[Bug rtl-optimization/17264] New: [hppa] Missing address increment optimization for fp load/stores

tausq at debian dot org gcc-bugzilla@gcc.gnu.org
Wed Sep 1 18:22:00 GMT 2004


i have a bit of loop code that looks like this:

        pds = (double *)pcs;
        pdd = (double *)pcd;
        while (len >= 8*sizeof(double)) {
                register double r1,r2,r3,r4,r5,r6,r7,r8;
                prefetch((const void *)(pds+8));
                r1 = *pds++;
                r2 = *pds++;
                r3 = *pds++;
                r4 = *pds++;
                *pdd++ = r1;
                *pdd++ = r2;
                *pdd++ = r3;
                *pdd++ = r4;
                /* ... */
        }

gcc translates this to:
  2c:   2e 80 10 16     fldd 0(,r20),fr22
  30:   37 18 3f 81     ldo -40(r24),r24
  34:   36 94 00 10     ldo 8(r20),r20
  38:   2e 80 10 17     fldd 0(,r20),fr23
  3c:   36 94 00 10     ldo 8(r20),r20
  40:   2e 80 10 18     fldd 0(,r20),fr24
  44:   36 94 00 10     ldo 8(r20),r20
  48:   2e 80 10 19     fldd 0(,r20),fr25
  4c:   36 94 00 10     ldo 8(r20),r20
  50:   2f 40 12 16     fstd fr22,0(,r26)
  54:   37 5a 00 10     ldo 8(r26),r26
  58:   2f 40 12 17     fstd fr23,0(,r26)
  5c:   37 5a 00 10     ldo 8(r26),r26
  60:   2f 40 12 18     fstd fr24,0(,r26)
  64:   37 5a 00 10     ldo 8(r26),r26
  68:   2f 40 12 19     fstd fr25,0(,r26)
  6c:   37 5a 00 10     ldo 8(r26),r26

it is probably better to emit fldd,ma and fstd,ma instructions in this case. 
(this works for ldw/stw insns)

Dave Anglin writes:
I think we need to add combiner patterns for floating point loads
and stores with base register modification.  These need to be similar
to those for ldw and stw.  See pa.md (~ line 2465 in 3.4).

I haven't done a complete scan but I think we need to add SFmode
patterns using ldw and stw, DFmode patterns using ldd and std,
SImode and SFmode using fldw and fstw, DImode and DFmode using
fldd and fstd.  The half word and byte patterns need to be reviewed
to see that they are complete.

-- 
           Summary: [hppa] Missing address increment optimization for fp
                    load/stores
           Product: gcc
           Version: 3.4.2
            Status: UNCONFIRMED
          Severity: normal
          Priority: P2
         Component: rtl-optimization
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: tausq at debian dot org
                CC: gcc-bugs at gcc dot gnu dot org
 GCC build triplet: hppa-linux
  GCC host triplet: hppa-linux
GCC target triplet: hppa-linux


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=17264



More information about the Gcc-bugs mailing list