[Bug tree-optimization/32921] [4.3 Regression] Revision 126326 causes 12% slowdown

pthaugen at gcc dot gnu dot org gcc-bugzilla@gcc.gnu.org
Tue Oct 23 20:29:00 GMT 2007



------- Comment #29 from pthaugen at gcc dot gnu dot org  2007-10-23 20:29 -------
    Found another example on PowerPC in the same benchmark that is not fixed by
the checked in patches.  Compiled with -m32 -O2. From the loop in procedure
FLUXI:

    revision 126325:

    .L47:
            lfd 0,0(9)       #* ivtmp.277, tmp346
            lfd 13,-8(9)     #, tmp347
            add 9,9,0        # ivtmp.277, ivtmp.277, ivtmp.309
            fsub 0,0,13      # tmp345, tmp346, tmp347
            fmul 0,0,12      # tmp348, tmp345, dtvol.23
            fneg 0,0         # tmp349, tmp348
            stfd 0,0(11)     #* ivtmp.306, tmp349
            add 11,11,10     # ivtmp.306, ivtmp.306, ivtmp.280
            bdnz .L47        #



    revision 126326 (and current mainline):

    .L83:
            lwz 0,48(7)      # <variable>.stride, <variable>.stride
            lfd 0,0(10)      #* ivtmp.277, tmp416
            lfd 13,-8(10)    #, tmp417
            lfd 12,0(5)      # dtvol, dtvol
            add 10,10,6      # ivtmp.277, ivtmp.277, ivtmp.306
            mullw 0,8,0      # tmp409, l, <variable>.stride
            fsub 0,0,13      # tmp415, tmp416, tmp417
            lwz 9,4(7)       # du.offset, du.offset
            lwz 11,0(7)      # du.data, du.data
            fmul 0,0,12      # tmp420, tmp415, dtvol
            fneg 0,0         # tmp422, tmp420
            add 0,0,9        # tmp412, tmp409, du.offset
            addi 8,8,1       # l, l,
            slwi 0,0,3       # tmp413, tmp412,
            stfdx 0,11,0     #, tmp422
            bdnz .L83        #


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32921



More information about the Gcc-bugs mailing list