[Bug rtl-optimization/48128] Excessive code generated for vectorized loop

rguenth at gcc dot gnu.org gcc-bugzilla@gcc.gnu.org
Tue Mar 15 10:32:00 GMT 2011


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48128

Richard Guenther <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Target|i686-*-*                    |i686-*-*, x86_64-*-*
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2011.03.15 10:32:18
     Ever Confirmed|0                           |1

--- Comment #3 from Richard Guenther <rguenth at gcc dot gnu.org> 2011-03-15 10:32:18 UTC ---
Confirmed.  The fun thing is that the tree level optimized code looks exactly
the same ...

On x86_64 we get

foo2:
.LFB1:
        .cfi_startproc
        movq    baz(%rip), %rdx
        movq    %rdx, -24(%rsp)
        movl    %edx, %eax
        movq    baz+8(%rip), %rdx
        movq    %rdx, -16(%rsp)
        movdqa  -24(%rsp), %xmm0
        movdqa  %xmm0, bar(%rip)
        movdqa  baz+16(%rip), %xmm0
        movdqa  %xmm0, bar+16(%rip)
        ret

so it spills everything to the stack here as well!?



More information about the Gcc-bugs mailing list