[Bug rtl-optimization/48128] Excessive code generated for vectorized loop
rguenth at gcc dot gnu.org
gcc-bugzilla@gcc.gnu.org
Tue Mar 15 10:32:00 GMT 2011
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48128
Richard Guenther <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Target|i686-*-* |i686-*-*, x86_64-*-*
Status|UNCONFIRMED |NEW
Last reconfirmed| |2011.03.15 10:32:18
Ever Confirmed|0 |1
--- Comment #3 from Richard Guenther <rguenth at gcc dot gnu.org> 2011-03-15 10:32:18 UTC ---
Confirmed. The fun thing is that the tree level optimized code looks exactly
the same ...
On x86_64 we get
foo2:
.LFB1:
.cfi_startproc
movq baz(%rip), %rdx
movq %rdx, -24(%rsp)
movl %edx, %eax
movq baz+8(%rip), %rdx
movq %rdx, -16(%rsp)
movdqa -24(%rsp), %xmm0
movdqa %xmm0, bar(%rip)
movdqa baz+16(%rip), %xmm0
movdqa %xmm0, bar+16(%rip)
ret
so it spills everything to the stack here as well!?
More information about the Gcc-bugs
mailing list