This is the mail archive of the
gcc-bugs@gcc.gnu.org
mailing list for the GCC project.
[Bug tree-optimization/51179] poor vectorization on interlagos.
- From: "Joost.VandeVondele at mat dot ethz.ch" <gcc-bugzilla at gcc dot gnu dot org>
- To: gcc-bugs at gcc dot gnu dot org
- Date: Sat, 30 Jun 2012 11:26:59 +0000
- Subject: [Bug tree-optimization/51179] poor vectorization on interlagos.
- Auto-submitted: auto-generated
- References: <bug-51179-4@http.gcc.gnu.org/bugzilla/>
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=51179
--- Comment #11 from Joost VandeVondele <Joost.VandeVondele at mat dot ethz.ch> 2012-06-30 11:26:59 UTC ---
It looks like this problem is solved in the current 4.7 and 4.8 branches. At
least on an avx machine, the best performance found by the code in comment #4
jumps from 5.3Gflops in 4.6 to 13.9Glfops in 4.7/4.8. Great work.
I can't test this right now on interlagos, but I guess this could be OK as
well.