[Bug target/38306] [4.4/4.5/4.6 Regression] 15% slowdown w.r.t. 4.3 of computational kernel on some architectures
Joost.VandeVondele at pci dot uzh.ch
gcc-bugzilla@gcc.gnu.org
Mon Feb 21 12:56:00 GMT 2011
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38306
--- Comment #23 from Joost VandeVondele <Joost.VandeVondele at pci dot uzh.ch> 2011-02-21 12:53:30 UTC ---
(In reply to comment #22)
> What is the performance with 4.3 -O2?
4.3:
> gfortran -O2 -march=native -funroll-loops -ffast-math test.f90 ; ./a.out
Time for evaluation [s]: 4.373
4.6:
> gfortran -O2 -march=native -funroll-loops -ffast-math test.f90 ; ./a.out
Time for evaluation [s]: 4.347
so, same performance.
Given that vectorization only happens at -O3, it is an important optimization
level for numerical codes. Nevertheless, I would propose to remove the
regression tag, and instead refocus the bug on the what current trunk does at
-O3 vs -O2 -ftree-vectorize as noted in comment #21
> gfortran -O2 -march=native -funroll-loops -ffast-math -ftree-vectorize test.f90 ; ./a.out
Time for evaluation [s]: 2.694
> gfortran -O3 -march=native -funroll-loops -ffast-math -ftree-vectorize test.f90 ; ./a.out
Time for evaluation [s]: 4.536
More information about the Gcc-bugs
mailing list