[Bug target/27855] reassociation pass produces ~30% slower matrix multiplication code
ubizjak at gmail dot com
gcc-bugzilla@gcc.gnu.org
Mon Jul 9 18:17:00 GMT 2007
------- Comment #15 from ubizjak at gmail dot com 2007-07-09 18:16 -------
New timings on x86_64 core2 (from [1])
The tests were performed on core2 in 64bit mode, using '-DREPS=10000 -O3 -msse3
-march=core2 -ffast-math' flags, with and without newly introduced
-fno-tree-reassoc flag.
The results were _interesting_, showing extreme differences in the run times:
w/o -fno-tree-reassoc:
ALGORITHM NB REPS TIME MFLOPS
========= ===== ===== ========== ==========
-DTYPE=float: atlasmm 60 10000 2.000 2159.87
-DTYPE=double: atlasmm 60 10000 2.500 1727.89
w/ -fno-tree-reassoc:
ALGORITHM NB REPS TIME MFLOPS
========= ===== ===== ========== ==========
-DTYPE=float: atlasmm 60 10000 0.932 4634.90
-DTYPE=double: atlasmm 60 10000 1.520 2841.93
[1] http://gcc.gnu.org/ml/gcc-patches/2007-07/msg00849.html
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=27855
More information about the Gcc-bugs
mailing list