[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

ubizjak at gmail dot com gcc-bugzilla@gcc.gnu.org
Fri Oct 19 17:53:00 GMT 2007



------- Comment #7 from ubizjak at gmail dot com  2007-10-19 17:53 -------
(In reply to comment #6)
> It is little bit sick, but what about implying -mfpmath=sse when
> -ftree-vectorize is used and SSE is available?

Then you will hit Core2 Duo, that shows the opposite in 32bit and 64bit mode:

-O3 -ffast-math -ftree-vectorize -funroll-loops -msse3 -m32 -mfpmath=387

user 0m22.785s

-O3 -ffast-math -ftree-vectorize -funroll-loops -msse3 -m32 -mfpmath=sse

user 0.27.886s

-O3 -ffast-math -ftree-vectorize -funroll-loops -msse3 -m64 -mfpmath=387

user 0m20.473s

-O3 -ffast-math -ftree-vectorize -funroll-loops -msse3 -m64 -mfpmath=sse

user 0.25.046s


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31897



More information about the Gcc-bugs mailing list