This is the mail archive of the
gcc@gcc.gnu.org
mailing list for the GCC project.
sqrt via SSE2 registers
- From: "Tom Womack" <tom at womack dot net>
- To: <gcc at gcc dot gnu dot org>
- Cc: "Jan Hubicka" <jh at suse dot cz>
- Date: Sat, 16 Feb 2002 15:37:43 -0000
- Subject: sqrt via SSE2 registers
Is there any way of telling gcc that I care so little about precision that
I'm prepared to have it compute the square roots of objects in SSE2
registers by using SQRTSD rather than by storing the register to memory,
loading it onto the FP stack and calling fsqrt?
I've tried (on the principle of sticking in anything that might have an
effect)
-O3 -march=pentium4 -msse2 -mfpmath=sse{,387} -ffast-math \
-mno-fancy-math-387
but nonetheless always get the worst-of-both-worlds behavior described
above.
Am I just being too optimistic about mainline gcc's current level of support
for the P4? There is a sqrtdf2_1 instruction in i386.md which looks as if it
should behave correctly.
Tom