[PATCH][libgcc-math] Vectorized intrinsics for x86_64

Richard Guenther rguenther@suse.de
Fri Apr 7 07:32:00 GMT 2006


On Thu, 6 Apr 2006, Richard Henderson wrote:

> On Tue, Apr 04, 2006 at 05:40:07PM +0200, Richard Guenther wrote:
> > Here's an updated patch with your suggestions applied (but with .align 16,
> > to have it cacheline aligned), like so
> > 
> > .section .rodata.cst16, "M", @progbits, 16
> > .align 16
> 
> Not ok for 8 byte data.

I believe it's all 16byte data, i.e. always full sse registers with
either 2 doubles or 4 floats.  Matz did the annotation, so I guess
he knows better if there are any partial loads from these data (it
might be f.i. the .L__two_to_jby32_table - which is seemingly accessed
in 4 byte quantities f.i.)

> And you didn't attach the patch.

Yeah - it was in the followup :/

Richard.



More information about the Gcc-patches mailing list