This is the mail archive of the
mailing list for the GCC project.
Re: upcoming SSE/SSE2 support in 3.1
- From: Geert Bosch <bosch at gnat dot com>
- To: Jim Wray <wray at rivit dot cs dot byu dot edu>
- Cc: gcc at gcc dot gnu dot org
- Date: Thu, 9 May 2002 20:06:10 -0400
- Subject: Re: upcoming SSE/SSE2 support in 3.1
On Thursday, May 9, 2002, at 05:07 , Jim Wray wrote:
> In other words, a typical operation would be to
> iteratively go through data that is consecutive in memory with two
> and a destination. Is this likely to get SSE(2) code generated
> automatically, or should I spend time looking at writing custom
In order to make it possible to have any vectorization, you should start
to make sure all your data is properly aligned and that the compiler
about this. Also you should take care to write the code such that it is
clear (for the compiler) that there can be no possible aliasing issues.
This is not trivial, but will likely result in speedups already without
using specific SSE/SSE2 instructions.
The last step of actually using the vector instructions is relatively
easy, and can be done using Asm inserts for now, or automatically by the
compiler later. The step of laying out your data and designing your
meet the aliasing and alignment requirements as described above is
that compilers in general will not be able to do for you however.