This is the mail archive of the gcc-help@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: -falign-loops=16 on apple gcc still gives loops not aligned to 16 byte address boundaries -why?

From: David Edelsohn <dje at watson dot ibm dot com>
To: Dan White <dan at chalkie dot org dot uk>
Cc: gcc-help at gcc dot gnu dot org
Date: Wed, 08 Mar 2006 14:43:10 -0500
Subject: Re: -falign-loops=16 on apple gcc still gives loops not aligned to 16 byte address boundaries -why?
References: <07BD2B23-D191-469E-9A51-7DF1BE1945A6@chalkie.org.uk>

> one loop running from a VTK lib
> which takes up much processor time
> has not been aligned to 16 byte boundary

	The -falign-loops option is a suggestion, not a requirements.  Not
all loops are aligned to the value specified. GCC uses various heuristics
to determine if if should be aligned and only aligns loops if it will not
require more than a certain number of nops.  Compiling with profiling can
help GCC determine better heuristics.

> shark also tells me this loop contains a singele-precision floating
> point computation that could be speeded up using altivec
> -fast also turns on -maltivec 

	-maltivec is not the same as auto-vectorization.  One can try
auto-vectorization or manually convert the loop to use Altivec intrinsics.

David

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]