This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: RFC: ARM Cortex-A8 and floating point performance

From: Ramana Radhakrishnan <ramana dot radhakrishnan at arm dot com>
To: Siarhei Siamashka <siarhei dot siamashka at gmail dot com>
Cc: gcc at gcc dot gnu dot org
Date: Wed, 16 Jun 2010 16:22:32 +0100
Subject: Re: RFC: ARM Cortex-A8 and floating point performance
References: <201006161552.43706.siarhei.siamashka@gmail.com>
Reply-to: ramana dot radhakrishnan at arm dot com

On Wed, 2010-06-16 at 15:52 +0000, Siarhei Siamashka wrote:
> Hello,
> 
> Currently gcc (at least version 4.5.0) does a very poor job generating single 
> precision floating point code for ARM Cortex-A8.
> 
> The source of this problem is the use of VFP instructions which are run on a 
> slow nonpipelined VFP Lite unit in Cortex-A8. Even turning on RunFast mode 
> (flush denormals to zero, disable exceptions) just provides a relatively minor 
> performance gain.

> The right solution seems to be the use of NEON instructions for doing most of
> the single precision calculations.

Only in situations that the user is aware about -ffast-math. I will
point out that single precision floating point operations on NEON are
not completely IEEE compliant. 

cheers
Ramana

Follow-Ups:
- Re: RFC: ARM Cortex-A8 and floating point performance
  - From: Siarhei Siamashka

References:
- RFC: ARM Cortex-A8 and floating point performance
  - From: Siarhei Siamashka

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]