This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: gcc will become the best optimizing x86 compiler

From: Denys Vlasenko <vda dot linux at googlemail dot com>
To: Agner Fog <agner at agner dot org>
Cc: Raksit Ashok <raksit at google dot com>, dclarke at opensolaris dot org, gcc at gcc dot gnu dot org, TimothyPrince at sbcglobal dot net
Date: Wed, 30 Jul 2008 20:22:25 +0200
Subject: Re: gcc will become the best optimizing x86 compiler
References: <2E073B3ABB3F664DBA1D1C4D5FB47EF40EBDAD8E@NT-IRVA-0752.brcm.ad.broadcom.com> <1158166a0807300908o5dcc101dqd39c1fc1ef477806@mail.gmail.com> <4890A184.6050709@agner.org>

On Wednesday 30 July 2008 19:14, Agner Fog wrote:
> I agree that the OpenSolaris memcpy is bigger than necessary. However, 
> it is necessary to have 16 branches for covering all possible alignments 
> modulo 16. This is because, unfortunately, there is no XMM shift 
> instruction with a variable count, only with a constant count, so we 
> need one branch for each value of the shift count. Since only one of the 
> branches is used, it doesn't take much space in the code cache. The 
> speed is improved by a factor 4-5 by this 16-branch algorithm, so it is 
> certainly worth the extra complexity.

I tend to doubt that odd-byte aligned large memcpys are anywhere
near typical. malloc and mmap both return well-aligned buffers
(say, 8 byte aligned). Static and on-stack objects are also
at least word-aligned 99% of the time.

memcpy can just use "relatively simple" code for copies in which
either src or dst is not word aligned. This cuts possibilities down
from 16 to 4 (or even 2?).
--
vda

Follow-Ups:
- Re: gcc will become the best optimizing x86 compiler
  - From: Agner Fog

References:
- Is cross-section inlining valid behaviour?
  - From: Bingfeng Mei
- Re: gcc will become the best optimizing x86 compiler
  - From: Denys Vlasenko
- Re: gcc will become the best optimizing x86 compiler
  - From: Agner Fog

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]