This is the mail archive of the gcc-help@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: bitwise & optimization

From: Vincent Diepeveen <diep at xs4all dot nl>
To: Fisnik Kastrati <kastrati at informatik dot uni-mannheim dot de>
Cc: gcc-help at gcc dot gnu dot org
Date: Tue, 9 Jun 2015 17:44:22 +0200 (CEST)
Subject: Re: bitwise & optimization
Authentication-results: sourceware.org; auth=none
References: <5576E63B dot 8030408 at informatik dot uni-mannheim dot de>

Hi!

Very similar to my post around june 2007 and when Linus Thorvalds posted6 months later something similar around 2007, i remember one of the GCCteam members showing the middlefinger that they simply wanted to keep intel ahead ofAMD in terms of speed and take care that GCC couldn't rival othercompilers in terms of speed (the implication of not doing thisoptimization in branchy codes).

For my chessprogram Diep i've posted even more horrible optimizations -GCC has the tendency to also put such branches where i know myself thatfall through is gonna give lots of mispredicted branches (as the totalnumber of branches is too much for the processors memory), GCC managed tomess up at other pieces even further:

causing it to generate a jump to the end of the function andthen back, and it also was instruction wise outside of the AMD instructionlook ahead - which really is slower than generating a few CMOV typeinstructions or using less branches.

Not rewriting this ugly part of the GCC compiler is the reason why intelc++ is roughly 10-15% faster than GCC, especially in 64 bits, and whycode generated runs faster on intel than on AMD processorsas the instruction lookahead is larger, whereas OBJECTIVELY the codegenerated is a lot SLOWER.

Ideally you really want that some statistics generated with whatever thereis at GCC nowadays like -fgenerate, that really every branch can getparameterized.

Yet a lot of ways to mess up GCC seems to do before such optimizationscan take part.

When it would parameterize that - it would be a compiler that can generatecode that's really objectively fast - whereas it's duck slow right now forbranchy codes.


Kind Regards,

Vincent Diepeveen
The Netherlands


On Tue, 9 Jun 2015, Fisnik Kastrati wrote:

To whom it may concern,
I'm turning to you with regards to an unwanted optimization that g++ (v.4.8.2) is generating, see the code in the following link:
http://goo.gl/3NVjyc
The assembly code generated for both methods "amp", "ampamp" is thepractically the same, when using the optimization flag "-O3". However, I'minterested to have a single jump for the code in the method "amp", as branchmisprediction penalty is very high otherwise. Is there any optmization flagthat I should set, in order to avoid this feature when using "-O3"? I.e., I'dlike a generated code similar to icc 13.
Thank you in advance

Follow-Ups:
- Re: bitwise & optimization
  - From: Manuel López-Ibáñez

References:
- bitwise & optimization
  - From: Fisnik Kastrati

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]