This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [EXT] Re: GCC missing -flto optimizations? SPEC lbm benchmark

From: Steve Ellcey <sellcey at marvell dot com>
To: "amker dot cheng at gmail dot com" <amker dot cheng at gmail dot com>, "majun4950646 at gmail dot com" <majun4950646 at gmail dot com>
Cc: "gcc at gcc dot gnu dot org" <gcc at gcc dot gnu dot org>
Date: Fri, 15 Feb 2019 17:53:24 +0000
Subject: Re: [EXT] Re: GCC missing -flto optimizations? SPEC lbm benchmark
References: <92bfe075168981ee45e525875ac6a15f5e318034.camel@marvell.com> <CAHFci2_tSRtnA38KJjG+kWDDh387NGvY2owyUrmfZjS03def0Q@mail.gmail.com> <CABT63J4+=ihYHkEWy6aZwawYu5Z6Y4wErCmZVLkzLBLv3tVE9w@mail.gmail.com>

On Fri, 2019-02-15 at 17:48 +0800, Jun Ma wrote:
> 
> ICC is doing much more than GCC in ipo, especially memory layout 
> optimizations. See https://software.intel.com/en-us/node/522667.
> ICC is more aggressive in array transposition/structure splitting
> /field reordering. However, these optimizations have been removed
> from GCC long time ago.  
> As for case lbm_r, IIRC a loop with memory access which stride is 20 is 
> most time-consuming.  ICC will optimize the array(maybe structure?) 
> and vectorize the loop under ipo.
>  
> Thanks
> Jun

Interesting.  I tried using '-qno-opt-mem-layout-trans' on ICC
along with '-Ofast -ipo' and that had no affect on the speed.  I also
tried '-no-vec' and that had no affect either.  The only thing that 
slowed down ICC was '-ip-no-inlining' or '-fno-inline'.  I see that
'-Ofast -ipo' resulted in everything (except libc functions) getting
inlined into the main program when using ICC.  GCC did not do that, but
if I forced it to by using the always_inline attribute, GCC could
inline everything into main the way ICC does.  But that did not speed
up the GCC executable.

Steve Ellcey
sellcey@marvell.com

Follow-Ups:
- Re: [EXT] Re: GCC missing -flto optimizations? SPEC lbm benchmark
  - From: Jun Ma

References:
- GCC missing -flto optimizations? SPEC lbm benchmark
  - From: Steve Ellcey
- Re: GCC missing -flto optimizations? SPEC lbm benchmark
  - From: Bin.Cheng
- Re: GCC missing -flto optimizations? SPEC lbm benchmark
  - From: Jun Ma

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]