This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: RFC: LRA for x86/x86-64 [0/9]

From: Steven Bosscher <stevenb dot gcc at gmail dot com>
To: Jakub Jelinek <jakub at redhat dot com>
Cc: Vladimir Makarov <vmakarov at redhat dot com>, Richard Guenther <richard dot guenther at gmail dot com>, GCC Patches <gcc-patches at gcc dot gnu dot org>
Date: Mon, 1 Oct 2012 11:55:14 +0200
Subject: Re: RFC: LRA for x86/x86-64 [0/9]
References: <CABu31nPaws_d+QoBdO_PxSXjHsZ8Kb10aOM_9GeWQKhoyG6_mA@mail.gmail.com> <5065C066.4040600@redhat.com> <5066486B.70205@redhat.com> <CABu31nP-19ZPkW6twoWZbPqThsY6zhyvhiW445Co+Gr_CykL=A@mail.gmail.com> <CAFiYyc3yCW-BGt0quQq=Ge3KTs9=Xyp25uneB8_zgQWtFeJzaA@mail.gmail.com> <CABu31nMw_VfLkcrqwUY=kYPH+G=Kv-OjwOJF0L5xbRjYfCXkJQ@mail.gmail.com> <CAFiYyc24pkv18XaviZRbOjYS4YOA1oyEQi5a5iWD=xvhkRDUKA@mail.gmail.com> <5068CCCA.3060206@redhat.com> <20121001054816.GD1787@tucnak.redhat.com> <CABu31nOkBs=G5Ug4mX0=cVQt15wgD8d7BGm4b9smP7pZmTL3Rw@mail.gmail.com> <20121001071653.GE1787@tucnak.redhat.com>

On Mon, Oct 1, 2012 at 9:16 AM, Jakub Jelinek <jakub@redhat.com> wrote:
> On Mon, Oct 01, 2012 at 08:47:13AM +0200, Steven Bosscher wrote:
>> The test case compiles just fine at -O2, only VRP has trouble with it.
>> Let's try to stick with facts, not speculation.
>
> I was talking about the other PR, PR26854, which from what I remember when
> trying it myself and even the latest -O3 time reports from the reduced
> testcase show that IRA/reload aren't there very significant (for -O3 IRA
> takes ~ 6% and reload ~ 1%).

OK, but what does LRA take? Vlad's numbers for 64-bits and looking at user time:

Reload: 503.26user
LRA: 598.70user

So if reload is ~1% of 503s then that'd be ~5s. And the only
difference between the two timings is LRA instead of reload, so LRA
takes ~100s, or 20%.

>> I've put a lot of hard work into it to fix almost all scalability problems
>> on this PR for gcc 4.8. LRA undoes all of that work. I understand it is
>> painful for some people to hear, but I remain of opinion that LRA cannot be
>> considered "ready" if it scales so much worse than everything else in the
>> compiler.
>
> Judging the whole implementation from just these corner cases and not how it
> performs on other testcases (SPEC, rebuild a distro, ...) is IMHO not the
> right thing, if Vlad thinks the corner cases are fixable during stage3; IMHO
> we should allow LRA in, worst case it can be disabled by default even for
> i?86/x86_64.

I'd be asked to do a guest lecture on compiler construction (to be
clear: I'd be highly surprised if anyone would ask me to, but for sake
of argument, bear with me ;-) then I'd start by stating that
algorithms should be designed for the corner cases, because the
devil's always in the details.

But more to the point regarding stage3: It will already be a busy
stage3 if the other, probably even more significant, scalability
issues have to be fixed, i.e. var-tracking and macro expansion. And
there's also the symtab work that's bound to cause some interesting
bugs still to be shaken out. With all due respect to Vlad, and,
seriously, hats off to Vlad for tacking reload and coming up with a
much easier to understand and nicely phase-split replacement, I just
don't believe that these scalability issues can be addressed in
stage3.

It's now very late stage1, and LRA was originally scheduled for GCC
4.9. Why the sudden hurrying? Did I miss the 2 minute warning?

Ciao!
Steven

References:
- Re: RFC: LRA for x86/x86-64 [0/9]
  - From: Jakub Jelinek
- Re: RFC: LRA for x86/x86-64 [0/9]
  - From: Jakub Jelinek

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]