This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [GCC RFC]A new and simple pass merging paired load store instructions

From: Jeff Law <law at redhat dot com>
To: Mike Stump <mikestump at comcast dot net>, "bin.cheng" <bin dot cheng at arm dot com>
Cc: gcc-patches at gcc dot gnu dot org
Date: Thu, 15 May 2014 11:13:01 -0600
Subject: Re: [GCC RFC]A new and simple pass merging paired load store instructions
Authentication-results: sourceware.org; auth=none
References: <004d01cf700e$ef1e30e0$cd5a92a0$ at arm dot com> <32B4330F-1D0F-4D4E-BF7A-2E5B2148B893 at comcast dot net>

On 05/15/14 10:51, Mike Stump wrote:

On May 15, 2014, at 12:26 AM, bin.cheng <bin.cheng@arm.com> wrote:

Here comes up with a new GCC pass looking through each basic block
and merging paired load store even they are not adjacent to each
other.


So I have a target that has load and store multiple support that
supports large a number of registers (2-n registers), and I added a
sched0 pass that is a light copy of the regular scheduling pass that
uses a different cost function which arranges all loads first, then
all stores then everything else.  Within a group of loads or stores
the secondary key is the base register, the next key is the offset.
The net result, all loads off the same register are sorted in
increasing order.

Glad to see someone else stumble on (ab)using the scheduler to do this.

I've poked at the scheduler several times to do similar stuff, but wasnever really satisfied with the results and never tried to polish thoseprototypes into something worth submitting.

One example I've poked at was discovery of stores which then feed into aload from the same location. Which obviously we'd prefer to turn into astore + copy (subject to mess of constraints). There's a handful ofthese kind of transformations that seem to naturally drop out of thiskind of work.

Similarly a post-reload pass could be used to promote single wordloads/stores to double-word operations.

If anyone cared about PA 1.1 code generation, it'd be a much cleaner wayto support the non-fused fmpyadd fmpysub insns.

Anyway, if you want to move forward with the idea, I'd certainly supportdoing so.


jeff

Follow-Ups:
- Re: [GCC RFC]A new and simple pass merging paired load store instructions
  - From: Mike Stump
- Re: [GCC RFC]A new and simple pass merging paired load store instructions
  - From: Bin.Cheng

References:
- [GCC RFC]A new and simple pass merging paired load store instructions
  - From: bin.cheng
- Re: [GCC RFC]A new and simple pass merging paired load store instructions
  - From: Mike Stump

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]