This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting

From: Jeff Law <law at redhat dot com>
To: Jiong Wang <jiong dot wang at arm dot com>
Cc: Steven Bosscher <stevenb dot gcc at gmail dot com>, "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>, Kenneth Zadeck <zadeck at naturalbridge dot com>
Date: Thu, 14 May 2015 16:21:11 -0600
Subject: Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting
Authentication-results: sourceware.org; auth=none
References: <54803EBE dot 2060607 at arm dot com> <5480B6D6 dot 2020201 at arm dot com> <548EFE0D dot 1070808 at arm dot com> <548EFE55 dot 6090901 at arm dot com> <CAFiYyc3oYRsYkQwivE+T4A4mysDBe0gjZqjroQ8B2p1J6sakQg at mail dot gmail dot com> <54930811 dot 1020003 at arm dot com> <20141218220908 dot GA20720 at gate dot crashing dot org> <CAHFci28ajc8KqKEvyYYvQHbhYkZ-ExV8ixJ+SNuqV8bg3n7JJQ at mail dot gmail dot com> <CAAfDdZ0EZ6EVN_wYFFuh81ptL2c_Em-Ub-2s4GO7Vp0QKjd-=Q at mail dot gmail dot com> <CAFiYyc32CJJTjakxMLjkCQAJLrv1u0PSjifTs=A4V4q4nOFTKg at mail dot gmail dot com> <5494426A dot 9010209 at naturalbridge dot com> <CAAfDdZ2xrfRYoD8eO1L+8StWh53OhFNBy4ZMRt-K4xSj6r64eA at mail dot gmail dot com> <54DB6587 dot 1020207 at naturalbridge dot com> <54DB9CDB dot 5090304 at arm dot com> <CAAfDdZ29jHnFFGCpi8Adgf4hXk80QQH-vCrV=m0wdZNkT0x84A at mail dot gmail dot com> <CABu31nPMZ5ZCx+frisV+AT9pmC+DumN+Sjt=UscZ48kze4_3YQ at mail dot gmail dot com> <552D4D61 dot 9040100 at redhat dot com> <CAAfDdZ1G_0A0k0RRF2XO_5W6xN13BoA_18+s-Z68P1YTS! 32mMA at mail dot gmail dot com> <n997ft1zcce dot fsf at arm dot com> <5554FCBC dot 50809 at redhat dot com> <n99siayal1i dot fsf at arm dot com>

On 05/14/2015 03:13 PM, Jiong Wang wrote:


Jeff Law writes:

For all kinds of reassociation we have to concern ourselves with adding
overflow where it didn't already occur.  Assuming a 32 bit architecture
we could get overflow if A is 0x7fffffff, b is -4 and and c = 3

0x7fffffff + -4 = 0x7ffffffb
0x7ffffffb + 3 = 0x7ffffffe


If you make the transformation you're suggesting we get

0x7fffffff + 3 = 0x80000002  OVERFLOW
0x80000002 - 4 = 0x7ffffffe

Now if you always know pointers are unsigned, then the overflow is
defined and you'd be OK.  But that's a property of the target and one
that's not well modeled within GCC (we have POINTER_EXTEND_UNSIGNED
which kind of tells us something in this space).


I see, understood, cool! Thanks for such detailed explanation.

Above scenario do may happen for general pointer arith
reassociation.

One thing may make life easier as my reassociation is restricted within
frame pointer. the "(plus (plus fp, index_reg) + const_off)" pattern was
to address some variable on stack. index_reg, const_off were part of
the stack offset of the variable. Reassociate them means reorder two
parts of the stack offset. There may be way to prove the transformation
will not add extra overflow risk, especially when the index_reg is
unsigned.

I understand for general pointer arith reassociation, there do have big
risk, as the involved operands largely come from irrelevant instruction,
no relationship between the values from those operands, we can deduce nothing.

Given the special status of SP, FP and ARGP and a known constant part,we can probably do something here. More below...


In addition to worrying about overflow, you have to worry about
segmented architectures with implicit segment selection -- especially if
the segment selection comes from the base register than the entire
effective address.


Hmm, understood!

This let me recall something as dark as x86 segment descriptor in protecting mode...

Possibly, I've actually never studied the segmented aspects of the x86.But I'm painfully familiar with the others mentioned :(

My recollection for the segmented stuff on the PA is we only had asingle guard page at both ends of the segment. So we only allowed anoffset of +-4k when doing address reassociations in legitimize_address.This was possible because we had callouts from the right places in theRTL generators/optimizers to allow targets to rewrite addressarithmetic. So we could naturally bury the target details away from thecode generator/optimizers.

So we could possibly parameterize the transformation around similarconcepts. The design issue here is it's introducing more targetdependencies in places where we've really wanted to avoid them. Intheory the gimple optimizers are supposed to be target independent.Reality is some stuff bleeds into them (the one that's mentioned themost often is branch costing, but there's others).

*If* we decide to go forward with using some target hooks here. I'd betempted to do 2. One that's effective a tri-state. Full reassociation,limited reassociation, no reassociation. The second would bound theconstants in the limited reassociation case.


Thoughts?

Jeff

Follow-Ups:
- Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting
  - From: Jiong Wang

References:
- Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting
  - From: Jeff Law
- Re: [PATCH] PR 62173, re-shuffle insns for RTL loop invariant hoisting
  - From: Jiong Wang

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]