This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Project Ranger

From: Andrew MacLeod <amacleod at redhat dot com>
To: Eric Botcazou <ebotcazou at adacore dot com>
Cc: gcc at gcc dot gnu dot org, Aldy Hernandez <aldyh at redhat dot com>, Jeff Law <law at redhat dot com>
Date: Wed, 30 May 2018 10:03:09 -0400
Subject: Re: Project Ranger
References: <5607b582-639b-7517-e052-014fabfe0ad4@redhat.com> <1946549.HoKu4gN0qI@polaris>

On 05/30/2018 03:41 AM, Eric Botcazou wrote:

The Ranger is far enough along now that we have confidence in both its
approach and ability to perform, and would like to solicit feedback on
what you think of it,  any questions, possible uses,  as well as
potential requirements to integrate with trunk later this stage.

The PDF document mentions that you first intended to support symbolic ranges
but eventually dropped them as "too complex, and ultimately not necessary".

I don't entirely disagree with the former part, but I'm curious about the
latter part: how do you intent to deal in the long term with cases that do
require symbolic information to optimize things?  The TODO page seems to
acknowledge the loophole but only mentions a plan to deal with equivalences,
which is not sufficient in the general case (as acknowledged too on the page).

First, we'll collect the cases that demonstrate a unique situation wecare about. I have 4 very specific case that show currentshortcomings.. Not just with the Ranger, but a couple we don't handlewith VRP today. .. I'll eventually get those put onto the wiki so thelist can be updated.

I think most of these cases that care about symbolics are not so muchrange related, but rather an algorithmic layer on top. Any follow onoptimization to either enhance or replace vrp or anything similar willsimply use the ranger as a client. If it turns out there are caseswhere we *have* to remember the end point of a range as a symbolic, thenthe algorithm to track that symbolic along with the range, and request are-evaluation of the range when the value of that symbolic is changes.

Thats the high-level view. I'm not convinced the symbolic has to be inthe range in order to solve problems for 2 reasons:

1) The Ranger maintains some definition chains internally and has aclear idea of what ssa_names can affect the outcome of a range. Ittracks all these dependencies on a per-bb basis in the gori-mapstructure as imports and exports. . The iterative approach I mentionedin the document would use this info to decide that ranges in a blockneed to be re-evaluated because an input to this block has changed. This is similar to the way VRP and other passes iterate until nothingchanges. If we get a better range for an ssa_name than we had before,push it on the stack to look for potential re-evaluation, and keep going.

ThIs is what I referred to as the Level 4 ranger in the document. I kindof glossed over it because I didn't want to get into the full-blowndesign I originally had, nor to rework it based on the currentincarnation of the Ranger. I wanted to primarily focus on what wecurrently have that is working so we can move forward with it.

I don't think we need to track the symbolic name in the range becausethe Ranger tracks these dependencies for each ssa-name and can indicatewhen we may need to reevaluate them. There is an exported routine fromthe block ranger :

      tree single_import (tree name);

If there is a sole import, it will return the ssa-name that NAME isdependent on that can affect the range of NAME. We added that API soAldy's new threading code could utilizes this ability to a small degree..

Bottom line: The ranger has information that a pass can use to decidethat a range could benefit from being reevaluated. This identifies anysymbolic component of a range from that block.

2) This entire approach is modeled on walking the IL to evaluate arange. If we put symbolics and expressions in the range, we are reallyduplicating information that is already in the IL, and have to make achoice of exactly what and how we do it..

BB3:
   j_1 = q_6 / 2
   i_2 = j_1 + 3
   if ( i_2 < k_4)

we could store the range of k_4 as [i_2 + 1, MAX] (which seems theobvious one)

we could also store it as [j_1 + 4, MAX]

or even [q_6 / 2 + 4, MAX]. But we have to decide in advance, and wehave extra work to do if it turns out to be one of the other names weended up wanting..

At some point later on we decide we either don't know anything about i_2(or j_1, or q_6), or we have found a range for it, and now need to takethat value and evaluate the expression stashed in the range in order toget the final result. Note that whatever algorithm is doing this mustalso keep track of this range somehow in order to use it later.

With the Ranger model, the same algorithm gets a range, and if it thinksit might need to be re-evaluated for whatever reason can just track thean extra bit of info (like i_2 for instance) along side the range(rather than in it). If we thinks the range needs to be re-evaluated ,it can simply request a new range from the ranger.

You also don't have to decide whether to track the range with i_2 or j_1(or even q_6). The Ranger can tell you that the range it gives you fork_4 is accurate unless you get a new value for q_6. That is really whatyou want to track. You might later want to reevaluate the range only ifq_6 changes. If it doesn't, you are done. .

Bottom line:The ranger indicates what the symbolic aspect of the rangeis with the import. The net effect is the symbolic expression using thatimport is also the longest possible expression in the blockavailable... it just picks it up from the IL rather than storing it inthe range.

I would also note that we track multiple imports, they just aren'treally exposed as yet since they aren't really being used by anyone. k_4 is also tagged as an import to that block, and if you ask for therange of i_2, you'd get a range, and k_4 would be listed as the import.

Also note more complexity is available. Once we hit statements withmultiple ssa_names, we stop tracking currently, but we do note theimports at that point:

BB4:
  z_4 = a_3 + c_2
  z_5 = z_4 + 3
  if (  q_8 < z_5)

we can get a range for q_8, and the ranger does know that a_3 and c_2are both imports to defining z_5. By using the import information, weeffectively get a "symbolic" range of

[MIN, a_3 + c_2 + 3] for q_8 in this case.

Which means I think the import approach of the Ranger has the benefit ofbeing simpler in many ways, yet more powerful should we wish to explorethat route.

The one place this falls down is if you get a range back from a call andyou have no idea where it came from, but want to be able to re-evaluateit later. . I am not sure what this use case looks like (if it exists:-), but I would be surprised if it wasn't something that could behandled with an algorithm changed. I know if discussions with Aldy andJeff as we went through various use cases, this model does sometimesrequire a bit of rethinking of how you approach using the informationsince a lot of things we're use to worrying about just happen under thecovers.

Does that help? If it does, I'll add this to the coverage in the wikipage.


Andrew

References:
- Project Ranger
  - From: Andrew MacLeod
- Re: Project Ranger
  - From: Eric Botcazou

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]