This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

On-Demand range technology [3/5] - The Prototype

From: Andrew MacLeod <amacleod at redhat dot com>
To: GCC <gcc at gcc dot gnu dot org>
Cc: Jeff Law <law at redhat dot com>, Aldy Hernandez <aldyh at redhat dot com>
Date: Wed, 22 May 2019 21:29:01 -0400
Subject: On-Demand range technology [3/5] - The Prototype

There is a functioning prototype in branch “ssa-range” which is a proofof concept that the approach is functional as well as quick, and can beused to answer questions which come up regarding what it can and can’tdo. Our last merge was on April 13th, so it's fairly up to date.

We have implemented a flexible range class (irange) which allows formultiple subranges to represent a range, and which can be extended inthe future to types other than integral. We use this throughout, but itcould be replaced in the ranger with any similar API. Conversionroutines are also provided to convert from irange to value_range andvalue_range to irange.

A full set of tree_code range-op routines are implemented. We havecommoned as much code as possible with the existing VRP range extractioncode. Also, we have added additional code for calculating the otheroperands from a known result in numerous cases.


The code base in VRP has been modified (via a flag) to
    - Work completely with the native value_range like it does today.

- Use irange and the range-ops component under the covers toextract ranges. Requests in VRP are then converted from value_ranges toirange, called into the range-op routines, and then converted back tovalue_range for VRP/EVRP’s use. - Do operations both ways and compare the results to make sure bothagree on the range, and trap if they do not.

The branch defaults to the compare and trap mode to ensure everything isworking correctly. This has been our correctness model for statementrange extraction and was active during the Fedora package builds. Theonly time we disabled it was to do performance runs vs RVRP, and werelooking at both branch and trunk times for EVRP and VRP.

Of note, we drop all symbolics in ranges to VARYING on everything exceptPLUS and MINUS, which we leave as native calculations if there aresymbolics present. More on symbolics later.


A VRP like pass called RVRP has been implemented.

- The vr-values statement simplification code has been factored outto be range agnostic, meaning that these routines can operate on eithervalue_range or irange. Thus, we are using a common code base to performstatement simplification as well. - For complete compatibility with EVRP, the RVRP pass buildsdominators and instantiates the SCEV loop information so we have looprange info available. RVRP does not need this info to run, but wouldmiss some of the test cases which depend on loop ranges. - RVRP is set up to demonstrate it can process the IL in multipledirections and bootstraps/passes all tests in all directions.

        * Dominator order
        * Post-dominator order
        * BB1 thru BBn
        * BBn thru BB1

* branch-only mode where only branches at the end of each BBare examined for folding opportunities


4 additional passes have been converted to use the ranger model:
    - sprintf - removed the dominator building/walking

- warn alloca - replaced calls to get global ranges with calls thatnow return context sensitive ranges.

    - warn restrict - just replaced EVRP range calls with ranger calls.

- backwards threader - enhanced to use contextual range informationto make additional threading decisions.



Symbolic Ranges

One big concern last year expressed was my decision to abolish symbolicranges.

I continue to maintain that there is no need to track the range of x_2as [y_3 + 5, MAX] for x_2 = y_3 + 5. All you need to do is look at thedefinition of x_2, and the same information is exposed right there inthe IL. If one requires the symbolic information, the same on-demandprocess could lookup that information as well. This in turn, makes thecode for ranges much simpler, easier to maintain, and less likely tointroduce bugs.

We have found through our prototype that symbolics in ranges are notnearly as prevalent as one would think. During the work to share acommon code base with VRP, we found that symbolic ranges are irrelevantfor everything except PLUS_EXPR and MINUS_EXPR. The shared code in ourbranch drops symbolics to varying immediately for everything else, andit has no impact on EVRP, or VRP or any tests we can find anywhere. Furthermore, we never trapped while comparing ranges generated by VRPversus generating them with range-ops which drops symbolics to varying.

We tried modifying VRP such that we don’t even create symbolicendpoints, but rather revert to VARYING always. We can find no testcase that fails because a range is not calculated properly due toresolving these endpoints.

There are a few that fail due to the symbolic being used to help trackrelationals.. Ie


     x_2 = y_3 + 5
    If (x_2 > y_3)     // can be folded since we know x_2 must be < y_3

VRP generates a range for x of [ y_3+5, MAX ] and at various pointsuses that to infer a relational or equivalence. Ie, it becomes easy totell that the condition must always be true since the lower bound of therange is y_3 + 5.

I argue this is not a range question, but rather a different problemwhich VRP has chosen to solve by piggybacking on the rangerepresentation. This leads to complications/complexity when trying toevaluate ranges because they must constantly be on the lookout forsymbolics. This information is then carried around for the life of thepass, even if it is never used. It also forces anyrelational/equivalency queries to be handled within the context of theVRP pass.

This aspect of symbolics would be handled by a relational/equivalenceprocessing engine that would be follow on work. Using the same basicmodel as ranges, each tree code is taught to understand the relationbetween its operands, and then we can answer equivalency and relationalaccurately as well. It would be available for any pass to useindependent of ranges. I will expound upon that a bit in the futuredirections section.


Comments and feedback always welcome!
Thanks
Andrew

Follow-Ups:
- Re: On-Demand range technology [3/5] - The Prototype
  - From: Richard Biener

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]