This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
Re: [PATCH][0/n] Merge from match-and-simplify
- From: Richard Biener <rguenther at suse dot de>
- To: Sebastian Pop <sebpop at gmail dot com>
- Cc: gcc-patches at gcc dot gnu dot org, marc dot glisse at inria dot fr
- Date: Mon, 20 Oct 2014 13:42:28 +0200 (CEST)
- Subject: Re: [PATCH][0/n] Merge from match-and-simplify
- Authentication-results: sourceware.org; auth=none
- References: <alpine dot LSU dot 2 dot 11 dot 1410151450430 dot 20733 at zhemvz dot fhfr dot qr> <20141016203852 dot GB29134 at f1 dot c dot bardezibar dot internal> <alpine dot LSU dot 2 dot 11 dot 1410170951450 dot 9891 at zhemvz dot fhfr dot qr> <20141017163558 dot GD29134 at f1 dot c dot bardezibar dot internal> <20141017182811 dot GA14499 at f1 dot c dot bardezibar dot internal>
On Fri, 17 Oct 2014, Sebastian Pop wrote:
> Sebastian Pop wrote:
> > Richard Biener wrote:
> > > looks like
> > > RTL issues and/or IVOPTs issues?
> >
> > I should have posted the first diff between the compilers with -fdump-tree-all:
> > that would expose the problem at its root.
>
> Looks like this is caused by the fwprop pass:
>
> diff -u -r ./foo.i.087t.forwprop3 ../mas/foo.i.087t.forwprop3
> --- ./foo.i.087t.forwprop3 2014-10-17 13:17:29.985327000 -0500
> +++ ../mas/foo.i.087t.forwprop3 2014-10-17 13:17:29.308814000 -0500
> @@ -5,6 +5,8 @@
> Pass statistics:
> ----------------
>
> +Applying pattern match-comparison.pd:43, gimple-match.c:11747
> +gimple_simplified to if (i_20 != 99)
>
> Pass statistics:
> ----------------
> @@ -60,7 +62,7 @@
> i_17 = i_20 + 1;
> # DEBUG iD.2450 => i_17
> # DEBUG iD.2450 => i_17
> - if (i_17 != 100)
> + if (i_20 != 99)
> goto <bb 3>;
> else
> goto <bb 4>;
Ok, so this is one effect on the thing Marc pointed out - currently
no patterns (well, no but one) guards itself with has_single_use
predicates.
That was a conscious decision and the idea was that the caller should
do this via its lattice valueization function which could look like
tree
valueize (tree t)
{
if (TREE_CODE (t) == SSA_NAME
&& !has_single_use (t))
return NULL_TREE;
return t;
}
But of course doing that unconditionally would also pessimize code.
Generally we'd like to avoid un-CSEing stuff in a way that cannot
be CSEd again. That's a more complex condition than what can be
implemented with has_single_use. You might also consider a
stmt doing a_1 + a_1 where a_1 has two uses now.
For Sebastians case above the issue is that we are appearantly
bad at optimizing post-increment exit tests. But if you'd consider
code like
i_2 = i_1 + 1;
b1_3 = i_2 < 100;
b2_4 = i_2 > 50;
if (b1_3 && b2_4)
...
then it is profitable to remove i_2 by changing the two comparisons
to i_2 <= 98 and i_2 > 49.
I thought about doing all simplifications first without committing
any simplified sequence to the IL, then scanning over the result,
pruning out cases that end up pessimizing code (how exactly isn't
yet clear to me).
So I'm not sure what we want to do here now. I don't very much like
doing things explicitely in the pattern description (nor using the
"has_single_use" predicate).
I suppose for the gimple_build () stuff we could restrict simplifications
to the expression we are building (not simplifying with SSA defs in the
IL), more exactly mimicing fold_buildN behavior.
I suppose for forwprop we could use the above valueize hook (but then
regress because not all patterns as implemented in forwprop guard
their def stmt lookup with has_single_use...).
Any opinion on this? Any idea of a "simple" cost function if
you have the functions IL before and after simplifications (but
without any DCE/CSE applied)?
Thanks,
Richard.