Bug 49390 - [4.6/4.7 Regression] GCSE miscompilation
Summary: [4.6/4.7 Regression] GCSE miscompilation
Alias: None
Product: gcc
Classification: Unclassified
Component: rtl-optimization (show other bugs)
Version: 4.6.0
: P3 normal
Target Milestone: 4.6.1
Assignee: Jakub Jelinek
Keywords: wrong-code
Depends on:
Reported: 2011-06-13 11:25 UTC by Jakub Jelinek
Modified: 2011-06-14 22:17 UTC (History)
2 users (show)

See Also:
Target: x86_64-linux
Known to work: 4.4.6, 4.5.2
Known to fail: 4.6.0, 4.7.0
Last reconfirmed: 2011-06-14 12:29:44

rh712480.c (631 bytes, text/plain)
2011-06-13 11:25 UTC, Jakub Jelinek
gcc46-pr49390.patch (2.33 KB, patch)
2011-06-13 15:35 UTC, Jakub Jelinek
Details | Diff

Note You need to log in before you can comment on or make changes to this bug.
Description Jakub Jelinek 2011-06-13 11:25:52 UTC
Created attachment 24507 [details]

The attached testcase is miscompiled, starting with
at -O2, -O2 -fno-gcse fixes it.
Comment 1 Jakub Jelinek 2011-06-13 12:36:58 UTC
Blindly ignoring MEM_EXPR or other attributes looks very wrong to me.
Guess in some cases it could return true even when MEM_ATTRS aren't identical,
but they'd need to have the same behavior during alias analysis at least, or
one of the attrs would need to be a superset of the other ones (but then it would need to be ensured that the callers pick up the superset instead of
the other MEM_ATTRS).

On this particular testcase, exp_equiv_p returns 1 for_gcse on
MEM_EXPRs c_4(D)->s1+0 and c_1->s1+0 (other attributes are the same).
c_1 is:
# c_1 = PHI <[rh712480.i : 41:7] [rh712480.i : 41] &e(2), c_4(D)(4), c_4(D)(5)>
and e is an automatic variable.  If GCSE thinks those two MEMs are
equivalent (one, c_4(D) based is present e.g. in the else branch of the if (c == 0) stmt, c_1 based afterwards) and merges those two into the one that has c_4(D)->s1 in it instead of c_1->s1, then aliasing oracle will see c_4(D)->s1
MEM_EXPR and e.s1 MEM_EXPR and will say that they can't alias (while for c_1->s1 and e.s1 they could and do).
I think such a change was ok in 4.4 and earlier era, where alias oracle hasn't been used to disambiguate RTL MEMs, but there also it would be NULL -> s1
rather than c_1 -> s1 etc.
Comment 2 Jakub Jelinek 2011-06-13 13:53:23 UTC
Perhaps we should have some exceptions where we allow different MEM_ATTRS, but they need to be carefully chosen.  E.g. if both refs are indirect refs and are similar, with the same points-to info, it would be ok.

    return 0;

  if (MEM_ATTRS (x) != MEM_ATTRS (y) && mem_attrs_equiv_p (x, y))
    return 0;

where mem_attrs_equiv_p would call ao_ref_from_mem on both x and y,
if at least one of them returns false, fail, compare all integer fields for
equality (alias sets using ao_ref_base_alias_set/ao_ref_alias_set) and compare base kinds, for indirect we could check for same type and same points-to info (couldn't find a points-to set comparison function, e.g. for pt->vars can just pointer equality be tested or does it need bitmap_equal_p?).
Comment 3 Jakub Jelinek 2011-06-13 14:28:31 UTC
Perhaps, if the tests are more expensive, case MEM: if (for_gcse) could
first do the cheap tests, then
  if (!exp_equiv_p (XEXP (x, 0), XEXP (y, 0), validate, 1))
    return 0;
then do the more expensive tests and then just return 1; instead of falling
through into the generic code.

Case which I agree would be nice to say exp_equiv_p is true is e.g.:
  if (c->s1 == 0)
    foo (1, 0, c->s1, c->s2);
  foo (2, 0, c->s1, c->s2);
Here e.g. c_4(D)->s1 and c_1->s1 aren't equal MEM_EXPRs, but they have the same SSA_VAR_NAME (not sure if we need to check it, perhaps for restrict?), pt->vars
and all other pt members are the same too, so I think both MEM_ATTRS should behave the same in the alias oracle.
Comment 4 Jakub Jelinek 2011-06-13 15:35:13 UTC
Created attachment 24510 [details]

Untested patch.  Richard, what do you think about it?
Bernd, do you still have some testcases around which got suboptimal code generated before your reversion?
Comment 5 Jakub Jelinek 2011-06-14 14:59:55 UTC
Author: jakub
Date: Tue Jun 14 14:59:52 2011
New Revision: 175023

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=175023
	PR rtl-optimization/49390
	2010-06-29  Bernd Schmidt  <bernds@codesourcery.com>

	* cse.c (exp_equiv_p): For MEMs, if for_gcse, only compare

	* gcc.c-torture/execute/pr49390.c: New test.

Comment 6 Jakub Jelinek 2011-06-14 15:01:15 UTC
Author: jakub
Date: Tue Jun 14 15:01:10 2011
New Revision: 175024

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=175024
	PR rtl-optimization/49390
	2010-06-29  Bernd Schmidt  <bernds@codesourcery.com>

	* cse.c (exp_equiv_p): For MEMs, if for_gcse, only compare

	* gcc.c-torture/execute/pr49390.c: New test.

Comment 7 Jakub Jelinek 2011-06-14 22:17:21 UTC