This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes

From: Jeff Law <law at redhat dot com>
To: Kyrill Tkachov <kyrylo dot tkachov at foss dot arm dot com>, Bernd Schmidt <bschmidt at redhat dot com>, GCC Patches <gcc-patches at gcc dot gnu dot org>
Cc: Segher Boessenkool <segher at kernel dot crashing dot org>
Date: Thu, 17 Dec 2015 10:33:26 -0700
Subject: Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
Authentication-results: sourceware.org; auth=none
References: <5672D68F dot 3030408 at foss dot arm dot com> <5672DB97 dot 7090800 at redhat dot com> <5672DE74 dot 7080802 at foss dot arm dot com> <5672DEE0 dot 8000509 at redhat dot com> <5672E249 dot 6030602 at foss dot arm dot com> <5672E9EE dot 4010207 at redhat dot com> <5672EB36 dot 8040601 at foss dot arm dot com>

On 12/17/2015 10:04 AM, Kyrill Tkachov wrote:

In this case, I'm expecting a QImode compare with zero to map down to
the aarch64 TST reg, #255 instruction which
definitely zeroes out any bits outside of QImode (as it is a bitwise AND
with a bitmask),
so zero_extract is the more correct expression here, no?

It's more about the semantics of the code and how it interacts with RTLgeneration, optimization and analysis than it is with the final assemblygenerated by the backend that drives SUBREG vs zero_extract.

The backend assembly code generator is free to implement strictersemantics (such as defining all the bits for a paradoxical subreg), butthe rest of the compiler can not depend on those stricter semantics.

The easiest way to think about the subreg case here is that it's usedwhen we've got a narrow object that we want to view in a wider mode, butwe don't actually care about the upper bits. The widening is merely tomake the mode match another operand.

zero_extract is still the canonical form. subreg is a specialized formfor cases where the upper bits are "don't care" values. This shouldprobably be documented as the current state of the world.

I think it's an open question whether or not to drop the subreg form andalways use zero-extract. I've certainly seen cases where the former is*supposed* to allow better code generation, but in fact actually gets inthe way resulting in poorer code generation.


Jeff

References:
- [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Kyrill Tkachov
- Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Bernd Schmidt
- Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Kyrill Tkachov
- Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Bernd Schmidt
- Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Kyrill Tkachov
- Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Jeff Law
- Re: [PATCH][combine][RFC][2/2] PR rtl-optimization/68796: Perfer zero_extract comparison against zero rather than unsupported shorter modes
  - From: Kyrill Tkachov

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]