This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
Re: [PATCH] Fix up Yr constraint
- From: Jakub Jelinek <jakub at redhat dot com>
- To: Uros Bizjak <ubizjak at gmail dot com>
- Cc: Kirill Yukhin <kirill dot yukhin at gmail dot com>, Ilya Enkovich <ilya dot enkovich at intel dot com>, "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>
- Date: Tue, 24 May 2016 21:02:51 +0200
- Subject: Re: [PATCH] Fix up Yr constraint
- Authentication-results: sourceware.org; auth=none
- References: <20160524165527 dot GM28550 at tucnak dot redhat dot com> <CAFULd4aJDh3UU6YZ+zpP_hEa4zneQUfhX9Gaw-cZya4o2Pde=Q at mail dot gmail dot com>
- Reply-to: Jakub Jelinek <jakub at redhat dot com>
On Tue, May 24, 2016 at 08:35:12PM +0200, Uros Bizjak wrote:
> On Tue, May 24, 2016 at 6:55 PM, Jakub Jelinek <jakub@redhat.com> wrote:
> > Hi!
> >
> > The Yr constraint contrary to what has been said when it has been submitted
> > actually is always NO_REX_SSE_REGS or NO_REGS, never ALL_SSE_REGS, so
> > the RA restriction to only the first 8 regs is done no matter what we tune
> > for.
> >
> > This is because we test X86_TUNE_AVOID_4BYTE_PREFIXES, which is an enum
> > value (59), rather than actually checking if the tune flag.
> >
> > Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
> >
> > 2016-05-24 Jakub Jelinek <jakub@redhat.com>
> >
> > * config/i386/i386.h (TARGET_AVOID_4BYTE_PREFIXES): Define.
> > * config/i386/constraints.md (Yr): Test TARGET_AVOID_4BYTE_PREFIXES
> > rather than X86_TUNE_AVOID_4BYTE_PREFIXES.
>
> Uh, another brown-paper bag bug...
>
> OK everywhere.
I fear it might be too dangerous for -mavx512* for the branches; I went
through all the Yr uses on the trunk, but not on the branches.
Would you be ok with using
"TARGET_SSE ? (TARGET_AVOID_4BYTE_PREFIXES ? NO_REX_SSE_REGS : SSE_REGS) : NO_REGS"
on the branches instead?
Or I guess we could use it on the trunk too, it should make no difference there
(because on the trunk it is only used when !TARGET_AVX).
Or maybe even
"TARGET_SSE ? ((TARGET_AVOID_4BYTE_PREFIXES && !TARGET_AVX) ? NO_REX_SSE_REGS : SSE_REGS) : NO_REGS"
(again, should make zero difference on the trunk, but might be better for
the branches).
> > --- gcc/config/i386/i386.h.jj 2016-05-24 10:56:02.000000000 +0200
> > +++ gcc/config/i386/i386.h 2016-05-24 15:13:05.715906018 +0200
> > @@ -465,6 +465,8 @@ extern unsigned char ix86_tune_features[
> > ix86_tune_features[X86_TUNE_SLOW_PSHUFB]
> > #define TARGET_VECTOR_PARALLEL_EXECUTION \
> > ix86_tune_features[X86_TUNE_VECTOR_PARALLEL_EXECUTION]
> > +#define TARGET_AVOID_4BYTE_PREFIXES \
> > + ix86_tune_features[X86_TUNE_AVOID_4BYTE_PREFIXES]
> > #define TARGET_FUSE_CMP_AND_BRANCH_32 \
> > ix86_tune_features[X86_TUNE_FUSE_CMP_AND_BRANCH_32]
> > #define TARGET_FUSE_CMP_AND_BRANCH_64 \
> > --- gcc/config/i386/constraints.md.jj 2016-05-12 10:29:41.000000000 +0200
> > +++ gcc/config/i386/constraints.md 2016-05-24 15:14:21.647914550 +0200
> > @@ -142,7 +142,7 @@ (define_register_constraint "Yf"
> > "@internal Any x87 register when 80387 FP arithmetic is enabled.")
> >
> > (define_register_constraint "Yr"
> > - "TARGET_SSE ? (X86_TUNE_AVOID_4BYTE_PREFIXES ? NO_REX_SSE_REGS : ALL_SSE_REGS) : NO_REGS"
> > + "TARGET_SSE ? (TARGET_AVOID_4BYTE_PREFIXES ? NO_REX_SSE_REGS : ALL_SSE_REGS) : NO_REGS"
> > "@internal Lower SSE register when avoiding REX prefix and all SSE registers otherwise.")
> >
> > (define_register_constraint "Yv"
Jakub