This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH] Improve vec_concatv?sf*


On Thu, May 12, 2016 at 04:39:52PM +0300, Kirill Yukhin wrote:
> > --- gcc/config/i386/sse.md.jj	2016-05-04 14:36:08.000000000 +0200
> > +++ gcc/config/i386/sse.md	2016-05-04 15:16:44.180894303 +0200
> > @@ -6415,12 +6415,12 @@ (define_insn "avx512f_vec_dup<mode>_1"
> >  ;; unpcklps with register source since it is shorter.
> >  (define_insn "*vec_concatv2sf_sse4_1"
> >    [(set (match_operand:V2SF 0 "register_operand"
> > -	  "=Yr,*x,x,Yr,*x,x,x,*y ,*y")
> > +	  "=Yr,*x,v,Yr,*x,v,v,*y ,*y")
> >  	(vec_concat:V2SF
> >  	  (match_operand:SF 1 "nonimmediate_operand"
> > -	  "  0, 0,x, 0,0, x,m, 0 , m")
> > +	  "  0, 0,v, 0,0, v,m, 0 , m")
> >  	  (match_operand:SF 2 "vector_move_operand"
> > -	  " Yr,*x,x, m,m, m,C,*ym, C")))]
> > +	  " Yr,*x,v, m,m, m,C,*ym, C")))]
> >    "TARGET_SSE4_1 && !(MEM_P (operands[1]) && MEM_P (operands[2]))"
> >    "@
> >     unpcklps\t{%2, %0|%0, %2}
> Looks like we were wrong here.
> We need to use Yv constraint for vunpcklps since this
> insn is available for AVX-512VL only.
> 
> Like this:
> diff --git a/gcc/config/i386/sse.md b/gcc/config/i386/sse.md
> index d77227a..7d71640 100644
> --- a/gcc/config/i386/sse.md
> +++ b/gcc/config/i386/sse.md
> @@ -6546,12 +6546,12 @@
>  ;; unpcklps with register source since it is shorter.
>  (define_insn "*vec_concatv2sf_sse4_1"
>    [(set (match_operand:V2SF 0 "register_operand"
> -         "=Yr,*x,v,Yr,*x,v,v,*y ,*y")
> +         "=Yr,*x,Yv,Yr,*x,v,v,*y ,*y")
>         (vec_concat:V2SF
>           (match_operand:SF 1 "nonimmediate_operand"
> -         "  0, 0,v, 0,0, v,m, 0 , m")
> +         "  0, 0,Yv, 0,0, v,m, 0 , m")
>           (match_operand:SF 2 "vector_move_operand"
> -         " Yr,*x,v, m,m, m,C,*ym, C")))]
> +         " Yr,*x,Yv, m,m, m,C,*ym, C")))]
>    "TARGET_SSE4_1 && !(MEM_P (operands[1]) && MEM_P (operands[2]))"
>    "@
>     unpcklps\t{%2, %0|%0, %2}
> 
> Will check in to main trunk after bootstrap/regtest.

I'm not sure about the Yv on the operand 0, I think without AVX512VL
HARD_REGNO_MODE_OK will disallow V2SFmode regs in XMM16+ (but, this
is MMX-ish mode, so maybe we don't allow it ever in XMM16+).
On the SFmode operands side, you're right, HARD_REGNO_MODE_OK allows
SFmode in XMM16+ even for only AVX512F.

	Jakub


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]