This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH] Emit vperm2[if]128 $0x12/$0x20 as vinsert[if]128 $0/$1


On Mon, Nov 7, 2011 at 10:20 PM, Jakub Jelinek <jakub@redhat.com> wrote:

> I think it is at least more readable and perhaps for some CPUs could
> be faster (for SandyBridge it is the same speed) if we emit a more
> specialized insn over a more generic one.
>
> Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
>
> In the attachment is my first attempt to do this, in the expander,
> unfortunately that turned out to be pessimizing - seems like IRA or
> reload has issues with the subregs and on

No, it is by design. Please see the comment in
ix86_cannot_change_mode_class, why we prohibit all nonparadoxical
subregs changing size for SSE/MMX classes.


> 2011-11-07 ?Jakub Jelinek ?<jakub@redhat.com>
>
> ? ? ? ?* config/i386/sse.md (*avx_vperm2f128<mode>3_nozero): Emit mask
> ? ? ? ?0x12 and 0x20 as vinsert[fi]128 instead of vperm2[fi]128.

OK.

Thanks,
Uros.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]