[PATCH] aarch64: Use RTL builtins for [su]mlal_high intrinsics

Kyrylo Tkachov Kyrylo.Tkachov@arm.com
Wed Feb 3 12:30:10 GMT 2021



> -----Original Message-----
> From: Jonathan Wright <Jonathan.Wright@arm.com>
> Sent: 03 February 2021 12:20
> To: gcc-patches@gcc.gnu.org
> Cc: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>
> Subject: [PATCH] aarch64: Use RTL builtins for [su]mlal_high intrinsics
> 
> Hi,
> 
> As subject, this patch rewrites [su]mlal_high Neon intrinsics to use RTL
> builtins rather than inline assembly code, allowing for better scheduling
> and optimization.
> 
> Regression tested and bootstrapped on aarch64-none-linux-gnu - no
> issues.
> 
> Ok for master?

Ok.
Thanks,
Kyrill

> 
> Thanks,
> Jonathan
> 
> ---
> 
> gcc/ChangeLog:
> 
> 2021-01-27  Jonathan Wright  <jonathan.wright@arm.com>
> 
> * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
> generator macros.
> * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
> Rename to...
> (aarch64_<su>mlal_hi<mode>_insn): This.
> (aarch64_<su>mlal_hi<mode>): Define.
> * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
> instead of inline asm.
> (vmlal_high_s16): Likewise.
> (vmlal_high_s32): Likewise.
> (vmlal_high_u8): Likewise.
> (vmlal_high_u16): Likewise.
> (vmlal_high_u32): Likewise.



More information about the Gcc-patches mailing list