[PATCH] aarch64: Use RTL builtins for [su]mlal_high_n intrinsics
Kyrylo Tkachov
Kyrylo.Tkachov@arm.com
Wed Feb 3 12:31:45 GMT 2021
> -----Original Message-----
> From: Jonathan Wright <Jonathan.Wright@arm.com>
> Sent: 03 February 2021 12:29
> To: gcc-patches@gcc.gnu.org
> Cc: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>
> Subject: [PATCH] aarch64: Use RTL builtins for [su]mlal_high_n intrinsics
>
> Hi,
>
> As subject, this patch rewrites [su]mlal_high_n Neon intrinsics to use RTL
> builtins rather than inline assembly code, allowing for better scheduling and
> optimization.
>
> Regression tested and bootstrapped on aarch64-none-linux-gnu and
> aarch64_be-none-elf - no issues.
>
> Ok for master?
Ok.
Thanks,
Kyrill
>
> Thanks,
> Jonathan
>
> --
>
> gcc/ChangeLog:
>
> 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
>
> * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
> builtin generator macros.
> * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
> Define.
> (aarch64_<su>mlal_hi_n<mode>): Define.
> * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
> instead of inline asm.
> (vmlal_high_n_s32): Likewise.
> (vmlal_high_n_u16): Likewise.
> (vmlal_high_n_u32): Likewise.
More information about the Gcc-patches
mailing list