ACLE intrinsics: BFloat16 store (vst<n>{q}_bf16) intrinsics for AArch32

Delia Burduv Delia.Burduv@arm.com
Tue Jan 28 17:12:00 GMT 2020


Ping.
________________________________
From: Delia Burduv <delia.burduv@arm.com>
Sent: 22 January 2020 17:29
To: gcc-patches@gcc.gnu.org <gcc-patches@gcc.gnu.org>
Cc: nickc@redhat.com <nickc@redhat.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>; Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>; Ramana Radhakrishnan <Ramana.Radhakrishnan@arm.com>
Subject: Re: ACLE intrinsics: BFloat16 store (vst<n>{q}_bf16) intrinsics for AArch32

Ping.

I will change the tests to use the exact input and output registers as
Richard Sandiford suggested for the AArch64 patches.

On 12/20/19 6:46 PM, Delia Burduv wrote:
> This patch adds the ARMv8.6 ACLE BFloat16 store intrinsics
> vst<n>{q}_bf16 as part of the BFloat16 extension.
> (https://developer.arm.com/architectures/instruction-sets/simd-isas/neon/intrinsics)
>
> The intrinsics are declared in arm_neon.h .
> A new test is added to check assembler output.
>
> This patch depends on the Arm back-end patche.
> (https://gcc.gnu.org/ml/gcc-patches/2019-12/msg01448.html)
>
> Tested for regression on arm-none-eabi and armeb-none-eabi. I don't have
> commit rights, so if this is ok can someone please commit it for me?
>
> gcc/ChangeLog:
>
> 2019-11-14  Delia Burduv  <delia.burduv@arm.com>
>
>      * config/arm/arm_neon.h (bfloat16_t): New typedef.
>          (bfloat16x4x2_t): New typedef.
>          (bfloat16x8x2_t): New typedef.
>          (bfloat16x4x3_t): New typedef.
>          (bfloat16x8x3_t): New typedef.
>          (bfloat16x4x4_t): New typedef.
>          (bfloat16x8x4_t): New typedef.
>          (vst2_bf16): New.
>      (vst2q_bf16): New.
>      (vst3_bf16): New.
>      (vst3q_bf16): New.
>      (vst4_bf16): New.
>      (vst4q_bf16): New.
>          * config/arm/arm-builtins.c (E_V2BFmode): New mode.
>          (VAR13): New.
>          (arm_simd_types[Bfloat16x2_t]):New type.
>          * config/arm/arm-modes.def (V2BF): New mode.
>          * config/arm/arm-simd-builtin-types.def
>          (Bfloat16x2_t): New entry.
>          * config/arm/arm_neon_builtins.def
>          (vst2): Changed to VAR13 and added v4bf, v8bf
>          (vst3): Changed to VAR13 and added v4bf, v8bf
>          (vst4): Changed to VAR13 and added v4bf, v8bf
>          * config/arm/iterators.md (VDXBF): New iterator.
>          (VQ2BF): New iterator.
>          (V_elem): Added V4BF, V8BF.
>          (V_sz_elem): Added V4BF, V8BF.
>          (V_mode_nunits): Added V4BF, V8BF.
>          (q): Added V4BF, V8BF.
>          *config/arm/neon.md (vst2): Used new iterators.
>          (vst3): Used new iterators.
>          (vst3qa): Used new iterators.
>          (vst3qb): Used new iterators.
>          (vst4): Used new iterators.
>          (vst4qa): Used new iterators.
>          (vst4qb): Used new iterators.
>
>
> gcc/testsuite/ChangeLog:
>
> 2019-11-14  Delia Burduv  <delia.burduv@arm.com>
>
>      * gcc.target/arm/simd/bf16_vstn_1.c: New test.



More information about the Gcc-patches mailing list