This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32
- From: Delia Burduv <Delia dot Burduv at arm dot com>
- To: "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>
- Cc: "nickc at redhat dot com" <nickc at redhat dot com>, Richard Earnshaw <Richard dot Earnshaw at arm dot com>, Kyrylo Tkachov <Kyrylo dot Tkachov at arm dot com>, Ramana Radhakrishnan <Ramana dot Radhakrishnan at arm dot com>
- Date: Wed, 22 Jan 2020 17:31:31 +0000
- Subject: Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32
- Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=2FjgJiDfJ9asavUTHHXWdrmO6UpuAg9TS8og1Vb0Oac=; b=DKAh3fFGZsfv2408X+7hkJYBYxdZdPPZpnDFzNomxyZ0x8wUxsYMDVz0bQne2tJwvUyT3fjOyp/gRDYXpDrLAJC3n986O+9rllnUiARakIfNlUxc449oX0jv8Ln4iqadHzfHYjGGaRY1ekh10IMja42yjT823UvboIfz6oAuUv7AdFo6jr5M73yHWbyj7Hc5IpxYYI9lo8OSQ9Oc2Fn5uFckLchJoo5s1gSlfIiWGVeQ7OcOPHNZpalKNNjDElWK3PGLBHXBuxhcbydWjJxXf3ZyWLlVUGgkSdK3LN6ERIYdsiB0GRYSM7WJYBJAUbk7SCJWgCuweqb/R0lucLXcwA==
- Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=YOaZ62U6/+bjo4jLFSBwC8ezB0WEL2qo3gDRebkt30N2/g5t33Y1PV+3pidklSybRVnj9rnliuss6X2tfuVzFy1gOZLOvOA13N9irg+Z9pw+sjKyW8NVkXJmqi/ld5+mxYTkfKHLpR0nuraMNl8pjP/+mBbXJ1JPONU0VSLnbNfWhkA6D8BkNoN6A02v+TgFUWc08sSneCxK0LSAofrnhJDI5GZy6dOdJKsG+BP4gdflXH0ZBy3Bjz2XtNfH5R7U+fyCPVHNHchNFYnFppdQtv4osJnvtUPsTqxRWq15oqkJImPf+5705NaFQURacqJssR/lGvGQglL840RHnj+x1Q==
- Original-authentication-results: spf=none (sender IP is ) smtp.mailfrom=Delia dot Burduv at arm dot com;
- References: <03fd9393-a25d-c1fb-535b-c4f39ea7decb@arm.com>
Ping.
I will change the tests to use the exact input and output registers as
Richard Sandiford suggested for the AArch64 patches.
On 12/20/19 6:48 PM, Delia Burduv wrote:
> This patch adds the ARMv8.6 ACLE BFloat16 load intrinsics vld<n>{q}_bf16
> as part of the BFloat16 extension.
> (https://developer.arm.com/architectures/instruction-sets/simd-isas/neon/intrinsics)
>
> The intrinsics are declared in arm_neon.h .
> A new test is added to check assembler output.
>
> This patch depends on the Arm back-end patche.
> (https://gcc.gnu.org/ml/gcc-patches/2019-12/msg01448.html)
>
> Tested for regression on arm-none-eabi and armeb-none-eabi. I don't have
> commit rights, so if this is ok can someone please commit it for me?
>
> gcc/ChangeLog:
>
> 2019-11-14 Delia Burduv <delia.burduv@arm.com>
>
> * config/arm/arm_neon.h (bfloat16_t): New typedef.
> (bfloat16x4x2_t): New typedef.
> (bfloat16x8x2_t): New typedef.
> (bfloat16x4x3_t): New typedef.
> (bfloat16x8x3_t): New typedef.
> (bfloat16x4x4_t): New typedef.
> (bfloat16x8x4_t): New typedef.
> (vld2_bf16): New.
> (vld2q_bf16): New.
> (vld3_bf16): New.
> (vld3q_bf16): New.
> (vld4_bf16): New.
> (vld4q_bf16): New.
> (vld2_dup_bf16): New.
> (vld2q_dup_bf16): New.
> (vld3_dup_bf16): New.
> (vld3q_dup_bf16): New.
> (vld4_dup_bf16): New.
> (vld4q_dup_bf16): New.
> * config/arm/arm-builtins.c (E_V2BFmode): New mode.
> (VAR13): New.
> (arm_simd_types[Bfloat16x2_t]):New type.
> * config/arm/arm-modes.def (V2BF): New mode.
> * config/arm/arm-simd-builtin-types.def
> (Bfloat16x2_t): New entry.
> * config/arm/arm_neon_builtins.def
> (vld2): Changed to VAR13 and added v4bf, v8bf
> (vld2_dup): Changed to VAR8 and added v4bf, v8bf
> (vld3): Changed to VAR13 and added v4bf, v8bf
> (vld3_dup): Changed to VAR8 and added v4bf, v8bf
> (vld4): Changed to VAR13 and added v4bf, v8bf
> (vld4_dup): Changed to VAR8 and added v4bf, v8bf
> * config/arm/iterators.md (VDXBF): New iterator.
> (VQ2BF): New iterator.
> (V_elem): Added V4BF, V8BF.
> (V_sz_elem): Added V4BF, V8BF.
> (V_mode_nunits): Added V4BF, V8BF.
> (q): Added V4BF, V8BF.
> *config/arm/neon.md (vld2): Used new iterators.
> (vld2_dup<mode>): Used new iterators.
> (vld2_dupv8bf): New.
> (vst3): Used new iterators.
> (vst3qa): Used new iterators.
> (vst3qb): Used new iterators.
> (vld3_dup<mode>): Used new iterators.
> (vld3_dupv8bf): New.
> (vst4): Used new iterators.
> (vst4qa): Used new iterators.
> (vst4qb): Used new iterators.
> (vld4_dup<mode>): Used new iterators.
> (vld4_dupv8bf): New.
>
>
> gcc/testsuite/ChangeLog:
>
> 2019-11-14 Delia Burduv <delia.burduv@arm.com>
>
> * gcc.target/arm/simd/bf16_vldn_1.c: New test.