[PATCH 8/20] aarch64: Use RTL builtins for v[q]tbl intrinsics

Jonathan Wright Jonathan.Wright@arm.com
Wed Apr 28 14:22:19 GMT 2021


Hi,

As subject, this patch rewrites the v[q]tbl Neon intrinsics to use RTL
builtins rather than inline assembly code, allowing for better scheduling
and optimization.

Regression tested and bootstrapped on aarch64-none-linux-gnu - no
issues.

Ok for master?

Thanks,
Jonathan

---

gcc/ChangeLog:

2021-02-12  Jonathan Wright  <jonathan.wright@arm.com>

	* config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
	generator macros.
	* config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
	instead of inline asm.
	(vqtbl1_s8): Likewise.
	(vqtbl1_u8): Likewise.
	(vqtbl1q_p8): Likewise.
	(vqtbl1q_s8): Likewise.
	(vqtbl1q_u8): Likewise.
	(vtbl1_s8): Likewise.
	(vtbl1_u8): Likewise.
	(vtbl1_p8): Likewise.
	(vtbl2_s8): Likewise.
	(vtbl2_u8): Likewise.
	(vtbl2_p8): Likewise.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: rb14154.patch
Type: application/octet-stream
Size: 7101 bytes
Desc: rb14154.patch
URL: <https://gcc.gnu.org/pipermail/gcc-patches/attachments/20210428/e30c53d6/attachment.obj>


More information about the Gcc-patches mailing list