Enable BF16 support (Please ignore my former email)

Liu, Hongtao hongtao.liu@intel.com
Fri Apr 12 07:10:00 GMT 2019


Hi :
    This patch is about to enable support for bfloat16 which will be in Future Cooper Lake, Please refer to https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference
for more details about BF16.

There are 3 instructions for AVX512BF16: VCVTNE2PS2BF16, VCVTNEPS2BF16 and DPBF16PS instructions, which are Vector Neural Network Instructions supporting:

-	VCVTNE2PS2BF16: Convert Two Packed Single Data to One Packed BF16 Data.
-	VCVTNEPS2BF16: Convert Packed Single Data to Packed BF16 Data.
-	VDPBF16PS: Dot Product of BF16 Pairs Accumulated into Packed Single Precision.

Since only BF16 intrinsics are supported, we treat it as HI for simplicity.

Bootstrap and regression test for x86/i386 backend are ok.

Changelog:

2019-04-07        Wei Xiao<wei3.xiao@inte.com>
gcc/:
              * common/config/i386/i386-common.c (OPTION_MASK_ISA_AVX512BF16_SET,
              OPTION_MASK_ISA_AVX512BF16_UNSET,
              OPTION_MASK_ISA2_AVX512BW_UNSET ): New.
              (OPTION_MASK_ISA2_AVX512F_UNSET): Add OPTION_MASK_ISA_AVX512BF16_UNSET.
              (ix86_handle_option): Handle -mavx512bf16.
              * config.gcc: Add avx512bf16vlintrin.h and avx512bf16intrin.h
              to extra_headers.
              * config/i386/avx512bf16vlintrin.h: New.
              * config/i386/avx512bf16intrin.h: New.
              * config/i386/cpuid.h (bit_AVX512BF16): New.
              * config/i386/driver-i386.c (host_detect_local_cpu): Detect BF16.
              * config/i386/i386-builtin-types.def: Add new types.
              * config/i386/i386-builtin.def: Add new builtins.
              * config/i386/i386-c.c (ix86_target_macros_internal): Define
              __AVX512BF16__.
              * config/i386/i386.c (ix86_target_string): Add -mavx512bf16.
              (ix86_option_override_internal): Handle BF16.
              (ix86_valid_target_attribute_inner_p): Ditto.
              (fold_builtin_cpu): Ditto.
              (ix86_expand_args_builtin): Ditto.
              * config/i386/i386.h (TARGET_AVX512BF16, TARGET_AVX512BF16_P): New.
              (PTA_AVX512BF16): Ditto.
              * config/i386/i386.opt: Add -mavx512bf16.
              * config/i386/immintrin.h: Include avx512bf16intrin.h
              and avx512bf16vlintrin.h.
              * config/i386/sse.md (avx512f_cvtne2ps2bf16_<mode><mask_name>,
              avx512f_cvtneps2bf16_<mode><mask_name>,
              avx512f_dpbf16ps_<mode><mask_half_name>): New define_insn patterns.
              * config/i386/subst.md (mask_half): Add new subst.
              * doc/invoke.texi: Document -mavx512bf16.

gcc/testsuite/:
              * gcc.target/i386/avx512bf16-vcvtne2ps2bf16-1.c: New test.
              * gcc.target/i386/avx512bf16-vcvtneps2bf16-1.c: New test.
              * gcc.target/i386/avx512bf16-vdpbf16ps-1.c: New test.
              * gcc.target/i386/avx512bf16vl-vcvtne2ps2bf16-1.c: New test.
              * gcc.target/i386/avx512bf16vl-vcvtneps2bf16-1.c: New test.
              * gcc.target/i386/avx512bf16vl-vdpbf16ps-1.c: New test.
              * gcc.target/i386/sse-12.c: Add -mavx512bf16.
              * gcc.target/i386/sse-13.c: Ditto.
              * gcc.target/i386/sse-14.c: Ditto.
              * gcc.target/i386/sse-22.c: Ditto.
              * gcc.target/i386/sse-23.c: Ditto.
              * g++.dg/other/i386-2.C: Ditto.
              * gcc.target/i386/avx-1.c: Ditto.
              * gcc.target/i386/avx-2.c: Ditto.
              * g++.dg/other/i386-3.C: Add avx512bf16.

Regards
Hongtao Liu


-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-Enable-BF16-support.patch
Type: application/octet-stream
Size: 64873 bytes
Desc: 0001-Enable-BF16-support.patch
URL: <http://gcc.gnu.org/pipermail/gcc-patches/attachments/20190412/31e66009/attachment.obj>


More information about the Gcc-patches mailing list