[PATCH 1/4][ARM Intrinsics]float16x4_t intrinsics: vget_lane, vset_lane, vcreate, vdup_n, vdup_lane, vld1_lane, vld1_dup, vreinterpret

Alan Lawrence alan.lawrence@arm.com
Fri Jan 16 17:36:00 GMT 2015


This adds a bunch of new intrinsics, implemented with GCC vector extensions to 
maximise mid-end optimization (the same approach as AArch64). Note that unlike 
AArch64, no attempt is made to support bigendian.

gcc/ChangeLog:

     * config/arm/arm_neon.h (vcreate_f16, vdup_lane_f16, vld1_lane_f16,
     vld1_dup_f16, vreinterpret_p8_f16, vreinterpret_p16_f16,
     vreinterpret_f16_p8, vreinterpret_f16_p16, vreinterpret_f16_f32,
     vreinterpret_f16_p64, vreinterpret_f16_s64, vreinterpret_f16_u64,
     vreinterpret_f16_s8, vreinterpret_f16_s16, vreinterpret_f16_s32,
     vreinterpret_f16_u8, vreinterpret_f16_u16, vreinterpret_f16_u32,
     vreinterpret_f32_f16, vreinterpret_p64_f16, vreinterpret_s64_f16,
     vreinterpret_u64_f16, vreinterpret_s8_f16, vreinterpret_s16_f16,
     vreinterpret_s32_f16, vreinterpret_u8_f16, vreinterpret_u16_f16,
     vreinterpret_u32_f16): New.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: f16x4_vec_ext.patch
Type: text/x-patch
Size: 11211 bytes
Desc: not available
URL: <http://gcc.gnu.org/pipermail/gcc-patches/attachments/20150116/b7a78f28/attachment.bin>


More information about the Gcc-patches mailing list