[PATCH] rs6000: Add Power10 optimization for most _mm_movemask*

Segher Boessenkool segher@kernel.crashing.org
Fri Nov 19 18:15:13 GMT 2021


On Thu, Oct 21, 2021 at 12:22:12PM -0500, Paul A. Clarke wrote:
> Power10 ISA added `vextract*` instructions which are realized in the
> `vec_extractm` instrinsic.
> 
> Use `vec_extractm` for `_mm_movemask_ps`, `_mm_movemask_pd`, and
> `_mm_movemask_epi8` compatibility intrinsics, when `_ARCH_PWR10`.
> 
> 2021-10-21  Paul A. Clarke  <pc@us.ibm.com>
> 
> gcc
> 	* config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
> 	when _ARCH_PWR10.
> 	* config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
> 	(_mm_movemask_epi8): Likewise.

Okay for trunk.  Thanks!


Segher


More information about the Gcc-patches mailing list