[Bug target/63678] __mm256_blend_epi16 only accepts 8-bit masks (should accept 16-bit)
peter.bumbulis at ianywhere dot com
gcc-bugzilla@gcc.gnu.org
Wed Oct 29 18:37:00 GMT 2014
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63678
--- Comment #4 from Peter Bumbulis <peter.bumbulis at ianywhere dot com> ---
(In reply to Peter Bumbulis from comment #2)
> The referenced web page is incorrect. Look in the instruction set reference
> manual
> (https://software.intel.com/sites/default/files/managed/c6/a9/319433-020.pdf,
> search for VPBLENDMW) or the intrinsics guide
> (https://software.intel.com/sites/landingpage/IntrinsicsGuide/).
>
> These instructions blend 16 bit quantities: you can fit 16 of these in a
> 256 bit register. For AVX512 it's a 32-bit constant.
My mistake: it looks like the generated code only uses the low 8 bytes. Sorry
for any wasted bandwidth.
More information about the Gcc-bugs
mailing list