[Bug target/99228] blend/shuffle

g.peterhoff@t-online.de gcc-bugzilla@gcc.gnu.org
Tue Mar 2 11:43:03 GMT 2021


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99228

--- Comment #5 from g.peterhoff@t-online.de ---
Here is a better test case. https://godbolt.org/z/3Gq783
I've found:

sgn_complex
- always inefficient code, TYPE and SIZE do not matter, even with -Ofast or
-fast-math

for TYPE=double
SIZE=1
- abs/mul/div/pow2_complex ok
- zero_complex not vectorized, also with -Ofast or -ffast-math

SIZE=2
- abs/mul/div/pow2/zero_complex only with scalar operations, never vectorized

SIZE=4 and larger
- abs/mul/div/pow2/zero_complex ok


for TYPE=float
SIZE=1
- abs/mul/pow2_complex ok
- div/zero_complex not vectorized, also with -Ofast or -ffast-math

SIZE=2
- abs/mul/div/pow2/zero_complex only with scalar operations, never vectorized

SIZE=4
- abs/pow2/zero_complex ok
- mul_complex inefficient, xmm instead of ymm, also with -Ofast or -ffast-math
- div_complex ok with O3, but with Ofast/fast-math only xmm instead of ymm

SIZE=8 and larger
- abs/mul/div/pow2_complex ok


More information about the Gcc-bugs mailing list