[Bug target/104122] On Zen3, 510.parest_r (built with -Ofast) is faster with generic than with native ISA
jamborm at gcc dot gnu.org
gcc-bugzilla@gcc.gnu.org
Thu Jan 20 09:40:55 GMT 2022
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104122
--- Comment #3 from Martin Jambor <jamborm at gcc dot gnu.org> ---
(In reply to Richard Biener from comment #2)
> It's ISA, not tuning.
You are of course correct, unfortunately I am too accustomed to
using the wrong term.
> I suppose -march=native -mtune=generic is still bad?
I don't know, I'd have to manually check.
> I wonder if you tried the obvious -mprefer-avx128?
I hope that is equivalent to -mprefer-vector-width=128
If it is, -march=native -mtune=native -mprefer-vector-width=128 is
even quite a bit slower than -march=native -mtune=native.
More information about the Gcc-bugs
mailing list