[Bug target/88510] GCC generates inefficient U64x2/v2di scalar multiply for NEON32

husseydevin at gmail dot com gcc-bugzilla@gcc.gnu.org
Thu Jan 3 17:39:00 GMT 2019


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88510

--- Comment #2 from Devin Hussey <husseydevin at gmail dot com> ---
Update: I did the calculations, and twomul has the same cycle count as
goodmul_sse. vmul.i32 with 128-bit operands takes 4 cycles (I assumed it was
two), so just like goodmul_sse, it takes 11 cycles.


More information about the Gcc-bugs mailing list