[Bug target/96789] x264: sub4x4_dct() improves when vectorization is disabled

linkw at gcc dot gnu.org gcc-bugzilla@gcc.gnu.org
Thu Sep 17 05:06:03 GMT 2020


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96789

--- Comment #12 from Kewen Lin <linkw at gcc dot gnu.org> ---

> Thanks for the explanation! I'll look at it after checking 2). IIUC, the
> advantage to eliminate stores here looks able to get those things which is
> fed to stores and stores' consumers bundled, then get more things SLP-ed if
> available?

Hmm, I think I was wrong, if both the feeding chain and consuming chain of the
stores are SLP-ed, later FRE would be able to fuse them.


More information about the Gcc-bugs mailing list