[Bug tree-optimization/91201] [7/8/9/10 Regression] SIMD not generated for horizontal sum of bytes in array

glisse at gcc dot gnu.org gcc-bugzilla@gcc.gnu.org
Tue Jul 30 13:45:00 GMT 2019


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91201

--- Comment #10 from Marc Glisse <glisse at gcc dot gnu.org> ---
For AVX512, I wonder if we could use vpsadbw to compute the sums for each
64-bit part, then vcompressb to collect them in the lower 64 bits, then vpsadbw
to conclude. Or whatever other faster variant (is Peter Cordes around?). But
that's not required for this patch.


More information about the Gcc-bugs mailing list