https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79151 --- Comment #3 from Richard Biener <rguenth at gcc dot gnu.org> --- The question is of course whether vector division has comparable latency / throughput as the scalar one.