[Bug tree-optimization/95747] New: [OpenMP/Builtin] nontemporal store support
jed at 59A2 dot org
gcc-bugzilla@gcc.gnu.org
Thu Jun 18 17:30:31 GMT 2020
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95747
Bug ID: 95747
Summary: [OpenMP/Builtin] nontemporal store support
Product: gcc
Version: 10.1.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
Assignee: unassigned at gcc dot gnu.org
Reporter: jed at 59A2 dot org
Target Milestone: ---
My understanding is that GCC currently has no analog of Clang's
__builtin_nontemporal_store
(https://clang.llvm.org/docs/LanguageExtensions.html#non-temporal-load-store-builtins)
and I don't see a ticket to add support for OpenMP-5's pragma omp simd
nontemporal. This leaves developers stuck with needing intrinsics to generate
vmovntpd and related instructions. Proper use of nontemporal stores can make
circa 50% performance improvement in bandwidth-limited workloads.
More information about the Gcc-bugs
mailing list