[Bug tree-optimization/95747] New: [OpenMP/Builtin] nontemporal store support

jed at 59A2 dot org gcc-bugzilla@gcc.gnu.org
Thu Jun 18 17:30:31 GMT 2020


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95747

            Bug ID: 95747
           Summary: [OpenMP/Builtin] nontemporal store support
           Product: gcc
           Version: 10.1.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: jed at 59A2 dot org
  Target Milestone: ---

My understanding is that GCC currently has no analog of Clang's
__builtin_nontemporal_store
(https://clang.llvm.org/docs/LanguageExtensions.html#non-temporal-load-store-builtins)
and I don't see a ticket to add support for OpenMP-5's pragma omp simd
nontemporal. This leaves developers stuck with needing intrinsics to generate
vmovntpd and related instructions. Proper use of nontemporal stores can make
circa 50% performance improvement in bandwidth-limited workloads.


More information about the Gcc-bugs mailing list