This is the mail archive of the
gcc-bugs@gcc.gnu.org
mailing list for the GCC project.
[Bug c++/80859] Performance Problems with OpenMP 4.5 support
- From: "thorstenkurth at me dot com" <gcc-bugzilla at gcc dot gnu dot org>
- To: gcc-bugs at gcc dot gnu dot org
- Date: Wed, 24 May 2017 16:29:02 +0000
- Subject: [Bug c++/80859] Performance Problems with OpenMP 4.5 support
- Auto-submitted: auto-generated
- References: <bug-80859-4@http.gcc.gnu.org/bugzilla/>
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #13 from Thorsten Kurth <thorstenkurth at me dot com> ---
Hello Jakub,
the compiler options are just -fopenmp. I am sure it does not have to do
anything with vectorization as I compare the code runtime with and without the
target directives and thus vectorization should be the same between them. The
remaining OpenMP sections are the same. In our work we have not seen 10x
because of insufficient vectorization, it is usually because of cache locality
but that is the same for OMP 4.5 and OMP 3 because the loops are not touched.
I do not specify an ISA choice, but I will try specifying KNL now and will tell
you what the compiler is going to do.
Best
Thorsten