This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
[PATCH 00/10, OpenACC] Rework handling of OpenACC kernels regions
- From: Kwok Cheung Yeung <kcy at codesourcery dot com>
- To: <gcc-patches at gcc dot gnu dot org>, Jakub Jelinek <jakub at redhat dot com>
- Cc: Thomas Schwinge <thomas at codesourcery dot com>
- Date: Wed, 17 Jul 2019 22:02:18 +0100
- Subject: [PATCH 00/10, OpenACC] Rework handling of OpenACC kernels regions
This series of patches reworks the way that OpenACC kernels regions are
processed by GCC. Instead of relying on the parloops pass for
auto-parallelisation of the kernel region, the contents of the region are
transformed into a sequence of offloaded regions, which are then processed
individually.
Tested on an x86_64 host, with offloading to a Nvidia Tesla K20c card.
Okay for trunk?
Thanks
Kwok
- Follow-Ups:
- [PATCH 01/10, OpenACC] Use "-fopenacc-kernels=parloops" to document "parloops" test cases
- [PATCH 02/10, OpenACC] Add OpenACC target kinds for decomposed kernels regions
- [PATCH 03/10, OpenACC] Separate OpenACC kernels regions in data and parallel parts
- [PATCH 04/10, OpenACC] Turn OpenACC kernels regions into a sequence of, parallel regions
- [PATCH 05/10, OpenACC] Handle conditional execution of loops in OpenACC, kernels regions
- [PATCH 06/10, OpenACC] Adjust parallelism of loops in gang-single parts of OpenACC kernels regions
- [PATCH 07/10, OpenACC] Launch kernels asynchronously in OpenACC kernels regions
- [PATCH 08/10, OpenACC] New OpenACC kernels region decompose algorithm
- [PATCH 09/10, OpenACC] Avoid introducing 'create' mapping clauses for loop index variables in kernels regions
- [PATCH 10/10, OpenACC] Make new OpenACC kernels conversion the default; adjust and add tests
- Re: [PATCH 00/10, OpenACC] Rework handling of OpenACC kernels regions