This is the mail archive of the
gcc@gcc.gnu.org
mailing list for the GCC project.
Re: Offloading GSOC 2015
- From: Kirill Yukhin <kirill dot yukhin at gmail dot com>
- To: guray ozen <guray dot ozen at gmail dot com>
- Cc: Thomas Schwinge <thomas at codesourcery dot com>, tobias dot burnus at physik dot fu-berlin dot de, gcc at gcc dot gnu dot org, Jakub Jelinek <jakub at redhat dot com>, Ilya Verbin <iverbin at gmail dot com>
- Date: Fri, 20 Mar 2015 18:47:47 +0400
- Subject: Re: Offloading GSOC 2015
- Authentication-results: sourceware.org; auth=none
- References: <CA+ga0G7z+xsO8LB8oc0yv9VHFPpryaH1T2rHOudky-it3Wnu3Q at mail dot gmail dot com> <87wq2n66gj dot fsf at kepler dot schwinge dot homeip dot net> <CA+ga0G6Y60g5rhOoj310TPp4EZgUpvmv7AT1Y1UHWkKfTp-ZOQ at mail dot gmail dot com> <CA+ga0G5WuGOrAUQ_Sq_LBt6=C6uEfFt4+3c7eDwu8D5ruhMRQg at mail dot gmail dot com>
Hello Güray,
On 20 Mar 12:14, guray ozen wrote:
> I've started to prepare my gsoc proposal for gcc's openmp for gpus.
I think that here is wide range for exploration. As you know, OpenMP 4
contains vectorization pragmas (`pragma omp simd') which not perfectly
suites for GPGPU.
Another problem is how to create threads dynamically on GPGPU. As far as
we understand it there're two possible solutions:
1. Use dynamic parallelism available in recent API (launch new kernel from
target)
2. Estimate maximum thread number on host and start them all from host,
making unused threads busy-waiting
There's a paper which investigates both approaches [1], [2].
> However i'm little bit confused about which ideas, i mentioned last my
> mail, should i propose or which one of them is interesting for gcc.
> I'm willing to work on data clauses to enhance performance of shared
> memory. Or maybe it might be interesting to work on OpenMP 4.1 draft
> version. How do you think i should propose idea?
We're going to work on OpenMP 4.1 offloading features.
[1] - http://openmp.org/sc14/Booth-Sam-IBM.pdf
[2] - http://dl.acm.org/citation.cfm?id=2688364
--
Thanks, K