This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [gomp] Move openacc vector& worker single handling to RTL


On 07/07/15 10:22, Jakub Jelinek wrote:
On Tue, Jul 07, 2015 at 10:12:56AM -0400, Nathan Sidwell wrote:

Wouldn't function attributes be better for that case, and just use the internal
functions for the case when the mode is being changed in the middle of
function?

It may be. I've been thinking how the top-level offloaded function (kernel), should be marked to specify gangs/worker/vector dimensions to allow a less device-specific launch mechanism. I suspect that and routines will have similar solutions.

I agree that fork/join might be less confusing.

BTW, where do you plan to lower the internal functions for non-PTX?
Doing it in RTL mach reorg is too late for those, we shouldn't be writing it
for each single target, as for non-PTX (perhaps non-HSA) I bet the behavior
is the same.

I suspect other devices can add a new device-specific lowering pass somewhere soon after the LTO readback. I think we're going to need that pass for some other pieces of PTX.

FWIW on a device that has a PTX-like architecture, I think this specific piece should be done as late as possible. Perhaps pieces of the PTX mach-dep-reorg can be abstracted for general use?

nathan


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]