This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH,nvptx] Use CUDA driver API to select default runtime launch, geometry

From: Cesar Philippidis <cesar at codesourcery dot com>
To: Tom de Vries <tdevries at suse dot de>
Cc: "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>, Thomas Schwinge <thomas at codesourcery dot com>
Date: Wed, 1 Aug 2018 12:11:05 -0700
Subject: Re: [PATCH,nvptx] Use CUDA driver API to select default runtime launch, geometry
References: <791625c9-911f-972d-ed4e-746dc5fe5f43@codesourcery.com> <a3f73478-259c-8bcf-1ae2-1e7710767f02@suse.de> <dde5a4cd-cf1d-c541-e0a6-396dfffe8f65@codesourcery.com> <a70c1249-07d9-1865-e177-01fd76374f7c@suse.de>

On 08/01/2018 07:12 AM, Tom de Vries wrote:

>>>> +	      gangs = grids * (blocks / warp_size);
>>>
>>> So, we launch with gangs == grids * workers ? Is that intentional?
>>
>> Yes. At least that's what I've been using in og8. Setting num_gangs =
>> grids alone caused significant slow downs.
>>
> 
> Well, what you're saying here is: increasing num_gangs increases
> performance.
> 
> You don't explain why you multiply with workers specifically.

I set it that way because I think the occupancy calculator is
determining the occupancy of a single multiprocessor unit, rather than
the entire GPU. Looking at the og8 code again, I had

   num_gangs = 2 * threads_per_sm / warp_size * dev_size

which corresponds to

   2 * grids * blocks / warp_size

Because blocks is generally smaller than threads_per_block, the driver
occupancy calculator ends up launching fewer gangs.

I don't have a firm position with this default behavior. Perhaps we
should just set

  gang = grids

That's probably an improvement over what's there now.

Cesar

Follow-Ups:
- Re: [PATCH,nvptx] Use CUDA driver API to select default runtime launch, geometry
  - From: Tom de Vries

References:
- Re: [PATCH,nvptx] Use CUDA driver API to select default runtime launch, geometry
  - From: Tom de Vries
- Re: [PATCH,nvptx] Use CUDA driver API to select default runtime launch, geometry
  - From: Cesar Philippidis
- Re: [PATCH,nvptx] Use CUDA driver API to select default runtime launch, geometry
  - From: Tom de Vries

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]