This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [AArch64] Add precision choices for the reciprocal square root approximation

From: Evandro Menezes <e dot menezes at samsung dot com>
To: James Greenhalgh <james dot greenhalgh at arm dot com>, Wilco Dijkstra <Wilco dot Dijkstra at arm dot com>
Cc: GCC Patches <gcc-patches at gcc dot gnu dot org>, Andrew Pinski <pinskia at gmail dot com>, nd <nd at arm dot com>
Date: Fri, 01 Apr 2016 09:53:56 -0500
Subject: Re: [AArch64] Add precision choices for the reciprocal square root approximation
Authentication-results: sourceware.org; auth=none
References: <56EB2BDC dot 30209 at samsung dot com> <AM3PR08MB00883C48B491A1BA92CD0783838C0 at AM3PR08MB0088 dot eurprd08 dot prod dot outlook dot com> <56EC2A91 dot 2030604 at samsung dot com> <AM3PR08MB0088D90F31B84E852FF3100C838C0 at AM3PR08MB0088 dot eurprd08 dot prod dot outlook dot com> <56EC8870 dot 1030108 at samsung dot com> <56FDA338 dot 4050108 at samsung dot com> <AM3PR08MB00889651F672A4F0157BDE17839A0 at AM3PR08MB0088 dot eurprd08 dot prod dot outlook dot com> <20160401140626 dot GA24744 at arm dot com>

On 04/01/16 09:06, James Greenhalgh wrote:

On Fri, Apr 01, 2016 at 02:47:05PM +0100, Wilco Dijkstra wrote:

Evandro Menezes wrote:

Ping^1

I haven't seen a newer version that incorporates my feedback. To recap what
I'd like to see is a more general way to select approximations based on mode.
I don't believe that looking at the inner mode works in general, and it
doesn't make sense to add internal tune flags for all possible combinations.

Agreed. I don't think that a flag for each of the cartesian product of
{rsqrt,sqrt,div} X {SF,DF,V2SF,V4SF,V2DF} is a scalable solution - that's
at least 15 flags we'll need.

As I said earlier in the discussion, this particular split (between SF and
DF mode) seems strange to me. I'd expect the V4SF vs. SF would also be
interesting, and that a distinction between vector modes and scalar
modes would be more likely to be useful.

To give an idea what I mean, it would be easiest to add a single field to the
CPU tuning structure that contains a mask for all the combinations. Then we
call a single function with approximation kind ie. sqrt, rsqrt, div (x/y),
recip (1/x) and mode which uses the CPU tuning field to decide whether it
should be inlined.

I like the idea of a single cost function.


I'll go with it.

These patches are well and truly on my radar for GCC 7, but as we're still
in bugfixing mode (and there's still plenty to do!), I'm not going to get
round to giving them a more detailed review until after the release. Feel
free to ping them again once GCC 6 has shipped.

I've been proposing this change for a few months now and I'd really liketo have it in 6. I'd appreciate if you'd consider this request, allthings considered.


Thank you,

--
Evandro Menezes

References:
- Re: [AArch64] Add precision choices for the reciprocal square root approximation
  - From: Wilco Dijkstra
- Re: [AArch64] Add precision choices for the reciprocal square root approximation
  - From: James Greenhalgh

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]