This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Add support for reductions in fully-masked loops

From: James Greenhalgh <james dot greenhalgh at arm dot com>
To: Jeff Law <law at redhat dot com>
Cc: "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>, "richard dot sandiford at linaro dot org" <richard dot sandiford at linaro dot org>, <nd at arm dot com>
Date: Sun, 7 Jan 2018 20:35:32 +0000
Subject: Re: Add support for reductions in fully-masked loops
Authentication-results: sourceware.org; auth=none
Authentication-results: spf=pass (sender IP is 217.140.96.140) smtp.mailfrom=arm.com; linaro.org; dkim=none (message not signed) header.d=none;linaro.org; dmarc=bestguesspass action=none header.from=arm.com;
Nodisclaimer: True
References: <87lgj5ymqe.fsf@linaro.org> <14f381c9-b00f-9c03-163e-cf4a93de4b7e@redhat.com>
Spamdiagnosticmetadata: NSPM
Spamdiagnosticoutput: 1:99

On Wed, Dec 13, 2017 at 04:34:34PM +0000, Jeff Law wrote:
> On 11/17/2017 07:59 AM, Richard Sandiford wrote:
> > This patch removes the restriction that fully-masked loops cannot
> > have reductions.  The key thing here is to make sure that the
> > reduction accumulator doesn't include any values associated with
> > inactive lanes; the patch adds a bunch of conditional binary
> > operations for doing that.
> > 
> > Tested on aarch64-linux-gnu (with and without SVE), x86_64-linux-gnu
> > and powerpc64le-linux-gnu.
> > 
> > Richard
> > 
> > 
> > 2017-11-17  Richard Sandiford  <richard.sandiford@linaro.org>
> > 	    Alan Hayward  <alan.hayward@arm.com>
> > 	    David Sherwood  <david.sherwood@arm.com>
> > 
> > gcc/
> > 	* doc/md.texi (cond_add@var{mode}, cond_sub@var{mode})
> > 	(cond_and@var{mode}, cond_ior@var{mode}, cond_xor@var{mode})
> > 	(cond_smin@var{mode}, cond_smax@var{mode}, cond_umin@var{mode})
> > 	(cond_umax@var{mode}): Document.
> > 	* optabs.def (cond_add_optab, cond_sub_optab, cond_and_optab)
> > 	(cond_ior_optab, cond_xor_optab, cond_smin_optab, cond_smax_optab)
> > 	(cond_umin_optab, cond_umax_optab): New optabs.
> > 	* internal-fn.def (COND_ADD, COND_SUB, COND_SMIN, COND_SMAX)
> > 	(COND_UMIN, COND_UMAX, COND_AND, COND_IOR, COND_XOR): New internal
> > 	functions.
> > 	* internal-fn.h (get_conditional_internal_fn): Declare.
> > 	* internal-fn.c (cond_binary_direct): New macro.
> > 	(expand_cond_binary_optab_fn): Likewise.
> > 	(direct_cond_binary_optab_supported_p): Likewise.
> > 	(get_conditional_internal_fn): New function.
> > 	* tree-vect-loop.c (vectorizable_reduction): Handle fully-masked loops.
> > 	Cope with reduction statements that are vectorized as calls rather
> > 	than assignments.
> > 	* config/aarch64/aarch64-sve.md (cond_<optab><mode>): New insns.
> > 	* config/aarch64/iterators.md (UNSPEC_COND_ADD, UNSPEC_COND_SUB)
> > 	(UNSPEC_COND_SMAX, UNSPEC_COND_UMAX, UNSPEC_COND_SMIN)
> > 	(UNSPEC_COND_UMIN, UNSPEC_COND_AND, UNSPEC_COND_ORR)
> > 	(UNSPEC_COND_EOR): New unspecs.
> > 	(optab): Add mappings for them.
> > 	(SVE_COND_INT_OP, SVE_COND_FP_OP): New int iterators.
> > 	(sve_int_op, sve_fp_op): New int attributes.
> > 
> > gcc/testsuite/
> > 	* gcc.dg/vect/pr60482.c: Remove XFAIL for variable-length vectors.
> > 	* gcc.target/aarch64/sve_reduc_1.c: Expect the loop operations
> > 	to be predicated.
> > 	* gcc.target/aarch64/sve_slp_5.c: Check for a fully-masked loop.
> > 	* gcc.target/aarch64/sve_slp_7.c: Likewise.
> > 	* gcc.target/aarch64/sve_reduc_5.c: New test.
> > 	* gcc.target/aarch64/sve_slp_13.c: Likewise.
> > 	* gcc.target/aarch64/sve_slp_13_run.c: Likewise.
> I didn't walk through the aarch64 specific bits here.  The generic bits
> are OK.

As are the AArch64 bits.

OK.

James

Follow-Ups:
- Re: Add support for reductions in fully-masked loops
  - From: Christophe Lyon

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]