This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency

From: Jan Hubicka <hubicka at ucw dot cz>
To: Jeff Law <law at redhat dot com>
Cc: "H.J. Lu" <hjl dot tools at gmail dot com>, GCC Patches <gcc-patches at gcc dot gnu dot org>, "Pandey, Sunil K" <sunil dot k dot pandey at intel dot com>, Uros Bizjak <ubizjak at gmail dot com>
Date: Wed, 28 Nov 2018 21:21:03 +0100
Subject: Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency
References: <CAMe9rOryQB=OT=RAff6BkLbbOrx9tDB1by6M_tnuYPBxmXQ5mQ@mail.gmail.com> <20181105142107.lxacbz25a7gm6767@kam.mff.cuni.cz> <2afa9f24-e018-beb6-2ef1-2f7d4bcea294@redhat.com> <20181105152904.2zzvyyhtiymfcrld@kam.mff.cuni.cz> <CAMe9rOpB3nj4LQ_yuFpB25bHhiv=1YEUL210f0s9DH6D3nS-EA@mail.gmail.com> <4c17a937-a770-c809-102d-d789ef0d842e@redhat.com>

> On 11/28/18 12:48 PM, H.J. Lu wrote:
> > On Mon, Nov 5, 2018 at 7:29 AM Jan Hubicka <hubicka@ucw.cz> wrote:
> >>
> >>> On 11/5/18 7:21 AM, Jan Hubicka wrote:
> >>>>>
> >>>>> Did you mean "the nearest common dominator"?
> >>>>
> >>>> If the nearest common dominator appears in the loop while all uses are
> >>>> out of loops, this will result in suboptimal xor placement.
> >>>> In this case you want to split edges out of the loop.
> >>>>
> >>>> In general this is what the LCM framework will do for you if the problem
> >>>> is modelled siimlar way as in mode_swtiching.  At entry function mode is
> >>>> "no zero register needed" and all conversions need mode "zero register
> >>>> needed".  Mode switching should then do the correct placement decisions
> >>>> (reaching minimal number of executions of xor).
> >>>>
> >>>> Jeff, whan is your optinion on the approach taken by the patch?
> >>>> It seems like a special case of more general issue, but I do not see
> >>>> very elegant way to solve it at least in the GCC 9 horisont, so if
> >>>> the placement is correct we can probalby go either with new pass or
> >>>> making this part of mode swithcing (which is anyway run by x86 backend)
> >>> So I haven't followed this discussion at all, but did touch on this
> >>> issue with some patch a month or two ago with a target patch that was
> >>> trying to avoid the partial stalls.
> >>>
> >>> My assumption is that we're trying to find one or more places to
> >>> initialize the upper half of an avx register so as to avoid partial
> >>> register stall at existing sites that set the upper half.
> >>>
> >>> This sounds like a classic PRE/LCM style problem (of which mode
> >>> switching is just another variant).   A common-dominator approach is
> >>> closer to a classic GCSE and is going to result is more initializations
> >>> at sub-optimal points than a PRE/LCM style.
> >>
> >> yes, it is usual code placement problem. It is special case because the
> >> zero register is not modified by the conversion (just we need to have
> >> zero somewhere).  So basically we do not have kills to the zero except
> >> for entry block.
> >>
> > 
> > Do you have  testcase to show thatf the nearest common dominator
> > in the loop, while all uses areout of loops, leads to suboptimal xor
> > placement?
> I don't have a testcase, but it's all but certain nearest common
> dominator is going to be a suboptimal placement.  That's going to create
> paths where you're going to emit the xor when it's not used.
> 
> The whole point of the LCM algorithms is they are optimal in terms of
> expression evaluations.

i think testcase should be something like

test()
{
  while (true)
  {
     if (cond1)
       {
       	 do_one_conversion;
	 return;
       }
     if (cond2)
       {
       	 do_other_conversion;
	 return;
       }
  }
}

Honza
> 
> jeff
> > 
>

References:
- Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency
  - From: Jan Hubicka
- Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency
  - From: Jeff Law
- Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency
  - From: Jan Hubicka
- Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency
  - From: H.J. Lu
- Re: V2 [PATCH] i386: Add pass_remove_partial_avx_dependency
  - From: Jeff Law

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]