This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays

From: Sebastian Pop <sebpop at gmail dot com>
To: Richard Guenther <richard dot guenther at gmail dot com>
Cc: Zdenek Dvorak <rakdver at kam dot mff dot cuni dot cz>, "Fang, Changpeng" <Changpeng dot Fang at amd dot com>, Christian Borntraeger <borntraeger at de dot ibm dot com>, "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>, "uweigand at de dot ibm dot com" <uweigand at de dot ibm dot com>
Date: Wed, 30 Jun 2010 13:10:49 -0500
Subject: Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
References: <D4C76825A6780047854A11E93CDE84D02F7759@SAUSEXMBP01.amd.com> <AANLkTimL0po55_PnkbcCe2lbJNlaeWmcLeRqy48kTBen@mail.gmail.com> <D4C76825A6780047854A11E93CDE84D02F775B@SAUSEXMBP01.amd.com> <AANLkTil2GYzA5OPqbqPwWRAhfdz_5vzl_moDpIHf4ole@mail.gmail.com> <20100630180140.GA32646@kam.mff.cuni.cz> <AANLkTinq0MP5fhFhmomp7E9NDzMzd5lukS32RxT7lYwm@mail.gmail.com>

On Wed, Jun 30, 2010 at 13:05, Richard Guenther
<richard.guenther@gmail.com> wrote:
> On Wed, Jun 30, 2010 at 8:01 PM, Zdenek Dvorak <rakdver@kam.mff.cuni.cz> wrote:
>> Hi,
>>
>>> On Wed, Jun 30, 2010 at 7:34 PM, Fang, Changpeng <Changpeng.Fang@amd.com> wrote:
>>> >> FOR_EACH_LOOP (li, loop, LI_ONLY_INNERMOST)
>>> >
>>> >>does that make a difference?
>>> >
>>> > This doesn't help, because "compute_all_dependences" was called the same number of time
>>> > as before.
>>> >
>>> > (BTW, should we limit prefetching only to the innermost one?)
>>> >
>>> > In this test case, there are 6 large loops, where each loop has 729 memory reference.
>>> > It takes 4~5 seconds to "compute_all_dependence" for one such loop.
>>>
>>> It shouldn't take that long. ?Can you gather a more detailed profile?
>>
>> actually, compute_all_dependences is quadratic in the number of memory
>> references, and in more complicated cases, it can perform rather complex
>> computations, so 5 seconds on 729 references does seem like a realistic time.
>> Of course, we need to either speed up the analysis or add an cut-off to avoid
>> it on loops with too many memory references (or both),
>
> Well, but at -O3 the vectorizer computes dependences as well, and it
> doesn't take that much of time, ?So there must be something obvious
> going wrong.
>

The dependence analysis in the vectorizer is done only in the innermost
loop that is vectorized, whereas prefetch does the analysis of data deps
for every loop.

Sebastian

Follow-Ups:
- Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Richard Guenther

References:
- [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Fang, Changpeng
- Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Richard Guenther
- RE: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Fang, Changpeng
- Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Richard Guenther
- Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Zdenek Dvorak
- Re: [Patch PR 44576]: Reduce the computation cost in compute_miss_rate for prefetching loop arrays
  - From: Richard Guenther

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]