This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
[Patch PR 44576]: imprivement in compute_miss_rate for prefetching loop arrays
- From: "Fang, Changpeng" <Changpeng dot Fang at amd dot com>
- To: Richard Guenther <richard dot guenther at gmail dot com>, Christian Borntraeger <borntraeger at de dot ibm dot com>, "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>
- Cc: "uweigand at de dot ibm dot com" <uweigand at de dot ibm dot com>, Zdenek Dvorak <rakdver at kam dot mff dot cuni dot cz>, "sebpop at gmail dot com" <sebpop at gmail dot com>
- Date: Mon, 28 Jun 2010 19:01:54 -0500
- Subject: [Patch PR 44576]: imprivement in compute_miss_rate for prefetching loop arrays
Hi,
Attached is the patch that partially fixes bug 44576: testsuite/gfortran.dg/zero_sized_1.f90 with huge compile
time on prefetching + peeling.
This patch avoid useless computation of miss rate because, if delta (address diference) is greater than or equal to
cache line size, The two references will never hit the same cache size and thus all misses.
This patch reduces the compile time of the test case from 5m30'' to 1m20'' on an amd-linux64 system.
Note that without -fprefetching-loop-arrays, the compile time on the same system is 30'', and I am still
working on reducing the complexity of reuse analysis and miss rate computation.
The patch passed Bootstrapping and regression tests.
Is this patch OK to commit?
Thanks,
Changpeng
From b29f8edf2b1a068ab7271746e8c621446e342dc1 Mon Sep 17 00:00:00 2001
From: Changpeng Fang <chfang@pathscale.(none)>
Date: Mon, 28 Jun 2010 10:23:36 -0700
Subject: [PATCH 4/4] pr 44576: miss rate computation improvement for prefetching loop arrays.
* tree-ssa-loop-prefetch.c (compute_miss_rate): Return 1000 (out of 1000)
for miss rate if the address diference is greater than or equal to the
cache line size (the two reference will never hit the same cache
line).
---
gcc/tree-ssa-loop-prefetch.c | 5 +++++
1 files changed, 5 insertions(+), 0 deletions(-)
diff --git a/gcc/tree-ssa-loop-prefetch.c b/gcc/tree-ssa-loop-prefetch.c
index 548c3e4..27e2b42 100644
--- a/gcc/tree-ssa-loop-prefetch.c
+++ b/gcc/tree-ssa-loop-prefetch.c
@@ -654,6 +654,11 @@ compute_miss_rate (unsigned HOST_WIDE_INT cache_line_size,
int total_positions, miss_positions, miss_rate;
int address1, address2, cache_line1, cache_line2;
+ /* It always misses if delta is greater than or equal to the cache
+ line size. */
+ if (delta >= cache_line_size)
+ return 1000;
+
total_positions = 0;
miss_positions = 0;
--
1.6.3.3