optimization/2001: Inordinately long compile times in reload CSE regs
Wolfgang Bangerth
bangerth@ticam.utexas.edu
Mon Nov 25 15:09:00 GMT 2002
The following reply was made to PR optimization/2001; it has been noted by GNATS.
From: Wolfgang Bangerth <bangerth@ticam.utexas.edu>
To: gcc-gnats@gcc.gnu.org
Cc:
Subject: Re: optimization/2001: Inordinately long compile times in reload
CSE regs
Date: Tue, 19 Nov 2002 08:58:33 -0600 (CST)
---------- Forwarded message ----------
Date: Mon, 18 Nov 2002 23:37:37 -0500 (EST)
From: Brad Lucier <lucier@math.purdue.edu>
To: bangerth@dealii.org
Cc: gcc-bugs@gcc.gnu.org, gcc-prs@gcc.gnu.org, lucier@math.purdue.edu,
nobody@gcc.gnu.org
Subject: Re: optimization/2001: Inordinately long compile times in reload
CSE regs
>
> Synopsis: Inordinately long compile times in reload CSE regs
>
> State-Changed-From-To: open->feedback
> State-Changed-By: bangerth
> State-Changed-When: Mon Nov 18 15:22:46 2002
> State-Changed-Why:
> Brad, can you confirm whether this problem still applies? If
> it is a regression w.r.t. 2.95, then we may want to raise
> its priority.
>
> Thanks
> Wolfgang
>
> http://gcc.gnu.org/cgi-bin/gnatsweb.pl?cmd=view%20audit-trail&database=gcc&pr=2001
>
It still happens with today's CVS mainline. The timings for cc1 (checking
disabled, no profiling) are below. The input file is at
http://www.math.purdue.edu/~lucier/GNATS/GNATS-6/pi.i.gz
popov-222% /export/home/lucier/local/lib/gcc-lib/alphaev6-unknown-linux-gnu/3.3/cc1 -fPIC -fno-math-errno -O2 -mcpu=ev6 pi.i
__copysignf copysignf __copysign copysign __fabsf fabsf __fabs fabs __floorf __floor floorf floor ___H__20_pi {GC 6136k -> 2125k} {GC 27774k -> 14090k} {GC 20667k -> 10672k} {GC 20718k -> 17582k} {GC 48525k -> 18420k} {GC 33107k -> 24477k} ___init_proc ____20_pi
Execution times (seconds)
garbage collection : 1.24 ( 0%) usr 0.01 ( 2%) sys 1.00 ( 0%) wall
cfg construction : 0.26 ( 0%) usr 0.01 ( 2%) sys 0.50 ( 0%) wall
cfg cleanup : 0.94 ( 0%) usr 0.00 ( 0%) sys 1.00 ( 0%) wall
trivially dead code : 1.21 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
life analysis : 12.97 ( 4%) usr 0.00 ( 1%) sys 13.00 ( 4%) wall
life info update : 1.97 ( 1%) usr 0.00 ( 0%) sys 2.50 ( 1%) wall
preprocessing : 0.06 ( 0%) usr 0.02 ( 4%) sys 0.00 ( 0%) wall
lexical analysis : 0.09 ( 0%) usr 0.05 ( 9%) sys 0.00 ( 0%) wall
parser : 0.26 ( 0%) usr 0.04 ( 6%) sys 0.50 ( 0%) wall
expand : 0.08 ( 0%) usr 0.00 ( 1%) sys 0.00 ( 0%) wall
varconst : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
integration : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
jump : 0.61 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
CSE : 7.91 ( 2%) usr 0.01 ( 1%) sys 8.00 ( 2%) wall
global CSE : 4.73 ( 1%) usr 0.21 (35%) sys 5.00 ( 1%) wall
loop analysis : 0.14 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
CSE 2 : 3.79 ( 1%) usr 0.00 ( 0%) sys 4.00 ( 1%) wall
branch prediction : 3.85 ( 1%) usr 0.00 ( 1%) sys 4.00 ( 1%) wall
flow analysis : 0.25 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
combiner : 0.65 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
if-conversion : 0.15 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
regmove : 0.58 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
scheduling : 159.40 (44%) usr 0.04 ( 6%) sys 159.50 (44%) wall
local alloc : 1.27 ( 0%) usr 0.00 ( 0%) sys 1.50 ( 0%) wall
global alloc : 4.20 ( 1%) usr 0.05 ( 8%) sys 4.00 ( 1%) wall
reload CSE regs : 109.40 (30%) usr 0.07 (12%) sys 109.50 (30%) wall
flow 2 : 0.60 ( 0%) usr 0.00 ( 0%) sys 0.50 ( 0%) wall
if-conversion 2 : 0.08 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
peephole 2 : 0.26 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
rename registers : 1.71 ( 0%) usr 0.00 ( 0%) sys 2.00 ( 1%) wall
scheduling 2 : 44.54 (12%) usr 0.02 ( 3%) sys 44.50 (12%) wall
reorder blocks : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
shorten branches : 0.38 ( 0%) usr 0.01 ( 1%) sys 0.50 ( 0%) wall
final : 1.39 ( 0%) usr 0.02 ( 4%) sys 1.50 ( 0%) wall
rest of compilation : 0.33 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
TOTAL : 365.42 0.59 366.00
popov-223% ll pi.s
-rw-r--r-- 1 lucier users 1829841 Nov 18 23:30 pi.s
popov-224% /export/home/lucier/local/lib/gcc-lib/alphaev6-unknown-linux-gnu/3.3/cc1 -fPIC -fno-math-errno -O2 -fno-gcse -mcpu=ev6 pi.i
__copysignf copysignf __copysign copysign __fabsf fabsf __fabs fabs __floorf __floor floorf floor ___H__20_pi {GC 6136k -> 2125k} {GC 6575k -> 2138k} ___init_proc ____20_pi
Execution times (seconds)
garbage collection : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
cfg construction : 0.22 ( 2%) usr 0.01 ( 8%) sys 0.50 ( 5%) wall
cfg cleanup : 0.74 ( 8%) usr 0.00 ( 0%) sys 0.50 ( 5%) wall
trivially dead code : 0.06 ( 1%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
life analysis : 0.57 ( 6%) usr 0.00 ( 0%) sys 0.50 ( 5%) wall
life info update : 0.17 ( 2%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
preprocessing : 0.08 ( 1%) usr 0.02 (12%) sys 0.00 ( 0%) wall
lexical analysis : 0.08 ( 1%) usr 0.03 (21%) sys 0.00 ( 0%) wall
parser : 0.29 ( 3%) usr 0.03 (25%) sys 0.50 ( 5%) wall
expand : 0.09 ( 1%) usr 0.00 ( 1%) sys 0.00 ( 0%) wall
varconst : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
integration : 0.02 ( 0%) usr 0.00 ( 1%) sys 0.00 ( 0%) wall
jump : 0.46 ( 5%) usr 0.01 ( 4%) sys 0.50 ( 5%) wall
CSE : 0.14 ( 2%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
loop analysis : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
CSE 2 : 0.13 ( 1%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
branch prediction : 3.79 (42%) usr 0.01 ( 8%) sys 4.00 (42%) wall
flow analysis : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
combiner : 0.16 ( 2%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
if-conversion : 0.08 ( 1%) usr 0.00 ( 0%) sys 0.50 ( 5%) wall
regmove : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
scheduling : 0.66 ( 7%) usr 0.00 ( 1%) sys 1.00 (11%) wall
local alloc : 0.15 ( 2%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
global alloc : 0.18 ( 2%) usr 0.01 ( 9%) sys 0.00 ( 0%) wall
reload CSE regs : 0.21 ( 2%) usr 0.00 ( 3%) sys 0.50 ( 5%) wall
flow 2 : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
if-conversion 2 : 0.06 ( 1%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
peephole 2 : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
rename registers : 0.09 ( 1%) usr 0.00 ( 1%) sys 0.00 ( 0%) wall
scheduling 2 : 0.16 ( 2%) usr 0.00 ( 1%) sys 0.50 ( 5%) wall
reorder blocks : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
shorten branches : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
final : 0.08 ( 1%) usr 0.00 ( 1%) sys 0.00 ( 0%) wall
rest of compilation : 0.13 ( 1%) usr 0.00 ( 0%) sys 0.50 ( 5%) wall
TOTAL : 9.08 0.13 9.50
popov-225% ll pi.s
-rw-r--r-- 1 lucier users 113392 Nov 18 23:31 pi.s
More information about the Gcc-prs
mailing list