This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Compilation time has more than doubled on some Polyhedron tests


On an AMD, the 20060105 build gives

[scala] gfortran/2006> time irun/bin/gfortran -ftime-report -O3 -ffast-math -funroll-loops induct.f90

Execution times (seconds)
 garbage collection    :   0.39 ( 2%) usr   0.01 ( 2%) sys   0.40 ( 2%) wall       0 kB ( 0%) ggc
 callgraph construction:   0.09 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall    1107 kB ( 1%) ggc
 callgraph optimization:   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     351 kB ( 0%) ggc
 ipa reference         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       1 kB ( 0%) ggc
 ipa type escape       :   0.02 ( 0%) usr   0.01 ( 2%) sys   0.02 ( 0%) wall       0 kB ( 0%) ggc
 cfg construction      :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     110 kB ( 0%) ggc
 cfg cleanup           :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     443 kB ( 0%) ggc
 CFG verifier          :   0.22 ( 1%) usr   0.00 ( 0%) sys   0.26 ( 1%) wall       0 kB ( 0%) ggc
 trivially dead code   :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall       0 kB ( 0%) ggc
 life analysis         :   0.20 ( 1%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall     597 kB ( 0%) ggc
 life info update      :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall      94 kB ( 0%) ggc
 alias analysis        :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall    2569 kB ( 2%) ggc
 register scan         :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall       1 kB ( 0%) ggc
 rebuild jump labels   :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 parser                :   0.17 ( 1%) usr   0.01 ( 2%) sys   0.39 ( 2%) wall    6866 kB ( 5%) ggc
 tree gimplify         :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall    1671 kB ( 1%) ggc
 tree CFG construction :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    2866 kB ( 2%) ggc
 tree CFG cleanup      :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     441 kB ( 0%) ggc
 tree VRP              :   0.12 ( 1%) usr   0.02 ( 5%) sys   0.12 ( 1%) wall    2473 kB ( 2%) ggc
 tree copy propagation :   0.44 ( 2%) usr   0.03 ( 8%) sys   0.49 ( 2%) wall     292 kB ( 0%) ggc
 tree store copy prop  :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 1%) wall      41 kB ( 0%) ggc
 tree find ref. vars   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     597 kB ( 0%) ggc
 tree PTA              :   0.28 ( 1%) usr   0.00 ( 0%) sys   0.28 ( 1%) wall     351 kB ( 0%) ggc
 tree alias analysis   :   0.20 ( 1%) usr   0.01 ( 3%) sys   0.26 ( 1%) wall     484 kB ( 0%) ggc
 tree PHI insertion    :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     411 kB ( 0%) ggc
 tree SSA rewrite      :   0.45 ( 2%) usr   0.02 ( 5%) sys   0.36 ( 2%) wall   35265 kB (27%) ggc
 tree SSA other        :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA incremental  :   0.71 ( 4%) usr   0.02 ( 5%) sys   0.77 ( 4%) wall    6145 kB ( 5%) ggc
 tree operand scan     :   0.44 ( 2%) usr   0.07 (18%) sys   0.55 ( 3%) wall   17385 kB (13%) ggc
 dominator optimization:   0.22 ( 1%) usr   0.00 ( 0%) sys   0.23 ( 1%) wall    5640 kB ( 4%) ggc
 tree SRA              :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      22 kB ( 0%) ggc
 tree STORE-CCP        :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall      69 kB ( 0%) ggc
 tree CCP              :   0.06 ( 0%) usr   0.01 ( 2%) sys   0.10 ( 1%) wall      25 kB ( 0%) ggc
 tree split crit edges :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    1059 kB ( 1%) ggc
 tree reassociation    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      79 kB ( 0%) ggc
 tree PRE              :   0.20 ( 1%) usr   0.01 ( 3%) sys   0.19 ( 1%) wall    2221 kB ( 2%) ggc
 tree FRE              :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall    1901 kB ( 1%) ggc
 tree conservative DCE :   0.23 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall       0 kB ( 0%) ggc
 tree aggressive DCE   :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall       0 kB ( 0%) ggc
 tree DSE              :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      58 kB ( 0%) ggc
 PHI merge             :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     764 kB ( 1%) ggc
 tree loop bounds      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     357 kB ( 0%) ggc
 loop invariant motion :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall      18 kB ( 0%) ggc
 tree canonical iv     :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     169 kB ( 0%) ggc
 scev constant prop    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      94 kB ( 0%) ggc
 complete unrolling    :   0.13 ( 1%) usr   0.01 ( 3%) sys   0.12 ( 1%) wall     435 kB ( 0%) ggc
 tree iv optimization  :   0.04 ( 0%) usr   0.01 ( 3%) sys   0.05 ( 0%) wall    1968 kB ( 2%) ggc
 tree loop init        :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall       5 kB ( 0%) ggc
 tree copy headers     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     826 kB ( 1%) ggc
 tree SSA uncprop      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA to normal    :   0.14 ( 1%) usr   0.00 ( 0%) sys   0.14 ( 1%) wall    2007 kB ( 2%) ggc
 tree rename SSA copies:   0.04 ( 0%) usr   0.05 (12%) sys   0.06 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA verifier     :   9.55 (50%) usr   0.09 (23%) sys   9.62 (48%) wall      19 kB ( 0%) ggc
 tree STMT verifier    :   1.56 ( 8%) usr   0.00 ( 0%) sys   1.61 ( 8%) wall       0 kB ( 0%) ggc
 callgraph verifier    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       0 kB ( 0%) ggc
 dominance frontiers   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 expand                :   0.39 ( 2%) usr   0.00 ( 0%) sys   0.46 ( 2%) wall    9703 kB ( 8%) ggc
 CSE                   :   0.22 ( 1%) usr   0.00 ( 0%) sys   0.25 ( 1%) wall     742 kB ( 1%) ggc
 loop analysis         :   0.10 ( 1%) usr   0.01 ( 3%) sys   0.11 ( 1%) wall    2339 kB ( 2%) ggc
 global CSE            :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 CPROP 1               :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     839 kB ( 1%) ggc
 PRE                   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     391 kB ( 0%) ggc
 CPROP 2               :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     220 kB ( 0%) ggc
 bypass jumps          :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     316 kB ( 0%) ggc
 web                   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall      73 kB ( 0%) ggc
 CSE 2                 :   0.16 ( 1%) usr   0.00 ( 0%) sys   0.16 ( 1%) wall     451 kB ( 0%) ggc
 branch prediction     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     291 kB ( 0%) ggc
 combiner              :   0.11 ( 1%) usr   0.00 ( 0%) sys   0.12 ( 1%) wall    1481 kB ( 1%) ggc
 if-conversion         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      36 kB ( 0%) ggc
 regmove               :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall      12 kB ( 0%) ggc
 local alloc           :   0.11 ( 1%) usr   0.00 ( 0%) sys   0.10 ( 1%) wall     487 kB ( 0%) ggc
 global alloc          :   0.38 ( 2%) usr   0.01 ( 3%) sys   0.37 ( 2%) wall    4316 kB ( 3%) ggc
 reload CSE regs       :   0.14 ( 1%) usr   0.00 ( 0%) sys   0.15 ( 1%) wall    1897 kB ( 1%) ggc
 load CSE after reload :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      21 kB ( 0%) ggc
 flow 2                :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     381 kB ( 0%) ggc
 if-conversion 2       :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      10 kB ( 0%) ggc
 peephole 2            :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      55 kB ( 0%) ggc
 rename registers      :   0.13 ( 1%) usr   0.00 ( 0%) sys   0.10 ( 1%) wall     279 kB ( 0%) ggc
 scheduling 2          :   0.19 ( 1%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall    4698 kB ( 4%) ggc
 machine dep reorg     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall      23 kB ( 0%) ggc
 reorder blocks        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     235 kB ( 0%) ggc
 final                 :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     333 kB ( 0%) ggc
 TOTAL                 :  19.26             0.40            19.91             129144 kB
Extra diagnostic checks enabled; compiler may run slowly.
Configure with --disable-checking to disable checks.
19.326u 0.428s 0:20.21 97.6%    0+0k 0+0io 0pf+0w

for induct.f90, and

[scala] gfortran/2006> time irun/bin/gfortran -ftime-report -O3 -ffast-math -funroll-loops air.f90 

Execution times (seconds)
 garbage collection    :   0.20 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall       0 kB ( 0%) ggc
 callgraph construction:   0.02 ( 0%) usr   0.01 ( 4%) sys   0.03 ( 0%) wall     378 kB ( 0%) ggc
 ipa reference         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       2 kB ( 0%) ggc
 ipa type escape       :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 cfg cleanup           :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     222 kB ( 0%) ggc
 CFG verifier          :   0.10 ( 1%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall       0 kB ( 0%) ggc
 trivially dead code   :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       0 kB ( 0%) ggc
 life analysis         :   0.10 ( 1%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall     387 kB ( 0%) ggc
 life info update      :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      58 kB ( 0%) ggc
 alias analysis        :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall    1181 kB ( 1%) ggc
 register scan         :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall       0 kB ( 0%) ggc
 rebuild jump labels   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 parser                :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 1%) wall    2893 kB ( 3%) ggc
 tree gimplify         :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall    1547 kB ( 2%) ggc
 tree eh               :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree CFG construction :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    1563 kB ( 2%) ggc
 tree CFG cleanup      :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      80 kB ( 0%) ggc
 tree VRP              :   0.08 ( 1%) usr   0.01 ( 4%) sys   0.09 ( 1%) wall    1305 kB ( 1%) ggc
 tree copy propagation :   0.45 ( 3%) usr   0.03 (12%) sys   0.48 ( 3%) wall     582 kB ( 1%) ggc
 tree store copy prop  :   0.08 ( 1%) usr   0.01 ( 4%) sys   0.09 ( 1%) wall     100 kB ( 0%) ggc
 tree find ref. vars   :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     263 kB ( 0%) ggc
 tree PTA              :   0.18 ( 1%) usr   0.00 ( 0%) sys   0.15 ( 1%) wall     114 kB ( 0%) ggc
 tree alias analysis   :   0.13 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall     159 kB ( 0%) ggc
 tree SSA rewrite      :   0.50 ( 3%) usr   0.03 (12%) sys   0.50 ( 3%) wall   27572 kB (29%) ggc
 tree SSA other        :   0.01 ( 0%) usr   0.01 ( 4%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA incremental  :   0.80 ( 5%) usr   0.00 ( 0%) sys   0.83 ( 5%) wall    5076 kB ( 5%) ggc
 tree operand scan     :   0.35 ( 2%) usr   0.03 (12%) sys   0.37 ( 2%) wall   12479 kB (13%) ggc
 dominator optimization:   0.17 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall    1234 kB ( 1%) ggc
 tree STORE-CCP        :   0.07 ( 0%) usr   0.01 ( 4%) sys   0.07 ( 0%) wall      21 kB ( 0%) ggc
 tree CCP              :   0.08 ( 1%) usr   0.00 ( 0%) sys   0.08 ( 1%) wall      42 kB ( 0%) ggc
 tree split crit edges :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall    1404 kB ( 1%) ggc
 tree reassociation    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      65 kB ( 0%) ggc
 tree PRE              :   0.13 ( 1%) usr   0.01 ( 4%) sys   0.14 ( 1%) wall    1061 kB ( 1%) ggc
 tree FRE              :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall    1501 kB ( 2%) ggc
 tree conservative DCE :   0.17 ( 1%) usr   0.00 ( 0%) sys   0.18 ( 1%) wall       0 kB ( 0%) ggc
 tree aggressive DCE   :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall       0 kB ( 0%) ggc
 tree DSE              :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     110 kB ( 0%) ggc
 tree loop bounds      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     673 kB ( 1%) ggc
 loop invariant motion :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall      43 kB ( 0%) ggc
 scev constant prop    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     168 kB ( 0%) ggc
 complete unrolling    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      55 kB ( 0%) ggc
 tree iv optimization  :   0.19 ( 1%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall   10657 kB (11%) ggc
 tree loop init        :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall       0 kB ( 0%) ggc
 tree copy headers     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    2022 kB ( 2%) ggc
 tree SSA uncprop      :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA to normal    :   0.08 ( 1%) usr   0.00 ( 0%) sys   0.08 ( 1%) wall     578 kB ( 1%) ggc
 tree rename SSA copies:   0.03 ( 0%) usr   0.01 ( 4%) sys   0.03 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA verifier     :   8.13 (55%) usr   0.08 (32%) sys   8.18 (54%) wall       0 kB ( 0%) ggc
 tree STMT verifier    :   0.95 ( 6%) usr   0.00 ( 0%) sys   0.99 ( 7%) wall       0 kB ( 0%) ggc
 callgraph verifier    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       0 kB ( 0%) ggc
 expand                :   0.32 ( 2%) usr   0.00 ( 0%) sys   0.32 ( 2%) wall    3161 kB ( 3%) ggc
 CSE                   :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall    2024 kB ( 2%) ggc
 loop analysis         :   0.10 ( 1%) usr   0.00 ( 0%) sys   0.13 ( 1%) wall    3069 kB ( 3%) ggc
 global CSE            :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 CPROP 1               :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     228 kB ( 0%) ggc
 PRE                   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     163 kB ( 0%) ggc
 CPROP 2               :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     117 kB ( 0%) ggc
 bypass jumps          :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     118 kB ( 0%) ggc
 web                   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      86 kB ( 0%) ggc
 CSE 2                 :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall    1340 kB ( 1%) ggc
 branch prediction     :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      96 kB ( 0%) ggc
 flow analysis         :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 combiner              :   0.09 ( 1%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall    1748 kB ( 2%) ggc
 if-conversion         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       8 kB ( 0%) ggc
 regmove               :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall       7 kB ( 0%) ggc
 local alloc           :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall     271 kB ( 0%) ggc
 global alloc          :   0.20 ( 1%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall    1570 kB ( 2%) ggc
 reload CSE regs       :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     881 kB ( 1%) ggc
 load CSE after reload :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       8 kB ( 0%) ggc
 flow 2                :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     242 kB ( 0%) ggc
 peephole 2            :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      20 kB ( 0%) ggc
 rename registers      :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall     217 kB ( 0%) ggc
 scheduling 2          :   0.07 ( 0%) usr   0.01 ( 4%) sys   0.09 ( 1%) wall    1939 kB ( 2%) ggc
 machine dep reorg     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall      43 kB ( 0%) ggc
 reorder blocks        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     113 kB ( 0%) ggc
 reg stack             :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      11 kB ( 0%) ggc
 final                 :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     157 kB ( 0%) ggc
 TOTAL                 :  14.91             0.25            15.17              94901 kB
Extra diagnostic checks enabled; compiler may run slowly.
Configure with --disable-checking to disable checks.
14.953u 0.277s 0:15.40 98.8%    0+0k 0+0io 0pf+0w

for air.f90. The 20060106 build gives

[scala] gfortran/2006> time irun/bin/gfortran -ftime-report -O3 -ffast-math -funroll-loops induct.f90

Execution times (seconds)
 garbage collection    :   0.38 ( 1%) usr   0.02 ( 5%) sys   0.40 ( 1%) wall       0 kB ( 0%) ggc
 callgraph construction:   0.04 ( 0%) usr   0.01 ( 3%) sys   0.06 ( 0%) wall    1107 kB ( 1%) ggc
 ipa reference         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       1 kB ( 0%) ggc
 ipa type escape       :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       0 kB ( 0%) ggc
 cfg construction      :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     110 kB ( 0%) ggc
 cfg cleanup           :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     453 kB ( 0%) ggc
 CFG verifier          :   0.17 ( 1%) usr   0.00 ( 0%) sys   0.22 ( 1%) wall       0 kB ( 0%) ggc
 trivially dead code   :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall       0 kB ( 0%) ggc
 life analysis         :   0.15 ( 0%) usr   0.00 ( 0%) sys   0.25 ( 1%) wall     550 kB ( 0%) ggc
 life info update      :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall      90 kB ( 0%) ggc
 alias analysis        :   0.15 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall    2513 kB ( 1%) ggc
 register scan         :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall       4 kB ( 0%) ggc
 rebuild jump labels   :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 parser                :   0.12 ( 0%) usr   0.01 ( 3%) sys   0.44 ( 1%) wall    6866 kB ( 4%) ggc
 tree gimplify         :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall    1671 kB ( 1%) ggc
 tree CFG construction :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall    2866 kB ( 1%) ggc
 tree CFG cleanup      :   0.04 ( 0%) usr   0.01 ( 3%) sys   0.09 ( 0%) wall     449 kB ( 0%) ggc
 tree VRP              :   0.21 ( 1%) usr   0.01 ( 3%) sys   0.20 ( 1%) wall    4112 kB ( 2%) ggc
 tree copy propagation :   1.07 ( 3%) usr   0.04 (11%) sys   1.06 ( 3%) wall     859 kB ( 0%) ggc
 tree store copy prop  :   0.20 ( 1%) usr   0.02 ( 5%) sys   0.22 ( 1%) wall     150 kB ( 0%) ggc
 tree find ref. vars   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     597 kB ( 0%) ggc
 tree PTA              :   0.50 ( 1%) usr   0.00 ( 0%) sys   0.46 ( 1%) wall     355 kB ( 0%) ggc
 tree alias analysis   :   0.52 ( 2%) usr   0.01 ( 3%) sys   0.55 ( 2%) wall    3380 kB ( 2%) ggc
 tree PHI insertion    :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    1011 kB ( 1%) ggc
 tree SSA rewrite      :   0.93 ( 3%) usr   0.03 ( 8%) sys   1.08 ( 3%) wall   65009 kB (33%) ggc
 tree SSA other        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA incremental  :   1.84 ( 5%) usr   0.02 ( 5%) sys   1.87 ( 5%) wall   13262 kB ( 7%) ggc
 tree operand scan     :   0.88 ( 3%) usr   0.05 (13%) sys   0.97 ( 3%) wall   29929 kB (15%) ggc
 dominator optimization:   0.43 ( 1%) usr   0.00 ( 0%) sys   0.41 ( 1%) wall    5778 kB ( 3%) ggc
 tree STORE-CCP        :   0.18 ( 1%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall      72 kB ( 0%) ggc
 tree CCP              :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.18 ( 1%) wall      36 kB ( 0%) ggc
 tree split crit edges :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall    2540 kB ( 1%) ggc
 tree reassociation    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall      77 kB ( 0%) ggc
 tree PRE              :   0.27 ( 1%) usr   0.01 ( 3%) sys   0.29 ( 1%) wall    2153 kB ( 1%) ggc
 tree FRE              :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall    1860 kB ( 1%) ggc
 tree code sinking     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      16 kB ( 0%) ggc
 tree linearize phis   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       7 kB ( 0%) ggc
 tree forward propagate:   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       7 kB ( 0%) ggc
 tree conservative DCE :   0.41 ( 1%) usr   0.00 ( 0%) sys   0.42 ( 1%) wall       0 kB ( 0%) ggc
 tree aggressive DCE   :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 0%) wall       0 kB ( 0%) ggc
 tree DSE              :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     148 kB ( 0%) ggc
 PHI merge             :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall    2456 kB ( 1%) ggc
 tree loop bounds      :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     354 kB ( 0%) ggc
 loop invariant motion :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall      16 kB ( 0%) ggc
 tree canonical iv     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     167 kB ( 0%) ggc
 scev constant prop    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      93 kB ( 0%) ggc
 tree loop unswitching :   0.00 ( 0%) usr   0.01 ( 3%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 complete unrolling    :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     455 kB ( 0%) ggc
 tree iv optimization  :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall    1939 kB ( 1%) ggc
 tree loop init        :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall       1 kB ( 0%) ggc
 tree copy headers     :   0.02 ( 0%) usr   0.01 ( 3%) sys   0.02 ( 0%) wall    3783 kB ( 2%) ggc
 tree SSA uncprop      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA to normal    :   0.18 ( 1%) usr   0.03 ( 8%) sys   0.19 ( 1%) wall    2111 kB ( 1%) ggc
 tree rename SSA copies:   0.05 ( 0%) usr   0.01 ( 3%) sys   0.06 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA verifier     :  18.53 (55%) usr   0.05 (13%) sys  18.54 (54%) wall      19 kB ( 0%) ggc
 tree STMT verifier    :   2.36 ( 7%) usr   0.00 ( 0%) sys   2.28 ( 7%) wall       0 kB ( 0%) ggc
 callgraph verifier    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall       0 kB ( 0%) ggc
 expand                :   0.97 ( 3%) usr   0.01 ( 3%) sys   1.01 ( 3%) wall   13943 kB ( 7%) ggc
 CSE                   :   0.22 ( 1%) usr   0.00 ( 0%) sys   0.22 ( 1%) wall     782 kB ( 0%) ggc
 loop analysis         :   0.11 ( 0%) usr   0.01 ( 3%) sys   0.07 ( 0%) wall    2393 kB ( 1%) ggc
 CPROP 1               :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     835 kB ( 0%) ggc
 PRE                   :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     398 kB ( 0%) ggc
 CPROP 2               :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     222 kB ( 0%) ggc
 bypass jumps          :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     200 kB ( 0%) ggc
 web                   :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall      79 kB ( 0%) ggc
 CSE 2                 :   0.18 ( 1%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall     452 kB ( 0%) ggc
 branch prediction     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     287 kB ( 0%) ggc
 flow analysis         :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 combiner              :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall    1577 kB ( 1%) ggc
 regmove               :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall      12 kB ( 0%) ggc
 local alloc           :   0.14 ( 0%) usr   0.01 ( 3%) sys   0.11 ( 0%) wall     493 kB ( 0%) ggc
 global alloc          :   0.42 ( 1%) usr   0.00 ( 0%) sys   0.39 ( 1%) wall    4390 kB ( 2%) ggc
 reload CSE regs       :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 0%) wall    1905 kB ( 1%) ggc
 load CSE after reload :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      21 kB ( 0%) ggc
 flow 2                :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     370 kB ( 0%) ggc
 if-conversion 2       :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      10 kB ( 0%) ggc
 peephole 2            :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      54 kB ( 0%) ggc
 rename registers      :   0.16 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall     282 kB ( 0%) ggc
 scheduling 2          :   0.20 ( 1%) usr   0.00 ( 0%) sys   0.22 ( 1%) wall    4615 kB ( 2%) ggc
 machine dep reorg     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall      22 kB ( 0%) ggc
 reorder blocks        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     235 kB ( 0%) ggc
 final                 :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall     333 kB ( 0%) ggc
 TOTAL                 :  33.82             0.38            34.64             194932 kB
Extra diagnostic checks enabled; compiler may run slowly.
Configure with --disable-checking to disable checks.
33.901u 0.407s 0:35.47 96.7%    0+0k 0+0io 28pf+0w

for induct.f90 and

[scala] gfortran/2006> time irun/bin/gfortran -ftime-report -O3 -ffast-math -funroll-loops air.f90

Execution times (seconds)
 garbage collection    :   0.20 ( 5%) usr   0.00 ( 0%) sys   0.21 ( 6%) wall       0 kB ( 0%) ggc
 callgraph construction:   0.03 ( 1%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall     378 kB ( 1%) ggc
 ipa reference         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       2 kB ( 0%) ggc
 ipa type escape       :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 cfg cleanup           :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     217 kB ( 0%) ggc
 CFG verifier          :   0.09 ( 2%) usr   0.00 ( 0%) sys   0.08 ( 2%) wall       0 kB ( 0%) ggc
 trivially dead code   :   0.04 ( 1%) usr   0.00 ( 0%) sys   0.03 ( 1%) wall       0 kB ( 0%) ggc
 life analysis         :   0.09 ( 2%) usr   0.00 ( 0%) sys   0.11 ( 3%) wall     524 kB ( 1%) ggc
 life info update      :   0.04 ( 1%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall      52 kB ( 0%) ggc
 alias analysis        :   0.08 ( 2%) usr   0.00 ( 0%) sys   0.07 ( 2%) wall    1276 kB ( 3%) ggc
 register scan         :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 1%) wall       0 kB ( 0%) ggc
 rebuild jump labels   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 1%) wall       0 kB ( 0%) ggc
 parser                :   0.06 ( 2%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall    2893 kB ( 6%) ggc
 tree gimplify         :   0.04 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall    1547 kB ( 3%) ggc
 tree eh               :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree CFG construction :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall    1563 kB ( 3%) ggc
 tree CFG cleanup      :   0.02 ( 1%) usr   0.00 ( 0%) sys   0.04 ( 1%) wall     112 kB ( 0%) ggc
 tree VRP              :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall     730 kB ( 2%) ggc
 tree copy propagation :   0.02 ( 1%) usr   0.00 ( 0%) sys   0.03 ( 1%) wall     144 kB ( 0%) ggc
 tree store copy prop  :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      12 kB ( 0%) ggc
 tree find ref. vars   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall     263 kB ( 1%) ggc
 tree PTA              :   0.05 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall     115 kB ( 0%) ggc
 tree alias analysis   :   0.03 ( 1%) usr   0.02 (33%) sys   0.02 ( 1%) wall     533 kB ( 1%) ggc
 tree PHI insertion    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     504 kB ( 1%) ggc
 tree SSA rewrite      :   0.04 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall    1897 kB ( 4%) ggc
 tree SSA other        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA incremental  :   0.05 ( 1%) usr   0.00 ( 0%) sys   0.06 ( 2%) wall      79 kB ( 0%) ggc
 tree operand scan     :   0.11 ( 3%) usr   0.02 (33%) sys   0.10 ( 3%) wall    1246 kB ( 3%) ggc
 dominator optimization:   0.07 ( 2%) usr   0.00 ( 0%) sys   0.06 ( 2%) wall    1335 kB ( 3%) ggc
 tree STORE-CCP        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      26 kB ( 0%) ggc
 tree CCP              :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall      42 kB ( 0%) ggc
 tree split crit edges :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     421 kB ( 1%) ggc
 tree reassociation    :   0.02 ( 1%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall      71 kB ( 0%) ggc
 tree PRE              :   0.05 ( 1%) usr   0.00 ( 0%) sys   0.06 ( 2%) wall    1119 kB ( 2%) ggc
 tree FRE              :   0.03 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall    1547 kB ( 3%) ggc
 tree forward propagate:   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       7 kB ( 0%) ggc
 tree conservative DCE :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree loop bounds      :   0.03 ( 1%) usr   0.00 ( 0%) sys   0.03 ( 1%) wall     553 kB ( 1%) ggc
 loop invariant motion :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      53 kB ( 0%) ggc
 tree iv optimization  :   0.08 ( 2%) usr   0.01 (17%) sys   0.09 ( 2%) wall    4992 kB (11%) ggc
 tree loop init        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree copy headers     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     431 kB ( 1%) ggc
 tree SSA to normal    :   0.02 ( 1%) usr   0.00 ( 0%) sys   0.04 ( 1%) wall     751 kB ( 2%) ggc
 tree SSA verifier     :   0.50 (14%) usr   0.00 ( 0%) sys   0.57 (15%) wall       0 kB ( 0%) ggc
 tree STMT verifier    :   0.58 (16%) usr   0.00 ( 0%) sys   0.46 (12%) wall       0 kB ( 0%) ggc
 expand                :   0.11 ( 3%) usr   0.00 ( 0%) sys   0.10 ( 3%) wall    3735 kB ( 8%) ggc
 CSE                   :   0.07 ( 2%) usr   0.01 (17%) sys   0.07 ( 2%) wall    1705 kB ( 4%) ggc
 loop analysis         :   0.13 ( 4%) usr   0.00 ( 0%) sys   0.13 ( 3%) wall    3581 kB ( 8%) ggc
 CPROP 1               :   0.03 ( 1%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall     273 kB ( 1%) ggc
 PRE                   :   0.02 ( 1%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall     178 kB ( 0%) ggc
 CPROP 2               :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     133 kB ( 0%) ggc
 bypass jumps          :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     218 kB ( 0%) ggc
 web                   :   0.04 ( 1%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall     244 kB ( 1%) ggc
 CSE 2                 :   0.08 ( 2%) usr   0.00 ( 0%) sys   0.08 ( 2%) wall    1678 kB ( 4%) ggc
 branch prediction     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      94 kB ( 0%) ggc
 combiner              :   0.08 ( 2%) usr   0.00 ( 0%) sys   0.11 ( 3%) wall    1495 kB ( 3%) ggc
 regmove               :   0.03 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall      13 kB ( 0%) ggc
 local alloc           :   0.06 ( 2%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall     274 kB ( 1%) ggc
 global alloc          :   0.16 ( 4%) usr   0.00 ( 0%) sys   0.15 ( 4%) wall    1559 kB ( 3%) ggc
 reload CSE regs       :   0.06 ( 2%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall     886 kB ( 2%) ggc
 load CSE after reload :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       8 kB ( 0%) ggc
 flow 2                :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     241 kB ( 1%) ggc
 if-conversion 2       :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 peephole 2            :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      17 kB ( 0%) ggc
 rename registers      :   0.08 ( 2%) usr   0.00 ( 0%) sys   0.05 ( 1%) wall     229 kB ( 1%) ggc
 scheduling 2          :   0.10 ( 3%) usr   0.00 ( 0%) sys   0.10 ( 3%) wall    2012 kB ( 4%) ggc
 machine dep reorg     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 1%) wall      51 kB ( 0%) ggc
 reorder blocks        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     114 kB ( 0%) ggc
 final                 :   0.04 ( 1%) usr   0.00 ( 0%) sys   0.04 ( 1%) wall     161 kB ( 0%) ggc
 TOTAL                 :   3.68             0.06             3.75              45840 kB
Extra diagnostic checks enabled; compiler may run slowly.
Configure with --disable-checking to disable checks.
3.727u 0.081s 0:03.99 95.2%     0+0k 0+0io 0pf+0w

for air.f90.

Not knowing what to look for, I'll need further directives about
what to do next.

Dominique


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]