This is the mail archive of the gcc-regression@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

A recent patch increased GCC's memory consumption!


Hi,

I am a friendly script caring about memory consumption in GCC.  Please
contact jh@suse.cz if something is going wrong.

Comparing memory consumption on compilation of combine.i, insn-attrtab.i,
and generate-3.4.ii I got:


comparing empty function compilation at -O0 level:
    Overall memory needed: 8208k
    Peak memory use before GGC: 1291k
    Peak memory use after GGC: 1217k
    Maximum of released memory in single GGC run: 134k
    Garbage: 218k
    Leak: 1221k
    Overhead: 136k
    GGC runs: 4
    Pre-IPA-Garbage: 207k
    Pre-IPA-Leak: 1224k
    Pre-IPA-Overhead: 135k
    Post-IPA-Garbage: 207k
    Post-IPA-Leak: 1224k
    Post-IPA-Overhead: 135k

comparing empty function compilation at -O0 -g level:
    Overall memory needed: 8452k
    Peak memory use before GGC: 1319k
    Peak memory use after GGC: 1245k
    Maximum of released memory in single GGC run: 133k
    Garbage: 220k
    Leak: 1254k
    Overhead: 141k
    GGC runs: 4
    Pre-IPA-Garbage: 207k
    Pre-IPA-Leak: 1224k
    Pre-IPA-Overhead: 135k
    Post-IPA-Garbage: 207k
    Post-IPA-Leak: 1224k
    Post-IPA-Overhead: 135k

comparing empty function compilation at -O1 level:
    Overall memory needed: 8236k
    Peak memory use before GGC: 1291k
    Peak memory use after GGC: 1217k
    Maximum of released memory in single GGC run: 134k
    Garbage: 221k
    Leak: 1221k
    Overhead: 137k
    GGC runs: 4
    Pre-IPA-Garbage: 207k
    Pre-IPA-Leak: 1224k
    Pre-IPA-Overhead: 135k
    Post-IPA-Garbage: 207k
    Post-IPA-Leak: 1224k
    Post-IPA-Overhead: 135k

comparing empty function compilation at -O2 level:
    Overall memory needed: 8452k
    Peak memory use before GGC: 1291k
    Peak memory use after GGC: 1218k
    Maximum of released memory in single GGC run: 135k
    Garbage: 226k
    Leak: 1221k
    Overhead: 138k
    GGC runs: 4
    Pre-IPA-Garbage: 207k
    Pre-IPA-Leak: 1224k
    Pre-IPA-Overhead: 135k
    Post-IPA-Garbage: 207k
    Post-IPA-Leak: 1224k
    Post-IPA-Overhead: 135k

comparing empty function compilation at -O3 level:
    Overall memory needed: 8456k
    Peak memory use before GGC: 1291k
    Peak memory use after GGC: 1218k
    Maximum of released memory in single GGC run: 135k
    Garbage: 226k
    Leak: 1221k
    Overhead: 138k
    GGC runs: 4
    Pre-IPA-Garbage: 207k
    Pre-IPA-Leak: 1224k
    Pre-IPA-Overhead: 135k
    Post-IPA-Garbage: 207k
    Post-IPA-Leak: 1224k
    Post-IPA-Overhead: 135k

comparing combine.c compilation at -O0 level:
    Overall memory needed: 31988k
    Peak memory use before GGC: 18018k
    Peak memory use after GGC: 17801k
    Maximum of released memory in single GGC run: 1839k
    Garbage: 39406k
    Leak: 5800k
    Overhead: 5220k
    GGC runs: 337
    Pre-IPA-Garbage: 12408k
    Pre-IPA-Leak: 19349k
    Pre-IPA-Overhead: 2559k
    Post-IPA-Garbage: 12408k
    Post-IPA-Leak: 19349k
    Post-IPA-Overhead: 2559k

comparing combine.c compilation at -O0 -g level:
    Overall memory needed: 34008k
    Peak memory use before GGC: 19938k
    Peak memory use after GGC: 19657k
    Maximum of released memory in single GGC run: 1849k
    Garbage: 39703k
    Leak: 9082k
    Overhead: 6038k
    GGC runs: 321
    Pre-IPA-Garbage: 12507k
    Pre-IPA-Leak: 21623k
    Pre-IPA-Overhead: 3049k
    Post-IPA-Garbage: 12507k
    Post-IPA-Leak: 21623k
    Post-IPA-Overhead: 3049k

comparing combine.c compilation at -O1 level:
    Overall memory needed: 30752k
    Peak memory use before GGC: 15682k
    Peak memory use after GGC: 15507k
    Maximum of released memory in single GGC run: 1340k
    Garbage: 46801k
    Leak: 5780k
    Overhead: 6014k
    GGC runs: 402
    Pre-IPA-Garbage: 13147k
    Pre-IPA-Leak: 16845k
    Pre-IPA-Overhead: 2472k
    Post-IPA-Garbage: 13147k
    Post-IPA-Leak: 16845k
    Post-IPA-Overhead: 2472k

comparing combine.c compilation at -O2 level:
    Overall memory needed: 31340k -> 31192k
    Peak memory use before GGC: 15823k
    Peak memory use after GGC: 15661k
    Maximum of released memory in single GGC run: 1355k
    Garbage: 60528k
    Leak: 5809k
    Overhead: 8023k
    GGC runs: 469
    Pre-IPA-Garbage: 13310k
    Pre-IPA-Leak: 16927k
    Pre-IPA-Overhead: 2491k
    Post-IPA-Garbage: 13310k
    Post-IPA-Leak: 16927k
    Post-IPA-Overhead: 2491k

comparing combine.c compilation at -O3 level:
    Overall memory needed: 31968k -> 31808k
    Peak memory use before GGC: 15911k
    Peak memory use after GGC: 15752k
    Maximum of released memory in single GGC run: 1629k
    Garbage: 72955k
    Leak: 7136k
    Overhead: 9459k
    GGC runs: 496
    Pre-IPA-Garbage: 13310k
    Pre-IPA-Leak: 16927k
    Pre-IPA-Overhead: 2491k
    Post-IPA-Garbage: 13310k
    Post-IPA-Leak: 16927k
    Post-IPA-Overhead: 2491k

comparing insn-attrtab.c compilation at -O0 level:
    Overall memory needed: 155236k
    Peak memory use before GGC: 65230k
    Peak memory use after GGC: 53275k
    Maximum of released memory in single GGC run: 27424k
    Garbage: 130437k
    Leak: 8497k
    Overhead: 15723k
    GGC runs: 263
    Pre-IPA-Garbage: 38215k
    Pre-IPA-Leak: 55487k
    Pre-IPA-Overhead: 8223k
    Post-IPA-Garbage: 38215k
    Post-IPA-Leak: 55487k
    Post-IPA-Overhead: 8223k

comparing insn-attrtab.c compilation at -O0 -g level:
    Overall memory needed: 156508k
    Peak memory use before GGC: 66504k
    Peak memory use after GGC: 54546k
    Maximum of released memory in single GGC run: 27425k
    Garbage: 130915k
    Leak: 10147k
    Overhead: 16179k
    GGC runs: 255
    Pre-IPA-Garbage: 38272k
    Pre-IPA-Leak: 57029k
    Pre-IPA-Overhead: 8558k
    Post-IPA-Garbage: 38272k
    Post-IPA-Leak: 57029k
    Post-IPA-Overhead: 8558k

comparing insn-attrtab.c compilation at -O1 level:
    Overall memory needed: 133328k
    Peak memory use before GGC: 50200k
    Peak memory use after GGC: 43294k
    Maximum of released memory in single GGC run: 22951k
    Garbage: 180978k
    Leak: 7873k
    Overhead: 24528k
    GGC runs: 301
    Pre-IPA-Garbage: 43193k
    Pre-IPA-Leak: 43086k
    Pre-IPA-Overhead: 7642k
    Post-IPA-Garbage: 43193k
    Post-IPA-Leak: 43086k
    Post-IPA-Overhead: 7642k

comparing insn-attrtab.c compilation at -O2 level:
    Overall memory needed: 148824k
    Peak memory use before GGC: 50204k
    Peak memory use after GGC: 45013k
    Maximum of released memory in single GGC run: 17966k
    Garbage: 204660k
    Leak: 15536k
    Overhead: 30014k
    GGC runs: 326
    Pre-IPA-Garbage: 43265k
    Pre-IPA-Leak: 43092k
    Pre-IPA-Overhead: 7651k
    Post-IPA-Garbage: 43265k
    Post-IPA-Leak: 43092k
    Post-IPA-Overhead: 7651k

comparing insn-attrtab.c compilation at -O3 level:
    Overall memory needed: 164772k
    Peak memory use before GGC: 61826k
    Peak memory use after GGC: 58725k
    Maximum of released memory in single GGC run: 23617k
    Garbage: 242519k
    Leak: 7899k
    Overhead: 33464k
    GGC runs: 339
    Pre-IPA-Garbage: 43265k
    Pre-IPA-Leak: 43092k
    Pre-IPA-Overhead: 7651k
    Post-IPA-Garbage: 43265k
    Post-IPA-Leak: 43092k
    Post-IPA-Overhead: 7651k

comparing Gerald's testcase PR8361 compilation at -O0 level:
    Overall memory needed: 151215k -> 151227k
    Peak memory use before GGC: 82970k
    Peak memory use after GGC: 82148k
    Maximum of released memory in single GGC run: 14702k
    Garbage: 205239k
    Leak: 52062k
    Overhead: 26925k
    GGC runs: 415
    Pre-IPA-Garbage: 111129k
    Pre-IPA-Leak: 88390k
    Pre-IPA-Overhead: 14820k
    Post-IPA-Garbage: 111129k
    Post-IPA-Leak: 88390k
    Post-IPA-Overhead: 14820k

comparing Gerald's testcase PR8361 compilation at -O0 -g level:
    Overall memory needed: 169155k
    Peak memory use before GGC: 96595k
    Peak memory use after GGC: 95639k
    Maximum of released memory in single GGC run: 15130k
    Garbage: 210902k
    Leak: 78624k
    Overhead: 33589k
    GGC runs: 387
    Pre-IPA-Garbage: 111751k
    Pre-IPA-Leak: 104905k
    Pre-IPA-Overhead: 18323k
    Post-IPA-Garbage: 111751k
    Post-IPA-Leak: 104905k
    Post-IPA-Overhead: 18323k

comparing Gerald's testcase PR8361 compilation at -O1 level:
    Overall memory needed: 111223k -> 111219k
    Peak memory use before GGC: 84225k
    Peak memory use after GGC: 83386k
    Maximum of released memory in single GGC run: 14982k
    Garbage: 282306k -> 282306k
    Leak: 49364k
    Overhead: 31646k -> 31646k
    GGC runs: 503
    Pre-IPA-Garbage: 159802k
    Pre-IPA-Leak: 88086k
    Pre-IPA-Overhead: 19886k
    Post-IPA-Garbage: 159802k
    Post-IPA-Leak: 88086k
    Post-IPA-Overhead: 19886k

comparing Gerald's testcase PR8361 compilation at -O2 level:
    Overall memory needed: 112647k -> 112631k
    Peak memory use before GGC: 85989k
    Peak memory use after GGC: 85136k
    Maximum of released memory in single GGC run: 14965k
    Garbage: 337301k
    Leak: 49420k
    Overhead: 38215k
    GGC runs: 570
    Pre-IPA-Garbage: 163792k
    Pre-IPA-Leak: 88452k
    Pre-IPA-Overhead: 20364k
    Post-IPA-Garbage: 163792k
    Post-IPA-Leak: 88452k
    Post-IPA-Overhead: 20364k

comparing Gerald's testcase PR8361 compilation at -O3 level:
    Overall memory needed: 113715k
    Peak memory use before GGC: 86611k
    Peak memory use after GGC: 85750k
    Maximum of released memory in single GGC run: 14965k
    Garbage: 368780k
    Leak: 49422k
    Overhead: 41456k
    GGC runs: 601
    Pre-IPA-Garbage: 163872k
    Pre-IPA-Leak: 89110k
    Pre-IPA-Overhead: 20416k
    Post-IPA-Garbage: 163872k
    Post-IPA-Leak: 89110k
    Post-IPA-Overhead: 20416k

comparing PR rtl-optimization/28071 testcase compilation at -O0 level:
    Overall memory needed: 361915k -> 361911k
    Peak memory use before GGC: 78518k
    Peak memory use after GGC: 49453k
    Maximum of released memory in single GGC run: 38186k
    Garbage: 144651k
    Leak: 7110k
    Overhead: 24889k
    GGC runs: 87
    Pre-IPA-Garbage: 12561k
    Pre-IPA-Leak: 20190k
    Pre-IPA-Overhead: 2241k
    Post-IPA-Garbage: 12561k
    Post-IPA-Leak: 20190k
    Post-IPA-Overhead: 2241k

comparing PR rtl-optimization/28071 testcase compilation at -O0 -g level:
    Overall memory needed: 362703k
    Peak memory use before GGC: 79215k
    Peak memory use after GGC: 50149k
    Maximum of released memory in single GGC run: 38170k
    Garbage: 144752k
    Leak: 9152k
    Overhead: 25473k
    GGC runs: 93
    Pre-IPA-Garbage: 12569k
    Pre-IPA-Leak: 20439k
    Pre-IPA-Overhead: 2295k
    Post-IPA-Garbage: 12569k
    Post-IPA-Leak: 20439k
    Post-IPA-Overhead: 2295k

comparing PR rtl-optimization/28071 testcase compilation at -O1 level:
  Amount of produced GGC garbage increased from 221628k to 222492k, overall 0.39%
    Overall memory needed: 242332k -> 242108k
    Peak memory use before GGC: 73624k
    Peak memory use after GGC: 66143k
    Maximum of released memory in single GGC run: 34735k
    Garbage: 221628k -> 222492k
    Leak: 7551k -> 7551k
    Overhead: 30480k -> 30652k
    GGC runs: 97
    Pre-IPA-Garbage: 48348k
    Pre-IPA-Leak: 63005k
    Pre-IPA-Overhead: 8797k
    Post-IPA-Garbage: 48348k
    Post-IPA-Leak: 63005k
    Post-IPA-Overhead: 8797k

comparing PR rtl-optimization/28071 testcase compilation at -O2 level:
    Overall memory needed: 369256k -> 369068k
    Peak memory use before GGC: 73624k
    Peak memory use after GGC: 66143k
    Maximum of released memory in single GGC run: 36061k
    Garbage: 251372k -> 251005k
    Leak: 7553k
    Overhead: 36845k -> 36772k
    GGC runs: 106
    Pre-IPA-Garbage: 107058k
    Pre-IPA-Leak: 75901k
    Pre-IPA-Overhead: 14919k
    Post-IPA-Garbage: 107058k
    Post-IPA-Leak: 75901k
    Post-IPA-Overhead: 14919k

comparing PR rtl-optimization/28071 testcase compilation at -O3 -fno-tree-pre -fno-tree-fre level:
    Overall memory needed: 1026608k
    Peak memory use before GGC: 141898k
    Peak memory use after GGC: 129175k
    Maximum of released memory in single GGC run: 62763k
    Garbage: 365075k -> 364901k
    Leak: 9099k
    Overhead: 44856k -> 44821k
    GGC runs: 103
    Pre-IPA-Garbage: 107058k
    Pre-IPA-Leak: 75901k
    Pre-IPA-Overhead: 14919k
    Post-IPA-Garbage: 107058k
    Post-IPA-Leak: 75901k
    Post-IPA-Overhead: 14919k

Head of the ChangeLog is:

--- /usr/src/SpecTests/sandbox-britten-memory/x86_64/mem-result/ChangeLog	2008-11-10 20:51:37.000000000 +0000
+++ /usr/src/SpecTests/sandbox-britten-memory/gcc/gcc/ChangeLog	2008-11-11 14:22:47.000000000 +0000
@@ -1,3 +1,64 @@
+2008-11-10  Catherine Moore  <clm@codesourcery.com>
+
+	* config.gcc (mips64vrel-*-elf*): Include the tm_file
+	prior to vr.h.
+	* config/mips/linux.h (LINUX_DRIVER_SELF_SPECS): New.
+	(BASE_DRIVER_SELF_SPECS): Remove.
+	(DRIVER_SELF_SPECS): New definition.
+	* config/mips/elfoabi.h: (DRIVER_SELF_SPECS): Include
+	BASE_DRIVER_SELF_SPECS.
+	* config/mips/sde.h: Likewise.
+	* config/mips/iris6.h: Likewise.
+	* config/mips/vr.h: Likewise.
+	* config/mips/mips.h (BASE_DRIVER_SELF_SPECS): New. 
+
+2008-11-10  Vladimir Makarov  <vmakarov@redhat.com>
+	    
+	PR rtl-optimizations/37948
+	* ira-int.h (struct ira_allocno_copy): New member constraint_p.
+	(ira_create_copy, ira_add_allocno_copy): New parameter.
+
+	* ira-conflicts.c (process_regs_for_copy): New parameter.  Pass it
+	to ira_add_allocno_copy.
+	(process_reg_shuffles, add_insn_allocno_copies): Pass a new
+	parameter to process_regs_for_copy.
+	(propagate_copies): Pass a new parameter to ira_add_allocno_copy.
+	Fix typo in passing second allocno to ira_add_allocno_copy.
+
+	* ira-color.c (update_conflict_hard_regno_costs): Use head of
+	coalesced allocnos list.
+	(assign_hard_reg): Ditto.  Check that assigned allocnos are not in
+	the graph.
+	(add_ira_allocno_to_bucket): Rename to add_allocno_to_bucket.
+	(add_ira_allocno_to_ordered_bucket): Rename to
+	add_allocno_to_ordered_bucket.
+	(push_ira_allocno_to_stack): Rename to push_allocno_to_stack.  Use
+	head of coalesced allocnos list.
+	(push_allocnos_to_stack): Remove calculation of ALLOCNO_TEMP.
+	Check that it is aready calculated.
+	(push_ira_allocno_to_spill): Rename to push_ira_allocno_to_spill.
+	(setup_allocno_left_conflicts_num): Use head of coalesced allocnos
+	list.
+	(coalesce_allocnos): Do extended coalescing too.
+
+	* ira-emit.c (add_range_and_copies_from_move_list): Pass a new
+	parameter to ira_add_allocno_copy.
+
+	* ira-build.c (ira_create_copy, ira_add_allocno_copy): Add a new
+	parameter.
+	(print_copy): Print copy origination too.
+
+	* ira-costs.c (scan_one_insn): Use alloc_pref for load from
+	equivalent memory.
+	
+2008-11-10  Kaz Kojima  <kkojima@gcc.gnu.org>
+
+	PR rtl-optimization/37514
+	* config/sh/sh.h (OPTIMIZATION_OPTIONS): Set
+	flag_ira_share_spill_slots to 2 if it's already non-zero.
+	(OVERRIDE_OPTIONS): Clear flag_ira_share_spill_slots if
+	flag_ira_share_spill_slots is 2.
+
 2008-11-10  Kevin Buettner  <kevinb@redhat.com>
 
 	* config/m32c/prologue.md (prologue_enter_16): Set FB to SP - 2.


The results can be reproduced by building a compiler with

--enable-gather-detailed-mem-stats targetting x86-64

and compiling preprocessed combine.c or testcase from PR8632 with:

-fmem-report --param=ggc-min-heapsize=1024 --param=ggc-min-expand=1 -Ox -Q

The memory consumption summary appears in the dump after detailed listing
of the places they are allocated in.  Peak memory consumption is actually
computed by looking for maximal value in {GC XXXX -> YYYY} report.

Your testing script.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]