This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: If you had a month to improve gcc build parallelization, where would you begin?

From: Segher Boessenkool <segher at kernel dot crashing dot org>
To: Geert Bosch <bosch at adacore dot com>
Cc: Joern Rennecke <joern dot rennecke at embecosm dot com>, Jeff Law <law at redhat dot com>, David Fang <fang at csl dot cornell dot edu>, Simon Baldwin <simonb at google dot com>, gcc at gcc dot gnu dot org
Date: Wed, 10 Apr 2013 04:19:29 +0200
Subject: Re: If you had a month to improve gcc build parallelization, where would you begin?
References: <CAPTY64o0UBQBwnq_GMNOBRmdBV4QTc+En3Q7pLn6iR1aKXKQTA at mail dot gmail dot com> <43ABCE7D-03A4-4534-9A3C-79360A7AEC75 at adacore dot com> <Pine dot LNX dot 4 dot 64 dot 1304031751290 dot 19270 at hal-00 dot csl dot cornell dot edu> <20130403195359 dot dxngsssuo8cgsww4-nzlynne at webmail dot spamcop dot net> <515CDC53 dot 5000009 at redhat dot com> <20130403234402 dot or5qa8khmsgs8k40-nzlynne at webmail dot spamcop dot net> <AF3D89AD-6378-4BDD-913E-C1343804E25F at adacore dot com>

How does that work?
The binaries have to get the all the machines of the clusterssomewhere.Does this assume you are using NFS or similar for your builddirectory?Won't the overhead of using that instead of local disk kill mostof the
parallelization benefit of a cluster over a single SMP machine?
This will be true regardless of communication method. There is solittle
opportunity for parallelism that anything more than 4-8 local cores is
pretty much wasted. On a 4-core machine, more than 50% of the walltimeis spent on things that will not use more than those 4 coresregardless.
If the other 40-50% or so can be cut by a factor 4 compared to 4-core
execution, we still are talking about at most a 30% improvement on the
total wall time. Even a small serial overhead for communicatingsources
and binaries will still reduce this 30%.

We need to improve the Makefiles before it makes sense to use more
parallelism.  Otherwise we'll just keep running into Amdahl's law.


Some numbers, 16-core 64-thread POWER7, c,c++,fortran bootstrap:

-j6:
real    57m32.245s
user    205m51.480s
sys     6m24.043s

-j10:
real    45m55.034s
user    211m37.833s
sys     6m33.305s

-j15:
real    41m51.061s
user    237m26.174s
sys     7m2.341s

-j60:
real    38m18.583s
user    336m12.393s
sys     11m26.717s


Segher

Follow-Ups:
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: Geert Bosch

References:
- If you had a month to improve gcc build parallelization, where would you begin?
  - From: Simon Baldwin
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: Geert Bosch
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: David Fang
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: Joern Rennecke
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: Jeff Law
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: Joern Rennecke
- Re: If you had a month to improve gcc build parallelization, where would you begin?
  - From: Geert Bosch

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]