This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]

Re: The new scheduler and x86 CPUs

To: Vladimir Makarov <vmakarov at redhat dot com>
Subject: Re: The new scheduler and x86 CPUs
From: Bernd Schmidt <bernds at redhat dot com>
Date: Tue, 28 Aug 2001 23:39:12 +0100 (BST)
cc: <jh at suse dot cz>, <gcc at gcc dot gnu dot org>

On Tue, 28 Aug 2001, Vladimir Makarov wrote:
> > Why is OOO a dead-end approach?  If there's parallelism to extract
> > in a program, then if a compiler can find it, an OOO core can find it
> > just as well.
>
>   I see
> (http://www.intel.com/eBusiness/products/ia64/overview/bm012101.htm#5)
> that Itanium has the same SPECint as UltrasparcIII practically on the
> same frequency.

Is Ultrasparc III out-of-order?  I thought it was in-order too.  There's
nothing in
  http://www.sun.com/sparc/UltraSPARC-III/USIIITech.html
that suggests it's OOO.  So it isn't all that surprising to me that it's
slow as well.

>   OOO has constraints on speculation (branches to look through, look
> ahead buffer).  Of course you could increase this characteristics, but
> in this case the penalty of mispredictions is bigger.  The control logic
> to implement OOO is very big (even now as I remember it achieves 1/3-1/2
> of all control logic).  Therefore maximal issue rate for them is 6-8.

How much parallelism are you likely to extract out of a typical integer
program, anyway?  IMO, 6-8 is quite near or possibly over the limit for
typical programs.

> There is no other way to achieve potential (fine grain)
> parallelism which even for SPECint achieve 30-40 insns per cycle.

I don't believe this.  Can you back up this number?  Even if it were
realistic, is there any known strategy for a compiler to get anywhere
close?  And, is it feasible that actual hardware could be built to make
use of it?

> Also
> this approach is simpler with hardware point of view (the logic were
> used for OOO control could be used for more useful functional units as
> ALUs), the typical example of this for embedded systems (with low power
> consumption) could be Transmeta processors.

Current ia64 systems don't seem to be very low power designs.

Bernd

References:
- Re: The new scheduler and x86 CPUs
  - From: Vladimir Makarov

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]