This is the mail archive of the
gcc@gcc.gnu.org
mailing list for the GCC project.
Re: g77 performance on ALPHA
- To: Kevin Maguire <K dot Maguire at dl dot ac dot uk>
- Subject: Re: g77 performance on ALPHA
- From: Martin Kahlert <martin dot kahlert at provi dot de>
- Date: Tue, 31 Aug 1999 19:06:56 +0200
- Cc: egcs at egcs dot cygnus dot com
- References: <199908281130.NAA00385@keksy.linux.provi.de> <Pine.LNX.4.10.9908310007400.16919-100000@tca1.dl.ac.uk>
- Reply-To: martin dot kahlert at provi dot de
Quoting Kevin Maguire (K.Maguire@dl.ac.uk):
> Hi Martin
>
> Excuse my ignorance but ....
> ||(The handcoded asm-daxpy of Mr. Goto gets 378.55 MFlops)
>
> Where can I get the hnandcoded BLAS? Mr. Goto?? Is this the Compaq
> CXML? Is it better?
It is better than CXML. Kazushige Goto from Japan spent some time and effort
and coded some blas routines in asm. E.g. the ?axpy ?gemm, ?dot routines.
The dgemm routine for example get about 950MFlops on an 600MHz
21164 (ev5!). I don't know, how good these routines are for 21264 cpus,
but they should be even faster there. The only problem is, i can't reach the
URL any more. It used to be http://www.neuro.uni-oldenburg.de/~joe/math
but the server refuses any connects, now.
>
> On my 500MHz Alpha/Linux (ev6) box I get:
>
> % g77 -O3 blas1.f
> % repeat 5 ./a.out
> 288.39993 MFlops
> 288.450704 MFlops
> 288.045032 MFlops
> 288.045032 MFlops
> 287.640449 MFlops
> % fort -O -fast -tune ev6 -arch ev6 -O3 blas1.f
> % repeat 5 ./a.out
> 393.846153846154 MFlops
> 393.846153846154 MFlops
> 393.846153846154 MFlops
> 393.846153846154 MFlops
> 393.846153846154 MFlops
> % fort -O -fast -tune ev6 -arch ev6 -O3 blas1-cxml.f -lcxml
> % repeat 5 ./a.out
> 461.261261261261 MFlops
> 460.095352622334 MFlops
> 460.224655977789 MFlops
> 460.224655977789 MFlops
> 462.302419375394 MFlops
> % g77 -O3 blas1-cxml.f -lcxml
> % repeat 5 ./a.out
> 461.261261 MFlops
> 460.224782 MFlops
> 458.165486 MFlops
> 461.261261 MFlops
> 461.261261 MFlops
Impressive, i think this comes from the out of order
capabilities of the 21164.
Bye,
Martin.
--
Your mouse has moved, Windows must be restarted for changes
to take affect - restart now?