I just tried running a piece of code I'm writing compiled with gfortran
and with ifort, and the difference is worrying.
ifort: 23 s
gfortran: 165 s
gfortran -O3 -ffast-math -funroll-loops -ftree-vectorize
-march=athlon-xp -msse -mfpmath=sse,387: 164 s
The code has nothing really special, it just converts some coordinates
and quaternions into transformed coordinates, and the main part I guess
is generating the rotation matrices and applying them (with MATMUL).
There is also some (maybe too much) disk I/O)