Processor | CPU/Bus Speed |
MFLOPS
on small / large vector loops daxpy: y(i) = y(i) + a*x(i) for vector lengths n=100 or 107
small vectors fit in cache and measure CPU speed large vector problems measure memory bandwidth speed benchmark source files |
STREAM Memory Bandwidth triad loop: y(i) = w(i) + a*x(i) for large vectors > 2M words http://www.cs.virginia.edu/stream/ my version of stream benchmark |
||
Intel ifc Fortran95 compiler |
Lahey lf95 Fortran95 compiler |
||||
Pentium4/Xeon petrel002 |
1.7 GHz / 400 MHz |
1523 / 176 Mflops (for small / large vectors) |
671 / 173 Mflops |
1600 MB/s |
200 MW/s (64 bit words) |
AMD Athlon MP1900+ petrel025 |
1.6 GHz / 266 MHz |
1032 / 70 Mflops |
1011 / 85 Mflops |
800 MB/s |
100 MW/s |
Cray C-90 (Circa 1991) |
9500 MB/s/proc |
1187 MW/s/proc |
|||
NEC SX-6 |
4000 MW/s/proc |