I have benchmmark the simd instruction with 2 cpu: atom email@example.comGHz and i3 firstname.lastname@example.orgGHz
testing on linux 2.6.35 and linux 2.6.38 (fedora 14 and fedora 15) with gcc 4.5.1
auto-vectoriztion by gcc ( with -msse4.2 /-mssse3 and -mavx)
my result (unit: Mflops):
atom 330 i3-2310m
float 133.32 1141.61
float + sse 402.93 (302%) 1825.28 (160%)
float + avx 2291.78 ( 200%)
double 132.96 1151.64
double + sse 233.94(176%) 1208.82 (105%)
double + avx(*) 2407.44 ( 209%)
* Compiled by icc with -mavx, becouse the gcc got slow down (550 MFlops) by -mavx
as you see, the sse set on atom is much better than it on i3-2310m
I have check my compile flag much times...
the result on atom is very reasonable, but on i3 is much unreasonable....
I would know to know why atom is much better than i3-2310m on sse set....