processor peak vs. effective performance on parallel platforms question

processor peak vs. effective performance on parallel platforms question

Hi,

Knowing the peak floating point operations rate of a processor, is there any rule of thumb for predicting the 'effective' FLOPs? In another words, how many percent of the peak FLOPs, in general, is the effective FLOPs?

Thanks for any insight.

3 帖子 / 0 全新
最新文章
如需更全面地了解编译器优化,请参阅优化注意事项
James Tullos (Intel)的头像

Hi notahoo,

Actual performance will be heavily dependent on the application. I would recommend reading the article at http://software.intel.com/en-us/articles/estimating-flops-using-event-based-sampling-ebs/. This should give you some ideas of how you can estimate your application's performance.

Sincerely,
James Tullos
Technical Consulting Engineer
Intel Cluster Tools

In the Top500 supercomputer ratings, an "efficiency" is quoted as the ratio of actual performance to the peak flops rating. This rating can be achieved only where it is possible to use the vendor's optimized BLAS (MKL dgemm, in the case of Intel clusters) and adjust the problem size to maximize efficiency, and the memory bandwidth is not a significant limitation, as it would be for many real applications.

登陆并发表评论。