processor peak vs. effective performance on parallel platforms question

processor peak vs. effective performance on parallel platforms question

Hi,

Knowing the peak floating point operations rate of a processor, is there any rule of thumb for predicting the 'effective' FLOPs? In another words, how many percent of the peak FLOPs, in general, is the effective FLOPs?

Thanks for any insight.

3 posts / 0 new
Last post
For more complete information about compiler optimizations, see our Optimization Notice.

Hi notahoo,

Actual performance will be heavily dependent on the application. I would recommend reading the article at http://software.intel.com/en-us/articles/estimating-flops-using-event-based-sampling-ebs/. This should give you some ideas of how you can estimate your application's performance.

Sincerely,
James Tullos
Technical Consulting Engineer
Intel Cluster Tools

In the Top500 supercomputer ratings, an "efficiency" is quoted as the ratio of actual performance to the peak flops rating. This rating can be achieved only where it is possible to use the vendor's optimized BLAS (MKL dgemm, in the case of Intel clusters) and adjust the problem size to maximize efficiency, and the memory bandwidth is not a significant limitation, as it would be for many real applications.

Leave a Comment

Please sign in to add a comment. Not a member? Join today