Intel optimised Linpack scaling comparision

Clay Breshears (Intel)
Total Points:
15,225
Status Points:
15,225
Black Belt
January 3, 2006 5:34 PM PST
Rate
 
#1

Bala -

At only 1000 rows and columns, the workload is likely too small to sustain good speedup for 16 threads (just over 60 rows per thread).  System and threading overheads are likely taking a relatively larger fraction of time to the work being done, which reduces Gflops.

What kind of speeds do you get for larger values of n (e.g., 5000, 10000, 20000)?

--clay



Intel Software Network Forums Statistics

8473 users have contributed to 31605 threads and 100654 posts to date.
In the past 24 hours, we have 30 new thread(s) 110 new posts(s), and 160 new user(s).

In the past 3 days, the most popular thread for everyone has been gemm(A,A,A) like possible? The most posts were made to gemm(A,A,A) like possible? The post with the most views is Dear Steve, excuse me for a d

Please welcome our newest member Kevin Johnson