| Thread Tools | Search this thread |
|---|
babysam
| August 19, 2009 4:05 AM PDT Problems related to Intel Architecture Code Analyzer | ||||
Hello everyone! I have used the IACA for nearly a month and it is great! It help me to solve many problems that affects the performance of my code... (and it is simple to use as long as you have the source code in hand) However, I have noticed something strange... Even though the analyzer has correctly identified the code for rcpss/rsqrtss, they are treated as the same as the divss/sqrtss respectively.(As I know, both of the approximation instructions should run faster than the accurate ones. However, the analyzer shows they are blocking the divider port for 14 cycles) Is it a bug or something else? Thank you for your attention! | |||||
|
|||||||||||||
|
|||||||||||||
|
|||||||||||||
|
|||||||||||||
|
|||||||||||||
|
|||||||||||||
| 8478 users have contributed to 31609 threads and 100661 posts to date. |
|---|
| In the past 24 hours, we have 30 new thread(s) 108 new posts(s), and 167 new user(s). In the past 3 days, the most popular thread for everyone has been gemm(A,A,A) like possible? The most posts were made to gemm(A,A,A) like possible? The post with the most views is Dear Steve, excuse me for a d Please welcome our newest member zhpn |