Intel® VTune™ Amplifier XE

Congratulations to user sachini1@gmail.com!!

Congratulations to user sachini1@gmail.com who is the 4rth winner of a copy of the newly released "VTune Performance Analyzer Essentials," by James Reinders.

Learn how you can win, too!

Thanks,

David Anderson and Jeff Gallagher

Intel VTune analyzer Discussion Forum Hosts

Message Edited by jdgallag on 10-05-2005 02:10 PM

CONGRATULATIONS to user manojs2k!!

Congratulations to user manojs2k who is the 4rth winner of a copy of the newly released "VTune Performance Analyzer Essentials," by James Reinders.

Learn how you can win, too!

Thanks,

David Anderson and Jeff Gallagher

Intel VTune analyzer Discussion Forum Hosts

Message Edited by jdgallag on 10-05-2005 02:09 PM

Message Edited by jdgallag on 10-05-2005 02:23 PM

Message Edited by jdgallag on 10-05-2005 02:24 PM

PageWalkDTLBAllMisses Performance impact

Vtune has reported a very high value of PageWalkDTLBAllMisses performace impact from running an application? What does this impact tell me about the memory access patten of this application? I read somewhere that this value is high does not necessary mean high L1 cache miss rate. What is the next step to investigate?

Thanks.

store forwarding impact

I ran Vtune to collect the Store forward performance impact from the application,the source view showed the following source and assembly code that had the high value of the performance impact,

Address Line Source MOB Loads
Replay
Retired

0x39219 100 if ( ((long)*s & 3L) == 0) { 1183
0x39219 100 CollectGarb+158: mov esi, DWORD PTP[ecx] 152
0x39219 100mov edx, esi736
0x39219 100and edx, 0x3h 122
0x39219 100 jnz CollectGarb+3a7173

Half cache size

Hi,
I read some where in you site that when the distance between siblings in a structure is less than half the cache size, the chances of a cache hit are more. Can you please let me know the link?? I saw it in one of the flash slides..any reference to a paper is also appreciated

thanks

i can't use rmtsvr. i think it may be the result of different version of compiler

we use vtune analyze to our target(pxa255+linux2.4.19).In the past ,we use arm-linux-gcc 2.95.3 to build our kernel and modules.The rmtsvr and vtlxsc you given to me is also build with 2.95.3 and thy worked well. But recently ,we have changed our arm-linux-gcc to 3.3.2, i rebuild the vtune_drv.o . When i run rmtsvr on my target,is shows :
#/root>./rmtsvr
Copyright (C) 2001-2003 Intel Corporation. All rights reserved.
VTune Performance Analyzer Update for Intel XScale Techonlogy, PXA Linux*
Version: 7.1.6.215
Server is Starting

measuring remote misses

hi,
I am running vtune for linux on a system that has two hyperthreaded pentium xeons. This means that there are four virtual processors. I wanted to run a parallel app. and measure the coherence traffic. In specific I wanted to measure the following:

a) The percentage of L2 read misses that are satisfied by reading data from another processor's cache, rather than reading it from main memory. Basically the number of remote misses and the number of main memory misses.

Any help is appreciated.
thanks
smruti

Subscribe to Intel® VTune™ Amplifier XE