pushq cost too many cpu times

pushq cost too many cpu times

hi all

when we use vtune to collect light-weight hotspots, we found this

Address Line Assembly CPU Time Instructions Retired

0xdd6d70 499 pushq %rbp 0.573s 1,184,000,000

how can a pushq cost such many cpu times? this make a function cpi = 2, and we want to fix it, but dont know how.

any suggestion is appretiate. Thanks.

2 posts / 0 new
Last post
For more complete information about compiler optimizations, see our Optimization Notice.


In hotspot analysis, the instruction identified is the active Instruction Pointer value when the event counter overflew (in this case INSTR_RETIRED).

Typically, this points to the application function which was statistically responsible for generating lots of this event/time spent.

Typically, the latency inducing instructions are occuring prior to the identified event. if this event is in a loop with some loads for example, these maybe high latency loads.

Hope this helps,

Note: there's a forum dedicated for Vtune Amplifier analysis and issues.

Leave a Comment

Please sign in to add a comment. Not a member? Join today