when we use vtune to collect light-weight hotspots, we found this
Address Line Assembly CPU Time Instructions Retired
0xdd6d70 499 pushq %rbp 0.573s 1,184,000,000
how can a pushq cost such many cpu times? this make a function cpi = 2, and we want to fix it, but dont know how.
any suggestion is appretiate. Thanks.