Hi every one,
I found that the no-fill mode(cr0.CD=1 cr0.NW=0) did not work in my prosessor(i7 2600). Specifically, when I access a memory region that is surely in the L1 Dcache after entering the no-fill mode, the speed suffers significantly(*1000+). But according to table 11-5 in intel SDM vol 3A, read hit should access cache. And I am sure that I have the correct memory type(WB in the experiment, both is mtrr and pat) and the process is restricted in a single core with intertrupt disabled.
In fact, when in the no-fill mode, everything becomes slower. In my experiment, simply movl %eax,%ebx takes a lot of cpu cycles. Can any one give me some explaination?