I am puzzled by the fact that the machine has 4 QPI packages and is not NUMA (Windows reports single NUMA node). How did you achieve this? Is it some kind of BIOS setting that effectively blends memory topology and makes a NUMA system to look like a UMA system?
Is it intended? I think that it's much more beneficial for educational purposes to setup the machine as NUMA system. Future concurrent hardware is going to be nonuniform. That's the only purpose of my login to MTL - to test some things on a NUMA system... and it turned out that the beast is not NUMA. It's a pity.
Earlier I tried Intel Parallel Universe, but it features silly Windows Server 2003 w/o any support for NUMA (the system itself is NUMA, though).
Not NUMA???

