I need to specify the kind of cluster that suits my requirement. I do molecular dynamics simulations. The software would generate threads that require*large amount* of communication between themselves. Each threads requires to send as well as recieve information from every other thread.
In such a scenario, if I stick to a 8-node processor cluster (as the s/w runs only on 2^n processors), which of the following would be more suitable:
1. A 2-processor dual core xeon processors with HT capability.
2. Four HT-enabled P4 processors.
3. Two P4 Extreme processor machines.
4. Four P4 Dual core (HT-less/disabled) processors.
5. Eight P4 processors.
We can choose an appropriate switch/hub for any of them.
I understand that only the actual benchmarks will make the things clear, but I would like to know in general what is expected to be the best.