Since two commits my submission fail on the 40 cores machine.
Here is the message I have :
error on a 40-cores HT machine :
couldn't unzip submission -> timeout of command execution, >2000 ms.
Reportez-vous à notre Notice d'optimisation pour plus d'informations sur les choix et l'optimisation des performances dans les produits logiciels Intel.
I have another question : can we assume that our program will not be tested on small inputs (less diagonals than the max number of threads)?
It would avoid me to handle special cases.
I wouldn't rely on this assumption. Your program should always give the same results regardlessly of the selected number of threads. Why shouldn't someone try to run your program with 1000 threads?