A Matrix Multiplication Routine that Updates Only the Upper or Lower Triangular Part of the Result MatrixBackground
Intel® MKL provides the general purpose BLAS* matrix multiply routines ?GEMM defined as follows:
Run-to-Run Reproducibility of Floating-Point Calculations for Applications on Intel® Xeon Phi™ Coprocessors (and Intel® Xeon® Processors)The Issue
If I rerun the identical program on the identical input data on an identical processor, will I get an identical result?
The latest Intel® Architecture Instruction Set Extensions Programming Reference includes the definition of Intel® Advanced Vector Extensions 512 (Intel® AV