I report an a simple experiment because it took me some time to figure how to use multi-threading (OPT_ARBB_LEVEL=O3) with a simple kernel
which applies exp function to all elements of a vector.
The following vector kernel :
void exp_kernel(arbb::dense& X)
X = exp(X);
does not provide MT acceleration (with ARBB_OPT_LEVEL=O3) while the following one does:
void elementary_exp_kernel(T & Xi)
Xi = exp(Xi);