Intel® MKL VML Training Material

This article contains training material (in PDF format) on Intel® MKL Vector Math (VML), which includes details of VML features and performance, examples and its application in Finance.
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • C/C++
  • Intermédiaire
  • Intel® Composer XE
  • Bibliothèque Intel® Math Kernel Library
  • Knights Corner Architecture


    I have a question about the architecture of knights corner. I wonder how many Vector Processing Unit (VPU) within one physical core? Because the hardware can support 4 threads within one core, does it that mean there are four VPUs within one core? or there is only one VPU within one core, and four threads share one VPU? I am not familiar with this, Can someone answer me ?


    Thank you!


    configure infiniband connection(rdma) for 2 mics

    Hi, I'm trying to set up infiniband connection between host and mic, mic and mic. Host is showing this on ifconfig:

    mic0:ib: flags=67<UP,BROADCAST,RUNNING>  mtu 64512
            inet  netmask  broadcast
            ether 4c:79:ba:20:06:63  txqueuelen 1000  (Ethernet)

    and two: mic0 and mic1 interfaces.

    I have two coprocessors installed.  I can run ib_read_bw between host and mic0, but not host and mic1 or mic0 and mic1.  Getting error:  

    Received 10 times ADDR_ERROR
    Unable to perform rdma_client function.

    Stepwise calculation for large datasets in Fortran


    I am trying to do stepwise calculations, for each day, by using an equation in a Fortran program, using the datasets, as in the attached (each dataset has 18262 values corresponding to 18262 days). The output of the program would be another new dataset in which the two datasets are combined per the equation (as an example 0.5*TC1+ 7*TC2), for each day.

    Intel LEO multiple copies of the same variable on MIC

    Dear all,

    I am extending SNU NPB OpenMP version to use LEO. I found a problem while converting the IS application. Considering that I do not have more than a couple of weeks of practice on LEO, I was wondering if this is a compiler bug, or a missing feature, etc. The problem is that when offloading pragma is used in/out/inout are not always respected depending on the underneath code (I am speaking about the C version): multiple copies of the same variable are created on MIC and those copies are not consistent. Here an example code:

    Temperature monitoring

    Hi all,

    I'm trying to figure out how to read the temperature sensor(s) on the PHI from a program running on the embedded linux.

    I installed MPSS version 3.4.3 and the board SKU is B1PRQ-5110P/5120D.

    The Intel system software developers guide for the PHI mentions, in section, that "The processor implements internal MSRs (IA32_THERM_STATUS, IA32_THERM_INTERRUPT, IA32_CLOCK_MODULATION)".

    unable to link offloaded MIC

    hi all,

    I had added an offload section to a file named atm_comp_mct.F90.

    it does compile well.

    however later it is being used to link to a file named cesm.exe and it throws errors as undefined reference.

    undefined reference to `__offload_target_acquire'
    undefined reference to `__offload_offload'

    here are the atm_comp_mct.F90 and the buildlog for cesm.exe



    S’abonner à Linux*