线程

Inconsistent Speedup

Hi,

I'm new in using OpenMP. I would like to ask about speedup ratio.

I running C source code with OpenMP added with Intel core i5-2410M.

Based on my understanding, speedup = execution time of code using one thread/execution time of code using N threads 

The execution time recorded is time_diff in the attached code.

offload error: unexpected embedded target binary type, ...

Hello,
Could someone help me with my software installation issue with Xeon Phi? I'm trying to compile my program on the host for future offload developments. In the case I don't include any offload functions in my codes, my program as well as some external libraries (Arpack, Petsc, Libmesh, etc.) are compiled with Intel MPI & MKL and running well on the host. However, if I put any testing offload codes, then it shows long messages during compilation and linking as shown below. An executable is created anyways but it's not running.

IPP multi-threaded libraries are not installed - static link

hello,

my error is 

...v110\ImportBefore\Intel.Libs.IPP.v110.targets(92,5): error : IPP multi-threaded libraries are not installed.

i have one computer which I compiled a project with IPP. and linked the lib which is created from this project with another project. on this computer I have Intel parallel studio 2015 installed.

my goal is to link the IPP project into the other project without having to install IPP for all the the other developers on my team.

the error i'm getting is that probably IPP is not installed on the other computer.

Rebuild modules for newer kernels fails

Hi, the systems hosting the MICs have been updated and hence the mpss must be updated.

Information:

Linux compute-19-17.local 3.19.1-1.el6.elrepo.x86_64 #1 SMP

mpss-3.5-linux.tar

gcc version 4.4.7 20120313 (Red Hat 4.4.7-11) (GCC)

Trying to rebuild the modules fails (it works fine with 2.6 type kernels) :

rpmbuild --rebuild mpss-modules-3.5-1.src.rpm

<cut ....>

Basic OMP Parallelized Program Not Scaling As Expected

#include <iostream>
#include <vector>
#include <stdexcept>
#include <sstream>
#include <omp.h>

std::vector<int> col_sums(std::vector<std::vector<short>>& data) {
    unsigned int height = data.size(), width = data[0].size();
    std::vector<int> totalSums(width, 0), threadSums(width, 0);

    #pragma omp parallel firstprivate(threadSums)
    {
        #pragma omp parallel for
        for (unsigned int i = 0; i < height; i++) {
            threadSums.data()[0:width] += data[i].data()[0:width];
        }
订阅 线程