Threading on Intel® Parallel Architectures

Parallel compression library version 2.15 ...

Hello,

My ParallelCompression libraryversion 2.15 is here, i have added
the Parallel LZ algorithm and also ihave added a testfile() method
to test if your compressed file is ok .

Description:

Parallel compression library implements Parallel Gzip , Parallel Bzip ,
Parallel LZMA and Parallel LZ algorithms using my Thread Pool Engine.

You can download my ParallelCompression library from:

Stress testing of Intel OpenMP library - More than 18,600 OpenMP threads created in a parallel region

Hi everybody,

I'd like to share results of stress testing ofIntel OpenMP library for a 32-bit platform:

libiomp5md.dll

Intel OMP Performance Library
version 5.0.20111219 for IA-32 architecture
built on 2011-12-19 17:12:07 UTC

A purpose of the stress test was as follows:

Create as many as possible OpenMP threads in a parallel region on aone CPU system.

Details:

HyperThreading getting in the way of performance

I am testing on Core i7 2600K here (4 physical cores x 2 logical cores).

My code (video processing plugin for VirtualDub) is threaded using OpenMP.

- With 8 threads I have lower than single-threaded performance.
- With 4 threads I have 3.98x single-threaded performance.
- With 4 threads I also have some periodic slowdowns (when thread is not run on the same logical core as before)

It is obvious that HyperThreading is the problem for this particular algorithm.

What is not obvious is how to control execution such that:

SSE4.2 STTNI 'equal each' instruction customization

Hi, I need to write a C program where STTNI 'equal each' instruction seems to be useful because I want to achive parallism. Here src1 should be equal to src2 for an array of 16. At the same time my requirement is that instead of comparing exact value, src1 should lie between (src2-(src2/10)) and (src2 + (src2/10)). I serached but I did not find anything which I can use directly. Can I have some C src code equivallent to the function 'equal each' where I can put my range condition? Thanks AT

Beginning With Parallel Programming

Hello everyone,

I am new to parallel programming. Till now I have learned about CUDA( ||el programming language for NIVIDIA GPU) , OpenCL (for AMD streams, Nividia and Intel). I have installed AMD APP SDK on Windows and Fedora 16. But as I am not having Visua Studio for windows and some graphics incompatibility with fedora. I am trying OpenCL with Intel CPU.

Spawn-and-forget multitasking

Hello, I am developing my game engine and trying to find best-fit multitasking/multithreading model to feel the power of multicore processors. I've already implemented threads and multitasking(based on spawn-and-wait model). AlsoI heard about spawn-and-forget multitasking, but how does it work? And if someone can, please share implementation of such multitasking model. Searching in the Internet didn't give me anything, because there is not so much helpful information about it(may be I type wrong request in the "Find" field of searcher).

Pagine

Iscriversi a Threading on Intel® Parallel Architectures