OpenCL*

Get Intel GPU if not used as desktop (windows 8.1) ?

Hello,

I have 2 GPU, the HD4600 and an external one. I want to use the HD4600 for OCL calculations. But I just have 1 monitor. The best way would be to disable the desktop on the second monitor. Unfortunately as soon as I do it, clGetDeviceIDs doesn't return the HD4600 anymore. The only option to enable the desktop in Win8 is to 'extend' the desktop, but then my mouse moves out of my monitor.

Is it a limitation of Windows or driver specific, that just a GPU can be used, which also shows a desktop?

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Please take a look at my OpenCL 2.0 tutorial on the use of enqueue_kernel and work-group scan functions. It also has a very cool algorithm, GPU-Quicksort, implemented in both OpenCL 1.2 and 2.0.

https://software.intel.com/en-us/articles/gpu-quicksort-in-opencl-20-usi...

Let me know what you think!

 

OpenCL vs Intel Cilk Plus Issues, Differences and Capabilities

I  am curious as to the differences between OpenCL and Intel Cilk Plus. They are both parallel programming paradigms that are receiving wide recognition but technically speaking is one better than the other or are they simply different. Also what yardstick do I use when choosing between the two when solving an embarrassingly parallel problem. Please i need answers.

Thanks!

Yaknan

Gen 8 and fp64

In the document GVCS001-The Compute Architecture of Intel Processor Graphics Gen8.pdf
it states.

"Finally, one of the FPUs provides extended math capability to support high-throughput transcendental math functions and double precision 64-bit floating-point."

Does this mean its possible for intel gpu opencl to one day fully support cl_khr_fp64 :)
Does the FPU also do high-throughput double precision transcendental math functions?

Another kernel crash, with reproducer

This one is very simple - just reading in blocks of an image and storing in LDS.

Crashes with access violation on read.

Windows 7, latest SDK, CPU device.

//////////////////////////////////////////////////////////////////////////////////////////////////////////

// image is of dimension 512 x 512
//size_t local_work_size[3] = 32, 32/4
//size_t global_work_size[3] = {512, 512/4,1};

Kernel crashes latest SDK with "access violation reading location ......"

Here is the offending kernel. The task is to find the maximum number of bits in a block of pixels.

 

 

/////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

#define CODEBLOCKX  32

#define CODEBLOCKY 32

CONSTANT sampler_t sampler = CLK_NORMALIZED_COORDS_FALSE  | CLK_FILTER_NEAREST;

Debugging issues with Visual Studio 2012 on Windows 8

I have written an OpenCL program for vector addition, for the Intel HD graphics processor. The code got built but during debugging, many of the files cant be found. The message window shows the following:

'OpenCLProject2.exe' (Win32): Loaded 'C:\Users\lenovo PC\Dropbox\Daily Work\Fundamental matrix test code\c to mex\OpenCLProject2\x64\Debug\OpenCLProject2.exe'. Symbols loaded.

Iscriversi a OpenCL*