Author's Blogs
Posted Juni 3rd 20110
A high-level overview of general algorithms included in Intel® TBB to let you know what's possible: parallel_reduce, parallel_do, parallel_for_each: parallel_invoke, parallel_pipeline, parallel_sort and parallel_scan
Posted Juni 3rd 20111
A high-level overview of some generic parallel algorithms included in Intel® TBB to let you know what's possible: parallel_reduce, parallel_do, parallel_for_each
Posted Mai 27th 20110
In the last blog, I explained how to “build” a parallelized for out of templatized components. Today I’m going to show you an easier way to implement the Intel® Threading Building Blocks (Intel® TBB) parallel_for.
The Final Draft International Standard (FDIS) for C++11, also known as C++0x came …
Posted Mai 24th 20110
In the last 6 blog posts, I've explained thecilk_for and getting vectorization insidereductions to prevent data races on the shared datacreating your own custom vectorized functions
What's great about those is that they are "quick and dirty" - you can get a whole lot of parallelism that you …
Posted Mai 4th 20110
In the last 5 blogs, I’ve explained various ways you can have the compiler generate vectorized code for you. If you understand and master each of the different ways Array Notations can vectorize your code, there is one last* recommended step before you start optimizing the “meat” of your ‘for’ …
Posted Mai 2nd 20110
In the previous blog, I explained two mini-kernels, the scatter and gather, which can be written up quickly and still have the benefits of compiler vectorization with Array Notations. There’s also a variety of run-of-the-mill functions, primarily in the “raw plug and chug math” category that you …
Posted April 29th 20110
100 blogs and 100 videos in 100 days about PBB
#6: Vectorized Parallel Patterns inside a Cilk ‘for’
“Educate the compiler a little on what you’re trying to do, and it will vectorize a ton for you.”
In the previous blog, I explained the syntax and behavior of basic Array Notations …
Posted April 29th 20111
100 blogs and 100 videos in 100 days about PBB #6: Vectorized Parallel Patterns inside a Cilk ‘for’ “Educate the compiler a little on what you’re trying to do, and it will vectorize a ton for you.” In the previous blog, I explained the syntax and behavior of basic Array Notations vectorization …
Posted April 28th 20110
In the previous blog, I explained the rationale behind vectorizing with Array Notations inside of the Cilk ‘for’. There are lots of ways to vectorize inside a for loop once you’ve used the Cilk ‘for’, but using this colon/bracket style syntax has been the easiest way for me to get vectorization …
Posted April 27th 20110
In the previous blog, I gave a very fast crash course on the Cilk ‘for’. There are a few other ways you can manipulate that for loop, but I will go over that another time. The key is to understand what is possible first, get the basics running and parallelized, and use the more expert features …
