Interview with Anatoliy Kuznetsov, the author of BitMagic C++ Library
In this article, Anatoliy Kuznetsov answers the questions and tells us about the open BitMagic C++ Library. Author: Andrey Karpov Type: Technical Article |
11/20/2009
|
Implementing Scalable Atomic Locks for Multi-Core Intel® EM64T and IA32 Architectures
by Michael Chynoweth and Mary R. Lee
Abstract
There are several different methods of atomically locking code and data on a Microsoft Windows platform. The primary purpose of this whitepaper is to giv ... Author: Michael Chynoweth (Intel),Mary Lee (Intel) Type: Technical Article |
11/09/2009
|
Performance Insights to Intel® Hyper-Threading Technology
Executive Summary
Intel® Hyper-Threading Technology (Intel® HT Technology)¹ is a hardware feature supported in many Intel® architecture-based server and client platforms that enables one processor c ... Author: Antonio Valles (Intel),Matt Gillespie (Intel),Garrett Drysdale (Intel) Type: Technical Article |
11/09/2009
|
Intel® Hyper-Threading Technology: Analysis of the HT Effects on a Server Transactional Workload
Executive Summary
Intel® Hyper-Threading Technology (Intel® HT Technology)¹ is a hardware feature supported in many Intel® architecture-based server and client platforms that enables one processor ... Author: Matt Gillespie (Intel),Garrett Drysdale (Intel) Type: Technical Article |
11/09/2009
|
Barnes-Hut Algorithm Implementation in Parallel Programming World
A new load balancing algorithm for parallel computation, by anology whose efficiency lies between ORB and costzones. Author: vchaitanya,ravindrapai34,jsangeetha Type: Technical Article |
11/02/2009
|
Miser – A Dynamically Loadable Memory Allocator for Multi-Threaded Applications
by Barry Tannenbaum(This is a follow-up to our earlier post on multicore storage allocation.)
While working with an early Cilk++ adopter, it quickly became apparent that the default memory allocato ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Multicore Storage Allocation
by Charles LeisersonWhen multicore-enabling a C/C++ application, it's common to discover that malloc()(or new) is a bottleneck that limits the speedup your parallelized application can obtain. This ar ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Four Reasons Why Parallel Programs Should Have Serial Semantics
by Steve Lewin-Berlin Some parallel programming environments require the developer to relearn the fundamentals of programming in order to think in parallel. Cilk++ takes a different approach. One basic ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Finding Performance Bottlenecks & Data Races
by Ilya MirmanAt this point, we have several dozen organizations worldwide exploring Cilk++. When embarking on a multithreading project, the first question many folks ask is, "Where do I start?"
Be ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Making Your Cache Go Further in These Troubled Times
by Will LeisersonOne of our summer interns, Matthew Steele, suggested a matrix-multiplication algorithm that more effectively used the cache than an algorithm that might be more intuitive to a mathemat ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Multicore-enabling the N-Queens Problem Using Cilk++
by Justin ZhangI was an intern at CilkArts this summer and my first engagement with Cilk++ was to solve the N-Queens problem with it. The N-Queens problem asks this question: given an n-by-n chessboard ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
A Tale of Two Algorithms: Multithreading Matrix Multiplication
by Matthew Steele I've been interning at Cilk Arts this summer, working mainly on designing and implementing a very interesting analysis tool for Cilk++ programs (more on that later!). I have also, of ... Type: Technical Article |
10/28/2009
|
Global Variable Reconsidered
by Charles Leiserson In a widely applauded article published in 1973 and entitled, “Global variable considered harmful,” Bill Wulf and Mary Shaw argued, “We claim that the non-local variable is a major ... Type: Technical Article |
10/28/2009
|
What the $#@! is Parallelism, Anyhow?
by Charles LeisersonI'm constantly amazed how many seemingly well-educated computer technologists bandy about the word parallelism without really knowing what they're talking about. I can't tell you ho ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Concepts in Multicore Programming - Lecture 3: Analysis of Multithreaded Algorithms
Posted by Ilya Mirman originally on www.cilk.com on Wed, Jul 20, 2009
We recently teamed up with MIT's Professional Development Program on a 2-day workshop focused on multicore programming.
(Here ar ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Concepts in Multicore Programming - Lecture 2: Parallelism & Scheduling Theory
Posted by Ilya Mirman originally on www.cilk.com on Wed, Jul 15, 2009
We recently teamed up with MIT's Professional Development Program on a 2-day workshop focused on multicore programming.
(Here's ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Multicore Programming Workshop - Lecture 1
Posted by Ilya Mirman originally on www.cilk.com on Wed, Jul 15, 2009
We recently teamed up with MIT's Professional Development Program on a 2-day workshop focused on multicore programming.
The firs ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Visualizing Parallel Speedup with Cilkview
Posted by Will Leiserson originally on www.cilk.com on Tue, Jun 30, 2009
Along with our upcoming release of Cilk++ v.1.1 we are including a new tool to help you visualize application performance: Cil ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Multicore-enabling FP-tree Algorithm for Frequent Pattern Mining
Posted by Yuxiong He originally on www.cilk.com on Thu, Jun 25, 2009
In data mining, association rule mining is a popular and well-researched method for discovering interesting relations between var ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Reducers and Other Cilk++ Hyperobjects: Peeking Under the Hood
Posted by Ilya Mirman originally on www.cilk.com on Wed, Jun 17, 2009
We have written extensively about the challenges global variables pose for multithreaded applications - inhibiting parallelism by ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Multicore-enabling Dense Polynomial Multiplication
Posted by Ilya Mirman originally on www.cilk.com on Tue, Jun 02, 2009
Marc Moreno Maza Ontario Research Centre for Computer Algebra, University of Western Ontario moreno@csd.uwo.ca Yuzhen Xie Compute ... Type: Technical Article |
10/28/2009
|
Multicore-enabling a Binary Decision Diagram algorithm
Posted by Yuxiong He originally on www.cilk.com on Fri, May 29, 2009
Reduced and Ordered Binary Decision Diagram (a.k.a. ROBDD, though I will refer to it as BDD in this post for simplicity) is an acy ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
A cute technique for avoiding certain race conditions
Posted by Matteo Frigo originally on www.cilk.com on Thu, May 07, 2009
The following problem, slightly rephrased, was posed in the Intel TBB forum . You have N particles. Particle j exerts force f(i, ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
First Impressions of the Fortress Language
Posted by Pablo Halpern on Fri, May 08, 2009
I was privileged recently to attend a one-day hands-on introduction to Fortress lead by Sukyoung Ryu and Jan-Willem Maessen of Sun Microsystems and ho ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|
Multicore-enabling the Murphi Verification Tool
Posted by Yuxiong He originally on www.cilk.com on Tue, Apr 28, 2009
Murphi is a popular finite-state machine verification tool, used widely in the design of cache coherence algorithms and protocols, ... Author: Stephen Lewin-berlin (Intel) Type: Technical Article |
10/28/2009
|