Become familiar with SSE3 Instructions. SSE3 for the 32-bit Intel® architecture is a set of 13 new instructions that accelerate performance of Streaming SIMD Extensions (SSE) technology, Streaming SIMD Extensions 2 (SSE2) technology, and 87-FP math capabilities. The new technology is compatible with existing software written for Intel architecture microprocessors, and existing software should continue to run correctly, without modification, on microprocessors that incorporate SSE3 instructions.
Implement SSE3 single-precision and double-precision packed ADDSUBxx instructions. ADDSUBPS has two 128-bit operands. The instruction performs single-precision addition on the second and fourth pairs of 32-bit data elements within the operands and single-precision subtraction on the first and third pairs. This instruction is effective at evaluating complex products on packed single-precision data.