SSE4.1 Hack

Many SIMD integer instructions are added to E8400 and Nehalem core CPUs

So, let’s see what we can do with the instructions: http://en.wikipedia.org/wiki/SSE4#SSE4.1

Boring samples will be fibonacci or exponent e calculation?
BigInteger Implementation in C++ might be good candidate.
PMULD, MPSADBW will be powerful to use.

Posted in ASM, SIMD, SSE. Tags: , , , . Leave a Comment »

Do you Assembly?

MULPS — Packed Single-Precision Floating-Point Multiply

I have started looking at assembly to see how Single-Instruction Multiple-Data (SIMD) works, and ended up with PPC’s Altivec and Intel’s SSE. I was successful to optimized C++ source codes using SIMD techniques.

I will look into SSE4.1 because I am also expecting  to receive a new CPU E8400.