On 5/17/2011 3:08 PM, Klonuo Umom wrote: >> I get 67ms/loop with plain ATLAS. > > What about Octave? > > As I wrote, PC I run the test is very low-end You should compare to Octave's a*b, not dot(a,b). Christoph