Wrapper round x86 Assembler

Josiah Carlson jcarlson at uci.edu
Sat Apr 10 16:13:25 EDT 2004


> On modern CPUs, unless you really know what you're doing, hand-written
> assembly code is likely to be slower than code generated by a good
> optimizing C/C++ compiler. If your assembly code is at all straightforward,
> it will definitely be slower than compiled code, because you'll be stalling
> the CPU all over the place when one instruction waits for the results of a
> previous one.

Yes and no.  With the existance of the Tomasulo algorithm for register 
renaming and out-of-order execution, you can pick up quite a bit of the 
ILP without even programming it that way.  Of course, there is only so 
much the Tomasulo algorithm can do for crappy assembly.

  - Josiah




More information about the Python-list mailing list