[Python-checkins] python/nondist/peps pep-0339.txt, NONE, 1.1 pep-0000.txt, 1.299, 1.300

bcannon at users.sourceforge.net bcannon at users.sourceforge.net
Sat Feb 12 23:02:07 CET 2005


Update of /cvsroot/python/python/nondist/peps
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv7649

Modified Files:
	pep-0000.txt 
Added Files:
	pep-0339.txt 
Log Message:
Added PEP 339, "How to Change CPython's Bytecode", by Brett Cannon.


--- NEW FILE: pep-0339.txt ---
PEP: 339
Title: How to Change CPython's Bytecode
Version: $Revision: 1.1 $
Last-Modified: $Date: 2005/02/12 22:02:04 $
Author: Brett Cannon <brett at python.org>
Status: Active
Type: Informational
Content-Type: text/x-rst
Created: 02-Feb-2005
Post-History: 02-Feb-2005


Abstract
========

Python source code is compiled down to something called bytecode.  This
bytecode must implement enough semantics to perform the actions required by the
Language Reference [#lang_ref]_.  As such, knowing how to add, remove, or change
the bytecode is important to do properly when changing the abilities of the
Python language.
This PEP covers how to accomplish this in the CPython implementation of the
language (referred to as simply "Python" for the rest of this PEP).

.. warning::
     The guidelines outlined in this PEP apply to Python 2.4 and earlier.
     Current plans for Python 2.5 will lead to a significant change in how
     Python's bytecode is handled.
     This PEP will be updated once these planned changes are committed into
     CVS.


Rationale
=========

While changing Python's bytecode is not a frequent occurence, it still happens.
Having the required steps documented in a single location should make
experimentation with the bytecode easier since it is not necessarily obvious
what the steps are to change the bytecode.

This PEP, paired with PEP 306 [#PEP-306]_, should provide enough basic
guidelines for handling any changes performed to the Python language itself in
terms of syntactic changes that introduce new semantics.


Checklist
=========

This is a rough checklist of what files need to change and how they are
involved with the bytecode.  All paths are given from the viewpoint of
``/cvsroot/python/dist/src`` from CVS).  This list should not be considered
exhaustive nor to cover all possible situations.

- ``Include/opcode.h``
	This include file lists all known opcodes and associates each opcode
	name with
	a unique number.  When adding a new opcode it is important to take note
	of the ``HAVE_ARGUMENT`` value.  This ``#define``'s value specifies the
	value at which all opcodes greater than ``HAVE_ARGUMENT`` are expected
	to take an argument to the opcode.

- ``Lib/opcode.py``
	Lists all of the opcodes and their associated value.  Used by the dis
	module [#dis]_ to map bytecode values to their names.

- ``Python/ceval.c``
	Contains the main interpreter loop.  Code to handle the evalution of an
	opcode goes here.

- ``Python/compile.c``
	To make sure an opcode is actually used, this file must be altered.
	The emitting of all bytecode occurs here.

- ``Lib/compiler/pyassem.py``, ``Lib/compiler/pycodegen.py``
	The 'compiler' package [#compiler]_ needs to be altered to also reflect
	any changes to the bytecode.

- ``Doc/lib/libdis.tex``
	The documentation [#opcode_list]_ for the dis module contains a complete
	list of all the opcodes.

- ``Python/import.c``
	Defines the magic word (named ``MAGIC``) used in .pyc files to detect if
	the bytecode used matches the one used by the version of Python running.
	This number needs to be changed to make sure that the running
	interpreter does not try to execute bytecode that it does not know
	about.


Suggestions for bytecode development
====================================

A few things can be done to make sure that development goes smoothly when
experimenting with Python's bytecode.  One is to delete all .py(c|o) files
after each semantic change to Python/compile.c .  That way all files will use
any bytecode changes.

Make sure to run the entire testing suite [#test-suite]_.  Since the
``regrtest.py`` driver recompiles all source code before a test is run it acts
a good test to make sure that no existing semantics are broken.

Running parrotbench [#parrotbench]_ is also a good way to make sure existing
semantics are not broken; this benchmark is practically a compliance test.


Previous experiments
====================
This section lists known bytecode experiments that have not gone into Python.

Skip Montanaro presented a paper at a Python workshop on a peephole optimizer
[#skip-peephole]_.

Michael Hudson has a non-active SourceForge project named Bytecodehacks
[#Bytecodehacks]_ that provides functionality for playing with bytecode
directly.

An opcode to combine the functionality of LOAD_ATTR/CALL_FUNCTION was created
named CALL_ATTR [#CALL_ATTR]_.  Currently only works for classic classes and
for new-style classes rough benchmarking showed an actual slowdown thanks to
having to support both classic and new-style classes.


References
==========

.. [#lang_ref] Python Language Reference, van Rossum & Drake
   (http://docs.python.org/ref/ref.html)

.. [#PEP-306] PEP 306, How to Change Python's Grammar, Hudson
   (http://www.python.org/peps/pep-0306.html)

.. [#dis] dis Module
   (http://docs.python.org/lib/module-dis.html)

.. [#compiler] 'compiler' Package
   (http://docs.python.org/lib/module-compiler.html)

.. [#test-suite] 'test' Package
   (http://docs.python.org/lib/module-test.html)

.. [#opcode_list] Python Byte Code Instructions
   (http://docs.python.org/lib/bytecodes.html)

.. [#parrotbench] Parrotbench
   (ftp://ftp.python.org/pub/python/parrotbench/parrotbench.tgz,
   http://mail.python.org/pipermail/python-dev/2003-December/041527.html)

.. [#skip-peephole] Skip Montanaro's Peephole Optimizer Paper
   (http://www.foretec.com/python/workshops/1998-11/proceedings/papers/montanaro/montanaro.html)

.. [#Bytecodehacks] Bytecodehacks Project
   (http://bytecodehacks.sourceforge.net/bch-docs/bch/index.html)

.. [#CALL_ATTR] CALL_ATTR opcode
   (http://www.python.org/sf/709744)


Copyright
=========

This document has been placed in the public domain.



..
   Local Variables:
   mode: indented-text
   indent-tabs-mode: nil
   sentence-end-double-space: t
   fill-column: 70
   End:

Index: pep-0000.txt
===================================================================
RCS file: /cvsroot/python/python/nondist/peps/pep-0000.txt,v
retrieving revision 1.299
retrieving revision 1.300
diff -u -d -r1.299 -r1.300
--- pep-0000.txt	30 Jan 2005 03:08:02 -0000	1.299
+++ pep-0000.txt	12 Feb 2005 22:02:05 -0000	1.300
@@ -58,6 +58,7 @@
  I   291  Backward Compatibility for Standard Library  Norwitz
  I   306  How to Change Python's Grammar               Hudson
  I   333  Python Web Server Gateway Interface v1.0     Eby
+ I   339  How to Change CPython's Bytecode             Cannon
  I  3000  Python 3.0 Plans                             Kuchling, Cannon
 
  Accepted PEPs (accepted; may not be implemented yet)
@@ -372,6 +373,7 @@
  S   336  Make None Callable                           McClelland
  S   337  Logging Usage in the Standard Library        Dubner
  S   338  Executing modules inside packages with '-m'  Coghlan
+ I   339  How to Change CPython's Bytecode             Cannon
  SR  666  Reject Foolish Indentation                   Creighton
  S   754  IEEE 754 Floating Point Special Values       Warnes
  I  3000  Python 3.0 Plans                             Kuchling, Cannon
@@ -401,7 +403,7 @@
     Batista, Facundo         facundo at taniquetil.com.ar
     Baxter, Anthony          anthony at interlink.com.au
     Bellman, Thomas          bellman+pep-divmod at lysator.liu.se
-    Cannon, Brett            drifty at alum.berkeley.edu
+    Cannon, Brett            brett at python.org
     Carlson, Josiah          jcarlson at uci.edu
     Carroll, W Isaac         icarroll at pobox.com
     Coghlan, Nick            ncoghlan at email.com



More information about the Python-checkins mailing list