[New-bugs-announce] [issue18348] Additional code pages for EBCDIC

roskakori report at bugs.python.org
Tue Jul 2 19:47:11 CEST 2013


New submission from roskakori:

Currently Python includes a codec for EBCDIC international (cp500) but seems to be missing any further EBCDIC codecs. These encodings are widly used on mainframe platforms, popular in finance and insurance.

Descriptions of these codepages are available from IBM: <http://www-01.ibm.com/software/globalization/cp/cp_cpgid.html>. These descriptions also include mapping files although not in a format that can readily be processed by gencodec.py.

So instead I used the codecs included with Java 1.7 to generate mappings for gencodec.py. You can find them in the attached ZIP archive. As Java also runs on mainframe platforms, IBM should be interested in the Java codecs to be correct and complete.

The converter is available from <https://github.com/roskakori/CodecMapper>. To build the cp*.txt for EBCDIC, simply run:

$ git clone https://github.com/roskakori/CodecMapper.git
$ cd CodecMapper
$ ant ebcdic

IBM lists a large number of EBCDIC codepages, I only attached the ones listed in the German Wikipedia: <http://de.wikipedia.org/wiki/EBCDIC>. This also includes cp500 for comparison with your current cp500. And it lacks EDF03DRV because even Java does not support it.

Currently Java 1.7 supports 43 variants. To get a list of them, use:

$ ant list | grep -i ' ibm'

This would also fix issue 1097797: Encoding for Code Page 273 used by EBCDIC Germany Austria.

----------
components: Unicode
files: cp_ebcdic.zip
messages: 192214
nosy: ezio.melotti, lemburg, roskakori
priority: normal
severity: normal
status: open
title: Additional code pages for EBCDIC
type: enhancement
Added file: http://bugs.python.org/file30751/cp_ebcdic.zip

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue18348>
_______________________________________


More information about the New-bugs-announce mailing list