unicode data tables

John MaxSkaller skaller@maxtal.com.au
Tue, 27 Jul 99 15:22:31 GMT


I have posted the unicode (and other) encoding subdocument of
interscript v1.0a9 to the Web. At

	http://www.xenon.triode.net.au/~skaller/unicode/index.html

you will find documented Python source for conversion of various character
sets, including ISO-8859-X, Wansung, Johab, GB, Big5, ShiftJis, and various
Microsoft/IBM proprietary code pages.

The document contains a character by character description of the Unicode
subset of ISO-10646, routines for converting UCS-2, USC2le, UCS-4, UCS4-le
to UTF-8 (the native encoding used by interscript), and compact tables of all
the glyphs (for unicode and each of the foreign character sets).

If you're interested in internationalisation, you may find this web useful.
You can browse online, or download it as a tarball (using http).
You will need a browser with appropiate fonts to see Chinese, Japanese,
Korean, Arabic and other glyphs.

[Note: the table of contents uses a folding tree under MSIE4-5, it
takes some time for my ECMA script to process the table. It's not
broken, just slow!]


John Max Skaller                ph:61-2-96600850              
mailto:skaller@maxtal.com.au       10/1 Toxteth Rd 
http://www.maxtal.com.au/~skaller  Glebe 2037 NSW AUSTRALIA

<P><A HREF="http://www.xenon.triode.net.au/~skaller/unicode/index.html">
unicode data tables</A> - documented Python source for conversion of
various character sets, including ISO-8859-X, Wansung, Johab, GB, Big5,
ShiftJis, and various Microsoft/IBM proprietary code pages.  (25-Jul-99)

-- 
----------- comp.lang.python.announce (moderated) ----------
Article Submission Address:  python-announce@python.org
Python Language Home Page:   http://www.python.org/
Python Quick Help Index:     http://www.python.org/Help.html
------------------------------------------------------------