unicode data tables
John MaxSkaller
skaller@maxtal.com.au
Tue, 27 Jul 99 15:22:31 GMT
I have posted the unicode (and other) encoding subdocument of
interscript v1.0a9 to the Web. At
http://www.xenon.triode.net.au/~skaller/unicode/index.html
you will find documented Python source for conversion of various character
sets, including ISO-8859-X, Wansung, Johab, GB, Big5, ShiftJis, and various
Microsoft/IBM proprietary code pages.
The document contains a character by character description of the Unicode
subset of ISO-10646, routines for converting UCS-2, USC2le, UCS-4, UCS4-le
to UTF-8 (the native encoding used by interscript), and compact tables of all
the glyphs (for unicode and each of the foreign character sets).
If you're interested in internationalisation, you may find this web useful.
You can browse online, or download it as a tarball (using http).
You will need a browser with appropiate fonts to see Chinese, Japanese,
Korean, Arabic and other glyphs.
[Note: the table of contents uses a folding tree under MSIE4-5, it
takes some time for my ECMA script to process the table. It's not
broken, just slow!]
John Max Skaller ph:61-2-96600850
mailto:skaller@maxtal.com.au 10/1 Toxteth Rd
http://www.maxtal.com.au/~skaller Glebe 2037 NSW AUSTRALIA
<P><A HREF="http://www.xenon.triode.net.au/~skaller/unicode/index.html">
unicode data tables</A> - documented Python source for conversion of
various character sets, including ISO-8859-X, Wansung, Johab, GB, Big5,
ShiftJis, and various Microsoft/IBM proprietary code pages. (25-Jul-99)
--
----------- comp.lang.python.announce (moderated) ----------
Article Submission Address: python-announce@python.org
Python Language Home Page: http://www.python.org/
Python Quick Help Index: http://www.python.org/Help.html
------------------------------------------------------------