[Python-bugs-list] [ python-Bugs-646408 ] old UnicodeData.txt

noreply@sourceforge.net noreply@sourceforge.net
Sun, 01 Dec 2002 11:37:42 -0800


Bugs item #646408, was opened at 2002-12-01 12:54
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=646408&group_id=5470

Category: Unicode
Group: Python 2.3
>Status: Closed
Resolution: None
Priority: 5
Submitted By: Matthias Klose (doko)
Assigned to: M.-A. Lemburg (lemburg)
Summary: old UnicodeData.txt

Initial Comment:
[submitted at http://bugs.debian.org/171061]

The version mentioned is CVS 021121 HEAD

unicodedata.so is obviously built with older
UnicodeData.txt file,
and does not match new characters introduced in Unicode
3.2.
To fix it, I copied UnicodeData.txt (conveniently
provided by perl-modules
in /usr/share/perl/5.8.0/unicore/UnicodeData.txt) to
the top of
unpacked python2.3 source package, renamed it to
UnicodeData-Latest.txt,
ran "python Tools/unicode/makeunicodedata.py" and then
recompiled
python2.3 package. This should probably be addressed
upstream as well.

before:
>>> import unicodedata
>>> unicodedata.name(u'\u20b0')
Traceback (most recent call last):
  File "<stdin>", line 1, in ?
  ValueError: no such name

after:
>>> import unicodedata
>>> unicodedata.name(u'\u20b0')
'GERMAN PENNY SIGN'





----------------------------------------------------------------------

>Comment By: Matthias Klose (doko)
Date: 2002-12-01 19:37

Message:
Logged In: YES 
user_id=60903

The CVS version (as mentioned) was 021121 HEAD. Closing the
report, as this has changed six days ago.

Frederik: Debian currently has 1.5, 2.1, 2.2 and 2.3, 2.1
beeing the default in the released (stable) distribution,
and 2.2 the default in the "unstable" distribution. I did
put 2.3 in unstable to ease building third party modules
using the new version.

----------------------------------------------------------------------

Comment By: Martin v. Löwis (loewis)
Date: 2002-12-01 13:30

Message:
Logged In: YES 
user_id=21627

Are you sure you are using the current CVS? In my copy of 
the CVS, renaming to UnicodeData-Latest is not necessary, 
and GERMANY PENNY SIGN is included in the database.

To verify that you use the current CVS, please report the 
value of unicodedata.unidata_version.

Fredrik, Debian has the Python 2.3 package only in 
its "unstable" (and "testing") distribution, see

http://packages.debian.org/unstable/interpreters/python2.3.ht
ml

It is common to provide Debian packages for CVS versions of 
software in "unstable", so that the Debian developers can 
analyse effects of upcoming versions on their software.

----------------------------------------------------------------------

Comment By: Fredrik Lundh (effbot)
Date: 2002-12-01 13:18

Message:
Logged In: YES 
user_id=38376

Why is Debian shipping a "python2.3", when
Python 2.3 hasn't been released yet (it's not
even in alpha)?

(the Unicode database in the current Python
CVS has already been updated...)

</F>


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=646408&group_id=5470