[pypy-issue] [issue1132] Bugs in Unicode data

Carl Friedrich Bolz tracker at bugs.pypy.org
Tue Apr 17 16:33:56 CEST 2012


New submission from Carl Friedrich Bolz <cfbolz at gmx.de>:

PyPy seems to have a bug in the unicodedata names:


Python 2.7.1 (r271:86832, Jun 16 2011, 16:59:05) 
[GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2335.15.00)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import unicodedata
>>> unicodedata.name(40900)
Traceback (most recent call last):
  File "<console>", line 1, in <module>
TypeError: must be unicode, not int
>>> unicodedata.name(unichr(40900))
'CJK UNIFIED IDEOGRAPH-9FC4'
>>> 
david at Yate ~/Code/pypy/pypy/module/unicodedata % pypy
Python 2.7.2 (0f03693b05ac, Feb 18 2012, 15:27:02)
[PyPy 1.8.1-dev0] on darwin
Type "help", "copyright", "credits" or "license" for more information.
Traceback (most recent call last):
  File "app_main.py", line 51, in run_toplevel
  File "app_main.py", line 576, in run_it
  File "/Users/david/.pystartup", line 21, in <module>
    import fancycompleter
ImportError: No module named fancycompleter
And now for something completely different: ``"All problems in PyPy can be
solved by another level of interpretation"''
>>>> import unicodedata
>>>> unicodedata.name(unichr(40900))
Traceback (most recent call last):
  File "<console>", line 1, in <module>
ValueError: no such name
>>>> 


There's even a test that would have caught this, but it's skipped on Python2.6
(which the tannit buildbot is using):

pypy/module/unicodedata/test/test_unicodedata.py test_random_charnames

----------
messages: 4267
nosy: cfbolz, pypy-issue
priority: bug
status: unread
title: Bugs in Unicode data

________________________________________
PyPy bug tracker <tracker at bugs.pypy.org>
<https://bugs.pypy.org/issue1132>
________________________________________


More information about the pypy-issue mailing list