[Python-bugs-list] [Bug #122587] xmllib unable to parse "german scharfes ß" in UTF8 format
noreply@sourceforge.net
noreply@sourceforge.net
Thu, 16 Nov 2000 10:58:03 -0800
Bug #122587, was updated on 2000-Nov-16 06:10
Here is a current snapshot of the bug.
Project: Python
Category: XML
Status: Open
Resolution: None
Bug Group: None
Priority: 5
Summary: xmllib unable to parse "german scharfes ß" in UTF8 format
Details: The xmllib.XMLParser seems to be unable to parse
an XML file that contains 0xc3 0x9f (UTF8 representation
of the german ß).
Python 2.0 (Linux i386) always give me the following
traceback:
suxlap@/tmp/xx(2)% python test.py test.xml
<?xml version="1.0" encoding="UTF-8" ?>
<test>ßüöä</test>
Traceback (most recent call last):
File "test.py", line 20, in ?
pp.feed(data)
File "/opt/python-2.0/lib/python2.0/xmllib.py", line 165, in feed
self.goahead(0)
File "/opt/python-2.0/lib/python2.0/xmllib.py", line 261, in goahead
self.syntax_error('illegal character in content')
File "/opt/python-2.0/lib/python2.0/xmllib.py", line 786, in syntax_error
raise RuntimeError, 'Syntax error at line %d: %s' % (self.lineno, message)
RuntimeError: Syntax error at line 3: illegal character in content
Other UTF8 characters seem to work
Follow-Ups:
Date: 2000-Nov-16 10:58
By: Nobody
Comment:
works with accept_utf=1 as additional parameter,
but using xml package instead.
-------------------------------------------------------
For detailed info, follow this link:
http://sourceforge.net/bugs/?func=detailbug&bug_id=122587&group_id=5470