SAXParseException: not well-formed (invalid token)
Pablo Rey
prey at cesga.es
Thu Aug 30 07:46:47 EDT 2007
Dear Colleagues,
I am getting the following error with a XML page:
> File "/home/prey/RAL-CESGA/bin/voms2users/voms2users.py", line 69, in getItems
> d = minidom.parseString(xml.read())
> File "/usr/lib/python2.2/site-packages/_xmlplus/dom/minidom.py", line 967, in parseString
> return _doparse(pulldom.parseString, args, kwargs)
> File "/usr/lib/python2.2/site-packages/_xmlplus/dom/minidom.py", line 954, in _doparse
> toktype, rootNode = events.getEvent()
> File "/usr/lib/python2.2/site-packages/_xmlplus/dom/pulldom.py", line 265, in getEvent
> self.parser.feed(buf)
> File "/usr/lib/python2.2/site-packages/_xmlplus/sax/expatreader.py", line 208, in feed
> self._err_handler.fatalError(exc)
> File "/usr/lib/python2.2/site-packages/_xmlplus/sax/handler.py", line 38, in fatalError
> raise exception
> xml.sax._exceptions.SAXParseException: <unknown>:553:48: not well-formed (invalid token)
> def getItems(page):
> opener =urllib.URLopener(key_file=HOSTKEY,cert_file=HOSTCERT) ;
> try:
> xml = opener.open(page)
> except:
> return []
>
> d = minidom.parseString(xml.read())
> items = d.getElementsByTagName('item')
> data = []
> for i in items:
> data.append(getText(i.childNodes))
>
> return data
The page is
https://lcg-voms.cern.ch:8443/voms/cms/services/VOMSCompatibility?method=getGridmapUsers
and the line with the invalid character is (the invalid character is the
final é of Université):
<item>/C=BE/O=BEGRID/OU=Physique/OU=Univesité Catholique de
Louvain/CN=Roberfroid</item>
I have tried several options but I am not able to avoid this problem.
Any idea?.
I am starting to work with Python so I am sorry if this problem is trivial.
Thanks for your time.
Pablo Rey
More information about the Python-list
mailing list