SAXParseException: not well-formed (invalid token)
Marc 'BlackJack' Rintsch
bj_666 at gmx.net
Thu Aug 30 09:35:39 EDT 2007
On Thu, 30 Aug 2007 15:31:58 +0200, Pablo Rey wrote:
> On 30/08/2007 14:35, Marc 'BlackJack' Rintsch wrote:
>
>> Maybe you can download that XML file and use `xmllint` to check if it
>> is well formed XML!?
>
> This is the output of the xmllint command:
>
> [prey at www3 voms2users]$ xmllint cms.xml cms.xml:553: error: Input is not
> proper UTF-8, indicate encoding !
> <item>/C=BE/O=BEGRID/OU=Physique/OU=Univesité Catholique de
> Louvain/CN=Roberfroi
> ^
> cms.xml:553: error: Bytes: 0xE9 0x20 0x43 0x61
> <item>/C=BE/O=BEGRID/OU=Physique/OU=Univesité Catholique de
> Louvain/CN=Roberfroi
>
> […]
>
> <?xml version="1.0" encoding="UTF-8" ?>
So the XML says it is encoded in UTF-8 but it contains at least one
character that seems to be encoded in ISO-8859-1.
Tell the authors/creators of that document there XML is broken.
Ciao,
Marc 'BlackJack' Rintsch
More information about the Python-list
mailing list