Converting html character codes to utf-8 text

Johann Spies johann.spies at gmail.com
Tue Jun 19 06:10:53 EDT 2012


I am trying the following:

Change data like this:

Bien Donné : agri tourism

to this:

Bien Donné agri tourism

I am using the 'unescape' function published on
http://effbot.org/zone/re-sub.htm#unescape-html but working through a file
I get the following error:

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 519:
ordinal not in range(128)

and I do not now how to solve this problem.

Any solution will be very appriciated.

Regards
Johann

-- 
Because experiencing your loyal love is better than life itself,
my lips will praise you.  (Psalm 63:3)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-list/attachments/20120619/ee05122e/attachment.html>


More information about the Python-list mailing list