Web page special characters encoding
Rami Chowdhury
rami.chowdhury at gmail.com
Sat Jul 10 13:07:33 EDT 2010
On Jul 10, 2010, at 09:24 , mattia wrote:
> Hi all, I'm using py3k and the urllib package to download web pages. Can
> you suggest me a package that can translate reserved characters in html
> like "è", "ò", "é" in the corresponding correct
> encoding?
It won't do the whole job for you but you may find the 'name2codepoint' mapping in the 'html.entities' module useful.
HTH,
Rami
More information about the Python-list
mailing list