[Tutor] Error with incorrect encoding

Oleg Oltar oltarasenko at gmail.com
Tue Apr 15 17:20:53 CEST 2008


I am trying to parse an html page. Have following error while doing that


 src = sel.get_html_source()
        links = re.findall(r'<a class="al4"[^<]*</a>', src)
        for link in links:
            print link



======================================================================
ERROR: test_new (__main__.NewTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "<stdin>", line 19, in test_new
UnicodeEncode    Error: 'ascii' codec can't encode character u'\xae' in
position 90: ordinal not in range(128)

----------------------------------------------------------------------
Ran 1 test in 6.345s
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/tutor/attachments/20080415/91c85a1a/attachment.htm 


More information about the Tutor mailing list