HTMLParser.HTMLParseError: EOF in middle of construct

Sérgio Monteiro Basto sergio at sergiomb.no-ip.org
Tue Jun 19 16:21:08 EDT 2007


First, sorry about the mess, let see if kontact works better with
newsgroups.

Marc 'BlackJack' Rintsch wrote:
>> ok but my problem is not understand what is the specific problem at line
>> 1173
> 
> You can't just look at that line and ignore the rest.  There are 604 (!)
> errors, some about table rows, before this line.  So the parser may be
> confused at this point and be already in an internal state that sees that
> line in a completely different light than you do.

Ok , but is one single error that blocks this.
Finally I found it , it is :
<td colspan="2"align="center"
if I put :
<td colspan="2" align="center"
    
p = re.compile('"align')
content = p.sub('" align', content)

I can parse the html 
I don't know if it a bug of HTMLParser 

thanks for the help 

Best regards,
--
Sérgio M. B. 



More information about the Python-list mailing list