issues with htmlparser.getpos

Steve Holden steve at holdenweb.com
Wed Jul 4 08:34:32 EDT 2007


dysmas wrote:
> Hi,
> 
> 
> Im having an issue with HTMLParser, the getpos() funtion sometimes
> returns things like :
> 
> (1, 1247)
> (1, 2114)
> (1, 2168)
> (1, 2228)
> (1, 2295)
> (1, 2382)
> (1, 2441)
> (1, 2963)
> (1, 3040)
> 
> i guess this is because the HTMLParser has not correctly parsed the
> newline characters in the string fed to it... is there a workaround
> for this, without checking the string every time i feed it some data?
> 
Have you verified that these results aren't correct? There is no 
requirements for newlines in HTML, and some computer-generated pages 
don't bother to insert them.

regards
  Steve
-- 
Steve Holden        +1 571 484 6266   +1 800 494 3119
Holden Web LLC/Ltd           http://www.holdenweb.com
Skype: holdenweb      http://del.icio.us/steve.holden
--------------- Asciimercial ------------------
Get on the web: Blog, lens and tag the Internet
Many services currently offer free registration
----------- Thank You for Reading -------------




More information about the Python-list mailing list