Taking data from a text file to parse html page

Larry Bates larry.bates at websafe.com
Thu Aug 24 09:19:55 EDT 2006


DH wrote:
> Hi,
> 
> I'm trying to strip the html and other useless junk from a html page..
> Id like to create something like an automated text editor, where it
> takes the keywords from a txt file and removes them from the html page
> (replace the words in the html page with blank space) I'm new to python
> and could use a little push in the right direction, any ideas on how to
> implement this?
> 
> Thanks!
> 
See Beautiful Soup: http://www.crummy.com/software/BeautifulSoup/
it will parse even badly formed HTML and allow you to extract/change
information as you wish.

-Larry Bates



More information about the Python-list mailing list