Taking data from a text file to parse html page
Larry Bates
larry.bates at websafe.com
Thu Aug 24 09:19:55 EDT 2006
DH wrote:
> Hi,
>
> I'm trying to strip the html and other useless junk from a html page..
> Id like to create something like an automated text editor, where it
> takes the keywords from a txt file and removes them from the html page
> (replace the words in the html page with blank space) I'm new to python
> and could use a little push in the right direction, any ideas on how to
> implement this?
>
> Thanks!
>
See Beautiful Soup: http://www.crummy.com/software/BeautifulSoup/
it will parse even badly formed HTML and allow you to extract/change
information as you wish.
-Larry Bates
More information about the Python-list
mailing list