[Tutor] htmllib vs re question

->Terry<- tvbare at socket.net
Fri Mar 10 00:18:46 CET 2006


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


I want to parse some text from an HTML file that contains
blocks of pre-formatted text. All I'm after is what's between
the <pre> and </pre> tags.

My first thought was to use re for this, but looking through
the Library Reference, I see the htmllib module. Is htmllib
overkill for this job?

The HTML file size varies, but I don't expect the size to exceed
150-200k. Speed is not a bug concern.

What is the Pythonic way and why?

Any recommendations or comments?

Thanks,
- -- 
     Terry     <tvbareATsocketDOTnet>

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.7 (GNU/Linux)

iD8DBQFEELfcQvSnsfFzkV0RAreaAJ9qvD5GoA5a0qD15Wr0hJ4XLLNhiQCeKd1R
XIqBMZWoIY66y8r5Rtgevqc=
=cUhn
-----END PGP SIGNATURE-----



More information about the Tutor mailing list