[Tutor] htmllib vs re question
->Terry<-
tvbare at socket.net
Fri Mar 10 00:18:46 CET 2006
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
I want to parse some text from an HTML file that contains
blocks of pre-formatted text. All I'm after is what's between
the <pre> and </pre> tags.
My first thought was to use re for this, but looking through
the Library Reference, I see the htmllib module. Is htmllib
overkill for this job?
The HTML file size varies, but I don't expect the size to exceed
150-200k. Speed is not a bug concern.
What is the Pythonic way and why?
Any recommendations or comments?
Thanks,
- --
Terry <tvbareATsocketDOTnet>
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.7 (GNU/Linux)
iD8DBQFEELfcQvSnsfFzkV0RAreaAJ9qvD5GoA5a0qD15Wr0hJ4XLLNhiQCeKd1R
XIqBMZWoIY66y8r5Rtgevqc=
=cUhn
-----END PGP SIGNATURE-----
More information about the Tutor
mailing list