Regex Help
Support Desk
support.desk.ipg at gmail.com
Wed Sep 24 10:22:32 EDT 2008
Thanks for the reply, I found out the problem was occurring later on in the
script. The regexp works well.
-----Original Message-----
From: Lawrence D'Oliveiro [mailto:ldo at geek-central.gen.new_zealand]
Sent: Tuesday, September 23, 2008 6:51 PM
To: python-list at python.org
Subject: Re: Regex Help
In message <mailman.1369.1222101506.3487.python-list at python.org>, Support
Desk wrote:
> Anybody know of a good regex to parse html links from html code? The one I
> am currently using seems to be cutting off the last letter of some links,
> and returning links like
>
> http://somesite.co
>
> or http://somesite.ph
>
> the code I am using is
>
>
> regex = r'<a href=["|\']([^"|\']+)["|\']>'
Can you post some example HTML sequences that this regexp is not handling
correctly?
More information about the Python-list
mailing list