Screen scraper to get all 'a title' elements

Grobu snailcoder at retrosite.invalid
Wed Nov 25 18:44:55 EST 2015


On 26/11/15 00:06, Chris Angelico wrote:
> On Thu, Nov 26, 2015 at 9:48 AM, ryguy7272 <ryanshuell at gmail.com> wrote:
>> Thanks!!  Is that regex?  Can you explain exactly what it is doing?
>> Also, it seems to pick up a lot more than just the list I wanted, but that's ok, I can see why it does that.
>>
>> Can you just please explain what it's doing???
>
> It's a trap!
>
> Don't use a regex to parse HTML, unless you're deliberately trying to
> entice young and innocent programmers to the dark side.
>
> ChrisA
>

Sorry, I wasn't aware of regex being on the dark side :-)
Now that you mention it, I suppose that their being complex and 
error-inducing could lead to broken code all too easily when there is a 
reliable, ready-made solution like BeautifulSoup.




More information about the Python-list mailing list