Python Web Scrapping : Within href readonly those value that have href in it

shahsn11 at gmail.com shahsn11 at gmail.com
Fri Jan 13 23:42:47 EST 2017


I am trying to scrape a webpage just for learning. In that webpage there are multiple "a" tags. consider the below code

<a href='\abc\def\jkl'> Something </a>

<a href ='http:\\www.google.com'> Something</a>


Now i want to read only those href in which there is http. My Current code is

for link in soup.find_all("a"):
    print link.get("href")

i would like to change it to read only http links.



More information about the Python-list mailing list