How to read webpage

koranthala koranthala at gmail.com
Sat Aug 1 11:19:28 EDT 2009


On Aug 1, 6:52 pm, MRAB <pyt... at mrabarnett.plus.com> wrote:
> tarun wrote:
> > Dear All,
> > I want to read a webpage and copy the contents of it in word file. I
> > tried to write following code:
>
> > import urllib2
> > urllib2.urlopen("http://www.rediff.com/")
>
> > *Error:-*
>
> >     urllib2.urlopen("http://www.icicibank.com/")
> >   File "C:\Python25\lib\urllib2.py", line 121, in urlopen
> >     return _opener.open(url, data)
> >   File "C:\Python25\lib\urllib2.py", line 374, in open
> >     response = self._open(req, data)
> >   File "C:\Python25\lib\urllib2.py", line 392, in _open
> >     '_open', req)
> >   File "C:\Python25\lib\urllib2.py", line 353, in _call_chain
> >     result = func(*args)
> >   File "C:\Python25\lib\urllib2.py", line 1100, in http_open
> >     return self.do_open(httplib.HTTPConnection, req)
> >   File "C:\Python25\lib\urllib2.py", line 1075, in do_open
> >     raise URLError(err)
> > urllib2.URLError: <urlopen error (11001, 'getaddrinfo failed')>
>
> I've just tried it. I didn't get an exception, so your problem must be
> elsewhere.

Is it that the website expects a valid browser?
In that case, spoof a browser and try to get the site.



More information about the Python-list mailing list