urllib2 - iteration over non-sequence

Erik Max Francis max at alcyone.com
Sun Jun 10 01:56:38 EDT 2007


Paul Rubin wrote:

> If you know in advance that the page you're retrieving will be
> reasonable in size, then using readlines is fine.  If you don't know
> in advance what you're retrieving (e.g. you're working on a crawler)
> you have to assume that you'll hit some very large pages with
> difficult construction.

And that's before you even mention the point that, depending on the 
application, it could easily open yourself up to a DOS attack.

There's premature optimization, and then there's premature completely 
obvious and pointless waste.  This falls in the latter category.

Besides, someone was asking for/needing an older equivalent to iterating 
over a file.  That's obviously .xreadlines, not .readlines.

-- 
Erik Max Francis && max at alcyone.com && http://www.alcyone.com/max/
  San Jose, CA, USA && 37 20 N 121 53 W && AIM, Y!M erikmaxfrancis
   The more violent the love, the more violent the anger.
    -- _Burmese Proverbs_ (tr. Hla Pe)



More information about the Python-list mailing list