[Web-SIG] So what's missing?

John J Lee jjl at pobox.com
Sat Oct 25 08:51:05 EDT 2003


On Fri, 24 Oct 2003, Bill Janssen wrote:

[...]
> * CSS parser.  I can't really do visual interpretation of Web pages
>   without understanding their layout.

Does anybody other than Bill want this?


> * post-multipart (both http and https).

Everybody is agreed this is needed.


> * Asynchronous fetch.  When working over the Plucker distiller, which
[...]

Nice, but not easy.  Would it not introduce a lot of new code?  There used
to be asynchttp and asyncurl libraries, I think, built on top of asyncore.
First (obviously) somebody would need to actually put the work in here.
Second, would it be possible to do this without a lot of code duplication
between the current urllib{2,} / httplib libraries and the new stuff?  Is
it worth it, when you can use threads instead?


> * Connection caching.  Again, when pulling lots of pages from lots of
[...]

That would be nice.  Are you volunteering?


John



More information about the Web-SIG mailing list