[Catalog-sig] simple index and urls exracted from metadata text fields

Fri Sep 11 15:13:29 CEST 2009

Hello

Right now in a package registered at pypi, there are no distinction
between urls located in free text metadata (like description)
and metadata that are supposed to be urls.

This leads to some problems when scripts like easy_install scans the index page:
it might try to visit urls the author just put there in his
description text with no particular
intent of making it viewable.

Plus, old urls that don't work anymore are not removed, leading to
easy_install timeouts.

1. what's the purpose of having them in there ?
2. if there's a purpose, what about adding an attribute to each <a>
tag to identify from which metadata field it was extracted from ?

Cheers
Tarek

-- 
Tarek Ziadé | http://ziade.org | オープンソースの岩！