[Catalog-sig] Extra links on the PyPI /simple index package pages

P.J. Eby pje at telecommunity.com
Mon Jun 21 16:52:06 CEST 2010


At 12:57 PM 6/21/2010 +0200, M.-A. Lemburg wrote:
>So in summary, the /simple index page doesn't need to include
>any URLs from the long_description that do not have a rel
>attribute set, or end with one of the fixed set of archive extensions
>or with "#egg=...".

Such links are ignored, yes.  (The 'rel' links are only generated by 
PyPI, btw, not from the long_description.)

OTOH, I'm not sure what benefit there is to adding code that would 
specifically filter things down to just those URLs, since adding code 
always adds the potential for bugs, and the presence of those links 
is currently harmless.

(Unless of course you're so bandwidth starved that an extra few 
hundred bytes of link text is a problem... in which case, you could 
likely save even *more* bytes by stripping off the '<a' tags and 
their contents, and just serve up a text file with a series of lines 
reading 'href="..."', since setuptools is actually only looking for 
href attributes, not the tags that contain them.  That would shave a 
significant chunk of bytes off every page, not just the ones with extra links!)



More information about the Catalog-SIG mailing list