From noreply at sourceforge.net  Mon May  1 05:44:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 30 Apr 2006 20:44:00 -0700
Subject: [Patches] [ python-Patches-1473132 ] Improve docs for tp_clear and
	tp_traverse
Message-ID: <E1FaPKK-00085o-NP@sc8-sf-web5.sourceforge.net>

Patches item #1473132, was opened at 2006-04-19 13:43
Message generated for change (Comment added) made by collinwinter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473132&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improve docs for tp_clear and tp_traverse

Initial Comment:
The attached patch greatly enhances the documentation
for the tp_clear and tp_traverse functions. The patch
is against Doc/api/newtypes.tex, r45562.

----------------------------------------------------------------------

>Comment By: Collin Winter (collinwinter)
Date: 2006-04-30 23:43

Message:
Logged In: YES 
user_id=1344176

I've enhanced the patch per Tim Peters' comment.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-04-22 23:30

Message:
Logged In: YES 
user_id=31435

I agree the additional info is helpful (thanks!).

Alas, there's more to it, and it's hard to know when to stop
:-(.

For example, an author of a type may _want_ to visit, e.g.,
contained strings in tp_traverse, because they want
gc.get_referents() to return the contained strings
(typically as a debugging aid).

The issues wrt to tp_clear are subtler.  The real
requirement is that the aggregate of all tp_clears called
break all possible cycles.  For one thing, that means
there's no real reason for a tp_clear to touch a member
that's known to be a Python string or integer (since such an
object can't be in a cycle, clearing it can't help to break
a cycle).  It's only tp_dealloc that _must_ drop references
to all containees.

Subtler is that a gc'ed container type may choose not to
implement tp_clear at all.  If you look, you'll see that
Python's tuple type in fact leaves its tp_clear slot empty.
 This isn't a problem because it's impossible to have a
cycle composed _solely_ of tuples (that may not be obvious,
but it's true -- it derives from that tuples are immutable).
 Any cycle a tuple may be in will be broken if the non-tuple
objects in the cycle clear their containees, so there's no
actually need for tuples to have a tp_clear.

The possibility should be mentioned, although it's fine to
recommend playing it safe.  Indeed, I don't think it buys
anything worth having for tuples not to have an obvious
tp_clear implementation.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473132&group_id=5470

From noreply at sourceforge.net  Mon May  1 08:58:24 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 30 Apr 2006 23:58:24 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1FaSMS-0004yD-Td@sc8-sf-web1.sourceforge.net>

Patches item #1479611, was opened at 2006-04-30 23:58
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Mon May  1 09:08:03 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 00:08:03 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1FaSVn-0002BQ-Ae@sc8-sf-web4-b.sourceforge.net>

Patches item #1479611, was opened at 2006-04-30 23:58
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 00:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Mon May  1 10:27:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 01:27:45 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1FaTkv-0001P4-L0@sc8-sf-web4-b.sourceforge.net>

Patches item #1479611, was opened at 2006-05-01 08:58
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-01 10:27

Message:
Logged In: YES 
user_id=21627

The tuples should get deallocated when Py_Finalize is called.

It would be good if there was (conditional) statistical
analysis, showing how often no tuple was found because the
number of arguments was too large, and how often no tuple
was found because the candidate was in use.

I think it should be more stack-like, starting off with no
tuples allocated, then returning them inside the needs_free
blocks only if the refcount is 1 (or 2?). This would avoid
degeneralized cases where some function holds onto its
argument tuple indefinitely, thus consuming all 64 tuples.

For the other part, I think it would make the code more
readable if it inlined PyCFunction_Call even more: the test
for NOARGS|O could be integrated into the switch statement
(one case for each), VARARGS and VARARGS|KEYWORDS would both
load the arguments, then call the function directly
(possibly with NULL keywords). OLDARGS should goto either
METH_NOARGS, METH_O, or METH_VARARGS depending on na (if you
don't like goto, modifying flags would work as well).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 09:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Mon May  1 11:24:43 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 02:24:43 -0700
Subject: [Patches] [ python-Patches-1473132 ] Improve docs for tp_clear and
	tp_traverse
Message-ID: <E1FaUe3-00060f-RR@sc8-sf-web5.sourceforge.net>

Patches item #1473132, was opened at 2006-04-19 19:43
Message generated for change (Comment added) made by twouters
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473132&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improve docs for tp_clear and tp_traverse

Initial Comment:
The attached patch greatly enhances the documentation
for the tp_clear and tp_traverse functions. The patch
is against Doc/api/newtypes.tex, r45562.

----------------------------------------------------------------------

>Comment By: Thomas Wouters (twouters)
Date: 2006-05-01 11:24

Message:
Logged In: YES 
user_id=34209

As Tim said, there is more to it :) I think this is a fine
start, though. One minor point: the use of Py_CLEAR() can do
with some extra explanation. It obviously isn't enough to
just 'NULL out' members, since that would leak references,
but the docs should also explain that it is in fact
important to set the actual member to NULL *before*
DECREFing the reference, and then point out that the
Py_CLEAR macro is a convenient way of doing that. That kind
of tweak can happen after it's checked in, though
(preferably by someone who can build documentation and see
that the result looks okay ;)


----------------------------------------------------------------------

Comment By: Collin Winter (collinwinter)
Date: 2006-05-01 05:43

Message:
Logged In: YES 
user_id=1344176

I've enhanced the patch per Tim Peters' comment.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-04-23 05:30

Message:
Logged In: YES 
user_id=31435

I agree the additional info is helpful (thanks!).

Alas, there's more to it, and it's hard to know when to stop
:-(.

For example, an author of a type may _want_ to visit, e.g.,
contained strings in tp_traverse, because they want
gc.get_referents() to return the contained strings
(typically as a debugging aid).

The issues wrt to tp_clear are subtler.  The real
requirement is that the aggregate of all tp_clears called
break all possible cycles.  For one thing, that means
there's no real reason for a tp_clear to touch a member
that's known to be a Python string or integer (since such an
object can't be in a cycle, clearing it can't help to break
a cycle).  It's only tp_dealloc that _must_ drop references
to all containees.

Subtler is that a gc'ed container type may choose not to
implement tp_clear at all.  If you look, you'll see that
Python's tuple type in fact leaves its tp_clear slot empty.
 This isn't a problem because it's impossible to have a
cycle composed _solely_ of tuples (that may not be obvious,
but it's true -- it derives from that tuples are immutable).
 Any cycle a tuple may be in will be broken if the non-tuple
objects in the cycle clear their containees, so there's no
actually need for tuples to have a tp_clear.

The possibility should be mentioned, although it's fine to
recommend playing it safe.  Indeed, I don't think it buys
anything worth having for tuples not to have an obvious
tp_clear implementation.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473132&group_id=5470

From noreply at sourceforge.net  Mon May  1 13:18:26 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 04:18:26 -0700
Subject: [Patches] [ python-Patches-1474907 ] detect %zd format for
	PY_FORMAT_SIZE_T
Message-ID: <E1FaWQ6-0006hm-Te@sc8-sf-web4-b.sourceforge.net>

Patches item #1474907, was opened at 2006-04-23 08:18
Message generated for change (Settings changed) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1474907&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Build
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Brett Cannon (bcannon)
>Assigned to: Brett Cannon (bcannon)
Summary: detect %zd format for PY_FORMAT_SIZE_T

Initial Comment:
The patch modifies configure.in to add PY_FORMAT_SIZE_T
to configure.in (meaning you need to run autoheader on
configure.in) so that if %zd is supported for size_t it
sets PY_FORMAT_SIZE_T to "z", otherwise it goes
undefined and the preprocessor trickery in
Include/pyport.h kicks in.

This fix removes compiler warnings on OS X 10.4.6 with
gcc 4.0.1 thanks to PY_FORMAT_SIZE_T being set to "".

Initially assigned to Martin v. Loewis since he said
this would be good to do and the Py_ssize_t stuff is
his invention.

----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-28 06:51

Message:
Logged In: YES 
user_id=357491

Yeah, I tried to use a string constant as a stack value, but
that didn't work.  =)  My brain just was not thinking in C
when I first came up with the patch.

I have a new version that uses a char array as the buffer. 
I am on vacation so I don't have the time to apply it and
break buildbot, so I will hold off on applying if no one
finds problems with this version.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-27 07:29

Message:
Logged In: YES 
user_id=21627

Looks fine to me, although it has "unusual" style of C:

- sizeof(char) is guaranteed to be 1 by the C standard. The
C standard defines "char" and "byte" as synonyms, even if
that means that "byte" has more than 8 bits. sizeof gives
the number of bytes, so for char, it is always 1.

- for a fixed-size array, people would normally make this an
automatic (stack) variable, instead of bothering with
explicit memory allocation, i.e.

  char str_buffer[4]

Just out of fear of buffer overruns, many people would also
add some horrendous overallocation, such as str_buffer[1024] :-)


----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-27 07:16

Message:
Logged In: YES 
user_id=357491

Realized there is a better way: just strncmp() for the
expected result.  Uploaded a new version.

----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-27 06:59

Message:
Logged In: YES 
user_id=357491

OK, uploaded a new version that uses strchr to check for
'%', 'z', and 'd'.  If it looks reasonable I will apply it
and hope I don't break the buildbot.  =)

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-26 18:15

Message:
Logged In: YES 
user_id=21627

The patch seems to rely on printf returning <0 for the
unrecognized format. That seems unreliable: atleast on
Linux, printf just outputs the format as-is for unrecognized
formats. Instead, I think it should use sprintf, and then
check whether the result is the string "0" (in addition to
checking whether the printf call itself failed).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1474907&group_id=5470

From noreply at sourceforge.net  Mon May  1 21:50:49 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 12:50:49 -0700
Subject: [Patches] [ python-Patches-1479977 ] Heavy revisions to urllib2
	howto
Message-ID: <E1FaePx-0001ER-Ms@sc8-sf-web2.sourceforge.net>

Patches item #1479977, was opened at 2006-05-01 20:50
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479977&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Heavy revisions to urllib2 howto

Initial Comment:
Lots of people have been complaining about lack of
urllib2 docs (though I'm never quite sure what people
are looking for, being too familiar with all the
details), so a tutorial may well be a useful addition.
 I'm sure you'll understand that my brutal criticism
:-) is intended to make it even more useful.

Michael: feel free to make further revisions, but
unless you have major objections I suggest that this is
checked in first, then we make any further changes
after that by uploading patches on SF for review (I
haven't stepped back and re-read it with a fresh mind,
and no doubt would be useful for somebody to do that).
 Editing this took me quite a while, and if I can help
it I don't want to go through too many revisions or
argue about the details before anything gets fixed!-).
 I've taken the liberty of mentioning myself as a
reviewer somewhere at the end of the document :-)

Important: I reformatted paragraphs to max 70 character
width (it's conventional, and plain-text diffs are
especially painful to read otherwise, though admittedly
diffs are never great for paragraphs anyway... I hope
emacs didn't muck up any ReST syntax).  I've uploaded
just that formatting change as reformatted.rst (which
also removes trailing whitespace from all lines).  This
should be done in a separate initial commit of course.
 For this reason, I've uploaded the whole document for
both reformatted (reformatted.rst) and edited versions
(edited.rst) rather than using patches.

I've made all of the changes I discuss below, *with the
exception of* the missing example of GET with
urlencoded data that's really needed (search for XXX in
the comments below) -- that should just need a few lines.

BTW, it would be a really fantastic idea to turn the
whole document into a valid doctest (I know I'm myself
almost incapable of writing correct examples unless I
do something like that).  All that would require of
course is adding a few >>>s and ...s and running it
through doctest.testfile until it stops complaining ;-)


Now a list explaining and justifying the changes I made:


Spelling / paragraph structure etc. fixes.  I won't
list these.


Most importantly, you seem a bit unsure who your
audience is.  For example, on headers -- you explain
that "HTTP is based on requests and responses", but
dive into User-Agent without actually mentioning what a
header is.  In my changes, I ended up adding brief
explanations of the concepts for people new to or fuzzy
about HTTP, but didn't go into details of
implementation.  For example, introducing the concept
of "HTTP header", but not explaining how HTTP
implements them "on the wire" (though in fact I think
it would be a good thing to add one example that showed
an HTTP request and pointed out the request line, the
headers and the data, since that makes everything very
concrete and easy to grasp for newbies).


Removed link to external howto on cookie handling. 
Despite the description ("How to handle cookies, when
fetching web pages with Python."), this actually spends
most of its time discussing what conditional imports
are needed if you want to be maximally compatible
across libraries and older versions of Python.  While
that is certainly useful for people who need that, I
think this is rather obscure and distracting detail
that seems out of place being referenced from the
Python 2.5 documentation, even in a howto.  Perhaps
some general statement that further tutorials are
available on your site?  Referencing your basic auth
tutorial seems fine.


You limit mention of urllib2.urlopen(url) to a
footnote, and in the text of the tutorial itself, you
say: """urllib2 mirrors this by having you form a
``Request``""" .  That's not true: a string URL is
fine, as you explain in the footnote.  That seems an
innaccuracy with no obvious didactic payoff.  In the
footnote, you say:

"""You *can* fetch URLs directly with urlopen, without
using a request object. It's more explicit, and
therefore more Pythonic, to use ``urllib2.Request``
though. It also makes it easier to add headers to your
request.

I find that bizarre!  Why is urlopen(url) unpythonic??
 On the contrary, using an extra object for no reason
*does* seem unpythonic to me.  I rewrote this a bit.


You needlessly assign the_url = "http:...", then
request = Request(the_url) -- why not a single line? 
Where it's useful to do that (i.e. in the more
complicated examples), I've s/the_url/url/, since I
object to chaff like "the_" in variable names ;-)


Your discussion of Request implies that it only
represents HTTP requests.  Fixed that.


Use of the word "handle" to talk about response objects
is unfortunate for two reasons: First, many objects in
Python are "handles" in some sense ("object reference"
semantics), so it's too vague to be a helpful name. 
Second, it's particularly unfortunate to use the word
"handle" when urllib2 makes heavy use of "handler"
objects that "handle" requests.  The fact that methods
on these handlers often return your "handles" only
makes things more confusing!  s/handle/response/


"""Sometimes you want to **POST** data to a CGI (Common
Gateway Interface) [#]_ or other web application"""

It's clear to us old hands what you mean here, but in a
tutorial at the level you seem to have picked we
probably shouldn't expect the reader to have all these
concepts straight, so being sloppy here is bad.

 - By "a CGI" I'm guessing you mean "a CGI
script/program".  Also, the whole sentence is unclear
whether you're talking about a web application in the
abstract, or some concrete CGI script.  I certainly
remember being very confused about this kind of thing
as a newbie.

 - "...or other web application" implies that all POSTs
go to web applications.  That's using "web application"
in a broader sense than it's usually understood.

 - You introduce "POST" without explanation.  Would be
nice to say "send data" instead of "POST", then explain
POST.

I rewrote this bit to try to address those points.


Re POST: """This is what your browser does when you
fill in a FORM on the web"""

Thats needed qualifying: form submission can also
result in a GET.


I added a bit on side-effects and GET/POST.


"""You may be mimicking a FORM submission, or
transmitting data to your own application."""

This reads oddly to me.  I know what you're getting at
(forms are not part of HTTP), but surely if you are
submitting form data you're not "mimicking" form
submission, you *are* submitting a form.  And in an
English sentence the "or" reads as an "exclusive or";
with that in mind: In what sense does form submission
*not* involve "transmitting data to your own
application"?  Reworded and s/FORM/HTML form/, since
we're talking about the abstract thing rather than
specifically about the HTML element.


"""In either case the data needs to be encoded for safe
transmission over HTTP"""

Arbitrary binary data does not need to be URL-encoded.
 Rephrased.


"""The encoding is done using a function from the
``urllib`` library *not* from ``urllib2``. ::"""

This is not true in general even for HTML forms.  For
example, HTML form file upload data is not encoded in
this way.  There are more obscure cases, too.  Noted this.


The quoted User-Agent string was out-of-date.  Fixed,
noting that it changes with each minor Python version.


Headers / data : I added a bit of explanatory context
to tell people what we're about to explain, and break
up paragraphs / add sections to clarify the structure.
 Also explained the concept of "HTTP header", as I
noted above.


XXX example needed on GET with urlencoded data (as it's
written ATM, this would go immediately before the
"Headers" section).


"""Coping With Errors"""

"Handling exceptions" seems more accurate.  Not all
HTTP status codes for which urllib2 raises an exception
involve HTTP error responses.  The text is also
confused on this point, so I rewrote it.


Errors: I believe urlopen can still actually raise
socket.error.  This is a bug, but I haven't dared to
submit a patch to fix it, fearing
backwards-compatibility issues.  I guess it should
probably be documented :-( But I suppose we should
discuss that in a separate tracker item, rather than
adding it to your howto straight away.


You mention IOError.  Without a motivating use case I
don't know why you mention this.  Since I'm not really
sure what the use case for this subclassing was ever
intended to be :-) I removed this example: feel free to
add it back if you know of a use or can get Jeremy
Hylton to explain it to you ;-)


Re URLError : you imply that the only reason for
URLError to be raised is failure to connect to the
server.  This is often the cause, but certainly not always.


For HTTP status codes, you refer to a document that
states "This is a historic document and is not accurate
anymore".  RFC 2616 is authoritative, and IMHO fairly
readable on error codes.  Removed the reference to the
other document.


"""As of Python 2.5 a dictionary like this one has
become part of ``urllib2``."""

In fact, this was moved to httplib.  The reference to
"HTTPBaseServer" (sic) is interesting: I think the copy
in httplib should be removed, since it's already there
in BaseHTTPServer (albeit missing 306, but that is
unused) -- would you mind filing a patch, Michael?

Your listing differed from BaseHTTPServer and from RFC
2616, so I replaced it with the BaseHTTPServer copy.


"""shows all the defined response codes"""

These are only those defined by RFC 2616 of course:
other standards can and do define other response status
codes (e.g. DAV).  Clarified this.


"""When an error is raised the server responds by
returning an http error code *and* an error page."""

This is sloppy: HTTP doesn't define "raising" an error,
so it can't respond to one.  Fixed.


httplib.HTTPMessage

Reworded to avoid impling it's *always* going to be
this concrete class.


"""In versions of Python prior to 2.3.4 it wasn't safe
to iterate over the object directly, so you should
iterate over the list returned by ``msg.keys()``
instead."""

Is this appropriate advice in the 2.5 docs?  I removed
this (am I too harsh on this point?).


"""Openers and handlers are slightly esoteric parts of
**urllib2**."""

I don't want to scare people off: they're easy to use
(if not to write).  Removed this.


I added a tiny bit more on what handlers do.


Changed the text to avoid implying that build_opener()
is the only way to create openers.


Don't refer to ``opener`` in those typewriter-font ReST
backticks, since that seems a little misleading: it's
not a Python class name (unfortunately the class is
named OpenerDirector, which rather clashes with the use
of the name "opener" of course, but personally I'm with
you in preferring "opener").


Wrote a bit more about opener construction.


Changed realm name to make it clear it may contain spaces.


Changed references to URI to URL in discussion of
authentication -- seems an irrelevant and distracting
distinction here.


I edited the basic auth description a little.


Comments conventionally come *before* code it refers
to, not after.  Fixed that, removed an over-obvious
comment or two (even in docs, "create the handler"
seems redundant if that's *all* it says), and the fixed
the curious line breaks.


"""The only reason to explicitly supply these to
``build_opener`` (which chains handlers provided as a
list), would be to change the order they appear in the
chain."""

I don't know of a use case for that in the case of the
handlers you list.  Also, that doesn't actually work:
handler ordering is determined by sorting.  Removed this.


"""One thing not to get bitten by is that the
``top_level_url`` in the code above *must not* contain
the protocol - the ``http://`` part. So if the URL we
are trying to access is"""

This is not correct usage (though I can see why it
worked); removed it.  Admittedly, urllib2 auth was the
subject of a quite a few bug fixes recently (I seem to
have just found yet another one five minutes ago, in
fact :-( ), so the situation pre-2.5 was certainly
messy.  However, I advise against trying to document
the old bugs!  Note that I haven't given examples of
"sub-URLs" since the RFC (2617) isn't clear to me on
this point, and I haven't yet tested whether urllib2
gets it right according to de-facto standards (as
defined by browsers, Apache, etc.)  for "sub-URLs" of
the one passed to .add_password().  It's on the list...


In your note explaining that HTTPS proxies are not
supported, you use "caution" rather than "note", which
conveys the strange implication to me that this lack of
support is somehow a consequence of using your previous
recipe for switching off proxy handling (or am I weird
in reading it that way??).  s/caution/note/


""".. [#] Possibly some of this tutorial will make it
into the standard library docs for versions of Python
after 2.4.1."""

Removed this.


Whew!


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479977&group_id=5470

From noreply at sourceforge.net  Mon May  1 21:59:20 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 12:59:20 -0700
Subject: [Patches] [ python-Patches-1479977 ] Heavy revisions to urllib2
	howto
Message-ID: <E1FaeYC-0007mD-7j@sc8-sf-web3.sourceforge.net>

Patches item #1479977, was opened at 2006-05-01 20:50
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479977&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Heavy revisions to urllib2 howto

Initial Comment:
Lots of people have been complaining about lack of
urllib2 docs (though I'm never quite sure what people
are looking for, being too familiar with all the
details), so a tutorial may well be a useful addition.
 I'm sure you'll understand that my brutal criticism
:-) is intended to make it even more useful.

Michael: feel free to make further revisions, but
unless you have major objections I suggest that this is
checked in first, then we make any further changes
after that by uploading patches on SF for review (I
haven't stepped back and re-read it with a fresh mind,
and no doubt would be useful for somebody to do that).
 Editing this took me quite a while, and if I can help
it I don't want to go through too many revisions or
argue about the details before anything gets fixed!-).
 I've taken the liberty of mentioning myself as a
reviewer somewhere at the end of the document :-)

Important: I reformatted paragraphs to max 70 character
width (it's conventional, and plain-text diffs are
especially painful to read otherwise, though admittedly
diffs are never great for paragraphs anyway... I hope
emacs didn't muck up any ReST syntax).  I've uploaded
just that formatting change as reformatted.rst (which
also removes trailing whitespace from all lines).  This
should be done in a separate initial commit of course.
 For this reason, I've uploaded the whole document for
both reformatted (reformatted.rst) and edited versions
(edited.rst) rather than using patches.

I've made all of the changes I discuss below, *with the
exception of* the missing example of GET with
urlencoded data that's really needed (search for XXX in
the comments below) -- that should just need a few lines.

BTW, it would be a really fantastic idea to turn the
whole document into a valid doctest (I know I'm myself
almost incapable of writing correct examples unless I
do something like that).  All that would require of
course is adding a few >>>s and ...s and running it
through doctest.testfile until it stops complaining ;-)


Now a list explaining and justifying the changes I made:


Spelling / paragraph structure etc. fixes.  I won't
list these.


Most importantly, you seem a bit unsure who your
audience is.  For example, on headers -- you explain
that "HTTP is based on requests and responses", but
dive into User-Agent without actually mentioning what a
header is.  In my changes, I ended up adding brief
explanations of the concepts for people new to or fuzzy
about HTTP, but didn't go into details of
implementation.  For example, introducing the concept
of "HTTP header", but not explaining how HTTP
implements them "on the wire" (though in fact I think
it would be a good thing to add one example that showed
an HTTP request and pointed out the request line, the
headers and the data, since that makes everything very
concrete and easy to grasp for newbies).


Removed link to external howto on cookie handling. 
Despite the description ("How to handle cookies, when
fetching web pages with Python."), this actually spends
most of its time discussing what conditional imports
are needed if you want to be maximally compatible
across libraries and older versions of Python.  While
that is certainly useful for people who need that, I
think this is rather obscure and distracting detail
that seems out of place being referenced from the
Python 2.5 documentation, even in a howto.  Perhaps
some general statement that further tutorials are
available on your site?  Referencing your basic auth
tutorial seems fine.


You limit mention of urllib2.urlopen(url) to a
footnote, and in the text of the tutorial itself, you
say: """urllib2 mirrors this by having you form a
``Request``""" .  That's not true: a string URL is
fine, as you explain in the footnote.  That seems an
innaccuracy with no obvious didactic payoff.  In the
footnote, you say:

"""You *can* fetch URLs directly with urlopen, without
using a request object. It's more explicit, and
therefore more Pythonic, to use ``urllib2.Request``
though. It also makes it easier to add headers to your
request.

I find that bizarre!  Why is urlopen(url) unpythonic??
 On the contrary, using an extra object for no reason
*does* seem unpythonic to me.  I rewrote this a bit.


You needlessly assign the_url = "http:...", then
request = Request(the_url) -- why not a single line? 
Where it's useful to do that (i.e. in the more
complicated examples), I've s/the_url/url/, since I
object to chaff like "the_" in variable names ;-)


Your discussion of Request implies that it only
represents HTTP requests.  Fixed that.


Use of the word "handle" to talk about response objects
is unfortunate for two reasons: First, many objects in
Python are "handles" in some sense ("object reference"
semantics), so it's too vague to be a helpful name. 
Second, it's particularly unfortunate to use the word
"handle" when urllib2 makes heavy use of "handler"
objects that "handle" requests.  The fact that methods
on these handlers often return your "handles" only
makes things more confusing!  s/handle/response/


"""Sometimes you want to **POST** data to a CGI (Common
Gateway Interface) [#]_ or other web application"""

It's clear to us old hands what you mean here, but in a
tutorial at the level you seem to have picked we
probably shouldn't expect the reader to have all these
concepts straight, so being sloppy here is bad.

 - By "a CGI" I'm guessing you mean "a CGI
script/program".  Also, the whole sentence is unclear
whether you're talking about a web application in the
abstract, or some concrete CGI script.  I certainly
remember being very confused about this kind of thing
as a newbie.

 - "...or other web application" implies that all POSTs
go to web applications.  That's using "web application"
in a broader sense than it's usually understood.

 - You introduce "POST" without explanation.  Would be
nice to say "send data" instead of "POST", then explain
POST.

I rewrote this bit to try to address those points.


Re POST: """This is what your browser does when you
fill in a FORM on the web"""

Thats needed qualifying: form submission can also
result in a GET.


I added a bit on side-effects and GET/POST.


"""You may be mimicking a FORM submission, or
transmitting data to your own application."""

This reads oddly to me.  I know what you're getting at
(forms are not part of HTTP), but surely if you are
submitting form data you're not "mimicking" form
submission, you *are* submitting a form.  And in an
English sentence the "or" reads as an "exclusive or";
with that in mind: In what sense does form submission
*not* involve "transmitting data to your own
application"?  Reworded and s/FORM/HTML form/, since
we're talking about the abstract thing rather than
specifically about the HTML element.


"""In either case the data needs to be encoded for safe
transmission over HTTP"""

Arbitrary binary data does not need to be URL-encoded.
 Rephrased.


"""The encoding is done using a function from the
``urllib`` library *not* from ``urllib2``. ::"""

This is not true in general even for HTML forms.  For
example, HTML form file upload data is not encoded in
this way.  There are more obscure cases, too.  Noted this.


The quoted User-Agent string was out-of-date.  Fixed,
noting that it changes with each minor Python version.


Headers / data : I added a bit of explanatory context
to tell people what we're about to explain, and break
up paragraphs / add sections to clarify the structure.
 Also explained the concept of "HTTP header", as I
noted above.


XXX example needed on GET with urlencoded data (as it's
written ATM, this would go immediately before the
"Headers" section).


"""Coping With Errors"""

"Handling exceptions" seems more accurate.  Not all
HTTP status codes for which urllib2 raises an exception
involve HTTP error responses.  The text is also
confused on this point, so I rewrote it.


Errors: I believe urlopen can still actually raise
socket.error.  This is a bug, but I haven't dared to
submit a patch to fix it, fearing
backwards-compatibility issues.  I guess it should
probably be documented :-( But I suppose we should
discuss that in a separate tracker item, rather than
adding it to your howto straight away.


You mention IOError.  Without a motivating use case I
don't know why you mention this.  Since I'm not really
sure what the use case for this subclassing was ever
intended to be :-) I removed this example: feel free to
add it back if you know of a use or can get Jeremy
Hylton to explain it to you ;-)


Re URLError : you imply that the only reason for
URLError to be raised is failure to connect to the
server.  This is often the cause, but certainly not always.


For HTTP status codes, you refer to a document that
states "This is a historic document and is not accurate
anymore".  RFC 2616 is authoritative, and IMHO fairly
readable on error codes.  Removed the reference to the
other document.


"""As of Python 2.5 a dictionary like this one has
become part of ``urllib2``."""

In fact, this was moved to httplib.  The reference to
"HTTPBaseServer" (sic) is interesting: I think the copy
in httplib should be removed, since it's already there
in BaseHTTPServer (albeit missing 306, but that is
unused) -- would you mind filing a patch, Michael?

Your listing differed from BaseHTTPServer and from RFC
2616, so I replaced it with the BaseHTTPServer copy.


"""shows all the defined response codes"""

These are only those defined by RFC 2616 of course:
other standards can and do define other response status
codes (e.g. DAV).  Clarified this.


"""When an error is raised the server responds by
returning an http error code *and* an error page."""

This is sloppy: HTTP doesn't define "raising" an error,
so it can't respond to one.  Fixed.


httplib.HTTPMessage

Reworded to avoid impling it's *always* going to be
this concrete class.


"""In versions of Python prior to 2.3.4 it wasn't safe
to iterate over the object directly, so you should
iterate over the list returned by ``msg.keys()``
instead."""

Is this appropriate advice in the 2.5 docs?  I removed
this (am I too harsh on this point?).


"""Openers and handlers are slightly esoteric parts of
**urllib2**."""

I don't want to scare people off: they're easy to use
(if not to write).  Removed this.


I added a tiny bit more on what handlers do.


Changed the text to avoid implying that build_opener()
is the only way to create openers.


Don't refer to ``opener`` in those typewriter-font ReST
backticks, since that seems a little misleading: it's
not a Python class name (unfortunately the class is
named OpenerDirector, which rather clashes with the use
of the name "opener" of course, but personally I'm with
you in preferring "opener").


Wrote a bit more about opener construction.


Changed realm name to make it clear it may contain spaces.


Changed references to URI to URL in discussion of
authentication -- seems an irrelevant and distracting
distinction here.


I edited the basic auth description a little.


Comments conventionally come *before* code it refers
to, not after.  Fixed that, removed an over-obvious
comment or two (even in docs, "create the handler"
seems redundant if that's *all* it says), and the fixed
the curious line breaks.


"""The only reason to explicitly supply these to
``build_opener`` (which chains handlers provided as a
list), would be to change the order they appear in the
chain."""

I don't know of a use case for that in the case of the
handlers you list.  Also, that doesn't actually work:
handler ordering is determined by sorting.  Removed this.


"""One thing not to get bitten by is that the
``top_level_url`` in the code above *must not* contain
the protocol - the ``http://`` part. So if the URL we
are trying to access is"""

This is not correct usage (though I can see why it
worked); removed it.  Admittedly, urllib2 auth was the
subject of a quite a few bug fixes recently (I seem to
have just found yet another one five minutes ago, in
fact :-( ), so the situation pre-2.5 was certainly
messy.  However, I advise against trying to document
the old bugs!  Note that I haven't given examples of
"sub-URLs" since the RFC (2617) isn't clear to me on
this point, and I haven't yet tested whether urllib2
gets it right according to de-facto standards (as
defined by browsers, Apache, etc.)  for "sub-URLs" of
the one passed to .add_password().  It's on the list...


In your note explaining that HTTPS proxies are not
supported, you use "caution" rather than "note", which
conveys the strange implication to me that this lack of
support is somehow a consequence of using your previous
recipe for switching off proxy handling (or am I weird
in reading it that way??).  s/caution/note/


""".. [#] Possibly some of this tutorial will make it
into the standard library docs for versions of Python
after 2.4.1."""

Removed this.


Whew!


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-01 20:59

Message:
Logged In: YES 
user_id=261020

(I guess if I had any sense in me, I would have uploaded
those comments as an attachment instead of pasting them into
the summary -- sorry.)

I'm uploading the revised document now.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479977&group_id=5470

From noreply at sourceforge.net  Mon May  1 22:08:14 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 13:08:14 -0700
Subject: [Patches] [ python-Patches-1479988 ] weakref dict methods
Message-ID: <E1Faego-0000GH-DQ@sc8-sf-web4-b.sourceforge.net>

Patches item #1479988, was opened at 2006-05-01 16:08
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Fred L. Drake, Jr. (fdrake)
Assigned to: Tim Peters (tim_one)
Summary: weakref dict methods

Initial Comment:
The WeakKeyDictionary and WeakValueDictionary don't
provide any API to get just the weakrefs out, instead
of the usual mapping API.  This can be desirable when
you want to get a list of everything without creating
new references to the underlying objects at that moment.

This patch adds methods to make the references
themselves accessible using the API, avoiding requiring
client code to have to depend on the implementation. 
The WeakKeyDictionary gains the .iterkeyrefs() and
.keyrefs() methods, and the WeakValueDictionary gains
the .itervaluerefs() and .valuerefs() methods.

The patch includes tests and docs.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

From noreply at sourceforge.net  Mon May  1 22:17:47 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 13:17:47 -0700
Subject: [Patches] [ python-Patches-1411097 ] httplib patch to make
	_read_chunked() more robust
Message-ID: <E1Faeq3-0007GE-6V@sc8-sf-web6.sourceforge.net>

Patches item #1411097, was opened at 2006-01-20 20:26
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1411097&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: None
>Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: httplib patch to make _read_chunked() more robust

Initial Comment:
To reproduce:

import urllib2
print urllib2.urlopen("http://66.117.37.13/").read()


The attached patch "fixes" the hang, but that patch is
not acceptable because it also removes the .readline()
and .readlines() methods on the response object
returned by urllib2.urlopen().

The patch seems to demonstrate that the problem is
caused by the (ab)use of socket._fileobject in
urllib2.AbstractHTTPHandler (I believe this hack was
introduced when urllib2 switched to using
httplib.HTTPConnection).

Not sure yet what the actual problem is...


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 21:20

Message:
Logged In: YES 
user_id=261020

Please ignore last comment (posted to wrong tracker item).

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 21:18

Message:
Logged In: YES 
user_id=261020

Conservative or not, I see no utility in changing the
default, and several major harmful effects: old code breaks,
and people have to pore over the specs to figure out why
"urlopen() doesn't work".


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 20:36

Message:
Logged In: YES 
user_id=261020

I missed the fact that, if the connection will not close at
the end of the transaction, the behaviour should not change
from what's currently in SVN (we should not assume that the
chunked response has ended unless we see the proper
terminating CRLF).  I intend to upload a slightly modified
patch that tests for self._will_close, and behaves accordingly.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 01:24

Message:
Logged In: YES 
user_id=261020

Oops, fixed chunk.patch to .strip() before comparing to "".

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 00:38

Message:
Logged In: YES 
user_id=261020

First, expanding a bit on what I wrote on 2006-01-21: The
problem does relate to chunked encoding, and is unrelated to
urllib2's use of _fileobject.  My hack to remove use of
socket._fileobject from urllib2 merely breaks handling of
chunked encoding by cutting httplib.HTTPResponse out of the
picture.  The problem is seen in urllib2 in recent Pythons
thanks to urllib2 switching to use of httplib.HTTPConnection
and HTTP/1.1, hence chunked encoding is allowed.  urllib
still uses httplib.HTTP, hence HTTP/1.0, so is unaffected.
To reproduce with httplib:
import httplib
conn = httplib.HTTPConnection("66.117.37.13")
conn.request("GET", "/", headers={"Connection": "close"})
r1 = conn.getresponse()
print r1.read()
The Connection: close is required -- if it's not there the
server doesn't use chunked transfer-encoding.
I verified with a packet sniffer that the problem is that
this server does not send the final trailing CRLF required
by section 3.6.1 of RFC 2616.  However, that section also
says that trailers (trailing HTTP headers) MUST NOT be sent
by the server unless either a TE header was present and
indicated that trailers are acceptable (httplib does not
send the TE header), or the trailers are optional metadata
and may be discarded by the client.  So, I propose the
attached patch to httplib (chunk.patch) as a work-around.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-01-21 22:10

Message:
Logged In: YES 
user_id=261020

In fact the commit message for rev 36871 states the real
reason _fileobject is used (handling chunked encoding),
showing my workaround is even more harmful than I thought. 
Moreover, doing a urlopen on 66.117.37.13 shows the response
*is* chunked.

The problem seems to be caused by httplib failing to find a
CRLF at the end of the chunked response, so the loop at the
end of _read_chunked() never terminates.  Haven't looked in
detail yet, but I'm guessing a) it's the server's fault and
b) httplib should work around it.


Here's the commit message from 36871:


Fix urllib2.urlopen() handling of chunked content encoding.

The change to use the newer httplib interface admitted the
possibility
that we'd get an HTTP/1.1 chunked response, but the code
didn't handle
it correctly.  The raw socket object can't be pass to
addinfourl(),
because it would read the undecoded response.  Instead,
addinfourl()
must call HTTPResponse.read(), which will handle the decoding.

One extra wrinkle is that the HTTPReponse object can't be
passed to
addinfourl() either, because it doesn't implement readline() or
readlines().  As a quick hack, use socket._fileobject(), which
implements those methods on top of a read buffer. 
(suggested by mwh)

Finally, add some tests based on test_urllibnet.

Thanks to Andrew Sawyers for originally reporting the
chunked problem.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1411097&group_id=5470

From noreply at sourceforge.net  Mon May  1 23:20:35 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 14:20:35 -0700
Subject: [Patches] [ python-Patches-1479988 ] weakref dict methods
Message-ID: <E1Fafop-0002zH-NI@sc8-sf-web3.sourceforge.net>

Patches item #1479988, was opened at 2006-05-01 16:08
Message generated for change (Comment added) made by fdrake
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Fred L. Drake, Jr. (fdrake)
Assigned to: Tim Peters (tim_one)
Summary: weakref dict methods

Initial Comment:
The WeakKeyDictionary and WeakValueDictionary don't
provide any API to get just the weakrefs out, instead
of the usual mapping API.  This can be desirable when
you want to get a list of everything without creating
new references to the underlying objects at that moment.

This patch adds methods to make the references
themselves accessible using the API, avoiding requiring
client code to have to depend on the implementation. 
The WeakKeyDictionary gains the .iterkeyrefs() and
.keyrefs() methods, and the WeakValueDictionary gains
the .itervaluerefs() and .valuerefs() methods.

The patch includes tests and docs.


----------------------------------------------------------------------

>Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2006-05-01 17:20

Message:
Logged In: YES 
user_id=3066

Tim noted in email:

http://mail.python.org/pipermail/python-dev/2006-May/064751.html

that the implementation could and probably should be
simplified.  This second version of the patch does that, and
updates the documentation to note the liveness issues of the
references, as well as avoid repetition.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

From noreply at sourceforge.net  Tue May  2 00:35:47 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 15:35:47 -0700
Subject: [Patches] [ python-Patches-1480067 ] urllib2 digest auth
	redirection bug causes 400 error
Message-ID: <E1Fagzb-0003cf-KO@sc8-sf-web5.sourceforge.net>

Patches item #1480067, was opened at 2006-05-01 23:35
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1480067&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2 digest auth redirection bug causes 400 error

Initial Comment:
urllib2 redirects HTTP digest authorisation
credentials, which is never useful (because the
redirection will change the digest), and may cause a
400 error if for example the handler finds credentials
for an initial request, but fails to finds credentials
for a redirected request.  In that case a stale
Authorization or Proxy-authorization header will get
returned to the server, causing a 400 error.

I've verified this makes the 400 go away for example in
the case where http://localhost/foo gets 301 redirected
to http://127.0.0.1/foo/ (i.e. with a slash on the
end), where I've only added username/password for
"localhost" and not "127.0.0.1".

The fix is trivial.

2.4 backport candidate.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1480067&group_id=5470

From noreply at sourceforge.net  Tue May  2 03:22:17 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 18:22:17 -0700
Subject: [Patches] [ python-Patches-1216942 ] Suggested Additional Material
	for urllib2 docs
Message-ID: <E1Fajaj-0004bP-UW@sc8-sf-web2.sourceforge.net>

Patches item #1216942, was opened at 2005-06-08 10:41
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1216942&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Fred L. Drake, Jr. (fdrake)
Summary: Suggested Additional Material for urllib2 docs

Initial Comment:
This is some suggested additional material for the 
urllib2 docs.

Particularly the part about error codes and the 
reason/code attributes of error objects is *missing* from 
the manual and needed.

Also the example showing basic Authentication using 
password manager/handler/opener may help avoid some 
confusion.

Alternatively you can link to my online tutorials at 
http://www.voidspace.org.uk/python/articles.shtml#http

:-)


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-02 02:22

Message:
Logged In: YES 
user_id=261020

Andrew Kuchling checked in Michael's tutorial as a howto, so
I guess this can be closed.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-12-22 22:07

Message:
Logged In: YES 
user_id=261020

Just to shout it out again: no need for said patches to
contain TeX markup!-)  Plain text / reST pasted into the
existing docs is ok (though making it clear by some means
what is a heading and what isn't &c. is obviously
desirable).  I only want a patch because that would make it
clear how the additions are intended to be integrated with
the existing docs. 


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-12-22 22:01

Message:
Logged In: YES 
user_id=261020

Fred, what will you TeXify?  Are you waiting for Mike to
reply, or were you saying that you'll TeXify what he already
submitted?

Personally, I'm not happy with the original as-is, foremost
because it's not clear how it is intended to fit with the
existing docs (there are certainly other problems with the
suggested additions, but not much point going into detail
before there's a patch).  I would be happy to review / edit
at least some of the content it if it were presented as
patch(es).


----------------------------------------------------------------------

Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2005-12-22 14:53

Message:
Logged In: YES 
user_id=3066

I'll TeXify.  I agree with John about reproducing the
response code listing; that's a good place to simply defer
to the HTTP spec.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-12-04 20:01

Message:
Logged In: YES 
user_id=261020

I'm sure doc improvements are welcome here, so thank you :)

However, I think you need to

1) split this up into small patches that address very
specific issues, and briefly justify each change in the
patch submission note on the SF patch tracker

2) present the patches by editing the original .tex source
files from src/Doc/lib and then running 'diff -u' or 'svn
diff'  (it doesn't matter if you can't compile the docs or
get the TeX markup wrong, just as long as everybody can see
exactly what the intended changes to the text are)

Also, one thing that caught my eye on a very brief scan was
that the actual response code->name mapping (rather than a
note to document the existence of that mapping) shouldn't be
reproduced in the docs, I think.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1216942&group_id=5470

From noreply at sourceforge.net  Tue May  2 06:43:29 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 21:43:29 -0700
Subject: [Patches] [ python-Patches-1479181 ] Split open() and file()
Message-ID: <E1FamjR-0001TU-Cd@sc8-sf-web6.sourceforge.net>

Patches item #1479181, was opened at 2006-04-29 21:12
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479181&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Aahz (aahz)
Assigned to: Neal Norwitz (nnorwitz)
Summary: Split open() and file()

Initial Comment:
Make open() a factory function instead of an alias to file().  Includes
doc patches and a bugfix to Lib/test/test_subprocess.py (which was
relying on open() being an alias to file()).  There were no other
changes to the test suite -- this appears to be a completely transparent
fix.


----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 21:43

Message:
Logged In: YES 
user_id=33168

Minor doc mods.

Committed revision 45850.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479181&group_id=5470

From noreply at sourceforge.net  Tue May  2 08:13:42 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 23:13:42 -0700
Subject: [Patches] [ python-Patches-1479988 ] weakref dict methods
Message-ID: <E1Fao8k-0003mf-LC@sc8-sf-web3.sourceforge.net>

Patches item #1479988, was opened at 2006-05-01 16:08
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
>Resolution: Accepted
Priority: 5
Submitted By: Fred L. Drake, Jr. (fdrake)
>Assigned to: Fred L. Drake, Jr. (fdrake)
Summary: weakref dict methods

Initial Comment:
The WeakKeyDictionary and WeakValueDictionary don't
provide any API to get just the weakrefs out, instead
of the usual mapping API.  This can be desirable when
you want to get a list of everything without creating
new references to the underlying objects at that moment.

This patch adds methods to make the references
themselves accessible using the API, avoiding requiring
client code to have to depend on the implementation. 
The WeakKeyDictionary gains the .iterkeyrefs() and
.keyrefs() methods, and the WeakValueDictionary gains
the .itervaluerefs() and .valuerefs() methods.

The patch includes tests and docs.


----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-02 02:13

Message:
Logged In: YES 
user_id=31435

Looks good to me, Fred, and thanks!  Marked Accepted and
back to you.

----------------------------------------------------------------------

Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2006-05-01 17:20

Message:
Logged In: YES 
user_id=3066

Tim noted in email:

http://mail.python.org/pipermail/python-dev/2006-May/064751.html

that the implementation could and probably should be
simplified.  This second version of the patch does that, and
updates the documentation to note the liveness issues of the
references, as well as avoid repetition.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

From noreply at sourceforge.net  Tue May  2 08:55:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 01 May 2006 23:55:25 -0700
Subject: [Patches] [ python-Patches-1479988 ] weakref dict methods
Message-ID: <E1Faon7-00022O-Fb@sc8-sf-web6.sourceforge.net>

Patches item #1479988, was opened at 2006-05-01 16:08
Message generated for change (Comment added) made by fdrake
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: Fred L. Drake, Jr. (fdrake)
Assigned to: Fred L. Drake, Jr. (fdrake)
Summary: weakref dict methods

Initial Comment:
The WeakKeyDictionary and WeakValueDictionary don't
provide any API to get just the weakrefs out, instead
of the usual mapping API.  This can be desirable when
you want to get a list of everything without creating
new references to the underlying objects at that moment.

This patch adds methods to make the references
themselves accessible using the API, avoiding requiring
client code to have to depend on the implementation. 
The WeakKeyDictionary gains the .iterkeyrefs() and
.keyrefs() methods, and the WeakValueDictionary gains
the .itervaluerefs() and .valuerefs() methods.

The patch includes tests and docs.


----------------------------------------------------------------------

>Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2006-05-02 02:55

Message:
Logged In: YES 
user_id=3066

Committed in revision 45853.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-05-02 02:13

Message:
Logged In: YES 
user_id=31435

Looks good to me, Fred, and thanks!  Marked Accepted and
back to you.

----------------------------------------------------------------------

Comment By: Fred L. Drake, Jr. (fdrake)
Date: 2006-05-01 17:20

Message:
Logged In: YES 
user_id=3066

Tim noted in email:

http://mail.python.org/pipermail/python-dev/2006-May/064751.html

that the implementation could and probably should be
simplified.  This second version of the patch does that, and
updates the documentation to note the liveness issues of the
references, as well as avoid repetition.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479988&group_id=5470

From noreply at sourceforge.net  Tue May  2 13:40:30 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 02 May 2006 04:40:30 -0700
Subject: [Patches] [ python-Patches-1455898 ] patch for mbcs codecs
Message-ID: <E1FatF0-0006Nn-2Y@sc8-sf-web2.sourceforge.net>

Patches item #1455898, was opened at 2006-03-22 16:31
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.4
Status: Open
Resolution: None
Priority: 7
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Walter D?rwald (doerwalter)
Summary: patch for mbcs codecs

Initial Comment:
Hello.

I have noticed mbcs codecs sometimes generates broken
string. I'm using Windows(Japanese) so mbcs is mapped
to cp932 (close to shift_jis)

When I run the attached script "a.zip", the entry
"Error 00007"'s message becomes broken like attached
file "b.txt".

I think this happens because the string passed to
PyUnicode_DecodeMBCS() sometimes terminates with
leading byte, and MultiByteToWideChar() counts it for
size of result string.buffer size.

I hope attached patch "mbcs.patch" may fix the problem.
It would be nice if this bug will be fixed in 2.4.3...
Thank you.


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-02 20:40

Message:
Logged In: YES 
user_id=1200846

I updated the patch. (I copy and pasted "int final = 0" from
above code (ex: utf_16_ex_decode), maybe they also should be
changed for consistency?)

And one more thing, I noticed "errors" is ignored now. We
can detect invalid character if we set MB_ERR_INVALID_CHARS
flag when calling MultiByteToWideChar, but we cannot tell
where is the position of invalid character, and MSDN saids
this flag is available Win2000SP4 or later (I don't know
why)
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_17si.asp
So I didn't make the patch for it.


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-04-26 02:22

Message:
Logged In: YES 
user_id=89016

I think the default value for final in mbcs_decode() should
be true, so that the stateless decoder detects incomplete
byte sequences too.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-04-07 18:10

Message:
Logged In: YES 
user_id=1200846

I have sent contributor form via postal mail. Probably you
can get it after 10 days.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-28 17:16

Message:
Logged In: YES 
user_id=1200846

You are right. I've updated the patch. (mbcs5.patch)

>>> import codecs
[20198 refs]
>>> d = codecs.getincrementaldecoder("mbcs")()
[20198 refs]
>>> d.decode('\x82\xa0\x82')
u'\u3042'
[20198 refs]
>>> d.decode('')
u''
[20198 refs]
>>> d.decode('', final=True)
u'\x00'
[20198 refs]


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-28 01:06

Message:
Logged In: YES 
user_id=89016

_buffer_decode() in the IncrementalDecoder ignores the final
argument. IncrementalDecoder._buffer_decode() should pass on
its final argument to _codecsmodules.c::mbcs_decode(), which
should be extended to accept the final argument. Also
PyUnicode_DecodeMBCSStateful() must handle consumed == NULL
correctly (with your patch it drops trailing lead bytes even
if consumed == NULL)

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 16:41

Message:
Logged In: YES 
user_id=1200846

I replaced tests. Probably this is better instead of
comparing the two string generated by same decoder.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 14:44

Message:
Logged In: YES 
user_id=1200846

My real name is Hirokazu Yamamoto. But sorry, I don't have
FAX. (It's needed to send contributor form, isn't it?)

I'll attach the patch updated for trunk. And I'll attach the
tests.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-03-27 06:05

Message:
Logged In: YES 
user_id=21627

I have reservations against this patch because of the
quasi-anonymous nature of the submission. ocean-city, can
you please state your real name? Would you also be willing
to fill out a contributor form, as shown on

http://www.python.org/psf/contrib-form.html

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-24 23:02

Message:
Logged In: YES 
user_id=1200846

OK, I'll try.

----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-24 06:44

Message:
Logged In: YES 
user_id=89016

This isn't a bugfix in the strictest sense, so IMHO this
patch shouldn't go into 2.4. 

If the patch goes into 2.5, it would need the appropriate
changes to encodings/mbcs.py (i.e. the IncrementalDecoder
would have to be changed (inheriting from
BufferedIncrementalDecoder).

I realize that this patch might be hard to test, as results
are dependent on locale. Nevertheless at least some tests
would be good (even if they are only run or do something
useful on a certain locale and are skipped otherwise).

ocean-city, can you update the patch for the trunk and add
tests?


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-23 11:51

Message:
Logged In: YES 
user_id=1200846

Hello. This is my final patch. (mbcs4.patch)

 - mbcs3a.patch: _mbsbtype depends on locale not system ANSI
code page. so probably it's not good to use it with
MultiByteToWideChar.

 - mbcs3b.patch: CharNext may cause buffer overflow. and
this patch always calls CharPrev but it's not needed if
string is not terminated with "potensial" lead byte.

I hope this is stable enough to commit on repositry. Thank you.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 23:36

Message:
Logged In: YES 
user_id=1200846

Sorry, I was stupid.

MSDN
(http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_0o2t.asp)
saids,

> IsDBCSLeadByte can only indicate a potential lead byte value. 

IsDBCSLeadByte was returning 1 for some trail byte (ex: "???"[1])

The patch "mbcs3a.patch" worked for me, but _mbsbtype is
probably compiler specific. Is that OK?

The patch "mbcs3b.patch" also worked for me and it only uses
Win32API, but I don't have enough faith on this
implementation...


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:31

Message:
Logged In: YES 
user_id=1200846

Sorry, I found problem when tried more long text file...
Please wait. I'll investigate more intensibly.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:13

Message:
Logged In: YES 
user_id=1200846

Thank you for reply. How about this? (I'm a newbie, I hope
this is right tex format but... can you confirm this? I
created this patch by copy & paste from
PyUnicode_DecodeUTF16Stateful and some modification)


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:12

Message:
Logged In: YES 
user_id=38388

One more nit: the doc patch is missing. Please add a patch
for the API docs.


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:11

Message:
Logged In: YES 
user_id=38388

As I understand your comment, the mbcs codec will have a
problem if the input string terminates with a lead byte.

Could you add a comment regarding this to the patch ?!

I can't test the patch, since I don't have a Japanese
Windows to check on, but from looking at the patch, it seems OK.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 16:42

Message:
Logged In: YES 
user_id=1200846

I forgot to mention this. "mbcs.patch" is for
release24-maint branch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

From noreply at sourceforge.net  Wed May  3 07:05:19 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 02 May 2006 22:05:19 -0700
Subject: [Patches] [ python-Patches-1480067 ] urllib2 digest auth
	redirection bug causes 400 error
Message-ID: <E1Fb9Y7-00088e-DA@sc8-sf-web4-b.sourceforge.net>

Patches item #1480067, was opened at 2006-05-01 22:35
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1480067&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2 digest auth redirection bug causes 400 error

Initial Comment:
urllib2 redirects HTTP digest authorisation
credentials, which is never useful (because the
redirection will change the digest), and may cause a
400 error if for example the handler finds credentials
for an initial request, but fails to finds credentials
for a redirected request.  In that case a stale
Authorization or Proxy-authorization header will get
returned to the server, causing a 400 error.

I've verified this makes the 400 go away for example in
the case where http://localhost/foo gets 301 redirected
to http://127.0.0.1/foo/ (i.e. with a slash on the
end), where I've only added username/password for
"localhost" and not "127.0.0.1".

The fix is trivial.

2.4 backport candidate.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 05:05

Message:
Logged In: YES 
user_id=849994

Committed as rev. 45879, 45880 (2.4).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1480067&group_id=5470

From noreply at sourceforge.net  Wed May  3 07:17:48 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 02 May 2006 22:17:48 -0700
Subject: [Patches] [ python-Patches-1411097 ] httplib patch to make
	_read_chunked() more robust
Message-ID: <E1Fb9kC-0000qV-S7@sc8-sf-web1.sourceforge.net>

Patches item #1411097, was opened at 2006-01-20 20:26
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1411097&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: httplib patch to make _read_chunked() more robust

Initial Comment:
To reproduce:

import urllib2
print urllib2.urlopen("http://66.117.37.13/").read()


The attached patch "fixes" the hang, but that patch is
not acceptable because it also removes the .readline()
and .readlines() methods on the response object
returned by urllib2.urlopen().

The patch seems to demonstrate that the problem is
caused by the (ab)use of socket._fileobject in
urllib2.AbstractHTTPHandler (I believe this hack was
introduced when urllib2 switched to using
httplib.HTTPConnection).

Not sure yet what the actual problem is...


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 05:17

Message:
Logged In: YES 
user_id=849994

Are you still working on your slightly modified patch?

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 21:20

Message:
Logged In: YES 
user_id=261020

Please ignore last comment (posted to wrong tracker item).

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 21:18

Message:
Logged In: YES 
user_id=261020

Conservative or not, I see no utility in changing the
default, and several major harmful effects: old code breaks,
and people have to pore over the specs to figure out why
"urlopen() doesn't work".


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 20:36

Message:
Logged In: YES 
user_id=261020

I missed the fact that, if the connection will not close at
the end of the transaction, the behaviour should not change
from what's currently in SVN (we should not assume that the
chunked response has ended unless we see the proper
terminating CRLF).  I intend to upload a slightly modified
patch that tests for self._will_close, and behaves accordingly.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 01:24

Message:
Logged In: YES 
user_id=261020

Oops, fixed chunk.patch to .strip() before comparing to "".

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 00:38

Message:
Logged In: YES 
user_id=261020

First, expanding a bit on what I wrote on 2006-01-21: The
problem does relate to chunked encoding, and is unrelated to
urllib2's use of _fileobject.  My hack to remove use of
socket._fileobject from urllib2 merely breaks handling of
chunked encoding by cutting httplib.HTTPResponse out of the
picture.  The problem is seen in urllib2 in recent Pythons
thanks to urllib2 switching to use of httplib.HTTPConnection
and HTTP/1.1, hence chunked encoding is allowed.  urllib
still uses httplib.HTTP, hence HTTP/1.0, so is unaffected.
To reproduce with httplib:
import httplib
conn = httplib.HTTPConnection("66.117.37.13")
conn.request("GET", "/", headers={"Connection": "close"})
r1 = conn.getresponse()
print r1.read()
The Connection: close is required -- if it's not there the
server doesn't use chunked transfer-encoding.
I verified with a packet sniffer that the problem is that
this server does not send the final trailing CRLF required
by section 3.6.1 of RFC 2616.  However, that section also
says that trailers (trailing HTTP headers) MUST NOT be sent
by the server unless either a TE header was present and
indicated that trailers are acceptable (httplib does not
send the TE header), or the trailers are optional metadata
and may be discarded by the client.  So, I propose the
attached patch to httplib (chunk.patch) as a work-around.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-01-21 22:10

Message:
Logged In: YES 
user_id=261020

In fact the commit message for rev 36871 states the real
reason _fileobject is used (handling chunked encoding),
showing my workaround is even more harmful than I thought. 
Moreover, doing a urlopen on 66.117.37.13 shows the response
*is* chunked.

The problem seems to be caused by httplib failing to find a
CRLF at the end of the chunked response, so the loop at the
end of _read_chunked() never terminates.  Haven't looked in
detail yet, but I'm guessing a) it's the server's fault and
b) httplib should work around it.


Here's the commit message from 36871:


Fix urllib2.urlopen() handling of chunked content encoding.

The change to use the newer httplib interface admitted the
possibility
that we'd get an HTTP/1.1 chunked response, but the code
didn't handle
it correctly.  The raw socket object can't be pass to
addinfourl(),
because it would read the undecoded response.  Instead,
addinfourl()
must call HTTPResponse.read(), which will handle the decoding.

One extra wrinkle is that the HTTPReponse object can't be
passed to
addinfourl() either, because it doesn't implement readline() or
readlines().  As a quick hack, use socket._fileobject(), which
implements those methods on top of a read buffer. 
(suggested by mwh)

Finally, add some tests based on test_urllibnet.

Thanks to Andrew Sawyers for originally reporting the
chunked problem.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1411097&group_id=5470

From noreply at sourceforge.net  Wed May  3 13:59:13 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 04:59:13 -0700
Subject: [Patches] [ python-Patches-1481032 ] patch smtplib:when
	SMTPDataError, rset crashes with sslerror
Message-ID: <E1FbG0f-0006vM-NH@sc8-sf-web6.sourceforge.net>

Patches item #1481032, was opened at 2006-05-03 13:59
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481032&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: patch smtplib:when SMTPDataError, rset crashes with sslerror

Initial Comment:


  File "smtplib.pyc", line 690, in sendmail
  File "smtplib.pyc", line 445, in rset
  File "smtplib.pyc", line 370, in docmd
  File "smtplib.pyc", line 344, in getreply
  File "smtplib.pyc", line 159, in readline
sslerror: (8, 'EOF occurred in violation of protocol')

traced from a py2.3 - yet unchanged.  

=> hides original error SMTPDataError. such
SMTPDataError may have forced a disconnect of server. 

patch for py2.3,py2.4,..

( I have this patch in my MUST-DO-PATCHes after any
Python installation. )

--

the patch passes on any error in this rset() location.
it also patches a error in PLAIN authentication

could cleanly catch socket.sslerror / socket.error /
EnvironmentError, yet ssl not always there ... 
rset() is tested otherwise, so the nacked except is ok?

(there should maybe be special a common base Exception
class "IOBaseError" for :   EnvironmentError(IOError,
OSError),  EOFError, socket.error, socket.sslerror,
ftplib.all_errors, etc.  Nice as it and not all IO
sublibs have to be imported to catch such errors.)

---

the same problem is with smtp.quit() on many SSL'ed
connections (without any other errors occuring): a
final socket.sslerror is raised during quit(). 
There may be a problem in the termination code of
ssl-FakeSockets or ssl.c . Or the same type of error
catch (on IOBaseError) should be applied. I am not sure
 - in my apps I (must) catch on smtp.quit() generally. 

(Compare also bug #978833 / shutdown(2)-remedy in
httplib's SSL FakeSocket - this shutdown(2) remedy
patch of #978833 I still have it on my MUST list
(py2.3/py2.4 installations), otherwise this FakeSocket
doesn't close fully in a FTPS application (where
termination on data channel is crucial for getting
response on the control channel) - and most likely puts
tremendous connection load on HTTPS servers because of
stale unterminated HTTPS connections while the bug may
not be obvious in casual usage. I'm not completely
clear about the nature of this error. Thus, what I say
is based on trial-and-error.

-robert


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481032&group_id=5470

From noreply at sourceforge.net  Wed May  3 15:01:15 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 06:01:15 -0700
Subject: [Patches] [ python-Patches-1481079 ] Support HTTP_REFERER in
	CGIHTTPServer.py
Message-ID: <E1FbGyh-0001V4-Gy@sc8-sf-web4-b.sourceforge.net>

Patches item #1481079, was opened at 2006-05-03 15:01
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481079&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: S??bastien Martini (ookoi)
Assigned to: Nobody/Anonymous (nobody)
Summary: Support HTTP_REFERER in CGIHTTPServer.py

Initial Comment:
In CGIHTTPServer.py simply put the referer's value
(obtained from headers) in os.env associated to the key
'HTTP_REFERER'.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481079&group_id=5470

From noreply at sourceforge.net  Wed May  3 15:59:26 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 06:59:26 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbHt0-0006X6-7a@sc8-sf-web5.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Wed May  3 17:34:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 08:34:12 -0700
Subject: [Patches] [ python-Patches-1481079 ] Support of HTTP_REFERER in
	CGIHTTPServer.py
Message-ID: <E1FbJMi-0002M6-Jc@sc8-sf-web5.sourceforge.net>

Patches item #1481079, was opened at 2006-05-03 15:01
Message generated for change (Settings changed) made by ookoi
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481079&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: S??bastien Martini (ookoi)
Assigned to: Nobody/Anonymous (nobody)
>Summary: Support of HTTP_REFERER in CGIHTTPServer.py

Initial Comment:
In CGIHTTPServer.py simply put the referer's value
(obtained from headers) in os.env associated to the key
'HTTP_REFERER'.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481079&group_id=5470

From noreply at sourceforge.net  Wed May  3 19:47:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 10:47:22 -0700
Subject: [Patches] [ python-Patches-1143695 ] Fix to allow urllib2 digest
	auth to talk to livejournal.com
Message-ID: <E1FbLRa-0003Xm-V3@sc8-sf-web3.sourceforge.net>

Patches item #1143695, was opened at 2005-02-18 11:14
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1143695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Benno Rice (benno)
Assigned to: Nobody/Anonymous (nobody)
Summary: Fix to allow urllib2 digest auth to talk to livejournal.com

Initial Comment:
When trying to use feedparser.py to deal with RSS feeds
from livejournal.com using digest auth (needed to
access locked posts), urllib2 would report a digest
auth failure.  The solution appears to be to always
specify the algorithm, even when it's MD5.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-15 14:49

Message:
Logged In: YES 
user_id=261020

This was fixed in revision 38092.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-02-20 02:01

Message:
Logged In: YES 
user_id=261020

Patch appears correct, and RFC 2617 allows always sending
the algorithm.  Haven't tested the patch, or verified that
real browsers do indeed always send algorithm even when it'd
MD5.


----------------------------------------------------------------------

Comment By: Benno Rice (benno)
Date: 2005-02-19 09:02

Message:
Logged In: YES 
user_id=9925

SourceForge is teh suxx0r.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2005-02-19 08:57

Message:
Logged In: YES 
user_id=29957

There's no uploaded file!  You have to check the
checkbox labeled "Check to Upload & Attach File"
when you upload a file. In addition, even if you
*did* check this checkbox, a bug in SourceForge
prevents attaching a file when *creating* an issue.

Please try again.

(This is a SourceForge annoyance that we can do
nothing about. :-( )

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1143695&group_id=5470

From noreply at sourceforge.net  Wed May  3 20:13:58 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 11:13:58 -0700
Subject: [Patches] [ python-Patches-1472184 ] pdb: fix for #1472191('clear'
	command bug)
Message-ID: <E1FbLrK-0000AT-Qf@sc8-sf-web6.sourceforge.net>

Patches item #1472184, was opened at 2006-04-18 09:38
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1472184&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: None
Priority: 6
Submitted By: Kuba Ko??czyk (jakamkon)
Assigned to: Nobody/Anonymous (nobody)
Summary: pdb: fix for #1472191('clear' command bug)

Initial Comment:
Pdb 'clear x' command doesn't clear selected breakpoints
that are already set:

$ ./python -m pdb ../test.py
> /home/xyz/python/test.py(3)<module>()
-> def t(x):
(Pdb) break 5
Breakpoint 1 at /home/xyz/python/test.py:5
(Pdb) break
Num Type         Disp Enb   Where
1   breakpoint   keep yes   at /home/xyz/python/test.py:5
(Pdb) clear 1
No breakpoint numbered 1
(Pdb)                    
...
  for i in numberlist:
*  if not (0 <= i < len(bdb.Breakpoint.bpbynumber)): 
    print 'No breakpoint numbered', i
...

Each i is a string and it's compared to 0 and len(...),
so condition * is always True. 
The fix is trivial:
*  if not (0 <= int(i) < len(bdb.Breakpoint.bpbynumber)):


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 18:13

Message:
Logged In: YES 
user_id=849994

Problem was fixed in rev. 45891, 45892(2.4), including
better error handling.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1472184&group_id=5470

From noreply at sourceforge.net  Wed May  3 20:37:42 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 11:37:42 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbMEI-0002eX-Hn@sc8-sf-web2.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Wed May  3 20:46:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 11:46:10 -0700
Subject: [Patches] [ python-Patches-1481304 ] Cleaned up 16x16px icons for
	windows.
Message-ID: <E1FbMMU-0002h9-Gp@sc8-sf-web1.sourceforge.net>

Patches item #1481304, was opened at 2006-05-03 20:46
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: goxe (goxe)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cleaned up 16x16px icons for windows.

Initial Comment:

Since the currently distributed icon files only 
include 32x32px images, Windows resizes them where 
16x16px is needed. With the predictable result that 
they look blurred and dark.

The attached icons include 16x16px versions of the 
current icons. It's the same friendly-snake-icon as 
always, just prettier in small sizes.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

From noreply at sourceforge.net  Wed May  3 20:53:38 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 11:53:38 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbMTi-0007Ml-Ae@sc8-sf-web3.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 13:59
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
>Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 18:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 18:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Wed May  3 20:57:35 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 11:57:35 -0700
Subject: [Patches] [ python-Patches-1411097 ] httplib patch to make
	_read_chunked() more robust
Message-ID: <E1FbMXX-0000d3-0M@sc8-sf-web3.sourceforge.net>

Patches item #1411097, was opened at 2006-01-20 20:26
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1411097&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: httplib patch to make _read_chunked() more robust

Initial Comment:
To reproduce:

import urllib2
print urllib2.urlopen("http://66.117.37.13/").read()


The attached patch "fixes" the hang, but that patch is
not acceptable because it also removes the .readline()
and .readlines() methods on the response object
returned by urllib2.urlopen().

The patch seems to demonstrate that the problem is
caused by the (ab)use of socket._fileobject in
urllib2.AbstractHTTPHandler (I believe this hack was
introduced when urllib2 switched to using
httplib.HTTPConnection).

Not sure yet what the actual problem is...


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-03 19:57

Message:
Logged In: YES 
user_id=261020

I *hope* to get back to it soon.  But if anybody beats me to
it, that's fine :-)

One problem: I don't understand the need for
HTTPConnection._safe_read(), rather than checking for an
EINTR resulting from the recv() call (or WSAEINTR on
Windows).  Can anybody explain that?


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 06:17

Message:
Logged In: YES 
user_id=849994

Are you still working on your slightly modified patch?

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 21:20

Message:
Logged In: YES 
user_id=261020

Please ignore last comment (posted to wrong tracker item).

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 21:18

Message:
Logged In: YES 
user_id=261020

Conservative or not, I see no utility in changing the
default, and several major harmful effects: old code breaks,
and people have to pore over the specs to figure out why
"urlopen() doesn't work".


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 20:36

Message:
Logged In: YES 
user_id=261020

I missed the fact that, if the connection will not close at
the end of the transaction, the behaviour should not change
from what's currently in SVN (we should not assume that the
chunked response has ended unless we see the proper
terminating CRLF).  I intend to upload a slightly modified
patch that tests for self._will_close, and behaves accordingly.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 01:24

Message:
Logged In: YES 
user_id=261020

Oops, fixed chunk.patch to .strip() before comparing to "".

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-02-06 00:38

Message:
Logged In: YES 
user_id=261020

First, expanding a bit on what I wrote on 2006-01-21: The
problem does relate to chunked encoding, and is unrelated to
urllib2's use of _fileobject.  My hack to remove use of
socket._fileobject from urllib2 merely breaks handling of
chunked encoding by cutting httplib.HTTPResponse out of the
picture.  The problem is seen in urllib2 in recent Pythons
thanks to urllib2 switching to use of httplib.HTTPConnection
and HTTP/1.1, hence chunked encoding is allowed.  urllib
still uses httplib.HTTP, hence HTTP/1.0, so is unaffected.
To reproduce with httplib:
import httplib
conn = httplib.HTTPConnection("66.117.37.13")
conn.request("GET", "/", headers={"Connection": "close"})
r1 = conn.getresponse()
print r1.read()
The Connection: close is required -- if it's not there the
server doesn't use chunked transfer-encoding.
I verified with a packet sniffer that the problem is that
this server does not send the final trailing CRLF required
by section 3.6.1 of RFC 2616.  However, that section also
says that trailers (trailing HTTP headers) MUST NOT be sent
by the server unless either a TE header was present and
indicated that trailers are acceptable (httplib does not
send the TE header), or the trailers are optional metadata
and may be discarded by the client.  So, I propose the
attached patch to httplib (chunk.patch) as a work-around.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-01-21 22:10

Message:
Logged In: YES 
user_id=261020

In fact the commit message for rev 36871 states the real
reason _fileobject is used (handling chunked encoding),
showing my workaround is even more harmful than I thought. 
Moreover, doing a urlopen on 66.117.37.13 shows the response
*is* chunked.

The problem seems to be caused by httplib failing to find a
CRLF at the end of the chunked response, so the loop at the
end of _read_chunked() never terminates.  Haven't looked in
detail yet, but I'm guessing a) it's the server's fault and
b) httplib should work around it.


Here's the commit message from 36871:


Fix urllib2.urlopen() handling of chunked content encoding.

The change to use the newer httplib interface admitted the
possibility
that we'd get an HTTP/1.1 chunked response, but the code
didn't handle
it correctly.  The raw socket object can't be pass to
addinfourl(),
because it would read the undecoded response.  Instead,
addinfourl()
must call HTTPResponse.read(), which will handle the decoding.

One extra wrinkle is that the HTTPReponse object can't be
passed to
addinfourl() either, because it doesn't implement readline() or
readlines().  As a quick hack, use socket._fileobject(), which
implements those methods on top of a read buffer. 
(suggested by mwh)

Finally, add some tests based on test_urllibnet.

Thanks to Andrew Sawyers for originally reporting the
chunked problem.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1411097&group_id=5470

From noreply at sourceforge.net  Wed May  3 20:59:59 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 11:59:59 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbMZr-00022Y-Pa@sc8-sf-web5.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
>Assigned to: Nobody/Anonymous (nobody)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Wed May  3 21:15:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 12:15:22 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbMok-0008Jd-3A@sc8-sf-web5.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Settings changed) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
>Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Wed May  3 21:32:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 12:32:31 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbN5L-0002Ch-6n@sc8-sf-web6.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 21:32

Message:
Logged In: YES 
user_id=791932

Final patch which should conform to PEP-7 completely.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Wed May  3 23:02:27 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 14:02:27 -0700
Subject: [Patches] [ python-Patches-1477281 ] __init__.py'less package
	import warnings
Message-ID: <E1FbOUN-0003XJ-FT@sc8-sf-web1.sourceforge.net>

Patches item #1477281, was opened at 2006-04-26 22:36
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1477281&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Thomas Wouters (twouters)
Assigned to: Guido van Rossum (gvanrossum)
Summary: __init__.py'less package import warnings

Initial Comment:
New! Industrial strength pitchfork-repellant. Just
sprinkle onto 2.5 and watch the pitchforks melt away,
to be replaced by hearthfelt grouphugs.

This patch (which probably needs some reformatting)
adds warnings when Python would have imported a module,
if only there had been an __init__.py. The text is
currently helpful, but it can easily be changed into a
FutureWarning with a warning that it'll change in 2.6
(or 'might change in the future'.)


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 21:02

Message:
Logged In: YES 
user_id=849994

Looks like this got checked in in rev. 45770.

----------------------------------------------------------------------

Comment By: Thomas Wouters (twouters)
Date: 2006-04-26 22:54

Message:
Logged In: YES 
user_id=34209

Let's just check it into Python 2.5 instead (it's still an
hour and 7 minutes until trunk freeze, I think? :>)


----------------------------------------------------------------------

Comment By: Guido van Rossum (gvanrossum)
Date: 2006-04-26 22:47

Message:
Logged In: YES 
user_id=6380

The patch doesn't check for errors coming out of
PyErr_Warn() -- remember, a warning can always be turned
into an exception.

I'll see if we can patch Google's Python like this.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1477281&group_id=5470

From noreply at sourceforge.net  Thu May  4 03:42:06 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 18:42:06 -0700
Subject: [Patches] [ python-Patches-1481530 ] imputil "from" os.path import
	bug
Message-ID: <E1FbSr0-00050P-4v@sc8-sf-web3.sourceforge.net>

Patches item #1481530, was opened at 2006-05-03 18:42
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481530&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Eric Huss (ehuss)
Assigned to: Nobody/Anonymous (nobody)
Summary: imputil "from" os.path import bug

Initial Comment:
The following idiom appears to not work when using imputil:

from os.path import join

This patch should fix that problem.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481530&group_id=5470

From noreply at sourceforge.net  Thu May  4 06:56:30 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 21:56:30 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbVt8-0004oS-Fa@sc8-sf-web4-b.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-04 06:56

Message:
Logged In: YES 
user_id=21627

I actually wonder what the rationale for this patch is. The
command line options of Python seem very clear to me; I
don't see the need for long options.

This should be discussed on python-dev.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 21:32

Message:
Logged In: YES 
user_id=791932

Final patch which should conform to PEP-7 completely.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Thu May  4 07:08:37 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 22:08:37 -0700
Subject: [Patches] [ python-Patches-1481530 ] imputil "from" os.path import
	bug
Message-ID: <E1FbW4r-0003HW-Am@sc8-sf-web1.sourceforge.net>

Patches item #1481530, was opened at 2006-05-04 01:42
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481530&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Eric Huss (ehuss)
Assigned to: Nobody/Anonymous (nobody)
Summary: imputil "from" os.path import bug

Initial Comment:
The following idiom appears to not work when using imputil:

from os.path import join

This patch should fix that problem.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-04 05:08

Message:
Logged In: YES 
user_id=849994

Thanks for the patch, committed as rev. 45895, 45896(2.4).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481530&group_id=5470

From noreply at sourceforge.net  Thu May  4 07:51:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 22:51:45 -0700
Subject: [Patches] [ python-Patches-1475845 ] IndentationError for
	unexpected indent
Message-ID: <E1FbWkb-000732-Ue@sc8-sf-web2.sourceforge.net>

Patches item #1475845, was opened at 2006-04-25 01:12
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1475845&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Parser/Compiler
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Roger Miller (rcmiller)
Assigned to: Martin v. L??wis (loewis)
Summary: IndentationError for unexpected indent

Initial Comment:
This patch raises an IndentationError rather than a
generic "invalid syntax" error for unexpected
indentation.  Code to do this was already in
pythonrun.c:err_input() but was not being reached due
to a failure to pass the INDENT token in the perrdetail
structure.  The patch also adds tests for the 3 kinds
of indentation errors (unexpected indent, no indent
where required, invalid outdent level) to test_syntax.py .


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-04 07:51

Message:
Logged In: YES 
user_id=21627

IndentationError is already raised for bad indentation, e.g.
for

"def f():\nreturn"

or

if 1:\nfoo()" (which is the test_no_indent)

However, the patch is right in filling the token in this
case, also; I accepted it as r45897. As it changes the
exceptio behaviour, I don't think it should be backported.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-04-30 13:20

Message:
Logged In: YES 
user_id=849994

Martin, do we want to change this? I myself have always
wondered what IndentationError was for if it was not raised
in these cases.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1475845&group_id=5470

From noreply at sourceforge.net  Thu May  4 08:06:03 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 23:06:03 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbWyR-0005lX-D7@sc8-sf-web2.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:06

Message:
Logged In: YES 
user_id=791932

The rationale behind this patch is to enable python to
answer to --version and --help, which are pretty much
standard command-line options with GNU utilities, and
increasingly common amongst BSD utilities. I developed this
patch, answering to a request on c.l.p where people were
asking for Python to answer to --version, and thought I
could generalize it a bit so that long options can also be
used for other arguments.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-04 06:56

Message:
Logged In: YES 
user_id=21627

I actually wonder what the rationale for this patch is. The
command line options of Python seem very clear to me; I
don't see the need for long options.

This should be discussed on python-dev.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 21:32

Message:
Logged In: YES 
user_id=791932

Final patch which should conform to PEP-7 completely.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Thu May  4 08:14:36 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 03 May 2006 23:14:36 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbX6i-0002Mk-63@sc8-sf-web1.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:14

Message:
Logged In: YES 
user_id=791932

I just posted a mail to python-dev explaining my rationale
behind this patch. Maybe you could answer there...

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:06

Message:
Logged In: YES 
user_id=791932

The rationale behind this patch is to enable python to
answer to --version and --help, which are pretty much
standard command-line options with GNU utilities, and
increasingly common amongst BSD utilities. I developed this
patch, answering to a request on c.l.p where people were
asking for Python to answer to --version, and thought I
could generalize it a bit so that long options can also be
used for other arguments.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-04 06:56

Message:
Logged In: YES 
user_id=21627

I actually wonder what the rationale for this patch is. The
command line options of Python seem very clear to me; I
don't see the need for long options.

This should be discussed on python-dev.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 21:32

Message:
Logged In: YES 
user_id=791932

Final patch which should conform to PEP-7 completely.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Thu May  4 21:16:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 04 May 2006 12:16:09 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbjJ3-0006ei-NM@sc8-sf-web3.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 21:16

Message:
Logged In: YES 
user_id=791932

The latest patch takes into account all constructive ideas
that have been proposed on python-dev for this enhancement.

It implements /<opt> support on Windows for long options,
and adds ? as a possible option to get help.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:14

Message:
Logged In: YES 
user_id=791932

I just posted a mail to python-dev explaining my rationale
behind this patch. Maybe you could answer there...

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:06

Message:
Logged In: YES 
user_id=791932

The rationale behind this patch is to enable python to
answer to --version and --help, which are pretty much
standard command-line options with GNU utilities, and
increasingly common amongst BSD utilities. I developed this
patch, answering to a request on c.l.p where people were
asking for Python to answer to --version, and thought I
could generalize it a bit so that long options can also be
used for other arguments.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-04 06:56

Message:
Logged In: YES 
user_id=21627

I actually wonder what the rationale for this patch is. The
command line options of Python seem very clear to me; I
don't see the need for long options.

This should be discussed on python-dev.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 21:32

Message:
Logged In: YES 
user_id=791932

Final patch which should conform to PEP-7 completely.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Thu May  4 22:21:38 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 04 May 2006 13:21:38 -0700
Subject: [Patches] [ python-Patches-1481112 ] Python long option support
Message-ID: <E1FbkKQ-00050E-Dv@sc8-sf-web6.sourceforge.net>

Patches item #1481112, was opened at 2006-05-03 15:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Martin v. L??wis (loewis)
Summary: Python long option support

Initial Comment:
The attached patch implements long option support for
Python. It changes the optstring found in
Modules/main.c, specifying brackets for the long name
of a corresponding option name. The patch is backward
compatible in that it doesn't change the behaviour of
_PyOS_GetOpt for any old format string, except on
[:()], which are explicitly excluded for matching an
option. This shouldn't break any code, though.

The patch is against Python 2.4.3.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 22:21

Message:
Logged In: YES 
user_id=791932

Small extension to completely conform to Microsoft long-opt
semantics:

--<name>=<arg>

is equivalent to:

/<name>:<arg>

under Windows with the latest patch.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 21:16

Message:
Logged In: YES 
user_id=791932

The latest patch takes into account all constructive ideas
that have been proposed on python-dev for this enhancement.

It implements /<opt> support on Windows for long options,
and adds ? as a possible option to get help.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:14

Message:
Logged In: YES 
user_id=791932

I just posted a mail to python-dev explaining my rationale
behind this patch. Maybe you could answer there...

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-04 08:06

Message:
Logged In: YES 
user_id=791932

The rationale behind this patch is to enable python to
answer to --version and --help, which are pretty much
standard command-line options with GNU utilities, and
increasingly common amongst BSD utilities. I developed this
patch, answering to a request on c.l.p where people were
asking for Python to answer to --version, and thought I
could generalize it a bit so that long options can also be
used for other arguments.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-04 06:56

Message:
Logged In: YES 
user_id=21627

I actually wonder what the rationale for this patch is. The
command line options of Python seem very clear to me; I
don't see the need for long options.

This should be discussed on python-dev.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 21:32

Message:
Logged In: YES 
user_id=791932

Final patch which should conform to PEP-7 completely.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:59

Message:
Logged In: YES 
user_id=791932

I'll redo the patch with vim now, emacs doesn't like doing 8
spaces indents, at least as far as I can get it configured...

Anyway, I assign the copyright to any code contained in this
patch to the PSF.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-03 20:53

Message:
Logged In: YES 
user_id=849994

Please update the patch to follow Python C style guidelines
(PEP 7), especially use 8-space tabs to indent.

Also, it might be good to send a copyright assignment to the
PSF for a patch of this magnitude.

Otherwise, I think that this is a desirable feature, at
least for --help and --version.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-03 20:37

Message:
Logged In: YES 
user_id=791932

The attached patch is against the current subversion trunk,
and implements long options with possible arguments after an
=-sign. It has partial matching for options, erroring out on
ambiguous matches of the command line arguments.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481112&group_id=5470

From noreply at sourceforge.net  Fri May  5 08:55:50 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 04 May 2006 23:55:50 -0700
Subject: [Patches] [ python-Patches-1481032 ] patch smtplib:when
	SMTPDataError, rset crashes with sslerror
Message-ID: <E1FbuEA-00089b-4Y@sc8-sf-web4-b.sourceforge.net>

Patches item #1481032, was opened at 2006-05-03 13:59
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481032&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: patch smtplib:when SMTPDataError, rset crashes with sslerror

Initial Comment:


  File "smtplib.pyc", line 690, in sendmail
  File "smtplib.pyc", line 445, in rset
  File "smtplib.pyc", line 370, in docmd
  File "smtplib.pyc", line 344, in getreply
  File "smtplib.pyc", line 159, in readline
sslerror: (8, 'EOF occurred in violation of protocol')

traced from a py2.3 - yet unchanged.  

=> hides original error SMTPDataError. such
SMTPDataError may have forced a disconnect of server. 

patch for py2.3,py2.4,..

( I have this patch in my MUST-DO-PATCHes after any
Python installation. )

--

the patch passes on any error in this rset() location.
it also patches a error in PLAIN authentication

could cleanly catch socket.sslerror / socket.error /
EnvironmentError, yet ssl not always there ... 
rset() is tested otherwise, so the nacked except is ok?

(there should maybe be special a common base Exception
class "IOBaseError" for :   EnvironmentError(IOError,
OSError),  EOFError, socket.error, socket.sslerror,
ftplib.all_errors, etc.  Nice as it and not all IO
sublibs have to be imported to catch such errors.)

---

the same problem is with smtp.quit() on many SSL'ed
connections (without any other errors occuring): a
final socket.sslerror is raised during quit(). 
There may be a problem in the termination code of
ssl-FakeSockets or ssl.c . Or the same type of error
catch (on IOBaseError) should be applied. I am not sure
 - in my apps I (must) catch on smtp.quit() generally. 

(Compare also bug #978833 / shutdown(2)-remedy in
httplib's SSL FakeSocket - this shutdown(2) remedy
patch of #978833 I still have it on my MUST list
(py2.3/py2.4 installations), otherwise this FakeSocket
doesn't close fully in a FTPS application (where
termination on data channel is crucial for getting
response on the control channel) - and most likely puts
tremendous connection load on HTTPS servers because of
stale unterminated HTTPS connections while the bug may
not be obvious in casual usage. I'm not completely
clear about the nature of this error. Thus, what I say
is based on trial-and-error.

-robert


----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-05 08:55

Message:
Logged In: YES 
user_id=791932

Where is the patch? :-)

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481032&group_id=5470

From noreply at sourceforge.net  Fri May  5 10:27:33 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 05 May 2006 01:27:33 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1Fbvev-0003aY-QA@sc8-sf-web1.sourceforge.net>

Patches item #1479611, was opened at 2006-04-30 23:58
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-05 01:27

Message:
Logged In: YES 
user_id=33168

v2 attached.  You might not want to review yet.  I mostly
did the first part of your suggest (stats, _Fini, and
stack-like if I understood you correctly).  I didn't do
anything on the second part about inlinting Function_Call.

perf seems to be about the same.  I'm not entirely sure the
patch is correct yet. I found one or two problems in the
original.  I added some more comments. 

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-01 01:27

Message:
Logged In: YES 
user_id=21627

The tuples should get deallocated when Py_Finalize is called.

It would be good if there was (conditional) statistical
analysis, showing how often no tuple was found because the
number of arguments was too large, and how often no tuple
was found because the candidate was in use.

I think it should be more stack-like, starting off with no
tuples allocated, then returning them inside the needs_free
blocks only if the refcount is 1 (or 2?). This would avoid
degeneralized cases where some function holds onto its
argument tuple indefinitely, thus consuming all 64 tuples.

For the other part, I think it would make the code more
readable if it inlined PyCFunction_Call even more: the test
for NOARGS|O could be integrated into the switch statement
(one case for each), VARARGS and VARARGS|KEYWORDS would both
load the arguments, then call the function directly
(possibly with NULL keywords). OLDARGS should goto either
METH_NOARGS, METH_O, or METH_VARARGS depending on na (if you
don't like goto, modifying flags would work as well).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 00:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Fri May  5 12:22:53 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 05 May 2006 03:22:53 -0700
Subject: [Patches] [ python-Patches-1481032 ] patch smtplib:when
	SMTPDataError, rset crashes with sslerror
Message-ID: <E1FbxSX-0000ll-SS@sc8-sf-web5.sourceforge.net>

Patches item #1481032, was opened at 2006-05-03 13:59
Message generated for change (Comment added) made by kxroberto
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481032&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: patch smtplib:when SMTPDataError, rset crashes with sslerror

Initial Comment:


  File "smtplib.pyc", line 690, in sendmail
  File "smtplib.pyc", line 445, in rset
  File "smtplib.pyc", line 370, in docmd
  File "smtplib.pyc", line 344, in getreply
  File "smtplib.pyc", line 159, in readline
sslerror: (8, 'EOF occurred in violation of protocol')

traced from a py2.3 - yet unchanged.  

=> hides original error SMTPDataError. such
SMTPDataError may have forced a disconnect of server. 

patch for py2.3,py2.4,..

( I have this patch in my MUST-DO-PATCHes after any
Python installation. )

--

the patch passes on any error in this rset() location.
it also patches a error in PLAIN authentication

could cleanly catch socket.sslerror / socket.error /
EnvironmentError, yet ssl not always there ... 
rset() is tested otherwise, so the nacked except is ok?

(there should maybe be special a common base Exception
class "IOBaseError" for :   EnvironmentError(IOError,
OSError),  EOFError, socket.error, socket.sslerror,
ftplib.all_errors, etc.  Nice as it and not all IO
sublibs have to be imported to catch such errors.)

---

the same problem is with smtp.quit() on many SSL'ed
connections (without any other errors occuring): a
final socket.sslerror is raised during quit(). 
There may be a problem in the termination code of
ssl-FakeSockets or ssl.c . Or the same type of error
catch (on IOBaseError) should be applied. I am not sure
 - in my apps I (must) catch on smtp.quit() generally. 

(Compare also bug #978833 / shutdown(2)-remedy in
httplib's SSL FakeSocket - this shutdown(2) remedy
patch of #978833 I still have it on my MUST list
(py2.3/py2.4 installations), otherwise this FakeSocket
doesn't close fully in a FTPS application (where
termination on data channel is crucial for getting
response on the control channel) - and most likely puts
tremendous connection load on HTTPS servers because of
stale unterminated HTTPS connections while the bug may
not be obvious in casual usage. I'm not completely
clear about the nature of this error. Thus, what I say
is based on trial-and-error.

-robert


----------------------------------------------------------------------

>Comment By: kxroberto (kxroberto)
Date: 2006-05-05 12:22

Message:
Logged In: YES 
user_id=972995

here (i forgot the upload checkbox?)

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-05 08:55

Message:
Logged In: YES 
user_id=791932

Where is the patch? :-)

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481032&group_id=5470

From noreply at sourceforge.net  Sat May  6 20:16:18 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 06 May 2006 11:16:18 -0700
Subject: [Patches] [ python-Patches-1457736 ] patch for building trunk with
	VC6
Message-ID: <E1FcRKE-0000i2-IJ@sc8-sf-web4-b.sourceforge.net>

Patches item #1457736, was opened at 2006-03-24 21:40
Message generated for change (Comment added) made by infidel
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Raymond Hettinger (rhettinger)
Summary: patch for building trunk with VC6

Initial Comment:
Hello. I tried to build trunk with VC6, but failed.
The reasons are

 - _W64 is not defined on VC6. (PC/pyconfig.h)

 - intptr_t and uintptr_t are not decleared on VC6.
(should use Py_intptr_t and Py_uintptr_t respectively)

I'll submit the patch for these two issues as
"build_trunk_for_vc6.patch".

And more two issues.

 - zlib was make built into pythoncore, but
PC/VC6/pythoncore.dsp is not updated for it yet.

I'll submit the file itself.

 - long long cannot be used on VC6, so 0xFFFFULL is
failed to compile with "invalid suffix" error.

I workarounded this replaced ULL with UI64 (_int64's
suffix) but I don't know how to make the patch. maybe
can this tequnique be used?

  #define Py_ULL(x) x##ULL /* non VC6 */

  #define Py_ULL(x) x##UI64 /* VC6 */

  Py_ULL(0xFFFFFFFFFFFFFFFF) instead of 0xFFF...FULL


----------------------------------------------------------------------

Comment By: Luke Dunstan (infidel)
Date: 2006-05-07 02:16

Message:
Logged In: YES 
user_id=30442

Is there anything preventing this patch from being 
applied? It would help me with building the trunk using 
both VC6 and Microsoft eMbedded Visual C++ 4.0 (for 
Windows CE).


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-27 01:02

Message:
Logged In: YES 
user_id=33168

Raymond, maybe this will help get VC6 building?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

From noreply at sourceforge.net  Sun May  7 07:37:37 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 06 May 2006 22:37:37 -0700
Subject: [Patches] [ python-Patches-1457736 ] patch for building trunk with
	VC6
Message-ID: <E1FcbxZ-0006pJ-U0@sc8-sf-web2.sourceforge.net>

Patches item #1457736, was opened at 2006-03-24 22:40
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Raymond Hettinger (rhettinger)
Summary: patch for building trunk with VC6

Initial Comment:
Hello. I tried to build trunk with VC6, but failed.
The reasons are

 - _W64 is not defined on VC6. (PC/pyconfig.h)

 - intptr_t and uintptr_t are not decleared on VC6.
(should use Py_intptr_t and Py_uintptr_t respectively)

I'll submit the patch for these two issues as
"build_trunk_for_vc6.patch".

And more two issues.

 - zlib was make built into pythoncore, but
PC/VC6/pythoncore.dsp is not updated for it yet.

I'll submit the file itself.

 - long long cannot be used on VC6, so 0xFFFFULL is
failed to compile with "invalid suffix" error.

I workarounded this replaced ULL with UI64 (_int64's
suffix) but I don't know how to make the patch. maybe
can this tequnique be used?

  #define Py_ULL(x) x##ULL /* non VC6 */

  #define Py_ULL(x) x##UI64 /* VC6 */

  Py_ULL(0xFFFFFFFFFFFFFFFF) instead of 0xFFF...FULL


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 14:37

Message:
Logged In: YES 
user_id=1200846

Hello. I updated the patch. (Probably this is better)

  - defined ULL() macro locally in Modules/sha512module.c
      maybe it's better to declare Py_ULL or something
      globally, but I don't know how to do it.

 - more patch for zlib builtin (ie: PC/VC6/Readme.txt)

I cannot try this patch on VC7 or later, but
I confirmed lib/test/testall.py passed on VC6.

----------------------------------------------------------------------

Comment By: Luke Dunstan (infidel)
Date: 2006-05-07 03:16

Message:
Logged In: YES 
user_id=30442

Is there anything preventing this patch from being 
applied? It would help me with building the trunk using 
both VC6 and Microsoft eMbedded Visual C++ 4.0 (for 
Windows CE).


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-27 02:02

Message:
Logged In: YES 
user_id=33168

Raymond, maybe this will help get VC6 building?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

From noreply at sourceforge.net  Sun May  7 07:40:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 06 May 2006 22:40:31 -0700
Subject: [Patches] [ python-Patches-1457736 ] patch for building trunk with
	VC6
Message-ID: <E1Fcc0N-0005Un-FQ@sc8-sf-web3.sourceforge.net>

Patches item #1457736, was opened at 2006-03-24 22:40
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Raymond Hettinger (rhettinger)
Summary: patch for building trunk with VC6

Initial Comment:
Hello. I tried to build trunk with VC6, but failed.
The reasons are

 - _W64 is not defined on VC6. (PC/pyconfig.h)

 - intptr_t and uintptr_t are not decleared on VC6.
(should use Py_intptr_t and Py_uintptr_t respectively)

I'll submit the patch for these two issues as
"build_trunk_for_vc6.patch".

And more two issues.

 - zlib was make built into pythoncore, but
PC/VC6/pythoncore.dsp is not updated for it yet.

I'll submit the file itself.

 - long long cannot be used on VC6, so 0xFFFFULL is
failed to compile with "invalid suffix" error.

I workarounded this replaced ULL with UI64 (_int64's
suffix) but I don't know how to make the patch. maybe
can this tequnique be used?

  #define Py_ULL(x) x##ULL /* non VC6 */

  #define Py_ULL(x) x##UI64 /* VC6 */

  Py_ULL(0xFFFFFFFFFFFFFFFF) instead of 0xFFF...FULL


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 14:40

Message:
Logged In: YES 
user_id=1200846

Oops, I forgot to upload the file.

  - Apply x.patch.

  - Replace pythoncore.dsp and pcbuild.dsw in PC/VC6 with
    attached files.

 - Remove PC/VC6/zlib.dsp


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 14:37

Message:
Logged In: YES 
user_id=1200846

Hello. I updated the patch. (Probably this is better)

  - defined ULL() macro locally in Modules/sha512module.c
      maybe it's better to declare Py_ULL or something
      globally, but I don't know how to do it.

 - more patch for zlib builtin (ie: PC/VC6/Readme.txt)

I cannot try this patch on VC7 or later, but
I confirmed lib/test/testall.py passed on VC6.

----------------------------------------------------------------------

Comment By: Luke Dunstan (infidel)
Date: 2006-05-07 03:16

Message:
Logged In: YES 
user_id=30442

Is there anything preventing this patch from being 
applied? It would help me with building the trunk using 
both VC6 and Microsoft eMbedded Visual C++ 4.0 (for 
Windows CE).


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-27 02:02

Message:
Logged In: YES 
user_id=33168

Raymond, maybe this will help get VC6 building?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

From noreply at sourceforge.net  Sun May  7 14:43:06 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 05:43:06 -0700
Subject: [Patches] [ python-Patches-982340 ] applesingle endianness issue
Message-ID: <E1FcibK-0000VY-PF@sc8-sf-web1.sourceforge.net>

Patches item #982340, was opened at 2004-06-30 00:20
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=982340&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Macintosh
Group: Python 2.3
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Jack Jansen (jackjansen)
Summary: applesingle endianness issue

Initial Comment:
the struct formats in applesingle.py do not declare endianness and 
thus won't work on little endian architectures.  This patch adds the 
endian declarations to the struct formats.
(from http://www.opensource.apple.com/darwinsource/
WWDC2004/)

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-17 13:41

Message:
Logged In: YES 
user_id=580910

Fixed in revision 45487 on the trunk. Please confirm and close this patch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=982340&group_id=5470

From noreply at sourceforge.net  Sun May  7 15:33:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 06:33:45 -0700
Subject: [Patches] [ python-Patches-1483325 ] Patch fixing #1481770 (wrong
	shared lib ext on hpux ia64)
Message-ID: <E1FcjOL-0001pS-Bh@sc8-sf-web2.sourceforge.net>

Patches item #1483325, was opened at 2006-05-07 07:33
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483325&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: David Everly (deckrider)
Assigned to: Nobody/Anonymous (nobody)
Summary: Patch fixing #1481770 (wrong shared lib ext on hpux ia64)

Initial Comment:
(configure and pyconfig.h.in must be regenerated after
applying this patch)

Not heavily tested, since I only have Linux (i686) at home.

Will test on HPUX ia64 tomorrow and report back.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483325&group_id=5470

From noreply at sourceforge.net  Sun May  7 18:03:28 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 09:03:28 -0700
Subject: [Patches] [ python-Patches-1483395 ] Add new top-level domains to
	cookielib
Message-ID: <E1FcljE-0005HJ-Fc@sc8-sf-web2.sourceforge.net>

Patches item #1483395, was opened at 2006-05-07 17:03
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add new top-level domains to cookielib

Initial Comment:
IANA introduced some new top-level domains in addition
to the original seven.  This adds them to cookielib,
and adds a test for the relevant behaviour
(blacklisting of some "country-code TLDs").

2.4 backport candidate.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

From noreply at sourceforge.net  Sun May  7 19:13:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 10:13:25 -0700
Subject: [Patches] [ python-Patches-1479977 ] Heavy revisions to urllib2
	howto
Message-ID: <E1Fcmov-0000oJ-Le@sc8-sf-web1.sourceforge.net>

Patches item #1479977, was opened at 2006-05-01 15:50
Message generated for change (Comment added) made by akuchling
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479977&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
>Assigned to: A.M. Kuchling (akuchling)
Summary: Heavy revisions to urllib2 howto

Initial Comment:
Lots of people have been complaining about lack of
urllib2 docs (though I'm never quite sure what people
are looking for, being too familiar with all the
details), so a tutorial may well be a useful addition.
 I'm sure you'll understand that my brutal criticism
:-) is intended to make it even more useful.

Michael: feel free to make further revisions, but
unless you have major objections I suggest that this is
checked in first, then we make any further changes
after that by uploading patches on SF for review (I
haven't stepped back and re-read it with a fresh mind,
and no doubt would be useful for somebody to do that).
 Editing this took me quite a while, and if I can help
it I don't want to go through too many revisions or
argue about the details before anything gets fixed!-).
 I've taken the liberty of mentioning myself as a
reviewer somewhere at the end of the document :-)

Important: I reformatted paragraphs to max 70 character
width (it's conventional, and plain-text diffs are
especially painful to read otherwise, though admittedly
diffs are never great for paragraphs anyway... I hope
emacs didn't muck up any ReST syntax).  I've uploaded
just that formatting change as reformatted.rst (which
also removes trailing whitespace from all lines).  This
should be done in a separate initial commit of course.
 For this reason, I've uploaded the whole document for
both reformatted (reformatted.rst) and edited versions
(edited.rst) rather than using patches.

I've made all of the changes I discuss below, *with the
exception of* the missing example of GET with
urlencoded data that's really needed (search for XXX in
the comments below) -- that should just need a few lines.

BTW, it would be a really fantastic idea to turn the
whole document into a valid doctest (I know I'm myself
almost incapable of writing correct examples unless I
do something like that).  All that would require of
course is adding a few >>>s and ...s and running it
through doctest.testfile until it stops complaining ;-)


Now a list explaining and justifying the changes I made:


Spelling / paragraph structure etc. fixes.  I won't
list these.


Most importantly, you seem a bit unsure who your
audience is.  For example, on headers -- you explain
that "HTTP is based on requests and responses", but
dive into User-Agent without actually mentioning what a
header is.  In my changes, I ended up adding brief
explanations of the concepts for people new to or fuzzy
about HTTP, but didn't go into details of
implementation.  For example, introducing the concept
of "HTTP header", but not explaining how HTTP
implements them "on the wire" (though in fact I think
it would be a good thing to add one example that showed
an HTTP request and pointed out the request line, the
headers and the data, since that makes everything very
concrete and easy to grasp for newbies).


Removed link to external howto on cookie handling. 
Despite the description ("How to handle cookies, when
fetching web pages with Python."), this actually spends
most of its time discussing what conditional imports
are needed if you want to be maximally compatible
across libraries and older versions of Python.  While
that is certainly useful for people who need that, I
think this is rather obscure and distracting detail
that seems out of place being referenced from the
Python 2.5 documentation, even in a howto.  Perhaps
some general statement that further tutorials are
available on your site?  Referencing your basic auth
tutorial seems fine.


You limit mention of urllib2.urlopen(url) to a
footnote, and in the text of the tutorial itself, you
say: """urllib2 mirrors this by having you form a
``Request``""" .  That's not true: a string URL is
fine, as you explain in the footnote.  That seems an
innaccuracy with no obvious didactic payoff.  In the
footnote, you say:

"""You *can* fetch URLs directly with urlopen, without
using a request object. It's more explicit, and
therefore more Pythonic, to use ``urllib2.Request``
though. It also makes it easier to add headers to your
request.

I find that bizarre!  Why is urlopen(url) unpythonic??
 On the contrary, using an extra object for no reason
*does* seem unpythonic to me.  I rewrote this a bit.


You needlessly assign the_url = "http:...", then
request = Request(the_url) -- why not a single line? 
Where it's useful to do that (i.e. in the more
complicated examples), I've s/the_url/url/, since I
object to chaff like "the_" in variable names ;-)


Your discussion of Request implies that it only
represents HTTP requests.  Fixed that.


Use of the word "handle" to talk about response objects
is unfortunate for two reasons: First, many objects in
Python are "handles" in some sense ("object reference"
semantics), so it's too vague to be a helpful name. 
Second, it's particularly unfortunate to use the word
"handle" when urllib2 makes heavy use of "handler"
objects that "handle" requests.  The fact that methods
on these handlers often return your "handles" only
makes things more confusing!  s/handle/response/


"""Sometimes you want to **POST** data to a CGI (Common
Gateway Interface) [#]_ or other web application"""

It's clear to us old hands what you mean here, but in a
tutorial at the level you seem to have picked we
probably shouldn't expect the reader to have all these
concepts straight, so being sloppy here is bad.

 - By "a CGI" I'm guessing you mean "a CGI
script/program".  Also, the whole sentence is unclear
whether you're talking about a web application in the
abstract, or some concrete CGI script.  I certainly
remember being very confused about this kind of thing
as a newbie.

 - "...or other web application" implies that all POSTs
go to web applications.  That's using "web application"
in a broader sense than it's usually understood.

 - You introduce "POST" without explanation.  Would be
nice to say "send data" instead of "POST", then explain
POST.

I rewrote this bit to try to address those points.


Re POST: """This is what your browser does when you
fill in a FORM on the web"""

Thats needed qualifying: form submission can also
result in a GET.


I added a bit on side-effects and GET/POST.


"""You may be mimicking a FORM submission, or
transmitting data to your own application."""

This reads oddly to me.  I know what you're getting at
(forms are not part of HTTP), but surely if you are
submitting form data you're not "mimicking" form
submission, you *are* submitting a form.  And in an
English sentence the "or" reads as an "exclusive or";
with that in mind: In what sense does form submission
*not* involve "transmitting data to your own
application"?  Reworded and s/FORM/HTML form/, since
we're talking about the abstract thing rather than
specifically about the HTML element.


"""In either case the data needs to be encoded for safe
transmission over HTTP"""

Arbitrary binary data does not need to be URL-encoded.
 Rephrased.


"""The encoding is done using a function from the
``urllib`` library *not* from ``urllib2``. ::"""

This is not true in general even for HTML forms.  For
example, HTML form file upload data is not encoded in
this way.  There are more obscure cases, too.  Noted this.


The quoted User-Agent string was out-of-date.  Fixed,
noting that it changes with each minor Python version.


Headers / data : I added a bit of explanatory context
to tell people what we're about to explain, and break
up paragraphs / add sections to clarify the structure.
 Also explained the concept of "HTTP header", as I
noted above.


XXX example needed on GET with urlencoded data (as it's
written ATM, this would go immediately before the
"Headers" section).


"""Coping With Errors"""

"Handling exceptions" seems more accurate.  Not all
HTTP status codes for which urllib2 raises an exception
involve HTTP error responses.  The text is also
confused on this point, so I rewrote it.


Errors: I believe urlopen can still actually raise
socket.error.  This is a bug, but I haven't dared to
submit a patch to fix it, fearing
backwards-compatibility issues.  I guess it should
probably be documented :-( But I suppose we should
discuss that in a separate tracker item, rather than
adding it to your howto straight away.


You mention IOError.  Without a motivating use case I
don't know why you mention this.  Since I'm not really
sure what the use case for this subclassing was ever
intended to be :-) I removed this example: feel free to
add it back if you know of a use or can get Jeremy
Hylton to explain it to you ;-)


Re URLError : you imply that the only reason for
URLError to be raised is failure to connect to the
server.  This is often the cause, but certainly not always.


For HTTP status codes, you refer to a document that
states "This is a historic document and is not accurate
anymore".  RFC 2616 is authoritative, and IMHO fairly
readable on error codes.  Removed the reference to the
other document.


"""As of Python 2.5 a dictionary like this one has
become part of ``urllib2``."""

In fact, this was moved to httplib.  The reference to
"HTTPBaseServer" (sic) is interesting: I think the copy
in httplib should be removed, since it's already there
in BaseHTTPServer (albeit missing 306, but that is
unused) -- would you mind filing a patch, Michael?

Your listing differed from BaseHTTPServer and from RFC
2616, so I replaced it with the BaseHTTPServer copy.


"""shows all the defined response codes"""

These are only those defined by RFC 2616 of course:
other standards can and do define other response status
codes (e.g. DAV).  Clarified this.


"""When an error is raised the server responds by
returning an http error code *and* an error page."""

This is sloppy: HTTP doesn't define "raising" an error,
so it can't respond to one.  Fixed.


httplib.HTTPMessage

Reworded to avoid impling it's *always* going to be
this concrete class.


"""In versions of Python prior to 2.3.4 it wasn't safe
to iterate over the object directly, so you should
iterate over the list returned by ``msg.keys()``
instead."""

Is this appropriate advice in the 2.5 docs?  I removed
this (am I too harsh on this point?).


"""Openers and handlers are slightly esoteric parts of
**urllib2**."""

I don't want to scare people off: they're easy to use
(if not to write).  Removed this.


I added a tiny bit more on what handlers do.


Changed the text to avoid implying that build_opener()
is the only way to create openers.


Don't refer to ``opener`` in those typewriter-font ReST
backticks, since that seems a little misleading: it's
not a Python class name (unfortunately the class is
named OpenerDirector, which rather clashes with the use
of the name "opener" of course, but personally I'm with
you in preferring "opener").


Wrote a bit more about opener construction.


Changed realm name to make it clear it may contain spaces.


Changed references to URI to URL in discussion of
authentication -- seems an irrelevant and distracting
distinction here.


I edited the basic auth description a little.


Comments conventionally come *before* code it refers
to, not after.  Fixed that, removed an over-obvious
comment or two (even in docs, "create the handler"
seems redundant if that's *all* it says), and the fixed
the curious line breaks.


"""The only reason to explicitly supply these to
``build_opener`` (which chains handlers provided as a
list), would be to change the order they appear in the
chain."""

I don't know of a use case for that in the case of the
handlers you list.  Also, that doesn't actually work:
handler ordering is determined by sorting.  Removed this.


"""One thing not to get bitten by is that the
``top_level_url`` in the code above *must not* contain
the protocol - the ``http://`` part. So if the URL we
are trying to access is"""

This is not correct usage (though I can see why it
worked); removed it.  Admittedly, urllib2 auth was the
subject of a quite a few bug fixes recently (I seem to
have just found yet another one five minutes ago, in
fact :-( ), so the situation pre-2.5 was certainly
messy.  However, I advise against trying to document
the old bugs!  Note that I haven't given examples of
"sub-URLs" since the RFC (2617) isn't clear to me on
this point, and I haven't yet tested whether urllib2
gets it right according to de-facto standards (as
defined by browsers, Apache, etc.)  for "sub-URLs" of
the one passed to .add_password().  It's on the list...


In your note explaining that HTTPS proxies are not
supported, you use "caution" rather than "note", which
conveys the strange implication to me that this lack of
support is somehow a consequence of using your previous
recipe for switching off proxy handling (or am I weird
in reading it that way??).  s/caution/note/


""".. [#] Possibly some of this tutorial will make it
into the standard library docs for versions of Python
after 2.4.1."""

Removed this.


Whew!


----------------------------------------------------------------------

>Comment By: A.M. Kuchling (akuchling)
Date: 2006-05-07 13:13

Message:
Logged In: YES 
user_id=11375

Edited.rst has been committed; thanks!


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-01 15:59

Message:
Logged In: YES 
user_id=261020

(I guess if I had any sense in me, I would have uploaded
those comments as an attachment instead of pasting them into
the summary -- sorry.)

I'm uploading the revised document now.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479977&group_id=5470

From noreply at sourceforge.net  Sun May  7 22:45:17 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 13:45:17 -0700
Subject: [Patches] [ python-Patches-1483395 ] Add new top-level domains to
	cookielib
Message-ID: <E1Fcq7x-0001QX-L1@sc8-sf-web2.sourceforge.net>

Patches item #1483395, was opened at 2006-05-07 16:03
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add new top-level domains to cookielib

Initial Comment:
IANA introduced some new top-level domains in addition
to the original seven.  This adds them to cookielib,
and adds a test for the relevant behaviour
(blacklisting of some "country-code TLDs").

2.4 backport candidate.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-07 20:45

Message:
Logged In: YES 
user_id=849994

Committed in rev. 45934. Note that the patch contained a new
test which imported a module "mechanize", which doesn't
belong to the stdlib. I removed that test.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

From noreply at sourceforge.net  Sun May  7 23:23:07 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 14:23:07 -0700
Subject: [Patches] [ python-Patches-1483395 ] Add new top-level domains to
	cookielib
Message-ID: <E1FcqiZ-0004KU-37@sc8-sf-web3.sourceforge.net>

Patches item #1483395, was opened at 2006-05-07 17:03
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Open
Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add new top-level domains to cookielib

Initial Comment:
IANA introduced some new top-level domains in addition
to the original seven.  This adds them to cookielib,
and adds a test for the relevant behaviour
(blacklisting of some "country-code TLDs").

2.4 backport candidate.


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-07 22:23

Message:
Logged In: YES 
user_id=261020

Oops, could you commit the test with that "mechanize"
replaced by "cookielib"?

I just ran the tests with that change and it passes.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-07 21:45

Message:
Logged In: YES 
user_id=849994

Committed in rev. 45934. Note that the patch contained a new
test which imported a module "mechanize", which doesn't
belong to the stdlib. I removed that test.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

From noreply at sourceforge.net  Sun May  7 23:33:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 14:33:08 -0700
Subject: [Patches] [ python-Patches-1483395 ] Add new top-level domains to
	cookielib
Message-ID: <E1FcqsG-00053Q-47@sc8-sf-web4-b.sourceforge.net>

Patches item #1483395, was opened at 2006-05-07 17:03
Message generated for change (Settings changed) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
>Assigned to: Georg Brandl (gbrandl)
Summary: Add new top-level domains to cookielib

Initial Comment:
IANA introduced some new top-level domains in addition
to the original seven.  This adds them to cookielib,
and adds a test for the relevant behaviour
(blacklisting of some "country-code TLDs").

2.4 backport candidate.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-07 22:23

Message:
Logged In: YES 
user_id=261020

Oops, could you commit the test with that "mechanize"
replaced by "cookielib"?

I just ran the tests with that change and it passes.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-07 21:45

Message:
Logged In: YES 
user_id=849994

Committed in rev. 45934. Note that the patch contained a new
test which imported a module "mechanize", which doesn't
belong to the stdlib. I removed that test.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

From noreply at sourceforge.net  Mon May  8 00:27:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 15:27:09 -0700
Subject: [Patches] [ python-Patches-972322 ] urllib2 handler naming
	convention collision
Message-ID: <E1FcriX-0002Bd-BV@sc8-sf-web3.sourceforge.net>

Patches item #972322, was opened at 2004-06-14 00:16
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=972322&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Jeremy Hylton (jhylton)
Summary: urllib2 handler naming convention collision

Initial Comment:
The method naming conventions of *_open and *_request
in urllib2 are accidentally met by the following methods:

AbstractHTTPHandler.do_open()
ProxyHandler.proxy_open()
AbstractHTTPHandler.redirect_request()

So URLs like do://example.com/ are regarded as having a
handler, and urllib2.urlopen("do://python.org/") causes
a TypeError.

I think *something* should be done about this, but I'm
willing to provide a different patch if this one is
frowned upon.  The alternative would be to rename
do_open and proxy_open, and leave the redirect_request
case unchanged (see below for why).

The first two methods are undocumented, so could in
theory be renamed.  However, people will likely be
overriding them anyway, so perhaps it's better to apply
this ugly patch than rename them.

redirect_request is documented, so can't be renamed,
but it will never be accidentally called unless
somebody actually adds a handler with a method named
"redirect_open".


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-07 23:27

Message:
Logged In: YES 
user_id=261020

OK, I see a slightly less ugly fix, don't apply this.  I
intend to upload a better one later.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-03-31 22:16

Message:
Logged In: YES 
user_id=261020

Here's an updated patch (collision.patch) that applies
against SVN HEAD.  I also made the test a little clearer. 
collision.patch supercedes both urllib2.py.patch and
test_urllib2.py.patch

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-05-19 21:53

Message:
Logged In: YES 
user_id=261020

Since nobody seems to mind the slightly uglified code
required to fix these bugs in a backwards-compatible way,
could somebody please apply this patch?


----------------------------------------------------------------------

Comment By: Michael Chermside (mcherm)
Date: 2004-10-22 17:36

Message:
Logged In: YES 
user_id=99874

I have reviewed this patch and I recomend applying it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=972322&group_id=5470

From noreply at sourceforge.net  Mon May  8 01:23:40 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 07 May 2006 16:23:40 -0700
Subject: [Patches] [ python-Patches-1483545 ] Wave.py support for ulaw and
	alaw audio
Message-ID: <E1FcsbE-0005k8-0T@sc8-sf-web3.sourceforge.net>

Patches item #1483545, was opened at 2006-05-07 19:23
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483545&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Eric Woudenberg (ewoudenberg)
Assigned to: Nobody/Anonymous (nobody)
Summary: Wave.py support for ulaw and alaw audio

Initial Comment:
Dear Python Patch Center:

This is my first Python patch submission. Apologies for
any errors of protocol. I have been using these
submitted changes for several years and have even given
it out (I mentioned it a few years back on a python
mailing list.) I am confident to the best of my ability
that these changes are solid.

Unfortunately I don't have the capability of rebuilding
the documentation, but the changes to the documentation
are outlined below.

Please do not hesitate to contact me for further
information or assistance. I would be honored to have
these changes become part of some Python revision, be
it 2.5 or something further in the future.

Thank you,
Eric Woudenberg
eaw at connact.com

>From my version of the wave.py file:

These changes allow .wav files containing u-law and
a-law data to be read and written. The user visible
changes are:

1) After a .wav file containing mu-law or a-law data is
opened for reading, a call to getcomptype() returns
'ULAW' (resp. 'ALAW') and a call to getcompname()
returns 'CCITT G.711 u-law' (resp. 'CCITT G.711 a-law').

2) After a wave object is created for writing,
setcomptype() can be called with the arguments ('ULAW',
'CCITT G.711 u-law') (resp. 'ALAW', 'CCITT G.711
a-law'). The second argument (text description) is ignored.

3) The comptype 'PCM' is now a synonym for 'NONE'.
PCM-containing wave files will return 'PCM' instead of
'NONE' for their comptype.
   
Note that this module does not do any u-law or a-law
format conversion to PCM, it simply allows users to
read or write u-law/a-law data from/to .wav files that
have conforming headers. For audio conversion of PCM
data to or from u-law, use the audioop module.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483545&group_id=5470

From noreply at sourceforge.net  Mon May  8 19:29:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 08 May 2006 10:29:00 -0700
Subject: [Patches] [ python-Patches-1483395 ] Add new top-level domains to
	cookielib
Message-ID: <E1Fd9XY-0003iR-Ju@sc8-sf-web5.sourceforge.net>

Patches item #1483395, was opened at 2006-05-07 16:03
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Georg Brandl (gbrandl)
Summary: Add new top-level domains to cookielib

Initial Comment:
IANA introduced some new top-level domains in addition
to the original seven.  This adds them to cookielib,
and adds a test for the relevant behaviour
(blacklisting of some "country-code TLDs").

2.4 backport candidate.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-08 17:29

Message:
Logged In: YES 
user_id=849994

Done in 45938.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-07 21:23

Message:
Logged In: YES 
user_id=261020

Oops, could you commit the test with that "mechanize"
replaced by "cookielib"?

I just ran the tests with that change and it passes.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-07 20:45

Message:
Logged In: YES 
user_id=849994

Committed in rev. 45934. Note that the patch contained a new
test which imported a module "mechanize", which doesn't
belong to the stdlib. I removed that test.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483395&group_id=5470

From noreply at sourceforge.net  Mon May  8 19:36:30 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 08 May 2006 10:36:30 -0700
Subject: [Patches] [ python-Patches-1479302 ] Make urllib2 digest auth and
	basic auth play together
Message-ID: <E1Fd9eo-0006Bk-NP@sc8-sf-web5.sourceforge.net>

Patches item #1479302, was opened at 2006-04-30 13:15
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479302&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Make urllib2 digest auth and basic auth play together

Initial Comment:
urllib2.HTTPDigestAuthHandler breaks urllib2's handler
scheme by raising an exception instead of returning
None to indicate another handler might handle the
response.  This stops everything in its tracks (the
exception is not caught by urllib2) and prevents
urllib2.HTTPBasicAuthHandler from handling basic auth
scheme 40* responses.

The patch simply removes the raise statement, so that
the .http_error_auth_reqed(), and therefore
.http_error_40*(), returns None.

There is also a unit test.

(will upload patch in a sec when I have the tracker ID
to insert in the test)

2.4 backport candidate.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-08 17:36

Message:
Logged In: YES 
user_id=849994

Applied as rev. 45939.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 14:37

Message:
Logged In: YES 
user_id=261020

Argh, posted to the wrong tracker item for that last
comment, too many bugs on the go at once, sorry.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 14:36

Message:
Logged In: YES 
user_id=261020

(...and the new patch makes a tiny fix to a
slightly-inaccurate statement in the module docstring)


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 13:42

Message:
Logged In: YES 
user_id=261020

Hmm, on second thoughts: use of module logging only solves
the debugging problem.  People may want to programatically
handle failure of authentication (and, say, report to the
user "authentication failed, you entered the wrong username
or password", or "authentication failed: hash algorithm YYY
not implemented").

That doesn't make applying this patch a bad idea, because
the HTTPDigestAuthHandler ValueError is not useful for that
purpose.  People wanting to handle this at run time can
(already) and should catch the HTTPError that will
eventually be raised when no handler handles the 40*
reponse.  (although the bug addressed by this patch breaks
that in one very specific case, of course: where both digest
+ basic handlers are present, and a basic auth challenge is
received)

In summary, this patch should be applied, but we should also
, as an additional feature, think up some way of allowing
auth failure information to be reported by these handlers
(probably by stuffing the info into the HTTPError).


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 13:25

Message:
Logged In: YES 
user_id=261020

Just a note that an XXX comment at the top of the code
comments that:

"""
If an authentication error handler that tries to perform
authentication for some reason but fails, how should the
error be signalled?  The client needs to know the HTTP error
code.  But if the handler knows that the problem was, e.g.,
that it didn't know that hash algo that requested in the
challenge, it would be good to pass that information along
to the client, too.
"""

I think this problem should be handled using module logging,
similarly to how module cookielib logs its reasoning for
accepting and returning cookies.

Do people agree?  If so, I'll file another patch to add that.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479302&group_id=5470

From noreply at sourceforge.net  Mon May  8 19:48:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 08 May 2006 10:48:22 -0700
Subject: [Patches] [ python-Patches-1478993 ] Take advantage of
	BaseException/Exception split in cookielib
Message-ID: <E1Fd9qH-0002bQ-DG@sc8-sf-web1.sourceforge.net>

Patches item #1478993, was opened at 2006-04-29 16:43
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1478993&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Take advantage of BaseException/Exception split in cookielib

Initial Comment:
The patch takes advantage of the exception hierarchy
reorganisation to remove some ugly code in cookielib. 
It clarifies a couple of exception messages, too.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-08 17:48

Message:
Logged In: YES 
user_id=849994

Applied in rev. 45940.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 14:38

Message:
Logged In: YES 
user_id=261020

(...and the new patch makes a tiny fix to a
slightly-inaccurate statement in the module docstring)

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 14:35

Message:
Logged In: YES 
user_id=261020

<shrug> I still think there should be documented guidelines
on this for both stdlib users and contributors (though they
should not live in the cookielib docs, of course!).

I've added a new patch to this tracker item,
cookielib_baseexception_2.patch, that adds an __all__ and
uses the name _warn_unhandled_exception (in addition to the
original patch contents).  I verified that the set of
documented module globals is identical to those listed in
the new __all__.  Let me know if I should also rename all
other non-public module globals defined in cookielib to have
initial underscores.

Just for the record, here is the list of all module globals
NOT listed in the new __all__ (as determined by doing
dir(cookielib) prior to applying the patch, and removing the
items I didn't add to __all__, so includes some noise from
things like 'import sys'):

['Absent', 'DAYS', 'DEFAULT_HTTP_PORT', 'EPOCH_YEAR',
'ESCAPED_CHAR_RE', 'HEADER_ESCAPE_RE',
'HEADER_JOIN_ESCAPE_RE', 'HEADER_QUOTED_VALUE_RE',
'HEADER_TOKEN_RE', 'HEADER_VALUE_RE', 'HTTP_PATH_SAFE',
'IPV4_RE', 'ISO_DATE_RE', 'LOOSE_HTTP_DATE_RE',
'MISSING_FILENAME_TEXT', 'MONTHS', 'MONTHS_LOWER',
'STRICT_DATE_RE', 'TIMEZONE_RE', 'UTC_ZONES', 'WEEKDAY_RE',
'__builtins__', '__doc__', '__file__', '__name__',
'_str2time', '_threading', '_timegm', 'copy', 'cut_port_re',
'debug', 'deepvalues', 'domain_match', 'eff_request_host',
'escape_path', 'http2time', 'httplib', 'is_HDN',
'is_third_party', 'iso2time', 'join_header_words',
'liberal_is_HDN', 'logging', 'lwp_cookie_str', 'month',
'offset_from_tz_string', 'parse_ns_headers', 're', 'reach',
'request_host', 'request_path', 'request_port',
'split_header_words', 'sys', 'time', 'time2isoz',
'time2netscape', 'timegm', 'unmatched',
'uppercase_escaped_char', 'urllib', 'urlparse',
'user_domain_match', 'vals_sorted_by_key',
'warn_unhandled_exception']


----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-30 02:23

Message:
Logged In: YES 
user_id=357491

You should use an underscore if you don't have an __all__
defined.  This is mostly for protection for ``from cookielib
import *`` code.  But if you define an __all__ I think you
are fine.

You do not need to document that undocumented globals should
not be relied upon.  Yes, people should know better (I
personally got nailed by the Debian folk for an undocumented
function in site.py that I changed the parameters of).  But
the suggestions Neal and I are making are to protect people
who do their doc checking from the command-line and thus
just do ``import cookielib; dir(cookielib)``.  I know Python
is for use by adults, but sometimes going a small step to
protect the adolescents is also okay.  =)

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 01:31

Message:
Logged In: YES 
user_id=261020

Hmm, perhaps by "do no worse" you simply meant not to rename
the function in this tracker item to a name not beginning
with an initial underscore (since that would introduce a new
non-public module global that does not begin with an
underscore).

In which case, sorry for the rant. :-)

My questions after the rant still stand. ;-)


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 01:26

Message:
Logged In: YES 
user_id=261020

Bleh.

e). Stdlib users should assume all undocumented module
globals are not part of the public API (I guess this should
be go somewhere near the library reference introduction)


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 01:22

Message:
Logged In: YES 
user_id=261020

So, you (Neil) agree with my three numbered action points
below?  You repeat my suggestion that we document this as if
it were a new suggestion; did you read my comment?

Sigh, sorry for being a little grumpy about this, but it's
hard to "do no worse" for a project if that project doesn't
seem itself to be very sure what it considers "worse":

While I must say I *agree* with you that such practices are
not good, if I as somebody apparently unusually inclined to
heavy use of underscores (even in most of my module names,
in library code) actually thought, however foolishly, that I
was *following stdlib conventions* by using *fewer*
underscores (for reasons I'll try to refrain from debating
further here), it does indeed seem pretty clear we're in
need of explicit documentation on this!  So your advice to
"do no worse" is a little annoying at this point... :-)

OK, so, what should get documented, specifically?  And where
should documentation for module authors go?

a). Stdlib module authors should always use underscores for
non-public module globals.

b). Don't know about this one: should non-legacy stdlib
modules (viz, those that follow rule a)) define __all__? 
(perhaps a point against doing this is that it may encourage
import * ?).

c). Stdlib packages should use __init__ to export public names.

d). Any discrepancy between __all__ and the API
documentation is a bug.

e). Stdlib users should assume all non- (I guess this should
be go somewhere near the library reference introduction)

Finally, how about my point 2?  Should I add underscores to
cookielib module globals I consider non-public (== all
undocumented module globals), or not?

Thanks for the feedback, both of you!


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-04-30 00:37

Message:
Logged In: YES 
user_id=33168

John, at this point (2.x) we should at least do no worse. 
Don't export unnecessary vars in any new code.  We should
also start documenting the situation and work towards
improving it.  For 3k, we should do better and solidify the
rules and do massive cleanup (module by module).  This will
probably involve some arm twisting of Guido. :-)

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-04-30 00:27

Message:
Logged In: YES 
user_id=261020

I changed the exception detail strings to use %r to get
quotes around the filename and quoted "bad" line read from
the file.  That makes it clearer what is part of the
explanatory English text and what is part of the filename
(or part of the quoted bad line, as the case may be). 
Filenames can and do contain spaces, commas, etc.

Your other point stunned me a bit.  I don't think it had
ever even really *occurred* to me that stdlib users might
consider stdlib module globals that are not documented as
public.  Ironically, I think that's because the code from
which cookielib derives is much stricter about this, all
modules starting with '_' and package __init__ exporting a
short list of names -- I guess I thought I was following
stdlib conventions by *not* adding initial underscores all
over the place.  Looking at some other stdlib code, I see
that underscores would have been more conventional after all.

Searching for reassurance, I discovered this from one of
your old python-dev summaries that confirms that
undocumented stdlib module globals are not considered part
of the module public interface:

http://www.python.org/dev/summary/2004-07-16_2004-07-31/#use-the-docs-to-know-what-the-public-api-is-people

e.g. from Tim Peters:

"""
As you noted later, it wasn't part of keyword's documented
interface, and you *always* act at your own risk when you go
beyond the docs.
"""

However, I don't see that this is explicitly documented,
which seems unfortunate to me (even though Tim's statement
is true regardless of any convention Python might have).

So, I guess I should:

1. Write something explicit about this (along the lines of
"Use undocumented module globals at your own risk") for the
stdlib library docs -- perhaps starting from Tim's post --
and submit that as a doc patch.

2. Leave all module global names in cookielib unchanged (so
people using those functions don't suffer gratuitous
breakage, even though any such people are asking for trouble
in the long run).  However, in the thread above, Michael
Hudson disagrees with that, and suggests all such module
globals be renamed.  So suggestions are welcome here on the
best course of action.

3. As you suggest, submit a patch to add an __all__ to
cookielib.


----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-29 21:12

Message:
Logged In: YES 
user_id=357491

Overall the patch looks fine (on vacation so not up for
applying and handling any possible failures so not going to
assign to myself).  But a question and a suggestion.

Why were the error strings changed to use the repr instead
of the string representation?  What does it buy you?

And if you are going to be changing the function name, you
might want to consider using a leading underscore to prevent
people from using it or getting exported.  Otherwise I would
define __all__ for the module.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1478993&group_id=5470

From noreply at sourceforge.net  Tue May  9 06:13:33 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 08 May 2006 21:13:33 -0700
Subject: [Patches] [ python-Patches-1483325 ] Patch fixing #1481770 (wrong
	shared lib ext on hpux ia64)
Message-ID: <E1FdJbJ-0007S5-Cc@sc8-sf-web3.sourceforge.net>

Patches item #1483325, was opened at 2006-05-07 07:33
Message generated for change (Comment added) made by deckrider
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483325&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: David Everly (deckrider)
Assigned to: Nobody/Anonymous (nobody)
Summary: Patch fixing #1481770 (wrong shared lib ext on hpux ia64)

Initial Comment:
(configure and pyconfig.h.in must be regenerated after
applying this patch)

Not heavily tested, since I only have Linux (i686) at home.

Will test on HPUX ia64 tomorrow and report back.

----------------------------------------------------------------------

>Comment By: David Everly (deckrider)
Date: 2006-05-08 22:13

Message:
Logged In: YES 
user_id=1113403

I tested against hpux ia64 today, and found that the
original patch required correction.  Here is the result.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483325&group_id=5470

From noreply at sourceforge.net  Tue May  9 15:51:18 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 09 May 2006 06:51:18 -0700
Subject: [Patches] [ python-Patches-1484695 ] tarfile.py fix for #1471427
	and updates
Message-ID: <E1FdScQ-0001rv-6T@sc8-sf-web1.sourceforge.net>

Patches item #1484695, was opened at 2006-05-09 15:51
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py fix for #1471427 and updates

Initial Comment:
I have assembled a patch that adds some features from
my own development path of tarfile.py
(http://www.gustaebel.de/lars/tarfile/) and fixes
#1471427 which made some restructuring necessary. The
patch affects Lib/tarfile.py, Lib/test/test_tarfile.py
and Doc/lib/libtarfile.tex.

The changes the patch makes are as follows:

Sets the version to 0.8.0.

Support for base256 encoding of number fields (nti()
and itn()). Up to now this was hardcoded for the
filesize field to allow filesizes greater than 8 GB but
it is applicable to all number fields.

TarInfo.tobuf() has a boolean argument "posix" which
controls how number fields are written (base256 is
non-posix).

Both unsigned and signed (Sun and NeXT) checksums are
calculated. Header validation moves from TarFile.next()
to TarInfo.frombuf(). A header is valid only if its
checksum is okay, in the past the checksum was
calculated but ignored.

The TarFile.next() method was rearranged which makes
header processing clearer and more abstract and fixes
bug #1471427. However, this change breaks the interface
for subclassing in order to implement custom member
types but makes it much easier at the same time. The
mapping TYPE_METH was removed.

A new test ReadGNULongTest was added to test_tarfile.py
and testtar.tar was updated to be able to test the GNU
extensions LONGNAME and LONGLINK.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

From noreply at sourceforge.net  Tue May  9 15:52:23 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 09 May 2006 06:52:23 -0700
Subject: [Patches] [ python-Patches-1484695 ] tarfile.py fix for #1471427
	and updates
Message-ID: <E1FdSdT-00028V-Bk@sc8-sf-web1.sourceforge.net>

Patches item #1484695, was opened at 2006-05-09 15:51
Message generated for change (Comment added) made by gustaebel
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py fix for #1471427 and updates

Initial Comment:
I have assembled a patch that adds some features from
my own development path of tarfile.py
(http://www.gustaebel.de/lars/tarfile/) and fixes
#1471427 which made some restructuring necessary. The
patch affects Lib/tarfile.py, Lib/test/test_tarfile.py
and Doc/lib/libtarfile.tex.

The changes the patch makes are as follows:

Sets the version to 0.8.0.

Support for base256 encoding of number fields (nti()
and itn()). Up to now this was hardcoded for the
filesize field to allow filesizes greater than 8 GB but
it is applicable to all number fields.

TarInfo.tobuf() has a boolean argument "posix" which
controls how number fields are written (base256 is
non-posix).

Both unsigned and signed (Sun and NeXT) checksums are
calculated. Header validation moves from TarFile.next()
to TarInfo.frombuf(). A header is valid only if its
checksum is okay, in the past the checksum was
calculated but ignored.

The TarFile.next() method was rearranged which makes
header processing clearer and more abstract and fixes
bug #1471427. However, this change breaks the interface
for subclassing in order to implement custom member
types but makes it much easier at the same time. The
mapping TYPE_METH was removed.

A new test ReadGNULongTest was added to test_tarfile.py
and testtar.tar was updated to be able to test the GNU
extensions LONGNAME and LONGLINK.


----------------------------------------------------------------------

>Comment By: Lars Gust?bel (gustaebel)
Date: 2006-05-09 15:52

Message:
Logged In: YES 
user_id=642936

Here is testtar.tar to replace Lib/test/testtar.tar.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

From noreply at sourceforge.net  Tue May  9 17:14:20 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 09 May 2006 08:14:20 -0700
Subject: [Patches] [ python-Patches-1484758 ] cookielib: reduce (fatal)
	dependency on "beta" logging?
Message-ID: <E1FdTum-0002MN-Lu@sc8-sf-web3.sourceforge.net>

Patches item #1484758, was opened at 2006-05-09 17:14
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: cookielib: reduce (fatal) dependency on "beta" logging?

Initial Comment:
The logging package is tagged "beta". Yet cookielib (as
the ONLY module in the std. lib !?) uses Logger.debug()
very excessively.

I got occasional nasty crash traces (from users) when
using cookielib Processors through urllib2
(multi-threaded usage) - see below.  The causes are not
errors in cookielib, but upon simple calls to
Logger.debug() : varying AttributeError's in logging,
which on the first glance seem to be impossible, as
those attributes are set in the related __init__()'s
but there are strange complex things going on with
roots/hierarchies/copy etc. so....  thread/lock
problems I'd guess.

the patch uncomments several debug() calls in cookielib
in import. only one's in important high-frequency
execution flow path (not ones upon errors and
exceptional states). And 2 minor fixes on pychecker
warnings.

After applying that, the nasty crash reports disappeared.

I do not understand completely why the cookielib
production code has to use the logging package
(expensive) at all. At least for the high-frq used
add_cookie_header its unnecessary. There could be some
simpler (detached) test code for testing purposes.
Importing the logging and setup is time consuming etc.
(see other patch for urllib2 import optimization. )

I'd recommend: At least as far as logging is "beta" and
cookielib NOT, all these debug()'s should be
uncommented, or at least called ONLY upon a dispatching
global 'use_logging' variable in cookielib, in case the
test code cannot be externalized nicely.


2 example error traces:

...File "cookielib.pyo",
line 1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 878, in debug\\n\',
\'  File "logging\\\\__init__.pyo", line 1056, in
getEffectiveLevel\\n\', "AttributeError: Logger
instance has no attribute \'level\'\\n


...in http_request\\n\', \'  File "cookielib.pyo", line
1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 876, in debug\\n\',
"AttributeError: Manager instance has no attribute
\'disable\'\\n


-robert

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

From noreply at sourceforge.net  Tue May  9 17:59:48 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 09 May 2006 08:59:48 -0700
Subject: [Patches] [ python-Patches-1484793 ] urllib2: resolves extremly
	slow import (of "everything")
Message-ID: <E1FdUcm-0005Eq-KF@sc8-sf-web2.sourceforge.net>

Patches item #1484793, was opened at 2006-05-09 17:59
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: resolves extremly slow import (of "everything")

Initial Comment:
This superseeds the old patch #1053150 (for an older
Python; it was stopped: "Jeremy doesn't like the idea")
in order to import the expensive modules behind urllib2
late.

I'm recommending now again to do this, as things are
almost unacceptable meanwhile.

In Py24, simply importing original urllib2 costs upto
to a second on my slower machines. the startup time of
some of my bigger apps/scripts goes mainly to importing
urllib2. More than half of the time goes into importing
cookielib (regarding profiler runs). Its almost
unusable so now in CGI scripts.

New modules were added to urllib2 meanwhile, and worst
of all the cookielib was inserted into urllib2 the same
old style "import everything on top of the file in a
kind of C-#include manner". 

Python offers best dynamic modularization of code. That
should be exploited for such an expensive
virtualization module like urllib2. There are usually
only very locations, where the sub-modules are referenced. 
This patch also enables to strip off unnecessary
modules (down to _MozillaCookieJar!) for
cx_freeze/py2exe distribution. 

( Since long I have this patch on my list, which I
apply after each Python installation regularly. )

--

As a side effect of this import-all practice a lazy
cookielib dependency came into normal Request
constructor code:
"origin_req_host = cookielib.request_host(self)"

I'd recommend, to copy/move this simple tool function
request_host into urllib2 in order to resolve the
cookielib dependency completely. (not done so far in
the patch)


-robert


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

From mersch.bernhard at osnanet.de  Tue May  9 20:33:07 2006
From: mersch.bernhard at osnanet.de (mersch.bernhard)
Date: Tue, 9 May 2006 20:33:07 +0200
Subject: [Patches]  Nastiest of fresh girls.
Message-ID: <NBEPIBAANCMJOJJPIEMLIEBKCAAA.mersch.bernhard@osnanet.de>


From noreply at sourceforge.net  Wed May 10 04:48:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 09 May 2006 19:48:51 -0700
Subject: [Patches] [ python-Patches-1478292 ] Fix doctest nit.
Message-ID: <E1Fdekt-0006Hc-DG@sc8-sf-web1.sourceforge.net>

Patches item #1478292, was opened at 2006-04-28 05:54
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1478292&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Thomas Heller (theller)
>Assigned to: Nobody/Anonymous (nobody)
Summary: Fix doctest nit.

Initial Comment:
I was puzzled by this behaviour:

C:\>py25
Python 2.5a2 (r25a2:45740, Apr 27 2006, 06:31:19) [MSC
v.1310 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for
more information.
>>> from doctest import register_optionflag
>>> print register_optionflag("SPAM")
1024
>>> print register_optionflag("SPAM")
2048
>>> print register_optionflag("SPAM")
2048
>>>

I suggest that register_optionflags does not
re-register already registered flags.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-09 22:48

Message:
Logged In: YES 
user_id=31435

I agree that this behavior wasn't intended.  Fixed in a
simpler way, and added a test to ensure it stays fixed, in
rev  45944 on the trunk and rev 45945 on the 2.4 branch. 
Thanks!

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1478292&group_id=5470

From noreply at sourceforge.net  Wed May 10 18:26:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 10 May 2006 09:26:31 -0700
Subject: [Patches] [ python-Patches-1484695 ] tarfile.py fix for #1471427
	and updates
Message-ID: <E1FdrWB-0006Be-4f@sc8-sf-web3.sourceforge.net>

Patches item #1484695, was opened at 2006-05-09 13:51
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py fix for #1471427 and updates

Initial Comment:
I have assembled a patch that adds some features from
my own development path of tarfile.py
(http://www.gustaebel.de/lars/tarfile/) and fixes
#1471427 which made some restructuring necessary. The
patch affects Lib/tarfile.py, Lib/test/test_tarfile.py
and Doc/lib/libtarfile.tex.

The changes the patch makes are as follows:

Sets the version to 0.8.0.

Support for base256 encoding of number fields (nti()
and itn()). Up to now this was hardcoded for the
filesize field to allow filesizes greater than 8 GB but
it is applicable to all number fields.

TarInfo.tobuf() has a boolean argument "posix" which
controls how number fields are written (base256 is
non-posix).

Both unsigned and signed (Sun and NeXT) checksums are
calculated. Header validation moves from TarFile.next()
to TarInfo.frombuf(). A header is valid only if its
checksum is okay, in the past the checksum was
calculated but ignored.

The TarFile.next() method was rearranged which makes
header processing clearer and more abstract and fixes
bug #1471427. However, this change breaks the interface
for subclassing in order to implement custom member
types but makes it much easier at the same time. The
mapping TYPE_METH was removed.

A new test ReadGNULongTest was added to test_tarfile.py
and testtar.tar was updated to be able to test the GNU
extensions LONGNAME and LONGLINK.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-10 16:26

Message:
Logged In: YES 
user_id=849994

Thanks for the patch, applied as rev. 45954.

----------------------------------------------------------------------

Comment By: Lars Gust?bel (gustaebel)
Date: 2006-05-09 13:52

Message:
Logged In: YES 
user_id=642936

Here is testtar.tar to replace Lib/test/testtar.tar.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

From noreply at sourceforge.net  Wed May 10 19:13:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 10 May 2006 10:13:45 -0700
Subject: [Patches] [ python-Patches-721464 ] Remote debugging with pdb.py
Message-ID: <E1FdsFt-0004OQ-7P@sc8-sf-web2.sourceforge.net>

Patches item #721464, was opened at 2003-04-14 23:02
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=721464&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Laurent Pelecq (lpelecq)
Assigned to: Raymond Hettinger (rhettinger)
Summary: Remote debugging with pdb.py

Initial Comment:
With this patch, instances of pdb.Pdb can read and
write from arbitrary file objects. It is based on
similar changes that have been made to cmd.py. It
basically consists of replacing print statement with
calls to self.stdout.write.

So it is possible for example to control the debugger
from another terminal to debug curses-based
applications or CGI scripts.

I can provide a basic client/server debugger.

This patch has been tested on Mandrake Linux 9.1 with
the current CVS version.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-10 17:13

Message:
Logged In: YES 
user_id=849994

I committed a version of the patch using output redirection
as rev. 45955.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-14 16:04

Message:
Logged In: YES 
user_id=21627

lpelecq, would you be willing to redo that patch for 2.5?
Using print redirection (instead of .write calls) might be
the easiest way to do it.

rhettinger, do you want to come back to this patch now? If
not, please unassign.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2003-06-22 17:15

Message:
Logged In: YES 
user_id=80475

I think this is a good idea.
It is past the the time for being added to 2.3.
Unassigning, but will come back to it for 2.4.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=721464&group_id=5470

From noreply at sourceforge.net  Thu May 11 05:50:54 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 10 May 2006 20:50:54 -0700
Subject: [Patches] [ python-Patches-1429539 ] pdb: fix for 1326406 (import
	__main__ pdb failure)
Message-ID: <E1Fe2CU-0000fN-Vo@sc8-sf-web2.sourceforge.net>

Patches item #1429539, was opened at 2006-02-10 19:34
Message generated for change (Comment added) made by isandler
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1429539&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Ilya Sandler (isandler)
Assigned to: Nobody/Anonymous (nobody)
Summary: pdb: fix for  1326406 (import  __main__ pdb failure)

Initial Comment:
The patch allows pdb to debug program which 
import from __main__


----------------------------------------------------------------------

>Comment By: Ilya Sandler (isandler)
Date: 2006-05-10 20:50

Message:
Logged In: YES 
user_id=971153

I'm attaching an alternative patch: the program stil runs in
__main__ namespace, but pdb gets imported first:

  import pdb
  pdb.main()

So the main program cann't accidentally stomp on pdb
internals (e.g by doing help=None)

(there is still a bit of namespace pollution in the main
program)


----------------------------------------------------------------------

Comment By: Ilya Sandler (isandler)
Date: 2006-04-23 11:10

Message:
Logged In: YES 
user_id=971153

> 1. Could you give some code examples for that?

Do you mean examples of intentional interference with
debugger? Well, you could just traverse the stack and check
whether the program runs under debugger and then do anything
you want... But why do you think intentional interference
would ever be an issue? After all python is not a language
to write debugger-resistant applications ;-) 

Anyway, here are some examples of unintentional interference:

1. If you need a custom version of std module, you can
modify sys.path and then import the module.. Which works by
itself. But if pdb is loaded first and imports the module,
then it does not work...

2. Similar problem with any application which changes
sys.stdout/sys.stdin (there is actually a SF bug for that)

3.  Also I don't see how pdb in its current form can control
any program which needs a full-screen control of the terminal...

4. Any program which tries to do any magic with stack and
assumes that top level stack frame is the main application
will not work under pdb (where top level stack frame is pdb)

---------------------------------------------------
And there is a whole separate bunch of intereference issues
when pdb restarts the program.

---------------------------------------------------

When a program does run in pdb's namespace (as would be the
case if this patch is applied), pdb could save copies of all
module global symbols which it needs and thus become immune
to the accidental overwriting of those symbols in the main
program...

There could be a better way...


----------------------------------------------------------------------

Comment By: Kuba Ko??czyk (jakamkon)
Date: 2006-04-21 08:28

Message:
Logged In: YES 
user_id=1491175

Sorry I forget to login in;)The comment below is from me.

----------------------------------------------------------------------

Comment By: Nobody/Anonymous (nobody)
Date: 2006-04-21 08:25

Message:
Logged In: NO 

1. Could you give some code examples for that?
2,3. Did you notice that google search for "from __main__
import" give hits similar to: 
  t = Timer("test()", "from __main__ import test")
in most situations?
I think it's hard to value uses of "from..." based on google
search or similar method.Maybe we shoud ask on python-list
what are the others opinions?

>As a middle ground it might be a good idea to expand the
>patch to reduce pdb's dependency on module global symbols
I'am interesting how would you do that?
 

----------------------------------------------------------------------

Comment By: Ilya Sandler (isandler)
Date: 2006-04-20 19:39

Message:
Logged In: YES 
user_id=971153

I do see your point (In fact it was me who submitted the
patch #896011 which separated pdb namespace from the
program's -- and thus broke imports from __main__ ;-))..

I do want to bring a couple of points:

1. I don't think it matters whether a program can
intentionally interfere with pdb...Even when pdb's namespace
is separated, it's easy for the program to  interfere with
debugger.. (Or delete your home directory for that matter)

2. Importing from __main_ may not be common in the std lib,
but that's simply because stdlib doesn't contain that many
executable hence there are very few places where there is
__main__ to import from. 

google search for "from __main__ import" results in about 1M
hits.


3. Just for the record, profile module does not separate its
 namespace from programs's either...

So, basically, it boils down to this: what's worse breaking
imports from __main__ or risking accidental interference
between pdb and the program (e.g if your program redefines a
help symbol)...

As a middle ground it might be a good idea to expand the
patch to reduce pdb's dependency on module global symbols
and thus reducing the risk of interference.

What do you think?


----------------------------------------------------------------------

Comment By: Kuba Ko??czyk (jakamkon)
Date: 2006-04-20 04:17

Message:
Logged In: YES 
user_id=1491175

I think that exposing pdb's namespaces for debugged code is
dangerous.When debugged code have this kind of access he can
dynamic change pdb's behaviour without your control:

y.py:
die = """\
def destroy(x,y):
        print 'Iam crashing your HOME and deleting your FILES'

Pdb.__dict__['do_break'] = destroy # pdb's break = destroy
"""
x.py:
# innocently looking code;)
import y
exec(y.puff)
print "X"

with your patch:
$ python2.5 -m pdb x.py
> /home/xyz/python/x.py(1)<module>()
-> import y
(Pdb) Pdb.__dict__['do_break']
<function do_break at 0xb7cafdf4>
(Pdb) break
(Pdb) n
> /home/xyz/python/x.py(2)<module>()
-> exec(y.puff)
(Pdb) n
> /home/xyz/python/x.py(3)<module>()
-> print "X"
(Pdb) Pdb.__dict__['do_break']
<function destroy at 0xb7cb81b4>
(Pdb) break
Iam crashing your HOME and deleting your FILES

I think that this patch can't be accepted due to above
reason.According to my advanced reaserch;) ( find Lib/ -name
'*.py' -exec grep 'from __main__ import' {} -ls \; ) 'from
__main__' is rare case so maybe it will be reasonable to
simply handle ImportError and print something like
'** 'from __main__ import' not supported' message.What do  
you think?       
  

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1429539&group_id=5470

From noreply at sourceforge.net  Thu May 11 06:31:11 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 10 May 2006 21:31:11 -0700
Subject: [Patches] [ python-Patches-1053150 ] urllib2: better import ftplib
	and gopherlib etc late
Message-ID: <E1Fe2pT-0004R5-29@sc8-sf-web2.sourceforge.net>

Patches item #1053150, was opened at 2004-10-24 13:50
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1053150&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
>Resolution: Out of Date
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: better import ftplib and gopherlib etc late

Initial Comment:
importing those libs like (ftplib, gopherlib, ..)
unconditionally on top of urllib2 slows down and
hinders distributing small app packages (py2exe'd,
mcm.installer, ...).
simple patch in attachment

----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-11 06:31

Message:
Logged In: YES 
user_id=21627

Closing this as out-of-date; it is replaced by #1484793.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-05-19 22:38

Message:
Logged In: YES 
user_id=261020

Since Jeremy doesn't like the idea (see tracker item ref.
below), this should probably be closed, but:

Robert originally submitted this as bug 1046077.


----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-02-05 00:10

Message:
Logged In: YES 
user_id=261020

Looks good to me.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1053150&group_id=5470

From noreply at sourceforge.net  Thu May 11 06:33:28 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 10 May 2006 21:33:28 -0700
Subject: [Patches] [ python-Patches-1483325 ] Patch fixing #1481770 (wrong
	shared lib ext on hpux ia64)
Message-ID: <E1Fe2rg-0005u5-JB@sc8-sf-web4-b.sourceforge.net>

Patches item #1483325, was opened at 2006-05-07 15:33
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483325&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
>Status: Closed
>Resolution: Out of Date
Priority: 5
Submitted By: David Everly (deckrider)
Assigned to: Nobody/Anonymous (nobody)
Summary: Patch fixing #1481770 (wrong shared lib ext on hpux ia64)

Initial Comment:
(configure and pyconfig.h.in must be regenerated after
applying this patch)

Not heavily tested, since I only have Linux (i686) at home.

Will test on HPUX ia64 tomorrow and report back.

----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-11 06:33

Message:
Logged In: YES 
user_id=21627

This is being tracked in #1481770; closing it here as
out-of-date.

----------------------------------------------------------------------

Comment By: David Everly (deckrider)
Date: 2006-05-09 06:13

Message:
Logged In: YES 
user_id=1113403

I tested against hpux ia64 today, and found that the
original patch required correction.  Here is the result.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1483325&group_id=5470

From noreply at sourceforge.net  Thu May 11 07:15:20 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 10 May 2006 22:15:20 -0700
Subject: [Patches] [ python-Patches-1474907 ] detect %zd format for
	PY_FORMAT_SIZE_T
Message-ID: <E1Fe3WC-0006tO-9H@sc8-sf-web5.sourceforge.net>

Patches item #1474907, was opened at 2006-04-22 23:18
Message generated for change (Settings changed) made by bcannon
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1474907&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Build
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Brett Cannon (bcannon)
Assigned to: Brett Cannon (bcannon)
Summary: detect %zd format for PY_FORMAT_SIZE_T

Initial Comment:
The patch modifies configure.in to add PY_FORMAT_SIZE_T
to configure.in (meaning you need to run autoheader on
configure.in) so that if %zd is supported for size_t it
sets PY_FORMAT_SIZE_T to "z", otherwise it goes
undefined and the preprocessor trickery in
Include/pyport.h kicks in.

This fix removes compiler warnings on OS X 10.4.6 with
gcc 4.0.1 thanks to PY_FORMAT_SIZE_T being set to "".

Initially assigned to Martin v. Loewis since he said
this would be good to do and the Py_ssize_t stuff is
his invention.

----------------------------------------------------------------------

>Comment By: Brett Cannon (bcannon)
Date: 2006-05-10 22:15

Message:
Logged In: YES 
user_id=357491

Applied in r45960 .

----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-27 21:51

Message:
Logged In: YES 
user_id=357491

Yeah, I tried to use a string constant as a stack value, but
that didn't work.  =)  My brain just was not thinking in C
when I first came up with the patch.

I have a new version that uses a char array as the buffer. 
I am on vacation so I don't have the time to apply it and
break buildbot, so I will hold off on applying if no one
finds problems with this version.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-26 22:29

Message:
Logged In: YES 
user_id=21627

Looks fine to me, although it has "unusual" style of C:

- sizeof(char) is guaranteed to be 1 by the C standard. The
C standard defines "char" and "byte" as synonyms, even if
that means that "byte" has more than 8 bits. sizeof gives
the number of bytes, so for char, it is always 1.

- for a fixed-size array, people would normally make this an
automatic (stack) variable, instead of bothering with
explicit memory allocation, i.e.

  char str_buffer[4]

Just out of fear of buffer overruns, many people would also
add some horrendous overallocation, such as str_buffer[1024] :-)


----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-26 22:16

Message:
Logged In: YES 
user_id=357491

Realized there is a better way: just strncmp() for the
expected result.  Uploaded a new version.

----------------------------------------------------------------------

Comment By: Brett Cannon (bcannon)
Date: 2006-04-26 21:59

Message:
Logged In: YES 
user_id=357491

OK, uploaded a new version that uses strchr to check for
'%', 'z', and 'd'.  If it looks reasonable I will apply it
and hope I don't break the buildbot.  =)

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-26 09:15

Message:
Logged In: YES 
user_id=21627

The patch seems to rely on printf returning <0 for the
unrecognized format. That seems unreliable: atleast on
Linux, printf just outputs the format as-is for unrecognized
formats. Instead, I think it should use sprintf, and then
check whether the result is the string "0" (in addition to
checking whether the printf call itself failed).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1474907&group_id=5470

From noreply at sourceforge.net  Thu May 11 09:43:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 11 May 2006 00:43:05 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1Fe5pB-00040B-SW@sc8-sf-web5.sourceforge.net>

Patches item #1479611, was opened at 2006-04-30 23:58
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-11 00:43

Message:
Logged In: YES 
user_id=33168

This version actually works (in both normal and debug
builds).  It adds some stats which are useful and updates
Misc/SpecialBuilds.txt.

I modified to not preallocate and only hold a ref when the
function didn't keep a ref.

I still need to inline more of PyCFunction_Call.  Speed is
still the same as before.

I'm not sure if I'll finish this before the sprint next
week.  Anyone there feel free to check this in if you finish it.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-05 01:27

Message:
Logged In: YES 
user_id=33168

v2 attached.  You might not want to review yet.  I mostly
did the first part of your suggest (stats, _Fini, and
stack-like if I understood you correctly).  I didn't do
anything on the second part about inlinting Function_Call.

perf seems to be about the same.  I'm not entirely sure the
patch is correct yet. I found one or two problems in the
original.  I added some more comments. 

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-01 01:27

Message:
Logged In: YES 
user_id=21627

The tuples should get deallocated when Py_Finalize is called.

It would be good if there was (conditional) statistical
analysis, showing how often no tuple was found because the
number of arguments was too large, and how often no tuple
was found because the candidate was in use.

I think it should be more stack-like, starting off with no
tuples allocated, then returning them inside the needs_free
blocks only if the refcount is 1 (or 2?). This would avoid
degeneralized cases where some function holds onto its
argument tuple indefinitely, thus consuming all 64 tuples.

For the other part, I think it would make the code more
readable if it inlined PyCFunction_Call even more: the test
for NOARGS|O could be integrated into the switch statement
(one case for each), VARARGS and VARARGS|KEYWORDS would both
load the arguments, then call the function directly
(possibly with NULL keywords). OLDARGS should goto either
METH_NOARGS, METH_O, or METH_VARARGS depending on na (if you
don't like goto, modifying flags would work as well).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 00:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Thu May 11 19:19:37 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 11 May 2006 10:19:37 -0700
Subject: [Patches] [ python-Patches-1486713 ] HTMLParser : A auto-tolerant
	parsing mode
Message-ID: <E1FeEp7-0008ER-1L@sc8-sf-web3.sourceforge.net>

Patches item #1486713, was opened at 2006-05-11 19:19
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486713&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.3
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: HTMLParser : A auto-tolerant parsing mode

Initial Comment:
Changes:

* Now allows missing spaces between attributes as its
often seen on the web like this :

<script type="text/javascript"language="JavaScript1.1">

That like broke the whole parsing before.


* A fully auto-tolerant mode (HTMLParser.tolerant=1)
was added. It should hopefully NEVER break HTML parsing
on the level of HTMLParser, but recover and continue
the parsing smartly. The mode was tested extensively
with complex pages. The tolerant mode is guaranted to
finish all HTML stuff only during HTMLParser.close() /
goahead(end=True)  - yet that was the same (stucking)
policy before.
Maybe steep: I have  switched ON the tolerant mode by
default, as this is, what in 99.9% of cases one wants
to have.
(I've maybe 20 applications for HTMLParser - None like
the unrecoverable breaks with Exceptions)
During tolerant mode the virtual .warning(message,i,k)
is called instead of error - by default this just
counts .warning_count up. This framework should even
enable to write po HTML checkers

* The patch was generated against py2.3 (still the
"good/base" Python for me) and also fixes a regexp-bug
(which already was fixed in py2.4.2). Yet the patch
works also against py2.4/2.5 - 2 locations where py24
trivially changed to %r/repr may grumble.


-robert


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486713&group_id=5470

From noreply at sourceforge.net  Fri May 12 01:47:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 11 May 2006 16:47:00 -0700
Subject: [Patches] [ python-Patches-1486962 ] Patches and enhancements to
	turtle.py
Message-ID: <E1FeKs0-0007Bx-JZ@sc8-sf-web4-b.sourceforge.net>

Patches item #1486962, was opened at 2006-05-11 18:47
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486962&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Tkinter
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Vern Ceder (vceder)
Assigned to: Martin v. L??wis (loewis)
Summary: Patches and enhancements to turtle.py

Initial Comment:
Several bugfixes and enhancements (from several
teachers who use Python in secondary and post-secondary
classes) to improve usability in the classroom:

 * docstrings added to methods (Toby Donaldson)

 * added methods to control speed, window geometry and
  window title. (Vern Ceder)

 * added Turtle as alias for Pen - students can now
create Turtle objects (Toby Donaldson)

 * default window now larger and centered (Vern Ceder)

 * added done() function to start main event loop after
drawing (handy when running programs in IDLE) (Vern
Ceder/Chris Smith)

 * fixed bug where filled polygons are lowered (Atanas
Radenski)

 * fixed bug in circle() method to use self._fullcircle
/ 4.0 instead of 90.0 to determine start (Chris Smith)

 * removed several redundant assignments (Chris Smith)

 * added second demo which uses new features (Gregor Lindl)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486962&group_id=5470

From noreply at sourceforge.net  Fri May 12 03:59:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 11 May 2006 18:59:31 -0700
Subject: [Patches] [ python-Patches-1473132 ] Improve docs for tp_clear and
	tp_traverse
Message-ID: <E1FeMwF-0005UT-Ib@sc8-sf-web1.sourceforge.net>

Patches item #1473132, was opened at 2006-04-19 13:43
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473132&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improve docs for tp_clear and tp_traverse

Initial Comment:
The attached patch greatly enhances the documentation
for the tp_clear and tp_traverse functions. The patch
is against Doc/api/newtypes.tex, r45562.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-11 21:59

Message:
Logged In: YES 
user_id=31435

Thanks, Collin!  I applied the patch, beefed up the
explanations, and committed as revision 45970 on the trunk,
affecting files:

Doc/api/newtypes.tex
Misc/ACKS
Misc/NEWS


----------------------------------------------------------------------

Comment By: Thomas Wouters (twouters)
Date: 2006-05-01 05:24

Message:
Logged In: YES 
user_id=34209

As Tim said, there is more to it :) I think this is a fine
start, though. One minor point: the use of Py_CLEAR() can do
with some extra explanation. It obviously isn't enough to
just 'NULL out' members, since that would leak references,
but the docs should also explain that it is in fact
important to set the actual member to NULL *before*
DECREFing the reference, and then point out that the
Py_CLEAR macro is a convenient way of doing that. That kind
of tweak can happen after it's checked in, though
(preferably by someone who can build documentation and see
that the result looks okay ;)


----------------------------------------------------------------------

Comment By: Collin Winter (collinwinter)
Date: 2006-04-30 23:43

Message:
Logged In: YES 
user_id=1344176

I've enhanced the patch per Tim Peters' comment.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-04-22 23:30

Message:
Logged In: YES 
user_id=31435

I agree the additional info is helpful (thanks!).

Alas, there's more to it, and it's hard to know when to stop
:-(.

For example, an author of a type may _want_ to visit, e.g.,
contained strings in tp_traverse, because they want
gc.get_referents() to return the contained strings
(typically as a debugging aid).

The issues wrt to tp_clear are subtler.  The real
requirement is that the aggregate of all tp_clears called
break all possible cycles.  For one thing, that means
there's no real reason for a tp_clear to touch a member
that's known to be a Python string or integer (since such an
object can't be in a cycle, clearing it can't help to break
a cycle).  It's only tp_dealloc that _must_ drop references
to all containees.

Subtler is that a gc'ed container type may choose not to
implement tp_clear at all.  If you look, you'll see that
Python's tuple type in fact leaves its tp_clear slot empty.
 This isn't a problem because it's impossible to have a
cycle composed _solely_ of tuples (that may not be obvious,
but it's true -- it derives from that tuples are immutable).
 Any cycle a tuple may be in will be broken if the non-tuple
objects in the cycle clear their containees, so there's no
actually need for tuples to have a tp_clear.

The possibility should be mentioned, although it's fine to
recommend playing it safe.  Indeed, I don't think it buys
anything worth having for tuples not to have an obvious
tp_clear implementation.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473132&group_id=5470

From noreply at sourceforge.net  Sat May 13 20:27:04 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 13 May 2006 11:27:04 -0700
Subject: [Patches] [ python-Patches-1481304 ] Cleaned up 16x16px icons for
	windows.
Message-ID: <E1FeypU-0001zf-KL@sc8-sf-web2.sourceforge.net>

Patches item #1481304, was opened at 2006-05-03 11:46
Message generated for change (Comment added) made by josiahcarlson
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: goxe (goxe)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cleaned up 16x16px icons for windows.

Initial Comment:

Since the currently distributed icon files only 
include 32x32px images, Windows resizes them where 
16x16px is needed. With the predictable result that 
they look blurred and dark.

The attached icons include 16x16px versions of the 
current icons. It's the same friendly-snake-icon as 
always, just prettier in small sizes.


----------------------------------------------------------------------

Comment By: Josiah Carlson (josiahcarlson)
Date: 2006-05-13 11:27

Message:
Logged In: YES 
user_id=341410

They are lighter in color, though I would prefer if Python
on Windows used the smallest versions of the Mac icons
(preview available here:
http://www.doxdesk.com/img/software/py/icons.png ).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

From noreply at sourceforge.net  Sat May 13 21:40:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 13 May 2006 12:40:05 -0700
Subject: [Patches] [ python-Patches-1481304 ] Cleaned up 16x16px icons for
	windows.
Message-ID: <E1Fezy9-0002N5-34@sc8-sf-web1.sourceforge.net>

Patches item #1481304, was opened at 2006-05-03 13:46
Message generated for change (Comment added) made by montanaro
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: goxe (goxe)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cleaned up 16x16px icons for windows.

Initial Comment:

Since the currently distributed icon files only 
include 32x32px images, Windows resizes them where 
16x16px is needed. With the predictable result that 
they look blurred and dark.

The attached icons include 16x16px versions of the 
current icons. It's the same friendly-snake-icon as 
always, just prettier in small sizes.


----------------------------------------------------------------------

>Comment By: Skip Montanaro (montanaro)
Date: 2006-05-13 14:40

Message:
Logged In: YES 
user_id=44345

I agree with Josiah.  I'd like the various icons to be the same across platforms.

----------------------------------------------------------------------

Comment By: Josiah Carlson (josiahcarlson)
Date: 2006-05-13 13:27

Message:
Logged In: YES 
user_id=341410

They are lighter in color, though I would prefer if Python
on Windows used the smallest versions of the Mac icons
(preview available here:
http://www.doxdesk.com/img/software/py/icons.png ).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

From noreply at sourceforge.net  Sat May 13 23:13:56 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 13 May 2006 14:13:56 -0700
Subject: [Patches] [ python-Patches-1488098 ] MacOSX: distutils support for
	-arch and -isysroot flags
Message-ID: <E1Ff1Qy-0008DA-8R@sc8-sf-web1.sourceforge.net>

Patches item #1488098, was opened at 2006-05-13 23:13
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: MacOSX: distutils support for -arch and -isysroot flags

Initial Comment:
This flag adds specific support for the -arch and -isysroot flags of GCC 
on MacOSX 10.4 or later.

The patch consists of two parts:

1) Remove these flags (and their arguments) from the base CFLAGS/
LDFLAGS when compiling extensions on OSX 10.3 or earlier because GCC 
doesn't support those arguments in the version of GCC that is shipped 
what the version of the OS.

2) Strip -arch and -isysroot (again including their arguments) from the 
base CFLAGS/LDFLAGS when the user has specified new values for them 
in the extra_compile_args and extra_link args.

The second part is needed because -isysroot can only be specified once 
and the -arch option is incremental, without this patch you cannot 
compile using a different SDK or for fewer architectures.

A reason for wanting to do the latter is software like psyco that is only 
fully supported on one of the architectures for OSX.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

From noreply at sourceforge.net  Sun May 14 15:33:13 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 14 May 2006 06:33:13 -0700
Subject: [Patches] [ python-Patches-1488312 ] Memory alignment fix on SPARC
Message-ID: <E1FfGif-0002Ka-87@sc8-sf-web3.sourceforge.net>

Patches item #1488312, was opened at 2006-05-14 15:33
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488312&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Jan Palus (atler_)
Assigned to: Nobody/Anonymous (nobody)
Summary: Memory alignment fix on SPARC

Initial Comment:
test_codecscallback fails on sparc becasue of line (in
Objects/unicodeobject.c):

*p = *(Py_UNICODE *)s;

which may break memory alignment rules. Attached patch
fixes the problem by using memcpy instead.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488312&group_id=5470

From noreply at sourceforge.net  Mon May 15 09:22:36 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 15 May 2006 00:22:36 -0700
Subject: [Patches] [ python-Patches-1488312 ] Memory alignment fix on SPARC
Message-ID: <E1FfXPY-0004H4-RC@sc8-sf-web4-b.sourceforge.net>

Patches item #1488312, was opened at 2006-05-14 06:33
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488312&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Jan Palus (atler_)
>Assigned to: Neal Norwitz (nnorwitz)
Summary: Memory alignment fix on SPARC

Initial Comment:
test_codecscallback fails on sparc becasue of line (in
Objects/unicodeobject.c):

*p = *(Py_UNICODE *)s;

which may break memory alignment rules. Attached patch
fixes the problem by using memcpy instead.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-15 00:22

Message:
Logged In: YES 
user_id=33168

Thanks!

Committed revision 46001.
Committed revision 46002.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488312&group_id=5470

From noreply at sourceforge.net  Mon May 15 09:41:41 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 15 May 2006 00:41:41 -0700
Subject: [Patches] [ python-Patches-1488098 ] MacOSX: distutils support for
	-arch and -isysroot flags
Message-ID: <E1FfXi1-0006sJ-Qo@sc8-sf-web2.sourceforge.net>

Patches item #1488098, was opened at 2006-05-13 14:13
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: MacOSX: distutils support for -arch and -isysroot flags

Initial Comment:
This flag adds specific support for the -arch and -isysroot flags of GCC 
on MacOSX 10.4 or later.

The patch consists of two parts:

1) Remove these flags (and their arguments) from the base CFLAGS/
LDFLAGS when compiling extensions on OSX 10.3 or earlier because GCC 
doesn't support those arguments in the version of GCC that is shipped 
what the version of the OS.

2) Strip -arch and -isysroot (again including their arguments) from the 
base CFLAGS/LDFLAGS when the user has specified new values for them 
in the extra_compile_args and extra_link args.

The second part is needed because -isysroot can only be specified once 
and the -arch option is incremental, without this patch you cannot 
compile using a different SDK or for fewer architectures.

A reason for wanting to do the latter is software like psyco that is only 
fully supported on one of the architectures for OSX.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-15 00:41

Message:
Logged In: YES 
user_id=33168

I don't see any obvious problems with the patch.  I have
some nits though:

 * This is pretty complex: int(os.uname()[2].split('.')[0])
   I would prefer if it was broken up and use local
variables to explain better what's going on (or at least a
comment that shows the expected format).
  - same with '.'.join(m.group(1).split('.')[:2])

 * Remove double blank lines at first line of patch in
util.py and the last 3 lines (the pass is not needed).

 * unixcompiler.py, use True/False instead of 1/0.  I forget
what the compatibility of distutils is, but I see other uses
of True and False

   - same comment about getting the kernel with a complex expr

   - I prefer index instead of idx (I don't like abbrevs,
particularly for foreign speakers)

Instead of: 
+        if '-arch' in cc_args:
+            stripArch = 1

just set it:  stripArch = '-arch' in cc_args

Same for stripSysroot

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

From noreply at sourceforge.net  Mon May 15 16:18:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 15 May 2006 07:18:25 -0700
Subject: [Patches] [ python-Patches-1488881 ] tarfile.py: support for
	file-objects and bz2 (cp. #1488634)
Message-ID: <E1Ffdtx-0002yi-DK@sc8-sf-web2.sourceforge.net>

Patches item #1488881, was opened at 2006-05-15 16:18
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488881&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py: support for file-objects and bz2 (cp. #1488634)

Initial Comment:
This patch adds support for file(-like) objects and
bzip2 compression to tarfile.py. It works around the
limitation of the bz2 module that you cannot create a
BZ2File object from a file or file-like object but from
a filename only.  
Bug #1488634 reminded me that I had this workaround in
my development version of tarfile.py since last year. I
think it would generally be a good addition for
stdlib's tarfile.py, and would solve the OP's problem
as a side-effect.

The patch adds a class _BZ2Proxy to Lib/tarfile.py and
adds tests for this feature to Lib/test/test_tarfile.py.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488881&group_id=5470

From noreply at sourceforge.net  Mon May 15 21:31:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 15 May 2006 12:31:08 -0700
Subject: [Patches] [ python-Patches-1488881 ] tarfile.py: support for
	file-objects and bz2 (cp. #1488634)
Message-ID: <E1Ffima-0002qn-0L@sc8-sf-web1.sourceforge.net>

Patches item #1488881, was opened at 2006-05-15 14:18
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488881&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py: support for file-objects and bz2 (cp. #1488634)

Initial Comment:
This patch adds support for file(-like) objects and
bzip2 compression to tarfile.py. It works around the
limitation of the bz2 module that you cannot create a
BZ2File object from a file or file-like object but from
a filename only.  
Bug #1488634 reminded me that I had this workaround in
my development version of tarfile.py since last year. I
think it would generally be a good addition for
stdlib's tarfile.py, and would solve the OP's problem
as a side-effect.

The patch adds a class _BZ2Proxy to Lib/tarfile.py and
adds tests for this feature to Lib/test/test_tarfile.py.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-15 19:31

Message:
Logged In: YES 
user_id=849994

Committed in rev. 46005.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488881&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:38:46 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:38:46 -0700
Subject: [Patches] [ python-Patches-1435422 ] Add copy() method to zlib's
	compress and decompress objects
Message-ID: <E1Ffu8k-0007HX-Jg@sc8-sf-web3.sourceforge.net>

Patches item #1435422, was opened at 2006-02-20 20:17
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1435422&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Chris AtLee (catlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add copy() method to zlib's compress and decompress objects

Initial Comment:
The attached patch adds a copy() method to zlib's
compressobj and decompressobj.  Copying a
(de)compression object allows a developer to store the
state of the (de)compressor at a certain point of the
input stream in order to more efficiently compress data
sharing some identical header, or to more efficiently
seek inside compressed data.

Doc/lib/libzlib.tex is updated with descriptions for
the new methods.

Lib/test/test_zlib.py is updated to test the new
functionality.

The patch is against revision 42524 in
http://svn.python.org/projects/python/trunk


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 07:38

Message:
Logged In: YES 
user_id=849994

Corrected the patch and applied in rev. 46012.

----------------------------------------------------------------------

Comment By: Chris AtLee (catlee)
Date: 2006-03-27 21:46

Message:
Logged In: YES 
user_id=186532

Patch for the unflush() docs is uploaded as #1459631

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-27 18:56

Message:
Logged In: YES 
user_id=33168

Yes, please fix any docs you find lacking.  Also, please
create a new patch.  Thanks!

----------------------------------------------------------------------

Comment By: Chris AtLee (catlee)
Date: 2006-03-27 18:52

Message:
Logged In: YES 
user_id=186532

New patch attached with the mentioned changes made.

I noticed that PyZlib_unflush() takes an argument, but that
its use is not documented.  Should the docs be updated to
explain what that argument is for?

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-24 06:02

Message:
Logged In: YES 
user_id=33168

You need to check the return result of newcompobject(). 
This would crash if it returns NULL.

You also need to change METH_VARARGS to METH_NOARGS since
these methods don't take any arguments.

The doc should contain \versionadded{2.5} before the end
markers for new methods.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1435422&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:39:44 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:39:44 -0700
Subject: [Patches] [ python-Patches-1442927 ] PyLong_FromString optimization
Message-ID: <E1Ffu9g-0007fx-3U@sc8-sf-web3.sourceforge.net>

Patches item #1442927, was opened at 2006-03-04 06:21
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Alan McIntyre (alanmcintyre)
>Assigned to: Tim Peters (tim_one)
Summary: PyLong_FromString optimization

Initial Comment:
The current implementation of PyLong_FromString in
Python 2.5 uses muladd1 to add each digit of the input
string into the final number.  Because muladd1 creates
a new long to hold the result on every call, an
intermediate long object is created/destroyed for each
digit in the input string.  

This patch improves on the current implementation of
PyLong_FromString in 3 main ways:

1. Creates and manipulates (in-place) a single long
object to hold the result, skipping the creation of all
those intermediate long objects.

2. Multiple digits from the input string are
consolidated into a single long digit before adding
them into the long integer object.  This greatly
reduces the number of "multiply/add" cycles required to
push all the digits into the long object.

3. Three chunks of code like "if (ch <= '9') k = ch -
'0'" in longobject.c are replaced by a digit value
lookup vector.  I'm not irreversibly stuck on this
idea; it doesn't measurably add to performance, but it
just seems (to me, anyway) to make the code in
long_from_binary_base and PyLong_FromString a little
less cluttered.  This is the same lookup table from
patch 1335972 (an optimization for int()).  I expect if
both patches get accepted it would be best to make them
both reference a single instance of this table; if it
looks like that's what will happen I'll tweak one or
both patches as necessary.


My cheezy test results (included in the attached file
in an OpenOffice spreadsheet) show that the patch makes
long() about 50% faster than the existing
implementation for decimal input strings of about 10
characters.   Longer input strings show even better
performance improvement, leveling off around 3x faster
for very long strings.

This patch passes regression tests on my machine
(WinXP, Visual C++ .net Standard 2003).  I plan to try
out the tests on my Linux box this weekend just to make
sure the performance boost still remains when Python
gets compiled by a C compiler that isn't neutered
(standard .net 2003 doesn't appear to allow any
optimizations).

The test and test data generation scripts I used for
this performance comparison are included in the
attached zip file. 

At the moment I don't have any added tests; if somebody
can suggest some things that ought to be tested I'll
gladly write some tests.


----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-07 03:05

Message:
Logged In: YES 
user_id=1115903

Version #3 is attached; it has an across-the-board
improvement of ~10% over version 2.  The performance hit for
calling long() on 9-digit numbers is now only about -10%,
breakeven happens somewhere around 11 digits, and the best
performance is about +282% in the vicinity of 1000 digits.

Sorry to keep commenting on my own patch. :)  I think I'm
done now.

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 22:33

Message:
Logged In: YES 
user_id=1115903

Version #2 is attached.  I made a couple of tweaks and
tested the patch out on Linux just to make sure the
performace is still as good with compiler optimizations. 
For short numbers (numbers that would fit into an int),
long() is 10-30% *slower* than before applying the patch. 
For longer numbers, long() is up to 249% faster, with the
peak occurring around 1000 digits.

If the negative performance impact for int-sized digits is
unacceptable, I will see if I can do something about it. 
However, one always has the option of using int() on very
long strings anyway, and it will automatically fall through
to PyLong_FromString if the number is too long.  The
performance impact on int() for small numbers is so small as
to be negligible (<5%), which is to be expected since the
modified code isn't called when using int() on input strings
< 2**32. 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:40:02 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:40:02 -0700
Subject: [Patches] [ python-Patches-1442927 ] PyLong_FromString optimization
Message-ID: <E1Ffu9y-0007nL-NL@sc8-sf-web3.sourceforge.net>

Patches item #1442927, was opened at 2006-03-04 06:21
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Alan McIntyre (alanmcintyre)
Assigned to: Tim Peters (tim_one)
Summary: PyLong_FromString optimization

Initial Comment:
The current implementation of PyLong_FromString in
Python 2.5 uses muladd1 to add each digit of the input
string into the final number.  Because muladd1 creates
a new long to hold the result on every call, an
intermediate long object is created/destroyed for each
digit in the input string.  

This patch improves on the current implementation of
PyLong_FromString in 3 main ways:

1. Creates and manipulates (in-place) a single long
object to hold the result, skipping the creation of all
those intermediate long objects.

2. Multiple digits from the input string are
consolidated into a single long digit before adding
them into the long integer object.  This greatly
reduces the number of "multiply/add" cycles required to
push all the digits into the long object.

3. Three chunks of code like "if (ch <= '9') k = ch -
'0'" in longobject.c are replaced by a digit value
lookup vector.  I'm not irreversibly stuck on this
idea; it doesn't measurably add to performance, but it
just seems (to me, anyway) to make the code in
long_from_binary_base and PyLong_FromString a little
less cluttered.  This is the same lookup table from
patch 1335972 (an optimization for int()).  I expect if
both patches get accepted it would be best to make them
both reference a single instance of this table; if it
looks like that's what will happen I'll tweak one or
both patches as necessary.


My cheezy test results (included in the attached file
in an OpenOffice spreadsheet) show that the patch makes
long() about 50% faster than the existing
implementation for decimal input strings of about 10
characters.   Longer input strings show even better
performance improvement, leveling off around 3x faster
for very long strings.

This patch passes regression tests on my machine
(WinXP, Visual C++ .net Standard 2003).  I plan to try
out the tests on my Linux box this weekend just to make
sure the performance boost still remains when Python
gets compiled by a C compiler that isn't neutered
(standard .net 2003 doesn't appear to allow any
optimizations).

The test and test data generation scripts I used for
this performance comparison are included in the
attached zip file. 

At the moment I don't have any added tests; if somebody
can suggest some things that ought to be tested I'll
gladly write some tests.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 07:40

Message:
Logged In: YES 
user_id=849994

Assigned to Tim. Perhaps something for Iceland?

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-07 03:05

Message:
Logged In: YES 
user_id=1115903

Version #3 is attached; it has an across-the-board
improvement of ~10% over version 2.  The performance hit for
calling long() on 9-digit numbers is now only about -10%,
breakeven happens somewhere around 11 digits, and the best
performance is about +282% in the vicinity of 1000 digits.

Sorry to keep commenting on my own patch. :)  I think I'm
done now.

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 22:33

Message:
Logged In: YES 
user_id=1115903

Version #2 is attached.  I made a couple of tweaks and
tested the patch out on Linux just to make sure the
performace is still as good with compiler optimizations. 
For short numbers (numbers that would fit into an int),
long() is 10-30% *slower* than before applying the patch. 
For longer numbers, long() is up to 249% faster, with the
peak occurring around 1000 digits.

If the negative performance impact for int-sized digits is
unacceptable, I will see if I can do something about it. 
However, one always has the option of using int() on very
long strings anyway, and it will automatically fall through
to PyLong_FromString if the number is too long.  The
performance impact on int() for small numbers is so small as
to be negligible (<5%), which is to be expected since the
modified code isn't called when using int() on input strings
< 2**32. 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:40:40 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:40:40 -0700
Subject: [Patches] [ python-Patches-1446489 ] zipfile: support for ZIP64
Message-ID: <E1FfuAa-00082P-OE@sc8-sf-web3.sourceforge.net>

Patches item #1446489, was opened at 2006-03-09 14:58
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
>Assigned to: Ronald Oussoren (ronaldoussoren)
Summary: zipfile: support for ZIP64

Initial Comment:
The attached patch implements support for ZIP64, that is zipfiles 
containing very large (>4GByte) files and zipfiles that are larger than
4GByte themselves. 

The output of this patch can be read by pkzip (see below for the actual 
version I used for testing).


----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-02 19:13

Message:
Logged In: YES 
user_id=580910

The "don't use the ZIP64 extension" flag is a good idea, zipfiles that use this 
extension aren't readable by the infozip tools (zip and unzip on most unix 
systems).

I'll add tests and documentation in the near future.

The version of zipfile that I'm currently using also contains a patch for 
speeding up the opening of zipfiles, for the type of files I'm dealing with 
(about 11GByte large with tens of thousands of files) the speedup is very 
significant. I suppose it's better to file that as a separate patch after this has 
been approved.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2006-04-02 05:02

Message:
Logged In: YES 
user_id=29957

I'd like to see a testcase and possibly a note for the
documentation about the new semantics. Also, should it be
possible to say "don't use the ZIP64 extension, instead
raise an Error" for people who don't want to generate these?
 

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-03-09 15:28

Message:
Logged In: YES 
user_id=580910

Oops, I've uploaded the wrong file. zipfile-zip64.patch is the correct one.

I've tested the correctness of created archives using this version of pkzip:

pkzipc -version
PKZIP(R) Server  Version 8  ZIP Compression Utility for Linux X86
Copyright (C) 1989-2005 PKWARE, Inc.  All Rights Reserved. Evaluation 
Version
PKZIP Reg. U.S. Pat. and Tm. Off.  Patent No. 5,051,745
Patent Pending

Version 8.40.66


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:41:24 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:41:24 -0700
Subject: [Patches] [ python-Patches-1446489 ] zipfile: support for ZIP64
Message-ID: <E1FfuBI-0008I0-6u@sc8-sf-web3.sourceforge.net>

Patches item #1446489, was opened at 2006-03-09 14:58
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Ronald Oussoren (ronaldoussoren)
Summary: zipfile: support for ZIP64

Initial Comment:
The attached patch implements support for ZIP64, that is zipfiles 
containing very large (>4GByte) files and zipfiles that are larger than
4GByte themselves. 

The output of this patch can be read by pkzip (see below for the actual 
version I used for testing).


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 07:41

Message:
Logged In: YES 
user_id=849994

Since 2.5 beta is coming close, have you made progress on
the tests/docs?

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-02 19:13

Message:
Logged In: YES 
user_id=580910

The "don't use the ZIP64 extension" flag is a good idea, zipfiles that use this 
extension aren't readable by the infozip tools (zip and unzip on most unix 
systems).

I'll add tests and documentation in the near future.

The version of zipfile that I'm currently using also contains a patch for 
speeding up the opening of zipfiles, for the type of files I'm dealing with 
(about 11GByte large with tens of thousands of files) the speedup is very 
significant. I suppose it's better to file that as a separate patch after this has 
been approved.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2006-04-02 05:02

Message:
Logged In: YES 
user_id=29957

I'd like to see a testcase and possibly a note for the
documentation about the new semantics. Also, should it be
possible to say "don't use the ZIP64 extension, instead
raise an Error" for people who don't want to generate these?
 

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-03-09 15:28

Message:
Logged In: YES 
user_id=580910

Oops, I've uploaded the wrong file. zipfile-zip64.patch is the correct one.

I've tested the correctness of created archives using this version of pkzip:

pkzipc -version
PKZIP(R) Server  Version 8  ZIP Compression Utility for Linux X86
Copyright (C) 1989-2005 PKWARE, Inc.  All Rights Reserved. Evaluation 
Version
PKZIP Reg. U.S. Pat. and Tm. Off.  Patent No. 5,051,745
Patent Pending

Version 8.40.66


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:44:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:44:08 -0700
Subject: [Patches] [ python-Patches-1442927 ] PyLong_FromString optimization
Message-ID: <E1FfuDw-0000rV-7P@sc8-sf-web3.sourceforge.net>

Patches item #1442927, was opened at 2006-03-04 01:21
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Alan McIntyre (alanmcintyre)
Assigned to: Tim Peters (tim_one)
Summary: PyLong_FromString optimization

Initial Comment:
The current implementation of PyLong_FromString in
Python 2.5 uses muladd1 to add each digit of the input
string into the final number.  Because muladd1 creates
a new long to hold the result on every call, an
intermediate long object is created/destroyed for each
digit in the input string.  

This patch improves on the current implementation of
PyLong_FromString in 3 main ways:

1. Creates and manipulates (in-place) a single long
object to hold the result, skipping the creation of all
those intermediate long objects.

2. Multiple digits from the input string are
consolidated into a single long digit before adding
them into the long integer object.  This greatly
reduces the number of "multiply/add" cycles required to
push all the digits into the long object.

3. Three chunks of code like "if (ch <= '9') k = ch -
'0'" in longobject.c are replaced by a digit value
lookup vector.  I'm not irreversibly stuck on this
idea; it doesn't measurably add to performance, but it
just seems (to me, anyway) to make the code in
long_from_binary_base and PyLong_FromString a little
less cluttered.  This is the same lookup table from
patch 1335972 (an optimization for int()).  I expect if
both patches get accepted it would be best to make them
both reference a single instance of this table; if it
looks like that's what will happen I'll tweak one or
both patches as necessary.


My cheezy test results (included in the attached file
in an OpenOffice spreadsheet) show that the patch makes
long() about 50% faster than the existing
implementation for decimal input strings of about 10
characters.   Longer input strings show even better
performance improvement, leveling off around 3x faster
for very long strings.

This patch passes regression tests on my machine
(WinXP, Visual C++ .net Standard 2003).  I plan to try
out the tests on my Linux box this weekend just to make
sure the performance boost still remains when Python
gets compiled by a C compiler that isn't neutered
(standard .net 2003 doesn't appear to allow any
optimizations).

The test and test data generation scripts I used for
this performance comparison are included in the
attached zip file. 

At the moment I don't have any added tests; if somebody
can suggest some things that ought to be tested I'll
gladly write some tests.


----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-16 03:44

Message:
Logged In: YES 
user_id=31435

Thanks for reminding me, Georg!  This is a good possiblity
for the Iceland sprint.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 03:40

Message:
Logged In: YES 
user_id=849994

Assigned to Tim. Perhaps something for Iceland?

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 22:05

Message:
Logged In: YES 
user_id=1115903

Version #3 is attached; it has an across-the-board
improvement of ~10% over version 2.  The performance hit for
calling long() on 9-digit numbers is now only about -10%,
breakeven happens somewhere around 11 digits, and the best
performance is about +282% in the vicinity of 1000 digits.

Sorry to keep commenting on my own patch. :)  I think I'm
done now.

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 17:33

Message:
Logged In: YES 
user_id=1115903

Version #2 is attached.  I made a couple of tweaks and
tested the patch out on Linux just to make sure the
performace is still as good with compiler optimizations. 
For short numbers (numbers that would fit into an int),
long() is 10-30% *slower* than before applying the patch. 
For longer numbers, long() is up to 249% faster, with the
peak occurring around 1000 digits.

If the negative performance impact for int-sized digits is
unacceptable, I will see if I can do something about it. 
However, one always has the option of using int() on very
long strings anyway, and it will automatically fall through
to PyLong_FromString if the number is too long.  The
performance impact on int() for small numbers is so small as
to be negligible (<5%), which is to be expected since the
modified code isn't called when using int() on input strings
< 2**32. 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:46:16 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:46:16 -0700
Subject: [Patches] [ python-Patches-1442927 ] PyLong_FromString optimization
Message-ID: <E1FfuG0-0001fu-CN@sc8-sf-web3.sourceforge.net>

Patches item #1442927, was opened at 2006-03-04 06:21
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Alan McIntyre (alanmcintyre)
Assigned to: Tim Peters (tim_one)
Summary: PyLong_FromString optimization

Initial Comment:
The current implementation of PyLong_FromString in
Python 2.5 uses muladd1 to add each digit of the input
string into the final number.  Because muladd1 creates
a new long to hold the result on every call, an
intermediate long object is created/destroyed for each
digit in the input string.  

This patch improves on the current implementation of
PyLong_FromString in 3 main ways:

1. Creates and manipulates (in-place) a single long
object to hold the result, skipping the creation of all
those intermediate long objects.

2. Multiple digits from the input string are
consolidated into a single long digit before adding
them into the long integer object.  This greatly
reduces the number of "multiply/add" cycles required to
push all the digits into the long object.

3. Three chunks of code like "if (ch <= '9') k = ch -
'0'" in longobject.c are replaced by a digit value
lookup vector.  I'm not irreversibly stuck on this
idea; it doesn't measurably add to performance, but it
just seems (to me, anyway) to make the code in
long_from_binary_base and PyLong_FromString a little
less cluttered.  This is the same lookup table from
patch 1335972 (an optimization for int()).  I expect if
both patches get accepted it would be best to make them
both reference a single instance of this table; if it
looks like that's what will happen I'll tweak one or
both patches as necessary.


My cheezy test results (included in the attached file
in an OpenOffice spreadsheet) show that the patch makes
long() about 50% faster than the existing
implementation for decimal input strings of about 10
characters.   Longer input strings show even better
performance improvement, leveling off around 3x faster
for very long strings.

This patch passes regression tests on my machine
(WinXP, Visual C++ .net Standard 2003).  I plan to try
out the tests on my Linux box this weekend just to make
sure the performance boost still remains when Python
gets compiled by a C compiler that isn't neutered
(standard .net 2003 doesn't appear to allow any
optimizations).

The test and test data generation scripts I used for
this performance comparison are included in the
attached zip file. 

At the moment I don't have any added tests; if somebody
can suggest some things that ought to be tested I'll
gladly write some tests.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 07:46

Message:
Logged In: YES 
user_id=849994

If someone ;) created a new tracker category, I'd go through
the patches and flag all I can find for the sprint.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-05-16 07:44

Message:
Logged In: YES 
user_id=31435

Thanks for reminding me, Georg!  This is a good possiblity
for the Iceland sprint.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 07:40

Message:
Logged In: YES 
user_id=849994

Assigned to Tim. Perhaps something for Iceland?

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-07 03:05

Message:
Logged In: YES 
user_id=1115903

Version #3 is attached; it has an across-the-board
improvement of ~10% over version 2.  The performance hit for
calling long() on 9-digit numbers is now only about -10%,
breakeven happens somewhere around 11 digits, and the best
performance is about +282% in the vicinity of 1000 digits.

Sorry to keep commenting on my own patch. :)  I think I'm
done now.

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 22:33

Message:
Logged In: YES 
user_id=1115903

Version #2 is attached.  I made a couple of tweaks and
tested the patch out on Linux just to make sure the
performace is still as good with compiler optimizations. 
For short numbers (numbers that would fit into an int),
long() is 10-30% *slower* than before applying the patch. 
For longer numbers, long() is up to 249% faster, with the
peak occurring around 1000 digits.

If the negative performance impact for int-sized digits is
unacceptable, I will see if I can do something about it. 
However, one always has the option of using int() on very
long strings anyway, and it will automatically fall through
to PyLong_FromString if the number is too long.  The
performance impact on int() for small numbers is so small as
to be negligible (<5%), which is to be expected since the
modified code isn't called when using int() on input strings
< 2**32. 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

From noreply at sourceforge.net  Tue May 16 09:55:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 00:55:25 -0700
Subject: [Patches] [ python-Patches-1446489 ] zipfile: support for ZIP64
Message-ID: <E1FfuOr-0004yZ-NO@sc8-sf-web3.sourceforge.net>

Patches item #1446489, was opened at 2006-03-09 15:58
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Ronald Oussoren (ronaldoussoren)
Summary: zipfile: support for ZIP64

Initial Comment:
The attached patch implements support for ZIP64, that is zipfiles 
containing very large (>4GByte) files and zipfiles that are larger than
4GByte themselves. 

The output of this patch can be read by pkzip (see below for the actual 
version I used for testing).


----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-16 09:55

Message:
Logged In: YES 
user_id=580910

I haven't had time to work on this, all time I had to work on python related stuff 
has been eaten by finishing PyObjC's port to intel macs and universal binary 
patches.

The former is now done, the latter almost so I'll have some time to work on this 
again especially because I'm using this patch at work and might be able to claim 
some time to work on this during work-hours.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 09:41

Message:
Logged In: YES 
user_id=849994

Since 2.5 beta is coming close, have you made progress on
the tests/docs?

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-02 21:13

Message:
Logged In: YES 
user_id=580910

The "don't use the ZIP64 extension" flag is a good idea, zipfiles that use this 
extension aren't readable by the infozip tools (zip and unzip on most unix 
systems).

I'll add tests and documentation in the near future.

The version of zipfile that I'm currently using also contains a patch for 
speeding up the opening of zipfiles, for the type of files I'm dealing with 
(about 11GByte large with tens of thousands of files) the speedup is very 
significant. I suppose it's better to file that as a separate patch after this has 
been approved.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2006-04-02 07:02

Message:
Logged In: YES 
user_id=29957

I'd like to see a testcase and possibly a note for the
documentation about the new semantics. Also, should it be
possible to say "don't use the ZIP64 extension, instead
raise an Error" for people who don't want to generate these?
 

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-03-09 16:28

Message:
Logged In: YES 
user_id=580910

Oops, I've uploaded the wrong file. zipfile-zip64.patch is the correct one.

I've tested the correctness of created archives using this version of pkzip:

pkzipc -version
PKZIP(R) Server  Version 8  ZIP Compression Utility for Linux X86
Copyright (C) 1989-2005 PKWARE, Inc.  All Rights Reserved. Evaluation 
Version
PKZIP Reg. U.S. Pat. and Tm. Off.  Patent No. 5,051,745
Patent Pending

Version 8.40.66


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

From noreply at sourceforge.net  Tue May 16 17:25:29 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 08:25:29 -0700
Subject: [Patches] [ python-Patches-1457736 ] patch for building trunk with
	VC6
Message-ID: <E1Fg1QP-0008Ko-8W@sc8-sf-web3.sourceforge.net>

Patches item #1457736, was opened at 2006-03-24 08:40
Message generated for change (Settings changed) made by rhettinger
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
>Assigned to: Nobody/Anonymous (nobody)
Summary: patch for building trunk with VC6

Initial Comment:
Hello. I tried to build trunk with VC6, but failed.
The reasons are

 - _W64 is not defined on VC6. (PC/pyconfig.h)

 - intptr_t and uintptr_t are not decleared on VC6.
(should use Py_intptr_t and Py_uintptr_t respectively)

I'll submit the patch for these two issues as
"build_trunk_for_vc6.patch".

And more two issues.

 - zlib was make built into pythoncore, but
PC/VC6/pythoncore.dsp is not updated for it yet.

I'll submit the file itself.

 - long long cannot be used on VC6, so 0xFFFFULL is
failed to compile with "invalid suffix" error.

I workarounded this replaced ULL with UI64 (_int64's
suffix) but I don't know how to make the patch. maybe
can this tequnique be used?

  #define Py_ULL(x) x##ULL /* non VC6 */

  #define Py_ULL(x) x##UI64 /* VC6 */

  Py_ULL(0xFFFFFFFFFFFFFFFF) instead of 0xFFF...FULL


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 00:40

Message:
Logged In: YES 
user_id=1200846

Oops, I forgot to upload the file.

  - Apply x.patch.

  - Replace pythoncore.dsp and pcbuild.dsw in PC/VC6 with
    attached files.

 - Remove PC/VC6/zlib.dsp


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 00:37

Message:
Logged In: YES 
user_id=1200846

Hello. I updated the patch. (Probably this is better)

  - defined ULL() macro locally in Modules/sha512module.c
      maybe it's better to declare Py_ULL or something
      globally, but I don't know how to do it.

 - more patch for zlib builtin (ie: PC/VC6/Readme.txt)

I cannot try this patch on VC7 or later, but
I confirmed lib/test/testall.py passed on VC6.

----------------------------------------------------------------------

Comment By: Luke Dunstan (infidel)
Date: 2006-05-06 13:16

Message:
Logged In: YES 
user_id=30442

Is there anything preventing this patch from being 
applied? It would help me with building the trunk using 
both VC6 and Microsoft eMbedded Visual C++ 4.0 (for 
Windows CE).


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-26 12:02

Message:
Logged In: YES 
user_id=33168

Raymond, maybe this will help get VC6 building?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

From noreply at sourceforge.net  Tue May 16 21:09:19 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 12:09:19 -0700
Subject: [Patches] [ python-Patches-1489771 ] Updates to syntax rules in
	reference manual
Message-ID: <E1Fg4v1-0003Xr-AY@sc8-sf-web2.sourceforge.net>

Patches item #1489771, was opened at 2006-05-16 21:09
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Updates to syntax rules in reference manual

Initial Comment:
I tried to update the reference manual to the current
Python syntax. Some things are still missing, most
notably the yield expression. Detailed description
of changes below. I can also attach the
generated webpages, if someone is interested.

Expressions
===========

List Displays
-------------
Reordered the rules so that the style is
consistent with the rest of the manual. Separated
listmaker into expression_list and
list_comprehension, for better readability.
Replaced "expression_list" between "for" and "in"
with "target_list". See this thread for details:
http://mail.python.org/pipermail/python-dev/2006-April/064264.html

The only thing missing is old_lambdadef.

Generator Expressions
---------------------
Simmilar as above.

Calls
-----
Fixed the latex syntax (somebody forgot to remove
a line when generators were introduced). Replaced
test with expression. Fixed allowed positions for
commas (func(*args,) is not allowed).

Boolean operations
------------------
Restructured the new conditional expression so
that it is more readable.


Simple Statements
=================

Augmented assignment statements
-------------------------------
Removed comments from "productionlist" macro,
since they broke the generated grammar.txt file.
Removed empty groups that are not needed anymore,
since automatic conversion to guillemets was
disabled. Unfortunately the escaped operator
characters would still need manual fixing in the
grammar.txt file.

The print statement
-------------------
Removed all uses of the "optional" macro and
replaced them with sqare brackets, since it broke
the generated grammar.txt file.

The import statement
--------------------
Replaced all invalid uses of name with identifier.
Added relative import notation to the grammar
section.

Description of relative imports is still needed.

The exec statement
------------------
Corrected a minor mistake, since

exec "a = 1" or "a = 2"

is not valid Python syntax.
Added a (commented out) section about a strange feature
(you can already treat exec as a function) that should
IMHO be included in documentation and its use encouraged
over the current notation.


Compound statements
===================

The with statement
------------------
Added missing macro.

Function definition
-------------------
Cleaned up "parameter_list" so that it is correct
and expresses all the restrictions, but is still
easier to understand (I hope).


Still needed
------------
Yield became an expression in version 2.5, but this
is not documented.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

From noreply at sourceforge.net  Tue May 16 21:34:58 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 12:34:58 -0700
Subject: [Patches] [ python-Patches-1489784 ] Patch for the urllib2 HOWTO
Message-ID: <E1Fg5Jq-00049M-I0@sc8-sf-web4-b.sourceforge.net>

Patches item #1489784, was opened at 2006-05-16 19:34
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489784&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Nobody/Anonymous (nobody)
Summary: Patch for the urllib2 HOWTO

Initial Comment:
This is a minor unified diff for edits of the new
urllib2 HOWTO.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489784&group_id=5470

From noreply at sourceforge.net  Wed May 17 03:11:50 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 16 May 2006 18:11:50 -0700
Subject: [Patches] [ python-Patches-1429539 ] pdb: fix for 1326406 (import
	__main__ pdb failure)
Message-ID: <E1FgAZq-0005OO-5o@sc8-sf-web3.sourceforge.net>

Patches item #1429539, was opened at 2006-02-10 19:34
Message generated for change (Comment added) made by isandler
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1429539&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Ilya Sandler (isandler)
Assigned to: Nobody/Anonymous (nobody)
Summary: pdb: fix for  1326406 (import  __main__ pdb failure)

Initial Comment:
The patch allows pdb to debug program which 
import from __main__


----------------------------------------------------------------------

>Comment By: Ilya Sandler (isandler)
Date: 2006-05-16 18:11

Message:
Logged In: YES 
user_id=971153

Another iteration of the patch

Now __main__ namespace is explicitly initialized before the
program (re)starts. The new patch should both support
imports from __main__ AND separate program's and pdb's
namespaces..

As a side effect, __file__ will now be set correctly in the
main program


----------------------------------------------------------------------

Comment By: Ilya Sandler (isandler)
Date: 2006-05-10 20:50

Message:
Logged In: YES 
user_id=971153

I'm attaching an alternative patch: the program stil runs in
__main__ namespace, but pdb gets imported first:

  import pdb
  pdb.main()

So the main program cann't accidentally stomp on pdb
internals (e.g by doing help=None)

(there is still a bit of namespace pollution in the main
program)


----------------------------------------------------------------------

Comment By: Ilya Sandler (isandler)
Date: 2006-04-23 11:10

Message:
Logged In: YES 
user_id=971153

> 1. Could you give some code examples for that?

Do you mean examples of intentional interference with
debugger? Well, you could just traverse the stack and check
whether the program runs under debugger and then do anything
you want... But why do you think intentional interference
would ever be an issue? After all python is not a language
to write debugger-resistant applications ;-) 

Anyway, here are some examples of unintentional interference:

1. If you need a custom version of std module, you can
modify sys.path and then import the module.. Which works by
itself. But if pdb is loaded first and imports the module,
then it does not work...

2. Similar problem with any application which changes
sys.stdout/sys.stdin (there is actually a SF bug for that)

3.  Also I don't see how pdb in its current form can control
any program which needs a full-screen control of the terminal...

4. Any program which tries to do any magic with stack and
assumes that top level stack frame is the main application
will not work under pdb (where top level stack frame is pdb)

---------------------------------------------------
And there is a whole separate bunch of intereference issues
when pdb restarts the program.

---------------------------------------------------

When a program does run in pdb's namespace (as would be the
case if this patch is applied), pdb could save copies of all
module global symbols which it needs and thus become immune
to the accidental overwriting of those symbols in the main
program...

There could be a better way...


----------------------------------------------------------------------

Comment By: Kuba Ko??czyk (jakamkon)
Date: 2006-04-21 08:28

Message:
Logged In: YES 
user_id=1491175

Sorry I forget to login in;)The comment below is from me.

----------------------------------------------------------------------

Comment By: Nobody/Anonymous (nobody)
Date: 2006-04-21 08:25

Message:
Logged In: NO 

1. Could you give some code examples for that?
2,3. Did you notice that google search for "from __main__
import" give hits similar to: 
  t = Timer("test()", "from __main__ import test")
in most situations?
I think it's hard to value uses of "from..." based on google
search or similar method.Maybe we shoud ask on python-list
what are the others opinions?

>As a middle ground it might be a good idea to expand the
>patch to reduce pdb's dependency on module global symbols
I'am interesting how would you do that?
 

----------------------------------------------------------------------

Comment By: Ilya Sandler (isandler)
Date: 2006-04-20 19:39

Message:
Logged In: YES 
user_id=971153

I do see your point (In fact it was me who submitted the
patch #896011 which separated pdb namespace from the
program's -- and thus broke imports from __main__ ;-))..

I do want to bring a couple of points:

1. I don't think it matters whether a program can
intentionally interfere with pdb...Even when pdb's namespace
is separated, it's easy for the program to  interfere with
debugger.. (Or delete your home directory for that matter)

2. Importing from __main_ may not be common in the std lib,
but that's simply because stdlib doesn't contain that many
executable hence there are very few places where there is
__main__ to import from. 

google search for "from __main__ import" results in about 1M
hits.


3. Just for the record, profile module does not separate its
 namespace from programs's either...

So, basically, it boils down to this: what's worse breaking
imports from __main__ or risking accidental interference
between pdb and the program (e.g if your program redefines a
help symbol)...

As a middle ground it might be a good idea to expand the
patch to reduce pdb's dependency on module global symbols
and thus reducing the risk of interference.

What do you think?


----------------------------------------------------------------------

Comment By: Kuba Ko??czyk (jakamkon)
Date: 2006-04-20 04:17

Message:
Logged In: YES 
user_id=1491175

I think that exposing pdb's namespaces for debugged code is
dangerous.When debugged code have this kind of access he can
dynamic change pdb's behaviour without your control:

y.py:
die = """\
def destroy(x,y):
        print 'Iam crashing your HOME and deleting your FILES'

Pdb.__dict__['do_break'] = destroy # pdb's break = destroy
"""
x.py:
# innocently looking code;)
import y
exec(y.puff)
print "X"

with your patch:
$ python2.5 -m pdb x.py
> /home/xyz/python/x.py(1)<module>()
-> import y
(Pdb) Pdb.__dict__['do_break']
<function do_break at 0xb7cafdf4>
(Pdb) break
(Pdb) n
> /home/xyz/python/x.py(2)<module>()
-> exec(y.puff)
(Pdb) n
> /home/xyz/python/x.py(3)<module>()
-> print "X"
(Pdb) Pdb.__dict__['do_break']
<function destroy at 0xb7cb81b4>
(Pdb) break
Iam crashing your HOME and deleting your FILES

I think that this patch can't be accepted due to above
reason.According to my advanced reaserch;) ( find Lib/ -name
'*.py' -exec grep 'from __main__ import' {} -ls \; ) 'from
__main__' is rare case so maybe it will be reasonable to
simply handle ImportError and print something like
'** 'from __main__ import' not supported' message.What do  
you think?       
  

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1429539&group_id=5470

From noreply at sourceforge.net  Wed May 17 13:43:53 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 04:43:53 -0700
Subject: [Patches] [ python-Patches-1490189 ] fix typo in os.utime()
	docstring
Message-ID: <E1FgKRV-0007tS-3C@sc8-sf-web1.sourceforge.net>

Patches item #1490189, was opened at 2006-05-17 07:43
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490189&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: M. Levinson (levinsm)
Assigned to: Nobody/Anonymous (nobody)
Summary: fix typo in os.utime() docstring

Initial Comment:
typo in os.utime() docstring: modification time should be mtime, not utime


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490189&group_id=5470

From noreply at sourceforge.net  Wed May 17 13:45:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 04:45:34 -0700
Subject: [Patches] [ python-Patches-1490190 ] add os.chflags() and
	os.lchflags() where available
Message-ID: <E1FgKT8-00080f-LG@sc8-sf-web3.sourceforge.net>

Patches item #1490190, was opened at 2006-05-17 07:45
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: M. Levinson (levinsm)
Assigned to: Nobody/Anonymous (nobody)
Summary: add os.chflags() and os.lchflags() where available

Initial Comment:
The return value from os.stat() includes st_flags on some systems, but
currently there's not much that can be done with the value; this patch aims
to make st_flags useful by adding some associated constants to stat.py and
the corresponding chflags() and lchflags() functions in posixmodule. For
completeness, shutil.copystat() is also updated to call os.chflags() where
it's available.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

From noreply at sourceforge.net  Wed May 17 14:48:18 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 05:48:18 -0700
Subject: [Patches] [ python-Patches-1490224 ] time.altzone does not include
	DST offset on Cygwin
Message-ID: <E1FgLRq-000646-Qw@sc8-sf-web1.sourceforge.net>

Patches item #1490224, was opened at 2006-05-17 14:48
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490224&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Christian Franke (chrfranke)
Assigned to: Nobody/Anonymous (nobody)
Summary: time.altzone does not include DST offset on Cygwin

Initial Comment:
On Cygwin (python-2.4.1-1) time.altzone is always set
equal to time.timezone.

Steps to reproduce:
$ python -c 'import time; time.tzset(); \
  print time.ctime(), time.daylight, \
  time.timezone, time.altzone'

Actual result (for CEST):
Sun May 14 13:46:55 2006 1 -3600 -3600

Expected result:
Sun May 14 13:46:55 2006 1 -3600 -7200

This causes failure of time conversions in e.g.
rdiff-backup-1.1.5

The attached patch should fix this for most timezones.
The function already uses the same heuristics in the
(!__CYGWIN__ &&  !HAVE_STRUCT_TM_TM_ZONE) case.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490224&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:11:50 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:11:50 -0700
Subject: [Patches] [ python-Patches-1489784 ] Patch for the urllib2 HOWTO
Message-ID: <E1FgMkg-00016y-AL@sc8-sf-web1.sourceforge.net>

Patches item #1489784, was opened at 2006-05-16 19:34
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489784&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Nobody/Anonymous (nobody)
Summary: Patch for the urllib2 HOWTO

Initial Comment:
This is a minor unified diff for edits of the new
urllib2 HOWTO.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:11

Message:
Logged In: YES 
user_id=849994

Applied as rev. 46024.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489784&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:18:42 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:18:42 -0700
Subject: [Patches] [ python-Patches-1490189 ] fix typo in os.utime()
	docstring
Message-ID: <E1FgMrK-00035l-8e@sc8-sf-web1.sourceforge.net>

Patches item #1490189, was opened at 2006-05-17 11:43
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490189&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: M. Levinson (levinsm)
Assigned to: Nobody/Anonymous (nobody)
Summary: fix typo in os.utime() docstring

Initial Comment:
typo in os.utime() docstring: modification time should be mtime, not utime


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:18

Message:
Logged In: YES 
user_id=849994

Applied in rev. 46025.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490189&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:23:35 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:23:35 -0700
Subject: [Patches] [ python-Patches-1490190 ] add os.chflags() and
	os.lchflags() where available
Message-ID: <E1FgMw3-0004Ik-Np@sc8-sf-web1.sourceforge.net>

Patches item #1490190, was opened at 2006-05-17 11:45
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: M. Levinson (levinsm)
>Assigned to: Neal Norwitz (nnorwitz)
Summary: add os.chflags() and os.lchflags() where available

Initial Comment:
The return value from os.stat() includes st_flags on some systems, but
currently there's not much that can be done with the value; this patch aims
to make st_flags useful by adding some associated constants to stat.py and
the corresponding chflags() and lchflags() functions in posixmodule. For
completeness, shutil.copystat() is also updated to call os.chflags() where
it's available.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:24:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:24:05 -0700
Subject: [Patches] [ python-Patches-1490190 ] add os.chflags() and
	os.lchflags() where available
Message-ID: <E1FgMwX-0004Qu-Dq@sc8-sf-web1.sourceforge.net>

Patches item #1490190, was opened at 2006-05-17 11:45
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: M. Levinson (levinsm)
Assigned to: Neal Norwitz (nnorwitz)
Summary: add os.chflags() and os.lchflags() where available

Initial Comment:
The return value from os.stat() includes st_flags on some systems, but
currently there's not much that can be done with the value; this patch aims
to make st_flags useful by adding some associated constants to stat.py and
the corresponding chflags() and lchflags() functions in posixmodule. For
completeness, shutil.copystat() is also updated to call os.chflags() where
it's available.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:24

Message:
Logged In: YES 
user_id=849994

Patch looks good. Do we want to include it?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:27:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:27:10 -0700
Subject: [Patches] [ python-Patches-1490224 ] time.altzone does not include
	DST offset on Cygwin
Message-ID: <E1FgMzW-0005GK-DW@sc8-sf-web1.sourceforge.net>

Patches item #1490224, was opened at 2006-05-17 12:48
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490224&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Christian Franke (chrfranke)
Assigned to: Nobody/Anonymous (nobody)
Summary: time.altzone does not include DST offset on Cygwin

Initial Comment:
On Cygwin (python-2.4.1-1) time.altzone is always set
equal to time.timezone.

Steps to reproduce:
$ python -c 'import time; time.tzset(); \
  print time.ctime(), time.daylight, \
  time.timezone, time.altzone'

Actual result (for CEST):
Sun May 14 13:46:55 2006 1 -3600 -3600

Expected result:
Sun May 14 13:46:55 2006 1 -3600 -7200

This causes failure of time conversions in e.g.
rdiff-backup-1.1.5

The attached patch should fix this for most timezones.
The function already uses the same heuristics in the
(!__CYGWIN__ &&  !HAVE_STRUCT_TM_TM_ZONE) case.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:27

Message:
Logged In: YES 
user_id=849994

Accepted in rev. 46026.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490224&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:46:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:46:22 -0700
Subject: [Patches] [ python-Patches-1484758 ] cookielib: reduce (fatal)
	dependency on "beta" logging?
Message-ID: <E1FgNI6-0005wy-70@sc8-sf-web4-b.sourceforge.net>

Patches item #1484758, was opened at 2006-05-09 15:14
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: cookielib: reduce (fatal) dependency on "beta" logging?

Initial Comment:
The logging package is tagged "beta". Yet cookielib (as
the ONLY module in the std. lib !?) uses Logger.debug()
very excessively.

I got occasional nasty crash traces (from users) when
using cookielib Processors through urllib2
(multi-threaded usage) - see below.  The causes are not
errors in cookielib, but upon simple calls to
Logger.debug() : varying AttributeError's in logging,
which on the first glance seem to be impossible, as
those attributes are set in the related __init__()'s
but there are strange complex things going on with
roots/hierarchies/copy etc. so....  thread/lock
problems I'd guess.

the patch uncomments several debug() calls in cookielib
in import. only one's in important high-frequency
execution flow path (not ones upon errors and
exceptional states). And 2 minor fixes on pychecker
warnings.

After applying that, the nasty crash reports disappeared.

I do not understand completely why the cookielib
production code has to use the logging package
(expensive) at all. At least for the high-frq used
add_cookie_header its unnecessary. There could be some
simpler (detached) test code for testing purposes.
Importing the logging and setup is time consuming etc.
(see other patch for urllib2 import optimization. )

I'd recommend: At least as far as logging is "beta" and
cookielib NOT, all these debug()'s should be
uncommented, or at least called ONLY upon a dispatching
global 'use_logging' variable in cookielib, in case the
test code cannot be externalized nicely.


2 example error traces:

...File "cookielib.pyo",
line 1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 878, in debug\\n\',
\'  File "logging\\\\__init__.pyo", line 1056, in
getEffectiveLevel\\n\', "AttributeError: Logger
instance has no attribute \'level\'\\n


...in http_request\\n\', \'  File "cookielib.pyo", line
1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 876, in debug\\n\',
"AttributeError: Manager instance has no attribute
\'disable\'\\n


-robert

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:46

Message:
Logged In: YES 
user_id=849994

Resolved with rev. 46027 by introducing a global "debug"
flag, like other libraries do.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

From noreply at sourceforge.net  Wed May 17 16:56:21 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 07:56:21 -0700
Subject: [Patches] [ python-Patches-1486962 ] Patches and enhancements to
	turtle.py
Message-ID: <E1FgNRl-00072D-Qc@sc8-sf-web2.sourceforge.net>

Patches item #1486962, was opened at 2006-05-11 23:47
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486962&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Tkinter
Group: Python 2.4
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Vern Ceder (vceder)
Assigned to: Martin v. L??wis (loewis)
Summary: Patches and enhancements to turtle.py

Initial Comment:
Several bugfixes and enhancements (from several
teachers who use Python in secondary and post-secondary
classes) to improve usability in the classroom:

 * docstrings added to methods (Toby Donaldson)

 * added methods to control speed, window geometry and
  window title. (Vern Ceder)

 * added Turtle as alias for Pen - students can now
create Turtle objects (Toby Donaldson)

 * default window now larger and centered (Vern Ceder)

 * added done() function to start main event loop after
drawing (handy when running programs in IDLE) (Vern
Ceder/Chris Smith)

 * fixed bug where filled polygons are lowered (Atanas
Radenski)

 * fixed bug in circle() method to use self._fullcircle
/ 4.0 instead of 90.0 to determine start (Chris Smith)

 * removed several redundant assignments (Chris Smith)

 * added second demo which uses new features (Gregor Lindl)


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:56

Message:
Logged In: YES 
user_id=849994

Thanks for the patch! Applied as rev. 46028.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486962&group_id=5470

From noreply at sourceforge.net  Wed May 17 17:01:30 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 08:01:30 -0700
Subject: [Patches] [ python-Patches-1489771 ] Updates to syntax rules in
	reference manual
Message-ID: <E1FgNWk-00084u-1e@sc8-sf-web2.sourceforge.net>

Patches item #1489771, was opened at 2006-05-16 19:09
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Updates to syntax rules in reference manual

Initial Comment:
I tried to update the reference manual to the current
Python syntax. Some things are still missing, most
notably the yield expression. Detailed description
of changes below. I can also attach the
generated webpages, if someone is interested.

Expressions
===========

List Displays
-------------
Reordered the rules so that the style is
consistent with the rest of the manual. Separated
listmaker into expression_list and
list_comprehension, for better readability.
Replaced "expression_list" between "for" and "in"
with "target_list". See this thread for details:
http://mail.python.org/pipermail/python-dev/2006-April/064264.html

The only thing missing is old_lambdadef.

Generator Expressions
---------------------
Simmilar as above.

Calls
-----
Fixed the latex syntax (somebody forgot to remove
a line when generators were introduced). Replaced
test with expression. Fixed allowed positions for
commas (func(*args,) is not allowed).

Boolean operations
------------------
Restructured the new conditional expression so
that it is more readable.


Simple Statements
=================

Augmented assignment statements
-------------------------------
Removed comments from "productionlist" macro,
since they broke the generated grammar.txt file.
Removed empty groups that are not needed anymore,
since automatic conversion to guillemets was
disabled. Unfortunately the escaped operator
characters would still need manual fixing in the
grammar.txt file.

The print statement
-------------------
Removed all uses of the "optional" macro and
replaced them with sqare brackets, since it broke
the generated grammar.txt file.

The import statement
--------------------
Replaced all invalid uses of name with identifier.
Added relative import notation to the grammar
section.

Description of relative imports is still needed.

The exec statement
------------------
Corrected a minor mistake, since

exec "a = 1" or "a = 2"

is not valid Python syntax.
Added a (commented out) section about a strange feature
(you can already treat exec as a function) that should
IMHO be included in documentation and its use encouraged
over the current notation.


Compound statements
===================

The with statement
------------------
Added missing macro.

Function definition
-------------------
Cleaned up "parameter_list" so that it is correct
and expresses all the restrictions, but is still
easier to understand (I hope).


Still needed
------------
Yield became an expression in version 2.5, but this
is not documented.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 15:01

Message:
Logged In: YES 
user_id=849994

I think the token names in the reference should not be
different from those in python/Grammar/Grammar. Aside from
this, the patch is fine.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

From noreply at sourceforge.net  Wed May 17 17:17:17 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 08:17:17 -0700
Subject: [Patches] [ python-Patches-1484793 ] urllib2: resolves extremly
	slow import (of "everything")
Message-ID: <E1FgNm1-0002QU-J7@sc8-sf-web2.sourceforge.net>

Patches item #1484793, was opened at 2006-05-09 15:59
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: resolves extremly slow import (of "everything")

Initial Comment:
This superseeds the old patch #1053150 (for an older
Python; it was stopped: "Jeremy doesn't like the idea")
in order to import the expensive modules behind urllib2
late.

I'm recommending now again to do this, as things are
almost unacceptable meanwhile.

In Py24, simply importing original urllib2 costs upto
to a second on my slower machines. the startup time of
some of my bigger apps/scripts goes mainly to importing
urllib2. More than half of the time goes into importing
cookielib (regarding profiler runs). Its almost
unusable so now in CGI scripts.

New modules were added to urllib2 meanwhile, and worst
of all the cookielib was inserted into urllib2 the same
old style "import everything on top of the file in a
kind of C-#include manner". 

Python offers best dynamic modularization of code. That
should be exploited for such an expensive
virtualization module like urllib2. There are usually
only very locations, where the sub-modules are referenced. 
This patch also enables to strip off unnecessary
modules (down to _MozillaCookieJar!) for
cx_freeze/py2exe distribution. 

( Since long I have this patch on my list, which I
apply after each Python installation regularly. )

--

As a side effect of this import-all practice a lazy
cookielib dependency came into normal Request
constructor code:
"origin_req_host = cookielib.request_host(self)"

I'd recommend, to copy/move this simple tool function
request_host into urllib2 in order to resolve the
cookielib dependency completely. (not done so far in
the patch)


-robert


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 15:17

Message:
Logged In: YES 
user_id=849994

Fixed in rev. 46029.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

From noreply at sourceforge.net  Wed May 17 17:51:38 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 08:51:38 -0700
Subject: [Patches] [ python-Patches-1180296 ] great improvement for
	locale.py formatting functions
Message-ID: <E1FgOJG-0006DM-B0@sc8-sf-web1.sourceforge.net>

Patches item #1180296, was opened at 2005-04-10 18:00
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1180296&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Georg Brandl (gbrandl)
>Assigned to: Georg Brandl (gbrandl)
Summary: great improvement for locale.py formatting functions

Initial Comment:
This is a patch that adds two new functions to the
locale module.

The first, format_string(), can be used like str %
values, but takes the locale into account.
format() cannot be used for this since its grouping
feature currently does not work for arbitrary format
strings.

At the same time, this patch enhances format() so that
the user is notified by an exception that she should
only give one '%char' specification and nothing else.
The docs are also corrected.

The second function, currency(), formats a number
according to current currency locale settings.

The patch is complete with doc and test changes.
Please, test and comment!

Also corrects minor mistakes in doc.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 15:51

Message:
Logged In: YES 
user_id=849994

Slightly revised version committed as rev. 46030.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1180296&group_id=5470

From noreply at sourceforge.net  Wed May 17 17:54:46 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 08:54:46 -0700
Subject: [Patches] [ python-Patches-1346238 ] A constant folding
	optimization pass for the AST
Message-ID: <E1FgOMI-0007HS-GT@sc8-sf-web1.sourceforge.net>

Patches item #1346238, was opened at 2005-11-02 18:49
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1346238&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Parser/Compiler
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Rune Holm (titanstar)
Assigned to: Neal Norwitz (nnorwitz)
Summary: A constant folding optimization pass for the AST

Initial Comment:
This patch adds the following: A visitor interface
generalized from the existing ast pass code in order to
make it easy to write ast passes that only care about
specific node types. A constant folding pass that looks
for operations involving number or string literals, and
calculates these at compile time. Example code snippets
that this pass will optimize:

3 + 4 + x => 7 + x

2 ** 2 ** 2 => 16

4 and 5 and x and 6 => x and 6

4 or 5 or x => 4

4 and 5 and ~6 => -7


When combined with patch 1346214, the compiler will
also optimize statements like

if 2**2**2 - 16: expensive_computation() => nothing

The patch adds two new files: Include/optimize.h and
Python.optimize.c. This was done because I anticipate
adding more AST optimizations later using the same
visitor interface, and Python/compile.c is already very
crowded with byte code generation and bytecode
optimization. If new files aren't desired, I could
easily change the pass to add the extra code to compile.c

This patch combined with patch 1346214 passes the unit
tests on all the platforms I've tested it on, namely:
macos 10.3/ppc
linux/x86
linux/amd64
linux/ppc
linux/ia64

valgrind on linux/x86 does not reveal any additional
leaks or uninitialized accesses that aren't already in
the svn head.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 15:54

Message:
Logged In: YES 
user_id=849994

Candidate for Iceland?

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2006-02-19 17:12

Message:
Logged In: YES 
user_id=80475

I'm +1 on the idea, but won't have an opportunity to 
review the patch in detail (to check for possible semantic 
changes).  Neal, what do you think?

----------------------------------------------------------------------

Comment By: Rune Holm (titanstar)
Date: 2006-02-19 13:35

Message:
Logged In: YES 
user_id=858364

It avoids generating constant objects with sizes above 20 (in a similar fashion 
as the bytecode peepholer), and checks whether the operand of unary minus 
is non-zero in order to avoid changing -0.0.

As for the bytecode peephole optimizer, this AST constant folder performs 
quite similar optimizations, but optimizes partially constant and/or and 
comparative expressions in addition. This patch should however not be seen 
as a replacement for the bytecode constant folder, but rather as a 
complement. An optimizing compiler typically contains many forms of 
constant folding in the different phases of compilation, since many later 
optimizations benefit from constant folding (warranting early constant 
folding), and some optimizations might emit code that benefit from constant 
folding again (warranting late constant folding). For an example of the 
former, consider the statement

if 1-1: some_code()

both passes are able to transform this into

if 0: some_code()

but since the AST constant folder is run before the dead code eliminator at 
<http://python.org/sf/1346214>, these two together are able to optimize 
the if statement away altogether.

Note that this patch probably won't apply cleanly anymore, since it was 
written three months ago and the AST code has undergone quite a few 
changes since then. But if there is interest in applying this patch, I'll gladly 
update it for the current trunk.


----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2006-02-19 10:09

Message:
Logged In: YES 
user_id=80475

This should be compared to the constant folding already 
added to Py2.5 via the peepholer:
   dis.dis(compile('x=2+3', '', 'exec'))

Also, make sure it doesn't go over the top consuming 
memory for the likes of:

  '-' * 100
  (None,)*2000

Both of those should not be optimized away at compile-time.

Also, be sure not optimize away -0.0.  Thet is not the 
same as +0.0.  The distinction is important for branch 
cuts in cmath.


----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-19 09:41

Message:
Logged In: YES 
user_id=1188172

Neal, what do you think of this?

----------------------------------------------------------------------

Comment By: Rune Holm (titanstar)
Date: 2005-11-06 20:42

Message:
Logged In: YES 
user_id=858364

Sorry, I'm new to the sourceforge patch tracker. The patch should be 
attached now.

----------------------------------------------------------------------

Comment By: Simon Dahlbacka (sdahlbac)
Date: 2005-11-06 19:10

Message:
Logged In: YES 
user_id=750513

the actual patch is missing...

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1346238&group_id=5470

From noreply at sourceforge.net  Wed May 17 18:54:49 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 09:54:49 -0700
Subject: [Patches] [ python-Patches-1489771 ] Updates to syntax rules in
	reference manual
Message-ID: <E1FgPIP-0004EM-1m@sc8-sf-web4-b.sourceforge.net>

Patches item #1489771, was opened at 2006-05-16 21:09
Message generated for change (Comment added) made by zseil
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Updates to syntax rules in reference manual

Initial Comment:
I tried to update the reference manual to the current
Python syntax. Some things are still missing, most
notably the yield expression. Detailed description
of changes below. I can also attach the
generated webpages, if someone is interested.

Expressions
===========

List Displays
-------------
Reordered the rules so that the style is
consistent with the rest of the manual. Separated
listmaker into expression_list and
list_comprehension, for better readability.
Replaced "expression_list" between "for" and "in"
with "target_list". See this thread for details:
http://mail.python.org/pipermail/python-dev/2006-April/064264.html

The only thing missing is old_lambdadef.

Generator Expressions
---------------------
Simmilar as above.

Calls
-----
Fixed the latex syntax (somebody forgot to remove
a line when generators were introduced). Replaced
test with expression. Fixed allowed positions for
commas (func(*args,) is not allowed).

Boolean operations
------------------
Restructured the new conditional expression so
that it is more readable.


Simple Statements
=================

Augmented assignment statements
-------------------------------
Removed comments from "productionlist" macro,
since they broke the generated grammar.txt file.
Removed empty groups that are not needed anymore,
since automatic conversion to guillemets was
disabled. Unfortunately the escaped operator
characters would still need manual fixing in the
grammar.txt file.

The print statement
-------------------
Removed all uses of the "optional" macro and
replaced them with sqare brackets, since it broke
the generated grammar.txt file.

The import statement
--------------------
Replaced all invalid uses of name with identifier.
Added relative import notation to the grammar
section.

Description of relative imports is still needed.

The exec statement
------------------
Corrected a minor mistake, since

exec "a = 1" or "a = 2"

is not valid Python syntax.
Added a (commented out) section about a strange feature
(you can already treat exec as a function) that should
IMHO be included in documentation and its use encouraged
over the current notation.


Compound statements
===================

The with statement
------------------
Added missing macro.

Function definition
-------------------
Cleaned up "parameter_list" so that it is correct
and expresses all the restrictions, but is still
easier to understand (I hope).


Still needed
------------
Yield became an expression in version 2.5, but this
is not documented.

----------------------------------------------------------------------

>Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-17 18:54

Message:
Logged In: YES 
user_id=1326842

Token names in reference manual already differ
from those in Grammar file. I only added
new tokens where it helps readability:

 - I've split listmaker into expression_list
   (already present in reference manual and
   often used) and list_comprehension.
 - I added conditional_expression, because
   I thought it helps readability.
 - Differences in function definiton and call
   syntax can't be avoided since Grammar file
   doesn't express all the limitations.
 - Same goes for target_list; Grammar uses
   testlist, but that was one of the problems
   raised in the thread mentioned above.

The biggest problem is that what is known as
"test" in Grammar/Grammar, is "expression"
in the reference manual, and I think I fixed
all parts that didn't take this in cosideration.

I'm attaching two new patches.
reference_manual_updated.diff contains another
fix (removed unneeded markup and comments), but
is otherwise the same as the previous patch.

reference_manual_conservative.diff also contains
this fix, but removes tokens list_comprehension
and conditional_expression.

This means that there are still large differences
between Grammar file and reference manual, most
notably:

Grammar        manual
---------------------------------------------
NAME           identifier
expr           or_expr
test           expression
old_test       test
testlist       expression_list | target_list
testlist_safe  testlist

All of these differences were already present
before my changes. Let me know if you want
them fixed.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 17:01

Message:
Logged In: YES 
user_id=849994

I think the token names in the reference should not be
different from those in python/Grammar/Grammar. Aside from
this, the patch is fine.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

From noreply at sourceforge.net  Wed May 17 18:59:50 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 09:59:50 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1FgPNG-0002Xd-SM@sc8-sf-web2.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 16:59
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Andrew Clover (bobince)
Assigned to: Nobody/Anonymous (nobody)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Wed May 17 23:34:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 14:34:05 -0700
Subject: [Patches] [ python-Patches-1484758 ] cookielib: reduce (fatal)
	dependency on "beta" logging?
Message-ID: <E1FgTef-00007B-0j@sc8-sf-web3.sourceforge.net>

Patches item #1484758, was opened at 2006-05-09 11:14
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Closed
Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: cookielib: reduce (fatal) dependency on "beta" logging?

Initial Comment:
The logging package is tagged "beta". Yet cookielib (as
the ONLY module in the std. lib !?) uses Logger.debug()
very excessively.

I got occasional nasty crash traces (from users) when
using cookielib Processors through urllib2
(multi-threaded usage) - see below.  The causes are not
errors in cookielib, but upon simple calls to
Logger.debug() : varying AttributeError's in logging,
which on the first glance seem to be impossible, as
those attributes are set in the related __init__()'s
but there are strange complex things going on with
roots/hierarchies/copy etc. so....  thread/lock
problems I'd guess.

the patch uncomments several debug() calls in cookielib
in import. only one's in important high-frequency
execution flow path (not ones upon errors and
exceptional states). And 2 minor fixes on pychecker
warnings.

After applying that, the nasty crash reports disappeared.

I do not understand completely why the cookielib
production code has to use the logging package
(expensive) at all. At least for the high-frq used
add_cookie_header its unnecessary. There could be some
simpler (detached) test code for testing purposes.
Importing the logging and setup is time consuming etc.
(see other patch for urllib2 import optimization. )

I'd recommend: At least as far as logging is "beta" and
cookielib NOT, all these debug()'s should be
uncommented, or at least called ONLY upon a dispatching
global 'use_logging' variable in cookielib, in case the
test code cannot be externalized nicely.


2 example error traces:

...File "cookielib.pyo",
line 1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 878, in debug\\n\',
\'  File "logging\\\\__init__.pyo", line 1056, in
getEffectiveLevel\\n\', "AttributeError: Logger
instance has no attribute \'level\'\\n


...in http_request\\n\', \'  File "cookielib.pyo", line
1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 876, in debug\\n\',
"AttributeError: Manager instance has no attribute
\'disable\'\\n


-robert

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 17:34

Message:
Logged In: YES 
user_id=764593

(1)  I don't think logging should be removed from the 
stdlib.  At the very least, the reasoning should be added 
to PEP 337, which says to *add* logging to the standard 
library.  http://www.python.org/dev/peps/pep-0337/  (There 
will probably be a Summer Of Code student funded to do 
this; if it is a problem, lets fix the problem in the 
logging module.)

(2)  Logging isn't really as unstable as you seem to think 
Beta implies; it is probably more stable than the newer 
cookielib, let alone the combination of cookielib, urllib2, 
and Processors.  (John Lee has been making long-overdue 
fixes to urllib2 -- and processors in particular -- because 
he was the first to really understand it well enough; these 
fixes are generally triggered by immediate problems and may 
not be complete fixes.)

I will agree that it might make sense to remove the beta 
marker from the version of logging that is distributed in 
the stdlib.

(3)  What else was shipped with those applications which 
caused this?  Which version of logging did you have?

Both tracebacks could be caused if the root logger were not 
a normal logger (and its manager therefore not a normal 
manager).  Vinay has taken some steps to allow 3rd party 
libraries to override the class of even the root logger, 
but doing it *right* is fairly subtle.

Another possibility is that you got burned by threads 
allowing access to half-constructed loggers or managers, or 
by broken PlaceHolders/fixups in the manager.  Again, this 
can't happen unless someone is doing at least two dangerous 
things, but ... it has triggered a few of the changelog 
entries.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 10:46

Message:
Logged In: YES 
user_id=849994

Resolved with rev. 46027 by introducing a global "debug"
flag, like other libraries do.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

From noreply at sourceforge.net  Thu May 18 00:25:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 15:25:34 -0700
Subject: [Patches] [ python-Patches-1484695 ] tarfile.py fix for #1471427
	and updates
Message-ID: <E1FgUSU-0002km-D0@sc8-sf-web3.sourceforge.net>

Patches item #1484695, was opened at 2006-05-09 09:51
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py fix for #1471427 and updates

Initial Comment:
I have assembled a patch that adds some features from
my own development path of tarfile.py
(http://www.gustaebel.de/lars/tarfile/) and fixes
#1471427 which made some restructuring necessary. The
patch affects Lib/tarfile.py, Lib/test/test_tarfile.py
and Doc/lib/libtarfile.tex.

The changes the patch makes are as follows:

Sets the version to 0.8.0.

Support for base256 encoding of number fields (nti()
and itn()). Up to now this was hardcoded for the
filesize field to allow filesizes greater than 8 GB but
it is applicable to all number fields.

TarInfo.tobuf() has a boolean argument "posix" which
controls how number fields are written (base256 is
non-posix).

Both unsigned and signed (Sun and NeXT) checksums are
calculated. Header validation moves from TarFile.next()
to TarInfo.frombuf(). A header is valid only if its
checksum is okay, in the past the checksum was
calculated but ignored.

The TarFile.next() method was rearranged which makes
header processing clearer and more abstract and fixes
bug #1471427. However, this change breaks the interface
for subclassing in order to implement custom member
types but makes it much easier at the same time. The
mapping TYPE_METH was removed.

A new test ReadGNULongTest was added to test_tarfile.py
and testtar.tar was updated to be able to test the GNU
extensions LONGNAME and LONGLINK.


----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 18:25

Message:
Logged In: YES 
user_id=764593

(1)  Why change the exception style?

When raising an instance, the style guide (PEP-8, http://
www.python.org/dev/peps/pep-0008/) prefers to construct 
that instance; the older form is left over from String 
exceptions and will be removed in Python 3.

I could understand leaving them as they were, but if you're 
going to change them to make them consistent, why not use 
the current format?

(2)  Why get rid of the debug messages (such as the 
checksum check) entirely?  Guarding them in if self.debug, 
I would understand.

(3)  I wouldn't count on str(e) (where e is any ValueError 
instance) being as meaningful as the (current version's) 
ReadError("empty, unreadable or compressed file")


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-10 12:26

Message:
Logged In: YES 
user_id=849994

Thanks for the patch, applied as rev. 45954.

----------------------------------------------------------------------

Comment By: Lars Gust?bel (gustaebel)
Date: 2006-05-09 09:52

Message:
Logged In: YES 
user_id=642936

Here is testtar.tar to replace Lib/test/testtar.tar.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

From noreply at sourceforge.net  Thu May 18 00:44:15 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 15:44:15 -0700
Subject: [Patches] [ python-Patches-1484793 ] urllib2: resolves extremly
	slow import (of "everything")
Message-ID: <E1FgUkZ-0006l3-M1@sc8-sf-web2.sourceforge.net>

Patches item #1484793, was opened at 2006-05-09 11:59
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Closed
Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: resolves extremly slow import (of "everything")

Initial Comment:
This superseeds the old patch #1053150 (for an older
Python; it was stopped: "Jeremy doesn't like the idea")
in order to import the expensive modules behind urllib2
late.

I'm recommending now again to do this, as things are
almost unacceptable meanwhile.

In Py24, simply importing original urllib2 costs upto
to a second on my slower machines. the startup time of
some of my bigger apps/scripts goes mainly to importing
urllib2. More than half of the time goes into importing
cookielib (regarding profiler runs). Its almost
unusable so now in CGI scripts.

New modules were added to urllib2 meanwhile, and worst
of all the cookielib was inserted into urllib2 the same
old style "import everything on top of the file in a
kind of C-#include manner". 

Python offers best dynamic modularization of code. That
should be exploited for such an expensive
virtualization module like urllib2. There are usually
only very locations, where the sub-modules are referenced. 
This patch also enables to strip off unnecessary
modules (down to _MozillaCookieJar!) for
cx_freeze/py2exe distribution. 

( Since long I have this patch on my list, which I
apply after each Python installation regularly. )

--

As a side effect of this import-all practice a lazy
cookielib dependency came into normal Request
constructor code:
"origin_req_host = cookielib.request_host(self)"

I'd recommend, to copy/move this simple tool function
request_host into urllib2 in order to resolve the
cookielib dependency completely. (not done so far in
the patch)


-robert


----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 18:44

Message:
Logged In: YES 
user_id=764593

Note that lazy importing can interact very badly with 
threads.

Why did you change the signature of OpenenDirector._open?  
The base class ignores the data, but subclasses may not.

Removing the SSL guard "if hasattr(httplib, 'HTTPS')" is 
questionable, since the ssl library is external and must be 
compiled separately, and therefore may not exist on some 
platforms even without other source customizations.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 11:17

Message:
Logged In: YES 
user_id=849994

Fixed in rev. 46029.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

From noreply at sourceforge.net  Thu May 18 00:55:28 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 15:55:28 -0700
Subject: [Patches] [ python-Patches-1489771 ] Updates to syntax rules in
	reference manual
Message-ID: <E1FgUvQ-0003Gw-KA@sc8-sf-web2.sourceforge.net>

Patches item #1489771, was opened at 2006-05-16 15:09
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Updates to syntax rules in reference manual

Initial Comment:
I tried to update the reference manual to the current
Python syntax. Some things are still missing, most
notably the yield expression. Detailed description
of changes below. I can also attach the
generated webpages, if someone is interested.

Expressions
===========

List Displays
-------------
Reordered the rules so that the style is
consistent with the rest of the manual. Separated
listmaker into expression_list and
list_comprehension, for better readability.
Replaced "expression_list" between "for" and "in"
with "target_list". See this thread for details:
http://mail.python.org/pipermail/python-dev/2006-April/064264.html

The only thing missing is old_lambdadef.

Generator Expressions
---------------------
Simmilar as above.

Calls
-----
Fixed the latex syntax (somebody forgot to remove
a line when generators were introduced). Replaced
test with expression. Fixed allowed positions for
commas (func(*args,) is not allowed).

Boolean operations
------------------
Restructured the new conditional expression so
that it is more readable.


Simple Statements
=================

Augmented assignment statements
-------------------------------
Removed comments from "productionlist" macro,
since they broke the generated grammar.txt file.
Removed empty groups that are not needed anymore,
since automatic conversion to guillemets was
disabled. Unfortunately the escaped operator
characters would still need manual fixing in the
grammar.txt file.

The print statement
-------------------
Removed all uses of the "optional" macro and
replaced them with sqare brackets, since it broke
the generated grammar.txt file.

The import statement
--------------------
Replaced all invalid uses of name with identifier.
Added relative import notation to the grammar
section.

Description of relative imports is still needed.

The exec statement
------------------
Corrected a minor mistake, since

exec "a = 1" or "a = 2"

is not valid Python syntax.
Added a (commented out) section about a strange feature
(you can already treat exec as a function) that should
IMHO be included in documentation and its use encouraged
over the current notation.


Compound statements
===================

The with statement
------------------
Added missing macro.

Function definition
-------------------
Cleaned up "parameter_list" so that it is correct
and expresses all the restrictions, but is still
easier to understand (I hope).


Still needed
------------
Yield became an expression in version 2.5, but this
is not documented.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 18:55

Message:
Logged In: YES 
user_id=764593

I agree that it would be better if they were consistent.

But does the manual have better names?  If so, alpha is a 
good time to fix the grammar file.


----------------------------------------------------------------------

Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-17 12:54

Message:
Logged In: YES 
user_id=1326842

Token names in reference manual already differ
from those in Grammar file. I only added
new tokens where it helps readability:

 - I've split listmaker into expression_list
   (already present in reference manual and
   often used) and list_comprehension.
 - I added conditional_expression, because
   I thought it helps readability.
 - Differences in function definiton and call
   syntax can't be avoided since Grammar file
   doesn't express all the limitations.
 - Same goes for target_list; Grammar uses
   testlist, but that was one of the problems
   raised in the thread mentioned above.

The biggest problem is that what is known as
"test" in Grammar/Grammar, is "expression"
in the reference manual, and I think I fixed
all parts that didn't take this in cosideration.

I'm attaching two new patches.
reference_manual_updated.diff contains another
fix (removed unneeded markup and comments), but
is otherwise the same as the previous patch.

reference_manual_conservative.diff also contains
this fix, but removes tokens list_comprehension
and conditional_expression.

This means that there are still large differences
between Grammar file and reference manual, most
notably:

Grammar        manual
---------------------------------------------
NAME           identifier
expr           or_expr
test           expression
old_test       test
testlist       expression_list | target_list
testlist_safe  testlist

All of these differences were already present
before my changes. Let me know if you want
them fixed.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 11:01

Message:
Logged In: YES 
user_id=849994

I think the token names in the reference should not be
different from those in python/Grammar/Grammar. Aside from
this, the patch is fine.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

From noreply at sourceforge.net  Thu May 18 05:46:11 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 20:46:11 -0700
Subject: [Patches] [ python-Patches-1489771 ] Updates to syntax rules in
	reference manual
Message-ID: <E1FgZSl-0002yX-68@sc8-sf-web2.sourceforge.net>

Patches item #1489771, was opened at 2006-05-16 21:09
Message generated for change (Comment added) made by zseil
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Updates to syntax rules in reference manual

Initial Comment:
I tried to update the reference manual to the current
Python syntax. Some things are still missing, most
notably the yield expression. Detailed description
of changes below. I can also attach the
generated webpages, if someone is interested.

Expressions
===========

List Displays
-------------
Reordered the rules so that the style is
consistent with the rest of the manual. Separated
listmaker into expression_list and
list_comprehension, for better readability.
Replaced "expression_list" between "for" and "in"
with "target_list". See this thread for details:
http://mail.python.org/pipermail/python-dev/2006-April/064264.html

The only thing missing is old_lambdadef.

Generator Expressions
---------------------
Simmilar as above.

Calls
-----
Fixed the latex syntax (somebody forgot to remove
a line when generators were introduced). Replaced
test with expression. Fixed allowed positions for
commas (func(*args,) is not allowed).

Boolean operations
------------------
Restructured the new conditional expression so
that it is more readable.


Simple Statements
=================

Augmented assignment statements
-------------------------------
Removed comments from "productionlist" macro,
since they broke the generated grammar.txt file.
Removed empty groups that are not needed anymore,
since automatic conversion to guillemets was
disabled. Unfortunately the escaped operator
characters would still need manual fixing in the
grammar.txt file.

The print statement
-------------------
Removed all uses of the "optional" macro and
replaced them with sqare brackets, since it broke
the generated grammar.txt file.

The import statement
--------------------
Replaced all invalid uses of name with identifier.
Added relative import notation to the grammar
section.

Description of relative imports is still needed.

The exec statement
------------------
Corrected a minor mistake, since

exec "a = 1" or "a = 2"

is not valid Python syntax.
Added a (commented out) section about a strange feature
(you can already treat exec as a function) that should
IMHO be included in documentation and its use encouraged
over the current notation.


Compound statements
===================

The with statement
------------------
Added missing macro.

Function definition
-------------------
Cleaned up "parameter_list" so that it is correct
and expresses all the restrictions, but is still
easier to understand (I hope).


Still needed
------------
Yield became an expression in version 2.5, but this
is not documented.

----------------------------------------------------------------------

>Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-18 05:46

Message:
Logged In: YES 
user_id=1326842

I created another patch and updated the previous
two with the following fixes:

 - Disabled an example of the EBNF notation
   in introduction to prevent name clashes
   and inclusion into the generated grammar file.
 - yield can be a bare statement in 2.5.
 - Reintroduced "name" in import statements,
   so that explanation can stay the same.
 - Reformated __future__ import statement,
   but left the latex syntax broken so that
   it still won't be included into grammar.txt.
 - Fixed a paragraph about valid __future__
   features.

The new patch is more of an example how hard it
would be to synchronise the names. It is more
or less consistent with the Grammar file, but as
a consequence is completely out of sync with the
surrounding text.
While I would like to see less differences betwen
Grammar and Reference manual, I don't think it is
that easy, since someone would also have to check
the rest of the text and fix all incorrect
references.
I would guess that the same holds for changing
the Grammar file; you would simply have to change
too much code.
I think that the simplest solution is to add the
table from my previous comment to the PEP 306.


----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-18 00:55

Message:
Logged In: YES 
user_id=764593

I agree that it would be better if they were consistent.

But does the manual have better names?  If so, alpha is a 
good time to fix the grammar file.


----------------------------------------------------------------------

Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-17 18:54

Message:
Logged In: YES 
user_id=1326842

Token names in reference manual already differ
from those in Grammar file. I only added
new tokens where it helps readability:

 - I've split listmaker into expression_list
   (already present in reference manual and
   often used) and list_comprehension.
 - I added conditional_expression, because
   I thought it helps readability.
 - Differences in function definiton and call
   syntax can't be avoided since Grammar file
   doesn't express all the limitations.
 - Same goes for target_list; Grammar uses
   testlist, but that was one of the problems
   raised in the thread mentioned above.

The biggest problem is that what is known as
"test" in Grammar/Grammar, is "expression"
in the reference manual, and I think I fixed
all parts that didn't take this in cosideration.

I'm attaching two new patches.
reference_manual_updated.diff contains another
fix (removed unneeded markup and comments), but
is otherwise the same as the previous patch.

reference_manual_conservative.diff also contains
this fix, but removes tokens list_comprehension
and conditional_expression.

This means that there are still large differences
between Grammar file and reference manual, most
notably:

Grammar        manual
---------------------------------------------
NAME           identifier
expr           or_expr
test           expression
old_test       test
testlist       expression_list | target_list
testlist_safe  testlist

All of these differences were already present
before my changes. Let me know if you want
them fixed.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 17:01

Message:
Logged In: YES 
user_id=849994

I think the token names in the reference should not be
different from those in python/Grammar/Grammar. Aside from
this, the patch is fine.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1489771&group_id=5470

From noreply at sourceforge.net  Thu May 18 07:54:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 22:54:51 -0700
Subject: [Patches] [ python-Patches-1484793 ] urllib2: resolves extremly
	slow import (of "everything")
Message-ID: <E1FgbTH-0006r6-JX@sc8-sf-web3.sourceforge.net>

Patches item #1484793, was opened at 2006-05-09 15:59
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Closed
Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: resolves extremly slow import (of "everything")

Initial Comment:
This superseeds the old patch #1053150 (for an older
Python; it was stopped: "Jeremy doesn't like the idea")
in order to import the expensive modules behind urllib2
late.

I'm recommending now again to do this, as things are
almost unacceptable meanwhile.

In Py24, simply importing original urllib2 costs upto
to a second on my slower machines. the startup time of
some of my bigger apps/scripts goes mainly to importing
urllib2. More than half of the time goes into importing
cookielib (regarding profiler runs). Its almost
unusable so now in CGI scripts.

New modules were added to urllib2 meanwhile, and worst
of all the cookielib was inserted into urllib2 the same
old style "import everything on top of the file in a
kind of C-#include manner". 

Python offers best dynamic modularization of code. That
should be exploited for such an expensive
virtualization module like urllib2. There are usually
only very locations, where the sub-modules are referenced. 
This patch also enables to strip off unnecessary
modules (down to _MozillaCookieJar!) for
cx_freeze/py2exe distribution. 

( Since long I have this patch on my list, which I
apply after each Python installation regularly. )

--

As a side effect of this import-all practice a lazy
cookielib dependency came into normal Request
constructor code:
"origin_req_host = cookielib.request_host(self)"

I'd recommend, to copy/move this simple tool function
request_host into urllib2 in order to resolve the
cookielib dependency completely. (not done so far in
the patch)


-robert


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-18 05:54

Message:
Logged In: YES 
user_id=849994

Jim: Note that I didn't apply the patch from here, but only
added lazy-loading of ftplib, cookielib and mimetypes.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 22:44

Message:
Logged In: YES 
user_id=764593

Note that lazy importing can interact very badly with 
threads.

Why did you change the signature of OpenenDirector._open?  
The base class ignores the data, but subclasses may not.

Removing the SSL guard "if hasattr(httplib, 'HTTPS')" is 
questionable, since the ssl library is external and must be 
compiled separately, and therefore may not exist on some 
platforms even without other source customizations.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 15:17

Message:
Logged In: YES 
user_id=849994

Fixed in rev. 46029.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484793&group_id=5470

From noreply at sourceforge.net  Thu May 18 08:11:54 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 23:11:54 -0700
Subject: [Patches] [ python-Patches-1484695 ] tarfile.py fix for #1471427
	and updates
Message-ID: <E1Fgbjm-0005sH-2k@sc8-sf-web4-b.sourceforge.net>

Patches item #1484695, was opened at 2006-05-09 13:51
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py fix for #1471427 and updates

Initial Comment:
I have assembled a patch that adds some features from
my own development path of tarfile.py
(http://www.gustaebel.de/lars/tarfile/) and fixes
#1471427 which made some restructuring necessary. The
patch affects Lib/tarfile.py, Lib/test/test_tarfile.py
and Doc/lib/libtarfile.tex.

The changes the patch makes are as follows:

Sets the version to 0.8.0.

Support for base256 encoding of number fields (nti()
and itn()). Up to now this was hardcoded for the
filesize field to allow filesizes greater than 8 GB but
it is applicable to all number fields.

TarInfo.tobuf() has a boolean argument "posix" which
controls how number fields are written (base256 is
non-posix).

Both unsigned and signed (Sun and NeXT) checksums are
calculated. Header validation moves from TarFile.next()
to TarInfo.frombuf(). A header is valid only if its
checksum is okay, in the past the checksum was
calculated but ignored.

The TarFile.next() method was rearranged which makes
header processing clearer and more abstract and fixes
bug #1471427. However, this change breaks the interface
for subclassing in order to implement custom member
types but makes it much easier at the same time. The
mapping TYPE_METH was removed.

A new test ReadGNULongTest was added to test_tarfile.py
and testtar.tar was updated to be able to test the GNU
extensions LONGNAME and LONGLINK.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-18 06:11

Message:
Logged In: YES 
user_id=849994

Jim: I agree with your comments and have committed an
improved version.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 22:25

Message:
Logged In: YES 
user_id=764593

(1)  Why change the exception style?

When raising an instance, the style guide (PEP-8, http://
www.python.org/dev/peps/pep-0008/) prefers to construct 
that instance; the older form is left over from String 
exceptions and will be removed in Python 3.

I could understand leaving them as they were, but if you're 
going to change them to make them consistent, why not use 
the current format?

(2)  Why get rid of the debug messages (such as the 
checksum check) entirely?  Guarding them in if self.debug, 
I would understand.

(3)  I wouldn't count on str(e) (where e is any ValueError 
instance) being as meaningful as the (current version's) 
ReadError("empty, unreadable or compressed file")


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-10 16:26

Message:
Logged In: YES 
user_id=849994

Thanks for the patch, applied as rev. 45954.

----------------------------------------------------------------------

Comment By: Lars Gust?bel (gustaebel)
Date: 2006-05-09 13:52

Message:
Logged In: YES 
user_id=642936

Here is testtar.tar to replace Lib/test/testtar.tar.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

From noreply at sourceforge.net  Thu May 18 08:21:18 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 17 May 2006 23:21:18 -0700
Subject: [Patches] [ python-Patches-1484758 ] cookielib: reduce (fatal)
	dependency on "beta" logging?
Message-ID: <E1Fgbss-0001ig-6c@sc8-sf-web4-b.sourceforge.net>

Patches item #1484758, was opened at 2006-05-09 15:14
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Closed
Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: cookielib: reduce (fatal) dependency on "beta" logging?

Initial Comment:
The logging package is tagged "beta". Yet cookielib (as
the ONLY module in the std. lib !?) uses Logger.debug()
very excessively.

I got occasional nasty crash traces (from users) when
using cookielib Processors through urllib2
(multi-threaded usage) - see below.  The causes are not
errors in cookielib, but upon simple calls to
Logger.debug() : varying AttributeError's in logging,
which on the first glance seem to be impossible, as
those attributes are set in the related __init__()'s
but there are strange complex things going on with
roots/hierarchies/copy etc. so....  thread/lock
problems I'd guess.

the patch uncomments several debug() calls in cookielib
in import. only one's in important high-frequency
execution flow path (not ones upon errors and
exceptional states). And 2 minor fixes on pychecker
warnings.

After applying that, the nasty crash reports disappeared.

I do not understand completely why the cookielib
production code has to use the logging package
(expensive) at all. At least for the high-frq used
add_cookie_header its unnecessary. There could be some
simpler (detached) test code for testing purposes.
Importing the logging and setup is time consuming etc.
(see other patch for urllib2 import optimization. )

I'd recommend: At least as far as logging is "beta" and
cookielib NOT, all these debug()'s should be
uncommented, or at least called ONLY upon a dispatching
global 'use_logging' variable in cookielib, in case the
test code cannot be externalized nicely.


2 example error traces:

...File "cookielib.pyo",
line 1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 878, in debug\\n\',
\'  File "logging\\\\__init__.pyo", line 1056, in
getEffectiveLevel\\n\', "AttributeError: Logger
instance has no attribute \'level\'\\n


...in http_request\\n\', \'  File "cookielib.pyo", line
1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 876, in debug\\n\',
"AttributeError: Manager instance has no attribute
\'disable\'\\n


-robert

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-18 06:21

Message:
Logged In: YES 
user_id=849994

As long as only one standard module uses logging, it's quite
useless. And, its use doesn't even comply to PEP 337 ("py."
prefix). So if a student wants to implement PEP 337 in SoC,
he/she is welcome to do this consistently, and any obscure
logging bugs will certainly show up soon after that.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 21:34

Message:
Logged In: YES 
user_id=764593

(1)  I don't think logging should be removed from the 
stdlib.  At the very least, the reasoning should be added 
to PEP 337, which says to *add* logging to the standard 
library.  http://www.python.org/dev/peps/pep-0337/  (There 
will probably be a Summer Of Code student funded to do 
this; if it is a problem, lets fix the problem in the 
logging module.)

(2)  Logging isn't really as unstable as you seem to think 
Beta implies; it is probably more stable than the newer 
cookielib, let alone the combination of cookielib, urllib2, 
and Processors.  (John Lee has been making long-overdue 
fixes to urllib2 -- and processors in particular -- because 
he was the first to really understand it well enough; these 
fixes are generally triggered by immediate problems and may 
not be complete fixes.)

I will agree that it might make sense to remove the beta 
marker from the version of logging that is distributed in 
the stdlib.

(3)  What else was shipped with those applications which 
caused this?  Which version of logging did you have?

Both tracebacks could be caused if the root logger were not 
a normal logger (and its manager therefore not a normal 
manager).  Vinay has taken some steps to allow 3rd party 
libraries to override the class of even the root logger, 
but doing it *right* is fairly subtle.

Another possibility is that you got burned by threads 
allowing access to half-constructed loggers or managers, or 
by broken PlaceHolders/fixups in the manager.  Again, this 
can't happen unless someone is doing at least two dangerous 
things, but ... it has triggered a few of the changelog 
entries.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:46

Message:
Logged In: YES 
user_id=849994

Resolved with rev. 46027 by introducing a global "debug"
flag, like other libraries do.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

From noreply at sourceforge.net  Thu May 18 09:38:02 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 18 May 2006 00:38:02 -0700
Subject: [Patches] [ python-Patches-1484758 ] cookielib: reduce (fatal)
	dependency on "beta" logging?
Message-ID: <E1Fgd58-0004c1-7o@sc8-sf-web2.sourceforge.net>

Patches item #1484758, was opened at 2006-05-09 15:14
Message generated for change (Comment added) made by vsajip
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Closed
Resolution: Fixed
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: cookielib: reduce (fatal) dependency on "beta" logging?

Initial Comment:
The logging package is tagged "beta". Yet cookielib (as
the ONLY module in the std. lib !?) uses Logger.debug()
very excessively.

I got occasional nasty crash traces (from users) when
using cookielib Processors through urllib2
(multi-threaded usage) - see below.  The causes are not
errors in cookielib, but upon simple calls to
Logger.debug() : varying AttributeError's in logging,
which on the first glance seem to be impossible, as
those attributes are set in the related __init__()'s
but there are strange complex things going on with
roots/hierarchies/copy etc. so....  thread/lock
problems I'd guess.

the patch uncomments several debug() calls in cookielib
in import. only one's in important high-frequency
execution flow path (not ones upon errors and
exceptional states). And 2 minor fixes on pychecker
warnings.

After applying that, the nasty crash reports disappeared.

I do not understand completely why the cookielib
production code has to use the logging package
(expensive) at all. At least for the high-frq used
add_cookie_header its unnecessary. There could be some
simpler (detached) test code for testing purposes.
Importing the logging and setup is time consuming etc.
(see other patch for urllib2 import optimization. )

I'd recommend: At least as far as logging is "beta" and
cookielib NOT, all these debug()'s should be
uncommented, or at least called ONLY upon a dispatching
global 'use_logging' variable in cookielib, in case the
test code cannot be externalized nicely.


2 example error traces:

...File "cookielib.pyo",
line 1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 878, in debug\\n\',
\'  File "logging\\\\__init__.pyo", line 1056, in
getEffectiveLevel\\n\', "AttributeError: Logger
instance has no attribute \'level\'\\n


...in http_request\\n\', \'  File "cookielib.pyo", line
1303, in add_cookie_header\\n\', \'  File
"logging\\\\__init__.pyo", line 876, in debug\\n\',
"AttributeError: Manager instance has no attribute
\'disable\'\\n


-robert

----------------------------------------------------------------------

>Comment By: Vinay Sajip (vsajip)
Date: 2006-05-18 07:38

Message:
Logged In: YES 
user_id=308438

I've updated the status of the logging package in Subversion 
from "beta" to "production". This seems reasonable, since 
the package has been part of Python since 2.3 ;-)

I would agree with Jim Jewett that the problems observed are 
likely to be general threading problems rather than bugs in 
logging - the latter are unlikely to present with symptoms 
such as those described.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-18 06:21

Message:
Logged In: YES 
user_id=849994

As long as only one standard module uses logging, it's quite
useless. And, its use doesn't even comply to PEP 337 ("py."
prefix). So if a student wants to implement PEP 337 in SoC,
he/she is welcome to do this consistently, and any obscure
logging bugs will certainly show up soon after that.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-17 21:34

Message:
Logged In: YES 
user_id=764593

(1)  I don't think logging should be removed from the 
stdlib.  At the very least, the reasoning should be added 
to PEP 337, which says to *add* logging to the standard 
library.  http://www.python.org/dev/peps/pep-0337/  (There 
will probably be a Summer Of Code student funded to do 
this; if it is a problem, lets fix the problem in the 
logging module.)

(2)  Logging isn't really as unstable as you seem to think 
Beta implies; it is probably more stable than the newer 
cookielib, let alone the combination of cookielib, urllib2, 
and Processors.  (John Lee has been making long-overdue 
fixes to urllib2 -- and processors in particular -- because 
he was the first to really understand it well enough; these 
fixes are generally triggered by immediate problems and may 
not be complete fixes.)

I will agree that it might make sense to remove the beta 
marker from the version of logging that is distributed in 
the stdlib.

(3)  What else was shipped with those applications which 
caused this?  Which version of logging did you have?

Both tracebacks could be caused if the root logger were not 
a normal logger (and its manager therefore not a normal 
manager).  Vinay has taken some steps to allow 3rd party 
libraries to override the class of even the root logger, 
but doing it *right* is fairly subtle.

Another possibility is that you got burned by threads 
allowing access to half-constructed loggers or managers, or 
by broken PlaceHolders/fixups in the manager.  Again, this 
can't happen unless someone is doing at least two dangerous 
things, but ... it has triggered a few of the changelog 
entries.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 14:46

Message:
Logged In: YES 
user_id=849994

Resolved with rev. 46027 by introducing a global "debug"
flag, like other libraries do.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484758&group_id=5470

From online.dept at bancorpsouthonline.com  Mon May 15 02:25:56 2006
From: online.dept at bancorpsouthonline.com (BancorpSouth)
Date: Mon, 15 May 2006 01:25:56 +0100
Subject: [Patches] BancorpSouth's Online Access website has been upgraded
Message-ID: <E1FfQuK-0006qi-00@palmspirit.vm.bytemark.co.uk>

An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/patches/attachments/20060515/8cabeba2/attachment-0001.htm 

From noreply at sourceforge.net  Thu May 18 12:00:40 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 18 May 2006 03:00:40 -0700
Subject: [Patches] [ python-Patches-1484695 ] tarfile.py fix for #1471427
	and updates
Message-ID: <E1FgfJA-0005Mx-Qo@sc8-sf-web3.sourceforge.net>

Patches item #1484695, was opened at 2006-05-09 15:51
Message generated for change (Comment added) made by gustaebel
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: Lars Gust?bel (gustaebel)
Assigned to: Nobody/Anonymous (nobody)
Summary: tarfile.py fix for #1471427 and updates

Initial Comment:
I have assembled a patch that adds some features from
my own development path of tarfile.py
(http://www.gustaebel.de/lars/tarfile/) and fixes
#1471427 which made some restructuring necessary. The
patch affects Lib/tarfile.py, Lib/test/test_tarfile.py
and Doc/lib/libtarfile.tex.

The changes the patch makes are as follows:

Sets the version to 0.8.0.

Support for base256 encoding of number fields (nti()
and itn()). Up to now this was hardcoded for the
filesize field to allow filesizes greater than 8 GB but
it is applicable to all number fields.

TarInfo.tobuf() has a boolean argument "posix" which
controls how number fields are written (base256 is
non-posix).

Both unsigned and signed (Sun and NeXT) checksums are
calculated. Header validation moves from TarFile.next()
to TarInfo.frombuf(). A header is valid only if its
checksum is okay, in the past the checksum was
calculated but ignored.

The TarFile.next() method was rearranged which makes
header processing clearer and more abstract and fixes
bug #1471427. However, this change breaks the interface
for subclassing in order to implement custom member
types but makes it much easier at the same time. The
mapping TYPE_METH was removed.

A new test ReadGNULongTest was added to test_tarfile.py
and testtar.tar was updated to be able to test the GNU
extensions LONGNAME and LONGLINK.


----------------------------------------------------------------------

>Comment By: Lars Gust?bel (gustaebel)
Date: 2006-05-18 12:00

Message:
Logged In: YES 
user_id=642936

On (1): agreed.

On (2): There is still a debug message emitted for a bad
checksum: In TarInfo.frombuf() at the bottom a ValueError is
raised if they don't match and is passed on to
TarFile.next() where it is put out as a debug message using
the _dbg() method in the except clause. The debug message
where it is now (r46040) is senseless because the try-block
will be left when TarInfo.frombuf() fails .

On (3): You're right, I attached a patch that adds another
Exception HeaderError which is raised in TarInfo.frombuf()
instead of ValueError in case of a bad header. I hope that
is acceptable.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-18 08:11

Message:
Logged In: YES 
user_id=849994

Jim: I agree with your comments and have committed an
improved version.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-18 00:25

Message:
Logged In: YES 
user_id=764593

(1)  Why change the exception style?

When raising an instance, the style guide (PEP-8, http://
www.python.org/dev/peps/pep-0008/) prefers to construct 
that instance; the older form is left over from String 
exceptions and will be removed in Python 3.

I could understand leaving them as they were, but if you're 
going to change them to make them consistent, why not use 
the current format?

(2)  Why get rid of the debug messages (such as the 
checksum check) entirely?  Guarding them in if self.debug, 
I would understand.

(3)  I wouldn't count on str(e) (where e is any ValueError 
instance) being as meaningful as the (current version's) 
ReadError("empty, unreadable or compressed file")


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-10 18:26

Message:
Logged In: YES 
user_id=849994

Thanks for the patch, applied as rev. 45954.

----------------------------------------------------------------------

Comment By: Lars Gust?bel (gustaebel)
Date: 2006-05-09 15:52

Message:
Logged In: YES 
user_id=642936

Here is testtar.tar to replace Lib/test/testtar.tar.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1484695&group_id=5470

From noreply at sourceforge.net  Thu May 18 16:33:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 18 May 2006 07:33:45 -0700
Subject: [Patches] [ python-Patches-1490989 ] Describe Py_DEBUG and
	friends...
Message-ID: <E1FgjZR-0007wf-95@sc8-sf-web4-b.sourceforge.net>

Patches item #1490989, was opened at 2006-05-18 09:33
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490989&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Skip Montanaro (montanaro)
Assigned to: Nobody/Anonymous (nobody)
Summary: Describe Py_DEBUG and friends...

Initial Comment:
Here's a minimal first cut at describing Py_DEBUG and
friends.
Hopefully the description is detailed enough to be
useful but not so
detailed as to always be out-of-date.  If nothing else,
perhaps it
will spur someone with better knowledge of these
details to correct my
mistakes and flesh out detail.

As is usual for me, I have not run this through LaTeX.
 (I lack it at
work.)  It will need to be checked for correctness.

Skip


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490989&group_id=5470

From noreply at sourceforge.net  Thu May 18 17:13:44 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 18 May 2006 08:13:44 -0700
Subject: [Patches] [ python-Patches-1490989 ] Describe Py_DEBUG and
	friends...
Message-ID: <E1FgkC8-0006ub-3b@sc8-sf-web1.sourceforge.net>

Patches item #1490989, was opened at 2006-05-18 15:33
Message generated for change (Comment added) made by mwh
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490989&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Skip Montanaro (montanaro)
Assigned to: Nobody/Anonymous (nobody)
Summary: Describe Py_DEBUG and friends...

Initial Comment:
Here's a minimal first cut at describing Py_DEBUG and
friends.
Hopefully the description is detailed enough to be
useful but not so
detailed as to always be out-of-date.  If nothing else,
perhaps it
will spur someone with better knowledge of these
details to correct my
mistakes and flesh out detail.

As is usual for me, I have not run this through LaTeX.
 (I lack it at
work.)  It will need to be checked for correctness.

Skip


----------------------------------------------------------------------

>Comment By: Michael Hudson (mwh)
Date: 2006-05-18 16:13

Message:
Logged In: YES 
user_id=6656

You know about Misc/SpecialBuilds.txt?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490989&group_id=5470

From noreply at sourceforge.net  Thu May 18 17:48:28 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 18 May 2006 08:48:28 -0700
Subject: [Patches] [ python-Patches-1490989 ] Describe Py_DEBUG and
	friends...
Message-ID: <E1Fgkjk-00029A-Pw@sc8-sf-web4-b.sourceforge.net>

Patches item #1490989, was opened at 2006-05-18 09:33
Message generated for change (Comment added) made by montanaro
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490989&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Skip Montanaro (montanaro)
Assigned to: Nobody/Anonymous (nobody)
Summary: Describe Py_DEBUG and friends...

Initial Comment:
Here's a minimal first cut at describing Py_DEBUG and
friends.
Hopefully the description is detailed enough to be
useful but not so
detailed as to always be out-of-date.  If nothing else,
perhaps it
will spur someone with better knowledge of these
details to correct my
mistakes and flesh out detail.

As is usual for me, I have not run this through LaTeX.
 (I lack it at
work.)  It will need to be checked for correctness.

Skip


----------------------------------------------------------------------

>Comment By: Skip Montanaro (montanaro)
Date: 2006-05-18 10:48

Message:
Logged In: YES 
user_id=44345

If I did I forgot.  I was grepping around trying to figure
out what Py_DEBUG did exactly and saw the XXX comment
in Doc/api/api.tex.  I naively assumed at that point that
Py_DEBUG and friends weren't documented.  In any case, I
think it makes sense to have some mention of the available
options in api.tex, simply because it's the "official"
documentation.  It should reference Misc/SpecialBuilds.txt
as well.

Skip


----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2006-05-18 10:13

Message:
Logged In: YES 
user_id=6656

You know about Misc/SpecialBuilds.txt?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490989&group_id=5470

From noreply at sourceforge.net  Thu May 18 19:13:47 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 18 May 2006 10:13:47 -0700
Subject: [Patches] [ python-Patches-1473257 ] Add a gi_code attr to
	generators
Message-ID: <E1Fgm4J-0003Dy-MM@sc8-sf-web4-b.sourceforge.net>

Patches item #1473257, was opened at 2006-04-19 17:39
Message generated for change (Comment added) made by collinwinter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473257&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Phillip J. Eby (pje)
Summary: Add a gi_code attr to generators

Initial Comment:
In the test suite for one of my packages, I've used
something like gen.gi_frame.f_code.co_name to help make
human-readable assertions about when certain generators
are run deep inside the application. This was possible
because Python 2.4 guaranteed that gi_frame was always
a frame instance, even after the generator exhausted
itself. In Python 2.5, however, gi_frame is None when
the generator has run till exhaustion, meaning that I
can't always get to f_code.co_name.

I'd like to add a gi_code attribute to generators that
would allow users to access the code object behind the
generator, even when gi_frame is None. This attribute
would be read-only and would follow this rule:

>>> def f():
...     yield 5
...
>>> g = f()
>>> g.gi_code is f.func_code
True
>>>

The attached patch (against r45570) implements the
proposed attribute (in Include/genobject.h and
Objects/genobject.c) and adds test cases to
Lib/test/test_generators.py for this attribute.

----------------------------------------------------------------------

>Comment By: Collin Winter (collinwinter)
Date: 2006-05-18 13:13

Message:
Logged In: YES 
user_id=1344176

In addition to updating the main patch to r46040, I've
included a diff against Misc/NEWS to make mention of the
gi_code attribute.

Any thoughts on getting this into 2.5a3?

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-04-30 07:17

Message:
Logged In: YES 
user_id=849994

Phillip, do you have an opinion on this one?

----------------------------------------------------------------------

Comment By: Collin Winter (collinwinter)
Date: 2006-04-29 17:11

Message:
Logged In: YES 
user_id=1344176

I've updated the patch; it's now against SVN r45808.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1473257&group_id=5470

From noreply at sourceforge.net  Fri May 19 09:31:36 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 00:31:36 -0700
Subject: [Patches] [ python-Patches-1490190 ] add os.chflags() and
	os.lchflags() where available
Message-ID: <E1FgzSS-0004mu-BN@sc8-sf-web3.sourceforge.net>

Patches item #1490190, was opened at 2006-05-17 04:45
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: M. Levinson (levinsm)
Assigned to: Neal Norwitz (nnorwitz)
Summary: add os.chflags() and os.lchflags() where available

Initial Comment:
The return value from os.stat() includes st_flags on some systems, but
currently there's not much that can be done with the value; this patch aims
to make st_flags useful by adding some associated constants to stat.py and
the corresponding chflags() and lchflags() functions in posixmodule. For
completeness, shutil.copystat() is also updated to call os.chflags() where
it's available.


----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-19 00:31

Message:
Logged In: YES 
user_id=33168

What operating systems is this available on?  The only one
I've found is OS X.  The man page says it's from BSD 4.4.  I
tried on Linux of various flavors (4+), Solaris, and Tru64.
 None of them had chflags.  I also could only find some of
the flags in my sys/stat.h that were added to stat.py. 
stat.h didn't have UF_NOUNLINK, SF_NOUNLINK, SF_SNAPSHOT.

As far as the patch itself, it looks good.  There are a
couple of changes if this should be accepted though:  doc
needs \versionadded{2.5}, I would prefer flags as the var
name rather than i in posixmodule.c (btw you shouldn't need
to init path).

Also would need to update Misc/NEWS and ACKS if accepted.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 07:24

Message:
Logged In: YES 
user_id=849994

Patch looks good. Do we want to include it?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

From noreply at sourceforge.net  Fri May 19 13:22:41 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 04:22:41 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1Fh345-0004nd-Uk@sc8-sf-web2.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 18:59
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Andrew Clover (bobince)
Assigned to: Nobody/Anonymous (nobody)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 13:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Fri May 19 13:47:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 04:47:00 -0700
Subject: [Patches] [ python-Patches-1490190 ] add os.chflags() and
	os.lchflags() where available
Message-ID: <E1Fh3Rc-0006e4-9n@sc8-sf-web2.sourceforge.net>

Patches item #1490190, was opened at 2006-05-17 07:45
Message generated for change (Comment added) made by levinsm
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: M. Levinson (levinsm)
Assigned to: Neal Norwitz (nnorwitz)
Summary: add os.chflags() and os.lchflags() where available

Initial Comment:
The return value from os.stat() includes st_flags on some systems, but
currently there's not much that can be done with the value; this patch aims
to make st_flags useful by adding some associated constants to stat.py and
the corresponding chflags() and lchflags() functions in posixmodule. For
completeness, shutil.copystat() is also updated to call os.chflags() where
it's available.


----------------------------------------------------------------------

>Comment By: M. Levinson (levinsm)
Date: 2006-05-19 07:47

Message:
Logged In: YES 
user_id=1522893

In addition to MacOS, chflags(2) is available on FreeBSD,
OpenBSD, and
NetBSD. The flags in Lib/stat.py are the full set of
available values
although, as you noted, MacOS hasn't (yet) implemented
several of them.

Thanks for the comments - I've attached an updated version
of the patch
incorporating your suggestions.


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-19 03:31

Message:
Logged In: YES 
user_id=33168

What operating systems is this available on?  The only one
I've found is OS X.  The man page says it's from BSD 4.4.  I
tried on Linux of various flavors (4+), Solaris, and Tru64.
 None of them had chflags.  I also could only find some of
the flags in my sys/stat.h that were added to stat.py. 
stat.h didn't have UF_NOUNLINK, SF_NOUNLINK, SF_SNAPSHOT.

As far as the patch itself, it looks good.  There are a
couple of changes if this should be accepted though:  doc
needs \versionadded{2.5}, I would prefer flags as the var
name rather than i in posixmodule.c (btw you shouldn't need
to init path).

Also would need to update Misc/NEWS and ACKS if accepted.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 10:24

Message:
Logged In: YES 
user_id=849994

Patch looks good. Do we want to include it?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490190&group_id=5470

From noreply at sourceforge.net  Fri May 19 16:21:36 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 07:21:36 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1Fh5rE-0007ac-DM@sc8-sf-web1.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 16:59
Message generated for change (Comment added) made by bobince
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Andrew Clover (bobince)
Assigned to: Nobody/Anonymous (nobody)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Andrew Clover (bobince)
Date: 2006-05-19 14:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 11:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Fri May 19 16:34:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 07:34:22 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1Fh63a-0001It-Ly@sc8-sf-web5.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 18:59
Message generated for change (Settings changed) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
>Priority: 8
Submitted By: Andrew Clover (bobince)
>Assigned to: Martin v. L??wis (loewis)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-19 16:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 13:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Fri May 19 19:39:43 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 10:39:43 -0700
Subject: [Patches] [ python-Patches-1491759 ] IDLE L&F on MacOSX
Message-ID: <E1Fh8wx-0000aV-7G@sc8-sf-web1.sourceforge.net>

Patches item #1491759, was opened at 2006-05-19 19:39
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491759&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: IDLE
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: IDLE L&F on MacOSX 

Initial Comment:
The attached patch fixes some L&F issues on MacOSX:

- IDLE now reacts to file-open AppleEvents, which means that if a user 
associates IDLE.app with .py files IDLE will open .py files when the user 
double-clicks on them

- Hide the tcl/tk console window that gets opened by default when IDLE is 
in an application bundle (that's a misfeature of aquatk)

- Patch the menu's to make sure they better conform to the HIG.

- PyShell/EditorWindow  status_bar no longer overlaps with the resize 
widget in the lower-left corner of the window

Open issues:

- When you double-click on a file and IDLE is not yet open the file will be 
opened, but IDLE will open the default shell window just above it :-(

- I'm not terribly happy with the code changes that implement the 
updated menu structure.

- The default keybindings on OSX are the windows keybindings. I haven't 
checked yet if that can be fixed programmaticly, I also haven't verified if 
the macos keybindings are fully correct for OSX.

- The general L&F is still wrong, but that isn't really IDLE's fault: tcl/tk 
doesn't fully conform to the HIG yet (dialogs without title bars, wrong 
default dinwos background, wrong widget for tabbed windows, ...).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491759&group_id=5470

From noreply at sourceforge.net  Fri May 19 21:19:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 12:19:08 -0700
Subject: [Patches] [ python-Patches-1491804 ] Simple slice support for
	list.sort() and .reverse()
Message-ID: <E1FhAVA-0006gS-T3@sc8-sf-web2.sourceforge.net>

Patches item #1491804, was opened at 2006-05-19 21:19
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Simple slice support for list.sort() and .reverse()

Initial Comment:
As requested per

http://groups.google.de/group/comp.lang.python/browse_thread/thread/6feadf8170900e53/aa621eed0fe14050?hl=de#aa621eed0fe14050

list.sort() should support extra keyword arguments
start and stop, which specify a slice of the whole list
to sort inplace.

The attached patch implements this functionality, and
extends the sorted() builtin to also offer these
keyword arguments, and additionally implements slice
support (also with start, stop) for list.reverse().

The patch updates the list object methods and the
sorted builtin, and also updates the testsuite to check
for the new keyword arguments and updates the
documentation to list them.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

From noreply at sourceforge.net  Fri May 19 21:23:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 12:23:34 -0700
Subject: [Patches] [ python-Patches-1488098 ] MacOSX: distutils support for
	-arch and -isysroot flags
Message-ID: <E1FhAZS-0005Xu-Kg@sc8-sf-web1.sourceforge.net>

Patches item #1488098, was opened at 2006-05-13 23:13
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: MacOSX: distutils support for -arch and -isysroot flags

Initial Comment:
This flag adds specific support for the -arch and -isysroot flags of GCC 
on MacOSX 10.4 or later.

The patch consists of two parts:

1) Remove these flags (and their arguments) from the base CFLAGS/
LDFLAGS when compiling extensions on OSX 10.3 or earlier because GCC 
doesn't support those arguments in the version of GCC that is shipped 
what the version of the OS.

2) Strip -arch and -isysroot (again including their arguments) from the 
base CFLAGS/LDFLAGS when the user has specified new values for them 
in the extra_compile_args and extra_link args.

The second part is needed because -isysroot can only be specified once 
and the -arch option is incremental, without this patch you cannot 
compile using a different SDK or for fewer architectures.

A reason for wanting to do the latter is software like psyco that is only 
fully supported on one of the architectures for OSX.

----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-19 21:23

Message:
Logged In: YES 
user_id=580910

I've updated the patch with the stylistic changes you've requested.

BTW. I don't think idx is confusing, although I suppose it helps that the Dutch 
term for index is index :-)

BTW. Distutils.archive_util claims it should be kept 2.1 compatible, although I 
don't know if that request covers all of distutils. PEP 291 doesn't mention 
distutils.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-15 09:41

Message:
Logged In: YES 
user_id=33168

I don't see any obvious problems with the patch.  I have
some nits though:

 * This is pretty complex: int(os.uname()[2].split('.')[0])
   I would prefer if it was broken up and use local
variables to explain better what's going on (or at least a
comment that shows the expected format).
  - same with '.'.join(m.group(1).split('.')[:2])

 * Remove double blank lines at first line of patch in
util.py and the last 3 lines (the pass is not needed).

 * unixcompiler.py, use True/False instead of 1/0.  I forget
what the compatibility of distutils is, but I see other uses
of True and False

   - same comment about getting the kernel with a complex expr

   - I prefer index instead of idx (I don't like abbrevs,
particularly for foreign speakers)

Instead of: 
+        if '-arch' in cc_args:
+            stripArch = 1

just set it:  stripArch = '-arch' in cc_args

Same for stripSysroot

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

From noreply at sourceforge.net  Fri May 19 23:27:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 14:27:32 -0700
Subject: [Patches] [ python-Patches-1491866 ] Complex representation
Message-ID: <E1FhCVQ-0004Nu-7U@sc8-sf-web2.sourceforge.net>

Patches item #1491866, was opened at 2006-05-19 23:27
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Complex representation

Initial Comment:
As per request on c.l.p:

http://groups.google.de/group/comp.lang.python/browse_thread/thread/26c93fefefd3a100/bf1924ce28fac1ac?hl=de#bf1924ce28fac1ac

I've implemented a small patch to change the output of
repr(x) for complex variables, so that complex(repr(x))
works for any complex x. This changes the output of
repr(x) to

'<r>+<i>j'

without brackets, but leaves the string output
untouched. This change of behaviour would be in line
with int(repr(x)) and float(repr(x)) being defined for
any int or float x, repectively.

I don't know whether this patch is sensible, and
whether it breaks any current code, because (for example)

eval("5*%r" % (1+2j,))

won't work properly anymore, or whether it'd be more
sensible to change the complex constructor to also
accept a bracketed expression. I'll attach a patch to
do the latter later.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

From noreply at sourceforge.net  Sat May 20 00:28:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 15:28:12 -0700
Subject: [Patches] [ python-Patches-1491866 ] Complex representation
Message-ID: <E1FhDS8-0006Dw-FP@sc8-sf-web5.sourceforge.net>

Patches item #1491866, was opened at 2006-05-19 23:27
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Complex representation

Initial Comment:
As per request on c.l.p:

http://groups.google.de/group/comp.lang.python/browse_thread/thread/26c93fefefd3a100/bf1924ce28fac1ac?hl=de#bf1924ce28fac1ac

I've implemented a small patch to change the output of
repr(x) for complex variables, so that complex(repr(x))
works for any complex x. This changes the output of
repr(x) to

'<r>+<i>j'

without brackets, but leaves the string output
untouched. This change of behaviour would be in line
with int(repr(x)) and float(repr(x)) being defined for
any int or float x, repectively.

I don't know whether this patch is sensible, and
whether it breaks any current code, because (for example)

eval("5*%r" % (1+2j,))

won't work properly anymore, or whether it'd be more
sensible to change the complex constructor to also
accept a bracketed expression. I'll attach a patch to
do the latter later.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-20 00:28

Message:
Logged In: YES 
user_id=791932

The second patch (python-complex-constructor.diff) changes
the constructor to accept bracketed complex numbers which
are enclosed in a single bracket. I'd rather say this is the
better appropach to have complex(repr(x)) work, but I leave
both patches attached to this bug.

The latter patch also creates test cases testing for
formatting errors with bracketed expressions.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

From noreply at sourceforge.net  Sat May 20 03:17:33 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 18:17:33 -0700
Subject: [Patches] [ python-Patches-1491939 ] Fix for bug #1486663 mutable
	types check kwargs in tp_new
Message-ID: <E1FhG61-00063T-Pb@sc8-sf-web3.sourceforge.net>

Patches item #1491939, was opened at 2006-05-20 03:17
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Fix for bug #1486663 mutable types check kwargs in tp_new

Initial Comment:
set and deque check that they are not called with
keyword arguments in their tp_new method, although
they are mutable. This makes them harder to subclass.
See the bug report for more details.

Patch contains tests and fixes for both of them.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

From noreply at sourceforge.net  Sat May 20 04:06:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 19:06:45 -0700
Subject: [Patches] [ python-Patches-1491939 ] Fix for bug #1486663 mutable
	types check kwargs in tp_new
Message-ID: <E1FhGrd-0008Vf-Cs@sc8-sf-web5.sourceforge.net>

Patches item #1491939, was opened at 2006-05-20 03:17
Message generated for change (Settings changed) made by zseil
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: Fix for bug #1486663 mutable types check kwargs in tp_new

Initial Comment:
set and deque check that they are not called with
keyword arguments in their tp_new method, although
they are mutable. This makes them harder to subclass.
See the bug report for more details.

Patch contains tests and fixes for both of them.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

From noreply at sourceforge.net  Sat May 20 07:14:48 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 22:14:48 -0700
Subject: [Patches] [ python-Patches-1491804 ] Simple slice support for
	list.sort() and .reverse()
Message-ID: <E1FhJnc-0001lG-Oh@sc8-sf-web2.sourceforge.net>

Patches item #1491804, was opened at 2006-05-19 15:19
Message generated for change (Comment added) made by tjreedy
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Simple slice support for list.sort() and .reverse()

Initial Comment:
As requested per

http://groups.google.de/group/comp.lang.python/browse_thread/thread/6feadf8170900e53/aa621eed0fe14050?hl=de#aa621eed0fe14050

list.sort() should support extra keyword arguments
start and stop, which specify a slice of the whole list
to sort inplace.

The attached patch implements this functionality, and
extends the sorted() builtin to also offer these
keyword arguments, and additionally implements slice
support (also with start, stop) for list.reverse().

The patch updates the list object methods and the
sorted builtin, and also updates the testsuite to check
for the new keyword arguments and updates the
documentation to list them.

----------------------------------------------------------------------

>Comment By: Terry J. Reedy (tjreedy)
Date: 2006-05-20 01:14

Message:
Logged In: YES 
user_id=593130

Having thought about submitting an RFE for start/stop for 
reverse, I support the enhancement.  Please do the same 
for array.reverse(, so list and array.reverse continue to 
have the same signature.  Two uses: one way to swap 
partitions in place is reverse each partition and then the 
whole sequence; probably more useful is the slice reversal 
in one standard method for sequentially generating 
permutations in lexicographical order.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

From noreply at sourceforge.net  Sat May 20 07:24:20 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 19 May 2006 22:24:20 -0700
Subject: [Patches] [ python-Patches-1491866 ] Complex representation
Message-ID: <E1FhJwq-0002Dd-6z@sc8-sf-web4-b.sourceforge.net>

Patches item #1491866, was opened at 2006-05-19 17:27
Message generated for change (Comment added) made by tjreedy
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Complex representation

Initial Comment:
As per request on c.l.p:

http://groups.google.de/group/comp.lang.python/browse_thread/thread/26c93fefefd3a100/bf1924ce28fac1ac?hl=de#bf1924ce28fac1ac

I've implemented a small patch to change the output of
repr(x) for complex variables, so that complex(repr(x))
works for any complex x. This changes the output of
repr(x) to

'<r>+<i>j'

without brackets, but leaves the string output
untouched. This change of behaviour would be in line
with int(repr(x)) and float(repr(x)) being defined for
any int or float x, repectively.

I don't know whether this patch is sensible, and
whether it breaks any current code, because (for example)

eval("5*%r" % (1+2j,))

won't work properly anymore, or whether it'd be more
sensible to change the complex constructor to also
accept a bracketed expression. I'll attach a patch to
do the latter later.

----------------------------------------------------------------------

>Comment By: Terry J. Reedy (tjreedy)
Date: 2006-05-20 01:24

Message:
Logged In: YES 
user_id=593130

(The current behavior is not a bug, nor is this patch 
submission a bug report, so let us omit that word.)
I think your example suggests why complexes are printed in 
parens, so I think enhancing complex() to accept such is 
the better approach if any change is to be made.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-19 18:28

Message:
Logged In: YES 
user_id=791932

The second patch (python-complex-constructor.diff) changes
the constructor to accept bracketed complex numbers which
are enclosed in a single bracket. I'd rather say this is the
better appropach to have complex(repr(x)) work, but I leave
both patches attached to this bug.

The latter patch also creates test cases testing for
formatting errors with bracketed expressions.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

From noreply at sourceforge.net  Sat May 20 13:18:03 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 04:18:03 -0700
Subject: [Patches] [ python-Patches-1491804 ] Simple slice support for
	list.sort() and .reverse()
Message-ID: <E1FhPT9-0007mC-25@sc8-sf-web3.sourceforge.net>

Patches item #1491804, was opened at 2006-05-19 21:19
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Simple slice support for list.sort() and .reverse()

Initial Comment:
As requested per

http://groups.google.de/group/comp.lang.python/browse_thread/thread/6feadf8170900e53/aa621eed0fe14050?hl=de#aa621eed0fe14050

list.sort() should support extra keyword arguments
start and stop, which specify a slice of the whole list
to sort inplace.

The attached patch implements this functionality, and
extends the sorted() builtin to also offer these
keyword arguments, and additionally implements slice
support (also with start, stop) for list.reverse().

The patch updates the list object methods and the
sorted builtin, and also updates the testsuite to check
for the new keyword arguments and updates the
documentation to list them.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-20 13:18

Message:
Logged In: YES 
user_id=791932

The attached patch implements all of the old patch, and adds
the specified logic for array.reverse(), updates the
documentation for the array module and whatsnew25, and will
speed up the methods a slight little bit in the absense of
optimization when compiling Python.

----------------------------------------------------------------------

Comment By: Terry J. Reedy (tjreedy)
Date: 2006-05-20 07:14

Message:
Logged In: YES 
user_id=593130

Having thought about submitting an RFE for start/stop for 
reverse, I support the enhancement.  Please do the same 
for array.reverse(, so list and array.reverse continue to 
have the same signature.  Two uses: one way to swap 
partitions in place is reverse each partition and then the 
whole sequence; probably more useful is the slice reversal 
in one standard method for sequentially generating 
permutations in lexicographical order.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

From noreply at sourceforge.net  Sat May 20 13:20:57 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 04:20:57 -0700
Subject: [Patches] [ python-Patches-1491866 ] Complex representation
Message-ID: <E1FhPVx-0003h7-MD@sc8-sf-web1.sourceforge.net>

Patches item #1491866, was opened at 2006-05-19 23:27
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Complex representation

Initial Comment:
As per request on c.l.p:

http://groups.google.de/group/comp.lang.python/browse_thread/thread/26c93fefefd3a100/bf1924ce28fac1ac?hl=de#bf1924ce28fac1ac

I've implemented a small patch to change the output of
repr(x) for complex variables, so that complex(repr(x))
works for any complex x. This changes the output of
repr(x) to

'<r>+<i>j'

without brackets, but leaves the string output
untouched. This change of behaviour would be in line
with int(repr(x)) and float(repr(x)) being defined for
any int or float x, repectively.

I don't know whether this patch is sensible, and
whether it breaks any current code, because (for example)

eval("5*%r" % (1+2j,))

won't work properly anymore, or whether it'd be more
sensible to change the complex constructor to also
accept a bracketed expression. I'll attach a patch to
do the latter later.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-20 13:20

Message:
Logged In: YES 
user_id=791932

The attached patch is a revised version of the patch to the
complex constructor to accept bracketed string expressions,
which also adds documentation changes (whatsnew25).

Anyway, I personally also find this to be the "better" way,
so I've removed the repr-changing patch. And, calling it a
bug was by accident: I should've rather called it tracker
item, which is pretty synonymous for me.

----------------------------------------------------------------------

Comment By: Terry J. Reedy (tjreedy)
Date: 2006-05-20 07:24

Message:
Logged In: YES 
user_id=593130

(The current behavior is not a bug, nor is this patch 
submission a bug report, so let us omit that word.)
I think your example suggests why complexes are printed in 
parens, so I think enhancing complex() to accept such is 
the better approach if any change is to be made.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-20 00:28

Message:
Logged In: YES 
user_id=791932

The second patch (python-complex-constructor.diff) changes
the constructor to accept bracketed complex numbers which
are enclosed in a single bracket. I'd rather say this is the
better appropach to have complex(repr(x)) work, but I leave
both patches attached to this bug.

The latter patch also creates test cases testing for
formatting errors with bracketed expressions.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491866&group_id=5470

From noreply at sourceforge.net  Sat May 20 18:15:57 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 09:15:57 -0700
Subject: [Patches] [ python-Patches-1491804 ] Simple slice support for
	list.sort() and .reverse()
Message-ID: <E1FhU7R-0002Os-GE@sc8-sf-web5.sourceforge.net>

Patches item #1491804, was opened at 2006-05-19 21:19
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Simple slice support for list.sort() and .reverse()

Initial Comment:
As requested per

http://groups.google.de/group/comp.lang.python/browse_thread/thread/6feadf8170900e53/aa621eed0fe14050?hl=de#aa621eed0fe14050

list.sort() should support extra keyword arguments
start and stop, which specify a slice of the whole list
to sort inplace.

The attached patch implements this functionality, and
extends the sorted() builtin to also offer these
keyword arguments, and additionally implements slice
support (also with start, stop) for list.reverse().

The patch updates the list object methods and the
sorted builtin, and also updates the testsuite to check
for the new keyword arguments and updates the
documentation to list them.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-20 18:15

Message:
Logged In: YES 
user_id=791932

By the way, I've just become aware of the fact that this
patch changes the semantics of list.sort() somewhat, because
of an optimization I did to the code. The DSU-function isn't
called anymore when there are less than two items to sort,
i.e. the list or the slice to sort is one item long. This means:

---
def test(k):
    k.append(4)
    return k[0]
x = [[1,2,3]]
x.sort(key=test)
print x
---

will now print [[1,2,3]] (with the patch applied), whereas
Python 2.4 would've printed [[1,2,3,4]], but not have called
timsort either.

I don't know whether this breaks anything (or the old
behaviour was sensible); at least it should have to be
documented. I'd like feedback before I start either taking
out the optimization or documenting this.

----------------------------------------------------------------------

Comment By: Heiko Wundram (hwundram)
Date: 2006-05-20 13:18

Message:
Logged In: YES 
user_id=791932

The attached patch implements all of the old patch, and adds
the specified logic for array.reverse(), updates the
documentation for the array module and whatsnew25, and will
speed up the methods a slight little bit in the absense of
optimization when compiling Python.

----------------------------------------------------------------------

Comment By: Terry J. Reedy (tjreedy)
Date: 2006-05-20 07:14

Message:
Logged In: YES 
user_id=593130

Having thought about submitting an RFE for start/stop for 
reverse, I support the enhancement.  Please do the same 
for array.reverse(, so list and array.reverse continue to 
have the same signature.  Two uses: one way to swap 
partitions in place is reverse each partition and then the 
whole sequence; probably more useful is the slice reversal 
in one standard method for sequentially generating 
permutations in lexicographical order.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491804&group_id=5470

From noreply at sourceforge.net  Sat May 20 19:03:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 10:03:51 -0700
Subject: [Patches] [ python-Patches-1492147 ] Minor Correction to urllib2
	HOWTO
Message-ID: <E1FhUrn-00078j-2H@sc8-sf-web5.sourceforge.net>

Patches item #1492147, was opened at 2006-05-20 17:03
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492147&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Nobody/Anonymous (nobody)
Summary: Minor Correction to urllib2 HOWTO

Initial Comment:
Minor patch - fixes a broken link.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492147&group_id=5470

From noreply at sourceforge.net  Sat May 20 20:09:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 11:09:25 -0700
Subject: [Patches] [ python-Patches-1492147 ] Minor Correction to urllib2
	HOWTO
Message-ID: <E1FhVtF-0002cN-Qg@sc8-sf-web5.sourceforge.net>

Patches item #1492147, was opened at 2006-05-21 02:03
Message generated for change (Comment added) made by quiver
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492147&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
>Status: Closed
Resolution: None
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Nobody/Anonymous (nobody)
Summary: Minor Correction to urllib2 HOWTO

Initial Comment:
Minor patch - fixes a broken link.

----------------------------------------------------------------------

>Comment By: George Yoshida (quiver)
Date: 2006-05-21 03:09

Message:
Logged In: YES 
user_id=671362

Committed in r46058.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492147&group_id=5470

From noreply at sourceforge.net  Sat May 20 22:43:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 13:43:10 -0700
Subject: [Patches] [ python-Patches-1492218 ] None missing from keyword
	module
Message-ID: <E1FhYI2-0000Yr-Gp@sc8-sf-web3.sourceforge.net>

Patches item #1492218, was opened at 2006-05-20 22:43
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: None missing from keyword module

Initial Comment:
None became a keyword in Python 2.4, but this is
not evident from the Python/gramminit.c file. As
a consequence, None is not included in the
keyword module when you regenerate it.

This patch also includes documentation fixes (None
was missing from keywords section in reference manual)
and fixes for syntax highliting for Idle and Vim.
python-mode.el already treats None, True and False
differently, so I didn't try to change it.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

From noreply at sourceforge.net  Sat May 20 23:39:03 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 14:39:03 -0700
Subject: [Patches] [ python-Patches-1492240 ] Socket-object convenience
	function: getpeercred().
Message-ID: <E1FhZA7-0004kf-TH@sc8-sf-web4-b.sourceforge.net>

Patches item #1492240, was opened at 2006-05-20 23:39
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492240&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Socket-object convenience function: getpeercred().

Initial Comment:
The attached patch implements a convenience function
called getpeercred() which internally calls
getsockopt(SO_PEERCRED) to retrieve the credentials
(pid, uid and gid) of the remote process a socket is
attached to, in case the remote end is local.

This currently (AFAIK) only works (properly) on Linux
2.4+, but might work on BSD-style systems too.

The returned data is wrapped in a new ucred type, which
is subclassable to implement additional convenience
functions in Lib/socket.py.

The patch updates the socket module, the test suite,
the documentation (including whatsnew), and adds a
configure check for the definition of struct ucred in
sys/socket.h, which is the default place for struct
ucred if it is available.

If struct ucred is not available on the current system,
getpeercred() is made a dummy method, which returns a
python-defined ucred type which contains pid=0,
uid=gid=-1, which are the default values returned under
Linux when the call fails because there is no
credentials data associated with the socket.

The decision to move the data to a separate type was
made with respect to the ability to use struct ucred
under other systems in a SCM_CREDENTIALS sendmsg()
call. I'll post the implementation of sendmsg() and
recvmsg() as a separate tracker item, but the latter
patch will rely on the inclusion of this patch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492240&group_id=5470

From noreply at sourceforge.net  Sun May 21 00:20:46 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 15:20:46 -0700
Subject: [Patches] [ python-Patches-1492255 ] urllib2 HOWTO - Further
	(minor) Corrections
Message-ID: <E1FhZoU-0000hW-72@sc8-sf-web4-b.sourceforge.net>

Patches item #1492255, was opened at 2006-05-20 22:20
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492255&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2 HOWTO - Further (minor) Corrections

Initial Comment:
Further minor changes to urllib2 HOWTO. (Hopefully the
last, oops...)

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492255&group_id=5470

From noreply at sourceforge.net  Sun May 21 06:43:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 20 May 2006 21:43:51 -0700
Subject: [Patches] [ python-Patches-1492255 ] urllib2 HOWTO - Further
	(minor) Corrections
Message-ID: <E1FhfnD-0000j1-J9@sc8-sf-web2.sourceforge.net>

Patches item #1492255, was opened at 2006-05-21 07:20
Message generated for change (Comment added) made by quiver
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492255&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
>Status: Closed
Resolution: None
Priority: 5
Submitted By: Mike Foord (mjfoord)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2 HOWTO - Further (minor) Corrections

Initial Comment:
Further minor changes to urllib2 HOWTO. (Hopefully the
last, oops...)

----------------------------------------------------------------------

>Comment By: George Yoshida (quiver)
Date: 2006-05-21 13:43

Message:
Logged In: YES 
user_id=671362

Committed in 46062.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492255&group_id=5470

From noreply at sourceforge.net  Sun May 21 09:15:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 21 May 2006 00:15:22 -0700
Subject: [Patches] [ python-Patches-1492356 ] Windows CE support (part 1)
Message-ID: <E1Fhi9q-0002qa-0g@sc8-sf-web2.sourceforge.net>

Patches item #1492356, was opened at 2006-05-21 15:15
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492356&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Luke Dunstan (infidel)
Assigned to: Nobody/Anonymous (nobody)
Summary: Windows CE support (part 1)

Initial Comment:
This patch contains part of the changes necessary to 
build Python trunk for Windows CE 4.x using the 
freely downloadable Microsoft eMbedded Visual C++ 
4.0. I will submit more patches later.

The changes are:

- Replace use of intptr_t with Py_intptr_t 
(Py_intptr_t already exists)

- Created a macro to support 64-bit integer literals 
using the I64 suffix

- Guard #include <errno.h> with #ifndef 
DONT_HAVE_ERRNO_H (this macro was already used in a 
few places)

- Guard #include <fcntl.h> with #ifdef HAVE_FCNTL_H 
(this macro was already in pyconfig.h)

- Various small changes to PC/pyconfig.h, mostly to 
cater for header files that are not available for 
Windows CE

I have tested that this doesn't break anything by 
building the patched Python on Linux and running the 
test suite.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492356&group_id=5470

From noreply at sourceforge.net  Sun May 21 17:06:46 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 21 May 2006 08:06:46 -0700
Subject: [Patches] [ python-Patches-1492509 ] Unification of list-comp and
	for syntax
Message-ID: <E1FhpW2-0003py-VN@sc8-sf-web3.sourceforge.net>

Patches item #1492509, was opened at 2006-05-21 17:06
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Unification of list-comp and for syntax

Initial Comment:
The following patch adds the ability for:

for <expr> in <expr> if <expr>:
    <do something>

to the Python core. This unifies the syntax of
list/generator comprehensions and the for statement
somewhat, because both now accept conditions which
produce an immediate continue.

I've posted a PEP to python-dev, which details the
changes this patch makes (which are all
backwards-compatible).

The patch doesn't try to address more than the actual
code required to make this feature work yet (except for
changes to Modules/parsermodule.c and Doc/ref/ref7.tex,
which details the for statement). If there's consensus
on this feature, I'll gladly produce more documentation.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

From online.dept at bancorpsouthonline.com  Mon May 22 01:10:29 2006
From: online.dept at bancorpsouthonline.com (BancorpSouth)
Date: 21 May 2006 23:10:29 -0000
Subject: [Patches] BancorpSouth Online Department Notice
Message-ID: <20060521231029.38291.qmail@be25.masterhost.ru>

An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/patches/attachments/20060521/d4cbef14/attachment.htm 

From noreply at sourceforge.net  Mon May 22 05:08:36 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 21 May 2006 20:08:36 -0700
Subject: [Patches] [ python-Patches-1492704 ] distinct error type from
	shutil.move()
Message-ID: <E1Fi0ma-0007tB-Sd@sc8-sf-web1.sourceforge.net>

Patches item #1492704, was opened at 2006-05-22 03:08
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492704&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Zooko O'Whielacronx (zooko)
Assigned to: Nobody/Anonymous (nobody)
Summary: distinct error type from shutil.move()

Initial Comment:
I need to call shutil.move() and be able to tell the
difference between an error such as access denied and
an error due to the two arguments being the same file.


--- old-dw/src/Lib/shutil.py    2006-05-22
00:06:02.000000000 -0300
+++ new-dw/src/Lib/shutil.py    2006-05-22
00:06:02.000000000 -0300
@@ -16,6 +16,9 @@
 class Error(exceptions.EnvironmentError):
     pass

+class SameFileError(Error):
+    pass
+
 def copyfileobj(fsrc, fdst, length=16*1024):
     """copy data from file-like object fsrc to
file-like object fdst"""
     while 1:
@@ -39,7 +42,7 @@
 def copyfile(src, dst):
     """Copy data from src to dst"""
     if _samefile(src, dst):
-        raise Error, "`%s` and `%s` are the same file"
% (src, dst)
+        raise SameFileError, "`%s` and `%s` are the
same file" % (src, dst)

     fsrc = None
     fdst = None


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492704&group_id=5470

From noreply at sourceforge.net  Mon May 22 10:52:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 01:52:01 -0700
Subject: [Patches] [ python-Patches-731328 ] AssertionError when building
	rpm under RedHat 9.1
Message-ID: <E1Fi68v-0003ez-KI@sc8-sf-web1.sourceforge.net>

Patches item #731328, was opened at 2003-05-02 12:56
Message generated for change (Comment added) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=731328&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Ricardo Niederberger Cabral (niederberger)
>Assigned to: Sean Reifschneider (jafo)
Summary: AssertionError when building rpm under RedHat 9.1

Initial Comment:
When trying to build an rpm on RH 9:

>From distutils __version__ = &quot;1.0.3&quot;:

  File &quot;distutils/command/bdist_rpm.py&quot;, line 316, in run
    assert len(rpms) == 1, \
AssertionError: unexpected number of RPM files found: ['build/bdist.
linux-i686/rpm/RPMS/i386/imgSeek-0.7-1.i386.rpm', 'build/bdist.
linux-i686/rpm/RPMS/i386/imgSeek-debuginfo-0.7-1.i386.rpm']

I had to remove the assert statement on bdist_rpm.py:316 in order to 
build my rpm since rpmbuild from RH always seems to generate this 
extra  -debuginfo rpm.

So attached is a patch (cvs rev 1.37) to simply copy all generated 
RPM's to the dist/ directory.

----------------------------------------------------------------------

>Comment By: Sean Reifschneider (jafo)
Date: 2006-05-22 08:52

Message:
Logged In: YES 
user_id=81797

Is 1.0.3 from the Red Hat RPMs, the python.org RPMs (which
version), or directly downloaded from the distutils?  If
this is an issue in Distutils, I'd be interested in still
fixing it.  However, there hasn't been any activity on this
in 3 years, so I'm planning on closing this unless I hear
something further over, say, the next few weeks.

Red Hat 9.1 is extremely legacy now, of course.  Not at all
supported for errata by either Red Hat or the other Legacy
sites, so I'm inclined to feel the same.

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2005-03-03 14:36

Message:
Logged In: YES 
user_id=1188172

Does anyone still care about this issue?

Or, other way round, does anything speak against applying
and so copying all RPMs?

----------------------------------------------------------------------

Comment By: Ricardo Niederberger Cabral (niederberger)
Date: 2003-05-28 01:59

Message:
Logged In: YES 
user_id=354686

Sorry for not replying faster and providing more info. SF tracker didn't email 
me about your comment, and I don't have the RH system at hand right now.
Anyway, it generates:
foo-version.i386.rpm
foo-version.src.rpm
foo-debuginfo-version.i386.rpm
instead of only the binary and src rpm's I would get on Mandrake 9 for 
example, which is what bdist_rpm.py currently expects.

I don't know exactly what goes inside this debug-info rpm, but i guess it's 
probably the binary one compiled with debug symbols on. I can provide 
more info if necessary in a few days.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2003-05-03 11:01

Message:
Logged In: YES 
user_id=21627

Can you provide more information? What rpm gets generated,
and what files does it contain?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=731328&group_id=5470

From noreply at sourceforge.net  Mon May 22 10:56:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 01:56:12 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1Fi6Cy-0005Ot-Og@sc8-sf-web2.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 18:59
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
>Resolution: Accepted
Priority: 8
Submitted By: Andrew Clover (bobince)
Assigned to: Martin v. L??wis (loewis)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 10:56

Message:
Logged In: YES 
user_id=21627

Thanks for the patch. I have committed it as r46063. I put a
demo installer containing them at

http://www.dcl.hpi.uni-potsdam.de/home/loewis/python-2.5.13290.msi

I would also like to add the source files, but I have
difficulties figuring out what they are. There is a source
directory; with:

- baselogo.svg; I assume this is a source file
- icons.svgz; can't figure out what this is
- source.xar; not sure either
- a directory called png, with many png file - I expect
  that these aren't source files, are they?

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-19 16:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 13:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Mon May 22 11:17:40 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 02:17:40 -0700
Subject: [Patches] [ python-Patches-1492356 ] Windows CE support (part 1)
Message-ID: <E1Fi6Xk-0003l0-2r@sc8-sf-web1.sourceforge.net>

Patches item #1492356, was opened at 2006-05-21 09:15
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492356&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Luke Dunstan (infidel)
Assigned to: Nobody/Anonymous (nobody)
Summary: Windows CE support (part 1)

Initial Comment:
This patch contains part of the changes necessary to 
build Python trunk for Windows CE 4.x using the 
freely downloadable Microsoft eMbedded Visual C++ 
4.0. I will submit more patches later.

The changes are:

- Replace use of intptr_t with Py_intptr_t 
(Py_intptr_t already exists)

- Created a macro to support 64-bit integer literals 
using the I64 suffix

- Guard #include <errno.h> with #ifndef 
DONT_HAVE_ERRNO_H (this macro was already used in a 
few places)

- Guard #include <fcntl.h> with #ifdef HAVE_FCNTL_H 
(this macro was already in pyconfig.h)

- Various small changes to PC/pyconfig.h, mostly to 
cater for header files that are not available for 
Windows CE

I have tested that this doesn't break anything by 
building the patched Python on Linux and running the 
test suite.


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 11:17

Message:
Logged In: YES 
user_id=21627

Thanks for the patch, committed as r46064.

Notice that your changes to the dsw/dsp files are not
included, as these files are marked as binary.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492356&group_id=5470

From noreply at sourceforge.net  Mon May 22 11:26:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 02:26:34 -0700
Subject: [Patches] [ python-Patches-1492218 ] None missing from keyword
	module
Message-ID: <E1Fi6gM-0006nY-R1@sc8-sf-web1.sourceforge.net>

Patches item #1492218, was opened at 2006-05-20 22:43
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: None missing from keyword module

Initial Comment:
None became a keyword in Python 2.4, but this is
not evident from the Python/gramminit.c file. As
a consequence, None is not included in the
keyword module when you regenerate it.

This patch also includes documentation fixes (None
was missing from keywords section in reference manual)
and fixes for syntax highliting for Idle and Vim.
python-mode.el already treats None, True and False
differently, so I didn't try to change it.


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 11:26

Message:
Logged In: YES 
user_id=21627

None is not a keyword. Watch this:

>>> def None():pass
SyntaxError: assignment to None
>>> def while():pass
SyntaxError: invalid syntax
>>> 

None remains an identifier, but assignments to None are not
allowed.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

From noreply at sourceforge.net  Mon May 22 11:37:47 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 02:37:47 -0700
Subject: [Patches] [ python-Patches-1481304 ] Cleaned up 16x16px icons for
	windows.
Message-ID: <E1Fi6rD-0002Ar-NT@sc8-sf-web2.sourceforge.net>

Patches item #1481304, was opened at 2006-05-03 20:46
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: None
>Status: Closed
>Resolution: Rejected
Priority: 5
Submitted By: goxe (goxe)
Assigned to: Nobody/Anonymous (nobody)
Summary: Cleaned up 16x16px icons for windows.

Initial Comment:

Since the currently distributed icon files only 
include 32x32px images, Windows resizes them where 
16x16px is needed. With the predictable result that 
they look blurred and dark.

The attached icons include 16x16px versions of the 
current icons. It's the same friendly-snake-icon as 
always, just prettier in small sizes.


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 11:37

Message:
Logged In: YES 
user_id=21627

I just committed #1490384, so this is obsolete now. Thanks
for the work, anyway.

----------------------------------------------------------------------

Comment By: Skip Montanaro (montanaro)
Date: 2006-05-13 21:40

Message:
Logged In: YES 
user_id=44345

I agree with Josiah.  I'd like the various icons to be the same across platforms.

----------------------------------------------------------------------

Comment By: Josiah Carlson (josiahcarlson)
Date: 2006-05-13 20:27

Message:
Logged In: YES 
user_id=341410

They are lighter in color, though I would prefer if Python
on Windows used the smallest versions of the Mac icons
(preview available here:
http://www.doxdesk.com/img/software/py/icons.png ).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1481304&group_id=5470

From noreply at sourceforge.net  Mon May 22 12:15:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 03:15:51 -0700
Subject: [Patches] [ python-Patches-1492828 ] Improvements to ceval.c
Message-ID: <E1Fi7S3-0006vd-5k@sc8-sf-web1.sourceforge.net>

Patches item #1492828, was opened at 2006-05-22 03:15
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: mrjbq7 (mrjbq7)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improvements to ceval.c

Initial Comment:
>From Raymond Hettinger, submitting here to keep track of for 
NeedForSpeed sprint.

Here are some customizations to your Python build:
 
First, make sure that WITH_TSC and WITH_THREAD are not defined in the 
build.

Then, attached diff to disable the tracing code, remove NOPs, speed-up 
absolute jumps, and increase the signal check interval.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

From noreply at sourceforge.net  Mon May 22 12:16:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 03:16:12 -0700
Subject: [Patches] [ python-Patches-1005461 ] property to get the docstring
	from fget
Message-ID: <E1Fi7SO-00072T-S4@sc8-sf-web1.sourceforge.net>

Patches item #1005461, was opened at 2004-08-08 11:08
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1005461&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: None
>Status: Closed
>Resolution: Out of Date
Priority: 5
Submitted By: Dima Dorfman (ddorfman)
Assigned to: Nobody/Anonymous (nobody)
Summary: property to get the docstring from fget

Initial Comment:
Allow property objects to use the fget function's docstring if a 
property docstring wasn't specified. This was suggested by 
someone on python-dev or c.l.py, but I don't remember who it was 
(sorry). Function docstrings are easier to write (read: prettier) than 
explicit doc= arguments, so this should make more properties 
have docstrings, sometimes automagically (I know it will for my 
code, anyway). It gets even better with syntatic sugar for 
decorators; read-only properties can now be defined, including 
docstring, with just one function:

  @property
    def golden(self):
        """The Golden Ratio. (Don't ask why this is a property.)"""
        return (1 + math.sqrt(5)) / 2

The doc part of the patch also improves some markup in the 
vicinity.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-22 10:16

Message:
Logged In: YES 
user_id=849994

This was addressed by patch 1434038.

----------------------------------------------------------------------

Comment By: Dima Dorfman (ddorfman)
Date: 2004-08-09 02:06

Message:
Logged In: YES 
user_id=908995

Oops, now that I'm sufficiently awake, it's pretty obvious that the patch 
is broken--it leaks a reference every time the new code runs. Mea culpa! 
Corrected patch attached.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1005461&group_id=5470

From noreply at sourceforge.net  Mon May 22 13:00:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 04:00:34 -0700
Subject: [Patches] [ python-Patches-1492828 ] Improvements to ceval.c
Message-ID: <E1Fi89K-0005QW-Gr@sc8-sf-web2.sourceforge.net>

Patches item #1492828, was opened at 2006-05-22 03:15
Message generated for change (Comment added) made by mrjbq7
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: mrjbq7 (mrjbq7)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improvements to ceval.c

Initial Comment:
>From Raymond Hettinger, submitting here to keep track of for 
NeedForSpeed sprint.

Here are some customizations to your Python build:
 
First, make sure that WITH_TSC and WITH_THREAD are not defined in the 
build.

Then, attached diff to disable the tracing code, remove NOPs, speed-up 
absolute jumps, and increase the signal check interval.

----------------------------------------------------------------------

>Comment By: mrjbq7 (mrjbq7)
Date: 2006-05-22 04:00

Message:
Logged In: YES 
user_id=1172546

Okay, now I checked the box "upload and attach file".  Thats a terrible UI.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

From noreply at sourceforge.net  Mon May 22 13:04:23 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 04:04:23 -0700
Subject: [Patches] [ python-Patches-1492218 ] None missing from keyword
	module
Message-ID: <E1Fi8D1-00074k-3p@sc8-sf-web3.sourceforge.net>

Patches item #1492218, was opened at 2006-05-20 22:43
Message generated for change (Comment added) made by zseil
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: None missing from keyword module

Initial Comment:
None became a keyword in Python 2.4, but this is
not evident from the Python/gramminit.c file. As
a consequence, None is not included in the
keyword module when you regenerate it.

This patch also includes documentation fixes (None
was missing from keywords section in reference manual)
and fixes for syntax highliting for Idle and Vim.
python-mode.el already treats None, True and False
differently, so I didn't try to change it.


----------------------------------------------------------------------

>Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-22 13:04

Message:
Logged In: YES 
user_id=1326842

I realise that None is a constant, not a keyword.
Could at least the documentation be changed?
Currently the reference manual says:

"The following identifiers are used as reserved words, or
keywords of the language, and cannot be used as ordinary
identifiers."

A list that doesn't include None follows, but as your
example shows, None also can't be used as an ordinary
identifier.
Later on that page:

"In some future version of Python, the identifier None
will become a keyword."

See:
http://docs.python.org/dev/ref/keywords.html

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 11:26

Message:
Logged In: YES 
user_id=21627

None is not a keyword. Watch this:

>>> def None():pass
SyntaxError: assignment to None
>>> def while():pass
SyntaxError: invalid syntax
>>> 

None remains an identifier, but assignments to None are not
allowed.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

From noreply at sourceforge.net  Mon May 22 14:02:52 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 05:02:52 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1Fi97c-00077w-UL@sc8-sf-web3.sourceforge.net>

Patches item #1479611, was opened at 2006-05-01 02:58
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 08:02

Message:
Logged In: YES 
user_id=139309

The performance gain for this patch (as-is) on Mac OS X i386 with a release 
build seems totally negligible. I'm not getting any consistent win with any of the 
timeit or pybench benchmarks. 

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-11 03:43

Message:
Logged In: YES 
user_id=33168

This version actually works (in both normal and debug
builds).  It adds some stats which are useful and updates
Misc/SpecialBuilds.txt.

I modified to not preallocate and only hold a ref when the
function didn't keep a ref.

I still need to inline more of PyCFunction_Call.  Speed is
still the same as before.

I'm not sure if I'll finish this before the sprint next
week.  Anyone there feel free to check this in if you finish it.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-05 04:27

Message:
Logged In: YES 
user_id=33168

v2 attached.  You might not want to review yet.  I mostly
did the first part of your suggest (stats, _Fini, and
stack-like if I understood you correctly).  I didn't do
anything on the second part about inlinting Function_Call.

perf seems to be about the same.  I'm not entirely sure the
patch is correct yet. I found one or two problems in the
original.  I added some more comments. 

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-01 04:27

Message:
Logged In: YES 
user_id=21627

The tuples should get deallocated when Py_Finalize is called.

It would be good if there was (conditional) statistical
analysis, showing how often no tuple was found because the
number of arguments was too large, and how often no tuple
was found because the candidate was in use.

I think it should be more stack-like, starting off with no
tuples allocated, then returning them inside the needs_free
blocks only if the refcount is 1 (or 2?). This would avoid
degeneralized cases where some function holds onto its
argument tuple indefinitely, thus consuming all 64 tuples.

For the other part, I think it would make the code more
readable if it inlined PyCFunction_Call even more: the test
for NOARGS|O could be integrated into the switch statement
(one case for each), VARARGS and VARARGS|KEYWORDS would both
load the arguments, then call the function directly
(possibly with NULL keywords). OLDARGS should goto either
METH_NOARGS, METH_O, or METH_VARARGS depending on na (if you
don't like goto, modifying flags would work as well).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 03:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Mon May 22 15:13:17 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 06:13:17 -0700
Subject: [Patches] [ python-Patches-1492218 ] None missing from keyword
	module
Message-ID: <E1FiADl-0001ur-FP@sc8-sf-web4-b.sourceforge.net>

Patches item #1492218, was opened at 2006-05-20 22:43
Message generated for change (Comment added) made by zseil
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Documentation
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: None missing from keyword module

Initial Comment:
None became a keyword in Python 2.4, but this is
not evident from the Python/gramminit.c file. As
a consequence, None is not included in the
keyword module when you regenerate it.

This patch also includes documentation fixes (None
was missing from keywords section in reference manual)
and fixes for syntax highliting for Idle and Vim.
python-mode.el already treats None, True and False
differently, so I didn't try to change it.


----------------------------------------------------------------------

>Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-22 15:13

Message:
Logged In: YES 
user_id=1326842

Attaching a new set of patches. Since they only affect
the documentation, I also changed the category. The
patch against the trunk also includes a note that
using "as" and "with" as identifiers will issue a
warning.

----------------------------------------------------------------------

Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-22 13:04

Message:
Logged In: YES 
user_id=1326842

I realise that None is a constant, not a keyword.
Could at least the documentation be changed?
Currently the reference manual says:

"The following identifiers are used as reserved words, or
keywords of the language, and cannot be used as ordinary
identifiers."

A list that doesn't include None follows, but as your
example shows, None also can't be used as an ordinary
identifier.
Later on that page:

"In some future version of Python, the identifier None
will become a keyword."

See:
http://docs.python.org/dev/ref/keywords.html

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 11:26

Message:
Logged In: YES 
user_id=21627

None is not a keyword. Watch this:

>>> def None():pass
SyntaxError: assignment to None
>>> def while():pass
SyntaxError: invalid syntax
>>> 

None remains an identifier, but assignments to None are not
allowed.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

From noreply at sourceforge.net  Mon May 22 16:11:56 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 07:11:56 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiB8W-0008Ni-ED@sc8-sf-web2.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Nobody/Anonymous (nobody)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 16:32:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 07:32:10 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiBS6-0004Ed-E2@sc8-sf-web2.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Settings changed) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: April King (marumari)
Assigned to: Nobody/Anonymous (nobody)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 17:12:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 08:12:12 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiC4q-0005R1-Li@sc8-sf-web1.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Pending
>Resolution: None
Priority: 5
Submitted By: April King (marumari)
>Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 17:29:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 08:29:10 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiCLG-0006Wn-7y@sc8-sf-web3.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Pending
Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 17:39:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 08:39:08 -0700
Subject: [Patches] [ python-Patches-1492828 ] Improvements to ceval.c
Message-ID: <E1FiCUu-0002Ea-L9@sc8-sf-web2.sourceforge.net>

Patches item #1492828, was opened at 2006-05-22 06:15
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: mrjbq7 (mrjbq7)
>Assigned to: Raymond Hettinger (rhettinger)
Summary: Improvements to ceval.c

Initial Comment:
>From Raymond Hettinger, submitting here to keep track of for 
NeedForSpeed sprint.

Here are some customizations to your Python build:
 
First, make sure that WITH_TSC and WITH_THREAD are not defined in the 
build.

Then, attached diff to disable the tracing code, remove NOPs, speed-up 
absolute jumps, and increase the signal check interval.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-22 11:39

Message:
Logged In: YES 
user_id=31435

Assigned to Raymond.  Raymond is there something of general
use here?  As a standalone patch, it sucks ;-)

----------------------------------------------------------------------

Comment By: mrjbq7 (mrjbq7)
Date: 2006-05-22 07:00

Message:
Logged In: YES 
user_id=1172546

Okay, now I checked the box "upload and attach file".  Thats a terrible UI.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

From noreply at sourceforge.net  Mon May 22 17:39:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 08:39:09 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiCUv-0003wR-7A@sc8-sf-web1.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 12:53
Message generated for change (Settings changed) made by marumari
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Open
Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: April King (marumari)
Date: 2006-05-22 10:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 09:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 09:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 12:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 17:54:41 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 08:54:41 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiCjx-0005mK-D9@sc8-sf-web2.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Pending
Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:54

Message:
Logged In: YES 
user_id=139309

The attached patch uses a strategy that provides the same 30%-ish performance 
boost for the benchmark, and also provides a small performance boost (about 
9% or so) for the very strange log file.

The key is to use lists for buffering, and to never allow the default buffer size to 
grow too large (512-ish starting point seems to be a sweet spot). This defends 
against working with large strings more often than necessary.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 11:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 17:59:41 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 08:59:41 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiCon-0006xL-Id@sc8-sf-web2.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:59

Message:
Logged In: YES 
user_id=139309

Applied in revision 46075

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:54

Message:
Logged In: YES 
user_id=139309

The attached patch uses a strategy that provides the same 30%-ish performance 
boost for the benchmark, and also provides a small performance boost (about 
9% or so) for the very strange log file.

The key is to use lists for buffering, and to never allow the default buffer size to 
grow too large (512-ish starting point seems to be a sweet spot). This defends 
against working with large strings more often than necessary.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 11:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 18:08:42 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 09:08:42 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiCxW-0000lv-53@sc8-sf-web3.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 12:53
Message generated for change (Comment added) made by marumari
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Open
>Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: April King (marumari)
Date: 2006-05-22 11:08

Message:
Logged In: YES 
user_id=747439

There was generally a 5-10% speed improvement for using a
string.  This is because the cost of recreating the string
by appending was less than the cost of creating an array,
appending the the array, and then joining it back together.

I would recommend trying leaving it as a string, but
changing this:
if readsize > self.min_readsize:
  self.min_readsize = int(self.min_readsize * 1.25)

(or some kind of scaling factor)

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:59

Message:
Logged In: YES 
user_id=139309

Applied in revision 46075

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:54

Message:
Logged In: YES 
user_id=139309

The attached patch uses a strategy that provides the same 30%-ish performance 
boost for the benchmark, and also provides a small performance boost (about 
9% or so) for the very strange log file.

The key is to use lists for buffering, and to never allow the default buffer size to 
grow too large (512-ish starting point seems to be a sweet spot). This defends 
against working with large strings more often than necessary.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 10:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 09:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 09:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 12:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 18:40:06 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 09:40:06 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiDRu-0001GG-Or@sc8-sf-web2.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 12:40

Message:
Logged In: YES 
user_id=139309

Using a string is over 4x slower (at least on this platform) if the strings get 
large, that's not acceptable. Using a list is a compromise that provides good (but 
not optimal) performance when dealing with lines of arbitrary length.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 12:08

Message:
Logged In: YES 
user_id=747439

There was generally a 5-10% speed improvement for using a
string.  This is because the cost of recreating the string
by appending was less than the cost of creating an array,
appending the the array, and then joining it back together.

I would recommend trying leaving it as a string, but
changing this:
if readsize > self.min_readsize:
  self.min_readsize = int(self.min_readsize * 1.25)

(or some kind of scaling factor)

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:59

Message:
Logged In: YES 
user_id=139309

Applied in revision 46075

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:54

Message:
Logged In: YES 
user_id=139309

The attached patch uses a strategy that provides the same 30%-ish performance 
boost for the benchmark, and also provides a small performance boost (about 
9% or so) for the very strange log file.

The key is to use lists for buffering, and to never allow the default buffer size to 
grow too large (512-ish starting point seems to be a sweet spot). This defends 
against working with large strings more often than necessary.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 11:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 18:47:58 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 09:47:58 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FiDZW-00057p-Ng@sc8-sf-web1.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 12:53
Message generated for change (Comment added) made by marumari
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

>Comment By: April King (marumari)
Date: 2006-05-22 11:47

Message:
Logged In: YES 
user_id=747439

Okie dokie.  30% is still a welcome speedup.  :)

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:40

Message:
Logged In: YES 
user_id=139309

Using a string is over 4x slower (at least on this platform) if the strings get 
large, that's not acceptable. Using a list is a compromise that provides good (but 
not optimal) performance when dealing with lines of arbitrary length.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 11:08

Message:
Logged In: YES 
user_id=747439

There was generally a 5-10% speed improvement for using a
string.  This is because the cost of recreating the string
by appending was less than the cost of creating an array,
appending the the array, and then joining it back together.

I would recommend trying leaving it as a string, but
changing this:
if readsize > self.min_readsize:
  self.min_readsize = int(self.min_readsize * 1.25)

(or some kind of scaling factor)

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:59

Message:
Logged In: YES 
user_id=139309

Applied in revision 46075

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:54

Message:
Logged In: YES 
user_id=139309

The attached patch uses a strategy that provides the same 30%-ish performance 
boost for the benchmark, and also provides a small performance boost (about 
9% or so) for the very strange log file.

The key is to use lists for buffering, and to never allow the default buffer size to 
grow too large (512-ish starting point seems to be a sweet spot). This defends 
against working with large strings more often than necessary.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 10:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 09:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 09:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 12:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Mon May 22 19:25:54 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 10:25:54 -0700
Subject: [Patches] [ python-Patches-876206 ] scary frame speed hacks
Message-ID: <E1FiEAE-0007pk-7l@sc8-sf-web1.sourceforge.net>

Patches item #876206, was opened at 2004-01-14 03:49
Message generated for change (Comment added) made by richard
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=876206&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Michael Hudson (mwh)
Assigned to: Jeremy Hylton (jhylton)
Summary: scary frame speed hacks

Initial Comment:
In ceval.c we find

		/* XXX Perhaps we should create a specialized
		   PyFrame_New() that doesn't take locals, but does
		   take builtins without sanity checking them.
		*/

This patch takes that idea rather further than you
might have expected... it creates a "light" subtype of
frame that assumes certain things about the frame,
gives this type its own free list (so it can assume
more about objects on the freelist) and converts light
frames into "heavy" frames when assumptions stop being
true.

Good for a ~5% improvement on "./python -s 'def f():
pass' 'f()'"; a bit less on pystone.  It also conflicts
slightly with my function reorg patch -- apply that
first, apply this, ignore the reject and edit
func_caller_nofrees in funcobject.c to call
PyFrame_NewLight.

All three patches I just submitted together get ~6% on
pystone.

----------------------------------------------------------------------

Comment By: Richard Jones (richard)
Date: 2006-05-23 03:25

Message:
Logged In: YES 
user_id=6405

Patch modified and applied to python2.5. Mods:

1. updated to python2.5
2. reinstated use of free list

See the "rjones-funccall" branch in SVN.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-20 22:44

Message:
Logged In: YES 
user_id=849994

Can this be reviewed for 2.5? The relevant discussion on
python-dev is at
http://mail.python.org/pipermail/python-dev/2004-March/042871.html.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-02 02:50

Message:
Logged In: YES 
user_id=80475

I think that's a question for python-dev.

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-03-02 00:59

Message:
Logged In: YES 
user_id=4771

On a small recursive example it slows down from 2.64s to 3.26s. 
This is a serious difference (20%).  Is it bad enough to keep the 
freelist ?

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-02 00:25

Message:
Logged In: YES 
user_id=80475

The effect on recursive functions could be mitigated by
restoring the freelist and falling back to it when
code->co_zombieframe == NULL.

I don't know if that is worth it.  The current patch is a
code simplication as well as an optimization.  Adding back
the freelist, adds a lot of clutter.  Python is not
especially friendly to recursive functions anyway.

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-03-01 23:23

Message:
Logged In: YES 
user_id=6656

It slows recursive functions down a noticeable amount (did
this get noted anywhere?  Maybe Armin & I just talked about
it on IRC), so that should be considered before this patch
is applied.  But I think it's probably worth it, FWIW.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-01 23:20

Message:
Logged In: YES 
user_id=80475

Armin's second patch gives gives the expected speedups on a
Pentium3 running WinME, and the test suite runs without
exception.  I recommend accepting and applying this patch as
is.  Further improvements can be considered separately.

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-02-03 00:04

Message:
Logged In: YES 
user_id=4771

I guess the idea was just in the air, after your published attempts.

Ideally I'd have liked to have the cached frame depend on the globals as well as the code object itself; I considered moving the cache field to function objects.  This way you also save the f_globals and f_builtins initialization.  There were problems but maybe we should try harder.

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-02-02 22:20

Message:
Logged In: YES 
user_id=6656

Did I mention this idea to you or did you come up with it 
independently?  I forget...

I'll try to time stuff on my iBook tomorrow.

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-01-28 04:03

Message:
Logged In: YES 
user_id=4771

Here is yet another try, which seems to perform better on my PentiumIII.  I get the following speed improvements for this patch alone, for a loop calling an empty function:

zombie-frames.diff: 11.4%      (PyStone 3.8%)
scary-frame-hacks.diff: 6.4%   (PyStone 0.85%)

The idea is to get rid of the free_list and instead store the most recently finished ("zombie") frame in an internal field of the code object.  This saves half of the frame creation overhead because half of the fields are already correct when the frame is reused, e.g. f_code, f_nlocals, f_stacksize, f_valuestack...

(you might need to cvs up frameobject.c before you can apply the patch)

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-01-15 22:02

Message:
Logged In: YES 
user_id=6656

I'm fairly sure this made a difference on my iBook; haven't
tried on x86.

It's possible that the correct response to this patch is to
add "... nah, not worth it" to the XXX comment in ceval.c...

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-01-15 04:35

Message:
Logged In: YES 
user_id=4771

(Side note first: I'm not sure 'builtins = back->f_builtins'
is right.)

Is the whole subclassing complexity worth the effort, given
that the invariants of light frames only seem to be that
four specific fields are null?  Changing the type of an
object under Python code's feet is calling for troubles. 
Moreover it is bound to break code that expect
'type(frame)==FrameType', even if such code can be
considered bad style.

Moreover it requires a number of hacks here and there --
e.g. you turn a light frame into a "heavy" frame when
f_trace is set; is it on purpose that you don't do it when
f_locals is set?

I cannot seem to get reliable performance results on my
machine, but maybe you want to compare with the attached
patch which speeds up the regular PyFrame_New by putting
stronger invariants on all the frames in the free_list.


----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-01-14 05:23

Message:
Logged In: YES 
user_id=6656

sigh

----------------------------------------------------------------------

Comment By: Jeremy Hylton (jhylton)
Date: 2004-01-14 05:20

Message:
Logged In: YES 
user_id=31392

I don't see any files attached.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=876206&group_id=5470

From noreply at sourceforge.net  Mon May 22 19:53:20 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 10:53:20 -0700
Subject: [Patches] [ python-Patches-1493102 ] Allow build without tracing
Message-ID: <E1FiEam-00080G-02@sc8-sf-web1.sourceforge.net>

Patches item #1493102, was opened at 2006-05-22 13:53
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493102&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Steve Holden (holdenweb)
Assigned to: Nobody/Anonymous (nobody)
Summary: Allow build without tracing

Initial Comment:
This patch allows the tracing code to be conditioned
out by the absence of a definition for the symbol
WITH_TRACING.

This seems to win a worthwhile speed gain.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493102&group_id=5470

From noreply at sourceforge.net  Mon May 22 22:13:33 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 13:13:33 -0700
Subject: [Patches] [ python-Patches-1183712 ] package_data chops off first
	char of default package
Message-ID: <E1FiGmT-0006Qn-Be@sc8-sf-web3.sourceforge.net>

Patches item #1183712, was opened at 2005-04-15 14:34
Message generated for change (Comment added) made by calvin
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1183712&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Wummel (calvin)
Assigned to: Nobody/Anonymous (nobody)
Summary: package_data chops off first char of default package

Initial Comment:
If the package name is an empty string (ie the default
package), all package_data files have the first
character chopped off.
Attached is a test package pytest.tar.gz where running
python2.4 setup.py build_py
produces this error:
running build_py
creating build
creating build/lib
copying __init__.py -> build/lib
error: can't copy 'ATA': doesn't exist or not a regular
file

Also attached is a fix proposal, though I have tested
this only against the test package.


----------------------------------------------------------------------

>Comment By: Wummel (calvin)
Date: 2006-05-22 22:13

Message:
Logged In: YES 
user_id=9205

I found it in another Python program (don't remember which
though). So I did not think of this as an undocumented
feature. I tried it and it worked (except the data file
stuff :).

The patch should not break any currently working setup.py
installation, since src_dir is only empty when using ''
(empty string) as package name.

Perhaps a cleaner approach would be to forbid an empty
package name instead of silently accepting it? I am not
sure. At least the documentation should mention that empty
package names are not allowed.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-15 10:23

Message:
Logged In: YES 
user_id=21627

Why are you using an empty name as the package name? There
is no default package in Python, so this shouldn't work at all.

----------------------------------------------------------------------

Comment By: Herv? Cauwelier (hcauwelier)
Date: 2005-10-05 13:03

Message:
Logged In: YES 
user_id=1216236

The patch worked well for me, thanks for it!

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1183712&group_id=5470

From noreply at sourceforge.net  Tue May 23 07:32:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 22:32:09 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1FiPV3-0000Nq-Nw@sc8-sf-web1.sourceforge.net>

Patches item #1479611, was opened at 2006-04-30 23:58
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-22 22:32

Message:
Logged In: YES 
user_id=33168

Interesting.  I did the original work for this on an amd64
(gcc 3.4 i think).  And continued work on ppc mac laptop
(gcc 4.0 i think).  Both had improvements.  I assume you
tested with v3?  What about v1?

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 05:02

Message:
Logged In: YES 
user_id=139309

The performance gain for this patch (as-is) on Mac OS X i386 with a release 
build seems totally negligible. I'm not getting any consistent win with any of the 
timeit or pybench benchmarks. 

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-11 00:43

Message:
Logged In: YES 
user_id=33168

This version actually works (in both normal and debug
builds).  It adds some stats which are useful and updates
Misc/SpecialBuilds.txt.

I modified to not preallocate and only hold a ref when the
function didn't keep a ref.

I still need to inline more of PyCFunction_Call.  Speed is
still the same as before.

I'm not sure if I'll finish this before the sprint next
week.  Anyone there feel free to check this in if you finish it.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-05 01:27

Message:
Logged In: YES 
user_id=33168

v2 attached.  You might not want to review yet.  I mostly
did the first part of your suggest (stats, _Fini, and
stack-like if I understood you correctly).  I didn't do
anything on the second part about inlinting Function_Call.

perf seems to be about the same.  I'm not entirely sure the
patch is correct yet. I found one or two problems in the
original.  I added some more comments. 

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-01 01:27

Message:
Logged In: YES 
user_id=21627

The tuples should get deallocated when Py_Finalize is called.

It would be good if there was (conditional) statistical
analysis, showing how often no tuple was found because the
number of arguments was too large, and how often no tuple
was found because the candidate was in use.

I think it should be more stack-like, starting off with no
tuples allocated, then returning them inside the needs_free
blocks only if the refcount is 1 (or 2?). This would avoid
degeneralized cases where some function holds onto its
argument tuple indefinitely, thus consuming all 64 tuples.

For the other part, I think it would make the code more
readable if it inlined PyCFunction_Call even more: the test
for NOARGS|O could be integrated into the switch statement
(one case for each), VARARGS and VARARGS|KEYWORDS would both
load the arguments, then call the function directly
(possibly with NULL keywords). OLDARGS should goto either
METH_NOARGS, METH_O, or METH_VARARGS depending on na (if you
don't like goto, modifying flags would work as well).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 00:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Tue May 23 08:23:13 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 23:23:13 -0700
Subject: [Patches] [ python-Patches-1488098 ] MacOSX: distutils support for
	-arch and -isysroot flags
Message-ID: <E1FiQIT-00063V-15@sc8-sf-web2.sourceforge.net>

Patches item #1488098, was opened at 2006-05-13 14:13
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: MacOSX: distutils support for -arch and -isysroot flags

Initial Comment:
This flag adds specific support for the -arch and -isysroot flags of GCC 
on MacOSX 10.4 or later.

The patch consists of two parts:

1) Remove these flags (and their arguments) from the base CFLAGS/
LDFLAGS when compiling extensions on OSX 10.3 or earlier because GCC 
doesn't support those arguments in the version of GCC that is shipped 
what the version of the OS.

2) Strip -arch and -isysroot (again including their arguments) from the 
base CFLAGS/LDFLAGS when the user has specified new values for them 
in the extra_compile_args and extra_link args.

The second part is needed because -isysroot can only be specified once 
and the -arch option is incremental, without this patch you cannot 
compile using a different SDK or for fewer architectures.

A reason for wanting to do the latter is software like psyco that is only 
fully supported on one of the architectures for OSX.

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-22 23:23

Message:
Logged In: YES 
user_id=33168

Heh, I never realized you were Dutch. :-)

It took some time to find.  I thought distutils was in 291,
but as you point out there's nothing there now.  So after
finding some references to distutils and 291, I did an svn
log on the PEP and sure enough:

r1982 | akuchling | 2005-03-20 12:47:01 -0800 (Sun, 20 Mar
2005) | 10 lines

After some discussion at the distutils sprint at PyCon 2005,
it seems that no one really wants to make a new standalone
release of Distutils. Given that, there's no reason for
Distutils code to preserve backward compatibility, so I am
removing the requirement for 2.1 compatibility.

I'm not sure if I'll have time to review this patch soon
(probably over a week).

Is this patch required to get Python working on x86 Macs?  I
know there were a couple of bug reports about x86 Mac.  If
so or you want more testing, it would be better to check
this in sooner rather than later.  I doubt I will find more
on a second review of this patch (and I try to review all
checkins).  So if you want and think it's appropriate go
ahead and check in.  If you want more review, you can try to
wait for me or solicit input from python-dev.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-19 12:23

Message:
Logged In: YES 
user_id=580910

I've updated the patch with the stylistic changes you've requested.

BTW. I don't think idx is confusing, although I suppose it helps that the Dutch 
term for index is index :-)

BTW. Distutils.archive_util claims it should be kept 2.1 compatible, although I 
don't know if that request covers all of distutils. PEP 291 doesn't mention 
distutils.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-15 00:41

Message:
Logged In: YES 
user_id=33168

I don't see any obvious problems with the patch.  I have
some nits though:

 * This is pretty complex: int(os.uname()[2].split('.')[0])
   I would prefer if it was broken up and use local
variables to explain better what's going on (or at least a
comment that shows the expected format).
  - same with '.'.join(m.group(1).split('.')[:2])

 * Remove double blank lines at first line of patch in
util.py and the last 3 lines (the pass is not needed).

 * unixcompiler.py, use True/False instead of 1/0.  I forget
what the compatibility of distutils is, but I see other uses
of True and False

   - same comment about getting the kernel with a complex expr

   - I prefer index instead of idx (I don't like abbrevs,
particularly for foreign speakers)

Instead of: 
+        if '-arch' in cc_args:
+            stripArch = 1

just set it:  stripArch = '-arch' in cc_args

Same for stripSysroot

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

From noreply at sourceforge.net  Tue May 23 08:33:52 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 22 May 2006 23:33:52 -0700
Subject: [Patches] [ python-Patches-1488098 ] MacOSX: distutils support for
	-arch and -isysroot flags
Message-ID: <E1FiQSm-000124-Is@sc8-sf-web4-b.sourceforge.net>

Patches item #1488098, was opened at 2006-05-13 23:13
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: MacOSX: distutils support for -arch and -isysroot flags

Initial Comment:
This flag adds specific support for the -arch and -isysroot flags of GCC 
on MacOSX 10.4 or later.

The patch consists of two parts:

1) Remove these flags (and their arguments) from the base CFLAGS/
LDFLAGS when compiling extensions on OSX 10.3 or earlier because GCC 
doesn't support those arguments in the version of GCC that is shipped 
what the version of the OS.

2) Strip -arch and -isysroot (again including their arguments) from the 
base CFLAGS/LDFLAGS when the user has specified new values for them 
in the extra_compile_args and extra_link args.

The second part is needed because -isysroot can only be specified once 
and the -arch option is incremental, without this patch you cannot 
compile using a different SDK or for fewer architectures.

A reason for wanting to do the latter is software like psyco that is only 
fully supported on one of the architectures for OSX.

----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-23 08:33

Message:
Logged In: YES 
user_id=580910

This patch isn't strictly necessary to get python going on x86, that should 
work just fine as it is right now. The trunk builds fine for me, except for libffi 
but that's a know issue and high on my list.

The primary reason for this patch is to be able to build a universal binary 
distribution of python on 10.4 and then use the result on 10.3. Without this 
patch you won't be able to build extensions on 10.3 in that scenario because 
distutils will use some compiler flags that aren't valid for the compiler that 
ships with 10.3.

I'll check this in.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-23 08:23

Message:
Logged In: YES 
user_id=33168

Heh, I never realized you were Dutch. :-)

It took some time to find.  I thought distutils was in 291,
but as you point out there's nothing there now.  So after
finding some references to distutils and 291, I did an svn
log on the PEP and sure enough:

r1982 | akuchling | 2005-03-20 12:47:01 -0800 (Sun, 20 Mar
2005) | 10 lines

After some discussion at the distutils sprint at PyCon 2005,
it seems that no one really wants to make a new standalone
release of Distutils. Given that, there's no reason for
Distutils code to preserve backward compatibility, so I am
removing the requirement for 2.1 compatibility.

I'm not sure if I'll have time to review this patch soon
(probably over a week).

Is this patch required to get Python working on x86 Macs?  I
know there were a couple of bug reports about x86 Mac.  If
so or you want more testing, it would be better to check
this in sooner rather than later.  I doubt I will find more
on a second review of this patch (and I try to review all
checkins).  So if you want and think it's appropriate go
ahead and check in.  If you want more review, you can try to
wait for me or solicit input from python-dev.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-19 21:23

Message:
Logged In: YES 
user_id=580910

I've updated the patch with the stylistic changes you've requested.

BTW. I don't think idx is confusing, although I suppose it helps that the Dutch 
term for index is index :-)

BTW. Distutils.archive_util claims it should be kept 2.1 compatible, although I 
don't know if that request covers all of distutils. PEP 291 doesn't mention 
distutils.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-15 09:41

Message:
Logged In: YES 
user_id=33168

I don't see any obvious problems with the patch.  I have
some nits though:

 * This is pretty complex: int(os.uname()[2].split('.')[0])
   I would prefer if it was broken up and use local
variables to explain better what's going on (or at least a
comment that shows the expected format).
  - same with '.'.join(m.group(1).split('.')[:2])

 * Remove double blank lines at first line of patch in
util.py and the last 3 lines (the pass is not needed).

 * unixcompiler.py, use True/False instead of 1/0.  I forget
what the compatibility of distutils is, but I see other uses
of True and False

   - same comment about getting the kernel with a complex expr

   - I prefer index instead of idx (I don't like abbrevs,
particularly for foreign speakers)

Instead of: 
+        if '-arch' in cc_args:
+            stripArch = 1

just set it:  stripArch = '-arch' in cc_args

Same for stripSysroot

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

From noreply at sourceforge.net  Tue May 23 10:45:57 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 01:45:57 -0700
Subject: [Patches] [ python-Patches-1479611 ] speed up function calls
Message-ID: <E1FiSWb-00076R-V0@sc8-sf-web4-b.sourceforge.net>

Patches item #1479611, was opened at 2006-05-01 02:58
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: speed up function calls

Initial Comment:
Results:  2.86% for 1 arg (len), 11.8% for 2 args
(min), and 1.6% for pybench.

trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.74 msec per loop
trunk-speed$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 8.03 msec per loop

trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): len([])'
100 loops, best of 3: 4.88 msec per loop
trunk-clean$ ./python.exe -m timeit 'for x in
xrange(10000): min(1,2)'
100 loops, best of 3: 9.09 msec per loop

pybench goes from 5688.00 down to 5598.00


Details about the patch:

There are 2 unrelated changes.  They both seem to
provide equal benefits for calling varargs C.  One is
very simple and just inlines calling a varargs C
function rather than calling PyCFunction_Call() which
does extra checks that are already known.  This moves
meth and self up one block. and breaks the C_TRACE into
2.  (When looking at the patch, this will make sense I
hope.)

The other change is more dangerous.  It modifies
load_args() to hold on to tuples so they aren't
allocated and deallocated.  The initialization is done
one time in the new func _PyEval_Init().

It allocates 64 tuples of size 8 that are never
deallocated.  The idea is that there won't be usually
be more than 64 frames with 8 or less parameters active
on the stack at any one time (stack depth).  There are
cases where this can degenerate, but for the most part,
it should only be marginally slower, but generally this
should be a fair amount faster by skipping the alloc
and dealloc and some extra work.  My decrementing the
_last_index inside the needs_free blocks, that could
improve behaviour.

This really needs comments added to the code.  But I'm
not gonna get there tonight.  I'd be interested in
comments about the code.

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-23 04:45

Message:
Logged In: YES 
user_id=139309

This was v3 on a MacBook Pro running 10.4.6 (gcc 4, of course, since that's the 
only Apple-distributed i386 GCC for OS X).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-23 01:32

Message:
Logged In: YES 
user_id=33168

Interesting.  I did the original work for this on an amd64
(gcc 3.4 i think).  And continued work on ppc mac laptop
(gcc 4.0 i think).  Both had improvements.  I assume you
tested with v3?  What about v1?

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 08:02

Message:
Logged In: YES 
user_id=139309

The performance gain for this patch (as-is) on Mac OS X i386 with a release 
build seems totally negligible. I'm not getting any consistent win with any of the 
timeit or pybench benchmarks. 

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-11 03:43

Message:
Logged In: YES 
user_id=33168

This version actually works (in both normal and debug
builds).  It adds some stats which are useful and updates
Misc/SpecialBuilds.txt.

I modified to not preallocate and only hold a ref when the
function didn't keep a ref.

I still need to inline more of PyCFunction_Call.  Speed is
still the same as before.

I'm not sure if I'll finish this before the sprint next
week.  Anyone there feel free to check this in if you finish it.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-05 04:27

Message:
Logged In: YES 
user_id=33168

v2 attached.  You might not want to review yet.  I mostly
did the first part of your suggest (stats, _Fini, and
stack-like if I understood you correctly).  I didn't do
anything on the second part about inlinting Function_Call.

perf seems to be about the same.  I'm not entirely sure the
patch is correct yet. I found one or two problems in the
original.  I added some more comments. 

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-01 04:27

Message:
Logged In: YES 
user_id=21627

The tuples should get deallocated when Py_Finalize is called.

It would be good if there was (conditional) statistical
analysis, showing how often no tuple was found because the
number of arguments was too large, and how often no tuple
was found because the candidate was in use.

I think it should be more stack-like, starting off with no
tuples allocated, then returning them inside the needs_free
blocks only if the refcount is 1 (or 2?). This would avoid
degeneralized cases where some function holds onto its
argument tuple indefinitely, thus consuming all 64 tuples.

For the other part, I think it would make the code more
readable if it inlined PyCFunction_Call even more: the test
for NOARGS|O could be integrated into the switch statement
(one case for each), VARARGS and VARARGS|KEYWORDS would both
load the arguments, then call the function directly
(possibly with NULL keywords). OLDARGS should goto either
METH_NOARGS, METH_O, or METH_VARARGS depending on na (if you
don't like goto, modifying flags would work as well).

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-01 03:08

Message:
Logged In: YES 
user_id=33168

I should note the numbers 64 and 8 are total guesses.  It
might be good to try and determine values based on empirical
data.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1479611&group_id=5470

From noreply at sourceforge.net  Tue May 23 13:16:41 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 04:16:41 -0700
Subject: [Patches] [ python-Patches-876206 ] scary frame speed hacks
Message-ID: <E1FiUsT-0008GB-U8@sc8-sf-web3.sourceforge.net>

Patches item #876206, was opened at 2004-01-13 11:49
Message generated for change (Settings changed) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=876206&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Michael Hudson (mwh)
>Assigned to: Nobody/Anonymous (nobody)
Summary: scary frame speed hacks

Initial Comment:
In ceval.c we find

		/* XXX Perhaps we should create a specialized
		   PyFrame_New() that doesn't take locals, but does
		   take builtins without sanity checking them.
		*/

This patch takes that idea rather further than you
might have expected... it creates a "light" subtype of
frame that assumes certain things about the frame,
gives this type its own free list (so it can assume
more about objects on the freelist) and converts light
frames into "heavy" frames when assumptions stop being
true.

Good for a ~5% improvement on "./python -s 'def f():
pass' 'f()'"; a bit less on pystone.  It also conflicts
slightly with my function reorg patch -- apply that
first, apply this, ignore the reject and edit
func_caller_nofrees in funcobject.c to call
PyFrame_NewLight.

All three patches I just submitted together get ~6% on
pystone.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-23 07:16

Message:
Logged In: YES 
user_id=31435

Closed as Accepted.  While re-adding the free list removed
the code simplification benefit, the measurable x-platform
speedup is well worth getting.

----------------------------------------------------------------------

Comment By: Richard Jones (richard)
Date: 2006-05-22 13:25

Message:
Logged In: YES 
user_id=6405

Patch modified and applied to python2.5. Mods:

1. updated to python2.5
2. reinstated use of free list

See the "rjones-funccall" branch in SVN.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-20 06:44

Message:
Logged In: YES 
user_id=849994

Can this be reviewed for 2.5? The relevant discussion on
python-dev is at
http://mail.python.org/pipermail/python-dev/2004-March/042871.html.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-01 10:50

Message:
Logged In: YES 
user_id=80475

I think that's a question for python-dev.

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-03-01 08:59

Message:
Logged In: YES 
user_id=4771

On a small recursive example it slows down from 2.64s to 3.26s. 
This is a serious difference (20%).  Is it bad enough to keep the 
freelist ?

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-01 08:25

Message:
Logged In: YES 
user_id=80475

The effect on recursive functions could be mitigated by
restoring the freelist and falling back to it when
code->co_zombieframe == NULL.

I don't know if that is worth it.  The current patch is a
code simplication as well as an optimization.  Adding back
the freelist, adds a lot of clutter.  Python is not
especially friendly to recursive functions anyway.

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-03-01 07:23

Message:
Logged In: YES 
user_id=6656

It slows recursive functions down a noticeable amount (did
this get noted anywhere?  Maybe Armin & I just talked about
it on IRC), so that should be considered before this patch
is applied.  But I think it's probably worth it, FWIW.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-01 07:20

Message:
Logged In: YES 
user_id=80475

Armin's second patch gives gives the expected speedups on a
Pentium3 running WinME, and the test suite runs without
exception.  I recommend accepting and applying this patch as
is.  Further improvements can be considered separately.

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-02-02 08:04

Message:
Logged In: YES 
user_id=4771

I guess the idea was just in the air, after your published attempts.

Ideally I'd have liked to have the cached frame depend on the globals as well as the code object itself; I considered moving the cache field to function objects.  This way you also save the f_globals and f_builtins initialization.  There were problems but maybe we should try harder.

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-02-02 06:20

Message:
Logged In: YES 
user_id=6656

Did I mention this idea to you or did you come up with it 
independently?  I forget...

I'll try to time stuff on my iBook tomorrow.

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-01-27 12:03

Message:
Logged In: YES 
user_id=4771

Here is yet another try, which seems to perform better on my PentiumIII.  I get the following speed improvements for this patch alone, for a loop calling an empty function:

zombie-frames.diff: 11.4%      (PyStone 3.8%)
scary-frame-hacks.diff: 6.4%   (PyStone 0.85%)

The idea is to get rid of the free_list and instead store the most recently finished ("zombie") frame in an internal field of the code object.  This saves half of the frame creation overhead because half of the fields are already correct when the frame is reused, e.g. f_code, f_nlocals, f_stacksize, f_valuestack...

(you might need to cvs up frameobject.c before you can apply the patch)

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-01-15 06:02

Message:
Logged In: YES 
user_id=6656

I'm fairly sure this made a difference on my iBook; haven't
tried on x86.

It's possible that the correct response to this patch is to
add "... nah, not worth it" to the XXX comment in ceval.c...

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-01-14 12:35

Message:
Logged In: YES 
user_id=4771

(Side note first: I'm not sure 'builtins = back->f_builtins'
is right.)

Is the whole subclassing complexity worth the effort, given
that the invariants of light frames only seem to be that
four specific fields are null?  Changing the type of an
object under Python code's feet is calling for troubles. 
Moreover it is bound to break code that expect
'type(frame)==FrameType', even if such code can be
considered bad style.

Moreover it requires a number of hacks here and there --
e.g. you turn a light frame into a "heavy" frame when
f_trace is set; is it on purpose that you don't do it when
f_locals is set?

I cannot seem to get reliable performance results on my
machine, but maybe you want to compare with the attached
patch which speeds up the regular PyFrame_New by putting
stronger invariants on all the frames in the free_list.


----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-01-13 13:23

Message:
Logged In: YES 
user_id=6656

sigh

----------------------------------------------------------------------

Comment By: Jeremy Hylton (jhylton)
Date: 2004-01-13 13:20

Message:
Logged In: YES 
user_id=31392

I don't see any files attached.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=876206&group_id=5470

From noreply at sourceforge.net  Tue May 23 14:28:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 05:28:09 -0700
Subject: [Patches] [ python-Patches-1491759 ] IDLE L&F on MacOSX
Message-ID: <E1FiVzd-0005Xx-BX@sc8-sf-web1.sourceforge.net>

Patches item #1491759, was opened at 2006-05-19 19:39
Message generated for change (Settings changed) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491759&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: IDLE
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
>Assigned to: Kurt B. Kaiser (kbk)
Summary: IDLE L&F on MacOSX 

Initial Comment:
The attached patch fixes some L&F issues on MacOSX:

- IDLE now reacts to file-open AppleEvents, which means that if a user 
associates IDLE.app with .py files IDLE will open .py files when the user 
double-clicks on them

- Hide the tcl/tk console window that gets opened by default when IDLE is 
in an application bundle (that's a misfeature of aquatk)

- Patch the menu's to make sure they better conform to the HIG.

- PyShell/EditorWindow  status_bar no longer overlaps with the resize 
widget in the lower-left corner of the window

Open issues:

- When you double-click on a file and IDLE is not yet open the file will be 
opened, but IDLE will open the default shell window just above it :-(

- I'm not terribly happy with the code changes that implement the 
updated menu structure.

- The default keybindings on OSX are the windows keybindings. I haven't 
checked yet if that can be fixed programmaticly, I also haven't verified if 
the macos keybindings are fully correct for OSX.

- The general L&F is still wrong, but that isn't really IDLE's fault: tcl/tk 
doesn't fully conform to the HIG yet (dialogs without title bars, wrong 
default dinwos background, wrong widget for tabbed windows, ...).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491759&group_id=5470

From noreply at sourceforge.net  Tue May 23 14:30:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 05:30:10 -0700
Subject: [Patches] [ python-Patches-1488098 ] MacOSX: distutils support for
	-arch and -isysroot flags
Message-ID: <E1FiW1a-0003Kw-3h@sc8-sf-web4-b.sourceforge.net>

Patches item #1488098, was opened at 2006-05-13 23:13
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Distutils and setup.py
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Nobody/Anonymous (nobody)
Summary: MacOSX: distutils support for -arch and -isysroot flags

Initial Comment:
This flag adds specific support for the -arch and -isysroot flags of GCC 
on MacOSX 10.4 or later.

The patch consists of two parts:

1) Remove these flags (and their arguments) from the base CFLAGS/
LDFLAGS when compiling extensions on OSX 10.3 or earlier because GCC 
doesn't support those arguments in the version of GCC that is shipped 
what the version of the OS.

2) Strip -arch and -isysroot (again including their arguments) from the 
base CFLAGS/LDFLAGS when the user has specified new values for them 
in the extra_compile_args and extra_link args.

The second part is needed because -isysroot can only be specified once 
and the -arch option is incremental, without this patch you cannot 
compile using a different SDK or for fewer architectures.

A reason for wanting to do the latter is software like psyco that is only 
fully supported on one of the architectures for OSX.

----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-23 14:30

Message:
Logged In: YES 
user_id=580910

Checked in as revision 46104

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-23 08:33

Message:
Logged In: YES 
user_id=580910

This patch isn't strictly necessary to get python going on x86, that should 
work just fine as it is right now. The trunk builds fine for me, except for libffi 
but that's a know issue and high on my list.

The primary reason for this patch is to be able to build a universal binary 
distribution of python on 10.4 and then use the result on 10.3. Without this 
patch you won't be able to build extensions on 10.3 in that scenario because 
distutils will use some compiler flags that aren't valid for the compiler that 
ships with 10.3.

I'll check this in.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-23 08:23

Message:
Logged In: YES 
user_id=33168

Heh, I never realized you were Dutch. :-)

It took some time to find.  I thought distutils was in 291,
but as you point out there's nothing there now.  So after
finding some references to distutils and 291, I did an svn
log on the PEP and sure enough:

r1982 | akuchling | 2005-03-20 12:47:01 -0800 (Sun, 20 Mar
2005) | 10 lines

After some discussion at the distutils sprint at PyCon 2005,
it seems that no one really wants to make a new standalone
release of Distutils. Given that, there's no reason for
Distutils code to preserve backward compatibility, so I am
removing the requirement for 2.1 compatibility.

I'm not sure if I'll have time to review this patch soon
(probably over a week).

Is this patch required to get Python working on x86 Macs?  I
know there were a couple of bug reports about x86 Mac.  If
so or you want more testing, it would be better to check
this in sooner rather than later.  I doubt I will find more
on a second review of this patch (and I try to review all
checkins).  So if you want and think it's appropriate go
ahead and check in.  If you want more review, you can try to
wait for me or solicit input from python-dev.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-19 21:23

Message:
Logged In: YES 
user_id=580910

I've updated the patch with the stylistic changes you've requested.

BTW. I don't think idx is confusing, although I suppose it helps that the Dutch 
term for index is index :-)

BTW. Distutils.archive_util claims it should be kept 2.1 compatible, although I 
don't know if that request covers all of distutils. PEP 291 doesn't mention 
distutils.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-15 09:41

Message:
Logged In: YES 
user_id=33168

I don't see any obvious problems with the patch.  I have
some nits though:

 * This is pretty complex: int(os.uname()[2].split('.')[0])
   I would prefer if it was broken up and use local
variables to explain better what's going on (or at least a
comment that shows the expected format).
  - same with '.'.join(m.group(1).split('.')[:2])

 * Remove double blank lines at first line of patch in
util.py and the last 3 lines (the pass is not needed).

 * unixcompiler.py, use True/False instead of 1/0.  I forget
what the compatibility of distutils is, but I see other uses
of True and False

   - same comment about getting the kernel with a complex expr

   - I prefer index instead of idx (I don't like abbrevs,
particularly for foreign speakers)

Instead of: 
+        if '-arch' in cc_args:
+            stripArch = 1

just set it:  stripArch = '-arch' in cc_args

Same for stripSysroot

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1488098&group_id=5470

From noreply at sourceforge.net  Tue May 23 15:10:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 06:10:01 -0700
Subject: [Patches] [ python-Patches-1446489 ] zipfile: support for ZIP64
Message-ID: <E1FiWe9-0005BV-38@sc8-sf-web4-b.sourceforge.net>

Patches item #1446489, was opened at 2006-03-09 15:58
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Ronald Oussoren (ronaldoussoren)
Summary: zipfile: support for ZIP64

Initial Comment:
The attached patch implements support for ZIP64, that is zipfiles 
containing very large (>4GByte) files and zipfiles that are larger than
4GByte themselves. 

The output of this patch can be read by pkzip (see below for the actual 
version I used for testing).


----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-23 15:10

Message:
Logged In: YES 
user_id=580910

I've found some time to work on this. I've added zipfile-zip64-
version2.patch, this version:

* Makes zip64 behaviour optional (defaults to off because zip(1) doesn't 
support  zip64)

* Is significantly faster for large zipfiles because it doesn't scan the entire 
zipfile just to check that the file headers are consistent with the central 
directory w.r.t. filename (this check is now done when trying to read a file)

* Updates the reference documentation.

* Adds unittests. There are two sets of tests: one set tests the behaviour of 
zip64 extensions using small files by lowering the zip64 cutoff point and is 
run every time, the other set do tests with huge zipfiles and are run when the 
largefile feature is enabled when running the tests.

There one backward incompatible change: ZipInfo objects no longer have a 
file_offset attribute. That was the other reason for scanning the entire zipfile 
when opening it. IMNSHO this should have been a private attribute and the 
cost of this feature is not worth its *very* limited usefulness. As an indication 
of its cost: I got a 6x speedup when I removed the calculation of the 
file_offset attribute, something that adds up when you are dealing with huge 
zipfiles (I wrote this patch because I'm dealing with 10+GByte zipfiles with 
tens of thousands of files at work).

I noticed that zipfile raises RuntimeError in some places. I've changed one of 
those to zipfile.BadZipfile, but others remain. I don't like this, most of them 
should be replaced by TypeError or ValueError exceptions.

BTW. This patch also supports storing files >4GByte in the zipfile, but that 
feature isn't very useful because zipfile doesn't have an API for reading file 
data incrementally.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-16 09:55

Message:
Logged In: YES 
user_id=580910

I haven't had time to work on this, all time I had to work on python related stuff 
has been eaten by finishing PyObjC's port to intel macs and universal binary 
patches.

The former is now done, the latter almost so I'll have some time to work on this 
again especially because I'm using this patch at work and might be able to claim 
some time to work on this during work-hours.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 09:41

Message:
Logged In: YES 
user_id=849994

Since 2.5 beta is coming close, have you made progress on
the tests/docs?

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-02 21:13

Message:
Logged In: YES 
user_id=580910

The "don't use the ZIP64 extension" flag is a good idea, zipfiles that use this 
extension aren't readable by the infozip tools (zip and unzip on most unix 
systems).

I'll add tests and documentation in the near future.

The version of zipfile that I'm currently using also contains a patch for 
speeding up the opening of zipfiles, for the type of files I'm dealing with 
(about 11GByte large with tens of thousands of files) the speedup is very 
significant. I suppose it's better to file that as a separate patch after this has 
been approved.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2006-04-02 07:02

Message:
Logged In: YES 
user_id=29957

I'd like to see a testcase and possibly a note for the
documentation about the new semantics. Also, should it be
possible to say "don't use the ZIP64 extension, instead
raise an Error" for people who don't want to generate these?
 

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-03-09 16:28

Message:
Logged In: YES 
user_id=580910

Oops, I've uploaded the wrong file. zipfile-zip64.patch is the correct one.

I've tested the correctness of created archives using this version of pkzip:

pkzipc -version
PKZIP(R) Server  Version 8  ZIP Compression Utility for Linux X86
Copyright (C) 1989-2005 PKWARE, Inc.  All Rights Reserved. Evaluation 
Version
PKZIP Reg. U.S. Pat. and Tm. Off.  Patent No. 5,051,745
Patent Pending

Version 8.40.66


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

From noreply at sourceforge.net  Tue May 23 16:57:20 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 07:57:20 -0700
Subject: [Patches] [ python-Patches-876193 ] reorganize,
	extend function call optimizations
Message-ID: <E1FiYJz-0000N9-Hd@sc8-sf-web5.sourceforge.net>

Patches item #876193, was opened at 2004-01-14 03:35
Message generated for change (Comment added) made by richard
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=876193&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: Michael Hudson (mwh)
Assigned to: Michael Hudson (mwh)
Summary: reorganize, extend function call optimizations

Initial Comment:
This patch rejigs the optimizations for certain kinds
of function call -- easy Python functions, METH_O
functions and so on.

It also extends the "easy Python function" optimization
to handle default arguments.

It adds a tp_pythoncall field to type objects, with the
signature of the ceval.c local static function do_call
(which is now exported and called PyEval_DoCall).  This
field is filled out in the function and method
constructors appropriately.

What do you think?  Makes little performance difference
(0.5% improvement in pystone on one machine), but I
think I prefer this arrangement.  It generalizes
better, for one thing.

The patch is a little untidy at present -- some code
duplication and it utterly mangles the function call
statistics code -- but these should be shallow.

----------------------------------------------------------------------

Comment By: Richard Jones (richard)
Date: 2006-05-24 00:57

Message:
Logged In: YES 
user_id=6405

This patch is far too out of date to apply to 2.5

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 22:26

Message:
Logged In: YES 
user_id=1188172

2.5 is nearing, so what to do?

----------------------------------------------------------------------

Comment By: Michael Hudson (mwh)
Date: 2004-11-08 23:19

Message:
Logged In: YES 
user_id=6656

Well, nothing has invalidated it.  I think I still would like to see it 
go in, but don't have strong opinions either way.  What do you 
think of it? :-)

It should certainly wait for 2.5.

----------------------------------------------------------------------

Comment By: Jeremy Hylton (jhylton)
Date: 2004-11-08 01:29

Message:
Logged In: YES 
user_id=31392

Is this patch still relevant?  Should it wait until 2.5?


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=876193&group_id=5470

From noreply at sourceforge.net  Tue May 23 17:11:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 08:11:31 -0700
Subject: [Patches] [ python-Patches-1486713 ] HTMLParser : A auto-tolerant
	parsing mode
Message-ID: <E1FiYXj-00013O-Gd@sc8-sf-web1.sourceforge.net>

Patches item #1486713, was opened at 2006-05-11 19:19
Message generated for change (Comment added) made by kxroberto
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486713&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
>Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: HTMLParser : A auto-tolerant parsing mode

Initial Comment:
Changes:

* Now allows missing spaces between attributes as its
often seen on the web like this :

<script type="text/javascript"language="JavaScript1.1">

That like broke the whole parsing before.


* A fully auto-tolerant mode (HTMLParser.tolerant=1)
was added. It should hopefully NEVER break HTML parsing
on the level of HTMLParser, but recover and continue
the parsing smartly. The mode was tested extensively
with complex pages. The tolerant mode is guaranted to
finish all HTML stuff only during HTMLParser.close() /
goahead(end=True)  - yet that was the same (stucking)
policy before.
Maybe steep: I have  switched ON the tolerant mode by
default, as this is, what in 99.9% of cases one wants
to have.
(I've maybe 20 applications for HTMLParser - None like
the unrecoverable breaks with Exceptions)
During tolerant mode the virtual .warning(message,i,k)
is called instead of error - by default this just
counts .warning_count up. This framework should even
enable to write po HTML checkers

* The patch was generated against py2.3 (still the
"good/base" Python for me) and also fixes a regexp-bug
(which already was fixed in py2.4.2). Yet the patch
works also against py2.4/2.5 - 2 locations where py24
trivially changed to %r/repr may grumble.


-robert


----------------------------------------------------------------------

>Comment By: kxroberto (kxroberto)
Date: 2006-05-23 17:11

Message:
Logged In: YES 
user_id=972995

Python 2.4 version of the patch added.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486713&group_id=5470

From noreply at sourceforge.net  Tue May 23 17:15:55 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 08:15:55 -0700
Subject: [Patches] [ python-Patches-1486713 ] HTMLParser : A auto-tolerant
	parsing mode
Message-ID: <E1FiYbz-0002G7-EL@sc8-sf-web4-b.sourceforge.net>

Patches item #1486713, was opened at 2006-05-11 19:19
Message generated for change (Comment added) made by kxroberto
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486713&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: kxroberto (kxroberto)
Assigned to: Nobody/Anonymous (nobody)
Summary: HTMLParser : A auto-tolerant parsing mode

Initial Comment:
Changes:

* Now allows missing spaces between attributes as its
often seen on the web like this :

<script type="text/javascript"language="JavaScript1.1">

That like broke the whole parsing before.


* A fully auto-tolerant mode (HTMLParser.tolerant=1)
was added. It should hopefully NEVER break HTML parsing
on the level of HTMLParser, but recover and continue
the parsing smartly. The mode was tested extensively
with complex pages. The tolerant mode is guaranted to
finish all HTML stuff only during HTMLParser.close() /
goahead(end=True)  - yet that was the same (stucking)
policy before.
Maybe steep: I have  switched ON the tolerant mode by
default, as this is, what in 99.9% of cases one wants
to have.
(I've maybe 20 applications for HTMLParser - None like
the unrecoverable breaks with Exceptions)
During tolerant mode the virtual .warning(message,i,k)
is called instead of error - by default this just
counts .warning_count up. This framework should even
enable to write po HTML checkers

* The patch was generated against py2.3 (still the
"good/base" Python for me) and also fixes a regexp-bug
(which already was fixed in py2.4.2). Yet the patch
works also against py2.4/2.5 - 2 locations where py24
trivially changed to %r/repr may grumble.


-robert


----------------------------------------------------------------------

>Comment By: kxroberto (kxroberto)
Date: 2006-05-23 17:15

Message:
Logged In: YES 
user_id=972995

(and works also for Python2.5)

----------------------------------------------------------------------

Comment By: kxroberto (kxroberto)
Date: 2006-05-23 17:11

Message:
Logged In: YES 
user_id=972995

Python 2.4 version of the patch added.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1486713&group_id=5470

From noreply at sourceforge.net  Tue May 23 18:47:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 09:47:34 -0700
Subject: [Patches] [ python-Patches-1493701 ] Performance enhancements for
	struct module
Message-ID: <E1Fia2g-000850-Uu@sc8-sf-web2.sourceforge.net>

Patches item #1493701, was opened at 2006-05-23 12:47
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493701&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Performance enhancements for struct module

Initial Comment:
This patch refactors the struct module to work like the re module: 
compile and cache the format in advance. This seems to yield at least a 
20% performance improvement on Mac OS X i386, depending on the 
length of the format string.

$ ./python-orig/_build/python.exe  -mtimeit -s "import struct; s = '\x00' 
* 16" "struct.unpack('>iId', s); struct.unpack('iId', s); struct.unpack('<iId', 
s)"
100000 loops, best of 3: 4.48 usec per loop

$ ./bippolito-newstruct/_build/python.exe  -mtimeit -s "import struct; s 
= '\x00' * 16" "struct.unpack('>iId', s); struct.unpack('iId', s); 
struct.unpack('<iId', s)"
100000 loops, best of 3: 3.54 usec per loop

It also adds a struct.Struct type, which is even faster to use than the pack/
unpack/calcsize functions since it doesn't need to look anything up in the 
cache.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493701&group_id=5470

From noreply at sourceforge.net  Tue May 23 20:48:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 11:48:09 -0700
Subject: [Patches] [ python-Patches-1493701 ] Performance enhancements for
	struct module
Message-ID: <E1FibvN-0004cg-LX@sc8-sf-web4-b.sourceforge.net>

Patches item #1493701, was opened at 2006-05-23 12:47
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493701&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Bob Ippolito (etrepum)
>Assigned to: Bob Ippolito (etrepum)
Summary: Performance enhancements for struct module

Initial Comment:
This patch refactors the struct module to work like the re module: 
compile and cache the format in advance. This seems to yield at least a 
20% performance improvement on Mac OS X i386, depending on the 
length of the format string.

$ ./python-orig/_build/python.exe  -mtimeit -s "import struct; s = '\x00' 
* 16" "struct.unpack('>iId', s); struct.unpack('iId', s); struct.unpack('<iId', 
s)"
100000 loops, best of 3: 4.48 usec per loop

$ ./bippolito-newstruct/_build/python.exe  -mtimeit -s "import struct; s 
= '\x00' * 16" "struct.unpack('>iId', s); struct.unpack('iId', s); 
struct.unpack('<iId', s)"
100000 loops, best of 3: 3.54 usec per loop

It also adds a struct.Struct type, which is even faster to use than the pack/
unpack/calcsize functions since it doesn't need to look anything up in the 
cache.

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-23 14:48

Message:
Logged In: YES 
user_id=139309

Applied in revision 46134

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493701&group_id=5470

From noreply at sourceforge.net  Tue May 23 20:49:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 11:49:08 -0700
Subject: [Patches] [ python-Patches-1335972 ] Fix for int(string,
	base) wrong answers (take 2)
Message-ID: <E1FibwK-0002d4-6u@sc8-sf-web1.sourceforge.net>

Patches item #1335972, was opened at 2005-10-24 01:33
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1335972&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Alan McIntyre (alanmcintyre)
>Assigned to: Nobody/Anonymous (nobody)
Summary: Fix for int(string, base) wrong answers (take 2)

Initial Comment:
This incorporates patch #1334979, adds test cases for
all the   cases listed in bug #1334662, and adds test
cases for evaluation of 2**32+1.  There seem to be some
minor speed improvements (simplistic stats shown
below). Some simple performance test scripts have been
included in the attached file as well.

A lookup table was added for the maximum number of
digits that can never overflow on a 32-bit ulong for
each base.  Overflow is only checked when this limit is
exceeded by 1, and once the input string is determined
to be too long (2 over the limit), the evaluation is
halted and an overflow indication is returned.  This
appears to help reduce the evaluation time for very
long strings (no time is wasted trying to evaluate all
of it into a 32-bit ulong).

Evaluation of each character has also been replaced by
a lookup table.  I'm not certain of the amount of speed
benefit obtained from this; I added it early on and
haven't had time to go back and test.  It may be that
it's not worth the extra static table.

Baseline Python from CVS:
alan at tarantula:~/python/dist/src# ./python -m timeit
'int("9")'
100000 loops, best of 3: 4 usec per loop
alan at tarantula:~/python/dist/src# ./python -m timeit
'int("999999999")'
100000 loops, best of 3: 5.49 usec per loop
alan at tarantula:~/python/dist/src# ./python -m timeit
'int("999999999999")'
100000 loops, best of 3: 11.8 usec per loop
alan at tarantula:~/python/dist/src# ./python -m timeit
'int("999999999999999")'
100000 loops, best of 3: 13.4 usec per loop
alan at tarantula:~/python/dist/src# ./python -m timeit
'int("1"*600)'
1000 loops, best of 3: 997 usec per loop


Modified:
alan at tarantula:~/python_testint/dist/src# ./python -m
timeit 'int("9")'
100000 loops, best of 3: 3.63 usec per loop
alan at tarantula:~/python_testint/dist/src# ./python -m
timeit 'int("999999999")'
100000 loops, best of 3: 3.93 usec per loop
alan at tarantula:~/python_testint/dist/src# ./python -m
timeit 'int("999999999999")'  
100000 loops, best of 3: 9.79 usec per loop
alan at tarantula:~/python_testint/dist/src# ./python -m
timeit 'int("999999999999999")'
100000 loops, best of 3: 11 usec per loop
alan at tarantula:~/python_testint/dist/src# ./python -m
timeit 'int("1"*600)'
1000 loops, best of 3: 905 usec per loop

10.2% faster for 1-digit int
39.7% faster for 9-digit int
20.5% faster for 12-digit int
21.8% faster for 15-digit int
10.2% faster for 600-digit int

Test program that takes 750k ints from [0, 2**32)
through stdin:
    Baseline: 8.114 sec (best of 5 consecutive runs)
    Modified: 6.774 sec (best of 5 consecutive runs)

19.8% faster

NOTE: This patch causes new errors in test_array and
test_compile, but it seems that these *should* be
failing given the input string for long(), unless I'm
missing something:

======================================================================
ERROR: test_repr (__main__.FloatTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "Lib/test/test_array.py", line 187, in test_repr
    self.assertEqual(a, eval(repr(a), {"array":
array.array}))
ValueError: invalid literal for long(): 10000000000.0
 
======================================================================
ERROR: test_repr (__main__.DoubleTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "Lib/test/test_array.py", line 187, in test_repr
    self.assertEqual(a, eval(repr(a), {"array":
array.array}))
ValueError: invalid literal for long(): 10000000000.0
 
----------------------------------------------------------------------

test test_compile crashed -- exceptions.ValueError:
invalid literal for long():
90000000000000.
 

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-23 14:49

Message:
Logged In: YES 
user_id=31435

Thank you, Alan!  A variant of the patch was checked in as
revision 46133 for Python 2.5, affecting

Lib/test/test_builtin.py
Misc/ACKS
Misc/NEWS
Python/mystrtoul.c

For future reference, note that C doesn't define whether an
unqualified "char" is signed or unsigned, and your
assumption that it was signed doesn't actually work
everywhere ;-)  Fixing that was the only real pain remaining
here:  thank you for the careful work and testing!

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-19 04:43

Message:
Logged In: YES 
user_id=1188172

Tim, if it looks good can it be applied?

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2005-12-20 23:20

Message:
Logged In: YES 
user_id=1115903

I cleaned up the digitlimit vector and test cases now
include all bases on [2, 36].  Uploaded as
python-mystrtoul5.tgz.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2005-10-31 19:55

Message:
Logged In: YES 
user_id=31435

This looks pretty good to me -- talk about every trick in the 
book <wink>.

Note that the digitlimit vector is too cautious for bases that 
are powers of 2.  For example, it's obvious that any string of 
32 bits can't overflow an unsigned long, but the table cuts 
base 2 off at 31 instead.  The formula should use log(2**32, 
base) instead:

"N digits can't overflow" iff
base**N-1 < 2**32  iff
base**N < 2**32+1
base**N <= 2**32  iff
N <= log(2**32, base)

Assuming exact calculation of log(2**32, base) then 
(dubious, really), the floor of that is exactly the maximum 
safe N.

The power-of-2 bases, and base 10, should be added to the 
tests.  We really want to check that _all_ supported bases 
work, right?

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2005-10-24 11:26

Message:
Logged In: YES 
user_id=1115903

Thanks, funny_falcon - that corrected the problem with the
literals. I also included the change to the digit lookup
table.  

The new patch is attached as python-mystrtoul4.tgz; it
passes all tests now on my machine.

----------------------------------------------------------------------

Comment By: funny_falcon (funny_falcon)
Date: 2005-10-24 03:07

Message:
Logged In: YES 
user_id=1290388

Instead of:
overflowed:

	/* spool through remaining characters */

	while ((c = Py_CHARMASK(*str)) != '\0')

		str ++;

Shoold be
	while ((c = Py_CHARMASK(*str)) != '\0') {

		c = digitlookup[c];

		if (c < 0 || c >= base) /* non-"digit" character */

			break;

		str++;
	}
And why not
static int digitlookup[] = {

	37, 37, 37 ......
};
and
		if (c >= base)  break;


----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2005-10-24 01:41

Message:
Logged In: YES 
user_id=1115903

I forgot to add that these results were obtained on a PIIIm
833MHz running Linux 2.4.2, GCC 3.2.2, with the Python 2.5a0
CVS source from about 8pm EST Oct 23, 2005.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1335972&group_id=5470

From noreply at sourceforge.net  Tue May 23 20:59:58 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 11:59:58 -0700
Subject: [Patches] [ python-Patches-1337051 ] remove 4 ints from
	PyFrameObject
Message-ID: <E1Fic6o-0003Zw-IG@sc8-sf-web2.sourceforge.net>

Patches item #1337051, was opened at 2005-10-25 02:18
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1337051&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: remove 4 ints from PyFrameObject

Initial Comment:
Decreases the size of each frame object by 32 bytes. 
The 4 ints are already in the PyCodeObject.  Well, 2
are in there directly (co_nlocals and co_stacksize). 
The other 2 are the tuple lengths of co_cellvars and
co_freevars.

I ran pybench before and after the patch.  With the
patch, the interpreter was .002 seconds slower, ie,
noise.  I get more variability than that with each
recompile.

Mostly the change is from using f->f_... to co->co_...
 ie, no difference in pointer derefs, just deref a
different pointer.  

I don't see a good reason to duplicate the data.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-23 14:59

Message:
Logged In: YES 
user_id=31435

Yes, it can.  And did!  Thanks be to Richard Jones.

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 05:37

Message:
Logged In: YES 
user_id=1188172

Can this go into 2.5?

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2005-11-02 02:14

Message:
Logged In: YES 
user_id=33168

Heh, my math sucks.  It should be 16 bytes, not 32.  Though
I got rid of 1 more (f_restricted), so it's really 20 bytes
now.  I need to clean up the patch and attach here.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1337051&group_id=5470

From noreply at sourceforge.net  Tue May 23 21:53:33 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 12:53:33 -0700
Subject: [Patches] [ python-Patches-1492509 ] Unification of list-comp and
	for syntax
Message-ID: <E1Ficwf-0004LJ-Vi@sc8-sf-web1.sourceforge.net>

Patches item #1492509, was opened at 2006-05-21 11:06
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Unification of list-comp and for syntax

Initial Comment:
The following patch adds the ability for:

for <expr> in <expr> if <expr>:
    <do something>

to the Python core. This unifies the syntax of
list/generator comprehensions and the for statement
somewhat, because both now accept conditions which
produce an immediate continue.

I've posted a PEP to python-dev, which details the
changes this patch makes (which are all
backwards-compatible).

The patch doesn't try to address more than the actual
code required to make this feature work yet (except for
changes to Modules/parsermodule.c and Doc/ref/ref7.tex,
which details the for statement). If there's consensus
on this feature, I'll gladly produce more documentation.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-23 15:53

Message:
Logged In: YES 
user_id=764593

I'm not loving the interaction with conditional expressions.

for x in (1,2,3) if test else (3,2,1):

I suppose this techically isn't ambiguous because else is a 
keyword.

On the other hand, you could do it now using he if-else

for x in real_seq if test else ():


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

From noreply at sourceforge.net  Tue May 23 22:14:22 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 13:14:22 -0700
Subject: [Patches] [ python-Patches-1492509 ] Unification of list-comp and
	for syntax
Message-ID: <E1FidGo-0001v4-3d@sc8-sf-web1.sourceforge.net>

Patches item #1492509, was opened at 2006-05-21 11:06
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Unification of list-comp and for syntax

Initial Comment:
The following patch adds the ability for:

for <expr> in <expr> if <expr>:
    <do something>

to the Python core. This unifies the syntax of
list/generator comprehensions and the for statement
somewhat, because both now accept conditions which
produce an immediate continue.

I've posted a PEP to python-dev, which details the
changes this patch makes (which are all
backwards-compatible).

The patch doesn't try to address more than the actual
code required to make this feature work yet (except for
changes to Modules/parsermodule.c and Doc/ref/ref7.tex,
which details the for statement). If there's consensus
on this feature, I'll gladly produce more documentation.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-23 16:14

Message:
Logged In: YES 
user_id=764593

It seems I misread what the intent was -- I was thinking of 
the if as guarding the entire for loop, not just a single 
iteration.

Because of this confusion, I have to be -1.  

Is there a reason you can't just wrap your iterable 
sequence with another iterator?

for x in (candidate for candidate in fullseq if test):


----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-23 15:53

Message:
Logged In: YES 
user_id=764593

I'm not loving the interaction with conditional expressions.

for x in (1,2,3) if test else (3,2,1):

I suppose this techically isn't ambiguous because else is a 
keyword.

On the other hand, you could do it now using he if-else

for x in real_seq if test else ():


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

From noreply at sourceforge.net  Wed May 24 08:10:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 23 May 2006 23:10:12 -0700
Subject: [Patches] [ python-Patches-1492509 ] Unification of list-comp and
	for syntax
Message-ID: <E1FimZQ-0007w7-FX@sc8-sf-web1.sourceforge.net>

Patches item #1492509, was opened at 2006-05-21 17:06
Message generated for change (Comment added) made by hwundram
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Heiko Wundram (hwundram)
Assigned to: Nobody/Anonymous (nobody)
Summary: Unification of list-comp and for syntax

Initial Comment:
The following patch adds the ability for:

for <expr> in <expr> if <expr>:
    <do something>

to the Python core. This unifies the syntax of
list/generator comprehensions and the for statement
somewhat, because both now accept conditions which
produce an immediate continue.

I've posted a PEP to python-dev, which details the
changes this patch makes (which are all
backwards-compatible).

The patch doesn't try to address more than the actual
code required to make this feature work yet (except for
changes to Modules/parsermodule.c and Doc/ref/ref7.tex,
which details the for statement). If there's consensus
on this feature, I'll gladly produce more documentation.

----------------------------------------------------------------------

>Comment By: Heiko Wundram (hwundram)
Date: 2006-05-24 08:10

Message:
Logged In: YES 
user_id=791932

Sure, you can wrap the iterable, or you can even do:

if x in y:
    if not x:
        continue
    ...

or

if x in y:
    if x:
        ...

without using any form of "iterator magic". Read my PEP-xxx
on py-dev, and my explanation there of why I think this is a
"good thing"(TM), but I won't go explain it here again,
because generally people have told be to drop it.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-23 22:14

Message:
Logged In: YES 
user_id=764593

It seems I misread what the intent was -- I was thinking of 
the if as guarding the entire for loop, not just a single 
iteration.

Because of this confusion, I have to be -1.  

Is there a reason you can't just wrap your iterable 
sequence with another iterator?

for x in (candidate for candidate in fullseq if test):


----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-23 21:53

Message:
Logged In: YES 
user_id=764593

I'm not loving the interaction with conditional expressions.

for x in (1,2,3) if test else (3,2,1):

I suppose this techically isn't ambiguous because else is a 
keyword.

On the other hand, you could do it now using he if-else

for x in real_seq if test else ():


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492509&group_id=5470

From noreply at sourceforge.net  Wed May 24 11:26:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 02:26:00 -0700
Subject: [Patches] [ python-Patches-1494140 ] Documentation for new Struct
	object
Message-ID: <E1Fipcu-0004QN-Nf@sc8-sf-web1.sourceforge.net>

Patches item #1494140, was opened at 2006-05-24 05:26
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Documentation for new Struct object

Initial Comment:
The performance enhancements to the struct module (patch #1493701) 
are implemented by having a Struct object, which is a compiled structure. 
This text file documents these new struct objects.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

From noreply at sourceforge.net  Wed May 24 11:27:03 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 02:27:03 -0700
Subject: [Patches] [ python-Patches-1493701 ] Performance enhancements for
	struct module
Message-ID: <E1Fipdv-0002Zw-67@sc8-sf-web4-b.sourceforge.net>

Patches item #1493701, was opened at 2006-05-23 12:47
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493701&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Bob Ippolito (etrepum)
Summary: Performance enhancements for struct module

Initial Comment:
This patch refactors the struct module to work like the re module: 
compile and cache the format in advance. This seems to yield at least a 
20% performance improvement on Mac OS X i386, depending on the 
length of the format string.

$ ./python-orig/_build/python.exe  -mtimeit -s "import struct; s = '\x00' 
* 16" "struct.unpack('>iId', s); struct.unpack('iId', s); struct.unpack('<iId', 
s)"
100000 loops, best of 3: 4.48 usec per loop

$ ./bippolito-newstruct/_build/python.exe  -mtimeit -s "import struct; s 
= '\x00' * 16" "struct.unpack('>iId', s); struct.unpack('iId', s); 
struct.unpack('<iId', s)"
100000 loops, best of 3: 3.54 usec per loop

It also adds a struct.Struct type, which is even faster to use than the pack/
unpack/calcsize functions since it doesn't need to look anything up in the 
cache.

----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 05:27

Message:
Logged In: YES 
user_id=139309

Documentation for the new API in this patch is in:
http://python.org/sf/1494140

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-23 14:48

Message:
Logged In: YES 
user_id=139309

Applied in revision 46134

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493701&group_id=5470

From noreply at sourceforge.net  Wed May 24 16:54:12 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 07:54:12 -0700
Subject: [Patches] [ python-Patches-1494140 ] Documentation for new Struct
	object
Message-ID: <E1FiukW-0004oF-Pk@sc8-sf-web4-b.sourceforge.net>

Patches item #1494140, was opened at 2006-05-24 05:26
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Documentation for new Struct object

Initial Comment:
The performance enhancements to the struct module (patch #1493701) 
are implemented by having a Struct object, which is a compiled structure. 
This text file documents these new struct objects.


----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 10:54

Message:
Logged In: YES 
user_id=139309

Hold up on this patch, I need to revise it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

From noreply at sourceforge.net  Wed May 24 17:34:49 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 08:34:49 -0700
Subject: [Patches] [ python-Patches-1281707 ] Speed up gzip.readline (~40%)
Message-ID: <E1FivNp-0006ZW-Tg@sc8-sf-web2.sourceforge.net>

Patches item #1281707, was opened at 2005-09-04 13:53
Message generated for change (Settings changed) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: April King (marumari)
Assigned to: Bob Ippolito (etrepum)
Summary: Speed up gzip.readline (~40%)

Initial Comment:
See bug 849046 for history.  This patch passes both the
regression test and the standard test.  Hopefully the
extra information below won't be too difficult to read.
 I can attach this info to the bug, if need be.

Fixed:
  - Add self.min_readsize to __init__.
    Follows the principal that lines are likely to be
the same length in size,
    and doesn't start over at a minimum length string
every call to readline()
  - Rewriting of assignment for readsize and size at
the beginning of function.
    Eliminates almost all calls to min()
  - Change bufs to a string, and not an array.  No
point in using an array when
    all you do with it is "".join(bufs).  Uses string
addition instead.
  - Remove extra assignments to bufs (in return())
  - Changes readline() to be much more readable (loop
reordering, more comments)

Recommendations:
  - Delete _unread() function.  It is used _only_ by
readline(), and moving its
    functionality into readline() itself saves the
function call overhead.
    _unread() is only 3 lines long.  Testing shows that
removing it speeds
    readline() up by about 3%.  Backwards compatibility
concerns?

Testing results:
test_append (__main__.TestGzip) ... ok
test_many_append (__main__.TestGzip) ... ok
test_mode (__main__.TestGzip) ... ok
test_read (__main__.TestGzip) ... ok
test_readline (__main__.TestGzip) ... ok
test_readlines (__main__.TestGzip) ... ok
test_seek_read (__main__.TestGzip) ... ok
test_seek_write (__main__.TestGzip) ... ok
test_write (__main__.TestGzip) ... ok

----------------------------------------------------------------------
Ran 9 tests in 0.331s

Regression tests:
python regrtest.py -g test_gzip.py
test_gzip
1 test OK.

---

Profiling Results (performed on a common compressed log
file - 200748 lines).

With patch...

         1213961 function calls in 12.188 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.020    0.000    0.020    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   200774    0.812    0.000    0.812    0.000 :0(find)
   403865    0.902    0.000    0.902    0.000 :0(len)
     1183    0.000    0.000    0.000    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.000    0.000    0.000    0.000 :0(read)
       12    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
       18    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   12.188   12.188 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip_new.py:156(_init_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:160(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip_new.py:18(U32)
   200774    2.453    0.000    2.593    0.000
gzip_new.py:207(read)
   200749    2.894    0.000    3.796    0.000
gzip_new.py:239(_unread)
     1166    0.010    0.000    0.140    0.000
gzip_new.py:244(_read)
        1    0.000    0.000    0.000    0.000
gzip_new.py:27(LOWU32)
     1158    0.010    0.000    0.030    0.000
gzip_new.py:294(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip_new.py:300(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip_new.py:314(close)
        1    0.000    0.000    0.000    0.000
gzip_new.py:327(__del__)
   200749    3.916    0.000   11.117    0.000
gzip_new.py:384(readline)
        2    0.000    0.000    0.000    0.000
gzip_new.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip_new.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip_new.py:60(__init__)
        1    0.000    0.000   12.188   12.188
profile:0(gunzip_gzip_new_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.071    1.071   12.188   12.188
test_gzip_speed.py:14(gunzip_gzip_new_open)

Without patch...

         2073328 function calls in 18.597 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
   243820    0.735    0.000    0.735    0.000 :0(append)
        1    0.000    0.000    0.000    0.000 :0(close)
     1159    0.040    0.000    0.040    0.000 :0(crc32)
     1158    0.100    0.000    0.100    0.000
:0(decompress)
        1    0.000    0.000    0.000    0.000
:0(decompressobj)
   243820    0.960    0.000    0.960    0.000 :0(find)
   200749    0.801    0.000    0.801    0.000 :0(join)
   489958    1.330    0.000    1.330    0.000 :0(len)
   243820    0.791    0.000    0.791    0.000 :0(min)
        2    0.000    0.000    0.000    0.000 :0(ord)
     1173    0.030    0.000    0.030    0.000 :0(read)
        6    0.000    0.000    0.000    0.000 :0(seek)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        6    0.000    0.000    0.000    0.000 :0(tell)
        2    0.000    0.000    0.000    0.000 :0(unpack)
        1    0.000    0.000   18.597   18.597 <string>:1(?)
        1    0.000    0.000    0.000    0.000
gzip.py:154(_init_read)
        1    0.000    0.000    0.000    0.000
gzip.py:158(_read_gzip_header)
        3    0.000    0.000    0.000    0.000
gzip.py:18(U32)
   243820    2.711    0.000    2.921    0.000
gzip.py:205(read)
   200749    3.083    0.000    4.143    0.000
gzip.py:237(_unread)
     1160    0.010    0.000    0.210    0.000
gzip.py:242(_read)
        1    0.000    0.000    0.000    0.000
gzip.py:27(LOWU32)
     1158    0.030    0.000    0.070    0.000
gzip.py:292(_add_read_data)
        1    0.000    0.000    0.000    0.000
gzip.py:298(_read_eof)
        1    0.000    0.000    0.000    0.000
gzip.py:312(close)
        1    0.000    0.000    0.000    0.000
gzip.py:325(__del__)
   200749    6.934    0.000   17.555    0.000
gzip.py:379(readline)
        2    0.000    0.000    0.000    0.000
gzip.py:39(read32)
        1    0.000    0.000    0.000    0.000
gzip.py:42(open)
        1    0.000    0.000    0.000    0.000
gzip.py:59(__init__)
        1    0.000    0.000   18.597   18.597
profile:0(gunzip_gzip_open())
        0    0.000             0.000         
profile:0(profiler)
        1    1.042    1.042   18.597   18.597
test_gzip_speed.py:7(gunzip_gzip_open)

Using popen + gunzip -c...

         200754 function calls in 4.338 CPU seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall
filename:lineno(function)
        1    0.000    0.000    0.000    0.000 :0(popen)
   200749    3.578    0.000    3.578    0.000 :0(readline)
        1    0.000    0.000    0.000    0.000
:0(setprofile)
        1    0.240    0.240    4.338    4.338 <string>:1(?)
        1    0.000    0.000    4.338    4.338
profile:0(gunzip_popen())
        0    0.000             0.000         
profile:0(profiler)
        1    0.520    0.520    4.098    4.098
test_gzip_speed.py:21(gunzip_popen)

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 12:47

Message:
Logged In: YES 
user_id=747439

Okie dokie.  30% is still a welcome speedup.  :)

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 12:40

Message:
Logged In: YES 
user_id=139309

Using a string is over 4x slower (at least on this platform) if the strings get 
large, that's not acceptable. Using a list is a compromise that provides good (but 
not optimal) performance when dealing with lines of arbitrary length.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 12:08

Message:
Logged In: YES 
user_id=747439

There was generally a 5-10% speed improvement for using a
string.  This is because the cost of recreating the string
by appending was less than the cost of creating an array,
appending the the array, and then joining it back together.

I would recommend trying leaving it as a string, but
changing this:
if readsize > self.min_readsize:
  self.min_readsize = int(self.min_readsize * 1.25)

(or some kind of scaling factor)

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:59

Message:
Logged In: YES 
user_id=139309

Applied in revision 46075

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:54

Message:
Logged In: YES 
user_id=139309

The attached patch uses a strategy that provides the same 30%-ish performance 
boost for the benchmark, and also provides a small performance boost (about 
9% or so) for the very strange log file.

The key is to use lists for buffering, and to never allow the default buffer size to 
grow too large (512-ish starting point seems to be a sweet spot). This defends 
against working with large strings more often than necessary.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2006-05-22 11:39

Message:
Logged In: YES 
user_id=747439

Actually, the slow speed in that specific circumstance has
nothing to do with the fact that it uses a string style
buffer, which should always be faster than what it used
before (an array of strings that was constantly appended to.)

The problem with that particular file is how the
gzip.readline function auto-optimizes it's read size.

if readsize > self.min_readsize:
  self.min_readsize = readsize

So, it optimizes it's read size to the length of the largest
line that it has seen so far.  The assumption is that
gzipped files are generally going to be a bunch of lines of
similar length, and not wildly differing length.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:29

Message:
Logged In: YES 
user_id=139309

It turns out the performance difference was due to some.. interesting 
characteristics for that particular log file.

>>> import gzip
>>> lengths = [len(line) for line in gzip.GzipFile('TEST.LOG')]
>>> sum(lengths) / float(len(lengths))
45.60349675165147
>>> max(lengths)
117989
>>> min(lengths)
1

The str style buffer in this particular example is going to fail miserably 
reading that one long line.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 11:12

Message:
Logged In: YES 
user_id=139309

I'm reopening this patch -- it seems that these changes have made parsing 
Apache style log files MUCH slower (4x on some samples).

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:32

Message:
Logged In: YES 
user_id=139309

Applied in revision 46070

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-22 10:11

Message:
Logged In: YES 
user_id=139309

This patch is about a 30% win on Mac OS X i386 using this benchmark:
http://svn.python.org/view/sandbox/trunk/gzipbench/gzipbench.py

I'm going to look to see if there's any other low hanging fruit in there before I 
commit.

----------------------------------------------------------------------

Comment By: April King (marumari)
Date: 2005-09-04 13:57

Message:
Logged In: YES 
user_id=747439

See attached text file for the detailed description (that's
much more readable).

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1281707&group_id=5470

From noreply at sourceforge.net  Wed May 24 17:35:46 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 08:35:46 -0700
Subject: [Patches] [ python-Patches-1494140 ] Documentation for new Struct
	object
Message-ID: <E1FivOk-0006nB-Fq@sc8-sf-web2.sourceforge.net>

Patches item #1494140, was opened at 2006-05-24 05:26
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Documentation for new Struct object

Initial Comment:
The performance enhancements to the struct module (patch #1493701) 
are implemented by having a Struct object, which is a compiled structure. 
This text file documents these new struct objects.


----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 11:35

Message:
Logged In: YES 
user_id=139309

New patch attached, fixed unpack documentation, added unpack_from method.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 10:54

Message:
Logged In: YES 
user_id=139309

Hold up on this patch, I need to revise it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

From noreply at sourceforge.net  Wed May 24 20:24:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 11:24:51 -0700
Subject: [Patches] [ python-Patches-1494487 ] PyUnicode_Resize cannot resize
	shared unicode object
Message-ID: <E1Fiy2N-00009N-Se@sc8-sf-web2.sourceforge.net>

Patches item #1494487, was opened at 2006-05-25 03:24
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494487&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Nobody/Anonymous (nobody)
Summary: PyUnicode_Resize cannot resize shared unicode object

Initial Comment:
I found following code fails.

    PyUnicodeObject *v1 = _PyUnicode_New(0);
    PyUnicodeObject *v2 = _PyUnicode_New(0);

    _PyUnicode_Resize(&v1, 1);

    Py_DECREF(v1);
    Py_DECREF(v2);

Error message is...

SystemError:
E:\python-dev\trunk\Objects\unicodeobject.c:335: bad
argument to internal function

This happens because _PyUnicode_New(0) returns
empty_unicode, and its ob_refcnt becomes 2 on second
call. I think refcnt check bellow is not needed. Is
this right fix?

Index: Objects/unicodeobject.c
===================================================================
--- Objects/unicodeobject.c	(revision 46192)
+++ Objects/unicodeobject.c	(working copy)
@@ -331,7 +331,7 @@
 	return -1;
     }
     v = (PyUnicodeObject *)*unicode;
-    if (v == NULL || !PyUnicode_Check(v) ||
v->ob_refcnt != 1 || length < 0) {
+    if (v == NULL || !PyUnicode_Check(v) || length < 0) {
 	PyErr_BadInternalCall();
 	return -1;
     }


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494487&group_id=5470

From noreply at sourceforge.net  Wed May 24 21:54:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 12:54:25 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1FizR3-0006du-JL@sc8-sf-web4-b.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Wed May 24 22:18:36 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 13:18:36 -0700
Subject: [Patches] [ python-Patches-1455898 ] patch for mbcs codecs
Message-ID: <E1FizoS-0001a0-C1@sc8-sf-web2.sourceforge.net>

Patches item #1455898, was opened at 2006-03-22 16:31
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.4
Status: Open
Resolution: None
Priority: 7
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Walter D?rwald (doerwalter)
Summary: patch for mbcs codecs

Initial Comment:
Hello.

I have noticed mbcs codecs sometimes generates broken
string. I'm using Windows(Japanese) so mbcs is mapped
to cp932 (close to shift_jis)

When I run the attached script "a.zip", the entry
"Error 00007"'s message becomes broken like attached
file "b.txt".

I think this happens because the string passed to
PyUnicode_DecodeMBCS() sometimes terminates with
leading byte, and MultiByteToWideChar() counts it for
size of result string.buffer size.

I hope attached patch "mbcs.patch" may fix the problem.
It would be nice if this bug will be fixed in 2.4.3...
Thank you.


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 05:18

Message:
Logged In: YES 
user_id=1200846

I updated the patch.

  - PyUnicode_DecodeMBCS now supports size >= INT_MAX. (I
don't have machine to test such big string, but I have
tested this routine replaced INT_MAX with 2 and 3)

PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
but probably I'll fix it too.

This patch includes Patch#1494487.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-02 20:40

Message:
Logged In: YES 
user_id=1200846

I updated the patch. (I copy and pasted "int final = 0" from
above code (ex: utf_16_ex_decode), maybe they also should be
changed for consistency?)

And one more thing, I noticed "errors" is ignored now. We
can detect invalid character if we set MB_ERR_INVALID_CHARS
flag when calling MultiByteToWideChar, but we cannot tell
where is the position of invalid character, and MSDN saids
this flag is available Win2000SP4 or later (I don't know
why)
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_17si.asp
So I didn't make the patch for it.


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-04-26 02:22

Message:
Logged In: YES 
user_id=89016

I think the default value for final in mbcs_decode() should
be true, so that the stateless decoder detects incomplete
byte sequences too.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-04-07 18:10

Message:
Logged In: YES 
user_id=1200846

I have sent contributor form via postal mail. Probably you
can get it after 10 days.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-28 17:16

Message:
Logged In: YES 
user_id=1200846

You are right. I've updated the patch. (mbcs5.patch)

>>> import codecs
[20198 refs]
>>> d = codecs.getincrementaldecoder("mbcs")()
[20198 refs]
>>> d.decode('\x82\xa0\x82')
u'\u3042'
[20198 refs]
>>> d.decode('')
u''
[20198 refs]
>>> d.decode('', final=True)
u'\x00'
[20198 refs]


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-28 01:06

Message:
Logged In: YES 
user_id=89016

_buffer_decode() in the IncrementalDecoder ignores the final
argument. IncrementalDecoder._buffer_decode() should pass on
its final argument to _codecsmodules.c::mbcs_decode(), which
should be extended to accept the final argument. Also
PyUnicode_DecodeMBCSStateful() must handle consumed == NULL
correctly (with your patch it drops trailing lead bytes even
if consumed == NULL)

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 16:41

Message:
Logged In: YES 
user_id=1200846

I replaced tests. Probably this is better instead of
comparing the two string generated by same decoder.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 14:44

Message:
Logged In: YES 
user_id=1200846

My real name is Hirokazu Yamamoto. But sorry, I don't have
FAX. (It's needed to send contributor form, isn't it?)

I'll attach the patch updated for trunk. And I'll attach the
tests.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-03-27 06:05

Message:
Logged In: YES 
user_id=21627

I have reservations against this patch because of the
quasi-anonymous nature of the submission. ocean-city, can
you please state your real name? Would you also be willing
to fill out a contributor form, as shown on

http://www.python.org/psf/contrib-form.html

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-24 23:02

Message:
Logged In: YES 
user_id=1200846

OK, I'll try.

----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-24 06:44

Message:
Logged In: YES 
user_id=89016

This isn't a bugfix in the strictest sense, so IMHO this
patch shouldn't go into 2.4. 

If the patch goes into 2.5, it would need the appropriate
changes to encodings/mbcs.py (i.e. the IncrementalDecoder
would have to be changed (inheriting from
BufferedIncrementalDecoder).

I realize that this patch might be hard to test, as results
are dependent on locale. Nevertheless at least some tests
would be good (even if they are only run or do something
useful on a certain locale and are skipped otherwise).

ocean-city, can you update the patch for the trunk and add
tests?


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-23 11:51

Message:
Logged In: YES 
user_id=1200846

Hello. This is my final patch. (mbcs4.patch)

 - mbcs3a.patch: _mbsbtype depends on locale not system ANSI
code page. so probably it's not good to use it with
MultiByteToWideChar.

 - mbcs3b.patch: CharNext may cause buffer overflow. and
this patch always calls CharPrev but it's not needed if
string is not terminated with "potensial" lead byte.

I hope this is stable enough to commit on repositry. Thank you.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 23:36

Message:
Logged In: YES 
user_id=1200846

Sorry, I was stupid.

MSDN
(http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_0o2t.asp)
saids,

> IsDBCSLeadByte can only indicate a potential lead byte value. 

IsDBCSLeadByte was returning 1 for some trail byte (ex: "???"[1])

The patch "mbcs3a.patch" worked for me, but _mbsbtype is
probably compiler specific. Is that OK?

The patch "mbcs3b.patch" also worked for me and it only uses
Win32API, but I don't have enough faith on this
implementation...


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:31

Message:
Logged In: YES 
user_id=1200846

Sorry, I found problem when tried more long text file...
Please wait. I'll investigate more intensibly.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:13

Message:
Logged In: YES 
user_id=1200846

Thank you for reply. How about this? (I'm a newbie, I hope
this is right tex format but... can you confirm this? I
created this patch by copy & paste from
PyUnicode_DecodeUTF16Stateful and some modification)


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:12

Message:
Logged In: YES 
user_id=38388

One more nit: the doc patch is missing. Please add a patch
for the API docs.


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:11

Message:
Logged In: YES 
user_id=38388

As I understand your comment, the mbcs codec will have a
problem if the input string terminates with a lead byte.

Could you add a comment regarding this to the patch ?!

I can't test the patch, since I don't have a Japanese
Windows to check on, but from looking at the patch, it seems OK.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 16:42

Message:
Logged In: YES 
user_id=1200846

I forgot to mention this. "mbcs.patch" is for
release24-maint branch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

From noreply at sourceforge.net  Wed May 24 23:03:48 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 14:03:48 -0700
Subject: [Patches] [ python-Patches-1494140 ] Documentation for new Struct
	object
Message-ID: <E1Fj0WC-0007FM-5y@sc8-sf-web4-b.sourceforge.net>

Patches item #1494140, was opened at 2006-05-24 05:26
Message generated for change (Comment added) made by jimjjewett
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Documentation for new Struct object

Initial Comment:
The performance enhancements to the struct module (patch #1493701) 
are implemented by having a Struct object, which is a compiled structure. 
This text file documents these new struct objects.


----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-24 17:03

Message:
Logged In: YES 
user_id=764593

Shouldn't self.size be the number of bytes required to *pack
* the structure?  The number required to *unpack* seems 
like it ought to include tuple overhead and such...


----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 11:35

Message:
Logged In: YES 
user_id=139309

New patch attached, fixed unpack documentation, added unpack_from method.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 10:54

Message:
Logged In: YES 
user_id=139309

Hold up on this patch, I need to revise it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

From noreply at sourceforge.net  Wed May 24 23:45:54 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 14:45:54 -0700
Subject: [Patches] [ python-Patches-1442927 ] PyLong_FromString optimization
Message-ID: <E1Fj1Aw-0007Ah-Br@sc8-sf-web3.sourceforge.net>

Patches item #1442927, was opened at 2006-03-04 01:21
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Alan McIntyre (alanmcintyre)
>Assigned to: Nobody/Anonymous (nobody)
Summary: PyLong_FromString optimization

Initial Comment:
The current implementation of PyLong_FromString in
Python 2.5 uses muladd1 to add each digit of the input
string into the final number.  Because muladd1 creates
a new long to hold the result on every call, an
intermediate long object is created/destroyed for each
digit in the input string.  

This patch improves on the current implementation of
PyLong_FromString in 3 main ways:

1. Creates and manipulates (in-place) a single long
object to hold the result, skipping the creation of all
those intermediate long objects.

2. Multiple digits from the input string are
consolidated into a single long digit before adding
them into the long integer object.  This greatly
reduces the number of "multiply/add" cycles required to
push all the digits into the long object.

3. Three chunks of code like "if (ch <= '9') k = ch -
'0'" in longobject.c are replaced by a digit value
lookup vector.  I'm not irreversibly stuck on this
idea; it doesn't measurably add to performance, but it
just seems (to me, anyway) to make the code in
long_from_binary_base and PyLong_FromString a little
less cluttered.  This is the same lookup table from
patch 1335972 (an optimization for int()).  I expect if
both patches get accepted it would be best to make them
both reference a single instance of this table; if it
looks like that's what will happen I'll tweak one or
both patches as necessary.


My cheezy test results (included in the attached file
in an OpenOffice spreadsheet) show that the patch makes
long() about 50% faster than the existing
implementation for decimal input strings of about 10
characters.   Longer input strings show even better
performance improvement, leveling off around 3x faster
for very long strings.

This patch passes regression tests on my machine
(WinXP, Visual C++ .net Standard 2003).  I plan to try
out the tests on my Linux box this weekend just to make
sure the performance boost still remains when Python
gets compiled by a C compiler that isn't neutered
(standard .net 2003 doesn't appear to allow any
optimizations).

The test and test data generation scripts I used for
this performance comparison are included in the
attached zip file. 

At the moment I don't have any added tests; if somebody
can suggest some things that ought to be tested I'll
gladly write some tests.


----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-24 17:45

Message:
Logged In: YES 
user_id=31435

Thanks again, Alan!  I did some major fiddling for
portability and to try to eradicate the penalty for "short"
input strings.  It's still slower on 1-digit inputs, by
about 4-5%, but faster than before with at least 2-digit
inputs.  The peak speedup remains at around 800-1000 decimal
digits, but it's 6x faster there now.  Much of that came
from eliminating code :-)  For example, there was no actual
need for the memset(), and reducing the main loop to:

	for (; pz < pzstop; ++pz) {
		c += (twodigits)*pz * convmult;
		*pz = (digit)(c & MASK);
		c >>= SHIFT;
	}

was a huge win under VC 7.1 (in the patch, it has a branch
testing whether c is 0, and I didn't believe the comment
that said the branch made it faster ;-)).

Anyway, this is checked in now.  The table of digit values
is duplicated for the moment, and I hope to refactor that
soon (given that ints and longs have become increasingly
unified in Python, it's become increasingly confusing to
have two string->int routines -- while they have to remain
for backward compatibility, there's nothing to stop
rewriting the core to use a new unified conversion function).

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 03:46

Message:
Logged In: YES 
user_id=849994

If someone ;) created a new tracker category, I'd go through
the patches and flag all I can find for the sprint.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-05-16 03:44

Message:
Logged In: YES 
user_id=31435

Thanks for reminding me, Georg!  This is a good possiblity
for the Iceland sprint.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 03:40

Message:
Logged In: YES 
user_id=849994

Assigned to Tim. Perhaps something for Iceland?

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 22:05

Message:
Logged In: YES 
user_id=1115903

Version #3 is attached; it has an across-the-board
improvement of ~10% over version 2.  The performance hit for
calling long() on 9-digit numbers is now only about -10%,
breakeven happens somewhere around 11 digits, and the best
performance is about +282% in the vicinity of 1000 digits.

Sorry to keep commenting on my own patch. :)  I think I'm
done now.

----------------------------------------------------------------------

Comment By: Alan McIntyre (alanmcintyre)
Date: 2006-03-06 17:33

Message:
Logged In: YES 
user_id=1115903

Version #2 is attached.  I made a couple of tweaks and
tested the patch out on Linux just to make sure the
performace is still as good with compiler optimizations. 
For short numbers (numbers that would fit into an int),
long() is 10-30% *slower* than before applying the patch. 
For longer numbers, long() is up to 249% faster, with the
peak occurring around 1000 digits.

If the negative performance impact for int-sized digits is
unacceptable, I will see if I can do something about it. 
However, one always has the option of using int() on very
long strings anyway, and it will automatically fall through
to PyLong_FromString if the number is too long.  The
performance impact on int() for small numbers is so small as
to be negligible (<5%), which is to be expected since the
modified code isn't called when using int() on input strings
< 2**32. 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1442927&group_id=5470

From noreply at sourceforge.net  Thu May 25 06:33:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 21:33:34 -0700
Subject: [Patches] [ python-Patches-1457736 ] patch for building trunk with
	VC6
Message-ID: <E1Fj7XS-0001De-O0@sc8-sf-web4-b.sourceforge.net>

Patches item #1457736, was opened at 2006-03-24 22:40
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Nobody/Anonymous (nobody)
Summary: patch for building trunk with VC6

Initial Comment:
Hello. I tried to build trunk with VC6, but failed.
The reasons are

 - _W64 is not defined on VC6. (PC/pyconfig.h)

 - intptr_t and uintptr_t are not decleared on VC6.
(should use Py_intptr_t and Py_uintptr_t respectively)

I'll submit the patch for these two issues as
"build_trunk_for_vc6.patch".

And more two issues.

 - zlib was make built into pythoncore, but
PC/VC6/pythoncore.dsp is not updated for it yet.

I'll submit the file itself.

 - long long cannot be used on VC6, so 0xFFFFULL is
failed to compile with "invalid suffix" error.

I workarounded this replaced ULL with UI64 (_int64's
suffix) but I don't know how to make the patch. maybe
can this tequnique be used?

  #define Py_ULL(x) x##ULL /* non VC6 */

  #define Py_ULL(x) x##UI64 /* VC6 */

  Py_ULL(0xFFFFFFFFFFFFFFFF) instead of 0xFFF...FULL


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 13:33

Message:
Logged In: YES 
user_id=1200846

Thanks to Luke Dunstan, my patch becomes much smaller.

  - Replace *.dsp in PC/VC6 with attached files.

  - Remove PC/VC6/zlib.dsp

  - _sqlite3 and other new packages are not supported

I read core member are not interested in VC6 anymore, so
this is for VC6 guy. I don't want to install VC++2005Express
because it installs .net framework which I don't need. I'm
Java guy. :-)


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 14:40

Message:
Logged In: YES 
user_id=1200846

Oops, I forgot to upload the file.

  - Apply x.patch.

  - Replace pythoncore.dsp and pcbuild.dsw in PC/VC6 with
    attached files.

 - Remove PC/VC6/zlib.dsp


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-07 14:37

Message:
Logged In: YES 
user_id=1200846

Hello. I updated the patch. (Probably this is better)

  - defined ULL() macro locally in Modules/sha512module.c
      maybe it's better to declare Py_ULL or something
      globally, but I don't know how to do it.

 - more patch for zlib builtin (ie: PC/VC6/Readme.txt)

I cannot try this patch on VC7 or later, but
I confirmed lib/test/testall.py passed on VC6.

----------------------------------------------------------------------

Comment By: Luke Dunstan (infidel)
Date: 2006-05-07 03:16

Message:
Logged In: YES 
user_id=30442

Is there anything preventing this patch from being 
applied? It would help me with building the trunk using 
both VC6 and Microsoft eMbedded Visual C++ 4.0 (for 
Windows CE).


----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-03-27 02:02

Message:
Logged In: YES 
user_id=33168

Raymond, maybe this will help get VC6 building?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1457736&group_id=5470

From noreply at sourceforge.net  Thu May 25 08:03:23 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 24 May 2006 23:03:23 -0700
Subject: [Patches] [ python-Patches-1494750 ] BaseWidget.destroy updates
	master's childern too early
Message-ID: <E1Fj8wN-0005vQ-SX@sc8-sf-web2.sourceforge.net>

Patches item #1494750, was opened at 2006-05-24 23:03
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494750&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Tkinter
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Greg Couch (gregcouch)
Assigned to: Martin v. L??wis (loewis)
Summary: BaseWidget.destroy updates master's childern too early

Initial Comment:
In BaseWidget.destroy(), it removes self from its
master's dict of children before it calls the Tcl
destroy on the widget.  If the widget has a destroy
callback (like Togl, the Tk OpenGL widget), then
nametowidget throws an exception when given the
widget's name even though the Python widget still
exists.  Just reordering the code, so that the Tcl
destroy happens before the updating of the master's
dict of children, fixes the problem.

The bug is present in earlier versions of Tkinter too.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494750&group_id=5470

From noreply at sourceforge.net  Thu May 25 13:06:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 04:06:01 -0700
Subject: [Patches] [ python-Patches-1455898 ] patch for mbcs codecs
Message-ID: <E1FjDfF-0002ED-QG@sc8-sf-web1.sourceforge.net>

Patches item #1455898, was opened at 2006-03-22 16:31
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.4
Status: Open
Resolution: None
Priority: 7
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Walter D?rwald (doerwalter)
Summary: patch for mbcs codecs

Initial Comment:
Hello.

I have noticed mbcs codecs sometimes generates broken
string. I'm using Windows(Japanese) so mbcs is mapped
to cp932 (close to shift_jis)

When I run the attached script "a.zip", the entry
"Error 00007"'s message becomes broken like attached
file "b.txt".

I think this happens because the string passed to
PyUnicode_DecodeMBCS() sometimes terminates with
leading byte, and MultiByteToWideChar() counts it for
size of result string.buffer size.

I hope attached patch "mbcs.patch" may fix the problem.
It would be nice if this bug will be fixed in 2.4.3...
Thank you.


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 20:06

Message:
Logged In: YES 
user_id=1200846

>PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
>but probably I'll fix it too.

Done. Attached as "mbcs_win64_support.patch".

Now, total summary...

    - MBCS decoder and encoder now supports 64bit Py_ssize_t
environment. (I don't have such machine, but I checked
routine by defining NEED_RETRY and redefining INT_MAX as 2,
3, 4)

    - Fixed a bug of MBCS incremental decoder which was
originaly reported by me.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 05:18

Message:
Logged In: YES 
user_id=1200846

I updated the patch.

  - PyUnicode_DecodeMBCS now supports size >= INT_MAX. (I
don't have machine to test such big string, but I have
tested this routine replaced INT_MAX with 2 and 3)

PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
but probably I'll fix it too.

This patch includes Patch#1494487.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-02 20:40

Message:
Logged In: YES 
user_id=1200846

I updated the patch. (I copy and pasted "int final = 0" from
above code (ex: utf_16_ex_decode), maybe they also should be
changed for consistency?)

And one more thing, I noticed "errors" is ignored now. We
can detect invalid character if we set MB_ERR_INVALID_CHARS
flag when calling MultiByteToWideChar, but we cannot tell
where is the position of invalid character, and MSDN saids
this flag is available Win2000SP4 or later (I don't know
why)
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_17si.asp
So I didn't make the patch for it.


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-04-26 02:22

Message:
Logged In: YES 
user_id=89016

I think the default value for final in mbcs_decode() should
be true, so that the stateless decoder detects incomplete
byte sequences too.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-04-07 18:10

Message:
Logged In: YES 
user_id=1200846

I have sent contributor form via postal mail. Probably you
can get it after 10 days.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-28 17:16

Message:
Logged In: YES 
user_id=1200846

You are right. I've updated the patch. (mbcs5.patch)

>>> import codecs
[20198 refs]
>>> d = codecs.getincrementaldecoder("mbcs")()
[20198 refs]
>>> d.decode('\x82\xa0\x82')
u'\u3042'
[20198 refs]
>>> d.decode('')
u''
[20198 refs]
>>> d.decode('', final=True)
u'\x00'
[20198 refs]


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-28 01:06

Message:
Logged In: YES 
user_id=89016

_buffer_decode() in the IncrementalDecoder ignores the final
argument. IncrementalDecoder._buffer_decode() should pass on
its final argument to _codecsmodules.c::mbcs_decode(), which
should be extended to accept the final argument. Also
PyUnicode_DecodeMBCSStateful() must handle consumed == NULL
correctly (with your patch it drops trailing lead bytes even
if consumed == NULL)

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 16:41

Message:
Logged In: YES 
user_id=1200846

I replaced tests. Probably this is better instead of
comparing the two string generated by same decoder.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 14:44

Message:
Logged In: YES 
user_id=1200846

My real name is Hirokazu Yamamoto. But sorry, I don't have
FAX. (It's needed to send contributor form, isn't it?)

I'll attach the patch updated for trunk. And I'll attach the
tests.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-03-27 06:05

Message:
Logged In: YES 
user_id=21627

I have reservations against this patch because of the
quasi-anonymous nature of the submission. ocean-city, can
you please state your real name? Would you also be willing
to fill out a contributor form, as shown on

http://www.python.org/psf/contrib-form.html

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-24 23:02

Message:
Logged In: YES 
user_id=1200846

OK, I'll try.

----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-24 06:44

Message:
Logged In: YES 
user_id=89016

This isn't a bugfix in the strictest sense, so IMHO this
patch shouldn't go into 2.4. 

If the patch goes into 2.5, it would need the appropriate
changes to encodings/mbcs.py (i.e. the IncrementalDecoder
would have to be changed (inheriting from
BufferedIncrementalDecoder).

I realize that this patch might be hard to test, as results
are dependent on locale. Nevertheless at least some tests
would be good (even if they are only run or do something
useful on a certain locale and are skipped otherwise).

ocean-city, can you update the patch for the trunk and add
tests?


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-23 11:51

Message:
Logged In: YES 
user_id=1200846

Hello. This is my final patch. (mbcs4.patch)

 - mbcs3a.patch: _mbsbtype depends on locale not system ANSI
code page. so probably it's not good to use it with
MultiByteToWideChar.

 - mbcs3b.patch: CharNext may cause buffer overflow. and
this patch always calls CharPrev but it's not needed if
string is not terminated with "potensial" lead byte.

I hope this is stable enough to commit on repositry. Thank you.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 23:36

Message:
Logged In: YES 
user_id=1200846

Sorry, I was stupid.

MSDN
(http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_0o2t.asp)
saids,

> IsDBCSLeadByte can only indicate a potential lead byte value. 

IsDBCSLeadByte was returning 1 for some trail byte (ex: "???"[1])

The patch "mbcs3a.patch" worked for me, but _mbsbtype is
probably compiler specific. Is that OK?

The patch "mbcs3b.patch" also worked for me and it only uses
Win32API, but I don't have enough faith on this
implementation...


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:31

Message:
Logged In: YES 
user_id=1200846

Sorry, I found problem when tried more long text file...
Please wait. I'll investigate more intensibly.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:13

Message:
Logged In: YES 
user_id=1200846

Thank you for reply. How about this? (I'm a newbie, I hope
this is right tex format but... can you confirm this? I
created this patch by copy & paste from
PyUnicode_DecodeUTF16Stateful and some modification)


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:12

Message:
Logged In: YES 
user_id=38388

One more nit: the doc patch is missing. Please add a patch
for the API docs.


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:11

Message:
Logged In: YES 
user_id=38388

As I understand your comment, the mbcs codec will have a
problem if the input string terminates with a lead byte.

Could you add a comment regarding this to the patch ?!

I can't test the patch, since I don't have a Japanese
Windows to check on, but from looking at the patch, it seems OK.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 16:42

Message:
Logged In: YES 
user_id=1200846

I forgot to mention this. "mbcs.patch" is for
release24-maint branch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:25:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:25:31 -0700
Subject: [Patches] [ python-Patches-1492828 ] Improvements to ceval.c
Message-ID: <E1FjGmJ-0002iT-6g@sc8-sf-web4-b.sourceforge.net>

Patches item #1492828, was opened at 2006-05-22 10:15
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: mrjbq7 (mrjbq7)
Assigned to: Raymond Hettinger (rhettinger)
Summary: Improvements to ceval.c

Initial Comment:
>From Raymond Hettinger, submitting here to keep track of for 
NeedForSpeed sprint.

Here are some customizations to your Python build:
 
First, make sure that WITH_TSC and WITH_THREAD are not defined in the 
build.

Then, attached diff to disable the tracing code, remove NOPs, speed-up 
absolute jumps, and increase the signal check interval.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-05-22 15:39

Message:
Logged In: YES 
user_id=31435

Assigned to Raymond.  Raymond is there something of general
use here?  As a standalone patch, it sucks ;-)

----------------------------------------------------------------------

Comment By: mrjbq7 (mrjbq7)
Date: 2006-05-22 11:00

Message:
Logged In: YES 
user_id=1172546

Okay, now I checked the box "upload and attach file".  Thats a terrible UI.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492828&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:26:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:26:32 -0700
Subject: [Patches] [ python-Patches-1359618 ] Speed charmap encoder
Message-ID: <E1FjGnI-0002wd-Ej@sc8-sf-web4-b.sourceforge.net>

Patches item #1359618, was opened at 2005-11-18 08:00
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1359618&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: M.-A. Lemburg (lemburg)
Summary: Speed charmap encoder

Initial Comment:
This patch speeds up the charmap encoder by a factor of 4 to 5, using a 
trie structure instead of a dictionary; the speedup primarily comes from 
not creating integer objects in the process.

The trie is created by inverting the encoding map; the codec generator is 
changed to drop the encoding dictionary, and instead emit a function call 
to create the trie.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1359618&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:26:56 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:26:56 -0700
Subject: [Patches] [ python-Patches-1353872 ] a faster Modulefinder
Message-ID: <E1FjGng-000320-4z@sc8-sf-web4-b.sourceforge.net>

Patches item #1353872, was opened at 2005-11-11 11:51
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1353872&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Thomas Heller (theller)
Assigned to: Nobody/Anonymous (nobody)
Summary: a faster Modulefinder

Initial Comment:
py2exe uses Python's modulefinder to find the modules
and packages that belong to one or more scripts.

For not too small projects, the runtime of modulefinder
is quite long.  On my system, the time to find all 533
modules my project needs is around 48 seconds.

So, I profiled the Python 2.4 modulefinder, and patched
it for a speedup of a factor of ~2.5 - the time
required to find the modules drops to around 19 seconds.


----------------------------------------------------------------------

Comment By: Thomas Heller (theller)
Date: 2005-11-11 12:02

Message:
Logged In: YES 
user_id=11105

Here is a description of the changes in the patch:

Modulefinder's scan_code method did call ord() on each
character of the co.co_code string, that took the most time,
and it built the argument (again with ord() calls) of each
bytecode that had one, even if it was never used.

The patch changes the code to
- work on the characters of the co.co_code string, avoiding
the calls to ord() altogether
- create the bytecodes argument only when needed,
- create the bytecode with struct.pack which is faster.

I did not stop there, so other changes were that the objects
that scan_code needs most are passed as default arguments to
the functions instead of looking them up in the global
namespace.

This patch will probably be in the next py2exe release, so
it will undergo some testing.

I would appreciate comments on the patch.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1353872&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:27:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:27:32 -0700
Subject: [Patches] [ python-Patches-1346238 ] A constant folding
	optimization pass for the AST
Message-ID: <E1FjGoG-00039r-A1@sc8-sf-web4-b.sourceforge.net>

Patches item #1346238, was opened at 2005-11-02 18:49
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1346238&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Rune Holm (titanstar)
Assigned to: Neal Norwitz (nnorwitz)
Summary: A constant folding optimization pass for the AST

Initial Comment:
This patch adds the following: A visitor interface
generalized from the existing ast pass code in order to
make it easy to write ast passes that only care about
specific node types. A constant folding pass that looks
for operations involving number or string literals, and
calculates these at compile time. Example code snippets
that this pass will optimize:

3 + 4 + x => 7 + x

2 ** 2 ** 2 => 16

4 and 5 and x and 6 => x and 6

4 or 5 or x => 4

4 and 5 and ~6 => -7


When combined with patch 1346214, the compiler will
also optimize statements like

if 2**2**2 - 16: expensive_computation() => nothing

The patch adds two new files: Include/optimize.h and
Python.optimize.c. This was done because I anticipate
adding more AST optimizations later using the same
visitor interface, and Python/compile.c is already very
crowded with byte code generation and bytecode
optimization. If new files aren't desired, I could
easily change the pass to add the extra code to compile.c

This patch combined with patch 1346214 passes the unit
tests on all the platforms I've tested it on, namely:
macos 10.3/ppc
linux/x86
linux/amd64
linux/ppc
linux/ia64

valgrind on linux/x86 does not reveal any additional
leaks or uninitialized accesses that aren't already in
the svn head.


----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-17 15:54

Message:
Logged In: YES 
user_id=849994

Candidate for Iceland?

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2006-02-19 17:12

Message:
Logged In: YES 
user_id=80475

I'm +1 on the idea, but won't have an opportunity to 
review the patch in detail (to check for possible semantic 
changes).  Neal, what do you think?

----------------------------------------------------------------------

Comment By: Rune Holm (titanstar)
Date: 2006-02-19 13:35

Message:
Logged In: YES 
user_id=858364

It avoids generating constant objects with sizes above 20 (in a similar fashion 
as the bytecode peepholer), and checks whether the operand of unary minus 
is non-zero in order to avoid changing -0.0.

As for the bytecode peephole optimizer, this AST constant folder performs 
quite similar optimizations, but optimizes partially constant and/or and 
comparative expressions in addition. This patch should however not be seen 
as a replacement for the bytecode constant folder, but rather as a 
complement. An optimizing compiler typically contains many forms of 
constant folding in the different phases of compilation, since many later 
optimizations benefit from constant folding (warranting early constant 
folding), and some optimizations might emit code that benefit from constant 
folding again (warranting late constant folding). For an example of the 
former, consider the statement

if 1-1: some_code()

both passes are able to transform this into

if 0: some_code()

but since the AST constant folder is run before the dead code eliminator at 
<http://python.org/sf/1346214>, these two together are able to optimize 
the if statement away altogether.

Note that this patch probably won't apply cleanly anymore, since it was 
written three months ago and the AST code has undergone quite a few 
changes since then. But if there is interest in applying this patch, I'll gladly 
update it for the current trunk.


----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2006-02-19 10:09

Message:
Logged In: YES 
user_id=80475

This should be compared to the constant folding already 
added to Py2.5 via the peepholer:
   dis.dis(compile('x=2+3', '', 'exec'))

Also, make sure it doesn't go over the top consuming 
memory for the likes of:

  '-' * 100
  (None,)*2000

Both of those should not be optimized away at compile-time.

Also, be sure not optimize away -0.0.  Thet is not the 
same as +0.0.  The distinction is important for branch 
cuts in cmath.


----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-19 09:41

Message:
Logged In: YES 
user_id=1188172

Neal, what do you think of this?

----------------------------------------------------------------------

Comment By: Rune Holm (titanstar)
Date: 2005-11-06 20:42

Message:
Logged In: YES 
user_id=858364

Sorry, I'm new to the sourceforge patch tracker. The patch should be 
attached now.

----------------------------------------------------------------------

Comment By: Simon Dahlbacka (sdahlbac)
Date: 2005-11-06 19:10

Message:
Logged In: YES 
user_id=750513

the actual patch is missing...

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1346238&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:28:16 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:28:16 -0700
Subject: [Patches] [ python-Patches-1243730 ] Big speedup in email message
	parsing
Message-ID: <E1FjGoy-0003LH-Ep@sc8-sf-web4-b.sourceforge.net>

Patches item #1243730, was opened at 2005-07-23 22:07
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243730&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: L. Peter Deutsch (lpd)
Assigned to: Barry A. Warsaw (bwarsaw)
Summary: Big speedup in email message parsing

Initial Comment:
Python 2.4.1, Red Hat Linux 7.3.

Speeds up message parsing on files with large
attachments by approximately 4x, mostly by replacing
REs by direct string processing.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243730&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:29:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:29:01 -0700
Subject: [Patches] [ python-Patches-1243654 ] Faster output if message
	already has a boundary
Message-ID: <E1FjGph-0003Vd-Pu@sc8-sf-web4-b.sourceforge.net>

Patches item #1243654, was opened at 2005-07-23 17:04
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243654&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: L. Peter Deutsch (lpd)
Assigned to: Barry A. Warsaw (bwarsaw)
Summary: Faster output if message already has a boundary

Initial Comment:
A simple change speeds up Message.as_string by more
than a factor of 2 if the message already has a defined
boundary, by avoiding a time-consuming RE compilation
and search.


----------------------------------------------------------------------

Comment By: L. Peter Deutsch (lpd)
Date: 2005-07-23 22:08

Message:
Logged In: YES 
user_id=8861

Sorry, forgot to enter this with the original submission:
Python 2.4.1, Red Hat Linux 7.3.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243654&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:29:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:29:25 -0700
Subject: [Patches] [ python-Patches-1145039 ] Remove some invariant
	conditions and assert in ceval
Message-ID: <E1FjGq4-0003bP-Rz@sc8-sf-web4-b.sourceforge.net>

Patches item #1145039, was opened at 2005-02-20 21:31
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1145039&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: Remove some invariant conditions and assert in ceval

Initial Comment:
ISTM that if frame->f_exc_type == NULL then exc_value
and exc_traceback will also be NULL.  I didn't see that
this is documented, perhaps I missed it or there is
some case when this can occur.  If it can occur, we
shoul develop a test for it.

Assuming this condition is invariant, some
simplifications can be made in reset_exc_info which is
called once per eval_frame (on function exit).

Also, I think there is currently an extra Py_INCREF on
Py_None.  This occurs when tstate->exc_type == NULL.

This patch seems to have little to no effect on
performance.  I did measure a 0.3% speed improvement.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1145039&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:29:58 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:29:58 -0700
Subject: [Patches] [ python-Patches-1107887 ] Speed up function calls/can
	add more introspection info
Message-ID: <E1FjGqc-0001i6-Jt@sc8-sf-web3.sourceforge.net>

Patches item #1107887, was opened at 2005-01-23 18:32
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1107887&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: Speed up function calls/can add more introspection info

Initial Comment:
This patch adds a new method type (flags) METH_ARGS
(yeah, the name could be better) that is used in
PyMethodDef.  METH_ARGS means the min and max # of
arguments are specified in the PyMethodDef by adding 2
new fields.  This information can be used in ceval to
call the method.  No tuple packing/unpacking is
required since the C stack is used.

The original patch only modifies Python/bltinmodule.c.
 If the approach is desirable, Objects/*.c should be
modified and so should code in Modules/ (probably).

The benefits are:
 * faster function calls
 * simplify function call machinery by removing
METH_NOARGS, METH_O, and possibly METH_VARARGS
 * more introspection info for C functions (ie, min/max
arg count)

The primary drawback is:
 * the defn of the MethodDef (# args) is separate from
the function defn
 * potentially more error prone to write C methods???

I've measured between 13-22% speed improvement when
doing simple tests like:

  ./python ./Lib/timeit.py -v 'pow(3, 5)'

I think the difference tends to be fairly constant at
about .3 usec per loop.

I'm not sure of the effect on memory usage.  I wouldn't
expect it to be much in either direction.

Note:  This patch does not make the min/max arg count
available to Python code.  If this patch is accepted,
that seems like it should also be done.

It's possible that METH_VARARGS may not be able to go away.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2005-01-24 23:23

Message:
Logged In: YES 
user_id=33168

Martin pointed out that chr(5.3) is mishandled by this patch
and needs to be corrected.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2005-01-23 18:40

Message:
Logged In: YES 
user_id=33168

Also see,
http://mail.python.org/pipermail/python-dev/2005-January/051251.html

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1107887&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:31:23 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:31:23 -0700
Subject: [Patches] [ python-Patches-936813 ] fast modular exponentiation
Message-ID: <E1FjGrz-0003lL-Dz@sc8-sf-web4-b.sourceforge.net>

Patches item #936813, was opened at 2004-04-17 08:16
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=936813&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Trevor Perrin (trevp)
Assigned to: Tim Peters (tim_one)
Summary: fast modular exponentiation

Initial Comment:

For crypto-sized numbers, Python mod-exp is several
times slower than GMP or OpenSSL (6x or more).  Those
libraries do crazy special-case stuff, + assembly,
platform-specific tuning, and so on.

However, there's some low-hanging fruit: this patch has
a few basic optimizations giving a 2-3x speedup for
numbers in the 1000-8000 bit range (that's what I've
mostly tested; but the patch should improve, or at
least not hurt, everything else):

 - x_mul() is special-cased for squaring, which is
almost twice as fast as general multiplication.
 
 - x_mul() uses pointers instead of indices for
iteration, giving ~10% speedup (under VC6).
 
 - the right-to-left square-and-multiply exponentiation
algorithm is replaced with a left-to-right
square-and-multiply, which takes advantage of small bases.
 
 - when the exponent is above a certain size, "k-ary"
exponentiation is used to reduce the number of
multiplications via precalculation.
 
 - when the modulus is odd, Montgomery reduction is used.

 - the Karatsuba cutoff seems too low.  For
multiplicands in the range of 500-5000 bits, Karatsuba
slows multiplication by around ~25% (VC6sp4, Intel P4M
1.7 Ghz).  For larger numbers, the benefits of
Karatsuba are less than they could be.
 
 Currently, the cutoff is 35 digits (525 bits).  I've
tried 70, 140, 280, and 560.  70, 140, and 280 are
roughly the same: they don't slow down small values,
and they have good speedup on large ones.  560 is not
quite as good for large values, but at least it doesn't
hurt small ones.
 
I know this is platform-dependent, but I think we
should err on the side of making the cutoff too high
and losing some optimization, instead of putting it too
low and slowing things down.  I suggest 70.
 

A couple misc. things:

 - Negative exponents with a modulus no longer give an
error, when the base is coprime with the modulus. 
Instead, it calculates the multiplicative inverse of
the base with respect to the modulus, using the
extended euclidean algorithm, and exponentiates that.
 
 Libraries like GMP and LibTomMath work the same way. 
Being able to take inverses mod a number is useful for
cryptography (e.g. RSA, DSA, and Elgamal).
 
 - The diff includes patch 923643, which supports
converting longs to byte-strings.  Ignore the last few
diff entries, if you don't want this.
 
 - I haven't looked into harmonizing with int_pow(). 
Something may have to be done.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2005-09-29 06:29

Message:
Logged In: YES 
user_id=973611

I updated this patch to CVS head, but didn't change it
otherwise.  It's still a bit hairy.  However, it's also
still a big speedup (see benchmarks from 2004-10-03).

If I can do anything to help this make it in 2.5, let me know.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-10-05 07:25

Message:
Logged In: YES 
user_id=973611

Montgomery has a fixed cost, so it slows down small
exponents. For example modular squaring is slowed ~5x.  I
added a MONTGOMERY_CUTOFF to take care of this.  Submitting
long_mont4.diff.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-10-04 07:48

Message:
Logged In: YES 
user_id=973611


oops.  Good thing for random testing, carry propagation was
buggy.  Submitting long_mont3.diff.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-10-04 05:43

Message:
Logged In: YES 
user_id=973611

I did more code review, testing, and timing.  The only
change in this new patch (long_mont2.diff) is a couple
"int"s were changed to "digits"s, and it's against CVS head.

As far as testing, I used the random module and GMPY to
check it on ~3 million random input values.  That's about an
hour of testing.  I'll leave the tests running for a few
days and see if anything crops up.

As far as timing, I updated the benchmarks with a new
machine (OpenBSD):
http://trevp.net/long_pow/
On 3 different machines, Montgomery gives a speedup of 2x,
3x, and 4x.  That dwarfs what we've done so far, so I'm
crossing my fingers for 2.4.  Let me know if I can explain
or improve the code, or anything..  

(The below crypto library comes with a "book" which has an 
explanation of Montgomery I found helpful):
http://math.libtomcrypt.org/download.html

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-09-13 08:20

Message:
Logged In: YES 
user_id=973611

Here's the 3rd part of the patch (long_mont.diff; Montgomery
Reduction), diff'd against 2.4a3 and cleaned up a bit.

Note that this doesn't include negative exponent handling. 
If this patch is accepted, I'll make a new tracker item for
that, since it's not an optimization, just an "opportunistic
feature" (it builds on one of the helper functions needed
for Montgomery).

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2004-08-30 02:47

Message:
Logged In: YES 
user_id=31435

Same deal with the 2nd part of the patch (major format 
changes, minor code changes).  Incidentally fixed an old leak 
bug in long_pow() during the review.  Added code to raise a 
compile-time error (C) if SHIFT isn't divisible by 5, and 
removed long_pow's new hardcoded assumption that SHIFT is 
exactly 15.

Include/longintrepr.h 2.16
Misc/NEWS 1.1120
Objects/longobject.c 1.163

This is cool stuff (& thank you!), but I'm sorry to say I can't 
foresee making time for the 3rd part of the patch for weeks.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2004-08-29 22:21

Message:
Logged In: YES 
user_id=31435

Checked in the first part of the patch, with major format 
changes (Python's C coding standard is hard 8-column tabs), 
and minor code changes:

Include/longintrepr.h 2.15
Misc/ACKS 1.280
Misc/NEWS 1.1119
Objects/longobject.c 1.162

I don't know whether it's possible for me to get to part 2 of 
the patch before 2.4a3, but I would like to.  It seems plainly 
impossible that I'll be able to get to part 3 before 2.4a3.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-07-22 08:39

Message:
Logged In: YES 
user_id=973611

Pragmatics isn't my strong suit... but I get your drift :-).
 I split it into 3 diffs:
 1) x_mul optimizations: (pointers instead of indices,
special-case squaring, changing Karatsuba cutoff)
 2) rewriting long_pow() for left-to-right 5-ary
 3) Montgomery reduction.  This also includes l_invmod(),
since it's necessary for Montgomery.

I've left out the code which exposes l_invmod() to the user
(and associated docs, tests, and intobject changes).  We
could slap that on afterwards or not...

Anyways, these are applied sequentially:
longobject.c + longobject1.diff = longobject1.c
longobject1.c + longobject2.diff = longobject2.c
longobject2.c + longobject2.diff = longobject3.c

Should I open new tracker items for them?

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2004-07-21 19:29

Message:
Logged In: YES 
user_id=31435

Pragmatics are a real problem here, Trevor.  I don't foresee 
being able to make a solid block of sufficient hours to give to 
reviewing this before Python 2.4 is history (which is why I've 
left this patch unassigned, BTW -- I just can't promise to 
make enough time).  So if nobody else can volunteer to 
review it, that alone is likely to leave the patch sitting here 
unapplied.

But there are several independent changes in this patch, and 
it *could* be broken into several smaller patches.  I tossed 
that bait out before, but you didn't bite.  You should <wink>.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-07-19 11:00

Message:
Logged In: YES 
user_id=973611


Tim, thanks for the feedback.  I'm uploading a new patch
against CVS latest that fixes those issues, and adds docs
and tests.  Also, I cleaned up the code quite a bit, and got
it properly handling (I hope) all the varied combinations of
ints/longs, positives/negatives/zeros etc..

Unfortunately, Montgomery is the bulk of the speedup:
http://trevp.net/long_pow/

But I could split out the negative exponent handling into a
separate patch, if that would be easier.

Anyways, I'd like to add more tests for the exponentiation
stuff.  Aside from that, I think the patch is complete.  And
more robust than previously, though I still wouldn't trust
it until another person or two gives it a serious
looking-over....

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2004-07-17 03:06

Message:
Logged In: YES 
user_id=31435

Notes after a brief eyeball scan:

Note that the expression

a & 1 == 1

groups as

a & (1 == 1)

in C -- comparisons have higher precedence in C than bit-
fiddling operators.  Stuff like that is usually best resolved by 
explicitly parenthesizing any "impure" expression fiddling with 
bits.  In this case, in a boolean expression plain

a & 1

has the hoped-for effect. and is clearer anyway.

Would be better to use "**" than "^" in comments when 
exponentiation is intended, since "^" means xor in both 
Python and C.

Doc changes are needed, because you're changing visible 
semantics in some cases.

Tests are needed, especially for new semantics.

l_invmod can return NULL for more than one reason, but one 
of its callers ignores this, assuming that all NULL returns are 
due to lack of coprimality.  It's unreasonable to, e.g., replace 
a MemoryError with a complaint about coprimality; this needs 
reworking.  l_invmod should probably set an exception in 
the "not coprime" case.  As is, it's a weird function, 
sometimes setting an exception when it returns NULL, but not 
setting one when coprimality doesn't obtain.  That makes life 
difficult for callers (which isn't apparent in the patch, 
because its callers are currently ignoring this issue).

The Montgomery reduction gimmicks grossly complicate this 
patch -- they're long-winded and hard to follow.  That may 
come with the territory, but it's the only part of the patch 
that made me want to vomit <wink>.  I'd be happier if it 
weren't there, for aesthetic, clarity, and maintainability 
reasons.   How much of a speedup does it actually buy?

You're right that int pow must deliver the same results as 
long pow, so code is needed for that too.  "short int" 
versus "unbounded int" is increasingly meant to be an invisible 
internal implementation detail in Python.  I'm also in favor of 
giving this meaning to modular negative exponents, btw, so 
no problem with that.  An easy way would be to change int 
pow to delegate to long pow when this is needed.

Pragmatics:  there's a better chance of making 2.4 if the 
patch were done in bite-size stages.  For example, no doc 
changes are needed to switch to 5-ary left-to-right 
exponentation, and that has no effect on the int 
implementation either, etc.  A patch that did just that much 
probably would have gone in a long time ago.

----------------------------------------------------------------------

Comment By: Trevor Perrin (trevp)
Date: 2004-07-13 08:04

Message:
Logged In: YES 
user_id=973611

Uploading 2nd version of longobject.diff - the only change
is that patch 923643 is removed from this diff.  That was a
diff for converting longs to byte-strings, which I
unnecessarily left in.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=936813&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:32:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:32:05 -0700
Subject: [Patches] [ python-Patches-1494140 ] Documentation for new Struct
	object
Message-ID: <E1FjGsf-0002sO-KN@sc8-sf-web1.sourceforge.net>

Patches item #1494140, was opened at 2006-05-24 05:26
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Documentation for new Struct object

Initial Comment:
The performance enhancements to the struct module (patch #1493701) 
are implemented by having a Struct object, which is a compiled structure. 
This text file documents these new struct objects.


----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-25 10:32

Message:
Logged In: YES 
user_id=139309

That's clearly a typo. I've attached a new version of the patch that removes those 
two letters.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-24 17:03

Message:
Logged In: YES 
user_id=764593

Shouldn't self.size be the number of bytes required to *pack
* the structure?  The number required to *unpack* seems 
like it ought to include tuple overhead and such...


----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 11:35

Message:
Logged In: YES 
user_id=139309

New patch attached, fixed unpack documentation, added unpack_from method.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 10:54

Message:
Logged In: YES 
user_id=139309

Hold up on this patch, I need to revise it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:36:49 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:36:49 -0700
Subject: [Patches] [ python-Patches-813436 ] Scalable zipfile extension
Message-ID: <E1FjGxF-0002H7-Ve@sc8-sf-web2.sourceforge.net>

Patches item #813436, was opened at 2003-09-27 08:09
Message generated for change (Comment added) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=813436&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.5
>Status: Closed
Resolution: None
Priority: 5
Submitted By: Marc De Falco (deufeufeu)
Assigned to: Nobody/Anonymous (nobody)
Summary: Scalable zipfile extension

Initial Comment:
Playing around with large zipfiles (&gt; 10000 files),
I've encountered big loading time, even if after having
loaded it I use only 30 files in it.
So I've introduced a differed parameter to the
Zipfile.__init__ in order to load headers on-demand.
As it's not a really good idea to activated it for all
zip it defaults to False.
I've updated the documentation too.

Thx and keep the good work ;)

P.S. : Dunno if it can be added to 2.3 or have to be
included in 2.4, so I've choosed 2.4 group.


----------------------------------------------------------------------

>Comment By: Sean Reifschneider (jafo)
Date: 2006-05-25 14:36

Message:
Logged In: YES 
user_id=81797

There is a summer of code project to re-write the zipfile
module, so this patch is moot.

Sean

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=813436&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:36:51 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:36:51 -0700
Subject: [Patches] [ python-Patches-1087418 ] long int bitwise ops speedup
	(patch included)
Message-ID: <E1FjGxH-0003yG-NT@sc8-sf-web1.sourceforge.net>

Patches item #1087418, was opened at 2004-12-18 05:22
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1087418&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Resolution: None
Priority: 4
Submitted By: Gregory Smith (gregsmith)
Assigned to: Nobody/Anonymous (nobody)
Summary: long int bitwise ops speedup (patch included)

Initial Comment:

The 'inner loop' for applying bitwise ops to longs is quite
inefficient.

The improvement in the attached diff is
 - 'a' is never shorter than 'b' (result: only test 1
   loop index condition instead of 3)
 - each operation ( & | ^ ) has its own loop, instead
   of switch inside loop
- I found that, when this is done, a lot
of things can be simplified, resulting in further speedup,
and the resulting code is not very much longer than
before (my libpython2.4.dll  .text got 140 bytes longer).

Operations on longs of a few thousand bits appear
to be 2 ... 2.5 times faster with this patch.
I'm not 100% sure the code is right, but it passes
test_long.py, anyway.


----------------------------------------------------------------------

Comment By: Gregory Smith (gregsmith)
Date: 2005-02-11 03:45

Message:
Logged In: YES 
user_id=292741

I started by just factoring out the inner switch loop. But then
it becomes evident that when op = '^', you always have
maska == maskb, so there's no point in doing the ^mask at all.
And when op == '|', then maska==maskb==0. So likewise.
And if you put a check in so that len(a) >= len(b), then the
calculation of len_z can be simplified. It also becomes easy
to break the end off the loops, so that, say, or'ing a small
number with a really long becomes mostly a copy. etc.
It's was just a series of small simple changes following
from the refactoring of the loop/switch. 

I see a repeatable 1.5 x speedup at 300 bits, which
I think is significant (I wasn't using negative #s, which
of course have their own extra overhead). The difference
should be even higher on CPUs that don't have several
100 mW of branch-prediction circuitry.

One use case is that you can simulate an array
of hundreds or thousands of simple 1-bit processors
in pure python using long operations, and get very
good performance, even better with this fix. This app
involves all logical ops, with the occasional shift.


IMHO, I don't think the changed code is more complex; it's a
little longer, but it's more explicit in what is really
being done, and it doesn't roll together 3 cases, which
don't really have that much in common, for the sake of
brevity.  It wasn't obvious to
me about the masks being redundant until after I did the
factoring, and this is my point - rolling it together hides
that.
The original author may not have noticed the redundancy.

 I see a lot of effort being expended on very complex
multiply operations, why should the logical ops be left
behind for
the sake of a few lines?


----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2005-01-07 06:54

Message:
Logged In: YES 
user_id=80475

Patch Review
------------

On Windows using MSC 6.0, I could only reproduce about a
small speedup at around 300 bits. 

While the patch is short, it adds quite a bit of complexity
to the routine.  Its correctness is not self-evident or
certain.  Even if correct, it is likely to encumber future
maintenance.

Unless you have important use cases and feel strongly about
it, I think this one should probably not go in.

An alternative to submit a patch that limits its scope to
factoring  out the innermost switch/case.  I tried that and
found that the speedup is microscopic.  I suspect that that
one unpredictable branch is not much of a bottleneck.  More
time is likely spent on creating z.

----------------------------------------------------------------------

Comment By: Gregory Smith (gregsmith)
Date: 2005-01-03 19:54

Message:
Logged In: YES 
user_id=292741

I originally timed this on a cygwin system, I've since found
that cygwin timings tend to be strange and possibly
misleading. On a RH8 system, I'm seeing speedup of x3.5 with
longs of ~1500 bits and larger, and x1.5 speedup with only
about 300 bits. Times were measured with timeit.Timer(
'a|b', 'a=...; b=...')
Increase in .text size is likewise about 120 bytes.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1087418&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:38:15 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:38:15 -0700
Subject: [Patches] [ python-Patches-813436 ] Scalable zipfile extension
Message-ID: <E1FjGyd-00039t-0M@sc8-sf-web3.sourceforge.net>

Patches item #813436, was opened at 2003-09-27 08:09
Message generated for change (Comment added) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=813436&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
>Status: Open
Resolution: None
Priority: 5
Submitted By: Marc De Falco (deufeufeu)
Assigned to: Nobody/Anonymous (nobody)
Summary: Scalable zipfile extension

Initial Comment:
Playing around with large zipfiles (&gt; 10000 files),
I've encountered big loading time, even if after having
loaded it I use only 30 files in it.
So I've introduced a differed parameter to the
Zipfile.__init__ in order to load headers on-demand.
As it's not a really good idea to activated it for all
zip it defaults to False.
I've updated the documentation too.

Thx and keep the good work ;)

P.S. : Dunno if it can be added to 2.3 or have to be
included in 2.4, so I've choosed 2.4 group.


----------------------------------------------------------------------

>Comment By: Sean Reifschneider (jafo)
Date: 2006-05-25 14:38

Message:
Logged In: YES 
user_id=81797

Actually, we'll leave it open until the Summer of Code
implementation is completed and accepted.

Sean

----------------------------------------------------------------------

Comment By: Sean Reifschneider (jafo)
Date: 2006-05-25 14:36

Message:
Logged In: YES 
user_id=81797

There is a summer of code project to re-write the zipfile
module, so this patch is moot.

Sean

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=813436&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:39:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:39:05 -0700
Subject: [Patches] [ python-Patches-738094 ] for i in range(N) optimization
Message-ID: <E1FjGzR-0004QO-LH@sc8-sf-web1.sourceforge.net>

Patches item #738094, was opened at 2003-05-15 07:14
Message generated for change (Settings changed) made by jafo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=738094&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: Python 2.3
Status: Open
Resolution: Later
Priority: 2
Submitted By: Sebastien Keim (s_keim)
Assigned to: Guido van Rossum (gvanrossum)
Summary: for i in range(N) optimization

Initial Comment:
This patch is intended to special case the built-in
range function in the common &quot;for i in range(...):&quot;
construct. The goal is to make range() return an
iterator instead of creating a real list, and then
being able to depreciate the xrange type.

It has once been suggested to make the compiler aware
of the 
&quot;for i in range(N):&quot; construct and to make it able to
produce optimized bytecode. But this solution is really
hard to achieve because you have to
ensure that the range built-in is not overridden.

The patch take an opposite approach: it let the range
built-in function looks at its execution context, and
return an iterator if the next frame opcode to be
executed is the GET_ITER opcode.

Speed increase for the piece of code &quot;for i in
range(N): pass&quot; : 
 N  (speed gain)
 10 (+ 64%)
 100 (+ 29%)
 1000 (+ 23%)
 10000	(+ 68%)
 100000 (+108%)

Since the patch only affect a small construct of the
language, performance improvements for real
applications are less impressive but they are still
interesting:
pystone.py       (+7%)
test_userstring.py (+8%)
test_datetime.py   (+20%)

Note that the performance loss for &quot;A = range(10)&quot; is
not measurable (less than 1%).

If the patch is accepted, the same recipe may be
applicable in some few other places. So the
Py_IsIterationContext function must probably live
somewhere else (is there a standard location for
byte-code dependent stuff?). Maybe other opcodes (for
sample JUMP_IF_FALSE) could provide other useful
specialization contexts.

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2005-03-03 14:48

Message:
Logged In: YES 
user_id=1188172

I don't know if I fully understand, but doesn't it suffice
to just use xrange()?

----------------------------------------------------------------------

Comment By: Armin Rigo (arigo)
Date: 2004-01-11 11:07

Message:
Logged In: YES 
user_id=4771

Here is a safer patch.  It adds a keyword argument 'iter' to range(), e.g.:

>>> range(10, iter=True)
<rangeiterator object at xxx>

and using an appropriate METH_XXX flag, the CALL_FUNCTION opcode now inserts a 'iter=True' keyword to the call when it is followed by GET_ITER.

The patch doesn't live up to its performance promizes.  I don't get any improvement at all on any real application.  The only example it accelerates is a set of three nested loops :-(

I still attach it for reference, and if someone else want to play with it.

----------------------------------------------------------------------

Comment By: Guido van Rossum (gvanrossum)
Date: 2003-07-09 16:22

Message:
Logged In: YES 
user_id=6380

In the sake of stability for Python 2.3's accelerated
release schedule, I'm postponing this until after 2.3.

I'm also skeptical that it ca be absolutely correct.
What if there is Python code of the form

    for i in some_function(): ...

where some_function() is a C extension that at some
point invokes range(), directly from C. Then when
range() peeks in the opcode stream, it would believe
that it was being called in the place of some_function().

So maybe I should just reject it as unsafe?


----------------------------------------------------------------------

Comment By: Guido van Rossum (gvanrossum)
Date: 2003-05-18 21:18

Message:
Logged In: YES 
user_id=6380

I'm interested, but have to ponder more, which will have to
wait until I'm back from vacation.

I expect that any hope to deprecate xrange() will prove
naive -- people will want to pass ranges around between
functions or reuse them (e.g. this happens a lot in timing
tests). Maybe in Python 3.0 I can make range() act as an
iterator generator. You'd have to say list(range(N)) to get
an actual list then.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2003-05-17 23:25

Message:
Logged In: YES 
user_id=80475

Assigning to Guido to see whether he is interested because 
it makes xrange less necessary or whether he thinks it is a 
horrendous hack --or maybe both ;-)

----------------------------------------------------------------------

Comment By: Sebastien Keim (s_keim)
Date: 2003-05-15 15:14

Message:
Logged In: YES 
user_id=498191

I have also thought about slicing, map and filter which
could all be replaced by  itertools equivalents , but I have
failed to  find a way to ensure that the argument lists
aren't mutated during the for loop.

Maybe it could be interesting to investigate into copy on
write semantic for lists objects?

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2003-05-15 14:33

Message:
Logged In: YES 
user_id=80475

zip() would benefit greatly from your approach.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=738094&group_id=5470

From noreply at sourceforge.net  Thu May 25 16:47:56 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 07:47:56 -0700
Subject: [Patches] [ python-Patches-1454481 ] Make thread stack size runtime
	tunable
Message-ID: <E1FjH80-0005O0-Ne@sc8-sf-web1.sourceforge.net>

Patches item #1454481, was opened at 2006-03-20 23:37
Message generated for change (Comment added) made by aimacintyre
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1454481&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Andrew I MacIntyre (aimacintyre)
Assigned to: Nobody/Anonymous (nobody)
Summary: Make thread stack size runtime tunable

Initial Comment:
Platform default thread stack sizes vary considerably.
Some are very generous (Win32: usually 1MB; Linux: 1MB,
sometimes 8MB).  Others are not (FreeBSD: 64k).

Some platforms have restricted virtual address space
OS/2: 512M less overhead) which makes hard coding a
generous default thread stack size problematic.  Some
platforms thread commit stack address space, even
though the memory backing it may not be committed
(Windows, OS/2 at least).

Some applications have a thirst for stack space in
threads (Zope). Some programmers want to be able to use
lots of threads, even in the face of sound advice about
the lack of wisdom in this approach.

The current approach to stack space management in
threads in Python uses a hard coded strategy, relying
on the platform having a useful default or relying on
the system administrator or distribution builder
over-riding the default at compile time.

This patch is intended to allow developers some control
over managing this resource from within Python code by
way of a function in the thread module.  As written, it
is not intended to provide unlimited flexibility; that
would probably require exposing the underlying
mechanism as an option on the creation of each thread.

An alternative approach to providing the functionality
would be to use an environment variable to provide the
information to the thread module.  This has its pros
and cons, in terms of flexibility and ease of use, and
could be complementary to the approach implemented.

The patch has been tested on OS/2 and FreeBSD 4.8.  I
have no means of testing the code on Win32 or Linux,
though Linux is a pthread environment as is FreeBSD. 
Code base is SVN head from a few hours ago. A doc 
update is included.

While I would like to see this functionality in Python
2.5, it is not a critical issue.

Critique of the approach and implementation welcome. 
Something not addressed is the issue of tests,
primarily because I haven't been able to think of a
viable testing strategy - I'm all ears to suggestions
for this.

----------------------------------------------------------------------

>Comment By: Andrew I MacIntyre (aimacintyre)
Date: 2006-05-26 00:47

Message:
Logged In: YES 
user_id=250749

Ok, v3 includes the additions to the threading module, tests
in both test_thread and test_threading and docs in both
thread and threading modules (duplicated as I don't know how
to do the LaTex linking).

If there are no other issues needing to be addressed, I
propose to check these changes in sometime on the weekend of
June 3-4 or thereabouts to get in a bit before the beta release.

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-04-24 06:25

Message:
Logged In: YES 
user_id=31435

Right, this one: "a simple shadow of the function as a
module level function".  If it affects all threads (which it
does), then a module function is a natural place for it.  If
I a saw a method on the Thread class, the most natural (to
me ;-)) assumption is that a_thread.stack_size(N) would set
the stack size for the specific thread `a_thread`, but not
affect other threads.  Part of what makes that "the most
natural" assumption is that Thread has no class or static
methods today.  As a module-level function, no such
confusion is sanely possible.

Sticking "stack_size" in threading.__all__, and adding

from thread import stack_size

to threading.py is all I'm looking for here.  Well, plus
docs and a test case ;-)

----------------------------------------------------------------------

Comment By: Andrew I MacIntyre (aimacintyre)
Date: 2006-04-23 15:35

Message:
Logged In: YES 
user_id=250749

Thanks Tim.

My default action is to try and match the prevailing style,
but  cut'n'paste propagated the flaw.  thread_pthread.h was
clean AFAICS, so I'll do a style normalisation (as a
separate checkin) on thread_nt.py and thread_os2.h when
commit time comes.

As an "implementation detail", I hadn't considered that
exposing it via threading was appropriate.

I can see 2 approaches:
- a simple shadow of the function as a module level function;
or
- a classmethod of the Thread class.

Any hints on which would be the more preferable or natural
approach?

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-04-22 15:46

Message:
Logged In: YES 
user_id=31435

The patch applies cleanly on WinXP, "and works" (I checked
this by setting various stack sizes, spawning a thread doing
nothing but a raw_input(), and looking at the VM size under
Task Manager while the thread was paused waiting for input
-- the VM size went up each time roughly by the stack-size
increase; finally set stack_size to 0 again, and all the
"extra" VM went away).

Note that Python C style for defining functions puts the
function name in the first column.  For example,

"""
static int
_pythread_nt_set_stacksize(size_t size)
"""

instead of

"""
static int _pythread_nt_set_stacksize(size_t size)
"""

The patch isn't consistent about this, and perhaps it's
errenously ;-) aping bad style in surrounding function
definitions.

This should really be exposed via threading.py.  `thread` is
increasingly "just an implementation detail" of `threading`,
and it actually felt weird to me to write a test program
that had to import `thread`.

----------------------------------------------------------------------

Comment By: Andrew I MacIntyre (aimacintyre)
Date: 2006-04-14 22:51

Message:
Logged In: YES 
user_id=250749

I have updated the patch along the lines Martin suggested.

I have omitted OS/2 from the list of supported platforms in
the doc patch as I haven't added OS/2 to anywhere else in
the docs.  My thinging has been that OS/2 is a 2nd tier
platform, and I have kept an extensive port README file in
the build directory (PC/os2emx) documenting port specific
behaviour.

The idea with the environment variable version was that it
would be less "intrusive" a change from the user POV.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-11 01:09

Message:
Logged In: YES 
user_id=21627

re 1) Currently, the usage of the stacksize attribute is
depending on the definition of a THREAD_STACK_SIZE macro. I
don't know where that comes from, but I guess whoever
defines it knows what he is doing, so that the stacksize
attribute is defined on such a system.

re 2) I can accept that Python enforces a minimum above
PTHREAD_STACK_MIN; it shouldn't be possible to set the stack
size below PTHREAD_STACK_MIN, since that *will* fail when a
thread is created.

-1 for an environment variable version. What problem would
that solve? If this patch gets implemented, applications can
define their own environment variables if they think it
helps, and users/admins can put something in
sitecustomize.py if they think there should be an
environment variable controlling the stack size for all
Python applications on the system.

----------------------------------------------------------------------

Comment By: Andrew I MacIntyre (aimacintyre)
Date: 2006-04-11 00:45

Message:
Logged In: YES 
user_id=250749

1) wrt _POSIX_THREAD_ATTR_STACKSIZE, I'll look at that
(though I note its absence from the existing code...)

2) PTHREAD_STACK_MIN on FreeBSD is 1k, which seemed grossly
inadequate for Python (my impression is that 20-32k is a
fairly safe minimum for Python).  In principle I don't have
a problem 
with relying on PTHREAD_STACK_MIN, except for trying to play
it safe.  Any further thoughts on this?

I'm also putting together an environment variable only
version of the patch, with a view to getting that in first,
and reworking this patch to work on top of that.

Thanks for the comments.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-04-10 23:41

Message:
Logged In: YES 
user_id=21627

Usage of pthread_attr_setstacksize should be conditional on
the definition of _POSIX_THREAD_ATTR_STACKSIZE, according to
POSIX. Errors from pthread_attr_setstacksize should be
reported (POSIX lists EINVAL as a possible error).

I think PTHREAD_STACK_MIN should be considered. 

The documentation should list availibility of the feature,
currently Win32, OS/2, and POSIX threads (with the TSS
option, to be precise). If some platforms have specific
additional requirements on the possible values (eg. must be
a multiple of the page size), these should be documented, as
well.

Apart from that, the patch looks fine.

----------------------------------------------------------------------

Comment By: Andrew I MacIntyre (aimacintyre)
Date: 2006-03-22 19:28

Message:
Logged In: YES 
user_id=250749

Thanks for the comments.

As implemented, the function is both a getter and
(optionally) a setter which makes attempting to use a
"get"/"set" prefix 
awkward.

I chose this approach to make it a little simpler to support
temporary changes.  I did consider using a module
attribute/variable, but it is slightly more unwieldy for
this case:

old_size = thread.stack_size(new_size)
...
thread.stack_size(old_size)

vs

old_size = thread.stack_size
thread.stack_size = new_size
...
thread.stack_size = old_size

or (using get/set accessors)

old_size = thread.get_stacksize()
thread.set_stacksize(new_size)
...
thread.set_stacksize(old_size)

I think an argument can be made for passing on the
"get"/"set" naming consistency based on the guidelines in
PEP 8.  While I have a preference for what I've implemented,
I'm more interested in getting the functionality in than
debating its decor.  If there's a strong view about these 
issues, I'm prepared to revise the patch accordingly.

I don't believe that the functionality belongs anywhere else
than the thread module, except possibly shadowing it in the
threading module, as it is highly specific to thread
support.  The sys module seems more appropriate for general 
knobs, and only for specific knobs when there is no other
choice IMO.  Doing it outside the thread module also
complicates the implementation, which I was trying to keep
as simple as I could.


----------------------------------------------------------------------

Comment By: Hye-Shik Chang (perky)
Date: 2006-03-21 00:58

Message:
Logged In: YES 
user_id=55188

I'm all for this!  The FreeBSD port have maintained a local
patch to bump THREAD_STACK_SIZE.  The patch will lighten
FreeBSD users' burden around thread stack size.

BTW, the naming, "thread.stack_size" seems to miss a verb
while all the other functions on the thread module have it.
 How about set_stack_size() or set_stacksize()?  Or, how
about in sys module?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1454481&group_id=5470

From noreply at sourceforge.net  Thu May 25 20:47:19 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 11:47:19 -0700
Subject: [Patches] [ python-Patches-921466 ] Reduce number of open calls on
	startup
Message-ID: <E1FjKrf-0007n4-FX@sc8-sf-web1.sourceforge.net>

Patches item #921466, was opened at 2004-03-23 00:10
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: Nobody/Anonymous (nobody)
Summary: Reduce number of open calls on startup

Initial Comment:
This patch uses sys.path_importer_cache to reduce the
number of open calls, in the following way:
- if the value in path_importer_cache is None, it stats
the path to find out whether the file exists
- it then puts True/False into path_importer_cache
- if the value in path_importer_cache is False, the
path entry is skipped on all imports
- if the value is True, the stat call is skipped, and
open calls for files in the directory are made.

On Linux, this reduces the number of open calls for an
empty script from 343 to 263. The startup-time (for 100
interpreter invocations) goes down by one percent (from
0.0819s to 0.08113s per invocation).

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-25 18:47

Message:
Logged In: YES 
user_id=849994

I reviewed this patch, in in consequence discovered a
problem with the sys.path_hooks machinery, described in
http://mail.python.org/pipermail/python-dev/2006-May/065173.html

This patch fixes the problem and corrects the original patch
to not set any sys.path_importer_cache entry to True or
False when no import hooks are enabled (the p_loader
argument to find_module is NULL then).

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-25 19:55

Message:
Logged In: YES 
user_id=849994

I'm very much for it. I haven't got too much RAM, and
whenever I start a Python program (emerge being the most
prominent example) after having worked heavily with e.g.
graphics or VMware, I'm hit by the files Python's opening
not being in the file cache anymore.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-02-20 21:51

Message:
Logged In: YES 
user_id=21627

Not sure. Anybody speaking in favour? against?

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 10:42

Message:
Logged In: YES 
user_id=1188172

Can this go into 2.5?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2004-03-23 15:43

Message:
Logged In: YES 
user_id=21627

It's certainly the case that the system has cached all files needed for 
startup in memory, including the directory contents of all directories 
searched.

OTOH, I assume that is the scenario in which people worry about startup 
time: high-frequency invocations of python. For a single invocation, it 
shouldn't matter much whether it takes 0.04s or 0.08s.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-23 07:30

Message:
Logged In: YES 
user_id=80475

I am surprised that making 25% fewer open calls doesn't save
more than 1% in startup time.

One other thought, I wonder if the timing of these changes
is affected by the OS keeping recently loaded files in
buffers so that disk access time not included.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

From noreply at sourceforge.net  Thu May 25 21:54:23 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 12:54:23 -0700
Subject: [Patches] [ python-Patches-1087418 ] long int bitwise ops speedup
	(patch included)
Message-ID: <E1FjLuZ-0001GW-7w@sc8-sf-web2.sourceforge.net>

Patches item #1087418, was opened at 2004-12-18 00:22
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1087418&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Performance
Group: None
Status: Open
Resolution: None
Priority: 4
Submitted By: Gregory Smith (gregsmith)
>Assigned to: Tim Peters (tim_one)
Summary: long int bitwise ops speedup (patch included)

Initial Comment:

The 'inner loop' for applying bitwise ops to longs is quite
inefficient.

The improvement in the attached diff is
 - 'a' is never shorter than 'b' (result: only test 1
   loop index condition instead of 3)
 - each operation ( & | ^ ) has its own loop, instead
   of switch inside loop
- I found that, when this is done, a lot
of things can be simplified, resulting in further speedup,
and the resulting code is not very much longer than
before (my libpython2.4.dll  .text got 140 bytes longer).

Operations on longs of a few thousand bits appear
to be 2 ... 2.5 times faster with this patch.
I'm not 100% sure the code is right, but it passes
test_long.py, anyway.


----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-25 15:54

Message:
Logged In: YES 
user_id=31435

Assigned to me, changed Category to Performance.

----------------------------------------------------------------------

Comment By: Gregory Smith (gregsmith)
Date: 2005-02-10 22:45

Message:
Logged In: YES 
user_id=292741

I started by just factoring out the inner switch loop. But then
it becomes evident that when op = '^', you always have
maska == maskb, so there's no point in doing the ^mask at all.
And when op == '|', then maska==maskb==0. So likewise.
And if you put a check in so that len(a) >= len(b), then the
calculation of len_z can be simplified. It also becomes easy
to break the end off the loops, so that, say, or'ing a small
number with a really long becomes mostly a copy. etc.
It's was just a series of small simple changes following
from the refactoring of the loop/switch. 

I see a repeatable 1.5 x speedup at 300 bits, which
I think is significant (I wasn't using negative #s, which
of course have their own extra overhead). The difference
should be even higher on CPUs that don't have several
100 mW of branch-prediction circuitry.

One use case is that you can simulate an array
of hundreds or thousands of simple 1-bit processors
in pure python using long operations, and get very
good performance, even better with this fix. This app
involves all logical ops, with the occasional shift.


IMHO, I don't think the changed code is more complex; it's a
little longer, but it's more explicit in what is really
being done, and it doesn't roll together 3 cases, which
don't really have that much in common, for the sake of
brevity.  It wasn't obvious to
me about the masks being redundant until after I did the
factoring, and this is my point - rolling it together hides
that.
The original author may not have noticed the redundancy.

 I see a lot of effort being expended on very complex
multiply operations, why should the logical ops be left
behind for
the sake of a few lines?


----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2005-01-07 01:54

Message:
Logged In: YES 
user_id=80475

Patch Review
------------

On Windows using MSC 6.0, I could only reproduce about a
small speedup at around 300 bits. 

While the patch is short, it adds quite a bit of complexity
to the routine.  Its correctness is not self-evident or
certain.  Even if correct, it is likely to encumber future
maintenance.

Unless you have important use cases and feel strongly about
it, I think this one should probably not go in.

An alternative to submit a patch that limits its scope to
factoring  out the innermost switch/case.  I tried that and
found that the speedup is microscopic.  I suspect that that
one unpredictable branch is not much of a bottleneck.  More
time is likely spent on creating z.

----------------------------------------------------------------------

Comment By: Gregory Smith (gregsmith)
Date: 2005-01-03 14:54

Message:
Logged In: YES 
user_id=292741

I originally timed this on a cygwin system, I've since found
that cygwin timings tend to be strange and possibly
misleading. On a RH8 system, I'm seeing speedup of x3.5 with
longs of ~1500 bits and larger, and x1.5 speedup with only
about 300 bits. Times were measured with timeit.Timer(
'a|b', 'a=...; b=...')
Increase in .text size is likewise about 120 bytes.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1087418&group_id=5470

From noreply at sourceforge.net  Fri May 26 00:45:38 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 15:45:38 -0700
Subject: [Patches] [ python-Patches-1359618 ] Speed charmap encoder
Message-ID: <E1FjOaI-0007xw-JX@sc8-sf-web5.sourceforge.net>

Patches item #1359618, was opened at 2005-11-18 03:00
Message generated for change (Comment added) made by jackdied
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1359618&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: M.-A. Lemburg (lemburg)
Summary: Speed charmap encoder

Initial Comment:
This patch speeds up the charmap encoder by a factor of 4 to 5, using a 
trie structure instead of a dictionary; the speedup primarily comes from 
not creating integer objects in the process.

The trie is created by inverting the encoding map; the codec generator is 
changed to drop the encoding dictionary, and instead emit a function call 
to create the trie.

----------------------------------------------------------------------

Comment By: Jack Diederich (jackdied)
Date: 2006-05-25 18:45

Message:
Logged In: YES 
user_id=591932

Updated the patch as part of NeedForSpeed
(mainly Py_ssize_t changes and some rejected chunks)

Because the previous version on the trunk was
broken I'm not sure what to compare the results against ;)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1359618&group_id=5470

From noreply at sourceforge.net  Fri May 26 00:55:53 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 15:55:53 -0700
Subject: [Patches] [ python-Patches-1243730 ] Big speedup in email message
	parsing
Message-ID: <E1FjOkD-0007x3-7R@sc8-sf-web4-b.sourceforge.net>

Patches item #1243730, was opened at 2005-07-23 18:07
Message generated for change (Comment added) made by holdenweb
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243730&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: L. Peter Deutsch (lpd)
Assigned to: Barry A. Warsaw (bwarsaw)
Summary: Big speedup in email message parsing

Initial Comment:
Python 2.4.1, Red Hat Linux 7.3.

Speeds up message parsing on files with large
attachments by approximately 4x, mostly by replacing
REs by direct string processing.


----------------------------------------------------------------------

Comment By: Steve Holden (holdenweb)
Date: 2006-05-25 18:55

Message:
Logged In: YES 
user_id=88157

A first examinaation reveals no particular speedup on an
email with approximately 30 MB of attachments. Can the OP
perhaps provide some code and test data I could time to
verify the assertions of speedup? Otherwise I can't see much
point in applying the patch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243730&group_id=5470

From noreply at sourceforge.net  Fri May 26 04:50:24 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 19:50:24 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1FjSPA-0000fN-BD@sc8-sf-web2.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 16:59
Message generated for change (Comment added) made by bobince
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: Accepted
Priority: 8
Submitted By: Andrew Clover (bobince)
Assigned to: Martin v. L??wis (loewis)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Andrew Clover (bobince)
Date: 2006-05-26 02:50

Message:
Logged In: YES 
user_id=311085

> I put a demo installer containing them

Seems to work OK. The thanks at the end still attributes the
graphic to Erik though; I'm not after an ack there myself,
but changing the text to not imply the current graphic is
his one may be appropriate.

> baselogo.svg; I assume this is a source file

Yes. This is just the Python logo itself (the gradient
version as used on the new website), in vector format.

> icons.svgz; can't figure out what this is

Same as source.xar, but exported as W3C standard SVG format
for wider compatibility [compressed, hence the 'z'].

Unfortunately because SVG cannot reproduce some of effects
used, and because the SVG export path is currently quite
bad, it's not really directly usable, but it might be of use
to anyone who wants to hack on the graphics but doesn't use
Xara.

> source.xar; not sure either

This is the primary vector graphics source of the icons -
the other SVG and PNG files are just there because other
people requested them.

It's in Xara format, a previously proprietary graphics
application which has now gone open-source and is heading
rapidly towards being usable on Linux, but isn't quite there
yet.

> a directory called png, with many png file - I expect
> that these aren't source files, are they?

Nope, they're just exactly the same content as in the
(with-vista) .ico files, just supplied as PNG for anyone who
wants to fiddle with them in a more accessible bitmap format.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 08:56

Message:
Logged In: YES 
user_id=21627

Thanks for the patch. I have committed it as r46063. I put a
demo installer containing them at

http://www.dcl.hpi.uni-potsdam.de/home/loewis/python-2.5.13290.msi

I would also like to add the source files, but I have
difficulties figuring out what they are. There is a source
directory; with:

- baselogo.svg; I assume this is a source file
- icons.svgz; can't figure out what this is
- source.xar; not sure either
- a directory called png, with many png file - I expect
  that these aren't source files, are they?

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-19 14:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 11:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Fri May 26 08:38:34 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Thu, 25 May 2006 23:38:34 -0700
Subject: [Patches] [ python-Patches-921466 ] Reduce number of open calls on
	startup
Message-ID: <E1FjVxy-0002li-Gm@sc8-sf-web3.sourceforge.net>

Patches item #921466, was opened at 2004-03-22 16:10
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: Nobody/Anonymous (nobody)
Summary: Reduce number of open calls on startup

Initial Comment:
This patch uses sys.path_importer_cache to reduce the
number of open calls, in the following way:
- if the value in path_importer_cache is None, it stats
the path to find out whether the file exists
- it then puts True/False into path_importer_cache
- if the value in path_importer_cache is False, the
path entry is skipped on all imports
- if the value is True, the stat call is skipped, and
open calls for files in the directory are made.

On Linux, this reduces the number of open calls for an
empty script from 343 to 263. The startup-time (for 100
interpreter invocations) goes down by one percent (from
0.0819s to 0.08113s per invocation).

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-25 23:38

Message:
Logged In: YES 
user_id=33168

Without looking at the patch impl, I'm +1 on the idea of
reducing stat/open calls.  On NFS this is a huge time sync.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-25 11:47

Message:
Logged In: YES 
user_id=849994

I reviewed this patch, in in consequence discovered a
problem with the sys.path_hooks machinery, described in
http://mail.python.org/pipermail/python-dev/2006-May/065173.html

This patch fixes the problem and corrects the original patch
to not set any sys.path_importer_cache entry to True or
False when no import hooks are enabled (the p_loader
argument to find_module is NULL then).

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-25 11:55

Message:
Logged In: YES 
user_id=849994

I'm very much for it. I haven't got too much RAM, and
whenever I start a Python program (emerge being the most
prominent example) after having worked heavily with e.g.
graphics or VMware, I'm hit by the files Python's opening
not being in the file cache anymore.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-02-20 13:51

Message:
Logged In: YES 
user_id=21627

Not sure. Anybody speaking in favour? against?

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 02:42

Message:
Logged In: YES 
user_id=1188172

Can this go into 2.5?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2004-03-23 07:43

Message:
Logged In: YES 
user_id=21627

It's certainly the case that the system has cached all files needed for 
startup in memory, including the directory contents of all directories 
searched.

OTOH, I assume that is the scenario in which people worry about startup 
time: high-frequency invocations of python. For a single invocation, it 
shouldn't matter much whether it takes 0.04s or 0.08s.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-22 23:30

Message:
Logged In: YES 
user_id=80475

I am surprised that making 25% fewer open calls doesn't save
more than 1% in startup time.

One other thought, I wonder if the timing of these changes
is affected by the OS keeping recently loaded files in
buffers so that disk access time not included.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

From noreply at sourceforge.net  Fri May 26 10:26:42 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 01:26:42 -0700
Subject: [Patches] [ python-Patches-1446489 ] zipfile: support for ZIP64
Message-ID: <E1FjXec-0000Fd-TL@sc8-sf-web4-b.sourceforge.net>

Patches item #1446489, was opened at 2006-03-09 15:58
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Ronald Oussoren (ronaldoussoren)
Summary: zipfile: support for ZIP64

Initial Comment:
The attached patch implements support for ZIP64, that is zipfiles 
containing very large (>4GByte) files and zipfiles that are larger than
4GByte themselves. 

The output of this patch can be read by pkzip (see below for the actual 
version I used for testing).


----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-26 10:26

Message:
Logged In: YES 
user_id=580910

I've attached yet another version, this version reintroduces some functionalitity 
that was unintentionally removed and fixes a lame bug that caused 
test_zipimport to fail.


----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-23 15:10

Message:
Logged In: YES 
user_id=580910

I've found some time to work on this. I've added zipfile-zip64-
version2.patch, this version:

* Makes zip64 behaviour optional (defaults to off because zip(1) doesn't 
support  zip64)

* Is significantly faster for large zipfiles because it doesn't scan the entire 
zipfile just to check that the file headers are consistent with the central 
directory w.r.t. filename (this check is now done when trying to read a file)

* Updates the reference documentation.

* Adds unittests. There are two sets of tests: one set tests the behaviour of 
zip64 extensions using small files by lowering the zip64 cutoff point and is 
run every time, the other set do tests with huge zipfiles and are run when the 
largefile feature is enabled when running the tests.

There one backward incompatible change: ZipInfo objects no longer have a 
file_offset attribute. That was the other reason for scanning the entire zipfile 
when opening it. IMNSHO this should have been a private attribute and the 
cost of this feature is not worth its *very* limited usefulness. As an indication 
of its cost: I got a 6x speedup when I removed the calculation of the 
file_offset attribute, something that adds up when you are dealing with huge 
zipfiles (I wrote this patch because I'm dealing with 10+GByte zipfiles with 
tens of thousands of files at work).

I noticed that zipfile raises RuntimeError in some places. I've changed one of 
those to zipfile.BadZipfile, but others remain. I don't like this, most of them 
should be replaced by TypeError or ValueError exceptions.

BTW. This patch also supports storing files >4GByte in the zipfile, but that 
feature isn't very useful because zipfile doesn't have an API for reading file 
data incrementally.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-16 09:55

Message:
Logged In: YES 
user_id=580910

I haven't had time to work on this, all time I had to work on python related stuff 
has been eaten by finishing PyObjC's port to intel macs and universal binary 
patches.

The former is now done, the latter almost so I'll have some time to work on this 
again especially because I'm using this patch at work and might be able to claim 
some time to work on this during work-hours.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 09:41

Message:
Logged In: YES 
user_id=849994

Since 2.5 beta is coming close, have you made progress on
the tests/docs?

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-02 21:13

Message:
Logged In: YES 
user_id=580910

The "don't use the ZIP64 extension" flag is a good idea, zipfiles that use this 
extension aren't readable by the infozip tools (zip and unzip on most unix 
systems).

I'll add tests and documentation in the near future.

The version of zipfile that I'm currently using also contains a patch for 
speeding up the opening of zipfiles, for the type of files I'm dealing with 
(about 11GByte large with tens of thousands of files) the speedup is very 
significant. I suppose it's better to file that as a separate patch after this has 
been approved.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2006-04-02 07:02

Message:
Logged In: YES 
user_id=29957

I'd like to see a testcase and possibly a note for the
documentation about the new semantics. Also, should it be
possible to say "don't use the ZIP64 extension, instead
raise an Error" for people who don't want to generate these?
 

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-03-09 16:28

Message:
Logged In: YES 
user_id=580910

Oops, I've uploaded the wrong file. zipfile-zip64.patch is the correct one.

I've tested the correctness of created archives using this version of pkzip:

pkzipc -version
PKZIP(R) Server  Version 8  ZIP Compression Utility for Linux X86
Copyright (C) 1989-2005 PKWARE, Inc.  All Rights Reserved. Evaluation 
Version
PKZIP Reg. U.S. Pat. and Tm. Off.  Patent No. 5,051,745
Patent Pending

Version 8.40.66


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

From noreply at sourceforge.net  Fri May 26 13:34:19 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 04:34:19 -0700
Subject: [Patches] [ python-Patches-1346214 ] Better dead code elimination
	for the AST compiler
Message-ID: <E1FjaaB-0004nV-Kq@sc8-sf-web5.sourceforge.net>

Patches item #1346214, was opened at 2005-11-02 18:21
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1346214&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Parser/Compiler
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Rune Holm (titanstar)
Assigned to: Neal Norwitz (nnorwitz)
Summary: Better dead code elimination for the AST compiler

Initial Comment:
Here's a patch that adds dead code elimination for if
0: style statements, and improves the current dead code
elimination for while statements by not performing
elimination if the function is a generator.  If the
last yield statement from a generator is removed, the
generator is turned into a regular function, which
changes the semantics of the function.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-26 11:34

Message:
Logged In: YES 
user_id=849994

Attaching new patch which does elimination for if 0, if 1
and if __debug__ correctly (visiting else clauses!) and
correctly recognizes functions mixing "return x" and "yield".

----------------------------------------------------------------------

Comment By: Rune Holm (titanstar)
Date: 2005-11-06 20:41

Message:
Logged In: YES 
user_id=858364

Sorry, I'm new to the sourceforge patch tracker. The patch should be 
attached now.

----------------------------------------------------------------------

Comment By: Simon Dahlbacka (sdahlbac)
Date: 2005-11-06 19:08

Message:
Logged In: YES 
user_id=750513

the actual patch is missing..

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1346214&group_id=5470

From noreply at sourceforge.net  Fri May 26 14:00:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 05:00:32 -0700
Subject: [Patches] [ python-Patches-1359618 ] Speed charmap encoder
Message-ID: <E1FjazY-0000JW-Bh@sc8-sf-web4-b.sourceforge.net>

Patches item #1359618, was opened at 2005-11-18 09:00
Message generated for change (Comment added) made by lemburg
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1359618&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: M.-A. Lemburg (lemburg)
Summary: Speed charmap encoder

Initial Comment:
This patch speeds up the charmap encoder by a factor of 4 to 5, using a 
trie structure instead of a dictionary; the speedup primarily comes from 
not creating integer objects in the process.

The trie is created by inverting the encoding map; the codec generator is 
changed to drop the encoding dictionary, and instead emit a function call 
to create the trie.

----------------------------------------------------------------------

>Comment By: M.-A. Lemburg (lemburg)
Date: 2006-05-26 14:00

Message:
Logged In: YES 
user_id=38388

Hi Martin,

I've only had a quick look at the patch, but it looks nice,
so please check it in.

Don't we also have to regenerate all the codecs once this
patch has been applied ?! If so, I can take care of that
using Makefile approach in Tools/unicode/. Please let me know.

Thanks.


----------------------------------------------------------------------

Comment By: Jack Diederich (jackdied)
Date: 2006-05-26 00:45

Message:
Logged In: YES 
user_id=591932

Updated the patch as part of NeedForSpeed
(mainly Py_ssize_t changes and some rejected chunks)

Because the previous version on the trunk was
broken I'm not sure what to compare the results against ;)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1359618&group_id=5470

From noreply at sourceforge.net  Fri May 26 14:28:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 05:28:31 -0700
Subject: [Patches] [ python-Patches-1491759 ] IDLE L&F on MacOSX
Message-ID: <E1FjbQd-0002NW-9T@sc8-sf-web4-b.sourceforge.net>

Patches item #1491759, was opened at 2006-05-19 19:39
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491759&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: IDLE
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Kurt B. Kaiser (kbk)
Summary: IDLE L&F on MacOSX 

Initial Comment:
The attached patch fixes some L&F issues on MacOSX:

- IDLE now reacts to file-open AppleEvents, which means that if a user 
associates IDLE.app with .py files IDLE will open .py files when the user 
double-clicks on them

- Hide the tcl/tk console window that gets opened by default when IDLE is 
in an application bundle (that's a misfeature of aquatk)

- Patch the menu's to make sure they better conform to the HIG.

- PyShell/EditorWindow  status_bar no longer overlaps with the resize 
widget in the lower-left corner of the window

Open issues:

- When you double-click on a file and IDLE is not yet open the file will be 
opened, but IDLE will open the default shell window just above it :-(

- I'm not terribly happy with the code changes that implement the 
updated menu structure.

- The default keybindings on OSX are the windows keybindings. I haven't 
checked yet if that can be fixed programmaticly, I also haven't verified if 
the macos keybindings are fully correct for OSX.

- The general L&F is still wrong, but that isn't really IDLE's fault: tcl/tk 
doesn't fully conform to the HIG yet (dialogs without title bars, wrong 
default dinwos background, wrong widget for tabbed windows, ...).

----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-26 14:28

Message:
Logged In: YES 
user_id=580910

I've currently worked around the default keybindings issue by copying a mac-
specific copy of config-main.def into the library directory when doing a 
framework install of python. That's obviously not a good solution, but I wouldn't 
know how to do it better.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491759&group_id=5470

From noreply at sourceforge.net  Fri May 26 15:05:30 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 06:05:30 -0700
Subject: [Patches] [ python-Patches-1494140 ] Documentation for new Struct
	object
Message-ID: <E1Fjc0Q-0001o1-UJ@sc8-sf-web3.sourceforge.net>

Patches item #1494140, was opened at 2006-05-24 05:26
Message generated for change (Comment added) made by etrepum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Bob Ippolito (etrepum)
Assigned to: Nobody/Anonymous (nobody)
Summary: Documentation for new Struct object

Initial Comment:
The performance enhancements to the struct module (patch #1493701) 
are implemented by having a Struct object, which is a compiled structure. 
This text file documents these new struct objects.


----------------------------------------------------------------------

>Comment By: Bob Ippolito (etrepum)
Date: 2006-05-26 09:05

Message:
Logged In: YES 
user_id=139309

We're going to need to revise this patch some more to document the new 
pack_to function (for Martin Blais' hotbuf work)

Additionally we'll probably also want to revise the main struct documentation to 
talk about bounds checking and avoiding the creation of long objects.


----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-25 10:32

Message:
Logged In: YES 
user_id=139309

That's clearly a typo. I've attached a new version of the patch that removes those 
two letters.

----------------------------------------------------------------------

Comment By: Jim Jewett (jimjjewett)
Date: 2006-05-24 17:03

Message:
Logged In: YES 
user_id=764593

Shouldn't self.size be the number of bytes required to *pack
* the structure?  The number required to *unpack* seems 
like it ought to include tuple overhead and such...


----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 11:35

Message:
Logged In: YES 
user_id=139309

New patch attached, fixed unpack documentation, added unpack_from method.

----------------------------------------------------------------------

Comment By: Bob Ippolito (etrepum)
Date: 2006-05-24 10:54

Message:
Logged In: YES 
user_id=139309

Hold up on this patch, I need to revise it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494140&group_id=5470

From noreply at sourceforge.net  Fri May 26 15:10:38 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 06:10:38 -0700
Subject: [Patches] [ python-Patches-1476578 ] Add help reference on Mac
Message-ID: <E1Fjc5O-00070J-1i@sc8-sf-web5.sourceforge.net>

Patches item #1476578, was opened at 2006-04-26 03:21
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1476578&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: IDLE
Group: Python 2.4
>Status: Pending
>Resolution: Fixed
Priority: 5
Submitted By: Bruce Sherwood (bsherwood)
Assigned to: Nobody/Anonymous (nobody)
Summary: Add help reference on Mac

Initial Comment:
On the Mac, you can't add a help reference in Configure
IDLE by browsing because configHelpSourceEdit.py tries
to assign into a specific entry in a tuple. 

----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-26 15:10

Message:
Logged In: YES 
user_id=580910

I've checked in a simular patch on the trunk and release24-maint.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1476578&group_id=5470

From noreply at sourceforge.net  Fri May 26 17:07:42 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 08:07:42 -0700
Subject: [Patches] [ python-Patches-1494487 ] PyUnicode_Resize cannot resize
	shared unicode object
Message-ID: <E1Fjdug-0003X8-Dv@sc8-sf-web1.sourceforge.net>

Patches item #1494487, was opened at 2006-05-24 20:24
Message generated for change (Settings changed) made by doerwalter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494487&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Invalid
Priority: 5
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Nobody/Anonymous (nobody)
Summary: PyUnicode_Resize cannot resize shared unicode object

Initial Comment:
I found following code fails.

    PyUnicodeObject *v1 = _PyUnicode_New(0);
    PyUnicodeObject *v2 = _PyUnicode_New(0);

    _PyUnicode_Resize(&v1, 1);

    Py_DECREF(v1);
    Py_DECREF(v2);

Error message is...

SystemError:
E:\python-dev\trunk\Objects\unicodeobject.c:335: bad
argument to internal function

This happens because _PyUnicode_New(0) returns
empty_unicode, and its ob_refcnt becomes 2 on second
call. I think refcnt check bellow is not needed. Is
this right fix?

Index: Objects/unicodeobject.c
===================================================================
--- Objects/unicodeobject.c	(revision 46192)
+++ Objects/unicodeobject.c	(working copy)
@@ -331,7 +331,7 @@
 	return -1;
     }
     v = (PyUnicodeObject *)*unicode;
-    if (v == NULL || !PyUnicode_Check(v) ||
v->ob_refcnt != 1 || length < 0) {
+    if (v == NULL || !PyUnicode_Check(v) || length < 0) {
 	PyErr_BadInternalCall();
 	return -1;
     }


----------------------------------------------------------------------

>Comment By: Walter D?rwald (doerwalter)
Date: 2006-05-26 17:07

Message:
Logged In: YES 
user_id=89016

This patch opens the door for hard to detect bugs: What if
PyUnicode_Resize() gets passed an object that is *not* one
of the preallocated size 0 or size 1 strings, but has a
refcount > 1? In this case your code falls through to
unicode_resize() which happily modifies an immutable shared
object.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494487&group_id=5470

From noreply at sourceforge.net  Fri May 26 17:43:15 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 08:43:15 -0700
Subject: [Patches] [ python-Patches-1455898 ] patch for mbcs codecs
Message-ID: <E1FjeT5-0005TX-M9@sc8-sf-web5.sourceforge.net>

Patches item #1455898, was opened at 2006-03-22 08:31
Message generated for change (Comment added) made by doerwalter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
>Group: Python 2.5
Status: Open
Resolution: None
Priority: 7
Submitted By: Hirokazu Yamamoto (ocean-city)
>Assigned to: Nobody/Anonymous (nobody)
Summary: patch for mbcs codecs

Initial Comment:
Hello.

I have noticed mbcs codecs sometimes generates broken
string. I'm using Windows(Japanese) so mbcs is mapped
to cp932 (close to shift_jis)

When I run the attached script "a.zip", the entry
"Error 00007"'s message becomes broken like attached
file "b.txt".

I think this happens because the string passed to
PyUnicode_DecodeMBCS() sometimes terminates with
leading byte, and MultiByteToWideChar() counts it for
size of result string.buffer size.

I hope attached patch "mbcs.patch" may fix the problem.
It would be nice if this bug will be fixed in 2.4.3...
Thank you.


----------------------------------------------------------------------

>Comment By: Walter D?rwald (doerwalter)
Date: 2006-05-26 17:43

Message:
Logged In: YES 
user_id=89016

The change to PyUnicode_Resize() should be reverted (or done
in a way that doesn't lead to bugs).

Unfortunately I don't have a Windows where I can test the
patch, so I'm unassigning the bug.

You should probably find someone on python-dev with a
multibyte version of Windows to look at the patch.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 13:06

Message:
Logged In: YES 
user_id=1200846

>PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
>but probably I'll fix it too.

Done. Attached as "mbcs_win64_support.patch".

Now, total summary...

    - MBCS decoder and encoder now supports 64bit Py_ssize_t
environment. (I don't have such machine, but I checked
routine by defining NEED_RETRY and redefining INT_MAX as 2,
3, 4)

    - Fixed a bug of MBCS incremental decoder which was
originaly reported by me.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-24 22:18

Message:
Logged In: YES 
user_id=1200846

I updated the patch.

  - PyUnicode_DecodeMBCS now supports size >= INT_MAX. (I
don't have machine to test such big string, but I have
tested this routine replaced INT_MAX with 2 and 3)

PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
but probably I'll fix it too.

This patch includes Patch#1494487.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-02 13:40

Message:
Logged In: YES 
user_id=1200846

I updated the patch. (I copy and pasted "int final = 0" from
above code (ex: utf_16_ex_decode), maybe they also should be
changed for consistency?)

And one more thing, I noticed "errors" is ignored now. We
can detect invalid character if we set MB_ERR_INVALID_CHARS
flag when calling MultiByteToWideChar, but we cannot tell
where is the position of invalid character, and MSDN saids
this flag is available Win2000SP4 or later (I don't know
why)
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_17si.asp
So I didn't make the patch for it.


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-04-25 19:22

Message:
Logged In: YES 
user_id=89016

I think the default value for final in mbcs_decode() should
be true, so that the stateless decoder detects incomplete
byte sequences too.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-04-07 11:10

Message:
Logged In: YES 
user_id=1200846

I have sent contributor form via postal mail. Probably you
can get it after 10 days.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-28 10:16

Message:
Logged In: YES 
user_id=1200846

You are right. I've updated the patch. (mbcs5.patch)

>>> import codecs
[20198 refs]
>>> d = codecs.getincrementaldecoder("mbcs")()
[20198 refs]
>>> d.decode('\x82\xa0\x82')
u'\u3042'
[20198 refs]
>>> d.decode('')
u''
[20198 refs]
>>> d.decode('', final=True)
u'\x00'
[20198 refs]


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-27 18:06

Message:
Logged In: YES 
user_id=89016

_buffer_decode() in the IncrementalDecoder ignores the final
argument. IncrementalDecoder._buffer_decode() should pass on
its final argument to _codecsmodules.c::mbcs_decode(), which
should be extended to accept the final argument. Also
PyUnicode_DecodeMBCSStateful() must handle consumed == NULL
correctly (with your patch it drops trailing lead bytes even
if consumed == NULL)

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 09:41

Message:
Logged In: YES 
user_id=1200846

I replaced tests. Probably this is better instead of
comparing the two string generated by same decoder.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 07:44

Message:
Logged In: YES 
user_id=1200846

My real name is Hirokazu Yamamoto. But sorry, I don't have
FAX. (It's needed to send contributor form, isn't it?)

I'll attach the patch updated for trunk. And I'll attach the
tests.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-03-26 23:05

Message:
Logged In: YES 
user_id=21627

I have reservations against this patch because of the
quasi-anonymous nature of the submission. ocean-city, can
you please state your real name? Would you also be willing
to fill out a contributor form, as shown on

http://www.python.org/psf/contrib-form.html

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-24 15:02

Message:
Logged In: YES 
user_id=1200846

OK, I'll try.

----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-23 22:44

Message:
Logged In: YES 
user_id=89016

This isn't a bugfix in the strictest sense, so IMHO this
patch shouldn't go into 2.4. 

If the patch goes into 2.5, it would need the appropriate
changes to encodings/mbcs.py (i.e. the IncrementalDecoder
would have to be changed (inheriting from
BufferedIncrementalDecoder).

I realize that this patch might be hard to test, as results
are dependent on locale. Nevertheless at least some tests
would be good (even if they are only run or do something
useful on a certain locale and are skipped otherwise).

ocean-city, can you update the patch for the trunk and add
tests?


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-23 03:51

Message:
Logged In: YES 
user_id=1200846

Hello. This is my final patch. (mbcs4.patch)

 - mbcs3a.patch: _mbsbtype depends on locale not system ANSI
code page. so probably it's not good to use it with
MultiByteToWideChar.

 - mbcs3b.patch: CharNext may cause buffer overflow. and
this patch always calls CharPrev but it's not needed if
string is not terminated with "potensial" lead byte.

I hope this is stable enough to commit on repositry. Thank you.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 15:36

Message:
Logged In: YES 
user_id=1200846

Sorry, I was stupid.

MSDN
(http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_0o2t.asp)
saids,

> IsDBCSLeadByte can only indicate a potential lead byte value. 

IsDBCSLeadByte was returning 1 for some trail byte (ex: "???"[1])

The patch "mbcs3a.patch" worked for me, but _mbsbtype is
probably compiler specific. Is that OK?

The patch "mbcs3b.patch" also worked for me and it only uses
Win32API, but I don't have enough faith on this
implementation...


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 11:31

Message:
Logged In: YES 
user_id=1200846

Sorry, I found problem when tried more long text file...
Please wait. I'll investigate more intensibly.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 11:13

Message:
Logged In: YES 
user_id=1200846

Thank you for reply. How about this? (I'm a newbie, I hope
this is right tex format but... can you confirm this? I
created this patch by copy & paste from
PyUnicode_DecodeUTF16Stateful and some modification)


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 10:12

Message:
Logged In: YES 
user_id=38388

One more nit: the doc patch is missing. Please add a patch
for the API docs.


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 10:11

Message:
Logged In: YES 
user_id=38388

As I understand your comment, the mbcs codec will have a
problem if the input string terminates with a lead byte.

Could you add a comment regarding this to the patch ?!

I can't test the patch, since I don't have a Japanese
Windows to check on, but from looking at the patch, it seems OK.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 08:42

Message:
Logged In: YES 
user_id=1200846

I forgot to mention this. "mbcs.patch" is for
release24-maint branch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

From noreply at sourceforge.net  Fri May 26 19:02:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 10:02:45 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1Fjfi0-0007vb-Eb@sc8-sf-web5.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 18:59
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: Accepted
Priority: 8
Submitted By: Andrew Clover (bobince)
Assigned to: Martin v. L??wis (loewis)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 19:02

Message:
Logged In: YES 
user_id=21627

Ok, I will then do the following changes still:
- add baselogo.svg and source.xar (ignore all the other files),
- remove the attribution for Erik (sorry for missing that)

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-26 04:50

Message:
Logged In: YES 
user_id=311085

> I put a demo installer containing them

Seems to work OK. The thanks at the end still attributes the
graphic to Erik though; I'm not after an ack there myself,
but changing the text to not imply the current graphic is
his one may be appropriate.

> baselogo.svg; I assume this is a source file

Yes. This is just the Python logo itself (the gradient
version as used on the new website), in vector format.

> icons.svgz; can't figure out what this is

Same as source.xar, but exported as W3C standard SVG format
for wider compatibility [compressed, hence the 'z'].

Unfortunately because SVG cannot reproduce some of effects
used, and because the SVG export path is currently quite
bad, it's not really directly usable, but it might be of use
to anyone who wants to hack on the graphics but doesn't use
Xara.

> source.xar; not sure either

This is the primary vector graphics source of the icons -
the other SVG and PNG files are just there because other
people requested them.

It's in Xara format, a previously proprietary graphics
application which has now gone open-source and is heading
rapidly towards being usable on Linux, but isn't quite there
yet.

> a directory called png, with many png file - I expect
> that these aren't source files, are they?

Nope, they're just exactly the same content as in the
(with-vista) .ico files, just supplied as PNG for anyone who
wants to fiddle with them in a more accessible bitmap format.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 10:56

Message:
Logged In: YES 
user_id=21627

Thanks for the patch. I have committed it as r46063. I put a
demo installer containing them at

http://www.dcl.hpi.uni-potsdam.de/home/loewis/python-2.5.13290.msi

I would also like to add the source files, but I have
difficulties figuring out what they are. There is a source
directory; with:

- baselogo.svg; I assume this is a source file
- icons.svgz; can't figure out what this is
- source.xar; not sure either
- a directory called png, with many png file - I expect
  that these aren't source files, are they?

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-19 16:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 13:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Fri May 26 19:35:35 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 10:35:35 -0700
Subject: [Patches] [ python-Patches-921466 ] Reduce number of open calls on
	startup
Message-ID: <E1FjgDn-0006Wt-Ut@sc8-sf-web2.sourceforge.net>

Patches item #921466, was opened at 2004-03-23 01:10
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
>Resolution: Accepted
Priority: 5
Submitted By: Martin v. L??wis (loewis)
>Assigned to: Georg Brandl (gbrandl)
Summary: Reduce number of open calls on startup

Initial Comment:
This patch uses sys.path_importer_cache to reduce the
number of open calls, in the following way:
- if the value in path_importer_cache is None, it stats
the path to find out whether the file exists
- it then puts True/False into path_importer_cache
- if the value in path_importer_cache is False, the
path entry is skipped on all imports
- if the value is True, the stat call is skipped, and
open calls for files in the directory are made.

On Linux, this reduces the number of open calls for an
empty script from 343 to 263. The startup-time (for 100
interpreter invocations) goes down by one percent (from
0.0819s to 0.08113s per invocation).

----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 19:35

Message:
Logged In: YES 
user_id=21627

Your revised patch looks fine to me, so please apply.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-26 08:38

Message:
Logged In: YES 
user_id=33168

Without looking at the patch impl, I'm +1 on the idea of
reducing stat/open calls.  On NFS this is a huge time sync.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-25 20:47

Message:
Logged In: YES 
user_id=849994

I reviewed this patch, in in consequence discovered a
problem with the sys.path_hooks machinery, described in
http://mail.python.org/pipermail/python-dev/2006-May/065173.html

This patch fixes the problem and corrects the original patch
to not set any sys.path_importer_cache entry to True or
False when no import hooks are enabled (the p_loader
argument to find_module is NULL then).

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-25 20:55

Message:
Logged In: YES 
user_id=849994

I'm very much for it. I haven't got too much RAM, and
whenever I start a Python program (emerge being the most
prominent example) after having worked heavily with e.g.
graphics or VMware, I'm hit by the files Python's opening
not being in the file cache anymore.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-02-20 22:51

Message:
Logged In: YES 
user_id=21627

Not sure. Anybody speaking in favour? against?

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 11:42

Message:
Logged In: YES 
user_id=1188172

Can this go into 2.5?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2004-03-23 16:43

Message:
Logged In: YES 
user_id=21627

It's certainly the case that the system has cached all files needed for 
startup in memory, including the directory contents of all directories 
searched.

OTOH, I assume that is the scenario in which people worry about startup 
time: high-frequency invocations of python. For a single invocation, it 
shouldn't matter much whether it takes 0.04s or 0.08s.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-23 08:30

Message:
Logged In: YES 
user_id=80475

I am surprised that making 25% fewer open calls doesn't save
more than 1% in startup time.

One other thought, I wonder if the timing of these changes
is affected by the OS keeping recently loaded files in
buffers so that disk access time not included.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

From noreply at sourceforge.net  Fri May 26 20:07:43 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 11:07:43 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1Fjgit-0006t7-HK@sc8-sf-web5.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 20:07

Message:
Logged In: YES 
user_id=21627

Which version of the Unicode database is this based on?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Fri May 26 20:41:45 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 11:41:45 -0700
Subject: [Patches] [ python-Patches-1145039 ] Remove some invariant
	conditions and assert in ceval
Message-ID: <E1FjhFp-0003QJ-AX@sc8-sf-web4-b.sourceforge.net>

Patches item #1145039, was opened at 2005-02-20 16:31
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1145039&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
Status: Open
>Resolution: Out of Date
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: Remove some invariant conditions and assert in ceval

Initial Comment:
ISTM that if frame->f_exc_type == NULL then exc_value
and exc_traceback will also be NULL.  I didn't see that
this is documented, perhaps I missed it or there is
some case when this can occur.  If it can occur, we
shoul develop a test for it.

Assuming this condition is invariant, some
simplifications can be made in reset_exc_info which is
called once per eval_frame (on function exit).

Also, I think there is currently an extra Py_INCREF on
Py_None.  This occurs when tstate->exc_type == NULL.

This patch seems to have little to no effect on
performance.  I did measure a 0.3% speed improvement.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-26 14:41

Message:
Logged In: YES 
user_id=31435

Note that the patch is out of date.

I agree the invariant you deduced should hold, but in fact
it doesn't now, at least due to insane initialization
problems in exceptions.c:

http://mail.python.org/pipermail/python-dev/2006-May/065248.html

I'd like to ensure & exploit a stronger invariant:

http://mail.python.org/pipermail/python-dev/2006-May/065231.html

but that's stuck for now.  I put my work in progress on a
new branch:

svn+ssh://svn.python.org/python/branches/tim-exc_sanity

BTW, I don't agree that the incref on Py_None wasn't needed.
 Py_None is getting assigned to two new pointers
(tstate->exc_type and frame->f_exc_type), so should be
incremented twice.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1145039&group_id=5470

From noreply at sourceforge.net  Fri May 26 20:57:28 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 11:57:28 -0700
Subject: [Patches] [ python-Patches-1495675 ] Remove types.InstanceType and
	new.instance
Message-ID: <E1FjhV2-00041z-8A@sc8-sf-web2.sourceforge.net>

Patches item #1495675, was opened at 2006-05-26 14:57
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 3000
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Remove types.InstanceType and new.instance

Initial Comment:
Remove types.InstanceType and new.instance, since
"instances" are a Python 2.x concept.

This patch is against SVN r46062 and includes doc
patches for Doc/lib/libtypes.tex and Doc/lib/libnew.tex.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

From noreply at sourceforge.net  Fri May 26 20:58:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 11:58:01 -0700
Subject: [Patches] [ python-Patches-1495675 ] Remove types.InstanceType and
	new.instance
Message-ID: <E1FjhVZ-0004C9-H8@sc8-sf-web1.sourceforge.net>

Patches item #1495675, was opened at 2006-05-26 14:57
Message generated for change (Settings changed) made by collinwinter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 3000
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
>Assigned to: Guido van Rossum (gvanrossum)
Summary: Remove types.InstanceType and new.instance

Initial Comment:
Remove types.InstanceType and new.instance, since
"instances" are a Python 2.x concept.

This patch is against SVN r46062 and includes doc
patches for Doc/lib/libtypes.tex and Doc/lib/libnew.tex.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

From noreply at sourceforge.net  Fri May 26 21:12:39 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 12:12:39 -0700
Subject: [Patches] [ python-Patches-1495675 ] Remove types.InstanceType and
	new.instance
Message-ID: <E1Fjhjj-0001TT-6A@sc8-sf-web1.sourceforge.net>

Patches item #1495675, was opened at 2006-05-26 14:57
Message generated for change (Comment added) made by gvanrossum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 3000
>Status: Closed
>Resolution: Duplicate
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Guido van Rossum (gvanrossum)
Summary: Remove types.InstanceType and new.instance

Initial Comment:
Remove types.InstanceType and new.instance, since
"instances" are a Python 2.x concept.

This patch is against SVN r46062 and includes doc
patches for Doc/lib/libtypes.tex and Doc/lib/libnew.tex.

----------------------------------------------------------------------

>Comment By: Guido van Rossum (gvanrossum)
Date: 2006-05-26 15:12

Message:
Logged In: YES 
user_id=6380

Thanks - checked in!

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

From noreply at sourceforge.net  Fri May 26 21:12:49 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 12:12:49 -0700
Subject: [Patches] [ python-Patches-1495675 ] Remove types.InstanceType and
	new.instance
Message-ID: <E1Fjhjt-000611-7z@sc8-sf-web5.sourceforge.net>

Patches item #1495675, was opened at 2006-05-26 14:57
Message generated for change (Comment added) made by gvanrossum
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 3000
Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Guido van Rossum (gvanrossum)
Summary: Remove types.InstanceType and new.instance

Initial Comment:
Remove types.InstanceType and new.instance, since
"instances" are a Python 2.x concept.

This patch is against SVN r46062 and includes doc
patches for Doc/lib/libtypes.tex and Doc/lib/libnew.tex.

----------------------------------------------------------------------

>Comment By: Guido van Rossum (gvanrossum)
Date: 2006-05-26 15:12

Message:
Logged In: YES 
user_id=6380

Thanks - checked in!

----------------------------------------------------------------------

Comment By: Guido van Rossum (gvanrossum)
Date: 2006-05-26 15:12

Message:
Logged In: YES 
user_id=6380

Thanks - checked in!

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495675&group_id=5470

From noreply at sourceforge.net  Fri May 26 21:21:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 12:21:00 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1Fjhro-00019q-Q7@sc8-sf-web4-b.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Comment added) made by andersch
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

>Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-26 21:20

Message:
Logged In: YES 
user_id=621306

The patch makes it match version 4.1.0. Though it didn't match
version 3.2.0 either.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 20:07

Message:
Logged In: YES 
user_id=21627

Which version of the Unicode database is this based on?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Fri May 26 22:17:03 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 13:17:03 -0700
Subject: [Patches] [ python-Patches-1492218 ] None missing from keyword
	module
Message-ID: <E1Fjik3-0005h1-6J@sc8-sf-web4-b.sourceforge.net>

Patches item #1492218, was opened at 2006-05-20 20:43
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Documentation
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Nobody/Anonymous (nobody)
Summary: None missing from keyword module

Initial Comment:
None became a keyword in Python 2.4, but this is
not evident from the Python/gramminit.c file. As
a consequence, None is not included in the
keyword module when you regenerate it.

This patch also includes documentation fixes (None
was missing from keywords section in reference manual)
and fixes for syntax highliting for Idle and Vim.
python-mode.el already treats None, True and False
differently, so I didn't try to change it.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-26 20:17

Message:
Logged In: YES 
user_id=849994

Committed your patches in rev. 46411, 46412. Note that the
optional text in \versionchanged mustn't have a trailing
period though...

----------------------------------------------------------------------

Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-22 13:13

Message:
Logged In: YES 
user_id=1326842

Attaching a new set of patches. Since they only affect
the documentation, I also changed the category. The
patch against the trunk also includes a note that
using "as" and "with" as identifiers will issue a
warning.

----------------------------------------------------------------------

Comment By: ?iga Seilnacht (zseil)
Date: 2006-05-22 11:04

Message:
Logged In: YES 
user_id=1326842

I realise that None is a constant, not a keyword.
Could at least the documentation be changed?
Currently the reference manual says:

"The following identifiers are used as reserved words, or
keywords of the language, and cannot be used as ordinary
identifiers."

A list that doesn't include None follows, but as your
example shows, None also can't be used as an ordinary
identifier.
Later on that page:

"In some future version of Python, the identifier None
will become a keyword."

See:
http://docs.python.org/dev/ref/keywords.html

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 09:26

Message:
Logged In: YES 
user_id=21627

None is not a keyword. Watch this:

>>> def None():pass
SyntaxError: assignment to None
>>> def while():pass
SyntaxError: invalid syntax
>>> 

None remains an identifier, but assignments to None are not
allowed.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1492218&group_id=5470

From noreply at sourceforge.net  Fri May 26 22:42:55 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 13:42:55 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1Fjj95-0005r4-Cl@sc8-sf-web3.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Comment added) made by lemburg
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

>Comment By: M.-A. Lemburg (lemburg)
Date: 2006-05-26 22:42

Message:
Logged In: YES 
user_id=38388

Rather than creating a patch for every new version, how
about extracting the relevant data from the Unicode database
using a script and putting that into Tools/unicode/ ?!

Note that the original version was also generated from the
database. Unfortunately, I can't find that script anymore.

One nit with the patch: it should put non-BMP Unicode code
points into #ifdef Py_UNICODE_WIDE ... #endif clauses.


----------------------------------------------------------------------

Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-26 21:20

Message:
Logged In: YES 
user_id=621306

The patch makes it match version 4.1.0. Though it didn't match
version 3.2.0 either.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 20:07

Message:
Logged In: YES 
user_id=21627

Which version of the Unicode database is this based on?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Fri May 26 23:18:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 14:18:01 -0700
Subject: [Patches] [ python-Patches-1455898 ] patch for mbcs codecs
Message-ID: <E1Fjjh3-0003YP-OG@sc8-sf-web3.sourceforge.net>

Patches item #1455898, was opened at 2006-03-22 16:31
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 7
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Nobody/Anonymous (nobody)
Summary: patch for mbcs codecs

Initial Comment:
Hello.

I have noticed mbcs codecs sometimes generates broken
string. I'm using Windows(Japanese) so mbcs is mapped
to cp932 (close to shift_jis)

When I run the attached script "a.zip", the entry
"Error 00007"'s message becomes broken like attached
file "b.txt".

I think this happens because the string passed to
PyUnicode_DecodeMBCS() sometimes terminates with
leading byte, and MultiByteToWideChar() counts it for
size of result string.buffer size.

I hope attached patch "mbcs.patch" may fix the problem.
It would be nice if this bug will be fixed in 2.4.3...
Thank you.


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-27 06:18

Message:
Logged In: YES 
user_id=1200846

>The change to PyUnicode_Resize() should be reverted (or done
>in a way that doesn't lead to bugs).

Sorry, how about this?

Index: Objects/unicodeobject.c
===================================================================
--- Objects/unicodeobject.c	(revision 46417)
+++ Objects/unicodeobject.c	(working copy)
@@ -326,7 +326,7 @@
 	return -1;
     }
     v = (PyUnicodeObject *)*unicode;
-    if (v == NULL || !PyUnicode_Check(v) || v->ob_refcnt !=
1 || length < 0) {
+    if (v == NULL || !PyUnicode_Check(v) || length < 0) {
 	PyErr_BadInternalCall();
 	return -1;
     }
@@ -335,7 +335,7 @@
        possible since these are being shared. We simply
return a fresh
        copy with the same Unicode content. */
     if (v->length != length &&
-	(v == unicode_empty || v->length == 1)) {
+	(v == unicode_empty || v->length == 1 || v->ob_refcnt != 1)) {
 	PyUnicodeObject *w = _PyUnicode_New(length);
 	if (w == NULL)
 	    return -1;


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-05-27 00:43

Message:
Logged In: YES 
user_id=89016

The change to PyUnicode_Resize() should be reverted (or done
in a way that doesn't lead to bugs).

Unfortunately I don't have a Windows where I can test the
patch, so I'm unassigning the bug.

You should probably find someone on python-dev with a
multibyte version of Windows to look at the patch.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 20:06

Message:
Logged In: YES 
user_id=1200846

>PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
>but probably I'll fix it too.

Done. Attached as "mbcs_win64_support.patch".

Now, total summary...

    - MBCS decoder and encoder now supports 64bit Py_ssize_t
environment. (I don't have such machine, but I checked
routine by defining NEED_RETRY and redefining INT_MAX as 2,
3, 4)

    - Fixed a bug of MBCS incremental decoder which was
originaly reported by me.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 05:18

Message:
Logged In: YES 
user_id=1200846

I updated the patch.

  - PyUnicode_DecodeMBCS now supports size >= INT_MAX. (I
don't have machine to test such big string, but I have
tested this routine replaced INT_MAX with 2 and 3)

PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
but probably I'll fix it too.

This patch includes Patch#1494487.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-02 20:40

Message:
Logged In: YES 
user_id=1200846

I updated the patch. (I copy and pasted "int final = 0" from
above code (ex: utf_16_ex_decode), maybe they also should be
changed for consistency?)

And one more thing, I noticed "errors" is ignored now. We
can detect invalid character if we set MB_ERR_INVALID_CHARS
flag when calling MultiByteToWideChar, but we cannot tell
where is the position of invalid character, and MSDN saids
this flag is available Win2000SP4 or later (I don't know
why)
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_17si.asp
So I didn't make the patch for it.


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-04-26 02:22

Message:
Logged In: YES 
user_id=89016

I think the default value for final in mbcs_decode() should
be true, so that the stateless decoder detects incomplete
byte sequences too.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-04-07 18:10

Message:
Logged In: YES 
user_id=1200846

I have sent contributor form via postal mail. Probably you
can get it after 10 days.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-28 17:16

Message:
Logged In: YES 
user_id=1200846

You are right. I've updated the patch. (mbcs5.patch)

>>> import codecs
[20198 refs]
>>> d = codecs.getincrementaldecoder("mbcs")()
[20198 refs]
>>> d.decode('\x82\xa0\x82')
u'\u3042'
[20198 refs]
>>> d.decode('')
u''
[20198 refs]
>>> d.decode('', final=True)
u'\x00'
[20198 refs]


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-28 01:06

Message:
Logged In: YES 
user_id=89016

_buffer_decode() in the IncrementalDecoder ignores the final
argument. IncrementalDecoder._buffer_decode() should pass on
its final argument to _codecsmodules.c::mbcs_decode(), which
should be extended to accept the final argument. Also
PyUnicode_DecodeMBCSStateful() must handle consumed == NULL
correctly (with your patch it drops trailing lead bytes even
if consumed == NULL)

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 16:41

Message:
Logged In: YES 
user_id=1200846

I replaced tests. Probably this is better instead of
comparing the two string generated by same decoder.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 14:44

Message:
Logged In: YES 
user_id=1200846

My real name is Hirokazu Yamamoto. But sorry, I don't have
FAX. (It's needed to send contributor form, isn't it?)

I'll attach the patch updated for trunk. And I'll attach the
tests.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-03-27 06:05

Message:
Logged In: YES 
user_id=21627

I have reservations against this patch because of the
quasi-anonymous nature of the submission. ocean-city, can
you please state your real name? Would you also be willing
to fill out a contributor form, as shown on

http://www.python.org/psf/contrib-form.html

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-24 23:02

Message:
Logged In: YES 
user_id=1200846

OK, I'll try.

----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-24 06:44

Message:
Logged In: YES 
user_id=89016

This isn't a bugfix in the strictest sense, so IMHO this
patch shouldn't go into 2.4. 

If the patch goes into 2.5, it would need the appropriate
changes to encodings/mbcs.py (i.e. the IncrementalDecoder
would have to be changed (inheriting from
BufferedIncrementalDecoder).

I realize that this patch might be hard to test, as results
are dependent on locale. Nevertheless at least some tests
would be good (even if they are only run or do something
useful on a certain locale and are skipped otherwise).

ocean-city, can you update the patch for the trunk and add
tests?


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-23 11:51

Message:
Logged In: YES 
user_id=1200846

Hello. This is my final patch. (mbcs4.patch)

 - mbcs3a.patch: _mbsbtype depends on locale not system ANSI
code page. so probably it's not good to use it with
MultiByteToWideChar.

 - mbcs3b.patch: CharNext may cause buffer overflow. and
this patch always calls CharPrev but it's not needed if
string is not terminated with "potensial" lead byte.

I hope this is stable enough to commit on repositry. Thank you.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 23:36

Message:
Logged In: YES 
user_id=1200846

Sorry, I was stupid.

MSDN
(http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_0o2t.asp)
saids,

> IsDBCSLeadByte can only indicate a potential lead byte value. 

IsDBCSLeadByte was returning 1 for some trail byte (ex: "???"[1])

The patch "mbcs3a.patch" worked for me, but _mbsbtype is
probably compiler specific. Is that OK?

The patch "mbcs3b.patch" also worked for me and it only uses
Win32API, but I don't have enough faith on this
implementation...


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:31

Message:
Logged In: YES 
user_id=1200846

Sorry, I found problem when tried more long text file...
Please wait. I'll investigate more intensibly.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:13

Message:
Logged In: YES 
user_id=1200846

Thank you for reply. How about this? (I'm a newbie, I hope
this is right tex format but... can you confirm this? I
created this patch by copy & paste from
PyUnicode_DecodeUTF16Stateful and some modification)


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:12

Message:
Logged In: YES 
user_id=38388

One more nit: the doc patch is missing. Please add a patch
for the API docs.


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:11

Message:
Logged In: YES 
user_id=38388

As I understand your comment, the mbcs codec will have a
problem if the input string terminates with a lead byte.

Could you add a comment regarding this to the patch ?!

I can't test the patch, since I don't have a Japanese
Windows to check on, but from looking at the patch, it seems OK.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 16:42

Message:
Logged In: YES 
user_id=1200846

I forgot to mention this. "mbcs.patch" is for
release24-maint branch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

From noreply at sourceforge.net  Fri May 26 23:40:07 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 14:40:07 -0700
Subject: [Patches] [ python-Patches-1455898 ] patch for mbcs codecs
Message-ID: <E1Fjk2R-0005e5-Lz@sc8-sf-web4-b.sourceforge.net>

Patches item #1455898, was opened at 2006-03-22 16:31
Message generated for change (Comment added) made by ocean-city
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Open
Resolution: None
Priority: 7
Submitted By: Hirokazu Yamamoto (ocean-city)
Assigned to: Nobody/Anonymous (nobody)
Summary: patch for mbcs codecs

Initial Comment:
Hello.

I have noticed mbcs codecs sometimes generates broken
string. I'm using Windows(Japanese) so mbcs is mapped
to cp932 (close to shift_jis)

When I run the attached script "a.zip", the entry
"Error 00007"'s message becomes broken like attached
file "b.txt".

I think this happens because the string passed to
PyUnicode_DecodeMBCS() sometimes terminates with
leading byte, and MultiByteToWideChar() counts it for
size of result string.buffer size.

I hope attached patch "mbcs.patch" may fix the problem.
It would be nice if this bug will be fixed in 2.4.3...
Thank you.


----------------------------------------------------------------------

>Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-27 06:40

Message:
Logged In: YES 
user_id=1200846

I reverted PyUnicode_Resize() patch for now, and recreated
the patch as "mbcs.patch".

>You should probably find someone on python-dev with a
>multibyte version of Windows to look at the patch.

OK, I will.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-27 06:18

Message:
Logged In: YES 
user_id=1200846

>The change to PyUnicode_Resize() should be reverted (or done
>in a way that doesn't lead to bugs).

Sorry, how about this?

Index: Objects/unicodeobject.c
===================================================================
--- Objects/unicodeobject.c	(revision 46417)
+++ Objects/unicodeobject.c	(working copy)
@@ -326,7 +326,7 @@
 	return -1;
     }
     v = (PyUnicodeObject *)*unicode;
-    if (v == NULL || !PyUnicode_Check(v) || v->ob_refcnt !=
1 || length < 0) {
+    if (v == NULL || !PyUnicode_Check(v) || length < 0) {
 	PyErr_BadInternalCall();
 	return -1;
     }
@@ -335,7 +335,7 @@
        possible since these are being shared. We simply
return a fresh
        copy with the same Unicode content. */
     if (v->length != length &&
-	(v == unicode_empty || v->length == 1)) {
+	(v == unicode_empty || v->length == 1 || v->ob_refcnt != 1)) {
 	PyUnicodeObject *w = _PyUnicode_New(length);
 	if (w == NULL)
 	    return -1;


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-05-27 00:43

Message:
Logged In: YES 
user_id=89016

The change to PyUnicode_Resize() should be reverted (or done
in a way that doesn't lead to bugs).

Unfortunately I don't have a Windows where I can test the
patch, so I'm unassigning the bug.

You should probably find someone on python-dev with a
multibyte version of Windows to look at the patch.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 20:06

Message:
Logged In: YES 
user_id=1200846

>PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
>but probably I'll fix it too.

Done. Attached as "mbcs_win64_support.patch".

Now, total summary...

    - MBCS decoder and encoder now supports 64bit Py_ssize_t
environment. (I don't have such machine, but I checked
routine by defining NEED_RETRY and redefining INT_MAX as 2,
3, 4)

    - Fixed a bug of MBCS incremental decoder which was
originaly reported by me.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-25 05:18

Message:
Logged In: YES 
user_id=1200846

I updated the patch.

  - PyUnicode_DecodeMBCS now supports size >= INT_MAX. (I
don't have machine to test such big string, but I have
tested this routine replaced INT_MAX with 2 and 3)

PyUnicode_DecodeMBCS does not support size >= INT_MAX yet,
but probably I'll fix it too.

This patch includes Patch#1494487.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-05-02 20:40

Message:
Logged In: YES 
user_id=1200846

I updated the patch. (I copy and pasted "int final = 0" from
above code (ex: utf_16_ex_decode), maybe they also should be
changed for consistency?)

And one more thing, I noticed "errors" is ignored now. We
can detect invalid character if we set MB_ERR_INVALID_CHARS
flag when calling MultiByteToWideChar, but we cannot tell
where is the position of invalid character, and MSDN saids
this flag is available Win2000SP4 or later (I don't know
why)
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_17si.asp
So I didn't make the patch for it.


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-04-26 02:22

Message:
Logged In: YES 
user_id=89016

I think the default value for final in mbcs_decode() should
be true, so that the stateless decoder detects incomplete
byte sequences too.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-04-07 18:10

Message:
Logged In: YES 
user_id=1200846

I have sent contributor form via postal mail. Probably you
can get it after 10 days.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-28 17:16

Message:
Logged In: YES 
user_id=1200846

You are right. I've updated the patch. (mbcs5.patch)

>>> import codecs
[20198 refs]
>>> d = codecs.getincrementaldecoder("mbcs")()
[20198 refs]
>>> d.decode('\x82\xa0\x82')
u'\u3042'
[20198 refs]
>>> d.decode('')
u''
[20198 refs]
>>> d.decode('', final=True)
u'\x00'
[20198 refs]


----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-28 01:06

Message:
Logged In: YES 
user_id=89016

_buffer_decode() in the IncrementalDecoder ignores the final
argument. IncrementalDecoder._buffer_decode() should pass on
its final argument to _codecsmodules.c::mbcs_decode(), which
should be extended to accept the final argument. Also
PyUnicode_DecodeMBCSStateful() must handle consumed == NULL
correctly (with your patch it drops trailing lead bytes even
if consumed == NULL)

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 16:41

Message:
Logged In: YES 
user_id=1200846

I replaced tests. Probably this is better instead of
comparing the two string generated by same decoder.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-27 14:44

Message:
Logged In: YES 
user_id=1200846

My real name is Hirokazu Yamamoto. But sorry, I don't have
FAX. (It's needed to send contributor form, isn't it?)

I'll attach the patch updated for trunk. And I'll attach the
tests.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-03-27 06:05

Message:
Logged In: YES 
user_id=21627

I have reservations against this patch because of the
quasi-anonymous nature of the submission. ocean-city, can
you please state your real name? Would you also be willing
to fill out a contributor form, as shown on

http://www.python.org/psf/contrib-form.html

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-24 23:02

Message:
Logged In: YES 
user_id=1200846

OK, I'll try.

----------------------------------------------------------------------

Comment By: Walter D?rwald (doerwalter)
Date: 2006-03-24 06:44

Message:
Logged In: YES 
user_id=89016

This isn't a bugfix in the strictest sense, so IMHO this
patch shouldn't go into 2.4. 

If the patch goes into 2.5, it would need the appropriate
changes to encodings/mbcs.py (i.e. the IncrementalDecoder
would have to be changed (inheriting from
BufferedIncrementalDecoder).

I realize that this patch might be hard to test, as results
are dependent on locale. Nevertheless at least some tests
would be good (even if they are only run or do something
useful on a certain locale and are skipped otherwise).

ocean-city, can you update the patch for the trunk and add
tests?


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-23 11:51

Message:
Logged In: YES 
user_id=1200846

Hello. This is my final patch. (mbcs4.patch)

 - mbcs3a.patch: _mbsbtype depends on locale not system ANSI
code page. so probably it's not good to use it with
MultiByteToWideChar.

 - mbcs3b.patch: CharNext may cause buffer overflow. and
this patch always calls CharPrev but it's not needed if
string is not terminated with "potensial" lead byte.

I hope this is stable enough to commit on repositry. Thank you.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 23:36

Message:
Logged In: YES 
user_id=1200846

Sorry, I was stupid.

MSDN
(http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_0o2t.asp)
saids,

> IsDBCSLeadByte can only indicate a potential lead byte value. 

IsDBCSLeadByte was returning 1 for some trail byte (ex: "???"[1])

The patch "mbcs3a.patch" worked for me, but _mbsbtype is
probably compiler specific. Is that OK?

The patch "mbcs3b.patch" also worked for me and it only uses
Win32API, but I don't have enough faith on this
implementation...


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:31

Message:
Logged In: YES 
user_id=1200846

Sorry, I found problem when tried more long text file...
Please wait. I'll investigate more intensibly.

----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 19:13

Message:
Logged In: YES 
user_id=1200846

Thank you for reply. How about this? (I'm a newbie, I hope
this is right tex format but... can you confirm this? I
created this patch by copy & paste from
PyUnicode_DecodeUTF16Stateful and some modification)


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:12

Message:
Logged In: YES 
user_id=38388

One more nit: the doc patch is missing. Please add a patch
for the API docs.


----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-03-22 18:11

Message:
Logged In: YES 
user_id=38388

As I understand your comment, the mbcs codec will have a
problem if the input string terminates with a lead byte.

Could you add a comment regarding this to the patch ?!

I can't test the patch, since I don't have a Japanese
Windows to check on, but from looking at the patch, it seems OK.


----------------------------------------------------------------------

Comment By: Hirokazu Yamamoto (ocean-city)
Date: 2006-03-22 16:42

Message:
Logged In: YES 
user_id=1200846

I forgot to mention this. "mbcs.patch" is for
release24-maint branch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1455898&group_id=5470

From noreply at sourceforge.net  Fri May 26 23:43:01 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 14:43:01 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1Fjk5F-0006yX-Tu@sc8-sf-web4-b.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 23:43

Message:
Logged In: YES 
user_id=21627

I agree it should be possible to regenerate that easily (or
perhaps entirely merge it into unicodedata/unicodectype).

andersch, how did you create the patch?

----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-05-26 22:42

Message:
Logged In: YES 
user_id=38388

Rather than creating a patch for every new version, how
about extracting the relevant data from the Unicode database
using a script and putting that into Tools/unicode/ ?!

Note that the original version was also generated from the
database. Unfortunately, I can't find that script anymore.

One nit with the patch: it should put non-BMP Unicode code
points into #ifdef Py_UNICODE_WIDE ... #endif clauses.


----------------------------------------------------------------------

Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-26 21:20

Message:
Logged In: YES 
user_id=621306

The patch makes it match version 4.1.0. Though it didn't match
version 3.2.0 either.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 20:07

Message:
Logged In: YES 
user_id=21627

Which version of the Unicode database is this based on?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Sat May 27 01:17:24 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 16:17:24 -0700
Subject: [Patches] [ python-Patches-1145039 ] Remove some invariant
	conditions and assert in ceval
Message-ID: <E1FjlYa-0008CJ-7V@sc8-sf-web4-b.sourceforge.net>

Patches item #1145039, was opened at 2005-02-20 16:31
Message generated for change (Comment added) made by tim_one
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1145039&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Neal Norwitz (nnorwitz)
Assigned to: Nobody/Anonymous (nobody)
Summary: Remove some invariant conditions and assert in ceval

Initial Comment:
ISTM that if frame->f_exc_type == NULL then exc_value
and exc_traceback will also be NULL.  I didn't see that
this is documented, perhaps I missed it or there is
some case when this can occur.  If it can occur, we
shoul develop a test for it.

Assuming this condition is invariant, some
simplifications can be made in reset_exc_info which is
called once per eval_frame (on function exit).

Also, I think there is currently an extra Py_INCREF on
Py_None.  This occurs when tstate->exc_type == NULL.

This patch seems to have little to no effect on
performance.  I did measure a 0.3% speed improvement.

----------------------------------------------------------------------

>Comment By: Tim Peters (tim_one)
Date: 2006-05-26 19:17

Message:
Logged In: YES 
user_id=31435

I got a very tiny bit more out of this and added it to the
trunk.  Thanks!  Note that I left in the disputed Py_INCREF.

The more ambitious tim-exc_sanity branch is looking like
more  trouble than it's worth.  

----------------------------------------------------------------------

Comment By: Tim Peters (tim_one)
Date: 2006-05-26 14:41

Message:
Logged In: YES 
user_id=31435

Note that the patch is out of date.

I agree the invariant you deduced should hold, but in fact
it doesn't now, at least due to insane initialization
problems in exceptions.c:

http://mail.python.org/pipermail/python-dev/2006-May/065248.html

I'd like to ensure & exploit a stronger invariant:

http://mail.python.org/pipermail/python-dev/2006-May/065231.html

but that's stuck for now.  I put my work in progress on a
new branch:

svn+ssh://svn.python.org/python/branches/tim-exc_sanity

BTW, I don't agree that the incref on Py_None wasn't needed.
 Py_None is getting assigned to two new pointers
(tstate->exc_type and frame->f_exc_type), so should be
incremented twice.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1145039&group_id=5470

From noreply at sourceforge.net  Sat May 27 05:48:57 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 20:48:57 -0700
Subject: [Patches] [ python-Patches-921466 ] Reduce number of open calls on
	startup
Message-ID: <E1FjpnN-0007or-6l@sc8-sf-web1.sourceforge.net>

Patches item #921466, was opened at 2004-03-22 16:10
Message generated for change (Comment added) made by nnorwitz
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: Accepted
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: Georg Brandl (gbrandl)
Summary: Reduce number of open calls on startup

Initial Comment:
This patch uses sys.path_importer_cache to reduce the
number of open calls, in the following way:
- if the value in path_importer_cache is None, it stats
the path to find out whether the file exists
- it then puts True/False into path_importer_cache
- if the value in path_importer_cache is False, the
path entry is skipped on all imports
- if the value is True, the stat call is skipped, and
open calls for files in the directory are made.

On Linux, this reduces the number of open calls for an
empty script from 343 to 263. The startup-time (for 100
interpreter invocations) goes down by one percent (from
0.0819s to 0.08113s per invocation).

----------------------------------------------------------------------

>Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-26 20:48

Message:
Logged In: YES 
user_id=33168

Georg, didn't you check this in or was that a diff patch?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 10:35

Message:
Logged In: YES 
user_id=21627

Your revised patch looks fine to me, so please apply.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-25 23:38

Message:
Logged In: YES 
user_id=33168

Without looking at the patch impl, I'm +1 on the idea of
reducing stat/open calls.  On NFS this is a huge time sync.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-25 11:47

Message:
Logged In: YES 
user_id=849994

I reviewed this patch, in in consequence discovered a
problem with the sys.path_hooks machinery, described in
http://mail.python.org/pipermail/python-dev/2006-May/065173.html

This patch fixes the problem and corrects the original patch
to not set any sys.path_importer_cache entry to True or
False when no import hooks are enabled (the p_loader
argument to find_module is NULL then).

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-25 11:55

Message:
Logged In: YES 
user_id=849994

I'm very much for it. I haven't got too much RAM, and
whenever I start a Python program (emerge being the most
prominent example) after having worked heavily with e.g.
graphics or VMware, I'm hit by the files Python's opening
not being in the file cache anymore.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-02-20 13:51

Message:
Logged In: YES 
user_id=21627

Not sure. Anybody speaking in favour? against?

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 02:42

Message:
Logged In: YES 
user_id=1188172

Can this go into 2.5?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2004-03-23 07:43

Message:
Logged In: YES 
user_id=21627

It's certainly the case that the system has cached all files needed for 
startup in memory, including the directory contents of all directories 
searched.

OTOH, I assume that is the scenario in which people worry about startup 
time: high-frequency invocations of python. For a single invocation, it 
shouldn't matter much whether it takes 0.04s or 0.08s.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-22 23:30

Message:
Logged In: YES 
user_id=80475

I am surprised that making 25% fewer open calls doesn't save
more than 1% in startup time.

One other thought, I wonder if the timing of these changes
is affected by the OS keeping recently loaded files in
buffers so that disk access time not included.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

From noreply at sourceforge.net  Sat May 27 08:56:15 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Fri, 26 May 2006 23:56:15 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1Fjsid-0002SK-5D@sc8-sf-web4-b.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Comment added) made by andersch
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

>Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-27 08:56

Message:
Logged In: YES 
user_id=621306

I got the differenced through comparing with _numeric in
http://codespeak.net/svn/pypy/dist/pypy/module/unicodedata/unicodedb.py
which we have generated with
ssh://codespeak.net/svn/pypy/dist/pypy/module/unicodedata/generate_unicodedb.py

If You are looking into generating this from the Unicode
database You might want to fix _PyUnicode_IsLinebreak too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 23:43

Message:
Logged In: YES 
user_id=21627

I agree it should be possible to regenerate that easily (or
perhaps entirely merge it into unicodedata/unicodectype).

andersch, how did you create the patch?

----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-05-26 22:42

Message:
Logged In: YES 
user_id=38388

Rather than creating a patch for every new version, how
about extracting the relevant data from the Unicode database
using a script and putting that into Tools/unicode/ ?!

Note that the original version was also generated from the
database. Unfortunately, I can't find that script anymore.

One nit with the patch: it should put non-BMP Unicode code
points into #ifdef Py_UNICODE_WIDE ... #endif clauses.


----------------------------------------------------------------------

Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-26 21:20

Message:
Logged In: YES 
user_id=621306

The patch makes it match version 4.1.0. Though it didn't match
version 3.2.0 either.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 20:07

Message:
Logged In: YES 
user_id=21627

Which version of the Unicode database is this based on?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Sat May 27 10:39:35 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 01:39:35 -0700
Subject: [Patches] [ python-Patches-1494554 ] Numeric characters not
	recognized.
Message-ID: <E1FjuKd-0003fR-5K@sc8-sf-web4-b.sourceforge.net>

Patches item #1494554, was opened at 2006-05-24 21:54
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Anders Chrigstr?m (andersch)
Assigned to: Nobody/Anonymous (nobody)
Summary: Numeric characters not recognized.

Initial Comment:
unicode.isnumeric() and unicodedata.numeric() fails to
recognize a bunch of numeric unicode characters.

The patch fixes this.


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-27 10:39

Message:
Logged In: YES 
user_id=21627

Thanks for the patch, committed as 46432. I conditionalized
the non-BMP characters on Py_UNICODE_WIDE, and updated
PyUnicode_IsNumeric to recognize U+0F33 as a numeric character.

If anybody wants to contribute a generator for these
functions (or perhaps generate a table in the first place),
please go ahead.

----------------------------------------------------------------------

Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-27 08:56

Message:
Logged In: YES 
user_id=621306

I got the differenced through comparing with _numeric in
http://codespeak.net/svn/pypy/dist/pypy/module/unicodedata/unicodedb.py
which we have generated with
ssh://codespeak.net/svn/pypy/dist/pypy/module/unicodedata/generate_unicodedb.py

If You are looking into generating this from the Unicode
database You might want to fix _PyUnicode_IsLinebreak too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 23:43

Message:
Logged In: YES 
user_id=21627

I agree it should be possible to regenerate that easily (or
perhaps entirely merge it into unicodedata/unicodectype).

andersch, how did you create the patch?

----------------------------------------------------------------------

Comment By: M.-A. Lemburg (lemburg)
Date: 2006-05-26 22:42

Message:
Logged In: YES 
user_id=38388

Rather than creating a patch for every new version, how
about extracting the relevant data from the Unicode database
using a script and putting that into Tools/unicode/ ?!

Note that the original version was also generated from the
database. Unfortunately, I can't find that script anymore.

One nit with the patch: it should put non-BMP Unicode code
points into #ifdef Py_UNICODE_WIDE ... #endif clauses.


----------------------------------------------------------------------

Comment By: Anders Chrigstr?m (andersch)
Date: 2006-05-26 21:20

Message:
Logged In: YES 
user_id=621306

The patch makes it match version 4.1.0. Though it didn't match
version 3.2.0 either.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 20:07

Message:
Logged In: YES 
user_id=21627

Which version of the Unicode database is this based on?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1494554&group_id=5470

From noreply at sourceforge.net  Sat May 27 11:23:08 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 02:23:08 -0700
Subject: [Patches] [ python-Patches-921466 ] Reduce number of open calls on
	startup
Message-ID: <E1Fjv0m-0005Ha-99@sc8-sf-web3.sourceforge.net>

Patches item #921466, was opened at 2004-03-23 00:10
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
>Status: Closed
Resolution: Accepted
Priority: 5
Submitted By: Martin v. L??wis (loewis)
Assigned to: Georg Brandl (gbrandl)
Summary: Reduce number of open calls on startup

Initial Comment:
This patch uses sys.path_importer_cache to reduce the
number of open calls, in the following way:
- if the value in path_importer_cache is None, it stats
the path to find out whether the file exists
- it then puts True/False into path_importer_cache
- if the value in path_importer_cache is False, the
path entry is skipped on all imports
- if the value is True, the stat call is skipped, and
open calls for files in the directory are made.

On Linux, this reduces the number of open calls for an
empty script from 343 to 263. The startup-time (for 100
interpreter invocations) goes down by one percent (from
0.0819s to 0.08113s per invocation).

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-27 09:23

Message:
Logged In: YES 
user_id=849994

Yes, I indeed checked this in, in rev. 46372. Thanks for
reminding me, Neal.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-27 03:48

Message:
Logged In: YES 
user_id=33168

Georg, didn't you check this in or was that a diff patch?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 17:35

Message:
Logged In: YES 
user_id=21627

Your revised patch looks fine to me, so please apply.

----------------------------------------------------------------------

Comment By: Neal Norwitz (nnorwitz)
Date: 2006-05-26 06:38

Message:
Logged In: YES 
user_id=33168

Without looking at the patch impl, I'm +1 on the idea of
reducing stat/open calls.  On NFS this is a huge time sync.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-25 18:47

Message:
Logged In: YES 
user_id=849994

I reviewed this patch, in in consequence discovered a
problem with the sys.path_hooks machinery, described in
http://mail.python.org/pipermail/python-dev/2006-May/065173.html

This patch fixes the problem and corrects the original patch
to not set any sys.path_importer_cache entry to True or
False when no import hooks are enabled (the p_loader
argument to find_module is NULL then).

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-02-25 19:55

Message:
Logged In: YES 
user_id=849994

I'm very much for it. I haven't got too much RAM, and
whenever I start a Python program (emerge being the most
prominent example) after having worked heavily with e.g.
graphics or VMware, I'm hit by the files Python's opening
not being in the file cache anymore.

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-02-20 21:51

Message:
Logged In: YES 
user_id=21627

Not sure. Anybody speaking in favour? against?

----------------------------------------------------------------------

Comment By: Georg Brandl (birkenfeld)
Date: 2006-02-20 10:42

Message:
Logged In: YES 
user_id=1188172

Can this go into 2.5?

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2004-03-23 15:43

Message:
Logged In: YES 
user_id=21627

It's certainly the case that the system has cached all files needed for 
startup in memory, including the directory contents of all directories 
searched.

OTOH, I assume that is the scenario in which people worry about startup 
time: high-frequency invocations of python. For a single invocation, it 
shouldn't matter much whether it takes 0.04s or 0.08s.

----------------------------------------------------------------------

Comment By: Raymond Hettinger (rhettinger)
Date: 2004-03-23 07:30

Message:
Logged In: YES 
user_id=80475

I am surprised that making 25% fewer open calls doesn't save
more than 1% in startup time.

One other thought, I wonder if the timing of these changes
is affected by the OS keeping recently loaded files in
buffers so that disk access time not included.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=921466&group_id=5470

From noreply at sourceforge.net  Sat May 27 16:15:31 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 07:15:31 -0700
Subject: [Patches] [ python-Patches-1495999 ] Windows CE support (part 2)
Message-ID: <E1FjzZj-0006DT-TZ@sc8-sf-web1.sourceforge.net>

Patches item #1495999, was opened at 2006-05-27 22:15
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495999&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Luke Dunstan (infidel)
Assigned to: Nobody/Anonymous (nobody)
Summary: Windows CE support (part 2)

Initial Comment:

This patch contains some more changes necessary to 
build Python trunk for Windows CE 4.x. More patches 
to come...

The changes are:

Missing headers: conio.h, direct.h, errno.h, io.h, 
process.h, signal.h, sys/stat.h, sys/types.h

- Change #ifndef DONT_HAVE_*_H to HAVE_*_H

- Add #ifdef guards for many of the #includes for 
these headers

- Add checks for headers to configure.in, 
pyconfig.h.in

- Add HAVE_*_H to manually edited versions of 
pyconfig.h (except for Windows CE)

- NOTE: the following are Windows-specific headers: 
conio.h, direct.h, io.h, process.h 

PC/pyconfig.h:

- define dummy macro implementations of getenv() and 
environ (Windows CE only)

- define macro implementation of GetVersion() 
(Windows CE only)

Modules/socketmodule.c: adjusted _MSC_VER conditional


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1495999&group_id=5470

From noreply at sourceforge.net  Sat May 27 20:22:17 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 11:22:17 -0700
Subject: [Patches] [ python-Patches-813436 ] Scalable zipfile extension
Message-ID: <E1Fk3QX-0001DH-MY@sc8-sf-web3.sourceforge.net>

Patches item #813436, was opened at 2003-09-27 10:09
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=813436&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Marc De Falco (deufeufeu)
Assigned to: Nobody/Anonymous (nobody)
Summary: Scalable zipfile extension

Initial Comment:
Playing around with large zipfiles (&gt; 10000 files),
I've encountered big loading time, even if after having
loaded it I use only 30 files in it.
So I've introduced a differed parameter to the
Zipfile.__init__ in order to load headers on-demand.
As it's not a really good idea to activated it for all
zip it defaults to False.
I've updated the documentation too.

Thx and keep the good work ;)

P.S. : Dunno if it can be added to 2.3 or have to be
included in 2.4, so I've choosed 2.4 group.


----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-27 20:22

Message:
Logged In: YES 
user_id=580910

Patch [1446489 ] zipfile: support for ZIP64 also addresses this as a side-
effect of adding support ZIP64 support (for very big zipfiles).

BTW. I don't quite understand why this patch is put on hold just because a 
rewrite of the zipfile module is planned. 

W.r.t. this patch: why is the on-demand loading optional? Loading the per-file 
headers when the zipfile is opened is not necessary for normal operation, the 
current zipfile module is basically doing a full verify of the zipfile on all 
occassions. This isn't necessary for normal operation and I don't think the 
infozip tools do this (probably because verification is very  expensive).

----------------------------------------------------------------------

Comment By: Sean Reifschneider (jafo)
Date: 2006-05-25 16:38

Message:
Logged In: YES 
user_id=81797

Actually, we'll leave it open until the Summer of Code
implementation is completed and accepted.

Sean

----------------------------------------------------------------------

Comment By: Sean Reifschneider (jafo)
Date: 2006-05-25 16:36

Message:
Logged In: YES 
user_id=81797

There is a summer of code project to re-write the zipfile
module, so this patch is moot.

Sean

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=813436&group_id=5470

From noreply at sourceforge.net  Sat May 27 22:10:25 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 13:10:25 -0700
Subject: [Patches] [ python-Patches-1496135 ] Fix test_exceptions.py
Message-ID: <E1Fk57B-00027S-DO@sc8-sf-web5.sourceforge.net>

Patches item #1496135, was opened at 2006-05-27 16:10
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496135&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Tests
Group: Python 3000
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Fix test_exceptions.py

Initial Comment:
The attached patch fixes a bug in
Lib/test/test_exceptions.py related to the
disappearance of apply() in Python 3000.

The patch is against r46491.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496135&group_id=5470

From noreply at sourceforge.net  Sat May 27 22:16:59 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 13:16:59 -0700
Subject: [Patches] [ python-Patches-1496135 ] Fix test_exceptions.py
Message-ID: <E1Fk5DX-00056s-U4@sc8-sf-web5.sourceforge.net>

Patches item #1496135, was opened at 2006-05-27 16:10
Message generated for change (Settings changed) made by collinwinter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496135&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Tests
Group: Python 3000
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
>Assigned to: Guido van Rossum (gvanrossum)
Summary: Fix test_exceptions.py

Initial Comment:
The attached patch fixes a bug in
Lib/test/test_exceptions.py related to the
disappearance of apply() in Python 3000.

The patch is against r46491.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496135&group_id=5470

From noreply at sourceforge.net  Sun May 28 01:36:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 16:36:05 -0700
Subject: [Patches] [ python-Patches-1496206 ] urllib2 HTTPPasswordMgr:
	default ports
Message-ID: <E1Fk8KD-0006Lk-91@sc8-sf-web4-b.sourceforge.net>

Patches item #1496206, was opened at 2006-05-28 00:36
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496206&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2 HTTPPasswordMgr: default ports

Initial Comment:
urllib2 HTTPPasswordMgr had support for ports added
during 2.5 development, but that code doesn't know
about the default HTTP / HTTPS ports.  As a result, for
example, a fetch of "https://example.com:443"
301-redirected to "https://example.com/" (as a local
Apache server did on my linux box) will fail unless you
register both "example.com:443" and "example.com" with
the HTTPPasswordMgr.  I'd call that a bug.

The patch adds a new test and takes care not to break
the case where old code calls add_password for
example.com and then find_user_password is called for
example.com (with no explicit port).

The patch also comments out one test which was testing
something not actually guaranteed by the code at all --
it was passing by fluke.  The code it's trying to test
could do with some review, which is why I left this
test commented out rather than deleting the test (but
that is a long-standing issue unrelated to this patch,
so should not block this patch from being applied).


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496206&group_id=5470

From noreply at sourceforge.net  Sun May 28 01:42:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 16:42:32 -0700
Subject: [Patches] [ python-Patches-1496206 ] urllib2 HTTPPasswordMgr:
	default ports
Message-ID: <E1Fk8QS-0001dR-8T@sc8-sf-web5.sourceforge.net>

Patches item #1496206, was opened at 2006-05-28 00:36
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496206&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
>Assigned to: Georg Brandl (gbrandl)
Summary: urllib2 HTTPPasswordMgr: default ports

Initial Comment:
urllib2 HTTPPasswordMgr had support for ports added
during 2.5 development, but that code doesn't know
about the default HTTP / HTTPS ports.  As a result, for
example, a fetch of "https://example.com:443"
301-redirected to "https://example.com/" (as a local
Apache server did on my linux box) will fail unless you
register both "example.com:443" and "example.com" with
the HTTPPasswordMgr.  I'd call that a bug.

The patch adds a new test and takes care not to break
the case where old code calls add_password for
example.com and then find_user_password is called for
example.com (with no explicit port).

The patch also comments out one test which was testing
something not actually guaranteed by the code at all --
it was passing by fluke.  The code it's trying to test
could do with some review, which is why I left this
test commented out rather than deleting the test (but
that is a long-standing issue unrelated to this patch,
so should not block this patch from being applied).


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-28 00:42

Message:
Logged In: YES 
user_id=261020

Assigning to Georg since he added the default port support.

(Actually, the support for ports added during 2.5
development was specifically for proxies IIRC: ordinary
(non-proxy) Basic Auth for URLs with a port did work prior
to 2.5, I think.  Still, this patch is backwards-compatible.)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496206&group_id=5470

From noreply at sourceforge.net  Sun May 28 01:58:48 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 16:58:48 -0700
Subject: [Patches] [ python-Patches-972322 ] urllib2 handler naming
	convention collision
Message-ID: <E1Fk8gC-0001OV-0y@sc8-sf-web5.sourceforge.net>

Patches item #972322, was opened at 2004-06-14 00:16
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=972322&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Jeremy Hylton (jhylton)
Summary: urllib2 handler naming convention collision

Initial Comment:
The method naming conventions of *_open and *_request
in urllib2 are accidentally met by the following methods:

AbstractHTTPHandler.do_open()
ProxyHandler.proxy_open()
AbstractHTTPHandler.redirect_request()

So URLs like do://example.com/ are regarded as having a
handler, and urllib2.urlopen("do://python.org/") causes
a TypeError.

I think *something* should be done about this, but I'm
willing to provide a different patch if this one is
frowned upon.  The alternative would be to rename
do_open and proxy_open, and leave the redirect_request
case unchanged (see below for why).

The first two methods are undocumented, so could in
theory be renamed.  However, people will likely be
overriding them anyway, so perhaps it's better to apply
this ugly patch than rename them.

redirect_request is documented, so can't be renamed,
but it will never be accidentally called unless
somebody actually adds a handler with a method named
"redirect_open".


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-28 00:58

Message:
Logged In: YES 
user_id=261020

OK, collision_v2.patch is less ugly, and supercedes all the
previous patches attached to this tracker item.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-07 23:27

Message:
Logged In: YES 
user_id=261020

OK, I see a slightly less ugly fix, don't apply this.  I
intend to upload a better one later.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-03-31 22:16

Message:
Logged In: YES 
user_id=261020

Here's an updated patch (collision.patch) that applies
against SVN HEAD.  I also made the test a little clearer. 
collision.patch supercedes both urllib2.py.patch and
test_urllib2.py.patch

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-05-19 21:53

Message:
Logged In: YES 
user_id=261020

Since nobody seems to mind the slightly uglified code
required to fix these bugs in a backwards-compatible way,
could somebody please apply this patch?


----------------------------------------------------------------------

Comment By: Michael Chermside (mcherm)
Date: 2004-10-22 17:36

Message:
Logged In: YES 
user_id=99874

I have reviewed this patch and I recomend applying it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=972322&group_id=5470

From noreply at sourceforge.net  Sun May 28 03:12:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sat, 27 May 2006 18:12:05 -0700
Subject: [Patches] [ python-Patches-1243730 ] Big speedup in email message
	parsing
Message-ID: <E1Fk9p7-00042M-Mu@sc8-sf-web3.sourceforge.net>

Patches item #1243730, was opened at 2005-07-23 18:07
Message generated for change (Comment added) made by bwarsaw
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243730&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Performance
>Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: L. Peter Deutsch (lpd)
Assigned to: Barry A. Warsaw (bwarsaw)
Summary: Big speedup in email message parsing

Initial Comment:
Python 2.4.1, Red Hat Linux 7.3.

Speeds up message parsing on files with large
attachments by approximately 4x, mostly by replacing
REs by direct string processing.


----------------------------------------------------------------------

>Comment By: Barry A. Warsaw (bwarsaw)
Date: 2006-05-27 21:12

Message:
Logged In: YES 
user_id=12800

Here's a slightly better version, cleaned up for style and
applicable to Python 2.5 (which is the only place I'd feel
comfortable applying it).  I've verified that this provides
about a 3x speed up at least for some messages with really
big attachments.

----------------------------------------------------------------------

Comment By: Steve Holden (holdenweb)
Date: 2006-05-25 18:55

Message:
Logged In: YES 
user_id=88157

A first examinaation reveals no particular speedup on an
email with approximately 30 MB of attachments. Can the OP
perhaps provide some code and test data I could time to
verify the assertions of speedup? Otherwise I can't see much
point in applying the patch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1243730&group_id=5470

From noreply at sourceforge.net  Sun May 28 18:58:17 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 28 May 2006 09:58:17 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1FkOan-0006fs-E3@sc8-sf-web5.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 18:59
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
>Status: Closed
Resolution: Accepted
Priority: 8
Submitted By: Andrew Clover (bobince)
Assigned to: Martin v. L??wis (loewis)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Martin v. L??wis (loewis)
Date: 2006-05-28 18:58

Message:
Logged In: YES 
user_id=21627

Committed the rest as r46503. Thanks again!

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 19:02

Message:
Logged In: YES 
user_id=21627

Ok, I will then do the following changes still:
- add baselogo.svg and source.xar (ignore all the other files),
- remove the attribution for Erik (sorry for missing that)

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-26 04:50

Message:
Logged In: YES 
user_id=311085

> I put a demo installer containing them

Seems to work OK. The thanks at the end still attributes the
graphic to Erik though; I'm not after an ack there myself,
but changing the text to not imply the current graphic is
his one may be appropriate.

> baselogo.svg; I assume this is a source file

Yes. This is just the Python logo itself (the gradient
version as used on the new website), in vector format.

> icons.svgz; can't figure out what this is

Same as source.xar, but exported as W3C standard SVG format
for wider compatibility [compressed, hence the 'z'].

Unfortunately because SVG cannot reproduce some of effects
used, and because the SVG export path is currently quite
bad, it's not really directly usable, but it might be of use
to anyone who wants to hack on the graphics but doesn't use
Xara.

> source.xar; not sure either

This is the primary vector graphics source of the icons -
the other SVG and PNG files are just there because other
people requested them.

It's in Xara format, a previously proprietary graphics
application which has now gone open-source and is heading
rapidly towards being usable on Linux, but isn't quite there
yet.

> a directory called png, with many png file - I expect
> that these aren't source files, are they?

Nope, they're just exactly the same content as in the
(with-vista) .ico files, just supplied as PNG for anyone who
wants to fiddle with them in a more accessible bitmap format.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 10:56

Message:
Logged In: YES 
user_id=21627

Thanks for the patch. I have committed it as r46063. I put a
demo installer containing them at

http://www.dcl.hpi.uni-potsdam.de/home/loewis/python-2.5.13290.msi

I would also like to add the source files, but I have
difficulties figuring out what they are. There is a source
directory; with:

- baselogo.svg; I assume this is a source file
- icons.svgz; can't figure out what this is
- source.xar; not sure either
- a directory called png, with many png file - I expect
  that these aren't source files, are they?

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-19 16:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 13:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Sun May 28 22:23:26 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 28 May 2006 13:23:26 -0700
Subject: [Patches] [ python-Patches-1496206 ] urllib2 HTTPPasswordMgr:
	default ports
Message-ID: <E1FkRnK-0000kH-PK@sc8-sf-web1.sourceforge.net>

Patches item #1496206, was opened at 2006-05-27 23:36
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496206&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Georg Brandl (gbrandl)
Summary: urllib2 HTTPPasswordMgr: default ports

Initial Comment:
urllib2 HTTPPasswordMgr had support for ports added
during 2.5 development, but that code doesn't know
about the default HTTP / HTTPS ports.  As a result, for
example, a fetch of "https://example.com:443"
301-redirected to "https://example.com/" (as a local
Apache server did on my linux box) will fail unless you
register both "example.com:443" and "example.com" with
the HTTPPasswordMgr.  I'd call that a bug.

The patch adds a new test and takes care not to break
the case where old code calls add_password for
example.com and then find_user_password is called for
example.com (with no explicit port).

The patch also comments out one test which was testing
something not actually guaranteed by the code at all --
it was passing by fluke.  The code it's trying to test
could do with some review, which is why I left this
test commented out rather than deleting the test (but
that is a long-standing issue unrelated to this patch,
so should not block this patch from being applied).


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-28 20:23

Message:
Logged In: YES 
user_id=849994

Thanks! Committed in rev. 46509.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-27 23:42

Message:
Logged In: YES 
user_id=261020

Assigning to Georg since he added the default port support.

(Actually, the support for ports added during 2.5
development was specifically for proxies IIRC: ordinary
(non-proxy) Basic Auth for URLs with a port did work prior
to 2.5, I think.  Still, this patch is backwards-compatible.)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496206&group_id=5470

From noreply at sourceforge.net  Sun May 28 23:07:33 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Sun, 28 May 2006 14:07:33 -0700
Subject: [Patches] [ python-Patches-1493102 ] Allow build without tracing
Message-ID: <E1FkSU1-0001b5-60@sc8-sf-web3.sourceforge.net>

Patches item #1493102, was opened at 2006-05-22 17:53
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493102&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
>Status: Closed
>Resolution: Postponed
Priority: 5
Submitted By: Steve Holden (holdenweb)
Assigned to: Nobody/Anonymous (nobody)
Summary: Allow build without tracing

Initial Comment:
This patch allows the tracing code to be conditioned
out by the absence of a definition for the symbol
WITH_TRACING.

This seems to win a worthwhile speed gain.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-28 21:07

Message:
Logged In: YES 
user_id=849994

As the result of some tests was that the patch even caused
slowdowns on some configurations, it was decided not to
follow this path. Closing this as "Postponed", since someone
might try to do something like this in the future.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1493102&group_id=5470

From noreply at sourceforge.net  Mon May 29 14:44:14 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 05:44:14 -0700
Subject: [Patches] [ python-Patches-1478788 ] Rename functional to functools
Message-ID: <E1Fkh6U-0003kS-GD@sc8-sf-web4-b.sourceforge.net>

Patches item #1478788, was opened at 2006-04-29 14:35
Message generated for change (Comment added) made by ncoghlan
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1478788&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nick Coghlan (ncoghlan)
Summary: Rename functional to functools

Initial Comment:
This patch handles the requested renaming of the
functional module to functools.

Due to problems encountered when generating the patch
(svn's diff doesn't handle moved+edited files well and
PC/VC6/pythoncore.dsp was marked as binary), I've had
to upload the new versions of several files with
instructions on how to use them:

1. Apply rename_functional.patch
2. Apply pep-0309.txt.diff to
projects/peps/trunk/pep-0309.txt
3. Delete the following:
   - Doc/lib/libfunctional.tex
   - Lib/test/test_functional.py
   - Modules/functionalmodule.c
4. Add the following:
   - libfunctools.tex -> Doc/lib/libfunctools.tex
   - test_functools.py -> Lib/test/test_functools.py
   - functoolsmodule.c -> Modules/functoolsmodule.c
5. Merge pythoncore.dsp with PC/VC6/pythoncore.dsp


rename_functional.patch changes the following files:
- PCbuild/pythoncore.vcproj
- setup.py
- Misc/NEWS
- PC/config.c
- Doc/whatsnew/whatsnew25.tex
- Doc/lib/lib.tex

All changes to projects/python were based on r45757.
The changes to PEP 309 were based on r45798.

----------------------------------------------------------------------

>Comment By: Nick Coghlan (ncoghlan)
Date: 2006-05-29 22:44

Message:
Logged In: YES 
user_id=1038590

Modified version applied as SVN rev 46520

----------------------------------------------------------------------

Comment By: Collin Winter (collinwinter)
Date: 2006-04-30 02:04

Message:
Logged In: YES 
user_id=1344176

The patch to whatsnew25.tex has been separated out into its
own diff.

The patch to NEWS has been changed as recommended.

I'm not sure what you mean by "the dependencies file for the
Library Reference". grepping my checkout reveals no further
mentions of the old functional name.

----------------------------------------------------------------------

Comment By: Nick Coghlan (ncoghlan)
Date: 2006-04-29 17:09

Message:
Logged In: YES 
user_id=1038590

Inspecting the changes manually (rather than applying them
locally):

The change to NEWS should be an extra entry to say that
functional was renamed to functools for alpha 3 rather than
changing the earlier entry to say functools.

I suggest providing the what's new changes as a separate
patch - AMK may prefer to make his own changes to that area
of the docs.

I believe an update to the dependencies file for the Library
Reference is currently missing.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1478788&group_id=5470

From noreply at sourceforge.net  Mon May 29 16:04:44 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 07:04:44 -0700
Subject: [Patches] [ python-Patches-1496952 ] Convert Tkinter to
	METH_VARARGS style
Message-ID: <E1FkiMO-0006av-0y@sc8-sf-web1.sourceforge.net>

Patches item #1496952, was opened at 2006-05-29 14:04
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496952&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Tkinter
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Georg Brandl (gbrandl)
Assigned to: Martin v. L??wis (loewis)
Summary: Convert Tkinter to METH_VARARGS style

Initial Comment:
Patch attached. Martin, as the maintainer please check.
I also changed some PyTuple_Size and PyTuple_GetItem to
the corresponding macros when tupleness is guaranteed.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496952&group_id=5470

From noreply at sourceforge.net  Mon May 29 16:10:00 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 07:10:00 -0700
Subject: [Patches] [ python-Patches-1496957 ] deprecate METH_OLDARGS
Message-ID: <E1FkiRU-0008Pb-Nq@sc8-sf-web1.sourceforge.net>

Patches item #1496957, was opened at 2006-05-29 14:10
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496957&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Georg Brandl (gbrandl)
Assigned to: Neal Norwitz (nnorwitz)
Summary: deprecate METH_OLDARGS

Initial Comment:
As discussed on python-dev.

Patch includes warning emmitting code in methodobject.c
as well as Doc/api changes.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496957&group_id=5470

From noreply at sourceforge.net  Mon May 29 17:47:29 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 08:47:29 -0700
Subject: [Patches] [ python-Patches-1497027 ] urllib2: ensure digest auth
	happens in preference to basic
Message-ID: <E1Fkjxp-0006sk-Fl@sc8-sf-web2.sourceforge.net>

Patches item #1497027, was opened at 2006-05-29 16:47
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497027&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: ensure digest auth happens in preference to basic

Initial Comment:
At the moment in urllib2, digest auth is only tried
before basic (as MUST happen according to RFC 2617
section 1.2) only if .add_handler() was called for
HTTPDigestAuthHandler before HTTPBasicAuthHandler. 
This patch ensures that digest is always tried first.

I guess it's unfortunate that sorting of handlers is
done by an attribute rather than by declaring and using
topological sort, since any change to a handler_order
may break code, but that can only be fixed by adding
that as a new feature (probably a new factory).  I
doubt this particular change will catch anybody out,
and it can violate the RFC as-is, so I think this
should go in.  The patch removes the note in the docs
about current values of handler_orders, since that was
already out of date even before this patch.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497027&group_id=5470

From noreply at sourceforge.net  Mon May 29 18:39:09 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 09:39:09 -0700
Subject: [Patches] [ python-Patches-1497027 ] urllib2: ensure digest auth
	happens in preference to basic
Message-ID: <E1Fkklp-0003Ib-L2@sc8-sf-web1.sourceforge.net>

Patches item #1497027, was opened at 2006-05-29 16:47
Message generated for change (Comment added) made by jjlee
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497027&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: ensure digest auth happens in preference to basic

Initial Comment:
At the moment in urllib2, digest auth is only tried
before basic (as MUST happen according to RFC 2617
section 1.2) only if .add_handler() was called for
HTTPDigestAuthHandler before HTTPBasicAuthHandler. 
This patch ensures that digest is always tried first.

I guess it's unfortunate that sorting of handlers is
done by an attribute rather than by declaring and using
topological sort, since any change to a handler_order
may break code, but that can only be fixed by adding
that as a new feature (probably a new factory).  I
doubt this particular change will catch anybody out,
and it can violate the RFC as-is, so I think this
should go in.  The patch removes the note in the docs
about current values of handler_orders, since that was
already out of date even before this patch.


----------------------------------------------------------------------

>Comment By: John J Lee (jjlee)
Date: 2006-05-29 17:39

Message:
Logged In: YES 
user_id=261020

Oops, the comment added to
test_basic_and_digest_auth_handlers (which is the second
mention of 1479302 in that test if the patch is applied)
should instead have referred to this tracker item (1497027).

I guess there's no need to upload a new patch just to fix that.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497027&group_id=5470

From noreply at sourceforge.net  Mon May 29 18:54:43 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 09:54:43 -0700
Subject: [Patches] [ python-Patches-1497053 ] Let dicts propagate the
	exceptions in user __eq__
Message-ID: <E1Fkl0t-0001Dr-Ta@sc8-sf-web3.sourceforge.net>

Patches item #1497053, was opened at 2006-05-29 16:54
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497053&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Armin Rigo (arigo)
Assigned to: Nobody/Anonymous (nobody)
Summary: Let dicts propagate the exceptions in user __eq__

Initial Comment:
Patch for bug #1275608.

Exceptions occurring when the dict lookup code calls a
user-defined __eq__() are silently eaten.  This has
caused me several hours-long debugging session so far,
so I thought that it was time to fix that.

The proposed patch takes an easy route: it doesn't try
to change PyDict_GetItem(), which still has no way to
report exceptions and just eats them.  Instead, it moves
the exception-eating logic into PyDict_GetItem().  So 
all other ways in which dicts are accessed now correctly
report exceptions.  Most importantly, this includes all
operators and methods accessible from Python code,
including the 'x=d[key]' syntax.

The only incompatibility I could imagine from this would
be from code that relies on the fact that dicts were
previouly tolerant about exceptions: an __eq__ could
fail in any way, and the lookup would consider it as a
"not equal" signal and proceed.  I'd say "fix that". 
However it means that the 2.4 patch attached here should
probably not be applied, sadly.  I'd vote to check in
the 2.5 patch as soon as possible.

Note that these patches sneak in another bugfix patch
too (#1456209) because I couldn't be bothered to
maintain two mutually-conflicting patches.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497053&group_id=5470

From noreply at sourceforge.net  Mon May 29 19:47:29 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 10:47:29 -0700
Subject: [Patches] [ python-Patches-1497053 ] Let dicts propagate the
	exceptions in user __eq__
Message-ID: <E1Fklpx-0008B5-5f@sc8-sf-web4-b.sourceforge.net>

Patches item #1497053, was opened at 2006-05-29 16:54
Message generated for change (Comment added) made by arigo
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497053&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Armin Rigo (arigo)
Assigned to: Nobody/Anonymous (nobody)
Summary: Let dicts propagate the exceptions in user __eq__

Initial Comment:
Patch for bug #1275608.

Exceptions occurring when the dict lookup code calls a
user-defined __eq__() are silently eaten.  This has
caused me several hours-long debugging session so far,
so I thought that it was time to fix that.

The proposed patch takes an easy route: it doesn't try
to change PyDict_GetItem(), which still has no way to
report exceptions and just eats them.  Instead, it moves
the exception-eating logic into PyDict_GetItem().  So 
all other ways in which dicts are accessed now correctly
report exceptions.  Most importantly, this includes all
operators and methods accessible from Python code,
including the 'x=d[key]' syntax.

The only incompatibility I could imagine from this would
be from code that relies on the fact that dicts were
previouly tolerant about exceptions: an __eq__ could
fail in any way, and the lookup would consider it as a
"not equal" signal and proceed.  I'd say "fix that". 
However it means that the 2.4 patch attached here should
probably not be applied, sadly.  I'd vote to check in
the 2.5 patch as soon as possible.

Note that these patches sneak in another bugfix patch
too (#1456209) because I couldn't be bothered to
maintain two mutually-conflicting patches.

----------------------------------------------------------------------

>Comment By: Armin Rigo (arigo)
Date: 2006-05-29 17:47

Message:
Logged In: YES 
user_id=4771

Updated the patches; added more tests for 2.5.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497053&group_id=5470

From noreply at sourceforge.net  Mon May 29 22:53:05 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 13:53:05 -0700
Subject: [Patches] [ python-Patches-1497027 ] urllib2: ensure digest auth
	happens in preference to basic
Message-ID: <E1FkojZ-0007hU-CK@sc8-sf-web1.sourceforge.net>

Patches item #1497027, was opened at 2006-05-29 15:47
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497027&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib2: ensure digest auth happens in preference to basic

Initial Comment:
At the moment in urllib2, digest auth is only tried
before basic (as MUST happen according to RFC 2617
section 1.2) only if .add_handler() was called for
HTTPDigestAuthHandler before HTTPBasicAuthHandler. 
This patch ensures that digest is always tried first.

I guess it's unfortunate that sorting of handlers is
done by an attribute rather than by declaring and using
topological sort, since any change to a handler_order
may break code, but that can only be fixed by adding
that as a new feature (probably a new factory).  I
doubt this particular change will catch anybody out,
and it can violate the RFC as-is, so I think this
should go in.  The patch removes the note in the docs
about current values of handler_orders, since that was
already out of date even before this patch.


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-29 20:53

Message:
Logged In: YES 
user_id=849994

Applied in rev. 46531.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-29 16:39

Message:
Logged In: YES 
user_id=261020

Oops, the comment added to
test_basic_and_digest_auth_handlers (which is the second
mention of 1479302 in that test if the patch is applied)
should instead have referred to this tracker item (1497027).

I guess there's no need to upload a new patch just to fix that.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1497027&group_id=5470

From noreply at sourceforge.net  Mon May 29 22:53:19 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 13:53:19 -0700
Subject: [Patches] [ python-Patches-972322 ] urllib2 handler naming
	convention collision
Message-ID: <E1Fkojn-0007nV-5O@sc8-sf-web1.sourceforge.net>

Patches item #972322, was opened at 2004-06-13 23:16
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=972322&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.4
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: John J Lee (jjlee)
Assigned to: Jeremy Hylton (jhylton)
Summary: urllib2 handler naming convention collision

Initial Comment:
The method naming conventions of *_open and *_request
in urllib2 are accidentally met by the following methods:

AbstractHTTPHandler.do_open()
ProxyHandler.proxy_open()
AbstractHTTPHandler.redirect_request()

So URLs like do://example.com/ are regarded as having a
handler, and urllib2.urlopen("do://python.org/") causes
a TypeError.

I think *something* should be done about this, but I'm
willing to provide a different patch if this one is
frowned upon.  The alternative would be to rename
do_open and proxy_open, and leave the redirect_request
case unchanged (see below for why).

The first two methods are undocumented, so could in
theory be renamed.  However, people will likely be
overriding them anyway, so perhaps it's better to apply
this ugly patch than rename them.

redirect_request is documented, so can't be renamed,
but it will never be accidentally called unless
somebody actually adds a handler with a method named
"redirect_open".


----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-29 20:53

Message:
Logged In: YES 
user_id=849994

Applied in rev. 46531.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-27 23:58

Message:
Logged In: YES 
user_id=261020

OK, collision_v2.patch is less ugly, and supercedes all the
previous patches attached to this tracker item.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-05-07 22:27

Message:
Logged In: YES 
user_id=261020

OK, I see a slightly less ugly fix, don't apply this.  I
intend to upload a better one later.

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2006-03-31 21:16

Message:
Logged In: YES 
user_id=261020

Here's an updated patch (collision.patch) that applies
against SVN HEAD.  I also made the test a little clearer. 
collision.patch supercedes both urllib2.py.patch and
test_urllib2.py.patch

----------------------------------------------------------------------

Comment By: John J Lee (jjlee)
Date: 2005-05-19 20:53

Message:
Logged In: YES 
user_id=261020

Since nobody seems to mind the slightly uglified code
required to fix these bugs in a backwards-compatible way,
could somebody please apply this patch?


----------------------------------------------------------------------

Comment By: Michael Chermside (mcherm)
Date: 2004-10-22 16:36

Message:
Logged In: YES 
user_id=99874

I have reviewed this patch and I recomend applying it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=972322&group_id=5470

From noreply at sourceforge.net  Mon May 29 23:05:11 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 14:05:11 -0700
Subject: [Patches] [ python-Patches-1491939 ] Fix for bug #1486663 mutable
	types check kwargs in tp_new
Message-ID: <E1FkovH-00048a-Or@sc8-sf-web1.sourceforge.net>

Patches item #1491939, was opened at 2006-05-20 01:17
Message generated for change (Settings changed) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
>Assigned to: Raymond Hettinger (rhettinger)
Summary: Fix for bug #1486663 mutable types check kwargs in tp_new

Initial Comment:
set and deque check that they are not called with
keyword arguments in their tp_new method, although
they are mutable. This makes them harder to subclass.
See the bug report for more details.

Patch contains tests and fixes for both of them.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

From noreply at sourceforge.net  Mon May 29 23:07:53 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 14:07:53 -0700
Subject: [Patches] [ python-Patches-1491939 ] Fix for bug #1486663 mutable
	types check kwargs in tp_new
Message-ID: <E1Fkoxt-0005E5-KP@sc8-sf-web1.sourceforge.net>

Patches item #1491939, was opened at 2006-05-20 01:17
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: ?iga Seilnacht (zseil)
Assigned to: Raymond Hettinger (rhettinger)
Summary: Fix for bug #1486663 mutable types check kwargs in tp_new

Initial Comment:
set and deque check that they are not called with
keyword arguments in their tp_new method, although
they are mutable. This makes them harder to subclass.
See the bug report for more details.

Patch contains tests and fixes for both of them.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-29 21:07

Message:
Logged In: YES 
user_id=849994

Raymond, please check.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1491939&group_id=5470

From noreply at sourceforge.net  Mon May 29 23:56:41 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Mon, 29 May 2006 14:56:41 -0700
Subject: [Patches] [ python-Patches-1496957 ] deprecate METH_OLDARGS
Message-ID: <E1Fkpj7-0003Tb-6e@sc8-sf-web2.sourceforge.net>

Patches item #1496957, was opened at 2006-05-29 14:10
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496957&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
>Group: Python 3000
>Status: Closed
>Resolution: Later
Priority: 5
Submitted By: Georg Brandl (gbrandl)
>Assigned to: Martin v. L??wis (loewis)
Summary: deprecate METH_OLDARGS

Initial Comment:
As discussed on python-dev.

Patch includes warning emmitting code in methodobject.c
as well as Doc/api changes.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-29 21:56

Message:
Logged In: YES 
user_id=849994

Closing as Later. Won't happen before Py3k.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1496957&group_id=5470

From noreply at sourceforge.net  Tue May 30 15:28:06 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 30 May 2006 06:28:06 -0700
Subject: [Patches] [ python-Patches-1446489 ] zipfile: support for ZIP64
Message-ID: <E1Fl4GT-0004SW-Nl@sc8-sf-web1.sourceforge.net>

Patches item #1446489, was opened at 2006-03-09 15:58
Message generated for change (Comment added) made by ronaldoussoren
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Library (Lib)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Ronald Oussoren (ronaldoussoren)
Assigned to: Ronald Oussoren (ronaldoussoren)
Summary: zipfile: support for ZIP64

Initial Comment:
The attached patch implements support for ZIP64, that is zipfiles 
containing very large (>4GByte) files and zipfiles that are larger than
4GByte themselves. 

The output of this patch can be read by pkzip (see below for the actual 
version I used for testing).


----------------------------------------------------------------------

>Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-30 15:28

Message:
Logged In: YES 
user_id=580910

I've added some more tests for pre-existing functionality. The unittests are still 
far from comprehensive, but at least touch upon most functionality of zipfile.

Does anyone feel like reviewing this? I'd like to get this into python2.5.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-26 10:26

Message:
Logged In: YES 
user_id=580910

I've attached yet another version, this version reintroduces some functionalitity 
that was unintentionally removed and fixes a lame bug that caused 
test_zipimport to fail.


----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-23 15:10

Message:
Logged In: YES 
user_id=580910

I've found some time to work on this. I've added zipfile-zip64-
version2.patch, this version:

* Makes zip64 behaviour optional (defaults to off because zip(1) doesn't 
support  zip64)

* Is significantly faster for large zipfiles because it doesn't scan the entire 
zipfile just to check that the file headers are consistent with the central 
directory w.r.t. filename (this check is now done when trying to read a file)

* Updates the reference documentation.

* Adds unittests. There are two sets of tests: one set tests the behaviour of 
zip64 extensions using small files by lowering the zip64 cutoff point and is 
run every time, the other set do tests with huge zipfiles and are run when the 
largefile feature is enabled when running the tests.

There one backward incompatible change: ZipInfo objects no longer have a 
file_offset attribute. That was the other reason for scanning the entire zipfile 
when opening it. IMNSHO this should have been a private attribute and the 
cost of this feature is not worth its *very* limited usefulness. As an indication 
of its cost: I got a 6x speedup when I removed the calculation of the 
file_offset attribute, something that adds up when you are dealing with huge 
zipfiles (I wrote this patch because I'm dealing with 10+GByte zipfiles with 
tens of thousands of files at work).

I noticed that zipfile raises RuntimeError in some places. I've changed one of 
those to zipfile.BadZipfile, but others remain. I don't like this, most of them 
should be replaced by TypeError or ValueError exceptions.

BTW. This patch also supports storing files >4GByte in the zipfile, but that 
feature isn't very useful because zipfile doesn't have an API for reading file 
data incrementally.

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-05-16 09:55

Message:
Logged In: YES 
user_id=580910

I haven't had time to work on this, all time I had to work on python related stuff 
has been eaten by finishing PyObjC's port to intel macs and universal binary 
patches.

The former is now done, the latter almost so I'll have some time to work on this 
again especially because I'm using this patch at work and might be able to claim 
some time to work on this during work-hours.

----------------------------------------------------------------------

Comment By: Georg Brandl (gbrandl)
Date: 2006-05-16 09:41

Message:
Logged In: YES 
user_id=849994

Since 2.5 beta is coming close, have you made progress on
the tests/docs?

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-04-02 21:13

Message:
Logged In: YES 
user_id=580910

The "don't use the ZIP64 extension" flag is a good idea, zipfiles that use this 
extension aren't readable by the infozip tools (zip and unzip on most unix 
systems).

I'll add tests and documentation in the near future.

The version of zipfile that I'm currently using also contains a patch for 
speeding up the opening of zipfiles, for the type of files I'm dealing with 
(about 11GByte large with tens of thousands of files) the speedup is very 
significant. I suppose it's better to file that as a separate patch after this has 
been approved.

----------------------------------------------------------------------

Comment By: Anthony Baxter (anthonybaxter)
Date: 2006-04-02 07:02

Message:
Logged In: YES 
user_id=29957

I'd like to see a testcase and possibly a note for the
documentation about the new semantics. Also, should it be
possible to say "don't use the ZIP64 extension, instead
raise an Error" for people who don't want to generate these?
 

----------------------------------------------------------------------

Comment By: Ronald Oussoren (ronaldoussoren)
Date: 2006-03-09 16:28

Message:
Logged In: YES 
user_id=580910

Oops, I've uploaded the wrong file. zipfile-zip64.patch is the correct one.

I've tested the correctness of created archives using this version of pkzip:

pkzipc -version
PKZIP(R) Server  Version 8  ZIP Compression Utility for Linux X86
Copyright (C) 1989-2005 PKWARE, Inc.  All Rights Reserved. Evaluation 
Version
PKZIP Reg. U.S. Pat. and Tm. Off.  Patent No. 5,051,745
Patent Pending

Version 8.40.66


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1446489&group_id=5470

From noreply at sourceforge.net  Tue May 30 18:20:58 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Tue, 30 May 2006 09:20:58 -0700
Subject: [Patches] [ python-Patches-1490384 ] PC new-logo-based icon set
Message-ID: <E1Fl6xm-0004Aa-TX@sc8-sf-web1.sourceforge.net>

Patches item #1490384, was opened at 2006-05-17 16:59
Message generated for change (Comment added) made by bobince
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Windows
Group: Python 2.5
Status: Closed
Resolution: Accepted
Priority: 8
Submitted By: Andrew Clover (bobince)
Assigned to: Martin v. L??wis (loewis)
Summary: PC new-logo-based icon set

Initial Comment:
Following positive discussion on -dev, here's the
updated version of the PC/py*.ico files I hacked up a
while ago.

The attachment is a ZIP, not a patch, as it contains
only binaries. Also available as tgz:

  http://doxdesk.com/img/software/py/win32-icons.tar.gz

Also possibly of interest:

  http://doxdesk.com/img/software/py/icons3.zip

This attachment contains only the simple replacement
files; the icons3 ZIP also contains:

  - source
  - versions including Windows Vista large icons
    (probably not worth including at this point as they're
    quite sizable and no-one is using Vista yet)
  - an egg icon
    (there is currently no installer/shell support for
eggs,
    but could be worth adding in future)
  - a new installer side banner
    (this has not currently seen any discussion on -dev,
    but may be worth considering if the intention is to
    leave behind the purple/green snake branding)


----------------------------------------------------------------------

>Comment By: Andrew Clover (bobince)
Date: 2006-05-30 16:20

Message:
Logged In: YES 
user_id=311085

No probs and ta!

However...

I've got more altered icons, attached and at
http://doxdesk.com/img/software/py/win32-icons2.zip . Sorry
for the inconvenience - pretty sure these are 'final'.

The problem with the old files? Well it seems there's a bug
in Windows that can cause redraw errors on 32-bit
alpha-blended XP icons. I can't find any doc on this at all,
but from experiment it seems that it can occur when icons
are partially redrawn instead of drawn all at once. This is
most easily provoked by slowly dragging a window on top of
an Explorer window in Tiles/Icons/List mode, to reveal the
icon underneath.

It looks to be an arithmetic overflow in compositing: when a
nearly-transparent white pixel being plotted onto pure white
background in this partial-redraw code, a black pixel can
unexpectedly result, with poor-looking results. (Thanks,
Windows.)

I've hacked the bitmaps to avoid places where they're
white-and-transparent enough to be able to provoke this
aggravating occasional behaviour. Also while I'm at it, I've
removed the 256 colour 48x48 icons, since it saves a few K
and there's almost no practical case where they're of benefit.

cheers,


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-28 16:58

Message:
Logged In: YES 
user_id=21627

Committed the rest as r46503. Thanks again!

----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-26 17:02

Message:
Logged In: YES 
user_id=21627

Ok, I will then do the following changes still:
- add baselogo.svg and source.xar (ignore all the other files),
- remove the attribution for Erik (sorry for missing that)

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-26 02:50

Message:
Logged In: YES 
user_id=311085

> I put a demo installer containing them

Seems to work OK. The thanks at the end still attributes the
graphic to Erik though; I'm not after an ack there myself,
but changing the text to not imply the current graphic is
his one may be appropriate.

> baselogo.svg; I assume this is a source file

Yes. This is just the Python logo itself (the gradient
version as used on the new website), in vector format.

> icons.svgz; can't figure out what this is

Same as source.xar, but exported as W3C standard SVG format
for wider compatibility [compressed, hence the 'z'].

Unfortunately because SVG cannot reproduce some of effects
used, and because the SVG export path is currently quite
bad, it's not really directly usable, but it might be of use
to anyone who wants to hack on the graphics but doesn't use
Xara.

> source.xar; not sure either

This is the primary vector graphics source of the icons -
the other SVG and PNG files are just there because other
people requested them.

It's in Xara format, a previously proprietary graphics
application which has now gone open-source and is heading
rapidly towards being usable on Linux, but isn't quite there
yet.

> a directory called png, with many png file - I expect
> that these aren't source files, are they?

Nope, they're just exactly the same content as in the
(with-vista) .ico files, just supplied as PNG for anyone who
wants to fiddle with them in a more accessible bitmap format.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-22 08:56

Message:
Logged In: YES 
user_id=21627

Thanks for the patch. I have committed it as r46063. I put a
demo installer containing them at

http://www.dcl.hpi.uni-potsdam.de/home/loewis/python-2.5.13290.msi

I would also like to add the source files, but I have
difficulties figuring out what they are. There is a source
directory; with:

- baselogo.svg; I assume this is a source file
- icons.svgz; can't figure out what this is
- source.xar; not sure either
- a directory called png, with many png file - I expect
  that these aren't source files, are they?

----------------------------------------------------------------------

Comment By: Andrew Clover (bobince)
Date: 2006-05-19 14:21

Message:
Logged In: YES 
user_id=311085

Sure, no worries. I'll fax over the -python version since I
have ancient contributions to cover too.


----------------------------------------------------------------------

Comment By: Martin v. L??wis (loewis)
Date: 2006-05-19 11:22

Message:
Logged In: YES 
user_id=21627

Thanks! Are you willing to contribute them to the PSF, under
the terms of the contributor agreement at

http://www.python.org/psf/contrib/contrib-form/

?

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1490384&group_id=5470

From noreply at sourceforge.net  Wed May 31 19:11:28 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 31 May 2006 10:11:28 -0700
Subject: [Patches] [ python-Patches-1462361 ] Possible fix to #1334662
	(int() wrong answers)
Message-ID: <E1FlUEC-0006xx-LD@sc8-sf-web4-b.sourceforge.net>

Patches item #1462361, was opened at 2006-03-31 19:23
Message generated for change (Comment added) made by gbrandl
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1462361&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Ivan Vilata i Balaguer (ivilata)
Assigned to: Nobody/Anonymous (nobody)
Summary: Possible fix to #1334662 (int() wrong answers)

Initial Comment:
This is the patch I talked about in #1334662.  I think
it fixes int() returning zero for non-zero literals
under some bases.

----------------------------------------------------------------------

>Comment By: Georg Brandl (gbrandl)
Date: 2006-05-31 17:11

Message:
Logged In: YES 
user_id=849994

Bug #1334662 has now been fixed.

----------------------------------------------------------------------

Comment By: Ralf Schmitt (titty)
Date: 2006-03-31 20:55

Message:
Logged In: YES 
user_id=17929

After applying the patch to mystrtoul all tests work.
tests were run on ubuntu dapper amd64


----------------------------------------------------------------------

Comment By: Ralf Schmitt (titty)
Date: 2006-03-31 20:51

Message:
Logged In: YES 
user_id=17929

with patch to tests applied I get:

===============================================
=======================
FAIL: test_int (__main__.BuiltinTest)
-----------------------------------------------------------------
-----
Traceback (most recent call last):
  File "Lib/test/test_builtin.py", line 685, in test_int
    self.assertEqual(int('c9c336o0mlb7eg', 25), max_uint64)
AssertionError: 0 != 18446744073709551616L

-----------------------------------------------------------------
-----
Ran 60 tests in 0.118s

FAILED (failures=1)
Traceback (most recent call last):
  File "Lib/test/test_builtin.py", line 103, in <module>
    class BuiltinTest(unittest.TestCase):
  File "Lib/test/test_builtin.py", line 1587, in test_main
    run_unittest(*test_classes)
  File "/home/ralf/python-trunk/Lib/test/test_support.py", line 300, in 
run_unittest
    run_suite(suite, testclass)
  File "/home/ralf/python-trunk/Lib/test/test_support.py", line 285, in 
run_suite
    raise TestFailed(err)
test.test_support.TestFailed: Traceback (most recent call last):
  File "Lib/test/test_builtin.py", line 685, in test_int
    self.assertEqual(int('c9c336o0mlb7eg', 25), max_uint64)
AssertionError: 0 != 18446744073709551616L


----------------------------------------------------------------------

Comment By: Ivan Vilata i Balaguer (ivilata)
Date: 2006-03-31 20:12

Message:
Logged In: YES 
user_id=1064183

This is a little modification to ``test_builtin.py`` to
check for the bug.  It would be nice for someone to run it
under a 64-bit platform!

----------------------------------------------------------------------

Comment By: Ivan Vilata i Balaguer (ivilata)
Date: 2006-03-31 19:29

Message:
Logged In: YES 
user_id=1064183

I *love* this web interface...  Here you have the patch.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1462361&group_id=5470

From noreply at sourceforge.net  Wed May 31 19:31:43 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 31 May 2006 10:31:43 -0700
Subject: [Patches] [ python-Patches-1498363 ] Improve super() objects
	support for implicit method calls
Message-ID: <E1FlUXn-0004cf-KK@sc8-sf-web3.sourceforge.net>

Patches item #1498363, was opened at 2006-05-31 13:31
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498363&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improve super() objects support for implicit method calls

Initial Comment:
The attached patch lets super() objects pass on
implicit __getitem__, __setitem__, __delitem__, __len__
and __hash__ calls. For example, to use len() with
super() objects, one must currently do something like

super(X, X()).__len__()

Likewise for __getitem__,

super(X, X()).__getitem__(item)

That's ugly.

This patch lets these be spelled as

len(super(X, X())) and super(X, X())[item], respectively.

The patch also includes documentation updates and tests
for the new functionality.

The patch was taken against r46582.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498363&group_id=5470

From noreply at sourceforge.net  Wed May 31 19:41:10 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 31 May 2006 10:41:10 -0700
Subject: [Patches] [ python-Patches-1498370 ] Improve itertools.starmap
Message-ID: <E1FlUgw-0004Q5-4Y@sc8-sf-web5.sourceforge.net>

Patches item #1498370, was opened at 2006-05-31 13:41
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498370&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Modules
Group: Python 2.5
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Improve itertools.starmap

Initial Comment:
As it currently stands, the iterator argument to
itertools.starmap() must yield tuples, even those any
iterable can be *-expanded in function calls. The
attached patch changes starmap()'s behaviour (as well
as docs and tests) to allow the provided iterator to
return any iterable object.

The patch is against r46582.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498370&group_id=5470

From noreply at sourceforge.net  Wed May 31 21:30:35 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 31 May 2006 12:30:35 -0700
Subject: [Patches] [ python-Patches-1498441 ] Change *args from a tuple to
	list
Message-ID: <E1FlWOp-0006BH-2w@sc8-sf-web1.sourceforge.net>

Patches item #1498441, was opened at 2006-05-31 15:30
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498441&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 3000
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
Assigned to: Nobody/Anonymous (nobody)
Summary: Change *args from a tuple to list

Initial Comment:
As discussed on python-3000, this patch changes *args
from a tuple to a list. It also includes doc and test
changes.

The patch is against r46582.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498441&group_id=5470

From noreply at sourceforge.net  Wed May 31 21:33:32 2006
From: noreply at sourceforge.net (SourceForge.net)
Date: Wed, 31 May 2006 12:33:32 -0700
Subject: [Patches] [ python-Patches-1498441 ] Change *args from a tuple to
	list
Message-ID: <E1FlWRg-0007ED-J0@sc8-sf-web1.sourceforge.net>

Patches item #1498441, was opened at 2006-05-31 15:30
Message generated for change (Settings changed) made by collinwinter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498441&group_id=5470

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Core (C code)
Group: Python 3000
Status: Open
Resolution: None
Priority: 5
Submitted By: Collin Winter (collinwinter)
>Assigned to: Guido van Rossum (gvanrossum)
Summary: Change *args from a tuple to list

Initial Comment:
As discussed on python-3000, this patch changes *args
from a tuple to a list. It also includes doc and test
changes.

The patch is against r46582.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1498441&group_id=5470