; <upper> ; (<condition_list> ;)? # <comment> # ... # I and i-dotless; I-dot and i are case pairs in Turkish and Azeri # The following rules handle those cases. 0130; 0069; 0130; 0130; tr; # LATIN CAPITAL LETTER I WITH DOT ABOVE 0130; 0069; 0130; 0130; az; # LATIN CAPITAL LETTER I WITH DOT ABOVE # When lowercasing, remove dot_above in the sequence I + dot_above, which will turn into i. # This matches the behavior of the canonically equivalent I-dot_above 0307; ; 0307; 0307; tr After_I; # COMBINING DOT ABOVE 0307; ; 0307; 0307; az After_I; # COMBINING DOT ABOVE # When lowercasing, unless an I is before a dot_above, it turns into a dotless i. 0049; 0131; 0049; 0049; tr Not_Before_Dot; # LATIN CAPITAL LETTER I 0049; 0131; 0049; 0049; az Not_Before_Dot; # LATIN CAPITAL LETTER I # When uppercasing, i turns into a dotted capital I 0069; 0069; 0130; 0130; tr; # LATIN SMALL LETTER I 0069; 0069; 0130; 0130; az; # LATIN SMALL LETTER I # Note: the following case is already in the UnicodeData file. # 0131; 0131; 0049; 0049; tr; # LATIN SMALL LETTER DOTLESS I """ Note how the context of the usage of the code points matters when doing case-conversions. To make things even more complicated, there are so called language tags which can be embedded into the Unicode string, so the language can also change within a Unicode string. http://www.unicode.org/reports/tr7/ To get a feeling of what it takes to do locale aware handling of Unicode right, have a look at the Locale Data Markup Language (LDML): http://www.unicode.org/reports/tr35/ (hey, perhaps Google could contribute support for this to Python ;-) -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, Jul 18 2006) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ ::: Try mxODBC.Zope.DA for Windows,Linux,Solaris,FreeBSD for free ! :::: From martin at v.loewis.de Tue Jul 18 23:15:17 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Tue, 18 Jul 2006 23:15:17 +0200 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <44BD4CBA.7040605@egenix.com> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <20060718124019.GB16294@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <9AC7F55A-4106-414F-9C20-64FFFD8C6131@fuhm.net> <44BD2A98.7000705@egenix.com> <44BD3757.7030101@v.loewis.de> <44BD4CBA.7040605@egenix.com> Message-ID: <44BD4F65.9040307@v.loewis.de> M.-A. Lemburg wrote: > Right. In fact, some case mappings are not available in the Unicode > database, since that only contains mappings which don't increase or > decrease the length of the Unicode string. A typical example is the > German u'?'. u'?'.upper() would have to give u'SS', but instead > returns u'?'. Actually, that is in the Unicode database (SpecialCasing.txt): 00DF; 00DF; 0053 0073; 0053 0053; # LATIN SMALL LETTER SHARP S > However, the point I wanted to make was that these mappings don't depend > on the locale setting of the C lib - you have to explicitly > access the mapping in the context of a locale and/or text. I don't get that point. SpecialCasing.txt is clearly intended to take locale context into account. Whether this is the "C locale", or some other locale mechanism, is out of scope of the Unicode specification. It could be the C locale (and indeed, the C locale implementations often take the Unicode casing procedure into account these days). Regards, Martin From rhettinger at ewtllc.com Wed Jul 19 00:39:20 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Tue, 18 Jul 2006 15:39:20 -0700 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <20060718210017.GA7850@panix.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> Message-ID: <44BD6318.6010806@ewtllc.com> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060718/b66f8c5d/attachment.htm From tim.peters at gmail.com Wed Jul 19 01:01:37 2006 From: tim.peters at gmail.com (Tim Peters) Date: Tue, 18 Jul 2006 19:01:37 -0400 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <44BD6318.6010806@ewtllc.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> Message-ID: <1f7befae0607181601g65bc3e1ax5f598c82b9de71fb@mail.gmail.com> [Raymond Hettinger] > ... > If the current approach gets in their way, the C implementers should feel free to > make an alternate design choice. I expect they will, eventually. Converting this to C is a big job, and at the NFS sprint we settled on an "incremental" strategy allowing most of the module to remain written in Python, converting methods to C one at a time. Changing the user-visible API is a hard egg to swallow, and it's unfortunate that the Python code used a dict to hold "flags" to begin with. The dict doesn't just record whether an exception has occurred, it also counts how many times the exception occurred. It's possible that someone, somewhere, has latched on to that as "a feature". From dalcinl at gmail.com Wed Jul 19 01:15:18 2006 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Tue, 18 Jul 2006 20:15:18 -0300 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <1f7befae0607181601g65bc3e1ax5f598c82b9de71fb@mail.gmail.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <1f7befae0607181601g65bc3e1ax5f598c82b9de71fb@mail.gmail.com> Message-ID: <e7ba66e40607181615v72ac1b7eud8d5e3e094cc153a@mail.gmail.com> On 7/18/06, Tim Peters <tim.peters at gmail.com> wrote: > [Raymond Hettinger] > > ... > > If the current approach gets in their way, the C implementers should feel free to > > make an alternate design choice. > > I expect they will, eventually. Converting this to C is a big job, > and at the NFS sprint we settled on an "incremental" strategy allowing > most of the module to remain written in Python, converting methods to > C one at a time. Changing the user-visible API is a hard egg to > swallow, and it's unfortunate that the Python code used a dict to hold > "flags" to begin with. The dict doesn't just record whether an > exception has occurred, it also counts how many times the exception > occurred. It's possible that someone, somewhere, has latched on to > that as "a feature". Why not a 'cDecimal' module instead? -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From rhettinger at ewtllc.com Wed Jul 19 02:06:06 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Tue, 18 Jul 2006 17:06:06 -0700 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <e7ba66e40607181615v72ac1b7eud8d5e3e094cc153a@mail.gmail.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <1f7befae0607181601g65bc3e1ax5f598c82b9de71fb@mail.gmail.com> <e7ba66e40607181615v72ac1b7eud8d5e3e094cc153a@mail.gmail.com> Message-ID: <44BD776E.1040404@ewtllc.com> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060718/5d4775ac/attachment.html From rasky at develer.com Wed Jul 19 02:34:28 2006 From: rasky at develer.com (Giovanni Bajo) Date: Wed, 19 Jul 2006 02:34:28 +0200 Subject: [Python-Dev] Strategy for converting the decimal module to C References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com><44BD6318.6010806@ewtllc.com> <1f7befae0607181601g65bc3e1ax5f598c82b9de71fb@mail.gmail.com> Message-ID: <089a01c6aacb$184cd7d0$d503030a@trilan> Tim Peters wrote: > Changing the user-visible API is a hard egg to > swallow, and it's unfortunate that the Python code used a dict to hold > "flags" to begin with. The dict doesn't just record whether an > exception has occurred, it also counts how many times the exception > occurred. It's possible that someone, somewhere, has latched on to > that as "a feature". Especially since it was a documented one: >>> import decimal >>> help(decimal.Context) Help on class Context in module decimal: class Context(__builtin__.object) | Contains the context for a Decimal instance. [...] | flags - When an exception is caused, flags[exception] is incremented. | (Whether or not the trap_enabler is set) | Should be reset by user of Decimal instance. [...] -- Giovanni Bajo From greg.ewing at canterbury.ac.nz Wed Jul 19 03:17:20 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 19 Jul 2006 13:17:20 +1200 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <9AC7F55A-4106-414F-9C20-64FFFD8C6131@fuhm.net> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <20060718124019.GB16294@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <9AC7F55A-4106-414F-9C20-64FFFD8C6131@fuhm.net> Message-ID: <44BD8820.9080602@canterbury.ac.nz> James Y Knight wrote: > That seems backwards of how it should be ideally: the byte-string > upper and lower should always do ascii uppering-and-lowering, and the > unicode ones should do it according to locale. Perhaps that can be > cleaned up in py3k? I would expect bytes objects not to have upper() and lower() methods at all in Py3k. -- Greg From g.brandl at gmx.net Wed Jul 19 09:46:59 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 19 Jul 2006 09:46:59 +0200 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <44BD6318.6010806@ewtllc.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> Message-ID: <e9ko1k$gv1$1@sea.gmane.org> Raymond Hettinger wrote: > Aahz wrote: >> On Tue, Jul 18, 2006, Raymond Hettinger wrote: >> >>> P.S. The dictionary approach to context objects should likely be >>> abandoned for the C version. If the API has to change a bit, then so >>> be it. >>> >> >> Why do you say that? The rest I agree with; seems to me that making a >> small wrapper for dict access works well, too. >> > I think it was tripping-up the folks working on the C implementation. > Georg can speak to it more directly. IIRC, the issue was that the > context object exposed a dictionary which a user could update directly > and there was no notification back to the surrounding object so it could > update an underlying bitfield representation. Yes, that's exactly what the problem was. Working with bitfields internally is fine as long as Python code doesn't want to change the dicts exposed as a wrapper. > If the current approach > gets in their way, the C implementers should feel free to make an > alternate design choice. +1. (cDecimal is an ugly name, but a sound concept) I don't know what progress Mateusz' work has made until now, but he wrote at one time that he would "start optimizing as soon as he's certain that it works". Georg From ncoghlan at gmail.com Wed Jul 19 12:31:16 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 19 Jul 2006 20:31:16 +1000 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <e9ko1k$gv1$1@sea.gmane.org> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <e9ko1k$gv1$1@sea.gmane.org> Message-ID: <44BE09F4.6000009@gmail.com> Georg Brandl wrote: > Raymond Hettinger wrote: >> If the current approach >> gets in their way, the C implementers should feel free to make an >> alternate design choice. > > +1. (cDecimal is an ugly name, but a sound concept) > > I don't know what progress Mateusz' work has made until now, but he wrote > at one time that he would "start optimizing as soon as he's certain that > it works". dmath (decimal-math, modelled on cmath for complex-math) could work. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From mattjfleming at googlemail.com Wed Jul 19 12:45:05 2006 From: mattjfleming at googlemail.com (Matt Fleming) Date: Wed, 19 Jul 2006 11:45:05 +0100 Subject: [Python-Dev] [Python-checkins] r50708 - in python/trunk: Lib/test/test_sys.py Misc/NEWS Python/pystate.c In-Reply-To: <5ff4a1e50607190334x62ae8f56r64dd71bd1c6de261@mail.gmail.com> References: <20060719000321.36AF31E401E@bag.python.org> <20060719010840.GD2540@performancedrivers.com> <ee2a432c0607181936l6174016l6cd381678b722abe@mail.gmail.com> <e9kqrq$oto$1@sea.gmane.org> <5ff4a1e50607190334x62ae8f56r64dd71bd1c6de261@mail.gmail.com> Message-ID: <5ff4a1e50607190345w21f32050w55cec402f9845972@mail.gmail.com> On 19/07/06, Thomas Heller <theller at python.net> wrote: > Neal Norwitz schrieb: > > On 7/18/06, Jack Diederich <jackdied at jackdied.com> wrote: > >> > >> were pre-2003 and talking about mod_python. HURD and FreeBSD came up a > >> couple times. Do we need to add more *BSD buildbots? > > > > Yes. We only have OpenBSD now. It would be nice to have {Free,Net}BSD too.. > > Maybe some of the buildbots should (in addition to the normal build?) > configure Python with --without-threads? > I have an AMD64 NetBSD machine that isn't doing much at the moment, I can regurlary run tests (I submitted a patch not long back to make regrtest netbsd-3 aware). However, I can't turn it into a buildbot, sorry. Matt -- http://mattssanctuary.blogspot.com From rhettinger at ewtllc.com Wed Jul 19 21:24:18 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Wed, 19 Jul 2006 12:24:18 -0700 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <e9ko1k$gv1$1@sea.gmane.org> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <e9ko1k$gv1$1@sea.gmane.org> Message-ID: <44BE86E2.6070802@ewtllc.com> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060719/dc62ed02/attachment.html From g.brandl at gmx.net Wed Jul 19 21:42:26 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 19 Jul 2006 21:42:26 +0200 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <44BE86E2.6070802@ewtllc.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <e9ko1k$gv1$1@sea.gmane.org> <44BE86E2.6070802@ewtllc.com> Message-ID: <e9m1va$5pv$1@sea.gmane.org> Raymond Hettinger wrote: > >>> I think it was tripping-up the folks working on the C implementation. >>> Georg can speak to it more directly. IIRC, the issue was that the >>> context object exposed a dictionary which a user could update directly >>> and there was no notification back to the surrounding object so it could >>> update an underlying bitfield representation. >>> >> >> Yes, that's exactly what the problem was. Working with bitfields internally >> is fine as long as Python code doesn't want to change the dicts exposed >> as a wrapper. >> > > If you want to stick with dictionary-like access to traps and flags, > then use a dict subclass that overrides each of the mutating methods. That may be easiest to do in Python code. I'll leave it to Mateusz to decide. > Even then, we need to drop the concept of having the flags as counters > rather than booleans. Yes. Given that even Tim couldn't imagine a use case for counting the exceptions, I think it's sensible. Georg From nmm1 at cus.cam.ac.uk Wed Jul 19 22:45:00 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Wed, 19 Jul 2006 21:45:00 +0100 Subject: [Python-Dev] Strategy for converting the decimal module to C Message-ID: <E1G3Iui-0006Zk-CQ@draco.cus.cam.ac.uk> Georg Brandl <g.brandl at gmx.net> wrote: > > > Even then, we need to drop the concept of having the flags as counters > > rather than booleans. > > Yes. Given that even Tim couldn't imagine a use case for counting the > exceptions, I think it's sensible. Well, I can. There is a traditional, important use - tuning. When such arithmetic is implemented in hardware, it is normal for exceptional cases to be handled by interrupt, and that is VERY expensive - often 100-1,000 times the cost of a single operation, occasionally 10,000 times. It then becomes important to know how many of the things you got, to know whether it is worth putting code in to avoid them or even using a different algorithm. Now, it is perfectly correct to say that this does not apply to emulated arithmetic and that there is no justification for such ghastly implementations. But, regrettably, almost all exception handling on modern systems IS ghastly - at least by the standards of the 1960s. Whether you regard the use of Python for tuning code that is to be run using hardware, where the arithmetic will be performance- critical as important, is a matter of taste. I don't :-) Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From misa at redhat.com Wed Jul 19 23:29:19 2006 From: misa at redhat.com (Mihai Ibanescu) Date: Wed, 19 Jul 2006 17:29:19 -0400 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <200607181455.56535.fdrake@acm.org> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <20060718185238.GB363@abulafia.devel.redhat.com> <200607181455.56535.fdrake@acm.org> Message-ID: <20060719212919.GD15030@abulafia.devel.redhat.com> On Tue, Jul 18, 2006 at 02:55:56PM -0400, Fred L. Drake, Jr. wrote: > On Tuesday 18 July 2006 14:52, Mihai Ibanescu wrote: > > Unicode might be a perfectly acceptable suggestion for others too. > > Are we still supporting builds that don't include Unicode? If so, that needs > to be considered in a patch as well. Good point. Does the attached patch look reasonable? Thanks, Misa -------------- next part -------------- --- Python-2.4.3/Lib/logging/handlers.py.nolocale 2006-07-19 12:15:46.000000000 -0400 +++ Python-2.4.3/Lib/logging/handlers.py 2006-07-19 12:16:14.000000000 -0400 @@ -44,6 +44,12 @@ DEFAULT_SOAP_LOGGING_PORT = 9023 SYSLOG_UDP_PORT = 514 +# If python was not built with unicode support, use the str function instead +# of the unicode type, and hope locale doesn't break things. + +if not hasattr(__builtins__, 'unicode'): + unicode = str + class BaseRotatingHandler(logging.FileHandler): """ Base class for handlers that rotate log files at a certain point. @@ -162,7 +168,7 @@ """ def __init__(self, filename, when='h', interval=1, backupCount=0, encoding=None): BaseRotatingHandler.__init__(self, filename, 'a', encoding) - self.when = string.upper(when) + self.when = unicode(when).upper() self.backupCount = backupCount # Calculate the real rollover interval, which is just the number of # seconds between rollovers. Also set the filename suffix used when @@ -642,10 +648,12 @@ """ We need to convert record level to lowercase, maybe this will change in the future. + We convert it to unicode first, to avoid locale from changing the + meaning of lower() and upper() """ msg = self.log_format_string % ( self.encodePriority(self.facility, - string.lower(record.levelname)), + unicode(record.levelname).lower()), msg) try: if self.unixsocket: @@ -854,7 +862,7 @@ ("GET" or "POST") """ logging.Handler.__init__(self) - method = string.upper(method) + method = unicode(method).upper() if method not in ["GET", "POST"]: raise ValueError, "method must be GET or POST" self.host = host From jeremy at alum.mit.edu Thu Jul 20 00:03:17 2006 From: jeremy at alum.mit.edu (Jeremy Hylton) Date: Wed, 19 Jul 2006 18:03:17 -0400 Subject: [Python-Dev] Python sprint in NY and CA, Aug. 21-24 Message-ID: <e8bf7a530607191503p6038f538y6decf87bae1a808a@mail.gmail.com> I'd like to repeat my invitation to spend a week at Google in California or New York for a Python sprint. We are hosting sprints at our offices in Mountain View and New York City the week of Aug. 21, Monday through Thursday. We're planning to work broadly on Python 2.6 and Python 3000. If you're interested in the core implementation, the standard library, the build process, or documentation, they're all in scope. Working on a test process using large community projects like Twisted and Zope would be a great activity, too. There's a wiki page with some more details: http://wiki.python.org/moin/GoogleSprint Feel free to follow up with Neal or me if you have questions. Jeremy From brett at python.org Thu Jul 20 00:35:45 2006 From: brett at python.org (Brett Cannon) Date: Wed, 19 Jul 2006 15:35:45 -0700 Subject: [Python-Dev] new security doc using object-capabilities Message-ID: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> After various people suggesting object-capabilities, takling with Mark S. Miller of the E programming language, and the people Mark works with at HP Labs (who have been giving talks every week during this month here at Google on object-capabilities), I have decided to go with object-capabilities for securing interpreters. I have rewritten my design doc from scratch and deleted the old one. The new doc is named securing_python.txt and can be found through the svn web interface at http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log. I have pretty much ignored any concrete API and such and gone more with a conceptual doc to make sure the API does not get in the way of the core security model. Using object-capabilities should make the implementation much cleaner. There is much less work directly on the interpreter and more of it gets pushed up to extension modules. I also have the okay of my supervisor to use this approach in my dissertation so this will get done. Two things do fall out of all of this which will make development much more modular and easier. First, the memory cap work just becomes a special build on its own; no need to tie into the security work. So I will be cleaning up the bcannon-sandboxing branch code as it stands, and then either create a separate branch for the object-capabilities work, or create another branch for the memory cap stuff and shift the changes over there. I will most likely do the former so as to not lose the history on the checkins. I also plan to rewrite the import machinery in pure Python. This will make the code much more maintainable and make creating proxies for the import machinery much easier. I will be doing that in a directory in the sandbox initially since it needs to work from what Python has now (and possibly some new extension module code) before it can be integrated into the interpreter directly. Anyone who wants to help with that can. I already have some perliminary notes on the whole thing and I think it will be reasonably doable. Anyway, there you go. Here is to hoping I have thought this all through properly. =) -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060719/09a113fc/attachment.html From python-dev at zesty.ca Thu Jul 20 01:11:24 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Wed, 19 Jul 2006 18:11:24 -0500 (CDT) Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <Pine.LNX.4.58.0607191810460.31087@server1.LFW.org> On Wed, 19 Jul 2006, Brett Cannon wrote: > I have decided to go with object-capabilities for > securing interpreters. I have rewritten my design doc from scratch and > deleted the old one. The new doc is named securing_python.txt and can be > found through the svn web interface at > http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log. This is amazing news!! I'm going off to read your document right now. -- ?!ng From fuzzyman at voidspace.org.uk Thu Jul 20 01:25:26 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 20 Jul 2006 00:25:26 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <44BEBF66.9020102@voidspace.org.uk> Brett Cannon wrote: > After various people suggesting object-capabilities, takling with Mark > S. Miller of the E programming language, and the people Mark works > with at HP Labs (who have been giving talks every week during this > month here at Google on object-capabilities), I have decided to go > with object-capabilities for securing interpreters. I have rewritten > my design doc from scratch and deleted the old one. The new doc is > named securing_python.txt and can be found through the svn web > interface at > http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log > <http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log> > . I have pretty much ignored any concrete API and such and gone more > with a conceptual doc to make sure the API does not get in the way of > the core security model. > This may not be relevant or possible, in which case I apologise, but the .NET model of creating application domains is extremely useful. It allows you to assign domains and run code within those domains. This means, for example, you can create a plugin system and run the plugins in a secure domain. I realise that this was the intent of the original rexec module, and your proposed new design (which is very exciting) overcomes the difficulties in that approach. The only approach using the new system would be interprocess communication (?) with a trusted interpreter communicating with an un-trusted one. Would the communication layer need to be implemented as a C extension, or will a standard Python API be possible ? Hmmm.... maybe I should read your doc. :-) Michael Foord http://www.voidspace.org.uk/python/index.shtml > Using object-capabilities should make the implementation much > cleaner. There is much less work directly on the interpreter and more > of it gets pushed up to extension modules. I also have the okay of my > supervisor to use this approach in my dissertation so this will get done. > > Two things do fall out of all of this which will make development much > more modular and easier. First, the memory cap work just becomes a > special build on its own; no need to tie into the security work. So I > will be cleaning up the bcannon-sandboxing branch code as it stands, > and then either create a separate branch for the object-capabilities > work, or create another branch for the memory cap stuff and shift the > changes over there. I will most likely do the former so as to not > lose the history on the checkins. > > I also plan to rewrite the import machinery in pure Python. This will > make the code much more maintainable and make creating proxies for the > import machinery much easier. I will be doing that in a directory in > the sandbox initially since it needs to work from what Python has now > (and possibly some new extension module code) before it can be > integrated into the interpreter directly. Anyone who wants to help > with that can. I already have some perliminary notes on the whole > thing and I think it will be reasonably doable. > > Anyway, there you go. Here is to hoping I have thought this all > through properly. =) > > -Brett > ------------------------------------------------------------------------ > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk > From fuzzyman at voidspace.org.uk Thu Jul 20 01:50:33 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 20 Jul 2006 00:50:33 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <44BEBF66.9020102@voidspace.org.uk> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <44BEBF66.9020102@voidspace.org.uk> Message-ID: <44BEC549.5060305@voidspace.org.uk> Michael Foord wrote: > Brett Cannon wrote: > >> After various people suggesting object-capabilities, takling with Mark >> S. Miller of the E programming language, and the people Mark works >> with at HP Labs (who have been giving talks every week during this >> month here at Google on object-capabilities), I have decided to go >> with object-capabilities for securing interpreters. I have rewritten >> my design doc from scratch and deleted the old one. The new doc is >> named securing_python.txt and can be found through the svn web >> interface at >> http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log >> <http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log> >> . I have pretty much ignored any concrete API and such and gone more >> with a conceptual doc to make sure the API does not get in the way of >> the core security model. >> >> > > This may not be relevant or possible, in which case I apologise, but > the .NET model of creating application domains is extremely useful. It > allows you to assign domains and run code within those domains. This > means, for example, you can create a plugin system and run the plugins > in a secure domain. > > I realise that this was the intent of the original rexec module, and > your proposed new design (which is very exciting) overcomes the > difficulties in that approach. The only approach using the new system > would be interprocess communication (?) with a trusted interpreter > communicating with an un-trusted one. Would the communication layer need > to be implemented as a C extension, or will a standard Python API be > possible ? Hmmm.... maybe I should read your doc. :-) > > Ok, started to read the doc - and realise it specifically addresses these issues. My apologies :-) Michael http://www.voidspace.org.uk/python/index.shtml > Michael Foord > http://www.voidspace.org.uk/python/index.shtml > > >> Using object-capabilities should make the implementation much >> cleaner. There is much less work directly on the interpreter and more >> of it gets pushed up to extension modules. I also have the okay of my >> supervisor to use this approach in my dissertation so this will get done. >> >> Two things do fall out of all of this which will make development much >> more modular and easier. First, the memory cap work just becomes a >> special build on its own; no need to tie into the security work. So I >> will be cleaning up the bcannon-sandboxing branch code as it stands, >> and then either create a separate branch for the object-capabilities >> work, or create another branch for the memory cap stuff and shift the >> changes over there. I will most likely do the former so as to not >> lose the history on the checkins. >> >> I also plan to rewrite the import machinery in pure Python. This will >> make the code much more maintainable and make creating proxies for the >> import machinery much easier. I will be doing that in a directory in >> the sandbox initially since it needs to work from what Python has now >> (and possibly some new extension module code) before it can be >> integrated into the interpreter directly. Anyone who wants to help >> with that can. I already have some perliminary notes on the whole >> thing and I think it will be reasonably doable. >> >> Anyway, there you go. Here is to hoping I have thought this all >> through properly. =) >> >> -Brett >> ------------------------------------------------------------------------ >> >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> http://mail.python.org/mailman/listinfo/python-dev >> Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk >> >> > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk > > From tim.peters at gmail.com Thu Jul 20 02:52:11 2006 From: tim.peters at gmail.com (Tim Peters) Date: Wed, 19 Jul 2006 20:52:11 -0400 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <e9m1va$5pv$1@sea.gmane.org> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <e9ko1k$gv1$1@sea.gmane.org> <44BE86E2.6070802@ewtllc.com> <e9m1va$5pv$1@sea.gmane.org> Message-ID: <1f7befae0607191752j7b4a828eu99ea6cc28be40f28@mail.gmail.com> ... [Raymond] >> Even then, we need to drop the concept of having the flags as counters >> rather than booleans. [Georg Brandl] > Yes. Given that even Tim couldn't imagine a use case for counting the > exceptions, I think it's sensible. That's not it -- someone will "find a use" for anything. It's unfortunate that we used a dict with counts because the /standard/ we're trying to meet requires no such thing, and clearly had a "pile of on/off bits" model in mind. Extending a standard without strong need creates problems (for example, this one <0.5 wink>). From greg.ewing at canterbury.ac.nz Thu Jul 20 03:30:50 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 20 Jul 2006 13:30:50 +1200 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <E1G3Iui-0006Zk-CQ@draco.cus.cam.ac.uk> References: <E1G3Iui-0006Zk-CQ@draco.cus.cam.ac.uk> Message-ID: <44BEDCCA.1070305@canterbury.ac.nz> Nick Maclaren wrote: > When such arithmetic is implemented in hardware, it is normal for > exceptional cases to be handled by interrupt, and that is VERY > expensive ... It then becomes important to know how > many of the things you got, to know whether it is worth putting > code in to avoid them or even using a different algorithm. But couldn't you just put in an interrupt handler that counts the interrupts, for the purpose of measurement? -- Greg From rhettinger at ewtllc.com Thu Jul 20 03:45:51 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Wed, 19 Jul 2006 18:45:51 -0700 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <1f7befae0607191752j7b4a828eu99ea6cc28be40f28@mail.gmail.com> References: <44BD4690.3070402@ewtllc.com> <20060718210017.GA7850@panix.com> <44BD6318.6010806@ewtllc.com> <e9ko1k$gv1$1@sea.gmane.org> <44BE86E2.6070802@ewtllc.com> <e9m1va$5pv$1@sea.gmane.org> <1f7befae0607191752j7b4a828eu99ea6cc28be40f28@mail.gmail.com> Message-ID: <44BEE04F.9000209@ewtllc.com> An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060719/058ab3cb/attachment.html From t-bruch at microsoft.com Thu Jul 20 04:29:09 2006 From: t-bruch at microsoft.com (Bruce Christensen) Date: Wed, 19 Jul 2006 19:29:09 -0700 Subject: [Python-Dev] Pickling objects that return string from reduce In-Reply-To: <44BBFB5B.2090806@v.loewis.de> References: <3581AA168D87A2479D88EA319BDF7D32E05276@RED-MSG-80.redmond.corp.microsoft.com> <44BBFB5B.2090806@v.loewis.de> Message-ID: <3581AA168D87A2479D88EA319BDF7D32EF2FC7@RED-MSG-80.redmond.corp.microsoft.com> Martin v. L?wis" wrote: > If obj has no __module__ attribute (or if it is None), pickle > (didn't check cPickle) also does > > for n, module in sys.module.items(): > if "module-ignored": continue > if getattr(module, result, None) is obj: > break # use n as module name > > If obj does have a __module__ attribute, it uses __import__ > to import the module, just to make sure it gets into sys.modules. What is "module-ignored" above? It's obviously not a literal string... --Bruce From kbk at shore.net Thu Jul 20 06:55:00 2006 From: kbk at shore.net (Kurt B. Kaiser) Date: Thu, 20 Jul 2006 00:55:00 -0400 (EDT) Subject: [Python-Dev] Weekly Python Patch/Bug Summary Message-ID: <200607200455.k6K4t0Vw018820@bayview.thirdcreek.com> Patch / Bug Summary ___________________ Patches : 398 open ( +5) / 3334 closed (+19) / 3732 total (+24) Bugs : 904 open ( -4) / 6011 closed (+36) / 6915 total (+32) RFE : 222 open ( -1) / 231 closed ( +2) / 453 total ( +1) New / Reopened Patches ______________________ Fix for #1513611 and #1511497; xml.sax imports (2006-07-10) http://python.org/sf/1519796 opened by ?iga Seilnacht telnetlib.py change to ease option handling. (2006-07-10) http://python.org/sf/1520081 opened by Ernest ter Kuile Support for PyGetSetDefs in pydoc, inspect, and types (2006-07-10) http://python.org/sf/1520294 opened by Barry A. Warsaw Distutils bugfix: Read $AR from the environment/Makefile. (2006-07-11) http://python.org/sf/1520877 opened by Douglas Greiman make install change: Allow $DESTDIR to be relative (2006-07-11) http://python.org/sf/1520879 opened by Douglas Greiman Fix tests that assume they can write to Lib/test (2006-07-12) http://python.org/sf/1520904 opened by Douglas Greiman Don't produce core file in test_subprocess.py (2006-07-12) http://python.org/sf/1520905 opened by Douglas Greiman Extra configurability for doctest TestCases (2006-07-12) http://python.org/sf/1521051 opened by Russell Keith-Magee --help and --version long options (2006-07-12) CLOSED http://python.org/sf/1521179 opened by Georg Brandl smtplib login fails with aol smtp server (2006-07-12) http://python.org/sf/1521196 opened by Peter Reintroduce index checking in 1-element ctypes arrays (2006-07-13) CLOSED http://python.org/sf/1521817 opened by Thomas Heller Fix grammatical errors in Doc/howto/doanddont.tex (2006-07-13) CLOSED http://python.org/sf/1521874 opened by Collin Winter Give Cookie.py its own _idmap (2006-07-13) http://python.org/sf/1521882 opened by Collin Winter Add some explication to PEP 3100 (2006-07-13) http://python.org/sf/1522038 opened by Collin Winter Remove operator.truth() and operator.abs() (2006-07-13) http://python.org/sf/1522059 opened by Collin Winter Improve docs for filter() (2006-07-13) CLOSED http://python.org/sf/1522211 opened by Collin Winter irda socket support (2006-07-14) http://python.org/sf/1522400 opened by ?????? Tix.Grid patch (2006-07-14) http://python.org/sf/1522587 opened by klappnase XML Test Runner for unittest module (2006-07-14) http://python.org/sf/1522704 opened by Sebastian Rittau (partial?) fix for Misc/python-config.in (2006-07-16) http://python.org/sf/1523356 opened by M. Levinson Var-default patch undid part of backtick to repr() patch (2006-07-18) CLOSED http://python.org/sf/1524429 opened by Graham Horler Fix Tkinter Tcl-commands memory-leaks (2006-07-18) http://python.org/sf/1524639 opened by Graham Horler Fix --without-threads build error (2006-07-18) CLOSED http://python.org/sf/1524724 opened by Matt Fleming ConfigParser: accept leading whitespace on options+comments (2006-07-18) http://python.org/sf/1524825 opened by Ken Lalonde Patches Closed ______________ Patch for bug 1441486: bad unary minus folding in compiler (2006-03-10) http://python.org/sf/1446922 closed by nascheme OpenVMS patches Modules directory (2006-07-04) http://python.org/sf/1516912 closed by nnorwitz Rough documentation for xml.etree.ElementTree (2006-06-10) http://python.org/sf/1504046 closed by nnorwitz turtle.py: correcting begin_fill (2006-07-09) http://python.org/sf/1519566 closed by loewis pdb: fix for #1472251('run/runeval' commands bug) (2006-04-18) http://python.org/sf/1472257 closed by jakamkon urllib2 redirection fix (2006-07-01) http://python.org/sf/1515745 closed by nnorwitz Python long option support (2006-05-03) http://python.org/sf/1481112 closed by gbrandl --help and --version long options (2006-07-12) http://python.org/sf/1521179 closed by gbrandl Reintroduce index checking in 1-element ctypes arrays (2006-07-13) http://python.org/sf/1521817 closed by theller Fix grammatical errors in Doc/howto/doanddont.tex (2006-07-13) http://python.org/sf/1521874 closed by gbrandl Improve docs for filter() (2006-07-13) http://python.org/sf/1522211 closed by rhettinger python / pythonw replacement in C (2004-10-29) http://python.org/sf/1056561 closed by etrepum OS X: Can't use #!/usr/bin/pythonw (2004-10-02) http://python.org/sf/1038911 closed by etrepum update the binhex module for Mach-O (2005-06-14) http://python.org/sf/1220874 closed by etrepum Remove dependencies on the sets module (2006-06-04) http://python.org/sf/1500609 closed by gbrandl ColorDelegator - Several bug fixes (2006-04-30) http://python.org/sf/1479219 closed by kbk Var-default patch undid part of backtick to repr() patch (2006-07-18) http://python.org/sf/1524429 closed by loewis zipfile: support for ZIP64 (2006-03-09) http://python.org/sf/1446489 closed by ronaldoussoren Fix --without-threads build error (2006-07-18) http://python.org/sf/1524724 closed by tim_one New / Reopened Bugs ___________________ Cannot use high-numbered sockets in 2.4.3 (2006-05-24) CLOSED http://python.org/sf/1494314 reopened by anthonybaxter Incorrect lineno's in code objects (2006-06-26) CLOSED http://python.org/sf/1512814 reopened by twouters AttributeError in the shelve module (2006-07-09) CLOSED http://python.org/sf/1519786 opened by Martin Kelly Proxy does not work in 2.4.3 (2006-07-10) http://python.org/sf/1519816 opened by Michal Niklas non-uniform behavior in 'startswith' / 'endswith' (2006-07-10) http://python.org/sf/1520176 opened by Milind __missing__ does not get called (2006-07-11) CLOSED http://python.org/sf/1520327 opened by Milind fcntl.ioctl fails to copy back exactly-1024 buffer (2006-07-11) http://python.org/sf/1520818 opened by Mark Eichin urrlib2 max_redirections=0 disables redirects (2006-07-11) http://python.org/sf/1520831 opened by rus_r_orange unpack list of singleton tuples not unpacking (2006-07-11) CLOSED http://python.org/sf/1520864 opened by Anthony Tuininga time.strftime breakage in 2.4/2.5 (2006-07-11) CLOSED http://python.org/sf/1520914 opened by Skip Montanaro ctypes test overwrites /dev/null (2006-07-12) CLOSED http://python.org/sf/1521375 opened by Scot Doyle file.seek() influelce write() when opened with a+ mode (2006-07-12) http://python.org/sf/1521491 opened by Lior isinstance failure in 2.6 Beta 2 (2006-07-13) CLOSED http://python.org/sf/1521726 opened by Nick Maclaren possible bug in mystrtol.c with recent gcc (2006-07-13) http://python.org/sf/1521947 opened by Marien Zwart shlex.split() does not tokenize like the shell (2006-07-13) http://python.org/sf/1521950 opened by Dan Christian filter() doesn't use __len__ of str/unicode/tuple subclasses (2006-07-13) CLOSED http://python.org/sf/1522016 opened by Collin Winter RPM build fails for Py2.5b2 (2006-07-13) http://python.org/sf/1522046 opened by Stefan Behnel _threading_local.py logic error in _localbase __new__ (2006-07-13) http://python.org/sf/1522237 opened by Tony Nelson Patch #1388073 is not mentioned in NEWS (2006-07-14) CLOSED http://python.org/sf/1522771 opened by Collin Winter incorrect tcp checksum (2006-07-15) CLOSED http://python.org/sf/1523136 opened by ecir hana threading.Thread Traceback (2006-07-16) CLOSED http://python.org/sf/1523465 opened by roee88 PyArg_ParseTupleAndKeywords potential core dump (2006-07-16) http://python.org/sf/1523610 opened by Eric Huss 2.4.2 file.read caches EOF state (2006-07-17) http://python.org/sf/1523853 opened by Jarkko Torppa logging using the SysLog handler fails if locale is set (2006-07-17) http://python.org/sf/1524081 opened by Mihai Ibanescu os.listdir doesn't check error code from FindNextFile (2006-07-18) http://python.org/sf/1524310 opened by Roger Upole configure --without-threads fails to build (2006-07-18) CLOSED http://python.org/sf/1524317 opened by Ram Bhamidipaty MemoryError with a lot of available memory - gc not called (2006-07-19) http://python.org/sf/1524938 opened by Mark Matusevich Webserver TypeError: expected read buffer, NoneType found (2006-07-19) http://python.org/sf/1525343 opened by jbet Build fails on OS X with case sensitive fs (2006-07-19) http://python.org/sf/1525447 opened by gideon may SimpleXMLRpcServer still uses sys.exc_value and sys.exc_type (2006-07-19) http://python.org/sf/1525469 opened by Russell Warren Document additions from PEP 302 (2006-07-19) http://python.org/sf/1525549 opened by Brett Cannon Malloc, memory error, failmalloc, low memory. (2006-07-19) http://python.org/sf/1525589 opened by Rene Dudfield ldap get_option(0) causes segfault (2006-07-19) CLOSED http://python.org/sf/1525590 opened by Sean Burford exec and eval allocate lots of memory and do not free it (2006-07-20) http://python.org/sf/1525678 opened by Connelly Bugs Closed ___________ yet another svn head compiler change (2006-03-02) http://python.org/sf/1441486 closed by nascheme Cannot use high-numbered sockets in 2.4.3 (2006-05-24) http://python.org/sf/1494314 closed by anthonybaxter Incorrect lineno's in code objects (2006-06-26) http://python.org/sf/1512814 closed by nnorwitz Incorrect lineno's in code objects (2006-06-26) http://python.org/sf/1512814 closed by nnorwitz c_void_pointer should accept a long pointer > 0x7fffffff (2006-07-06) http://python.org/sf/1518190 closed by theller ImportWarning should be removed (2006-06-30) http://python.org/sf/1515169 closed by nnorwitz test_ctypes fails on OSX 10.3 (2006-04-10) http://python.org/sf/1467450 closed by theller AttributeError in the shelve module (2006-07-10) http://python.org/sf/1519786 closed by gbrandl inspect.py: still infinite recursion inspecting frames (2006-07-03) http://python.org/sf/1516184 closed by pje AttributeError in upload_file on interrupted connection (2006-03-23) http://python.org/sf/1457312 closed by pje subprocess test cases fail with noexec-mounted /tmp (2006-03-06) http://python.org/sf/1444408 closed by astrand subprocess.CalledProcessError uses errno incorrectly (2005-12-08) http://python.org/sf/1376309 closed by astrand subprocess.py abuse of errno (2005-06-20) http://python.org/sf/1223937 closed by astrand __missing__ does not get called (2006-07-11) http://python.org/sf/1520327 closed by gbrandl unpack list of singleton tuples not unpacking (2006-07-11) http://python.org/sf/1520864 closed by nnorwitz time.strftime breakage in 2.4/2.5 (2006-07-11) http://python.org/sf/1520914 closed by bcannon ctypes test overwrites /dev/null (2006-07-12) http://python.org/sf/1521375 closed by theller isinstance failure in 2.5 Beta 2 (2006-07-13) http://python.org/sf/1521726 closed by gbrandl filter() doesn't use __len__ of str/unicode/tuple subclasses (2006-07-13) http://python.org/sf/1522016 closed by rhettinger msvccompiler.py using VC6 with Python 2.5a2 (2006-06-18) http://python.org/sf/1508010 closed by loewis incorrect locale.strcoll() return in Windows (2006-07-08) http://python.org/sf/1519069 closed by loewis Patch #1388073 is not mentioned in NEWS (2006-07-14) http://python.org/sf/1522771 closed by gbrandl IDLE fails to launch after 46272 (2.4 with 2.5 IDLE) (2006-06-27) http://python.org/sf/1513617 closed by kbk ext/win-cookbook.html has a broken link to distutils (2006-07-07) http://python.org/sf/1518772 closed by quiver PIL binary package missing jpeg support (pimp) (2003-07-23) http://python.org/sf/776600 closed by etrepum IDE needs easy access to builtin help() (2003-03-29) http://python.org/sf/711991 closed by etrepum Raise IDE output window over splash screen on early crash (2003-03-26) http://python.org/sf/710374 closed by etrepum IDE stdin doesn't have readlines (2003-03-26) http://python.org/sf/710373 closed by etrepum Mac IDE behaviour (output to console) (2002-06-24) http://python.org/sf/573174 closed by etrepum incorrect tcp checksum (2006-07-15) http://python.org/sf/1523136 closed by loewis threading.Thread Traceback (2006-07-16) http://python.org/sf/1523465 closed by tim_one "as" keyword sometimes highlighted in strings (2005-10-12) http://python.org/sf/1325071 closed by kbk email.Utils.py: "'" in RFC2231 header (2005-06-10) http://python.org/sf/1218081 closed by bwarsaw Incorrect RFC 2231 decoding (2005-01-15) http://python.org/sf/1102973 closed by bwarsaw link path probs on OSX re: Tcl, Tk & fink's /sw (07/13/05) http://python.org/sf/1237697 closed by sf-robot termios.c in qnx4.25 (09/19/05) http://python.org/sf/1295179 closed by sf-robot IDLE bug - changing shortcuts - Shift-Tab broken! (2005-04-08) http://python.org/sf/1179168 closed by kbk expat crash python (2005-09-20) http://python.org/sf/1296433 closed by bcannon configure --without-threads fails to build (2006-07-18) http://python.org/sf/1524317 closed by tim_one ldap get_option(0) causes segfault (2006-07-19) http://python.org/sf/1525590 closed by nnorwitz Getting an error message import site failed -v traceback. (07/05/06) http://python.org/sf/1517370 closed by sf-robot New / Reopened RFE __________________ support all of strftime(3) (2006-07-11) http://python.org/sf/1520662 opened by Toni Mueller RFE Closed __________ Add syntax coloring to Mac IDE (2002-08-07) http://python.org/sf/592047 closed by etrepum Enhance PackageManager functionality (2003-07-28) http://python.org/sf/779160 closed by etrepum From martin at v.loewis.de Thu Jul 20 08:47:40 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Thu, 20 Jul 2006 08:47:40 +0200 Subject: [Python-Dev] Pickling objects that return string from reduce In-Reply-To: <3581AA168D87A2479D88EA319BDF7D32EF2FC7@RED-MSG-80.redmond.corp.microsoft.com> References: <3581AA168D87A2479D88EA319BDF7D32E05276@RED-MSG-80.redmond.corp.microsoft.com> <44BBFB5B.2090806@v.loewis.de> <3581AA168D87A2479D88EA319BDF7D32EF2FC7@RED-MSG-80.redmond.corp.microsoft.com> Message-ID: <44BF270C.90202@v.loewis.de> Bruce Christensen wrote: >> If obj has no __module__ attribute (or if it is None), pickle >> (didn't check cPickle) also does >> >> for n, module in sys.module.items(): >> if "module-ignored": continue >> if getattr(module, result, None) is obj: >> break # use n as module name > > What is "module-ignored" above? It's obviously not a literal string... It's skipped if module is None (skip dummy package entries) or n=='__main__'. Regards, Martin From l.oluyede at gmail.com Thu Jul 20 10:17:32 2006 From: l.oluyede at gmail.com (Lawrence Oluyede) Date: Thu, 20 Jul 2006 10:17:32 +0200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <9eebf5740607200117r4d4613e2i91665ea211bab46@mail.gmail.com> That's great. I just read your draft but I have little comments to do but before let me say that I liked the idea to borrow concepts from E. I've crossed the E's path in the beginning of this year and I found it a pot of really nice ideas (for promises and capabilities). Here are my comments about the draft: - it's not really clear to me what the "powerbox" is. I think I got the concept of "super process" but maybe it's to be clarified, isn't it? It become clear in the "threat model" paragraph - I hope no Rubystas will read the "Problem of No Private Namespace" section because they have private/protected keywords to enforce this stuff :-) Writing proxies in C will slow down the dev process (altough will speed up the performance maybe) but in a far future someone will come up with an alternative closer to the Python level - Can you write down a simple example of what you mean with "changing something of the built-in objects"? (in "Problem of mutable shared state") - What about the performance issues of the capabilities model overall? - I know what you meant to say but the paragraph about pythonicness and the security model seems a little "fuzzy" to me. Which are the boundaries of the allowed changes for the security stuff? - You don't say anything about networking and networked resources in the list of the standard sandboxed interpreter - Suppose we have a .py module. Based on your security model we can import it, right? When imported it generates a .pyc file. The second time we import it what happens? .pyc is ignored? import is not allowed at all? We can't rely on the name of the file.pyc because an attacker who knows the file.py is secure and the second import is done against file.pyc can replace the "secure" file.pyc with an implementation not secure and can do some kind of harm to the sandbox - About "Filesystem information". Does the sandboxed interpreter need to know all that information about file paths, files and so on? Can't we reset those attributes to something arbitrary? - About sys module: I think the best way is to have a purged fake sys module with only the stuff you need. pypy has the concept of faked modules too (altough for a different reason) - About networking: what do you think about the E's model of really safe networking, protected remotable objects and safe RPC? Is that model applicable to Python's in some way? We can't use the E's model as a whole (ask people to generate a safe key and send it by email is unfeasible) - is the protected memory model a some kind of memory monitor system? I think that's all for the draft. I wrote these comments during the reading of the document. Hope some of these help -- Lawrence http://www.oluyede.org/blog From fuzzyman at voidspace.org.uk Thu Jul 20 11:12:43 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 20 Jul 2006 10:12:43 +0100 Subject: [Python-Dev] os.utime and os.chmod failures (etc) Python 2.5b2 Message-ID: <44BF490B.1090101@voidspace.org.uk> Hello all, There may be a reasonable cause for this (i.e. it is likely to be my fault) - but it is consistent across two different machines I have tried it on. With Python 2.5b2 (from the msi at Python.org), running on Windows XP Pro SP2, ``os.utime`` and ``os.chmod`` fail with WindowsError. The same code runs fine on Python 2.3 and Python 2.4. [err] shutil.copytree(thisentry, targetdir) [err] File "C:\Python25\lib\shutil.py", line 130, in copytree [err] copystat(src, dst) [err] File "C:\Python25\lib\shutil.py", line 67, in copystat [err] os.utime(dst, (st.st_atime, st.st_mtime)) [err] WindowsError: [Error 13] Access is denied: 'lib\\Pmw' [err] The script uses ``shutil.copytree`` to copy a directory (using relative paths). The source code snippet is : if os.path.isdir(targetdir): shutil.rmtree(targetdir) shutil.copytree(thisentry, targetdir) The code in shutil.py is : def copystat(src, dst): """Copy all stat info (mode bits, atime and mtime) from src to dst""" st = os.stat(src) mode = stat.S_IMODE(st.st_mode) if hasattr(os, 'utime'): os.utime(dst, (st.st_atime, st.st_mtime)) if hasattr(os, 'chmod'): os.chmod(dst, mode) Additionally, after running a couple of times I get core dumps when trying to run the interactive interpreter. Oddly IDLE and other programs still run. However this is after installing py2exe, wxPython and pywin32 (all of which have Python 2.5 builds), so there is possibly some weird interaction. My install directory has the following three dlls in it : MSVCIRT.dll MSVCP60.dll MSVCRT.dll It has none of the msvc7 dlls that I would expect. I'm now at work and so don't have time to experiment, but tonight I will try uninstalling all my Python 2.5 stuff and re-installing just 2.5b2. I will see what dlls are present, run the interpreter a few times, and test a simple script with shutil.copytree. All the best, Michael Foord http://www.voidspace.org.uk/python/index.shtml From nmm1 at cus.cam.ac.uk Thu Jul 20 12:03:41 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Thu, 20 Jul 2006 11:03:41 +0100 Subject: [Python-Dev] Strategy for converting the decimal module to C Message-ID: <E1G3VNd-0003SW-6E@draco.cus.cam.ac.uk> Greg Ewing <greg.ewing at canterbury.ac.nz> wrote: > > But couldn't you just put in an interrupt handler that > counts the interrupts, for the purpose of measurement? No, but the reasons are very arcane. The general reason is that taking an interrupt handler and returning is not transparent, and is often not possible on modern systems. If that problem is at the hardware level (as on the Alpha and 3086/7), you are stuffed. But, more often, it is due the the fact that the architecture means that such handling can only be done at maximally privileged level. Now, interrupting into that level has to be transparent, in order to support TLB misses, clock interrupts, device interrupts, machine-check interrupts and so on. But the kernels rarely support transparent callbacks from that state into user code (though they used to); it is actually VERY hard to do, and even the mainframes had problems. This very commonly means that such counting breaks other facilities, unless it is done IN the privileged code. Of course, a GOOD hardware architecture wouldn't leave the process state when it gets a floating-point interrupt, but would just invoke an asynchronous routine call. That used to be done. As I said, none of this is directly relevant to emulated implementations, such as the current Python ones, but it IS to the design of an arithmetic specification. It could become relevant if Python wants to start to use a hardware implementation, because your proposal would mean that it would have to try to ensure that such callbacks are transparent. As one of the few people still working who has extensive experience with doing that, I can assure you that it is an order of magnitude fouler than you can imagine. A decimal order of magnitude :-( But note that I wasn't saying that such things should be put into the API, merely that there is a very good reason to do so for hardware implementations and ones used to tune code for such implementations. Personally, I wouldn't bother. Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From skip at pobox.com Thu Jul 20 12:09:38 2006 From: skip at pobox.com (skip at pobox.com) Date: Thu, 20 Jul 2006 05:09:38 -0500 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <20060719212919.GD15030@abulafia.devel.redhat.com> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <20060718185238.GB363@abulafia.devel.redhat.com> <200607181455.56535.fdrake@acm.org> <20060719212919.GD15030@abulafia.devel.redhat.com> Message-ID: <17599.22114.395451.103228@montanaro.dyndns.org> Misa> Good point. Does the attached patch look reasonable? ... Misa> - self.when = string.upper(when) Misa> + self.when = unicode(when).upper() ... The use of the string module instead of string methods suggests to me that the logging package attempts to work with older versions of Python. Looking at PEP 291 it looks like 1.5.2 compatibility is desired (no string methods, no unicode). I think a conscious decision by someone (probably Vinay Sajip) to give up that compatibility would be required. Skip From rasky at develer.com Thu Jul 20 12:36:35 2006 From: rasky at develer.com (Giovanni Bajo) Date: Thu, 20 Jul 2006 12:36:35 +0200 Subject: [Python-Dev] new security doc using object-capabilities References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <012801c6abe8$5f9524d0$d503030a@trilan> Brett Cannon wrote: >> The new doc is named securing_python.txt and >> can be >> found through the svn web interface at >> http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log. How do you plan to handle CPU-hogs? Stuff like execution of a gigantic integer multiplication. This recipe for safe_eval: http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/496746 which is otherwise very cute, does not handle this case as well: it tries to catch and interrupt long-running operations through a secondary thread, but fails on a single long operation because the GIL is not released and the alarm thread does not get its chance to run. -- Giovanni Bajo From nmm1 at cus.cam.ac.uk Thu Jul 20 13:10:13 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Thu, 20 Jul 2006 12:10:13 +0100 Subject: [Python-Dev] new security doc using object-capabilities Message-ID: <E1G3WQ1-0007C7-Ov@libra.cus.cam.ac.uk> "Giovanni Bajo" <rasky at develer.com> wrote: > > This recipe for safe_eval: > http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/496746 > which is otherwise very cute, does not handle this case as well: it tries to > catch and interrupt long-running operations through a secondary thread, but > fails on a single long operation because the GIL is not released and the > alarm thread does not get its chance to run. Grin :-) You have put your finger on the Great Myth of such virtualisations, which applies to the system-level ones and even to the hardware-level ones. In practice, there is always some request that a sandbox can make to the hypervisor that can lock out or otherwise affect other sandboxes. The key is, of course, to admit that and to specify what is and is not properly virtualised, so that the consequences can at least be analysed. Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From rasky at develer.com Thu Jul 20 13:40:06 2006 From: rasky at develer.com (Giovanni Bajo) Date: Thu, 20 Jul 2006 13:40:06 +0200 Subject: [Python-Dev] new security doc using object-capabilities References: <E1G3WQ1-0007C7-Ov@libra.cus.cam.ac.uk> Message-ID: <030001c6abf1$3f231690$d503030a@trilan> Nick Maclaren wrote: >> This recipe for safe_eval: >> http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/496746 >> which is otherwise very cute, does not handle this case as well: it >> tries to catch and interrupt long-running operations through a >> secondary thread, but fails on a single long operation because the >> GIL is not released and the alarm thread does not get its chance to >> run. > > Grin :-) > > You have put your finger on the Great Myth of such virtualisations, > which applies to the system-level ones and even to the hardware-level > ones. In practice, there is always some request that a sandbox can > make to the hypervisor that can lock out or otherwise affect other > sandboxes. > > The key is, of course, to admit that and to specify what is and is > not properly virtualised, so that the consequences can at least be > analysed. I agree, and in fact Brett's work on a proper security model is greatly welcome. It's just that us mere mortals need to use eval() *now*, and that recipe is good enough for many practice uses. If you can't win, you can at least lose with dignity :) -- Giovanni Bajo From ncoghlan at gmail.com Thu Jul 20 13:43:21 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 20 Jul 2006 21:43:21 +1000 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <44BF6C59.6010707@gmail.com> For code objects, their construction is already commonly written as "compile(source)". For type objects, the constructor doesn't let you do anything you can't already do with a class statement. It doesn't need securing. For rewriting import.c in Python, the PEP 302 compliant import system API in pkgutil would be a good starting point. Your doc also asks about the imp.get_suffixes() list, and wonder where to set it from Python. As far as I am aware, you can't. get_suffixes() is built from _PyImport_FileTab, which is a C array. A switch statement is used to get from the file table entries to the appropriate handler functions. Quoting from the suggestions I put to the Py3k list: Use smarter data structures --------------------------- Currently, the individual handlers to load a fully identified module are exposed to Python code in a way that reflects the C-style data structures used in the current implementation. Simply switching to more powerful data structures for the file type handlers (i.e. use a PyTuple for filedescr values, a PyList for _PyImport_FileTab, and a PyDict instead of a switch statement to go from filedescr values to module loading/initialisation functions) and manipulating them all as normal Python objects could make the code in import.c much easier to follow. Extensible file type handling ----------------------------- If the file type handlers are stored in normal Python data structures as described above, it becomes feasible to make the import system extensible to different file types as well as to different file locations. This could be handled on a per-package basis, e.g. via a __file_types__ special attribute in packages. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From theller at python.net Thu Jul 20 14:26:44 2006 From: theller at python.net (Thomas Heller) Date: Thu, 20 Jul 2006 14:26:44 +0200 Subject: [Python-Dev] os.utime and os.chmod failures (etc) Python 2.5b2 In-Reply-To: <44BF490B.1090101@voidspace.org.uk> References: <44BF490B.1090101@voidspace.org.uk> Message-ID: <e9nsq5$6m6$1@sea.gmane.org> Michael Foord schrieb: > Hello all, > > There may be a reasonable cause for this (i.e. it is likely to be my > fault) - but it is consistent across two different machines I have tried > it on. > > With Python 2.5b2 (from the msi at Python.org), running on Windows XP > Pro SP2, ``os.utime`` and ``os.chmod`` fail with WindowsError. The same > code runs fine on Python 2.3 and Python 2.4. > > [err] shutil.copytree(thisentry, targetdir) > [err] File "C:\Python25\lib\shutil.py", line 130, in copytree > [err] copystat(src, dst) > [err] File "C:\Python25\lib\shutil.py", line 67, in copystat > [err] os.utime(dst, (st.st_atime, st.st_mtime)) > [err] WindowsError: [Error 13] Access is denied: 'lib\\Pmw' > [err] > > The script uses ``shutil.copytree`` to copy a directory (using relative > paths). IMO this is a bug in Python 2.5, on Windows. The problem is that the call to 'copystat(src, dst)' was added to the shutil.copytree function, in svn r38363 probably. It will fail always on Windows, since os.utime does not work on directories (as the docs correctly explain). I guess that a patch similar to this one should fix it: Index: shutil.py =================================================================== --- shutil.py (Revision 50710) +++ shutil.py (Arbeitskopie) @@ -127,7 +127,12 @@ # continue with other files except Error, err: errors.extend(err.args[0]) - copystat(src, dst) + try: + copystat(src, dst) + except WindowsError: + pass + except OSError, err: + errors.extend(err.args[0]) if errors: raise Error, errors But you should report this to the bug tracker. Thomas From fuzzyman at voidspace.org.uk Thu Jul 20 14:43:48 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 20 Jul 2006 13:43:48 +0100 Subject: [Python-Dev] os.utime and os.chmod failures (etc) Python 2.5b2 In-Reply-To: <e9nsq5$6m6$1@sea.gmane.org> References: <44BF490B.1090101@voidspace.org.uk> <e9nsq5$6m6$1@sea.gmane.org> Message-ID: <44BF7A84.2010908@voidspace.org.uk> Thomas Heller wrote: > Michael Foord schrieb: > >> Hello all, >> >> There may be a reasonable cause for this (i.e. it is likely to be my >> fault) - but it is consistent across two different machines I have tried >> it on. >> >> With Python 2.5b2 (from the msi at Python.org), running on Windows XP >> Pro SP2, ``os.utime`` and ``os.chmod`` fail with WindowsError. The same >> code runs fine on Python 2.3 and Python 2.4. >> >> [err] shutil.copytree(thisentry, targetdir) >> [err] File "C:\Python25\lib\shutil.py", line 130, in copytree >> [err] copystat(src, dst) >> [err] File "C:\Python25\lib\shutil.py", line 67, in copystat >> [err] os.utime(dst, (st.st_atime, st.st_mtime)) >> [err] WindowsError: [Error 13] Access is denied: 'lib\\Pmw' >> [err] >> >> The script uses ``shutil.copytree`` to copy a directory (using relative >> paths). >> > > IMO this is a bug in Python 2.5, on Windows. > The problem is that the call to 'copystat(src, dst)' was added to > the shutil.copytree function, in svn r38363 probably. It will fail > always on Windows, since os.utime does not work on directories (as the > docs correctly explain). > > I guess that a patch similar to this one should fix it: > > Index: shutil.py > =================================================================== > --- shutil.py (Revision 50710) > +++ shutil.py (Arbeitskopie) > @@ -127,7 +127,12 @@ > # continue with other files > except Error, err: > errors.extend(err.args[0]) > - copystat(src, dst) > + try: > + copystat(src, dst) > + except WindowsError: > + pass > + except OSError, err: > + errors.extend(err.args[0]) > if errors: > raise Error, errors > > But you should report this to the bug tracker. > Ok, thanks. Michael http://www.voidspace.org.uk/python/index.shtml > Thomas > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk > > From mihaiibanescu at yahoo.com Thu Jul 20 15:49:05 2006 From: mihaiibanescu at yahoo.com (Mihai Ibanescu) Date: Thu, 20 Jul 2006 09:49:05 -0400 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <17599.22114.395451.103228@montanaro.dyndns.org> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <20060718185238.GB363@abulafia.devel.redhat.com> <200607181455.56535.fdrake@acm.org> <20060719212919.GD15030@abulafia.devel.redhat.com> <17599.22114.395451.103228@montanaro.dyndns.org> Message-ID: <20060720134905.GA25083@abulafia.devel.redhat.com> On Thu, Jul 20, 2006 at 05:09:38AM -0500, skip at pobox.com wrote: > > Misa> Good point. Does the attached patch look reasonable? > > ... > Misa> - self.when = string.upper(when) > Misa> + self.when = unicode(when).upper() > ... > > The use of the string module instead of string methods suggests to me that > the logging package attempts to work with older versions of Python. Looking > at PEP 291 it looks like 1.5.2 compatibility is desired (no string methods, > no unicode). I think a conscious decision by someone (probably Vinay Sajip) > to give up that compatibility would be required. Agreed. There is a note that the module should work with python >= 1.5.2 at the top of the module. It's up to Vinay to decide if we want to drop support for 1.5.2 in the module included in newer pythons, or the attached patch would make it work for 1.5.2 as well (as in "it's not more broken than before"). I would like to redo the patch once more to get rid of the try-except and use __builtins__ instead (but for some reason it kept jumping from being a module to being a dictionary and I just wanted the proof of concept). Misa -------------- next part -------------- --- Python-2.4.3/Lib/logging/handlers.py.nolocale 2006-07-19 12:15:46.000000000 -0400 +++ Python-2.4.3/Lib/logging/handlers.py 2006-07-20 09:45:57.000000000 -0400 @@ -162,7 +162,7 @@ """ def __init__(self, filename, when='h', interval=1, backupCount=0, encoding=None): BaseRotatingHandler.__init__(self, filename, 'a', encoding) - self.when = string.upper(when) + self.when = _upper(when) self.backupCount = backupCount # Calculate the real rollover interval, which is just the number of # seconds between rollovers. Also set the filename suffix used when @@ -645,7 +645,7 @@ """ msg = self.log_format_string % ( self.encodePriority(self.facility, - string.lower(record.levelname)), + _lower(record.levelname)), msg) try: if self.unixsocket: @@ -854,7 +854,7 @@ ("GET" or "POST") """ logging.Handler.__init__(self) - method = string.upper(method) + method = _upper(method) if method not in ["GET", "POST"]: raise ValueError, "method must be GET or POST" self.host = host @@ -1007,3 +1007,25 @@ self.flush() self.target = None BufferingHandler.close(self) + +def _upper(s): + """A version of upper() that tries to be locale-independent by converting + the string to unicode (which is not subject to case conversion being + locale specific) + """ + try: + ret = str(unicode(s).upper()) + except NameError: + ret = string.upper(s) + return ret + +def _lower(s): + """A version of lower() that tries to be locale-independent by converting + the string to unicode (which is not subject to case conversion being + locale specific) + """ + try: + ret = str(unicode(s).lower()) + except NameError: + ret = string.lower(s) + return ret From jimjjewett at gmail.com Thu Jul 20 17:39:27 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Thu, 20 Jul 2006 11:39:27 -0400 Subject: [Python-Dev] logging module broken because of locale Message-ID: <fb6fbf560607200839y75e6414eqe0f79c985999db60@mail.gmail.com> Mihai, It does make sense to document this limit for people writing subclasses, or using a Turkic codeset. I'm not sure that logging is the right place to document it, and I don't think changing the base classes is a good idea. TimedRotatingFileHandler and HTTPHandler restrict their input to a tiny subset of ASCII; anything that would be handled differently after these changes will already raise a ValueError unless you subclass. Even SysLogHandler.emit doesn't actually print the string; it is only used as a lookup key for a dictionary whose keys are all lower-case ASCII. In theory, you could monkey-patch that dictionary to add additional values, but then you might as well subclass to do the right thing with your new keys. (It *might* make sense to change the base function and dictionary to accept unicode, including undotted-i synonyms.) -jJ From martin at v.loewis.de Thu Jul 20 18:08:05 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Thu, 20 Jul 2006 18:08:05 +0200 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <20060720134905.GA25083@abulafia.devel.redhat.com> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <20060718185238.GB363@abulafia.devel.redhat.com> <200607181455.56535.fdrake@acm.org> <20060719212919.GD15030@abulafia.devel.redhat.com> <17599.22114.395451.103228@montanaro.dyndns.org> <20060720134905.GA25083@abulafia.devel.redhat.com> Message-ID: <44BFAA65.5040501@v.loewis.de> Mihai Ibanescu wrote: > It's up to Vinay to decide if we want to drop support for 1.5.2 in the module > included in newer pythons, or the attached patch would make it work for 1.5.2 > as well (as in "it's not more broken than before"). That still wouldn't work with Python 1.5.2, as that version did not support Unicode at all. Regards, Martin From mihaiibanescu at yahoo.com Thu Jul 20 18:13:14 2006 From: mihaiibanescu at yahoo.com (Mihai Ibanescu) Date: Thu, 20 Jul 2006 12:13:14 -0400 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <44BFAA65.5040501@v.loewis.de> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <20060718185238.GB363@abulafia.devel.redhat.com> <200607181455.56535.fdrake@acm.org> <20060719212919.GD15030@abulafia.devel.redhat.com> <17599.22114.395451.103228@montanaro.dyndns.org> <20060720134905.GA25083@abulafia.devel.redhat.com> <44BFAA65.5040501@v.loewis.de> Message-ID: <20060720161314.GI25083@abulafia.devel.redhat.com> On Thu, Jul 20, 2006 at 06:08:05PM +0200, "Martin v. L?wis" wrote: > Mihai Ibanescu wrote: > > It's up to Vinay to decide if we want to drop support for 1.5.2 in the module > > included in newer pythons, or the attached patch would make it work for 1.5.2 > > as well (as in "it's not more broken than before"). > > That still wouldn't work with Python 1.5.2, as that version did not > support Unicode at all. Yes, as I said, it won't be more broken than before applying the patch (my first patch was breaking 1.5.2 completely). If people choose to compile python 2.4.3 without unicode support, it won't work either. But if you choose to disable unicode you probably live in a very constrained environment and you may not be affected by the locale bug at all. I agree the patch is not perfect :-) but I think it solves the problem for most people. Misa From mihaiibanescu at yahoo.com Thu Jul 20 18:19:02 2006 From: mihaiibanescu at yahoo.com (mihaiibanescu at yahoo.com) Date: Thu, 20 Jul 2006 12:19:02 -0400 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <fb6fbf560607200839y75e6414eqe0f79c985999db60@mail.gmail.com> References: <fb6fbf560607200839y75e6414eqe0f79c985999db60@mail.gmail.com> Message-ID: <20060720161902.GJ25083@abulafia.devel.redhat.com> On Thu, Jul 20, 2006 at 11:39:27AM -0400, Jim Jewett wrote: > Mihai, > > It does make sense to document this limit for people writing > subclasses, or using a Turkic codeset. I'm not sure that logging is > the right place to document it, and I don't think changing the base > classes is a good idea. > > TimedRotatingFileHandler and HTTPHandler restrict their input to a > tiny subset of ASCII; anything that would be handled differently after > these changes will already raise a ValueError unless you subclass. > > Even SysLogHandler.emit doesn't actually print the string; it is only > used as a lookup key for a dictionary whose keys are all lower-case > ASCII. In theory, you could monkey-patch that dictionary to add > additional values, but then you might as well subclass to do the right > thing with your new keys. (It *might* make sense to change the base > function and dictionary to accept unicode, including undotted-i > synonyms.) Hi Jim, I am afraid you lost me. My initial approach was to compute the values of the dictionary based on the current locale - and that doesn't work. The issue is not that the dictionary doesn't accept unicode, it's that there's no reliable way to do lookups in it. If you could sketch an example of what you had in mind with the monkey-patching, please do so, since I don't think I grasped the idea. Misa From martin at v.loewis.de Thu Jul 20 18:41:04 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Thu, 20 Jul 2006 18:41:04 +0200 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <20060720161314.GI25083@abulafia.devel.redhat.com> References: <20060717193955.GK3699@abulafia.devel.redhat.com> <44BD2054.4020705@v.loewis.de> <20060718185238.GB363@abulafia.devel.redhat.com> <200607181455.56535.fdrake@acm.org> <20060719212919.GD15030@abulafia.devel.redhat.com> <17599.22114.395451.103228@montanaro.dyndns.org> <20060720134905.GA25083@abulafia.devel.redhat.com> <44BFAA65.5040501@v.loewis.de> <20060720161314.GI25083@abulafia.devel.redhat.com> Message-ID: <44BFB220.8060802@v.loewis.de> Mihai Ibanescu wrote: > Yes, as I said, it won't be more broken than before applying the patch (my > first patch was breaking 1.5.2 completely). Ah, I didn't notice that it deals with unicode() not being a builtin. That's fine then. Regards, Martin From brett at python.org Thu Jul 20 19:04:43 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 10:04:43 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <012801c6abe8$5f9524d0$d503030a@trilan> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <012801c6abe8$5f9524d0$d503030a@trilan> Message-ID: <bbaeab100607201004j324b3856jb43b3cc3c3ec22c8@mail.gmail.com> On 7/20/06, Giovanni Bajo <rasky at develer.com> wrote: > > Brett Cannon wrote: > > >> The new doc is named securing_python.txt and > >> can be > >> found through the svn web interface at > >> > > http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log > . > > How do you plan to handle CPU-hogs? Stuff like execution of a gigantic > integer multiplication. I don't. =) Protecting the CPU is damn hard to do in any form of portable fashion. And even getting it to work on an OS you do know the details of leads to probably an interrupt implementation and that doesn't sound fun. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/4b9938f9/attachment.html From jimjjewett at gmail.com Thu Jul 20 19:06:32 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Thu, 20 Jul 2006 13:06:32 -0400 Subject: [Python-Dev] logging module broken because of locale In-Reply-To: <20060720161902.GJ25083@abulafia.devel.redhat.com> References: <fb6fbf560607200839y75e6414eqe0f79c985999db60@mail.gmail.com> <20060720161902.GJ25083@abulafia.devel.redhat.com> Message-ID: <fb6fbf560607201006k171f810am91b1425b02dfddfd@mail.gmail.com> On 7/20/06, mihaiibanescu at yahoo.com <mihaiibanescu at yahoo.com> wrote: > On Thu, Jul 20, 2006 at 11:39:27AM -0400, Jim Jewett wrote: > > Even SysLogHandler.emit doesn't actually print the string; it is only > > used as a lookup key for a dictionary whose keys are all lower-case > > ASCII. In theory, you could monkey-patch that dictionary to add > > additional values, but then you might as well subclass to do the right > > thing with your new keys. (It *might* make sense to change the base > > function and dictionary to accept unicode, including undotted-i > > synonyms.) > My initial approach was to compute the values of the dictionary based on the > current locale - and that doesn't work. Correct, it would need to be done by hand. As part of the current class construction, SysLogHandler.priority_names["info"] = SysLogHandler.LOG_INFO You could add other entries, so that SysLogHandler.priority_names["my_info"] = SysLogHandler.LOG_INFO If you also changed SysLogHandler.encode_priority to accept unicode keys, you could even add u"info" and the equivalent with an undotted-i. I'm not sure this is worth doing, though, since they supposedly represent symbolic constants. -jJ > The issue is not that the dictionary doesn't accept unicode, it's that there's > no reliable way to do lookups in it. If you could sketch an example of what > you had in mind with the monkey-patching, please do so, since I don't think I > grasped the idea. > > Misa > From brett at python.org Thu Jul 20 19:09:07 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 10:09:07 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <44BF6C59.6010707@gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <44BF6C59.6010707@gmail.com> Message-ID: <bbaeab100607201009x28a7354elde374c6651b1a322@mail.gmail.com> On 7/20/06, Nick Coghlan <ncoghlan at gmail.com> wrote: > > For code objects, their construction is already commonly written as > "compile(source)". Right, but some people like to construct directly from bytecode. For type objects, the constructor doesn't let you do anything you can't > already do with a class statement. It doesn't need securing. I figured as much, but when I was making the list I was not sure and didn't want to stop my writing momentum to check. For rewriting import.c in Python, the PEP 302 compliant import system API in > pkgutil would be a good starting point. Yep. Plan on looking at all of the various modules in the stdlib that assist with importing, package PEP (I think there is one), and PEP 302. Your doc also asks about the imp.get_suffixes() list, and wonder where to > set > it from Python. > > As far as I am aware, you can't. get_suffixes() is built from > _PyImport_FileTab, which is a C array. A switch statement is used to get > from > the file table entries to the appropriate handler functions. Ah, OK. Quoting from the suggestions I put to the Py3k list: > > Use smarter data structures > --------------------------- > Currently, the individual handlers to load a fully identified module are > exposed to Python code in a way that reflects the C-style data structures > used > in the current implementation. > > Simply switching to more powerful data structures for the file type > handlers > (i.e. use a PyTuple for filedescr values, a PyList for _PyImport_FileTab, > and > a PyDict instead of a switch statement to go from filedescr values to > module > loading/initialisation functions) and manipulating them all as normal > Python > objects could make the code in import.c much easier to follow. Yep. I just kind of glanced at the rest of your suggestions, Nick, since I assumed a lot of it would change (or could be changed) if import was redone in as much Python as possible. Extensible file type handling > ----------------------------- > If the file type handlers are stored in normal Python data structures as > described above, it becomes feasible to make the import system extensible > to > different file types as well as to different file locations. Yep. Although I am more interested in restricting than broadening the file types. This could be handled on a per-package basis, e.g. via a __file_types__ > special attribute in packages. Maybe. I don't want to get into introducing new abilities to start, though. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/32151d26/attachment-0001.htm From brett at python.org Thu Jul 20 19:30:03 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 10:30:03 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <9eebf5740607200117r4d4613e2i91665ea211bab46@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <9eebf5740607200117r4d4613e2i91665ea211bab46@mail.gmail.com> Message-ID: <bbaeab100607201030l2842b416wcfd8007c86c10e4f@mail.gmail.com> On 7/20/06, Lawrence Oluyede <l.oluyede at gmail.com> wrote: > > That's great. I just read your draft but I have little comments to do > but before let me say that I liked the idea to borrow concepts from E. > I've crossed the E's path in the beginning of this year and I found it > a pot of really nice ideas (for promises and capabilities). Here are > my comments about the draft: > > - it's not really clear to me what the "powerbox" is. I think I got > the concept of "super process" but maybe it's to be clarified, isn't > it? It become clear in the "threat model" paragraph The powerbox is the thing that gives your security domains their initial abilities. The OS gives the process its abilities, but it does not directly work with the interpreter. Since the process does, though, it is considered the powerbox and farms out abilities that it has been given by the OS. I have tried to clarify the definition at the start of the doc. - I hope no Rubystas will read the "Problem of No Private Namespace" > section because they have private/protected keywords to enforce this > stuff :-) Writing proxies in C will slow down the dev process (altough > will speed up the performance maybe) but in a far future someone will > come up with an alternative closer to the Python level Maybe. As I said in the doc, any changes must be Pythonic and adding private namespaces right now wouldn't be without much more thought and work. And if Ruby ends up with this security model but more thoroughly, more power to them. Their language is different in the right ways to support it. As for coding in C, thems the breaks. I plan in adding stuff to the stdlib for the common case. I might eventually think of a good, generic proxy object that could be used, but as of right now I am not worrying about that since it would be icing on the cake. - Can you write down a simple example of what you mean with "changing > something of the built-in objects"? (in "Problem of mutable shared > state") Done. - What about the performance issues of the capabilities model overall? Should be faster than an IBAC model since certain calls will not need to check the identity of the caller every time. But I am not worrying about performance, I am worrying about correctness, so I did not try to make any performance claims. - I know what you meant to say but the paragraph about pythonicness > and the security model seems a little "fuzzy" to me. Which are the > boundaries of the allowed changes for the security stuff? Being "pythonic" is a fuzzy term in itself and Guido is the only person who can make definitive claims over what is and is not Pythonic. As I have said, this doc was mostly written with python-dev in mind since they are the ones I have to convince to let this into the core and they all know the term. But I have tacked in a sentence on what the term means. - You don't say anything about networking and networked resources in > the list of the standard sandboxed interpreter Nope. Have not started worrying about that yet. Just trying to get the basic model laid out. - Suppose we have a .py module. Based on your security model we can > import it, right? When imported it generates a .pyc file. The second > time we import it what happens? .pyc is ignored? import is not allowed > at all? We can't rely on the name of the file.pyc because an attacker > who knows the file.py is secure and the second import is done against > file.pyc can replace the "secure" file.pyc with an implementation not > secure and can do some kind of harm to the sandbox It will be ignored. But I am hoping that through rewriting the import machinery more control over generating .pyc files can be had (see Skip Montanaro's PEP on this; forget the number). This is why exact details were left out of the implementation details. I just wanted people understand the approach to everything, not the concrete details of how it will be coded up. - About "Filesystem information". Does the sandboxed interpreter need > to know all that information about file paths, files and so on? Can't > we reset those attributes to something arbitrary? That is the point. It is not that the sandbox needs to know it, its that it needs to be hidden from the sandbox. - About sys module: I think the best way is to have a purged fake sys > module with only the stuff you need. pypy has the concept of faked > modules too (altough for a different reason) OK. - About networking: what do you think about the E's model of really > safe networking, protected remotable objects and safe RPC? Is that > model applicable to Python's in some way? We can't use the E's model > as a whole (ask people to generate a safe key and send it by email is > unfeasible) I have not looked at it. I am also not trying to build an RPC system *and* a security model for Python. That is just too much work right now. - is the protected memory model a some kind of memory monitor system? Basically. It just keeps a size_t on the memory cap and another on memory usage, and when memory is requested it makes sure that it won't go over the cap. And when memory is freed the usage goes down. It's very rough (hard to account for padding bits, etc. in C structs), but it should be good enough to prevent a program from hitting 800 MB when you really just wanted it to have 5 MB. I think that's all for the draft. I wrote these comments during the > reading of the document. > > Hope some of these help Thanks, Lawrence. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/77ae9261/attachment.html From rasky at develer.com Thu Jul 20 19:40:46 2006 From: rasky at develer.com (Giovanni Bajo) Date: Thu, 20 Jul 2006 19:40:46 +0200 Subject: [Python-Dev] new security doc using object-capabilities References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <012801c6abe8$5f9524d0$d503030a@trilan> <bbaeab100607201004j324b3856jb43b3cc3c3ec22c8@mail.gmail.com> Message-ID: <084e01c6ac23$a1930ec0$d503030a@trilan> Brett Cannon wrote: >> http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log >> . >> >> How do you plan to handle CPU-hogs? Stuff like execution of a >> gigantic integer multiplication. > > > I don't. =) Protecting the CPU is damn hard to do in any form of > portable fashion. And even getting it to work on an OS you do know > the details of leads to probably an interrupt implementation and > that doesn't sound fun. I think the trick used by the safe_eval recipe (a separate thread which interrupts the script through thread.interrupt_main()) shows that, in most cases, it's possible to make sure that an embedded script does not take too long to execute. Do you agree that this usage case ("allow me to timeout an embedded script") is something which would be a very good start in the right direction? Now, I wonder, in a restricted execution environment such as that depicted in your document, how many different ways are there to make the Python interpreter enter a long calcolation loop which does not release the GIL? I can think of bignum*bignum, bignum**bignum or similar mathematical operations, but there are really a few. If we could make those release the GIL (or poll some kind of watchdog used to abort them, pretty much like they normally poll CTRL+C), then the same trick used by the recipe could be used. -- Giovanni Bajo From brett at python.org Thu Jul 20 20:10:29 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 11:10:29 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <084e01c6ac23$a1930ec0$d503030a@trilan> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <012801c6abe8$5f9524d0$d503030a@trilan> <bbaeab100607201004j324b3856jb43b3cc3c3ec22c8@mail.gmail.com> <084e01c6ac23$a1930ec0$d503030a@trilan> Message-ID: <bbaeab100607201110i4e4c924fvbc57425a7e4b3a36@mail.gmail.com> On 7/20/06, Giovanni Bajo <rasky at develer.com> wrote: > > Brett Cannon wrote: > > >> > > http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log > >> . > >> > >> How do you plan to handle CPU-hogs? Stuff like execution of a > >> gigantic integer multiplication. > > > > > > I don't. =) Protecting the CPU is damn hard to do in any form of > > portable fashion. And even getting it to work on an OS you do know > > the details of leads to probably an interrupt implementation and > > that doesn't sound fun. > > I think the trick used by the safe_eval recipe (a separate thread which > interrupts the script through thread.interrupt_main()) shows that, in most > cases, it's possible to make sure that an embedded script does not take > too > long to execute. Do you agree that this usage case ("allow me to timeout > an > embedded script") is something which would be a very good start in the > right > direction? Probably. I just don't feel like worrying about it right now. =) Now, I wonder, in a restricted execution environment such as that depicted > in your document, how many different ways are there to make the Python > interpreter enter a long calcolation loop which does not release the GIL? > I > can think of bignum*bignum, bignum**bignum or similar mathematical > operations, but there are really a few. If we could make those release the > GIL (or poll some kind of watchdog used to abort them, pretty much like > they > normally poll CTRL+C), then the same trick used by the recipe could be > used. Well, any work that does most of its calculation within C code and that does not touch base with the interpreter on a semi-regular basis would need to relesae the GIL. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/d60aea40/attachment.htm From pje at telecommunity.com Thu Jul 20 20:57:07 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Thu, 20 Jul 2006 14:57:07 -0400 Subject: [Python-Dev] Undocumented PEP 302 protocol change by need-for-speed sprint Message-ID: <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> While investigating the need to apply http://python.org/sf/1525766 I found that there was a modification to pkgutil during the need-for-speed sprint that affects the PEP 302 protocol in a backwards incompatible way. Specifically, PEP 302 documents that path_importer_cache always contains either importer objects or None. Any code written to obtain importer objects is therefore now broken, because import.c is slapping False in for non-existent filesystem paths. The pkgutil module was then hacked to work around this problem, thereby hiding the breakage from at least the standard library, but not any external libraries that follow the PEP 302 protocol to find importers. There are several options as to how to proceed: 1. Revert the change 2. Document the breakage, update PEP 302, and make everybody update their code 3. Make it not break existing code, by using a NonexistentPathImporter or NullImporter type in place of "False" in sys.path_importer_cache. Any thoughts? Personally, the only code I know of that implements the PEP 302 protocol besides the pkgutil module that would be affected is pkg_resources in setuptools, so it's not like I can't fix it for 2.5. However, I don't know if anybody else is using the protocol, and if so, how bad the breakage would be. This should really only affect code that is walking sys.path, because paths with "False" in sys.path_importer_cache by definition cannot have any importable modules associated with them. So, although I don't like option 2 on general principles, it may be an acceptable solution. From tjreedy at udel.edu Thu Jul 20 21:00:45 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 20 Jul 2006 15:00:45 -0400 Subject: [Python-Dev] new security doc using object-capabilities References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <9eebf5740607200117r4d4613e2i91665ea211bab46@mail.gmail.com> Message-ID: <e9ojsu$ui1$1@sea.gmane.org> "Lawrence Oluyede" <l.oluyede at gmail.com> wrote in message news:9eebf5740607200117r4d4613e2i91665ea211bab46 at mail.gmail.com... > - I know what you meant to say but the paragraph about pythonicness > and the security model seems a little "fuzzy" to me. I agree that this paragraph is weak and recommend that it be rewritten. In particular, I think the 'pythonic*' words should go, especially if you expect this document to be read by anyone other than dedicated pythonistas. I would start with something like "It is my goal that my thesis work be incorporated in some future version of the Python distribution. This has two constraints. First, changes to the core must not slow down normal operation. Second, visible changes must not violate the spirit and style of Python that make it a distinctive language." This alludes to the fact that your proposal discusses two highly overlapping yet separate projects: write a thesis that gains you a PhD degree; and produce an accepted patch set that give Python a useful security capability it does not now have. They have to be thought of as somewhat separate because you have two sets of 'overseers' and approvers: your thesis advisor and committee for the first; and Guido and other Python developers for the second. I think your thesis should currently be your first priority. Your current paragraph implied to me that you would not follow a promising line of research if you could not see how to make it 'pythonic'. If I were on your thesis committee, I think that would bother me ;-). In any case, I wish you the best with a double project that is obviously not a 'gimme'. Terry Jan Reedy (PhD, though not on any thesis committees) From l.oluyede at gmail.com Thu Jul 20 21:20:17 2006 From: l.oluyede at gmail.com (Lawrence Oluyede) Date: Thu, 20 Jul 2006 21:20:17 +0200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607201030l2842b416wcfd8007c86c10e4f@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <9eebf5740607200117r4d4613e2i91665ea211bab46@mail.gmail.com> <bbaeab100607201030l2842b416wcfd8007c86c10e4f@mail.gmail.com> Message-ID: <9eebf5740607201220t3edcb31k4b08d6b27f1546d6@mail.gmail.com> > Should be faster than an IBAC model since certain calls will not need to > check the identity of the caller every time. > > But I am not worrying about performance, I am worrying about correctness, so > I did not try to make any performance claims. Got that. > Nope. Have not started worrying about that yet. Just trying to get the > basic model laid out. Ok sorry to have bothered > That is the point. It is not that the sandbox needs to know it, its that it > needs to be hidden from the sandbox. So I think that's a "simple" step during the importing step. > I have not looked at it. I am also not trying to build an RPC system *and* > a security model for Python. That is just too much work right now. Ok sorry :-) > Thanks, Lawrence. Thank you! -- Lawrence http://www.oluyede.org/blog From barry at python.org Thu Jul 20 21:27:39 2006 From: barry at python.org (Barry Warsaw) Date: Thu, 20 Jul 2006 15:27:39 -0400 Subject: [Python-Dev] Support for PyGetSetDefs in pydoc In-Reply-To: <7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> References: <A027CA35-EE77-4658-BA59-E849368C440B@python.org> <EF6E6AF2-654F-4F53-91A8-A6C9B8A80745@python.org> <44B64782.9010706@gmail.com> <7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> Message-ID: <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Jul 13, 2006, at 12:12 PM, Barry Warsaw wrote: > I've updated SF patch #1520294 and assigned it back to Georg for > another quick review. Neal commented in the patch that it might help to explain the implementation a bit. I'd like to do that and also explain why I think this patch should go in 2.5. OTOH, I suspect most people just don't care, which is why I've gotten almost no comments on the patch (other than one or two mild nods of approval). As a reminder, this patch is really about providing useful help() at the interactive prompt for two specific types of C implemented descriptors, getset and member descriptors. Both objects can be given docstrings in the C code, but currently nothing in the tool chain pulls out and displays those docstrings, so help() on such objects is not very useful. While certainly a minor point, I think it improves the user experience to be able to display any existing docstrings for these types of objects. Of course, I'm scratching my own itch here because our app is deeply embedded and we have a lot of getset and member descriptors with docstrings. I'd like for our users to be able to type "help(someobj.member)" at our interactive prompt and get our docstring printed back at them. The implementation is fairly straightforward I add two new constants to types.py, a couple of is*() methods to inspect.py, and a bit of extra support to pydoc.py. Where it gets complicated is that I added a _types builtin module to provide a getset and member descriptor that could be turned into *Type constants in the types.py module. Why did I do this instead of trying to hunt down some existing getset or member descriptor? For one thing, there really aren't very good candidates for such objects in the built-in modules. You can't use objects like datetime.timedelta.days in types.py because datetime is not importable early enough for types.py to use it. Even if there were likely candidates, they would be accidents of implementation and it doesn't seem like a good idea to force force some future datetime maintainer to have to fix types.py when she decides that datetime.timedelta.days should be implemented in some other way. A 3rd party extension module doesn't work either because you really need the tie-in from types.py, inspect.py, and pydoc.py. You certainly don't want to go poking new values into types.py from the outside, and besides inspect.py and pydoc.py also need to learn about these fundamental built-in Python types. ISTM the most straightforward approach is to provide these types in a built-in C module, linked into the interpreter so types.py can get access to it as early as it needs to. Also, because the type that these descriptors are attached to cannot be instantiated from Python, they should be quite benign, existing only in order to give type() something to query. Neal also questions the reliance on the types.py module and whether we're moving away from it. My own feeling is that certainly for common types available as type constructors in the built-in namespace (e.g. list, set, dict, etc.), no, we certainly don't need types.py. But there are enough other tools in Pythonland that need to introspect about types that aren't readily available, and I see no reason why types.py shouldn't continue to fulfill those needs. How else would you find the type of frames or generators in a simple, documented, consistent way? Neal also asks about other implementations. I believe that I've patched types.py, inspect.py, and pydoc.py in such a way that if the implementation did not have these types available, those modules would not break. For example, GetSetDescriptorType and MemberDescriptorType are not defined if _types can't be imported. If those types aren't defined, the appropriate inspect.py is() methods will always return False. Thus, other implementations can do hasattr () tests on types.py or better yet, inspect.isgetsetdescriptor() and get an answer it can deal with. Further, by providing a _types built-in module, other implementations /could/ expose those types, if they were meaningful. I believe that an _types module provides a good place for those alternative implementations to hide types of their own should the need arise (e.g. if Jython had some implementation-specific JavaSpecialUnicornType, the object would live in Jython's _types.java and be exposed via a type(_type.javaspecialunicorn) in types.py). Could this wait until Python 2.6? Well, it /could/ but IMO it shouldn't. Improving the user experience by providing useful help() shouldn't be postponed when we have the opportunity to do so now, with low risk of breakage. Probably the biggest risk is whether I can modify the Windows build to DTRT with _types.c. I've been building enough Windows s/w lately that I think I can handle it, and if not, I'll just buy the Timbot lunch as payment for his help. I know what he likes to eat. So anyway, that's probably more text than the patch combined. :) I hope it helps. - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (Darwin) iQCVAwUBRL/ZL3EjvBPtnXfVAQKEvgQAktmdjKnq5uf1+m5Wc5F1ZFY/LhdM1vQQ A2v6zgXTxdQteqTNds15BktGAUwRNKKB2pU/U2wTcfLx/4rfjvkFG3cRbhEY8jI0 wa8wzPrGs17VTp8Z/LI65F9DC8aMEa9RBr2fXeuhewXlAheiZT9doelds8b5y0T/ Ed9Yl4MJ0es= =Fh1P -----END PGP SIGNATURE----- From brett at python.org Thu Jul 20 21:28:46 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 12:28:46 -0700 Subject: [Python-Dev] Undocumented PEP 302 protocol change by need-for-speed sprint In-Reply-To: <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> References: <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> Message-ID: <bbaeab100607201228y797b601alac230a13c00c9b59@mail.gmail.com> On 7/20/06, Phillip J. Eby <pje at telecommunity.com> wrote: > > While investigating the need to apply http://python.org/sf/1525766 I found > that there was a modification to pkgutil during the need-for-speed sprint > that affects the PEP 302 protocol in a backwards incompatible way. > > Specifically, PEP 302 documents that path_importer_cache always contains > either importer objects or None. Any code written to obtain importer > objects is therefore now broken, because import.c is slapping False in for > non-existent filesystem paths. > > The pkgutil module was then hacked to work around this problem, thereby > hiding the breakage from at least the standard library, but not any > external libraries that follow the PEP 302 protocol to find importers. > > There are several options as to how to proceed: > > 1. Revert the change > 2. Document the breakage, update PEP 302, and make everybody update their > code > 3. Make it not break existing code, by using a NonexistentPathImporter or > NullImporter type in place of "False" in sys.path_importer_cache. > > Any thoughts? Revert it. Is it really that much of a bonus to use False over None? Both evaluate to false and both are already singleton so you can use 'is' for testing. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/3e21b965/attachment.htm From g.brandl at gmx.net Thu Jul 20 21:32:58 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 20 Jul 2006 21:32:58 +0200 Subject: [Python-Dev] Support for PyGetSetDefs in pydoc In-Reply-To: <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> References: <A027CA35-EE77-4658-BA59-E849368C440B@python.org> <EF6E6AF2-654F-4F53-91A8-A6C9B8A80745@python.org> <44B64782.9010706@gmail.com> <7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> Message-ID: <e9olpa$4ql$1@sea.gmane.org> Barry Warsaw wrote: > Why did I do this instead of trying to hunt down some existing getset > or member descriptor? For one thing, there really aren't very good > candidates for such objects in the built-in modules. You can't use > objects like datetime.timedelta.days in types.py because datetime is > not importable early enough for types.py to use it. Even if there > were likely candidates, they would be accidents of implementation and > it doesn't seem like a good idea to force force some future datetime > maintainer to have to fix types.py when she decides that > datetime.timedelta.days should be implemented in some other way. > > A 3rd party extension module doesn't work either because you really > need the tie-in from types.py, inspect.py, and pydoc.py. You > certainly don't want to go poking new values into types.py from the > outside, and besides inspect.py and pydoc.py also need to learn about > these fundamental built-in Python types. > > ISTM the most straightforward approach is to provide these types in a > built-in C module, linked into the interpreter so types.py can get > access to it as early as it needs to. Also, because the type that > these descriptors are attached to cannot be instantiated from Python, > they should be quite benign, existing only in order to give type() > something to query. Perhaps you could put the objects into _testcapi. That way no new module has to be deployed (is _testcapi installed on every system?) Georg From pje at telecommunity.com Thu Jul 20 21:37:25 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Thu, 20 Jul 2006 15:37:25 -0400 Subject: [Python-Dev] Undocumented PEP 302 protocol change by need-for-speed sprint In-Reply-To: <bbaeab100607201228y797b601alac230a13c00c9b59@mail.gmail.co m> References: <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060720153329.0202d570@sparrow.telecommunity.com> At 12:28 PM 7/20/2006 -0700, Brett Cannon wrote: >On 7/20/06, Phillip J. Eby ><<mailto:pje at telecommunity.com>pje at telecommunity.com> wrote: >>While investigating the need to apply >><http://python.org/sf/1525766>http://python.org/sf/1525766 I found >>that there was a modification to pkgutil during the need-for-speed sprint >>that affects the PEP 302 protocol in a backwards incompatible way. >> >>Specifically, PEP 302 documents that path_importer_cache always contains >>either importer objects or None. Any code written to obtain importer >>objects is therefore now broken, because import.c is slapping False in for >>non-existent filesystem paths. >> >>The pkgutil module was then hacked to work around this problem, thereby >>hiding the breakage from at least the standard library, but not any >>external libraries that follow the PEP 302 protocol to find importers. >> >>There are several options as to how to proceed: >> >>1. Revert the change >>2. Document the breakage, update PEP 302, and make everybody update their >>code >>3. Make it not break existing code, by using a NonexistentPathImporter or >>NullImporter type in place of "False" in sys.path_importer_cache. >> >>Any thoughts? > >Revert it. Is it really that much of a bonus to use False over >None? Both evaluate to false and both are already singleton so you can >use 'is' for testing. The changed code still uses None. PEP 302 defines None as meaning that a sys.path entry does not have an importer. It's just that the need-for-speed patch *adds* the use of True and False. None still means "no importer", but True now means "no importer, path exists" and False now means "no importer, path does not exist". The idea is that import.c can then skip checking the existence of the path when it sees True or False, but it then means that code that gets data from path_importer_cache needs to know about these new special values, or else it will get an attribute error when it tries to call True.find_module(). From brett at python.org Thu Jul 20 21:58:42 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 12:58:42 -0700 Subject: [Python-Dev] Undocumented PEP 302 protocol change by need-for-speed sprint In-Reply-To: <5.1.1.6.0.20060720153329.0202d570@sparrow.telecommunity.com> References: <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> <5.1.1.6.0.20060720153329.0202d570@sparrow.telecommunity.com> Message-ID: <bbaeab100607201258sba22a52u4d713c07f92c4775@mail.gmail.com> On 7/20/06, Phillip J. Eby <pje at telecommunity.com> wrote: > > At 12:28 PM 7/20/2006 -0700, Brett Cannon wrote: > >On 7/20/06, Phillip J. Eby > ><<mailto:pje at telecommunity.com>pje at telecommunity.com> wrote: > >>While investigating the need to apply > >><http://python.org/sf/1525766>http://python.org/sf/1525766 I found > >>that there was a modification to pkgutil during the need-for-speed > sprint > >>that affects the PEP 302 protocol in a backwards incompatible way. > >> > >>Specifically, PEP 302 documents that path_importer_cache always contains > >>either importer objects or None. Any code written to obtain importer > >>objects is therefore now broken, because import.c is slapping False in > for > >>non-existent filesystem paths. > >> > >>The pkgutil module was then hacked to work around this problem, thereby > >>hiding the breakage from at least the standard library, but not any > >>external libraries that follow the PEP 302 protocol to find importers. > >> > >>There are several options as to how to proceed: > >> > >>1. Revert the change > >>2. Document the breakage, update PEP 302, and make everybody update > their > >>code > >>3. Make it not break existing code, by using a NonexistentPathImporter > or > >>NullImporter type in place of "False" in sys.path_importer_cache. > >> > >>Any thoughts? > > > >Revert it. Is it really that much of a bonus to use False over > >None? Both evaluate to false and both are already singleton so you can > >use 'is' for testing. > > The changed code still uses None. PEP 302 defines None as meaning that a > sys.path entry does not have an importer. It's just that the > need-for-speed patch *adds* the use of True and False. None still means > "no importer", but True now means "no importer, path exists" and False now > means "no importer, path does not exist". Ah. Sounds like None is not really even needed with the change (although I am not suggesting the removal of None). The idea is that import.c can then skip checking the existence of the path > when it sees True or False, but it then means that code that gets data > from > path_importer_cache needs to know about these new special values, or else > it will get an attribute error when it tries to call True.find_module(). > > Well, I have not played with the PEP 302 stuff so I don't know how helpful they are to have around. But it is definitely a semantic change that either needs to be reverted or documented. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/61ed8b42/attachment.html From glyph at divmod.com Thu Jul 20 22:17:46 2006 From: glyph at divmod.com (glyph at divmod.com) Date: Thu, 20 Jul 2006 16:17:46 -0400 Subject: [Python-Dev] Undocumented PEP 302 protocol change by need-for-speed sprint In-Reply-To: <5.1.1.6.0.20060720144252.02622318@sparrow.telecommunity.com> Message-ID: <20060720201746.29014.684853232.divmod.quotient.46224@ohm> On Thu, 20 Jul 2006 14:57:07 -0400, "Phillip J. Eby" <pje at telecommunity.com> wrote: >While investigating the need to apply http://python.org/sf/1525766 I found >that there was a modification to pkgutil during the need-for-speed sprint >that affects the PEP 302 protocol in a backwards incompatible way. It just so happens that the bug that is reported was probably reported because I'm working on some controversial new functionality in Twisted - controversial because it replicates the functionality that bug is about in pkgutil. This functionality does make some use of PEP 302 functionality :). See <http://twistedmatrix.com/trac/ticket/1940> >Specifically, PEP 302 documents that path_importer_cache always contains >either importer objects or None. Any code written to obtain importer >objects is therefore now broken, because import.c is slapping False in for >non-existent filesystem paths. Oddly, for once I'm going to say I don't care about this change. The code I've written so far doesn't depend on this, and I was pretty careful to be conservative about depending too much on the stuff described in PEP 302. It documents several features which don't exist (get_data, and methods in the "imp" module which don't exist in python2.3 or python2.4, where it was nominally accepted). >There are several options as to how to proceed: >2. Document the breakage, update PEP 302, and make everybody update their code Personally I'd prefer it if PEP 302 were updated for a variety of reasons. It's very hard to use as a reference for writing actual code because so many features are "optional" or "open issues", and there's no description in the PEP of what their status is. Better yet, this breakage (and other things) should be documented in the Python reference, and the PEP should link to the documentation for different versions, which can each describe the PEP's implementation status. The "importing modules" section of the library reference seems like a natural place to put it. From barry at python.org Thu Jul 20 22:45:47 2006 From: barry at python.org (Barry Warsaw) Date: Thu, 20 Jul 2006 16:45:47 -0400 Subject: [Python-Dev] Support for PyGetSetDefs in pydoc In-Reply-To: <e9olpa$4ql$1@sea.gmane.org> References: <A027CA35-EE77-4658-BA59-E849368C440B@python.org> <EF6E6AF2-654F-4F53-91A8-A6C9B8A80745@python.org> <44B64782.9010706@gmail.com> <7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> <e9olpa$4ql$1@sea.gmane.org> Message-ID: <1A2C07C0-58C4-45F0-90C1-FD0C26B7D9C5@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Jul 20, 2006, at 3:32 PM, Georg Brandl wrote: > Perhaps you could put the objects into _testcapi. That way no new > module > has to be deployed (is _testcapi installed on every system?) That doesn't seem importable in types.py either. You /could/ add that to Modules/config.c.in but I'm not sure it''s a good idea. - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (Darwin) iQCVAwUBRL/rgHEjvBPtnXfVAQLnHwP/Tzy1Iy8nbByuVaYaGrhel9zsdlbRXXlj 8wvLLlmzctmnzBrPBpZKV7aJMdiBd6261SKqPCrDZCj2LEplZWRa7za0zm3O2iZu F1kbWCKTwWeI2/kVNpP1lTkm53u9yYp8grjxe1Q6fniB5nqQH11j79JehAYpBwdj /RsVYoeQxng= =lIEO -----END PGP SIGNATURE----- From tdelaney at avaya.com Fri Jul 21 01:22:25 2006 From: tdelaney at avaya.com (Delaney, Timothy (Tim)) Date: Fri, 21 Jul 2006 09:22:25 +1000 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 Message-ID: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> Larry Hastings wrote: > I run the following script: > -- > from subprocess import * > Popen("ls -l") > -- > (yeah, I have ls.exe on Windows) > > Under Python 2.4.2, this simply dumped the results of ls.exe to the > terminal--sorry, to the "command shell". > > Under Python 2.5, both beta 1 and beta 2, it dumps the results to the > command shell, but *also* prints this: > > Exception exceptions.AttributeError: "'NoneType' object has no > attribute 'append'" in <bound method Popen.__del__ of > <subprocess.Popen object at 0x00C04EB0>> ignored > > Calling Popen() with a stdout = subprocess.PIPE does not throw this > exception. I've asked Larry to raise this on SourceForge, but with the SF email problems I thought I'd better forward it here. Looks like there's a bug in Popen.__del__ in 2.5. I'm not in a position to have a look right now. Tim Delaney From bioinformed at gmail.com Fri Jul 21 01:37:49 2006 From: bioinformed at gmail.com (Kevin Jacobs <jacobs@bioinformed.com>) Date: Thu, 20 Jul 2006 19:37:49 -0400 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 In-Reply-To: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> References: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> Message-ID: <2e1434c10607201637l238635c7kc63cff228c5def82@mail.gmail.com> Reported to the list about a week ago, with analysis. Didn't get a response. Won't use sourceforge. Sorry about the top post. -Kevin On 7/20/06, Delaney, Timothy (Tim) <tdelaney at avaya.com> wrote: > > Larry Hastings wrote: > > > I run the following script: > > -- > > from subprocess import * > > Popen("ls -l") > > -- > > (yeah, I have ls.exe on Windows) > > > > Under Python 2.4.2, this simply dumped the results of ls.exe to the > > terminal--sorry, to the "command shell". > > > > Under Python 2.5, both beta 1 and beta 2, it dumps the results to the > > command shell, but *also* prints this: > > > > Exception exceptions.AttributeError: "'NoneType' object has no > > attribute 'append'" in <bound method Popen.__del__ of > > <subprocess.Popen object at 0x00C04EB0>> ignored > > > > Calling Popen() with a stdout = subprocess.PIPE does not throw this > > exception. > > I've asked Larry to raise this on SourceForge, but with the SF email > problems I thought I'd better forward it here. > > Looks like there's a bug in Popen.__del__ in 2.5. I'm not in a position > to have a look right now. > > Tim Delaney > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/jacobs%40bioinformed.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060720/a4ae87b3/attachment.htm From tdelaney at avaya.com Fri Jul 21 02:44:39 2006 From: tdelaney at avaya.com (Delaney, Timothy (Tim)) Date: Fri, 21 Jul 2006 10:44:39 +1000 Subject: [Python-Dev] Behavior change in subprocess.py Message-ID: <2773CAC687FD5F4689F526998C7E4E5FF1E847@au3010avexu1.global.avaya.com> Hah - just found it. I even remember reading it... I'll update the SF tracker (1526203) with your analysis. Tim Delaney -----Original Message----- From: python-dev-bounces+tdelaney=avaya.com at python.org [mailto:python-dev-bounces+tdelaney=avaya.com at python.org] On Behalf Of Kevin Jacobs <jacobs at bioinformed.com> Sent: Thursday, 13 July 2006 12:33 AM To: python-dev at python.org Subject: [Python-Dev] Behavior change in subprocess.py During my testing of Python 2.5b2, I've found something that may be worthy of discussion. I suspect that recent GC and finalization changes have altered the behavior of the Popen object in subprocess.py. I am now getting many many many finalization warnings in my code like: Exception exceptions.AttributeError: "'NoneType' object has no attribute 'append'" in <bound method Popen.__del__ of <subprocess.Popen object at 0x2aaaab910950>> ignored Is this a bug or a feature? Personally, I'd like to see these messages silenced, since it is being generated during interpreter shutdown. The following patch does the trick for me: --- /usr/local/lib/python2.5/subprocess.py 2006-07-11 14:11:59.000000000 -0400 +++ subprocess.py 2006-07-12 10:17:09.000000000 -0400 @@ -613,7 +613,7 @@ return # In case the child hasn't been waited on, check if it's done. self.poll(_deadstate=sys.maxint) - if self.returncode is None: + if self.returncode is None and _active is not None: # Child is still running, keep us alive until we can wait on it. _active.append(self) Note that popen.py does something similar, though I am not convinced that the test is right or if it is doing something more subtle: def __del__(self): # In case the child hasn't been waited on, check if it's done. self.poll(_deadstate=sys.maxint) if self.sts < 0: if _active: # Child is still running, keep us alive until we can wait on it. _active.append(self) Regards, -Kevin -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060721/c21df170/attachment.html From greg.ewing at canterbury.ac.nz Fri Jul 21 03:17:43 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 21 Jul 2006 13:17:43 +1200 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <E1G3VNd-0003SW-6E@draco.cus.cam.ac.uk> References: <E1G3VNd-0003SW-6E@draco.cus.cam.ac.uk> Message-ID: <44C02B37.5080106@canterbury.ac.nz> Nick Maclaren wrote: > Now, interrupting into that level has to be transparent, in order to > support TLB misses, clock interrupts, device interrupts, machine-check > interrupts and so on. I thought we were just talking about counting the number of floating point exceptions that a particular piece of code generates. Surely that's deterministic, and isn't affected by any of that stuff? -- Greg From brett at python.org Fri Jul 21 05:52:34 2006 From: brett at python.org (Brett Cannon) Date: Thu, 20 Jul 2006 20:52:34 -0700 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ Message-ID: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> Here is a first stab at writing up guidelines for people to follow when reporting bug. If this goes well I will also do ones for patches, committing, and PEPs. -Brett --------------------------------------------------- These sets of guidelines are to help you file a bug report for the Python programming language on SourceForge_. If your bug is not for the language but for a third-party application, please report the bug to that third-party. *Please make sure to follow every step as it will make the lives of the Python developers much easier!!!* .. contents:: Get a SourceForge account ======================================= In order to file a bug report, you must have an account_ on SourceForge_. We realize some people would like to have anonymous bug reporting for various reasons (anonymity, ease or reporting, etc.). But SourceForge does not support anonymous reporting. Plus, by registering, you are notified by email when any action is been taken on your report. This can be very important if a Python developer needs more information from you about the bug. Start a new bug ======================================= You must be logged into SourceForge to file a bug! See `Get a SourceForge account`_ if you do not have one. Go to the `SourceForge bug page`_ to start a new bug report. There you will find a link called `Submit New`_. Click on it and it will allow you to fill out a new bug report. Once you click on the link, you are presented with a page that has several fields for you to fill in. Here is what to do for each field: * Category Set this to the area that the bug is related to (e.g., documentation, build, etc.). * Group Usually this is set the major.minor version of Python that you found the bug in. * Assigned To Leave this alone * Priority Leave this alone * Summary A one-line describing the problem so as to make it easy for developers to spot whether they have the expertise needed to work on the bug. * Detailed Description Following sections of this document discuss what should go in here. * Check to Upload and Attach a File If you are going to upload a file, you *must* check this box. * <File Location Field> Click the Browse button to upload any file to accompany your bug report (usually a succinct way to reproduce the bug). * File Description A one-line describing the file; no date info is needed since the upload is timestamped. Specify Python version ======================================= It is important that we have the most accurate version number of the interpreter you are using in order to best diagnose the issue. There are two ways to get us the version information. If you can run your Python interpreter, execute the following lines at an interpreter and paste the result into the ``Detailed Description`` field of the bug report:: >>> import sys >>> print sys.version If you are running a version of Python newer than 2.4 and are working from a source checkout of Python, the please also report the Subversion revision number for the root of your checkout:: python/trunk$ svnversion . If your bug is preventing you from running the interpreter, execute Python with teh ``-V`` command-line flag and paste the output:: python/trunk$ python -V Special settings for your Python interpreter ============================================ Sometimes your environment influences a bug and thus needs to be reported to help find the problem. This means we need to have reported: * Operating System * Environment Variables + PYTHONSTARTUP If this is set and might be causing the issue, please either upload the file or state what it does. + PYTHONCASEOK If your bug is on Windows and involves importing, please report if this environment variable is set or not. * site-packages If you have andy third-party packages installed that may be contributing to the bug, please report those. * Custom Patches Any differences between your code and the code the Python developers work off of needs to be reported. Sample code to reproduce bug ======================================== If you can, please upload a file the demonstrates the bug. The more succinct the better! And please do not forget to check the upload checkbox in the bug report. Submit! ======================================== At this point you should have a detailed bug report for developers to work off of. Click the ``Submit`` button and read on to see what you should do after the bug is reported. Respond to requests from developers ======================================== No matter how detailed the bug report, there is always the off-chance that a developer will need more information to fix a bug. Please be prompt in replying to requests for information by submitting a reply on the bug report. You may be asked to test out a patch. It is very important that you help with this, especially if the bug is not reproducible by the developer working on it. Write a patch! ======================================== If you are so inclined, patches for your own bug reports are always helpful! Please make sure to reference the tracker item # in the patch details. .. _SourceForge: http://www.sourceforge.net/ .. _SourceForge bug page: http://sourceforge.net/tracker/?group_id=5470&atid=105470 .. _account: http://sourceforge.net/account/newuser_emailverify.php .. _Submit New: http://sourceforge.net/tracker/?func=add&group_id=5470&atid=105470 From nyamatongwe at gmail.com Fri Jul 21 06:10:27 2006 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Fri, 21 Jul 2006 14:10:27 +1000 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> Message-ID: <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> Brett Cannon: > But SourceForge does not support anonymous reporting. SourceForge does support anonymous reporting. A large proportion of the fault reports I receive for Scintilla are anonymous as indicated by "nobody" in the "Submitted By" column. https://sourceforge.net/tracker/?group_id=2439&atid=102439 Neil From fdrake at acm.org Fri Jul 21 06:25:17 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Fri, 21 Jul 2006 00:25:17 -0400 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> Message-ID: <200607210025.17429.fdrake@acm.org> On Friday 21 July 2006 00:10, Neil Hodgson wrote: > Brett Cannon: > > But SourceForge does not support anonymous reporting. > > SourceForge does support anonymous reporting. A large proportion of > the fault reports I receive for Scintilla are anonymous as indicated > by "nobody" in the "Submitted By" column. SourceForge supports anonymous reporting, but the Python project determined that the management cost of anonymous reports was higher than the value they provided. It might be time to reconsider that decision (though my position hasn't changed). -Fred -- Fred L. Drake, Jr. <fdrake at acm.org> From tjreedy at udel.edu Fri Jul 21 06:42:12 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 21 Jul 2006 00:42:12 -0400 Subject: [Python-Dev] Support for PyGetSetDefs in pydoc References: <A027CA35-EE77-4658-BA59-E849368C440B@python.org><EF6E6AF2-654F-4F53-91A8-A6C9B8A80745@python.org><44B64782.9010706@gmail.com><7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> Message-ID: <e9plv5$rfe$1@sea.gmane.org> "Barry Warsaw" <barry at python.org> wrote in message news:AB1C2209-329E-4F3D-8B7A-2548A40F892A at python.org... > think this patch should go in 2.5. OTOH, I suspect most people just > don't care, which is why I've gotten almost no comments on the patch > (other than one or two mild nods of approval). I use help(ob) quite a bit, have noticed that somethings don't give anything useful, and definitely approve of making help more useful. tjr From oliphant.travis at ieee.org Fri Jul 21 09:42:51 2006 From: oliphant.travis at ieee.org (Travis E. Oliphant) Date: Fri, 21 Jul 2006 01:42:51 -0600 Subject: [Python-Dev] segfault when using PyGILState_Ensure/Release in Python2.3.4 Message-ID: <e9q0hs$jo4$1@sea.gmane.org> I'm hoping somebody here can help me with an error I'm getting in Python 2.3.4 but not in Python 2.4.2 when I use PyGILState_Ensure in NumPy on Linux. Perhaps somebody can point out what I'm doing wrong because while I've tried to understand the threading API it can be a bit confusing and maybe I'm doing it wrong. Right now, I've had to disable threading support in NumPy for Python 2.3 which is a bit annoying. The problem shows up when I've released the GIL using PyEval_SaveThread() in one section of code. Then the code calls functions that don't involve the Python C-API. Then another function sometimes requires use of the C-API to set a Python Error or issue a warning. So I call: _save = PyGILState_Ensure(); <Use Python C-API to issue an error or warning> Finally, before exiting this function PyGILState_Release(_save); is called. Later when control returns to the original caller that released the GIL, PyEval_RestoreThread() is called. But the segfault seems to be happening on the call to PyGILState_Release(_save); All of this works fine when it runs under Python 2.4.2, but under Python 2.3.4 I get a segfault. Does anybody have any ideas? Thanks very much. From ncoghlan at gmail.com Fri Jul 21 12:08:36 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 21 Jul 2006 20:08:36 +1000 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607201009x28a7354elde374c6651b1a322@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <44BF6C59.6010707@gmail.com> <bbaeab100607201009x28a7354elde374c6651b1a322@mail.gmail.com> Message-ID: <44C0A7A4.2030209@gmail.com> Brett Cannon wrote: > Extensible file type handling > ----------------------------- > If the file type handlers are stored in normal Python data > structures as > described above, it becomes feasible to make the import system > extensible to > different file types as well as to different file locations. > > > Yep. Although I am more interested in restricting than broadening the > file types. Either way you'd be mutating the list of recognised file types :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Fri Jul 21 12:14:40 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 21 Jul 2006 20:14:40 +1000 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 In-Reply-To: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> References: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> Message-ID: <44C0A910.4070002@gmail.com> Delaney, Timothy (Tim) wrote: > Looks like there's a bug in Popen.__del__ in 2.5. I'm not in a position > to have a look right now. For those not watching python-checkins, a check for "is not None" has been added before the offending line in Popen.__del__. (by Georg, IIRC) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From nmm1 at cus.cam.ac.uk Fri Jul 21 12:18:04 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Fri, 21 Jul 2006 11:18:04 +0100 Subject: [Python-Dev] Strategy for converting the decimal module to C Message-ID: <E1G3s56-0000pe-M5@draco.cus.cam.ac.uk> Greg Ewing <greg.ewing at canterbury.ac.nz> wrote: > > > Now, interrupting into that level has to be transparent, in order to > > support TLB misses, clock interrupts, device interrupts, machine-check > > interrupts and so on. > > I thought we were just talking about counting the number > of floating point exceptions that a particular piece of > code generates. Surely that's deterministic, and isn't Er, no. Rather fundamentally, on two grounds. Please bear with me, as this IS relevant to Python. See the summary at the end if you like :-) The first is that such things are NOT deterministic, not even on simple CPUs - take a look at the Alpha architecture for an example, and then follow it up with the IA64 one if you have the stomach for it. But that wasn't my main point. It is that modern CPUs have a SINGLE interrupt mechanism (a mistake in itself, but they do), so a CPU may be interrupted when it is running a device driver, other kernel thread or within a system call as much as when running an application. In fact, to some extent, interrupt handlers can themselves be interrupted (let's skip the details). Now, in order to allow the application to run its handler, the state has to be saved, sanitised and converted back to application context; and conversely on return. That is hairy, and is why it is not possible to handle interrupts generated within system calls on many systems. But that is not directly Python's problem. What is, is that the code gets interrupted at an unpredictable place, and the registers and other state may not be consistent as the language run-time system and Python are concerned. It is critical (a) that a sane state is restored before calling the handler and (b) that calling the handler neither relies on nor disturbs any of the "in flight" actions in the interrupted code. To cut a long story short, it is impractical for a language run-time system to call user-defined handlers with any degree of reliability unless the compiled code and run-time interoperate carefully - I have been there and done that many times, but few people still working have. On architectures with out-of-order execution (and interrupts), you have to assume that an interrupt may occur anywhere, even when the code does not use the relevant facility. Floating-point overflow in the middle of a list insertion? That's to be expected. It becomes considerably easier if the (run-time system) interrupt handler merely needs to flag or count interrupts, as it can use a minimal handler which is defensive and non-intrusive. Even that is a pretty fair nightmare, as many systems temporarily corrupt critical registers when they think that it is safe. And few think of interrupts when deciding that .... So, in summary, please DON'T produce a design that relies on trapping floating-point exceptions and passing control to a Python function. This is several times harder than implementing fpectl. Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From ndbecker2 at gmail.com Fri Jul 21 12:56:33 2006 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 21 Jul 2006 06:56:33 -0400 Subject: [Python-Dev] Document performance requirements? Message-ID: <e9qbtf$kk6$1@sea.gmane.org> For a recent project I needed to select a container. There are plenty of python data structures to choose from. It seems that information on performance is missing (or not easy to find). I think Python should include performance in the documentation of common data structures to help users select the appropriate types. Something in the style of c++ STL. From mwh at python.net Fri Jul 21 13:25:12 2006 From: mwh at python.net (Michael Hudson) Date: Fri, 21 Jul 2006 12:25:12 +0100 Subject: [Python-Dev] Ireland PyPy sprint 21th-27th August 2006 Message-ID: <2mirlryzc7.fsf@starship.python.net> The next PyPy sprint will happen in the nice city of Limerick in Ireland from 21st till 27th August. (Most people intend to arrive 20th August). The main focus of the sprint will be on JIT compiler works, various optimization works, porting extension modules, infrastructure works like a build tool for PyPy, or extended (distributed) testing. It's also open to new topics. If you are a student consider to participate in `Summer of PyPy`_ in order get funding for your travels and accomodation. The sprint is being hosted by University of Limerick (http://www.ul.ie/) - and is arranged in co-operation with people from our sister project Calibre (www.calibre.ie). Our contact at the University is P?r ?gerfalk and Eoin Oconchuir. .. _`Summer of PyPy`: http://codespeak.net/pypy/dist/pypy/doc/summer-of-pypy First day: introduction and workshop (possible to attend only this day) ------------------------------------------------------------------------ During the first day (21st of August) there will be talks on various subjects related to PyPy: * A tutorial and technical introduction to the PyPy codebase (suited for people interested in getting an overview of PyPy?s architecture and/or contributing to PyPy) * a workshop covering more in-depth technical aspects of PyPy and what PyPy can do for you. The workshop will also cover methodology, aiming at explaining the pros and cons of sprint-driven development. (suited for sprint attendants, students, staff and other interested parties from/around the University and the local area) The tutorial will be part of the sprint introduction - the workshop will take place if there is enough interest raised before the 21st of August from people planning to attend. You are of course welcome to attend just for this first day of the sprint. If you want to come ... ---------------------------- If you'd like to come, please subscribe to the `pypy-sprint mailing list`_ and drop a note about your interests and post any questions. More organisational information will be send to that list. We'll keep a list of `people`_ which we'll update (which you can do so yourself if you have codespeak commit rights). .. _`Calibre`: http://www.calibre.ie A small disclaimer: There might be people visiting the sprint in order to do research on how open source communities work, organize and communicate. This research might be done via filming, observing or interviewing. But of course you will be able to opt-out of being filmed at the sprint. Logistics -------------------------------------- NOTE: you need a UK style of power adapter (220V). The sprint will be held in the Computer Science Building, room CSG-025, University of Limerick (no 7 on http://www.ul.ie/main/places/campus.shtml). Bus 308 from Limerick city will take you to no 30 (approx.). See http://www.ul.ie/main/places/travel.shtml for more on how to get to UL. We will have access to the sprint facilities from 09:00-19:00 every day (it might be even later than 19:00). Monday-Wednesday, Friday-Sunday are sprint days, Thursday is likely a break day. Food on campus varies in price and quality ;-) : from ca 4 EUR to 7-8 EUR for a lunch. There are of course a lot more food alternatives in down town Limerick. Next Airports ------------------ Shannon Airport (SNN) is the nearest airport (Ryanair flies there) - you may check out more information about flights to/from the airport at http://www.shannonairport.com/index.html There are busses from there to downtown Limerick, and busses from Limerick to the UL campus. Taxis are about 35 EUR. Accomodation ----------------- There is a website address for campus accomodation at http://www.ul.ie/conference/accommodation.htm. The rate should be 49 euro for Bed and Breakfast. If you are interested in booking campus accommodation, please contact deborah.tudge at ul ie and make reference to the PyPy workshop and sprint. Please try to book as soon as possible. As an off-campus accommodation alternative you can also try: Castletroy Lodge and Castletroy Inn (Bed and Breakfast) Dublin Road (15 to 20 mins walk to UL) Tel: +353 61 338385 / +353 61 331167 .. _`pypy-sprint mailing list`: http://codespeak.net/mailman/listinfo/pypy-sprint .. _`people`: people.html -- surely, somewhere, somehow, in the history of computing, at least one manual has been written that you could at least remotely attempt to consider possibly glancing at. -- Adam Rixey From ncoghlan at gmail.com Fri Jul 21 13:49:32 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 21 Jul 2006 21:49:32 +1000 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <e9qbtf$kk6$1@sea.gmane.org> References: <e9qbtf$kk6$1@sea.gmane.org> Message-ID: <44C0BF4C.1010408@gmail.com> Neal Becker wrote: > For a recent project I needed to select a container. There are plenty of > python data structures to choose from. It seems that information on > performance is missing (or not easy to find). > > I think Python should include performance in the documentation of common > data structures to help users select the appropriate types. Something in > the style of c++ STL. Do you mean absolute performance, or do you mean algorithmic order guarantees? I thought the latter were already documented. . . Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Fri Jul 21 13:56:20 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 21 Jul 2006 21:56:20 +1000 Subject: [Python-Dev] Support for PyGetSetDefs in pydoc In-Reply-To: <e9olpa$4ql$1@sea.gmane.org> References: <A027CA35-EE77-4658-BA59-E849368C440B@python.org> <EF6E6AF2-654F-4F53-91A8-A6C9B8A80745@python.org> <44B64782.9010706@gmail.com> <7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> <e9olpa$4ql$1@sea.gmane.org> Message-ID: <44C0C0E4.6040403@gmail.com> Georg Brandl wrote: > Perhaps you could put the objects into _testcapi. That way no new module > has to be deployed (is _testcapi installed on every system?) Let's not corrupt _testcapi with something that has an entirely differently purpose. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ndbecker2 at gmail.com Fri Jul 21 14:03:26 2006 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 21 Jul 2006 08:03:26 -0400 Subject: [Python-Dev] Document performance requirements? References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> Message-ID: <e9qfqt$1t3$1@sea.gmane.org> Nick Coghlan wrote: > Neal Becker wrote: >> For a recent project I needed to select a container. There are plenty of >> python data structures to choose from. It seems that information on >> performance is missing (or not easy to find). >> >> I think Python should include performance in the documentation of common >> data structures to help users select the appropriate types. Something in >> the style of c++ STL. > > Do you mean absolute performance, or do you mean algorithmic order > guarantees? I thought the latter were already documented. . . > The latter. Where is it documented? From ncoghlan at gmail.com Fri Jul 21 14:25:02 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 21 Jul 2006 22:25:02 +1000 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <200607210759.56595.ndbecker2@gmail.com> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> <200607210759.56595.ndbecker2@gmail.com> Message-ID: <44C0C79E.3020309@gmail.com> Neal Becker wrote: > On Friday 21 July 2006 7:49 am, Nick Coghlan wrote: >> Neal Becker wrote: >>> For a recent project I needed to select a container. There are plenty of >>> python data structures to choose from. It seems that information on >>> performance is missing (or not easy to find). >>> >>> I think Python should include performance in the documentation of common >>> data structures to help users select the appropriate types. Something in >>> the style of c++ STL. >> Do you mean absolute performance, or do you mean algorithmic order >> guarantees? I thought the latter were already documented. . . >> > > The latter. Where are they documented? Just because I think something, it doesn't mean it's true :) The only reference I can actually find is the one in the collections module docs pointing out that collections.deque permits O(1) insertions and removals at the beginning of the sequence, as well as at the end (whereas lists are O(n) for operations at the beginning due to the resulting memory copying). However, I'm also struggling to think of a case other than list vs deque where the choice of a builtin or standard library data structure would be dictated by big-O() concerns. list vs array.array is based on memory efficiency list vs deque is based on whether or not you need O(1) push/pop at both ends list vs set is based on whether or not ordering matters set vs dict is based on whether or not you need to map keys to values Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From jason.orendorff at gmail.com Fri Jul 21 16:14:20 2006 From: jason.orendorff at gmail.com (Jason Orendorff) Date: Fri, 21 Jul 2006 10:14:20 -0400 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> <200607210759.56595.ndbecker2@gmail.com> <44C0C79E.3020309@gmail.com> <bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> Message-ID: <bb8868b90607210714g2b7dd8e1he8fa00ba842fe3ee@mail.gmail.com> On 7/21/06, Nick Coghlan <ncoghlan at gmail.com> wrote: > However, I'm also struggling to think of a case other than list vs deque where > the choice of a builtin or standard library data structure would be dictated > by big-O() concerns. OK, but that doesn't mean the information is unimportant. +1 on making this something of a priority. People looking for this info should find it in the obvious place. Some are unobvious. (How fast is dict.__eq__ on average? Worst case?) -j From brett at python.org Fri Jul 21 16:40:07 2006 From: brett at python.org (Brett Cannon) Date: Fri, 21 Jul 2006 07:40:07 -0700 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <200607210025.17429.fdrake@acm.org> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> <200607210025.17429.fdrake@acm.org> Message-ID: <bbaeab100607210740pf9e2db5r9f427c8f3184a9f7@mail.gmail.com> On 7/20/06, Fred L. Drake, Jr. <fdrake at acm.org> wrote: > > On Friday 21 July 2006 00:10, Neil Hodgson wrote: > > Brett Cannon: > > > But SourceForge does not support anonymous reporting. > > > > SourceForge does support anonymous reporting. A large proportion of > > the fault reports I receive for Scintilla are anonymous as indicated > > by "nobody" in the "Submitted By" column. > > SourceForge supports anonymous reporting, but the Python project > determined > that the management cost of anonymous reports was higher than the value > they > provided. OK. Wording has been changed in my copy. It might be time to reconsider that decision (though my position hasn't > changed). Sure. It can also wait until we begin discussing the transition to our next bug tracker. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060721/1f9d3087/attachment.html From foom at fuhm.net Fri Jul 21 17:05:51 2006 From: foom at fuhm.net (James Y Knight) Date: Fri, 21 Jul 2006 11:05:51 -0400 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <E1G3s56-0000pe-M5@draco.cus.cam.ac.uk> References: <E1G3s56-0000pe-M5@draco.cus.cam.ac.uk> Message-ID: <8611AEB2-5D94-4C28-96D6-8626BA5B6DFC@fuhm.net> On Jul 21, 2006, at 6:18 AM, Nick Maclaren wrote: > To cut a long story short, it is impractical for a language run-time > system to call user-defined handlers with any degree of reliability > unless the compiled code and run-time interoperate carefully - I have > been there and done that many times, but few people still working > have. > On architectures with out-of-order execution (and interrupts), you > have to assume that an interrupt may occur anywhere, even when the > code does not use the relevant facility. Floating-point overflow > in the middle of a list insertion? That's to be expected. While this _is_ a real problem, is it _not_ a general problem as you are describing it. Processors are perfectly capable of generating precise interrupts, and the inability to do so has nothing to do with the out-of-order execution, etc. Almost all interrupts are precise. The only interesting one which is not, on x86 processors, is the x87 floating point exception, which is basically for historical reasons. It has never been precise, ever since the actual 8087 coprocessor chip for the 8086. However, all is not lost: the exception cannot occur randomly. It can only occur on *some* floating point instruction, even if the instruction is not the one the error actually occurred in. So, unless your list insertion code uses floating point instructions, you should not get a floating point exception during your list insertion. Also, looking forward, the "simd" floating point instructions (ie mmx/ sse/sse2/sse3) _do_ generate precise interrupts. And on x86-64, x87 instructions are deprecated and everyone is recommended to use the simd ones, instead (so, for example, gcc defaults to using them). James From bioinformed at gmail.com Fri Jul 21 18:43:44 2006 From: bioinformed at gmail.com (Kevin Jacobs <jacobs@bioinformed.com>) Date: Fri, 21 Jul 2006 12:43:44 -0400 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 In-Reply-To: <44C0A910.4070002@gmail.com> References: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> <44C0A910.4070002@gmail.com> Message-ID: <2e1434c10607210943p10171eb9n607c5a5821768c7c@mail.gmail.com> On 7/21/06, Nick Coghlan <ncoghlan at gmail.com> wrote: > > Delaney, Timothy (Tim) wrote: > > Looks like there's a bug in Popen.__del__ in 2.5. I'm not in a position > > to have a look right now. > > For those not watching python-checkins, a check for "is not None" has been > added before the offending line in Popen.__del__. (by Georg, IIRC) > Many thanks for accepting my patch. There remains a potentially related problem in popen2.py, but it may be a lower priority, since most folks should be using subprocess. def __del__(self): # In case the child hasn't been waited on, check if it's done. self.poll(_deadstate=sys.maxint) if self.sts < 0: if _active: # Child is still running, keep us alive until we can wait on it. _active.append(self) The is _active check, unless it intendeds to check for either empty or None, should probably be revised to: def __del__(self): # In case the child hasn't been waited on, check if it's done. self.poll(_deadstate=sys.maxint) if self.sts < 0: if _active is None: # Child is still running, keep us alive until we can wait on it. _active.append(self) However, there may be a clever reason for doing what is doing that I do not see. Thanks again, -Kevin -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060721/f57de211/attachment.html From rasky at develer.com Fri Jul 21 18:45:12 2006 From: rasky at develer.com (Giovanni Bajo) Date: Fri, 21 Jul 2006 18:45:12 +0200 Subject: [Python-Dev] Document performance requirements? References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com><200607210759.56595.ndbecker2@gmail.com> <44C0C79E.3020309@gmail.com><bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> <bb8868b90607210714g2b7dd8e1he8fa00ba842fe3ee@mail.gmail.com> Message-ID: <054801c6ace5$09478c50$d503030a@trilan> Jason Orendorff wrote: >> However, I'm also struggling to think of a case other than list vs >> deque where the choice of a builtin or standard library data >> structure would be dictated by big-O() concerns. > > OK, but that doesn't mean the information is unimportant. +1 on > making this something of a priority. People looking for this info > should find it in the obvious place. Some are unobvious. (How fast is > dict.__eq__ on average? Worst case?) I also found out that most people tend to think of Python's lists as a magical data structure optimized for many operations (like a "rope" or something complex like that). Documenting that it's just a bare vector (std::vector in C++) would be of great help. -- Giovanni Bajo From grig.gheorghiu at gmail.com Fri Jul 21 19:04:38 2006 From: grig.gheorghiu at gmail.com (Grig Gheorghiu) Date: Fri, 21 Jul 2006 10:04:38 -0700 Subject: [Python-Dev] Community buildbots -- reprise Message-ID: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> Hi, This message is in response to Glyph's plea (<http://mail.python.org/pipermail/python-dev/2006-July/067366.html> <http://mail.python.org/pipermail/python-dev/2006-July/067366.html%3E>). Here's what Glyph said: "I would like to propose, although I certainly don't have time to implement, a program by which Python-using projects could contribute buildslaves which would run their projects' tests with the latest Python trunk. This would provide two useful incentives: Python code would gain a reputation as generally well-tested (since there is a direct incentive to write tests for your project: get notified when core python changes might break it), and the core developers would have instant feedback when a "small" change breaks more code than it was expected to." I'm volunteering to organize this effort, is there is enough interest on this list. In fact, I've done some prep work already: * got a domain name: pybots.org * got a $47/month Ubuntu-based VPS from JohnCompanies.com (root access and everything); it's available at master.pybots.org, and it's ready to be configured as a buildmaster for the pybots * got a mailing list: pybots at lists2.idyll.org <http://us.f545.mail.yahoo.com/ym/Compose?To=pybots at lists2.idyll.org&YY=8414&order=down&sort=date&pos=0&view=a&head=b> I can start configuring the Ubuntu machine as a buildmaster, and I can also add a buildslave on the same machine that will check out the latest Python trunk code, build it, then run the automated tests for a sample project -- let's say for Twisted, since Glyph was the one requesting this. This will also serve as a sample buildslave for other people who will be interested in running buildslaves for their own projects. Apart from the goals stated by Glyph, I see this as a very valuable effort in convincing people of the value of automated tests, Python-related or not. A secondary effect I'd like to see would be for these suites of tests to be invoked in a standard fashion -- maybe 'python setup.py test'. If PSF can contribute some $$$ towards the hosting of the master server, that would be appreciated, but not required. All that's required is enough interest from the community. Please let me know if you're interested. Grig <http://agiletesting.blogspot.com/> -- http://agiletesting.blogspot.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060721/6ede65ff/attachment.htm From skip at pobox.com Fri Jul 21 19:22:09 2006 From: skip at pobox.com (skip at pobox.com) Date: Fri, 21 Jul 2006 12:22:09 -0500 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <bbaeab100607210740pf9e2db5r9f427c8f3184a9f7@mail.gmail.com> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> <200607210025.17429.fdrake@acm.org> <bbaeab100607210740pf9e2db5r9f427c8f3184a9f7@mail.gmail.com> Message-ID: <17601.3393.544378.871413@montanaro.dyndns.org> Brett> Sure. It can also wait until we begin discussing the transition Brett> to our next bug tracker. Would be kinda nice if the new bug tracker allowed submitters to enter a followup email address without formally logging in. (Of course, email-based submissions would go a long way to minimizing the problem.) Skip From exarkun at divmod.com Fri Jul 21 19:27:41 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Fri, 21 Jul 2006 13:27:41 -0400 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> Message-ID: <20060721172741.29014.1781472149.divmod.quotient.47688@ohm> On Fri, 21 Jul 2006 10:04:38 -0700, Grig Gheorghiu <grig.gheorghiu at gmail.com> wrote: >Hi, > >Apart from the goals stated by Glyph, I see this as a very valuable >effort in convincing people of the value of automated tests, >Python-related or not. A secondary effect I'd like to see would be for >these suites of tests to be invoked in a standard fashion -- maybe >'python setup.py test'. > >If PSF can contribute some $$$ towards the hosting of the master >server, that would be appreciated, but not required. All that's >required is enough interest from the community. > >Please let me know if you're interested. > This is certainly interesting to me. If you need any help setting up the Twisted buildslave, please let me know. Jean-Paul From g.brandl at gmx.net Fri Jul 21 19:39:40 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Fri, 21 Jul 2006 19:39:40 +0200 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 In-Reply-To: <2e1434c10607210943p10171eb9n607c5a5821768c7c@mail.gmail.com> References: <2773CAC687FD5F4689F526998C7E4E5FF1E843@au3010avexu1.global.avaya.com> <44C0A910.4070002@gmail.com> <2e1434c10607210943p10171eb9n607c5a5821768c7c@mail.gmail.com> Message-ID: <e9r3ee$4on$1@sea.gmane.org> Kevin Jacobs <jacobs at bioinformed.com> wrote: > On 7/21/06, *Nick Coghlan* <ncoghlan at gmail.com > <mailto:ncoghlan at gmail.com>> wrote: > > Delaney, Timothy (Tim) wrote: > > Looks like there's a bug in Popen.__del__ in 2.5. I'm not in a > position > > to have a look right now. > > For those not watching python-checkins, a check for "is not None" > has been > added before the offending line in Popen.__del__. (by Georg, IIRC) > > > Many thanks for accepting my patch. There remains a potentially related > problem in popen2.py, but it may be a lower priority, since most folks > should be using subprocess. > > def __del__(self): > # In case the child hasn't been waited on, check if it's done. > self.poll(_deadstate=sys.maxint) > if self.sts < 0: > if _active: > # Child is still running, keep us alive until we can > wait on it. > _active.append(self) > > > The is _active check, unless it intendeds to check for either empty or > None, should probably be revised to: > > def __del__(self): > # In case the child hasn't been waited on, check if it's done. > self.poll(_deadstate=sys.maxint) > if self.sts < 0: > if _active is None: > # Child is still running, keep us alive until we can > wait on it. > _active.append(self) > > However, there may be a clever reason for doing what is doing that I do > not see. There's no reason (I know since I added the check myself ;). Thanks for pointing out that obvious bug. (fixed in rev 50759). Georg From john at ewtllc.com Fri Jul 21 19:44:02 2006 From: john at ewtllc.com (John Benediktsson) Date: Fri, 21 Jul 2006 10:44:02 -0700 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 Message-ID: <B6FAC926EFE7B348B12F29CF7E4A93D401328DC8@hammer.office.bhtrader.com> > The is _active check, unless it intendeds to check for either empty or > None, should probably be revised to: > > def __del__(self): > # In case the child hasn't been waited on, check if it's done. > self.poll(_deadstate=sys.maxint) > if self.sts < 0: > if _active is None: > # Child is still running, keep us alive until we can > wait on it. > _active.append(self) > It probably is obvious, but I think you mean: if _active is not None: ... Thanks, John. From foom at fuhm.net Fri Jul 21 19:48:28 2006 From: foom at fuhm.net (James Y Knight) Date: Fri, 21 Jul 2006 13:48:28 -0400 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <054801c6ace5$09478c50$d503030a@trilan> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com><200607210759.56595.ndbecker2@gmail.com> <44C0C79E.3020309@gmail.com><bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> <bb8868b90607210714g2b7dd8e1he8fa00ba842fe3ee@mail.gmail.com> <054801c6ace5$09478c50$d503030a@trilan> Message-ID: <7458FE96-1A1D-40F3-B585-8E1DA55C4B14@fuhm.net> On Jul 21, 2006, at 12:45 PM, Giovanni Bajo wrote: > Jason Orendorff wrote: > >>> However, I'm also struggling to think of a case other than list vs >>> deque where the choice of a builtin or standard library data >>> structure would be dictated by big-O() concerns. >> >> OK, but that doesn't mean the information is unimportant. +1 on >> making this something of a priority. People looking for this info >> should find it in the obvious place. Some are unobvious. (How >> fast is >> dict.__eq__ on average? Worst case?) > > I also found out that most people tend to think of Python's lists as a > magical data structure optimized for many operations (like a "rope" or > something complex like that). Documenting that it's just a bare vector > (std::vector in C++) would be of great help. Indeed, I was talking to someone a while back who thought that lists were magically hashed, in that he did something like: dictionary = open("/usr/share/dict/words").readlines() and then expected: "word" in dictionary would be fast. And was very surprised when it turned out to be slow a linear search of the list. :) James From nnorwitz at gmail.com Fri Jul 21 20:31:08 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Fri, 21 Jul 2006 11:31:08 -0700 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <20060721172741.29014.1781472149.divmod.quotient.47688@ohm> References: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> <20060721172741.29014.1781472149.divmod.quotient.47688@ohm> Message-ID: <ee2a432c0607211131n67af50a5sa2518d5410a9c8e4@mail.gmail.com> I have a server up and running. I still need to polish some stuff off. I will mail more info when I get a chance. n -- On 7/21/06, Jean-Paul Calderone <exarkun at divmod.com> wrote: > On Fri, 21 Jul 2006 10:04:38 -0700, Grig Gheorghiu <grig.gheorghiu at gmail.com> wrote: > >Hi, > > > >Apart from the goals stated by Glyph, I see this as a very valuable > >effort in convincing people of the value of automated tests, > >Python-related or not. A secondary effect I'd like to see would be for > >these suites of tests to be invoked in a standard fashion -- maybe > >'python setup.py test'. > > > >If PSF can contribute some $$$ towards the hosting of the master > >server, that would be appreciated, but not required. All that's > >required is enough interest from the community. > > > >Please let me know if you're interested. > > > > This is certainly interesting to me. If you need any help setting up > the Twisted buildslave, please let me know. > > Jean-Paul > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/nnorwitz%40gmail.com > From bioinformed at gmail.com Fri Jul 21 20:53:56 2006 From: bioinformed at gmail.com (Kevin Jacobs <jacobs@bioinformed.com>) Date: Fri, 21 Jul 2006 14:53:56 -0400 Subject: [Python-Dev] FW: Bug? Certainly a new *behavior* from subprocess in 2.5 on Win32 In-Reply-To: <B6FAC926EFE7B348B12F29CF7E4A93D401328DC8@hammer.office.bhtrader.com> References: <B6FAC926EFE7B348B12F29CF7E4A93D401328DC8@hammer.office.bhtrader.com> Message-ID: <2e1434c10607211153g392d122bj83d705ae82f5fa49@mail.gmail.com> That'll teach me to fire off emails while running out the door. Thanks. -Kevin On 7/21/06, John Benediktsson <john at ewtllc.com> wrote: > > > > The is _active check, unless it intendeds to check for either empty or > > > None, should probably be revised to: > > > > def __del__(self): > > # In case the child hasn't been waited on, check if it's done. > > self.poll(_deadstate=sys.maxint) > > if self.sts < 0: > > if _active is None: > > # Child is still running, keep us alive until we can > > wait on it. > > _active.append(self) > > > > It probably is obvious, but I think you mean: > > if _active is not None: > ... > > Thanks, > John. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/jacobs%40bioinformed.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060721/73f4b83d/attachment.html From bioinformed at gmail.com Fri Jul 21 20:56:29 2006 From: bioinformed at gmail.com (Kevin Jacobs <jacobs@bioinformed.com>) Date: Fri, 21 Jul 2006 14:56:29 -0400 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <17601.3393.544378.871413@montanaro.dyndns.org> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> <200607210025.17429.fdrake@acm.org> <bbaeab100607210740pf9e2db5r9f427c8f3184a9f7@mail.gmail.com> <17601.3393.544378.871413@montanaro.dyndns.org> Message-ID: <2e1434c10607211156s582441b1o753f25d2424e94b3@mail.gmail.com> On 7/21/06, skip at pobox.com <skip at pobox.com> wrote: > > > Brett> Sure. It can also wait until we begin discussing the > transition > Brett> to our next bug tracker. > > Would be kinda nice if the new bug tracker allowed submitters to enter a > followup email address without formally logging in. (Of course, > email-based > submissions would go a long way to minimizing the problem.) > It may just be bad karma, but SourceForge tends to lock or go off into lala land whenever I log in. Thus, I would file many bug reports, with a reply-to address, if non-login bug submissions where allowed. My long term hope is that you toss out SF and get something better. Thanks, -Kevin -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060721/bd31a869/attachment.htm From ncoghlan at gmail.com Sat Jul 22 04:33:45 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 22 Jul 2006 12:33:45 +1000 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> <200607210759.56595.ndbecker2@gmail.com> <44C0C79E.3020309@gmail.com> <bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> Message-ID: <44C18E89.5090603@gmail.com> Jason Orendorff wrote: > On 7/21/06, Nick Coghlan <ncoghlan at gmail.com> wrote: >> However, I'm also struggling to think of a case other than list vs >> deque where >> the choice of a builtin or standard library data structure would be >> dictated >> by big-O() concerns. > > OK, but that doesn't mean the information is unimportant. +1 on > making this something of a priority. People looking for this info > should find it in the obvious place. Some are unobvious. (How fast is > dict.__eq__ on average? Worst case?) Agreed, but there's more to doing that than just writing down the O() implied by the current CPython implementation - it's up to Guido to decide which of the constraints are part of the language definition, and which are implementation accidents (e.g. CPython's list.sort() operation was stable for at least one release before GvR made stability part of the definition of the method at the language level). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Sat Jul 22 04:37:37 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 22 Jul 2006 12:37:37 +1000 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <2e1434c10607211156s582441b1o753f25d2424e94b3@mail.gmail.com> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> <50862ebd0607202110m661a3fc8sfd99b4a78b4db1b8@mail.gmail.com> <200607210025.17429.fdrake@acm.org> <bbaeab100607210740pf9e2db5r9f427c8f3184a9f7@mail.gmail.com> <17601.3393.544378.871413@montanaro.dyndns.org> <2e1434c10607211156s582441b1o753f25d2424e94b3@mail.gmail.com> Message-ID: <44C18F71.6020106@gmail.com> Kevin Jacobs <jacobs at bioinformed.com> wrote: > It may just be bad karma, but SourceForge tends to lock or go off into > lala land whenever I log in. Thus, I would file many bug reports, with > a reply-to address, if non-login bug submissions where allowed. My long > term hope is that you toss out SF and get something better. You're not the only one with that hope. With at least Trac, Jira and Roundup to choose from, the PSF's current tracker shootout should find us that replacement :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From arigo at tunes.org Sat Jul 22 09:07:10 2006 From: arigo at tunes.org (Armin Rigo) Date: Sat, 22 Jul 2006 09:07:10 +0200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <20060722070710.GA22334@code0.codespeak.net> Hi Brett, On Wed, Jul 19, 2006 at 03:35:45PM -0700, Brett Cannon wrote: > I also plan to rewrite the import machinery in pure Python. http://codespeak.net/svn/pypy/dist/pypy/module/__builtin__/importing.py A bientot, Armin From martin at v.loewis.de Sat Jul 22 10:34:08 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sat, 22 Jul 2006 10:34:08 +0200 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <bb8868b90607210714g2b7dd8e1he8fa00ba842fe3ee@mail.gmail.com> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> <200607210759.56595.ndbecker2@gmail.com> <44C0C79E.3020309@gmail.com> <bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> <bb8868b90607210714g2b7dd8e1he8fa00ba842fe3ee@mail.gmail.com> Message-ID: <44C1E300.8050803@v.loewis.de> Jason Orendorff wrote: > On 7/21/06, Nick Coghlan <ncoghlan at gmail.com> wrote: >> However, I'm also struggling to think of a case other than list vs deque where >> the choice of a builtin or standard library data structure would be dictated >> by big-O() concerns. > > OK, but that doesn't mean the information is unimportant. +1 on > making this something of a priority. People looking for this info > should find it in the obvious place. Some are unobvious. (How fast is > dict.__eq__ on average? Worst case?) Contributions are welcome. Regards, Martin From martin at v.loewis.de Sat Jul 22 10:39:35 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sat, 22 Jul 2006 10:39:35 +0200 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> References: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> Message-ID: <44C1E447.2020404@v.loewis.de> Grig Gheorghiu wrote: > Please let me know if you're interested. As I said earlier: If you need some kind of post-commit trigger on the python repository to trigger a build, just let me know. We currently use a more-or-less plain svn_buildbot.py to trigger our own builds. Regards, Martin From arigo at tunes.org Sat Jul 22 13:19:07 2006 From: arigo at tunes.org (Armin Rigo) Date: Sat, 22 Jul 2006 13:19:07 +0200 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <44C18E89.5090603@gmail.com> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> <200607210759.56595.ndbecker2@gmail.com> <44C0C79E.3020309@gmail.com> <bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com> <44C18E89.5090603@gmail.com> Message-ID: <20060722111907.GA13782@code0.codespeak.net> Hi, On Sat, Jul 22, 2006 at 12:33:45PM +1000, Nick Coghlan wrote: > Agreed, but there's more to doing that than just writing down the O() implied > by the current CPython implementation - it's up to Guido to decide which of > the constraints are part of the language definition, and which are > implementation accidents I think that O-wise the current CPython situation should be documented as a "minimal requirement" for implementations of the language, with just one exception: the well-documented "don't rely on this" hack in 2.4 to make repeated 'str += str' amortized linear, for which the 2.3 quadratic behavior is considered compliant enough. I suppose that allowing implementations to provide better algorithmic complexities than required is fine, although I can think of some problems with that (e.g. nice and efficient user code that would perform horribly badly on CPython). Armin From arigo at tunes.org Sat Jul 22 13:46:15 2006 From: arigo at tunes.org (Armin Rigo) Date: Sat, 22 Jul 2006 13:46:15 +0200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> Message-ID: <20060722114615.GB13782@code0.codespeak.net> Re-hi, On Wed, Jul 19, 2006 at 03:35:45PM -0700, Brett Cannon wrote: > http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log. I'm not sure I understand what you propose to fix holes like constructors and __subclasses__: it seems that you want to remove them altogether (and e.g. make factory functions instead). That would completely break all programs, right? I mean, there is no way such changes would go into mainstream CPython. Or do you propose to maintain a CPython branch manually for the foreseeable future? (From experience this is a bad idea...) A bientot, Armin From grig.gheorghiu at gmail.com Sat Jul 22 17:32:07 2006 From: grig.gheorghiu at gmail.com (Grig Gheorghiu) Date: Sat, 22 Jul 2006 08:32:07 -0700 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <44C1E447.2020404@v.loewis.de> References: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> <44C1E447.2020404@v.loewis.de> Message-ID: <3f09d5a00607220832n57f0c5a6m8a193eb8e405846a@mail.gmail.com> On 7/22/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > > Grig Gheorghiu wrote: > > Please let me know if you're interested. > > As I said earlier: If you need some kind of post-commit > trigger on the python repository to trigger a build, just > let me know. We currently use a more-or-less plain > svn_buildbot.py to trigger our own builds. Wouldn't that put too much of a burden on the python core build system? It would have to be aware of all the buildslaves running specific projects. I was thinking about having a dedicated buildmaster machine, such as the one Neal says he already has, and configure that machine to coordinate a small army of buildslaves which will be contributed for people interested in this effort. Grig -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060722/f68aec48/attachment.htm From martin at v.loewis.de Sat Jul 22 17:53:23 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sat, 22 Jul 2006 17:53:23 +0200 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <3f09d5a00607220832n57f0c5a6m8a193eb8e405846a@mail.gmail.com> References: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> <44C1E447.2020404@v.loewis.de> <3f09d5a00607220832n57f0c5a6m8a193eb8e405846a@mail.gmail.com> Message-ID: <44C249F3.8000908@v.loewis.de> Grig Gheorghiu wrote: > As I said earlier: If you need some kind of post-commit > trigger on the python repository to trigger a build, just > let me know. We currently use a more-or-less plain > svn_buildbot.py to trigger our own builds. > > Wouldn't that put too much of a burden on the python core build system? > It would have to be aware of all the buildslaves running specific projects. If there is a single "community buildbot", then no. In any case, it's primarily administrative overhead, not so much cycles. python.org does so many things simultaneously, making it trigger an additional build remotely doesn't hurt. > I was thinking about having a dedicated buildmaster machine, such as the > one Neal says he already has, and configure that machine to coordinate a > small army of buildslaves which will be contributed for people > interested in this effort. Right. You still need to find out when to rebuild, and getting triggers from the source repositories is likely the easiest solution. Regards, Martin From brett at python.org Sat Jul 22 19:30:23 2006 From: brett at python.org (Brett Cannon) Date: Sat, 22 Jul 2006 10:30:23 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <20060722114615.GB13782@code0.codespeak.net> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> Message-ID: <bbaeab100607221030v5df16602ya72f58a43e82006b@mail.gmail.com> On 7/22/06, Armin Rigo <arigo at tunes.org> wrote: > Re-hi, > > On Wed, Jul 19, 2006 at 03:35:45PM -0700, Brett Cannon wrote: > > http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log . > > I'm not sure I understand what you propose to fix holes like > constructors and __subclasses__: it seems that you want to remove them > altogether (and e.g. make factory functions instead). That would > completely break all programs, right? Not altogether, just constructors on select types who are considered dangerous from a security standpoint. The breakage won't be horrible, but it will be there for advanced Python code. I will try to make the wording more clear when I get back to work on Tuesday. > I mean, there is no way such > changes would go into mainstream CPython. If this has to wait until Py3k then so be it. > Or do you propose to maintain > a CPython branch manually for the foreseeable future? (From experience > this is a bad idea...) > Yeah, not my idea of fun either, but since this is a long term project, I will at least need to for the foreseeable future. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060722/3f1b39e2/attachment.htm From brett at python.org Sat Jul 22 19:33:19 2006 From: brett at python.org (Brett Cannon) Date: Sat, 22 Jul 2006 10:33:19 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <20060722070710.GA22334@code0.codespeak.net> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722070710.GA22334@code0.codespeak.net> Message-ID: <bbaeab100607221033t229d80c3h35b669009787cc87@mail.gmail.com> On 7/22/06, Armin Rigo <arigo at tunes.org> wrote: > > Hi Brett, > > On Wed, Jul 19, 2006 at 03:35:45PM -0700, Brett Cannon wrote: > > I also plan to rewrite the import machinery in pure Python. > > http://codespeak.net/svn/pypy/dist/pypy/module/__builtin__/importing.py Thanks for the link, Armin. Since you guys don't have the import restrictions the CPython version would have and just have different coding needs for RPython obviously I can't just do a blind copy. But I will definitely take a look as I develop. Maybe you guys can even help to lower the duplication if it makes sense for you. BTW, do you guys happen to have extra tests from import? -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060722/208e6338/attachment.html From grig.gheorghiu at gmail.com Sat Jul 22 20:16:00 2006 From: grig.gheorghiu at gmail.com (Grig Gheorghiu) Date: Sat, 22 Jul 2006 11:16:00 -0700 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <44C249F3.8000908@v.loewis.de> References: <3f09d5a00607211004kb45ffe2kcedcf418136e7102@mail.gmail.com> <44C1E447.2020404@v.loewis.de> <3f09d5a00607220832n57f0c5a6m8a193eb8e405846a@mail.gmail.com> <44C249F3.8000908@v.loewis.de> Message-ID: <3f09d5a00607221116m37854df2kafaa381c9e2f81dc@mail.gmail.com> On 7/22/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > > Grig Gheorghiu wrote: > > As I said earlier: If you need some kind of post-commit > > trigger on the python repository to trigger a build, just > > let me know. We currently use a more-or-less plain > > svn_buildbot.py to trigger our own builds. > > > > Wouldn't that put too much of a burden on the python core build system? > > It would have to be aware of all the buildslaves running specific > projects. > > If there is a single "community buildbot", then no. In any case, it's > primarily administrative overhead, not so much cycles. python.org does > so many things simultaneously, making it trigger an additional build > remotely doesn't hurt. > > > I was thinking about having a dedicated buildmaster machine, such as the > > one Neal says he already has, and configure that machine to coordinate a > > small army of buildslaves which will be contributed for people > > interested in this effort. > > Right. You still need to find out when to rebuild, and getting triggers > from the source repositories is likely the easiest solution. I see....I guess I was thinking about building periodically (every X hours or at time Y) as opposed to getting svn triggers on each check-in. But if, as you're saying, the overhead on python.org is not too great, we can do what you suggested. Grig -- http://agiletesting.blogspot.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060722/b7d1a491/attachment.htm From greg.ewing at canterbury.ac.nz Sun Jul 23 03:24:43 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sun, 23 Jul 2006 13:24:43 +1200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <20060722114615.GB13782@code0.codespeak.net> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> Message-ID: <44C2CFDB.4060700@canterbury.ac.nz> Armin Rigo wrote: > I'm not sure I understand what you propose to fix holes like > constructors and __subclasses__: it seems that you want to remove them > altogether (and e.g. make factory functions instead). That would > completely break all programs, right? I mean, there is no way such > changes would go into mainstream CPython. How much code is actually out there that uses __subclasses__? It seems like a fairly esoteric corner of the language to me. In any case, I think this approach should certainly be tried, and if it works out, considered for Py3k. -- Greg From david.nospam.hopwood at blueyonder.co.uk Sun Jul 23 03:18:48 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Sun, 23 Jul 2006 02:18:48 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <20060722114615.GB13782@code0.codespeak.net> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> Message-ID: <44C2CE78.9020802@blueyonder.co.uk> Armin Rigo wrote: > Re-hi, > > On Wed, Jul 19, 2006 at 03:35:45PM -0700, Brett Cannon wrote: > >>http://svn.python.org/view/python/branches/bcannon-sandboxing/securing_python.txt?rev=50717&view=log. > > I'm not sure I understand what you propose to fix holes like > constructors and __subclasses__: it seems that you want to remove them > altogether (and e.g. make factory functions instead). That would > completely break all programs, right? I mean, there is no way such > changes would go into mainstream CPython. If I understand correctly, the proposal is that any incompatible changes to the language would apply only in "sandboxed" interpreters. So there is no reason why support for these couldn't go into the main branch. Of course we want to minimize the changes that will need to be made to programs and libraries to make them work in a sandboxed interpreter, but not at the expense of security. Some incompatible changes will be necessary. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From arigo at tunes.org Sun Jul 23 08:54:10 2006 From: arigo at tunes.org (Armin Rigo) Date: Sun, 23 Jul 2006 08:54:10 +0200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607221033t229d80c3h35b669009787cc87@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722070710.GA22334@code0.codespeak.net> <bbaeab100607221033t229d80c3h35b669009787cc87@mail.gmail.com> Message-ID: <20060723065409.GC13782@code0.codespeak.net> Hi Brett, On Sat, Jul 22, 2006 at 10:33:19AM -0700, Brett Cannon wrote: > Thanks for the link, Armin. Since you guys don't have the import > restrictions the CPython version would have and just have different coding > needs for RPython obviously I can't just do a blind copy. But I will > definitely take a look as I develop. Maybe you guys can even help to lower > the duplication if it makes sense for you. Yes, it should be possible to abstract the common logic in some way, using some kind of interface for all OS inspection and 'sys.modules' manipulations. > BTW, do you guys happen to have extra tests from import? Yes, there is http://codespeak.net/svn/pypy/dist/pypy/module/__builtin__/test/test_import.py which will also need a bit of rewriting, but that should be straightforward. A bientot, Armin From arigo at tunes.org Sun Jul 23 09:08:12 2006 From: arigo at tunes.org (Armin Rigo) Date: Sun, 23 Jul 2006 09:08:12 +0200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <44C2CE78.9020802@blueyonder.co.uk> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> Message-ID: <20060723070811.GD13782@code0.codespeak.net> Hi David, hi Brett, On Sun, Jul 23, 2006 at 02:18:48AM +0100, David Hopwood wrote: > If I understand correctly, the proposal is that any incompatible changes > to the language would apply only in "sandboxed" interpreters. So there is > no reason why support for these couldn't go into the main branch. That's what I originally thought too, but Brett writes: Implementation Details ======================== An important point to keep in mind when reading about the implementation details for the security model is that these are general changes and are not special to any type of interpreter, sandboxed or otherwise. That means if a change to a built-in type is suggested and it does not involve a proxy, that change is meant Python-wide for *all* interpreters. So that's why I'm starting to worry that Brett is proposing to change the regular Python language too. However, Brett, you also say somewhere else that backward compatibility is not an issue. So I'm a bit confused actually... Also, I hate to sound self-centered, but I should point out somewhere that PyPy was started by people who no longer wanted to maintain a fork of CPython, and preferred to work on building CPython-like variants automatically. Many of the security features you list would be quite easier to implement and maintain in PyPy than CPython -- also from a security perspective: it is easier to be sure that some protection is complete, and remains complete over time, if it is systematically generated instead of hand-patched in a dozen places. A bientot, Armin From rasky at develer.com Sun Jul 23 15:30:50 2006 From: rasky at develer.com (Giovanni Bajo) Date: Sun, 23 Jul 2006 15:30:50 +0200 Subject: [Python-Dev] Document performance requirements? References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com><200607210759.56595.ndbecker2@gmail.com><44C0C79E.3020309@gmail.com><bb8868b90607210713p4599e8c3ka5fc8c61f8679486@mail.gmail.com><44C18E89.5090603@gmail.com> <20060722111907.GA13782@code0.codespeak.net> Message-ID: <008201c6ae5c$36b4ca40$d503030a@trilan> Armin Rigo wrote: > I think that O-wise the current CPython situation should be documented > as a "minimal requirement" for implementations of the language, with > just one exception: the well-documented "don't rely on this" hack in > 2.4 to make repeated 'str += str' amortized linear, for which the 2.3 > quadratic behavior is considered compliant enough. > > I suppose that allowing implementations to provide better algorithmic > complexities than required is fine, although I can think of some > problems with that (e.g. nice and efficient user code that would > perform horribly badly on CPython). I'm not sure big-O tells the whole truth. For instance, do we want to allow an implementation to use a hash table as underlying type for a list? It would match big-O requirements, but would still be slower than a plain array because of higher overhead of implementation (higher constant factor). And if this is allowed, I would like to find in CPython tutorials and documentations a simple statement like: "to implement the list and match its requirements, CPython choose a simple array as underlying data structure". -- Giovanni Bajo From tnelson at onresolve.com Sun Jul 23 19:42:59 2006 From: tnelson at onresolve.com (Trent Nelson) Date: Sun, 23 Jul 2006 19:42:59 +0200 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization Message-ID: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> Hi, Has anyone else built Python with Visual Studio 2005 and played around with Profile Guided Optimization? I had to build Python from source w/ VS 2005 as I had a few .pyd's built with VS 2005 that I wanted to load; I ended up playing around with Profile Guided Optimization, running ``python.exe pystones.py'' to collect call-graph data after python.exe/Python24.dll had been instrumented, then recompiling with the optimizations fed back in. Results were interesting, an average speedup of around 33% was noticeable: ActiveState 2.4.3 python.exe: C:\Python24>python.exe Lib\test\pystone.py Pystone(1.1) time for 50000 passes = 0.980119 This machine benchmarks at 51014.2 pystones/second The python compiled from branches/release24-maint with VS 2005 + profile guided optimization: C:\Python24>python.exe Lib\test\pystone.py Pystone(1.1) time for 50000 passes = 0.73261 This machine benchmarks at 68249.2 pystones/second Is there any motivation in the Win32 Python dev camp to switch from VC6 to VS 2005? FWIW, although there were a shed-load of warnings when compiling python and pythoncore (and a lot more errors when compiling other modules), I only had to apply one patch to get it working well enough to run pystone.py. Without this patch, the VC8 CRT aborts at runtime as soon as an invalid signal is passed to signal(); which is inevitable given the current code in the initsignal() method: for (i = 1; i < NSIG; i++) { void (*t)(int); t = PyOS_getsig(i); Regards, Trent. -- http://www.onresolve.com Index: signalmodule.c =================================================================== --- signalmodule.c (revision 47196) +++ signalmodule.c (working copy) @@ -280,7 +280,21 @@ {NULL, NULL} /* sentinel */ }; +#define WIN32VS2005HACK +#ifdef WIN32VS2005HACK +#include <stdio.h> +#include <stdlib.h> +#include <crtdbg.h> +void dummy_handler(const wchar_t *exp, + const wchar_t *fn, + const wchar_t *file, + unsigned int line, + uintptr_t reserved) +{ +} +#endif + PyDoc_STRVAR(module_doc, "This module provides mechanisms to use signal handlers in Python.\n\ \n\ @@ -339,6 +353,12 @@ goto finally; Py_INCREF(IntHandler); +#ifdef WIN32VS2005HACK + (void)_set_invalid_parameter_handler(&dummy_handler); + _CrtSetReportMode(_CRT_ASSERT, 0); +#endif + + Handlers[0].tripped = 0; for (i = 1; i < NSIG; i++) { void (*t)(int); From tonynelson at georgeanelson.com Sun Jul 23 19:26:59 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Sun, 23 Jul 2006 13:26:59 -0400 Subject: [Python-Dev] Socket Timeouts patch 1519025 Message-ID: <v04020a00c0e95c95d9bc@[192.168.123.162]> I request a review of my patch (1519025) to get socket timeouts to work properly with errors and signals. I don't expect this patch would make it into 2.5, but perhaps it could be in 2.5.1, as it fixes a long-standing bug. I know that people are busy with getting 2.5 out the door, but it would be helpful for me to know if my current patch is OK before I start on another patch to make socket timeouts more useful. There is also a version of the patch for 2.4, which would make yum nicer in Fedora 4 and 5, and I think that passing a review would make the patch more acceptable to Fedora's maintainers. My next patch will, if it works, make socket timeouts easier to use per-thread, allow for the timing of entire operations rather than just timing transaction phases, allow for setting an acceptable rate for file transfers, and should be completely backward compatible, in that old code would be unaffected and new code would work as well as possible now on older unpatched versions. That's my plan, anyway. It would build on my current patch, at least in its principles. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From brett at python.org Sun Jul 23 22:00:06 2006 From: brett at python.org (Brett Cannon) Date: Sun, 23 Jul 2006 13:00:06 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <20060723070811.GD13782@code0.codespeak.net> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> Message-ID: <bbaeab100607231300w187dfa8dx87ef0a94935e56ce@mail.gmail.com> On 7/23/06, Armin Rigo <arigo at tunes.org> wrote: > > Hi David, hi Brett, > > On Sun, Jul 23, 2006 at 02:18:48AM +0100, David Hopwood wrote: > > If I understand correctly, the proposal is that any incompatible changes > > to the language would apply only in "sandboxed" interpreters. So there > is > > no reason why support for these couldn't go into the main branch. > > That's what I originally thought too, but Brett writes: > > Implementation Details > ======================== > > An important point to keep in mind when reading about the > implementation details for the security model is that these are > general changes and are not special to any type of interpreter, > sandboxed or otherwise. That means if a change to a built-in type is > suggested and it does not involve a proxy, that change is meant > Python-wide for *all* interpreters. > > So that's why I'm starting to worry that Brett is proposing to change > the regular Python language too. Yes, I am proposing changing some constructors and methods on some built-in types for the regular languages. That's it. No new keywords or major semantic changes and such. If I make changes just for sandboxed interpreters it changes the general approach of the security model by then requiring an identity check to see if the interpreter is sandboxed or not. However, Brett, you also say somewhere > else that backward compatibility is not an issue. So I'm a bit confused > actually... Since this is my Ph.D. dissertation first and foremost, I am not going to tie my hands in such a way that I have to make too much of a compromise in order for this to work. I obviously don't want to change the feel of Python, but if I have to remove the constructor for code objects to prevent evil bytecode or __subclasses__() from object to prevent poking around stuff, then so be it. For this project, security is trumpeting backwards-compatibility when the latter is impossible in order to have the former. I will obviously try to minimize it, but something that works at such a basic level of the language is just going to require some changes for it to work. Also, I hate to sound self-centered, but I should point out somewhere > that PyPy was started by people who no longer wanted to maintain a fork > of CPython, and preferred to work on building CPython-like variants > automatically. Many of the security features you list would be quite > easier to implement and maintain in PyPy than CPython -- also from a > security perspective: it is easier to be sure that some protection is > complete, and remains complete over time, if it is systematically > generated instead of hand-patched in a dozen places. It doesn't sound self-centered. =) Problem is that my knowledge base is obviously all in CPython so my startup costs are much lower than if I tried this in PyPy. Plus there is the point of embedding this into Firefox (possibly) eventually. Does PyPy support embedding yet at the C level? -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060723/229c8fb8/attachment.htm From rasky at develer.com Sun Jul 23 22:41:46 2006 From: rasky at develer.com (Giovanni Bajo) Date: Sun, 23 Jul 2006 22:41:46 +0200 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> Message-ID: <034401c6ae98$6a0da920$d503030a@trilan> Trent Nelson wrote: > Has anyone else built Python with Visual Studio 2005 and played around > with Profile Guided Optimization? Yes, there was some work at the recent Need for Speed sprint. Python 2.5 has a PCBuild8 directory (for VS 2005) with a specific project for PGO. > Results were interesting, an average speedup of around 33% was > noticeable: Yes, they are. > Is there any motivation in the Win32 Python dev camp to switch from > VC6 to VS 2005? I think Martin decided to keep VC71 (Visual Studio .NET 2003) for another release cycle. Given the impressive results of VC8 with PGO, and the fact that Visual Studio Express 2005 is free forever, I would hope as well for the decision to be reconsidered. -- Giovanni Bajo From pje at telecommunity.com Sun Jul 23 22:45:54 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 23 Jul 2006 16:45:54 -0400 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607231300w187dfa8dx87ef0a94935e56ce@mail.gmail.co m> References: <20060723070811.GD13782@code0.codespeak.net> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> Message-ID: <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> At 01:00 PM 7/23/2006 -0700, Brett Cannon wrote: >I obviously don't want to change the feel of Python, but if I have to >remove the constructor for code objects to prevent evil bytecode or >__subclasses__() from object to prevent poking around stuff, then so be >it. For this project, security is trumpeting backwards-compatibility when >the latter is impossible in order to have the former. I will obviously >try to minimize it, but something that works at such a basic level of the >language is just going to require some changes for it to work. Zope 3's sandboxing machinery manages to handle securing these things without any language changes. So, declaring it "impossible" to manage without backward compatibility seems inappropriate, or at least incorrect. But perhaps there is something I'm missing? From david.nospam.hopwood at blueyonder.co.uk Mon Jul 24 00:07:25 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Sun, 23 Jul 2006 23:07:25 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> References: <20060723070811.GD13782@code0.codespeak.net> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> Message-ID: <44C3F31D.3070201@blueyonder.co.uk> Phillip J. Eby wrote: > At 01:00 PM 7/23/2006 -0700, Brett Cannon wrote: > >>I obviously don't want to change the feel of Python, but if I have to >>remove the constructor for code objects to prevent evil bytecode or >>__subclasses__() from object to prevent poking around stuff, then so be >>it. For this project, security is [trumping] backwards-compatibility when >>the latter is impossible in order to have the former. I will obviously >>try to minimize it, but something that works at such a basic level of the >>language is just going to require some changes for it to work. > > Zope 3's sandboxing machinery manages to handle securing these things > without any language changes. So, declaring it "impossible" to manage > without backward compatibility seems inappropriate, or at least > incorrect. ... if Zope's sandboxing is secure. I haven't done a security review of it, but your argument assumes that it is. In any case, Zope's sandboxing is not capability-based. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From foom at fuhm.net Mon Jul 24 01:59:08 2006 From: foom at fuhm.net (James Y Knight) Date: Sun, 23 Jul 2006 19:59:08 -0400 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization In-Reply-To: <034401c6ae98$6a0da920$d503030a@trilan> References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> <034401c6ae98$6a0da920$d503030a@trilan> Message-ID: <646666C6-A1DB-4D2C-8054-FA9F7E2129E4@fuhm.net> On Jul 23, 2006, at 4:41 PM, Giovanni Bajo wrote: > I think Martin decided to keep VC71 (Visual Studio .NET 2003) for > another > release cycle. Given the impressive results of VC8 with PGO, and > the fact > that Visual Studio Express 2005 is free forever, I would hope as > well for > the decision to be reconsidered. Wasn't there a "Free Forever" 2003 edition too, which has since completely disappeared? Why do you think that MS won't stop distributing the Free Forever VS 2005 once VS 2005+1 comes out, the same way they did the 2003 one? James From david.nospam.hopwood at blueyonder.co.uk Mon Jul 24 01:32:47 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Mon, 24 Jul 2006 00:32:47 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607231300w187dfa8dx87ef0a94935e56ce@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <bbaeab100607231300w187dfa8dx87ef0a94935e56ce@mail.gmail.com> Message-ID: <44C4071F.5050003@blueyonder.co.uk> Brett Cannon wrote: > On 7/23/06, Armin Rigo <arigo at tunes.org> wrote: > >> Hi David, hi Brett, >> >> On Sun, Jul 23, 2006 at 02:18:48AM +0100, David Hopwood wrote: >> > If I understand correctly, the proposal is that any incompatible >> > changes to the language would apply only in "sandboxed" interpreters. >> > So there is no reason why support for these couldn't go into the >> > main branch. >> >> That's what I originally thought too, but Brett writes: >> >> Implementation Details >> ======================== >> >> An important point to keep in mind when reading about the >> implementation details for the security model is that these are >> general changes and are not special to any type of interpreter, >> sandboxed or otherwise. That means if a change to a built-in type is >> suggested and it does not involve a proxy, that change is meant >> Python-wide for *all* interpreters. >> >> So that's why I'm starting to worry that Brett is proposing to change >> the regular Python language too. > > Yes, I am proposing changing some constructors and methods on some built-in > types for the regular languages. That's it. No new keywords or major > semantic changes and such. If I make changes just for sandboxed > interpreters it changes the general approach of the security model by then > requiring an identity check to see if the interpreter is sandboxed or not. I assume that the extent of incompatible changes would be limited as much as possible. So the only checks would be in operations that are directly affected by whatever incompatible changes are made. The performance and complexity costs of this are likely to be small -- or at least should not be assumed to be large before having hammered out a more detailed design. Suppose, for the sake of argument, that we introduced private methods and attributes. If an attribute in an existing standard library class was changed to be private, then code depending on it would break. But if there were a notion of a "compatibility private" attribute that acts as private only in a sandboxed interpreter, then no code running in an unprotected interpreter would break. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From tjreedy at udel.edu Mon Jul 24 04:45:27 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Sun, 23 Jul 2006 22:45:27 -0400 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> <034401c6ae98$6a0da920$d503030a@trilan> Message-ID: <ea1c88$aa4$1@sea.gmane.org> "Giovanni Bajo" <rasky at develer.com> wrote in message news:034401c6ae98$6a0da920$d503030a at trilan... > that Visual Studio Express 2005 is free forever, I would hope as well for > the decision to be reconsidered. But is it freely redistributable forever? Or even now? I have the 2003 toolkit sitting on my disk, but I am apparently not supposed to send it to anyone else. tjr From nyamatongwe at gmail.com Mon Jul 24 05:03:22 2006 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Mon, 24 Jul 2006 13:03:22 +1000 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization In-Reply-To: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> Message-ID: <50862ebd0607232003gc800a48r3853bd773d3a7f21@mail.gmail.com> Trent Nelson: > I ended up playing around with Profile Guided Optimization, running > ``python.exe pystones.py'' to collect call-graph data after > python.exe/Python24.dll had been instrumented, then recompiling with the > optimizations fed back in. It'd be an idea to build a larger body of Python code to run the profiling pass on so it doesn't just optimize the sort of code in pystone which is not very representative. Could run the test suite as it would have good coverage but would hit exceptional cases too heavily. Other compilers (Intel?) support profile directed optimization so would also benefit from such a body of code. Neil From pje at telecommunity.com Mon Jul 24 05:30:24 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 23 Jul 2006 23:30:24 -0400 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <44C3F31D.3070201@blueyonder.co.uk> References: <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <20060723070811.GD13782@code0.codespeak.net> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> At 11:07 PM 7/23/2006 +0100, David Hopwood wrote: >Phillip J. Eby wrote: > > At 01:00 PM 7/23/2006 -0700, Brett Cannon wrote: > > > >>I obviously don't want to change the feel of Python, but if I have to > >>remove the constructor for code objects to prevent evil bytecode or > >>__subclasses__() from object to prevent poking around stuff, then so be > >>it. For this project, security is [trumping] backwards-compatibility when > >>the latter is impossible in order to have the former. I will obviously > >>try to minimize it, but something that works at such a basic level of the > >>language is just going to require some changes for it to work. > > > > Zope 3's sandboxing machinery manages to handle securing these things > > without any language changes. So, declaring it "impossible" to manage > > without backward compatibility seems inappropriate, or at least > > incorrect. > >... if Zope's sandboxing is secure. I haven't done a security review of it, >but your argument assumes that it is. What argument is that? I'm merely suggesting that coming up with a completely new way to secure Python without a serious consideration of existing practical prior art (with many years' deployment experience on the public internet!) seems ill-advised with respect to achieving practical goals. Brett's securing_python.txt don't refer to or cite Zope in any way, but rather relies on broad and unsupported assertions about what can or can't be done with Python. I hope he isn't doing the same in his thesis, as this is rather like writing about one's new theory of how to have a worldwide ball-kicking contest without making any reference as to how one's theory compares with the World Cup. I'm not saying Zope is better or worse. I'm simply saying that in a business context, a failure to compare and contrast a proposed "build" solution to show how it would be better than a well-established available "buy" solution would be called something like "lack of due diligence". I think in the academic context it might be called something like "failure to cite", but the general idea is the same, i.e., not doing your homework. :) In other words, if the solution being proposed is better than what Zope does, the appropriate thing in business is to show the reasons why, and the appropriate thing in science is to state a hypothesis regarding the differences, and then perform an experiment to either prove or disprove it. >In any case, Zope's sandboxing is not capability-based. You're right: you haven't done a review of it. :) If you had, you'd know that one proxy plus one namechecker equals one capability. In other words, you could take the restricted interpreter, the proxy mechanism, and the namechecker and leave most of the rest alone, and you'd have your capability system. Then you could focus more time and attention on the parts of the problem that Zope *doesn't* solve, instead of reinventing the ones that it already does. Now, if Brett believes that changing the Python language is a *better* way to implement capabilities than using proxies to implement them, then great. His paper should explain why, and (presumably) include experimental results to show that they're either better or worse than Zope's approach based on some criteria. The same information is relevant to Python-Dev as to what is an appropriate approach to support sandboxing in CPython. What are the advantages of a built-in approach versus an add-on approach? Are there interpreter facilities that could be added to shore up any awkward aspects of Zope's approach? (Whatever those might be.) For example, one part of Zope's approach uses a custom compiler and custom builtins in order to redefine how attribute access works in certain cases. Could these customizations be replaced with options built into the Python compiler and interpreter? What improvements would that result in? Simply handwaving all of these questions away, however, with broad assertions of superiority and without even attempting to compare the new work to Zope's existing work is really not acceptable for academia OR Python development. For the record: I have no personal interest in Zope's security system. I didn't develop it and haven't had the need to use it, myself. I once reviewed some of the code and offered some minor suggestions, mainly regarding performance improvement. My only axe to grind in this matter is what I've already stated: I think it would be crazy (in the "monumental waste of resources" sense) to consider putting *any* sandboxing system into CPython without tapping the Zope team's experiences. For example: having implemented such a system, what compiler or interpreter changes would've made the job easier? Meanwhile, what Brett does or doesn't put in his thesis is between him and his advisor, but what gets put into Python shouldn't be based on ignoring the existing field experience and state of the art. From unknown_kev_cat at hotmail.com Mon Jul 24 06:13:50 2006 From: unknown_kev_cat at hotmail.com (Joe Smith) Date: Mon, 24 Jul 2006 00:13:50 -0400 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg><034401c6ae98$6a0da920$d503030a@trilan> <646666C6-A1DB-4D2C-8054-FA9F7E2129E4@fuhm.net> Message-ID: <ea1hdl$k0v$1@sea.gmane.org> "James Y Knight" <foom at fuhm.net> wrote in message news:646666C6-A1DB-4D2C-8054-FA9F7E2129E4 at fuhm.net... > On Jul 23, 2006, at 4:41 PM, Giovanni Bajo wrote: >> I think Martin decided to keep VC71 (Visual Studio .NET 2003) for >> another >> release cycle. Given the impressive results of VC8 with PGO, and >> the fact >> that Visual Studio Express 2005 is free forever, I would hope as >> well for >> the decision to be reconsidered. > > Wasn't there a "Free Forever" 2003 edition too, which has since > completely disappeared? Why do you think that MS won't stop > distributing the Free Forever VS 2005 once VS 2005+1 comes out, the > same way they did the 2003 one? I am not aware of any full free ofeering based on 2003. There were ways to get the command line tools, but the GUI's were not available. Most of the freely available command line utilities are still available from microsoft if you find the correct page. I will note that according to the FAQ, Profile Guided Optimizations will not be available with Express. Because the compiler is the same, this means the tools to generate a profile are all that is missing. Express should still be able to build using the optimized profile. Microsoft as a general rule, does not go after people distributing products that Microsoft has labeled free, even after Microsoft no longer distributes that product. So the express editions will continue to be available long into the future if 2005+1 does not have a free version. (The logic behind this is that Microsoft would have a hard time explaining to a jury how somebody can "pirate" software that is available at no cost.) From unknown_kev_cat at hotmail.com Mon Jul 24 06:24:23 2006 From: unknown_kev_cat at hotmail.com (Joe Smith) Date: Mon, 24 Jul 2006 00:24:23 -0400 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> <50862ebd0607232003gc800a48r3853bd773d3a7f21@mail.gmail.com> Message-ID: <ea1i1e$l5c$1@sea.gmane.org> "Neil Hodgson" <nyamatongwe at gmail.com> wrote in message news:50862ebd0607232003gc800a48r3853bd773d3a7f21 at mail.gmail.com... > Trent Nelson: > >> I ended up playing around with Profile Guided Optimization, running >> ``python.exe pystones.py'' to collect call-graph data after >> python.exe/Python24.dll had been instrumented, then recompiling with the >> optimizations fed back in. > > It'd be an idea to build a larger body of Python code to run the > profiling pass on so it doesn't just optimize the sort of code in > pystone which is not very representative. Could run the test suite as > it would have good coverage but would hit exceptional cases too > heavily. Other compilers (Intel?) support profile directed > optimization so would also benefit from such a body of code. > GCC suppost profiling optimized code. One caveat is that the profile-enabled builds ause a GPL'd library. Not a problem for Python right now as it is GPL-compatible, but a caveat non-the-less. That does not apply to the final optimized app. Anyway the flags used are: "-fprofile-generate" and "-fprofile-use". From martin at v.loewis.de Mon Jul 24 06:25:46 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 24 Jul 2006 06:25:46 +0200 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization In-Reply-To: <ea1hdl$k0v$1@sea.gmane.org> References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg><034401c6ae98$6a0da920$d503030a@trilan> <646666C6-A1DB-4D2C-8054-FA9F7E2129E4@fuhm.net> <ea1hdl$k0v$1@sea.gmane.org> Message-ID: <44C44BCA.30404@v.loewis.de> Joe Smith wrote: > Microsoft as a general rule, does not go after people distributing > products that Microsoft has labeled free, even after Microsoft no > longer distributes that product. So the express editions will > continue to be available long into the future if 2005+1 does not have > a free version. Interesting. So people can do the same with the free 2003 version. Regards, Martin From brett at python.org Mon Jul 24 06:27:50 2006 From: brett at python.org (Brett Cannon) Date: Sun, 23 Jul 2006 21:27:50 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> Message-ID: <bbaeab100607232127p1fdb401ct12d471925453cb9b@mail.gmail.com> On 7/23/06, Phillip J. Eby <pje at telecommunity.com> wrote: > > At 11:07 PM 7/23/2006 +0100, David Hopwood wrote: > >Phillip J. Eby wrote: [snip] Brett's securing_python.txt don't refer to or cite Zope in any way, but > rather relies on broad and unsupported assertions about what can or can't > be done with Python. I hope he isn't doing the same in his thesis, as > this > is rather like writing about one's new theory of how to have a worldwide > ball-kicking contest without making any reference as to how one's theory > compares with the World Cup. The design doc is not meant to be taken as any sort of draft of my thesis. I did read that link you sent me, Philip, but it was hard to follow. So I used Google to find another reference that explained it to me much more clearly. securing_python.txt is meant to explain what I am planning to python-dev so that if someone sees some fatal flaw they can speak up and let me know, not as a thorough comparison of why my approach is better than anyone other one. I'm not saying Zope is better or worse. I'm simply saying that in a > business context, a failure to compare and contrast a proposed "build" > solution to show how it would be better than a well-established available > "buy" solution would be called something like "lack of due diligence". I > think in the academic context it might be called something like "failure > to > cite", but the general idea is the same, i.e., not doing your > homework. :) > > In other words, if the solution being proposed is better than what Zope > does, the appropriate thing in business is to show the reasons why, and > the > appropriate thing in science is to state a hypothesis regarding the > differences, and then perform an experiment to either prove or disprove > it. I am not going to write out a blow-by-blow comparison right now. It will come with the thesis. And I am not expecting my approach or code to be checked in blindly anyway. >In any case, Zope's sandboxing is not capability-based. > > You're right: you haven't done a review of it. :) If you had, you'd know > that one proxy plus one namechecker equals one capability. In other > words, > you could take the restricted interpreter, the proxy mechanism, and the > namechecker and leave most of the rest alone, and you'd have your > capability system. Then you could focus more time and attention on the > parts of the problem that Zope *doesn't* solve, instead of reinventing the > ones that it already does. Right, but I am trying to remove the need for a namechecker which makes it an object-capabilities system. Now, if Brett believes that changing the Python language is a *better* way > to implement capabilities than using proxies to implement them, then > great. His paper should explain why, and (presumably) include > experimental > results to show that they're either better or worse than Zope's approach > based on some criteria. The same information is relevant to Python-Dev as > to what is an appropriate approach to support sandboxing in CPython. What > are the advantages of a built-in approach versus an add-on approach? Are > there interpreter facilities that could be added to shore up any awkward > aspects of Zope's approach? (Whatever those might be.) I think people are starting to lose sight of the purpose of the doc I wrote. It was to explain what I was doing for people to see if there was any fatal flaw and to keep people updated on what I am planning on doing. It is not meant to convince anyone that my way is the best way yet. I am not even going to attempt that until I have working code. For example, one part of Zope's approach uses a custom compiler and custom > builtins in order to redefine how attribute access works in certain > cases. Could these customizations be replaced with options built into the > Python compiler and interpreter? What improvements would that result in? Part of my point is to help alleviate the need for custom anything. Simply handwaving all of these questions away, however, with broad > assertions of superiority and without even attempting to compare the new > work to Zope's existing work is really not acceptable for academia OR > Python development. > > For the record: I have no personal interest in Zope's security system. I > didn't develop it and haven't had the need to use it, myself. I once > reviewed some of the code and offered some minor suggestions, mainly > regarding performance improvement. My only axe to grind in this matter is > what I've already stated: I think it would be crazy (in the "monumental > waste of resources" sense) to consider putting *any* sandboxing system > into > CPython without tapping the Zope team's experiences. For example: having > implemented such a system, what compiler or interpreter changes would've > made the job easier? > > Meanwhile, what Brett does or doesn't put in his thesis is between him and > his advisor, but what gets put into Python shouldn't be based on ignoring > the existing field experience and state of the art. There is no ignoring of anything. I understand their basic approach and I want to try another one. I like the fundemental design difference of object-capabilities and so I am going to give it a shot and see how it works out. If Zope's proxies are better, then fine, get them to contribute to the core so that we have security again. No one is proposing blind acceptance of my approach. But as it stands now, Guido likes my initial ideas (including the possible changes to the language), I like my approach, and my supervisor is willing to let me give it a shot, so that is good enough for me to move forward. There will be a thorough discussion I am sure once the code is finished and ready to be proposed for possible inclusion in CPython. Until then, though, just consider it my little experiment that I am just keeping people abreast of. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060723/bd0f74ea/attachment.htm From greg.ewing at canterbury.ac.nz Mon Jul 24 06:51:23 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 24 Jul 2006 16:51:23 +1200 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization In-Reply-To: <ea1hdl$k0v$1@sea.gmane.org> References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg> <034401c6ae98$6a0da920$d503030a@trilan> <646666C6-A1DB-4D2C-8054-FA9F7E2129E4@fuhm.net> <ea1hdl$k0v$1@sea.gmane.org> Message-ID: <44C451CB.3060607@canterbury.ac.nz> Joe Smith wrote: > Microsoft as a general rule, does not go after people distributing products > that Microsoft has labeled > free, even after Microsoft no longer distributes that product. But if the licence agreement technically forbids redistribution, it doesn't seem like a good idea to rely on Microsoft turning a blind eye to that. -- Greg From pje at telecommunity.com Mon Jul 24 08:10:44 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Mon, 24 Jul 2006 02:10:44 -0400 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607232127p1fdb401ct12d471925453cb9b@mail.gmail.co m> References: <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> At 09:27 PM 7/23/2006 -0700, Brett Cannon wrote: >On 7/23/06, Phillip J. Eby ><<mailto:pje at telecommunity.com>pje at telecommunity.com> wrote: >>one proxy plus one namechecker equals one capability. In other words, >>you could take the restricted interpreter, the proxy mechanism, and the >>namechecker and leave most of the rest alone, and you'd have your >>capability system. Then you could focus more time and attention on the >>parts of the problem that Zope *doesn't* solve, instead of reinventing the >>ones that it already does. > >Right, but I am trying to remove the need for a namechecker which makes it >an object-capabilities system. As I said above: a namechecker plus a proxy *equals* an object capability. When I say "name checker" I mean the Zope type that allows you to specify a list of names that are allowed for a given object. This allowing is not based on identity or code signing or anything like that. It's just a list of attribute names: i.e. a capability mask over an existing object. When you create a proxy using this name mask, that proxy becomes a capability that allows access to the given names on the underlying object. >I like the fundemental design difference of object-capabilities It's not a difference at all, let alone a fundamental one. Zope just happens to allow other kinds of security checking *in addition to* capabilities, if you want them. However, most of its more basic encapsulation features are 100% capability based. Meanwhile, if you want to implement an object-capability system, you will need something that is basically a mask, to allow one piece of code to create capabilities that can be given to another. What you end up with for doing that is going to look almost exactly like a Zope proxy plus a Zope name checker. I hate to harp on this point, but there seems to be a trend that when people have capabilities on their mind, they tend to look at Zope and dismiss it as not being capability-based, when in fact Zope's approach is capabilities *plus* other things. (Of course, most of those "other things" have to do with closing holes like __subclasses__, while improving performance by still allowing lots of common objects not to be proxied.) From brett at python.org Mon Jul 24 09:50:09 2006 From: brett at python.org (Brett Cannon) Date: Mon, 24 Jul 2006 00:50:09 -0700 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> Message-ID: <bbaeab100607240050h142e6bbcle137ff393f454684@mail.gmail.com> On 7/23/06, Phillip J. Eby <pje at telecommunity.com> wrote: > > At 09:27 PM 7/23/2006 -0700, Brett Cannon wrote: > > When I say "name checker" I mean the Zope type that allows you to specify > a > list of names that are allowed for a given object. This allowing is not > based on identity or code signing or anything like that. It's just a list > of attribute names: i.e. a capability mask over an existing object. > > When you create a proxy using this name mask, that proxy becomes a > capability that allows access to the given names on the underlying object. OK, then using the term "namechecker" through me off. >I like the fundemental design difference of object-capabilities > > It's not a difference at all, let alone a fundamental one. Zope just > happens to allow other kinds of security checking *in addition to* > capabilities, if you want them. However, most of its more basic > encapsulation features are 100% capability based. > Meanwhile, if you want to implement an object-capability system, you will > need something that is basically a mask, to allow one piece of code to > create capabilities that can be given to another. What you end up with > for > doing that is going to look almost exactly like a Zope proxy plus a Zope > name checker. I hate to harp on this point, but there seems to be a trend that when > people have capabilities on their mind, they tend to look at Zope and > dismiss it as not being capability-based, when in fact Zope's approach is > capabilities *plus* other things. Well, Jim said that Zope proxies didn't conform to the strict definition of object-capabilities the last time this all came about: http://mail.python.org/pipermail/python-dev/2003-March/033884.html and http://mail.python.org/pipermail/python-dev/2003-March/033915.html . He said they *could* be made to be what object-capabilities is defined as, but they were not currently structured that way. Those comments are one of the reasons I never considered thinking of Zope proxies as a object-capabilities system. (Of course, most of those "other things" have to do with closing holes like > __subclasses__, while improving performance by still allowing lots of > common objects not to be proxied.) > > OK, then I need something clarified. If you read http://www.zope.org/Wikis/DevSite/Projects/ComponentArchitecture/TransitionToSecurityProxies, it talks about creating the proxies. I get they restrict attribute access and wrap all returned objects in proxies themselves (unless they are considered safe). But to judge whether an attribute should be returned, it checks the security context. It also mentions how access to the security policy must be available so that proper security checks can be done to either grant or deny access. So what I want to know is if this security context is this global thing that proxies access every time to check whether something is allowed or not. Or is it a per-object specification? And what is the security domain for Zope proxies; objects, interpreter, running Python program, what? -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060724/133649ce/attachment-0001.html From nmm1 at cus.cam.ac.uk Mon Jul 24 11:46:52 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Mon, 24 Jul 2006 10:46:52 +0100 Subject: [Python-Dev] Strategy for converting the decimal module to C Message-ID: <E1G4x1Y-0005Vy-Js@virgo.cus.cam.ac.uk> James Y Knight <foom at fuhm.net> wrote: > > > To cut a long story short, it is impractical for a language run-time > > system to call user-defined handlers with any degree of reliability > > unless the compiled code and run-time interoperate carefully - I have > > been there and done that many times, but few people still working > > have. > > On architectures with out-of-order execution (and interrupts), you > > have to assume that an interrupt may occur anywhere, even when the > > code does not use the relevant facility. Floating-point overflow > > in the middle of a list insertion? That's to be expected. > > While this _is_ a real problem, is it _not_ a general problem as you > are describing it. Processors are perfectly capable of generating > precise interrupts, and the inability to do so has nothing to do with > the out-of-order execution, etc. Almost all interrupts are precise. I am sorry, but this is almost totally wrong, though I agree that you will get that impression upon reading the architecture books unless you are very deeply into that area. Let's skip the hardware issues, as they aren't what I am talking about (though see [*]). I am referring to the interaction between the compiled code, deep library functions and run-time interrupt handler. It is almost universal for some deep library functions and common for compiled code to leave data structures inconsistent in a short window that "cannot possibly fail" - indeed, most system interfaces do this around system calls. If an interrupt occurs then, the run-time system will receive control with those data structures in a state where they must not be accessed. And it is fairly common for such data structures to include ones critical to the functioning of the run-time system. Now, it IS possible to write run-time systems that are safe against this, and still allow asynchronous interrupts, but I am one of three people in the world that I know have done it in the past two decades. There may be as many as six, but I doubt more, and I know of no such implementation on any Unix or Microsoft system. It is even possible to do this for compiled code, but that is where the coordination between the compiler and run-time system comes in. > The only interesting one which is not, on x86 processors, is the x87 > floating point exception, ... Er, no. Try a machine-check in a TLB miss handler. But it is all pretty irrelevant, as the problem arises with asychronous exceptions (e.g. timer interrupts, signals from other processes), anyway. > Also, looking forward, the "simd" floating point instructions (ie mmx/ > sse/sse2/sse3) _do_ ... The critical problems with the x87 floating-point exception were resolved in the 80386. [*] Whether or not it is a fundamental problem, it is very much a general problem at present, and it will become more so as more CPUs implement micro-threading. For why it is tied up with out-of-order execution etc., consider a system with 100 operations flying, of which 10 are memory accesses, and then consider what happens when you have combinations of floating-point exceptions, TLB misses, machine-checks (e.g. ECC problems on memory) and device/timer interrupts. Once you add user-defined handlers into that mix, you either start exposing that mess to the program or have to implement them by stopping the CPU, unwinding the pipeline, and rerunning in very, very serial mode until the handler is called. Look at IA64 .... Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From ncoghlan at gmail.com Mon Jul 24 13:21:22 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 24 Jul 2006 21:21:22 +1000 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607231300w187dfa8dx87ef0a94935e56ce@mail.gmail.com> References: <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <bbaeab100607231300w187dfa8dx87ef0a94935e56ce@mail.gmail.com> Message-ID: <44C4AD32.20309@gmail.com> Brett Cannon wrote: > On 7/23/06, *Armin Rigo* <arigo at tunes.org <mailto:arigo at tunes.org>> wrote: > Also, I hate to sound self-centered, but I should point out somewhere > that PyPy was started by people who no longer wanted to maintain a fork > of CPython, and preferred to work on building CPython-like variants > automatically. Many of the security features you list would be quite > easier to implement and maintain in PyPy than CPython -- also from a > security perspective: it is easier to be sure that some protection is > complete, and remains complete over time, if it is systematically > generated instead of hand-patched in a dozen places. > > > It doesn't sound self-centered. =) Problem is that my knowledge base > is obviously all in CPython so my startup costs are much lower than if I > tried this in PyPy. Plus there is the point of embedding this into > Firefox (possibly) eventually. Does PyPy support embedding yet at the C > level? Another rationale for basing the work on CPython is that it should be possible to implement the resulting security model regardless of the implementation language used for the interpreter core (C/Python, Java/Python, C#/Python, RPython/Python). If you can figure out how to do it in C, it should be feasible to do it in the others. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From arigo at tunes.org Mon Jul 24 14:56:40 2006 From: arigo at tunes.org (Armin Rigo) Date: Mon, 24 Jul 2006 14:56:40 +0200 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <008201c6ae5c$36b4ca40$d503030a@trilan> References: <e9qbtf$kk6$1@sea.gmane.org> <20060722111907.GA13782@code0.codespeak.net> <008201c6ae5c$36b4ca40$d503030a@trilan> Message-ID: <20060724125640.GA10183@code0.codespeak.net> Hi Giovanni, On Sun, Jul 23, 2006 at 03:30:50PM +0200, Giovanni Bajo wrote: > I'm not sure big-O tells the whole truth. For instance, do we want to allow > an implementation to use a hash table as underlying type for a list? It > would match big-O requirements, but would still be slower than a plain array > because of higher overhead of implementation (higher constant factor). A big-O difference can make the difference between a program that takes 0.5 seconds or 2 hours to run. This is more important than a constant factor difference, which different implementations are bound to exhibit anyway. > And if this is allowed, I would like to find in CPython tutorials and > documentations a simple statement like: "to implement the list and match its > requirements, CPython choose a simple array as underlying data structure". Yes, the big-O notes don't have to be too technical: the docs should tell people to think about Python lists as simple arrays, and the O requirements follow naturally. A bientot, Armin From amk at amk.ca Mon Jul 24 16:08:19 2006 From: amk at amk.ca (A.M. Kuchling) Date: Mon, 24 Jul 2006 10:08:19 -0400 Subject: [Python-Dev] Python sprint in Arlington July 29/30 Message-ID: <20060724140819.GA9507@rogue.amk.ca> The CanDo developers are sprinting for three days starting on this coming Friday, so there's space available for a Python sprint. I'll try to attend at least on Saturday (Sunday may not be possible for me). Does anyone want to come and work on Python stuff? If yes, please add your name to <http://wiki.python.org/moin/ArlingtonSprint>, which also has directions. Given that Python will still be in a code freeze, we'll have to be careful about code changes. I'm probably going to work on documentation issues. --amk From pje at telecommunity.com Mon Jul 24 18:32:32 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Mon, 24 Jul 2006 12:32:32 -0400 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <bbaeab100607240050h142e6bbcle137ff393f454684@mail.gmail.co m> References: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060724110644.02024180@sparrow.telecommunity.com> At 12:50 AM 7/24/2006 -0700, Brett Cannon wrote: >OK, then I need something clarified. If you read ><http://www.zope.org/Wikis/DevSite/Projects/ComponentArchitecture/TransitionToSecurityProxies>http://www.zope.org/Wikis/DevSite/Projects/ComponentArchitecture/TransitionToSecurityProxies >, it talks about creating the proxies. I get they restrict attribute >access and wrap all returned objects in proxies themselves (unless they >are considered safe). But to judge whether an attribute should be >returned, it checks the security context. That depends on the checker. The proxy implementation delegates all access decisions to a "checker" object. Some checkers check permissions, but a NamesChecker just checks a statically-defined list of names. > It also mentions how access to the security policy must be available so > that proper security checks can be done to either grant or deny access. > >So what I want to know is if this security context is this global thing >that proxies access every time to check whether something is allowed or not. Proxies don't do that; checkers do. The default Checker implementation doesn't even look at a security context if a name is declared public (i.e., it's a NamesChecker). Look at the zope.security.checker module for details. IOW, to make it a pure capabilities system, you would only *delete* code, not add any, as far as I can tell. > Or is it a per-object specification? Each proxy can have its own checker, but an individual checker instance can be shared between proxies. > And what is the security domain for Zope proxies; objects, interpreter, > running Python program, what? There are restricted eval and exec operations to run restricted code. The primary language limitations imposed are the lack of eval/exec by the restricted code, and lack of support for raise and try/except. Implementing these would require additional compiler hacking to add code to ensure that e.g. tracebacks get wrapped. From eswierk at arastra.com Mon Jul 24 19:20:34 2006 From: eswierk at arastra.com (Ed Swierk) Date: Mon, 24 Jul 2006 17:20:34 +0000 (UTC) Subject: [Python-Dev] setup.py and cross-compiling Message-ID: <loom.20060724T190947-141@post.gmane.org> I'm cross-compiling Python using uClibc buildroot, which installs target include files and libraries to a directory like ~/toolchain rather than /usr. I couldn't figure out any way to convincing the top-level python/setup.py to look in ~/toolchain instead of /usr when detecting what modules to build. I decided to hack up setup.py so that an optional root directory (passed via an environment variable) is prepended to all the hardcoded paths like "/usr/include", "/lib", "/lib64", and so on. Am I missing an easier solution to this problem? I'm happy to post my patch to setup.py if anyone is interested. --Ed From unknown_kev_cat at hotmail.com Mon Jul 24 19:35:36 2006 From: unknown_kev_cat at hotmail.com (Joe Smith) Date: Mon, 24 Jul 2006 13:35:36 -0400 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg><034401c6ae98$6a0da920$d503030a@trilan><646666C6-A1DB-4D2C-8054-FA9F7E2129E4@fuhm.net><ea1hdl$k0v$1@sea.gmane.org> <44C451CB.3060607@canterbury.ac.nz> Message-ID: <ea30d8$64o$1@sea.gmane.org> "Greg Ewing" <greg.ewing at canterbury.ac.nz> wrote in message news:44C451CB.3060607 at canterbury.ac.nz... > Joe Smith wrote: > >> Microsoft as a general rule, does not go after people distributing >> products >> that Microsoft has labeled >> free, even after Microsoft no longer distributes that product. > > But if the licence agreement technically forbids > redistribution, it doesn't seem like a good idea > to rely on Microsoft turning a blind eye to that. I agree, although it seems very unlikely Microsoft would bother as they are busy enough the the pirating of the for-cost software. I was a little surprised that they did not allow verbatim duplicates of that software, expecially as it has the feel of freeware, or even shareware (the commecial versions have additional features). I would actually not be surprised if somebody could convince Microsoft to allow them to distribute these versions even after Microsoft no longer supports them. That would be the ideal situation. From david.nospam.hopwood at blueyonder.co.uk Mon Jul 24 18:22:39 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Mon, 24 Jul 2006 17:22:39 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> References: <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <20060723070811.GD13782@code0.codespeak.net> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> Message-ID: <44C4F3CF.1060104@blueyonder.co.uk> Phillip J. Eby wrote: > At 11:07 PM 7/23/2006 +0100, David Hopwood wrote: >> Phillip J. Eby wrote: >> > At 01:00 PM 7/23/2006 -0700, Brett Cannon wrote: >> > >> >> I obviously don't want to change the feel of Python, but if I have to >> >> remove the constructor for code objects to prevent evil bytecode or >> >> __subclasses__() from object to prevent poking around stuff, then so be >> >> it. For this project, security is [trumping] backwards-compatibility when >> >> the latter is impossible in order to have the former. I will obviously >> >> try to minimize it, but something that works at such a basic level of the >> >> language is just going to require some changes for it to work. >> > >> > Zope 3's sandboxing machinery manages to handle securing these things >> > without any language changes. So, declaring it "impossible" to manage >> > without backward compatibility seems inappropriate, or at least >> > incorrect. >> >> ... if Zope's sandboxing is secure. I haven't done a security review >> of it, but your argument assumes that it is. > > What argument is that? You said "Zope 3's sandboxing machinery manages to handle securing these things without any language changes." This assertion assumes that Zope 3's sandboxing machinery is secure. > I'm merely suggesting that coming up with a > completely new way to secure Python without a serious consideration of > existing practical prior art (with many years' deployment experience on > the public internet!) seems ill-advised with respect to achieving > practical goals. > > Brett's securing_python.txt don't refer to or cite Zope in any way, This is indeed an omission that should be corrected, in order to explain why this project is not using Zope or following Zope's approach, and what the differences are. (I've explained some of them below.) > but rather relies on broad and unsupported assertions about what can or > can't be done with Python. I hope he isn't doing the same in his > thesis, as this is rather like writing about one's new theory of how to > have a worldwide ball-kicking contest without making any reference as to > how one's theory compares with the World Cup. > > I'm not saying Zope is better or worse. I'm simply saying that in a > business context, a failure to compare and contrast a proposed "build" > solution to show how it would be better than a well-established > available "buy" solution would be called something like "lack of due > diligence". I think in the academic context it might be called > something like "failure to cite", but the general idea is the same, > i.e., not doing your homework. :) > > In other words, if the solution being proposed is better than what Zope > does, the appropriate thing in business is to show the reasons why, and > the appropriate thing in science is to state a hypothesis regarding the > differences, and then perform an experiment to either prove or disprove it. I completely agree with this. >> In any case, Zope's sandboxing is not capability-based. > > You're right: you haven't done a review of it. :) I haven't done a detailed security review. However, I wouldn't have commented on it without knowing what its basic approach is. From <http://www.zope.org/Wikis/DevSite/Projects/ComponentArchitecture/TransitionToSecurityProxies>: # When an object is passed to untrusted code, it is wrapped in a security # proxy unless it is already wrapped. Security proxies mediate all accesses # to the wrapped object. Operations on security proxies return security # proxies as well. Security proxies passed from untrusted code to trusted # code remain wrapped, so untrusted code can't trick trusted code into # performing operations that the untrusted code could not perform. Restricting the actions of "trusted" code when called by "untrusted" code is precisely what a capability system does *not* do. Indeed, capability systems have no binary distinction between "trusted" and "untrusted" code (outside the system TCB, which is as small as possible); the only security distinction between protection domains is in what capabilities they hold. The main reason why capability systems do not have any such restriction is that it reduces the system's ability to support fine-grained delegation of authority. We want to be able to grant an object just the authority it needs for any particular task. For that to work, code that is otherwise untrusted must be able to use any capability it is given, even if it is a very powerful capability, and therefore must be able to call into more-trusted code without restriction. To characterise this as "tricking" the more-trusted code is a misconception: if it is not intended that a particular protection domain should have some authority, then it should not be given a capability for that authority in the first place. Capability-based security design emphasises preventing protection domains from gaining unintended capabilities; it is not about restricting their use once granted (except to the extent needed to support revocation, which is typically done via higher-level patterns rather than in the basic access control mechanisms). The kind of restrictions that Zope enforces would be a significant impediment to this approach: # With care, trusted code can explicitly unwrap security proxies and gain # additional access. In particular, security proxies cannot be stored in the # Zope object database. If an object wrapped by a security proxy is to be # stored in another object, the security proxy must be removed. This is a severe limitation to expressiveness; it effectively means that references to secure objects are not first-class. In a capability system, they must be first-class. > If you had, you'd know that one proxy plus one namechecker equals one > capability. No, because Zope enforces some restrictions that are not desired in a capability system. (It is absolutely mistaken to think that a system that enforces more restrictions must be more secure, in case anyone is under that impression.) I do not know whether the easiest way to get to a capability-secure version of Python would be by changing the restrictions enforced by Zope, or by adding restrictions to CPython (or another Python implementation). My point is simply that the security model of the Zope sandboxing system as it stands, is quite different from that of a capability system. [...] > Now, if Brett believes that changing the Python language is a *better* > way to implement capabilities than using proxies to implement them, then > great. His paper should explain why, and (presumably) include > experimental results to show that they're either better or worse than > Zope's approach based on some criteria. Requiring experimental results is not appropriate for a proposal that has not been implemented. It is also not a paper; it's an informal, early-stage design outline. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From unknown_kev_cat at hotmail.com Mon Jul 24 22:08:43 2006 From: unknown_kev_cat at hotmail.com (Joe Smith) Date: Mon, 24 Jul 2006 16:08:43 -0400 Subject: [Python-Dev] Python 2.4, VS 2005 & Profile Guided Optmization References: <AB1BB4CF81B9214682079EB55A7125EDEFA707@mapibe05.exchange.xchg><50862ebd0607232003gc800a48r3853bd773d3a7f21@mail.gmail.com> <ea1i1e$l5c$1@sea.gmane.org> Message-ID: <ea39ca$7af$1@sea.gmane.org> "Joe Smith" <unknown_kev_cat at hotmail.com> wrote in message news:ea1i1e$l5c$1 at sea.gmane.org... > > "Neil Hodgson" <nyamatongwe at gmail.com> wrote in message > news:50862ebd0607232003gc800a48r3853bd773d3a7f21 at mail.gmail.com... >> Trent Nelson: >> >>> I ended up playing around with Profile Guided Optimization, running >>> ``python.exe pystones.py'' to collect call-graph data after >>> python.exe/Python24.dll had been instrumented, then recompiling with the >>> optimizations fed back in. >> >> It'd be an idea to build a larger body of Python code to run the >> profiling pass on so it doesn't just optimize the sort of code in >> pystone which is not very representative. Could run the test suite as >> it would have good coverage but would hit exceptional cases too >> heavily. Other compilers (Intel?) support profile directed >> optimization so would also benefit from such a body of code. >> > > GCC suppost profiling optimized code. Err... That was supposed to say "supports profile-optimized code". The rest of the message was indeed about Profile Guided Optimization in gcc. From martin at v.loewis.de Mon Jul 24 23:25:58 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 24 Jul 2006 23:25:58 +0200 Subject: [Python-Dev] setup.py and cross-compiling In-Reply-To: <loom.20060724T190947-141@post.gmane.org> References: <loom.20060724T190947-141@post.gmane.org> Message-ID: <44C53AE6.7040105@v.loewis.de> Ed Swierk wrote: > I decided to hack up setup.py so that an optional root directory (passed via an > environment variable) is prepended to all the hardcoded paths like > "/usr/include", "/lib", "/lib64", and so on. I doubt this solves the problem. Distutils just doesn't support cross-compilation, period. The main problem is that it is the host python that runs setup.py, not the target python. Various parts of distutils assume that the information the interpreter provides is correct, yet it is not in a cross-compilation case. I'm uncertain how to solve this; using a target Makefile and pyconfig.h instead of the installed one might be a starting point. One should then review distutils to find out where it accesses host information when target information is required, and then find a way to come up with that information. If no information is available, then distutils should fail instead of returning incorrect information. Regards, Martin From scott+python-dev at scottdial.com Tue Jul 25 00:43:57 2006 From: scott+python-dev at scottdial.com (Scott Dial) Date: Mon, 24 Jul 2006 18:43:57 -0400 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <20060724125640.GA10183@code0.codespeak.net> References: <e9qbtf$kk6$1@sea.gmane.org> <20060722111907.GA13782@code0.codespeak.net> <008201c6ae5c$36b4ca40$d503030a@trilan> <20060724125640.GA10183@code0.codespeak.net> Message-ID: <44C54D2D.3070005@scottdial.com> Between the two of you, I think you have made the case that the language specification is better to not include such details. As you both note, it is difficult to capture the essence of what is desired from the performance of the implementation. To tag on other version, what about Big-O space concerns with things like list.sort. I'm sure there are other things to add as well. It seems reasonable to me that everyone has the same interests in mind when they write a program. Make it good, make it fast, make it small, etc. These sort of details should work themselves out if they are actually important. All of these algorithms should be treated as implementation accidents. Having the information about CPython's implementation in the docs would be good. And go most of the way towards having everyone on the same page. -- Scott Dial scott at scottdial.com scodial at indiana.edu From martin at v.loewis.de Tue Jul 25 01:19:25 2006 From: martin at v.loewis.de (=?UTF-8?B?Ik1hcnRpbiB2LiBMw7Z3aXMi?=) Date: Tue, 25 Jul 2006 01:19:25 +0200 Subject: [Python-Dev] setup.py and cross-compiling In-Reply-To: <c1bf1cf0607241519j157a7050l29998cecc4953485@mail.gmail.com> References: <loom.20060724T190947-141@post.gmane.org> <44C53AE6.7040105@v.loewis.de> <c1bf1cf0607241519j157a7050l29998cecc4953485@mail.gmail.com> Message-ID: <44C5557D.4010004@v.loewis.de> Ed Swierk wrote: > Well, it seems buildroot solves this main problem by building another > version of python and pygen that run on the build machine, and hacks > the Makefile to run setup.py with these instead of whatever happens to > be sitting in /usr/bin. If you think its useful, please submit a patch to sf.net/projects/python It would be good if you could get other people that attempt to cross-build Python to comment. A detailed howto might help to get them started with your code. Regards, Martin From greg.ewing at canterbury.ac.nz Tue Jul 25 02:04:44 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 25 Jul 2006 12:04:44 +1200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> References: <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> Message-ID: <44C5601C.4070605@canterbury.ac.nz> Phillip J. Eby wrote: > When I say "name checker" I mean the Zope type that allows you to specify a > list of names that are allowed for a given object. This allowing is not > based on identity or code signing or anything like that. It's just a list > of attribute names: i.e. a capability mask over an existing object. But this is backwards from what a true object-capability system should be like if it's properly designed. Instead of starting with too-powerful objects and trying to hide some of their powers, the different powers should be separated into different objects in the first place. It sounds to me like Zope is using the approach it's using because it's having to work with Python as it currently is, not because its approach is the best one. -- Greg From greg.ewing at canterbury.ac.nz Tue Jul 25 02:14:34 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 25 Jul 2006 12:14:34 +1200 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <E1G4x1Y-0005Vy-Js@virgo.cus.cam.ac.uk> References: <E1G4x1Y-0005Vy-Js@virgo.cus.cam.ac.uk> Message-ID: <44C5626A.8080306@canterbury.ac.nz> Nick Maclaren wrote: > Er, no. Try a machine-check in a TLB miss handler. But it is all > pretty irrelevant, as the problem arises with asychronous exceptions > (e.g. timer interrupts, signals from other processes), anyway. But we weren't talking about asynchronous exceptions, we were talking about floating point exceptions. Unless your TLB miss handler uses floating point arithmethic, there's no way it can get interrupted by one. (And if it does use floating point arithmetic in a way that can cause an exception, you'd better write it to deal with that!) -- Greg From pje at telecommunity.com Tue Jul 25 03:19:33 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Mon, 24 Jul 2006 21:19:33 -0400 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <44C5601C.4070605@canterbury.ac.nz> References: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060724211442.0262c008@sparrow.telecommunity.com> At 12:04 PM 7/25/2006 +1200, Greg Ewing wrote: >Phillip J. Eby wrote: > > > When I say "name checker" I mean the Zope type that allows you to > specify a > > list of names that are allowed for a given object. This allowing is not > > based on identity or code signing or anything like that. It's just a list > > of attribute names: i.e. a capability mask over an existing object. > >But this is backwards from what a true object-capability >system should be like if it's properly designed. Instead >of starting with too-powerful objects and trying to >hide some of their powers, the different powers should >be separated into different objects in the first place. And what about code that needs to pass on a subset of a capability? You need the ability to create such capability-restricted subsets anyway, no matter how "pure" a system you start with. And being able to create capability masks for existing objects means you don't have to redesign every piece of code ever written for Python to make it secure. >It sounds to me like Zope is using the approach it's >using because it's having to work with Python as it >currently is, not because its approach is the best one. Well, that depends a lot on how you define "best". Practicality beats purity, doesn't it? ;) From david.nospam.hopwood at blueyonder.co.uk Tue Jul 25 02:36:48 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Tue, 25 Jul 2006 01:36:48 +0100 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <5.1.1.6.0.20060724211442.0262c008@sparrow.telecommunity.com> References: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> <5.1.1.6.0.20060724211442.0262c008@sparrow.telecommunity.com> Message-ID: <44C567A0.2080906@blueyonder.co.uk> Phillip J. Eby wrote: > At 12:04 PM 7/25/2006 +1200, Greg Ewing wrote: >>Phillip J. Eby wrote: >> >>>When I say "name checker" I mean the Zope type that allows you to specify >>>a list of names that are allowed for a given object. This allowing is not >>>based on identity or code signing or anything like that. It's just a list >>>of attribute names: i.e. a capability mask over an existing object. >> >>But this is backwards from what a true object-capability >>system should be like if it's properly designed. Instead >>of starting with too-powerful objects and trying to >>hide some of their powers, the different powers should >>be separated into different objects in the first place. > > And what about code that needs to pass on a subset of a capability? You > need the ability to create such capability-restricted subsets anyway, no > matter how "pure" a system you start with. That is true, but doing so for every secure object has significant costs. A simple wrapper is not sufficient, because it would not prevent a wrapped object from returning a reference to itself, bypassing the wrapper. To solve this problem you need to use the more complex Membrane pattern, which also wraps the results of method calls on a wrapped object, for example. In fact Zope's approach does implement a membrane, but this does not really dent the argument that Greg Ewing was making. A pure capability system incurs the complexity and performance costs of wrappers or membranes only in cases where they are needed, not for every object, and the complexity is only in user code, not in the system's security kernel. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From nnorwitz at gmail.com Tue Jul 25 06:20:40 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Mon, 24 Jul 2006 21:20:40 -0700 Subject: [Python-Dev] outstanding bugs to fix for 2.5 Message-ID: <ee2a432c0607242120o5365d6b4w2a8b200253686ea6@mail.gmail.com> There are still a bunch of outstanding bugs. rc1 is about a week away and it would be great to fix these. Many of these are also present in 2.4, but it would be nice to squash them in 2.5. Here's the list from PEP 356: http://python.org/sf/1526585 - SystemError concat long strings (2.4) http://python.org/sf/1523610 - PyArg_ParseTupleAndKeywords potential core dump (2.4) http://python.org/sf/1521947 - mystrtol.c fails with gcc 4.1 (2.4?) test_compile.test_unary_minus http://python.org/sf/1519025 - socket timeout crash when receive signal (2.4) http://python.org/sf/1517042 - Fix crashers/gc_inspection.py (2.4) http://python.org/sf/1515471 - stringobject (char buffers) http://python.org/sf/1513611 - XML: xml.sax.expatreader missing http://python.org/sf/1511497 - XML: xml.sax.ParseException issue http://python.org/sf/1475523 - gettext.py bug http://python.org/sf/1467929 - %-formatting and dicts (2.4) http://python.org/sf/1333982 - AST http://python.org/sf/1191458 - AST (test_trace issue mentioned below) It would be great to fix *all* of these. In this list, at least 3 (4?) can cause segfaults, and #1521947 can cause incorrect results. n From nnorwitz at gmail.com Tue Jul 25 07:50:46 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Mon, 24 Jul 2006 22:50:46 -0700 Subject: [Python-Dev] remaining issues from Klocwork static analysis Message-ID: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> I've fixed most of the problems (or determined they weren't problems) from all the warnings issued by Klocwork's static analysis tool. The following are outstanding issues. This first group looks like real problems to me: # 74 Object/funcobject.c:143 Suspicious deref of ptr before NULL check #169 Modules/threadmodule.c:497 Memory Leak # 28 Modules/_sre.c:987 Array Index Out of Bounds Buffer overflow, array index of 'mark' may be outside the bounds. Array 'mark' of size 200 declared at sre.h:77 may use index values 0..536870911. Also there are 3 similar errors on lines 1006, 1225, 1237. (Try limiting mark on line 589?) #174 Modules/unicodedata.c:432 Array Index Out of Bounds Buffer overflow, array index of 'decomp_prefix' may be outside the bounds. Array 'decomp_prefix' of size 18 declared at unicodedata_db.h:529 may use index values 18..255. Also there is one similar error on line 433. # 36 Modules/cPickle.c:3404 Memory Leak Memory leak. Dynamic memory stored in 's' allocated through function 'pystrndup' at line 3384 is lost at line 3404. s should not be freed on line 3407, but earlier. PDATA_PUSH can return on error and s will not be freed. # 61 Modules/_sqlite/cursor.c:599 Null pointer may be dereferenced Null pointer 'self->statement' that comes from line 674 may be dereferenced by passing argument 1 to function 'statement_mark_dirty' at line 599. Most of these seem suspect. I'm not so sure about them, but I haven't looked into some at all. Let me know if you want the details for any of these or if you can provide an analysis to demonstrate they are incorrect. Null pointer may be dereferenced Python/ast.c:641 Null pointer may be dereferenced Python/ast.c:656 Ptr will be derefed after it was positively checked for NULL Python/compile.c:3020 Null pointer may be passed to function that may dereference it Python/compile.c:4459 Array Index Out of Bounds Modules/_sre.c:987 Array Index Out of Bounds Object/longobject.c:1787 Array Index Out of Bounds Object/longobject.c:2475 Array Index Out of Bounds Python/sysmodule.c:1016 Array Index Out of Bounds Python/getpath.c:285 Buffer Overflow - Non-null Terminated String Python/getpath.c:432 Unvalidated User Input Buffer Overflow-Non-Null Terminated String Python/getpath.c:431 Unvalidated User Input Buffer Overflow-Non-Null Terminated String Python/getpath.c:496 Unvalidated User Input Buffer Overflow-Non-Null Terminated String Python/getpath.c:497 Let me know if you want more info about any particular report. It would be great to have some help and fix these. n From martin at v.loewis.de Tue Jul 25 07:54:24 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Tue, 25 Jul 2006 07:54:24 +0200 Subject: [Python-Dev] outstanding bugs to fix for 2.5 In-Reply-To: <ee2a432c0607242120o5365d6b4w2a8b200253686ea6@mail.gmail.com> References: <ee2a432c0607242120o5365d6b4w2a8b200253686ea6@mail.gmail.com> Message-ID: <44C5B210.6020902@v.loewis.de> Neal Norwitz wrote: > http://python.org/sf/1513611 - XML: xml.sax.expatreader missing > > It would be great to fix *all* of these. In this list, at least 3 > (4?) can cause segfaults, and #1521947 can cause incorrect results. IMO, 1513611 should block the release, since it's a regression from 2.4 that may break many SAX applications, and perhaps also DOM applications. Regards, Martin From martin at v.loewis.de Tue Jul 25 09:40:44 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Tue, 25 Jul 2006 09:40:44 +0200 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> Message-ID: <44C5CAFC.80004@v.loewis.de> Neal Norwitz wrote: > # 74 Object/funcobject.c:143 Suspicious deref of ptr before NULL check Not quite sure what it is complaining about, but else if (PyTuple_Check(closure)) { Py_XINCREF(closure); } looks indeed suspicious: Why do we check for NULL (XINCREF) when we know closure can't be NULL (Tuple_Check). Drop the X, and see if the warning goes away > #169 Modules/threadmodule.c:497 Memory Leak Does it say what memory is leaking? Perhaps it complains about boot not being released if ident is not -1, however, in that case, t_bootstrap will release the memory. > # 28 Modules/_sre.c:987 Array Index Out of Bounds > > Buffer overflow, array index of 'mark' may be outside the > bounds. Array 'mark' of size 200 declared at sre.h:77 may use > index values 0..536870911. Also there are 3 similar errors on > lines 1006, 1225, 1237. (Try limiting mark on line 589?) ISTM that SRE has a limit of 100 MARK opcodes, meaning a maximum of 100 groups per expression (so you need 200 mark pointers). This can't overrun as sre_compile refuses to compile expressions with more groups. Of course, a malicious application could craft the opcodes itself (bypassing sre_compile), in which case you could get a buffer overrun. The right solution is to have a dynamic marks array. > #174 Modules/unicodedata.c:432 Array Index Out of Bounds > > Buffer overflow, array index of 'decomp_prefix' may be outside the > bounds. Array 'decomp_prefix' of size 18 declared at > unicodedata_db.h:529 may use index values 18..255. Also there is one > similar error on line 433. This limit is enforced by Tools/unicode/makeunicodedata.py. There are only 18 decomposition prefixes at the moment, yet we use 8 bits for the decomposition prefix (makeunicodedata checks that prefix < 256) Looking at the code, I now wonder why decomp_data can't be "unsigned short", instead of "unsigned int" (the upper byte is the decomposition length, and it can't be larger than 256, either). > # 36 Modules/cPickle.c:3404 Memory Leak > > Memory leak. Dynamic memory stored in 's' allocated through > function 'pystrndup' at line 3384 is lost at line 3404. > > s should not be freed on line 3407, but earlier. > PDATA_PUSH can return on error and s will not be freed. Correct. We should not use macros with embedded return statements. > # 61 Modules/_sqlite/cursor.c:599 Null pointer may be dereferenced > > Null pointer 'self->statement' that comes from line 674 may be > dereferenced by passing argument 1 to function > 'statement_mark_dirty' at line 599. Looks like a problem. Maybe a break is missing after line 674? Regards, Martin From nmm1 at cus.cam.ac.uk Tue Jul 25 11:06:08 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Tue, 25 Jul 2006 10:06:08 +0100 Subject: [Python-Dev] Strategy for converting the decimal module to C Message-ID: <E1G5Irg-0003Zi-9b@libra.cus.cam.ac.uk> Greg Ewing <greg.ewing at canterbury.ac.nz> wrote: > > But we weren't talking about asynchronous exceptions, > we were talking about floating point exceptions. Unless > your TLB miss handler uses floating point arithmethic, > there's no way it can get interrupted by one. (And if > it does use floating point arithmetic in a way that > can cause an exception, you'd better write it to deal > with that!) I am really not getting my message across, am I? Yes, that is true - as far as it goes. The trouble is that designing systems based on assuming that IS true as far as it goes means that they don't work when it goes further. And it does. Here are a FEW of the many examples of where the simplistic model is likely to fail in an x86 context: The compiled code has made a data structure temporarily inconsistent because the operation is safe (say, list insertion), and then gets an asynchronous interrupt (e.g. SIGINT). The SIGINT handler does some operation (e.g. I/O) that implicitly uses floating-point, which then interrupts. The x86 architecture is extended to include out-of-order floating-point as it had in the past, many systems have today, and is very likely to happen in the future. It is one of the standard ways to get better performance, after all, and is on the increase. The x86 architecture is extended to support micro-threading. I have not been told by Intel or AMD that either have such plans, but I have very good reason to believe that both have such projects. IBM and Sun certainly do, though I don't know if IBM's is/are relevant. Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From g.brandl at gmx.net Tue Jul 25 11:57:23 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Tue, 25 Jul 2006 11:57:23 +0200 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <44C5CAFC.80004@v.loewis.de> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> Message-ID: <ea4pro$ere$1@sea.gmane.org> Martin v. L?wis wrote: > Neal Norwitz wrote: >> # 74 Object/funcobject.c:143 Suspicious deref of ptr before NULL check > > Not quite sure what it is complaining about, but > > else if (PyTuple_Check(closure)) { > Py_XINCREF(closure); > } > > looks indeed suspicious: Why do we check for NULL (XINCREF) when > we know closure can't be NULL (Tuple_Check). Drop the X, and see > if the warning goes away In comparison, the PyFunction_SetDefaults function does check for NULL, and raises an error in this case. However, since it is a C API function only, passing NULL is an error anyway. Georg From martin at v.loewis.de Tue Jul 25 21:44:12 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Tue, 25 Jul 2006 21:44:12 +0200 Subject: [Python-Dev] More tracker demos online Message-ID: <44C6748C.4060701@v.loewis.de> Currently, we have two running tracker demos online: Roundup: http://efod.se/python-tracker/ Jira: http://jira.python.atlassian.com/secure/Dashboard.jspa These installation are in various forms of demo mode and "pre-release" (meaning that the configuration is still not complete). They both use the sample data that Fredrik Lundh produced at some point, so don't be surprised that they are behind SF wrt. content. While these might not be in the final form of operation, I think users should already try to use them, to find out which one they like best. Discussions/Comments can be sent to infrastructure at python.org, however, for reports/reviews, please use the Wiki at http://wiki.python.org/moin/CallForTrackers You'll notice that it also lists Trac and Malone, however, it seems that there is no progress on importing SF data into these. Regards, Martin From kiko at async.com.br Tue Jul 25 22:38:36 2006 From: kiko at async.com.br (Christian Robottom Reis) Date: Tue, 25 Jul 2006 17:38:36 -0300 Subject: [Python-Dev] More tracker demos online In-Reply-To: <44C6748C.4060701@v.loewis.de> References: <44C6748C.4060701@v.loewis.de> Message-ID: <20060725203836.GA1542@anthem.async.com.br> On Tue, Jul 25, 2006 at 09:44:12PM +0200, "Martin v. L?wis" wrote: > You'll notice that it also lists Trac and Malone, however, > it seems that there is no progress on importing SF data > into these. Actually, James Henstridge has been working on an import into Launchpad (Malone is the codename for the bugtracker component of it) over last week. We have a demo site up at: https://demo.launchpad.net/products/python/+bugs Note that we're still ironing out some of the kinks with the import and the code running there, so there will be changes before the official announcement. -- Christian Robottom Reis | http://async.com.br/~kiko/ | [+55 16] 3376 0125 From grig at gheorghiu.net Fri Jul 21 02:20:58 2006 From: grig at gheorghiu.net (Grig Gheorghiu) Date: Thu, 20 Jul 2006 17:20:58 -0700 (PDT) Subject: [Python-Dev] Community buildbots -- reprise Message-ID: <20060721002058.66596.qmail@web54515.mail.yahoo.com> Hi, This message is in response to Glyph's plea (<http://mail.python.org/pipermail/python-dev/2006-July/067366.html>). Here's what Glyph said: "I would like to propose, although I certainly don't have time to implement, a program by which Python-using projects could contribute buildslaves which would run their projects' tests with the latest Python trunk. This would provide two useful incentives: Python code would gain a reputation as generally well-tested (since there is a direct incentive to write tests for your project: get notified when core python changes might break it), and the core developers would have instant feedback when a "small" change breaks more code than it was expected to." I'm volunteering to organize this effort, is there is enough interest on this list. In fact, I've done some prep work already: * got a domain name: pybots.org * got a $47/month Ubuntu-based VPS from JohnCompanies.com (root access and everything); it's available at master.pybots.org, and it's ready to be configured as a buildmaster for the pybots * got a mailing list: pybots at lists2.idyll.org I can start configuring the Ubuntu machine as a buildmaster, and I can also add a buildslave on the same machine that will check out the latest Python trunk code, build it, then run the automated tests for a sample project -- let's say for Twisted, since Glyph was the one requesting this. This will also serve as a sample buildslave for other people who will be interested in running buildslaves for their own projects. Apart from the goals stated by Glyph, I see this as a very valuable effort in convincing people of the value of automated tests, Python-related or not. A secondary effect I'd like to see would be for these suites of tests to be invoked in a standard fashion -- maybe 'python setup.py test'. If PSF can contribute some $$$ towards the hosting of the master server, that would be appreciated, but not required. All that's required is enough interest from the community. Please let me know if you're interested. Grig ---- http://agiletesting.blogspot.com From ndbecker2 at gmail.com Fri Jul 21 13:59:56 2006 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 21 Jul 2006 07:59:56 -0400 Subject: [Python-Dev] Document performance requirements? In-Reply-To: <44C0BF4C.1010408@gmail.com> References: <e9qbtf$kk6$1@sea.gmane.org> <44C0BF4C.1010408@gmail.com> Message-ID: <200607210759.56595.ndbecker2@gmail.com> On Friday 21 July 2006 7:49 am, Nick Coghlan wrote: > Neal Becker wrote: > > For a recent project I needed to select a container. There are plenty of > > python data structures to choose from. It seems that information on > > performance is missing (or not easy to find). > > > > I think Python should include performance in the documentation of common > > data structures to help users select the appropriate types. Something in > > the style of c++ STL. > > Do you mean absolute performance, or do you mean algorithmic order > guarantees? I thought the latter were already documented. . . > The latter. Where are they documented? From eswierk at arastra.com Tue Jul 25 00:19:46 2006 From: eswierk at arastra.com (Ed Swierk) Date: Mon, 24 Jul 2006 15:19:46 -0700 Subject: [Python-Dev] setup.py and cross-compiling In-Reply-To: <44C53AE6.7040105@v.loewis.de> References: <loom.20060724T190947-141@post.gmane.org> <44C53AE6.7040105@v.loewis.de> Message-ID: <c1bf1cf0607241519j157a7050l29998cecc4953485@mail.gmail.com> On 7/24/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > The main problem is that it is the host python that runs setup.py, > not the target python. Various parts of distutils assume that the > information the interpreter provides is correct, yet it is not > in a cross-compilation case. Well, it seems buildroot solves this main problem by building another version of python and pygen that run on the build machine, and hacks the Makefile to run setup.py with these instead of whatever happens to be sitting in /usr/bin. Thus the modules that do get built seem to work just fine, but setup.py itself ignores all this careful hackery when determining what modules to build and configuring distutils to search for system libraries in various dynamically-discovered paths. --Ed From nyamatongwe at gmail.com Wed Jul 26 02:10:07 2006 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Wed, 26 Jul 2006 10:10:07 +1000 Subject: [Python-Dev] More tracker demos online In-Reply-To: <44C6748C.4060701@v.loewis.de> References: <44C6748C.4060701@v.loewis.de> Message-ID: <50862ebd0607251710p454d3cc1s84622aee45774c77@mail.gmail.com> Martin v. L?wis: > Currently, we have two running tracker demos online: After playing with them for 30 minutes, Jira seems to have too busy an interface and finicky behaviour: not liking the back button sometimes (similar to SF) and clicking on diffs wants to download them rather than view them. Its disappointing that Jira and Launchpad use different bug IDs as continuity should be maintained with the SF bug IDs which will be referred to in other areas such as commit messages. They do include the SF bug ID (as a field in Jira and a nickname in Launchpad) but this makes it harder to navigate between related bugs. I mostly looked at "os.startfile() still doesn't work with Unicode filenames" and I would have tagged the patch on SF with a "looks OK to me" if SF was working. The text in Launchpad was a bit sparsely formatted for me so would like to see if indvidual users can choose a different style. The others are OK although Roundup is clearer. Neil From brett at python.org Wed Jul 26 02:32:22 2006 From: brett at python.org (Brett Cannon) Date: Tue, 25 Jul 2006 17:32:22 -0700 Subject: [Python-Dev] More tracker demos online In-Reply-To: <50862ebd0607251710p454d3cc1s84622aee45774c77@mail.gmail.com> References: <44C6748C.4060701@v.loewis.de> <50862ebd0607251710p454d3cc1s84622aee45774c77@mail.gmail.com> Message-ID: <bbaeab100607251732u5771b6fet4b6e5a949297f8f9@mail.gmail.com> On 7/25/06, Neil Hodgson <nyamatongwe at gmail.com> wrote: > > Martin v. L?wis: > > > Currently, we have two running tracker demos online: > > After playing with them for 30 minutes, Jira seems to have too busy > an interface and finicky behaviour: not liking the back button > sometimes (similar to SF) and clicking on diffs wants to download them > rather than view them. Its disappointing that Jira and Launchpad use > different bug IDs as continuity should be maintained with the SF bug > IDs which will be referred to in other areas such as commit messages. Stuff like continuity in bug numbers and such can be fixed in the official tracker upon launch. More important is interface and general usage. And as Martin said, if you wish to discuss, please do so on the infrastructue mailing list. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060725/2af3211d/attachment.htm From nnorwitz at gmail.com Wed Jul 26 05:46:13 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 25 Jul 2006 20:46:13 -0700 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <20060721002058.66596.qmail@web54515.mail.yahoo.com> References: <20060721002058.66596.qmail@web54515.mail.yahoo.com> Message-ID: <ee2a432c0607252046p7a6c91a3gf1451fd1f2f3ed55@mail.gmail.com> If you want I can send you the build master cfg I setup on python.org and some simple instructions for how to connect to it. I don't have time to focus on this at the moment and probably won't until 2.5 is out. n -- On 7/20/06, Grig Gheorghiu <grig at gheorghiu.net> wrote: > Hi, > > This message is in response to Glyph's plea > (<http://mail.python.org/pipermail/python-dev/2006-July/067366.html>). > > Here's what Glyph said: > > "I would like to propose, although I certainly don't have time to > implement, a program by which Python-using projects could contribute > buildslaves which would run their projects' tests with the latest > Python trunk. This would provide two useful incentives: Python code > would gain a reputation as generally well-tested (since there is a > direct incentive to write tests for your project: get notified when > core python changes might break it), and the core developers would have > instant feedback when a "small" change breaks more code than it was > expected to." > > > I'm volunteering to organize this effort, is there is enough interest > on this list. In fact, I've done some prep work already: > > * got a domain name: pybots.org > * got a $47/month Ubuntu-based VPS from JohnCompanies.com (root access > and everything); it's available at master.pybots.org, and it's ready to > be configured as a buildmaster for the pybots > * got a mailing list: pybots at lists2.idyll.org > > I can start configuring the Ubuntu machine as a buildmaster, and I can > also add a buildslave on the same machine that will check out the > latest Python trunk code, build it, then run the automated tests for a > sample project -- let's say for Twisted, since Glyph was the one > requesting this. This will also serve as a sample buildslave for other > people who will be interested in running buildslaves for their own > projects. > > Apart from the goals stated by Glyph, I see this as a very valuable > effort in convincing people of the value of automated tests, > Python-related or not. A secondary effect I'd like to see would be for > these suites of tests to be invoked in a standard fashion -- maybe > 'python setup.py test'. > > If PSF can contribute some $$$ towards the hosting of the master > server, that would be appreciated, but not required. All that's > required is enough interest from the community. > > Please let me know if you're interested. > > Grig > > ---- > http://agiletesting.blogspot.com > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/nnorwitz%40gmail.com > From nnorwitz at gmail.com Wed Jul 26 06:47:08 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 25 Jul 2006 21:47:08 -0700 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <44C5CAFC.80004@v.loewis.de> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> Message-ID: <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> On 7/25/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > Neal Norwitz wrote: > > # 74 Object/funcobject.c:143 Suspicious deref of ptr before NULL check > > Not quite sure what it is complaining about, but > > else if (PyTuple_Check(closure)) { > Py_XINCREF(closure); > } > > looks indeed suspicious: Why do we check for NULL (XINCREF) when > we know closure can't be NULL (Tuple_Check). Drop the X, and see > if the warning goes away Yes, I definitely think dropping the X would make the warning go away. Do we want to check for a NULL pointer and raise an exception? The docs don't address the issue, so I think if we added a check, ie: if (closure && PyTuple_Check(closure)) and got rid of the X that would be fine as well. > > #169 Modules/threadmodule.c:497 Memory Leak > > Does it say what memory is leaking? Perhaps it complains about > boot not being released if ident is not -1, however, in that case, > t_bootstrap will release the memory. I believe you are right, I never traced through t_bootstrap. I think this is a false positive. There is some memory being leaked on thread creation as reported by valgrind IIRC. This doesn't seem to be it though. > > #174 Modules/unicodedata.c:432 Array Index Out of Bounds > > > > Buffer overflow, array index of 'decomp_prefix' may be outside the > > bounds. Array 'decomp_prefix' of size 18 declared at > > unicodedata_db.h:529 may use index values 18..255. Also there is one > > similar error on line 433. > > This limit is enforced by Tools/unicode/makeunicodedata.py. There are > only 18 decomposition prefixes at the moment, yet we use 8 bits for > the decomposition prefix (makeunicodedata checks that prefix < 256) Just to make sure I understand. The code in question is accessing decomp_prefix like this: decomp_prefix[decomp_data[index] & 255] So decomp_prefix will be accessed with the result of: decomp_data[index] & 255 The first line of data is (fro unicodedata_db.h) is: static unsigned int decomp_data[] = { 0, 257, 32, 514, 32, 776, 259, 97, 514, 32, 772, 259, 50, 259, 51, 514, If index == 2 (or 3-5, 7-10, etc), we have: decomp_prefix[decomp_data[2] & 255] decomp_prefix[32 & 255] decomp_prefix[32] which is larger than the max size of decomp_prefix (18). But from what I think you stated above, index can't equal those values and the code that prevents it is calculated a few lines above: index = decomp_index1[(code>>DECOMP_SHIFT)]; index = decomp_index2[(index<<DECOMP_SHIFT)+ (code&((1<<DECOMP_SHIFT)-1))]; Is that correct? If so, would it be correct to add: unsigned short prefix_index = decomp_data[index] & 255; assert(prefix_index < (sizeof(decomp_prefix)/sizeof(*decomp_prefix))); n From nnorwitz at gmail.com Wed Jul 26 06:49:56 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 25 Jul 2006 21:49:56 -0700 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <ea4pro$ere$1@sea.gmane.org> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ea4pro$ere$1@sea.gmane.org> Message-ID: <ee2a432c0607252149t6136b679w1630fc0fb3dcba63@mail.gmail.com> On 7/25/06, Georg Brandl <g.brandl at gmx.net> wrote: > Martin v. L?wis wrote: > > Neal Norwitz wrote: > >> # 74 Object/funcobject.c:143 Suspicious deref of ptr before NULL check > > > > Not quite sure what it is complaining about, but > > > > else if (PyTuple_Check(closure)) { > > Py_XINCREF(closure); > > } > > > > looks indeed suspicious: Why do we check for NULL (XINCREF) when > > we know closure can't be NULL (Tuple_Check). Drop the X, and see > > if the warning goes away > > In comparison, the PyFunction_SetDefaults function does check for > NULL, and raises an error in this case. However, since it is a C API function > only, passing NULL is an error anyway. Heh, that was me that added it 10 days ago. :-) Might as well do the same here. n From grig.gheorghiu at gmail.com Wed Jul 26 06:57:03 2006 From: grig.gheorghiu at gmail.com (Grig Gheorghiu) Date: Tue, 25 Jul 2006 21:57:03 -0700 Subject: [Python-Dev] Community buildbots -- reprise In-Reply-To: <ee2a432c0607252046p7a6c91a3gf1451fd1f2f3ed55@mail.gmail.com> References: <20060721002058.66596.qmail@web54515.mail.yahoo.com> <ee2a432c0607252046p7a6c91a3gf1451fd1f2f3ed55@mail.gmail.com> Message-ID: <3f09d5a00607252157q117c2e7dxd6c6f75ba7e4fd57@mail.gmail.com> On 7/25/06, Neal Norwitz <nnorwitz at gmail.com> wrote: > > If you want I can send you the build master cfg I setup on python.org > and some simple instructions for how to connect to it. I don't have > time to focus on this at the moment and probably won't until 2.5 is > out. > > n > -- Sure. I'm still a bit unclear on whether you want me to coordinate this by adding buildslaves to the build master cfg, adding build steps etc. I'll gladly do it if you need help. I'll need access to the server and proper permissions of course. Grig -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060725/dac00dfd/attachment.html From martin at v.loewis.de Wed Jul 26 07:05:44 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 26 Jul 2006 07:05:44 +0200 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> Message-ID: <44C6F828.7020005@v.loewis.de> Neal Norwitz wrote: >> Not quite sure what it is complaining about, but >> >> else if (PyTuple_Check(closure)) { >> Py_XINCREF(closure); >> } >> >> looks indeed suspicious: Why do we check for NULL (XINCREF) when >> we know closure can't be NULL (Tuple_Check). Drop the X, and see >> if the warning goes away > > Yes, I definitely think dropping the X would make the warning go away. > Do we want to check for a NULL pointer and raise an exception? The > docs don't address the issue, so I think if we added a check, ie: if > (closure && PyTuple_Check(closure)) and got rid of the X that would be > fine as well. The docs do address the issue: \var{closure} must be \var{Py_None} or a tuple of cell objects. It doesn't allow for NULL, and None indicates that the closure should become NULL. The only caller of it in the core will never pass NULL. If you want to check that this is not NULL on the grounds that somebody may call it incorrectly, then you should also check that op is not NULL, because somebody may call it incorrectly. > The first line of data is (fro unicodedata_db.h) is: > > static unsigned int decomp_data[] = { > 0, 257, 32, 514, 32, 776, 259, 97, 514, 32, 772, 259, 50, 259, 51, 514, Read this as 0: sentinel 257 = 256 | 1: length 1, prefix 1 32: U+0020 514 = 512 | 2: length 2, prefix 2 32: U+0020 776: U+308 ... > Is that correct? If so, would it be correct to add: > > unsigned short prefix_index = decomp_data[index] & 255; > assert(prefix_index < (sizeof(decomp_prefix)/sizeof(*decomp_prefix))); Yes, that would be correct. Regards, Martin From martin at v.loewis.de Wed Jul 26 07:09:47 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 26 Jul 2006 07:09:47 +0200 Subject: [Python-Dev] More tracker demos online In-Reply-To: <50862ebd0607251710p454d3cc1s84622aee45774c77@mail.gmail.com> References: <44C6748C.4060701@v.loewis.de> <50862ebd0607251710p454d3cc1s84622aee45774c77@mail.gmail.com> Message-ID: <44C6F91B.1010105@v.loewis.de> Neil Hodgson wrote: > Its disappointing that Jira and Launchpad use > different bug IDs as continuity should be maintained with the SF bug > IDs which will be referred to in other areas such as commit messages. My plan is to keep the SF redirector alive, so python.org/sf/<bugid> should continue to direct you to the right item in the new tracker. This can only be done when we actually make the switch, since currently the redirector still needs to direct to SF. Regards, Martin From nnorwitz at gmail.com Wed Jul 26 07:12:42 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 25 Jul 2006 22:12:42 -0700 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <44C6F828.7020005@v.loewis.de> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> <44C6F828.7020005@v.loewis.de> Message-ID: <ee2a432c0607252212k5276852dlacb0d73dbd984434@mail.gmail.com> On 7/25/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > > > > Yes, I definitely think dropping the X would make the warning go away. > > Do we want to check for a NULL pointer and raise an exception? The > > docs don't address the issue, so I think if we added a check, ie: if > > (closure && PyTuple_Check(closure)) and got rid of the X that would be > > fine as well. > > The docs do address the issue: > > \var{closure} must be \var{Py_None} or a tuple of cell objects. > > It doesn't allow for NULL, and None indicates that the closure > should become NULL. The only caller of it in the core will never > pass NULL. > > If you want to check that this is not NULL on the grounds that > somebody may call it incorrectly, then you should also check that > op is not NULL, because somebody may call it incorrectly. We never really did address this issue did? A while back we talked about whether to assert vs check and do PyErr_BadInternalCall(). I don't remember a clear resolution (though my memory). I vaguely remember a preference towards asserting, but I don't know if that was in all cases or maybe it was just my preference. :-) I'm happy to assert here too. But it's really a broader question. I guess I'm even happy to just remove the X. It would be nice to handle this consistently going forward. n From martin at v.loewis.de Wed Jul 26 07:21:16 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 26 Jul 2006 07:21:16 +0200 Subject: [Python-Dev] More tracker demos online In-Reply-To: <ea6msv$hgf$1@sea.gmane.org> References: <44C6748C.4060701@v.loewis.de> <ea6msv$hgf$1@sea.gmane.org> Message-ID: <44C6FBCC.5080801@v.loewis.de> Terry Reedy wrote: > ""Martin v. L?wis"" <martin at v.loewis.de> wrote in message > news:44C6748C.4060701 at v.loewis.de... >> Currently, we have two running tracker demos online: >> >> Roundup: >> http://efod.se/python-tracker/ >> >> Jira: >> http://jira.python.atlassian.com/secure/Dashboard.jspa > > What user name and passwords will they accept, if any? The roundup installation accepts SF user names, no password; you can then set the password if you want to. The Jira installation accepts SF user names; you have make it send you a password reminder. The registered email address is the SF one. If you have problems getting in, please let me know. I (and the other infrastruture people) have admin privs on these installations, so I can learn how to administrate them if something goes wrong :-) Regards, Martin From martin at v.loewis.de Wed Jul 26 07:32:01 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 26 Jul 2006 07:32:01 +0200 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <ee2a432c0607252212k5276852dlacb0d73dbd984434@mail.gmail.com> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> <44C6F828.7020005@v.loewis.de> <ee2a432c0607252212k5276852dlacb0d73dbd984434@mail.gmail.com> Message-ID: <44C6FE51.7060509@v.loewis.de> Neal Norwitz wrote: > We never really did address this issue did? A while back we talked > about whether to assert vs check and do PyErr_BadInternalCall(). I > don't remember a clear resolution (though my memory). I vaguely > remember a preference towards asserting, but I don't know if that was > in all cases or maybe it was just my preference. :-) > > I'm happy to assert here too. But it's really a broader question. I > guess I'm even happy to just remove the X. It would be nice to handle > this consistently going forward. I would just remove the X. If we want to handle it consistently, we would have to check all pointer parameters in all functions; this would be a huge task (and for little value, IMO). In any case, "closure && PyTuple_Check(closure)" would be wrong, since it then goes into PyErr_Format(PyExc_SystemError, "expected tuple for closure, got '%.100s'", closure->ob_type->tp_name); which crashes just the same. Regards, Martin From nnorwitz at gmail.com Wed Jul 26 07:41:14 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 25 Jul 2006 22:41:14 -0700 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <44C6FE51.7060509@v.loewis.de> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> <44C6F828.7020005@v.loewis.de> <ee2a432c0607252212k5276852dlacb0d73dbd984434@mail.gmail.com> <44C6FE51.7060509@v.loewis.de> Message-ID: <ee2a432c0607252241i681435e2n7c846b2d46355936@mail.gmail.com> On 7/25/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > Neal Norwitz wrote: > > We never really did address this issue did? A while back we talked > > about whether to assert vs check and do PyErr_BadInternalCall(). I > > don't remember a clear resolution (though my memory). I vaguely > > remember a preference towards asserting, but I don't know if that was > > in all cases or maybe it was just my preference. :-) > > > > I'm happy to assert here too. But it's really a broader question. I > > guess I'm even happy to just remove the X. It would be nice to handle > > this consistently going forward. > > I would just remove the X. I'll do that here since it's the easiest. > If we want to handle it consistently, we would have to check all pointer > parameters in all functions; this would be a huge task (and for little > value, IMO). I'm not suggesting changing existing code, unless we find issues. I agree that it would be a huge task and of little value. I was thinking about for future code. I guess we aren't writing a lot of new C APIs in 2.x, so it really doesn't much there matter. Though for 3k, it would be nice to make it consistent as new APIs are written or old APIs are cleaned up. n From greg.ewing at canterbury.ac.nz Wed Jul 26 08:13:16 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 26 Jul 2006 18:13:16 +1200 Subject: [Python-Dev] new security doc using object-capabilities In-Reply-To: <44C567A0.2080906@blueyonder.co.uk> References: <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <bbaeab100607191535p543cb0ddj66410cf985dd9b77@mail.gmail.com> <20060722114615.GB13782@code0.codespeak.net> <44C2CE78.9020802@blueyonder.co.uk> <20060723070811.GD13782@code0.codespeak.net> <5.1.1.6.0.20060723164246.04144298@sparrow.telecommunity.com> <44C3F31D.3070201@blueyonder.co.uk> <5.1.1.6.0.20060723194448.0205fa50@sparrow.telecommunity.com> <5.1.1.6.0.20060724015521.0261d160@sparrow.telecommunity.com> <5.1.1.6.0.20060724211442.0262c008@sparrow.telecommunity.com> <44C567A0.2080906@blueyonder.co.uk> Message-ID: <44C707FC.6@canterbury.ac.nz> > Phillip J. Eby wrote: > > > And what about code that needs to pass on a subset of a capability? With one object == one capability, there is no such thing as a subset of a capability -- the capabilities are the atomic units at which you control access. So you need to make them fine-grained enough to begin with. -- Greg From g.brandl at gmx.net Wed Jul 26 09:16:25 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 26 Jul 2006 09:16:25 +0200 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> Message-ID: <ea74pt$ign$1@sea.gmane.org> Neal Norwitz wrote: > On 7/25/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: >> Neal Norwitz wrote: >> > # 74 Object/funcobject.c:143 Suspicious deref of ptr before NULL check >> >> Not quite sure what it is complaining about, but >> >> else if (PyTuple_Check(closure)) { >> Py_XINCREF(closure); >> } >> >> looks indeed suspicious: Why do we check for NULL (XINCREF) when >> we know closure can't be NULL (Tuple_Check). Drop the X, and see >> if the warning goes away > > Yes, I definitely think dropping the X would make the warning go away. > Do we want to check for a NULL pointer and raise an exception? The > docs don't address the issue, so I think if we added a check, ie: if > (closure && PyTuple_Check(closure)) and got rid of the X that would be > fine as well. You'll have to do something about the error message, then, since it uses closure->ob_type. Georg From greg.ewing at canterbury.ac.nz Wed Jul 26 09:39:40 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 26 Jul 2006 19:39:40 +1200 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <E1G5Irg-0003Zi-9b@libra.cus.cam.ac.uk> References: <E1G5Irg-0003Zi-9b@libra.cus.cam.ac.uk> Message-ID: <44C71C3C.4030604@canterbury.ac.nz> Nick Maclaren wrote: > The compiled code has made a data structure temporarily inconsistent > because the operation is safe (say, list insertion), and then gets an > asynchronous interrupt (e.g. SIGINT). The SIGINT handler does some > operation (e.g. I/O) that implicitly uses floating-point, which then > interrupts. Well, of course anything can be made to happen asynchronously by calling it from something asynchronous, such as a SIGINT handler. That doesn't change the fact that the floating point operation itself is deterministic, including whether it causes an exception. Well-written programs don't do any more in a signal handler than is absolutely necessary, for reasons which apply equally well whether floating point is involved or not. I'd say the mistake was made right at the beginning by assuming that the data structure in question was safe while allowing a SIGINT to occur to a handler that's not careful enough about what it does. BTW, it seems to me you could get exactly the same problem if FP exceptions were handled entirely in user mode, as you suggest. Not that I don't agree that would be a good idea -- I do -- but it wouldn't prevent this particular kind of mistake. And all of this is getting rather far away from where we started, which was simply instrumenting a piece of code to count floating point exceptions. Such a program isn't going to be doing I/O in SIGINT handlers or installing FP exception handlers that mess with unrelated critical data structures. -- Greg From gh at ghaering.de Wed Jul 26 10:17:04 2006 From: gh at ghaering.de (=?ISO-8859-1?Q?Gerhard_H=E4ring?=) Date: Wed, 26 Jul 2006 10:17:04 +0200 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <44C5CAFC.80004@v.loewis.de> References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> Message-ID: <44C72500.9080301@ghaering.de> Martin v. L?wis wrote: > Neal Norwitz wrote: >> # 61 Modules/_sqlite/cursor.c:599 Null pointer may be dereferenced >> >> Null pointer 'self->statement' that comes from line 674 may be >> dereferenced by passing argument 1 to function >> 'statement_mark_dirty' at line 599. > > Looks like a problem. Maybe a break is missing after line 674? The code is a bit complicated here, and admittedly not the nicest one, but I verified it and there is no problem here. _query_execute() is designed to handle both executemany() and execute(). multiple is a local variable that is 1 if there is a set of tuples of SQL parameters (executemany) instead of a single tuple of SQL parameters (execute), in which case it's 0. Before the while loop, the code makes sure that parameters_iter is an iterator that returns SQL parameter tuples. So if there is only a single parameter tuple, a temporary list is created an parameters_iter is an iterator over it. So, if !multiple (referenced code in line 674), the while-loop will only be executed once, and in the second loop, the while-loop will exit because then the following code will break: parameters = PyIter_Next(parameters_iter); if (!parameters) { break; } Code in line 599 is thus not executed with self->statement = 0. -- Gerhard From mwh at python.net Wed Jul 26 10:53:53 2006 From: mwh at python.net (Michael Hudson) Date: Wed, 26 Jul 2006 09:53:53 +0100 Subject: [Python-Dev] remaining issues from Klocwork static analysis In-Reply-To: <ee2a432c0607252212k5276852dlacb0d73dbd984434@mail.gmail.com> (Neal Norwitz's message of "Tue, 25 Jul 2006 22:12:42 -0700") References: <ee2a432c0607242250v5a7ac021p464b68c5c47ec538@mail.gmail.com> <44C5CAFC.80004@v.loewis.de> <ee2a432c0607252147u6e3e4b37i1a77eb8114ac7349@mail.gmail.com> <44C6F828.7020005@v.loewis.de> <ee2a432c0607252212k5276852dlacb0d73dbd984434@mail.gmail.com> Message-ID: <2mwta0ycf2.fsf@starship.python.net> "Neal Norwitz" <nnorwitz at gmail.com> writes: > On 7/25/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: >> > >> > Yes, I definitely think dropping the X would make the warning go away. >> > Do we want to check for a NULL pointer and raise an exception? The >> > docs don't address the issue, so I think if we added a check, ie: if >> > (closure && PyTuple_Check(closure)) and got rid of the X that would be >> > fine as well. >> >> The docs do address the issue: >> >> \var{closure} must be \var{Py_None} or a tuple of cell objects. >> >> It doesn't allow for NULL, and None indicates that the closure >> should become NULL. The only caller of it in the core will never >> pass NULL. >> >> If you want to check that this is not NULL on the grounds that >> somebody may call it incorrectly, then you should also check that >> op is not NULL, because somebody may call it incorrectly. > > We never really did address this issue did? A while back we talked > about whether to assert vs check and do PyErr_BadInternalCall(). I > don't remember a clear resolution (though my memory). I vaguely > remember a preference towards asserting, but I don't know if that was > in all cases or maybe it was just my preference. :-) > > I'm happy to assert here too. But it's really a broader question. I > guess I'm even happy to just remove the X. It would be nice to handle > this consistently going forward. I think I'm rather in favour of assert()ing this sort of thing. If you're programming in C, you can cause crashes any which way and removing one doesn't seem worth making correct usage pay any kind of (admittedly miniscule) performance penalty. It would be nice if API docs explicitly stated which pointer arguments could be NULL, and then it would be a programming error to pass a NULL pointer argument in any other place. I have no idea how far away from this we are already :-) Cheers, mwh -- Gullible editorial staff continues to post links to any and all articles that vaguely criticize Linux in any way. -- Reason #4 for quitting slashdot today, from http://www.cs.washington.edu/homes/klee/misc/slashdot.html From python at rcn.com Wed Jul 26 11:18:24 2006 From: python at rcn.com (Raymond Hettinger) Date: Wed, 26 Jul 2006 02:18:24 -0700 Subject: [Python-Dev] Strategy for converting the decimal module to C References: <E1G5Irg-0003Zi-9b@libra.cus.cam.ac.uk> <44C71C3C.4030604@canterbury.ac.nz> Message-ID: <003c01c6b094$72de56a0$0f10000a@RaymondLaptop1> Greg Ewing > And all of this is getting rather far away from where we > started, which was simply instrumenting a piece of code > to count floating point exceptions. I'm thinking of adding a note to the Py2.5 docs that the counting feature is not part of the standard and should not be expected to work on other implementations of the standard (including a planned CPython extension module). Raymond From facundobatista at gmail.com Wed Jul 26 17:52:48 2006 From: facundobatista at gmail.com (Facundo Batista) Date: Wed, 26 Jul 2006 12:52:48 -0300 Subject: [Python-Dev] Strategy for converting the decimal module to C In-Reply-To: <003c01c6b094$72de56a0$0f10000a@RaymondLaptop1> References: <E1G5Irg-0003Zi-9b@libra.cus.cam.ac.uk> <44C71C3C.4030604@canterbury.ac.nz> <003c01c6b094$72de56a0$0f10000a@RaymondLaptop1> Message-ID: <e04bdf310607260852qeee5239t97bad5a5f1424017@mail.gmail.com> 2006/7/26, Raymond Hettinger <python at rcn.com>: > Greg Ewing > > And all of this is getting rather far away from where we > > started, which was simply instrumenting a piece of code > > to count floating point exceptions. > > I'm thinking of adding a note to the Py2.5 docs that the counting feature is not > part of the standard and should not be expected to work on other implementations > of the standard (including a planned CPython extension module). +1 -- . Facundo Blog: http://www.taniquetil.com.ar/plog/ PyAr: http://www.python.org/ar/ From syfou at users.sourceforge.net Wed Jul 26 20:11:53 2006 From: syfou at users.sourceforge.net (Sylvain Fourmanoit) Date: Wed, 26 Jul 2006 14:11:53 -0400 (EDT) Subject: [Python-Dev] New miniconf module Message-ID: <Pine.LNX.4.64.0607261402410.5956@sylvain> I wrote a data persistence module called miniconf, aimed at making easy to create and safely retrieve configuration info from external, human-readable sources using Python syntax. I feel it would eventually make a nice addition to the standard library. The code was only newly refactored in this form, but it as been broadly distributed and used as a part of the adesklets project for over a year by a significant user base on multiple platforms. Here it is, as a patch against Python 2.5 SVN tree[1], or as a stand-alone module hosted on the Python Cheese Shop[2]; any feedback is welcomed. -- Sylvain <syfou at users.sourceforge.net> Hackers are just a migratory lifeform with a tropism for computers. [1]http://sourceforge.net/tracker/index.php?func=detail&aid=1527597&group_id=5470&atid=355470 [2]http://cheeseshop.python.org/pypi?:action=display&name=miniconf&version=1.0.1 From amk at amk.ca Wed Jul 26 19:58:24 2006 From: amk at amk.ca (A.M. Kuchling) Date: Wed, 26 Jul 2006 13:58:24 -0400 Subject: [Python-Dev] 2.5: uses of sys.exc_type, exc_value Message-ID: <20060726175824.GA17956@localhost.localdomain> http://www.python.org/sf/1525469 reports that SimpleXMLRPCServer.py still uses sys.exc_type and sys.exc_value when handling exceptions. These variables aren't thread-safe and sys.exc_info() is the better way. I have a patch attached to the bug that fixes the problem. Question 1: is this worth fixing for 2.5? (It's not really a bugfix, more of a style cleanup.) Question 2: I searched for uses of the old variables and found these: Lib/idlelib/WindowList.py: sys.exc_type, ":", sys.exc_value Lib/logging/__init__.py: return sys.exc_traceback.tb_frame.f_back Lib/lib-tk/Tkinter.py: exc, val, tb = sys.exc_type, sys.exc_value, sys.exc_traceback Lib/plat-mac/cfmfile.py: raise Res.Error, "no 'cfrg' resource found", sys.exc_traceback Lib/SocketServer.py: sys.exc_traceback = None # Help garbage collection Plus some references in the test suite, the demos, and faqwizard.py. SocketServer should use sys.exc_clear() instead. Tkinter.py could just call exc_info(), but I wonder if the usage of the variables is intentional here. sys.exc_info() was introduced in Python 1.5, so logging/__init__.py could be fixed without affecting 1.5.2 compatibility. Should the above uses be fixed, too? --amk From guido at python.org Wed Jul 26 20:21:50 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 26 Jul 2006 11:21:50 -0700 Subject: [Python-Dev] 2.5: uses of sys.exc_type, exc_value In-Reply-To: <20060726175824.GA17956@localhost.localdomain> References: <20060726175824.GA17956@localhost.localdomain> Message-ID: <ca471dc20607261121i7bfe7115oa217c36170bddc92@mail.gmail.com> Clearly they should be fixed. Whether in 2.5 or 2.6 I'll leave up to Neal and Anthony. On 7/26/06, A.M. Kuchling <amk at amk.ca> wrote: > http://www.python.org/sf/1525469 reports that SimpleXMLRPCServer.py > still uses sys.exc_type and sys.exc_value when handling exceptions. > These variables aren't thread-safe and sys.exc_info() is the better > way. I have a patch attached to the bug that fixes the problem. > > Question 1: is this worth fixing for 2.5? (It's not really a bugfix, > more of a style cleanup.) > > Question 2: I searched for uses of the old variables and found these: > > Lib/idlelib/WindowList.py: sys.exc_type, ":", sys.exc_value > Lib/logging/__init__.py: return sys.exc_traceback.tb_frame.f_back > Lib/lib-tk/Tkinter.py: exc, val, tb = sys.exc_type, sys.exc_value, sys.exc_traceback > Lib/plat-mac/cfmfile.py: raise Res.Error, "no 'cfrg' resource found", sys.exc_traceback > Lib/SocketServer.py: sys.exc_traceback = None # Help garbage collection > > Plus some references in the test suite, the demos, and faqwizard.py. > > SocketServer should use sys.exc_clear() instead. Tkinter.py could > just call exc_info(), but I wonder if the usage of the variables is > intentional here. sys.exc_info() was introduced in Python 1.5, so > logging/__init__.py could be fixed without affecting 1.5.2 > compatibility. > > Should the above uses be fixed, too? > > --amk > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From pje at telecommunity.com Wed Jul 26 20:40:27 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Wed, 26 Jul 2006 14:40:27 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix Message-ID: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> I posted last week about a need-for-speed patch that broke PEP 302 compliance, and asked if it should be fixed or reverted. I got exactly one response which said "yes, it should be fixed or reverted", which unfortunately didn't answer my question as to which one we should do. :) If we don't revert it, there are two ways to fix it. One is to just change PEP 302 so that the behavior is unbroken by definition. :) The other is to actually go ahead and fix it by adding PathImporter and NullImporter types to import.c, along with a factory function on sys.path_hooks to create them. (This would've been the PEP-compliant way to implement the need-for-speed patch.) So, "fix" by documentation, fix by fixing, or fix by reverting? Which should it be? From david.nospam.hopwood at blueyonder.co.uk Wed Jul 26 20:47:30 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Wed, 26 Jul 2006 19:47:30 +0100 Subject: [Python-Dev] New miniconf module In-Reply-To: <Pine.LNX.4.64.0607261402410.5956@sylvain> References: <Pine.LNX.4.64.0607261402410.5956@sylvain> Message-ID: <44C7B8C2.1060904@blueyonder.co.uk> Sylvain Fourmanoit wrote: > I wrote a data persistence module called miniconf, aimed at making > easy to create and safely retrieve configuration info from external, > human-readable sources using Python syntax. I feel it would eventually > make a nice addition to the standard library. >From a security point of view, this is a great improvement on the existing pickle, marshal, and shelve modules. Those modules could not be safely imported from restricted code. miniconf, OTOH, appears to have an interface compatible with capability security. (I have not checked that the compiler.ast module used in its implementation is safe.) However: +Limitations +=========== + +miniconf has a few limitations one should be aware of: [...] +- It is not preemptiple: concurrent calls to dump() or load() will + have unpredictable results and must be avoided. This limitation should be fixed before the module is added to the standard library, IMHO. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From pje at telecommunity.com Wed Jul 26 22:35:43 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Wed, 26 Jul 2006 16:35:43 -0400 Subject: [Python-Dev] New miniconf module In-Reply-To: <44C7B8C2.1060904@blueyonder.co.uk> References: <Pine.LNX.4.64.0607261402410.5956@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> Message-ID: <5.1.1.6.0.20060726162457.02d5a7c0@sparrow.telecommunity.com> At 07:47 PM 7/26/2006 +0100, David Hopwood wrote: >Sylvain Fourmanoit wrote: > > I wrote a data persistence module called miniconf, aimed at making > > easy to create and safely retrieve configuration info from external, > > human-readable sources using Python syntax. I feel it would eventually > > make a nice addition to the standard library. > > >From a security point of view, this is a great improvement on the existing >pickle, marshal, and shelve modules. Those modules could not be safely >imported from restricted code. > >miniconf, OTOH, appears to have an interface compatible with capability >security. (I have not checked that the compiler.ast module used in its >implementation is safe.) However: > >+Limitations >+=========== >+ >+miniconf has a few limitations one should be aware of: >[...] >+- It is not preemptiple: concurrent calls to dump() or load() will >+ have unpredictable results and must be avoided. > >This limitation should be fixed before the module is added to the standard >library, IMHO. It looks like it's trivial to fix; the code uses a strange and unnecessary complication of creating nested classes and nested singleton instances thereof. Getting rid of the singletons to create a new instance for each dump/load call would suffice to make the implementation re-entrant, although de-nesting the classes would also be a good idea. :) The loading code could also be made a lot faster by using a dictionary mapping AST node types to functions, instead of doing string manipulation for each node. Each function could take 'pedantic' as a parameter, which would eliminate the need to have an object at all, let alone a singleton. Finally, there is an interesting characteristic of the code's interpretation of names: any name other than 'True' is interpreted as 'False'! On the whole, though, I don't see a lot of difference between this format and say, JavaScript Object Notation (JSON), which can be parsed and generated by many other languages as well as multiple Python libraries already. From syfou at users.sourceforge.net Wed Jul 26 22:45:35 2006 From: syfou at users.sourceforge.net (Sylvain Fourmanoit) Date: Wed, 26 Jul 2006 16:45:35 -0400 (EDT) Subject: [Python-Dev] New miniconf module In-Reply-To: <44C7B8C2.1060904@blueyonder.co.uk> References: <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> Message-ID: <Pine.LNX.4.64.0607261613130.6037@sylvain> > miniconf, OTOH, appears to have an interface compatible with capability > security (I have not checked that the compiler.ast module used in its > implementation is safe.) I woudn't be 100% sure either (obviously, I didn't write this nice piece of code, let alone the underlying parser), but I read it and tried to abuse it without success (I haven't found obvious buffer overflow and such)... As far as I know, the abstract syntax tree generation exposed via compiler.ast is a safe operation, in the sense that it doesn't allow execution of code when feeded from arbitrary strings via compiler.parse(); in the worst case scenario, it raises a SyntaxError or similar exceptions, as documented... If anybody know more on this issue, I will be happy to hear about it. > miniconf has a few limitations one should be aware of: > > - It is not preemptiple: concurrent calls to dump() or load() will > have unpredictable results and must be avoided. > > This limitation should be fixed before the module is added to the > standard library, IMHO. If this is the general opinion, I will be glad to change this... The only reason miniconf is not thread-safe for the moment is that I chose to re-use over and over a single instance of each of my two processing classes to reduce resources usage, but this seems pretty pointless (and overly complicated) now that I look at it. Yours, -- Sylvain <syfou at users.sourceforge.net> Your files are now being encrypted and thrown into the bit bucket. EOF From syfou at users.sourceforge.net Wed Jul 26 23:43:21 2006 From: syfou at users.sourceforge.net (Sylvain Fourmanoit) Date: Wed, 26 Jul 2006 17:43:21 -0400 (EDT) Subject: [Python-Dev] New miniconf module In-Reply-To: <Pine.LNX.4.64.0607261613130.6037@sylvain> References: <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> Message-ID: <Pine.LNX.4.64.0607261742160.6037@sylvain> > It looks like it's trivial to fix; the code uses a strange and > unnecessary complication of creating nested classes and nested > singleton instances thereof. Getting rid of the singletons to create a > new instance for each dump/load call would suffice to make the > implementation re-entrant, although de-nesting the classes would also be > a good idea. :) OK then, I will change this. > The loading code could also be made a lot faster by using a dictionary > mapping AST node types to functions, instead of doing string > manipulation for each node. Each function could take 'pedantic' as a > parameter, which would eliminate the need to have an object at all, let > alone a singleton. > I am not convinced the current string manipulation for mapping the nodes types to the methods of the _Load class has such a significant impact on performance, but I will test your suggestion... The only difference with current code is that we use a dynamically computed string as the dictionary key to locate the function instead of the node type themselves as keys. > Finally, there is an interesting characteristic of the code's > interpretation of names: any name other than 'True' is interpreted as > 'False'! ;-) It will be corrected in the next release. > On the whole, though, I don't see a lot of difference between this format > and say, JavaScript Object Notation (JSON), which can be parsed and > generated by many other languages as well as multiple Python libraries > already. The difference is that this is Python code, already familiar to all Python coders... Besides, it sits directly on top of the real Python parser, mitigating the need of a new one, and keeping the added code complexity to a strict minimum. But I agree this looks a lot like JSON, since ecmascript syntax for literals looks a lot like the one of Python... For the same reasons there is a need for JSON, I think having something like miniconf in the standard lib would benefit the users. -- Sylvain <syfou at users.sourceforge.net> If you think the system is working, ask someone who's waiting for a prompt. From pje at telecommunity.com Thu Jul 27 00:03:50 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Wed, 26 Jul 2006 18:03:50 -0400 Subject: [Python-Dev] New miniconf module In-Reply-To: <Pine.LNX.4.64.0607261742160.6037@sylvain> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> Message-ID: <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> At 05:43 PM 7/26/2006 -0400, Sylvain Fourmanoit wrote: > > The loading code could also be made a lot faster by using a dictionary > > mapping AST node types to functions, instead of doing string > > manipulation for each node. Each function could take 'pedantic' as a > > parameter, which would eliminate the need to have an object at all, let > > alone a singleton. > > >I am not convinced the current string manipulation for mapping the nodes >types to the methods of the _Load class has such a significant impact on >performance, but I will test your suggestion... I haven't tested this with your code specifically, but I know that in the past I have nearly tripled the speed of AST-visiting code by doing this; string manipulation plus attribute lookup is a lot more expensive than direct dictionary lookups. > The only difference with >current code is that we use a dynamically computed string as the >dictionary key to locate the function instead of the node type themselves >as keys. Actually you're doing string manipulation plus an *attribute* lookup, and attribute lookups can involve multiple dictionary lookups. But anyway, test and see what you get. :) >But I agree this looks a lot like JSON, since ecmascript syntax for >literals looks a lot like the one of Python... For the same reasons there >is a need for JSON, I think having something like miniconf in the >standard lib would benefit the users. Actually, I would see more reason to include JSON in the standard library, since it's at least something approaching an internet protocol these days. From jjl at pobox.com Thu Jul 27 00:18:30 2006 From: jjl at pobox.com (John J Lee) Date: Wed, 26 Jul 2006 22:18:30 +0000 (UTC) Subject: [Python-Dev] New miniconf module In-Reply-To: <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> Message-ID: <Pine.LNX.4.64.0607262218060.8397@localhost> On Wed, 26 Jul 2006, Phillip J. Eby wrote: [...] > Actually, I would see more reason to include JSON in the standard library, > since it's at least something approaching an internet protocol these days. +1 John From bob at redivi.com Thu Jul 27 00:41:41 2006 From: bob at redivi.com (Bob Ippolito) Date: Wed, 26 Jul 2006 15:41:41 -0700 Subject: [Python-Dev] JSON implementation in Python 2.6 In-Reply-To: <Pine.LNX.4.64.0607262218060.8397@localhost> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607262218060.8397@localhost> Message-ID: <661F2C7D-E446-4BB3-B5A7-8237F0850E91@redivi.com> On Jul 26, 2006, at 3:18 PM, John J Lee wrote: > On Wed, 26 Jul 2006, Phillip J. Eby wrote: > [...] >> Actually, I would see more reason to include JSON in the standard >> library, >> since it's at least something approaching an internet protocol >> these days. > > +1 If there's a consensus on that, my simplejson [1] implementation could migrate to the stdlib for 2.6. The API is modeled after marshal and pickle, the code should be PEP 8 compliant, its test suite has pretty good coverage, it's already used by (at least) TurboGears and Django, and it's the implementation currently "endorsed by" json.org. The work that would be required would be: - LaTeX docs (currently reST in docstrings) - Move the tests around and make them run from the suite rather than via nose - Possible module rename (jsonlib?) [1] http://undefined.org/python/#simplejson -bob From steven.bethard at gmail.com Thu Jul 27 01:42:58 2006 From: steven.bethard at gmail.com (Steven Bethard) Date: Wed, 26 Jul 2006 17:42:58 -0600 Subject: [Python-Dev] DRAFT: python-dev summary for 2006-06-16 to 2006-06-30 Message-ID: <d11dcfba0607261642t73095955lc807b4d9a29464a0@mail.gmail.com> Here's the draft for the second half of June. As always, comments and corrections are greatly appreciated. ============= Announcements ============= ------------------- Python 2.5 schedule ------------------- A number of bugs are being squashed as Python 2.5 moves towards its next release. See `PEP 356`_ for more details and the full schedule. .. _PEP 356: http://www.python.org/dev/peps/pep-0356/ Contributing threads: - `Beta 1 schedule ? (Bug in stringobject?) <http://mail.python.org/pipermail/python-dev/2006-June/066113.html>`__ - `Adding winerror module (Beta 1 schedule ?) <http://mail.python.org/pipermail/python-dev/2006-June/066159.html>`__ - `current 2.5 issues <http://mail.python.org/pipermail/python-dev/2006-June/066204.html>`__ - `TRUNK FREEZE IMMINENT FOR 2.5 BETA 1 - 00:00 UTC, 20-JUNE-2006 <http://mail.python.org/pipermail/python-dev/2006-June/066206.html>`__ - `beta1 coming real soon <http://mail.python.org/pipermail/python-dev/2006-June/066253.html>`__ - `RELEASED Python 2.5 (beta 1) <http://mail.python.org/pipermail/python-dev/2006-June/066318.html>`__ - `TRUNK is UNFROZEN, but in FEATURE FREEZE <http://mail.python.org/pipermail/python-dev/2006-June/066322.html>`__ - `2.5 and beyond <http://mail.python.org/pipermail/python-dev/2006-June/066807.html>`__ ----------------------------------------- Checkins for betas and release candidates ----------------------------------------- Anthony Baxter announced some guidelines for checkins for the beta and release candidate releases. For all beta releases: * All checkins must have an entry for Misc/NEWS, a test and docs * All checkins that add features must have approval from a release manager * All checkins must not break any of the buildbots For all release candidates: * All checkins must have approval from a release manager Approval from a release manager (Anthony or Neal) should preferably be obtained in public (e.g. the python-dev list) and should be noted in the commit message. Contributing threads: - `When to branch release25-maint? <http://mail.python.org/pipermail/python-dev/2006-June/066200.html>`__ - `RFC: trunk checkins between now and 2.5 final <http://mail.python.org/pipermail/python-dev/2006-June/066718.html>`__ ------------------------------------------- FishEye on the Python Subversion Repository ------------------------------------------- FishEye is once again `available for the Python repository`_. .. _available for the Python repository: http://fisheye3.cenqua.com/browse/python Contributing thread: - `FishEye on Python CVS Repository <http://mail.python.org/pipermail/python-dev/2006-June/066188.html>`__ ========= Summaries ========= --------------------------------- PEP 3103: A Switch/Case Statement --------------------------------- After Thomas Lee provided a `simple patch implementing a switch statement`_ for Python, there was a massive discussion about it and how `PEP 275`_ should best be implemented. After much discussion, basically three camps arose: * School I: The switch statement should just be syntactic sugar for the corresponding if/elif chain. * School II: The switch statement should dispatch on a precomputed dict of values. * School III: The switch statement should correspond to an if/elif chain but require all expressions to be hashable (to allow for better optimizations). School I was primarily concerned with the repetition of the ``x ==`` in something like:: if x == ...: ... elif x == ...: ... elif x == ...: ... else: ... School II seemed to feel that just aiding DRY was not enough to introduce a new construct, and that the switch statement should also be able to avoid the function definitions in dispatching code like:: def f(...): ... def g(...): ... def h(...): ... dispatch_dict = {x:f, y:g, z:h} dispatch_dict[value](*args) In order to optimize this kind of code, School II wanted to be able to compute the dispatch dict ahead of time, so that it wouldn't have be recomputed each time the switch statement was executed. There was a lot of discussion as to exactly when this freezing should occur, with some voting for module compilation time (allowing only constants in the cases), some voting for function definition time (allowing only constants and non-local names in the cases) and some voting for the first time the switch statement is executed (allowing only constants and both local and non-local names). Guido put together a thorough summary of the options in `PEP 3103`_. There was some discussion of introducing a ``static`` keyword which would cause an expression to be evaluated at function definition time, so that, for example, the following code would create a list of functions returning each of 0, 1, ... 9:: funcs = [lambda: (static i) for i in xrange(10)] The intention was that switch statement cases would then allow only constants or static expressions. Guido requested a separate PEP on the idea, and `Fredrik Lundh posted a proto-PEP`_, but at the time of this summary, no official PEP had been submitted. In the end, it looked like Guido was leaning towards the switch statement as syntactic sugar for a dispatching dict, with the dict frozen at function definition time (which would mean compile-time for module-level switch statements). However, the introduction of the statement seemed likely to be postponed at least until Python 3.0. .. _simple patch implementing a switch statement: http://bugs.python.org/1504199 .. _PEP 275: http://www.python.org/dev/peps/pep-0275/ .. _PEP 3103: http://www.python.org/dev/peps/pep-3103/ .. _Fredrik Lundh posted a proto-PEP: http://online.effbot.org/2006_06_01_archive.htm#pep-static Contributing threads: - `Switch statement <http://mail.python.org/pipermail/python-dev/2006-June/066086.html>`__ - `An obscene computed goto bytecode hack for "switch" :) <http://mail.python.org/pipermail/python-dev/2006-June/066115.html>`__ - `Simple Switch statement <http://mail.python.org/pipermail/python-dev/2006-June/066499.html>`__ - `Alternatives to switch? <http://mail.python.org/pipermail/python-dev/2006-June/066508.html>`__ - `Temporary Constantification <http://mail.python.org/pipermail/python-dev/2006-June/066531.html>`__ - `Simple Switch statementZ <http://mail.python.org/pipermail/python-dev/2006-June/066537.html>`__ - `PEP 3103: A Switch/Case Statement <http://mail.python.org/pipermail/python-dev/2006-June/066570.html>`__ - `School IIb? <http://mail.python.org/pipermail/python-dev/2006-June/066585.html>`__ - `Switch statement - handling errors <http://mail.python.org/pipermail/python-dev/2006-June/066650.html>`__ - `Split switch statement <http://mail.python.org/pipermail/python-dev/2006-June/066652.html>`__ - `once [was: Simple Switch statementZ] <http://mail.python.org/pipermail/python-dev/2006-June/066692.html>`__ ------------------------------ Restricted execution in Python ------------------------------ For his Ph.D. thesis, Brett Cannon is looking into adding facilities for restricted execution to Python, partly with the goal of getting Python into Firefox alongside Javascript. His restricted execution specifications aimed to take advantage of the C-to-Python language barrier to enforce security restrictions. Though there's no real way to get private attributes in pure Python, objects coded in C and exposed to Python can select which attributes are exposed, thus making the non-exposed attributes truly private to Python-level code. His initial draft aimed to hide as many "dangerous" objects as possible, and then cripple objects like ``file`` that would be difficult to hide. A number of people seemed to prefer a hiding-only approach, but comments from Armin Rigo seemed to suggest that plugging all the introspection holes that give access to file objects might be quite difficult. The discussion continued on into the next fortnight. Contributing threads: - `doc for new restricted execution design for Python <http://mail.python.org/pipermail/python-dev/2006-June/066344.html>`__ - `Is Lib/test/crashers/recursive_call.py really a crasher? <http://mail.python.org/pipermail/python-dev/2006-June/066627.html>`__ - `For sandboxing: alternative to crippling file() <http://mail.python.org/pipermail/python-dev/2006-June/066792.html>`__ ----------------------------------------------- NaN and infinities in Python float calculations ----------------------------------------------- Nick Maclaren asked about trying to get more platform-independent behavior in Python's floats, so that IEEE 754 values as in `PEP 754`_ would be produced more consistently. Currently, different OSes produce different results when these values are involved:: Python 2.4.2 (#1, May 2 2006, 08:28:01) [GCC 4.1.0 (SUSE Linux)] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> a = "NaN" >>> b = float(a) >>> c = int(b) >>> d = (b == b) >>> print a, b, c, d NaN nan 0 False Python 2.3.3 (#1, Feb 18 2004, 11:58:04) [GCC 2.8.1] on sunos5 Type "help", "copyright", "credits" or "license" for more information. >>> a = "NaN" >>> b = float(a) >>> c = int(b) >>> d = (b == b) >>> print a, b, c, d NaN NaN 0 True Nick Maclaren suggested either raising an exception for all ambiguous or invalid operations, or returning NaN or infinity as appropriate and then raising exceptions whenever an operation that would lose the error indication was performed. Nick Coghlan explained that the decimal module already does most of this:: >>> from decimal import Decimal as d >>> nan = d('NaN') >>> int(nan) Traceback (most recent call last): ... decimal.InvalidOperation >>> >>> from decimal import getcontext, Overflow >>> ctx = getcontext() >>> ctx.traps[Overflow] = False >>> d('1e999999999') * 10 Decimal("Infinity") Nick Maclaren seemed to suggest that he would be working on a PEP and an implementation that would bring some of the decimal module consistencies to Python's floats as well. .. _PEP 754: http://www.python.org/dev/peps/pep-0754/ Contributing threads: - `Numerical robustness, IEEE etc. <http://mail.python.org/pipermail/python-dev/2006-June/066186.html>`__ - `Numerical robustness, IEEE etc. <http://mail.python.org/pipermail/python-dev/2006-June/066192.html>`__ - `Python memory model (low level) <http://mail.python.org/pipermail/python-dev/2006-June/066834.html>`__ ----------------------------------------------- ImportWarnings for directories without __init__ ----------------------------------------------- Ralf W. Grosse-Kunstleve complained that with Python 2.5 he started getting tons of "ImportWarning: Not importing directory" messages. James Y Knight pointed out that running Python in your home directory is quite likely to issue such warnings if you have *any* directories in your home directory that have the same name as a python module (e.g. ``readline``). A number of options for silencing the errors were discussed, including invoking Python like ``python -W'ignore:Not importing directory'`` and including ``warnings.filterwarnings('ignore', 'Not importing directory', ImportWarning)`` in site.py or .pythonrc.py. Two patches were provided that introduce the warning only if the import fails, `one by Shane Hathaway`_ and `one by Sergey A. Lipnevich`_. No final decision had been made at the time of this summary. .. _one by Shane Hathaway: http://bugs.python.org/1515361 .. _one by Sergey A. Lipnevich: http://bugs.python.org/1515609 Contributing threads: - `Dropping __init__.py requirement for subpackages <http://mail.python.org/pipermail/python-dev/2006-June/066280.html>`__ - `ImportWarning flood <http://mail.python.org/pipermail/python-dev/2006-June/066345.html>`__ ------------------ Updating turtle.py ------------------ Gregor Lingl proposed replacing turtle.py in the Python standard library with his `new xturtle.py module`_. The xturtle module is backwards compatible with the turtle module and adds a number of enhancements. However, Gregor's request came after Python 2.5's feature freeze, so he was told to propose it again in Python 2.6. There was some discussion about this -- as the stdlib turtle module is poorly tested, some contended that introducing the new APIs of xturtle would not make things any worse. A couple of compromises were offered: mentioning xturtle in the turtle module docs, and putting xturtle in the Tools directory. .. _new xturtle.py module: http://ada.rg16.asn-wien.ac.at/~python/xturtle/ Contributing threads: - `xturtle.py a replacement for turtle.py(!?) <http://mail.python.org/pipermail/python-dev/2006-June/066676.html>`__ - `xturtle.py - a replacement for turtle.py <http://mail.python.org/pipermail/python-dev/2006-June/066677.html>`__ - `xturtle.py a replacement for turtle.py(!?) ATTENTION PLEASE! <http://mail.python.org/pipermail/python-dev/2006-June/066734.html>`__ - `xturtle.py <http://mail.python.org/pipermail/python-dev/2006-June/066742.html>`__ ---------------------------------------------------------- Relative imports and PEP 338: Executing Modules as Scripts ---------------------------------------------------------- Relative imports, as described in `PEP 328`_, introduced problems for `PEP 338`_ which allows modules within packages and zipfiles to be run with the -m command-line switch. The -m switch sets the __name__ of the module to '__main__' so that ``if __name__ == '__main__'`` blocks will get executed. However, relative imports use __name__ to determine the parent package, so if a module that has a relative import is executed using the -m switch, the relative import will fail. Nick Coghlan suggested adding a __module_name__ attribute that would not be clobbered by the -m switch, but people generally seemed to think that it would be simpler to just require absolute imports in main modules. .. _PEP 328: http://www.python.org/dev/peps/pep-0328/ .. _PEP 338: http://www.python.org/dev/peps/pep-0338/ Contributing threads: - `PEP 338 vs PEP 328 - a limitation of the -m switch <http://mail.python.org/pipermail/python-dev/2006-June/066161.html>`__ - `PEP 328 and PEP 338, redux <http://mail.python.org/pipermail/python-dev/2006-June/066609.html>`__ - `[Python-checkins] r47142 - in python/trunk: Doc/lib/librunpy.tex Lib/runpy.py Lib/test/test_runpy.py <http://mail.python.org/pipermail/python-dev/2006-June/066690.html>`__ -------------------------------------------- Importing modules within unicode directories -------------------------------------------- Kristj?n V. J?nsson pointed out that currently, Python on Windows cannot import modules from directories with unicode names, even if the module names themselves are plain ASCII. Nick Coghlan suggested that this was likely because import.c was doing something like ``u'c:/tmp/\u814c'.encode('mbcs')``, getting back ``'c:/tmp/?'`` and being unable to do anything useful with that. Martin v. L?wis suggested using the 8.3 simplified filename used by DOS, at least until the import machinery gets reworked to better handle encodings, hopefully for Python 2.6. `Thomas Heller had provided a patch`_ for reworking import.c in this manner a while back, but it was large enough that no one had reviewed it. .. _Thomas Heller had provided a patch: http://bugs.python.org/1093253 Contributing thread: - `unicode imports <http://mail.python.org/pipermail/python-dev/2006-June/066103.html>`__ ---------------------------------------- MS VC++ 2003 toolkit no longer available ---------------------------------------- Bill Janssen pointed out that Python 2.4 on Windows expects to be compiled with the MS Visual C++ compiler version 7.1, and that the corresponding MS VC++ 2003 toolkit is no longer available. Fredrik Lundh explained that the compiler is still available in the .net SDK as well as being available to MSDN subscribers. There was again some discussion about moving to the VS 2005 toolkit for compiling Python. It would have made compiling for 64bit architectures somewhat easier, but would have meant that extension writers would have to install three different compilers just to compile extensions for Python 2.3, 2.4 and 2.5, and would also have given problems for MinGW users as MinGW does not yet easily support linking to the msvcr80 runtime library. Contributing threads: - `Python 2.4 extensions require VC 7.1? <http://mail.python.org/pipermail/python-dev/2006-June/066110.html>`__ - `Documentation enhancement: "MS free compiler"? <http://mail.python.org/pipermail/python-dev/2006-June/066182.html>`__ - `Documentation enhancement: "MS free compiler"? <http://mail.python.org/pipermail/python-dev/2006-June/066257.html>`__ --------------------------------- Keeping interned strings in a set --------------------------------- Alexander Belopolsky tried out the new set C API by `replacing the dict of interned strings with a set`_ instead. He had to make two changes to get this to work: there's currently no way to retrieve a single object from a set, and Py_Finalize() needed to be changed to finalize sets after strings (instead of the other way around as it used to be). There was some discussion about trying to get rid of PySet_Fini() so the latter problem wouldn't be an issue at all, but with all the other Py*Fini() functions already existing, it didn't seem worth it. The patch had no slowdown and reduced the memory consumption of the interning structure slightly. .. _replacing the dict of interned strings with a set: http://bugs.python.org/1507011 Contributing threads: - `Keeping interned strings in a set <http://mail.python.org/pipermail/python-dev/2006-June/066084.html>`__ - `Keeping interned strings in a set <http://mail.python.org/pipermail/python-dev/2006-June/066088.html>`__ - `setobject code <http://mail.python.org/pipermail/python-dev/2006-June/066116.html>`__ - `Proposal to eliminate PySet_Fini <http://mail.python.org/pipermail/python-dev/2006-June/066645.html>`__ ------------------------- Allowing empty subscripts ------------------------- Guido finally vetoed the proposal to allow ``x[()]`` to be written as ``x[]``. The use-cases were weak, and in most cases the functionality seemed better expressed as attribute access. Contributing threads: - `Pre-PEP: Allow Empty Subscript List Without Parentheses <http://mail.python.org/pipermail/python-dev/2006-June/066099.html>`__ - `Empty Subscript PEP on Wiki - keep or toss? <http://mail.python.org/pipermail/python-dev/2006-June/066848.html>`__ ------------------------------------- Creating range objects at the C level ------------------------------------- Ralf W. Grosse-Kunstleve asked about the removal of the C function ``PyRange_New()`` which had been deprecated in Python 2.4. The right way to create ranges is to call PyRange_Type with the appropriate parameters, e.g. something like ``PyObject_CallFunction((PyObject*) &PyRange_Type, "lll", start, stop, step)``. Ralf was nervous about this alternative because it also appeared to be undocumented, and requested that something like the above be at least put into the What's New document. Contributing threads: - `PyRange_New() alternative? <http://mail.python.org/pipermail/python-dev/2006-June/066343.html>`__ - `PyObject* aliasing (Was: PyRange_New() alternative?) <http://mail.python.org/pipermail/python-dev/2006-June/066477.html>`__ ---------------------------------- type(), __class__ and isinstance() ---------------------------------- Martin Maly pointed out that you can't fool isinstance() into thinking your object is not a subclass of its true base class:: >>> class C(object): ... pass ... >>> class D(object): ... __class__ = property(lambda self: C) ... >>> isinstance(D(), D) True >>> isinstance(D(), C) True Phillip J. Eby explained that isinstance() checks both the type() of the object and the __class__ attribute. In essence, you can lie about your __class__ to make isinstance() return True, but you can't lie to make it return False. Guido suggested that these issues, as well as lying about an object's __bases__, should be revisited for Python 3000. Contributing thread: - `Semantic of isinstance <http://mail.python.org/pipermail/python-dev/2006-June/066591.html>`__ ---------------------------------------------------------------- Requiring backward compatibility in the standard library modules ---------------------------------------------------------------- Ka-Ping Yee's uuid module, newly added for Python 2.5, contained a comment "This module works with Python 2.3 or higher". George Yoshida asked if that comment should be interpreted as requiring Python 2.3 compatibility. People generally felt like the list of backwards compatible modules in `PEP 291`_ should be as small as possible so as to keep maintenance as simple as possible. Ka-Ping removed the comment, and submitted the module to PyPI for Python 2.3 and 2.4 users. .. _PEP 291: http://www.python.org/dev/peps/pep-0291/ Contributing thread: - `uuid backward compatibility <http://mail.python.org/pipermail/python-dev/2006-June/066153.html>`__ --------------------- Figleaf code coverage --------------------- `Titus Brown offered some reports`_ from his `figleaf code coverage`_ utility. People seemed particularly interested in trying to get coverage across multiple platforms, perhaps using a BuildBot extension, and Titus said he'd try to look into it. Walter D?rwald also pointed to `his own code coverage module`_. .. _Titus Brown offered some reports: http://vallista.idyll.org/~t/temp/python2.4-svn/ .. _figleaf code coverage: http://darcs.idyll.org/~t/projects/figleaf-latest.tar.gz .. _his own code coverage module: http://styx.livinglogic.de/~walter/python/coverage/PythonCodeCoverage.py Contributing threads: - `Code coverage reporting. <http://mail.python.org/pipermail/python-dev/2006-June/066184.html>`__ - `Code coverage reporting. <http://mail.python.org/pipermail/python-dev/2006-June/066193.html>`__ ------------------------ Improving error messages ------------------------ Georg Brandl proposed going through abstract.c and modifying error messages like "object does not support item assignment" to also include the type of the object. He got little feedback, mainly because everyone seemed to think it was such an obviously good idea that there was no need for any. Python 2.5 now incorporates `Georg's better error messages`_. .. _Georg's better error messages: http://bugs.python.org/1507676 Contributing threads: - `Improve error msgs? <http://mail.python.org/pipermail/python-dev/2006-June/066048.html>`__ - `Improve error msgs? <http://mail.python.org/pipermail/python-dev/2006-June/066128.html>`__ ----------------------------------------- Allowing assignments in global statements ----------------------------------------- Talin proposed allowing a global statement to be combined with an assignment statement, e.g.:: global badger = 42 Guido suggested that such a desire was a sure indicator of overuse of ``global``. Contributing thread: - `Allow assignments in 'global' statements? <http://mail.python.org/pipermail/python-dev/2006-June/066347.html>`__ ----------------------------------------- Splitting Python tests from CPython tests ----------------------------------------- Frank Wierzbicki volunteered some time into splitting out CPython specific test from Python-the-language tests. Armin Rigo pointed him to PyPy's `tests modified to be more implementation independent`_. .. _tests modified to be more implementation independent: http://codespeak.net/svn/pypy/dist/lib-python/modified-2.4.1/test Contributing thread: - `Cleanup of test harness for Python <http://mail.python.org/pipermail/python-dev/2006-June/066817.html>`__ ----------------------------------------- A multi-dimensional array type for Python ----------------------------------------- For `Google's Summer of Code`_, Karol Langner will be working on `implementing a basic multi-dimensional array type`_ for Python core, based on the numpy_ array struct. He asked for any comments or suggestions that people had for the project. .. _Google's Summer of Code: http://code.google.com/summerofcode.html .. _numpy: http://www.numpy.org/ .. _implementing a basic multi-dimensional array type: http://scipy.org/BaseArray Contributing thread: - `basearray <http://mail.python.org/pipermail/python-dev/2006-June/066516.html>`__ ================== Previous Summaries ================== - `Source control tools <http://mail.python.org/pipermail/python-dev/2006-June/066187.html>`__ - `Dropping externally maintained packages (Was: Please stop changing wsgiref on the trunk) <http://mail.python.org/pipermail/python-dev/2006-June/066195.html>`__ =============== Skipped Threads =============== - `Last-minute curses patch <http://mail.python.org/pipermail/python-dev/2006-June/066095.html>`__ - `Bug in stringobject? <http://mail.python.org/pipermail/python-dev/2006-June/066100.html>`__ - `Fwd: subprocess.Popen(.... stdout=IGNORE, ...) <http://mail.python.org/pipermail/python-dev/2006-June/066111.html>`__ - `About dynamic module loading <http://mail.python.org/pipermail/python-dev/2006-June/066190.html>`__ - `PyString_FromFormat <http://mail.python.org/pipermail/python-dev/2006-June/066213.html>`__ - `Misleading error message from PyObject_GenericSetAttr <http://mail.python.org/pipermail/python-dev/2006-June/066227.html>`__ - `Bug: xml.dom.pulldom never gives you END_DOCUMENT events with an Expat parser <http://mail.python.org/pipermail/python-dev/2006-June/066228.html>`__ - `os.getmtime now returns a float? <http://mail.python.org/pipermail/python-dev/2006-June/066252.html>`__ - `XP build failing <http://mail.python.org/pipermail/python-dev/2006-June/066258.html>`__ - `ETree: xml vs xmlcore <http://mail.python.org/pipermail/python-dev/2006-June/066268.html>`__ - `test_ctypes failure on Mac OS X/PowerPC 10.3.9 (Panther) <http://mail.python.org/pipermail/python-dev/2006-June/066282.html>`__ - `Small sqlite3 test suite fix (Python 2.5b1 candidate) <http://mail.python.org/pipermail/python-dev/2006-June/066291.html>`__ - `Weekly Python Patch/Bug Summary <http://mail.python.org/pipermail/python-dev/2006-June/066350.html>`__ - `Things to remember when adding *packages* to stdlib <http://mail.python.org/pipermail/python-dev/2006-June/066353.html>`__ - `Moving the ctypes repository to python.org <http://mail.python.org/pipermail/python-dev/2006-June/066417.html>`__ - `PyObject_CallFunction and 'N' format char <http://mail.python.org/pipermail/python-dev/2006-June/066501.html>`__ - `pypy-0.9.0: stackless, new extension compiler <http://mail.python.org/pipermail/python-dev/2006-June/066512.html>`__ - `[Python-checkins] Things to remember when adding *packages* to stdlib <http://mail.python.org/pipermail/python-dev/2006-June/066515.html>`__ - `Import semantics <http://mail.python.org/pipermail/python-dev/2006-June/066523.html>`__ - `2.5b1 Windows install <http://mail.python.org/pipermail/python-dev/2006-June/066542.html>`__ - `Python-Dev Digest, Vol 35, Issue 143 <http://mail.python.org/pipermail/python-dev/2006-June/066577.html>`__ - `Problems building Python on OSX 10.4.6? <http://mail.python.org/pipermail/python-dev/2006-June/066579.html>`__ - `enhancements for uuid module <http://mail.python.org/pipermail/python-dev/2006-June/066583.html>`__ - `Do we need a bug triage day? <http://mail.python.org/pipermail/python-dev/2006-June/066647.html>`__ - `Oh-why that?? Please ignore one of the two <http://mail.python.org/pipermail/python-dev/2006-June/066678.html>`__ - `msvccompiler.py: some remarks <http://mail.python.org/pipermail/python-dev/2006-June/066768.html>`__ - `Joke: Rush Limbaugh (a joke in and of himself) <http://mail.python.org/pipermail/python-dev/2006-June/066780.html>`__ - `PyGIL_ and --without-threads <http://mail.python.org/pipermail/python-dev/2006-June/066784.html>`__ - `document @property? <http://mail.python.org/pipermail/python-dev/2006-June/066787.html>`__ - `Pickle implementation questions <http://mail.python.org/pipermail/python-dev/2006-June/066803.html>`__ - `sys.settrace() in Python 2.3 vs. 2.4 <http://mail.python.org/pipermail/python-dev/2006-June/066820.html>`__ - `how long to wait for expat to incorporate a fix to prevent a crasher? <http://mail.python.org/pipermail/python-dev/2006-June/066829.html>`__ - `LOAD_CONST POP_TOP <http://mail.python.org/pipermail/python-dev/2006-June/066832.html>`__ From tim.peters at gmail.com Thu Jul 27 02:02:37 2006 From: tim.peters at gmail.com (Tim Peters) Date: Wed, 26 Jul 2006 20:02:37 -0400 Subject: [Python-Dev] [Windows, buildbot] kill_python.c mystery Message-ID: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> Rarely I'll be running the Python tests in my sandbox from a DOS box, and the test run will just end. Like so: C:\Code\python\PCbuild>python -E -tt ../lib/test/regrtest.py -uall -rw test_softspace test_codecmaps_kr ... test_float test_userdict C:\Code\python\PCbuild> No indication of success or failure -- the process just vanishes mid-stream. Today I noticed this happened when the buildbot started to run tests, and I'm 100% sure it's due to this code in Tools/buildbot/kill_python.c (the buildbot log files showed that kill_python.c killed /some/ Python process, and the Python running release-build tests in my sandbox was the only plausible candidate): if ((strstr(path, "build\\pcbuild\\python_d.exe") != NULL) || (strstr(path, "build\\python.exe") != NULL)) { printf("Terminating %s (pid %d)\n", path, pids[i]); if (!TerminateProcess(hProcess, 1)) { The second clause in the first `if` looks for a substring match on: build\python.exe and that just happens to match a suffix of: C:\Code\python\PCbuild\python.exe which is the release-build Python I happen to be running in my sandbox. Why is the second clause there? That is, are we /trying/ to kill a release-build Python running from the user's sandbox, and if so why? Introducing the second clause was the sole change in rev 46817, and the checkin comment doesn't really explain it: Port cygwin kill_python changes from 2.4 branch. Since I don't know what it's trying to accomplish, I hesitate to change it. It's quite clear what the first clause is trying to accomplish, and that one hasn't caused any problems. From mattjfleming at googlemail.com Thu Jul 27 02:20:19 2006 From: mattjfleming at googlemail.com (Matt Fleming) Date: Thu, 27 Jul 2006 00:20:19 +0000 Subject: [Python-Dev] Improving unit tests for the standard library Message-ID: <5ff4a1e50607261720o2a98572ft4cab275ed91ad344@mail.gmail.com> Hi, after speaking with Neal off-list about writing tests for the pkgutil module, we agreed it would be a good idea to start a page on http://wiki.python.org/moin/ stating any tests for the standard library that either, a) need to be written b) can be improved I've started the page http://wiki.python.org/moin/ImprovingLibTests that lists all the test files I could think of that need to be written. Ive also included a table for improvements to existing tests, along with a column that allows you to specify exactly what needs improving. I hope this will be of use to people, and I hope people will find time to modify the page approriately. When I get some spare time from my SoC project, I'll be working my way through the list. Thanks, Matt -- http://mattssanctuary.blogspot.com From ocean at m2.ccsnet.ne.jp Thu Jul 27 03:15:23 2006 From: ocean at m2.ccsnet.ne.jp (H.Yamamoto) Date: Thu, 27 Jul 2006 10:15:23 +0900 Subject: [Python-Dev] patch for mbcs codec (again) Message-ID: <002601c6b11a$22e3fa70$0400a8c0@whiterabc2znlh> Hello. I noticed mbcs codec still has problem when calls StreamReader. Can anyone commit the patch "fix.patch version2" on http://python.org/sf/1455898 ? # Very sorry about this... I thought I checked this, but I definitely looked at something # wrong. From kbk at shore.net Thu Jul 27 04:18:12 2006 From: kbk at shore.net (Kurt B. Kaiser) Date: Wed, 26 Jul 2006 22:18:12 -0400 (EDT) Subject: [Python-Dev] Weekly Python Patch/Bug Summary Message-ID: <200607270218.k6R2ICQ9021704@bayview.thirdcreek.com> Patch / Bug Summary ___________________ Patches : 401 open ( +3) / 3342 closed ( +8) / 3743 total (+11) Bugs : 896 open ( -8) / 6035 closed (+24) / 6931 total (+16) RFE : 224 open ( +2) / 233 closed ( +2) / 457 total ( +4) New / Reopened Patches ______________________ pkgutil.walk_packages ignores onerror arg (2006-07-20) CLOSED http://python.org/sf/1525766 opened by James Y Knight Tkdnd mouse cursor handling patch (2006-07-20) http://python.org/sf/1525806 opened by klappnase calltip awkwardly shortened (2006-07-20) CLOSED http://python.org/sf/1525817 opened by Lingl str.__iter__ and unicode.__iter__ (2006-07-21) http://python.org/sf/1526367 opened by Walter D??rwald Fix socketmodule compile on NetBSD (2006-07-21) http://python.org/sf/1526460 opened by Matt Fleming New module: miniconf (2006-07-24) http://python.org/sf/1527597 opened by S.Fourmanoit winsound - probably win9x port is not working (2006-07-24) CLOSED http://python.org/sf/1527744 opened by Hirokazu Yamamoto Expose case-insensitivity of string.Template (2006-07-25) http://python.org/sf/1528167 opened by Chad Whitacre PyShell.recall - fix indentation logic (2006-07-25) http://python.org/sf/1528468 opened by Tal Einat patch for mbcs codecs (2006-03-22) http://python.org/sf/1455898 reopened by ocean-city Patches Closed ______________ pkgutil.walk_packages ignores onerror arg (2006-07-20) http://python.org/sf/1525766 closed by gbrandl calltip awkwardly shortened (2006-07-20) http://python.org/sf/1525817 closed by loewis ParenMatch: workaround for misinterpreting of closing parens (2006-01-16) http://python.org/sf/1407280 closed by kbk Syntax-related improvements to IDLE (2004-02-28) http://python.org/sf/906702 closed by kbk (partial?) fix for Misc/python-config.in (2006-07-16) http://python.org/sf/1523356 closed by gbrandl ConnectRegistry blocks all threads (2006-03-12) http://python.org/sf/1448199 closed by loewis MS Windows - module search path fix (2005-07-04) http://python.org/sf/1232023 closed by loewis winsound - probably win9x port is not working (2006-07-24) http://python.org/sf/1527744 closed by gbrandl 1515163 fix - traceback and str exc (2006-06-30) http://python.org/sf/1515343 closed by gbrandl New / Reopened Bugs ___________________ inspect.py: still infinite recursion inspecting frames (2006-07-03) CLOSED http://python.org/sf/1516184 reopened by nnorwitz Bug in shutil.copytree on Windows (2006-07-20) http://python.org/sf/1525866 opened by Mike Foord email package quoted printable behaviour changed (2006-07-20) http://python.org/sf/1525919 opened by Thomas Arendsen Hein Win32: subprocess.Popen() w/o "pipe" throws an exception (2006-07-21) CLOSED http://python.org/sf/1526203 opened by Larry Hastings Concatenation on a long string breaks (2006-07-21) http://python.org/sf/1526585 opened by Jp Calderone current directory added to sys.path on win32 (2006-07-22) CLOSED http://python.org/sf/1526785 opened by John Ehresman unbalanced parentheses from command line crash pdb (2006-07-21) http://python.org/sf/1526834 opened by Ilya Sandler PythonLauncher uses incorrect working directory (2006-07-23) http://python.org/sf/1527397 opened by Bob Ippolito optparse should support arbitrary number of arguments (2006-07-24) http://python.org/sf/1527705 reopened by riteshsarraf optparse should support arbitrary number of arguments (2006-07-24) http://python.org/sf/1527705 opened by Ritesh Raj Sarraf tarfile chokes on ipython archive on Windows (2006-07-24) http://python.org/sf/1527974 opened by Arve Knudsen difflib.SequenceMatcher.find_longest_match() wrong result (2006-07-25) http://python.org/sf/1528074 opened by John Machin urllib2 data argument (2006-07-25) http://python.org/sf/1528258 opened by paul rubin forward in turtle module may cause incorrect display (2006-07-25) http://python.org/sf/1528363 opened by NatureMage IDLE: printing issue on OSX (2006-07-25) http://python.org/sf/1528593 opened by Ronald Oussoren Python 2.5b2 fails to build on Solaris 10 (2006-07-25) http://python.org/sf/1528620 opened by Guido Ostkamp Python 2.5b2 fails to build (GCC) on Solaris 10 (2006-07-26) http://python.org/sf/1529269 opened by Guido Ostkamp Bugs Closed ___________ inspect.py: still infinite recursion inspecting frames (2006-07-03) http://python.org/sf/1516184 closed by pje logging using the SysLog handler fails if locale is set (2006-07-17) http://python.org/sf/1524081 closed by vsajip Win32: subprocess.Popen() w/o "pipe" throws an exception (2006-07-21) http://python.org/sf/1526203 closed by gbrandl Malloc, memory error, failmalloc, low memory. (2006-07-19) http://python.org/sf/1525589 closed by nnorwitz current directory added to sys.path on win32 (2006-07-22) http://python.org/sf/1526785 closed by loewis IDLE (macosx): Class and Path browsers show Tk menu (2006-07-06) http://python.org/sf/1517996 closed by ronaldoussoren Column Number is not visible in MacOSX (2003-07-09) http://python.org/sf/768481 closed by kbk Tooltip-window doesn't vanish if... (2003-07-22) http://python.org/sf/775535 closed by kbk failure of test_ossaudiodev; elapsed time .1 sec faster (2006-06-05) http://python.org/sf/1501330 closed by gward Argument missing from calltip for new-style class init (2004-09-13) http://python.org/sf/1027566 closed by kbk length of unicode string changes print behaviour (2006-02-22) http://python.org/sf/1436532 closed by loewis exec and eval allocate lots of memory and do not free it (2006-07-20) http://python.org/sf/1525678 closed by loewis os.listdir doesn't check error code from FindNextFile (2006-07-18) http://python.org/sf/1524310 closed by loewis os.path.abspath() / os.chdir() buggy with unicode paths (2005-09-07) http://python.org/sf/1283895 closed by loewis traceback now masks some string exceptions (2006-06-30) http://python.org/sf/1515163 closed by gbrandl sys.ps1 not protected in EditorWindow.py (2004-08-16) http://python.org/sf/1010370 closed by kbk EditorWindow demo causes attr-error (2006-01-28) http://python.org/sf/1417598 closed by kbk Building 2.5a1 requires python (2006-04-06) http://python.org/sf/1465408 closed by anthonybaxter New / Reopened RFE __________________ New sequences for Unicode classes needed (2006-07-25) http://python.org/sf/1528154 opened by gmarketer RFE Closed __________ Add Windows 9x/ME (lack of) support information to README.TX (2006-06-22) http://python.org/sf/1510853 closed by loewis From collinw at gmail.com Thu Jul 27 04:24:37 2006 From: collinw at gmail.com (Collin Winter) Date: Wed, 26 Jul 2006 22:24:37 -0400 Subject: [Python-Dev] Which version of distutils to ship with Python 2.5? Message-ID: <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> Is it intentional that Python 2.5 is (currently) shipping with distutils 2.4.0, while Python 2.4 (at least 2.4.1, 2.4.2 and 2.4.3) shipped with distutils 2.4.1? Judging from my own tests, distutils 2.4.1 fixed several bugs that some of my test suites depend on (the fixes, not the bugs ; ). Thanks, Collin Winter From david.nospam.hopwood at blueyonder.co.uk Thu Jul 27 04:19:40 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Thu, 27 Jul 2006 03:19:40 +0100 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> Message-ID: <44C822BC.1070204@blueyonder.co.uk> [This message is cc:d to the e-lang list, but please take any replies to python-dev at python.org.] Brett Cannon wrote: > On 7/19/06, Ka-Ping Yee <cap-talk at zesty.ca> wrote: > >> OMG!!! Is all i can say at the moment. Very excited. This is very encouraging. Thanks to ?!ng, Michael Chermside and others for making the case for capabilities. > Also realize that I am using object-capabilities to secure the interpreter, > not objects. That will be enough of a challenge to do for now. Who knows, > maybe some day Python can support object-capabilities at the object level, > but for now I am just trying to isolate and protect individual interpreters > in the same process. I think that the alternative of providing object-granularity protection domains straight away is more practical than you suggest, and I'd like to at least make sure that this possibility has been thoroughly explored. Below is a first-cut proposal for enforcing namespace restrictions, i.e. support for non-public attributes and methods, on Python objects and modules. It is not sufficient by itself to provide capability security, but it could be the basis for doing that at object granularity. (Note that this proposal would only affect sandboxed/restricted interpreters, at least for the time being. The encapsulation it provides is also useful for reasons other than security, and I think there is nothing about it that would be unreasonable to apply to an unrestricted interpreter, but for compatibility, that would have to be enabled by a __future__ option or similar.) Internal namespace proposal =========================== Existing Python code tends to use a convention where the names of attributes and methods intended only for internal use are prefixed by '_'. This convention comes from PEP 8 <http://www.python.org/dev/peps/pep-0008/>, which says: # In addition, the following special forms using leading or trailing # underscores are recognized (these can generally be combined with any case # convention): # # - _single_leading_underscore: weak "internal use" indicator. E.g. "from M # import *" does not import objects whose name starts with an underscore. # # - single_trailing_underscore_: used by convention to avoid conflicts with # Python keyword, e.g. # # Tkinter.Toplevel(master, class_='ClassName') # # - __double_leading_underscore: when naming a class attribute, invokes name # mangling (inside class FooBar, __boo becomes _FooBar__boo; see below). # # - __double_leading_and_trailing_underscore__: "magic" objects or # attributes that live in user-controlled namespaces. E.g. __init__, # __import__ or __file__. Never invent such names; only use them # as documented. I propose that the "internal" status of names beginning with _ (including those beginning with __) should be enforced in restricted interpreters. This is better than introducing a new annotation, because it will do the right thing for existing code that follows this part of PEP 8. More precisely: A restricted interpreter refuses access to any object attribute or method with a name beginning with '_' (by throwing a new exception type 'InternalAccessException'), unless the access is from a method and its static target is that method's first argument variable. Also, a restricted interpreter refuses access to any module-global variable or module-global function with a name beginning with '_' (by throwing 'InternalAccessException'), unless the access is statically from the same module. (A method's first argument is usually called 'self', but that's just a convention. By "static target", I mean that to access an internal attribute _foo in a method with first argument 'self', you must write "self._foo"; attempting to access "x._foo" will fail even if 'x' happens to be the same object as 'self'. This allows such accesses to be reported at compile-time, rather than only at run-time.) I am using the term "internal" rather than "private" or "protected", because these semantics are not the same as either "private" or "protected" in C++ or Java. In Python with this change, an object can only access its own internal methods and attributes. In C++ and Java, an object can access private and protected members of other objects of the same class. The rationale for this difference is explained below. The use of _single vs __double underscores encodes a useful distinction that would not change. Ignoring the point in the previous paragraph, a _single underscore is similar to "protected" in languages like C++ and Java, while a __double underscore is similar to "private". This is purely a consequence of the name mangling: if a class X and its subclass Y both name an attribute __foo, then we will end up with two attributes _X__foo and _Y__foo in instances of Y, which is the desired behaviour for private attributes. In the case of an attribute called _foo, OTOH, there can be only one such attribute per object, which is the desired behaviour for protected attributes. The name mangling also ensures that an object will not *accidentally* access a private attribute inherited from a superclass. However, in the same example, an instance of Y can still deliberately access the copy of the attribute inherited from X by specifying _X__foo. There is no security problem here, because Y cannot do anything as a result that it could not have done by copying X's code, rather than inheriting from it. Notice that this is only true because we restrict an object to only accessing its own internal attributes and methods; if we followed C++'s semantics where an object can access protected members of any superclass, this would break security. (Java solves this problem by applying a more complicated access rule for protected members, which I considered to be unintuitive. More details on request.) __dict__ is an internal attribute. This means that an object can only directly reflect on itself. I know that there are other means of reflection (e.g. using the 'inspect' module); blocking these or making them safe is a separate issue. If desired, it would be safe to add a 'publicdict' attribute to each object, or a 'publicdict(object)' built-in. This would return a *read-only* dict, probably created lazily if needed, giving access only to public (non-internal) attributes and methods. __init__ is an internal method. This is as it should be, because it should not be possible to call __init__ on an existing object; only to have __init__ implicitly called when a new object is constructed. __repr__ and __str__ are internal under these rules, and probably shouldn't be. Existing classes may expose private state in the strings returned by __repr__ or __str__, but in principle, there is nothing unsafe about being able to convert the public state of an object to a string. OTOH, this functionality is usually accessed via the built-ins 'repr' and 'str', which we could perhaps allow to access '__repr__' and '__str__' as a special case. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From nnorwitz at gmail.com Thu Jul 27 05:43:38 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Wed, 26 Jul 2006 20:43:38 -0700 Subject: [Python-Dev] [Windows, buildbot] kill_python.c mystery In-Reply-To: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> References: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> Message-ID: <ee2a432c0607262043i323ae8d5k8fdc81ca0b7e6b39@mail.gmail.com> On 7/26/06, Tim Peters <tim.peters at gmail.com> wrote: > > Today I noticed this happened when the buildbot started to run tests, > and I'm 100% sure it's due to this code in > Tools/buildbot/kill_python.c (the buildbot log files showed that > kill_python.c killed /some/ Python process, and the Python running > release-build tests in my sandbox was the only plausible candidate): > > if ((strstr(path, "build\\pcbuild\\python_d.exe") != NULL) || > (strstr(path, "build\\python.exe") != NULL)) { > printf("Terminating %s (pid %d)\n", path, pids[i]); > if (!TerminateProcess(hProcess, 1)) { > > The second clause in the first `if` looks for a substring match on: > > build\python.exe > > and that just happens to match a suffix of: > > C:\Code\python\PCbuild\python.exe > > which is the release-build Python I happen to be running in my sandbox. > > Why is the second clause there? That is, are we /trying/ to kill a > release-build Python running from the user's sandbox, and if so why? No, I don't believe that was the intent. The exe on cygwin uses the unix convention, not the Windows convention for the filename. ie, either a debug or release build on cygwin are both called python.exe. So the second clause is there to kill the process when it's running under cygwin. It's interesting that the process appears to be running as ./python.exe, but build shows up in filename. From that I deduce that it must contain the complete path. Assuming that is true, we can change the code to ensure that build is a directory since that's what buildbot does (add the leading \\): > (strstr(path, "\\build\\python.exe") != NULL)) { I tested this change with a different path, so I believe it will work fine and not catch PCbuild. I'll check in this change and add some comments. n From nnorwitz at gmail.com Thu Jul 27 06:08:40 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Wed, 26 Jul 2006 21:08:40 -0700 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> Message-ID: <ee2a432c0607262108n2599f9aere07bae83535179b1@mail.gmail.com> What is the behaviour that was added which broke compliance? What is the benefit of the behaviour? >From your description of fixing the problem, it seems there's some risk invovled as it's modiyfing import.c, plus adding new features. What is your recommendation? n -- On 7/26/06, Phillip J. Eby <pje at telecommunity.com> wrote: > I posted last week about a need-for-speed patch that broke PEP 302 > compliance, and asked if it should be fixed or reverted. I got exactly one > response which said "yes, it should be fixed or reverted", which > unfortunately didn't answer my question as to which one we should do. :) > > If we don't revert it, there are two ways to fix it. One is to just change > PEP 302 so that the behavior is unbroken by definition. :) The other is > to actually go ahead and fix it by adding PathImporter and NullImporter > types to import.c, along with a factory function on sys.path_hooks to > create them. (This would've been the PEP-compliant way to implement the > need-for-speed patch.) > > So, "fix" by documentation, fix by fixing, or fix by reverting? Which > should it be? > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/nnorwitz%40gmail.com > From martin at v.loewis.de Thu Jul 27 08:32:39 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Thu, 27 Jul 2006 08:32:39 +0200 Subject: [Python-Dev] [Windows, buildbot] kill_python.c mystery In-Reply-To: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> References: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> Message-ID: <44C85E07.3060707@v.loewis.de> Tim Peters wrote: > Today I noticed this happened when the buildbot started to run tests, > and I'm 100% sure it's due to this code in > Tools/buildbot/kill_python.c Didn't you know that you signed in to run arbitrary viruses, worms, and trojan horses when you added your machine to the buildbot infrastructure :-? You just haven't seen buildbot erasing your hard disk and filling your coffee machine with tea, yet. > (strstr(path, "build\\python.exe") != NULL)) { > Why is the second clause there? That's for Cygwin (i.e. Anthony Baxter's machine). As Neal suggests, preceding the executable path with another backslash should solve this problem. As a related note, this entire approach will also manage to kill python.exe from an unrelated buildbot installation, e.g. a 2.4 build job might kill python.exe from the trunk. This actually helped when I tried to get the Cygwin slave to get unstuck, and shouldn't do harm since we currently don't run to builds on the same slave simultaneously, but could be surprising when parallel builds are activated some day. Sorry for messing with your machine, Martin From martin at v.loewis.de Thu Jul 27 08:40:50 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Thu, 27 Jul 2006 08:40:50 +0200 Subject: [Python-Dev] Which version of distutils to ship with Python 2.5? In-Reply-To: <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> References: <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> Message-ID: <44C85FF2.3030701@v.loewis.de> Collin Winter wrote: > Is it intentional that Python 2.5 is (currently) shipping with > distutils 2.4.0, while Python 2.4 (at least 2.4.1, 2.4.2 and 2.4.3) > shipped with distutils 2.4.1? Judging from my own tests, distutils > 2.4.1 fixed several bugs that some of my test suites depend on (the > fixes, not the bugs ; ). Are these bugs not fixed in the distutils that shipped with Python 2.5b2? In any case, I bumped the version number to 2.5, according to the policy discussed in http://mail.python.org/pipermail/distutils-sig/2005-January/004368.html Thanks for pointing this out. Regards, Martin From anthony at interlink.com.au Thu Jul 27 09:01:53 2006 From: anthony at interlink.com.au (Anthony Baxter) Date: Thu, 27 Jul 2006 17:01:53 +1000 Subject: [Python-Dev] Which version of distutils to ship with Python 2.5? In-Reply-To: <44C85FF2.3030701@v.loewis.de> References: <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> <44C85FF2.3030701@v.loewis.de> Message-ID: <200607271701.58223.anthony@interlink.com.au> On Thursday 27 July 2006 16:40, Martin v. L?wis wrote: > Collin Winter wrote: > > Is it intentional that Python 2.5 is (currently) shipping with > > distutils 2.4.0, while Python 2.4 (at least 2.4.1, 2.4.2 and > > 2.4.3) shipped with distutils 2.4.1? Judging from my own tests, > > distutils 2.4.1 fixed several bugs that some of my test suites > > depend on (the fixes, not the bugs ; ). > > Are these bugs not fixed in the distutils that shipped with Python > 2.5b2? > > In any case, I bumped the version number to 2.5, according to the > policy discussed in > Could this not simply use the Python version number directly, instead? Separate version numbers only make sense if the package is separately distributed - and even then, something like Barry's setup for the email package could keep that version number out of the Python trunk. Fiddly little version numbers scattered throughout the standard library == pain. Anthony From greg.ewing at canterbury.ac.nz Thu Jul 27 09:14:27 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 27 Jul 2006 19:14:27 +1200 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <44C822BC.1070204@blueyonder.co.uk> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> Message-ID: <44C867D3.90306@canterbury.ac.nz> David Hopwood wrote: > A restricted interpreter refuses access to any object attribute or method > with a name beginning with '_' (by throwing a new exception type > 'InternalAccessException'), unless the access is from a method and its > static target is that method's first argument variable. What's to stop def my_naughty_method(self): self = some_protected_object self._a_special_attribute = some_naughty_value > __init__ is an internal method. This is as it should be, because it should not > be possible to call __init__ on an existing object; only to have __init__ > implicitly called when a new object is constructed. What about calling an inherited __init__ method? Your proposed rule would seem to disallow BaseClass.__init__(self, ...) -- Greg From syfou at users.sourceforge.net Thu Jul 27 09:39:39 2006 From: syfou at users.sourceforge.net (Sylvain Fourmanoit) Date: Thu, 27 Jul 2006 03:39:39 -0400 (EDT) Subject: [Python-Dev] New miniconf module In-Reply-To: <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> Message-ID: <Pine.LNX.4.64.0607270200300.6000@sylvain> An updated version is now available, based to the feedback of Phillip J. Eby and David Hopwood (stand-alone module[1], patch[2]): - the module is now reentrant - the sloppy case with Name nodes is now covered properly - the node lookup procedure was optimized, leading to a 20% speed increase on the average case... Phillip, I was wrong to doubt you. ;-) There is undoubtedly still room from improvement, but that's a good start. >> But I agree this looks a lot like JSON, since ecmascript syntax for >> literals looks a lot like the one of Python... For the same reasons there >> is a need for JSON, I think having something like miniconf in the >> standard lib would benefit the users. > > Actually, I would see more reason to include JSON in the standard library, > since it's at least something approaching an internet protocol these days. Having JSON there would indeed be nice: In fact, I recall being initially surprised it was not supported by the standard library. But is there a need to choose? Why not have both? The miniconf approach has its advantages and differences: - The code is short and simple. Since all the real work is performed by the Python parser, very little has to be done on top of that: it should be easy to maintain, and will benefit of all the future work (patches, etc.) that will be integrated to it in the future. - The source it works on is valid Python source, which seems to be a plus for a dynamic, reflexive language such as Python... Things such as this will work: >>> from miniconf import dump >>> file('test.py','w').write(dump({'spam': 1})) >>> import test I know this in not the best example, but you get the idea... - Unlike JSON, miniconf is not introducing any new notation or syntax at all: it uses a strict, well defined subset of the Python grammar that every Python user is already familiar with; it is in no way a data-interchange format, but it feels pretty natural in a all-python environment... In that sense, it is well documented and standardized. - Am I missing something, or is JSON not supporting comments inside the parse tree? That's not really convenient for storage of configuration information. Anyway, if I had to choose between the two, I would definitively want simplejson part of the standard library well before miniconf, since it can be used in so many different situations, but I wouldn't choose JSON as a configuration format given the choice to use the Python notation employed by miniconf either. Yours, -- Sylvain <syfou at users.sourceforge.net> Nobody said computers were going to be polite. [1]http://cheeseshop.python.org/pypi?:action=display&name=miniconf&version=1.1.0 [2]http://sourceforge.net/tracker/index.php?func=detail&aid=1527597&group_id=5470&atid=355470 From theller at python.net Thu Jul 27 09:39:59 2006 From: theller at python.net (Thomas Heller) Date: Thu, 27 Jul 2006 09:39:59 +0200 Subject: [Python-Dev] Patch for building ctypes on more OpenBSD target platforms Message-ID: <ea9qke$nnu$1@sea.gmane.org> I've uploaded a patch sent to me in private email by Damien Miller, who is packaging ctypes for the OpenBSD port tree. I'm requesting permission to commit this for Python 2.5. http://python.org/sf/1529514 Thanks, Thomas From g.brandl at gmx.net Thu Jul 27 10:47:30 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 27 Jul 2006 10:47:30 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <ee2a432c0607262108n2599f9aere07bae83535179b1@mail.gmail.com> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <ee2a432c0607262108n2599f9aere07bae83535179b1@mail.gmail.com> Message-ID: <ea9ugp$3nu$1@sea.gmane.org> Neal Norwitz wrote: > What is the behaviour that was added which broke compliance? What is > the benefit of the behaviour? sys.path_importer_cache is now used to cache if a real directory exists on the filesystem. Previously, a value of None for a given sys.path entry told find_module that no import hook exist, so it should look for a filesystem directory. Now, the entry is set to True if that directory really exists and to False if it doesn't exist, thus saving quite a few open() calls to files in these not existing dirs. >>From your description of fixing the problem, it seems there's some > risk invovled as it's modiyfing import.c, plus adding new features. > What is your recommendation? I would prefer fixing the docs. Importing from filesystem directories is common enough to be special cased. Georg From arigo at tunes.org Thu Jul 27 12:33:31 2006 From: arigo at tunes.org (Armin Rigo) Date: Thu, 27 Jul 2006 12:33:31 +0200 Subject: [Python-Dev] New miniconf module In-Reply-To: <Pine.LNX.4.64.0607270200300.6000@sylvain> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> Message-ID: <20060727103331.GA31912@code0.codespeak.net> Hi, On Thu, Jul 27, 2006 at 03:39:39AM -0400, Sylvain Fourmanoit wrote: > Having JSON there would indeed be nice: In fact, I recall being initially > surprised it was not supported by the standard library. > > But is there a need to choose? Why not have both? The miniconf approach > has its advantages and differences: I support this point of view: miniconf fills the hole that the stdlib leaves for a safe and cross-version dumper/loader for simple objects using the Python syntax. In the same spirit, maybe it could be slightly re-oriented towards a dumper/loader for more than config files; for example, it could provide a safe inverse of repr() for common built-in types. Such a functionality has been discussed here a few times if I remember correctly, but the code in miniconf is very close to providing it. A bientot, Armin From arigo at tunes.org Thu Jul 27 12:39:20 2006 From: arigo at tunes.org (Armin Rigo) Date: Thu, 27 Jul 2006 12:39:20 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> Message-ID: <20060727103920.GB31912@code0.codespeak.net> Hi Phillip, On Wed, Jul 26, 2006 at 02:40:27PM -0400, Phillip J. Eby wrote: > If we don't revert it, there are two ways to fix it. One is to just change > PEP 302 so that the behavior is unbroken by definition. :) The other is > to actually go ahead and fix it by adding PathImporter and NullImporter > types to import.c, along with a factory function on sys.path_hooks to > create them. (This would've been the PEP-compliant way to implement the > need-for-speed patch.) > > So, "fix" by documentation, fix by fixing, or fix by reverting? Which > should it be? "fix" by changing the definition looks like a bad idea to me. The import logic is already extremely complicated and delicate, any change to it is bound to break *some* code somewhere. So although import.c is already by far the longest piece of code around, I think that we need a patch doing the "right" thing, or else revert. A bientot, Armin From g.brandl at gmx.net Thu Jul 27 12:52:33 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 27 Jul 2006 12:52:33 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <20060727103920.GB31912@code0.codespeak.net> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> Message-ID: <eaa5rd$pe2$1@sea.gmane.org> Armin Rigo wrote: > Hi Phillip, > > On Wed, Jul 26, 2006 at 02:40:27PM -0400, Phillip J. Eby wrote: >> If we don't revert it, there are two ways to fix it. One is to just change >> PEP 302 so that the behavior is unbroken by definition. :) The other is >> to actually go ahead and fix it by adding PathImporter and NullImporter >> types to import.c, along with a factory function on sys.path_hooks to >> create them. (This would've been the PEP-compliant way to implement the >> need-for-speed patch.) >> >> So, "fix" by documentation, fix by fixing, or fix by reverting? Which >> should it be? > > "fix" by changing the definition looks like a bad idea to me. The > import logic is already extremely complicated and delicate, any change > to it is bound to break *some* code somewhere. Though beta1 and beta2 shipped with this change nobody reported any bug that could be linked to it. sys.path_importer_cache is quite an internal thing and most code, even import hooks, shouldn't have to deal with it. Georg From arigo at tunes.org Thu Jul 27 13:30:17 2006 From: arigo at tunes.org (Armin Rigo) Date: Thu, 27 Jul 2006 13:30:17 +0200 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <44C822BC.1070204@blueyonder.co.uk> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> Message-ID: <20060727113016.GC31912@code0.codespeak.net> Hi David, Your proposal is too vague to be useful. In Python I would not feel that any compiler-enforced restrictions are going to be too restrictive, and so I believe that your approach is not viable, but I cannot give you many concrete examples of why before you come up with a more concrete specification. More importantly, this is going to depend critically on a restricted interpreter in the first place, in the sense of Brett, but the safety of your proposal depends on the restricted interpreter forbidding many operations that it would not otherwise forbid for its original goal. For example, in Brett's use case there is no need to prevent reading the 'func_globals' attribute of function objects, but if that's allowed, then accessing any _attribute of any module is easy. About special methods: how do built-in functions like str(), int(), and so on, know in which context they are called? Surely you don't propose that '2+3' should be invalid because it accesses the hidden attribute '2 .__add__' ? How would you formulate a rule in term on Python's attribute look-up algorithm to prevent the following trivial attack? : x.py: # supposedly secure? _hidden = [1,2,3] class A: def __init__(self): self._authorized = ... def read(self): if not self._authorized: raise Forbidden return _hidden attack.py: import x class B(x.A): def __init__(self): self._authorized = True b = B() print b.read() # => [1,2,3] On any real-life example I'm sure that hacks like overriding selected methods on the instance itself would allow an attacker to confuse the remaining methods enough to leak hidden information. Here is a metaclass attack against the rule "self._attr is only allowed if syntactically inside the class definition of the exact class of self": class SupposedlySecure(object): _hidden = [1,2,3] class MetaAttack(type): def read(self): return self._hidden # seen as an instance attribute class Attack(SupposedlySecure): __metaclass__ = MetaAttack print Attack.read() A bientot, Armin. From amk at amk.ca Thu Jul 27 14:41:37 2006 From: amk at amk.ca (A.M. Kuchling) Date: Thu, 27 Jul 2006 08:41:37 -0400 Subject: [Python-Dev] Release manager: pdb bugfix incompatibility Message-ID: <20060727124137.GA4071@rogue.amk.ca> Bug #1526834: if you do 'b f(' in pdb, the debugger crashes. This bug stems from pdb just sticking the string in a regex and compiling it. cre = re.compile(r'def\s+%s\s*[(]' % funcname) A side effect of this is that 'b f()' works to match the function 'f', because the empty parens are legal regex syntax declaring an empty group that matches the null string. It's easy to fix the crash by doing re.escape(funcname). But this means that 'b f()' will no longer find the function 'f' -- it will look for 'f(' followed by another paren, and won't find it. Should this be fixed by the re.escape(), or should the fix attempt to keep 'b f()' working? The pdb documentation doesn't claim that 'b f()' should work. --amk From fuzzyman at voidspace.org.uk Thu Jul 27 13:46:04 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 27 Jul 2006 12:46:04 +0100 Subject: [Python-Dev] New miniconf module In-Reply-To: <20060727103331.GA31912@code0.codespeak.net> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> <20060727103331.GA31912@code0.codespeak.net> Message-ID: <44C8A77C.6060407@voidspace.org.uk> Armin Rigo wrote: > Hi, > > On Thu, Jul 27, 2006 at 03:39:39AM -0400, Sylvain Fourmanoit wrote: > >> Having JSON there would indeed be nice: In fact, I recall being initially >> surprised it was not supported by the standard library. >> >> But is there a need to choose? Why not have both? The miniconf approach >> has its advantages and differences: >> > > I support this point of view: miniconf fills the hole that the stdlib > leaves for a safe and cross-version dumper/loader for simple objects > using the Python syntax. In the same spirit, maybe it could be slightly > re-oriented towards a dumper/loader for more than config files; for > example, it could provide a safe inverse of repr() for common built-in > types. Such a functionality has been discussed here a few times if I > remember correctly, but the code in miniconf is very close to providing > it. > ConfigObj [1] gained an 'unrepr' mode a while back. The code is simple, and originally came from CherryPy. It can (safely) unrepr basic datatypes. import compiler def getObj(s): s = "a=" + s p = compiler.parse(s) return p.getChildren()[1].getChildren()[0].getChildren()[1] class UnknownType(Exception): pass class Builder: def build(self, o): m = getattr(self, 'build_' + o.__class__.__name__, None) if m is None: raise UnknownType(o.__class__.__name__) return m(o) def build_List(self, o): return map(self.build, o.getChildren()) def build_Const(self, o): return o.value def build_Dict(self, o): d = {} i = iter(map(self.build, o.getChildren())) for el in i: d[el] = i.next() return d def build_Tuple(self, o): return tuple(self.build_List(o)) def build_Name(self, o): if o.name == 'None': return None if o.name == 'True': return True if o.name == 'False': return False # An undefinted Name raise UnknownType('Undefined Name') def build_Add(self, o): real, imag = map(self.build_Const, o.getChildren()) try: real = float(real) except TypeError: raise UnknownType('Add') if not isinstance(imag, complex) or imag.real != 0.0: raise UnknownType('Add') return real+imag def build_Getattr(self, o): parent = self.build(o.expr) return getattr(parent, o.attrname) def build_UnarySub(self, o): return -self.build_Const(o.getChildren()[0]) def build_UnaryAdd(self, o): return self.build_Const(o.getChildren()[0]) def unrepr(s): if not s: return s return Builder().build(getObj(s)) HTH Michael Foord [1] http://www.voidspace.org.uk/python/configobj.html > > A bientot, > > Armin > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk > > From arigo at tunes.org Thu Jul 27 15:54:17 2006 From: arigo at tunes.org (Armin Rigo) Date: Thu, 27 Jul 2006 15:54:17 +0200 Subject: [Python-Dev] New miniconf module In-Reply-To: <44C8A77C.6060407@voidspace.org.uk> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> <20060727103331.GA31912@code0.codespeak.net> <44C8A77C.6060407@voidspace.org.uk> Message-ID: <20060727135417.GA21563@code0.codespeak.net> Hi Michael, On Thu, Jul 27, 2006 at 12:46:04PM +0100, Michael Foord wrote: > > leaves for a safe and cross-version dumper/loader for simple objects > > using the Python syntax. In the same spirit, maybe it could be slightly > > re-oriented towards a dumper/loader for more than config files; for > > example, it could provide a safe inverse of repr() for common built-in > > types. Such a functionality has been discussed here a few times if I > > remember correctly, but the code in miniconf is very close to providing > > it. > > > ConfigObj [1] gained an 'unrepr' mode a while back. The code is simple, > and originally came from CherryPy. I'm sure, but my point was that the discussed miniconf already contains mostly the same code already, so I suggested that it would be a worthwhile addition to it, in its stdlib-hole-filler role. If it goes in that direction, I'd suggest to rename the module to give it a name closer to existing persistence-related modules already in the stdlib. Armin From g.brandl at gmx.net Thu Jul 27 17:40:57 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 27 Jul 2006 17:40:57 +0200 Subject: [Python-Dev] uuid test suite failing Message-ID: <eaamnu$mrq$1@sea.gmane.org> The UUID test suite, which wasn't run by regrtest.py until now, is now failing on some buildbots (and my machine). This should be fixed before releasing something. Georg From pje at telecommunity.com Thu Jul 27 17:58:15 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Thu, 27 Jul 2006 11:58:15 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <eaa5rd$pe2$1@sea.gmane.org> References: <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> Message-ID: <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> At 12:52 PM 7/27/2006 +0200, Georg Brandl wrote: >Armin Rigo wrote: > > Hi Phillip, > > > > On Wed, Jul 26, 2006 at 02:40:27PM -0400, Phillip J. Eby wrote: > >> If we don't revert it, there are two ways to fix it. One is to just > change > >> PEP 302 so that the behavior is unbroken by definition. :) The other is > >> to actually go ahead and fix it by adding PathImporter and NullImporter > >> types to import.c, along with a factory function on sys.path_hooks to > >> create them. (This would've been the PEP-compliant way to implement the > >> need-for-speed patch.) > >> > >> So, "fix" by documentation, fix by fixing, or fix by reverting? Which > >> should it be? > > > > "fix" by changing the definition looks like a bad idea to me. The > > import logic is already extremely complicated and delicate, any change > > to it is bound to break *some* code somewhere. > >Though beta1 and beta2 shipped with this change nobody reported any bug that >could be linked to it. Because in at least setuptools' case, you have to be using unzipped namespace packages under the right set of circumstances to trigger a propblem. >sys.path_importer_cache is quite an internal thing Whose behavior is documented in a PEP. > and >most code, even import hooks, shouldn't have to deal with it. That doesn't make it unimportant. It's a visible change in specified behavior between Python versions -- precisely the sort of thing that makes people mad at us renegade cowboy Python-dev hackers changing their language for no apparent reason. The strftime thing that recently got hashed to death here was also an "internal thing" which "most code shouldn't have to deal with". This is precisely how these kinds of problems happen. So, this needs to either be documented in the What's New document and PEP 302 at a minimum, or it needs to be reverted, unless somebody wants to bless the feature addition to fix it. I'm willing to write code that makes it PEP 302 compliant, if the release manager will bless such an addition. But if that's not acceptable, then somebody needs to produce the necessary documentation updates or revert the patch. It absolutely should not be allowed to remain in *and* undocumented because it is a backwards-incompatible change to documented behavior of Python for two major releases (2.3 and 2.4). From g.brandl at gmx.net Thu Jul 27 18:20:39 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 27 Jul 2006 18:20:39 +0200 Subject: [Python-Dev] uuid test suite failing In-Reply-To: <eaamnu$mrq$1@sea.gmane.org> References: <eaamnu$mrq$1@sea.gmane.org> Message-ID: <eaap2c$al$1@sea.gmane.org> Georg Brandl wrote: > The UUID test suite, which wasn't run by regrtest.py until now, is > now failing on some buildbots (and my machine). This should be fixed > before releasing something. Okay, after fixing the test on my machine (locale issue) it looks like some ifconfigs don't like to be called without arguments. "-a" seems to be supported everywhere though, so I guess it's reasonable to use that flag on every platform. Any objections? Georg From g.brandl at gmx.net Thu Jul 27 18:24:15 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 27 Jul 2006 18:24:15 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> References: <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <eaa5rd$pe2$1@sea.gmane.org> <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> Message-ID: <eaap95$1bq$1@sea.gmane.org> Phillip J. Eby wrote: >>sys.path_importer_cache is quite an internal thing > > Whose behavior is documented in a PEP. Correct. >> and >>most code, even import hooks, shouldn't have to deal with it. > > That doesn't make it unimportant. It's a visible change in specified > behavior between Python versions -- precisely the sort of thing that makes > people mad at us renegade cowboy Python-dev hackers changing their language > for no apparent reason. The strftime thing that recently got hashed to > death here was also an "internal thing" which "most code shouldn't have to > deal with". > > This is precisely how these kinds of problems happen. > > So, this needs to either be documented in the What's New document and PEP > 302 at a minimum, or it needs to be reverted, unless somebody wants to > bless the feature addition to fix it. I agree with you (now). ;) > I'm willing to write code that makes it PEP 302 compliant, if the release > manager will bless such an addition. But if that's not acceptable, then > somebody needs to produce the necessary documentation updates or revert the > patch. A possible third option would be to store the information "this is an invalid path" somewhere else, that is, an internal dictionary only available to import.c. I will write up docs and update the PEP in any case, if the release manager agrees. Georg From amk at amk.ca Thu Jul 27 18:36:47 2006 From: amk at amk.ca (A.M. Kuchling) Date: Thu, 27 Jul 2006 12:36:47 -0400 Subject: [Python-Dev] uuid test suite failing In-Reply-To: <eaamnu$mrq$1@sea.gmane.org> References: <eaamnu$mrq$1@sea.gmane.org> Message-ID: <20060727163647.GA4175@rogue.amk.ca> On Thu, Jul 27, 2006 at 05:40:57PM +0200, Georg Brandl wrote: > The UUID test suite, which wasn't run by regrtest.py until now, is > now failing on some buildbots (and my machine). This should be fixed > before releasing something. Looking at the failures, there seem to be two problems on Unix variants: 1) on some, '/sbin/ifconfig' prints a help message; you need 'ifconfig -a' to print information about all interfaces. 2) on Solaris 9 (the only version in the SF compile farm), I can't figure out how to make ifconfig print MAC addresses at all. Searching online finds the incantation 'arp <hostname>' to print the MAC. The XP build fails because it seems to be getting different node IDs from different calls. The cygwin build is very unhappy, for reasons that don't look connected to the newly-enabled tests. --amk From david.nospam.hopwood at blueyonder.co.uk Thu Jul 27 17:18:01 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Thu, 27 Jul 2006 16:18:01 +0100 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <20060727113016.GC31912@code0.codespeak.net> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> <20060727113016.GC31912@code0.codespeak.net> Message-ID: <44C8D929.2010508@blueyonder.co.uk> Armin Rigo wrote: > Hi David, > > Your proposal is too vague to be useful. In Python I would not feel > that any compiler-enforced restrictions are going to be too restrictive, > and so I believe that your approach is not viable, but I cannot give you > many concrete examples of why before you come up with a more concrete > specification. The intention was not to require the restrictions to be compiler-enforced; only to *allow* them to be compiler-enforced. Code like this, for example: def someMethod(self, x): if self == x: foo(x._internal) should not have to work. > More importantly, this is going to depend critically on a restricted > interpreter in the first place, in the sense of Brett, but the safety of > your proposal depends on the restricted interpreter forbidding many > operations that it would not otherwise forbid for its original goal. > For example, in Brett's use case there is no need to prevent reading the > 'func_globals' attribute of function objects, but if that's allowed, > then accessing any _attribute of any module is easy. I disagree that there is no need to prevent reading func_globals. func_globals is clearly incompatible with capability security (as are func_dict and func_closure; also the other function attributes should be read-only). Functions in a capability language should be opaque. I don't see that there is any problem with the proposal depending on a restricted interpreter to prevent access via loopholes such as func_globals, since that is the main intended context of its use. Remember that Brett's document stated that protection could only be obtained at interpreter granularity, rather than object granularity, primarily because objects have no way to prevent access to their private state. My intention in describing the basic idea of enforcing the PEP 8 convention for internal attributes/methods, was to get precisely this kind of feedback on potential problems. I have already obtained useful feedback (including yours), and will prepare a more concrete proposal based on it. > About special methods: how do built-in functions like str(), int(), and > so on, know in which context they are called? Surely you don't propose > that '2+3' should be invalid because it accesses the hidden attribute '2 > .__add__' ? This and other examples have convinced me that names starting and ending with double underscores should not automatically be considered internal. There are a few such names that should be internal (e.g. __dict__), but it is reasonable to treat those as special cases. > How would you formulate a rule in term on Python's attribute look-up > algorithm to prevent the following trivial attack? : > > x.py: > > # supposedly secure? > > _hidden = [1,2,3] > > class A: > def __init__(self): > self._authorized = ... > def read(self): > if not self._authorized: > raise Forbidden > return _hidden > > attack.py: > > import x > class B(x.A): > def __init__(self): > self._authorized = True > > b = B() > print b.read() # => [1,2,3] Inheritance should be defined as though the code of inherited methods and attributes were copied into the subclass (with global accesses updated to point to the original module). IOW, B acts as though it is defined like this: attack.py: class B(x.A): def __init__(self): self._authorized = True def read(self): if not self._authorized: raise Forbidden return x._hidden Since x._hidden is not accessible from attack.py, the attack fails. > On any real-life example I'm sure that hacks like overriding selected > methods on the instance itself would allow an attacker to confuse the > remaining methods enough to leak hidden information. Yes, Java was subject to many attacks of this type. However, a code-copying semantics for inheritance prevents all of them, by ensuring that a class cannot do anything by inheritance that it could not do without it. > Here is a metaclass attack against the rule "self._attr is only allowed > if syntactically inside the class definition of the exact class of > self": > > class SupposedlySecure(object): > _hidden = [1,2,3] > > class MetaAttack(type): > def read(self): > return self._hidden # seen as an instance attribute > > class Attack(SupposedlySecure): > __metaclass__ = MetaAttack > > print Attack.read() Metaclasses are a reflective feature; almost all such features would have to be limited in restricted interpreters. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From david.nospam.hopwood at blueyonder.co.uk Thu Jul 27 17:24:13 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Thu, 27 Jul 2006 16:24:13 +0100 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <44C8D929.2010508@blueyonder.co.uk> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> <20060727113016.GC31912@code0.codespeak.net> <44C8D929.2010508@blueyonder.co.uk> Message-ID: <44C8DA9D.9060709@blueyonder.co.uk> David Hopwood wrote: > The intention was not to require the restrictions to be compiler-enforced; > only to *allow* them to be compiler-enforced. > > Code like this, for example: > > def someMethod(self, x): > if self == x: "if self is x:", I meant. > foo(x._internal) > > should not have to work. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From bob at redivi.com Thu Jul 27 19:04:35 2006 From: bob at redivi.com (Bob Ippolito) Date: Thu, 27 Jul 2006 10:04:35 -0700 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <eaa5rd$pe2$1@sea.gmane.org> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <eaa5rd$pe2$1@sea.gmane.org> Message-ID: <1BFAA40C-5B6C-4376-B1C9-D2477EE689B9@redivi.com> On Jul 27, 2006, at 3:52 AM, Georg Brandl wrote: > Armin Rigo wrote: >> Hi Phillip, >> >> On Wed, Jul 26, 2006 at 02:40:27PM -0400, Phillip J. Eby wrote: >>> If we don't revert it, there are two ways to fix it. One is to >>> just change >>> PEP 302 so that the behavior is unbroken by definition. :) The >>> other is >>> to actually go ahead and fix it by adding PathImporter and >>> NullImporter >>> types to import.c, along with a factory function on >>> sys.path_hooks to >>> create them. (This would've been the PEP-compliant way to >>> implement the >>> need-for-speed patch.) >>> >>> So, "fix" by documentation, fix by fixing, or fix by reverting? >>> Which >>> should it be? >> >> "fix" by changing the definition looks like a bad idea to me. The >> import logic is already extremely complicated and delicate, any >> change >> to it is bound to break *some* code somewhere. > > Though beta1 and beta2 shipped with this change nobody reported any > bug that > could be linked to it. sys.path_importer_cache is quite an internal > thing and > most code, even import hooks, shouldn't have to deal with it. Anyone trying to emulate what imp.find_module does in a PEP 302 compliant way will need to introspect sys.path_importer_cache. I have some unreleased code based on the PEP 302 spec that does this and the way it was originally written would have broke in 2.5 if I had tested it there. Just because it's obscure doesn't mean we should go change how things work in a way that's not consistent with the documentation. The documentation should change to match the code or vice versa, though I really don't have any strong feelings one way or the other. -bob From g.brandl at gmx.net Thu Jul 27 19:13:43 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 27 Jul 2006 19:13:43 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <1BFAA40C-5B6C-4376-B1C9-D2477EE689B9@redivi.com> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <eaa5rd$pe2$1@sea.gmane.org> <1BFAA40C-5B6C-4376-B1C9-D2477EE689B9@redivi.com> Message-ID: <eaas5s$br4$1@sea.gmane.org> Bob Ippolito wrote: > Just because it's obscure doesn't mean we should go change how things > work in a way that's not consistent with the documentation. The > documentation should change to match the code or vice versa, though I > really don't have any strong feelings one way or the other. I never said it shouldn't be documented if the current code stays. Georg From ronaldoussoren at mac.com Thu Jul 27 19:14:13 2006 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Thu, 27 Jul 2006 19:14:13 +0200 Subject: [Python-Dev] uuid test suite failing In-Reply-To: <eaap2c$al$1@sea.gmane.org> References: <eaamnu$mrq$1@sea.gmane.org> <eaap2c$al$1@sea.gmane.org> Message-ID: <D59E6AA9-C7D1-4439-9E45-12D11B6296ED@mac.com> On Jul 27, 2006, at 6:20 PM, Georg Brandl wrote: > Georg Brandl wrote: >> The UUID test suite, which wasn't run by regrtest.py until now, is >> now failing on some buildbots (and my machine). This should be fixed >> before releasing something. > > Okay, after fixing the test on my machine (locale issue) it looks like > some ifconfigs don't like to be called without arguments. "-a" > seems to > be supported everywhere though, so I guess it's reasonable to use that > flag on every platform. Any objections? IIRC at least some versions of HP-UX do not support the -a flag for ifconfig, I'll check this tomorrow. Ronald From collinw at gmail.com Thu Jul 27 19:51:51 2006 From: collinw at gmail.com (Collin Winter) Date: Thu, 27 Jul 2006 13:51:51 -0400 Subject: [Python-Dev] Which version of distutils to ship with Python 2.5? In-Reply-To: <44C85FF2.3030701@v.loewis.de> References: <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> <44C85FF2.3030701@v.loewis.de> Message-ID: <43aa6ff70607271051h3d87b807w5fad3195949d051e@mail.gmail.com> On 7/27/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > Collin Winter wrote: > > Is it intentional that Python 2.5 is (currently) shipping with > > distutils 2.4.0, while Python 2.4 (at least 2.4.1, 2.4.2 and 2.4.3) > > shipped with distutils 2.4.1? Judging from my own tests, distutils > > 2.4.1 fixed several bugs that some of my test suites depend on (the > > fixes, not the bugs ; ). > > Are these bugs not fixed in the distutils that shipped with Python 2.5b2? I now believe this to be a new regression that I had confused with an earlier bug report. I've filed a new report, http://python.org/sf/1529871. I'd appreciate it if anyone could shed some light on this. Thanks, Collin Winter From david.nospam.hopwood at blueyonder.co.uk Thu Jul 27 19:02:05 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Thu, 27 Jul 2006 18:02:05 +0100 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <44C867D3.90306@canterbury.ac.nz> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> <44C867D3.90306@canterbury.ac.nz> Message-ID: <44C8F18D.5020503@blueyonder.co.uk> Greg Ewing wrote: > David Hopwood wrote: > >> A restricted interpreter refuses access to any object attribute or >> method with a name beginning with '_' (by throwing a new exception type >> 'InternalAccessException'), unless the access is from a method and its >> static target is that method's first argument variable. > > What's to stop > > def my_naughty_method(self): > self = some_protected_object > self._a_special_attribute = some_naughty_value That's a good point -- I didn't describe the intended restriction correctly. The reason for not just saying "... and its target is the object that was passed as that method's first argument", was that I want it to be possible to reject programs with internal accesses that cannot be statically recognized as legal. (This does not mean that a particular implementation would have to rely on static detection of non-internal accesses, or on a trusted compiler.) How about this: A restricted interpreter shall refuse access to any object attribute or method with a name beginning with '_', unless the access is from a method, and its target is the object that was passed as that method's first argument. If such an access is detected at run-time, then it shall be reported by throwing a new exception type 'InternalAccessException'. In addition, a program containing an access to an object attribute or method with a name beginning with '_', where the access is not from a method, or the target of the access is not the method's first argument variable, or there is an assignment to that variable in the method, is an illegal program. >> __init__ is an internal method. This is as it should be, because it >> should not be possible to call __init__ on an existing object ... from outside the object, that is ... >> ; only to have __init__ implicitly called when a new object is constructed. > > What about calling an inherited __init__ method? > Your proposed rule would seem to disallow > > BaseClass.__init__(self, ...) No, this call would be allowed because it is to self. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From mattjfleming at googlemail.com Thu Jul 27 23:21:52 2006 From: mattjfleming at googlemail.com (Matt Fleming) Date: Thu, 27 Jul 2006 21:21:52 +0000 Subject: [Python-Dev] uuid test suite failing In-Reply-To: <5ff4a1e50607271022x35175b0coda2148d9d0d1db4b@mail.gmail.com> References: <eaamnu$mrq$1@sea.gmane.org> <eaap2c$al$1@sea.gmane.org> <D59E6AA9-C7D1-4439-9E45-12D11B6296ED@mac.com> <5ff4a1e50607271022x35175b0coda2148d9d0d1db4b@mail.gmail.com> Message-ID: <5ff4a1e50607271421v71cdb8fbt569b9cc6a339ed0f@mail.gmail.com> > On 27/07/06, Ronald Oussoren <ronaldoussoren at mac.com> wrote: > > IIRC at least some versions of HP-UX do not support the -a flag for > > ifconfig, I'll check this tomorrow. > > > > Ronald > > td192> /usr/sbin/ifconfig usage: ifconfig interface [ af [ address [ dest_addr ] ] [ up ] [ down ][ netmask mask ] ] [ metric n ] [ arp | -arp ] [ plumb | unplumb ] td192> /usr/sbin/ifconfig -a ifconfig: no such interface td192> uname -a HP-UX td192 B.11.11 U 9000/800 1839940656 unlimited-user license Also fixed this test on my NetBSD machine by using 'ifconfig -a' and checking for 'address:' in the output. But as Ronald said, not all platforms support the '-a' flag. Not sure if this will fix the OpenBSD buildbot, I don't have access to an OpenBSD machine. Matt -- http://mattssanctuary.blogspot.com From pje at telecommunity.com Thu Jul 27 23:23:00 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Thu, 27 Jul 2006 17:23:00 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <eaa5rd$pe2$1@sea.gmane.org> References: <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> Message-ID: <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> At 12:52 PM 7/27/2006 +0200, Georg Brandl wrote: >Though beta1 and beta2 shipped with this change nobody reported any bug that >could be linked to it Actually, that's no longer true. See: http://python.org/sf/1529871 This is a user-reported bug against 2.5b2, and I have just confirmed that it is caused by the need-for-speed PEP 302 breakage. Reverting r46372 fixes the reported problem. Of course, since I'm the author of the affected package, I can certainly fix the problem by updating my code. (And in fact, setuptools 0.7a1 works because it uses 2.5's pkgutil when available, and r46372 included a fix for pkgutil.) At this point, I'm going to wait another day for somebody to step up to fix the documentation, or the release manager to authorize inclusion of a proper fix (which is unlikely to be allowed, since it will mean adding two new types to import.c, and restructuring a bit of pkgutil to subclass from them). (It's a nice performance improvement you've got here, I would hate to see anything happen to it...) Personally, I would prefer to see it properly fixed in 2.5 rather than having to rip it out. It's more work for me to create the proper fix than it is to just work around it in my code, but it seems a more righteous labor, if you know what I mean. It also means that already-shipped and distributed versions of my code would work with the 2.5 release. From pje at telecommunity.com Thu Jul 27 23:28:23 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Thu, 27 Jul 2006 17:28:23 -0400 Subject: [Python-Dev] Which version of distutils to ship with Python 2.5? In-Reply-To: <43aa6ff70607271051h3d87b807w5fad3195949d051e@mail.gmail.co m> References: <44C85FF2.3030701@v.loewis.de> <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> <44C85FF2.3030701@v.loewis.de> Message-ID: <5.1.1.6.0.20060727172329.028f1178@sparrow.telecommunity.com> At 01:51 PM 7/27/2006 -0400, Collin Winter wrote: >On 7/27/06, "Martin v. L?wis" <martin at v.loewis.de> wrote: > > Collin Winter wrote: > > > Is it intentional that Python 2.5 is (currently) shipping with > > > distutils 2.4.0, while Python 2.4 (at least 2.4.1, 2.4.2 and 2.4.3) > > > shipped with distutils 2.4.1? Judging from my own tests, distutils > > > 2.4.1 fixed several bugs that some of my test suites depend on (the > > > fixes, not the bugs ; ). > > > > Are these bugs not fixed in the distutils that shipped with Python 2.5b2? > >I now believe this to be a new regression that I had confused with an >earlier bug report. I've filed a new report, >http://python.org/sf/1529871. I'd appreciate it if anyone could shed >some light on this. As I noted on the bug itself, the problem is due to r46372, a patch introduced by the need-for-speed sprint and which broke CPython's compliance with PEP 302, by introducing non-None, non-importer values into sys.path_importer_cache. At the present time, I have not received an unequivocal response regarding how the problem should be corrected: 1. Update PEP 302 and the "What's New" documentation to reflect this backwards-incompatible change and require authors to update their code to work with Python 2.5 2. Fix the patch so it implements the speedup in a way that is conformant with PEP 302 as it currently stands (I have volunteered to do this, if the release manager(s) will authorize it) 3. Revert the patch. No one has volunteered for any of the work required for #1. I belive that #3 must be done if #2 is not approved and no volunteers surface for #1 in a timely manner. From barry at python.org Fri Jul 28 01:54:57 2006 From: barry at python.org (Barry Warsaw) Date: Thu, 27 Jul 2006 19:54:57 -0400 Subject: [Python-Dev] Support for PyGetSetDefs in pydoc In-Reply-To: <e9plv5$rfe$1@sea.gmane.org> References: <A027CA35-EE77-4658-BA59-E849368C440B@python.org><EF6E6AF2-654F-4F53-91A8-A6C9B8A80745@python.org><44B64782.9010706@gmail.com><7FE62713-24B0-47CE-BA94-41150DB683F4@python.org> <AB1C2209-329E-4F3D-8B7A-2548A40F892A@python.org> <e9plv5$rfe$1@sea.gmane.org> Message-ID: <E666C3A3-9C45-41C7-B25A-C9CB244485B5@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Since Anthony didn't speak up, I took his silence as assent and went ahead and committed the changes. r50881 and r50885 for *nix and Windows, just in case the deafening silence turns into a howl of derision :). - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (Darwin) iQCVAwUBRMlSV3EjvBPtnXfVAQL3WQP9H2RBIDG3FCEkzHjzmwyRWl4HU467yWMQ bse0/XhUEAQHivwP2nLvAqn+Qrb8XaXIT3n5i9++saMFtxjTdfMJX2ZNBK+0JmVl N+XvhTIXIu9XJy47c4FsZ6tbfHVSKQ3KRaE81sfMYuKQsPCnB9cNskKEJEpaS0Cy F7GmpdE96sM= =T3Ia -----END PGP SIGNATURE----- From tim.peters at gmail.com Fri Jul 28 02:46:30 2006 From: tim.peters at gmail.com (Tim Peters) Date: Thu, 27 Jul 2006 20:46:30 -0400 Subject: [Python-Dev] [Windows, buildbot] kill_python.c mystery In-Reply-To: <44C85E07.3060707@v.loewis.de> References: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> <44C85E07.3060707@v.loewis.de> Message-ID: <1f7befae0607271746y7bca18f2s439e8b5f996f132d@mail.gmail.com> [Martin v. L?wis] > Didn't you know that you signed in to run arbitrary viruses, worms, and > trojan horses when you added your machine to the buildbot infrastructure > :-? Hey, I signed up for that when I bought a Windows box :-) > You just haven't seen buildbot erasing your hard disk and filling > your coffee machine with tea, yet. Not the buildbot, no, but visiting random web pages does that routinely. >> (strstr(path, "build\\python.exe") != NULL)) { >> Why is the second clause there? > That's for Cygwin (i.e. Anthony Baxter's machine). As Neal suggests, > preceding the executable path with another backslash should solve > this problem. And he checked that in, and I haven't noticed another similar problem since (but then it was rare to begin with). > As a related note, this entire approach will also manage to kill > python.exe from an unrelated buildbot installation, e.g. a 2.4 > build job might kill python.exe from the trunk. This actually helped > when I tried to get the Cygwin slave to get unstuck, and shouldn't > do harm since we currently don't run to builds on the same slave > simultaneously, but could be surprising when parallel builds > are activated some day. I don't think we /can/ yet -- I believe some tests T exist that implicitly assume only one instance of T is running. I don't recall details, although I'm sure we'll bump into them sporadically the instant parallel builds are enabled. As a purely pragmatic matter, I expect my hard drive would quickly be reduced to dust if two instances of test_largefile ran simultaneously (Windows XP writes physical zeroes in the entire multi-GB file, and that takes finite time only if the disk head doesn't have to keep seeking). > Sorry for messing with your machine, No problem! That's what it's here for :-) From greg.ewing at canterbury.ac.nz Fri Jul 28 02:51:43 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 28 Jul 2006 12:51:43 +1200 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <44C8D929.2010508@blueyonder.co.uk> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> <20060727113016.GC31912@code0.codespeak.net> <44C8D929.2010508@blueyonder.co.uk> Message-ID: <44C95F9F.2090307@canterbury.ac.nz> David Hopwood wrote: > Inheritance should be defined as though the code of inherited methods and > attributes were copied into the subclass (with global accesses updated to > point to the original module). You'll have to propose an implementation strategy for that which works without actually copying all the code, though. > Since x._hidden is not accessible from attack.py, the attack fails. But if _hidden were an attribute of the A instance that you were trying to protect, it would succeed. So you can't actually protect any direct attribute of a class that can be subclassed. Which means we're back to the situation of having to prevent access to class objects. -- Greg From david.nospam.hopwood at blueyonder.co.uk Fri Jul 28 03:40:53 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Fri, 28 Jul 2006 02:40:53 +0100 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <44C95F9F.2090307@canterbury.ac.nz> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> <20060727113016.GC31912@code0.codespeak.net> <44C8D929.2010508@blueyonder.co.uk> <44C95F9F.2090307@canterbury.ac.nz> Message-ID: <44C96B25.5000709@blueyonder.co.uk> Greg Ewing wrote: > David Hopwood wrote: > >> Inheritance should be defined as though the code of inherited methods and >> attributes were copied into the subclass (with global accesses updated to >> point to the original module). > > You'll have to propose an implementation strategy for that > which works without actually copying all the code, though. The only difference between the copying semantics and the current semantics, is in the visibility of module-global internal variables and functions. It's sufficient to keep track of whether each class could access a variable or function that is internal to its module. If it does, then it cannot be subclassed from a different module. (It must not be possible to access internal variables/ functions reflectively.) The effect of this is that if a programmer intends a class to be subclassable from outside the module, they must make sure that all of the variables/functions it depends on are public. Anyone performing a security review of the module then does not have to consider inheritance from a different module when deciding which variables/functions might be accessible. There is a slightly more flexible version of this approach that is just as secure, provided that functions are restricted to be stateless. If it is possible to prove statically that a class does not access any internal *variables* of a module (regardless of whether it accesses internal functions), then it is safe to allow the class to be subclassed from another module. This is because the subclassing module could have copied all of the code of the original module (assuming it is written purely in Python); the only possible sources of authority in a capability language are from access to variables or primitives, not code. If a class is not written in Python, then we cannot analyse whether it accesses internal variables. In that case the class will be part of the TCB, and we have to trust the class writer to mark whether it can be safely subclassed from another module. >> Since x._hidden is not accessible from attack.py, the attack fails. > > But if _hidden were an attribute of the A instance that > you were trying to protect, it would succeed. No, attack.py could only access a _hidden attribute in an instance of B. This is harmless, because it could just as well define the _hidden attribute of B itself, rather than by subclassing. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From david.nospam.hopwood at blueyonder.co.uk Fri Jul 28 03:55:58 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Fri, 28 Jul 2006 02:55:58 +0100 Subject: [Python-Dev] Internal namespace proposal In-Reply-To: <4BE3C1EC-8DE0-4518-A8B6-68F98477433B@commonground.com.au> References: <Pine.LNX.4.58.0607191811400.31087@server1.LFW.org> <bbaeab100607191650j4584de2cge554274e100f125e@mail.gmail.com> <44C822BC.1070204@blueyonder.co.uk> <4BE3C1EC-8DE0-4518-A8B6-68F98477433B@commonground.com.au> Message-ID: <44C96EAE.6040907@blueyonder.co.uk> Richard Jones wrote: > On 27/07/2006, at 12:19 PM, David Hopwood wrote: > >> A restricted interpreter refuses access to any object attribute or >> method with a name beginning with '_' (by throwing a new exception type >> 'InternalAccessException'), unless the access is from a method and its >> static target is that method's first argument variable. >> >> Also, a restricted interpreter refuses access to any module-global >> variable or module-global function with a name beginning with '_' (by >> throwing 'InternalAccessException'), unless the access is statically >> from the same module. > > Note that this is a rule that Zope enforces in its restricted environment. Is that documented anywhere? -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From tim.peters at gmail.com Fri Jul 28 05:21:15 2006 From: tim.peters at gmail.com (Tim Peters) Date: Thu, 27 Jul 2006 23:21:15 -0400 Subject: [Python-Dev] how about adding ping's uuid module to the standard lib ? In-Reply-To: <dui1u8$ge1$1@sea.gmane.org> References: <dui1u8$ge1$1@sea.gmane.org> Message-ID: <1f7befae0607272021k17b65a88g356c781c34967cc7@mail.gmail.com> Georg discovered that test_uuid didn't run any tests, and fixed that on Thursday. A number of buildbots have failed that test since then. My XP box appears unique among the Windows buildbots in failing. It always fails like so: AssertionError: different sources disagree on node: from source 'getnode1', node was 00038a000015 from source 'getnode2', node was 00038a000015 from source 'ipconfig', node was 001111b2b7bf 0x00038a000015 /is/ the last 6 bytes returned by the Windows UuidCreateSequential() on my box. I confirmed that by writing a C program calling it directly. However, it doesn't appear to correspond to any MAC address of any HW on my box. All documented ways of determining the MAC address of my Ethernet card (using UuidCreateSequential for this appears to be folklore rather than documented behavior) agree that 0x001111b2b7bf is correct on this box; e.g., $ getmac /fo list /v Connection Name: Local Area Connection Network Adapter: Marvell Yukon 88E8050 PCI-E ASF Gigabit Ethernet Controller Physical Address: 00-11-11-B2-B7-BF Transport Name: \Device\Tcpip_... Connection Name: 1394 Connection Network Adapter: 1394 Net Adapter Physical Address: 62-A1-AC-6C-FD-BE Transport Name: \Device\Tcpip_... Connection Name: 1394 Connection 2 Network Adapter: 1394 Net Adapter Physical Address: E2-1F-01-C6-5D-88 Transport Name: \Device\Tcpip_... The last two are for firewire interfaces, and don't match the purported MAC address extracted from UuidCreateSequential's output anyway. So, at least on my box, this comment in uuid.py is incorrect (UuidCreateSequential does not behave as it says): # On Windows prior to 2000, UuidCreate gives a UUID containing the # hardware address. On Windows 2000 and later, UuidCreate makes a # random UUID and UuidCreateSequential gives a UUID containing the # hardware address. ... Unfortunately, uuid.getnode() tries things in this order on Windows: getters = [_windll_getnode, _netbios_getnode, _ipconfig_getnode] It's only the first one that returns the bogus 0x00038a000015; both of the latter return 0x001111B2B7BF. However, there's nothing I can do to that list to make test_uuid pass on this box. It wants to insist that all three ways of getting/guessing the MAC address return the same thing, and that's never going to happen here. Given that _windll_getnode's actual behavior appears to have nothing in common with what was expected for it here, best suggestion I can make is to throw its code away. From tim.peters at gmail.com Fri Jul 28 06:27:51 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 28 Jul 2006 00:27:51 -0400 Subject: [Python-Dev] how about adding ping's uuid module to the standard lib ? In-Reply-To: <1f7befae0607272021k17b65a88g356c781c34967cc7@mail.gmail.com> References: <dui1u8$ge1$1@sea.gmane.org> <1f7befae0607272021k17b65a88g356c781c34967cc7@mail.gmail.com> Message-ID: <1f7befae0607272127u544fac8eq9608e95938ca2c55@mail.gmail.com> [Tim] > ... uuid.getnode() tries things in this order on Windows: > > getters = [_windll_getnode, _netbios_getnode, _ipconfig_getnode] > > It's only the first one that returns the bogus 0x00038a000015; both of > the latter return 0x001111B2B7BF [the correct MAC address for my > network card]. That was on my desktop XP Pro SP2 box. On my similar laptop box, it's quite different: _windll_getnode and _ipconfig_getnode return the MAC address of my wireless Ethernet adapter, while _netbios_getnode returns the MAC address of my LAN Ethernet card. > However, there's nothing I can do to that list to make test_uuid pass > on this box. It wants to insist that all three ways of > getting/guessing the MAC address return the same thing, and that's > never going to happen here. Or on my laptop, but for different reasons there. > Given that _windll_getnode's actual behavior appears to have nothing > in common with what was expected for it here, best suggestion I can > make is to throw its code away. Which wouldn't improve things on my laptop. Best next ;-) suggestion is to change test_uuid to stop believing that uuid.py knows multiple ways to find a well-defined MAC address. I'm going to make that change -- someone who hates that can revert it after they buy me two new computers that work they way they think computers should work ;-) From nnorwitz at gmail.com Fri Jul 28 06:49:48 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Thu, 27 Jul 2006 21:49:48 -0700 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <eaa5rd$pe2$1@sea.gmane.org> <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> Message-ID: <ee2a432c0607272149xc22baf4u2945da575237a83@mail.gmail.com> On 7/27/06, Phillip J. Eby <pje at telecommunity.com> wrote: > > Personally, I would prefer to see it properly fixed in 2.5 rather than > having to rip it out. It's more work for me to create the proper fix than > it is to just work around it in my code, but it seems a more righteous > labor, if you know what I mean. It also means that already-shipped and > distributed versions of my code would work with the 2.5 release. Based on this comment, is it really acceptable to just document a behaviour change? ISTM there should really only be 2 choices: fix 2.5 properly or revert the change. This seemed to be Armin's position. n From pje at telecommunity.com Fri Jul 28 06:56:41 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 28 Jul 2006 00:56:41 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <ee2a432c0607272149xc22baf4u2945da575237a83@mail.gmail.com> References: <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <eaa5rd$pe2$1@sea.gmane.org> <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060728005338.02a25860@sparrow.telecommunity.com> At 09:49 PM 7/27/2006 -0700, Neal Norwitz wrote: >On 7/27/06, Phillip J. Eby <pje at telecommunity.com> wrote: >> >>Personally, I would prefer to see it properly fixed in 2.5 rather than >>having to rip it out. It's more work for me to create the proper fix than >>it is to just work around it in my code, but it seems a more righteous >>labor, if you know what I mean. It also means that already-shipped and >>distributed versions of my code would work with the 2.5 release. > >Based on this comment, is it really acceptable to just document a >behaviour change? ISTM there should really only be 2 choices: fix >2.5 properly or revert the change. This seemed to be Armin's >position. Well, it's a moot question since nobody has volunteered to update the docs. Fixing it and reverting it are the only options unless somebody steps up to do the doc work. I'll happily fix it or revert it, just tell me which one is the approved course of action and I'll get started. :) From rhettinger at ewtllc.com Fri Jul 28 07:32:52 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Thu, 27 Jul 2006 22:32:52 -0700 Subject: [Python-Dev] Py2.5 release schedule Message-ID: <44C9A184.7030701@ewtllc.com> I suggest that there be a third beta release and that we then wait just a bit before going final. The bugs that were found and fixed in the first two beta releases suggest that Py2.5 is not yet as stable as we would like. Over the next few days, I'll try to run it on as much third-party code as possible. That would have detected the recently surfaced grammar error a little bit earlier (the one where "for x, in listOfTuples" would not unpack). The release process itself is going well but I don't think the pervasive AST changes have been fully shaken-out yet. Raymond From martin at v.loewis.de Fri Jul 28 07:33:37 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Fri, 28 Jul 2006 07:33:37 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> References: <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> Message-ID: <44C9A1B1.3040102@v.loewis.de> Phillip J. Eby wrote: > I'm willing to write code that makes it PEP 302 compliant, if the release > manager will bless such an addition. But if that's not acceptable, then > somebody needs to produce the necessary documentation updates or revert the > patch. It absolutely should not be allowed to remain in *and* undocumented > because it is a backwards-incompatible change to documented behavior of > Python for two major releases (2.3 and 2.4). You don't need a release manager pronouncement for that. It's a bug, changing it is a bug fix, you don't need RM permission to fix a bug. Do you have a patch ready that restores path_importer_cache behavior, yet preserves the property that it caches existence of a directory? If not, I will have to produce one. Regards, Martin From anthony at interlink.com.au Fri Jul 28 07:39:48 2006 From: anthony at interlink.com.au (Anthony Baxter) Date: Fri, 28 Jul 2006 15:39:48 +1000 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <44C9A184.7030701@ewtllc.com> References: <44C9A184.7030701@ewtllc.com> Message-ID: <200607281539.50794.anthony@interlink.com.au> On Friday 28 July 2006 15:32, Raymond Hettinger wrote: > I suggest that there be a third beta release and that we then wait > just a bit before going final. > > The bugs that were found and fixed in the first two beta releases > suggest that Py2.5 is not yet as stable as we would like. Over the > next few days, I'll try to run it on as much third-party code as > possible. That would have detected the recently surfaced grammar > error a little bit earlier (the one where "for x, in listOfTuples" > would not unpack). > > The release process itself is going well but I don't think the > pervasive AST changes have been fully shaken-out yet. I've been thinking the same thing, too. A quick chat to Neal says that he also agrees. There's still a lot more bugs popping up than I'm really comfortable with. I guess this is inevitable - there's a lot of new stuff in 2.5. Does anyone disagree with making the next release beta3? Anthony -- Anthony Baxter <anthony at interlink.com.au> It's never too late to have a happy childhood. From greg at electricrain.com Fri Jul 28 07:54:39 2006 From: greg at electricrain.com (Gregory P. Smith) Date: Thu, 27 Jul 2006 22:54:39 -0700 Subject: [Python-Dev] [Windows, buildbot] kill_python.c mystery In-Reply-To: <44C85E07.3060707@v.loewis.de> References: <1f7befae0607261702o6d943f52x701be0d402345f65@mail.gmail.com> <44C85E07.3060707@v.loewis.de> Message-ID: <20060728055439.GD22921@zot.electricrain.com> > Didn't you know that you signed in to run arbitrary viruses, worms, and > trojan horses when you added your machine to the buildbot infrastructure > :-? You just haven't seen buildbot erasing your hard disk and filling > your coffee machine with tea, yet. VMware Server is free. Run buildbots in a VM. (but don't assume a VM protects you from trojans that are designed to break out of it) -g From nnorwitz at gmail.com Fri Jul 28 09:24:52 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Fri, 28 Jul 2006 00:24:52 -0700 Subject: [Python-Dev] uuid test suite failing In-Reply-To: <20060727163647.GA4175@rogue.amk.ca> References: <eaamnu$mrq$1@sea.gmane.org> <20060727163647.GA4175@rogue.amk.ca> Message-ID: <ee2a432c0607280024j56bf7facmd9ebdd26aafc013f@mail.gmail.com> On 7/27/06, A.M. Kuchling <amk at amk.ca> wrote: > On Thu, Jul 27, 2006 at 05:40:57PM +0200, Georg Brandl wrote: > > The UUID test suite, which wasn't run by regrtest.py until now, is > > now failing on some buildbots (and my machine). This should be fixed > > before releasing something. > > Looking at the failures, there seem to be two problems on Unix variants: > 1) on some, '/sbin/ifconfig' prints a help message; you need 'ifconfig -a' > to print information about all interfaces. > 2) on Solaris 9 (the only version in the SF compile farm), I can't > figure out how to make ifconfig print MAC addresses at all. > Searching online finds the incantation 'arp <hostname>' to print the > MAC. This is such a mess. There are so many different ways of determining the MAC addr on each flavour of Unix it seems hopeless to try. I fixed _ifconfig_getnode so it should work on at least: Linux, Tru64, Solaris, and HP-UX. Who knows how many more variations there are. This only fixes 1 of the 2 failures in test_uuid. The other one is due to _unixdll_getnode() failing. This is because _uuid_generate_time is None because we couldn't find it in the uuid library. This is just broken, not sure if it's the code or the test though. We should handle the case if _uuid_generate_time and the others are None better. I don't know what to do in this case. Since getnode ignores exceptions, maybe it's the test that is broken? n From mwh at python.net Fri Jul 28 11:53:47 2006 From: mwh at python.net (Michael Hudson) Date: Fri, 28 Jul 2006 10:53:47 +0100 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <200607281539.50794.anthony@interlink.com.au> (Anthony Baxter's message of "Fri, 28 Jul 2006 15:39:48 +1000") References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> Message-ID: <2mirlixdg4.fsf@starship.python.net> Anthony Baxter <anthony at interlink.com.au> writes: > On Friday 28 July 2006 15:32, Raymond Hettinger wrote: >> I suggest that there be a third beta release and that we then wait >> just a bit before going final. >> >> The bugs that were found and fixed in the first two beta releases >> suggest that Py2.5 is not yet as stable as we would like. Over the >> next few days, I'll try to run it on as much third-party code as >> possible. That would have detected the recently surfaced grammar >> error a little bit earlier (the one where "for x, in listOfTuples" >> would not unpack). >> >> The release process itself is going well but I don't think the >> pervasive AST changes have been fully shaken-out yet. > > I've been thinking the same thing, too. A quick chat to Neal says that > he also agrees. > > There's still a lot more bugs popping up than I'm really comfortable > with. I guess this is inevitable - there's a lot of new stuff in 2.5. > > Does anyone disagree with making the next release beta3? It seems like a good idea to me. I guess this will mean the final release will be pushed back a bit? Cheers, mwh -- Gevalia is undrinkable low-octane see-through only slightly roasted bilge water. Compared to .us coffee it is quite drinkable. -- M?ns Nilsson, asr From arigo at tunes.org Fri Jul 28 12:11:33 2006 From: arigo at tunes.org (Armin Rigo) Date: Fri, 28 Jul 2006 12:11:33 +0200 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat Message-ID: <20060728101133.GA339@code0.codespeak.net> Hi, There is an oversight in the design of __index__() that only just surfaced :-( It is responsible for the following behavior, on a 32-bit machine with >= 2GB of RAM: >>> s = 'x' * (2**100) # works! >>> len(s) 2147483647 This is because PySequence_Repeat(v, w) works by applying w.__index__ in order to call v->sq_repeat. However, __index__ is defined to clip the result to fit in a Py_ssize_t. This means that the above problem exists with all sequences, not just strings, given enough RAM to create such sequences with 2147483647 items. For reference, in 2.4 we correctly get an OverflowError. Argh! What should be done about it? A bientot, Armin. From barry at python.org Fri Jul 28 14:57:06 2006 From: barry at python.org (Barry Warsaw) Date: Fri, 28 Jul 2006 08:57:06 -0400 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <200607281539.50794.anthony@interlink.com.au> References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> Message-ID: <BBBDED8E-ACDD-4176-985A-144188B52F1E@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Jul 28, 2006, at 1:39 AM, Anthony Baxter wrote: > I've been thinking the same thing, too. A quick chat to Neal says that > he also agrees. > > There's still a lot more bugs popping up than I'm really comfortable > with. I guess this is inevitable - there's a lot of new stuff in 2.5. > > Does anyone disagree with making the next release beta3? +1. It would give me more type to port and test a few of my applications to the new version. FWIW, our commercial app went pretty smoothly, mostly dealing with Py_ssize_t conversions and adopting the new PySet C API. I'm still working on Mailman but the most painful thing so far has been the conversion of exceptions to new-style classes, and even that wasn't / too/ painful. I've only done limited testing of both, but I'm encouraged that the porting effort will be minor (probably no more than a day's worth of work for both apps combined). - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (Darwin) iQCVAwUBRMoJonEjvBPtnXfVAQJOjAP9HRbakdb39IOtFxGX/wP4QhiXAcNAbXXM hGFIJ6vC0Gp/SSlTVMYPF5oJMIzuDCIDzs4Nrbgysgfj6Ehyphei/ed8W94PHLat nh54Y0N5pvwLelHW6ChJBcIxulU8Fuj0Z9kIZCLiryTOAyXTh+t3+gZPEzWRY/tY v2hd9ERXDl8= =vK7s -----END PGP SIGNATURE----- From tomerfiliba at gmail.com Fri Jul 28 15:35:27 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Fri, 28 Jul 2006 15:35:27 +0200 Subject: [Python-Dev] patching pydoc? Message-ID: <1d85506f0607280635q3a693682l230c7821dc6f408f@mail.gmail.com> i have a problem with pydoc in rpyc. i wanted help(obj), where obj is a NetProxy object, to work as if it were local. i followed the code starting from site.help to pydoc.doc, which is the ultimate function that generates and prints the text. i expected there would be some function in the middle that prepares the text, and another that writes it to the pager, but to my disappointment pydoc.doc does both. this means i can't transfer the document to my local machine (it's printed directly to the remote console). therefore, i would like to split this behavior into two parts: * render_doc - a function that returns the document text * doc - a function that calls render_doc and sends it to the pager this way no existing code breaks (no existing function signatures are changed) and i gain help on remote objects. i hope people would be in favor, as it's not such a big change anyway. is it possible to add to 2.5? -tomer this is the code of pydoc, starting at line 1457 <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< def doc(thing, title='Python Library Documentation: %s', forceload=0): """Display text documentation, given an object or a path to an object.""" try: object, name = resolve(thing, forceload) desc = describe(object) module = inspect.getmodule(object) if name and '.' in name: desc += ' in ' + name[:name.rfind('.')] elif module and module is not object: desc += ' in module ' + module.__name__ if not (inspect.ismodule(object) or inspect.isclass(object) or inspect.isroutine(object) or isinstance(object, property)): # If the passed object is a piece of data or an instance, # document its available methods instead of its value. object = type(object) desc += ' object' pager(title % desc + '\n\n' + text.document(object, name)) except (ImportError, ErrorDuringImport), value: print value >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> this is the suggested code <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< def render_doc(thing, title='Python Library Documentation: %s', forceload=0): """generate the text""" object, name = resolve(thing, forceload) desc = describe(object) module = inspect.getmodule(object) if name and '.' in name: desc += ' in ' + name[:name.rfind('.')] elif module and module is not object: desc += ' in module ' + module.__name__ if not (inspect.ismodule(object) or inspect.isclass(object) or inspect.isroutine(object) or isinstance(object, property)): # If the passed object is a piece of data or an instance, # document its available methods instead of its value. object = type(object) desc += ' object' return title % desc + '\n\n' + text.document(object, name) def doc(*args, **kwargs): """Display text documentation, given an object or a path to an object.""" try: text = render_doc(*args, **kwargs) pager(text) except (ImportError, ErrorDuringImport), value: print value >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> From skip at pobox.com Fri Jul 28 16:02:10 2006 From: skip at pobox.com (skip at pobox.com) Date: Fri, 28 Jul 2006 09:02:10 -0500 Subject: [Python-Dev] Another uuid problem Message-ID: <17610.6370.422365.622901@montanaro.dyndns.org> I just tried building and testing 2.5 on a Solaris 10 box at work. The uuid test failed with sh: ifconfig: not found sh: ifconfig: not found sh: ifconfig: not found sh: arp: not found sh: ifconfig: not found sh: ifconfig: not found sh: ifconfig: not found sh: arp: not found In our environment at least it's uncommon for /usr/sbin to be in the PATH of non-privileged users: piggy:% type -a ifconfig -bash: type: ifconfig: not found piggy:% PATH=$PATH:/usr/sbin type -a ifconfig ifconfig is /usr/sbin/ifconfig Perhaps test_uuid needs to do a little investigation to find ifconfig and arp. Skip From david.nospam.hopwood at blueyonder.co.uk Fri Jul 28 14:50:51 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Fri, 28 Jul 2006 13:50:51 +0100 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <20060728101133.GA339@code0.codespeak.net> References: <20060728101133.GA339@code0.codespeak.net> Message-ID: <44CA082B.6030908@blueyonder.co.uk> Armin Rigo wrote: > Hi, > > There is an oversight in the design of __index__() that only just > surfaced :-( It is responsible for the following behavior, on a 32-bit > machine with >= 2GB of RAM: > > >>> s = 'x' * (2**100) # works! > >>> len(s) > 2147483647 > > This is because PySequence_Repeat(v, w) works by applying w.__index__ in > order to call v->sq_repeat. However, __index__ is defined to clip the > result to fit in a Py_ssize_t. Clipping the result sounds like it would *never* be a good idea. What was the rationale for that? It should throw an exception. -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From grig.gheorghiu at gmail.com Fri Jul 28 16:46:55 2006 From: grig.gheorghiu at gmail.com (Grig Gheorghiu) Date: Fri, 28 Jul 2006 07:46:55 -0700 Subject: [Python-Dev] uuid test suite failing In-Reply-To: <ee2a432c0607280024j56bf7facmd9ebdd26aafc013f@mail.gmail.com> References: <eaamnu$mrq$1@sea.gmane.org> <20060727163647.GA4175@rogue.amk.ca> <ee2a432c0607280024j56bf7facmd9ebdd26aafc013f@mail.gmail.com> Message-ID: <3f09d5a00607280746t57fc51b6m95cc8e1d1c927035@mail.gmail.com> On 7/28/06, Neal Norwitz <nnorwitz at gmail.com> wrote: > > On 7/27/06, A.M. Kuchling <amk at amk.ca> wrote: > > On Thu, Jul 27, 2006 at 05:40:57PM +0200, Georg Brandl wrote: > > > The UUID test suite, which wasn't run by regrtest.py until now, is > > > now failing on some buildbots (and my machine). This should be fixed > > > before releasing something. > > > > Looking at the failures, there seem to be two problems on Unix variants: > > 1) on some, '/sbin/ifconfig' prints a help message; you need 'ifconfig > -a' > > to print information about all interfaces. > > 2) on Solaris 9 (the only version in the SF compile farm), I can't > > figure out how to make ifconfig print MAC addresses at all. > > Searching online finds the incantation 'arp <hostname>' to print the > > MAC. > > This is such a mess. There are so many different ways of determining > the MAC addr on each flavour of Unix it seems hopeless to try. I > fixed _ifconfig_getnode so it should work on at least: Linux, Tru64, > Solaris, and HP-UX. Who knows how many more variations there are. > > This only fixes 1 of the 2 failures in test_uuid. The other one is > due to _unixdll_getnode() failing. This is because > _uuid_generate_time is None because we couldn't find it in the uuid > library. This is just broken, not sure if it's the code or the test > though. We should handle the case if _uuid_generate_time and the > others are None better. I don't know what to do in this case. > > Since getnode ignores exceptions, maybe it's the test that is broken? My 2 cents: since there is no POSIX standard for getting a list of network interfaces, trying to account for all the platform variations is one central location is hopeless. Instead, I think the onus should be on whomever is testing this on a particular platform -- in short, on the buildbot maintainer on that platform. There could be another regrtest.py-type suite, something like platform_regrtest.py for example, which could be composed of highly platform-dependent tests such as test_uuid.py. These tests would have empty methods such as _ifconfig_getnode, which would then be defined on a per-platform basis by the buildbot maintainer on that platform. The test would obviously fail by default, unless those methods are properly defined. Or these methods could account for just one platform, as an example of what to do on other platforms. Grig -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060728/aaf51821/attachment.html From mwh at python.net Fri Jul 28 17:26:37 2006 From: mwh at python.net (Michael Hudson) Date: Fri, 28 Jul 2006 16:26:37 +0100 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA082B.6030908@blueyonder.co.uk> (David Hopwood's message of "Fri, 28 Jul 2006 13:50:51 +0100") References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> Message-ID: <2mac6tyclu.fsf@starship.python.net> David Hopwood <david.nospam.hopwood at blueyonder.co.uk> writes: > Armin Rigo wrote: >> Hi, >> >> There is an oversight in the design of __index__() that only just >> surfaced :-( It is responsible for the following behavior, on a 32-bit >> machine with >= 2GB of RAM: >> >> >>> s = 'x' * (2**100) # works! >> >>> len(s) >> 2147483647 >> >> This is because PySequence_Repeat(v, w) works by applying w.__index__ in >> order to call v->sq_repeat. However, __index__ is defined to clip the >> result to fit in a Py_ssize_t. > > Clipping the result sounds like it would *never* be a good idea. What was > the rationale for that? It should throw an exception. Why would you expect range(10)[:2**32-1] and range(10)[:2**32] to do different things? Cheers, mwh -- This makes it possible to pass complex object hierarchies to a C coder who thinks computer science has made no worthwhile advancements since the invention of the pointer. -- Gordon McMillan, 30 Jul 1998 From ncoghlan at gmail.com Fri Jul 28 17:29:19 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 29 Jul 2006 01:29:19 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA082B.6030908@blueyonder.co.uk> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> Message-ID: <44CA2D4F.1020601@gmail.com> David Hopwood wrote: > Armin Rigo wrote: >> Hi, >> >> There is an oversight in the design of __index__() that only just >> surfaced :-( It is responsible for the following behavior, on a 32-bit >> machine with >= 2GB of RAM: >> >> >>> s = 'x' * (2**100) # works! >> >>> len(s) >> 2147483647 >> >> This is because PySequence_Repeat(v, w) works by applying w.__index__ in >> order to call v->sq_repeat. However, __index__ is defined to clip the >> result to fit in a Py_ssize_t. > > Clipping the result sounds like it would *never* be a good idea. What was > the rationale for that? It should throw an exception. A simple demonstration of the clipping behaviour that works on machines with limited memory: >>> (2**100).__index__() 2147483647 >>> (-2**100).__index__() -2147483648 PEP 357 doesn't even mention the issue, and the comment on long_index in the code doesn't give a rationale - it just notes that the function clips the result. Neither the PyNumber_AsIndex nor the __index__ documentation mention anything about the possibility of clipping, and there's no test case to verify this behaviour. I'm inclined to call it a bug, too, but I've cc'ed Travis to see if he can shed some light on the question - the implementation of long_index explicitly suppresses the overflow error generated by _long_as_ssize_t, so the current behaviour appears to be deliberate. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From pje at telecommunity.com Fri Jul 28 17:30:31 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 28 Jul 2006 11:30:31 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <44C9A1B1.3040102@v.loewis.de> References: <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060728112800.03a7fb48@sparrow.telecommunity.com> At 07:33 AM 7/28/2006 +0200, Martin v. L?wis wrote: >Phillip J. Eby wrote: > > I'm willing to write code that makes it PEP 302 compliant, if the release > > manager will bless such an addition. But if that's not acceptable, then > > somebody needs to produce the necessary documentation updates or revert > the > > patch. It absolutely should not be allowed to remain in *and* > undocumented > > because it is a backwards-incompatible change to documented behavior of > > Python for two major releases (2.3 and 2.4). > >You don't need a release manager pronouncement for that. It's a bug, >changing it is a bug fix, you don't need RM permission to fix a bug. > >Do you have a patch ready that restores path_importer_cache behavior, >yet preserves the property that it caches existence of a directory? >If not, I will have to produce one. The issue is that a proper fix that caches existence requires adding new types to import.c and thus might appear to be more of a feature. I was therefore reluctant to embark upon the work without some assurance that it wouldn't be rejected as adding a last-minute feature. From nnorwitz at gmail.com Fri Jul 28 17:53:07 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Fri, 28 Jul 2006 08:53:07 -0700 Subject: [Python-Dev] Another uuid problem In-Reply-To: <17610.6370.422365.622901@montanaro.dyndns.org> References: <17610.6370.422365.622901@montanaro.dyndns.org> Message-ID: <ee2a432c0607280853w1d5a26d7k47264bb1430be879@mail.gmail.com> It checks for ifconfig, /sbin/ifconfig, and /usr/sbin/ifconfig (same for arp). The problem is the os.pipe command doesn't hide these issues. It doesn't cause the test to fail, but is still broken. The test is presumably failing for the other reason I mentioned (unixdll_getnode). Let me know if you see that when running with -v. n -- On 7/28/06, skip at pobox.com <skip at pobox.com> wrote: > > I just tried building and testing 2.5 on a Solaris 10 box at work. The uuid > test failed with > > sh: ifconfig: not found > sh: ifconfig: not found > sh: ifconfig: not found > sh: arp: not found > sh: ifconfig: not found > sh: ifconfig: not found > sh: ifconfig: not found > sh: arp: not found > > In our environment at least it's uncommon for /usr/sbin to be in the PATH of > non-privileged users: > > piggy:% type -a ifconfig > -bash: type: ifconfig: not found > piggy:% PATH=$PATH:/usr/sbin type -a ifconfig > ifconfig is /usr/sbin/ifconfig > > Perhaps test_uuid needs to do a little investigation to find ifconfig and > arp. > > Skip > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/nnorwitz%40gmail.com > From ncoghlan at gmail.com Fri Jul 28 17:54:30 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 29 Jul 2006 01:54:30 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <2mac6tyclu.fsf@starship.python.net> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <2mac6tyclu.fsf@starship.python.net> Message-ID: <44CA3336.6030807@gmail.com> Michael Hudson wrote: > David Hopwood <david.nospam.hopwood at blueyonder.co.uk> writes: > >> Armin Rigo wrote: >>> Hi, >>> >>> There is an oversight in the design of __index__() that only just >>> surfaced :-( It is responsible for the following behavior, on a 32-bit >>> machine with >= 2GB of RAM: >>> >>> >>> s = 'x' * (2**100) # works! >>> >>> len(s) >>> 2147483647 >>> >>> This is because PySequence_Repeat(v, w) works by applying w.__index__ in >>> order to call v->sq_repeat. However, __index__ is defined to clip the >>> result to fit in a Py_ssize_t. >> Clipping the result sounds like it would *never* be a good idea. What was >> the rationale for that? It should throw an exception. > > Why would you expect range(10)[:2**32-1] and range(10)[:2**32] to do > different things? In that case, I believe it is the slice object that should be suppressing the overflow error (via PyErr_Occurred and PyErr_Matches) when calculating the indices for a given length, rather than having silent clipping be part of the basic implementation of long.__index__(). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From tim.peters at gmail.com Fri Jul 28 17:55:47 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 28 Jul 2006 11:55:47 -0400 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <20060728101133.GA339@code0.codespeak.net> References: <20060728101133.GA339@code0.codespeak.net> Message-ID: <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> [Armin Rigo] > There is an oversight in the design of __index__() that only just > surfaced :-( It is responsible for the following behavior, on a 32-bit > machine with >= 2GB of RAM: > > >>> s = 'x' * (2**100) # works! > >>> len(s) > 2147483647 > > This is because PySequence_Repeat(v, w) works by applying w.__index__ in > order to call v->sq_repeat. ? I don't see an invocation of __index__ or nb_index in PySequence_Repeat. To the contrary, its /incoming/ `count` argument is constrained to Py_ssize_t from the start: PyObject * PySequence_Repeat(PyObject *o, Py_ssize_t count) ... OK, I think you mean sequence_repeat() in abstract.c. That does invoke nb_index. But, as below, I don't think it should in this case. > However, __index__ is defined to clip the result to fit in a Py_ssize_t. > This means that the above problem exists > with all sequences, not just strings, given enough RAM to create such > sequences with 2147483647 items. > > For reference, in 2.4 we correctly get an OverflowError. > > Argh! What should be done about it? IMO, this is plain wrong. PEP 357 isn't entirely clear, but it is clear the author only had /slicing/ in mind (where clipping makes sense -- and which makes `__index__` a misleading name). Guido pointed out the ambiguity here: http://mail.python.org/pipermail/python-dev/2006-February/060624.html There's also an ambiguity when using simple indexing. When writing x[i] where x is a sequence and i an object that isn't int or long but implements __index__, I think i.__index__() should be used rather than bailing out. I suspect that you didn't think of this because you've already special-cased this in your code -- when a non-integer is passed, the mapping API is used (mp_subscript). This is done to suppose extended slicing. The built-in sequences (list, str, unicode, tuple for sure, probably more) that implement mp_subscript should probe for nb_index before giving up. The generic code in PyObject_GetItem should also check for nb_index before giving up. So, e.g., plain a[i] shouldn't use __index__ either if i is already int or long. I don't see any justification for invoking nb_index in sequence_repeat(), although if someone thinks it should, then as for plain indexing it certainly shouldn't invoke nb_index if the incoming count is an int or long to begin with. Ah, fudge. Contrary to Guido's advice above, I see that PyObject_GetItem() /also/ unconditionally invokes nb_index (even when the incoming key is already int or long). It shouldn't do that either (according to me). OTOH, in the long discussion about PEP 357, I'm not sure anyone except Travis was clear on whether nb_index was meant to apply only to sequence /slicing/ or was meant to apply "everywhere an object gets used in an index-like context". Clipping makes sense only for the former, but it looks like the implementation treats it more like the latter. This was probably exacerbated by: http://mail.python.org/pipermail/python-dev/2006-February/060663.html [Travis] There are other places in Python that check specifically for int objects and long integer objects and fail with anything else. Perhaps all of these should aslo call the __index__ slot. [Guido] Right, absolutely. This is a mess :-) From nnorwitz at gmail.com Fri Jul 28 17:59:51 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Fri, 28 Jul 2006 08:59:51 -0700 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <2mirlixdg4.fsf@starship.python.net> References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> <2mirlixdg4.fsf@starship.python.net> Message-ID: <ee2a432c0607280859r6c293fa5l78c66011829ff466@mail.gmail.com> On 7/28/06, Michael Hudson <mwh at python.net> wrote: > Anthony Baxter <anthony at interlink.com.au> writes: > > > Does anyone disagree with making the next release beta3? > > It seems like a good idea to me. I guess this will mean the final > release will be pushed back a bit? Anthony and I talked about still having b3 on Aug 1. rc1 around Aug 17-18 (just before the Google sprint which Martin, Jeremy and I will be attending). Final around 24-29. We didn't discuss with Martin yet, so these dates are quite tentative. n From guido at python.org Fri Jul 28 18:05:57 2006 From: guido at python.org (Guido van Rossum) Date: Fri, 28 Jul 2006 09:05:57 -0700 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA2D4F.1020601@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <44CA2D4F.1020601@gmail.com> Message-ID: <ca471dc20607280905y7447d005r2ab0ec664a69948@mail.gmail.com> Argh. I also find it a bug. I also feel responsible because I reviewed the patch. :-( In my recollection I tried to avoid this exact behavior. I wanted __index__() to just return the unclipped int or long value, but have a C API that clipped it for use in slice operations. It looks like I failed (the patch went through so many revisions that at some point I must've stopped caring). --Guido On 7/28/06, Nick Coghlan <ncoghlan at gmail.com> wrote: > David Hopwood wrote: > > Armin Rigo wrote: > >> Hi, > >> > >> There is an oversight in the design of __index__() that only just > >> surfaced :-( It is responsible for the following behavior, on a 32-bit > >> machine with >= 2GB of RAM: > >> > >> >>> s = 'x' * (2**100) # works! > >> >>> len(s) > >> 2147483647 > >> > >> This is because PySequence_Repeat(v, w) works by applying w.__index__ in > >> order to call v->sq_repeat. However, __index__ is defined to clip the > >> result to fit in a Py_ssize_t. > > > > Clipping the result sounds like it would *never* be a good idea. What was > > the rationale for that? It should throw an exception. > > A simple demonstration of the clipping behaviour that works on machines with > limited memory: > > >>> (2**100).__index__() > 2147483647 > >>> (-2**100).__index__() > -2147483648 > > PEP 357 doesn't even mention the issue, and the comment on long_index in the > code doesn't give a rationale - it just notes that the function clips the result. > > Neither the PyNumber_AsIndex nor the __index__ documentation mention anything > about the possibility of clipping, and there's no test case to verify this > behaviour. > > I'm inclined to call it a bug, too, but I've cc'ed Travis to see if he can > shed some light on the question - the implementation of long_index explicitly > suppresses the overflow error generated by _long_as_ssize_t, so the current > behaviour appears to be deliberate. > > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > --------------------------------------------------------------- > http://www.boredomandlaziness.org > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tomerfiliba at gmail.com Fri Jul 28 18:07:30 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Fri, 28 Jul 2006 18:07:30 +0200 Subject: [Python-Dev] Fwd: patching pydoc? In-Reply-To: <1d85506f0607280635q3a693682l230c7821dc6f408f@mail.gmail.com> References: <1d85506f0607280635q3a693682l230c7821dc6f408f@mail.gmail.com> Message-ID: <1d85506f0607280907x22589f98od79e925baba24ac1@mail.gmail.com> submitted patch: https://sourceforge.net/tracker/?func=detail&atid=305470&aid=1530482&group_id=5470 -tomer ---------- Forwarded message ---------- From: tomer filiba <tomerfiliba at gmail.com> Date: Jul 28, 2006 3:35 PM Subject: patching pydoc? To: python-dev at python.org i have a problem with pydoc in rpyc. i wanted help(obj), where obj is a NetProxy object, to work as if it were local. i followed the code starting from site.help to pydoc.doc, which is the ultimate function that generates and prints the text. i expected there would be some function in the middle that prepares the text, and another that writes it to the pager, but to my disappointment pydoc.doc does both. this means i can't transfer the document to my local machine (it's printed directly to the remote console). therefore, i would like to split this behavior into two parts: * render_doc - a function that returns the document text * doc - a function that calls render_doc and sends it to the pager this way no existing code breaks (no existing function signatures are changed) and i gain help on remote objects. i hope people would be in favor, as it's not such a big change anyway. is it possible to add to 2.5? -tomer this is the code of pydoc, starting at line 1457 <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< def doc(thing, title='Python Library Documentation: %s', forceload=0): """Display text documentation, given an object or a path to an object.""" try: object, name = resolve(thing, forceload) desc = describe(object) module = inspect.getmodule(object) if name and '.' in name: desc += ' in ' + name[:name.rfind('.')] elif module and module is not object: desc += ' in module ' + module.__name__ if not (inspect.ismodule(object) or inspect.isclass(object) or inspect.isroutine(object) or isinstance(object, property)): # If the passed object is a piece of data or an instance, # document its available methods instead of its value. object = type(object) desc += ' object' pager(title % desc + '\n\n' + text.document(object, name)) except (ImportError, ErrorDuringImport), value: print value >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> this is the suggested code <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< def render_doc(thing, title='Python Library Documentation: %s', forceload=0): """generate the text""" object, name = resolve(thing, forceload) desc = describe(object) module = inspect.getmodule(object) if name and '.' in name: desc += ' in ' + name[:name.rfind('.')] elif module and module is not object: desc += ' in module ' + module.__name__ if not (inspect.ismodule(object) or inspect.isclass(object) or inspect.isroutine(object) or isinstance(object, property)): # If the passed object is a piece of data or an instance, # document its available methods instead of its value. object = type(object) desc += ' object' return title % desc + '\n\n' + text.document(object, name) def doc(*args, **kwargs): """Display text documentation, given an object or a path to an object.""" try: text = render_doc(*args, **kwargs) pager(text) except (ImportError, ErrorDuringImport), value: print value >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> From ncoghlan at gmail.com Fri Jul 28 18:33:01 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 29 Jul 2006 02:33:01 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> Message-ID: <44CA3C3D.90805@gmail.com> Tim Peters wrote: > OTOH, in the long discussion about PEP 357, I'm not sure anyone except > Travis was clear on whether nb_index was meant to apply only to > sequence /slicing/ or was meant to apply "everywhere an object gets > used in an index-like context". Clipping makes sense only for the > former, but it looks like the implementation treats it more like the > latter. This was probably exacerbated by: > > http://mail.python.org/pipermail/python-dev/2006-February/060663.html > > [Travis] > There are other places in Python that check specifically for int objects > and long integer objects and fail with anything else. Perhaps all of > these should aslo call the __index__ slot. > > [Guido] > Right, absolutely. > > This is a mess :-) I've been trawling through the code a bit, and I don't think it's as bad as all that. All I believe is really needed is to: - remove the PyErr_Occurred() check and its body from long_index in longobject.c - add a PyErr_Occurred() check to force a -1 return from PyNumber_Index in abstract.c - add a PyErr_Occurred() and PyErr_ExceptionMatches(PyOverflowError) check to invoke PyErr_Clear() in _PyEval_SliceIndex in ceval.c. Add test cases to test_index.py to check that: (2**100).__index__() == 2**100 (-2**100).__index__() == -2**100 slice(-2**100, 2**100).indices(sys.maxint) == (0, sys.maxint, 1) "a" * 2**100 raises OverflowError Add test cases to test_operator.py to check that: operator.index(2**100) == 2**100 operator.index(-2**100) == -2**100 Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Fri Jul 28 20:19:24 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 29 Jul 2006 04:19:24 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA3C3D.90805@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> <44CA3C3D.90805@gmail.com> Message-ID: <44CA552C.9020805@gmail.com> Nick Coghlan wrote: > Tim Peters wrote: >> OTOH, in the long discussion about PEP 357, I'm not sure anyone except >> Travis was clear on whether nb_index was meant to apply only to >> sequence /slicing/ or was meant to apply "everywhere an object gets >> used in an index-like context". Clipping makes sense only for the >> former, but it looks like the implementation treats it more like the >> latter. This was probably exacerbated by: >> >> http://mail.python.org/pipermail/python-dev/2006-February/060663.html >> >> [Travis] >> There are other places in Python that check specifically for int objects >> and long integer objects and fail with anything else. Perhaps all of >> these should aslo call the __index__ slot. >> >> [Guido] >> Right, absolutely. >> >> This is a mess :-) > > I've been trawling through the code a bit, and I don't think it's as bad as > all that. Damn, it really is a mess. . . nb_index returns the Pyssize_t directly, and a whole heap of the code expects errors to be signalled via returning -1 before checking PyErr_Occurred(). To get it to work without clipping everywhere, wrap_lenfunc (typeobject.c), _PyEval_SliceIndex (ceval.c), PyNumber_Index (abstract.c) and sequence_repeat (abstract.c) all had to be modified to recognize PY_SSIZE_T_MIN and PY_SSIZE_T_MAX as potential error flags (in order to clear the overflow error for _PyEval_SliceIndex, and in order to propagate the exception for the other three). And using this approach still means that (2**100).__index__() raises an OverflowError. It would probably be cleaner to change the signature of nb_index to return a PyObject *, and let the code that uses it worry about how (or even whether!) to convert PyLong results to a Py_ssize_t. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From tjreedy at udel.edu Fri Jul 28 20:29:50 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 28 Jul 2006 14:29:50 -0400 Subject: [Python-Dev] patching pydoc? References: <1d85506f0607280635q3a693682l230c7821dc6f408f@mail.gmail.com> Message-ID: <eadl2v$51q$1@sea.gmane.org> "tomer filiba" <tomerfiliba at gmail.com> wrote in message news:1d85506f0607280635q3a693682l230c7821dc6f408f at mail.gmail.com... ... > therefore, i would like to split this behavior into two parts: > * render_doc - a function that returns the document text > * doc - a function that calls render_doc and sends it to the pager > > this way no existing code breaks (no existing function signatures > are changed) and i gain help on remote objects. > i hope people would be in favor, as it's not such a big change anyway. > is it possible to add to 2.5? Giving the amount of hair-tearing over uuid and __index__, this seems like an especially bad day to ask for a new-feature variance in a time of feature freeze ;-). Some quick questions: * I presume you gain the new functionality by directly calling the factored-out render_doc and printing thru your own pager. Does everyone? * Would making pager() a parameter of doc() make sense? * Is pager() the only part of the original doc() that can generate ImportError, ErrorDuringImport? If not, the try/except should be in render_doc also or instead. * Why generalize the doc() signature? Bad calls will be traced as caught in render_doc instead of doc. Couldn't that potentially break a bad_call test? Terry Jan Reedy > this is the code of pydoc, starting at line 1457 > > <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< > def doc(thing, title='Python Library Documentation: %s', forceload=0): > """Display text documentation, given an object or a path to an > object.""" > try: > object, name = resolve(thing, forceload) > desc = describe(object) > module = inspect.getmodule(object) > if name and '.' in name: > desc += ' in ' + name[:name.rfind('.')] > elif module and module is not object: > desc += ' in module ' + module.__name__ > if not (inspect.ismodule(object) or > inspect.isclass(object) or > inspect.isroutine(object) or > isinstance(object, property)): > # If the passed object is a piece of data or an instance, > # document its available methods instead of its value. > object = type(object) > desc += ' object' > pager(title % desc + '\n\n' + text.document(object, name)) > except (ImportError, ErrorDuringImport), value: > print value >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> > > this is the suggested code > > <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< > def render_doc(thing, title='Python Library Documentation: %s', > forceload=0): > """generate the text""" > object, name = resolve(thing, forceload) > desc = describe(object) > module = inspect.getmodule(object) > if name and '.' in name: > desc += ' in ' + name[:name.rfind('.')] > elif module and module is not object: > desc += ' in module ' + module.__name__ > if not (inspect.ismodule(object) or > inspect.isclass(object) or > inspect.isroutine(object) or > isinstance(object, property)): > # If the passed object is a piece of data or an instance, > # document its available methods instead of its value. > object = type(object) > desc += ' object' > return title % desc + '\n\n' + text.document(object, name) > > def doc(*args, **kwargs): > """Display text documentation, given an object or a path to an > object.""" > try: > text = render_doc(*args, **kwargs) > pager(text) > except (ImportError, ErrorDuringImport), value: > print value >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/python-python-dev%40m.gmane.org > From guido at python.org Fri Jul 28 20:31:09 2006 From: guido at python.org (Guido van Rossum) Date: Fri, 28 Jul 2006 11:31:09 -0700 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA552C.9020805@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> <44CA3C3D.90805@gmail.com> <44CA552C.9020805@gmail.com> Message-ID: <ca471dc20607281131m34f752b4pb1ff933f2d2262d9@mail.gmail.com> On 7/28/06, Nick Coghlan <ncoghlan at gmail.com> wrote: > Nick Coghlan wrote: > > Tim Peters wrote: > >> OTOH, in the long discussion about PEP 357, I'm not sure anyone except > >> Travis was clear on whether nb_index was meant to apply only to > >> sequence /slicing/ or was meant to apply "everywhere an object gets > >> used in an index-like context". Clipping makes sense only for the > >> former, but it looks like the implementation treats it more like the > >> latter. This was probably exacerbated by: > >> > >> http://mail.python.org/pipermail/python-dev/2006-February/060663.html > >> > >> [Travis] > >> There are other places in Python that check specifically for int objects > >> and long integer objects and fail with anything else. Perhaps all of > >> these should aslo call the __index__ slot. > >> > >> [Guido] > >> Right, absolutely. > >> > >> This is a mess :-) > > > > I've been trawling through the code a bit, and I don't think it's as bad as > > all that. > > Damn, it really is a mess. . . nb_index returns the Pyssize_t directly, and a > whole heap of the code expects errors to be signalled via returning -1 before > checking PyErr_Occurred(). > > To get it to work without clipping everywhere, wrap_lenfunc (typeobject.c), > _PyEval_SliceIndex (ceval.c), PyNumber_Index (abstract.c) and sequence_repeat > (abstract.c) all had to be modified to recognize PY_SSIZE_T_MIN and > PY_SSIZE_T_MAX as potential error flags (in order to clear the overflow error > for _PyEval_SliceIndex, and in order to propagate the exception for the other > three). > > And using this approach still means that (2**100).__index__() raises an > OverflowError. > > It would probably be cleaner to change the signature of nb_index to return a > PyObject *, and let the code that uses it worry about how (or even whether!) > to convert PyLong results to a Py_ssize_t. No time to look through the code here, but IMO it's acceptable (at least for 2.5) if (2**100).__index__() raises OverflowError, as long as x[:2**100] silently clips. __index__() is primarily meant to return a value useful for indexing concrete sequences, and 2**100 isn't. Certainly the exception is preferrable to the silently truncated result currently returned. Fortunately there's some extra time since we're now going to do a third beta. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tim.peters at gmail.com Fri Jul 28 20:39:28 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 28 Jul 2006 14:39:28 -0400 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA552C.9020805@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> <44CA3C3D.90805@gmail.com> <44CA552C.9020805@gmail.com> Message-ID: <1f7befae0607281139ycb89b13n3ac3d829499eeab1@mail.gmail.com> [Tim] >>> ... >>> This is a mess :-) [Nick Coghlan] >> I've been trawling through the code a bit, and I don't think it's as bad as >> all that. [also Nick, but older & wiser ;-)] > Damn, it really is a mess. . . nb_index returns the Pyssize_t directly, Bingo. It's a /conceptual/ mess. Best I can make out, Travis only cared about sequence slicing (not indexing), and then the machinery got hijacked to become a more general "can you faithfully act like an integer?" thing -- but kept a signature that made sense only for the original slicing use (where clipping is fine). > and a whole heap of the code expects errors to be signalled via returning -1 before > checking PyErr_Occurred(). > > To get it to work without clipping everywhere, wrap_lenfunc (typeobject.c), > _PyEval_SliceIndex (ceval.c), PyNumber_Index (abstract.c) and sequence_repeat > (abstract.c) all had to be modified to recognize PY_SSIZE_T_MIN and > PY_SSIZE_T_MAX as potential error flags (in order to clear the overflow error > for _PyEval_SliceIndex, and in order to propagate the exception for the other > three). > > And using this approach still means that (2**100).__index__() raises an > OverflowError. > > It would probably be cleaner to change the signature of nb_index to return a > PyObject *, Given that the more-general use is what everyone else either wanted, or simply /assumed/, in the original discussions, I expect it would be, although with the understanding that the PyObject * returned must be NULL (in case of error), or a Python int or long. > and let the code that uses it worry about how (or even whether!) > to convert PyLong results to a Py_ssize_t. A utility function or two could help, like one that converted to Py_ssize_t with clipping, and another that did the same but raised OverflowError if Py_ssize_t isn't big enough (and in the latter case a caller would do the usual business of checking for a -1 return and PyErr_Occurred()).. From syfou at users.sourceforge.net Fri Jul 28 21:24:19 2006 From: syfou at users.sourceforge.net (Sylvain Fourmanoit) Date: Fri, 28 Jul 2006 15:24:19 -0400 (EDT) Subject: [Python-Dev] New miniconf module In-Reply-To: <20060727103331.GA31912@code0.codespeak.net> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> <20060727103331.GA31912@code0.codespeak.net> Message-ID: <Pine.LNX.4.64.0607281438120.10450@sylvain> Armin Rigo wrote: > In the same spirit, maybe it could be slightly re-oriented towards a > dumper/loader for more than config files; for example, it could provide > a safe inverse of repr() for common built-in types New version of miniconf (version 1.2.0) is out [1][2], including a unrepr() function; that's the only change this time. Michael Foord wrote: > ConfigObj [3] gained an 'unrepr' mode a while back. The code is simple, > and originally came from CherryPy. Thanks for the link! I completely missed ConfigObj. It indeed shares a lot with my code. At the core, it use many of the same ideas and implementation techniques... In many ways, it is also a lot more advanced, specialized than miniconf: for instance, it introduce a new, specialized Config File format, while my effort aimed at keeping things minimal. Armin Rigo wrote: > If it goes in that direction, I'd suggest to rename the module to give > it a name closer to existing persistence-related modules already in the > stdlib. I am not especially fond of the current miniconf name either; I didn't find something more suitable, yet evocative of what it does; I would be glad to hear any suggestion you or the rest of the developers would have. Yours, -- Sylvain <syfou at users.sourceforge.net> The only difference between a car salesman and a computer salesman is that the car salesman knows he's lying. [1]http://cheeseshop.python.org/pypi?:action=display&name=miniconf&version=1.2.0 [2]http://sourceforge.net/tracker/index.php?func=detail&aid=1527597&group_id=5470&atid=355470 [3]http://www.voidspace.org.uk/python/configobj.html P.-S. I am leaving the civilization (where I have some sort of network access) from July the 29th to August the 13th: I will be glad to address any comment, bug report or suggestion Python developers might want to discuss in the meantime as soon as I will be back. From fuzzyman at voidspace.org.uk Fri Jul 28 21:38:34 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Fri, 28 Jul 2006 20:38:34 +0100 Subject: [Python-Dev] New miniconf module In-Reply-To: <Pine.LNX.4.64.0607281438120.10450@sylvain> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> <20060727103331.GA31912@code0.codespeak.net> <Pine.LNX.4.64.0607281438120.10450@sylvain> Message-ID: <44CA67BA.9060803@voidspace.org.uk> Sylvain Fourmanoit wrote: > Armin Rigo wrote: > >> In the same spirit, maybe it could be slightly re-oriented towards a >> dumper/loader for more than config files; for example, it could provide >> a safe inverse of repr() for common built-in types >> > > New version of miniconf (version 1.2.0) is out [1][2], including a > unrepr() function; that's the only change this time. > > Michael Foord wrote: > >> ConfigObj [3] gained an 'unrepr' mode a while back. The code is simple, >> and originally came from CherryPy. >> > > Thanks for the link! I completely missed ConfigObj. It indeed shares a > lot with my code. At the core, it use many of the same ideas and > implementation techniques... In many ways, it is also a lot more > advanced, specialized than miniconf: for instance, it introduce a new, > specialized Config File format, while my effort aimed at keeping things > minimal. > > Cool - I made the post so you could reuse the unrepr code, but I'm glad you like ConfigObj. :-) If miniconf can store and restore instances of classes (which I guess it does if you see it as an improvement of pickle ?) then it does more than ConfigObj unrepr mode - which simply allows you to store basic data-types in config files (amongst all the other things it does...). Michael http://www.voidspace.org.uk/python/index.shtml From rhettinger at ewtllc.com Fri Jul 28 21:36:39 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Fri, 28 Jul 2006 12:36:39 -0700 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <ee2a432c0607280859r6c293fa5l78c66011829ff466@mail.gmail.com> References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> <2mirlixdg4.fsf@starship.python.net> <ee2a432c0607280859r6c293fa5l78c66011829ff466@mail.gmail.com> Message-ID: <44CA6747.2090603@ewtllc.com> Neal Norwitz wrote: >Anthony and I talked about still having b3 on Aug 1. rc1 around Aug >17-18 (just before the Google sprint which Martin, Jeremy and I will >be attending). Final around 24-29. We didn't discuss with Martin >yet, so these dates are quite tentative. > > If it doesn't muck-up your workflow, I would like to see all of these dates bumped back by about a week. In particular, it may be too aggressive to expect the __index__ issue be fully resolved by 8/1. Also, I have a few error-traps to add to setobject.c and need to review one of the implementation decisions for str.rpartition(). This weekend, I'll run the head on a few third-party packages to see if their test-suites still pass -- if not, it would be nice to have a bit of time to fix whatever arises. Raymond From fdrake at acm.org Fri Jul 28 22:07:46 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Fri, 28 Jul 2006 16:07:46 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <ee2a432c0607272149xc22baf4u2945da575237a83@mail.gmail.com> References: <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> <ee2a432c0607272149xc22baf4u2945da575237a83@mail.gmail.com> Message-ID: <200607281607.46709.fdrake@acm.org> On Friday 28 July 2006 00:49, Neal Norwitz wrote: > Based on this comment, is it really acceptable to just document a > behaviour change? ISTM there should really only be 2 choices: fix > 2.5 properly or revert the change. This seemed to be Armin's > position. I agree those are the only reasonable solutions. I'd rather see things fixed, but I don't know how much time Phillip has to work on it. I'll be working on the straigtening out the xmlcore issue tonight/tomorrow. -Fred -- Fred L. Drake, Jr. <fdrake at acm.org> From bob at redivi.com Fri Jul 28 22:35:40 2006 From: bob at redivi.com (Bob Ippolito) Date: Fri, 28 Jul 2006 13:35:40 -0700 Subject: [Python-Dev] struct module and coercing floats to integers Message-ID: <9E51A030-B610-414B-98E0-1923F1C3E862@redivi.com> It seems that the pre-2.5 struct module has some additional undocumented behavior[1] that didn't percolate into the new version: http://python.org/sf/1530559 Python 2.4 and previous will coerce floats to integers when necessary as such without any kind of complaint: $ python2.4 -c "import struct; print repr(struct.pack('>H', 0.9999999999999999))" '\x00\x00' Python 2.5 refuses to coerce float to int: $ python2.5 -c "import struct; print repr(struct.pack('>H', 0.9999999999999999))" Traceback (most recent call last): File "<string>", line 1, in <module> File "/Users/bob/src/python/Lib/struct.py", line 63, in pack return o.pack(*args) TypeError: unsupported operand type(s) for &: 'float' and 'long' The available options are to: 1. Reinstate the pre-2.5 weirdness 2. Reinstate the pre-2.5 weirdness with a DeprecationWarning 3. Break existing code that relies on undocumented behavior (seems more like a bug than lack of specification) Either 2 or 3 seems reasonable to me, with a preference for 3 because none of my code depends on old bugs in the struct module :) As far as precedent goes, the array module *used* to coerce floats silently, but it's had a DeprecationWarning since at least Python 2.3 (but perhaps even earlier). Maybe it's time to promote that warning to an exception for Python 2.5? [1] The pre-2.5 behavior should really be considered a bug, the documentation says "Return a string containing the values v1, v2, ... packed according to the given format. The arguments must match the values required by the format exactly." I wouldn't consider arbitrary floating point numbers to match the value required by an integer format exactly. Floats are not in general interchangeable with integers in Python anyway (e.g. list indexes, etc.). -bob From pje at telecommunity.com Fri Jul 28 22:42:00 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 28 Jul 2006 16:42:00 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <200607281607.46709.fdrake@acm.org> References: <ee2a432c0607272149xc22baf4u2945da575237a83@mail.gmail.com> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <5.1.1.6.0.20060727171324.0262c318@sparrow.telecommunity.com> <ee2a432c0607272149xc22baf4u2945da575237a83@mail.gmail.com> Message-ID: <5.1.1.6.0.20060728163317.0377cbf8@sparrow.telecommunity.com> At 04:07 PM 7/28/2006 -0400, Fred L. Drake, Jr. wrote: >On Friday 28 July 2006 00:49, Neal Norwitz wrote: > > Based on this comment, is it really acceptable to just document a > > behaviour change? ISTM there should really only be 2 choices: fix > > 2.5 properly or revert the change. This seemed to be Armin's > > position. > >I agree those are the only reasonable solutions. I'd rather see things >fixed, >but I don't know how much time Phillip has to work on it. I'll be working on >the straigtening out the xmlcore issue tonight/tomorrow. I'm testing a semi-final version of the fix now. It's minimally intrusive in that it only adds an "imp.NullImporter" type to replace the "False" value, and it keeps "None" to mean that the built-in import machinery should be used. From martin at v.loewis.de Fri Jul 28 22:47:34 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Fri, 28 Jul 2006 22:47:34 +0200 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <ee2a432c0607280859r6c293fa5l78c66011829ff466@mail.gmail.com> References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> <2mirlixdg4.fsf@starship.python.net> <ee2a432c0607280859r6c293fa5l78c66011829ff466@mail.gmail.com> Message-ID: <44CA77E6.7060200@v.loewis.de> Neal Norwitz wrote: > Anthony and I talked about still having b3 on Aug 1. rc1 around Aug > 17-18 (just before the Google sprint which Martin, Jeremy and I will > be attending). Final around 24-29. We didn't discuss with Martin > yet, so these dates are quite tentative. That doesn't work for me. The final release must either happen before Aug 19, or after Sep 9, or somebody else must roll the Windows binaries. Regards, Martin From martin at v.loewis.de Fri Jul 28 22:55:46 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Fri, 28 Jul 2006 22:55:46 +0200 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <5.1.1.6.0.20060728112800.03a7fb48@sparrow.telecommunity.com> References: <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> <5.1.1.6.0.20060728112800.03a7fb48@sparrow.telecommunity.com> Message-ID: <44CA79D2.7000209@v.loewis.de> Phillip J. Eby wrote: > The issue is that a proper fix that caches existence requires adding new > types to import.c and thus might appear to be more of a feature. I was > therefore reluctant to embark upon the work without some assurance that it > wouldn't be rejected as adding a last-minute feature. So do you have a patch, or are going to write one? Regards, Martin From fuzzyman at voidspace.org.uk Fri Jul 28 23:20:31 2006 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Fri, 28 Jul 2006 22:20:31 +0100 Subject: [Python-Dev] Patch Against shutil.copytree Bug Message-ID: <44CA7F9F.3080402@voidspace.org.uk> Hello all, Sourceforge is being anal and won't let me log in. Attached is a patch for the shutil test, which would pickup bug "[1525866] Bug in shutil.copytree on Windows". It passes against a bug-fixed shutil [1] on my XP box, and fails on an un-fixed one. It only tests basic functionality of copytree. I did add tests that copystat had worked, but the mtime results were consistently off on Windows [2] so I removed that test. I would appreciate it if someone could post this patch to sourceforge. IMO this bug needs fixing before 2.5 final, the fix is trivial and shutil.copytree is broken on windows without it. All the best, Michael Foord [1] Bugfix supplied by Thomas Heller and is attached to the sourceforge bug report. [2] And consistently off in a weird way FWIW... -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: test_shutil.diff Url: http://mail.python.org/pipermail/python-dev/attachments/20060728/b83b9152/attachment.diff From gward-1337f07a94b43060ff5c1ea922ed93d6 at python.net Wed Jul 26 04:32:13 2006 From: gward-1337f07a94b43060ff5c1ea922ed93d6 at python.net (Greg Ward) Date: Tue, 25 Jul 2006 22:32:13 -0400 Subject: [Python-Dev] httplib and bad response chunking Message-ID: <20060726023213.GA3106@cthulhu.gerg.ca> So I accidentally discovered the other day that httplib does not handle a particular type of mangled HTTP response very well. In particular, it tends to blow up with an undocumented ValueError when the server screws up "chunked" encoding. I'm not the first to discover this, either: see http://www.python.org/sf/1486335 . <digression> HTTP 1.1 response chunking allows clients to know how many bytes of response to expect for dynamic content, i.e. when it's not possible to include a "Content-length" header. A chunked response might look like this: 0005\r\nabcd\n\r\n0004\r\nabc\n\r\n0\r\n\r\n which means: 0x0005 bytes in first chunk, which is "abcd\n" 0x0004 bytes in second chunk, which is "abc\n" Each chunk size is terminated with "\r\n"; each chunk is terminated with "\r\n"; end of response is indicated by a chunk of 0 bytes, hence the "\r\n\r\n" at the end. Details in RFC 2616: http://www.w3.org/Protocols/rfc2616/rfc2616-sec3.html#sec3.6.1 </digression> Anyways, what I discovered in the wild the other day was a response like this: 0005\r\nabcd\n\r\n0004\r\nabc\n\r\n\r\n i.e. the chunk-size for the terminating empty chunk was missing. This cause httplib.py to blow up with ValueError because it tried to call int(line, 16) assuming that 'line' contained a hex number, when in fact it was the empty string. Oops. IMHO the minimal fix is to turn ValueError into HTTPException (or a subclass thereof); httplib should not raise ValueError just because some server sends a bad response. (The server in question was Apache 2.0.52 running PHP 4.3.9 sending a big hairy error page because the database was down.) Where I'm getting hung up is how far to test this stuff. I have discovered other hypothetical cases of bad chunking that cause httplib to go into an infinite loop or block forever on socket.readline(). Should we worry about those cases as well, despite not having seen them happen in the wild? More annoying, I can reproduce the "block forever" case using a real socket, but not using the StringIO-based FakeSocket class in test_httplib. Anyways, I've cobbled together a crude hack to test_httplib.py that exposes the problem: http://sourceforge.net/tracker/download.php?group_id=5470&atid=105470&file_id=186245&aid=1486335 Feedback welcome. (Fixing the inadvertent ValueError is trivial, so I'm concentrating on getting the tests right first.) Oh yeah, my patch is relative to the 2.4 branch. Greg -- Greg Ward <gward at python.net> http://www.gerg.ca/ I don't believe there really IS a GAS SHORTAGE.. I think it's all just a BIG HOAX on the part of the plastic sign salesmen -- to sell more numbers!! From richardjones at optushome.com.au Wed Jul 26 06:26:29 2006 From: richardjones at optushome.com.au (Richard Jones) Date: Wed, 26 Jul 2006 14:26:29 +1000 Subject: [Python-Dev] More tracker demos online In-Reply-To: <ea6msv$hgf$1@sea.gmane.org> References: <44C6748C.4060701@v.loewis.de> <ea6msv$hgf$1@sea.gmane.org> Message-ID: <200607261426.29143.richardjones@optushome.com.au> On Wednesday 26 July 2006 13:17, Terry Reedy wrote: > ""Martin v. L?wis"" <martin at v.loewis.de> wrote in message > news:44C6748C.4060701 at v.loewis.de... > > > Currently, we have two running tracker demos online: > > > > Roundup: > > http://efod.se/python-tracker/ > > > > Jira: > > http://jira.python.atlassian.com/secure/Dashboard.jspa > > What user name and passwords will they accept, if any? For the Roundup tracker at least you should be able to recover your password using your sourceforge email address. Richard From oliphant.travis at ieee.org Fri Jul 28 18:15:51 2006 From: oliphant.travis at ieee.org (Travis Oliphant) Date: Fri, 28 Jul 2006 10:15:51 -0600 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA2D4F.1020601@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <44CA2D4F.1020601@gmail.com> Message-ID: <44CA3837.3040506@ieee.org> Nick Coghlan wrote: > David Hopwood wrote: >> Armin Rigo wrote: >>> Hi, >>> >>> There is an oversight in the design of __index__() that only just >>> surfaced :-( It is responsible for the following behavior, on a 32-bit >>> machine with >= 2GB of RAM: >>> >>> >>> s = 'x' * (2**100) # works! >>> >>> len(s) >>> 2147483647 >>> >>> This is because PySequence_Repeat(v, w) works by applying >>> w.__index__ in >>> order to call v->sq_repeat. However, __index__ is defined to clip the >>> result to fit in a Py_ssize_t. >> >> Clipping the result sounds like it would *never* be a good idea. What >> was >> the rationale for that? It should throw an exception. > > A simple demonstration of the clipping behaviour that works on > machines with limited memory: > > >>> (2**100).__index__() > 2147483647 > >>> (-2**100).__index__() > -2147483648 > > PEP 357 doesn't even mention the issue, and the comment on long_index > in the code doesn't give a rationale - it just notes that the function > clips the result. I can't think of the rationale so it was probably an unclear one and should be thought of as a bug. The fact that it isn't discussed in the PEP means it wasn't thought about clearly. I think I had the vague idea that .__index_() should always succeed. But, this shows a problem with that notion. > > I'm inclined to call it a bug, too, but I've cc'ed Travis to see if he > can shed some light on the question - the implementation of long_index > explicitly suppresses the overflow error generated by > _long_as_ssize_t, so the current behaviour appears to be deliberate. If it was deliberate, it was a hurried decision and one that should be re-thought and probably changed. I think the idea came from the fact that out-of-bounds slicing returns empty lists and since __index__ was primarily developed to allow integer-like objects to be used in slicing it adopted that behavior. In fact it looks like the comment above _long_index contains words from the comment above _PyEval_SliceIndex showing the direct borrowing of the idea. But, _long_index is clearly the wrong place to handle the situation since it is used by more than just the slicing code. An error return is already handled by the _Eval_SliceIndex code anyway. I say it's a bug that should be fixed. Don't clear the error, raise it. -Travis From pje at telecommunity.com Sat Jul 29 00:00:36 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 28 Jul 2006 18:00:36 -0400 Subject: [Python-Dev] Release manager pronouncement needed: PEP 302 Fix In-Reply-To: <44CA79D2.7000209@v.loewis.de> References: <5.1.1.6.0.20060728112800.03a7fb48@sparrow.telecommunity.com> <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060726142456.036d9008@sparrow.telecommunity.com> <20060727103920.GB31912@code0.codespeak.net> <5.1.1.6.0.20060727114924.0262c980@sparrow.telecommunity.com> <5.1.1.6.0.20060728112800.03a7fb48@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060728173543.01b7c120@sparrow.telecommunity.com> At 10:55 PM 7/28/2006 +0200, Martin v. L?wis wrote: >Phillip J. Eby wrote: > > The issue is that a proper fix that caches existence requires adding new > > types to import.c and thus might appear to be more of a feature. I was > > therefore reluctant to embark upon the work without some assurance that it > > wouldn't be rejected as adding a last-minute feature. > >So do you have a patch, or are going to write one? Yes, it's checked in as r50916. It ultimately turned out to be simpler than I thought; only one new type (imp.NullImporter) was required. From martin at v.loewis.de Sat Jul 29 00:44:46 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sat, 29 Jul 2006 00:44:46 +0200 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA3837.3040506@ieee.org> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <44CA2D4F.1020601@gmail.com> <44CA3837.3040506@ieee.org> Message-ID: <44CA935E.4010901@v.loewis.de> Travis Oliphant wrote: > I say it's a bug that should be fixed. Don't clear the error, raise it. Several people have said this, but I don't think it can work. If you raise an OverflowError in __index__, the slicing code cannot know whether this meant as overflow or underflow (in a signed sense). Regards, Martin From jjl at pobox.com Sat Jul 29 00:59:37 2006 From: jjl at pobox.com (John J Lee) Date: Fri, 28 Jul 2006 22:59:37 +0000 (UTC) Subject: [Python-Dev] httplib and bad response chunking In-Reply-To: <20060726023213.GA3106@cthulhu.gerg.ca> References: <20060726023213.GA3106@cthulhu.gerg.ca> Message-ID: <Pine.LNX.4.64.0607282251320.8644@localhost> On Tue, 25 Jul 2006, Greg Ward wrote: [...] > Where I'm getting hung up is how far to test this stuff. Stop when you run out of time ;-) > I have > discovered other hypothetical cases of bad chunking that cause httplib > to go into an infinite loop or block forever on socket.readline(). > Should we worry about those cases as well, despite not having seen them > happen in the wild? More annoying, I can reproduce the "block forever" > case using a real socket, but not using the StringIO-based FakeSocket > class in test_httplib. They have been seen in the wild :-) http://python.org/sf/1411097 The IP address referenced isn't under my control, I don't know if it still provokes the error, but the problem is clear. John From rrr at ronadam.com Sat Jul 29 01:06:02 2006 From: rrr at ronadam.com (Ron Adam) Date: Fri, 28 Jul 2006 18:06:02 -0500 Subject: [Python-Dev] patching pydoc? In-Reply-To: <1d85506f0607280635q3a693682l230c7821dc6f408f@mail.gmail.com> References: <1d85506f0607280635q3a693682l230c7821dc6f408f@mail.gmail.com> Message-ID: <44CA985A.3030408@ronadam.com> tomer filiba wrote: > i have a problem with pydoc in rpyc. i wanted help(obj), where obj > is a NetProxy object, to work as if it were local. > > i followed the code starting from site.help to pydoc.doc, which is the > ultimate function that generates and prints the text. i expected there > would be some function in the middle that prepares the text, and > another that writes it to the pager, but to my disappointment pydoc.doc > does both. > > this means i can't transfer the document to my local machine (it's > printed directly to the remote console). > > therefore, i would like to split this behavior into two parts: > * render_doc - a function that returns the document text > * doc - a function that calls render_doc and sends it to the pager > > this way no existing code breaks (no existing function signatures > are changed) and i gain help on remote objects. > i hope people would be in favor, as it's not such a big change anyway. > > > is it possible to add to 2.5? > > -tomer Hi Tomer, This doesn't address your small patch since the following described project is, not complete, will be either be a separate module or package, (if/when it's completed), and because it's more of a complete rewrite than not, would be ready and tested no sooner than 2.6. But you might be interested in taking a look at what I've attempted to do so far. (This is too premature to consider adding to python distribution but maybe after completing, and after it is used in the community it could be a candidate.) I am/was working on cleaning up pydoc by trying to separate the various formating, server, and output routines, because they are currently very much intertwined making enhancing and/or maintaining pydoc more difficult than it could be. This has resulted rewriting most of the top level functions, which after having some success, has been put on hold because I have other things to do that are more immediate at the moment. What I have done (so far) is separate the file pydoc.py into multiple files in order to begin to better organize the code, And rewriting each of those files so they are less interdependent on each other and more complete in them selves with the possibility of putting it back together in a single file or a package at some point. (I'm leaning on it being a package myself.) My attempt has focused on having pydoc create an intermediate xml-tree data structure that is then be used as the source to generate text, html, and other outputs. This separates the introspective and formatting functions of pydoc. The xml to html function mostly work. (some dependency link tags still need to be reimplemented.) Another improvement I've made is having the html server prepend an html search bar to each page which eliminates the need to have a separate TK search window. This makes interactively browsing with pydoc very easy and nice. :-) It also uses a style sheet to format the html pages, making it easier to change colors, indents, fonts, and borders, etc... If you are interested in working on this or would just like to take a look, let me know, I can put it in a zip file and send it to you. It's not as well organized as I would like at the moment, but the xml and html generation functions work. No attempt to create plain text output from the xml has been made yet. I intended to write the xml to text (and paging help) functions from scratch once the html output is complete and fully working. (mostly done) I think it will not be difficult to get the output very similar to the existing pydoc help text form. I'm thinking it may be better to share it at this time rather than let it sit on my hard drive and possibly get waylayed, or worse, forgotten indefinitely. If this is the "right approach", and is "desirable", maybe it could make a good community project. Possibly someone could post it and/or manage it someplace? It may need a bit more cleaning up before it's ready for that though. Some additional thoughts of why to do this are: I'm hoping there may be a way to run the html server on a web site to generate documentation directly (pydoc-live?) instead of pre-generating html files and placing them on the web site. (Possible?, Has that already been done?) I know pydoc html files are available on line, but I think they are all pregenerated by pydoc first then uploaded. That may be preferred in most cases because it would require less server overhead and be more secure, but live pydocs would give more immediate and dynamic feedback when a project is being updated and changed often. Another far off (and more ambitious 3.0 ?) goal may be to use the pydoc generated xml as a source to insert doc strings and other information directly into the python user manuals. That may improve both the python library manuals as well as the pydoc help information by having them more closely tied together. Inconsistencies would be found sooner. The doc strings could be the brief quick ref documents, which it currently isn't always brief and/or present, and the python manual could include the pydoc output (or selected parts of it) along with the longer explanation and discussion. Having pydoc produce xml as an intermediate format makes these types of things easier to do. Cheers, Ron Adam From tomerfiliba at gmail.com Sat Jul 29 02:11:19 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Sat, 29 Jul 2006 02:11:19 +0200 Subject: [Python-Dev] patching pydoc? Message-ID: <1d85506f0607281711q1152506bn4c3bc8a2bcadb0ef@mail.gmail.com> > Giving the amount of hair-tearing over uuid and __index__, this seems like > an especially bad day to ask for a new-feature variance in a time of > feature freeze ;-). yeah, i guess so. > * Would making pager() a parameter of doc() make sense? not at all. > * I presume you gain the new functionality by directly calling the > factored-out render_doc and printing thru your own pager. Does everyone? i don't quite get the question, but yes, i plan to call render_doc instead of doc, getting the doc text, and printing it on my own console using a pager/whatever. about the signature, okay, it can easily be fixed. i didn't think about that too much. about the other point -- ImportError etc -- i tried to go "dumb" as much as possible and not make any change that would break something. pydoc is poorly written anyway, and as ron adam noted, there is a need for at least a cleaned-up version of pydoc, or even a redesigned package. and it ought to be a package. it's a module of ~2000 lines. that's way too much for a module. i made the minimal changes needed to separate the "generate text" part from the "generate and display" part. and i kept all the semantics in tact (including who handles the errors) and therefore, i don't expect it to break any tests (after restoring the original doc signature, that is). i'd hate to wait for 2.6 for such a small change. today my solution is to replace pydoc.pager by a fake pager that hands over the result. this is of course not thread safe, as multiple threads calling help() would mix their outputs. not very likely, but i'd love to see this code go away asap. thanks for the comments, -tomer > From: Terry Reedy <tjreedy <at> udel.edu> > Subject: Re: patching pydoc? > Newsgroups: gmane.comp.python.devel > Date: 2006-07-28 18:29:50 GMT (5 hours and 27 minutes ago) > > "tomer filiba" <tomerfiliba <at> gmail.com> wrote in message > news:1d85506f0607280635q3a693682l230c7821dc6f408f <at> mail.gmail.com... > ... > > therefore, i would like to split this behavior into two parts: > > * render_doc - a function that returns the document text > > * doc - a function that calls render_doc and sends it to the pager > > > > this way no existing code breaks (no existing function signatures > > are changed) and i gain help on remote objects. > > i hope people would be in favor, as it's not such a big change anyway. > > is it possible to add to 2.5? > > Giving the amount of hair-tearing over uuid and __index__, this seems like > an especially bad day to ask for a new-feature variance in a time of > feature freeze ;-). > > Some quick questions: > * I presume you gain the new functionality by directly calling the > factored-out render_doc and printing thru your own pager. Does everyone? > * Would making pager() a parameter of doc() make sense? > * Is pager() the only part of the original doc() that can generate > ImportError, ErrorDuringImport? If not, the try/except should be in > render_doc also or instead. > * Why generalize the doc() signature? Bad calls will be traced as caught > in render_doc instead of doc. Couldn't that potentially break a bad_call > test? > > Terry Jan Reedy From chrism at plope.com Sat Jul 29 02:02:10 2006 From: chrism at plope.com (Chris McDonough) Date: Fri, 28 Jul 2006 20:02:10 -0400 Subject: [Python-Dev] cgi.FieldStorage DOS (sf bug #1112549) Message-ID: <D38E3187-946F-4649-9E6D-F366E1F7E52C@plope.com> From the initial bugreport (http://sourceforge.net/tracker/index.php? func=detail&aid=1112549&group_id=5470&atid=105470) """ Various parts of cgi.FieldStorage call its "read_lines_to_outerboundary", "read_lines" and "skip_lines" methods. These methods use the "readline" method of the file object that represents an input stream. The input stream is typically data supplied by an untrusted source (such as a user uploading a file from a web browser). The input data is not required by the RFC 822/1521/1522/1867 specifications to contain any newline characters. For example, it is within the bounds of the specification to supply a a multipart/form-data input stream with a "file-data" part that consists of a 2GB string composed entirely of "x" characters (which happens to be something I did that led me to noticing this bug). """ This bug has been around for about a year but I just worked up a patch yesterday that applies OK against current SVN. It's attached to the issue. Would someone be so kind as to check it in? Guido has already reviewed it, I believe. - C From david.nospam.hopwood at blueyonder.co.uk Sat Jul 29 02:14:21 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Sat, 29 Jul 2006 01:14:21 +0100 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA935E.4010901@v.loewis.de> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <44CA2D4F.1020601@gmail.com> <44CA3837.3040506@ieee.org> <44CA935E.4010901@v.loewis.de> Message-ID: <44CAA85D.5010907@blueyonder.co.uk> Martin v. L?wis wrote: > Travis Oliphant wrote: > >>I say it's a bug that should be fixed. Don't clear the error, raise it. > > Several people have said this, but I don't think it can work. > > If you raise an OverflowError in __index__, the slicing code cannot know > whether this meant as overflow or underflow (in a signed sense). Why not use IndexError for an underflow, and OverflowError for an overflow? -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From cvaughn at gmail.com Sat Jul 29 03:34:36 2006 From: cvaughn at gmail.com (Charles Vaughn) Date: Fri, 28 Jul 2006 20:34:36 -0500 Subject: [Python-Dev] Eliminating loops Message-ID: <3bcbd0d70607281834g3b21146bi5dfd700cea47d4f2@mail.gmail.com> I'm looking for a way of modifying the compiler to eliminate any loops and recursion from code. It's for a high speed data processing application. The alternative is a custom language that is little more than gloryfied assembly. I'd like to be able to use everything else around Python, but we can't allow the users to create more than O(1) complexity. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060728/0e0b1009/attachment.html From brett at python.org Sat Jul 29 03:40:43 2006 From: brett at python.org (Brett Cannon) Date: Fri, 28 Jul 2006 18:40:43 -0700 Subject: [Python-Dev] Eliminating loops In-Reply-To: <3bcbd0d70607281834g3b21146bi5dfd700cea47d4f2@mail.gmail.com> References: <3bcbd0d70607281834g3b21146bi5dfd700cea47d4f2@mail.gmail.com> Message-ID: <bbaeab100607281840g6ea3db63y5eb89b11bc1a424@mail.gmail.com> On 7/28/06, Charles Vaughn <cvaughn at gmail.com> wrote: > > I'm looking for a way of modifying the compiler to eliminate any loops and > recursion from code. It's for a high speed data processing application. > The alternative is a custom language that is little more than gloryfied > assembly. I'd like to be able to use everything else around Python, but we > can't allow the users to create more than O(1) complexity. > Well, the problem is how are you going to infer how many times to unroll the loop? range() and xrange() can change since they are only built-ins. You would need type inference on some locally defined variable that was set to a syntactically defined atomic type in order to know the proper number of times to unroll the loop and that might not work for your needs (on top of being a pain to write out a list with however many values you want to have the loop iterate over). -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060728/ed111499/attachment-0001.htm From greg.ewing at canterbury.ac.nz Sat Jul 29 04:50:21 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 29 Jul 2006 14:50:21 +1200 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <20060728101133.GA339@code0.codespeak.net> References: <20060728101133.GA339@code0.codespeak.net> Message-ID: <44CACCED.4090307@canterbury.ac.nz> Armin Rigo wrote: > This is because PySequence_Repeat(v, w) works by applying w.__index__ in > order to call v->sq_repeat. Why does it do that? Shouldn't __index__ only be used for numbers which are going to be used as an index? > However, __index__ is defined to clip the > result to fit in a Py_ssize_t. Why is it defined to do this instead of raising OverflowError? -- Greg From greg.ewing at canterbury.ac.nz Sat Jul 29 05:09:46 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 29 Jul 2006 15:09:46 +1200 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> Message-ID: <44CAD17A.9090902@canterbury.ac.nz> Tim Peters wrote: > So, e.g., plain a[i] shouldn't use __index__ either if i is already > int or long. I don't see any justification for invoking nb_index in > sequence_repeat(), although if someone thinks it should, then as for > plain indexing it certainly shouldn't invoke nb_index if the incoming > count is an int or long to begin with. Hmmm. So that means anything accepting an integer index needs to do its own range checking anyway. So having __index__ do clipping is at best unnecessary and at worst counterproductive, since it could suppress an overflow or range exception that *should* be produced by the code using the index, and would be if it got the equivalent index value as an int or long directly. -- Greg From greg.ewing at canterbury.ac.nz Sat Jul 29 05:11:57 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 29 Jul 2006 15:11:57 +1200 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <ca471dc20607280905y7447d005r2ab0ec664a69948@mail.gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <44CA2D4F.1020601@gmail.com> <ca471dc20607280905y7447d005r2ab0ec664a69948@mail.gmail.com> Message-ID: <44CAD1FD.5090602@canterbury.ac.nz> Guido van Rossum wrote: > In my recollection I tried to avoid this exact behavior. I wanted > __index__() to just return the unclipped int or long value, but have a > C API that clipped it for use in slice operations. So is there still a chance to fix it? -- Greg From murman at gmail.com Sat Jul 29 05:14:58 2006 From: murman at gmail.com (Michael Urman) Date: Fri, 28 Jul 2006 22:14:58 -0500 Subject: [Python-Dev] struct module and coercing floats to integers In-Reply-To: <9E51A030-B610-414B-98E0-1923F1C3E862@redivi.com> References: <9E51A030-B610-414B-98E0-1923F1C3E862@redivi.com> Message-ID: <dcbbbb410607282014w30459351ifb09c56075997be1@mail.gmail.com> On 7/28/06, Bob Ippolito <bob at redivi.com> wrote: > http://python.org/sf/1530559 > > [1] The pre-2.5 behavior should really be considered a bug, the > documentation says "Return a string containing the values v1, v2, ... > packed according to the given format. The arguments must match the > values required by the format exactly." I wouldn't consider arbitrary > floating point numbers to match the value required by an integer > format exactly. Floats are not in general interchangeable with > integers in Python anyway (e.g. list indexes, etc.). While it may be a bug, it's not as hard to run into, nor as illogical as the presentation here makes it sound. The original code[1] took a float value between 0 and 2, and wanted to use pack('>H', round(value * 32768)) The workaround is a trivial change pack('>H', int(round(value * 32768))) but the timeframe is less than ideal, as working code will suddenly stop and recieve only mildly helpful error message. The fact that round returns a float rather than an int, while intentional, does not feature prominently in one's mine when the first version yielded the expected results. I would appreciate option 2 which retains compatibility but warns that the construct is bad. I will accept any of the options, as it's clear that floats don't make sense. It's just unfortunate that the previous implementation let them through in a way the new implementation does not. [1] http://www.sacredchao.net/quodlibet/changeset/3706 Michael -- Michael Urman http://www.tortall.net/mu/blog From greg.ewing at canterbury.ac.nz Sat Jul 29 05:44:33 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 29 Jul 2006 15:44:33 +1200 Subject: [Python-Dev] Rounding float to int directly (Re: struct module and coercing floats to integers) In-Reply-To: <dcbbbb410607282014w30459351ifb09c56075997be1@mail.gmail.com> References: <9E51A030-B610-414B-98E0-1923F1C3E862@redivi.com> <dcbbbb410607282014w30459351ifb09c56075997be1@mail.gmail.com> Message-ID: <44CAD9A1.8080000@canterbury.ac.nz> Michael Urman wrote: > The fact that > round returns a float rather than an int, while intentional, does not > feature prominently in one's mine when the first version yielded the > expected results. As an aside, does anyone else think that it would be useful to have a builtin which rounds and converts to an int in one go? Whenever I use round(), I almost always want the result as an int, and making me do it in two steps seems unnecessarily bothersome. Since automatic float->int coercion is being increasingly disallowed, use cases for this are becoming more and more common. -- Greg From ncoghlan at gmail.com Sat Jul 29 05:49:51 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 29 Jul 2006 13:49:51 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CA935E.4010901@v.loewis.de> References: <20060728101133.GA339@code0.codespeak.net> <44CA082B.6030908@blueyonder.co.uk> <44CA2D4F.1020601@gmail.com> <44CA3837.3040506@ieee.org> <44CA935E.4010901@v.loewis.de> Message-ID: <44CADADF.1040801@gmail.com> Martin v. L?wis wrote: > Travis Oliphant wrote: >> I say it's a bug that should be fixed. Don't clear the error, raise it. > > Several people have said this, but I don't think it can work. > > If you raise an OverflowError in __index__, the slicing code cannot know > whether this meant as overflow or underflow (in a signed sense). It can actually, but you have to allow 3 possible error return values from nb_index (-1, PY_SSIZE_T_MIN and PY_SSIZE_T_MAX). This is ugly as hell [1], so I'm going to try to work out a patch that changes the signature of nb_index to return PyObject * (leaving the client code in the core to decide whether to clip, overflow, or leave the result alone). Cheers, Nick. [1] http://www.python.org/sf/1530738 -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Sat Jul 29 05:53:21 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 29 Jul 2006 13:53:21 +1000 Subject: [Python-Dev] New miniconf module In-Reply-To: <Pine.LNX.4.64.0607281438120.10450@sylvain> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> <20060727103331.GA31912@code0.codespeak.net> <Pine.LNX.4.64.0607281438120.10450@sylvain> Message-ID: <44CADBB1.6090401@gmail.com> Sylvain Fourmanoit wrote: > Armin Rigo wrote: >> If it goes in that direction, I'd suggest to rename the module to give >> it a name closer to existing persistence-related modules already in the >> stdlib. > > I am not especially fond of the current miniconf name either; I didn't > find something more suitable, yet evocative of what it does; I would be > glad to hear any suggestion you or the rest of the developers would have. pyson :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From jcarlson at uci.edu Sat Jul 29 10:05:53 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 29 Jul 2006 01:05:53 -0700 Subject: [Python-Dev] Eliminating loops In-Reply-To: <3bcbd0d70607281834g3b21146bi5dfd700cea47d4f2@mail.gmail.com> References: <3bcbd0d70607281834g3b21146bi5dfd700cea47d4f2@mail.gmail.com> Message-ID: <20060729004957.E623.JCARLSON@uci.edu> "Charles Vaughn" <cvaughn at gmail.com> wrote: > I'm looking for a way of modifying the compiler to eliminate any loops and > recursion from code. It's for a high speed data processing application. > The alternative is a custom language that is little more than gloryfied > assembly. I'd like to be able to use everything else around Python, but we > can't allow the users to create more than O(1) complexity. One of the larger, if not largest advances in computer science in the last 50 years was the design and implementation of looping as a construct, not just an artifact of structured gotos, but as a method of implementing algorithms. With your proposed removal of loops and recursion, you are effectively saying that users should be given a turing machine because you are afraid of them doing foolish things with the language. Well, since the user is going to do foolish things with the language anyways, about all you can really do is to test, analyze, and verify. Oh, and educate. What do I mean? If your users have access to sequences of any type, and they can't perform 'x in y', then they are going to write their own contains/index function... def index(x,y): if x == y[0]: return 0 elif x == y[1]: return 1 elif x == y[2]: return 2 ... If they aren't given acess to sequences, then they will do what used to be done in QuakeC back in the day... def index(x,y): if x == y0: return 0 elif x == y1: return 1 elif x == y2: return 2 ... While algorithm design and analysis isn't something that everyone can do (some just can't handle the math), the users really should understand at least a bit about algorithms before they work on the "high speed data processing" application. - Josiah From g.brandl at gmx.net Sat Jul 29 10:07:42 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 29 Jul 2006 10:07:42 +0200 Subject: [Python-Dev] Using Python docs Message-ID: <eaf4u4$gfc$1@sea.gmane.org> Regarding bug 469773, I think it would be great to have such a document "Using Python", containing the manual page and platform- specific hints on how to invoke the interpreter and scripts (e.g. explaining the shebang for Unices). I'd be willing to help write up such a document. Another thing that could be helpful is a list of "frequently needed documentation sections", that is, a list of keywords and respective links to topics that are hard to find for newbies, such as the section "String formatting" or some locations in the reference manual. Georg From arigo at tunes.org Sat Jul 29 10:56:12 2006 From: arigo at tunes.org (Armin Rigo) Date: Sat, 29 Jul 2006 10:56:12 +0200 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <ca471dc20607281131m34f752b4pb1ff933f2d2262d9@mail.gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <1f7befae0607280855l3f71febbo95ff5751fe31d99c@mail.gmail.com> <44CA3C3D.90805@gmail.com> <44CA552C.9020805@gmail.com> <ca471dc20607281131m34f752b4pb1ff933f2d2262d9@mail.gmail.com> Message-ID: <20060729085612.GA30795@code0.codespeak.net> Hi Guido, On Fri, Jul 28, 2006 at 11:31:09AM -0700, Guido van Rossum wrote: > No time to look through the code here, but IMO it's acceptable (at > least for 2.5) if (2**100).__index__() raises OverflowError, as long > as x[:2**100] silently clips. __index__() is primarily meant to return > a value useful for indexing concrete sequences, and 2**100 isn't. If nb_index keeps returning a Py_ssize_t with clipping, it means that there is no way to write in pure Python an object that emulates a long -- only an int. Sounds inconsistent with the int/long unification trend for pure Python code. It would make it awkward to write, say, pure Python classes that pretend to be very large sequences, because using __index__ in such code wouldn't work. Another example of this is that if places like sequence_repeat are made to use the following pseudo-logic: if isinstance(w, long) and w > sys.maxint: raise OverflowError else: i = w.__index__() then if an object 'l' is an emulated pseudo-long, then "x"*l will still silently clip the pseudo-long to sys.maxint. I'm more in favor of changing nb_index to return a PyObject *, since now is our last chance to do so. A pair of API functions can be added to return a Py_ssize_t with either the proper clipping, or the proper OverflowError'ing. A bientot, Armin. From ncoghlan at gmail.com Sat Jul 29 16:06:53 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 30 Jul 2006 00:06:53 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <20060728101133.GA339@code0.codespeak.net> References: <20060728101133.GA339@code0.codespeak.net> Message-ID: <44CB6B7D.1040001@gmail.com> Armin Rigo wrote: > Hi, > > There is an oversight in the design of __index__() that only just > surfaced :-( It is responsible for the following behavior, on a 32-bit > machine with >= 2GB of RAM: > > >>> s = 'x' * (2**100) # works! > >>> len(s) > 2147483647 > > This is because PySequence_Repeat(v, w) works by applying w.__index__ in > order to call v->sq_repeat. However, __index__ is defined to clip the > result to fit in a Py_ssize_t. This means that the above problem exists > with all sequences, not just strings, given enough RAM to create such > sequences with 2147483647 items. > > For reference, in 2.4 we correctly get an OverflowError. > > Argh! What should be done about it? I've now got a patch on SF that aims to fix this properly [1]. The gist of the patch: 1. Redesign the PyNumber_Index C API to serve the actual use cases in the interpreter core and the standard library. The PEP 357 abstract C API as written was bypassed by nearly all of the uses in the core and the standard library. The patch redesigns that API to reduce code duplication between the various parts of the code base that were previously calling nb_index directly. The principal change is to provide an "is_index" output variable that the various mp_subscript implementations can use to determine whether or not the passed in object was an index or not, rather than having to repeat the type check everywhere. The rationale for doing it this way: a. you may want to try something else (e.g. the mp_subscript implementations in the standard library try indexing before checking to see if the object is a slice object) b. a different error message may be wanted (e.g. the normal indexing related Type Error doesn't make sense for sequence repetition) c. a separate checking function would lead to repeating the check on common code paths (e.g. if an mp_subscript implementation did the type check first, and then PyNumber_Check did it again to see whether or not to raise an error) The output variable solves the problem nicely: either pass in NULL to get the default behaviour of raising a sequence indexing TypeError, or pass in a pointer to a C int in order to be told whether or not the typecheck succeeded without an exception actually being set if it fails (if the typecheck passes, but the actual call fails, the exception state is set as normal). Additionally, PyNumber_Index is redefined to raise an IndexError for values that cannot be represented as a Py_ssize_t. The choice of IndexError was made based on the dominant usage in the standard library (IndexError is the correct error to raise so that an mp_subscript implementation does the right thing). There are only a few places that need to override the IndexError to replace it with OverflowError (the length argument to slice.indices, sequence repetition, the mmap constructor), whereas all of the sequence objects (list, tuple, string, unicode, array), as well as PyObject_Get/Set/DelItem, need it to raise IndexError. Raising IndexError also benefits sequences implemented in Python, which can simply do: def __getitem__(self, idx): if isinstance(idx, slice): return self._get_slice(idx) idx = operator.index(idx) # Will raise IndexError on overflow A second API function PyNumber_SliceIndex is added so that the clipping semantics are still available where needed and _PyEval_SliceIndex is modified to use the new public API. This is exposed to Python code as operator.sliceindex(). With the redesigned C API, the *only* code that calls the nb_index slot directly is the two functions in abstract.c. Everything else uses one or the other of those interfaces. Code duplication was significantly reduced as a result, and it should be much easier for 3rd party C libraries to do what they need to do (i.e. implementing mp_subscript slots). 2. Redefine nb_index to return a PyObject * Returning the PyInt/PyLong objects directly from nb_index greatly simplified the implementation of the nb_index methods for the affected classes. For classic classes, instance_index could be modified to simply return the result of calling __index__, as could slot_nb_index for new-style classes. For the standard library classes, the existing int_int method, and the long_long method could be used instead of needing new functions. This convenience should hold true for extension classes - instead of needing to implement __index__ separately, they should be able to reuse their existing __int__ or __long__ implementations. The other benefit is that the logic to downconvert to Py_ssize_t that was formerly invoked by long's __index__ method is now instead invoked by PyNumber_Index and PyNumber_SliceIndex. This means that directly calling an __index__() method allows large long results to be passed through unaffected, but calling the indexing operator will raise IndexError if the long is outside the memory address space: (2 ** 100).__index__() == (2**100) # This works operator.index(2**100) # This raises IndexError The patch includes additions to test_index.py to cover these limit cases, as well as the necessary updates to the C API and operator module documentation. Cheers, Nick. [1] http://www.python.org/sf/1530738 -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From david.nospam.hopwood at blueyonder.co.uk Sat Jul 29 16:09:03 2006 From: david.nospam.hopwood at blueyonder.co.uk (David Hopwood) Date: Sat, 29 Jul 2006 15:09:03 +0100 Subject: [Python-Dev] New miniconf module In-Reply-To: <44CADBB1.6090401@gmail.com> References: <Pine.LNX.4.64.0607261613130.6037@sylvain> <Pine.LNX.4.64.0607261402410.5956@sylvain> <44C7B8C2.1060904@blueyonder.co.uk> <Pine.LNX.4.64.0607261613130.6037@sylvain> <5.1.1.6.0.20060726180017.0262c990@sparrow.telecommunity.com> <Pine.LNX.4.64.0607270200300.6000@sylvain> <20060727103331.GA31912@code0.codespeak.net> <Pine.LNX.4.64.0607281438120.10450@sylvain> <44CADBB1.6090401@gmail.com> Message-ID: <44CB6BFF.4010700@blueyonder.co.uk> Nick Coghlan wrote: > Sylvain Fourmanoit wrote: >>Armin Rigo wrote: >> >>>If it goes in that direction, I'd suggest to rename the module to give >>>it a name closer to existing persistence-related modules already in the >>>stdlib. >> >>I am not especially fond of the current miniconf name either; I didn't >>find something more suitable, yet evocative of what it does; I would be >>glad to hear any suggestion you or the rest of the developers would have. > > pyson :) Following the pattern of JSON, it would be "PYON" (PYthon Object Notation). -- David Hopwood <david.nospam.hopwood at blueyonder.co.uk> From ncoghlan at iinet.net.au Sat Jul 29 16:19:27 2006 From: ncoghlan at iinet.net.au (Nick Coghlan) Date: Sun, 30 Jul 2006 00:19:27 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CB6B7D.1040001@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <44CB6B7D.1040001@gmail.com> Message-ID: <44CB6E6F.5010507@iinet.net.au> Nick Coghlan wrote: > The other benefit is that the logic to downconvert to Py_ssize_t that > was formerly invoked by long's __index__ method is now instead invoked > by PyNumber_Index and PyNumber_SliceIndex. This means that directly > calling an __index__() method allows large long results to be passed > through unaffected, but calling the indexing operator will raise > IndexError if the long is outside the memory address space: > > (2 ** 100).__index__() == (2**100) # This works > operator.index(2**100) # This raises IndexError > > The patch includes additions to test_index.py to cover these limit > cases, as well as the necessary updates to the C API and operator module > documentation. I forgot to mention the main benefit of this: when working with a pseudo-sequence rather than a concrete one, __index__() can be used directly to ensure you are working with integral data types while still allowing access to the full range of representable integer values. operator.index is available for when what you have really is a concrete data set that is limited to the memory capacity of a single machine, and operator.sliceindex for when you want to clamp at the memory address space limits rather than raising an exception. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From tonynelson at georgeanelson.com Sat Jul 29 17:29:03 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Sat, 29 Jul 2006 11:29:03 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 Message-ID: <v04020a01c0f129ae3d64@[192.168.123.162]> I'm trying to write a test for my Socket Timeouts patch [1], which fixes signal handling (notably Ctl-C == SIGINT == KeyboarInterrupt) on socket operations using a timeout. I don't see a portable way to send a signal, and asking the test runner to press Ctl-C is a non-starter. A "real" signal is needed to interrupt the select() (or equivalent) call, because that's what wasn't being handled correctly. The bug should happen on the other platforms I don't know how to test on. Is there a portable way to send a signal? SIGINT would be best, but another signal (such as SIGALRM) would do, I think. If not, should I write the test to only work on systems implementing SIGALRM, the signal I'm using now, or implementing kill(), or what? [1] <http://www.python.org/sf/1519025> ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From ncoghlan at iinet.net.au Sat Jul 29 18:55:13 2006 From: ncoghlan at iinet.net.au (Nick Coghlan) Date: Sun, 30 Jul 2006 02:55:13 +1000 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CB6B7D.1040001@gmail.com> References: <20060728101133.GA339@code0.codespeak.net> <44CB6B7D.1040001@gmail.com> Message-ID: <44CB92F1.9040304@iinet.net.au> Nick Coghlan wrote: > Armin Rigo wrote: >> Hi, >> >> There is an oversight in the design of __index__() that only just >> surfaced :-( It is responsible for the following behavior, on a 32-bit >> machine with >= 2GB of RAM: >> >> >>> s = 'x' * (2**100) # works! >> >>> len(s) >> 2147483647 >> >> This is because PySequence_Repeat(v, w) works by applying w.__index__ in >> order to call v->sq_repeat. However, __index__ is defined to clip the >> result to fit in a Py_ssize_t. This means that the above problem exists >> with all sequences, not just strings, given enough RAM to create such >> sequences with 2147483647 items. >> >> For reference, in 2.4 we correctly get an OverflowError. >> >> Argh! What should be done about it? > > I've now got a patch on SF that aims to fix this properly [1]. I revised this patch to further reduce the code duplication associated with the indexing code in the standard library. The patch now has three new functions in the abstract C API: PyNumber_Index (used in a dozen or so places) - raises IndexError on overflow PyNumber_AsSsize_t (used in 3 places) - raises OverflowError on overflow PyNumber_AsClippedSsize_t() (used once, by _PyEval_SliceIndex) - clips to PY_SSIZE_T_MIN/MAX on overflow All 3 have an int * output argument allowing type errors to be flagged directly to the caller rather than through PyErr_Occurred(). Of the 3, only PyNumber_Index is exposed through the operator module. Probably the most interesting thing now would be for Travis to review it, and see whether it makes things easier to handle for the Numeric scalar types (given the amount of code the patch deleted from the builtin and standard library data types, hopefully the benefits to Numeric will be comparable). Cheers, Nick. [1] http://www.python.org/sf/1530738 -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From nnorwitz at gmail.com Sat Jul 29 21:31:29 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Sat, 29 Jul 2006 12:31:29 -0700 Subject: [Python-Dev] test_uuid Message-ID: <ee2a432c0607291231s6ad5a9bdt8a3cfa599a52acb3@mail.gmail.com> Ping, I just checked in a change to disable testing 2 uuid functions (_ifconfig_get_node and unixdll_getnode) that fail on many platforms. Here's the message: """ Disable these tests until they are reliable across platforms. These problems may mask more important, real problems. One or both methods are known to fail on: Solaris, OpenBSD, Debian, Ubuntu. They pass on Windows and some Linux boxes. """ Can you fix these issues or at least provide guidance how they should be fixed? n From python-dev at zesty.ca Sat Jul 29 22:00:50 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Sat, 29 Jul 2006 15:00:50 -0500 (CDT) Subject: [Python-Dev] uuid test suite failing In-Reply-To: <ee2a432c0607280024j56bf7facmd9ebdd26aafc013f@mail.gmail.com> References: <eaamnu$mrq$1@sea.gmane.org> <20060727163647.GA4175@rogue.amk.ca> <ee2a432c0607280024j56bf7facmd9ebdd26aafc013f@mail.gmail.com> Message-ID: <Pine.LNX.4.58.0607291458310.23895@server1.LFW.org> On Fri, 28 Jul 2006, Neal Norwitz wrote: > This only fixes 1 of the 2 failures in test_uuid. The other one is > due to _unixdll_getnode() failing. This is because > _uuid_generate_time is None because we couldn't find it in the uuid > library. This is just broken, not sure if it's the code or the test > though. We should handle the case if _uuid_generate_time and the > others are None better. I don't know what to do in this case. The design intention is this: each of the various *_getnode() functions is supposed to work on the platform for which it was written. For example, _windll_getnode() is supposed to work on Windows, and will raise an exception on other platforms; if it raises an exception on Windows, something is wrong (the code's expectations of the OS are not met). When uuid_generate_time is unavailable, _unixdll_getnode() is supposed to fail. The getnode() function is just supposed to get an available MAC address; that's why it catches any exceptions raised by the *_getnode() functions. -- ?!ng From python-dev at zesty.ca Sat Jul 29 22:05:16 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Sat, 29 Jul 2006 15:05:16 -0500 (CDT) Subject: [Python-Dev] test_uuid In-Reply-To: <ee2a432c0607291231s6ad5a9bdt8a3cfa599a52acb3@mail.gmail.com> References: <ee2a432c0607291231s6ad5a9bdt8a3cfa599a52acb3@mail.gmail.com> Message-ID: <Pine.LNX.4.58.0607291501220.23895@server1.LFW.org> On Sat, 29 Jul 2006, Neal Norwitz wrote: > I just checked in a change to disable testing 2 uuid functions > (_ifconfig_get_node and unixdll_getnode) that fail on many platforms. > Here's the message: > > """ > Disable these tests until they are reliable across platforms. These > problems may mask more important, real problems. > > One or both methods are known to fail on: Solaris, OpenBSD, Debian, Ubuntu. > They pass on Windows and some Linux boxes. > """ _ifconfig_get_node() should work on all Linuxes. (Thanks for fixing it to work on more types of Unix.) It's okay for unixdll_getnode to fail when the necessary shared library is not available. Ideally, test_uuid should serve as a record of which platforms we expect these routines to work on. The uuid module as a whole isn't broken if one of these routines fails; it just means that we don't have complete platform coverage and/or test_uuid has inaccurate expectations of which routines work on which platforms. -- ?!ng From jcarlson at uci.edu Sat Jul 29 23:38:38 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 29 Jul 2006 14:38:38 -0700 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a01c0f129ae3d64@[192.168.123.162]> References: <v04020a01c0f129ae3d64@[192.168.123.162]> Message-ID: <20060729143048.E630.JCARLSON@uci.edu> Tony Nelson <tonynelson at georgeanelson.com> wrote: > > I'm trying to write a test for my Socket Timeouts patch [1], which fixes > signal handling (notably Ctl-C == SIGINT == KeyboarInterrupt) on socket > operations using a timeout. I don't see a portable way to send a signal, > and asking the test runner to press Ctl-C is a non-starter. A "real" > signal is needed to interrupt the select() (or equivalent) call, because > that's what wasn't being handled correctly. The bug should happen on the > other platforms I don't know how to test on. > > Is there a portable way to send a signal? SIGINT would be best, but > another signal (such as SIGALRM) would do, I think. According to my (limited) research on signals, Windows signal support is horrible. I have not been able to have Python send signals of any kind other than SIGABRT, and then only to the currently running process, which kills it (regardless of whether you have a signal handler or not). > If not, should I write the test to only work on systems implementing > SIGALRM, the signal I'm using now, or implementing kill(), or what? I think that most non-Windows platforms should have non-braindead signal support, though the signal module seems to be severely lacking in sending any signal except for SIGALRM, and the os module has its fingers on SIGABRT. If someone is looking for a project for 2.6 that digs into all sorts of platform-specific nastiness, they could add actual signal sending to the signal module (at least for unix systems). - Josiah From exarkun at divmod.com Sat Jul 29 23:44:28 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Sat, 29 Jul 2006 17:44:28 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <20060729143048.E630.JCARLSON@uci.edu> Message-ID: <20060729214428.1717.1630815704.divmod.quotient.6244@ohm> On Sat, 29 Jul 2006 14:38:38 -0700, Josiah Carlson <jcarlson at uci.edu> wrote: > >If someone is looking for a project for 2.6 that digs into all sorts of >platform-specific nastiness, they could add actual signal sending to the >signal module (at least for unix systems). > Maybe I am missing something obvious, but what is necessary beyond os.kill()? What /would/ be useful is a complete sigaction wrapper, but that's a completely separate topic. Jean-Paul From tonynelson at georgeanelson.com Sun Jul 30 00:18:41 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Sat, 29 Jul 2006 18:18:41 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <20060729143048.E630.JCARLSON@uci.edu> References: <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> Message-ID: <v04020a01c0f18c283570@[192.168.123.162]> At 2:38 PM -0700 7/29/06, Josiah Carlson wrote: >Tony Nelson <tonynelson at georgeanelson.com> wrote: >> >> I'm trying to write a test for my Socket Timeouts patch [1], which fixes >> signal handling (notably Ctl-C == SIGINT == KeyboarInterrupt) on socket >> operations using a timeout. I don't see a portable way to send a signal, >> and asking the test runner to press Ctl-C is a non-starter. A "real" >> signal is needed to interrupt the select() (or equivalent) call, because >> that's what wasn't being handled correctly. The bug should happen on the >> other platforms I don't know how to test on. >> >> Is there a portable way to send a signal? SIGINT would be best, but >> another signal (such as SIGALRM) would do, I think. > >According to my (limited) research on signals, Windows signal support is >horrible. I have not been able to have Python send signals of any kind >other than SIGABRT, and then only to the currently running process, >which kills it (regardless of whether you have a signal handler or not). Hmm, OK, darn, thanks. MSWindows does allow users to press Ctl-C to send a KeyboardInterrupt, so it's just too bad if I can't find a way to test it from a script. >> If not, should I write the test to only work on systems implementing >> SIGALRM, the signal I'm using now, or implementing kill(), or what? > >I think that most non-Windows platforms should have non-braindead signal >support, though the signal module seems to be severely lacking in >sending any signal except for SIGALRM, and the os module has its fingers >on SIGABRT. The test now checks "hasattr(signal, 'alarm')" before proceeding, so at least it won't die horribly. >If someone is looking for a project for 2.6 that digs into all sorts of >platform-specific nastiness, they could add actual signal sending to the >signal module (at least for unix systems). Isn't signal sending the province of kill (2) (or os.kill()) in python)? Not that I know much about it. BTW, I picked SIGALRM because I could do it all with one thread. Reading POSIX, ISTM that if I sent the signal from another thread, it would bounce off that thread to the main thread during the call to kill(), at which point I got the willies. OTOH, if kill() is more widely available than alarm(), I'll give it a try, but going by the docs, I'd say it isn't. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From martin at v.loewis.de Sun Jul 30 09:42:14 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sun, 30 Jul 2006 09:42:14 +0200 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a01c0f18c283570@[192.168.123.162]> References: <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> Message-ID: <44CC62D6.9070703@v.loewis.de> Tony Nelson schrieb: > Hmm, OK, darn, thanks. MSWindows does allow users to press Ctl-C to send a > KeyboardInterrupt, so it's just too bad if I can't find a way to test it > from a script. You can use GenerateConsoleCtrlEvent to send Ctrl-C to all processes that share the console of the calling process. > BTW, I picked SIGALRM because I could do it all with one thread. Reading > POSIX, ISTM that if I sent the signal from another thread, it would bounce > off that thread to the main thread during the call to kill(), at which > point I got the willies. OTOH, if kill() is more widely available than > alarm(), I'll give it a try, but going by the docs, I'd say it isn't. Indeed, alarm should be available on any POSIX system. Regards, Martin From phd at mail2.phd.pp.ru Sun Jul 30 11:29:57 2006 From: phd at mail2.phd.pp.ru (Oleg Broytmann) Date: Sun, 30 Jul 2006 13:29:57 +0400 Subject: [Python-Dev] first draft of bug guidelines for www.python.org/dev/ In-Reply-To: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> References: <bbaeab100607202052t68ac220cs70b32a7c6a3190a2@mail.gmail.com> Message-ID: <20060730092957.GB29899@phd.pp.ru> On Thu, Jul 20, 2006 at 08:52:34PM -0700, Brett Cannon wrote: > * Summary > A one-line describing the problem so as to make it easy for > developers to spot whether they have the expertise needed to work on > the bug. Summary is also displayed as a title on index and search pages, so it is important to make the summary right - short and descriptive. Oleg. -- Oleg Broytmann http://phd.pp.ru/ phd at phd.pp.ru Programmers don't die, they just GOSUB without RETURN. From martin at v.loewis.de Sun Jul 30 15:14:13 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sun, 30 Jul 2006 15:14:13 +0200 Subject: [Python-Dev] Which version of distutils to ship with Python 2.5? In-Reply-To: <200607271701.58223.anthony@interlink.com.au> References: <43aa6ff70607261924w65e01325o854f87d4c109026a@mail.gmail.com> <44C85FF2.3030701@v.loewis.de> <200607271701.58223.anthony@interlink.com.au> Message-ID: <44CCB0A5.30706@v.loewis.de> Anthony Baxter schrieb: >> In any case, I bumped the version number to 2.5, according to the >> policy discussed in >> > > Could this not simply use the Python version number directly, instead? See the prior discussion at http://mail.python.org/pipermail/distutils-sig/2005-January/004366.html Some people still believe (at least, believed in January 2005), that distutils is developed independently of Python, and thus deserves its own version number. Of course, Andrew Kuchling officially declared in r1982 of pep 291 that there won't be any further stand-alone distutils releases, and therefore, backwards compatibility with 2.1 is not necessary anymore. So I changed distutils.__version__ again, to be derived from sys.version_info. I left the numerous comments still in distutils that compatibility with 2.1 is desired. We should remove these after 2.5 is released (or perhaps even before that). Regards, Martin From jcarlson at uci.edu Sun Jul 30 18:50:36 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sun, 30 Jul 2006 09:50:36 -0700 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <20060729214428.1717.1630815704.divmod.quotient.6244@ohm> References: <20060729143048.E630.JCARLSON@uci.edu> <20060729214428.1717.1630815704.divmod.quotient.6244@ohm> Message-ID: <20060730093610.E634.JCARLSON@uci.edu> Jean-Paul Calderone <exarkun at divmod.com> wrote: > > On Sat, 29 Jul 2006 14:38:38 -0700, Josiah Carlson <jcarlson at uci.edu> wrote: > > > >If someone is looking for a project for 2.6 that digs into all sorts of > >platform-specific nastiness, they could add actual signal sending to the > >signal module (at least for unix systems). > > Maybe I am missing something obvious, but what is necessary beyond > os.kill()? I note that os.kill() does everything necessary for posix systems. I didn't notice that it took an argument for the kind of signal. A new project for someone: combine all of the methods available to Windows into a single function. > What /would/ be useful is a complete sigaction wrapper, but that's a > completely separate topic. Like atexit, only a stack per signal? - Josiah From tonynelson at georgeanelson.com Sun Jul 30 19:10:32 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Sun, 30 Jul 2006 13:10:32 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <44CC62D6.9070703@v.loewis.de> References: <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> Message-ID: <v04020a01c0f286dddce0@[192.168.123.162]> At 9:42 AM +0200 7/30/06, Martin v. L?wis wrote: >Tony Nelson schrieb: >> Hmm, OK, darn, thanks. MSWindows does allow users to press Ctl-C to send a >> KeyboardInterrupt, so it's just too bad if I can't find a way to test it >> from a script. > >You can use GenerateConsoleCtrlEvent to send Ctrl-C to all processes >that share the console of the calling process. That looks like it would work, but it seems prone to overkill. To avoid killing all the processes running from a console, the test would need to be run in a subprocess in a new process group. If the test simply sends the event to its own process, all the other processes in its process group would receive the event as well, and probably die. I would expect that all the processes sharing the console would die, but even if they didn't when I tried it, I couldn't be sure that it wouldn't happen elsewhere, say when run from a .bat file. Martin, your advice is usually spot-on, but I don't always understand it. Maybe using it here is just complicated. I expect that GenerateConsoleCtrlEvent() can be called through the ctypes module, though that would make backporting the test to 2.4 a bit more difficult. It looks like the subprocess module can be passed the needed creation flag to make a new process group. The subprocess can send the event to itself, and could return the test result in its result code, so that part isn't so bad. To avoid adding a new file to the distribution, test_socket.test_main() could be modified to look for a command line argument requesting the particular test action. >> BTW, I picked SIGALRM because I could do it all with one thread. Reading >> POSIX, ISTM that if I sent the signal from another thread, it would bounce >> off that thread to the main thread during the call to kill(), at which >> point I got the willies. OTOH, if kill() is more widely available than >> alarm(), I'll give it a try, but going by the docs, I'd say it isn't. > >Indeed, alarm should be available on any POSIX system. Well, if alarm() is available, then the test will work. If not, it will be silently skipped, as are some other tests already in test_socket.py. I can't offhand tell if MSWindows supports alarm(), but RiscOS and OS2 do not. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From exarkun at divmod.com Sun Jul 30 19:20:54 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Sun, 30 Jul 2006 13:20:54 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <20060730093610.E634.JCARLSON@uci.edu> Message-ID: <20060730172054.1717.102253317.divmod.quotient.7639@ohm> On Sun, 30 Jul 2006 09:50:36 -0700, Josiah Carlson <jcarlson at uci.edu> wrote: > >Jean-Paul Calderone <exarkun at divmod.com> wrote: >> >> On Sat, 29 Jul 2006 14:38:38 -0700, Josiah Carlson <jcarlson at uci.edu> wrote: >> > >> >If someone is looking for a project for 2.6 that digs into all sorts of >> >platform-specific nastiness, they could add actual signal sending to the >> >signal module (at least for unix systems). >> >> Maybe I am missing something obvious, but what is necessary beyond >> os.kill()? > >I note that os.kill() does everything necessary for posix systems. >I didn't notice that it took an argument for the kind of signal. > >A new project for someone: combine all of the methods available to >Windows into a single function. > >> What /would/ be useful is a complete sigaction wrapper, but that's a >> completely separate topic. > >Like atexit, only a stack per signal? I just mean a complete wrapping of sigaction(2). In particular, I need this functionality to properly install a SIGCHLD handler which does not interfer with various I/O functions by installing the handler with the SA_RESTART flag, which is not currently possible using the signal.signal function. Jean-Paul From barry at python.org Sun Jul 30 21:44:08 2006 From: barry at python.org (Barry Warsaw) Date: Sun, 30 Jul 2006 15:44:08 -0400 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <BBBDED8E-ACDD-4176-985A-144188B52F1E@python.org> References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> <BBBDED8E-ACDD-4176-985A-144188B52F1E@python.org> Message-ID: <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Re: Python 2.5 compatibility On Jul 28, 2006, at 8:57 AM, Barry Warsaw wrote: > +1. It would give me more type to port and test a few of my > applications to the new version. > > I'm still working on Mailman but the most painful thing so far has > been the conversion of exceptions to new-style classes, and even > that wasn't /too/ painful. I believe I've finished porting Mailman to Python 2.5. None of the issues were insurmountable, but here they are FTR: 1) Exceptions are new-style classes but Mailman was doing one specific test against the type of an object to see if it needed to be instantiated. This test was written as: if isinstance(obj, ClassType) which fails in Python 2.5. I actually rewrote it like so: if isinstance(obj, ClassType) or isinstance(obj, type(type)) in MM2.1 because it has to be backward compatible to Python 2.1. 2) There was one place where I was commonly raising a string and that generates deprecation warnings, so I changed that to a class. There are a few other legacy string exceptions defined in the code, but they are rarely used and should all get rewritten as well. 3) Cookie.py's repr changed so that trailing semicolons are no longer output on the end of text lines. I understand this change was made so that Python cookies were more RFC compliant, but it broke some homegrown cookie text splitting we were doing. I changed this code to split on lines first. All in all, not too bad although the Cookie.py change took a while to track down! - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (Darwin) iQCVAwUBRM0MDnEjvBPtnXfVAQJ9ZAP/VdtM79SXgx7s/X0aEIu4HDZva7TkYyi6 dRzlgAtEV5BN1yYn+vzw8PBCtdy+9N3yYtv/zqdQP54mZDjsaGaNw6MiS0jsETRy 248hj3otL/00WTrKWh8/OvDlLW8KUNQI4MWBOMKJ/TqYW5Es4fJGEMtbO/xqGXXD /wgWmmLOOAE= =Mu8m -----END PGP SIGNATURE----- From fdrake at acm.org Sun Jul 30 21:55:40 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Sun, 30 Jul 2006 15:55:40 -0400 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> References: <44C9A184.7030701@ewtllc.com> <BBBDED8E-ACDD-4176-985A-144188B52F1E@python.org> <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> Message-ID: <200607301555.40541.fdrake@acm.org> On Sunday 30 July 2006 15:44, Barry Warsaw wrote: > if isinstance(obj, ClassType) or isinstance(obj, type(type)) Looks like you've got a possible name clash in the second isinstance. ;-) -Fred -- Fred L. Drake, Jr. <fdrake at acm.org> From g.brandl at gmx.net Sun Jul 30 22:17:41 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 30 Jul 2006 22:17:41 +0200 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> References: <44C9A184.7030701@ewtllc.com> <200607281539.50794.anthony@interlink.com.au> <BBBDED8E-ACDD-4176-985A-144188B52F1E@python.org> <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> Message-ID: <eaj42s$6n0$1@sea.gmane.org> Barry Warsaw wrote: > if isinstance(obj, ClassType) > > which fails in Python 2.5. I actually rewrote it like so: > > if isinstance(obj, ClassType) or isinstance(obj, type(type)) The second "type" seems to be superfluous. ;) Georg From fdrake at acm.org Sun Jul 30 22:27:16 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Sun, 30 Jul 2006 16:27:16 -0400 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <eaj42s$6n0$1@sea.gmane.org> References: <44C9A184.7030701@ewtllc.com> <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> <eaj42s$6n0$1@sea.gmane.org> Message-ID: <200607301627.17309.fdrake@acm.org> On Sunday 30 July 2006 16:17, Georg Brandl wrote: > The second "type" seems to be superfluous. ;) I was thinking it suggested there was a local named "type". But if not, yeah. I get the impression Barry's pretty new to this "Python thing." Wonder what he's been up to. ;-) -Fred -- Fred L. Drake, Jr. <fdrake at acm.org> From barry at python.org Sun Jul 30 23:07:22 2006 From: barry at python.org (Barry Warsaw) Date: Sun, 30 Jul 2006 17:07:22 -0400 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <200607301627.17309.fdrake@acm.org> References: <44C9A184.7030701@ewtllc.com> <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> <eaj42s$6n0$1@sea.gmane.org> <200607301627.17309.fdrake@acm.org> Message-ID: <ADB1B68C-7C90-4466-BE43-921FDFA7ECEE@python.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Jul 30, 2006, at 4:27 PM, Fred L. Drake, Jr. wrote: > On Sunday 30 July 2006 16:17, Georg Brandl wrote: >> The second "type" seems to be superfluous. ;) > > I was thinking it suggested there was a local named "type". But if > not, yeah. > > I get the impression Barry's pretty new to this "Python thing." > Wonder what > he's been up to. ;-) As I mentioned, this has to be compatible with Python 2.1: Python 2.1.3+ (#1, Apr 25 2005, 22:52:02) [GCC 3.3.5-20050130 (Gentoo Linux 3.3.5.20050130-r1, ssp-3.3.5.20050130-1, pie- on linux2 Type "copyright", "credits" or "license" for more information. >>> isinstance(Exception, type) Traceback (most recent call last): File "<stdin>", line 1, in ? TypeError: isinstance() arg 2 must be a class or type >>> isinstance(Exception, type(type)) 0 Python 2.5b2 (trunk:50835, Jul 25 2006, 23:27:51) [GCC 3.4.6 (Gentoo 3.4.6-r1, ssp-3.4.5-1.0, pie-8.7.9)] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> isinstance(Exception, type) True >>> isinstance(Exception, type(type)) True I thought that was rather clever actually. :) - -Barry -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.3 (Darwin) iQCVAwUBRM0fj3EjvBPtnXfVAQI1AQP+ID6BSbJ/4TL7cizvMjrxHD6JVjFKBzD6 7FCwXpvELQt7vlDWGrXWi+Lai/93nGqD42VYRSgtHqFP2gMYKEkM+TaLl91YFuSh B6jO7l5wW7SGlyLQQiibmPBp8uGDG30F1ylM9e9y9c69Hy1LJEa1sG8/AS/FiA+n fpQ/WlTNRvA= =uGDZ -----END PGP SIGNATURE----- From martin at v.loewis.de Sun Jul 30 23:34:30 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sun, 30 Jul 2006 23:34:30 +0200 Subject: [Python-Dev] Py2.5 release schedule In-Reply-To: <200607301555.40541.fdrake@acm.org> References: <44C9A184.7030701@ewtllc.com> <BBBDED8E-ACDD-4176-985A-144188B52F1E@python.org> <3E70BA16-8DAA-487D-9901-B1597D912222@python.org> <200607301555.40541.fdrake@acm.org> Message-ID: <44CD25E6.5040608@v.loewis.de> Fred L. Drake, Jr. schrieb: > On Sunday 30 July 2006 15:44, Barry Warsaw wrote: > > if isinstance(obj, ClassType) or isinstance(obj, type(type)) > > Looks like you've got a possible name clash in the second isinstance. ;-) Nah, that's rather an entry to the obfuscated Python contest. The two occurrences of type really mean to refer to the same thing; this is the test whether obj _is a_ new-style class. Normally, you would write isinstance(obj, type), but that gives a TypeError in 2.1 (isinstance() arg 2 must be a class or type). In 2.1, type(type) is FunctionType, so the test should fail (in the context, as obj ought to be a string, an exception object, or an exception type). In 2.2 and later, we have >>> type(type) is type 1 # sometimes True instead I think I would have rewritten as try: # Instantiate it if possible and necessary exc = exc() except AttributeError: # no __call__; it's already an object pass (assuming that the mailman exceptions don't have __call__) or as if not isinstance(exc, Exception): exc = exc() (assuming that string exceptions are gone, but the code below already assumes that exc should be an object that supports exc.reason_notice()) Regards, Martin From martin at v.loewis.de Sun Jul 30 23:42:54 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sun, 30 Jul 2006 23:42:54 +0200 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a01c0f286dddce0@[192.168.123.162]> References: <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> Message-ID: <44CD27DE.5060804@v.loewis.de> Tony Nelson schrieb: >> You can use GenerateConsoleCtrlEvent to send Ctrl-C to all processes >> that share the console of the calling process. [...] > Martin, your advice is usually spot-on, but I don't always understand it. > Maybe using it here is just complicated. This was really just in response to your remark that you couldn't find a way to send Ctrl-C programmatically. I researched (in the C library sources) how SIGINT was *generated* (through SetConsoleCtrlHandler), and that let me to a way to generate I didn't mean to suggest that you *should* use GenerateConsoleCtrlEvent, only that you could if you wanted to. > I expect that > GenerateConsoleCtrlEvent() can be called through the ctypes module, though > that would make backporting the test to 2.4 a bit more difficult. Well, if there was general utility to that API, I would prefer exposing it in the nt module. It doesn't quite fit into kill(2), as it doesn't allow to specify a pid of the target process, so perhaps it doesn't have general utility. In any case, that would have to wait for 2.6. Regards, Martin From tonynelson at georgeanelson.com Mon Jul 31 01:23:23 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Sun, 30 Jul 2006 19:23:23 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <44CD27DE.5060804@v.loewis.de> References: <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> Message-ID: <v04020a04c0f2da888957@[192.168.123.162]> At 11:42 PM +0200 7/30/06, Martin v. L?wis wrote: >Tony Nelson schrieb: >>> You can use GenerateConsoleCtrlEvent to send Ctrl-C to all processes >>> that share the console of the calling process. >[...] >> Martin, your advice is usually spot-on, but I don't always understand it. >> Maybe using it here is just complicated. > >This was really just in response to your remark that you couldn't >find a way to send Ctrl-C programmatically. I researched (in >the C library sources) how SIGINT was *generated* (through >SetConsoleCtrlHandler), and that let me to a way to generate [one.] Well, fine work there! >I didn't mean to suggest that you *should* use GenerateConsoleCtrlEvent, >only that you could if you wanted to. Hmm. Well, it would make the test possible on MSWindows as well as on OS's implementing alarm(2). If I figure out how to build Python on MSWindows, I might give it a try. I tried to get MSVC 7.1 via the .Net SDK, but it installed VS 8 instead, so I'm not quite sure how to proceed. >> I expect that >> GenerateConsoleCtrlEvent() can be called through the ctypes module, though >> that would make backporting the test to 2.4 a bit more difficult. > >Well, if there was general utility to that API, I would prefer exposing >it in the nt module. It doesn't quite fit into kill(2), as it doesn't >allow to specify a pid of the target process, so perhaps it doesn't >have general utility. In any case, that would have to wait for 2.6. A Process Group ID is the PID of the first process put in it, so it's sort of a PID. It just means a collection of processes, probably more than one. It seems to be mostly applicable to MSWindows, and isn't a suitable way to implement a form of kill(2). I hope that the Socket Timeouts patch 1519025 can make it into 2.5, or 2.5.1, as it is a bug fix. As such, it would probably be better to punt the test on MSWindows than to do a tricky fancy test that might have its own issues. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From tonynelson at georgeanelson.com Mon Jul 31 04:04:59 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Sun, 30 Jul 2006 22:04:59 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a04c0f2da888957@[192.168.123.162]> References: <44CD27DE.5060804@v.loewis.de> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> Message-ID: <v04020a06c0f315064900@[192.168.123.162]> At 7:23 PM -0400 7/30/06, Tony Nelson wrote: ... >...I tried to get MSVC 7.1 via the .Net SDK, but it >installed VS 8 instead, so I'm not quite sure how to proceed. ... David Murmann suggested off-list that I'd probably installed the 2.0 .Net SDK, and that I should install the 1.1 .Net SDK, which is the correct one. Now I can try to build Python on MSWindows. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From martin at v.loewis.de Mon Jul 31 04:34:03 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 31 Jul 2006 04:34:03 +0200 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a04c0f2da888957@[192.168.123.162]> References: <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a04c0f2da888957@[192.168.123.162]> Message-ID: <44CD6C1B.6040401@v.loewis.de> Tony Nelson schrieb: > Hmm. Well, it would make the test possible on MSWindows as well as on OS's > implementing alarm(2). If I figure out how to build Python on MSWindows, I > might give it a try. I tried to get MSVC 7.1 via the .Net SDK, but it > installed VS 8 instead, so I'm not quite sure how to proceed. The .NET SDK (any version) is not suitable to build Python. You really need VS 2003; if you don't have it anymore, you might be able to find a copy of the free version of the VC Toolkit 2003 (VCToolkitSetup.exe) somewhere. Of course, just for testing, you can also install VS Express 2005, and use the PCbuild8 projects directory; these changes should work the same under both versions. Regards, Martin From tonynelson at georgeanelson.com Mon Jul 31 06:39:07 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Mon, 31 Jul 2006 00:39:07 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <44CD6C1B.6040401@v.loewis.de> References: <v04020a04c0f2da888957@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a04c0f2da888957@[192.168.123.162]> Message-ID: <v04020a07c0f320d50fb8@[192.168.123.162]> At 4:34 AM +0200 7/31/06, Martin v. L?wis wrote: >Tony Nelson schrieb: >>Hmm. Well, it would make the test possible on MSWindows as well as on >>OS's implementing alarm(2). If I figure out how to build Python on >>MSWindows, I might give it a try. I tried to get MSVC 7.1 via the .Net >>SDK, but it installed VS 8 instead, so I'm not quite sure how to proceed. > >The .NET SDK (any version) is not suitable to build Python. I do see the warning in the instructions about it not be an optimizing compiler. I've managed to build python.exe and the rt.bat tests mostly work -- 2 tests fail, test_popen, and test_cmd_line because of popen() failing. Hmm, actually, this might be a real problem with the MSWindows version of posix_popen() in Modules/posixmodule.c. The path to my built python.exe is: "E:\Documents and Settings\Tony Nelson\My Documents\Python\pydev\trunk\PCBuild\python.exe" (lots of spaces in it). It seems properly quoted in the test and when I do it by hand, but in a call to popen() it doesn't work: popen('"E:\Documents and Settings\Tony Nelson\My Documents\Python\pydev\trunk\PCBuild\python.exe" -c "import sys;sys.version_info"') The returned file object repr resembles one that does work. If I just use "python.exe" from within the PCBuild directory: popen('python.exe -c "import sys;sys.version_info"') I get the right version, and that's the only 2.5b2 python I've got, so the built python must be working, but the path, even quoted, isn't accepted by MSWindows XP SP2. Should I report a bug? It may well just be MSWindows weirdness, and not something that posixmodule.c can do anything about. OTOH, it does work from the command line. I'll bet I wouldn't have seen a thing if I'd checked out to "E:\pydev" instead. >You really need VS 2003; if you don't have it anymore, you might be able >to find a copy of the free version of the VC Toolkit 2003 >(VCToolkitSetup.exe) somewhere. I really never had VS 2003. It doesn't appear to be on microsoft.com anymore. I'm reluctant to try to steal a copy. >Of course, just for testing, you can also install VS Express 2005, and >use the PCbuild8 projects directory; these changes should work the >same under both versions. I'll try that if I have any real trouble with the non-optimized python or if you insist that it's necessary. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From greg at electricrain.com Mon Jul 31 06:48:55 2006 From: greg at electricrain.com (Gregory P. Smith) Date: Sun, 30 Jul 2006 21:48:55 -0700 Subject: [Python-Dev] httplib and bad response chunking In-Reply-To: <20060726023213.GA3106@cthulhu.gerg.ca> References: <20060726023213.GA3106@cthulhu.gerg.ca> Message-ID: <20060731044855.GK22921@zot.electricrain.com> On Tue, Jul 25, 2006 at 10:32:13PM -0400, Greg Ward wrote: > > what I discovered in the wild the other day was a response like this: > > 0005\r\nabcd\n\r\n0004\r\nabc\n\r\n\r\n > > i.e. the chunk-size for the terminating empty chunk was missing. > This cause httplib.py to blow up with ValueError because it tried to > call > > int(line, 16) > > assuming that 'line' contained a hex number, when in fact it was the > empty string. Oops. > > IMHO the minimal fix is to turn ValueError into HTTPException (or a > subclass thereof); httplib should not raise ValueError just because some > server sends a bad response. (The server in question was Apache 2.0.52 > running PHP 4.3.9 sending a big hairy error page because the database > was down.) IMNSHO httplib should be fixed and this shouldn't be an error at all as its in the wild and will only show up more and more in the future. Plus file a bug with the apache or php project as appropriate for having a non-RFC compliant response. This is part of the good old network programming addage of being lenient in what you accept. -g From tonynelson at georgeanelson.com Mon Jul 31 06:58:23 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Mon, 31 Jul 2006 00:58:23 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a07c0f320d50fb8@[192.168.123.162]> References: <44CD6C1B.6040401@v.loewis.de> <v04020a04c0f2da888957@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a04c0f2da888957@[192.168.123.162]> Message-ID: <v04020a08c0f33d81cce8@[192.168.123.162]> At 12:39 AM -0400 7/31/06, Tony Nelson wrote: > popen('"E:\Documents and Settings\Tony Nelson\My >Documents\Python\pydev\trunk\PCBuild\python.exe" -c "import >sys;sys.version_info"') Ehh, I must admit that I retyped that. Obviously what I typed would not work, but what I used was: python = '"' + sys.executable + '"' popen(python + ' -c "import sys;sys.version_info"' So there wasn't a problem with backslashes. I've also been using raw strings. And, as I said, the file objects looked OK, with backslashes where they should be. Sorry for the mistyping. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From tonynelson at georgeanelson.com Mon Jul 31 07:14:36 2006 From: tonynelson at georgeanelson.com (Tony Nelson) Date: Mon, 31 Jul 2006 01:14:36 -0400 Subject: [Python-Dev] Testing Socket Timeouts patch 1519025 In-Reply-To: <v04020a08c0f33d81cce8@[192.168.123.162]> References: <v04020a07c0f320d50fb8@[192.168.123.162]> <44CD6C1B.6040401@v.loewis.de> <v04020a04c0f2da888957@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f129ae3d64@[192.168.123.162]> <v04020a01c0f18c283570@[192.168.123.162]> <v04020a01c0f286dddce0@[192.168.123.162]> <v04020a04c0f2da888957@[192.168.123.162]> Message-ID: <v04020a09c0f34051763e@[192.168.123.162]> At 12:58 AM -0400 7/31/06, Tony Nelson wrote: >At 12:39 AM -0400 7/31/06, Tony Nelson wrote: > >> popen('"E:\Documents and Settings\Tony Nelson\My >>Documents\Python\pydev\trunk\PCBuild\python.exe" -c "import >>sys;sys.version_info"') > >Ehh, I must admit that I retyped that. Obviously what I typed would not >work, but what I used was: > > python = '"' + sys.executable + '"' > popen(python + ' -c "import sys;sys.version_info"' > >So there wasn't a problem with backslashes. I've also been using raw >strings. And, as I said, the file objects looked OK, with backslashes >where they should be. Sorry for the mistyping. OK, I recognize the bug now. It's that quote parsing bug in MSWindows (which I can find again if you want) which can be worked around by using an extra quote at the front (and maybe also the back): popen('""E:\Documents ... Not really a bug in Python at all. ____________________________________________________________________ TonyN.:' <mailto:tonynelson at georgeanelson.com> ' <http://www.georgeanelson.com/> From gward-1337f07a94b43060ff5c1ea922ed93d6 at python.net Mon Jul 31 04:09:02 2006 From: gward-1337f07a94b43060ff5c1ea922ed93d6 at python.net (Greg Ward) Date: Sun, 30 Jul 2006 22:09:02 -0400 Subject: [Python-Dev] httplib and bad response chunking In-Reply-To: <Pine.LNX.4.64.0607282251320.8644@localhost> References: <20060726023213.GA3106@cthulhu.gerg.ca> <Pine.LNX.4.64.0607282251320.8644@localhost> Message-ID: <20060731020902.GA2950@cthulhu.gerg.ca> [me, on 25 July] > I have > discovered other hypothetical cases of bad chunking that cause httplib > to go into an infinite loop or block forever on socket.readline(). > Should we worry about those cases as well, despite not having seen them > happen in the wild? More annoying, I can reproduce the "block forever" > case using a real socket, but not using the StringIO-based FakeSocket > class in test_httplib. [John J Lee] > They have been seen in the wild :-) > > http://python.org/sf/1411097 Thanks -- that was really all the encouragement I needed to keep banging away at this bug. Did you look at the crude attempt at testing for this bug that I hacked into test_httplib.py? I posted it to bug #1486335 here: http://sourceforge.net/tracker/download.php?group_id=5470&atid=105470&file_id=186245&aid=1486335 The idea is simple: put various chunked responses into strings and then feed those strings to HTTPConnection. The trouble is that StringIO does not behave the same as a real socket: where HTTPResponse fails one way reading from a real socket (eg. infinite loop), it fails differently (or not at all) reading from a StringIO. Makes testing with the FakeSocket class in test_httplib.py problematic. Maybe the right way to test httplib is to spawn a server process (thread?) to listen on some random port, feed various HTTP responses at HTTPConnection/HTTPResponse, and see what happens. I'm not sure how to do that portably, though. Well, I'll see if I can whip up a Unix-y solution and see if anyone knows how to make it portable. Greg -- Greg Ward <gward at python.net> http://www.gerg.ca/ Be careful: sometimes, you're only standing on the shoulders of idiots. From rhettinger at ewtllc.com Mon Jul 31 20:34:26 2006 From: rhettinger at ewtllc.com (Raymond Hettinger) Date: Mon, 31 Jul 2006 11:34:26 -0700 Subject: [Python-Dev] Rounding float to int directly (Re: struct module and coercing floats to integers) In-Reply-To: <44CAD9A1.8080000@canterbury.ac.nz> References: <9E51A030-B610-414B-98E0-1923F1C3E862@redivi.com> <dcbbbb410607282014w30459351ifb09c56075997be1@mail.gmail.com> <44CAD9A1.8080000@canterbury.ac.nz> Message-ID: <44CE4D32.6040909@ewtllc.com> Greg Ewing wrote: >As an aside, does anyone else think that it would be >useful to have a builtin which rounds and converts to >an int in one go? Whenever I use round(), I almost >always want the result as an int, and making me do >it in two steps seems unnecessarily bothersome. > > I think this would harm more than it would help. It more confusing to have several rounding-thingies to choose from than it is have an explicit two-step. BTW, I thought the traditional idiom (for positive numbers) was: int(x+.5) >Since automatic float->int coercion is being increasingly >disallowed, use cases for this are becoming more and more >common. > >-- >Greg >_______________________________________________ >Python-Dev mailing list >Python-Dev at python.org >http://mail.python.org/mailman/listinfo/python-dev >Unsubscribe: http://mail.python.org/mailman/options/python-dev/rhettinger%40ewtllc.com > > From chad at zetaweb.com Mon Jul 31 20:38:49 2006 From: chad at zetaweb.com (Chad Whitacre) Date: Mon, 31 Jul 2006 14:38:49 -0400 Subject: [Python-Dev] Patch submitted, now what? Message-ID: <44CE4E39.5010006@zetaweb.com> Dear All, Last week I submitted a patch (my first), and now I'm wondering what my expectations should be. Do I sit around and wait? How long? Do I notify this list? Do I notify a specific person, say, an author or reviewer of the original code I modified? Do I use SF's assignment mechanism? Who do I assign it to? These are the questions I have, unanswered (afaict) by the patch documentation I've found: http://www.python.org/dev/patches/ http://www.python.org/dev/patches/style/ http://www.python.org/patches/ http://www.python.org/patches/style.html http://www.python.org/dev/tools/#patch-tracking Any insight? chad P.S. For the interested, here is my patch: "Expose case-insensitivity of string.Template" http://www.python.org/sf/1528167 From brett at python.org Mon Jul 31 20:53:14 2006 From: brett at python.org (Brett Cannon) Date: Mon, 31 Jul 2006 11:53:14 -0700 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <44CE4E39.5010006@zetaweb.com> References: <44CE4E39.5010006@zetaweb.com> Message-ID: <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> On 7/31/06, Chad Whitacre <chad at zetaweb.com> wrote: > > Dear All, > > Last week I submitted a patch (my first), Thanks! and now I'm wondering what my > expectations should be. Do I sit around and wait? How long? Do I notify > this list? Do I notify a specific person, say, an author or reviewer of > the original code I modified? Do I use SF's assignment mechanism? Who do > I assign it to? Let it sit for now. We get email notifications when new patches come in. Since we are all volunteers it can take a little while before we get to it. And don't assign it to anyone. Let us handle that. These are the questions I have, unanswered (afaict) by the patch > documentation I've found: > > http://www.python.org/dev/patches/ > http://www.python.org/dev/patches/style/ > http://www.python.org/patches/ > http://www.python.org/patches/style.html > http://www.python.org/dev/tools/#patch-tracking Those docs needs to be overhauled. I am planning to consolidate into a single patch guidelines doc in a month or so. Any insight? > > > > > chad > > > P.S. For the interested, here is my patch: > > "Expose case-insensitivity of string.Template" > http://www.python.org/sf/1528167 > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/brett%40python.org > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060731/161ca6a3/attachment.htm From chad at zetaweb.com Mon Jul 31 21:43:20 2006 From: chad at zetaweb.com (Chad Whitacre) Date: Mon, 31 Jul 2006 15:43:20 -0400 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> References: <44CE4E39.5010006@zetaweb.com> <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> Message-ID: <44CE5D58.6060407@zetaweb.com> Brett, Thanks for the helpful reply. > Let it sit for now. We get email notifications when new patches come in. Can I ask who "we" are? Is that the seven SF "Project Admins?" Is that the 68 SF "Developers?" And is every patch eventually responded to? Or do some simply fall by the wayside? > Those docs needs to be overhauled. I am planning to consolidate into a > single patch guidelines doc in a month or so. Great! I'd be happy to review it when the time comes, if that'd be helpful. Thanks again. chad From brett at python.org Mon Jul 31 21:56:58 2006 From: brett at python.org (Brett Cannon) Date: Mon, 31 Jul 2006 12:56:58 -0700 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <44CE5D58.6060407@zetaweb.com> References: <44CE4E39.5010006@zetaweb.com> <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> <44CE5D58.6060407@zetaweb.com> Message-ID: <bbaeab100607311256u281af267wd5b4d79b7db7b203@mail.gmail.com> On 7/31/06, Chad Whitacre <chad at zetaweb.com> wrote: > > Brett, > > Thanks for the helpful reply. > > > > Let it sit for now. We get email notifications when new patches come > in. > > Can I ask who "we" are? Is that the seven SF "Project Admins?" Is that > the 68 SF "Developers?" "We" is most of the developers on python-dev. There is a Python patches mailing list that most developers subscribe to. And is every patch eventually responded to? Or do some simply fall by > the wayside? The latter unfortunatley. Since this all relies on people's volunteer efforts the patch usually has to matter to someone to lead to them putting the time and effort into getting it checked in. > Those docs needs to be overhauled. I am planning to consolidate into a > > single patch guidelines doc in a month or so. > > Great! I'd be happy to review it when the time comes, if that'd be > helpful. Yep, it would be. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060731/67f04a64/attachment.html From skip at pobox.com Mon Jul 31 22:06:47 2006 From: skip at pobox.com (skip at pobox.com) Date: Mon, 31 Jul 2006 15:06:47 -0500 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <bbaeab100607311256u281af267wd5b4d79b7db7b203@mail.gmail.com> References: <44CE4E39.5010006@zetaweb.com> <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> <44CE5D58.6060407@zetaweb.com> <bbaeab100607311256u281af267wd5b4d79b7db7b203@mail.gmail.com> Message-ID: <17614.25303.255526.960212@montanaro.dyndns.org> Chad> And is every patch eventually responded to? Or do some simply fall Chad> by the wayside? Brett> The latter unfortunatley. Since this all relies on people's Brett> volunteer efforts the patch usually has to matter to someone to Brett> lead to them putting the time and effort into getting it checked Brett> in. It's worth noting that a number of people will look at a patch after the submitter has reviewed five other patches or bug reports (does it look okay to you? does Python build with it applied? do all unit tests pass? if it needs documentation does it have it? etc). Do that, attach comments to each of the five, then send a note here listing the five you reviewed and the patch id of your patch and one of those patch angels will take a look at your patch (if they haven't already). Skip From chad at zetaweb.com Mon Jul 31 22:07:10 2006 From: chad at zetaweb.com (Chad Whitacre) Date: Mon, 31 Jul 2006 16:07:10 -0400 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <bbaeab100607311256u281af267wd5b4d79b7db7b203@mail.gmail.com> References: <44CE4E39.5010006@zetaweb.com> <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> <44CE5D58.6060407@zetaweb.com> <bbaeab100607311256u281af267wd5b4d79b7db7b203@mail.gmail.com> Message-ID: <44CE62EE.2010801@zetaweb.com> Brett, > "We" is most of the developers on python-dev. There is a Python patches > mailing list that most developers subscribe to. Helpful, thanks. > The latter unfortunatley. Since this all relies on people's volunteer > efforts the patch usually has to matter to someone to lead to them putting > the time and effort into getting it checked in. Hrm, ok, also good to know. So it sounds like I wait another week or so and then maybe approach some folks individually. chad From chad at zetaweb.com Mon Jul 31 22:08:32 2006 From: chad at zetaweb.com (Chad Whitacre) Date: Mon, 31 Jul 2006 16:08:32 -0400 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <17614.25303.255526.960212@montanaro.dyndns.org> References: <44CE4E39.5010006@zetaweb.com> <bbaeab100607311153q52753e27yf371e2ad19d61405@mail.gmail.com> <44CE5D58.6060407@zetaweb.com> <bbaeab100607311256u281af267wd5b4d79b7db7b203@mail.gmail.com> <17614.25303.255526.960212@montanaro.dyndns.org> Message-ID: <44CE6340.30505@zetaweb.com> Skip, > It's worth noting that a number of people will look at a patch after the > submitter has reviewed five other patches or bug reports Also helpful, thanks! chad From martin at v.loewis.de Mon Jul 31 22:48:36 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 31 Jul 2006 22:48:36 +0200 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <44CE4E39.5010006@zetaweb.com> References: <44CE4E39.5010006@zetaweb.com> Message-ID: <44CE6CA4.1080908@v.loewis.de> Chad Whitacre schrieb: [I notice that my message comes across pretty negative. In a single sentence: We are all volunteers with limited time, and we contribute to Python because its fun and because it helps us solve our own problems.] > Last week I submitted a patch (my first), and now I'm wondering what my > expectations should be. As a starting point, it would be better if you had provided the patch number (1528167) in this message. That makes it easier to answer the questions. [I didn't read until the end of the message yet] > Do I sit around and wait? How long? It's a new feature, so it can't possibly get into Python 2.5. That means that no action is likely taken before October. It might take years until the patch gets considered. As others have mentioned, there are ways to expedite processing. > Do I notify this list? If you want discussion of the patch, you can do that. Depending on how many people care about that aspect of Python, you may or may not get a response. I personally don't see myself using $-interpolation in the next few years, so I doubt I will react in any way (the Python library would just be the same for me if that feature didn't exist, and your patch isn't a bug fix). Others are more interested in the feature (or else it wouldn't have gotten into the standard library), so they might respond. > Do I notify a specific person, say, an author or reviewer of > the original code I modified? You might do that, although some may consider this at the edge of spamming. > Do I use SF's assignment mechanism? Please don't. If your assignment is "incorrect", this is a guarantee that the patch will sit there forever: the person assigned won't react because he doesn't feel responsible if he isn't, and nobody else will feel responsible because the patch is assigned. As Skip explains, I have a 5-for-1-rule for people who really want to push their patches: If you review 5 patches, I will review yours (despite me normally ignoring patches to this part of the library). Regards, Martin From chad at zetaweb.com Mon Jul 31 23:07:23 2006 From: chad at zetaweb.com (Chad Whitacre) Date: Mon, 31 Jul 2006 17:07:23 -0400 Subject: [Python-Dev] Patch submitted, now what? In-Reply-To: <44CE6CA4.1080908@v.loewis.de> References: <44CE4E39.5010006@zetaweb.com> <44CE6CA4.1080908@v.loewis.de> Message-ID: <44CE710B.4010905@zetaweb.com> Martin, Thanks for the reply. > I notice that my message comes across pretty negative. No worries. I'm a volunteer too. :) > It's a new feature, so it can't possibly get into Python 2.5. That means > that no action is likely taken before October. It might take years until > the patch gets considered. Helpful, thanks. > As Skip explains, I have a 5-for-1-rule for people who really want > to push their patches: If you review 5 patches, I will review yours Yeah, this hit the bulls-eye for me. The 5-for-1 rule is a great way to specify how newcomers can influence patch workflow w/o becoming spammers. [Brett: +1 on mentioning this in the updated Patch Guidelines] Watch for another post once I've found the time to volunteer to review 5 other patches. :) Thanks again, all! Consider my original questions answered. chad From oliphant.travis at ieee.org Mon Jul 31 20:28:09 2006 From: oliphant.travis at ieee.org (Travis Oliphant) Date: Mon, 31 Jul 2006 12:28:09 -0600 Subject: [Python-Dev] Bad interaction of __index__ and sequence repeat In-Reply-To: <44CB92F1.9040304@iinet.net.au> References: <20060728101133.GA339@code0.codespeak.net> <44CB6B7D.1040001@gmail.com> <44CB92F1.9040304@iinet.net.au> Message-ID: <44CE4BB9.60903@ieee.org> Nick Coghlan wrote: > Nick Coghlan wrote: >> Armin Rigo wrote: >>> Hi, >>> >>> There is an oversight in the design of __index__() that only just >>> surfaced :-( It is responsible for the following behavior, on a 32-bit >>> machine with >= 2GB of RAM: >>> >>> >>> s = 'x' * (2**100) # works! >>> >>> len(s) >>> 2147483647 >>> >>> This is because PySequence_Repeat(v, w) works by applying >>> w.__index__ in >>> order to call v->sq_repeat. However, __index__ is defined to clip the >>> result to fit in a Py_ssize_t. This means that the above problem >>> exists >>> with all sequences, not just strings, given enough RAM to create such >>> sequences with 2147483647 items. >>> >>> For reference, in 2.4 we correctly get an OverflowError. >>> >>> Argh! What should be done about it? >> >> I've now got a patch on SF that aims to fix this properly [1]. > > I revised this patch to further reduce the code duplication associated > with the indexing code in the standard library. > > The patch now has three new functions in the abstract C API: > > PyNumber_Index (used in a dozen or so places) > - raises IndexError on overflow > PyNumber_AsSsize_t (used in 3 places) > - raises OverflowError on overflow > PyNumber_AsClippedSsize_t() (used once, by _PyEval_SliceIndex) > - clips to PY_SSIZE_T_MIN/MAX on overflow > > All 3 have an int * output argument allowing type errors to be flagged > directly to the caller rather than through PyErr_Occurred(). > > Of the 3, only PyNumber_Index is exposed through the operator module. > > Probably the most interesting thing now would be for Travis to review > it, and see whether it makes things easier to handle for the Numeric > scalar types (given the amount of code the patch deleted from the > builtin and standard library data types, hopefully the benefits to > Numeric will be comparable). I noticed most of the checks for PyInt where removed in the patch. If I remember correctly, I left these in for "optimization." Other than that, I think the patch is great. As far as helping with NumPy, I think it will help to be able to remove special-checks for all the different integer-types. But, this has not yet been done in the NumPy code. -Travis

From python-dev at zesty.ca Sat Jul 1 00:00:26 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Fri, 30 Jun 2006 17:00:26 -0500 (CDT) Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <001201c69c65$b3869750$6402a8c0@arkdesktop> References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: On Fri, 30 Jun 2006, Andrew Koenig wrote: > I saw messages out of sequence and did not realize that this would be a > change in behavior from 2.4. Sigh. Yes, this is not a good time to change it. > I hope Py3000 has lexical scoping a la Scheme... Me too -- that would be really nice. -- ?!ng From python-dev at zesty.ca Sat Jul 1 00:05:25 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Fri, 30 Jun 2006 17:05:25 -0500 (CDT) Subject: [Python-Dev] For sandboxing: alternative to crippling file() In-Reply-To: References: <20060630175205.GA17748@code0.codespeak.net> Message-ID: On Fri, 30 Jun 2006, Brett Cannon wrote: > On 6/30/06, Armin Rigo wrote: > > >>> object.__subclasses__() > > [..., ] > > > > Maybe this one won't work if __subclasses__ is forbidden, but in general > > I think there *will* be a way to find this object. > > Yeah, that's been my (what I thought was paranoid) feeling. Glad I am not > the only one who thinks that hiding file() is near impossible. If you want to do this right, it should be about *making* hiding possible. If you can't hide things, it will be hard to get very far. I realize that may be difficult for Python 2.x, but hiding is pretty essential for security. It would be really good to keep this in mind for the design of Python 3k. (It doesn't mean we can't have introspection, just that we need to agree on some discipline for how to do it.) -- ?!ng From jimjjewett at gmail.com Sat Jul 1 00:13:37 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Fri, 30 Jun 2006 18:13:37 -0400 Subject: [Python-Dev] traceback regression Message-ID: python.org/sf/1515343 fixes python.org/sf/1515163, which is a new-in-2.5 regression. On the one hand, the regression only affects >>> raise "string1", "string2" which is both obscure and deprecated. On the other hand, it is a regression, and it is something I bumped into while working with unittest. [Note that I probably won't be checking email again for a week, so I'm afraid I won't be very responsive to comments.] -jJ From guido at python.org Sat Jul 1 00:24:04 2006 From: guido at python.org (Guido van Rossum) Date: Fri, 30 Jun 2006 15:24:04 -0700 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: On 6/30/06, Ka-Ping Yee wrote: > On Fri, 30 Jun 2006, Andrew Koenig wrote: > > I saw messages out of sequence and did not realize that this would be a > > change in behavior from 2.4. Sigh. > > Yes, this is not a good time to change it. > > > I hope Py3000 has lexical scoping a la Scheme... > > Me too -- that would be really nice. That's not a very constructive proposal (especially since I don't know Scheme). Perhaps you could elaborate on what needs to change? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From brett at python.org Sat Jul 1 00:33:20 2006 From: brett at python.org (Brett Cannon) Date: Fri, 30 Jun 2006 15:33:20 -0700 Subject: [Python-Dev] For sandboxing: alternative to crippling file() In-Reply-To: References: <20060630175205.GA17748@code0.codespeak.net> Message-ID: On 6/30/06, Ka-Ping Yee wrote: > > On Fri, 30 Jun 2006, Brett Cannon wrote: > > On 6/30/06, Armin Rigo wrote: > > > >>> object.__subclasses__() > > > [..., ] > > > > > > Maybe this one won't work if __subclasses__ is forbidden, but in > general > > > I think there *will* be a way to find this object. > > > > Yeah, that's been my (what I thought was paranoid) feeling. Glad I am > not > > the only one who thinks that hiding file() is near impossible. > > If you want to do this right, it should be about *making* hiding > possible. If you can't hide things, it will be hard to get very far. Well, this is only file() we are worrying about leaking out. Stuff from import are the worry here. I realize that may be difficult for Python 2.x, but hiding is pretty > essential for security. It would be really good to keep this in mind > for the design of Python 3k. (It doesn't mean we can't have > introspection, > just that we need to agree on some discipline for how to do it.) That's fine; I personally have no issue with tweaking the security model for Py3K. But I am worrying about 2.x here. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060630/172e6d14/attachment-0001.htm From python-dev at zesty.ca Sat Jul 1 01:03:14 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Fri, 30 Jun 2006 18:03:14 -0500 (CDT) Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: On Fri, 30 Jun 2006, Guido van Rossum wrote: > On 6/30/06, Ka-Ping Yee wrote: > > On Fri, 30 Jun 2006, Andrew Koenig wrote: > > > I hope Py3000 has lexical scoping a la Scheme... > > > > Me too -- that would be really nice. > > That's not a very constructive proposal (especially since I don't know > Scheme). Perhaps you could elaborate on what needs to change? Sorry! I should have been more clear. Right now, Python supports the existence of nested scopes: a = 3 def f(): a = 4 def g(): a = 5 print a # prints 5 g() print a # prints 4 f() print a # prints 3 The above example shows that there are three distinct scopes, and that each one has a distinct binding named 'a' -- assigning to one doesn't affect the others. a = 3 def f(): b = 4 def g(): c = 5 print a, b, c # i can see all three g() f() The above example shows that all of the scopes can be *read*. But in today's Python, not all of the scopes can be *written*. a = 3 def f(): b = 4 def g(): c = 5 a, b, c = 0, 1, 2 # changes local c, not outer a and b g() f() The code in g() can affect its own local, 'c', and it can affect the global variable 'a' if it declares 'global a', but no matter what you write in g(), it cannot assign to 'b' (or to any other intermediate scope). This is a strange limitation and it would be nice to remove it. The asymmetry comes from Python having one foot in the new paradigm of nested lexical scopes and one foot still in the older paradigm of only two scopes, local and global. Most other languages that support lexical scoping (including Scheme, JavaScript, Ruby, Perl, E, Java, Smalltalk) provide a uniform way to read and write to scopes at all levels. This is done by letting programmers specify the scope in which they want a variable bound (usually with a keyword like "var" in JavaScript, "my" in Perl, or "define" in E). So here are some thoughts on how Python might be adjusted to support this. I'm not saying these would be the only ways, but at least they're some ideas to start with. In JavaScript, the "var" keyword is required whenever you want to declare a local variable. Anything without "var" is assumed to be a global name. The cleanest and most consistent solution that comes to mind would be to adopt exactly this for Python. Without "var": a = 3 # global def f(): b = 4 # global def g(): c = 5 # global a, b, c = 0, 1, 2 # changes all three globals g() f() print a, b, c # prints 0, 1, 2 With "var": var a = 3 # global def f(): var b = 4 # local to f def g(): var c = 5 # local to g a, b, c = 0, 1, 2 # changes outer a, outer b, and c print c # prints 2 g() print b # prints 1 f() print a # prints 0 print b # no such name print c # no such name But that is a big change. Perhaps it would be too unpythonic to suddenly require declarations for all local variables. So an alternative would be to retain the default assumption that undeclared variables are local. Here's what we'd get: Without "var": a = 3 def f(): b = 4 def g(): c = 5 a, b, c = 0, 1, 2 # changes local c, not outer a and b g() f() With "var": var a = 3 def f(): var b = 4 def g(): var c = 5 a, b, c = 0, 1, 2 # changes outer a, outer b, and c g() f() Now i think this is a little bit weird, because the statement "var b = 4" in an outer scope changes the meaning of "b" in an inner scope. But it does have the virtue of retaining behaviour compatible with today's Python, while offering a way to get proper lexical scopes for those who want to use them. Thoughts? Other ideas? -- ?!ng From ark at acm.org Sat Jul 1 01:04:53 2006 From: ark at acm.org (Andrew Koenig) Date: Fri, 30 Jun 2006 19:04:53 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: Message-ID: <000d01c69c99$9c023ec0$6402a8c0@arkdesktop> > That's not a very constructive proposal (especially since I don't know > Scheme). Perhaps you could elaborate on what needs to change? The fundamental principle is that the binding of every name is determined during compilation, not during execution. This property does not quite apply to Python at present. For example: x = 42 def f(): y = x x = 123 return y f() This example fails with "local variable 'x' referenced before assignment" because the compiler sees that f contains an assignment to x, so it makes x a local variable, and then when you try to assign x to y during execution, it fails. This behavior is consistent with the notion of lexical scoping. However, if I write def g(): return x x = 42 g() the result is 42. With lexical scoping, I believe it should be undefined. The reason is that when the compiler encounters the definition of g, variable x is not yet bound, and there is nothing in the body of g that would bind it. Therefore, g requires a binding to exist at the time it is compiled; because no such binding exists, this example would be an error (at compile time) under lexical scoping. From scott+python-dev at scottdial.com Sat Jul 1 01:13:58 2006 From: scott+python-dev at scottdial.com (Scott Dial) Date: Fri, 30 Jun 2006 19:13:58 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: <44A5B036.7030905@scottdial.com> Guido van Rossum wrote: > On 6/30/06, Ka-Ping Yee wrote: >> On Fri, 30 Jun 2006, Andrew Koenig wrote: >>> I hope Py3000 has lexical scoping a la Scheme... >> Me too -- that would be really nice. > > That's not a very constructive proposal (especially since I don't know > Scheme). Perhaps you could elaborate on what needs to change? I believe the essence of their request for lexical scope boils down to allowing rebinding. Such code like the following is legal in Scheme: def f(x): def incr(): x = x + 1 return x def decr(): x = x - 1 return x return (incr, decr) (incr, decr) = f(1) print incr() # 2 print incr() # 3 print decr() # 2 print decr() # 1 -- FWIW, the Scheme equivalent would be something like: (define f (lambda (x) (list (lambda () (set! x (+ x 1)) x) (lambda () (set! x (- x 1)) x)))) (let ([fs (f 1)]) (let ([incr (car fs)] [decr (cadr fs)]) (display (incr)) (newline) ; 2 (display (incr)) (newline) ; 3 (display (decr)) (newline) ; 2 (display (decr)) (newline))) ; 1 As a more personal aside, I can't imagine where I would use this in any python program I have ever wrote. I actually never noticed that rebinding wasn't allowed until recently. -- Scott Dial scott at scottdial.com scodial at indiana.edu From tim.peters at gmail.com Sat Jul 1 01:23:56 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 30 Jun 2006 19:23:56 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: <1f7befae0606301623q4072f138vb4b7c24ca7bbff08@mail.gmail.com> [Andrew Koenig] >>> I saw messages out of sequence and did not realize that this would be a >>> change in behavior from 2.4. Sigh. [Ka-Ping Yee] >> Yes, this is not a good time to change it. >>> I hope Py3000 has lexical scoping a la Scheme... >> Me too -- that would be really nice. [Guido] > That's not a very constructive proposal (especially since I don't know > Scheme). Perhaps you could elaborate on what needs to change? It's effectively the opposite of Python <0.1 wink>: a name is local to a scope in Scheme if and only if a declaration says it is. For example, the "let" family of forms is often used for this purpose, and (let ((x 2) (y 3)) # declares x and y as local to this `let`, and gives initial values (let ((x 7) (z (+ x y))) # x comes from outer `let` so is 2, and z is 2+3=5 (* z x))) # x comes from inner `let`, so this is 5*7=35 If you use `let*` instead of `let` in the inner one, z picks up the inner x=7, so that z is 7+3=10, and the result is 7*10 = 70 instead. The bindings in a `let` "happen" in an undefined order. In `let*`, a binding is visible "to its right" within the `let*`. Then there's `letrec`, which allows establishing mutually recursive bindings. While the `let` family is entirely about declaration, there are lots of other forms that mix in some declaration as part of their purpose (for example, the names in a lambda's argument list are local to the lambda's body), but they're all explicit in Scheme. I read "a la Scheme" here as "actually nothing like Scheme, except I want a non-tricky way to rebind a name from an enclosing scope within an enclosed scope". In Scheme, the scope a name x belongs to is found by searching enclosing scopes until you hit the first with an explicit "x belongs to me" declaration (OK, there's a hokey fallback to "top level" definitions too). Searching "textually up" always suffices (there's no form of delayed declaration -- a name must be declared before use). Scheme's assignment of assignment: (set! variable expression) has nothing to do with establishing the scope of `variable`. From ark at acm.org Sat Jul 1 01:27:06 2006 From: ark at acm.org (Andrew Koenig) Date: Fri, 30 Jun 2006 19:27:06 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> Message-ID: <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> > That sounds like a bug, not a feature. It's frequently useful to have > forward references in function bodies to names that are not yet globally > bound, e.g. for classes, or mutually-recursive functions. The trouble is that you don't necessarily know in what scope they will be defined, so I think that forcing you to be explicit about it is useful. Can you show me an example of where you think it isn't? Incidentally, I think that lexical scoping would also deal with the problem that people often encounter in which they have to write things like "lambda x=x:" where one would think "lambda x:" would suffice. From ark at acm.org Sat Jul 1 01:29:45 2006 From: ark at acm.org (Andrew Koenig) Date: Fri, 30 Jun 2006 19:29:45 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <1f7befae0606301623q4072f138vb4b7c24ca7bbff08@mail.gmail.com> Message-ID: <001401c69c9d$12d53900$6402a8c0@arkdesktop> > I read "a la Scheme" here as "actually nothing like Scheme, except I > want a non-tricky way to rebind a name from an enclosing scope within > an enclosed scope". Almost. What I really want is for it to be possible to determine the binding of every name by inspecting the source text of the program. Right now, it is often possible to do so, but sometimes it isn't. From pje at telecommunity.com Sat Jul 1 01:09:41 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 30 Jun 2006 19:09:41 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <000d01c69c99$9c023ec0$6402a8c0@arkdesktop> References: Message-ID: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> At 07:04 PM 6/30/2006 -0400, Andrew Koenig wrote: >However, if I write > > def g(): > return x > x = 42 > g() > >the result is 42. With lexical scoping, I believe it should be undefined. > >The reason is that when the compiler encounters the definition of g, >variable x is not yet bound, and there is nothing in the body of g that >would bind it. Therefore, g requires a binding to exist at the time it is >compiled; because no such binding exists, this example would be an error (at >compile time) under lexical scoping. That sounds like a bug, not a feature. It's frequently useful to have forward references in function bodies to names that are not yet globally bound, e.g. for classes, or mutually-recursive functions. From tim.peters at gmail.com Sat Jul 1 01:39:03 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 30 Jun 2006 19:39:03 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> Message-ID: <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> [Andrew Koenig] > ... > Incidentally, I think that lexical scoping would also deal with the problem > that people often encounter in which they have to write things like "lambda > x=x:" where one would think "lambda x:" would suffice. They _shouldn't_ encounter that at all anymore. For example, >>> def f(x): ... return lambda: x+1 >>> f(3)() 4 works fine in modern Pythons. Earlier Python's had a no-exceptions 3-scope implementation (local, global, builtin), and in those the "x" in the lambda body was "not local" (was either global or builtin, although the compiler couldn't tell which of those two it was). In _those_ Pythons people had to write "lambda x=x: x+1" instead, to suck the binding of the outer x into the lambda body, but if people are still doing that they're confused. Modern Pythons do have lexical scoping + global + builtin, although there's no way to spell "rebind a name local to an outer scope from within an inner scope". From guido at python.org Sat Jul 1 01:44:02 2006 From: guido at python.org (Guido van Rossum) Date: Fri, 30 Jun 2006 16:44:02 -0700 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <001401c69c9d$12d53900$6402a8c0@arkdesktop> References: <1f7befae0606301623q4072f138vb4b7c24ca7bbff08@mail.gmail.com> <001401c69c9d$12d53900$6402a8c0@arkdesktop> Message-ID: On 6/30/06, Andrew Koenig wrote: > > I read "a la Scheme" here as "actually nothing like Scheme, except I > > want a non-tricky way to rebind a name from an enclosing scope within > > an enclosed scope". > > Almost. What I really want is for it to be possible to determine the > binding of every name by inspecting the source text of the program. Right > now, it is often possible to do so, but sometimes it isn't. Then your example def f(): return x x = 42 print f() is entirely well-defined -- x is a global and the compiler in fact generates code that benefits from knowing that it's not a local. Python knows which locals there are; also which locals there are in surrounding function scopes. It *could* also know which globals and builtins there are, except the language currently allows dynamic rebinding of module-level variables so that they replace builtins. E.g. def f(): return len([]) print f() # prints 0 def len(x): return "booh" print f() # prints "booh" del len print f() # prints 0 again Worse, instead if explicitly overriding len in the module, it could have been an assignment to __main__.len in some other module. We've been thinking on how to deal with this for years, since nobody really likes it in all its freedom. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From jcarlson at uci.edu Sat Jul 1 01:56:58 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 30 Jun 2006 16:56:58 -0700 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: References: Message-ID: <20060630163625.10D1.JCARLSON@uci.edu> Ka-Ping Yee wrote: [snip lexical scoping option] > Now i think this is a little bit weird, because the statement > "var b = 4" in an outer scope changes the meaning of "b" in an > inner scope. But it does have the virtue of retaining behaviour > compatible with today's Python, while offering a way to get proper > lexical scopes for those who want to use them. > > Thoughts? Other ideas? Using a keyword in an outer scope to state that a variable could be used in a nested scope is counter to the current method for accessing a parent scope with 'global'. Using 'var' as the equivalent of 'global', only for nested scopes, would be a more reasonable approach. However, I'm -1 on the feature now, for the same reasons I've been -1 on the feature for the last 2 times it has come up. In many of the cases where lexically nested scopes have been used to solve problems in Python, and programmers have run into a 'limitation' where not being able to modify a value in a parent scope has hindered them, the problem could have been better solved with another method that was more readable, more extensible, etc. What I asked before, and what I'd like to ask again, is if there are any _nontrivial uses_ of lexically nested scopes which are made cumbersome by our inability to write to parent scopes. If there aren't, then I'm going to again have to argue against new syntax, keywords, and their use. If there are, then we'll see how compelling such uses are. - Josiah From python-dev at zesty.ca Sat Jul 1 01:55:12 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Fri, 30 Jun 2006 18:55:12 -0500 (CDT) Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <000d01c69c99$9c023ec0$6402a8c0@arkdesktop> References: <000d01c69c99$9c023ec0$6402a8c0@arkdesktop> Message-ID: On Fri, 30 Jun 2006, Andrew Koenig wrote: > The fundamental principle is that the binding of every name is determined > during compilation, not during execution. This property does not quite > apply to Python at present. I think this property does apply. In your example: > def g(): > return x > x = 42 > g() > > the result is 42. It is already known at compile time that the "return x" in g() refers to an 'x' in the outer scope. 'x' cannot be a local variable to g() because there are no statements in g() that bind 'x'. Regardless of whether the binding itself exists yet, you (the reader) and the compiler can know which scope to look in for the binding at runtime. Have i understood your desired property correctly? -- ?!ng From tim.peters at gmail.com Sat Jul 1 01:55:38 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 30 Jun 2006 19:55:38 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <001401c69c9d$12d53900$6402a8c0@arkdesktop> References: <1f7befae0606301623q4072f138vb4b7c24ca7bbff08@mail.gmail.com> <001401c69c9d$12d53900$6402a8c0@arkdesktop> Message-ID: <1f7befae0606301655i2425c0a1lf8d956cad0f028b7@mail.gmail.com> [Andrew Koenig] > Almost. What I really want is for it to be possible to determine the > binding of every name by inspecting the source text of the program. Right > now, it is often possible to do so, but sometimes it isn't. Local names are always determined at compile-time in Python. What you can't always determine is whether a non-local (to any enclosing scope) name will end up getting resolved from the module globals or from __builtin__. The runtime error in: def f(): y = x x = 1 f() doesn't occur because Python doesn't know "x" is local to "f" at compile-time (it does know that), it's because Python's compiler doesn't do any flow analysis to detect potential use-before-definition. Instead the runtime initalizes locals to a special "not bound yet" value that the LOAD_FAST (really "load local") opcode special-cases. Note that this is quite unlike Scheme, in which declaration must appear before use (ignoring fancy letrec cases), and declaration must also supply an initial binding (Scheme has no "unbound local" problem because there's no way to create an uninitialized local). From rasky at develer.com Sat Jul 1 02:01:00 2006 From: rasky at develer.com (Giovanni Bajo) Date: Sat, 1 Jul 2006 02:01:00 +0200 Subject: [Python-Dev] 2.5 and beyond References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com><001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> Message-ID: <020c01c69ca1$754c7310$d1b12997@bagio> Tim Peters wrote: >> ... >> Incidentally, I think that lexical scoping would also deal with the >> problem >> that people often encounter in which they have to write things like >> "lambda >> x=x:" where one would think "lambda x:" would suffice. > > They _shouldn't_ encounter that at all anymore. For example, > >>>> def f(x): > ... return lambda: x+1 >>>> f(3)() > 4 > > works fine in modern Pythons. Yes but: >>> a = [] >>> for i in range(10): ... a.append(lambda: i) ... >>> print [x() for x in a] [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] This subtle semantic of lambda is quite confusing, and still forces people to use the "i=i" trick. Giovanni Bajo From tim.peters at gmail.com Sat Jul 1 02:15:51 2006 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 30 Jun 2006 20:15:51 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <020c01c69ca1$754c7310$d1b12997@bagio> References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> Message-ID: <1f7befae0606301715m6fe7044eue4e5c8d74a7c8a22@mail.gmail.com> [Giovanni Bajo] > Yes but: > > >>> a = [] > >>> for i in range(10): > ... a.append(lambda: i) > ... > >>> print [x() for x in a] > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] > > This subtle semantic of lambda is quite confusing, and still forces people to > use the "i=i" trick. So stay away from excruciating abuses of lexical scoping you don't understand ;-) What do you _expect_ `i` to refer to? "Oh, it should guess that I didn't really mean to defer evaluation of the lambda body at all, but instead evaluate the lambda body at the time I define the lambda and then synthesize some other function that captures the specific outer bindings in effect at lambda-definition time" doesn't really cut it. Try spelling what you think you want here in Scheme. Before it works, you'll probably end up with some equally "atrocious" (let ((i i)) ...) gimmick to force capturing each binding for `i` as it flies by. Else Scheme will also use the outer binding for `i` in effect at the time the lambdas are _executed_. This isn't typical use for lambda, and I don't think it's what Andrew had in mind. From bjourne at gmail.com Sat Jul 1 03:11:59 2006 From: bjourne at gmail.com (=?ISO-8859-1?Q?BJ=F6rn_Lindqvist?=) Date: Sat, 1 Jul 2006 03:11:59 +0200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: <740c3aec0606301811i6b323bei3abcfb05cde5b747@mail.gmail.com> > With "var": > > var a = 3 > def f(): > var b = 4 > def g(): > var c = 5 > a, b, c = 0, 1, 2 # changes outer a, outer b, and c > g() > f() > > Now i think this is a little bit weird, because the statement > "var b = 4" in an outer scope changes the meaning of "b" in an > inner scope. But it does have the virtue of retaining behaviour > compatible with today's Python, while offering a way to get proper > lexical scopes for those who want to use them. > > Thoughts? Other ideas? Maybe an object, like self, for referring to enclosing scopes? a = 3 def f(): b = 4 def g(): c = 5 outer.outer.a, outer.b, c = 0, 1, 2 # changes outer a, outer b, and c g() f() Chaining the keyword looks a little weird, but it is not often that you have to refer to variables in the enclosing scope of the enclosing scope. I have often wanted something similar to that for global variables, instead of the global declaration: cache = None def init(): if not global.cache: global.cache = init_cache() -- mvh Bj?rn From janssen at parc.com Sat Jul 1 03:31:12 2006 From: janssen at parc.com (Bill Janssen) Date: Fri, 30 Jun 2006 18:31:12 PDT Subject: [Python-Dev] 2.5 and beyond In-Reply-To: Your message of "Fri, 30 Jun 2006 17:01:00 PDT." <020c01c69ca1$754c7310$d1b12997@bagio> Message-ID: <06Jun30.183119pdt."58641"@synergy1.parc.xerox.com> > >>> a = [] > >>> for i in range(10): > ... a.append(lambda: i) > ... > >>> print [x() for x in a] > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] Isn't this exactly what you'd expect? Maybe I've been writing Python for too long... :-). Bill From rasky at develer.com Sat Jul 1 03:43:01 2006 From: rasky at develer.com (Giovanni Bajo) Date: Sat, 1 Jul 2006 03:43:01 +0200 Subject: [Python-Dev] 2.5 and beyond References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> <1f7befae0606301715m6fe7044eue4e5c8d74a7c8a22@mail.gmail.com> Message-ID: <027f01c69caf$b1c20450$d1b12997@bagio> [Giovanni Bajo] > Yes but: > >>>> a = [] >>>> for i in range(10): > ... a.append(lambda: i) > ... >>>> print [x() for x in a] > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] > > This subtle semantic of lambda is quite confusing, and still forces people to > use the "i=i" trick. [Tim Peters] > So stay away from excruciating abuses of lexical scoping you don't > understand What do you expect `i` to refer to? "Oh, it should > guess that I didn't really mean to defer evaluation of the lambda body > at all, but instead evaluate the lambda body at the time I define the > lambda and then synthesize some other function that captures the > specific outer bindings in effect at lambda-definition time" doesn't > really cut it. I think I understand what happens, I just don't know whether this can be "fixed" or not. Unless you are saying that the above behaviour is not only a complex side-effect the way things are, but the way things should be. Do you agree that it would be ideal if the above code generated range(10) instead of [9]*10, or you believe that the current behaviour is more sound (and if so, why)? As for actual implementing this change of semantic, the fact that `i` is a local variable in the outer scope (assuming it's all within a function), doesn't make it possible for Python to early-bound it, by realizing that, since `i` is not an argument of the lambda, and it's a local of the outer scope? At worse, couldn't Python do the "i=i" trick by itself when it sees that `i` is a local in the outer scope? Right now I can't think off-hand of a case in which this would break things. [Tim Peters] > This isn't typical use for lambda, Yes, maybe it's not the most used idiom and Andrew wasn't referring to this, but it happens quite often to me (where 'often' means 'many times' among my rare usages of lambda). For instance, in GUI code, it's common to do things like: for b in self.buttons: self.setEventCallback(b, "clicked", lambda: self.label.setText("I pressed button %r" % b)) ... which of course won't work, as written above. Giovanni Bajo -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 135 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20060701/ebb3a2b4/attachment.gif From guido at python.org Sat Jul 1 04:55:03 2006 From: guido at python.org (Guido van Rossum) Date: Fri, 30 Jun 2006 19:55:03 -0700 Subject: [Python-Dev] ImportWarning flood In-Reply-To: <44A5D5DA.2060601@hathawaymix.org> References: <20060630194331.29014.836424634.divmod.quotient.16780@ohm> <44A5D5DA.2060601@hathawaymix.org> Message-ID: It's up to the release manager now to decide whether the pitchforks at Google or the pitchforks in the larger Python community are sharper. ;-) --Guido (ducks) On 6/30/06, Shane Hathaway wrote: > Guido van Rossum wrote: > > On 6/30/06, Jean-Paul Calderone wrote: > >> How about if someone grovels through import.c and figures out how to make > >> the warning information only show up if the import actually fails? > > > > That would work I think. But it's not easy. > > I just posted a patch intended to solve this. With my patch, > find_module() collects the warnings in a list and only generates > ImportWarning if no package is found. It works, but I have not done > extensive testing. > > I also discovered and tried to resolve what appear to be memory leaks > involving the "copy" variable. If the ImportWarning patch doesn't fly, > I'll be happy to post a different patch that only fixes the leaks. > > The patch is #1515361: > > https://sourceforge.net/tracker/index.php?func=detail&aid=1515361&group_id=5470&atid=305470 > > Shane > > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From skip at pobox.com Sat Jul 1 05:32:01 2006 From: skip at pobox.com (skip at pobox.com) Date: Fri, 30 Jun 2006 22:32:01 -0500 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 Message-ID: <17573.60593.759293.549583@montanaro.dyndns.org> Just upgraded my Mac to OSX 10.4.7 yesterday. svn up'd Python trunk, then "make clean ; configure ; make" and I see that building the zlib module fails: gcc -fno-strict-aliasing -Wno-long-double -no-cpp-precomp -mno-fused-madd -DNDEBUG -g -O3 -Wall -Wstrict-prototypes -I. -I/Users/skip/src/python-svn/trunk/./Include -I/Users/skip/src/python-svn/trunk/./Mac/Include -I/Users/skip/local/include -I../Include -I. -I/usr/local/include -I/Users/skip/src/python-svn/trunk/Include -I/Users/skip/src/python-svn/trunk/build -c /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c -o build/temp.macosx-10.3-ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c: In function 'PyZlib_uncopy': /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c:724: warning: implicit declaration of function 'inflateCopy' gcc -bundle -undefined dynamic_lookup build/temp.macosx-10.3-ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o -L/Users/skip/local/lib -L/usr/local/lib -lz -o build/lib.macosx-10.3-ppc-2.5/zlib.so -Wl,-search_paths_first *** WARNING: renaming "zlib" since importing it failed: dlopen(build/lib.macosx-10.3-ppc-2.5/zlib.so, 2): Symbol not found: _inflateCopy Referenced from: build/lib.macosx-10.3-ppc-2.5/zlib.so Expected in: dynamic lookup Anybody else seen this? I checked the buildbot trunk osx 10.4. It seemed to have no trouble. And what's with the "10.3" bit in the directory names? Skip From greg.ewing at canterbury.ac.nz Sat Jul 1 05:45:02 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 15:45:02 +1200 Subject: [Python-Dev] doc for new restricted execution design for Python In-Reply-To: <043001c69b45$35742290$100a0a0a@enfoldsystems.local> References: <043001c69b45$35742290$100a0a0a@enfoldsystems.local> Message-ID: <44A5EFBE.9050300@canterbury.ac.nz> Mark Hammond wrote: > that helps "mozilla the platform" more than it helps "firebox the browser" ^^^^^^^ Firebox - the sandfoxed web browser! -- Greg From tjreedy at udel.edu Sat Jul 1 06:27:16 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 1 Jul 2006 00:27:16 -0400 Subject: [Python-Dev] 2.5 and beyond References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com><001201c69c9c$b6531fd0$6402a8c0@arkdesktop><1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> Message-ID: "Giovanni Bajo" wrote in message news:020c01c69ca1$754c7310$d1b12997 at bagio... > Yes but: > >>>> a = [] >>>> for i in range(10): > ... a.append(lambda: i) > ... >>>> print [x() for x in a] > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] > > This subtle semantic of lambda is quite confusing, and still forces > people to > use the "i=i" trick. The 'subtle sematic' had nothing to do with lambda but with Python functions. The above is exactly equivalent (except the different .funcname) to a = [] for i in range(10): def f(): return i a.append(f) del f That should be equally confusing (or not), and equally requires the 'i=i' trick (or not). As is, either function definitiion is a constant and the loop makes useless duplicates. Either form would have the same effect is hoisted out of the loop. Terry Jan Reedy From greg.ewing at canterbury.ac.nz Sat Jul 1 06:59:19 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 16:59:19 +1200 Subject: [Python-Dev] For sandboxing: alternative to crippling file() In-Reply-To: References: Message-ID: <44A60127.3030706@canterbury.ac.nz> Brett Cannon wrote: > 1) Is removing 'file' from the builtins dict in PyInterpreterState (and > maybe some other things) going to be safe enough to sufficiently hide > 'file' confidently (short of someone being stupid in their C extension > module and exposing 'file' directly)? > > 2) Changing open() to return C-implemented delegate objects for files > (and thus won't type check, but this is Python so I am not worried about > that too much) and delegate socket objects for IP and URL addresses. My suggestion is to change things so that the constructor of the file type doesn't open files (at least in restricted mode). Then it wouldn't matter if untrusted code had real file objects, as they couldn't use them to get access to any other files. -- Greg From tjreedy at udel.edu Sat Jul 1 07:00:40 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 1 Jul 2006 01:00:40 -0400 Subject: [Python-Dev] 2.5 and beyond References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com><001201c69c9c$b6531fd0$6402a8c0@arkdesktop><1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com><020c01c69ca1$754c7310$d1b12997@bagio><1f7befae0606301715m6fe7044eue4e5c8d74a7c8a22@mail.gmail.com> <027f01c69caf$b1c20450$d1b12997@bagio> Message-ID: "Giovanni Bajo" wrote in message news:027f01c69caf$b1c20450$d1b12997 at bagio... > [Giovanni Bajo] >> Yes but: >> >>>>> a = [] >>>>> for i in range(10): >> ... a.append(lambda: i) >> ... >>>>> print [x() for x in a] >> [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] >. Do you agree that it would be ideal if the above code > generated range(10) instead of [9]*10, No. You are trying to reify an optical illusion resulting from putting a constant function definition inside a loop. Making the meaning of 'def f(): return i' depend on the definition-time context by partially and variably evaluating the body would make code much harder to read and understand. Consider: if a: i=666 def f(): return i > At > worse, couldn't Python do the "i=i" trick by itself when it sees that `i` > is a > local in the outer scope? Right now I can't think off-hand of a case in > which > this would break things. It would make code more fragile. for i in range(666): print name[i] ... ... def total(num): return cost[item]*num Now someone decides first loop should have more expressive loop var name and changes the first line to for item in range(666): print name[item] and the meaning of total is completely changed. Adding such long-range coupling between language statements strikes me as a poor idea. Terry Jan Reedy From greg.ewing at canterbury.ac.nz Sat Jul 1 07:39:05 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 17:39:05 +1200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: <44A60A79.8090802@canterbury.ac.nz> Ka-Ping Yee wrote: > while offering a way to get proper > lexical scopes for those who want to use them. I don't disagree with anything you said, but I think it would be a good idea to avoid using phrases like "proper lexical scopes", which is likely to set people off on a tangent. The issue isn't lexicality, it's writeability. -- Greg From nnorwitz at gmail.com Sat Jul 1 07:53:26 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Fri, 30 Jun 2006 22:53:26 -0700 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <17573.60593.759293.549583@montanaro.dyndns.org> References: <17573.60593.759293.549583@montanaro.dyndns.org> Message-ID: Maybe do a make distclean. There was a problem where old versions of zlib (those without inflateCopy) weren't supported. They are now, but it's a configure check. That coupled with the upgrade and the 10.3 in the pathname, seems like it's just something didn't get cleaned up properly. You could always rm -rf build/ n -- On 6/30/06, skip at pobox.com wrote: > Just upgraded my Mac to OSX 10.4.7 yesterday. svn up'd Python trunk, then > "make clean ; configure ; make" and I see that building the zlib module > fails: > > gcc -fno-strict-aliasing -Wno-long-double -no-cpp-precomp -mno-fused-madd -DNDEBUG -g -O3 -Wall -Wstrict-prototypes -I. -I/Users/skip/src/python-svn/trunk/./Include -I/Users/skip/src/python-svn/trunk/./Mac/Include -I/Users/skip/local/include -I../Include -I. -I/usr/local/include -I/Users/skip/src/python-svn/trunk/Include -I/Users/skip/src/python-svn/trunk/build -c /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c -o build/temp.macosx-10.3-ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o > /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c: In function 'PyZlib_uncopy': > /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c:724: warning: implicit declaration of function 'inflateCopy' > gcc -bundle -undefined dynamic_lookup build/temp.macosx-10.3-ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o -L/Users/skip/local/lib -L/usr/local/lib -lz -o build/lib.macosx-10.3-ppc-2.5/zlib.so -Wl,-search_paths_first > *** WARNING: renaming "zlib" since importing it failed: dlopen(build/lib.macosx-10.3-ppc-2.5/zlib.so, 2): Symbol not found: _inflateCopy > Referenced from: build/lib.macosx-10.3-ppc-2.5/zlib.so > Expected in: dynamic lookup > > Anybody else seen this? I checked the buildbot trunk osx 10.4. It seemed > to have no trouble. And what's with the "10.3" bit in the directory names? > > Skip > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/nnorwitz%40gmail.com > From greg.ewing at canterbury.ac.nz Sat Jul 1 08:14:28 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 18:14:28 +1200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> References: <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> Message-ID: <44A612C4.6020908@canterbury.ac.nz> Andrew Koenig wrote: > Incidentally, I think that lexical scoping would also deal with the problem > that people often encounter in which they have to write things like "lambda > x=x:" where one would think "lambda x:" would suffice. This is another red herring. Python's problem here is not because its scoping isn't lexical (it is). It's because Scheme implicitly introduces new scopes in various places where the equivalent Python constructs don't. -- Greg From greg.ewing at canterbury.ac.nz Sat Jul 1 08:14:54 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 18:14:54 +1200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <1f7befae0606301655i2425c0a1lf8d956cad0f028b7@mail.gmail.com> References: <1f7befae0606301623q4072f138vb4b7c24ca7bbff08@mail.gmail.com> <001401c69c9d$12d53900$6402a8c0@arkdesktop> <1f7befae0606301655i2425c0a1lf8d956cad0f028b7@mail.gmail.com> Message-ID: <44A612DE.1080503@canterbury.ac.nz> Tim Peters wrote: > Note that this is quite unlike Scheme, in which declaration must > appear before use (ignoring fancy letrec cases), I think that's overstating things a bit -- mutually recursive functions are quite easy to write in Scheme and don't look at all "fancy" (unless you object for some reason to using (define ...)). > and declaration must > also supply an initial binding (Scheme has no "unbound local" problem > because there's no way to create an uninitialized local). That much is true. -- Greg From greg.ewing at canterbury.ac.nz Sat Jul 1 08:15:04 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 18:15:04 +1200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060630163625.10D1.JCARLSON@uci.edu> References: <20060630163625.10D1.JCARLSON@uci.edu> Message-ID: <44A612E8.8050205@canterbury.ac.nz> Josiah Carlson wrote: > What I asked before, and what I'd like to ask again, is if there are any > _nontrivial uses_ of lexically nested scopes which are made cumbersome > by our inability to write to parent scopes. The trouble with taking that position is that the very cases which would benefit are very *simple* ones, where it would be cumbersome to refactor it to use a class, or mutable object in the outer scope, etc. So you've effectively set up your acceptance criteria to be unmeetable. > If there aren't, then I'm > going to again have to argue against new syntax, keywords, and their use. There's one very simple way we could do this in Py3k without requiring any new syntax or keywords: just redefine the meaning of "global" to mean "not local". -- Greg From greg.ewing at canterbury.ac.nz Sat Jul 1 08:15:29 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Jul 2006 18:15:29 +1200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <020c01c69ca1$754c7310$d1b12997@bagio> References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> Message-ID: <44A61301.2070805@canterbury.ac.nz> Giovanni Bajo wrote: >>>>a = [] >>>>for i in range(10): > > ... a.append(lambda: i) > ... > >>>>print [x() for x in a] > > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] > > This subtle semantic of lambda is quite confusing, and still forces people to > use the "i=i" trick. This has *nothing* to do with the semantics of lambda! It's because Python's for-loop doesn't put its control variable in a new scope, the way Scheme's equivalent construct does. *That's* what needs to be addressed to fix this problem. I've made a suggestion about that before, but Guido rejected it, so I won't repeat it here. -- Greg From tim.peters at gmail.com Sat Jul 1 09:28:35 2006 From: tim.peters at gmail.com (Tim Peters) Date: Sat, 1 Jul 2006 03:28:35 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <44A612DE.1080503@canterbury.ac.nz> References: <1f7befae0606301623q4072f138vb4b7c24ca7bbff08@mail.gmail.com> <001401c69c9d$12d53900$6402a8c0@arkdesktop> <1f7befae0606301655i2425c0a1lf8d956cad0f028b7@mail.gmail.com> <44A612DE.1080503@canterbury.ac.nz> Message-ID: <1f7befae0607010028q5da09c6ahb2f8c6358dbcd510@mail.gmail.com> [Tim Peters] >> Note that this is quite unlike Scheme, in which declaration must >> appear before use (ignoring fancy letrec cases), [Greg Ewing] > I think that's overstating things a bit -- So do I :-), but I don't really care about Scheme here. > mutually recursive functions are quite easy to write in > Scheme and don't look at all "fancy" (unless you object for > some reason to using (define ...)). In this context, yes, I object to using "define", because the semantics of internal definitions are defined in terms of an equivalent (letrec ...) form. The "fancy" gimmick is that letrec views all its bindings as occurring simultaneously, so strains a natural, linear understanding of "no use before declaration". But none of this appears to have any relevance to Python, so I'm happiest _here_ just calling that "fancy" and ignoring the details. Ditto "top level" definitions, which have unique rules of their own. From anthony at interlink.com.au Sat Jul 1 09:57:59 2006 From: anthony at interlink.com.au (Anthony Baxter) Date: Sat, 1 Jul 2006 17:57:59 +1000 Subject: [Python-Dev] ImportWarning flood In-Reply-To: References: <44A5D5DA.2060601@hathawaymix.org> Message-ID: <200607011758.03756.anthony@interlink.com.au> On Saturday 01 July 2006 12:55, Guido van Rossum wrote: > It's up to the release manager now to decide whether the pitchforks > at Google or the pitchforks in the larger Python community are > sharper. ;-) At this point, I think removing the warning code is the prudent course. If someone wanted to find an easy and safe way to make it only be triggered when the import fails, it could stay in. I'm not convinced that _anything_ in import.c is easy and safe. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From ncoghlan at gmail.com Sat Jul 1 10:01:29 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 01 Jul 2006 18:01:29 +1000 Subject: [Python-Dev] ImportWarning flood In-Reply-To: References: <20060630194331.29014.836424634.divmod.quotient.16780@ohm> <44A5D5DA.2060601@hathawaymix.org> Message-ID: <44A62BD9.7000906@gmail.com> Guido van Rossum wrote: > It's up to the release manager now to decide whether the pitchforks at > Google or the pitchforks in the larger Python community are sharper. > ;-) > > --Guido (ducks) I vaguely recall one of the reasons we went with the warning approach was to find out whether or not dropping __init__.py would cause serious problems - I think we have our answer to that question now :) How does this sound for a way forward?: 2.5b2: - ignore ImportWarning by default (like PendingDeprecationWarning) - include in What's New instructions to enable it via the command line or Python code 2.6: - only show ImportWarning if the import ultimately fails - enable ImporWarning by default - consider allowing a .py extension on a directory name as an alternative to an __init__.py file. Google could then change their sitecustomize.py to enable the warning by default when they roll out 2.5 :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From anthony at interlink.com.au Sat Jul 1 10:02:21 2006 From: anthony at interlink.com.au (Anthony Baxter) Date: Sat, 1 Jul 2006 18:02:21 +1000 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <44A57954.7030905@v.loewis.de> References: <86FF939A-5D9F-42C0-B633-95837FD7C991@fuhm.net> <44A57954.7030905@v.loewis.de> Message-ID: <200607011802.23996.anthony@interlink.com.au> On Saturday 01 July 2006 05:19, Martin v. L?wis wrote: > James Y Knight wrote: > > I just submitted http://python.org/sf/1515169 for the > > ImportWarning issue previously discussed here. IMO it's > > important. > > At the moment (i.e. without an acceptable alternative > implementation) it's primarily a policy issue. There really isn't > any bug here; (to speak with Microsoft's words): This behavior is > by design. > > Only the release manager or the BDFL could revert the feature, and > Guido already stated that the warning stays until Python 3, and > probably even after that. I personally believe the only chance to > get this changed now is a well-designed alternative implementation > (although this is no promise that such an alternative would > actually be accepted). given the number of people and ways that this can emit a spurious warning, I think it should be reverted for 2.5. At _best_ we could maybe have a new -W switch to make it be generated, but this should be off by default. Anthony -- Anthony Baxter It's never too late to have a happy childhood. From tim.peters at gmail.com Sat Jul 1 10:08:39 2006 From: tim.peters at gmail.com (Tim Peters) Date: Sat, 1 Jul 2006 04:08:39 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <44A61301.2070805@canterbury.ac.nz> References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> <44A61301.2070805@canterbury.ac.nz> Message-ID: <1f7befae0607010108hb99c574l9e0d7e3cbf378355@mail.gmail.com> [Giovanni Bajo] >> >>> a = [] >> >>> for i in range(10): >> >> ... a.append(lambda: i) >> ... >> >> >>> print [x() for x in a] >> >> [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] >> >> This subtle semantic of lambda is quite confusing, and still forces people to >> use the "i=i" trick. [Greg Ewing] > This has *nothing* to do with the semantics of lambda! > It's because Python's for-loop doesn't put its control > variable in a new scope, the way Scheme's equivalent > construct does. I don't think I follow that. Scheme has no loops in Python's sense -- things like "do" are shorthand for expressing stylized recursion, where each conceptual iteration gets a fresh set of "loop variables". When people talk about giving a Python for-loop vrbl its own scope, they generally don't mean a new scope on _each iteration_, they just mean that, e.g., i = 5 for i in range(10): # do stuff print i prints 5 intead of 9, about the same as creating a nested block with its own autos in C. The Scheme way is more like: i = 5 def step(i): # do stuff if i < 9: step(i+1) step(0) print i except with tail-recursion elimination. That also prints 5, but does a hell of a lot more than _just_ arrange for that. > *That's* what needs to be addressed to fix this problem. > I've made a suggestion about that before, but Guido > rejected it, so I won't repeat it here. Don't recall what that was, but creating a new scope on each iteration sounds hard to explain in Python. If Giovanni wants the Scheme way ;-), it's available: """ a = [] def step(i): a.append(lambda: i) if i < 9: step(i+1) step(0) print [x() for x in a] """ prints [0, 1, 2, 3, 4, 5, 6, 7, 8, 9], although it's more sanely written in Python with a loop: """ def make_lambda(i): return lambda: i a = [] for i in range(10): a.append(make_lambda(i)) print [x() for x in a] """ Abusing the default-argument machinery to capture current bindings is never necessary, and _is_ abuse. Of course I do it too -- but rarely :-) From ncoghlan at gmail.com Sat Jul 1 10:35:02 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 01 Jul 2006 18:35:02 +1000 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <020c01c69ca1$754c7310$d1b12997@bagio> References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com><001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> Message-ID: <44A633B6.3020403@gmail.com> Giovanni Bajo wrote: > Yes but: > >>>> a = [] >>>> for i in range(10): > ... a.append(lambda: i) > ... >>>> print [x() for x in a] > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] > > This subtle semantic of lambda is quite confusing, and still forces people to > use the "i=i" trick. If you'd like each function instance to have a separate closure scope, then *give* each function a separate closure scope, instead of making them all share the same one the way you have above: >>> def make_f(i): ... def f(): ... return i ... return f ... >>> a = [] >>> for i in range(10): ... a.append(make_f(i)) ... >>> print [x() for x in a] [0, 1, 2, 3, 4, 5, 6, 7, 8, 9] Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Sat Jul 1 10:37:37 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 01 Jul 2006 18:37:37 +1000 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <200607011802.23996.anthony@interlink.com.au> References: <86FF939A-5D9F-42C0-B633-95837FD7C991@fuhm.net> <44A57954.7030905@v.loewis.de> <200607011802.23996.anthony@interlink.com.au> Message-ID: <44A63451.6010909@gmail.com> Anthony Baxter wrote: > given the number of people and ways that this can emit a spurious > warning, I think it should be reverted for 2.5. At _best_ we could > maybe have a new -W switch to make it be generated, but this should > be off by default. Last line of warnings.py Copy, paste, s/PendingDeprecationWarning/ImportWarning -Wd to enable it again :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From arigo at tunes.org Sat Jul 1 10:49:00 2006 From: arigo at tunes.org (Armin Rigo) Date: Sat, 1 Jul 2006 10:49:00 +0200 Subject: [Python-Dev] Cleanup of test harness for Python In-Reply-To: <4dab5f760606300705l41c208c8tfb83f09f74badf2e@mail.gmail.com> References: <4dab5f760606300705l41c208c8tfb83f09f74badf2e@mail.gmail.com> Message-ID: <20060701084900.GB17748@code0.codespeak.net> Hi all, On Fri, Jun 30, 2006 at 10:05:14AM -0400, Frank Wierzbicki wrote: > some checks for CPython internal tests that should be excluded from > Jython I know Frank already knows about this, but I take the occasion to remind us that http://codespeak.net/svn/pypy/dist/lib-python/modified-2.4.1/test already shows which tests we had to modify for PyPy to make them less implementation-detail-dependent, and which changes were made. A possible first step here would be to find a consistent way to check, in the test, which implementation we are running on top of, so that we can (re-)write the tests accordingly. A bientot, Armin From arigo at tunes.org Sat Jul 1 10:52:21 2006 From: arigo at tunes.org (Armin Rigo) Date: Sat, 1 Jul 2006 10:52:21 +0200 Subject: [Python-Dev] sys.settrace() in Python 2.3 vs. 2.4 In-Reply-To: <20060630131620.10CE.JCARLSON@uci.edu> References: <20060630094140.10C8.JCARLSON@uci.edu> <44A582AE.7010902@v.loewis.de> <20060630131620.10CE.JCARLSON@uci.edu> Message-ID: <20060701085221.GC17748@code0.codespeak.net> Hi Josiah, On Fri, Jun 30, 2006 at 01:27:24PM -0700, Josiah Carlson wrote: > I'll just have to gracefully degrade functionality for older Pythons. More precisely, the bug shows up because in while 1: pass the current line remains on the 'pass' forever. It works for a loop like that: while 1: sys sys but it's admittedly quite obscure. Armin From jcarlson at uci.edu Sat Jul 1 11:01:12 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 01 Jul 2006 02:01:12 -0700 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <44A612E8.8050205@canterbury.ac.nz> References: <20060630163625.10D1.JCARLSON@uci.edu> <44A612E8.8050205@canterbury.ac.nz> Message-ID: <20060701013710.10D7.JCARLSON@uci.edu> Greg Ewing wrote: > > Josiah Carlson wrote: > > > What I asked before, and what I'd like to ask again, is if there are any > > _nontrivial uses_ of lexically nested scopes which are made cumbersome > > by our inability to write to parent scopes. > > The trouble with taking that position is that the very > cases which would benefit are very *simple* ones, where > it would be cumbersome to refactor it to use a class, > or mutable object in the outer scope, etc. So you've > effectively set up your acceptance criteria to be > unmeetable. If the only code that benefits from such changes are "very *simple*", then I think that says something about its necessity. That is, if anything more complicated than those that are "very *simple*" generally don't benefit, then I don't believe that such a modification would be beneficial to the language overall. Further, a simple namespace factory can handle much of the current issues, without needing to create or change keywords. def namespace(**kwds): class namespace(object): __slots__ = kwds.keys() def __init__(self): for i,j in kwds.iteritems(): setattr(self, i,j) return namespace() def trivial_counter(start): ns = namespace(current=start-1) def next(): ns.current += 1 return ns.current return next Maybe a variant of the above namespace factory should make it into the collections module. > > If there aren't, then I'm > > going to again have to argue against new syntax, keywords, and their use. > > There's one very simple way we could do this in Py3k > without requiring any new syntax or keywords: just > redefine the meaning of "global" to mean "not local". I would probably be a solid -0 on such a proposal; I still don't think it's really necessary, but I've never used (or really seen) global more than one level deep, so would guess its impact would be low. - Josiah From g.brandl at gmx.net Sat Jul 1 11:00:13 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 01 Jul 2006 11:00:13 +0200 Subject: [Python-Dev] Bug in stringobject? In-Reply-To: References: Message-ID: Georg Brandl wrote: > In string_replace, there is > > if (PyString_Check(from)) { > /* Can this be made a '!check' after the Unicode check? */ > } > #ifdef Py_USING_UNICODE > if (PyUnicode_Check(from)) > return PyUnicode_Replace((PyObject *)self, > from, to, count); > #endif > else if (PyObject_AsCharBuffer(from, &tmp_s, &tmp_len)) > return NULL; > > [the same check with "to"] > > return (PyObject *)replace((PyStringObject *) self, > (PyStringObject *) from, > (PyStringObject *) to, count); > > > Can this be correct if from or to isn't a string object, but a > char buffer compatible object? May I note that this is still unresolved? I can submit a bug report and add it to PEP 356, too... Georg From fredrik at pythonware.com Sat Jul 1 11:29:56 2006 From: fredrik at pythonware.com (Fredrik Lundh) Date: Sat, 01 Jul 2006 11:29:56 +0200 Subject: [Python-Dev] Bug in stringobject? In-Reply-To: References: Message-ID: Georg Brandl wrote: >> Can this be correct if from or to isn't a string object, but a >> char buffer compatible object? > > May I note that this is still unresolved? I can submit a bug report > and add it to PEP 356, too... it's already on my todo list, but that list is full of stuff, so having it on the official todo list is probably a good idea. if you do add it, assign it to me. From g.brandl at gmx.net Sat Jul 1 11:41:53 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 01 Jul 2006 11:41:53 +0200 Subject: [Python-Dev] Bug in stringobject? In-Reply-To: References: Message-ID: Fredrik Lundh wrote: > Georg Brandl wrote: > >>> Can this be correct if from or to isn't a string object, but a >>> char buffer compatible object? >> >> May I note that this is still unresolved? I can submit a bug report >> and add it to PEP 356, too... > > it's already on my todo list, but that list is full of stuff, so having > it on the official todo list is probably a good idea. if you do add it, > assign it to me. Done. #1515471. Georg From andrewdalke at gmail.com Sat Jul 1 11:59:00 2006 From: andrewdalke at gmail.com (Andrew Dalke) Date: Sat, 1 Jul 2006 11:59:00 +0200 Subject: [Python-Dev] PEP 328 and PEP 338, redux In-Reply-To: <018e01c69b77$a9172e40$d503030a@trilan> References: <44A11EA1.1000605@iinet.net.au> <5.1.1.6.0.20060627120926.02021fe0@sparrow.telecommunity.com> <033001c69a07$44140890$d503030a@trilan> <44A1CE4A.2000900@canterbury.ac.nz> <039201c69ad7$4dc91df0$d503030a@trilan> <44A3B648.2060104@gmail.com> <018e01c69b77$a9172e40$d503030a@trilan> Message-ID: Giovanni Bajo wrote: > Real-world usage case for import __main__? Otherwise, I say screw it :) I have used it as a workaround for timeit.py's requirement that I pass it strings instead of functions. >>> def compute(): ... 1+1 ... >>> import timeit >>> t = timeit.Timer("__main__.compute()", "import __main__") >>> t.timeit() 1.9755008220672607 >>> You can argue (as many have) that timeit.py needs a better API for this. That's a different world than the existing real one. Andrew dalke at dalkescientific.com From tomerfiliba at gmail.com Sat Jul 1 15:49:46 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Sat, 1 Jul 2006 15:49:46 +0200 Subject: [Python-Dev] weakattr Message-ID: <1d85506f0607010649y65c9fb24mbc5c636360970e52@mail.gmail.com> weakattr (weak attributes) are attributes that are weakly referenced by their containing object. they are very useful for cyclic references -- an object that holds a reference to itself. when a cyclic reference is found by the GC, the memory may be freed, but __del__ is not called, because it's impossible to tell which __del__ to call first. this is an awkward asymmetry with no clean solution: most such objects provide a "close" or "dispose" method that must be called explicitly. weakattrs to solve this problem, by providing a "magical" attribute that "disappears" when the attribute is no longer strongly-referenced. you can find the code, as well as some examples, on this link http://sebulba.wikispaces.com/recipe+weakattr since the stdlib already comes with weakref.py, which provides higher level concepts over the builtin _weakref module, i'd like to make weakattr a part of it. it's only ~20 lines of code, and imho saves the trouble of explicitly releasing the resource of un__del__able objects. i think it's useful. here's a snippet: >>> from weakref import weakattr >>> >>> class blah(object): ... yada = weakref() ... >>> o1 = blah() >>> o2 = blah() >>> o1.yada = o2 >>> o2.yada = o1 o1.yada is a *weakref* to o2, so that when o2 is no longer strongly-referenced... >>> del o2 o1.yada "magically" disappears as well. >>> o1.yada ... AttributeError(...) since the programmer explicitly defined "yada" as a weakatt, he/she knows it might "disappear". it might look awkward at first, but that's exactly the *desired* behavior (otherwise we'd just use the regular strong attributes). another thing to note is that weakattrs are likely to be gone only when the object's __del__ is already invoked, so the only code that needs to take such precautions is __del__ (which already has some constraints) for example: >>> class blah(object): ... me = weakattr() ... ... def __init__(self): ... self.me = self ... ... def something(self): ... # we can rest assure me exists at this stage ... print self.me ... ... def __del__(self): ... # by the time __del__ is called, "me" is removed ... print "me exists?", hasattr(self, "me") ... >>> b = blah() >>> b.me <__main__.blah object at 0x00C0EC10> >>> b.something() <__main__.blah object at 0x00C0EC10> >>> del b >>> import gc >>> gc.collect() me exists? False 0 >>> -tomer -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060701/b174e77e/attachment.htm From nas at arctrix.com Sat Jul 1 17:41:29 2006 From: nas at arctrix.com (Neil Schemenauer) Date: Sat, 1 Jul 2006 15:41:29 +0000 (UTC) Subject: [Python-Dev] Lexical scoping in Python 3k References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: Ka-Ping Yee wrote: > Most other languages that support lexical scoping (including Scheme, > JavaScript, Ruby, Perl, E, Java, Smalltalk) provide a uniform way > to read and write to scopes at all levels. This is done by letting > programmers specify the scope in which they want a variable bound > (usually with a keyword like "var" in JavaScript, "my" in Perl, or > "define" in E). That's not the Python way, IMO. I think the right way (assuming we actually want to allow it) is to introduce a pure assignment statement in addition to the assignment/declaration statement that we already have. For example: a = 1 def f(): b = 2 a := 2 def g(): b := 3 print a, b, c g() f() would print "2 3 4". The := would assign but not declare a variable in the current scope. Neil From ronaldoussoren at mac.com Sat Jul 1 18:24:15 2006 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Sat, 1 Jul 2006 18:24:15 +0200 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <17573.60593.759293.549583@montanaro.dyndns.org> References: <17573.60593.759293.549583@montanaro.dyndns.org> Message-ID: <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> On Jul 1, 2006, at 5:32 AM, skip at pobox.com wrote: > Just upgraded my Mac to OSX 10.4.7 yesterday. svn up'd Python > trunk, then > "make clean ; configure ; make" and I see that building the zlib > module > fails: > > gcc -fno-strict-aliasing -Wno-long-double -no-cpp-precomp -mno- > fused-madd -DNDEBUG -g -O3 -Wall -Wstrict-prototypes -I. -I/Users/ > skip/src/python-svn/trunk/./Include -I/Users/skip/src/python-svn/ > trunk/./Mac/Include -I/Users/skip/local/include -I../Include -I. -I/ > usr/local/include -I/Users/skip/src/python-svn/trunk/Include -I/ > Users/skip/src/python-svn/trunk/build -c /Users/skip/src/python-svn/ > trunk/Modules/zlibmodule.c -o build/temp.macosx-10.3-ppc-2.5/Users/ > skip/src/python-svn/trunk/Modules/zlibmodule.o > /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c: In > function 'PyZlib_uncopy': > /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c:724: > warning: implicit declaration of function 'inflateCopy' > gcc -bundle -undefined dynamic_lookup build/temp.macosx-10.3- > ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o -L/ > Users/skip/local/lib -L/usr/local/lib -lz -o build/lib.macosx-10.3- > ppc-2.5/zlib.so -Wl,-search_paths_first > *** WARNING: renaming "zlib" since importing it failed: dlopen > (build/lib.macosx-10.3-ppc-2.5/zlib.so, 2): Symbol not found: > _inflateCopy > Referenced from: build/lib.macosx-10.3-ppc-2.5/zlib.so > Expected in: dynamic lookup > > Anybody else seen this? I checked the buildbot trunk osx 10.4. It > seemed > to have no trouble. And what's with the "10.3" bit in the > directory names? Are you sure you're building on a 10.4 box? Both the macosx-10.3 thingy and lack of inflateCopy seem to indicate that you're running on 10.3. Ronald > > Skip > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/ > ronaldoussoren%40mac.com -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 2157 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20060701/f51f75e9/attachment.bin From almann.goo at gmail.com Sat Jul 1 18:42:27 2006 From: almann.goo at gmail.com (Almann T. Goo) Date: Sat, 1 Jul 2006 12:42:27 -0400 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060701013710.10D7.JCARLSON@uci.edu> References: <20060630163625.10D1.JCARLSON@uci.edu> <44A612E8.8050205@canterbury.ac.nz> <20060701013710.10D7.JCARLSON@uci.edu> Message-ID: <7e9b97090607010942r599bfed9u4bc02a564becaf2b@mail.gmail.com> On 7/1/06, Josiah Carlson wrote: > > There's one very simple way we could do this in Py3k > > without requiring any new syntax or keywords: just > > redefine the meaning of "global" to mean "not local". > > I would probably be a solid -0 on such a proposal; I still don't think > it's really necessary, but I've never used (or really seen) global more > than one level deep, so would guess its impact would be low. > This has been discussed at length in the following thread that I started in February and at least one time before that. http://mail.python.org/pipermail/python-dev/2006-February/061568.html I think using the "global" keyword is probably the lowest impact form and has the least amount of backwards incompatibility. Below is the part of the last thread that I talked about changing the meaning of "global." http://mail.python.org/pipermail/python-dev/2006-February/061852.html Having the "global" keyword semantics changed to be "lexically global" would break in the cases that "global" is used on a name within a nested scope that has an enclosing scope with the same name. I would suppose that actual instances in real code of this would be rare. Consider: >>>* x = 1 *>>>* def f() : *... x = 2 ... def inner() : ... global x ... print x ... inner() ... >>>* f() *1 Under the proposed rules: >>>* f() *2 PEP 227 also had backwards incompatibilities that were similar and I suggest handling them the same way by issuing a warning in these cases when the new semantics are not being used (i.e. no "from __future__"). Most people probably think that this is a low impact "wart" on the Python language that not really worth fixing as there are workarounds (i.e. mutable objects) or other ways to express (i.e. use classes) such things, but it does trip people up from time to time as warts typically do--I guess that's why this gets brought up now and again. Best regards, Almann -- Almann T. Goo almann.goo at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060701/7f3f113c/attachment.html From skip at pobox.com Sat Jul 1 18:57:34 2006 From: skip at pobox.com (skip at pobox.com) Date: Sat, 1 Jul 2006 11:57:34 -0500 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> Message-ID: <17574.43390.560231.425494@montanaro.dyndns.org> Ronald> Are you sure you're building on a 10.4 box? Both the Ronald> macosx-10.3 thingy and lack of inflateCopy seem to indicate that Ronald> you're running on 10.3. Well, yeah, pretty sure. Let's see. The box with the disk says "Mac OS X Tiger - Version 10.4" on the spine. The "About This Mac" popup says "10.4.7". It used to run 10.3 though. Is there some possibility the update from 10.3 to 10.4 had problems? Note that the compile log on the buildbot 10.4 box also has "10.3" in its directory names. If I remember correctly, it came from Apple with 10.4 installed. Skip From sergey at optimaltec.com Sat Jul 1 18:52:19 2006 From: sergey at optimaltec.com (Sergey A. Lipnevich) Date: Sat, 01 Jul 2006 12:52:19 -0400 Subject: [Python-Dev] ImportWarning flood In-Reply-To: <44A282E3.1000009@v.loewis.de> References: <20060628122558.19551.qmail@web31505.mail.mud.yahoo.com> <44A282E3.1000009@v.loewis.de> Message-ID: All, I tried to implement Jean-Paul Calderone's idea for the following patch, plagiarizing Ralf W. Grosse-Kunstleve's error text. It delays import warning until end of search for modules, but remembers how many potential modules (candidates without __init__.py) it didn't import. I didn't really try to analyze any conditions, instead I simply assumed that wherever ImportWarning would be issued, we have a suitable candidate, and saved it on the stack. If nothing is found, Python emits ImportWarning right before ImportError, and explains what happened. Please let me know if this would work and if anything needs to be done for this patch to be accepted. Thank you! Sergey. -------------- next part -------------- A non-text attachment was scrubbed... Name: import_warning.diff Type: text/x-patch Size: 1867 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20060701/94ab9c43/attachment-0001.bin From aahz at pythoncraft.com Sat Jul 1 19:31:00 2006 From: aahz at pythoncraft.com (Aahz) Date: Sat, 1 Jul 2006 10:31:00 -0700 Subject: [Python-Dev] ImportWarning flood In-Reply-To: References: <20060628122558.19551.qmail@web31505.mail.mud.yahoo.com> <44A282E3.1000009@v.loewis.de> Message-ID: <20060701173100.GA22185@panix.com> On Sat, Jul 01, 2006, Sergey A. Lipnevich wrote: > > Please let me know if this would work and if anything needs to be done > for this patch to be accepted. The first thing you need to do for ANY patch to be considered is to post it so SourceForge (or at least post to python-dev explaining that you'll post to SF as soon as it comes back). -- Aahz (aahz at pythoncraft.com) <*> http://www.pythoncraft.com/ "I saw `cout' being shifted "Hello world" times to the left and stopped right there." --Steve Gonedes From martin at v.loewis.de Sat Jul 1 19:32:29 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sat, 01 Jul 2006 19:32:29 +0200 Subject: [Python-Dev] ImportWarning flood In-Reply-To: References: <20060628122558.19551.qmail@web31505.mail.mud.yahoo.com> <44A282E3.1000009@v.loewis.de> Message-ID: <44A6B1AD.6020204@v.loewis.de> Sergey A. Lipnevich wrote: > I tried to implement Jean-Paul Calderone's idea for the following patch, > plagiarizing Ralf W. Grosse-Kunstleve's error text. It delays import > warning until end of search for modules, but remembers how many > potential modules (candidates without __init__.py) it didn't import. I > didn't really try to analyze any conditions, instead I simply assumed > that wherever ImportWarning would be issued, we have a suitable > candidate, and saved it on the stack. If nothing is found, Python emits > ImportWarning right before ImportError, and explains what happened. > Please let me know if this would work and if anything needs to be done > for this patch to be accepted. Please notice that there is also python.org/sf/1515361 I had no time to compare this with your patch, yet. Regards, Martin From ronaldoussoren at mac.com Sat Jul 1 19:45:17 2006 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Sat, 1 Jul 2006 19:45:17 +0200 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <17574.43390.560231.425494@montanaro.dyndns.org> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> Message-ID: <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> On Jul 1, 2006, at 6:57 PM, skip at pobox.com wrote: > > Ronald> Are you sure you're building on a 10.4 box? Both the > Ronald> macosx-10.3 thingy and lack of inflateCopy seem to > indicate that > Ronald> you're running on 10.3. > > Well, yeah, pretty sure. Let's see. The box with the disk says > "Mac OS X > Tiger - Version 10.4" on the spine. The "About This Mac" popup says > "10.4.7". That gets the easy solution out of the way ;-) > It used to run 10.3 though. Is there some possibility the update > from 10.3 to 10.4 had problems? > > Note that the compile log on the buildbot 10.4 box also has "10.3" > in its > directory names. If I remember correctly, it came from Apple with > 10.4 > installed. /me slaps head. Having 10.3 in the directory names is intentional, the version in the directory name is the value of MACOSX_DEPLOYMENT_TARGET, with is defaulted to 10.3 in the configure script. What I don't understand yet is why your copy of libz doesn't have inflateCopy. What does /usr/lib/libz.dylib point to on your system? On my 10.4 box it is a symlink that points to libz.1.2.3.dylib and there is an older version of libz (libz.1.1.3.dylib) in /usr/lib as well. Ronald -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 2157 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20060701/d2da4e4b/attachment.bin From sergey at optimaltec.com Sat Jul 1 20:10:42 2006 From: sergey at optimaltec.com (Sergey A. Lipnevich) Date: Sat, 01 Jul 2006 14:10:42 -0400 Subject: [Python-Dev] ImportWarning flood In-Reply-To: <44A6B1AD.6020204@v.loewis.de> References: <20060628122558.19551.qmail@web31505.mail.mud.yahoo.com> <44A282E3.1000009@v.loewis.de> <44A6B1AD.6020204@v.loewis.de> Message-ID: Martin v. L?wis wrote: > Sergey A. Lipnevich wrote: >> I tried to implement Jean-Paul Calderone's idea for the following patch, >> plagiarizing Ralf W. Grosse-Kunstleve's error text. It delays import ... > Please notice that there is also python.org/sf/1515361 > > I had no time to compare this with your patch, yet. Thanks! I made python.org/sf/1515609. The difference (also documented in the description) is that my patch saves memory and some CPU cycles by not trying to collect all directories Python did not import because of missing __init__.py. It only reports how many such directories there are and what is the first one. Sergey. From s.percivall at chello.se Sat Jul 1 20:27:47 2006 From: s.percivall at chello.se (Simon Percivall) Date: Sat, 1 Jul 2006 20:27:47 +0200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <7e9b97090607010942r599bfed9u4bc02a564becaf2b@mail.gmail.com> References: <20060630163625.10D1.JCARLSON@uci.edu> <44A612E8.8050205@canterbury.ac.nz> <20060701013710.10D7.JCARLSON@uci.edu> <7e9b97090607010942r599bfed9u4bc02a564becaf2b@mail.gmail.com> Message-ID: <8B4078DF-F730-4E46-9F2C-04807B178471@chello.se> What about doing something similar to how import was changed? .a = 5 # this scope (self might be too magical ..a = 3 # up one scope ...a # up three Of course, this looks ... perhaps a bit strange. Also, counting is a bother. //Simon From martin at v.loewis.de Sat Jul 1 20:46:44 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Sat, 01 Jul 2006 20:46:44 +0200 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> Message-ID: <44A6C314.2000709@v.loewis.de> Ronald Oussoren wrote: > What I don't understand yet is why your copy of libz doesn't have > inflateCopy. What I don't understand is that configure does not detect that. Regards, Martin From ark at acm.org Sat Jul 1 21:03:19 2006 From: ark at acm.org (Andrew Koenig) Date: Sat, 1 Jul 2006 15:03:19 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <44A61301.2070805@canterbury.ac.nz> Message-ID: <000601c69d41$073dbf00$6402a8c0@arkdesktop> > a = [] > for i in range(10): > a.append(lambda: i) > print [x() for x in a] > > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] Aha! -- Thank you for jogging my memory. You seem to be right -- the problem is not that Python is lexically scoped, but that when you define a variable with =, it leaks out into the surrounding function scope. Here's an example: If True: y = 123 print y It may be obvious that this should print 123, but that's only because = combines properties of assignment and definition. In particular, if we were to write y = 42 if True: y = 123 print y it would be very surprising if this example were to print anything but 123. Here is a corresponding fragment in C++: int y = 42; if (true) { y = 123; } std::cout << y << "\n"; The "int" in the first line means that the variable y is being defined. Its lack in the third line means that y refers to a variable defined in an outer scope. So both instances of y here refer to the same variable, as they do in Python. But because definition and assignment are separated in C++, we can also write int y = 42; if (true) { int y = 123; } std::cout << y << "\n"; and the fragment will print 42. In this example, there are two distinct variables, both named y. So the problem, as I see it, is indeed that in Python there are suites that look to me as if they should define scopes, but don't. Indeed, if I write if (foo): y = 123 I can't even determine by inspecting the program whether y is defined at all. I might argue that y is always defined, by virtue of appearing before = somewhere in this scope, but the compiler tells me "name 'y' is not defined" if I try it, so I guess that's the right way to treat it. So here's how I understand what Greg was saying. Suppose I write x = [] for i in range(10): x.append(lambda:i) print [f() for f in x] This example will print [9, 9, 9, 9, 9, 9, 9, 9, 9, 9], which I think is wildly unintuitive. My intuition in this matter is partly formed by C++, but it is also formed by other languages going as far back as Algol 68. That intuition says that because the suite controlled by a "for" statement is executed any number of times, potentially including zero, it should be considered as its own scope, and any variables defined in that scope should stay there. In particular, the variable "i" should be defined in the scope of the "for", which implies that each time through the loop, the name "i" should be (re)bound to a different object. What surprises me even more is that if I try to define such a variable explicitly, it still doesn't work: x = [] for i in range(10): j = i x.append(lambda:j) print [f() for f in x] This example still prints [9, 9, 9, 9, 9, 9, 9, 9, 9, 9]. If I understand the reason correctly, it is because even though j is defined only in the body of the loop, loop bodies are not scopes, so the variable's definition is hoisted out into the surrounding function scope. To convince myself of this behavior, I defined an extra function scope, the purpose of which is to localize j: x = [] for i in range(10): def foo(): j = i return lambda:j x.append(foo()) print [f() for f in x] Indeed, this example prints [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]. The example also points up the fact that x.append(lambda:i) and def foo(): j = i return lambda:j x.append(foo()) behave differently, where my intuition (and, I suspect, many other people's as well) would be that they would be equivalent. Finally, I observe that this second example above is also equivalent to x.append(lambda i=i: i) which is what explains the fairly common idiom x = [] for i in range(10): x.append(lambda i=i:i) print [f() for f in x] So maybe what I meant when I asked for lexical scopes was two things: 1) Every indentation level should be a scope; 2) In general, variable definitions should not leak into surrounding scopes. I realize that (2) is too simplistic. Someone who writes if x < 0: y = -x else: y = x will expect y to be defined in the scope surrounding the "if" even if it was not already defined there. On the other hand, I think that the subtle pitfalls that come from allowing "for" variables to leak into the surrounding scopes are much harder to deal with and understand than would be the consequences of restricting their scopes as outlined above. From ark at acm.org Sat Jul 1 21:06:38 2006 From: ark at acm.org (Andrew Koenig) Date: Sat, 1 Jul 2006 15:06:38 -0400 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <8B4078DF-F730-4E46-9F2C-04807B178471@chello.se> Message-ID: <000701c69d41$7dc7a9b0$6402a8c0@arkdesktop> > What about doing something similar to how import was changed? > > .a = 5 # this scope (self might be too magical > ..a = 3 # up one scope > ...a # up three > > Of course, this looks ... perhaps a bit strange. Also, counting is a > bother. I'd rather see a simpler rule: = never defines a variable in a surrounding scope. If you want to affect the binding of such a variable, you have to define it explicitly in the scope in which you want it. Example: x = 42 def f(): x = 123 # rebinds x as defined above y = 123 # defines local variable f() print x # prints 123 print y # error -- y not defined Yes, I know that rule is too simplistic. But I think I'd still prefer it to the way things are now. From ronaldoussoren at mac.com Sat Jul 1 21:12:24 2006 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Sat, 1 Jul 2006 21:12:24 +0200 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <44A6C314.2000709@v.loewis.de> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> <44A6C314.2000709@v.loewis.de> Message-ID: <0006F35E-FD5F-4BAD-A608-642039BBE7E8@mac.com> On Jul 1, 2006, at 8:46 PM, Martin v. L?wis wrote: > Ronald Oussoren wrote: >> What I don't understand yet is why your copy of libz doesn't have >> inflateCopy. > > What I don't understand is that configure does not detect that. You may be onto something there. Skip, do you have another copy of libz somewhere? Given the link line in your first message either in / usr/local/lib or /Users/skip/local/lib. And if you have, is that a static library (libz.a) instead of a dylib? As background to my question: the linker on OSX behaves slightly different than the one on most other unix-y systems. It first searches the entire linker path for shared libraries (dylibs) before looking for static libraries. I added a flag to the link flags for the zlib extension a while back that changes the search order into a more traditional one: look in every directory on the linker path for either a dylib or static library. The new flag is -Wl,- search_paths_first. If skip does indeed have libz somewhere else we'll either have to make a matching update to configure, or roll back my change. If the latter I'll have to tweak the build script for the binary installer for OSX because I want to link that using a static copy of libz for binary compatibility with OSX 10.3.9. Ronald -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 2157 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20060701/e0a2a627/attachment-0001.bin From bob at redivi.com Sat Jul 1 21:19:58 2006 From: bob at redivi.com (Bob Ippolito) Date: Sat, 1 Jul 2006 12:19:58 -0700 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> Message-ID: On Jul 1, 2006, at 10:45 AM, Ronald Oussoren wrote: > > On Jul 1, 2006, at 6:57 PM, skip at pobox.com wrote: > >> >> Ronald> Are you sure you're building on a 10.4 box? Both the >> Ronald> macosx-10.3 thingy and lack of inflateCopy seem to >> indicate that >> Ronald> you're running on 10.3. >> >> Well, yeah, pretty sure. Let's see. The box with the disk says >> "Mac OS X >> Tiger - Version 10.4" on the spine. The "About This Mac" popup says >> "10.4.7". > > That gets the easy solution out of the way ;-) > >> It used to run 10.3 though. Is there some possibility the update >> from 10.3 to 10.4 had problems? >> >> Note that the compile log on the buildbot 10.4 box also has "10.3" >> in its >> directory names. If I remember correctly, it came from Apple with >> 10.4 >> installed. > > /me slaps head. > > Having 10.3 in the directory names is intentional, the version in > the directory name is the value of MACOSX_DEPLOYMENT_TARGET, with > is defaulted to 10.3 in the configure script. > > What I don't understand yet is why your copy of libz doesn't have > inflateCopy. What does /usr/lib/libz.dylib point to on your system? > On my 10.4 box it is a symlink that points to libz.1.2.3.dylib and > there is an older version of libz (libz.1.1.3.dylib) in /usr/lib as > well. Maybe Skip didn't upgrade to the latest version of Xcode? Perhaps he's still got an old SDK? -bob From rasky at develer.com Sat Jul 1 21:24:04 2006 From: rasky at develer.com (Giovanni Bajo) Date: Sat, 1 Jul 2006 21:24:04 +0200 Subject: [Python-Dev] 2.5 and beyond References: <000601c69d41$073dbf00$6402a8c0@arkdesktop> Message-ID: <024901c69d43$ea5a6480$d503030a@trilan> Andrew Koenig wrote: > Suppose I write > > x = [] > for i in range(10): > x.append(lambda:i) > print [f() for f in x] > > This example will print [9, 9, 9, 9, 9, 9, 9, 9, 9, 9], which I think > is wildly unintuitive. That is my point: to me, it's counter-intuitive just like the infamous "except NameError, TypeError". I believe that names in lambdas/nested-functions referring to local names in the outer scope should really be bound at function definition time (much like default arguments are). > What surprises me even more is that if I try to define such a variable > explicitly, it still doesn't work: > > x = [] > for i in range(10): > j = i > x.append(lambda:j) > print [f() for f in x] > > This example still prints [9, 9, 9, 9, 9, 9, 9, 9, 9, 9]. If I > understand the reason correctly, it is because even though j is > defined only in the body of the loop, loop bodies are not scopes, so > the variable's definition is hoisted out into the surrounding > function scope. Yes. And by itself, I like this fact because it's very handy in many cases. And it's also handy that the iteration variable of the for loop is accessible after the for loop is terminated (in fact, this specific behaviour is already listed among the wont-change for Py3k). > On the other hand, I think that > the subtle pitfalls that come from allowing "for" variables to leak > into the surrounding scopes are much harder to deal with and > understand than would be the consequences of restricting their scopes > as outlined above. As I said, to me there's nothing wrong with the way Python variables leak out of the suites; or, in other words, with the fact that Python has only two namespaces, the function-local and the global namespace. What I don't like is that the lookup of lambda's names are fully deferred at execution time. This behaviour is already not fully followed for local variables in functions, since: >>> y = 0 >>> def foo(): ... print y ... y = 2 ... >>> foo() Traceback (most recent call last): File "", line 1, in ? File "", line 2, in foo UnboundLocalError: local variable 'y' referenced before assignment which means that Python users *already* know that a variable is not really looked up only at run-time, but there's "something" going on even at function definition time. I don't see anything wrong if lambdas (or nested scopes) did the same for names provably coming from the outer scope. -- Giovanni Bajo From noamraph at gmail.com Sat Jul 1 22:06:28 2006 From: noamraph at gmail.com (Noam Raphael) Date: Sat, 1 Jul 2006 23:06:28 +0300 Subject: [Python-Dev] Empty Subscript PEP on Wiki - keep or toss? In-Reply-To: References: <17573.33939.473091.920283@montanaro.dyndns.org> Message-ID: Hello, I posted it as a "pre-PEP", in the hope that it may become a PEP and be accepted. As it happened, Guido said "no" at the end, so I stopped pushing the subject. I think that the main reason for the "no" was that my use case wasn't convincing enough - the objections were that this wasn't useful enough, not that it does anything harmful*. As the one who does think it's useful, I have the tiniest hope that if, in the future, people will become familiar with the package and see the usefulness of allowing empty subscript list, the decision will change. If the only result of me posting it as a PEP is a final "rejected" status that will prevent any chance of that happening, I don't think I'll bother to make it a PEP. If it's not the case, then I'll make it a PEP and post it. Have a good week, Noam * Yes, I know that adding unneeded feature to the language can be considered "harmful". You may not agree with my distinction in this case. As it is, I barely consider this as an "added feature" - I would say it's mostly "a small generalization". 2006/6/30, Georg Brandl : > skip at pobox.com wrote: > > Noam Raphael posted an empty subscript PEP on the Python Wiki: > > > > http://wiki.python.org/moin/EmptySubscriptListPEP > > > > It's not linked to by any other pages on the wiki. Is there a reason it > > wasn't added to the peps repository? > > Perhaps the author forgot to submit it to the PEP editor, or he decided > to abandon it after the mostly negative discussion here. > > Georg > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/noamraph%40gmail.com > From pedronis at strakt.com Sat Jul 1 22:10:40 2006 From: pedronis at strakt.com (Samuele Pedroni) Date: Sat, 01 Jul 2006 22:10:40 +0200 Subject: [Python-Dev] doc for new restricted execution design for Python In-Reply-To: References: <44A2BF2E.4020706@activestate.com> Message-ID: <44A6D6C0.70408@strakt.com> Brett Cannon wrote: > > I don't know how JavaScript is doing it yet. The critical thing for me > for this month was trying to come up with a security model. > > And if you don't think it is going to scale, how do you think it should > be done? if I remember correctly, the boundary/granularity of mutual isolation is practically web domains, pages from the same domain can liberally access each other data, javascript state. From brett at python.org Sat Jul 1 22:27:51 2006 From: brett at python.org (Brett Cannon) Date: Sat, 1 Jul 2006 13:27:51 -0700 Subject: [Python-Dev] For sandboxing: alternative to crippling file() In-Reply-To: <44A60127.3030706@canterbury.ac.nz> References: <44A60127.3030706@canterbury.ac.nz> Message-ID: On 6/30/06, Greg Ewing wrote: > > Brett Cannon wrote: > > > 1) Is removing 'file' from the builtins dict in PyInterpreterState (and > > maybe some other things) going to be safe enough to sufficiently hide > > 'file' confidently (short of someone being stupid in their C extension > > module and exposing 'file' directly)? > > > > 2) Changing open() to return C-implemented delegate objects for files > > (and thus won't type check, but this is Python so I am not worried about > > that too much) and delegate socket objects for IP and URL addresses. > > My suggestion is to change things so that the constructor > of the file type doesn't open files (at least in restricted > mode). Then it wouldn't matter if untrusted code had real > file objects, as they couldn't use them to get access to > any other files. So require use of open() to open a file and then put the access restrictions in open() while turning off the constructor for file? Seems reasonable. It basically shifts the access restrictions to open() instead of 'file'. For some reason this proposal makes me want to remove the checks in read/write methods as well. That way there is only open() that needs to do the checks and 'file' can have the constructor crippled and that be it. Really minimize the impact of code on 'file' itself. Do people think that having the restriction checks for every read/write method is necessary? I originally thought of doing that so that if an open file object leaked into a restricted interpreter by accident there would still be proper protections, but perhaps that should not be the resopnsibility of 'file' and instead should be more up to modules not passing back exposed 'file' objects into code. -Brett -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-dev/attachments/20060701/1087426f/attachment.html From talin at acm.org Sat Jul 1 22:32:02 2006 From: talin at acm.org (Talin) Date: Sat, 01 Jul 2006 13:32:02 -0700 Subject: [Python-Dev] More Switch: Explicit freezing Message-ID: <44A6DBC2.5080302@acm.org> Here's another stab at the "explicit freezing" school of thought on the switch semantics. The idea is to borrow the freeze protocol and apply it to functions. In this scheme, the default behavior of switch is to rebuild the dictionary each time the switch is executed. However, by calling freeze(), you can get a 'frozen' version of the function in which all switch dictionaries contained within the function are precalculated: # Emulating 'freeze at function definition time' def myfunc( x ): switch y: case a: ... case b: ... myfunc = freeze( myfunc ) This of course lends itself well to decorator syntax: @freeze def myfunc( x ): switch y: case a: ... case b: ... You can also get 'freeze on first use' via the appropriate decorator function, although that's a litte harder to white (essentially, you need a way to test if the function is already frozen.) Each time you call freeze(), you get a new copy of the function object with the switch dictionaries bound to the values that were in the scope of the call to 'freeze'. This means that you can call freeze several times and get several different versions of the function: def myfunc( x ): switch y: case a: ... case b: ... a = 1 b = 2 f1 = freeze( myfunc ) a = 3 b = 4 f2 = freeze( myfunc ) Now we have two versions of the function, each having a different switch dictionary. Note that 'switch' is still usable without 'freeze', it just won't run as fast. This means that the folks who are interested in a switch statement purely for its improved expressiveness can simply not bother with freezing the function. -- Talin From skip at pobox.com Sun Jul 2 00:26:17 2006 From: skip at pobox.com (skip at pobox.com) Date: Sat, 1 Jul 2006 17:26:17 -0500 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> Message-ID: <17574.63113.771422.208090@montanaro.dyndns.org> Ronald> What does /usr/lib/libz.dylib point to on your system? % cd /usr/lib % ls -l libz.* lrwxr-xr-x 1 root wheel 12 Feb 12 00:32 libz.1.1.3.dylib -> libz.1.dylib -rwxr-xr-x 1 root wheel 72588 Jun 29 18:36 libz.1.2.3.dylib lrwxr-xr-x 1 root wheel 16 Feb 12 00:32 libz.1.dylib -> libz.1.2.3.dylib lrwxr-xr-x 1 root wheel 16 Feb 12 00:32 libz.dylib -> libz.1.2.3.dylib Looks like everything on my system winds up at 1.2.3. Ronald> What I don't understand yet is why your copy of libz doesn't Ronald> have inflateCopy. It appears to: % nm libz.1.2.3.dylib | egrep -i inflate U _inflate U _inflateEnd U _inflateInit2_ U _inflateReset U _inflate U _inflateEnd U _inflateInit_ libz.1.2.3.dylib(inflate.o): 9110ea18 T _inflate 911168cc T _inflateCopy 9110e5d8 T _inflateEnd 91116694 t _inflateGetHeader 9110dc84 T _inflateInit2_ 9110e680 T _inflateInit_ 91116524 t _inflatePrime 9110dddc T _inflateReset 91116584 T _inflateSetDictionary 91116744 T _inflateSync 91116888 T _inflateSyncPoint u _inflate_fast u _inflate_table 91116b90 T _inflateBack 91117a6c T _inflateBackEnd 91116a38 T _inflateBackInit_ u _inflate_fast u _inflate_table 91118334 s _inflate_copyright 9111047c t _inflate_table 91110930 t _inflate_fast However, even though the zlib.so is linked with -lz, there's no evidence of it in the otool -L output: % make case $MAKEFLAGS in \ *-s*) CC='gcc' LDSHARED='gcc -bundle -undefined dynamic_lookup' OPT='-DNDEBUG -g -O3 -Wall -Wstrict-prototypes' ./python.exe -E ../setup.py -q build;; \ *) CC='gcc' LDSHARED='gcc -bundle -undefined dynamic_lookup' OPT='-DNDEBUG -g -O3 -Wall -Wstrict-prototypes' ./python.exe -E ../setup.py build;; \ esac running build running build_ext db.h: found (4, 2) in /sw/include/db4 db lib: using (4, 2) db-4.2 sqlite: found /usr/include/sqlite3.h /usr/include/sqlite3.h: version 3.1.3 building 'zlib' extension gcc -fno-strict-aliasing -Wno-long-double -no-cpp-precomp -mno-fused-madd -DNDEBUG -g -O3 -Wall -Wstrict-prototypes -I. -I/Users/skip/src/python-svn/trunk/./Include -I/Users/skip/src/python-svn/trunk/./Mac/Include -I/Users/skip/local/include -I../Include -I. -I/usr/local/include -I/Users/skip/src/python-svn/trunk/Include -I/Users/skip/src/python-svn/trunk/build -c /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c -o build/temp.macosx-10.3-ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c: In function 'PyZlib_uncopy': /Users/skip/src/python-svn/trunk/Modules/zlibmodule.c:724: warning: implicit declaration of function 'inflateCopy' gcc -bundle -undefined dynamic_lookup build/temp.macosx-10.3-ppc-2.5/Users/skip/src/python-svn/trunk/Modules/zlibmodule.o -L/Users/skip/local/lib -L/usr/local/lib -lz -o build/lib.macosx-10.3-ppc-2.5/zlib.so -Wl,-search_paths_first *** WARNING: renaming "zlib" since importing it failed: dlopen(build/lib.macosx-10.3-ppc-2.5/zlib.so, 2): Symbol not found: _inflateCopy Referenced from: build/lib.macosx-10.3-ppc-2.5/zlib.so Expected in: dynamic lookup running build_scripts montanaro:build% otool -L build/lib.macosx-10.3-ppc-2.5/zlib_failed.so build/lib.macosx-10.3-ppc-2.5/zlib_failed.so: /usr/lib/libmx.A.dylib (compatibility version 1.0.0, current version 93.0.0) /usr/lib/libSystem.B.dylib (compatibility version 1.0.0, current version 88.1.6) Ah, found it! There was an antique libz.a in /usr/local/lib dating from 2003. It's all better now. Skip From skip at pobox.com Sun Jul 2 00:44:33 2006 From: skip at pobox.com (skip at pobox.com) Date: Sat, 1 Jul 2006 17:44:33 -0500 Subject: [Python-Dev] Empty Subscript PEP on Wiki - keep or toss? In-Reply-To: References: <17573.33939.473091.920283@montanaro.dyndns.org> Message-ID: <17574.64209.730168.967635@montanaro.dyndns.org> Noam> If the only result of me posting it as a PEP is a final "rejected" Noam> status that will prevent any chance of that happening, I don't Noam> think I'll bother to make it a PEP. If it's not the case, then Noam> I'll make it a PEP and post it. Even if it's ultimately rejected, it still serves as useful documentation of the process. I'd go ahead and update it to reflect the latest discussions, submit it, then let the chips fall where they may. That would also get it out of the wiki, orphaned page that it is. Skip From python-dev at zesty.ca Sun Jul 2 00:50:22 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Sat, 1 Jul 2006 17:50:22 -0500 (CDT) Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <44A60A79.8090802@canterbury.ac.nz> References: <001201c69c65$b3869750$6402a8c0@arkdesktop> <44A60A79.8090802@canterbury.ac.nz> Message-ID: On Sat, 1 Jul 2006, Greg Ewing wrote: > I don't disagree with anything you said, but I think it > would be a good idea to avoid using phrases like "proper > lexical scopes", which is likely to set people off on > a tangent. The issue isn't lexicality, it's writeability. "Fully functional" lexical scopes, then? Python's scopes are lexical (except for builtins) but currently somewhat hamstrung. -- ?!ng From ark at acm.org Sun Jul 2 01:42:28 2006 From: ark at acm.org (Andrew Koenig) Date: Sat, 1 Jul 2006 19:42:28 -0400 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: Message-ID: <001301c69d68$070e5fe0$6402a8c0@arkdesktop> > "Fully functional" lexical scopes, then? Fine-grained scopes? From ark at acm.org Sun Jul 2 02:22:46 2006 From: ark at acm.org (Andrew Koenig) Date: Sat, 1 Jul 2006 20:22:46 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <1f7befae0607010108hb99c574l9e0d7e3cbf378355@mail.gmail.com> Message-ID: <001401c69d6d$a7cd8d20$6402a8c0@arkdesktop> > Don't recall what that was, but creating a new scope on each iteration > sounds hard to explain in Python. I don't think it's particularly hard to explain. For example, one way to explain it is to say that for i in <>: body is equivalent to for <> in <>: local i = <> body This explanation doesn't need to rest on recursion. From tim.peters at gmail.com Sun Jul 2 03:08:25 2006 From: tim.peters at gmail.com (Tim Peters) Date: Sat, 1 Jul 2006 21:08:25 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <001401c69d6d$a7cd8d20$6402a8c0@arkdesktop> References: <1f7befae0607010108hb99c574l9e0d7e3cbf378355@mail.gmail.com> <001401c69d6d$a7cd8d20$6402a8c0@arkdesktop> Message-ID: <1f7befae0607011808x7f7b2661h7f96a57f4bedc672@mail.gmail.com> [Tim] >> Don't recall what that was, but creating a new scope on each iteration >> sounds hard to explain in Python. [Andrew Koenig] > I don't think it's particularly hard to explain. For example, one way to explain it is > to say that > > for i in <>: > body > > is equivalent to > > for <> in <>: > local i = <> > body > > This explanation doesn't need to rest on recursion. Sorry, but as a Python programmer that explanation makes little sense to me. In effect, it pushes the mystery into what "local" is supposed to mean, but there's nothing _already_ in Python that acts the way you need "local" to act. Scope in Python is defined wrt "blocks", so you need to phrase this in terms of blocks, and there are very few kinds of blocks in Python's execution model: A block is a piece of Python program text that is executed as a unit. The following are blocks: a module, a function body, and a class definition. Each command typed interactively is a block. A script file (a file given as standard input to the interpreter or specified on the interpreter command line the first argument) is a code block. A script command (a command specified on the interpreter command line with the `-c' option) is a code block. The file read by the built-in function execfile() is a code block. The string argument passed to the built-in function eval() and to the exec statement is a code block. The expression read and evaluated by the built-in function input() is a code block. That's from section "Naming and binding" of the Python Reference Manual. I expect most Python programmers have "module, function, class ... plus some weird stuff I don't much care about" in mind. Python's execution model also has a one-to-one correspondence between active blocks and execution frames (see the rest of that section), which would need to be snapped to consider a finer-grained notion of block that didn't have its own execution frame. In short, it's only easy to define this in Python, without invoking nested functions, if you don't have Python's execution model in mind to begin with. From fdrake at acm.org Sun Jul 2 04:04:47 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Sat, 1 Jul 2006 22:04:47 -0400 Subject: [Python-Dev] =?iso-8859-1?q?how_long_to_wait_for_expat_to_incorpo?= =?iso-8859-1?q?rate_a_fix_to=09prevent_a_crasher=3F?= In-Reply-To: <44A583A6.6010602@v.loewis.de> References: <44A583A6.6010602@v.loewis.de> Message-ID: <200607012204.47631.fdrake@acm.org> On Friday 30 June 2006 16:03, Martin v. L?wis wrote: > If you have a patch, you should commit it to our copy. Make sure you > activate the test case, so that somebody incorporating the next Expat > release doesn't mistakenly roll back your change. A modified version of Brett's patch has been committed to Expat, along with regression tests for two specific cases that it handles (only one of which is relevant to Python). The patch to xmlparse.c has also been committed to Python's copy, and the crasher test has been moved to the regular xml.parsers.expat tests. > Of course, you might wait a few days to see whether Fred creates another > release that we could incorporate without introducing new features. I'm not ready to push for an Expat release, since I've not had much time to pay attention to that project over the past year. I'm trying to catch up on that project's email, but don't expect it to be quick. Once I've had time to discuss this with the current principal maintainer, it shouldn't be difficult to get a 2.0.1 release out the door. Once that's done, it'll be time to sync with the Expat release again. -Fred -- Fred L. Drake, Jr. From ncoghlan at gmail.com Sun Jul 2 05:26:48 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 02 Jul 2006 13:26:48 +1000 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: References: <001201c69c65$b3869750$6402a8c0@arkdesktop> Message-ID: <44A73CF8.2080300@gmail.com> Neil Schemenauer wrote: > The := would assign but not declare a variable > in the current scope. There are other benefits to such a statement, too, since we can make it similar to other augmented assignments by letting the object being assigned to interfere with the process. a := 2 could translate to something like: a = a.__assign__(2) with the default behaviour of __assign__ simply being: def __assign__(rhs) return rhs This gives you: - runtime checking for typos (you can't accidentally declare a new variable with := when you really meant to assign to an existing one) - if/when control flow analysis is added to the AST compiler, it will be picked up as an error at compile time along with the other augmented assignments - the object being assigned to can validate/modify its replacement (e.g. automatically wrapping it in a weakref proxy, or checking that it has the correct type) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From jcarlson at uci.edu Sun Jul 2 05:56:07 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 01 Jul 2006 20:56:07 -0700 Subject: [Python-Dev] sys.settrace() in Python 2.3 vs. 2.4 In-Reply-To: <20060701085221.GC17748@code0.codespeak.net> References: <20060630131620.10CE.JCARLSON@uci.edu> <20060701085221.GC17748@code0.codespeak.net> Message-ID: <20060701205520.10DD.JCARLSON@uci.edu> Armin Rigo wrote: > > Hi Josiah, > > On Fri, Jun 30, 2006 at 01:27:24PM -0700, Josiah Carlson wrote: > > I'll just have to gracefully degrade functionality for older Pythons. > > More precisely, the bug shows up because in > > while 1: > pass > > the current line remains on the 'pass' forever. It works for a loop > like that: > > while 1: > sys > sys > > but it's admittedly quite obscure. That is good to know, thank you Armin. - Josiah From jcarlson at uci.edu Sun Jul 2 06:50:11 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 01 Jul 2006 21:50:11 -0700 Subject: [Python-Dev] weakattr In-Reply-To: <1d85506f0607010649y65c9fb24mbc5c636360970e52@mail.gmail.com> References: <1d85506f0607010649y65c9fb24mbc5c636360970e52@mail.gmail.com> Message-ID: <20060701213745.10E0.JCARLSON@uci.edu> "tomer filiba" wrote: > weakattr (weak attributes) are attributes that are weakly referenced > by their containing object. they are very useful for cyclic references -- > an object that holds a reference to itself. I like the added functionality offered with weakattrs as defined. I'm not terribly in love with the syntax of their creation, and I'm curious as to how it plays with __slots__ (not quite having the time to look at its implementation right now), but it is quite explicit, so I can get past that. It would allow us to say, "stop using __del__, use weakattrs", but I'm not sure how well that would work, generally. Toss it out in python-list, I think some people over there would be able to offer more feedback. - Josiah From jcarlson at uci.edu Sun Jul 2 07:46:57 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 01 Jul 2006 22:46:57 -0700 Subject: [Python-Dev] More Switch: Explicit freezing In-Reply-To: <44A6DBC2.5080302@acm.org> References: <44A6DBC2.5080302@acm.org> Message-ID: <20060701215239.10E3.JCARLSON@uci.edu> Talin wrote: > Here's another stab at the "explicit freezing" school of thought on the > switch semantics. The idea is to borrow the freeze protocol and apply it > to functions. -1 . Freezing was previously questionably useful in the realm of general data structures. Using switch/case as a use-case for this functionality, I think, is a non-starter. - Josiah From arigo at tunes.org Sun Jul 2 13:23:44 2006 From: arigo at tunes.org (Armin Rigo) Date: Sun, 2 Jul 2006 13:23:44 +0200 Subject: [Python-Dev] LOAD_CONST POP_TOP In-Reply-To: References: Message-ID: <20060702112344.GA30481@code0.codespeak.net> Hi Georg, On Fri, Jun 30, 2006 at 08:39:13PM +0200, Georg Brandl wrote: > + case LOAD_CONST: > + cumlc = lastlc + 1; > + /* Skip over LOAD_CONST POP_TOP */ > + if (codestr[i+3] == POP_TOP) { This is missing a ISBASICBLOCK() check. It makes the following example segfault: a = 5 for i in range(1000): a or 4 Attached an updated patch to the SF tracker. A bientot, Armin. From jan-python at maka.demon.nl Sun Jul 2 13:58:55 2006 From: jan-python at maka.demon.nl (jan-python at maka.demon.nl) Date: Sun, 02 Jul 2006 13:58:55 +0200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: References: Message-ID: <20060702135855.bdk6r8kugw0c0w0o@webmail.nl.demon.net> Hi everyone, Even though I'm new on this list I think I've got something sensible to say on this one. (I've been following this list a bit through the archives) Andrew Koenig wrote: > You seem to be right -- the problem is not that Python is lexically scoped, > but that when you define a variable with =, it leaks out into the > surrounding function scope. > So maybe what I meant when I asked for lexical scopes was two things: > > 1) Every indentation level should be a scope; > 2) In general, variable definitions should not leak into > surrounding scopes. > > I realize that (2) is too simplistic. Someone who writes I believe the problem has nothing to do with how many scopes a block/function definition has, but with what the lambda does with the scope it's given. Currently it remembers the block and looks up the nescessary variables in it when it's invoked. I think it shoud should have just taken the values of the needed variables and rememberd those as it's own local variables. So the closed over variables become just local variables initialised to the value they have in the outer scope. Without having any blocks to be confused about, I think this is counterintuitive as well: >>> x = 1 >>> f = lambda: x >>> x = 2 >>> g = lambda: x >>> f() 2 >>> g() 2 I think it should have been: .... >>> f() 1 >>> g() 2 Using the lambda x=x: x trick gives exactly this behaviour because it apparently does copy the value of x. As far as I can see it also solves the >>> for i in range(10): ... a.append(lambda: i) case, and other similar examples. (However, this would probably be a to big change for 2.5) From pje at telecommunity.com Sun Jul 2 16:07:57 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 02 Jul 2006 10:07:57 -0400 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <20060702135855.bdk6r8kugw0c0w0o@webmail.nl.demon.net> References:

Message-ID: <5.1.1.6.0.20060702100646.01f1b878@sparrow.telecommunity.com> At 01:58 PM 7/2/2006 +0200, jan-python at maka.demon.nl wrote: >I believe the problem has nothing to do with how many scopes a block/function >definition has, but with what the lambda does with the scope it's given. >Currently it remembers the block and looks up the nescessary variables in it >when it's invoked. I think it shoud should have just taken the values of the >needed variables and rememberd those as it's own local variables. So >the closed >over variables become just local variables initialised to the value >they have in >the outer scope. That won't work. Consider this code, that's perfectly valid Python today: def foo(): def bar(): print x for x in range(10): bar() From amk at amk.ca Sun Jul 2 18:00:44 2006 From: amk at amk.ca (A.M. Kuchling) Date: Sun, 2 Jul 2006 12:00:44 -0400 Subject: [Python-Dev] Another 2.5 bug candidate? Message-ID: <20060702160044.GA21217@rogue.amk.ca> http://www.python.org/sf/1488934 argues that Python's use of fwrite() has incorrect error checking; this most affects file.write(), but there are other uses of fwrite() in the core. It seems fwrite() can return N bytes written even if an error occurred, and the code needs to also check ferror(f->fp). At the last sprint I tried to assemble a small test case to exhibit the problem but failed. The reporter's test case uses SSH, and I did verify that Python does loop infinitely if executed under SSH, but a test case would need to work without SSH. Should this be fixed in 2.5? I'm nervous about such a change to error handling without a test case to add; maybe it'll cause problems on one of our platforms. --amk From nmm1 at cus.cam.ac.uk Sun Jul 2 20:08:19 2006 From: nmm1 at cus.cam.ac.uk (Nick Maclaren) Date: Sun, 02 Jul 2006 19:08:19 +0100 Subject: [Python-Dev] Another 2.5 bug candidate? Message-ID: "A.M. Kuchling" wrote: > > http://www.python.org/sf/1488934 argues that Python's use of fwrite() > has incorrect error checking; this most affects file.write(), but > there are other uses of fwrite() in the core. It seems fwrite() can > return N bytes written even if an error occurred, and the code needs > to also check ferror(f->fp). > > At the last sprint I tried to assemble a small test case to exhibit > the problem but failed. The reporter's test case uses SSH, and I did > verify that Python does loop infinitely if executed under SSH, but a > test case would need to work without SSH. > > Should this be fixed in 2.5? I'm nervous about such a change to error > handling without a test case to add; maybe it'll cause problems on one > of our platforms. So would assembling a test case. NOTHING will cause ferror to return True that isn't classed as undefined behaviour, and therefore may fail on some platforms. Regards, Nick Maclaren, University of Cambridge Computing Service, New Museums Site, Pembroke Street, Cambridge CB2 3QH, England. Email: nmm1 at cam.ac.uk Tel.: +44 1223 334761 Fax: +44 1223 334679 From python-dev at zesty.ca Mon Jul 3 00:07:43 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Sun, 2 Jul 2006 17:07:43 -0500 (CDT) Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <000701c69d41$7dc7a9b0$6402a8c0@arkdesktop> References: <000701c69d41$7dc7a9b0$6402a8c0@arkdesktop> Message-ID: On Sat, 1 Jul 2006, Andrew Koenig wrote: > I'd rather see a simpler rule: = never defines a variable in a surrounding > scope. If you want to affect the binding of such a variable, you have to > define it explicitly in the scope in which you want it. > > Example: > > x = 42 > def f(): > x = 123 # rebinds x as defined above > y = 123 # defines local variable > f() > print x # prints 123 > print y # error -- y not defined > > Yes, I know that rule is too simplistic. But I think I'd still prefer it to > the way things are now. I agree with you that this is a nicer and more consistent rule. What do you think of the proposal for a keyword to say "don't rebind"? It would achieve the same distinction you're aiming for above, but without the drastic incompatibility with today's Python. This has been previously discussed as "change the meaning of 'global' to mean 'not local'": http://mail.python.org/pipermail/python-dev/2006-February/061568.html http://mail.python.org/pipermail/python-dev/2006-July/066908.html I support this proposal, though i would prefer a clearer keyword such as "outer x" or "nonlocal x". If we can't agree on another keyword (or can't afford to spend one more keyword), i'm willing to support "global" for this purpose. -- ?!ng From greg.ewing at canterbury.ac.nz Mon Jul 3 03:11:43 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 03 Jul 2006 13:11:43 +1200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060701013710.10D7.JCARLSON@uci.edu> References: <20060630163625.10D1.JCARLSON@uci.edu> <44A612E8.8050205@canterbury.ac.nz> <20060701013710.10D7.JCARLSON@uci.edu> Message-ID: <44A86ECF.4060705@canterbury.ac.nz> Josiah Carlson wrote: > If the only code that benefits from such changes are "very *simple*", > then I think that says something about its necessity. The point is that they're only "very simple" if you can write them using access to an outer scope. Without that ability, they become less simple, less efficient, more convoluted, harder to follow, etc. Also I don't buy the argument that something has to be useful for big, complicated things in order to be worth having in the language. -- Greg From greg.ewing at canterbury.ac.nz Mon Jul 3 03:36:31 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 03 Jul 2006 13:36:31 +1200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <024901c69d43$ea5a6480$d503030a@trilan> References: <000601c69d41$073dbf00$6402a8c0@arkdesktop> <024901c69d43$ea5a6480$d503030a@trilan> Message-ID: <44A8749F.2000801@canterbury.ac.nz> Giovanni Bajo wrote: > I believe that names in > lambdas/nested-functions referring to local names in the outer scope should > really be bound at function definition time No, you don't want that, because it would make functions that call each other very awkward to arrange. > And it's also handy that the iteration variable of the for loop is > accessible after the for loop is terminated (in fact, this specific > behaviour is already listed among the wont-change for Py3k). I'd just like to point out that the create-a-new-cell behaviour that I have proposed for loop variables *preserves* this ability! for new i in range(10): ... print i will still print 9. -- Greg From greg.ewing at canterbury.ac.nz Mon Jul 3 03:50:54 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 03 Jul 2006 13:50:54 +1200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <1f7befae0607010108hb99c574l9e0d7e3cbf378355@mail.gmail.com> References: <5.1.1.6.0.20060630190709.01ef5858@sparrow.telecommunity.com> <001201c69c9c$b6531fd0$6402a8c0@arkdesktop> <1f7befae0606301639u6f8c968cr29afce0ee7fbb43b@mail.gmail.com> <020c01c69ca1$754c7310$d1b12997@bagio> <44A61301.2070805@canterbury.ac.nz> <1f7befae0607010108hb99c574l9e0d7e3cbf378355@mail.gmail.com> Message-ID: <44A877FE.6040004@canterbury.ac.nz> Tim Peters wrote: > Scheme has no loops in Python's sense -- > things like "do" are shorthand for expressing stylized recursion But it does have foreach and map, which are the moral equivalent of Python's for-loops and list comprehensions. The body is a lambda which takes the loop variable as a parameter, thus providing the extra level of scope. Recursion isn't needed. > When people talk about giving a Python for-loop vrbl its own scope, > they generally don't mean a new scope on _each iteration_, But that's exactly what you *do* need in order for a for-loop with a lambda in it to behave intuitively. If that's not what people mean, it's because they don't fully understand what they really mean to mean. :-) BTW, I'm not suggesting that a new stack frame gets allocated for every iteration -- all you need is a cell. > about the same as creating a nested block with > its own autos in C. Analogies with C aren't very helpful here, because it doesn't have closures, so it's only a matter of visibility, not lifetime. > creating a new scope on each iteration > sounds hard to explain in Python. But is it harder to explain than the reason someone's loop-with-a-lambda doesn't do what they expect? BTW, I wouldn't explain it by saying it creates a new scope, I'd say it creates a new binding on each iteration, or something like that. In my earlier proposal, you would actually say that explicitly, with something like for new i in range(10): ... > Abusing the default-argument machinery to capture current bindings is > never necessary, and _is_ abuse. But it's abuse that still happens, because although scoping has been fixed, other parts of the story are still missing. -- Greg From steven.bethard at gmail.com Mon Jul 3 01:28:08 2006 From: steven.bethard at gmail.com (Steven Bethard) Date: Sun, 2 Jul 2006 17:28:08 -0600 Subject: [Python-Dev] DRAFT: python-dev summary for 2006-06-01 to 2006-06-15 Message-ID: Here's the summary for the first half of June. Thanks in advance for your comments and corrections! ============= Announcements ============= ------------------- Python 2.5 schedule ------------------- Python 2.5 is moving steadily towards its next release. See `PEP 356`_ for more details and the full schedule. .. _PEP 356: http://www.python.org/dev/peps/pep-0356/ Contributing threads: - `beta1 coming real soon `__ - `2.5 issues need resolving in a few days `__ ----------------------------------------------- Request for Bug Trackers to replace SourceForge ----------------------------------------------- The Python Software Foundation's Infrastructure committee asked for suggestions for tracker systems that could replace SourceForge. The minimum requirements are: * Can import SourceForge data * Can export data * Has an email interface and if you'd like to suggest a particular tracker system all you need to do is: * Install a test tracker * Import the `SourceForge data dump`_ * Make the `Infrastructure committee members`_ administrators of the tracker * Add your tracker to the `wiki page`_ * Email `the Infrastructure committee`_ Be sure to check the `wiki page`_ for additional information. .. _SourceForge data dump: http://effbot.org/zone/sandbox-sourceforge.htm .. _Infrastructure committee members: http://wiki.python.org/moin/PythonSoftwareFoundationCommittees#infrastructure-committee-ic .. _wiki page: http://wiki.python.org/moin/CallForTrackers .. _the Infrastructure committee: infrastructure at python.org Contributing thread: - `Request for trackers to evaluate as SF replacement for Python development `__ ========= Summaries ========= -------------------------------------------- Getting more comparable results from pybench -------------------------------------------- Skip Montanaro mentioned that the NeedForSpeed_ folks had some trouble with the pybench_ string and unicode tests. In some discussions both on the checkins list and off-list, Fredrik Lundh had concluded that stringbench more reliably reported performance than pybench. There was then a long discussion about how to improve pybench including: * Using time.clock() on Windows and time.time() on Linux. This was accompanied by a long debate about whether to use wall-time or process time. Both wall time and process time can see interference from other programs running at the same time; wall time because the time consumed by other programs running at the same time is also counted, and process time because it is sampled so that other processes can charge their time to the running process by using less than a full time slice. In general, the answer was to use the timer with the best resolution. * Using the minimum time rather than the average. Andrew Dalke explained that timing results do not have a Gaussian distribution (they have more of a gamma distribution) and provided some graphs generated on his machine to demonstrate this. Since the slower runs are typically caused by other things running at the same time (which is pretty much unpredictable), it's much better to report the fastest run, which should more consistently approximate the best possible time. * Making sure to use an appropriate warp factor. Marc-Andre Lemburg explained that each testing round of pybench is expected to take around 20-50 seconds. If rounds are much shorter than this, pybench's warp factor should be adjusted until they are long enough. At the end of the thread, Marc-Andre checked in pybench_ 2.0, which included the improvements suggested above. .. _NeedForSpeed: http://wiki.python.org/moin/NeedForSpeed .. _pybench: http://svn.python.org/view/python/trunk/Tools/pybench/ .. _stringbench: http://svn.python.org/view/sandbox/trunk/stringbench/ Contributing threads: - `Python Benchmarks `__ - `Python Benchmarks `__ --------------------------------------- PEP 360: Externally Maintained Packages --------------------------------------- After checking wsgiref into the Python repository, Phillip J. Eby requested in `PEP 360`_ that patches to wsgiref be passed to him before being committed on the trunk. After a number of changes were committed to the trunk and he had to go through a complicated two-way merge, he complained that people were not following the posted procedures. Guido suggested that `PEP 360`_ was a mistake, and that whenever possible, development for any module in the stdlib should be done in the Python trunk, not externally. He also requested that the PEP indicate that even for externally maintained modules, bugfixes and ordinary maintenance should be allowed on the trunk so that bugs in external modules don't hold up Python core development. A number of solutions were discussed for authors of code that is also distributed standalone. Using svn:externals is mostly undesirable because svn is much slower at checking whether or not an svn:externals directory is updated, and because upgrading to a newer version would require making sure that no changes made by Python developers were lost in the new version. Phillip suggested adding an "Externals" directory and modifying Python's setup to invoke all the ``Externals/*/setup.py`` scripts, though this would mean having some Python code that lives outside of the Lib/ subtree. Barry Warsaw explained that for the email package, he maintains a directory in the sandbox with all the distutils and documentation stuff needed for the standalone releases as well as the email package from the Python repository through svn:externals. This means having to create some extra directories (since svn:externals doesn't work with individual files) and having one checkout per version of Python supported, but seemed to work pretty well for Barry. People seemed to like Phillip's Externals idea (possibly renamed to Packages), but work on that was postponed for Python 2.6. One of the side benefits of these discussions was that Thomas Heller generously offered to move ctypes development fully into the Python repository. .. _PEP 360: http://www.python.org/dev/peps/pep-0360/ Contributing threads: - `wsgiref documentation `__ - `wsgiref doc draft; reviews/patches wanted `__ - `[Web-SIG] wsgiref doc draft; reviews/patches wanted `__ - `FYI: wsgiref is now checked in `__ - `Please stop changing wsgiref on the trunk `__ - `Dropping externally maintained packages (Was: Please stop changing wsgiref on the trunk) `__ - `External Package Maintenance (was Re: Please stop changing wsgiref on the trunk) `__ - `External Package Maintenance `__ - `rewording PEP 360 `__ - `Updating packages from external ? `__ -------------------------------------- Universally unique identifiers (UUIDs) -------------------------------------- Ka-Ping Yee was looking to put his `uuid module`_ into Python 2.5. He addressed a number of requests from the last round of discussions, including making UUIDs immutable, removing curly braces from the UUID string and adding the necessary tests to the test suite. Then he asked about how best to address the fact that ``uuid1()`` required looking up a MAC address, a potentially slow procedure. At the suggestion of Fredrik Lundh, he changed the API to allow a MAC address to be passed in if it was already known. If a MAC address is not passed in to ``uuid1()``, the ``getnode()`` utility function is called, which searches for the MAC address through a variety of routes, including some quicker paths through ctypes that Thomas Heller and others helped Ka-Ping with. The code was checked into the Python trunk. .. _uuid module: http://zesty.ca/python/uuid.py Contributing thread: - `UUID module `__ ------------------------------------- PEP 275: Switching on Multiple Values ------------------------------------- Thomas Lee offered up a `patch implementing the switch statement`_ from `PEP 275`_. People brought up a number of concerns with the implementation (and the switch statement in general). The implementation didn't allow for any way of allowing multiple values to be mapped to the same case (without repeating the code in the case). The implementation also made the switch statement essentially syntactic sugar for a series of if/elif/else statements, and people were concerned that just adding another way to write if/elif/else was not much of a gain for Python. The discussion continued on into the next fortnight. .. _patch implementing the switch statement: http://bugs.python.org/1504199 .. _PEP 275: http://www.python.org/dev/peps/pep-0275/ Contributing thread: - `Switch statement `__ --------------------------------------------------------- The period of the random module's random number generator --------------------------------------------------------- Alex Martelli noticed a note in random.shuffle.__doc__ which said that most permutations of a long sequence would never be generated due to the short period of the random number generator. This turned out to be an artifact from back when Python used the Whichman-Hill generator instead of the Mersenne Twister generator it uses currently. There was some discussion as to whether the comment should be removed or updated, and Robert Kern pointed out that at sequence lengths of 2081 or greater, the comment was still true. Tim Peters decided it was best to just remove the comment, explaining that "anyone sophisticated enough to *understand* an accurate warning correctly would have no need to be warned". Contributing thread: - `a note in random.shuffle.__doc__ ... `__ ------------------------------------------------------- Pre-PEP: Allow Empty Subscript List Without Parentheses ------------------------------------------------------- Noam Raphael presented a `pre-PEP for empty subscript lists`_ in getitem-style access to objects. This would allow zero-dimensional arrays to work in a similar manner to all other N dimensional arrays, and make all of the following equivalences hold:: x[i, j] <--> x[(i, j)] x[i,] <--> x[(i,)] x[i] <--> x[(i)] x[] <--> x[()] Most people felt that zero-dimensional arrays were uncommon enough that either they could be replaced with simple names, e.g. ``x``, or could use the currently available syntax, i.e. ``x[()]``. Zero-dimensional arrays are even uncommon in numpy_ where, after `rehashing the issue`_ innumerable times, zero-dimensional arrays have been almost entirely replaced with scalars. .. _pre-PEP for empty subscript lists: http://wiki.python.org/moin/EmptySubscriptListPEP .. _numpy: http://numeric.scipy.org/ .. _rehashing the issue: http://projects.scipy.org/scipy/numpy/wiki/ZeroRankArray Contributing thread: - `Pre-PEP: Allow Empty Subscript List Without Parentheses `__ ---------------------------------------------- PEP 337: Logging Usage in the Standard Library ---------------------------------------------- For the `Google Summer of Code`_, Jackilyn Hoxworth has been working on implementing parts of `PEP 337`_ to use the logging module in parts of the stdlib. When Jim Jewett, who is mentoring her, brought up a few issues, people got concerned that this work was being done at all, being that `PEP 337`_ has not been approved. Jim and A.M. Kuchling clarified that the goal of Jackilyn's work is to both clarify the PEP (e.g. determine exactly which modules would benefit from logging) and to provide an implementation that can be tweaked as necessary if the PEP is accepted. For the first draft at least, it looked like Jackilyn would keep things simple -- using "py." + __name__ for the logger name, not adding any new logging messages, not changing any message formats, and generally aiming only to give stderr and stdout messages across different modules a common choke point. .. _Google Summer of Code: http://code.google.com/soc/ .. _PEP 337: http://www.python.org/dev/peps/pep-0337/ Contributing thread: - `Stdlib Logging questions (PEP 337 SoC) `__ ------------------- inspect.isgenerator ------------------- Michele Simionato asked for a new function in the inspect module that would identify a function as being a generator function. Phillip J. Eby pointed out that any function can return a generator-iterator (though generator functions are of course guaranteed to do so) and suggested that the perceived need for this inspect function was misguided. Michele agreed and withdrew the proposal. Contributing threads: - `feature request: inspect.isgenerator `__ - `feature request: inspect.isgenerator `__ -------------------------------- Unescaping entities with sgmllib -------------------------------- Sam Ruby asked why sgmllib unescapes entities selectively, not all or nothing (which would be easier to work around), and Fred L. Drake, Jr. explained that sgmllib is really only intended as support for htmllib. Sam suggested isolating the code that attempts to resolve character references into a single method so that subclasses could override this behavior as needed. Martin v. L?wis agreed that this seemed reasonable, though he suggested two functions, one for character references and one for entity references. Sam implemented the suggested behavior and provided a `patch to sgmllib`_. .. _patch to sgmllib: http://bugs.python.org/1504676 Contributing thread: - `sgmllib Comments `__ ------------------------------------------------------------------------- Scoping vs augmented assignment vs sets (Re: 'fast locals' in Python 2.5) ------------------------------------------------------------------------- A bug in Python 2.5 that did not detect augmented assignment as creating a local name allowed code like the following to work:: >>> g = 1 >>> def f1(): ... g += 1 ... >>> f1() >>> g 2 This of course started the usual discussion about giving Python a way to rebind names in enclosing scopes. Boris Borcic in particular was hoping that the bug could be considered a feature, but Terry Reedy explained that Python was not willing to give up the near equivalence between ``x = x + 1`` and ``x += 1``. Since the former creates a local name, the latter ought to do the same thing. The thread seemed like it might drift on further until Guido cut it off, pronouncing that the behavior of augmented assignments creating local names was not going to change. Contributing threads: - `'fast locals' in Python 2.5 `__ - `Scoping vs augmented assignment vs sets (Re: 'fast locals' in Python 2.5) `__ - `Comparing closures and arguments (was Re: Scoping vs augmented assignment vs sets (Re: 'fast locals' in Python 2.5) `__ - `The baby and the bathwater (Re: Scoping, augmented assignment, 'fast locals' - conclusion) `__ --------------------------------------- Checking out an older version of Python --------------------------------------- Skip Montanaro asked about checking out a particular version of Python. Oleg Broytmann and Tim Peters explained that tags are just directories in Subversion, and you can vew all the existing ones and their corresponding revision numbers at http://svn.python.org/projects/python/tags/. Oleg also explained that the difference between:: svn switch svn+ssh://pythondev at svn.python.org/python/tags/r242 and noting that the r242 tag corresponds to revision 39619 and doing:: svn up -r 39619 is that with the latter, commits will go to the trunk (assuming the update was performed on a trunk checkout), while with the former, updates will go to the appropriate tag or branch. Giovanni Bajo provided a nice explanation of this, describing Subversion's 2D coordinate system of [url, revision] and Skip added the explanation to the `Development FAQ`_. .. _Development FAQ: http://www.python.org/dev/faq/ Contributing thread: - `Subversion repository question - back up to older versions `__ -------------------- Source control tools -------------------- In the externally maintained packages discussion, Guido suggested offhand that some other version control project might make it easier to resolve some of the issues. Thomas Wouters put forward a number of considerations. On the negative side of changing to one of the newer version control systems: * Workflow would have to change somewhat to use most of the new branch-oriented systems. * Everyone would have to download the whole repository (at least once) since with the newer systems everyone usually has their own repository. But on the positive side: * History can be preserved for merges of brances (unlike Subversion), which is a big gain for when the trunk is switched to 3.0. Thomas tried importing the Python repository into a number of different systems, and after playing around with them, concluded that in the short term, none of the other version control systems were quite ready yet, though he seemed optimistic for them in the next few years. He also promised to publish imports of the Python repository into Git, Darcs, Mercurial, Bazaar-NG and Monotone somewhere once he was able to successfully import them all. Contributing thread: - `Source control tools `__ ---------------------------------------- Is implicit underscore assignment buggy? ---------------------------------------- Raymond Hettinger noted that in the interactive interpreter, an expression that returns None not only suppresses the printing of that None, but also suppresses the assignment to ``_``. Raymond asked if this was intentional as it makes code like the following break:: >>> import re, string >>> re.search('lmnop', string.letters) <_sre.SRE_Match object at 0xb6f2c480> >>> re.search('pycon', string.letters) >>> if _ is not None: ... print _.group() lmnop Fredrik Lundh pointed out that users just need to recognize that the ``_`` holds the most recently *printed* result. Guido pronounced that this would not change. Terry Reedy suggested adding some documentation for this behavior to either Language Reference 2.3.2 Reserved Classes of Identifiers and/or to Tutorial 2.1.2 Interactive Mode, but it was unclear if any doc changes were committed. Contributing thread: - `Is implicit underscore assignment buggy? `__ ----------------------- Removing MAC OS 9 cruft ----------------------- A number of old MAC OS 9 bits and pieces that are no longer used were removed: * IDE scripts * MPW * Tools/IDE * Tools/macfreeze * Unsupported/mactcp/dnrglue.c * Wastemods This should solve some problems for Windows checkouts where files with trailing dots are not supported. Contributing threads: - `Removing Mac OS 9 cruft `__ - `Mac/wastemodule build failing `__ ------------------------------------------ Fixing buffer object's char buffer support ------------------------------------------ Brett Cannon found that ``import array; int(buffer(array.array('c')))`` caused the interpreter to segfault because buffer objects were redirecting tp_as_buffer->bf_getcharbuffer to the wrong tp_as_buffer slot. Brett fixed the bug and updated the docs a bit to clarify what was intended for the implementation, but kept changes pretty minimal as Python 3.0 will ditch buffer for the bytes type anyway. Contributing threads: - `How to fix the buffer object's broken char buffer support `__ - `Is "t#" argument format meant to be char buffer, or just read-only? `__ ------------------------------- Importing subpackages in Jython ------------------------------- In Jython 2.1, importing a module makes all subpackages beneath it available, unlike in regular Python, where subpackages must be imported separately. Samuele Pedroni explained that this was intentional so that imports in Jython would work like imports in Java do. Guido suggested that having imports work this way in Jython was fine as long as a Java package was being imported, but when a Python package was being imported, Jython should use the Python semantics. Contributing thread: - `Import semantics `__ --------------------------------------------- RFC 3986: Uniform Resource Identifiers (URIs) --------------------------------------------- There was some continued discussion of Paul Jimenez's proposed `uriparse module`_ which more faithfully implements `RFC 3986`_ than the current urlparse module. Nick Coghlan submitted an `alternate implementation`_ that kept all parsed URIs as (scheme, authority, path, query, fragment) tuples by allowing some of these elements to be non-strings, e.g. authority could be a (user, password, host, port) tuple, and path could be a (user, host) tuple. People seemed to like Nick's implementation, but no final decision on the module was made. .. _uriparse module: http://bugs.python.org/1462525 .. _RFC 3986: http://www.ietf.org/rfc/rfc3986.txt .. _alternate implementation: http://bugs.python.org/1500504 Contributing thread: - `Some more comments re new uriparse module, patch 1462525 `__ ----------------------------------------------------- False instead of TypeError for frozenset.__contains__ ----------------------------------------------------- Collin Winter suggested that code like ``{} in frozenset([1, 2, 3])`` should return False instead of raising a TypeError. Guido didn't like the idea because he thought it would mask bugs where, say, a user-defined __hash__() method accidentally raised a TypeError. Contributing thread: - `Unhashable objects and __contains__() `__ -------------------------------------------- IOError or ValueError for invalid file modes -------------------------------------------- Kristj?n V. J?nsson asked why open()/file() throws an IOError for an invalid mode string instead of a ValueError. Georg Brandl explained that either an IOError or a ValueError can be raised depending on whether the invalid mode was detected in Python's code or in the OS's fopen call. Guido suggested that this couldn't really be fixed until Python gets rid of its stdio-based implementation in Python 3.0. Contributing thread: - `file() `__ --------------------------------------------------------------- [Python-checkins] r46603 - python/trunk/Lib/test/test_struct.py --------------------------------------------------------------- Martin Blais checked in un-unittestification of test_struct, and a number of people questioned whether that was a wise thing to do. Thomas Wouters suggested that unittest should merge as many features from py.test_ as possible. This would reduce some of the class-based boilerplate currently required, and also allow some nice additional features like test cases generated on the fly. He didn't get much of a response though, so it was unclear what the plans for Python 2.6 were. .. _py.test: http://codespeak.net/py/current/doc/test.html Contributing thread: - `[Python-checkins] r46603 - python/trunk/Lib/test/test_struct.py `__ ----------------------------------------------- Should hex() yield 'L' suffix for long numbers? ----------------------------------------------- Ka-Ping Yee asked why hex() and oct() still produced an 'L' suffix for long numbers even now that ints and longs have basically been unified. `PEP 237`_ had mentioned the removal of this suffix, but not given it a specific release for removal, so people decided it was best to wait until Python 3.0 when the 'L' suffix will also be removed from repr(). .. _PEP 237: http://www.python.org/dev/peps/pep-0237/ Contributing thread: - `Should hex() yield 'L' suffix for long numbers? `__ --------------------------------- Adding an index of Python symbols --------------------------------- Terry Reedy suggested adding a page to the Python Language Reference index that would list each symbol in Python (e.g. ``()``, ``[]`` and ``@``) along with the places in the documentation where it was discussed. Terry promised to submit a plain-text version in time for the Python 2.5 release, so that someone could convert it to LaTeX and merge it into the docs. Contributing thread: - `Symbol page for Language Reference Manual Index `__ ------------------------------------------ Behavior of searching for empty substrings ------------------------------------------ Fredrik Lundh resolved the issues discussed previously with searching for an empty substring at a position past the end of the string. The current behavior looks like:: >>> "ab".find("") 0 >>> "ab".find("", 1) 1 >>> "ab".find("", 2) 2 >>> "ab".find("", 3) -1 Both Tim Peters and Guido applauded the final resolution. Contributing thread: - `Search for empty substrings (was Re: Let's stop eating exceptions in dict lookup) `__ ----------------- subprocess.IGNORE ----------------- Martin Blais asked about adding subprocess.IGNORE along the lines of subprocess.PIPE which would ignore the child's output without being susceptible to buffer deadlock problems. Under Unix, IGNORE could be implemented as ``open('/dev/null', 'w')``, and on Windows, ``open('nul:', 'w')``. People seemed to think this was a useful feature, but at the time of this summary, no patch had yet been provided. Contributing thread: - `subprocess.Popen(.... stdout=IGNORE, ...) `__ ================ Deferred Threads ================ - `Improve error msgs? `__ - `Keeping interned strings in a set `__ - `Documentation enhancement: "MS free compiler"? `__ - `Code coverage reporting. `__ - `Numerical robustness, IEEE etc. `__ ================== Previous Summaries ================== - `Let's stop eating exceptions in dict lookup `__ - `ssize_t question: longs in header files `__ - `ssize_t: ints in header files `__ - `zlib module doesn't build - inflateCopy() not found `__ =============== Skipped Threads =============== - `Segmentation fault of Python if build on Solaris 9 or10 with Sun Studio 11 `__ - `Possible bug in complexobject.c (still in Python 2.5) `__ - `[Python-checkins] r46300 - in python/trunk: Lib/socket.py Lib/test/test_socket.py Lib/test/test_struct.py Modules/_struct.c Modules/arraymodule.c Modules/socketmodule.c `__ - `test_struct failure on 64 bit platforms `__ - `string inconsistency `__ - `S/390 buildbot URLs problematic `__ - `SF patch #1473257: "Add a gi_code attr to generators" `__ - `test_unicode failure on MIPS `__ - `valgrind report `__ - `test_ctypes failures on ppc64 debian `__ - `Request for patch review `__ - `patch #1454481 vs buildbot `__ - `Seeking Core Developers for Vancouver Python Workshop `__ - `[Python-checkins] Python Regression Test Failures refleak (1) `__ - `Include/structmember.h, Py_ssize_t `__ - `DC Python sprint on July 29th `__ - `tarfile and unicode filenames in windows `__ - `[Python-checkins] buildbot warnings in hppa Ubuntu dapper trunk `__ - `-Wi working for anyone else? `__ - `Inject some tracing ... `__ - `Segmentation fault in collections.defaultdict `__ - `Add pure python PNG writer module to stdlib? `__ - `crash in dict on gc collect `__ - `"can't unpack IEEE 754 special value on non-IEEE platform" `__ - `socket._socketobject.close() doesn't really close sockets `__ - `DRAFT: python-dev summary for 2006-04-16 to 2006-04-30 `__ - `[Python-checkins] r46795 - in python/trunk: Doc/lib/libstdtypes.tex Lib/test/string_tests.py Misc/NEWS Objects/stringobject.c Objects/unicodeobject.c `__ - `xrange vs. int.__getslice__ `__ - `request for review: patch 1446489 (zip64 extensions in zipfile) `__ - `DRAFT: python-dev summary for 2006-05-01 to 2006-05-15 `__ - `pychecker warnings in Lib/encodings `__ - `Moving PEP 343 to Final `__ - `Python sprint at Google Aug. 21-24 `__ - `Long options support `__ - `High Level Virtual Machine `__ - `sqlite3 test errors - was : Re: [Python-checkins] r46936 - in python/trunk: Lib/sqlite3/test/regression.py Lib/sqlite3/test/types.py Lib/sqlite3/test/userfunctions.py Modules/_sqlite/connection.c Modules/_sqlite/cursor.c Modules/_sqlite/module.c Modules/_sqlite/module.h `__ - `[Python-checkins] sqlite3 test errors - was : Re: r46936 - in python/trunk: Lib/sqlite3/test/regression.py Lib/sqlite3/test/types.py Lib/sqlite3/test/userfunctions.py Modules/_sqlite/connection.c Modules/_sqlite/cursor.c Modules/_sqlite/module.c Modules/_sqlite/module.h `__ - `[Python-checkins] sqlite3 test errors - was : Re: r46936 - in python/trunk: Lib/sqlite3/test/regression.py Lib/sqlite3/test/types.py Lib/sqlite3/test/userfunctions.py Modules/_sqlite/connection.c Modules/_sqlite/cursor.c Modules/_sqlite/module.c `__ - `[Python-checkins] sqlite3 test errors - was : Re: r46936 - in python/trunk: Lib/sqlite3/test/regression.py Lib/sqlite3/test/types.py Lib/sqlite3/test/userfunctions.py Modules/_sqlite/connection.c Modules/_sqlite/cursor.c Modules/_sql `__ - `Last-minute curses patch `__ - `DRAFT: python-dev summary for 2006-05-16 to 2006-05-31 `__ - `Bug: xml.dom.pulldom never gives you END_DOCUMENT events with an Expat parser `__ - `Misleading error message from PyObject_GenericSetAttr `__ - `About dynamic module loading `__ From jcarlson at uci.edu Mon Jul 3 07:15:39 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sun, 02 Jul 2006 22:15:39 -0700 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <44A86ECF.4060705@canterbury.ac.nz> References: <20060701013710.10D7.JCARLSON@uci.edu> <44A86ECF.4060705@canterbury.ac.nz> Message-ID: <20060702211647.10F1.JCARLSON@uci.edu> Greg Ewing wrote: > Josiah Carlson wrote: > > If the only code that benefits from such changes are "very *simple*", > > then I think that says something about its necessity. > > The point is that they're only "very simple" if you > can write them using access to an outer scope. Without > that ability, they become less simple, less efficient, > more convoluted, harder to follow, etc. As is known and has been stated, assigning to a parent scope can be emulated in various ways, either through an explicit namespace object, or through a namespace list. > Also I don't buy the argument that something has to > be useful for big, complicated things in order to be > worth having in the language. I never claimed that something needed to be useful for "big, complicated things" in order to be worth having in the language. To be explicit, if nontrivial code isn't improved, that doesn't necessarily mean that the feature is useless. However, if the feature is really only useful for generally trivial cases *without* the feature, then making them even more trivial, I think, is a bit of over optimization. - Josiah From guido at python.org Mon Jul 3 07:25:21 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 3 Jul 2006 07:25:21 +0200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060702211647.10F1.JCARLSON@uci.edu> References: <20060701013710.10D7.JCARLSON@uci.edu> <44A86ECF.4060705@canterbury.ac.nz> <20060702211647.10F1.JCARLSON@uci.edu> Message-ID: On 7/3/06, Josiah Carlson wrote: > > Greg Ewing wrote: > > Josiah Carlson wrote: > > > If the only code that benefits from such changes are "very *simple*", > > > then I think that says something about its necessity. > > > > The point is that they're only "very simple" if you > > can write them using access to an outer scope. Without > > that ability, they become less simple, less efficient, > > more convoluted, harder to follow, etc. > > As is known and has been stated, assigning to a parent scope can be > emulated in various ways, either through an explicit namespace object, or > through a namespace list. And the fact that this desire and need remains, even amongst people who should know better, suggests that it may be worth supporting it more directly, as the current work-arounds ain't pretty. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From jcarlson at uci.edu Mon Jul 3 08:08:33 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sun, 02 Jul 2006 23:08:33 -0700 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: References: <20060702211647.10F1.JCARLSON@uci.edu> Message-ID: <20060702225058.10F4.JCARLSON@uci.edu> "Guido van Rossum" wrote: > > On 7/3/06, Josiah Carlson wrote: > > > > Greg Ewing wrote: > > > Josiah Carlson wrote: > > > > If the only code that benefits from such changes are "very *simple*", > > > > then I think that says something about its necessity. > > > > > > The point is that they're only "very simple" if you > > > can write them using access to an outer scope. Without > > > that ability, they become less simple, less efficient, > > > more convoluted, harder to follow, etc. > > > > As is known and has been stated, assigning to a parent scope can be > > emulated in various ways, either through an explicit namespace object, or > > through a namespace list. > > And the fact that this desire and need remains, even amongst people > who should know better, suggests that it may be worth supporting it > more directly, as the current work-arounds ain't pretty. Perhaps not pretty, but not wholly ugly either. Or expressed another way, it's a wart, but the wart isn't 1" across on a forehead, it's fairly small and tucked away on an elbow. I had hoped that there would be a response to my second (and I believe more applicable statement); "if the feature is really only useful for generally trivial cases *without* the feature, then making them even more trivial, I think, is a bit of over optimization." As for a solution, I find the "global means 'not local'" proposition is the least undesireable of the possibilities. It suffers from a change in semantics and potential name masking issues, but I don't believe these are any more serious than normal global masking for the former, and the latter is solvable with a __future__ (at least for 2.6). I'm a solid -0 on this particular proposition, which is far better than the -1 I am on all of the other recent lexical scoping propositions. - Josiah From martin at v.loewis.de Mon Jul 3 08:19:14 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 03 Jul 2006 08:19:14 +0200 Subject: [Python-Dev] Proposal to eliminate PySet_Fini In-Reply-To: <20060627184404.GL10485@performancedrivers.com> References: <20060627184404.GL10485@performancedrivers.com> Message-ID: <44A8B6E2.2040003@v.loewis.de> Jack Diederich wrote: > PyObject_MALLOC does a good job of reusing small allocations but it > can't quite manage the same speed as a free list, especially for things that > have some extra setup involved (tuples have a free list for each length). I would question that statement, for any practical purposed. The cost of tuple comes from setting the elements to NULL, and that has to be done regardless of whether they were allocated new or came from the list. Likewise, the GC management has to be done regardless. So I expect that the speedup is rather minor, and not worth it. Regards, Martin From talin at acm.org Mon Jul 3 09:27:04 2006 From: talin at acm.org (Talin) Date: Mon, 03 Jul 2006 00:27:04 -0700 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060702225058.10F4.JCARLSON@uci.edu> References: <20060702211647.10F1.JCARLSON@uci.edu> <20060702225058.10F4.JCARLSON@uci.edu> Message-ID: <44A8C6C8.3030401@acm.org> Josiah Carlson wrote: > I had hoped that there would be a response to my second (and I believe > more applicable statement); "if the feature is really only useful for > generally trivial cases *without* the feature, then making them even > more trivial, I think, is a bit of over optimization." It really depends on how common the trivial case is. In other words, multiply the savings for each occurance times the number of occurances. (Unfortunately, I don't know what units to measure said savings in - is there a unit of 'mental disconnect' or unintuitiveness? :) In an idealy world, the language would allow everything to be said in the most comprehensible way possible. Longer and more verbose ways of stating something are at an inherent disadvantage in this, simply because of the time it takes to scan and absorb the information by the human brain. However, losing excess syntax has to be done in a way that doesn't also lose information. Highly compressed representations of a concept may require such a level of abstraction that it is as much work to puzzle out their meaning as it would be to read the longer version and more. To put it another way - I am an advocate of applying Claude Shannon's theory of information to language design. The highest level of compression should be used for expressions that occur the most frequently. > As for a solution, I find the "global means 'not local'" proposition is > the least undesireable of the possibilities. It suffers from a change > in semantics and potential name masking issues, but I don't believe > these are any more serious than normal global masking for the former, > and the latter is solvable with a __future__ (at least for 2.6). I'm a > solid -0 on this particular proposition, which is far better than the -1 > I am on all of the other recent lexical scoping propositions. I'd say that the more common case is where you want global to really mean global - that is, you want to be able to write to some module-level variable, regardless of how deeply nested your function scope is. While being able to access the 'next outer scope' is occasionally useful, it's not all that common. So changing the behavior of 'global' in this case would be both confusing (since it no longer means 'global'), and less useful (because it doesn't match the most common case.) (This assumes that I haven't completely understood the meaning of the phrase 'not local' - I assumed that it means 'not defined in this scope') Of course, the reason why it's not all that common may be because of the fact that it's not as easy to do, and so people tend to (consciously or otherwise) avoid that pattern in their designs. That being said, I don't think that's necessarily such a bad thing. Python isn't Scheme, and the scoping rules of Python are IMHO more oriented towards practicality and common sense than theoretical purity. This is why I'm not bothered by the fact that Python doesn't create a new scope for loop statements and such. Most of the time, this is what you want. It does mean that you need to name all of your variables uniquely, but that's good programming style in any case. The same is true for local variables not needing to be specially declared as 'my' or 'var' - most of the time, a local variable is what you want. On the other hand, the thing about theoretical purity is that it can be so mouth-wateringly powerful at times. For example, a language that supports closures is, IMHO, at least twice as powerful as a language that doesn't -- because you can use them in so many different and interesting ways. OK, so about the lexical scoping issue - let me brainstorm a moment: One idea would be to introduce the keyword 'local' which would have the effect of capturing any 'global' statements in any enclosing scope. So for example: f = 1 def a(): local f f = 2 def b(): global f f = 3 So in this case, the 'global' statement, which would normally associate 'f' with the outermost (module-level) scope, would instead associate 'f' with the innermost 'local' declaration of that variable. So in the above example, assigning 3 to f assigns it to the middle scope, but does not affect the module-level definition. Admittedly, that's a bit confusing and also verbose, considering that you are not only adding an extra keyword, but also using two statements to specify the home of one variable. Another alternative would be a way to declare an explicitly scoped variable. Lets use the keyword 'my' to indicate this: f = 1 def a(): my f = 2 def b(): f = 3 In this case, what the 'my' statement is doing is indicating that this scope 'owns' the definition of 'f' -- in other words, the definition is hoisted out of any enclosed scopes. So again, in the above example, the innermost assignment will be to the definition of 'f' in the middle scope. What's interesting about this is that you can use the same method with globals: my f = 1 def a(): f = 2 a() print f # prints '2' So again, you are indicating that the global scope 'owns' the definition of 'f', and any enclosed scopes should use that definition, and not create their own. Of course, if you really *do* need to have your own version, you can always override the 'my' statement with another 'my' statement: my f = 1 def a(): my f = 2 a() print f # prints '1' The 'my' statement essentially changes the scoping rules for all variables of that name, within the defining scope and all enclosed scopes. Of course, you can also override this behavior using the 'global' statement, which does exactly what it does now - makes the reference global (i.e. module-level): my f = 1 def a(): global f f = 2 a() print f # prints '2' All right, I'm pretty happy with that. Brainstorming done. :) -- Talin From tim.peters at gmail.com Mon Jul 3 10:14:42 2006 From: tim.peters at gmail.com (Tim Peters) Date: Mon, 3 Jul 2006 04:14:42 -0400 Subject: [Python-Dev] Proposal to eliminate PySet_Fini In-Reply-To: <44A8B6E2.2040003@v.loewis.de> References: <20060627184404.GL10485@performancedrivers.com> <44A8B6E2.2040003@v.loewis.de> Message-ID: <1f7befae0607030114h49fb33edwa61bc5e55ec3b569@mail.gmail.com> [Jack Diederich] >> PyObject_MALLOC does a good job of reusing small allocations but it >> can't quite manage the same speed as a free list, especially for things that >> have some extra setup involved (tuples have a free list for each length). [Martin v. L?wis] > I would question that statement, for any practical purposed. The cost of > tuple comes from setting the elements to NULL, and that has to be done > regardless of whether they were allocated new or came from the list. Except _that_ overhead is trivial for small tuples, and small tuples are the only kind the free lists cache. There are many other overheads. If a tuple is taken off a free list, we get to skip integer multiplication and division checking for overflow before calling PyObject_GC_NewVar. We also get to skip the call to PyObject_GC_NewVar. That in turns skips another integer multiplication in the _PyObject_VAR_SIZE macro, and a call to _PyObject_GC_Malloc. That it turn skips a call to PyObject_MALLOC, and conditionals checking whether it's time to trigger a gc collection. All of that is highly significant compared to the cost of setting at most a handful of slots to NULL inline. > Likewise, the GC management has to be done regardless. _PyObject_GC_TRACK expands to 5 inlined simple stores, and a predictable branch, so it is often more expensive than setting the tuple slots to NULL. But, as above, we get to skip three layers of function call and "will it overflow?" arithmetic in the service of _setting up_ an object for gc initially. Only the gc track/untrack pair remains for tuples in a free list. > So I expect that the speedup is rather minor, and not worth it. Depends on the app :-) Here's a test case that gets supernatural benefit from small-tuple caching: """ def doit(): N1000 = [None] * 1000 basetup = (5,) for i in N1000: tups = [] push = tups.append for j in xrange(10): for k in N1000: push(basetup * j) from time import clock as now times = [] for i in range(3): start = now() doit() finish = now() times.append(finish - start) print sorted(times) """ With current trunk that printed [2.9363677646013846, 2.9489729031005703, 2.9689538729183949] After changing #define MAXSAVEDTUPLES 2000 to #define MAXSAVEDTUPLES 0 the times zoomed to [4.5894824930441587, 4.6023111649343242, 4.629560027293957] That's pretty dramatic. OTOH, I don't have any apps that do that <0.5 wink>, and there's another downside: on SF recently, someone complained that the 2.5 obmalloc work to release unused arenas wasn't doing much in his (perhaps equally artificial -- don't know) test case. It surprised me too, so I dug into it. The problem turned out to be that piles of arenas were being kept "artificially" alive because obmalloc was the original source of thousands of tuple objects being kept alive (from obmalloc's POV) in tupleobject.c's free lists. If you're unlucky, it only takes one tiny tuple in one free list to keep an entire 256KB arena alive -- and if you're very unlucky, you manage to allocate objects in such a way that this happens repeatedly. His test also created lots of dicts along the way, and each arena got mostly filled up with dict objects and a relative handful of small tuples. By the time all of this became trash, the tuples were spread over a few hundred areans, which effectively became immortal. While it doesn't make any real sense, I've seen repeatedly that users _try_ calling gc.collect() in such cases (doesn't make sense because it has nothing to do with cyclic gc). But that suggests it could be a _pragmatic_ win to add an internal "clear all known free lists" function, which gc could call at times, or even just be forced by the user via a `gc` entry point or gc.collect() option. The immortal & unbounded int and float free lists don't cause obmalloc arenas to stay alive unduly, because they get their memory in chunks straight from the system malloc(). But they have to be the cause of the most frequent source of complaints from newbies and Zope users ;-), who do things like range(10000000) and then marvel that some 120MB of VM is still being used after nuking the list and doing a gc.collect(). I get weary of explaining that one :-(. From greg.ewing at canterbury.ac.nz Mon Jul 3 11:19:14 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 03 Jul 2006 21:19:14 +1200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060702225058.10F4.JCARLSON@uci.edu> References: <20060702211647.10F1.JCARLSON@uci.edu> <20060702225058.10F4.JCARLSON@uci.edu> Message-ID: <44A8E112.1050309@canterbury.ac.nz> Josiah Carlson wrote: > I had hoped that there would be a response to my second (and I believe > more applicable statement); "if the feature is really only useful for > generally trivial cases *without* the feature, then making them even > more trivial, I think, is a bit of over optimization." I don't think "trivial" is the right word to use here, since it implies something that's of so little importance that it can be ignored. But the simple cases are precisely the ones where this wart hurts the most, so we can't ignore them. Arguments that a feature is undesirable because this or that workaround exists seem like post-hoc justifications to me. Think about it the other way around -- if writing to outer scopes had been straightforward from the beginning, would you be arguing for the *removal* of that ability? Would it even have occurred to anyone to do such a thing? -- Greg From greg.ewing at canterbury.ac.nz Mon Jul 3 11:50:06 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 03 Jul 2006 21:50:06 +1200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <44A8C6C8.3030401@acm.org> References: <20060702211647.10F1.JCARLSON@uci.edu> <20060702225058.10F4.JCARLSON@uci.edu> <44A8C6C8.3030401@acm.org> Message-ID: <44A8E84E.2040806@canterbury.ac.nz> Talin wrote: > To put it another way - I am an advocate of applying Claude Shannon's > theory of information to language design. The highest level of > compression should be used for expressions that occur the most frequently. I believe the proposal in question would cause no net worsening in this information content, and may actually improve it slightly, due to allowing a few things to be written in a shorter and clearer way, while allowing the vast majority of existing things to be written in exactly the same way. > I'd say that the more common case is where you want global to really > mean global - that is, you want to be able to write to some module-level > variable, regardless of how deeply nested your function scope is. It would still mean that, except in the (expected to be *extremely* rare) case where you happened to have a variable with the same name assigned in some intermediate scope. Such cases would be easily fixed by renaming the intermediate variable -- using a name of shorter or equal length, if you like, to keep the information content up. :-) > So changing the behavior of 'global' in this case > would be both confusing (since it no longer means 'global'), An alternative would be to change the keyword as well, to something like 'outer', which would better match its semantics. But if that were done, I would argue for the *removal* of the existing 'global' keyword, which would then be almost completely redundant. This would break large amounts of existing code, however, and it's highly dubious whether that would be worth the small increase in pendantic accuracy, even in Py3k. We're not supposed to be *gratuitously* breaking things in Py3k, after all. > (This assumes that I haven't completely understood the meaning of the > phrase 'not local' - I assumed that it means 'not defined in this scope') Yes, the new meaning would be "in the next outermost scope where there is an assignment to this name, or the module scope if you get that far". > Python isn't Scheme, and the scoping rules of Python are > IMHO more oriented towards practicality and common sense > than theoretical purity. Yes, but I find it hard to regard being *forbidden* from assigning to intermediate scopes as something driven by practical need rather than just being a historical accident. Back when there were strictly two scopes, many people argued themselves blue in the face that this was actually a *good* thing, even if you didn't realise it, and that Python was doing you a favour by enforcing it. Eventually a measure of sanity prevailed, and we got something a lot more like traditional lexical scoping. But one remnant of the old system remained, like a vestigial organ -- the 'global' statement that reaches all the way out to the module scope, regardless of what exists in between. To someone used to lexical scoping in almost any other language that has it, this is a *very* strange and unintuitive thing. Looking back, I think the meaning of 'global' should have been redefined right then at the same time. That would have been the logical and consistent thing to do, and in my opinion would have resulted in a scoping model that was simpler, more useful and no less practical. The most theoretically pure thing would have been to change it to 'outer' at the same time, but that would have broken too much code, and would therefore not have been practical. See, I'm not immune to practicality arguments. :-) > One idea would be to introduce the keyword 'local' which would have the > effect of capturing any 'global' statements in any enclosing scope. That seems unnecessary to me. Or at least not necessary enough to be worth the extra complexity in the scoping model. -- Greg From skip at pobox.com Mon Jul 3 15:01:45 2006 From: skip at pobox.com (skip at pobox.com) Date: Mon, 3 Jul 2006 08:01:45 -0500 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <20060702225058.10F4.JCARLSON@uci.edu> References: <20060702211647.10F1.JCARLSON@uci.edu> <20060702225058.10F4.JCARLSON@uci.edu> Message-ID: <17577.5433.301972.602166@montanaro.dyndns.org> Josiah> As for a solution, I find the "global means 'not local'" Josiah> proposition is the least undesireable of the possibilities. It Josiah> suffers from a change in semantics and potential name masking Josiah> issues... Pychecker and PyLint both already identify cases where builtins are masked by locals or module globals (and may identify cases where locals mask module globals - I don't recall). I suspect both could be generalized in this regard without a huge effort. That's probably the best place for this sort of warning. Skip From ark at acm.org Mon Jul 3 17:08:18 2006 From: ark at acm.org (Andrew Koenig) Date: Mon, 3 Jul 2006 11:08:18 -0400 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <44A8E112.1050309@canterbury.ac.nz> Message-ID: <003801c69eb2$872dc990$6402a8c0@arkdesktop> > I don't think "trivial" is the right word to use here, > since it implies something that's of so little importance > that it can be ignored. But the simple cases are precisely > the ones where this wart hurts the most, so we can't > ignore them. I'd like to inject an example that might help make this discussion more concrete. Consider the following function: def for_each(seq, f): for i in seq: f(i) I'm sure I've seen more than one instance of someone on comp.lang.python trying to do the equivalent of using a function such as this one to compute the sum of the elements of a sequence as follows: def sum(seq): result = 0 def accum(i): result += i for_each(seq, accum) return result and wonder why it doesn't work. Still odder, why it doesn't work and the following does: def sum(seq): result = [0] def accum(i): result[0] += i for_each(seq, accum) return result[0] Transforming the first definition of sum above into the second may be trivial, but only if you've encountered the technique before. Moreover, the first version of sum uses a technique that is more than 45 years old (!), as it was available to Algol 60 programmers. From martin at v.loewis.de Mon Jul 3 17:10:16 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 03 Jul 2006 17:10:16 +0200 Subject: [Python-Dev] Proposal to eliminate PySet_Fini In-Reply-To: <1f7befae0607030114h49fb33edwa61bc5e55ec3b569@mail.gmail.com> References: <20060627184404.GL10485@performancedrivers.com> <44A8B6E2.2040003@v.loewis.de> <1f7befae0607030114h49fb33edwa61bc5e55ec3b569@mail.gmail.com> Message-ID: <44A93358.3040000@v.loewis.de> Tim Peters wrote: > With current trunk that printed > > [2.9363677646013846, 2.9489729031005703, 2.9689538729183949] > > After changing > > #define MAXSAVEDTUPLES 2000 > > to > > #define MAXSAVEDTUPLES 0 > > the times zoomed to > > [4.5894824930441587, 4.6023111649343242, 4.629560027293957] > > That's pretty dramatic. Interesting. I ran this through gprof, and found the following changes to the number of function calls with-cache without-cache PyObject_Malloc 59058 24055245 tupletraverse 33574 67863194 visit_decref 131333 197199417 visit_reachable 131333 197199417 collect 17 33006 (for reference:) tuplerepeat 30000000 30000000 According to gprof, these functions (excluding tuplerepeat) together account for 40% of the execution time in the without-cache (i.e. MAXSAVEDTUPLES 0) case. So it appears that much of the slowdown in disabling the fast tuple allocator is due to the higher frequency of garbage collection in your example. Can you please re-run the example with gc disabled? Of course, it's really no surprise that GC is called more often: if the tuples are allocated from the cache, that doesn't count as an allocation wrt. GC. It so happens that your example just triggers gc a few times in its inner loop; I wouldn't attribute that overhead to obmalloc per se. Regards, Martin From guido at python.org Mon Jul 3 17:45:49 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 3 Jul 2006 17:45:49 +0200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <003801c69eb2$872dc990$6402a8c0@arkdesktop> References: <44A8E112.1050309@canterbury.ac.nz> <003801c69eb2$872dc990$6402a8c0@arkdesktop> Message-ID: On 7/3/06, Andrew Koenig wrote: > > I don't think "trivial" is the right word to use here, > > since it implies something that's of so little importance > > that it can be ignored. But the simple cases are precisely > > the ones where this wart hurts the most, so we can't > > ignore them. > > I'd like to inject an example that might help make this discussion more > concrete. > > Consider the following function: > > def for_each(seq, f): > for i in seq: > f(i) > > I'm sure I've seen more than one instance of someone on comp.lang.python > trying to do the equivalent of using a function such as this one to compute > the sum of the elements of a sequence as follows: > > def sum(seq): > result = 0 > def accum(i): > result += i > for_each(seq, accum) > return result > > and wonder why it doesn't work. Still odder, why it doesn't work and the > following does: > > def sum(seq): > result = [0] > def accum(i): > result[0] += i > for_each(seq, accum) > return result[0] > > Transforming the first definition of sum above into the second may be > trivial, but only if you've encountered the technique before. Moreover, the > first version of sum uses a technique that is more than 45 years old (!), as > it was available to Algol 60 programmers. Much though the Algol 60 tickles my nostalgia (it was my first programming language!) I don't think that it's a particularly strong argument. I like to think that we have better ways these days. I think you need to come up with a better motivating example; the above is particular un-idiomatic Python. It starts by defining a higher-order function for_each that has little to offer over writing an explicit for loop, and then uses this to motivate writing a simple operation (result += i) as a function instead so that it fits in the inconvenient for_each() API. I understand that both for_each() and accum() are just examples of more complicated functions, but I can't help thinking that the problem here only occurs for very *simple* functions in the place of accum(); a more complicated form of accum would likely be a bound method of a class instance which carries the state. A better way to decompose these kinds of problems is probably by using generators. The equivalent of for_each() would not take a function parameter but *yield* the successive values instead of calling f() with successive values; e.g.: def for_each(seq): for i in seq: yield i Then the sum() function could be written like this: def sum(seq): result = 0 for i in for_each(seq): result += i return result -- --Guido van Rossum (home page: http://www.python.org/~guido/) From ark at acm.org Mon Jul 3 17:56:02 2006 From: ark at acm.org (Andrew Koenig) Date: Mon, 3 Jul 2006 11:56:02 -0400 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: Message-ID: <006c01c69eb9$323f37a0$6402a8c0@arkdesktop> > Much though the Algol 60 tickles my nostalgia (it was my first > programming language!) I don't think that it's a particularly strong > argument. I like to think that we have better ways these days. Even if so, that's not the point I was trying to make. The point is that there is a programming technique that is widely used, works in many languages, and has been around for 45 years; and when you try to use it in Python, it fails. I believe that such failures, even if there are alternative ways of solving the problems that engender them, are barriers to learning that should be removed if it is possible to do so without substantial cost. From theller at python.net Mon Jul 3 18:19:02 2006 From: theller at python.net (Thomas Heller) Date: Mon, 03 Jul 2006 18:19:02 +0200 Subject: [Python-Dev] Moving the ctypes repository to python.org In-Reply-To: <44A57D6D.70701@v.loewis.de> References: <20060623145602.GB10250@niemeyer.net> <449C5157.5050004@v.loewis.de> <44A57D6D.70701@v.loewis.de> Message-ID: <44A94376.2030004@python.net> Martin v. L?wis schrieb: > Thomas Heller wrote: >> - Do I need special rights to call 'svnadmin load' to import this dumpfile >> into Python SVN, or are the normal commit rights sufficient? > > It's called "svnadmin" for a reason :-) > > Neal Norwitz or myself will have to do that; we need to do it on the > repository machine locally. I would likely take subversion write > access offline for the time of the import, so that I can rollback > the entire repository in case of an operator mistake. Please tell me when you or Neal have time for the import. >> What exactly is the URL/PATH where it should be imported (some sandbox, >> I assume)? > > My view is that this is the "projects" repository; with ctypes being a > project, it should go into the root directory (i.e. as a sibling to > python, peps, distutils, stackless, ...). If you prefer to see it in > sandbox, this could work as well. Having it in "projects" is fine, this matches the directory structure that cvs2svn creates (ctypes/trunk, ctypes/tags, ctypes/branches). >> - What about the Python trunk? Should changes from the sandbox be merged >> into Modules/_ctypes and Lib/ctypes, or would it be better (or possible at all) >> to use the external mechanism? > > I would prefer to see two-way merges going on, at least until 2.5 is > released (i.e. no changes to Modules/ctypes except for bug fixes). Ok. > Using svn:external is a risky thing wrt. to branching/tagging: > > When we tag the Python tree, we want to tag the entire source tree. > With svn:external, only the external link would be in the tag, > i.e. later changes to the external link would modify old tags. > This is undesirable. > > This problem could be solved with a versioned external link; > this would mean that ctypes could not be edited directly, but > that one would have to go through the original repository > URL to perform modifications, and then update the external > link. > > So I think I still would prefer two-way merges. There are > tools to make the merges pretty mechanic. I have no experience at all with svn:external, so that's fine with me too. Thanks, Thomas From theller at python.net Mon Jul 3 18:20:09 2006 From: theller at python.net (Thomas Heller) Date: Mon, 03 Jul 2006 18:20:09 +0200 Subject: [Python-Dev] Moving the ctypes repository to python.org In-Reply-To: <44A57D6D.70701@v.loewis.de> References: <20060623145602.GB10250@niemeyer.net> <449C5157.5050004@v.loewis.de> <44A57D6D.70701@v.loewis.de> Message-ID: <44A943B9.9080708@python.net> Martin v. L?wis schrieb: > Thomas Heller wrote: >> - Do I need special rights to call 'svnadmin load' to import this dumpfile >> into Python SVN, or are the normal commit rights sufficient? > > It's called "svnadmin" for a reason :-) > > Neal Norwitz or myself will have to do that; we need to do it on the > repository machine locally. I would likely take subversion write > access offline for the time of the import, so that I can rollback > the entire repository in case of an operator mistake. Please tell me when you or Neal have time for the import. >> What exactly is the URL/PATH where it should be imported (some sandbox, >> I assume)? > > My view is that this is the "projects" repository; with ctypes being a > project, it should go into the root directory (i.e. as a sibling to > python, peps, distutils, stackless, ...). If you prefer to see it in > sandbox, this could work as well. Having it in "projects" is fine, this matches the directory structure that cvs2svn creates (ctypes/trunk, ctypes/tags, ctypes/branches). >> - What about the Python trunk? Should changes from the sandbox be merged >> into Modules/_ctypes and Lib/ctypes, or would it be better (or possible at all) >> to use the external mechanism? > > I would prefer to see two-way merges going on, at least until 2.5 is > released (i.e. no changes to Modules/ctypes except for bug fixes). Ok. > Using svn:external is a risky thing wrt. to branching/tagging: > > When we tag the Python tree, we want to tag the entire source tree. > With svn:external, only the external link would be in the tag, > i.e. later changes to the external link would modify old tags. > This is undesirable. > > This problem could be solved with a versioned external link; > this would mean that ctypes could not be edited directly, but > that one would have to go through the original repository > URL to perform modifications, and then update the external > link. > > So I think I still would prefer two-way merges. There are > tools to make the merges pretty mechanic. I have no experience at all with svn:external, so that's fine with me too. Thanks, Thomas From tomerfiliba at gmail.com Mon Jul 3 19:00:09 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Mon, 3 Jul 2006 19:00:09 +0200 Subject: [Python-Dev] weakattr In-Reply-To: <20060701213745.10E0.JCARLSON@uci.edu> References: <1d85506f0607010649y65c9fb24mbc5c636360970e52@mail.gmail.com> <20060701213745.10E0.JCARLSON@uci.edu> Message-ID: <1d85506f0607031000q578c8dccka762b13ba3413de7@mail.gmail.com> > I like the added functionality offered with weakattrs as defined. I'm > not terribly in love with the syntax of their creation, and I'm curious > as to how it plays with __slots__ weakattrs are data descriptors, just like properties etc. they are part of the class, not the instance, so there shouldn't be any trouble with mixing those with __slots__ moreover, adding those to stdlib is very staight-forward. we don't even need to introduce a new module. if people show interest, i'll write a bit of a longer doc string and add some unit tests (although there's not much to test :) ) > Toss it out in python-list, I think some people over there would be able > to offer more feedback. will do... although i doubt they will offer any -tomer On 7/2/06, Josiah Carlson wrote: > > "tomer filiba" wrote: > > weakattr (weak attributes) are attributes that are weakly referenced > > by their containing object. they are very useful for cyclic references -- > > an object that holds a reference to itself. > > I like the added functionality offered with weakattrs as defined. I'm > not terribly in love with the syntax of their creation, and I'm curious > as to how it plays with __slots__ (not quite having the time to look at > its implementation right now), but it is quite explicit, so I can get > past that. It would allow us to say, "stop using __del__, use weakattrs", > but I'm not sure how well that would work, generally. > > Toss it out in python-list, I think some people over there would be able > to offer more feedback. > > - Josiah > > From facundobatista at gmail.com Mon Jul 3 20:07:20 2006 From: facundobatista at gmail.com (Facundo Batista) Date: Mon, 3 Jul 2006 15:07:20 -0300 Subject: [Python-Dev] Time-out in URL Open Message-ID: I need a timeout in urlopen, just to be able to make: >>> urllib2.urlopen("http://no.host.org", timeout=2) This is actually not possible, but I'll make it work. I want to know, please, if this is useful in general, for me to post a patch in SF. Regards, -- . Facundo Blog: http://www.taniquetil.com.ar/plog/ PyAr: http://www.python.org/ar/ From fdrake at acm.org Mon Jul 3 20:12:22 2006 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Mon, 3 Jul 2006 14:12:22 -0400 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: <200607031412.22874.fdrake@acm.org> On Monday 03 July 2006 14:07, Facundo Batista wrote: > I want to know, please, if this is useful in general, for me to post a > patch in SF. It seems like something that should be easy, and lots of people need to consider this for applications. -Fred -- Fred L. Drake, Jr. From martin at v.loewis.de Mon Jul 3 22:27:56 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 03 Jul 2006 22:27:56 +0200 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: <44A97DCC.2040202@v.loewis.de> Facundo Batista wrote: >>>> urllib2.urlopen("http://no.host.org", timeout=2) > > This is actually not possible, but I'll make it work. > > I want to know, please, if this is useful in general, for me to post a > patch in SF. While it might be useful, it can only be added to Python 2.6 now. So take your time with that patch. Regards, Martin From guido at python.org Mon Jul 3 23:00:38 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 3 Jul 2006 23:00:38 +0200 Subject: [Python-Dev] Lexical scoping in Python 3k In-Reply-To: <006c01c69eb9$323f37a0$6402a8c0@arkdesktop> References: <006c01c69eb9$323f37a0$6402a8c0@arkdesktop> Message-ID: On 7/3/06, Andrew Koenig wrote: > > Much though the Algol 60 tickles my nostalgia (it was my first > > programming language!) I don't think that it's a particularly strong > > argument. I like to think that we have better ways these days. > > Even if so, that's not the point I was trying to make. The point is that > there is a programming technique that is widely used, works in many > languages, and has been around for 45 years; and when you try to use it in > Python, it fails. That's true for lots of things that have been around for a long time. Can you provide a better example? (The use of += is not particularly relevant to the example; it could just as well have said "result = result + i".) > I believe that such failures, even if there are alternative ways of solving > the problems that engender them, are barriers to learning that should be > removed if it is possible to do so without substantial cost. And that is of course the crucial question. Probably the only proposal that has any chance of succeeding is to extend the 'global' statement so that it also applies to variables in intermediate outer scopes; or perhaps a new keyword (since "global" is not a very good name for the extended semantics). We would have to decide what this example would do: def outer(): def inner1(x): global a a = x def inner2(): return a return inner1, inner2 f1, f2 = outer() g1, g2 = outer() f1(42) g1(0) print f2() # Does it print 0 or 42 ??? In current Python this prints 0: there's only one (global) variable a, and the call to g1(0) overrides the value that was stored by f1(42). If global were changed to mean "nonlocal" what should it do? The question the example poses is that a is not initialized except in inner1() -- we somehow have to decide whether this is an error, or whether it chooses some well-defined outer scope, with the choices being the nearest enclosing scope or the outermost (truly global) scope. We have one guideline: if there is exactly one outer scope that defines a variable named a, we would like it to be referenced (by the 'global a') statement and the variable references governed by it automatically. Also, of there's more than one such scope, we'd like it to reference the innermost one. But this doesn't have a natural extension to what should happen if there's no such scope! Perhaps the best solution would be to make it an error if there wasn't a visible variable named a in an outer scope. That would suit me fine because I'd like to migrate towards more static analysis of variables anyway. If that means equipping modues with traps for attempts to store arbitrary variables into their namespaces, that's fine with me (as long as there's some escape -- and of course instancs and classes remain fully dynamic). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Mon Jul 3 23:06:02 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 3 Jul 2006 23:06:02 +0200 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: To fake things like this, socket.setdefaulttimeout() was added, though I don't know if it actually works. Have you tried that? --Guido On 7/3/06, Facundo Batista wrote: > I need a timeout in urlopen, just to be able to make: > > >>> urllib2.urlopen("http://no.host.org", timeout=2) > > This is actually not possible, but I'll make it work. > > I want to know, please, if this is useful in general, for me to post a > patch in SF. > > Regards, > > -- > . Facundo > > Blog: http://www.taniquetil.com.ar/plog/ > PyAr: http://www.python.org/ar/ > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From skip at pobox.com Mon Jul 3 23:30:11 2006 From: skip at pobox.com (skip at pobox.com) Date: Mon, 3 Jul 2006 16:30:11 -0500 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: <17577.35939.965788.902096@montanaro.dyndns.org> Facundo> I need a timeout in urlopen, just to be able to make: >>>> urllib2.urlopen("http://no.host.org", timeout=2) Facundo> This is actually not possible, but I'll make it work. Facundo> I want to know, please, if this is useful in general, for me to Facundo> post a patch in SF. As others have posted, yes, it would be useful for 2.6. However, you should consider how that might be applied to the other Internet service modules (ftplib, telnetlib, urllib, etc). Skip From skip at pobox.com Mon Jul 3 23:31:12 2006 From: skip at pobox.com (skip at pobox.com) Date: Mon, 3 Jul 2006 16:31:12 -0500 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: <17577.36000.58814.231162@montanaro.dyndns.org> Guido> To fake things like this, socket.setdefaulttimeout() was added, Guido> though I don't know if it actually works. Have you tried that? I'm pretty sure it does, but is a rather blunt instrument for the task, as it affects all socket connections the app might make. Skip From skip at pobox.com Mon Jul 3 23:31:12 2006 From: skip at pobox.com (skip at pobox.com) Date: Mon, 3 Jul 2006 16:31:12 -0500 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: <17577.36000.58814.231162@montanaro.dyndns.org> Guido> To fake things like this, socket.setdefaulttimeout() was added, Guido> though I don't know if it actually works. Have you tried that? I'm pretty sure it does, but is a rather blunt instrument for the task, as it affects all socket connections the app might make. Skip From jjl at pobox.com Mon Jul 3 23:57:36 2006 From: jjl at pobox.com (John J Lee) Date: Mon, 3 Jul 2006 21:57:36 +0000 (UTC) Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: On Mon, 3 Jul 2006, Guido van Rossum wrote: > To fake things like this, socket.setdefaulttimeout() was added, though > I don't know if it actually works. Have you tried that? [...] It works. I think there's some issue with SSL, though (can't seem to find the issue now). Of course, feeding through the timeout to the individual protocol modules would be a good thing. John From billchi at microsoft.com Tue Jul 4 01:17:36 2006 From: billchi at microsoft.com (Bill Chiles) Date: Mon, 3 Jul 2006 16:17:36 -0700 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <06Jun30.183119pdt."58641"@synergy1.parc.xerox.com> Message-ID: <9B706ACDA57B23438AAE4822369672AF04062AD1@RED-MSG-10.redmond.corp.microsoft.com> For Common Lispers and probably Schemers, Python has some surprising semantics around scope and lifetime extent of variables. Three that leap out at me are: * function parameters with default values are NOT new bindings for each invocation, so a default value of [] changes if you destructively modify this list object in the function * loop variables are NOT distinct lexical variables. The binding gloms on to a variable in the function's scope, both changing that lexical binding and not creating a new one for the loop (that goes away when the loop's scope ends) * loop variables are NOT distinct bindings per iteration, leading to the surprising results below Bill -----Original Message----- From: python-dev-bounces+billchi=microsoft.com at python.org [mailto:python-dev-bounces+billchi=microsoft.com at python.org] On Behalf Of Bill Janssen Sent: Friday, June 30, 2006 6:31 PM To: Giovanni Bajo Cc: Phillip J. Eby; Ka-Ping Yee; Guido van Rossum; Tim Peters; python-dev at python.org Subject: Re: [Python-Dev] 2.5 and beyond > >>> a = [] > >>> for i in range(10): > ... a.append(lambda: i) > ... > >>> print [x() for x in a] > [9, 9, 9, 9, 9, 9, 9, 9, 9, 9] Isn't this exactly what you'd expect? Maybe I've been writing Python for too long... :-). Bill _______________________________________________ Python-Dev mailing list Python-Dev at python.org http://mail.python.org/mailman/listinfo/python-dev Unsubscribe: http://mail.python.org/mailman/options/python-dev/billchi%40microsoft.co m From jcarlson at uci.edu Tue Jul 4 02:20:28 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Mon, 03 Jul 2006 17:20:28 -0700 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <9B706ACDA57B23438AAE4822369672AF04062AD1@RED-MSG-10.redmond.corp.microsoft.com> References: <06Jun30.183119pdt."58641"@synergy1.parc.xerox.com> <9B706ACDA57B23438AAE4822369672AF04062AD1@RED-MSG-10.redmond.corp.microsoft.com> Message-ID: <20060703171546.1108.JCARLSON@uci.edu> "Bill Chiles" wrote: > > For Common Lispers and probably Schemers, Python has some surprising > semantics around scope and lifetime extent of variables. Three that > leap out at me are: One thing to remember is that Python is not Scheme/Lisp. It borrows some ideas from Scheme/Lisp, but that borrowing does not necessitate a it also use a completely equivalent scoping mechanism. From what I have been hearing about Python 2.6, and 3.0, the three "surprises" you describe are not going to be "fixed" (with respect to expected Scheme/Lisp semantics). Feel free to argue as to why they should be "fixed" in Py3k (unless Guido says, "you're dreaming"), but please do so in the py3k list. - Josiah > * function parameters with default values are NOT new bindings for each > invocation, so a > default value of [] changes if you destructively modify this list > object in the function > * loop variables are NOT distinct lexical variables. The binding gloms > on to a variable in the > function's scope, both changing that lexical binding and not creating > a new one for the > loop (that goes away when the loop's scope ends) > * loop variables are NOT distinct bindings per iteration, leading to > the surprising results > below From guido at python.org Tue Jul 4 06:56:41 2006 From: guido at python.org (Guido van Rossum) Date: Tue, 4 Jul 2006 06:56:41 +0200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: <20060703171546.1108.JCARLSON@uci.edu> References: <9B706ACDA57B23438AAE4822369672AF04062AD1@RED-MSG-10.redmond.corp.microsoft.com> <20060703171546.1108.JCARLSON@uci.edu> Message-ID: On 7/4/06, Josiah Carlson wrote: > One thing to remember is that Python is not Scheme/Lisp. It borrows > some ideas from Scheme/Lisp, I can say it stronger. Any resemblance between Python and Scheme or Lisp is purely a coincidence. Neither language is in Python's ancestry, at least not explicitly; I'd never used or tried to learn Scheme when I started Python (still haven't) and my Lisp experience was limited to copying Emacs startup code from friends (still is). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From g.brandl at gmx.net Tue Jul 4 09:03:44 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Tue, 04 Jul 2006 09:03:44 +0200 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: <17577.35939.965788.902096@montanaro.dyndns.org> References: <17577.35939.965788.902096@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > Facundo> I need a timeout in urlopen, just to be able to make: > > >>>> urllib2.urlopen("http://no.host.org", timeout=2) > > Facundo> This is actually not possible, but I'll make it work. > > Facundo> I want to know, please, if this is useful in general, for me to > Facundo> post a patch in SF. > > As others have posted, yes, it would be useful for 2.6. However, you should > consider how that might be applied to the other Internet service modules > (ftplib, telnetlib, urllib, etc). There was one patch that did this: http://python.org/sf/723312. Georg From talin at acm.org Tue Jul 4 09:59:50 2006 From: talin at acm.org (Talin) Date: Tue, 04 Jul 2006 00:59:50 -0700 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) Message-ID: <44AA1FF6.1050501@acm.org> This is sort of a re-do of an earlier proposal which seems to have gotten lost in the shuffle of the larger debate. I propose to create a new type of scoping rule, which I will call "explicit" lexical scoping, that will co-exist with the current "implicit" scoping rule that exists in Python today. Definitions: Implicit scoping is what we have now - a variable is defined within a scope implicitly by assignment. More specifically, when a name is assigned, the name is defined at the innermost function-level scope from where the assignment took place. Explicit scoping is where the programmer explicitly specifies which scope the variable should be defined in. Unlike implicit scoping, assignments to a named variable do not automatically redefine that variable within the current scope. Syntax: Borrowing from Perl, the keyword 'my' is used to declare an explicitly scoped variable: def f1(): my x = 1 def f2(): x = 2 # Does not create a new x In the above example, the statement 'my x = 1' declares that the scope of the variable 'x' is the outer function f1. Any assignment to x will modify the existing x, rather than creating a new definition. Note that the 'my' prefix can be combined with an assignment operation. It is anticipated that the 'my' prefix will be used quite frequently (and encouraged), so it makes sense to cut down on the number of statements by combining declaration and assignment. Explicitly scoped variables can also be declared at the module level: my x = 1 def f1(): x = 2 # Modifies the global X Declaring a module-level variable with an explicit scope eliminates the need for a 'global' statement for that variable. Nested Scopes: Each occurance of the keyword 'my' creates a new scope which hides any outer definitions of the name. So for example: my x = 1 def f1(): my x = 2 # This is a different 'x' than the global def f2(): x = 3 # This is the 'x' defined within f1() Interaction between explicit scoping and globals: The 'global' statement, when used with explicitly scoped variables, means exactly the same as it does with implicitly scoped variables: It allows access to the outermost scope, overriding any intermediate definitions in surrounding scopes: x = 1 def f1(): my x = 2 def f2(): global x x = 3 # This is the module-level 'x' Explicit scoping and code block structure: Implicitly scoped variables are always defined at the nearest enclosing function scope, even if they are created within a code block. It might be worth considering allowing explicitly scoped variables to be defined within other scopes. For example, we might choose to allow explicit scope declarations to be limited to the current suite: def f1(): for x in range(0,10): my y = x*x # A new definition of y for each iteration Note that this is a speculation only, and not a core part of the proposal (so please don't reject the proposal on this one point.) Formal definition: When a value is assigned to a local variable name, the rules for determining which scope the variable will be defined in are as follows: 1) Starting with the current (innermost) scope, examine all of the currently active scopes: 1a) If the current scope contains a 'global' statement for the given name, then set the result scope to the outermost (module-level) scope. 1b) If the current scope contains a 'my' statement for the given name, then set the result scope to the scope in which the 'my' statement occurred. 2) Otherwise, continue until we run out of scopes. If neither a 'global' or 'my' declaration was discovered, then use the innermost scope as the result scope. How is this different from 'outer'? The explicit scope proposal requires that the scope be specified at the place where the variable is *defined* as opposed to where it is *used*. This definition is inherited by all inner scopes. This allows a finer degree of control, for less typing, than the 'outer' proposal. With explicit scoping, there is no confusion as to which scope is being considered; And explicit scoping allows a single declaration of a variable to be shared by many different inner scopes, which would otherwise require a separate 'outer' statement for each one. Explicit scoping and static analysis: It should be easier to do static analysis of code with explicit scoping, since you always know what scope a variable is defined in (as opposed to implicit scoping, where a variable may switch from global to local as a result of an assignment.) Note that this implies that the creation of the scope does not occur at the time of the assignment, but rather at the time the function is entered. Thus: x = 1 def f1(): print x # Error, unassigned value my x = 2 In the above example, even though the 'my' statement occurs after the print, the scope created by the 'my' statement is in effect for the entire function, although the actual *assignment* takes place after the print. The reason for this is that the scope creation is actually done by the compiler. -- Talin From python-dev at zesty.ca Tue Jul 4 12:32:00 2006 From: python-dev at zesty.ca (Ka-Ping Yee) Date: Tue, 4 Jul 2006 05:32:00 -0500 (CDT) Subject: [Python-Dev] doc for new restricted execution design for Python In-Reply-To: References: Message-ID: Hi Brett, Here are some comments on the description of the restricted execution model that you posted. > When referring to the state of an interpreter, it is either "trusted" or > "untrusted". A trusted interpreter has no restrictions imposed upon any > resource. An untrusted interpreter has at least one, possibly more, resource > with a restriction placed upon it. In response to Guido's comment about confusing the words "trusted" and "untrusted", how about "empowered" and "restricted"? > When the Interpreter Is Embedded > ================================ > > Single Untrusted Interpreter > ---------------------------- > > This use case is when an application embeds the interpreter and never has more > than one interpreter running. > > The main security issue to watch out for is not having default abilities be > provided to the interpreter by accident. I'd rather rephrase this in the opposite direction. The onus shouldn't be on the application to hunt down each possible dangerous authority and deactivate them all one by one. The main security issue is to let the application choose which abilities it wants the restricted interpreter to have, and then ensure that the restricted interpreter gets only those abilities. > Multiple Untrusted Interpreters > ------------------------------- > > When multiple interpreters, all untrusted at varying levels, need to be > running within a single application. This is the key use case that this > proposed design is targetted for. > > On top of the security issues from a single untrusted interpreter, > there is one additional worry. Resources cannot end up being leaked > into other interpreters where they are given escalated rights. What is your model here for communication between interpreters? If two interpreters can communicate, any attempt to "prevent leakage" of resources is meaningless. When you say "leaked into other interpreters" are you talking about a Python object leaking or something else at a lower level? Suppose for example that the application wants to embed two interpreters, P and Q, and that the application wants P to be able to write files but Q to be restricted against writing files. When you say "leaked" above, that suggests to me that you want to prevent something like # code running in P import spam f = open('/home/doofus/.ssh/authorized_keys', 'a') spam.f = f # code running in Q import spam spam.f.write('blargh') The above example supposes that P and Q can communicate through a shared module, spam, where they can pass Python objects. But notice that even if you prevent them from passing Python objects like open files, any form of communication is sufficient to leak resources: # code running in P def add_key(key): f = open('/home/doofus/.ssh/authorized_keys', 'a') f.write(key + '\n') f.close() import socket s = socket.socket() s.bind(('', 6666)) s.listen(1) ns, addr = s.accept() add_key(ns.recv(100)) # code running in Q import webbrowser webbrowser.open('http://localhost:6666/zebra') As long as P can listen for instructions from Q, it can give Q the power to write to the filesystem. > Filesystem > =================== > > The most obvious facet of a filesystem to protect is reading from it. > One does not want what is stored in ``/etc/passwd`` to get out. And > one also does not want writing to the disk unless explicitly allowed > for basically the same reason; if someone can write ``/etc/passwd`` > then they can set the password for the root account. There's a big difference between modifying (or erasing) an existing file and writing a new file (e.g. for temporary storage). If i give you a little filesystem of your own to play in, and it starts out empty, you can put whatever you want in it without violating my secrecy or the integrity of my files. I think you should be talking about this in terms of specifically what abilities you want to be able to allow, based on examples of real-life applications. > Physical Resources > =================== > > Memory should be protected. It is a limited resource on the system > that can have an impact on other running programs if it is exhausted. > Being able to restrict the use of memory would help alleviate issues > from denial-of-service (DoS) attacks. > Networking > =================== > > Networking is somewhat like the filesystem in terms of wanting similar > protections. You do not want to let untrusted code make tons of socket > connections or accept them to do possibly nefarious things (e.g., acting > as a zombie). > > You also want to prevent finding out information about the network you are > connected to. This includes doing DNS resolution since that allows one > to find out what addresses your intranet has or what subnets you use. Again, it's risky to describe only individual cases of things to prevent. What networking abilities are safe or necessary for the kinds of applications you have in mind? Start from nothing and work up from there. > Interpreter > =================== > > One must make sure that the interpreter is not harmed in any way. > There are several ways to possibly do this. One is generating > hostile bytecode. Another is some buffer overflow. In general any > ability to crash the interpreter is unacceptable. This is hard for me to understand. What exactly do you trust and not trust? It seems to me that crashing an interpreter is only a problem if a single interpreter is running both trusted and untrusted code -- then if the untrusted code crashes the interpreter, the trusted code suffers. But there doesn't seem to be any such thing in your model. Each interpreter is either trusted or untrusted. If the interpreter is trusted, and the code running in it causes it to crash, i assume you would consider that to be the code's "own fault", right? And if the interpreter is untrusted, and the code running in it causes it to crash, then the code has only harmed itself. It seems to me that we need only be concerned about crashing when the crash of an embedded interpreter will bring down its host application, or there are multiple interpreters embedded at once and one interpreter causes another interpreter to crash. > Resource Hiding > ============================= [...] > This can be viewed as a passive system for security. [...] > Resource Crippling > ============================= > Another approach to security is to provide constant, proactive security > checking of rights to use a resource. I think you have this backwards. Resource hiding is proactive: before untrusted code has a chance to abuse anything, you decide what you want to allow it to do. It defaults to no access, and only gets access to resources you have proactively decided to provide. Resource crippling is the opposite: it begins by giving carte blanche to the untrusted code, then you run around trying to plug holes by stopping everything you don't want. This is a lot more work, and it is also much more dangerous. If you forget to plug even one hole, you're hosed. Back to what you wrote about resource hiding: > This can be viewed as a passive system for security. Once a resource > has been given to code there are no more checks to make sure the > security model is being violated. This last sentence doesn't make any sense. If you decided to give the resource, how is using the resource a violation? Either you want to enable the resource or you don't. If you want to enable it, give it; if you don't, don't give it. As a criticism of the resource hiding approach, it's a red herring -- there's no way to interpret this sentence that doesn't make it also an unfalsifiable criticism of any possible security model. > The most common implementation of resource hiding is capabilities. > In this type of system a resource's reference acts as a ticket that > represents the right to use the resource. Once code has a reference > it is considered to have full use of that resource it represents and > no further security checks are performed. Same thing. What "further security checks" are we worried about? Woult it check to see whether we've authorized the interpreter to have access to the resource ... which we already know to be true? > To allow customizable restrictions one can pass references to wrappers of > resources. This allows one to provide custom security to resources instead of > requiring an all-or-nothing approach. The ability to customize security restrictions is an important advantage of the resource hiding approach, since resource crippling requires that the architect of the security model anticipate every possible security restriction that future programmers might need. Using resource crippling is analogous to removing "def" from the language and requiring Python programmers to only use functions that are provided in the built-in modules instead of writing their own functions. > To use an analogy, imagine you are providing security for your home. > With capabilities, security came from not having any way to know > where your house is without being told where it was; a reference > to its location. You might be able to ask a guard (e.g., Java's > ClassLoader) for a map, but if they refuse there is no way for you > to guess its location without being told. But once you knew where > it was, you had complete use of the house. This analogy is only fair if you compare it to the same analogy for the resource crippling approach. Resource crippling doesn't get you any finer-grained control either! The comparison story is: With resource crippling, security comes from having a guard at the door to your house. When a Python interpreter comes up to the door, the guard checks to see if the interpreter has permission to enter the house, and if it does, then it gets complete use of the house. Why is the granularity of control described as the whole house in the resource-hiding story, but as each door in the house in the resource-crippling story? > And that complete access is an issue with a capability system. > If someone played a little loose with a reference for a resource > then you run the risk of it getting out. Could you be more specific about what you mean by "it getting out"? If you mean getting from a trusted interpreter to an untrusted interpreter -- then how is a resource going to travel between interpreters? Or if not, then are you thinking of a situation in which one piece of code is trusted with the resource, but another piece of code is not, and both are running in the same interpreter? -- ?!ng From fuzzyman at voidspace.org.uk Tue Jul 4 13:08:21 2006 From: fuzzyman at voidspace.org.uk (Fuzzyman) Date: Tue, 04 Jul 2006 12:08:21 +0100 Subject: [Python-Dev] doc for new restricted execution design for Python In-Reply-To: References: Message-ID: <44AA4C25.3020706@voidspace.org.uk> Ka-Ping Yee wrote: >Hi Brett, > >Here are some comments on the description of the restricted execution >model that you posted. > > [snip...] > >>Filesystem >>=================== >> >>The most obvious facet of a filesystem to protect is reading from it. >>One does not want what is stored in ``/etc/passwd`` to get out. And >>one also does not want writing to the disk unless explicitly allowed >>for basically the same reason; if someone can write ``/etc/passwd`` >>then they can set the password for the root account. >> >> > >There's a big difference between modifying (or erasing) an existing file >and writing a new file (e.g. for temporary storage). If i give you a >little filesystem of your own to play in, and it starts out empty, you >can put whatever you want in it without violating my secrecy or the >integrity of my files. > >I think you should be talking about this in terms of specifically >what abilities you want to be able to allow, based on examples of >real-life applications. > > > > As an adjunct to this, one of the barriers to Javascript applications is the lack of client-side data persistence. This makes (amongst other things) offline, or entirely clientside, applications very difficult. All sorts of novel ways round this have been found [ http://codinginparadise.org/weblog/2006/04/now-in-browser-near-you-offline-access.html ]. If a 'standard' interpreter running in the browser had sandboxed access to the filesystem, this would be great. Of course Mozilla would probably disable it by default, and only provide horrific means for users to re-enable it [ http://www.mozilla.org/editor/midasdemo/securityprefs.html ]. All the best, Michael Foord http://www.voidspace.org.uk/python/index.shtml > > >-- ?!ng >_______________________________________________ >Python-Dev mailing list >Python-Dev at python.org >http://mail.python.org/mailman/listinfo/python-dev >Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk > > > From theller at python.net Tue Jul 4 13:35:13 2006 From: theller at python.net (Thomas Heller) Date: Tue, 04 Jul 2006 13:35:13 +0200 Subject: [Python-Dev] 2.5 and beyond In-Reply-To: References: Message-ID: Neal Norwitz schrieb: > I'm glad to see Anthony ratcheting down. At this point, we need to be > fixing bugs and improving doc. Maybe Anthony and I should have a > contest to see who can revert the most changes. :-) > Neal (and/or Anthony), I would like to ask about the possibility to add some improvements to ctypes in Python 2.5, although the feature freeze is now in effect. Hopefully former third-party libraries can have the freeze relaxed somewhat;-). I intend to do these changes, the first is a small and trivial one, but allows a lot of flexibility: - Remove the restriction that the argtypes attribute of foreign functions must be ctypes types. Instead they are only required to implement a .from_param class method. The advantage is that custom objects can be used as function parameters. One usecase is to allow numpy arrays as function parameters without any conversion - this change at least allows to code this in Python. The patch is attached as from_param.patch. The second one is more involved, and not yet complete. I can post the patch or a link to it for review when it is implemented completely: - Implement the __array_struct__ attribute as describes by the numpy pep at http://numpy.scipy.org/array_interface.html. The properties needed to implement the __array_struct__ attribute could be calculated from a given ctypes array type, however, it would be more efficient to calculate them at type creation time. This requires the StgDSictObject that holds information about the ctypes type to grow a few fields: 'int nd' - contains the number of dimensions, 'char typekind' - a struct-like character for the item type, and 'Py_intptr_t *shape' - an array of size 'nd' containing shape information. Thanks for investigating this, Thomas -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: from_param.patch Url: http://mail.python.org/pipermail/python-dev/attachments/20060704/3c0500e4/attachment.pot From ncoghlan at iinet.net.au Tue Jul 4 14:32:38 2006 From: ncoghlan at iinet.net.au (Nick Coghlan) Date: Tue, 04 Jul 2006 22:32:38 +1000 Subject: [Python-Dev] Proposed beta 2 changes (Q for Anthony/Neal) Message-ID: <44AA5FE6.2030801@iinet.net.au> I've got a couple of changes ready to go for beta 2, but need a go ahead from one of the release managers before committing either of them: 1. Finishing the __module_name__ workaround to allow relative imports from the main module when using -m. I'd really like to finish this, because having PEP 328 and 338 not playing well together is a wart that's quite visible to end users. I'd rather not have people's first encounter with the features provided by either PEP involve discovering that they're "broken". The patch to fix this also takes care of adding a couple of paragraphs to the tutorial about explicit relative imports (which aren't currently written up in the main documentation). The patch is attached directly to the beta 1 bug report about the problem [1]. (Guido gave a +1 to the concept, but explicitly deferred to Anthony and Neal as to whether or not the fix should go in for beta 2) 2. Adding an 'ignore' filter for ImportWarning at the end of warnings.py This is a safe and easy fix to silence import warning spam for people that don't want it. I don't believe there are any Pending Deprecation Warnings at the moment, so -Wd at the command line would be sufficient to enable ImportWarning for people that want to see it. Adding the line "warnings.simplefilter('default', ImportWarning)" to sitecustomize.py would be sufficient for organisations to turn the warning on across the board if they so chose. The reason I haven't checked this in directly is that there's no point if Anthony and/or Neal intend to accept one of the patches that tries to make the import machinery more intelligent about missing __init__.py files. Cheers, Nick. [1] http://www.python.org/sf/1510172 -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From rasky at develer.com Tue Jul 4 14:43:40 2006 From: rasky at develer.com (Giovanni Bajo) Date: Tue, 4 Jul 2006 14:43:40 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) References: <44AA1FF6.1050501@acm.org> Message-ID: <010601c69f67$7a535f90$d503030a@trilan> Talin wrote: > This is sort of a re-do of an earlier proposal which seems to have > gotten lost in the shuffle of the larger debate. > > I propose to create a new type of scoping rule, which I will call > "explicit" lexical scoping, that will co-exist with the current > "implicit" scoping rule that exists in Python today. Interesting. What if for-loops implicitally used "my" on the iteration variable? That would solve the binding problem we were discussing and make lambdas "Do The Right Thing"(TM) when used in loops. -- Giovanni Bajo From ark at acm.org Tue Jul 4 16:16:59 2006 From: ark at acm.org (Andrew Koenig) Date: Tue, 4 Jul 2006 10:16:59 -0400 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: <44AA1FF6.1050501@acm.org> Message-ID: <001901c69f74$86a28b60$6402a8c0@arkdesktop> > Borrowing from Perl, the keyword 'my' is used to declare an explicitly > scoped variable: > > def f1(): > my x = 1 > def f2(): > x = 2 # Does not create a new x > > In the above example, the statement 'my x = 1' declares that the scope > of the variable 'x' is the outer function f1. Any assignment to x will > modify the existing x, rather than creating a new definition. -1, for this reason: def f() x = 2 # Does this create a local variable? Today, the answer is yes. Under this proposal, you can't answer the question without inspecting the entire context in which f is defined. For that reason, I would much rather have the first assignment in a block say explicitly whether it is intended to create a local variable: def f1(): x = 1 def f2(): global x x = 2 # Does not create a new x This might even be abbreviated: def f1(): x = 1 def f2(): global x = 2 # Equivalent to the last example above From guido at python.org Tue Jul 4 16:28:30 2006 From: guido at python.org (Guido van Rossum) Date: Tue, 4 Jul 2006 16:28:30 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: <44AA1FF6.1050501@acm.org> References: <44AA1FF6.1050501@acm.org> Message-ID: Please move this to the python-3000 list. Also please explain what problem you are solving before proposing a solution. I note that we are seeing quite a flurry of language change proposals. I have to recommend restraint; I *don't* want to turn the entire language upside down. That's not a comment on this particular proposal, but on the issue of too many proposals. From actual users of the language I get more complaints about the breakneck speed of Python's evolution than about the brokenness of the current language. --Guido On 7/4/06, Talin wrote: > This is sort of a re-do of an earlier proposal which seems to have > gotten lost in the shuffle of the larger debate. > > I propose to create a new type of scoping rule, which I will call > "explicit" lexical scoping, that will co-exist with the current > "implicit" scoping rule that exists in Python today. > > > Definitions: > > Implicit scoping is what we have now - a variable is defined within a > scope implicitly by assignment. More specifically, when a name is > assigned, the name is defined at the innermost function-level scope from > where the assignment took place. > > Explicit scoping is where the programmer explicitly specifies which > scope the variable should be defined in. Unlike implicit scoping, > assignments to a named variable do not automatically redefine that > variable within the current scope. > > > Syntax: > > Borrowing from Perl, the keyword 'my' is used to declare an explicitly > scoped variable: > > def f1(): > my x = 1 > def f2(): > x = 2 # Does not create a new x > > In the above example, the statement 'my x = 1' declares that the scope > of the variable 'x' is the outer function f1. Any assignment to x will > modify the existing x, rather than creating a new definition. > > Note that the 'my' prefix can be combined with an assignment operation. > It is anticipated that the 'my' prefix will be used quite frequently > (and encouraged), so it makes sense to cut down on the number of > statements by combining declaration and assignment. > > Explicitly scoped variables can also be declared at the module level: > > my x = 1 > def f1(): > x = 2 # Modifies the global X > > Declaring a module-level variable with an explicit scope eliminates the > need for a 'global' statement for that variable. > > > Nested Scopes: > > Each occurance of the keyword 'my' creates a new scope which hides any > outer definitions of the name. So for example: > > my x = 1 > def f1(): > my x = 2 # This is a different 'x' than the global > def f2(): > x = 3 # This is the 'x' defined within f1() > > > Interaction between explicit scoping and globals: > > The 'global' statement, when used with explicitly scoped variables, > means exactly the same as it does with implicitly scoped variables: It > allows access to the outermost scope, overriding any intermediate > definitions in surrounding scopes: > > x = 1 > def f1(): > my x = 2 > def f2(): > global x > x = 3 # This is the module-level 'x' > > > Explicit scoping and code block structure: > > Implicitly scoped variables are always defined at the nearest enclosing > function scope, even if they are created within a code block. > > It might be worth considering allowing explicitly scoped variables to be > defined within other scopes. For example, we might choose to allow > explicit scope declarations to be limited to the current suite: > > def f1(): > for x in range(0,10): > my y = x*x # A new definition of y for each iteration > > Note that this is a speculation only, and not a core part of the > proposal (so please don't reject the proposal on this one point.) > > > Formal definition: > > When a value is assigned to a local variable name, the rules for > determining which scope the variable will be defined in are as follows: > > 1) Starting with the current (innermost) scope, examine all of the > currently active scopes: > 1a) If the current scope contains a 'global' statement for the > given name, then set the result scope to the outermost (module-level) scope. > 1b) If the current scope contains a 'my' statement for the given > name, then set the result scope to the scope in which the 'my' statement > occurred. > 2) Otherwise, continue until we run out of scopes. If neither a > 'global' or 'my' declaration was discovered, then use the innermost > scope as the result scope. > > > How is this different from 'outer'? > > The explicit scope proposal requires that the scope be specified at the > place where the variable is *defined* as opposed to where it is *used*. > This definition is inherited by all inner scopes. > > This allows a finer degree of control, for less typing, than the 'outer' > proposal. With explicit scoping, there is no confusion as to which scope > is being considered; And explicit scoping allows a single declaration of > a variable to be shared by many different inner scopes, which would > otherwise require a separate 'outer' statement for each one. > > > Explicit scoping and static analysis: > > It should be easier to do static analysis of code with explicit scoping, > since you always know what scope a variable is defined in (as opposed to > implicit scoping, where a variable may switch from global to local as a > result of an assignment.) > > Note that this implies that the creation of the scope does not occur at > the time of the assignment, but rather at the time the function is > entered. Thus: > > x = 1 > def f1(): > print x # Error, unassigned value > my x = 2 > > In the above example, even though the 'my' statement occurs after the > print, the scope created by the 'my' statement is in effect for the > entire function, although the actual *assignment* takes place after the > print. The reason for this is that the scope creation is actually done > by the compiler. > > -- Talin > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From facundobatista at gmail.com Tue Jul 4 17:43:35 2006 From: facundobatista at gmail.com (Facundo Batista) Date: Tue, 4 Jul 2006 12:43:35 -0300 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: 2006/7/3, Guido van Rossum : > To fake things like this, socket.setdefaulttimeout() was added, though > I don't know if it actually works. Have you tried that? This affect all the sockets. And I hit the problem when servicing information with a web service (TCPServer), and I need to timeout the connection of the URLOpen *only*. Regards, -- . Facundo Blog: http://www.taniquetil.com.ar/plog/ PyAr: http://www.python.org/ar/ From fperez.net at gmail.com Tue Jul 4 22:14:43 2006 From: fperez.net at gmail.com (Fernando Perez) Date: Tue, 04 Jul 2006 14:14:43 -0600 Subject: [Python-Dev] 2.5 and beyond References: Message-ID: Thomas Heller wrote: > I would like to ask about the possibility to add some improvements to > ctypes > in Python 2.5, although the feature freeze is now in effect. Hopefully > former third-party libraries can have the freeze relaxed somewhat;-). > > I intend to do these changes, the first is a small and trivial one, but > allows a lot of flexibility: [...] I'd just like to provide a bit of context for Thomas' request (disclaimer: he did NOT ask me to write this, nor did anyone else). I understand the release managers' need to be strict with the freeze, but perhaps knowing what's behind this particular change will help them make a more informed decision. Numpy (http://numpy.scipy.org/) is the new python array package for numerical computing, which has been developed at enormous effort by Travis Oliphant (with community help) over the last year, as a way to unify the old Numeric package (written by Jim Hugunin, of Jython and IronPython fame) and Numarray (written by the Hubble telescope team). The effect of numpy in the community, even in its current pre-1.0 form, has been tremendous. There is a real, pressing need in the scientific world for open source and technically superior replacements to Matlab and IDL, the propietary 800-lb gorillas of the field. Many major research institutions across the world are seriously looking at python as fulfilling this role, but the previous situation of a divided library (Numeric/numarray) was keeping a lot of people on the fence. With Travis' effort and numpy maturing towards a 1.0 release right around the time of python 2.5, a LOT of people have come out of the woodwork to contribute code, ideas, documentation, etc. There is a real sense that the combination of python2.5 (with better 64-bit and __index__ support) and numpy will provide a significant advancement for scientific computing with modern, high-level tools. In this particular community, the need to interface with low-level existing libraries is probably much more common than in other fields. There are literally millions of lines of C/C++ code for scientific work which we have to use efficiently, and this is an everyday need for us. While there are a number of tools for this (SWIG, Boost::Python, pyrex, scipy.weave,...), very recently people have discovered how useful ctypes can be for this task. One of the Google SoC projects (http://2006.planet-soc.com/blog/140) started trying to wrap libsvm with SWIG and a week of frustrated efforts led nowhere. Albert then discovered ctypes and in a few hours was up and running. This has generated a lot of interest in the numpy crowd for ctypes, and people would really, really like to see python2.5 come 'out of the box' with as solid a support as possible from ctypes for numpy array interfacing. Ultimately the decision is up to the release team, I know that. But at least with this info, I hope you can understand: 1. why this is important to this community 2. why the timing isn't ideal: it is only /very/ recently that the numpy team 'discovered' how much ctypes could truly help with a necessary (and often very unpleasant) task in the numerical/python world. Thanks for reading, f From nnorwitz at gmail.com Tue Jul 4 23:21:55 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 4 Jul 2006 14:21:55 -0700 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: <0006F35E-FD5F-4BAD-A608-642039BBE7E8@mac.com> References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> <44A6C314.2000709@v.loewis.de> <0006F35E-FD5F-4BAD-A608-642039BBE7E8@mac.com> Message-ID: Ronald, Bob, I know Skip found and fixed his problem, however, is this problem likely to affect other users? Is there anything we can do to help alleviate/diagnose this problem? n -- On 7/1/06, Ronald Oussoren wrote: > > On Jul 1, 2006, at 8:46 PM, Martin v. L?wis wrote: > > > Ronald Oussoren wrote: > >> What I don't understand yet is why your copy of libz doesn't have > >> inflateCopy. > > > > What I don't understand is that configure does not detect that. > > You may be onto something there. Skip, do you have another copy of > libz somewhere? Given the link line in your first message either in / > usr/local/lib or /Users/skip/local/lib. And if you have, is that a > static library (libz.a) instead of a dylib? > > As background to my question: the linker on OSX behaves slightly > different than the one on most other unix-y systems. It first > searches the entire linker path for shared libraries (dylibs) before > looking for static libraries. I added a flag to the link flags for > the zlib extension a while back that changes the search order into a > more traditional one: look in every directory on the linker path for > either a dylib or static library. The new flag is -Wl,- > search_paths_first. > > If skip does indeed have libz somewhere else we'll either have to > make a matching update to configure, or roll back my change. If the > latter I'll have to tweak the build script for the binary installer > for OSX because I want to link that using a static copy of libz for > binary compatibility with OSX 10.3.9. > > Ronald > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/nnorwitz%40gmail.com > > > > From talin at acm.org Tue Jul 4 23:39:44 2006 From: talin at acm.org (Talin) Date: Tue, 04 Jul 2006 14:39:44 -0700 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <44AA1FF6.1050501@acm.org> Message-ID: <44AAE020.5060204@acm.org> Guido van Rossum wrote: > Please move this to the python-3000 list. > > Also please explain what problem you are solving before proposing a > solution. > > I note that we are seeing quite a flurry of language change proposals. > I have to recommend restraint; I *don't* want to turn the entire > language upside down. That's not a comment on this particular > proposal, but on the issue of too many proposals. From actual users of > the language I get more complaints about the breakneck speed of > Python's evolution than about the brokenness of the current language. > > --Guido Actually, the "problem" I am trying to solve is the debate on the mailing list. That is, I listen to what people are asking for, and what disagreements they have, and then I try to provide a solution that resolves the debate. In this case, there was a lot of discussion about lexical scoping, and various people proposing solutions (such as redefining the behavior of 'global') that I thought were (a) problematic, and (b) not a lot of bang for the buck (i.e. the disruption vs. utility tradeoff was poor IMHO.) To be honest, I really have no stake in this proposal, and I don't intend to spend any time defending it other than to correct misperceptions - however, I offer it as a potential starting point for people who are interested in the whole lexical scoping issue. If someone feels that this proposal gives them what they want, then great - otherwise I'll drop it. -- Talin From guido at python.org Tue Jul 4 23:58:04 2006 From: guido at python.org (Guido van Rossum) Date: Tue, 4 Jul 2006 23:58:04 +0200 Subject: [Python-Dev] Time-out in URL Open In-Reply-To: References: Message-ID: On 7/4/06, Facundo Batista wrote: > 2006/7/3, Guido van Rossum : > > > To fake things like this, socket.setdefaulttimeout() was added, though > > I don't know if it actually works. Have you tried that? > > This affect all the sockets. So, assuming your app is single-threaded, set the timeout, call urlopen(), and reset the timeout to None. > And I hit the problem when servicing > information with a web service (TCPServer), and I need to timeout the > connection of the URLOpen *only*. That's not so easy even if you were to have a timeout parameter to urlopen(). You'd have to implement that peculiarity in all the layers (in this case, urllib and httplib; and possibly ftplib, gopherlib etc. :-) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Wed Jul 5 00:18:13 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 00:18:13 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: <44AAE020.5060204@acm.org> References: <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> Message-ID: On 7/4/06, Talin wrote: > Guido van Rossum wrote: > > Also please explain what problem you are solving before proposing a > > solution. > Actually, the "problem" I am trying to solve is the debate on the > mailing list. That is, I listen to what people are asking for, and what > disagreements they have, and then I try to provide a solution that > resolves the debate. > > In this case, there was a lot of discussion about lexical scoping, and > various people proposing solutions (such as redefining the behavior of > 'global') that I thought were (a) problematic, and (b) not a lot of bang > for the buck (i.e. the disruption vs. utility tradeoff was poor IMHO.) > > To be honest, I really have no stake in this proposal, and I don't > intend to spend any time defending it other than to correct > misperceptions - however, I offer it as a potential starting point for > people who are interested in the whole lexical scoping issue. If someone > feels that this proposal gives them what they want, then great - > otherwise I'll drop it. Thanks; I appreciate the attempt. I just think that we're not quite ready for more proposals (and certainly not for radical ones). Instead, I'd like to go back to review the needs and desires first. I think the needs are actually pretty simple. Python currently doesn't allow assignment to variables in an outer non-global scope, and people have shown by their behavior that they cannot get used to this (otherwise the debate would have fizzled by now). There are two fundamentally different mechanisms seen in programming languages to control the binding of such variables. The most common approach is to require declaration of variables in the scope to which they belong. But Python doesn't do this, and I think it would be a shame if we had to start doing this now -- the objections against your proposal clearly show the problems if we try to mix this with Python's traditional "assignment is declaration" philosophy. The other approach is an extension of what Python already does for variables in the global scope. ABC did this too (the SHARE command, see http://homepages.cwi.nl/~steven/abc/qr.html#HOWTOs). I think we have to continue to search for a solution that extends the idea of global declarations. I've proposed extending its meaning to refer to the nearest outer scope where the variable is set; if there is no such scope it's an error. This will break a small number of program but probably not very many; still, it'll require a future statement or waiting until Python 3.0. The downside is that "global" is not a very intuitive word for this new meaning. (Maybe ABC's SHARE would have been better.) We could use a different keyword instead, e.g. 'outer'. I believe I've also seen proposals in the past that used a number to indicate how many scopes to go out; I don't like that at all. I don't see anything else that's attractive. The realistic options are: 1. do nothing 2. extend global's meaning 3. add outer keyword Personally I think I'd vote for (2) since it doesn't require a new keyword. But that's only a slight preference over the other two. Personally it's not a burning need; by the time you start feeling the need to modify variables in an outer scope you should probably consider refactoring using an explicit class to hold the state. But I used the same argument to keep the current form of nested scopes (can we say "closures"? But what exactly is the closure?) out of the door and I lost that argument. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From pje at telecommunity.com Wed Jul 5 00:25:38 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Tue, 04 Jul 2006 18:25:38 -0400 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <44AAE020.5060204@acm.org> <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> Message-ID: <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> At 12:18 AM 7/5/2006 +0200, Guido van Rossum wrote: >I don't see anything else that's attractive. The realistic options are: > >1. do nothing >2. extend global's meaning >3. add outer keyword Did you also consider and reject: * Alternate binding operators (e.g. ":=", ".=", etc.) * Alternate spelling of outer names when binding (e.g. ".x = whatever" to bind an outer x) If so, then these should probably be added to the "rejected alternatives" for Py3K so they don't get rehashed. From nnorwitz at gmail.com Wed Jul 5 01:49:13 2006 From: nnorwitz at gmail.com (Neal Norwitz) Date: Tue, 4 Jul 2006 16:49:13 -0700 Subject: [Python-Dev] User's complaints Message-ID: On 7/4/06, Guido van Rossum wrote: > > From actual users of > the language I get more complaints about the breakneck speed of > Python's evolution than about the brokenness of the current language. Guido, I'm really interested in your perspective here. I assume you hear far more "average" complaints from Joe Random User. Can you help give the rest of us an idea about the top 10 complaints/problems people have? I realize this will be subjective, that's ok. Perhaps we should try to focus our energies on some of these issues. For example, we heard grumblings about the releases coming too often. Once we went to an 18 month release schedule, there was minimal complaining. It should be fairly safe to assume this silence means people think we are doing a good job. What are the things that could be fixed that would silence the most number of user's complaints? n -- PS. One thing I tend to talk to users about is stability of the interpreter. When I talk about crashing the interpreter, the most common first reaction I get is "you can crash the interpreter? How do you do that?" I take that answer as a good sign. :-) From aahz at pythoncraft.com Wed Jul 5 05:04:29 2006 From: aahz at pythoncraft.com (Aahz) Date: Tue, 4 Jul 2006 20:04:29 -0700 Subject: [Python-Dev] 2.5b1 Windows install In-Reply-To: <449F7CA9.9010105@v.loewis.de> References: <20060626004726.GA24988@panix.com> <449F7CA9.9010105@v.loewis.de> Message-ID: <20060705030429.GA18105@panix.com> On Mon, Jun 26, 2006, "Martin v. L?wis" wrote: > Aahz wrote: >> >> Has anyone else tried doing an admin install with "compile .py files" >> checked? It's causing my install to blow up, but I'd prefer to assume >> it's some weird Windows config/bug unless other people also have it, in >> which case I'll file an SF report. > > It works fine for me. One way for it to fail is if you uncompilable > modules in the target directory. Currently, it invokes > > [TARGETDIR]python.exe -Wi [TARGETDIR]Lib\compileall.py -f -x > bad_coding|badsyntax|site-packages [TARGETDIR]Lib > > where TARGETDIR is, well, the target directory of the installation. > You could try to run this after you installed Python without pyc > compilation, to see whether it succeeds. Ah-ha! I haven't actually tested this directly, but I bet I know what's going on: this isn't properly quoted and fails with TARGETDIR of "C:\Program Files\Python25" because of the space. I did test to see that it works fine with "C:\Python25" Shall I file a bug? Or do you want to just document this as a limitation? -- Aahz (aahz at pythoncraft.com) <*> http://www.pythoncraft.com/ "I saw `cout' being shifted "Hello world" times to the left and stopped right there." --Steve Gonedes From guido at python.org Wed Jul 5 05:49:10 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 05:49:10 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> References: <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> Message-ID: On 7/5/06, Phillip J. Eby wrote: > At 12:18 AM 7/5/2006 +0200, Guido van Rossum wrote: > >I don't see anything else that's attractive. The realistic options are: > > > >1. do nothing > >2. extend global's meaning > >3. add outer keyword > > Did you also consider and reject: > > * Alternate binding operators (e.g. ":=", ".=", etc.) Brr. > * Alternate spelling of outer names when binding (e.g. ".x = whatever" to > bind an outer x) We looked at and rejected "globals.x = whatever". I think the same reasoning applies here. > If so, then these should probably be added to the "rejected alternatives" > for Py3K so they don't get rehashed. Georgbot? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From martin at v.loewis.de Wed Jul 5 07:23:54 2006 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 05 Jul 2006 07:23:54 +0200 Subject: [Python-Dev] 2.5b1 Windows install In-Reply-To: <20060705030429.GA18105@panix.com> References: <20060626004726.GA24988@panix.com> <449F7CA9.9010105@v.loewis.de> <20060705030429.GA18105@panix.com> Message-ID: <44AB4CEA.9080705@v.loewis.de> Aahz wrote: > Ah-ha! I haven't actually tested this directly, but I bet I know what's > going on: this isn't properly quoted and fails with TARGETDIR of > "C:\Program Files\Python25" because of the space. I did test to see that > it works fine with "C:\Python25" > > Shall I file a bug? Or do you want to just document this as a > limitation? If this is indeed the problem, it should be fixed. Before filing the bug report, please confirm that this actually is a problem. Regards, Martin From ronaldoussoren at mac.com Wed Jul 5 07:35:25 2006 From: ronaldoussoren at mac.com (Ronald Oussoren) Date: Wed, 5 Jul 2006 07:35:25 +0200 Subject: [Python-Dev] zlib module build failure on Mac OSX 10.4.7 In-Reply-To: References: <17573.60593.759293.549583@montanaro.dyndns.org> <130298B1-7336-480A-A90F-97B68327BE4A@mac.com> <17574.43390.560231.425494@montanaro.dyndns.org> <315DECAC-D582-4692-A0CA-13D3C7BF992C@mac.com> <44A6C314.2000709@v.loewis.de> <0006F35E-FD5F-4BAD-A608-642039BBE7E8@mac.com> Message-ID: <738A121B-3529-411D-89A7-EBD8C6B42A29@mac.com> On Jul 4, 2006, at 11:21 PM, Neal Norwitz wrote: > Ronald, Bob, > > I know Skip found and fixed his problem, however, is this problem > likely to affect other users? Is there anything we can do to help > alleviate/diagnose this problem? I'll either enhance configure or roll back my change to setup.py. I'd prefer to do the former, but if beta2 gets too close I'll just change setup.py. Ronald -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 2157 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20060705/256a01dc/attachment.bin From pje at telecommunity.com Wed Jul 5 08:26:25 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Wed, 05 Jul 2006 02:26:25 -0400 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060705020306.0207f890@sparrow.telecommunity.com> At 05:49 AM 7/5/2006 +0200, Guido van Rossum wrote: >>* Alternate spelling of outer names when binding (e.g. ".x = whatever" to >>bind an outer x) > >We looked at and rejected "globals.x = whatever". I think the same >reasoning applies here. I thought the 'globals.x' proposal required that 'x' always be accessed using 'globals', even if it wasn't being rebound. I don't see a problem with requiring '.x' to be used for both reading and writing of outer-scope names; it just shouldn't be required for an outer-scope name that you don't rebind in the current scope. That symmetry requirement can't be implemented with the 'globals.x' approach unless 'globals' is treated specially by the compiler. Using the classic nonsense example: def counter(num): def inc(): .num += 1 return .num return inc If inc() only needed to *read* num, it could just use 'num' without the '.', and be nicely backward compatible with today's Python. (Note: It should be illegal to use both '.num' and 'num' in the same scope, whether writing or reading the value, to prevent readers from becoming confused about what variable you mean. It should also be required that the compiler can see a definition of 'num' in an outer scope if you use the '.num' syntax, so that misspelling a name doesn't create a global variable.) I personally think this approach could be the overall least-intrusive solution as far as syntax goes. It also allows for dropping the 'global' keyword in 3.x, and it has a nice EIBTI feel to it, as it allows you to highlight closure variables in an inner function by using the '.'. It's often not obvious when an inner function (such as a decorator returned by a decorator factory) is using variables that were defined in the outer scope; the leading '.' would make them stand out, and so could be considered the recommended code style when referring to outer variables. In addition, there's a nice symmetry between nested functions and top-level functions, e.g. in this global version of the counter example: num = 0 def inc(): .num += 1 return .num The principle downside taht I see is that it uses semi-random punctuation in place of keywords. OTOH, we are already using more-or-less this syntax for relative imports, so reusing it to mean "relative variables" seems to at least avoid creating any entirely new principles. :) Anyway, I won't argue this one further; I just wanted to make sure it had been considered, as I'm not sure that you were reading the thread where it was first brought up (possibly as long as a few months ago). From guido at python.org Wed Jul 5 10:12:53 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 10:12:53 +0200 Subject: [Python-Dev] Import semantics In-Reply-To: <4dab5f760606251334o718323adt409294e2952e954a@mail.gmail.com> References: <448D1F0D.7000405@strakt.com> <4dab5f760606251334o718323adt409294e2952e954a@mail.gmail.com> Message-ID: On 6/25/06, Frank Wierzbicki wrote: > Sorry for the untrimmed conversation, but I've cc'ed jython-dev, my > comments are at the bottom. > > On 6/12/06, Guido van Rossum wrote: > > On 6/12/06, Samuele Pedroni wrote: > > > Fabio Zadrozny wrote: > > > > Python and Jython import semantics differ on how sub-packages should be > > > > accessed after importing some module: > > > > > > > > Jython 2.1 on java1.5.0 (JIT: null) > > > > Type "copyright", "credits" or "license" for more information. > > > > >>> import xml > > > > >>> xml.dom > > > > > > > > > > > > Python 2.4.2 (#67, Sep 28 2005, 12:41:11) [MSC v.1310 32 bit (Intel)] on > > > > win32 > > > > Type "help", "copyright", "credits" or "license" for more information. > > > > >>> import xml > > > > >>> xml.dom > > > > Traceback (most recent call last): > > > > File "", line 1, in ? > > > > AttributeError: 'module' object has no attribute 'dom' > > > > >>> from xml.dom import pulldom > > > > >>> xml.dom > > > > > > > > > > > > Note that in Jython importing a module makes all subpackages beneath it > > > > available, whereas in python, only the tokens available in __init__.py > > > > are accessible, but if you do load the module later even if not getting > > > > it directly into the namespace, it gets accessible too -- this seems > > > > more like something unexpected to me -- I would expect it to be > > > > available only if I did some "import xml.dom" at some point. > > > > > > > > My problem is that in Pydev, in static analysis, I would only get the > > > > tokens available for actually imported modules, but that's not true for > > > > Jython, and I'm not sure if the current behaviour in Python was expected. > > > > > > > > So... which would be the right semantics for this? > > > > > > the difference in Jython is deliberate. I think the reason was to mimic > > > more the Java style for this, in java fully qualified names always work. > > > In jython importing the top level packages is enough to get a similar > > > effect. > > > > > > This is unlikely to change for backward compatibility reasons, at least > > > from my POV. > > > > IMO it should do this only if the imported module is really a Java > > package. If it's a Python package it should stick to python semantics > > if possible. > > > > -- > > --Guido van Rossum (home page: http://www.python.org/~guido/) > > This is a tough one since the BDFL and Samuele disagree here. Perhaps > we should document the Java import behavior as permanent, but document > the Python imports in Jython as being deprecated but available until > some future release? I believe we would keep it at least through > Jython 2.3. Hi Frank, Have you and/or the Jython community made up your mind about this? The thread seems to have disappeared after you posted (or perhaps it continued only on jython-dev, which I don't read?). Also, I just realized that you're the new Jython maintainer. Is *that* official? I'd like to offer you my congratulations, and, more importantly, any support you might need. I find Jython an important part for Python's long-term stategy. I'm asked occasionally what the status of Jython is; people point out that the last release was 2.1 many years ago and the website has no news since early 2005; thy're afraid that Jython is dying and that it's not a viable choice for new projects. I'm very happy to be able to tell them that soon there will be a 2.3 release and yes there *is* continued support... So if you need anything from me or from the PSF, please let me know! -- --Guido van Rossum (home page: http://www.python.org/~guido/) From anthony at interlink.com.au Wed Jul 5 10:17:59 2006 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed, 5 Jul 2006 18:17:59 +1000 Subject: [Python-Dev] Import semantics In-Reply-To: References: <4dab5f760606251334o718323adt409294e2952e954a@mail.gmail.com> Message-ID: <200607051818.02742.anthony@interlink.com.au> On Wednesday 05 July 2006 18:12, Guido van Rossum wrote: > I'm asked > occasionally what the status of Jython is; people point out that > the last release was 2.1 many years ago and the website has no news > since early 2005; thy're afraid that Jython is dying and that it's > not a viable choice for new projects. I'm very happy to be able to > tell them that soon there will be a 2.3 release and yes there *is* > continued support... So if you need anything from me or from the > PSF, please let me know! In that case, why not post a news item saying this? The website is probably the first place people look... Anthony -- Anthony Baxter It's never too late to have a happy childhood. From guido at python.org Wed Jul 5 10:21:00 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 10:21:00 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: <5.1.1.6.0.20060705020306.0207f890@sparrow.telecommunity.com> References: <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> <5.1.1.6.0.20060705020306.0207f890@sparrow.telecommunity.com> Message-ID: On 7/5/06, Phillip J. Eby wrote: > At 05:49 AM 7/5/2006 +0200, Guido van Rossum wrote: > >>* Alternate spelling of outer names when binding (e.g. ".x = whatever" to > >>bind an outer x) > > > >We looked at and rejected "globals.x = whatever". I think the same > >reasoning applies here. > > I thought the 'globals.x' proposal required that 'x' always be accessed > using 'globals', even if it wasn't being rebound. I don't see a problem > with requiring '.x' to be used for both reading and writing of outer-scope > names; it just shouldn't be required for an outer-scope name that you don't > rebind in the current scope. That symmetry requirement can't be > implemented with the 'globals.x' approach unless 'globals' is treated > specially by the compiler. > > Using the classic nonsense example: > > def counter(num): > def inc(): > .num += 1 > return .num > return inc > > If inc() only needed to *read* num, it could just use 'num' without the > '.', and be nicely backward compatible with today's Python. > > (Note: It should be illegal to use both '.num' and 'num' in the same scope, > whether writing or reading the value, to prevent readers from becoming > confused about what variable you mean. It should also be required that the > compiler can see a definition of 'num' in an outer scope if you use the > '.num' syntax, so that misspelling a name doesn't create a global variable.) > > I personally think this approach could be the overall least-intrusive > solution as far as syntax goes. It also allows for dropping the 'global' > keyword in 3.x, and it has a nice EIBTI feel to it, as it allows you to > highlight closure variables in an inner function by using the '.'. It's > often not obvious when an inner function (such as a decorator returned by a > decorator factory) is using variables that were defined in the outer scope; > the leading '.' would make them stand out, and so could be considered the > recommended code style when referring to outer variables. > > In addition, there's a nice symmetry between nested functions and top-level > functions, e.g. in this global version of the counter example: > > num = 0 > def inc(): > .num += 1 > return .num > > The principle downside taht I see is that it uses semi-random punctuation > in place of keywords. OTOH, we are already using more-or-less this syntax > for relative imports, so reusing it to mean "relative variables" seems to > at least avoid creating any entirely new principles. :) > > Anyway, I won't argue this one further; I just wanted to make sure it had > been considered, as I'm not sure that you were reading the thread where it > was first brought up (possibly as long as a few months ago). Thanks for bringing this up. I'm not sure what I think of it yet. One problem I see is that there might end up being two ways to reference variables in outer scopes: .num if you plan to assign to it, or just num if you only reference it. I find that the most disurbing issue so far; modified global declarations or outer declarations don't have this problem. Would this also use ..num to refer to num in an outer scope two levels removed? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Wed Jul 5 10:22:16 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 10:22:16 +0200 Subject: [Python-Dev] Import semantics In-Reply-To: <200607051818.02742.anthony@interlink.com.au> References: <4dab5f760606251334o718323adt409294e2952e954a@mail.gmail.com> <200607051818.02742.anthony@interlink.com.au> Message-ID: On 7/5/06, Anthony Baxter wrote: > On Wednesday 05 July 2006 18:12, Guido van Rossum wrote: > > I'm asked > > occasionally what the status of Jython is; people point out that > > the last release was 2.1 many years ago and the website has no news > > since early 2005; thy're afraid that Jython is dying and that it's > > not a viable choice for new projects. I'm very happy to be able to > > tell them that soon there will be a 2.3 release and yes there *is* > > continued support... So if you need anything from me or from the > > PSF, please let me know! > > In that case, why not post a news item saying this? The website is > probably the first place people look... I'm all for that; but I don't have webmaster privileges (nor do I want them :-). Frank? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From anthony at interlink.com.au Wed Jul 5 10:25:16 2006 From: anthony at interlink.com.au (Anthony Baxter) Date: Wed, 5 Jul 2006 18:25:16 +1000 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <44AA1FF6.1050501@acm.org> <5.1.1.6.0.20060705020306.0207f890@sparrow.telecommunity.com> Message-ID: <200607051825.17720.anthony@interlink.com.au> On Wednesday 05 July 2006 18:21, Guido van Rossum wrote: > Would this also use ..num to refer to num in an outer scope two > levels removed? Ew! I don't want to even think about debugging ...x vs ....x Anthony -- Anthony Baxter It's never too late to have a happy childhood. From guido at python.org Wed Jul 5 10:28:45 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 10:28:45 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: <200607051825.17720.anthony@interlink.com.au> References: <44AA1FF6.1050501@acm.org> <5.1.1.6.0.20060705020306.0207f890@sparrow.telecommunity.com> <200607051825.17720.anthony@interlink.com.au> Message-ID: Sorry, I should have added a ... :-) On 7/5/06, Anthony Baxter wrote: > On Wednesday 05 July 2006 18:21, Guido van Rossum wrote: > > Would this also use ..num to refer to num in an outer scope two > > levels removed? > > Ew! > > I don't want to even think about debugging > > ...x > vs > ....x > > Anthony > -- > Anthony Baxter > It's never too late to have a happy childhood. > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From g.brandl at gmx.net Wed Jul 5 10:29:27 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 05 Jul 2006 10:29:27 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> Message-ID: Guido van Rossum wrote: > On 7/5/06, Phillip J. Eby wrote: >> At 12:18 AM 7/5/2006 +0200, Guido van Rossum wrote: >> >I don't see anything else that's attractive. The realistic options are: >> > >> >1. do nothing >> >2. extend global's meaning >> >3. add outer keyword >> >> Did you also consider and reject: >> >> * Alternate binding operators (e.g. ":=", ".=", etc.) > > Brr. > >> * Alternate spelling of outer names when binding (e.g. ".x = whatever" to >> bind an outer x) > > We looked at and rejected "globals.x = whatever". I think the same > reasoning applies here. > >> If so, then these should probably be added to the "rejected alternatives" >> for Py3K so they don't get rehashed. > > Georgbot? I added the alternative binding operators. The discussion about ".x" seems to be still in progress. Georg From scott+python-dev at scottdial.com Wed Jul 5 11:40:40 2006 From: scott+python-dev at scottdial.com (Scott Dial) Date: Wed, 05 Jul 2006 05:40:40 -0400 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> <5.1.1.6.0.20060704182302.0207e890@sparrow.telecommunity.com> <5.1.1.6.0.20060705020306.0207f890@sparrow.telecommunity.com> Message-ID: <44AB8918.1040308@scottdial.com> Guido van Rossum wrote: > Would this also use ..num to refer to num in an outer scope two levels removed? I realize this was a wink, but it is a valid problem with the "dot"-proposal. def foo(n): def bar(n): def baz(): return .n So, which 'n' outer 'n' is being referenced? Seems like you need to either be able to do multiple dots (ugly, hard to read) or only do a single-step outwards reference. But then that has it's own problems, if I meant the 'n' passed into 'foo', then I have to resort to such nonsense as: def foo(n): def bar(n): foon = .n def baz(): return .foon It would almost be cute if you could do something like ".foo.n" to get to the correct variable. If python maintains it's current scoping rules, then it seems like it works out, but I haven't thought this one all the way through. def foo(n): def bar(n): def baz(): return .foo.n + .bar.n -- Scott Dial scott at scottdial.com scodial at indiana.edu From and-dev at doxdesk.com Wed Jul 5 11:18:32 2006 From: and-dev at doxdesk.com (Andrew Clover) Date: Wed, 05 Jul 2006 18:18:32 +0900 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: <44AA1FF6.1050501@acm.org> <44AAE020.5060204@acm.org> Message-ID: <44AB83E8.2070303@doxdesk.com> Guido van Rossum wrote: > 1. do nothing > 2. extend global's meaning > 3. add outer keyword 2.5. extend global syntax to cover both [really global] and [innermost matching scope]. eg. global x, y outer # trailing non-keyword global in x, y # re-use keyword not global x # ceci n'est pas un global ... # something less ugly? > Personally it's not a burning need Agreed. Inability to write as well as read nested scopes is more of an aesthetic wart than a practical one IMO. -- And Clover mailto:and at doxdesk.com http://www.doxdesk.com/ -- And Clover mailto:and at doxdesk.com http://www.doxdesk.com/ From just at letterror.com Wed Jul 5 11:46:50 2006 From: just at letterror.com (Just van Rossum) Date: Wed, 5 Jul 2006 11:46:50 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: Message-ID: Guido van Rossum wrote: > On 7/5/06, Phillip J. Eby wrote: > > Did you also consider and reject: > > > > * Alternate binding operators (e.g. ":=", ".=", etc.) > > Brr. That's too bad :( I still find a rebinding operator (":=" being my favorite) much, *much* more appealing than any of the alternative proposals. It's beautifully symmetrical with "assignment means local". It also pretty much makes the global statement redundant. The only downside I see is that it may cause a fairly big shift in style: I for one would use := for rebinding local names. While I think that would be an improvement (eg. by catching typo's earlier), it's *different*. Just -- Change is bad. We fear change. -- Garth Algar From guido at python.org Wed Jul 5 12:02:16 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 5 Jul 2006 12:02:16 +0200 Subject: [Python-Dev] Explicit Lexical Scoping (pre-PEP?) In-Reply-To: References: