From cmjohnson.mailinglist at gmail.com Wed Jun 1 06:52:05 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Tue, 31 May 2011 18:52:05 -1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? Message-ID: We all know that the following code won't work because of UnboundLocalError and that to get around it, one needs to use nonlocal: >>> def accum(): ... x = 0 ... def inner(): ... x += 1 ... return x ... return inner ... >>> inc = accum() >>> inc() Traceback (most recent call last): File "", line 1, in File "", line 4, in inner UnboundLocalError: local variable 'x' referenced before assignment But why does this happen? Let's think about this a little more closely: += is not the same as =. A += can only happen if the left-hand term was already defined. So, why does the compiler treat this as though there were an assignment inside the function? Compare: >>> def accum(): ... x = [] ... def inner(): ... x.append(1) ... return x ... return inner ... >>> inc = accum() >>> inc() [1] >>> inc() [1, 1] >>> inc() [1, 1, 1] So, if I changed += to .append, the code suddenly works fine. Heck, I could also change it to x.__iadd__ if x happens to have that attribute. As we all know, adding an = anywhere to the function bound will cause x to be considered a local. So, for example, we can make the .append example fail by adding some unreachable code: >>> def accum(): ... x = [] ... def inner(): ... x.append(1) ... return x ... x = 0 #Won't ever be reached, but will cause x to be considered a local ... return inner ... >>> inc = accum() >>> inc() Traceback (most recent call last): File "", line 1, in File "", line 4, in inner UnboundLocalError: local variable 'x' referenced before assignment So, my proposal is that += by itself should not cause x to be considered a local variable. There should need to be a normal = assignment for the compiler to count x as a local. If the objection to my proposal is that I'm being "implicit and not explicit" because it would be like there's an implicit "nonlocal," my rebuttal is that we already have "implicit" nonlocals in the case of .append. -- Carl Johnson -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.brandl at gmx.net Wed Jun 1 07:48:19 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 01 Jun 2011 07:48:19 +0200 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On 01.06.2011 06:52, Carl M. Johnson wrote: > We all know that the following code won't work because of UnboundLocalError and > that to get around it, one needs to use nonlocal: > >>>> def accum(): > ... x = 0 > ... def inner(): > ... x += 1 > ... return x > ... return inner > ... >>>> inc = accum() >>>> inc() > Traceback (most recent call last): > File "", line 1, in > File "", line 4, in inner > UnboundLocalError: local variable 'x' referenced before assignment > > But why does this happen? Let's think about this a little more closely: += is > not the same as =. A += can only happen if the left-hand term was already > defined. So, why does the compiler treat this as though there were an assignment > inside the function? Because x += y is equivalent to x = x.__iadd__(y) and therefore an assignment is going on here. Therefore, it's only logical to treat it as such when determining scopes. Georg From cmjohnson.mailinglist at gmail.com Wed Jun 1 08:48:55 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Tue, 31 May 2011 20:48:55 -1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On Tue, May 31, 2011 at 7:48 PM, Georg Brandl wrote: > Because x += y is equivalent to > > x = x.__iadd__(y) > > and therefore an assignment is going on here. Therefore, it's only logical > to > treat it as such when determining scopes. > > But the difference is that you can only use += if the LHS name already exists and is defined. So, it couldn't possibly be referring to a local name if it's the only assignment-like statement within a function body. How could it refer to a local if it has to refer to something that already exists? -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.brandl at gmx.net Wed Jun 1 09:05:58 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 01 Jun 2011 09:05:58 +0200 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On 01.06.2011 08:48, Carl M. Johnson wrote: > > > On Tue, May 31, 2011 at 7:48 PM, Georg Brandl > > wrote: > > Because x += y is equivalent to > > x = x.__iadd__(y) > > and therefore an assignment is going on here. Therefore, it's only logical to > treat it as such when determining scopes. > > > But the difference is that you can only use += if the LHS name already exists > and is defined. So, it couldn't possibly be referring to a local name if it's > the only assignment-like statement within a function body. How could it refer to > a local if it has to refer to something that already exists? Sure, this can only work if the local is assigned somewhere before the augmented assign statement. But this is just like accessing a local before its assignment: in the case of x = 1 def f(): print x x = 2 we also don't treat the first "x" reference as a nonlocal. And the fact remains that augassign *is* an assignment, and the rule is that assignments to out-of-scope names are only allowed when declared using "global" or "nonlocal". Georg From cmjohnson.mailinglist at gmail.com Wed Jun 1 10:26:02 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Tue, 31 May 2011 22:26:02 -1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On Tue, May 31, 2011 at 9:05 PM, Georg Brandl wrote: > Sure, this can only work if the local is assigned somewhere before the > augmented > assign statement. But this is just like accessing a local before its > assignment: in the case of > > x = 1 > def f(): > print x > x = 2 > > we also don't treat the first "x" reference as a nonlocal. > I don't think that's a counterexample to the point I'm trying to make. We all agree that if there's an x= somewhere in the function body, then we have to treat the variable as a local. The only possible way around that would be to solve the halting problem in order to figure out if a particular line of code will be reached or not. Agreed, sure, we have to treat the LHS of = as a local. But += is fundamentally different. You cannot have a += statement unless somewhere out there there is a matching = statement. It cannot exist independently. It never works on its own. So, if there is a += statement in the function body and there isn't an = statement in the function body it cannot work. Ever. All function bodies that have a += but no corresponding = or nonlocal are, as of today, broken code. So, if we were to change Python to make += not cause a variable to become a local, it wouldn't change how any (working) Python code today functions (it might causes some tests to change if they were counting on the error). This would be a completely backwards compatible change. Or am I missing something? Is there any scenario where you can get away with using += without = or nonlocal? I guess you could do something with locals().update or the stackframe, but my understanding is that those hacks don't count for language purposes. -- Carl -------------- next part -------------- An HTML attachment was scrubbed... URL: From p.f.moore at gmail.com Wed Jun 1 10:51:33 2011 From: p.f.moore at gmail.com (Paul Moore) Date: Wed, 1 Jun 2011 09:51:33 +0100 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References:

Message-ID: On 1 June 2011 09:26, Carl M. Johnson wrote: > I don't think that's a counterexample to the point I'm trying to make. We > all agree that if there's an x= somewhere in the function body, then we have > to treat the variable as a local. The only possible way around that would be > to solve the halting problem in order to figure out if a particular line of > code will be reached or not. Agreed, sure, we have to treat the LHS of = as > a local. But += is fundamentally different. You cannot have a += statement > unless somewhere out there there is a matching = statement. It cannot exist > independently. It never works on its own. So, if there is a += statement in > the function body and there isn't an = statement in the function body it > cannot work. Ever. All function bodies that have a += but no corresponding = > or nonlocal are, as of today, broken code. So, if we were to change Python > to make += not cause a variable to become a local, it wouldn't change how > any (working) Python code today functions (it might causes some tests to > change if they were counting on the error). This would be a completely > backwards compatible change. > Or am I missing something? Is there any scenario where you can get away with > using += without = or nonlocal? I guess you could do something with > locals().update or the stackframe, but my understanding is that those hacks > don't count for language purposes. The place to start here is section 4.1 of the language reference (Naming and Binding). Specifically, "A scope defines the visibility of a name within a block. If a local variable is defined in a block, its scope includes that block." Your modification of augmented assignment implies that a block can contain 2 different scopes - consider x = 1 def f(): # The next statement uses the global x x += 1 x = 2 # From here, you have a local x That fundamentally changes the language semantics. If you want to push this change, I'd suggest you start by proposing a change to the language reference section I mentioned above to define your proposed new scoping rules. In my view, that would be sufficiently hard that it'd kill this proposal, but if you can manage to do it, then you may have a chance to get your change accepted. Paul. From jh at improva.dk Wed Jun 1 11:09:06 2011 From: jh at improva.dk (Jacob Holm) Date: Wed, 01 Jun 2011 11:09:06 +0200 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References:

Message-ID: <4DE601B2.20708@improva.dk> I think you missed this statement, even though you quoted it. On 2011-06-01 10:51, Paul Moore wrote: > On 1 June 2011 09:26, Carl M. Johnson wrote: >> We >> all agree that if there's an x= somewhere in the function body, then we have >> to treat the variable as a local. > This means that your example: > x = 1 > def f(): > # The next statement uses the global x > x += 1 > x = 2 > # From here, you have a local x > Would behave exactly as it does today under the proposed new semantics. Specifically, the "x = 2" statement (and the lack of a nonlocal statement) forces x to be local throughout the function, and the "x += 1" statement then tries to read the local "x" and fails. > That fundamentally changes the language semantics. I don't think it does. It only makes a difference for functions that contains an augmented assignment to a name without also containing a regular assignment to that name. This case will change from being an error to doing something well-defined and useful. FWIW, I'm +1 on the idea. Best regards - Jacob From cmjohnson.mailinglist at gmail.com Wed Jun 1 11:41:06 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Tue, 31 May 2011 23:41:06 -1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: <4DE601B2.20708@improva.dk> References:

<4DE601B2.20708@improva.dk> Message-ID: On Tue, May 31, 2011 at 11:09 PM, Jacob Holm wrote: > > x = 1 > > def f(): > > # The next statement uses the global x > > x += 1 > > x = 2 > > # From here, you have a local x > > > > > Specifically, the "x = 2" statement (and the lack of a nonlocal > statement) forces x to be local throughout the function, and the "x += > 1" statement then tries to read the local "x" and fails. > Yes, Jacob has got exactly what I was proposing. x += 1; x = 2 should continue to fail, since there would be a = statement in the function body in that case. -- Carl -------------- next part -------------- An HTML attachment was scrubbed... URL: From rob.cliffe at btinternet.com Wed Jun 1 12:43:14 2011 From: rob.cliffe at btinternet.com (Rob Cliffe) Date: Wed, 01 Jun 2011 11:43:14 +0100 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References:

<4DE601B2.20708@improva.dk> Message-ID: <4DE617C2.2020602@btinternet.com> > Yes, Jacob has got exactly what I was proposing. x += 1; x = 2 should > continue to fail, since there would be a = statement in the function > body in that case. > > -- Carl My first reaction was: +1 on the proposed change. It seemed logical. Then I had a reservation: it would widen the semantic difference between x += 1 and x = x + 1 which could trip someone innocently making a "trivial" code change from the former to the latter (x unintentionally becomes a local). So how about going further and say that x is only interpreted as local if there is at least one NON-augmented assignment in which x appears as a target on the LHS but x does NOT appear on the RHS? I.e. x = x + 1 (like "x += 1") does not (by itself) make x local. Or is this getting too hard to explain? Best wishes Rob Cliffe From andreengels at gmail.com Wed Jun 1 12:51:49 2011 From: andreengels at gmail.com (Andre Engels) Date: Wed, 1 Jun 2011 12:51:49 +0200 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: <4DE617C2.2020602@btinternet.com> References:

<4DE601B2.20708@improva.dk> <4DE617C2.2020602@btinternet.com> Message-ID: On Wed, Jun 1, 2011 at 12:43 PM, Rob Cliffe wrote: > My first reaction was: +1 on the proposed change. ?It seemed logical. > > Then I had a reservation: it would widen the semantic difference between > ? ? ? ?x += 1 > and > ? ? ? ?x = x + 1 > which could trip someone innocently making a "trivial" code change from the > former to the latter (x unintentionally becomes a local). > > So how about going further and say that x is only interpreted as local if > there is at least one NON-augmented assignment in which x appears as a > target on the LHS but x does NOT appear on the RHS? > I.e. > ? ?x = x + 1 > (like "x += 1") does not (by itself) make x local. > > Or is this getting too hard to explain? I think so; it also has the same disadvantage you mention of getting a semantic change from seemingly neutral changes, but for other changes. For example x = 1 if x == 0 else x-1 would keep x global, but changing it to: if x == 0: x = 1 else: x = x-1 would not do so. -- Andr? Engels, andreengels at gmail.com From ethan at stoneleaf.us Wed Jun 1 13:24:30 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Wed, 01 Jun 2011 04:24:30 -0700 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: <4DE6216E.1000204@stoneleaf.us> Carl M. Johnson wrote: > On Tue, May 31, 2011 at 7:48 PM, Georg Brandl wrote: >> >> Because x += y is equivalent to >> >> x = x.__iadd__(y) >> >> and therefore an assignment is going on here. Therefore, it's only >> logical to treat it as such when determining scopes. > > But the difference is that you can only use += if the LHS name already > exists and is defined. So, it couldn't possibly be referring to a local > name if it's the only assignment-like statement within a function body. > How could it refer to a local if it has to refer to something that > already exists? Two problems. Firstly, what error should be raised here? --> def accum(): ... x = 0 ... def inner(): ... x1 += 1 ... return x ... return inner Secondly, the += operator may or may not be a mutating operator depending on the object it's used on: if the object does not have a __iadd__ method, it's not mutating; even if it does have an __iadd__ method, it may not be mutating -- it's up to the object to decide. --> class ex_int(int): ... def __iadd__(self, other): ... return self + other ... --> x = ex_int(7) --> x.__iadd__(3) 10 --> x 7 --> x = [1, 2, 3] --> x.__iadd__([4]) [1, 2, 3, 4] --> x [1, 2, 3, 4] -1 on changing the semantics. ~Ethan~ From ncoghlan at gmail.com Wed Jun 1 13:50:20 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 1 Jun 2011 21:50:20 +1000 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: <79306.1306858606@parc.com> References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> Message-ID: On Wed, Jun 1, 2011 at 2:16 AM, Bill Janssen wrote: > I like the deprecations you suggest, but I'd prefer to see a more > general solution: the 'str' type extended so that it had two possible > representations for strings, the current format and an "encoded" format, > which would be kept as an array of bytes plus an encoding. ?It would > transcode only as necessary -- for example, the 're' module might > require the current Unicode encoding. ?An explicit method would be added > to allow the user to force transcoding. > > This would complicate life at the C level, to be sure. ?Though, perhaps > not so much, given the proper macrology. See PEP 393 - it is basically this idea (although the encodings are fixed for the various sizes rather than allowing arbitrary encodings in the 8-bit internal format). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From andrew at acooke.org Wed Jun 1 14:29:10 2011 From: andrew at acooke.org (andrew cooke) Date: Wed, 1 Jun 2011 08:29:10 -0400 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: <20110601122910.GA19508@acooke.org> On Wed, Jun 01, 2011 at 07:48:19AM +0200, Georg Brandl wrote: > Because x += y is equivalent to > > x = x.__iadd__(y) > > and therefore an assignment is going on here. Therefore, it's only logical to > treat it as such when determining scopes. > > Georg And it's like this so that immutable classes work correctly, as far as I can see. So one way to answer the original idea is: because of immutable classes, += does not have the same semantics as .append() Andrew From ncoghlan at gmail.com Wed Jun 1 14:43:07 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 1 Jun 2011 22:43:07 +1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On Wed, Jun 1, 2011 at 2:52 PM, Carl M. Johnson wrote: > We all know that the following code won't work because of UnboundLocalError > and that to get around it, one needs to use nonlocal: There's no fundamental reason this couldn't change, but actually changing it simply isn't worth the hassle, so the status quo wins the stalemate. I elaborated further on this point when the topic came up last year: http://mail.python.org/pipermail/python-ideas/2010-June/007448.html Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From paul-python at svensson.org Wed Jun 1 15:03:40 2011 From: paul-python at svensson.org (Paul Svensson) Date: Wed, 1 Jun 2011 09:03:40 -0400 (EDT) Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: <20110601081137.B68837@familjen.svensson.org> On Wed, 1 Jun 2011, Georg Brandl wrote: > On 01.06.2011 06:52, Carl M. Johnson wrote: >> We all know that the following code won't work because of UnboundLocalError and >> that to get around it, one needs to use nonlocal: >> >>>>> def accum(): >> ... x = 0 >> ... def inner(): >> ... x += 1 >> ... return x >> ... return inner >> ... >>>>> inc = accum() >>>>> inc() >> Traceback (most recent call last): >> File "", line 1, in >> File "", line 4, in inner >> UnboundLocalError: local variable 'x' referenced before assignment >> >> But why does this happen? Let's think about this a little more closely: += is >> not the same as =. A += can only happen if the left-hand term was already >> defined. So, why does the compiler treat this as though there were an assignment >> inside the function? > > Because x += y is equivalent to > > x = x.__iadd__(y) > > and therefore an assignment is going on here. Therefore, it's only logical to > treat it as such when determining scopes. Off on a bit of a tangent here - this behaviour always bugged me: --> x = ([],) --> x[0] += ['a'] Traceback (most recent call last): File "", line 1, in TypeError: 'tuple' object does not support item assignment --> x (['a'],) --> I understand by the definition why this happens as it does, but intuitively, I'd expect an operation to either fail and raise, or succeed and not. I see two possible ways to make this behave: we can look before we leap, and raise the exception before calling __iadd__, if the assigment would fail; or, we can change the definition to only perform the assignment if __iadd__ returns something other than self. Both these are, to some extent, incompatible language changes. Both change how I think about the original proposal: with the first option, it softens the argument about __iadd__ being called before the assignment, so strengthens the case for the status quo; with the second option, the definition of __iadd__ gets more complicated, making me less inclined to dive into this definition to explain the locality of the assigned variable, preferring it to be defined separately, and simply. Back from the tangent, I think Carl's proposal would make Python more difficult to understand rather than less, so -1 from me. /Paul From janssen at parc.com Wed Jun 1 18:34:03 2011 From: janssen at parc.com (Bill Janssen) Date: Wed, 1 Jun 2011 09:34:03 PDT Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> Message-ID: <98069.1306946043@parc.com> Nick Coghlan wrote: > On Wed, Jun 1, 2011 at 2:16 AM, Bill Janssen wrote: > > I like the deprecations you suggest, but I'd prefer to see a more > > general solution: the 'str' type extended so that it had two possible > > representations for strings, the current format and an "encoded" format, > > which would be kept as an array of bytes plus an encoding. ?It would > > transcode only as necessary -- for example, the 're' module might > > require the current Unicode encoding. ?An explicit method would be added > > to allow the user to force transcoding. > > > > This would complicate life at the C level, to be sure. ?Though, perhaps > > not so much, given the proper macrology. > > See PEP 393 - it is basically this idea Should have realized Martin would have thought of this :-). I'm not sure how I missed it back in January -- high drama at work distracted me, I guess. I might do it a bit differently, with just one pointer, say, "data", and a field which carries the encoding (possibly as a pointer to the appropriate codec). "data" would point to a buffer of the correct type. New strings would by default still be created as UCS-2 or UCS-4 Unicode, just as per today. I'd also allow any encoding which we have a codec for, so that if you are reading from a file containing encoded text, you can carry the exact bytes around unless you need to do something which isn't supported for that encoding -- in which case things get Unicodified behind the scenes. We'd smarten the various string methods over time so that most of them would work so long as the operands matched. str.index, for instance, wouldn't require decoding unless the two strings were of different encodings. Yes, there'd be some "magic" going on, but it wouldn't be worse than the automatic coercions Python does now -- that's just what a HLL does for you. > (although the encodings are > fixed for the various sizes rather than allowing arbitrary encodings > in the 8-bit internal format). IMO, the thing that bit us on the fundament with the 2.x str/unicode divide, and continues to bite us with the 3.x str/bytes divide is that we don't carry the encoding as part of the 2.x 'str' value (or as part of the 3.x 'bytes' value). The key here is to store the encoding internally in the string object, so that it's available to do automatic coercion when necessary, rather than *requiring* all coercions to be done manually by some program code. Bill From ethan at stoneleaf.us Wed Jun 1 18:56:47 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Wed, 01 Jun 2011 09:56:47 -0700 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: <4DE66F4F.5000706@stoneleaf.us> Nick Coghlan wrote: > On Wed, Jun 1, 2011 at 2:52 PM, Carl M. Johnson > wrote: >> We all know that the following code won't work because of UnboundLocalError >> and that to get around it, one needs to use nonlocal: > > There's no fundamental reason this couldn't change, but actually > changing it simply isn't worth the hassle, so the status quo wins the > stalemate. > > I elaborated further on this point when the topic came up last year: > http://mail.python.org/pipermail/python-ideas/2010-June/007448.html Maybe I get to learn something new about Python today. Several times in that thread it was stated that --> a += 1 is a shortcut for --> a.__iadd__(1) It seems to me that this is an implementation detail, and that the actual "longcut" is --> a = a + 1 Likewise, the shortcut of --> some_list[func_with_side_effects()] += some_value is the same as --> index = func_with_side_effects() --> some_list[index] = some_list[index] + some_value Is my understanding correct? ~Ethan~ From tjreedy at udel.edu Wed Jun 1 19:18:50 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 01 Jun 2011 13:18:50 -0400 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: <98069.1306946043@parc.com> References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> Message-ID: On 6/1/2011 12:34 PM, Bill Janssen wrote: > IMO, the thing that bit us on the fundament with the 2.x str/unicode > divide, and continues to bite us with the 3.x str/bytes divide is that > we don't carry the encoding as part of the 2.x 'str' value (or as part > of the 3.x 'bytes' value). The key here is to store the encoding > internally in the string object, so that it's available to do automatic > coercion when necessary, rather than *requiring* all coercions to be > done manually by some program code. Some time ago, I posted here a proposal to do just that -- add an encoding field to byte strings (or, I believe, add a new class). It was horribly shot down. Something like 'conceptually wrong, some bytes have 0 or multiple encodings, can just use an attribute or tuple, don't need it'. -- Terry Jan Reedy From bruce at leapyear.org Wed Jun 1 19:32:49 2011 From: bruce at leapyear.org (Bruce Leban) Date: Wed, 1 Jun 2011 10:32:49 -0700 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: <4DE66F4F.5000706@stoneleaf.us> References: <4DE66F4F.5000706@stoneleaf.us> Message-ID: On Wed, Jun 1, 2011 at 9:56 AM, Ethan Furman wrote: > > Several times in that thread it was stated that > > --> a += 1 > > is a shortcut for > > --> a.__iadd__(1) > > It seems to me that this is an implementation detail, and that the actual > "longcut" is > > --> a = a + 1 > > ... > > ~Ethan~ > a += 1 is not a shortcut for a.__iadd__(1). It's a shortcut for a = a.__iadd(1). Otherwise this wouldn't work: >>> x = (1,) >>> x += (2,) >>> x (1, 2) Note the difference between these two is one opcode: >>> def f(x,y): x += y >>> dis.dis(f) 2 0 LOAD_FAST 0 (x) 3 LOAD_FAST 1 (y) 6 *INPLACE_ADD* 7 STORE_FAST 0 (x) 10 LOAD_CONST 0 (None) 13 RETURN_VALUE >>> def g(x,y): x = x + y >>> dis.dis(g) 2 0 LOAD_FAST 0 (x) 3 LOAD_FAST 1 (y) 6 *BINARY_ADD* 7 STORE_FAST 0 (x) 10 LOAD_CONST 0 (None) 13 RETURN_VALUE --- Bruce Follow me: http://www.twitter.com/Vroo http://www.vroospeak.com Latest tweet: SO disappointed end of the world didn't happen AGAIN! #y2k #rapture Now waiting for 2038! #unixrapture -------------- next part -------------- An HTML attachment was scrubbed... URL: From ethan at stoneleaf.us Wed Jun 1 19:51:30 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Wed, 01 Jun 2011 10:51:30 -0700 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: <4DE66F4F.5000706@stoneleaf.us> Message-ID: <4DE67C22.4000502@stoneleaf.us> Bruce Leban wrote: > > > On Wed, Jun 1, 2011 at 9:56 AM, Ethan Furman wrote: > > > Several times in that thread it was stated that > > --> a += 1 > > is a shortcut for > > --> a.__iadd__(1) > > It seems to me that this is an implementation detail, and that the > actual "longcut" is > > --> a = a + 1 > > ... > > ~Ethan~ > > > a += 1 is not a shortcut for a.__iadd__(1). It's a shortcut for a = > a.__iadd(1). Otherwise this wouldn't work: Right -- typo on my part, sorry. > Note the difference between these two is one opcode: > > >>> def f(x,y): > x += y > >>> dis.dis(f) > 2 0 LOAD_FAST 0 (x) > 3 LOAD_FAST 1 (y) > 6 *INPLACE_ADD* > 7 STORE_FAST 0 (x) > 10 LOAD_CONST 0 (None) > 13 RETURN_VALUE > >>> def g(x,y): > x = x + y > > >>> dis.dis(g) > 2 0 LOAD_FAST 0 (x) > 3 LOAD_FAST 1 (y) > 6 *BINARY_ADD* > 7 STORE_FAST 0 (x) > 10 LOAD_CONST 0 (None) > 13 RETURN_VALUE Note also that INPLACE_ADD will call the the BINARY_ADD method if no __iadd__ method exists. ~Ethan~ From tjreedy at udel.edu Wed Jun 1 19:41:26 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 01 Jun 2011 13:41:26 -0400 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On 6/1/2011 12:52 AM, Carl M. Johnson wrote: > So, my proposal is that += by itself should not cause x to be considered > a local variable. Right now, 'augmented assigment' is uniformly what it says: an assignment with augmented behavior. 'expr1 op= expr2' is *defined* as being the same as 'expr1 = expr1 op expr2' except that expr1 is evauluated just once*, and if expr1 evaluates to a mutable, the op can be done in place. Some consider the second exception to be a confusing complication and a mistake. Your proposal would require a rewrite of the definition and would add additional complication. Some would then want another exception for when expr1 evaluates to a mutable within an immutable (see Paul Svensson's post). While I do understand your point, I also value uniformity. -1 * It is actually more complicate than than. Expr1 is partially evaluated just once to an internal reference rather than to an object. That reference is then used once to fetch the existing object and once again to rebind to the new or mutated object. Still, it is one behavior for all occurrences. -- Terry Jan Reedy From ethan at stoneleaf.us Wed Jun 1 19:58:12 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Wed, 01 Jun 2011 10:58:12 -0700 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> Message-ID: <4DE67DB4.3070304@stoneleaf.us> Terry Reedy wrote: > On 6/1/2011 12:34 PM, Bill Janssen wrote: > >> IMO, the thing that bit us on the fundament with the 2.x str/unicode >> divide, and continues to bite us with the 3.x str/bytes divide is that >> we don't carry the encoding as part of the 2.x 'str' value (or as part >> of the 3.x 'bytes' value). The key here is to store the encoding >> internally in the string object, so that it's available to do automatic >> coercion when necessary, rather than *requiring* all coercions to be >> done manually by some program code. > > Some time ago, I posted here a proposal to do just that -- add an > encoding field to byte strings (or, I believe, add a new class). It was > horribly shot down. Something like 'conceptually wrong, some bytes have > 0 or multiple encodings, can just use an attribute or tuple, don't need > it'. > A byte stream with multiple encodings? Now *that* seems wrong! It could also be handled by having the encoding field set to some special value indicating Unknown. ~Ethan~ From tjreedy at udel.edu Thu Jun 2 00:15:47 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 01 Jun 2011 18:15:47 -0400 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: <4DE67DB4.3070304@stoneleaf.us> References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> <4DE67DB4.3070304@stoneleaf.us> Message-ID: On 6/1/2011 1:58 PM, Ethan Furman wrote: > A byte stream with multiple encodings? Now *that* seems wrong! No, it is standard in many protocols. Ascii coded characters and numbers are mixed with binary coded numbers and binary blobs with their own codings. Bytes are not text, so don't think in terms of just text encodings. -- Terry Jan Reedy From tjreedy at udel.edu Thu Jun 2 00:42:03 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 01 Jun 2011 18:42:03 -0400 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: <4DE66F4F.5000706@stoneleaf.us> Message-ID: On 6/1/2011 1:32 PM, Bruce Leban wrote: > >>> def f(x,y): > x += y > >>> dis.dis(f) > 2 0 LOAD_FAST 0 (x) > 3 LOAD_FAST 1 (y) > 6 *INPLACE_ADD* > 7 STORE_FAST 0 (x) > 10 LOAD_CONST 0 (None) > 13 RETURN_VALUE > >>> def g(x,y): > x = x + y > > >>> dis.dis(g) > 2 0 LOAD_FAST 0 (x) > 3 LOAD_FAST 1 (y) > 6 *BINARY_ADD* > 7 STORE_FAST 0 (x) > 10 LOAD_CONST 0 (None) > 13 RETURN_VALUE (In 3.2, one no longer needs to wrap code in a function to dis it. see below.) To see the 'calculate the source/target just once instead of twice' part, you need a source/target that actually requires calculation. >>> from dis import dis >>> dis('x[i] = x[i] + 1') 1 0 LOAD_NAME 0 (x) 3 LOAD_NAME 1 (i) 6 BINARY_SUBSCR 7 LOAD_CONST 0 (1) 10 BINARY_ADD 11 LOAD_NAME 0 (x) 14 LOAD_NAME 1 (i) 17 STORE_SUBSCR 18 LOAD_CONST 1 (None) 21 RETURN_VALUE >>> dis('x[i] += 1') 1 0 LOAD_NAME 0 (x) 3 LOAD_NAME 1 (i) 6 DUP_TOP_TWO 7 BINARY_SUBSCR 8 LOAD_CONST 0 (1) 11 INPLACE_ADD 12 ROT_THREE 13 STORE_SUBSCR 14 LOAD_CONST 1 (None) 17 RETURN_VALUE Even this does not show much difference as the dup and rotate substitute for two loads but do not actually save any calculation. However, >>> dis('a.b[c+d] = a.b[c+d] + 1') 1 0 LOAD_NAME 0 (a) 3 LOAD_ATTR 1 (b) 6 LOAD_NAME 2 (c) 9 LOAD_NAME 3 (d) 12 BINARY_ADD 13 BINARY_SUBSCR 14 LOAD_CONST 0 (1) 17 BINARY_ADD 18 LOAD_NAME 0 (a) 21 LOAD_ATTR 1 (b) 24 LOAD_NAME 2 (c) 27 LOAD_NAME 3 (d) 30 BINARY_ADD 31 STORE_SUBSCR 32 LOAD_CONST 1 (None) 35 RETURN_VALUE >>> dis('a.b[c+d] += 1') 1 0 LOAD_NAME 0 (a) 3 LOAD_ATTR 1 (b) 6 LOAD_NAME 2 (c) 9 LOAD_NAME 3 (d) 12 BINARY_ADD 13 DUP_TOP_TWO 14 BINARY_SUBSCR 15 LOAD_CONST 0 (1) 18 INPLACE_ADD 19 ROT_THREE 20 STORE_SUBSCR 21 LOAD_CONST 1 (None) 24 RETURN_VALUE The latter has the same dup-rotate in place of a bit more calculation. The same would be true of, for instance, f(a).b. -- Terry Jan Reedy From tjreedy at udel.edu Thu Jun 2 00:56:52 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 01 Jun 2011 18:56:52 -0400 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References: Message-ID: On 6/1/2011 1:41 PM, Terry Reedy wrote: > On 6/1/2011 12:52 AM, Carl M. Johnson wrote: > >> So, my proposal is that += by itself should not cause x to be considered >> a local variable. > While I do understand your point, I also value uniformity. > -1 There is another problem I had not thought of before. Right now, Python has (always had) a simple rule: code in a function CANNOT rebind names in outer scopes unless the function has a global or nonlocal declaration. This simple, uniform rule benefits not only the interpreter but human readers. It should not be broken. def f(): 'doc for f' def g(): 'docstring of g' If g is the only nested function and the body of g does not have a nonlocal declaration (which OUGHT to be at the top if present), then a reader or maintainer of f knows (without reading g in detail) that nothing other that can rebind f's locals. -- Terry Jan Reedy From steve at pearwood.info Thu Jun 2 01:21:12 2011 From: steve at pearwood.info (Steven D'Aprano) Date: Thu, 02 Jun 2011 09:21:12 +1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References:

Message-ID: <4DE6C968.3000301@pearwood.info> Carl M. Johnson wrote: > Agreed, sure, we have to treat the LHS of = as > a local. But += is fundamentally different. No it's not. It is fundamentally the same. Augmented assignment in Python *is* assignment, equivalent to x = x.__iadd__(other). That alone should be enough to kill this proposal stone dead. += is not, except by accident, an in-place addition operator. It is always a re-binding. (Mutable objects are free to mutate in place, if they choose, but the re-binding still takes place.) > You cannot have a += statement > unless somewhere out there there is a matching = statement. It cannot exist > independently. It never works on its own. Neither does *any* attempt to access an unbound local. Python doesn't, and shouldn't, try to guess what you actually intended so as to make it work. If you want x to refer to a nonlocal, or a global, declare it as such. print x; x = 1 will fail unless there is an earlier x = something. x = x+1 will fail unless there is an earlier x = something. x += 1 will fail unless there is an earlier x = something. Why single out x += 1 for changed semantics to the rule that any assignment makes x a local? What if you don't have a non-local x, should Python guess that you wanted a global? Currently, the rule is simple: any assignment tells the compiler to treat x as local. If you want nonlocal or global, you have to declare it as such. Nice and simple. What actual real-world problem are you trying to solve that you want to change this behaviour? -1 on this change. -- Steven From ethan at stoneleaf.us Thu Jun 2 01:47:19 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Wed, 01 Jun 2011 16:47:19 -0700 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: <4DE6C968.3000301@pearwood.info> References:

<4DE6C968.3000301@pearwood.info> Message-ID: <4DE6CF87.3080604@stoneleaf.us> Steven D'Aprano wrote: > Carl M. Johnson wrote: > >> Agreed, sure, we have to treat the LHS of = as >> a local. But += is fundamentally different. > > > No it's not. It is fundamentally the same. Augmented assignment in > Python *is* assignment, equivalent to x = x.__iadd__(other). Or x = x.__add__(other) if no __iadd__ exists for the object. ~Ethan~ From g.brandl at gmx.net Thu Jun 2 07:17:19 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 02 Jun 2011 07:17:19 +0200 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References:

Message-ID: On 01.06.2011 10:26, Carl M. Johnson wrote: > > > On Tue, May 31, 2011 at 9:05 PM, Georg Brandl > wrote: > > Sure, this can only work if the local is assigned somewhere before the augmented > assign statement. But this is just like accessing a local before its > assignment: in the case of > > x = 1 > def f(): > print x > x = 2 > > we also don't treat the first "x" reference as a nonlocal. > > > I don't think that's a counterexample to the point I'm trying to make. We all > agree that if there's an x= somewhere in the function body, then we have to > treat the variable as a local. The only possible way around that would be to > solve the halting problem in order to figure out if a particular line of code > will be reached or not. Agreed, sure, we have to treat the LHS of = as a local. > But += is fundamentally different. You keep saying that, but I just can't see how += is fundamentally different from =, given its definition as x = x.__iadd__(y). This is a situation that comes up from time to time, where it seems logical to make a change that satisfies "DWIM" feelings, but makes the languge more inconsistent by introducing special cases. This doesn't feel right to me (and the Zen agrees ;) Georg From cmjohnson.mailinglist at gmail.com Thu Jun 2 07:17:58 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Wed, 1 Jun 2011 19:17:58 -1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: <4DE6C968.3000301@pearwood.info> References:

<4DE6C968.3000301@pearwood.info> Message-ID: On Wed, Jun 1, 2011 at 1:21 PM, Steven D'Aprano wrote: > Currently, the rule is simple: any assignment tells the compiler to treat x > as local. If you want nonlocal or global, you have to declare it as such. > Nice and simple. What actual real-world problem are you trying to solve that > you want to change this behaviour? The best counter-arguments I've heard so far are Nick's (it would be a pain to go into the guts and change this, and you also need to think about PyPy, Jython, IronPy, etc., etc.) and this one. In terms of "real world problems" this solves, it makes the solution to the Paul Graham language challenge problem (build a function that returns an accumulator) one line shorter. Which is a bit silly, but so far as I can tell, nonlocal was created just to say we have an answer to the Paul Graham question. ;-) I think the benefit of saving that one line is probably outweighed by the brittleness that this would create (ie. changing x += 1 to x = x + 1 could break code), so I withdraw the proposal, at least for now. One additional problem that I ran into is this: >>> def f(): ... nonlocal count ... return count ... SyntaxError: no binding for nonlocal 'count' found Nonlocal fails at the compilation stage if the variable isn't found. On the other hand, attribute lookup is delayed until runtime, so if by accident you did def f(): count = 0 def g(): cont += 1 #oops typo. return cont return g it's not clear when the function should fail: compile time or runtime. -- Carl -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Thu Jun 2 07:37:47 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 2 Jun 2011 15:37:47 +1000 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: <4DE67DB4.3070304@stoneleaf.us> References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> <4DE67DB4.3070304@stoneleaf.us> Message-ID: On Thu, Jun 2, 2011 at 3:58 AM, Ethan Furman wrote: > A byte stream with multiple encodings? ?Now *that* seems wrong! Unicode encodings are just one serialisation format specific to text data. bytes objects may contain *any* serialisation format (e.g. zip archives, Python pickles, Python marshal files, packed binary data, innumerable wire protocols both standard and proprietary). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ncoghlan at gmail.com Thu Jun 2 07:49:58 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 2 Jun 2011 15:49:58 +1000 Subject: [Python-ideas] Why does += trigger UnboundLocalError? In-Reply-To: References:

<4DE6C968.3000301@pearwood.info> Message-ID: On Thu, Jun 2, 2011 at 3:17 PM, Carl M. Johnson wrote: > > > On Wed, Jun 1, 2011 at 1:21 PM, Steven D'Aprano wrote: >> >> Currently, the rule is simple: any assignment tells the compiler to treat >> x as local. If you want nonlocal or global, you have to declare it as such. >> Nice and simple. What actual real-world problem are you trying to solve that >> you want to change this behaviour? > > The best counter-arguments I've heard so far are Nick's (it would be a pain > to go into the guts and change this, and you also need to think about PyPy, > Jython, IronPy, etc., etc.) and this one. > In terms of "real world problems" this solves, it makes the solution to the > Paul Graham language challenge problem (build a function that returns an > accumulator) one line shorter. Which is a bit silly, but so far as I can > tell, nonlocal was created just to say we have an answer to the Paul Graham > question. ;-) Nah, nonlocal was added because the introduction of decorators increased the use of closures, and boxing and unboxing variables manually is a PITA. Note that the "translation" of 'x += y' to 'x = x + y' is and always has been a gross oversimplification (albeit a useful one). Reality is complicated by possible provision of __iadd__ by the assignment target, as well as the need to pair up __getitem__/__setitem__ and __getattr__/__setattr__ appropriately when the target is a subscript operation or attribute access. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From tjreedy at udel.edu Thu Jun 2 08:30:28 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 02 Jun 2011 02:30:28 -0400 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> <4DE67DB4.3070304@stoneleaf.us> Message-ID: On 6/2/2011 1:37 AM, Nick Coghlan wrote: > On Thu, Jun 2, 2011 at 3:58 AM, Ethan Furman wrote: >> A byte stream with multiple encodings? Now *that* seems wrong! > > Unicode encodings are just one serialisation format specific to text > data. bytes objects may contain *any* serialisation format (e.g. zip > archives, Python pickles, Python marshal files, packed binary data, > innumerable wire protocols both standard and proprietary). One result of this thread is that I see much better the value of separating the ancient human level concepts of character and text from the (3) decades old computer concept of byte. Numbers, lists, and dicts are other old human concepts. As Nick implies above, bytes (or bits within them) are used to encode all data for computer processing. The confusion of character with byte in the original design of Python both privileged and burdened text processing. -- Terry Jan Reedy From guido at python.org Thu Jun 2 19:58:55 2011 From: guido at python.org (Guido van Rossum) Date: Thu, 2 Jun 2011 10:58:55 -0700 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> <4DE67DB4.3070304@stoneleaf.us> Message-ID: On Wed, Jun 1, 2011 at 11:30 PM, Terry Reedy wrote: > The confusion of character with byte in the original design of Python both > privileged and burdened text processing. Right. And it wasn't only Python: most languages created around or before that time had the same issues (perhaps starting with C's use of "char" meaning byte). Even most IP protocols developed in the 1990s confuse character set and encoding (witness HTTP's "Content-type: text/plain; charset=utf-8"). I'm glad in Python 3 we undertook to improve the distinction. -- --Guido van Rossum (python.org/~guido) From guido at python.org Thu Jun 2 20:11:09 2011 From: guido at python.org (Guido van Rossum) Date: Thu, 2 Jun 2011 11:11:09 -0700 Subject: [Python-ideas] Minor tweak to PEP 8? In-Reply-To: References: <20110510104754.4689cc5e@bhuda.mired.org> <87y62ejl2j.fsf@benfinney.id.au>

Message-ID: FYI, I've submitted this change to PEP 8, with the help of a draft patch by Steven Klass. --Guido On Wed, May 11, 2011 at 7:23 AM, Guido van Rossum wrote: > At Google we use the following rule (from > http://google-styleguide.googlecode.com/svn/trunk/pyguide.html#Indentation): > > Yes:? # Aligned with opening delimiter > ? ? ? foo = long_function_name(var_one, var_two, > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?var_three, var_four) > > ? ? ? # 4-space hanging indent; nothing on first line > ? ? ? foo = long_function_name( > ? ? ? ? ? var_one, var_two, var_three, > ? ? ? ? ? var_four) > > No: ? # Stuff on first line forbidden > ? ? ? foo = long_function_name(var_one, var_two, > ? ? ? ? ? var_three, var_four) > > ? ? ? # 2-space hanging indent forbidden > ? ? ? foo = long_function_name( > ? ? ? ? var_one, var_two, var_three, > ? ? ? ? var_four) > > I propose we somehow incorporate these two allowed alternatives into PEP 8. > They both serve a purpose. > > -- > --Guido van Rossum (python.org/~guido) > > -- --Guido van Rossum (python.org/~guido) From barry at python.org Thu Jun 2 21:00:31 2011 From: barry at python.org (Barry Warsaw) Date: Thu, 2 Jun 2011 15:00:31 -0400 Subject: [Python-ideas] Minor tweak to PEP 8? References: <20110510104754.4689cc5e@bhuda.mired.org> <87y62ejl2j.fsf@benfinney.id.au>

Message-ID: <20110602150031.2b98524f@neurotica.wooz.org> On Jun 02, 2011, at 11:11 AM, Guido van Rossum wrote: >FYI, I've submitted this change to PEP 8, with the help of a draft >patch by Steven Klass. Thanks Guido. This is probably the right mailing list to follow up to (ignore my python-dev followup to the -checkins message). I agree with the change, except for the recommendation to double-indent. Yes, double-indent does look better with Google's 2-space indentation level rule, but it looks excessive (to my eyes anyway) with a 4-space rule. One indentation level looks fine. I posted some examples to python-dev. Is it worth softening the PEP 8 recommendation on double-indents? Cheers, -Barry >On Wed, May 11, 2011 at 7:23 AM, Guido van Rossum wrote: >> At Google we use the following rule (from >> http://google-styleguide.googlecode.com/svn/trunk/pyguide.html#Indentation): >> >> Yes:? # Aligned with opening delimiter >> ? ? ? foo = long_function_name(var_one, var_two, >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?var_three, var_four) >> >> ? ? ? # 4-space hanging indent; nothing on first line >> ? ? ? foo = long_function_name( >> ? ? ? ? ? var_one, var_two, var_three, >> ? ? ? ? ? var_four) >> >> No: ? # Stuff on first line forbidden >> ? ? ? foo = long_function_name(var_one, var_two, >> ? ? ? ? ? var_three, var_four) >> >> ? ? ? # 2-space hanging indent forbidden >> ? ? ? foo = long_function_name( >> ? ? ? ? var_one, var_two, var_three, >> ? ? ? ? var_four) >> >> I propose we somehow incorporate these two allowed alternatives into PEP 8. >> They both serve a purpose. >> >> -- >> --Guido van Rossum (python.org/~guido) >> >> > > > -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: not available URL: From tjreedy at udel.edu Thu Jun 2 22:14:30 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 02 Jun 2011 16:14:30 -0400 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> <4DE67DB4.3070304@stoneleaf.us> Message-ID: On 6/2/2011 1:58 PM, Guido van Rossum wrote: > On Wed, Jun 1, 2011 at 11:30 PM, Terry Reedy wrote: >> The confusion of character with byte in the original design of Python both >> privileged and burdened text processing. > > Right. And it wasn't only Python: most languages created around or > before that time had the same issues (perhaps starting with C's use of > "char" meaning byte). Even most IP protocols developed in the 1990s > confuse character set and encoding (witness HTTP's "Content-type: > text/plain; charset=utf-8"). I hold Python to a higher standard. But yes, that is badly confused. > I'm glad in Python 3 we undertook to improve the distinction. I am a bit embarassed that I did not see sooner that characters are for people and bytes for computers. Thus Python produces both character and byte serializations for objects. On the coding front: when I first did statistics on computers (1970s), all data were coded with numbers. For instance, Sex: male = 1; female = 2; unknown = 9. In the 1980s, we could use letters (which became ascii codes): male = 'm'; female = 'f'; unknown = ' '. For a US-only project, this seemed like an advance. So I though then. For a global project, it would have been the opposite. For a Spanish speaker, 'm' might seem to mean 'mujer' (woman). For many others around the world, euro-indic digits are more familiar and easier to read than latin letters. I am less ethnocentric now. I'm glad Python has become more of a global language, even if English based. -- Terry Jan Reedy From python-dev at realityexists.net Fri Jun 3 06:07:39 2011 From: python-dev at realityexists.net (Evan Martin) Date: Fri, 03 Jun 2011 14:07:39 +1000 Subject: [Python-ideas] date.datetime() method to convert from date to datetime Message-ID: <4DE85E0B.5090906@realityexists.net> There is a datetime.date() method for converting from a datetime to a date, but no stdlib method to do the opposite conversion. Could a date.datetime() method be added that returns a datetime with the time component set to zero? Alternatively, if the object is already a datetime, it could simply return itself. This isn't exactly hard to do in user code, but the obvious way of doing it is a bit too verbose for such a simple operation. Less verbose ways are not obvious or are error-prone, as this StackOverflow question shows: http://stackoverflow.com/questions/1937622/convert-date-to-datetime-in-python -- Evan Martin From ncoghlan at gmail.com Fri Jun 3 06:40:36 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 3 Jun 2011 14:40:36 +1000 Subject: [Python-ideas] Adding 'bytes' as alias for 'latin_1' codec. In-Reply-To: References:

<4DE0481E.7010005@canterbury.ac.nz> <79306.1306858606@parc.com> <98069.1306946043@parc.com> <4DE67DB4.3070304@stoneleaf.us> Message-ID: On Fri, Jun 3, 2011 at 6:14 AM, Terry Reedy wrote: > I am a bit embarassed that I did not see sooner that characters are for > people and bytes for computers. Thus Python produces both character and byte > serializations for objects. FWIW, even after being involved in the assorted bytes/str design discussions for Py3k, I didn't really "get it" myself until I made the changes to urllib.parse in Python 3.2 to get most of the APIs to accept both str objects and byte sequences. The contrast between my first attempt (which tried to provide a common code path that handled both strings and byte sequences without trashing the encoding of the latter) and my second (which just decodes and reencodes byte sequences using strict ASCII and punts on malformed URLs containing non-ASCII values) was amazing. My original plan was to benchmark them before choosing, but the latter approach was so much simpler and cleaner than the former that it wasn't even a contest. Focusing efforts on things like PEP 393, and perhaps even a memoryview based "strview" is likely to be a more fruitful way forward than trying to shoehorn text-specific concerns into the general binary storage types (and, as noted, the long release cycle means the standard library is the wrong place for that kind of experimentation). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ncoghlan at gmail.com Fri Jun 3 06:45:25 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 3 Jun 2011 14:45:25 +1000 Subject: [Python-ideas] date.datetime() method to convert from date to datetime In-Reply-To: <4DE85E0B.5090906@realityexists.net> References: <4DE85E0B.5090906@realityexists.net> Message-ID: On Fri, Jun 3, 2011 at 2:07 PM, Evan Martin wrote: > There is a datetime.date() method for converting from a datetime to a date, > but no stdlib method to do the opposite conversion. Could a date.datetime() > method be added that returns a datetime with the time component set to zero? > Alternatively, if the object is already a datetime, it could simply return > itself. > > This isn't exactly hard to do in user code, but the obvious way of doing it > is a bit too verbose for such a simple operation. Less verbose ways are not > obvious or are error-prone, as this StackOverflow question shows: > http://stackoverflow.com/questions/1937622/convert-date-to-datetime-in-python Alternatively, the second argument to datetime.combine() could be made optional (defaulting to midnight). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ben+python at benfinney.id.au Fri Jun 3 08:03:29 2011 From: ben+python at benfinney.id.au (Ben Finney) Date: Fri, 03 Jun 2011 16:03:29 +1000 Subject: [Python-ideas] date.datetime() method to convert from date to datetime References: <4DE85E0B.5090906@realityexists.net> Message-ID: <87wrh34fke.fsf@benfinney.id.au> Evan Martin writes: > There is a datetime.date() method for converting from a datetime to a > date, but no stdlib method to do the opposite conversion. Could a > date.datetime() method be added that returns a datetime with the time > component set to zero? What is ?zero? for a time-of-day? Do you mean ?midnight on that day?? In what timezone? -- \ ?Faith, n. Belief without evidence in what is told by one who | `\ speaks without knowledge, of things without parallel.? ?Ambrose | _o__) Bierce, _The Devil's Dictionary_, 1906 | Ben Finney From python-dev at realityexists.net Fri Jun 3 14:46:35 2011 From: python-dev at realityexists.net (Evan Martin) Date: Fri, 03 Jun 2011 22:46:35 +1000 Subject: [Python-ideas] date.datetime() method to convert from date to datetime In-Reply-To: References: <4DE85E0B.5090906@realityexists.net> Message-ID: <4DE8D7AB.3040902@realityexists.net> Yes, zero means the midnight of the date. No timezone - it's a naive datetime, same as the original date. The method would return exactly the same result as datetime.combine(date, time()) -- Evan Martin From alexander.belopolsky at gmail.com Fri Jun 3 15:11:32 2011 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Fri, 3 Jun 2011 09:11:32 -0400 Subject: [Python-ideas] date.datetime() method to convert from date to datetime In-Reply-To: <4DE85E0B.5090906@realityexists.net> References: <4DE85E0B.5090906@realityexists.net> Message-ID: On Fri, Jun 3, 2011 at 12:07 AM, Evan Martin wrote: > There is a datetime.date() method for converting from a datetime to a date, > but no stdlib method to do the opposite conversion. Could a date.datetime() > method be added that returns a datetime with the time component set to zero? > Alternatively, if the object is already a datetime, it could simply return > itself. My preferred alternative to this idea would be to allow datetime constructor to take date (or datetime). This would make date/datetime behave similar to int/float. If this is done, I would also like the single-argument constructor to accept str in ISO format. From ben+python at benfinney.id.au Sat Jun 4 02:08:32 2011 From: ben+python at benfinney.id.au (Ben Finney) Date: Sat, 04 Jun 2011 10:08:32 +1000 Subject: [Python-ideas] date.datetime() method to convert from date to datetime References: <4DE85E0B.5090906@realityexists.net> Message-ID: <87oc2e4fwf.fsf@benfinney.id.au> Alexander Belopolsky writes: > My preferred alternative to this idea would be to allow datetime > constructor to take date (or datetime). This would make date/datetime > behave similar to int/float. +1. They are conceptually very similar types, so it makes sense for the constructor to accept each of them. > If this is done, I would also like the single-argument constructor to > accept str in ISO format. ?1, please don't overload a type's default constructor to the point of accepting all sorts of unrelated types. I think ?datetime.fromstring?, if implemented, should be a separate alternative constructor (by whatever spelling). -- \ ?I knew it was a shocking thing to say, but ? no-one has the | `\ right to spend their life without being offended.? ?Philip | _o__) Pullman, 2010-03-28 | Ben Finney From python-dev at realityexists.net Sat Jun 4 03:10:38 2011 From: python-dev at realityexists.net (Evan Martin) Date: Sat, 04 Jun 2011 11:10:38 +1000 Subject: [Python-ideas] date.datetime() method to convert from date to datetime In-Reply-To: <87oc2e4fwf.fsf@benfinney.id.au> References: <4DE85E0B.5090906@realityexists.net> <87oc2e4fwf.fsf@benfinney.id.au> Message-ID: <4DE9860E.1070003@realityexists.net> On 4/06/2011 10:08 AM, Ben Finney wrote: > Alexander Belopolsky > writes: > >> My preferred alternative to this idea would be to allow datetime >> constructor to take date (or datetime). This would make date/datetime >> behave similar to int/float. > +1. They are conceptually very similar types, so it makes sense for the > constructor to accept each of them. I think that would work well if we had overloaded constructors, but without them it might complicate things too much. What would the signature look like? Also, that would be inconsistent with how a datetime is converted to a date - as a user I would then expect the date constructor to take a datetime, too. If datetime.date() converts a datetime to a date then I naturally think to call date.datetime() to go the other way. -- Evan Martin From ericsnowcurrently at gmail.com Sat Jun 4 08:11:31 2011 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Sat, 4 Jun 2011 00:11:31 -0600 Subject: [Python-ideas] objects aware of being bound to a name Message-ID: When a name is bound to an object, the object doesn't hear about it, usually. The exceptions I could think of are class and function definitions, and imports. Also, you can sneak around it using setattr/descriptors if you control the class... However, objects are otherwise (and generally) blind to their names. This is relevant when you want an object to be aware of the contexts in which it exists. It also relates to DRY issues that people bring up sometimes with regards to descriptors and namedtuple (though I'm not sure its that big a deal). Here are three approaches that satisfy this situation: 1. have assignment automatically pass the name to the __init__ when binding a new instance. 2. bind the name to __name__ on the object before calling __init__. 3. call __bound__(self, name, obj) on the object before __init__ is called (or maybe after). The first is the approach import/class/def take. You can't really generalize that, though, since most classes don't have a name parameter. Both the first and second approach only work when the object to be bound is instantiated. The second seems to work better than the first, but __name__ shouldn't be re-bound on an object every time it is involved in an assignment. It might be a good special-case, though. I like the third option because it could be tried for any name binding, from assignment to function arguments. However, that may be its downfall too. I would guess that name binding happens more than just once or twice during the course of execution . I would also guess that it would kill performance. However, I don't know the ins and outs of the compiler/runtime so I could be pleasantly wrong. In addition to __bound__, an __unbound__ could be leveraged (wait for it) to let an object know when it has been unbound from a name (during del or when another object is bound to the name). Of course you get double the performance hit from just __bound__. Like most things, this can already be done, just not cleanly. Here's an example of more or less equivalent code: class Something: def __init__(self): self._names = set() def __bound__(self, name, obj): if (name, obj) in set: return self._names.add((name, obj)) def __unbound__(self, name, obj): self._names.remove((name, obj)) obj = __import__(__file__.rsplit(".py", 1)[0]) something = Something() something.__bound__("something", obj) something.__unbound__("something", obj) something = Something() something.__bound__("something", obj) something.__unbound__("something", obj) del something So you can do it already, explicitly, but it's a mess. I wouldn't be surprised if there was a way to be smarter about when to call __bound__/__unbound__ to alleviate the performance hit, but I don't see it. I also wouldn't be surprised if there was a trivial way to do this, or if no one's brought it up because it's such an obviously bad idea! :) Maybe I just need to get go some sleep. Regardless, this idea hit me suddenly while I was working on something else. I don't remember what prompted the idea, but I at least wanted to float it out there. Even if it's a terrible idea, I think the concept of letting the bound object know how it's bound is an interesting one. It's an angle I had not considered before. Thanks, -eric From guido at python.org Sat Jun 4 18:52:03 2011 From: guido at python.org (Guido van Rossum) Date: Sat, 4 Jun 2011 09:52:03 -0700 Subject: [Python-ideas] Fwd: Minor tweak to PEP 8? In-Reply-To: References: <20110510104754.4689cc5e@bhuda.mired.org> <87y62ejl2j.fsf@benfinney.id.au>

<20110602150031.2b98524f@neurotica.wooz.org> <20110602181113.5be70efc@neurotica.wooz.org> <20110603101906.25dbf2f7@neurotica.wooz.org> <20110603140923.0c8fcb90@neurotica.wooz.org> Message-ID: [Correct list address] Yeah, please prepare a patch. Maybe you can send it to Barry so he can check it in. I would use 4-space indent for the "Opt:" exanple at the end though. On Sat, Jun 4, 2011 at 9:03 AM, Steven Klass wrote: > Hey Barry, > You are correct - it was an artifact from the email tool.. > Guido - it appears we are reaching some consensus. Thoughts? Did you want me > to update the repo? > ?? ?Continuation lines should align wrapped elements either vertically > using > ?? ?Python's implicit line joining inside parentheses, brackets and braces, > or > ?? ?or using a hanging indent. ?When using a hanging indent the following > ?? ?considerations should be applied; there should be no arguments on the > ?? ?first line and further indentation should be used to clearly distinguish > ?? ?the itself as a continuation line. > > ?? ?Yes: ?# Aligned with opening delimiter > ?? ? ? ? ?foo = long_function_name(var_one, var_two, > ?? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? var_three, var_four) > ?? ? ? ? ?# More indentation included to distiguish this from the rest. > ?? ? ? ? ?def long_function_name( > ?? ? ? ? ? ? ? ? ?var_one, var_two, var_three, > ?? ? ? ? ? ? ? ? ?var_four): > ?? ? ? ? ? ? ?print(var_one) > ?? ?No: ? # Stuff on first line forbidden when not using vertical alignment > ?? ? ? ? ?foo = long_function_name(var_one, var_two, > ?? ? ? ? ? ? ?var_three, var_four) > ?? ? ? ? ?# Further indentation required as indentation is not > distiguishable > ?? ? ? ? ?def long_function_name( > ?? ? ? ? ? ? ?var_one, var_two, var_three, > ?? ? ? ? ? ? ?var_four): > ?? ? ? ? ? ? ?print(var_one) > > ?? ?Opt: ?# Extra indentation not necessary. > ?? ? ? ? ?foo = long_function_name( > ?? ? ? ? ? ?var_one, var_two, > ?? ? ? ? ? ?var_three, var_four) > > > > On Fri, Jun 3, 2011 at 11:09 AM, Barry Warsaw wrote: >> >> Hi Steven, >> >> On Jun 03, 2011, at 08:57 AM, Steven Klass wrote: >> >> > ? ?Continuation lines should align wrapped elements either vertically >> > using >> > ? ?Python's implicit line joining inside parentheses, brackets and >> > braces, >> > ? ?or or using a hanging indent. ?When using a hanging indent the >> > following >> > ? ?considerations should be applied; there should be no arguments on the >> > ? ?first line and further indentation should be used to clearly >> > distinguish >> > ? ?the itself as a continuation line. >> > >> > ? ?Yes: ?# Aligned with opening delimiter >> > ? ? ? ? ?foo = long_function_name(var_one, var_two, >> > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?var_three, var_four) >> -------------------------------------^ >> I'm sure that's just an email alignment bug, right? >> >> > >> > ? ? ? ? ?# More indentation included to distiguish this from the rest. >> > ? ? ? ? ?def long_function_name( >> > ? ? ? ? ? ? ? ? ?var_one, var_two, var_three, >> > ? ? ? ? ? ? ? ? ?var_four): >> > ? ? ? ? ? ? ?print(var_one) >> > >> > >> > ? ?No: ? # Stuff on first line forbidden when not using vertical >> > alignment >> > ? ? ? ? ?foo = long_function_name(var_one, var_two, >> > ? ? ? ? ? ? ?var_three, var_four) >> > >> > ? ? ? ? ?# Further indentation required as indentation is not >> > distiguishable >> > ? ? ? ? ?def long_function_name( >> > ? ? ? ? ? ? ?var_one, var_two, var_three, >> > ? ? ? ? ? ? ?var_four): >> > ? ? ? ? ? ? ?print(var_one) >> > >> > >> >Thoughts? >> >> That looks great to me, thanks. ?I would add one more example to the 'Yes' >> section to cover the case we've been talking about: >> >> ? ? ? ?# Extra indentation not necessary. >> ? ? ? ?foo = long_function_name( >> ? ? ? ? ? ?var_one, var_two, >> ? ? ? ? ? ?var_three, var_four) >> >> This doesn't say that extra indentation isn't allowed, just that it's not >> necessary, so I think it strikes the right balance. >> >> Cheers, >> -Barry > > > > -- > --- > > Steven M. Klass > > ? 1 (480) 225-1112 > ? sklass at pointcircle.com > -- --Guido van Rossum (python.org/~guido) From tjreedy at udel.edu Sun Jun 5 03:54:42 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 04 Jun 2011 21:54:42 -0400 Subject: [Python-ideas] objects aware of being bound to a name In-Reply-To: References: Message-ID: On 6/4/2011 2:11 AM, Eric Snow wrote: > When a name is bound to an object, the object doesn't hear about it, Objects are created. Then they may *optionally* be bound to names (possibly in a different namespace) and collection slots. Or they may be used in an expression and be discarded without every being bound to anything. > usually. I believe never. > The exceptions I could > think of are class and function definitions, and imports. Functions, classes, and modules have __name__ attributes (that cannot be deleted, even if they can be replaced). This attribute is set before they are optionally bound, so they also do not hear about the subsequent bindings. This attribute is used for their string representations for humans to read. I cannot think of any other use by the interpreter itself. > Also, you can sneak around it using > setattr/descriptors if you control the class... However, objects are > otherwise (and generally) blind to their names. Object (or definition or intrinsic or attribute) names (exactly 1 for certain instances of certain classes) and namespace binding names (0-many for every object) are different concepts. Ojects names live on the object. The binding names live in one of the many namespaces. Object names are not unique among objects. Binding names are unique within each namespace. Objects names do not have to match any of the binding names of the object. A common type of example of this is >>> import itertools as it >>> it.__name__ 'itertools' > This is relevant when you want an object to be aware of the contexts > in which it exists. Python objects exist within an anonymous object space with no structure. They can be used within any namespace context from which they can be accessed via names and slots. Adding a __name__ attribute to instances of a class will not say anything about use context. Modules carry source information in __filename__ for convenience in error reporting. Classes and functions have the *name* of their creation context in __module__ for the same reason. I do not believe that __filename or __module__ have any operational meaning after the object is created. For functions (but not classes) __globals__ *is* needed to function. Instances have __class__, which is used for both attribute lookup and information display. > It also relates to DRY issues that people bring up sometimes > with regards to descriptors and namedtuple (though > I'm not sure its that big a deal). If one wants an object name and initial binding name to be the same, I agree that giving the name once is a convenience. Def, class, and import *statements* allow this for functions, classes, and modules. Each has a syntax peculiar to the class. Lambda expressions, type() calls, __import__ calls (and others in inspect create objects with __name__s but without an automatic binding operation. Ideas about not repeating duplicate object/binding names was discussed here in a thread with several posts within the last year. I believe one idea was a new 'augmented assignment': something like 'x .= y' meaning "y.__name__ = 'x'; x = y". But there was some problem with every proposal. -- Terry Jan Reedy From ncoghlan at gmail.com Sun Jun 5 07:50:30 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 5 Jun 2011 15:50:30 +1000 Subject: [Python-ideas] objects aware of being bound to a name In-Reply-To: References: Message-ID: On Sun, Jun 5, 2011 at 11:54 AM, Terry Reedy wrote: > Functions, classes, and modules have __name__ attributes (that cannot be > deleted, even if they can be replaced). This attribute is set before they > are optionally bound, so they also do not hear about the subsequent > bindings. This attribute is used for their string representations for humans > to read. I cannot think of any other use by the interpreter itself. __name__ attributes are also relevant for serialisation (esp. pickling). However, due to immutable objects, there's no realistic general purpose solution in this space. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From cmjohnson.mailinglist at gmail.com Sun Jun 5 10:00:55 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Sat, 4 Jun 2011 22:00:55 -1000 Subject: [Python-ideas] objects aware of being bound to a name In-Reply-To: References:

Message-ID: This is in principle doable with metaclasses (like everything else :-D)... Here's what I whipped up in the console: >>> class MagicDict(dict): ... def __setitem__(self, key, value): ... print("Binding key: {} to value: {}".format(key, value)) ... if hasattr(value, "__bind__"): ... value.__bind__(key) ... super().__setitem__(key, value) ... >>> class Bindable: ... def __bind__(self, key): ... print("{} was bound to {}".format(self, key)) ... >>> class MetaBinding(type): ... @classmethod ... def __prepare__(metacls, name, bases): ... return MagicDict() ... >>> class BindingReports(metaclass=MetaBinding): ... a = 1 ... b = Bindable() ... c = "blah" ... Binding key: __module__ to value: __main__ Binding key: a to value: 1 Binding key: b to value: <__main__.Bindable object at 0x100603b50> <__main__.Bindable object at 0x100603b50> was bound to b Binding key: c to value: blah Not sure how useful this is. I don't like using the term "class" for things where you're not really trying to bundle together methods or create a type, just change the way values get bound. -- Carl Johnson -------------- next part -------------- An HTML attachment was scrubbed... URL: From ericsnowcurrently at gmail.com Mon Jun 6 17:48:07 2011 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Mon, 6 Jun 2011 09:48:07 -0600 Subject: [Python-ideas] objects aware of being bound to a name In-Reply-To: References:

Message-ID: On Sat, Jun 4, 2011 at 11:50 PM, Nick Coghlan wrote: > On Sun, Jun 5, 2011 at 11:54 AM, Terry Reedy wrote: >> Functions, classes, and modules have __name__ attributes (that cannot be >> deleted, even if they can be replaced). This attribute is set before they >> are optionally bound, so they also do not hear about the subsequent >> bindings. This attribute is used for their string representations for humans >> to read. I cannot think of any other use by the interpreter itself. > > __name__ attributes are also relevant for serialisation (esp. pickling). > > However, due to immutable objects, there's no realistic general > purpose solution in this space. > Yeah, I think I was hasty on writing this up. It's interesting, but not a great fit, nor very practical. The immutable objects problem is definitely a show-stopper. Thanks for the feedback though. -eric > Cheers, > Nick. > > -- > Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From andrew at acooke.org Mon Jun 6 18:11:23 2011 From: andrew at acooke.org (andrew cooke) Date: Mon, 6 Jun 2011 12:11:23 -0400 Subject: [Python-ideas] objects aware of being bound to a name In-Reply-To: References:

Message-ID: <20110606161123.GE11101@acooke.org> I'm not sure what you're trying to do (ie if there's any practical problem that motivates this), but in Lepl (a parser) I use a little trick that lets me examine variables defined within a "with" scope. That lets me add "debugging" at the application level. There's an example here: http://www.acooke.org/lepl/debugging.html#variable-traces - everything defined inside the "with TraceVariables" is found by inspection of some Python internals doo-hicky, and then modified to produce the debug output (note that the output incldues the variable *names* which is the kind of thing you are trying to do here). Contact me if you want more info. Andrew On Mon, Jun 06, 2011 at 09:48:07AM -0600, Eric Snow wrote: > On Sat, Jun 4, 2011 at 11:50 PM, Nick Coghlan wrote: > > On Sun, Jun 5, 2011 at 11:54 AM, Terry Reedy wrote: > >> Functions, classes, and modules have __name__ attributes (that cannot be > >> deleted, even if they can be replaced). This attribute is set before they > >> are optionally bound, so they also do not hear about the subsequent > >> bindings. This attribute is used for their string representations for humans > >> to read. I cannot think of any other use by the interpreter itself. > > > > __name__ attributes are also relevant for serialisation (esp. pickling). > > > > However, due to immutable objects, there's no realistic general > > purpose solution in this space. > > > > Yeah, I think I was hasty on writing this up. It's interesting, but > not a great fit, nor very practical. The immutable objects problem is > definitely a show-stopper. Thanks for the feedback though. > > -eric > > > Cheers, > > Nick. > > > > -- > > Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia > > _______________________________________________ > > Python-ideas mailing list > > Python-ideas at python.org > > http://mail.python.org/mailman/listinfo/python-ideas > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From zuo at chopin.edu.pl Wed Jun 8 13:01:08 2011 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Wed, 8 Jun 2011 13:01:08 +0200 Subject: [Python-ideas] Iteritems() function? Message-ID: <20110608110108.GA2703@chopin.edu.pl> Case ==== Quite typical: iterate and do something with some_items -- a collection of 2-element items. for first, second in some_items: ... But for dicts it must use another form: for first, second in some_items.items(): ... We must know it'll be a mapping, and even then quite usual bug is to forget to add that `items()'. But sometimes it may be a dict {first: second, ...} OR a seq [(first, second), ...] and in fact we are not interested in it -- we simply want to iterate over its items... But we are forced to do type/interface check, e.g.: if isinstance(coll, collections.Mapping): for first, second in some_items.items(): ... else: for first, second in some_items: ... Idea ==== A new function: builtins.iteritems() or builtins.iterpairs() or itertools.items() or itertools.pairs() (don't know what name would be the best) -- equivalent to: def (coll): iterable = (coll.items() if isinstance(coll, collections.Mapping) else coll) return iter(iterable) or maybe something like: def (coll): try: iterable = coll.items() except AttributeError: iterable = coll return iter(iterable) Usage ===== Then, in our example case, we'd do simply: for first, second in iteritems(some_items): ... And we don't need to think about some_items type, whether it's a mapping or 2-tuple sequence. All we need to know is that it's a collection of 2-element items. Regards, *j From masklinn at masklinn.net Wed Jun 8 13:25:42 2011 From: masklinn at masklinn.net (Masklinn) Date: Wed, 8 Jun 2011 13:25:42 +0200 Subject: [Python-ideas] Iteritems() function? In-Reply-To: <20110608110108.GA2703@chopin.edu.pl> References: <20110608110108.GA2703@chopin.edu.pl> Message-ID: <518A4DD6-AA95-4D68-9783-5FE993E07B7A@masklinn.net> On 2011-06-08, at 13:01 , Jan Kaliszewski wrote: > But sometimes it may be a dict {first: second, ...} OR a seq [(first, > second), ...] and in fact we are not interested in it -- we simply want > to iterate over its items... But we are forced to do type/interface > check, e.g.: > > if isinstance(coll, collections.Mapping): > for first, second in some_items.items(): ... > else: > for first, second in some_items: ? You could just convert everything to a dict: for first, second in dict(some_items).iteritems(): # etc? From phd at phdru.name Wed Jun 8 13:54:49 2011 From: phd at phdru.name (Oleg Broytman) Date: Wed, 8 Jun 2011 15:54:49 +0400 Subject: [Python-ideas] Iteritems() function? In-Reply-To: <20110608110108.GA2703@chopin.edu.pl> References: <20110608110108.GA2703@chopin.edu.pl> Message-ID: <20110608115449.GC21059@iskra.aviel.ru> On Wed, Jun 08, 2011 at 01:01:08PM +0200, Jan Kaliszewski wrote: > Quite typical: iterate and do something with some_items -- a collection > of 2-element items. > > for first, second in some_items: > ... > > But for dicts it must use another form: > > for first, second in some_items.items(): > ... > > We must know it'll be a mapping, and even then quite usual bug is to > forget to add that `items()'. You don't need a special buitin for that. Just call .items(): if hasattr(some_items, 'items'): some_items = some_items.items() for first, second in some_items: ... Oleg. -- Oleg Broytman http://phdru.name/ phd at phdru.name Programmers don't die, they just GOSUB without RETURN. From grosser.meister.morti at gmx.net Wed Jun 8 17:41:28 2011 From: grosser.meister.morti at gmx.net (=?ISO-8859-1?Q?Mathias_Panzenb=F6ck?=) Date: Wed, 08 Jun 2011 17:41:28 +0200 Subject: [Python-ideas] Iteritems() function? In-Reply-To: <20110608115449.GC21059@iskra.aviel.ru> References: <20110608110108.GA2703@chopin.edu.pl> <20110608115449.GC21059@iskra.aviel.ru> Message-ID: <4DEF9828.9070009@gmx.net> On 06/08/2011 01:54 PM, Oleg Broytman wrote: > On Wed, Jun 08, 2011 at 01:01:08PM +0200, Jan Kaliszewski wrote: >> Quite typical: iterate and do something with some_items -- a collection >> of 2-element items. >> >> for first, second in some_items: >> ... >> >> But for dicts it must use another form: >> >> for first, second in some_items.items(): >> ... >> >> We must know it'll be a mapping, and even then quite usual bug is to >> forget to add that `items()'. > > You don't need a special buitin for that. Just call .items(): > > if hasattr(some_items, 'items'): > some_items = some_items.items() > for first, second in some_items: > ... > > Oleg. So basically it would be: def items(sequence): if hasattr(sequence, 'items'): return sequence.items() else: return sequence I don't think that is enough complexity to justify an inclusion in builtins or itertools. Anyway, I would have expected such a function to do this (so it's not even obvious): def items(sequence): if hasattr(sequence, 'items'): return sequence.items() else: return enumerate(sequence) -panzi From ncoghlan at gmail.com Wed Jun 8 18:24:30 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 9 Jun 2011 02:24:30 +1000 Subject: [Python-ideas] Iteritems() function? In-Reply-To: <4DEF9828.9070009@gmx.net> References: <20110608110108.GA2703@chopin.edu.pl> <20110608115449.GC21059@iskra.aviel.ru> <4DEF9828.9070009@gmx.net> Message-ID: On Thu, Jun 9, 2011 at 1:41 AM, Mathias Panzenb?ck wrote: > I don't think that is enough complexity to justify an inclusion in builtins > or itertools. Anyway, I would have expected such a function to do this (so > it's not even obvious): > > def items(sequence): > ? ? ? ?if hasattr(sequence, 'items'): > ? ? ? ? ? ? ? ?return sequence.items() > ? ? ? ?else: > ? ? ? ? ? ? ? ?return enumerate(sequence) I expect the use case here is to implement APIs like the dict constructor and update() method - you can either pass them a mapping, or else an iterable of key-value pairs. As Oleg noted though, the traditional way of handling that is to ducktype on the "items" method, although an isinstance check against Mapping would indeed be an acceptable substitute these days. Either way, calling items() gets you an iterable of key-value pairs, so you write your algorithm to work on that and do a coercion via items() to handle the mapping case. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From matt at vazor.com Wed Jun 8 19:57:28 2011 From: matt at vazor.com (Matt Billenstein) Date: Wed, 08 Jun 2011 18:57:28 +0100 Subject: [Python-ideas] Iteritems() function? Message-ID: <4tiv7fp5lkhs.sts845i@elasticemail.net> On Wed, Jun 08, 2011 at 01:25:42PM +0200, Masklinn wrote: > You could just convert everything to a dict: > > for first, second in dict(some_items).iteritems(): > # etc? That option won't necessarily preserve the order of the original sequence where perhaps it matters... m -- Matt Billenstein matt at vazor.com http://www.vazor.com/ From masklinn at masklinn.net Wed Jun 8 21:32:37 2011 From: masklinn at masklinn.net (Masklinn) Date: Wed, 8 Jun 2011 21:32:37 +0200 Subject: [Python-ideas] Iteritems() function? In-Reply-To: <4tiv7fp5lkhs.sts845i@elasticemail.net> References: <4tiv7fp5lkhs.sts845i@elasticemail.net> Message-ID: On 2011-06-08, at 19:57 , Matt Billenstein wrote: > On Wed, Jun 08, 2011 at 01:25:42PM +0200, Masklinn wrote: >> You could just convert everything to a dict: >> >> for first, second in dict(some_items).iteritems(): >> # etc? > > That option won't necessarily preserve the order of the original sequence where > perhaps it matters? > That is what collections.OrderedDict is for. From steve at pearwood.info Thu Jun 9 02:38:23 2011 From: steve at pearwood.info (Steven D'Aprano) Date: Thu, 09 Jun 2011 10:38:23 +1000 Subject: [Python-ideas] Iteritems() function? In-Reply-To: References: <4tiv7fp5lkhs.sts845i@elasticemail.net> Message-ID: <4DF015FF.9050109@pearwood.info> Masklinn wrote: > On 2011-06-08, at 19:57 , Matt Billenstein wrote: >> On Wed, Jun 08, 2011 at 01:25:42PM +0200, Masklinn wrote: >>> You could just convert everything to a dict: >>> >>> for first, second in dict(some_items).iteritems(): >>> # etc? >> That option won't necessarily preserve the order of the original sequence where >> perhaps it matters? >> > That is what collections.OrderedDict is for. But calling *dict* on the items (as shown above), not OrderedDict, doesn't preserve the order. And frankly, I think it's silly to take an arbitrarily big iterable of ordered items, convert it to a dict (ordered or not), only to immediately extract an ordered iterable of items again: OrderedDict(some_items).items() You already have some_items in the right format for iteration, why iterate over it twice instead of once? Better to use a simple helper function: def coerce_to_items(obj): if has_attr(obj, 'items'): return obj.items() return obj which accepts either a mapping (dict or OrderedDict) or an iterable of items, and returns an iterable of items. (I use items() rather than iteritems() because any proposed new functionality must be aimed at Python 3, not 2.) That's simple enough to use in-line, if you use small variable names: for a,b in (pairs.items() if hasattr(pairs, 'items') else pairs): ... but I think the utility function is better. I don't think it needs to be a built-in. -- Steven From ericsnowcurrently at gmail.com Fri Jun 10 01:54:14 2011 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Thu, 9 Jun 2011 17:54:14 -0600 Subject: [Python-ideas] inheriting docstrings and mutable docstings for classes Message-ID: I noticed that __doc__ for classes is immutable: >>> class X: ... "some doc" ... >>> X.__doc__ 'some doc' >>> X.__doc__ = "another doc" Traceback (most recent call last): File "", line 1, in AttributeError: attribute '__doc__' of 'type' objects is not writable That is on 3.3, but apparently it's the case all the way back to 2.2. I mentioned this on python-list and several people indicated that it should be an unnecessary restriction [1]. Someone else pointed out that docstrings also behave this way for method objects [2]. I want too see if it would be okay to make __doc__ writable for classes. I am not sure about for method objects, since I've never thought to do that, but it is analogous to class instances, where __doc__ is mutable and distinct from the class docstring. I just don't have any use cases that would dictate changing the docstring of the method object, the wrapped function, or neither when changing __doc__. Someone else on the thread indicated that perhaps docstrings should be inherited and got some support [3][4]. It makes sense to me for many, but not necessarily all, cases. However, if you don't want to inherit you can simply set an empty docstring. Docstrings impact help(), doctests, and some DSLs. I'm +1 on having __doc__ be inherited. Thanks, -eric [1] http://mail.python.org/pipermail/python-list/2011-June/1274079.html [2] http://mail.python.org/pipermail/python-list/2011-June/1274080.html [3] http://mail.python.org/pipermail/python-list/2011-June/1274099.html [4] http://mail.python.org/pipermail/python-list/2011-June/1274105.html From ncoghlan at gmail.com Fri Jun 10 03:05:08 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 10 Jun 2011 11:05:08 +1000 Subject: [Python-ideas] inheriting docstrings and mutable docstings for classes In-Reply-To: References: Message-ID: On Fri, Jun 10, 2011 at 9:54 AM, Eric Snow wrote: > I'm +1 on having __doc__ be inherited. -1. Subclasses are not the same thing as the original class so docstring inheritance should be requested explicitly. Agreed that docstrings should be writeable after the fact, though (e.g. functions already work that way - functools.wraps wouldn't work otherwise). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ericsnowcurrently at gmail.com Fri Jun 10 03:45:07 2011 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Thu, 9 Jun 2011 19:45:07 -0600 Subject: [Python-ideas] inheriting docstrings and mutable docstings for classes In-Reply-To: References: Message-ID: On Thu, Jun 9, 2011 at 7:05 PM, Nick Coghlan wrote: > On Fri, Jun 10, 2011 at 9:54 AM, Eric Snow wrote: >> I'm +1 on having __doc__ be inherited. > > -1. Subclasses are not the same thing as the original class so > docstring inheritance should be requested explicitly. > Yeah, this one was mostly auxiliary to my main concern, __doc__ mutability for classes. Other than doctests and documentation/help(), I haven't used docstrings for much so the idea of it did not seem like a big deal. I certainly find myself inheriting docstrings from my abstract base classes explicitly all the time so that help() will show the info that is still applicable. > Agreed that docstrings should be writeable after the fact, though > (e.g. functions already work that way - functools.wraps wouldn't work > otherwise). > Would this be a very controversial change? I ask because it's been this way since 2.2 and no one's changed it. Thanks. -eric > Cheers, > Nick. > > -- > Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia > From greg.ewing at canterbury.ac.nz Fri Jun 10 09:04:32 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 10 Jun 2011 19:04:32 +1200 Subject: [Python-ideas] inheriting docstrings and mutable docstings for classes In-Reply-To: References: Message-ID: <4DF1C200.9090405@canterbury.ac.nz> Nick Coghlan wrote: > -1. Subclasses are not the same thing as the original class so > docstring inheritance should be requested explicitly. The docstring of the class itself probably shouldn't be inherited automatically. But if you override a method without changing the API or user-visible behaviour, the inherited docstring still applies. Maybe the best thing would be for the inherited docstring to get put into a different property, such as __basedoc__. Then tools that examine docstrings can decide for themselves whether using inherited docstrings makes sense. -- Greg From ncoghlan at gmail.com Fri Jun 10 09:07:47 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 10 Jun 2011 17:07:47 +1000 Subject: [Python-ideas] inheriting docstrings and mutable docstings for classes In-Reply-To: <4DF1C200.9090405@canterbury.ac.nz> References: <4DF1C200.9090405@canterbury.ac.nz> Message-ID: On Fri, Jun 10, 2011 at 5:04 PM, Greg Ewing wrote: > Nick Coghlan wrote: >> >> -1. Subclasses are not the same thing as the original class so >> docstring inheritance should be requested explicitly. > > The docstring of the class itself probably shouldn't be > inherited automatically. But if you override a method > without changing the API or user-visible behaviour, the > inherited docstring still applies. I believe Eric created a class decorator recipe on the cookbook site that does exactly that for methods without their own docstrings. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ncoghlan at gmail.com Fri Jun 10 09:09:41 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 10 Jun 2011 17:09:41 +1000 Subject: [Python-ideas] inheriting docstrings and mutable docstings for classes In-Reply-To: References: