From cmjohnson.mailinglist at gmail.com Wed Sep 2 12:34:49 2009 From: cmjohnson.mailinglist at gmail.com (Carl Johnson) Date: Wed, 2 Sep 2009 00:34:49 -1000 Subject: [Python-ideas] [Python-Dev] Decorator syntax In-Reply-To: References: Message-ID: <3bdda690909020334m4c81cecdm8f57dce54c2da398@mail.gmail.com> Crossposting to Python-ideas, I asked for the same change to the grammar a couple months back on python-ideas. See http://mail.python.org/pipermail/python-ideas/2009-February/thread.html#2787 I'm all for it, but you'll have to convince Guido that this won't result in confusing to read code. My own examples, unfortunately did not advance your cause, as Guido explained, "My brain hurts trying to understand all this. I don't think this bodes well as a use case for a proposed feature." :-D The trouble is that I was using lambdas upon lambdas to do all kinds of Ruby block-esque tricks. OTOH, if you come up with some simple, clear use cases though, and I think he might still be persuadable to make a simple change to the grammar. ? Carl Johnson Rob Cliffe wrote: > Hi All, > This is my first post to python-dev so I will briefly introduce myself:? My > name is Rob Cliffe and I am a commercial programmer living in London, UK.? I > have some 30 years of programming experience but have only been using Python > for a couple of years. > First I want to say what a fantastic language Python is.? It is THE best > language for development in my opinion, and a joy to use. > > My specific issue: > I eventually got my head round decorator syntax and realised that what came > after the '@' was (basically) a function that took a function as argument > and returned a function as result. > However it seems to me?unPythonesque (i.e. an exception to Python's normal > consistency) that the syntax of what follows the '@' should be restricted to > either a single (function) identifier or a single (function) identifier with > an argument list. > The example I tried, which seems not an unreasonable sort of thing to do, > was along the lines of: > > def deco1(func): > ??? > def deco2(func): > ??? > > DecoList = [deco1, deco2] > > @DecoList[0]??? # NO - CAUSES SYNTAX ERROR > def foo(): > ??? pass > > I am sure other guys have their own examples. > > I am of course not the first person to raise this issue, and I see that > Guido has a "gut feeling" against allowing a general expression after the > '@'. > > BUT - a general expression can be "smuggled in" very easily as a function > argument: > > def Identity(x): return x > > @Identity(DecoList[0])??? # THIS WORKS > def foo(): > ??? pass > > So - the syntax restriction seems not only?inconsistent, but?pointless; it > doesn't forbid anything, but merely means we have to do it in a slightly > convoluted (unPythonesque) way.? So please, Guido, will you reconsider? > > Best wishes > Rob Cliffe From fuzzyman at gmail.com Wed Sep 2 12:49:31 2009 From: fuzzyman at gmail.com (Michael Foord) Date: Wed, 2 Sep 2009 11:49:31 +0100 Subject: [Python-ideas] [Python-Dev] Decorator syntax In-Reply-To: <3bdda690909020334m4c81cecdm8f57dce54c2da398@mail.gmail.com> References: <3bdda690909020334m4c81cecdm8f57dce54c2da398@mail.gmail.com> Message-ID: <6f4025010909020349h31573e63if2c112ba471b8872@mail.gmail.com> I actually encountered this for the first time yesterday and didn't realise that the decorator syntax was limited in this way (I was mentally preparing a blog entry when these emails arrived). What I needed to do was turn a Python function into a .NET event handler in IronPython. The simple case is this: from System import EventHandler @EventHandler def on_event(sender, event): # do stuff... This works fine of course, but then I needed to use the 'typed' form which is like this: @EventHandler[HtmlEventArgs] def on_event(sender, event): # do stuff... I didn't realise this was invalid syntax - nor the neat trick with the identity function to bypass the limitation. Michael 2009/9/2 Carl Johnson > Crossposting to Python-ideas, > > I asked for the same change to the grammar a couple months back on > python-ideas. > > See > http://mail.python.org/pipermail/python-ideas/2009-February/thread.html#2787 > > I'm all for it, but you'll have to convince Guido that this won't > result in confusing to read code. My own examples, unfortunately did > not advance your cause, as Guido explained, "My brain hurts trying to > understand all this. I don't think this bodes well as a use case for a > proposed feature." :-D The trouble is that I was using lambdas upon > lambdas to do all kinds of Ruby block-esque tricks. OTOH, if you come > up with some simple, clear use cases though, and I think he might > still be persuadable to make a simple change to the grammar. > > > ? Carl Johnson > > Rob Cliffe wrote: > > > Hi All, > > This is my first post to python-dev so I will briefly introduce myself: > My > > name is Rob Cliffe and I am a commercial programmer living in London, > UK. I > > have some 30 years of programming experience but have only been using > Python > > for a couple of years. > > First I want to say what a fantastic language Python is. It is THE best > > language for development in my opinion, and a joy to use. > > > > My specific issue: > > I eventually got my head round decorator syntax and realised that what > came > > after the '@' was (basically) a function that took a function as argument > > and returned a function as result. > > However it seems to me unPythonesque (i.e. an exception to Python's > normal > > consistency) that the syntax of what follows the '@' should be restricted > to > > either a single (function) identifier or a single (function) identifier > with > > an argument list. > > The example I tried, which seems not an unreasonable sort of thing to do, > > was along the lines of: > > > > def deco1(func): > > > > def deco2(func): > > > > > > DecoList = [deco1, deco2] > > > > @DecoList[0] # NO - CAUSES SYNTAX ERROR > > def foo(): > > pass > > > > I am sure other guys have their own examples. > > > > I am of course not the first person to raise this issue, and I see that > > Guido has a "gut feeling" against allowing a general expression after the > > '@'. > > > > BUT - a general expression can be "smuggled in" very easily as a function > > argument: > > > > def Identity(x): return x > > > > @Identity(DecoList[0]) # THIS WORKS > > def foo(): > > pass > > > > So - the syntax restriction seems not only inconsistent, but pointless; > it > > doesn't forbid anything, but merely means we have to do it in a slightly > > convoluted (unPythonesque) way. So please, Guido, will you reconsider? > > > > Best wishes > > Rob Cliffe > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- http://www.ironpythoninaction.com/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Wed Sep 2 13:18:57 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 02 Sep 2009 21:18:57 +1000 Subject: [Python-ideas] [Python-Dev] Decorator syntax In-Reply-To: <126B4B7C-38AF-45DF-B5F2-DB146E7146C8@masklinn.net> References: <126B4B7C-38AF-45DF-B5F2-DB146E7146C8@masklinn.net> Message-ID: <4A9E54A1.4060408@gmail.com> Xavier Morel wrote: > On 2 Sep 2009, at 12:15 , Rob Cliffe wrote: >> >> @Identity(DecoList[0]) # THIS WORKS >> def foo(): >> pass > For what it's worth, you don't need an id function, you can simply write > > @itemgetter(0)(decorators) > def foo(): > 'whatever' > or > > @decorators.__getitem__(0) > def foo(): > 'whatever' To be honest, I'd forgotten the restriction was even there. So +0 on removing it and relying on "consenting adults" and style guides to keep people from getting to obscure with their decorators. However, any such change should also be accompanied by an update to PEP 8 (recommending the current syntactic restrictions as style rules for the standard library). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From guido at python.org Wed Sep 2 16:15:22 2009 From: guido at python.org (Guido van Rossum) Date: Wed, 2 Sep 2009 07:15:22 -0700 Subject: [Python-ideas] [Python-Dev] Decorator syntax In-Reply-To: <4A9E54A1.4060408@gmail.com> References: <126B4B7C-38AF-45DF-B5F2-DB146E7146C8@masklinn.net> <4A9E54A1.4060408@gmail.com> Message-ID: On Wed, Sep 2, 2009 at 4:18 AM, Nick Coghlan wrote: > Xavier Morel wrote: >> On 2 Sep 2009, at 12:15 , Rob Cliffe wrote: >>> >>> @Identity(DecoList[0]) ? ?# THIS WORKS >>> def foo(): >>> ? ?pass >> For what it's worth, you don't need an id function, you can simply write >> >> ? ? @itemgetter(0)(decorators) >> ? ? def foo(): >> ? ? ? ? 'whatever' >> or >> >> ? ? @decorators.__getitem__(0) >> ? ? def foo(): >> ? ? ? ? 'whatever' > > To be honest, I'd forgotten the restriction was even there. So +0 on > removing it and relying on "consenting adults" and style guides to keep > people from getting to obscure with their decorators. > > However, any such change should also be accompanied by an update to PEP > 8 (recommending the current syntactic restrictions as style rules for > the standard library). To be honest, I'm still -0 on allowing full expression syntax, but I'm fine allowing @foo[expr]. Decorators are syntactic sugar that make existing functionality more readable. I don't think that allowing complex expressions in the decorator furthers that goal. After all the solution is always only one line away: helper = @helper def func(args): body -- --Guido van Rossum (home page: http://www.python.org/~guido/) From g.brandl at gmx.net Thu Sep 3 01:09:33 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 02 Sep 2009 23:09:33 +0000 Subject: [Python-ideas] Decorator syntax In-Reply-To: <4A9E9F26.8030608@mrabarnett.plus.com> References:

<4A9E9F26.8030608@mrabarnett.plus.com> Message-ID: MRAB schrieb: > James Y Knight wrote: >> On Sep 2, 2009, at 6:15 AM, Rob Cliffe wrote: >> >>> So - the syntax restriction seems not only inconsistent, but >>> pointless; it doesn't forbid anything, but merely means we have to do >>> it in a slightly convoluted (unPythonesque) way. So please, Guido, >>> will you reconsider? >> >> Indeed, it's a silly inconsistent restriction. When it was first added I >> too suggested that any expression be allowed after the @, rather than >> having a uniquely special restricted syntax. I argued from consistency >> of grammar standpoint. But Guido was not persuaded. Good luck to you. :) >> > [snip] > I can see no syntactic reason to restrict what can appear after the @. > If someone chooses to abuse it then that's unPythonic, but not illegal. I do see a reason. I have no problems with @foo.bar @foo.bar[baz] @foo.bar(baz) But this is ugly to me: @a + b def foo(): pass As is this: @a or (c and d) def foo(): pass Having the decorator expression "opened" by @ but not "closed" feels bad. However, this looks better to me: @(a + b) @(a or (c and d)) So, in terms of Grammar/Grammar, what about decorator: '@' atom trailer* NEWLINE [x-post to ideas list] Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From python at mrabarnett.plus.com Wed Sep 2 23:42:33 2009 From: python at mrabarnett.plus.com (MRAB) Date: Wed, 02 Sep 2009 22:42:33 +0100 Subject: [Python-ideas] Decorator syntax In-Reply-To: References:

<4A9E9F26.8030608@mrabarnett.plus.com> Message-ID: <4A9EE6C9.4050808@mrabarnett.plus.com> Georg Brandl wrote: > MRAB schrieb: >> James Y Knight wrote: >>> On Sep 2, 2009, at 6:15 AM, Rob Cliffe wrote: >>> >>>> So - the syntax restriction seems not only inconsistent, but >>>> pointless; it doesn't forbid anything, but merely means we have to do >>>> it in a slightly convoluted (unPythonesque) way. So please, Guido, >>>> will you reconsider? >>> Indeed, it's a silly inconsistent restriction. When it was first added I >>> too suggested that any expression be allowed after the @, rather than >>> having a uniquely special restricted syntax. I argued from consistency >>> of grammar standpoint. But Guido was not persuaded. Good luck to you. :) >>> >> [snip] >> I can see no syntactic reason to restrict what can appear after the @. >> If someone chooses to abuse it then that's unPythonic, but not illegal. > > I do see a reason. I have no problems with > > @foo.bar > @foo.bar[baz] > @foo.bar(baz) > > But this is ugly to me: > > @a + b > def foo(): pass > Ugly, yes. > As is this: > > @a or (c and d) > def foo(): pass > Agreed. > Having the decorator expression "opened" by @ but not "closed" feels bad. > But: @foo isn't "closed" either. > However, this looks better to me: > > @(a + b) > @(a or (c and d)) > Conditions in 'if' and 'while' statements don't need parentheses, so why do decorators? > So, in terms of Grammar/Grammar, what about > > decorator: '@' atom trailer* NEWLINE > I say "keep it clean", ie no parentheses except where operator priority or clarity require them. IMHO, if a user writes something that's ugly then call it unPythonic; consenting adults and all that. :-) From ncoghlan at gmail.com Thu Sep 3 00:22:46 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 03 Sep 2009 08:22:46 +1000 Subject: [Python-ideas] Decorator syntax In-Reply-To: <4A9EE6C9.4050808@mrabarnett.plus.com> References:

<4A9E9F26.8030608@mrabarnett.plus.com> <4A9EE6C9.4050808@mrabarnett.plus.com> Message-ID: <4A9EF036.1080404@gmail.com> MRAB wrote: >> However, this looks better to me: >> >> @(a + b) >> @(a or (c and d)) >> > Conditions in 'if' and 'while' statements don't need parentheses, so why > do decorators? Those are already closed by the colon so requiring parentheses would be redundant: if a + b: while a + b: The minimalist tweak would be to follow Guido's preference and just accept subscripting in addition to calls. >> So, in terms of Grammar/Grammar, what about >> >> decorator: '@' atom trailer* NEWLINE >> > I say "keep it clean", ie no parentheses except where operator priority > or clarity require them. I actually agree with Georg that this is a case where clarity favours enforced parentheses for expressions that are otherwise non-atomic. Things like variable references, function calls and subscripting are already atomic so the parentheses would be optional in those cases. However, as long as Guido remains -0 extension to arbitrary expressions isn't going to happen, parentheses or no parentheses. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From python at mrabarnett.plus.com Thu Sep 3 00:36:11 2009 From: python at mrabarnett.plus.com (MRAB) Date: Wed, 02 Sep 2009 23:36:11 +0100 Subject: [Python-ideas] Decorator syntax In-Reply-To: <4A9EF036.1080404@gmail.com> References:

<4A9E9F26.8030608@mrabarnett.plus.com> <4A9EE6C9.4050808@mrabarnett.plus.com> <4A9EF036.1080404@gmail.com> Message-ID: <4A9EF35B.7040908@mrabarnett.plus.com> Nick Coghlan wrote: > MRAB wrote: >>> However, this looks better to me: >>> >>> @(a + b) >>> @(a or (c and d)) >>> >> Conditions in 'if' and 'while' statements don't need parentheses, so why >> do decorators? > > Those are already closed by the colon so requiring parentheses would be > redundant: > > if a + b: > while a + b: > I forgot about 'return': return a + b > The minimalist tweak would be to follow Guido's preference and just > accept subscripting in addition to calls. > >>> So, in terms of Grammar/Grammar, what about >>> >>> decorator: '@' atom trailer* NEWLINE >>> >> I say "keep it clean", ie no parentheses except where operator priority >> or clarity require them. > > I actually agree with Georg that this is a case where clarity favours > enforced parentheses for expressions that are otherwise non-atomic. > Things like variable references, function calls and subscripting are > already atomic so the parentheses would be optional in those cases. > > However, as long as Guido remains -0 extension to arbitrary expressions > isn't going to happen, parentheses or no parentheses. > From tjreedy at udel.edu Thu Sep 3 09:28:12 2009 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 03 Sep 2009 03:28:12 -0400 Subject: [Python-ideas] Decorator syntax In-Reply-To: References:

<4A9E9F26.8030608@mrabarnett.plus.com> Message-ID: Georg Brandl wrote: > @foo.bar > @foo.bar[baz] To me, this is easier to read (conceptualize), because it simply selects a metafunction, than the current > @foo.bar(baz) , which calls a metafunction that creates and returns a metafunction. So I hope it gets added if not too difficult. I have some sympathy for Guido's position that anything more complicated should be split into two lines. That should be mentioned in the docs whether or not @f[i] is added. tjr From g.brandl at gmx.net Thu Sep 3 13:27:05 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 03 Sep 2009 13:27:05 +0200 Subject: [Python-ideas] Decorator syntax In-Reply-To: <4A9EE6C9.4050808@mrabarnett.plus.com> References:

<4A9E9F26.8030608@mrabarnett.plus.com> <4A9EE6C9.4050808@mrabarnett.plus.com> Message-ID: MRAB schrieb: >> I do see a reason. I have no problems with >> >> @foo.bar >> @foo.bar[baz] >> @foo.bar(baz) >> >> But this is ugly to me: >> >> @a + b >> def foo(): pass >> > Ugly, yes. > >> As is this: >> >> @a or (c and d) >> def foo(): pass >> > Agreed. Good :) >> Having the decorator expression "opened" by @ but not "closed" feels bad. >> > But: > > @foo > > isn't "closed" either. Hmm, the above is probably a bad expression of my "feeling" :) but I think you know what I mean. Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From stefan_ml at behnel.de Fri Sep 4 11:35:57 2009 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 04 Sep 2009 11:35:57 +0200 Subject: [Python-ideas] data structures should have an .any() method Message-ID: Hi, I just had a discussion with a co-worker, and we noticed that there are use cases where you just want the only element in a data structure, or just any of the elements in a data structure because you know that they all contain the same information (with respect to what you are looking for, at least). If you want all items, you can iterate, but if you just want any item or the only item, it's inefficient (and not very explicit code) to create an iterator and take the element out. It's easy to do with ordered data structures such as lists or tuples ("container[0]"), but it's not so obvious for sets (or dicts), which means that you have to know what kind of container you receive to handle it correctly. I know there's .pop() on sets, but that modifies the data structure. It would therefore be nice to have a common ".any()" method on data structures that would just read an arbitrary item from a container. Regarding the special (and probably minor use) case of dicts, I assume it would return any key, so that you could get the value from the dict in a second step if you want. Only returning the value would not easily get you the key itself. Stefan From masklinn at masklinn.net Fri Sep 4 11:48:43 2009 From: masklinn at masklinn.net (Masklinn) Date: Fri, 4 Sep 2009 11:48:43 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: <9BE99220-8FEB-4314-B0C9-C49D1AA0D99D@masklinn.net> On 4 Sep 2009, at 11:35 , Stefan Behnel wrote: Hi, > > I just had a discussion with a co-worker, and we noticed that there > are use > cases where you just want the only element in a data structure, or > just any > of the elements in a data structure because you know that they all > contain > the same information (with respect to what you are looking for, at > least). > > If you want all items, you can iterate, but if you just want any > item or > the only item, it's inefficient (and not very explicit code) to > create an > iterator and take the element out. It's easy to do with ordered data > structures such as lists or tuples ("container[0]"), but it's not so > obvious for sets (or dicts), which means that you have to know what > kind of > container you receive to handle it correctly. I know there's .pop() on > sets, but that modifies the data structure. > > It would therefore be nice to have a common ".any()" method on data > structures that would just read an arbitrary item from a container. > > Regarding the special (and probably minor use) case of dicts, I > assume it > would return any key, so that you could get the value from the dict > in a > second step if you want. Only returning the value would not easily > get you > the key itself. Given the random value you want from the dict is a key, how about `random.choice(list(container))` (list is needed because choice works on sequences)? >>> l, s, d ([0, 1, 2, 3, 4], set([8, 9, 5, 6, 7]), {10: 15, 11: 16, 12: 17, 13: 18, 14: 19}) >>> choice(list(l)) 4 >>> choice(list(s)) 8 >>> choice(list(d)) 14 From matteodellamico at gmail.com Fri Sep 4 11:55:05 2009 From: matteodellamico at gmail.com (Matteo Dell'Amico) Date: Fri, 04 Sep 2009 11:55:05 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: <4AA0E3F9.9010904@gmail.com> Stefan Behnel ha scritto: > Hi, > > I just had a discussion with a co-worker, and we noticed that there are use > cases where you just want the only element in a data structure, or just any > of the elements in a data structure because you know that they all contain > the same information (with respect to what you are looking for, at least). > > If you want all items, you can iterate, but if you just want any item or > the only item, it's inefficient (and not very explicit code) to create an > iterator and take the element out. It's easy to do with ordered data > structures such as lists or tuples ("container[0]"), but it's not so > obvious for sets (or dicts), which means that you have to know what kind of > container you receive to handle it correctly. I know there's .pop() on > sets, but that modifies the data structure. You can do next(iter(container)) (or iter(container).next() with python <= 2.5). This works fine with any iterable. matteo From ncoghlan at gmail.com Fri Sep 4 12:03:21 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 04 Sep 2009 20:03:21 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: <4AA0E5E9.2050202@gmail.com> Stefan Behnel wrote: > It would therefore be nice to have a common ".any()" method on data > structures that would just read an arbitrary item from a container. I'd advise against bare name "any" for this, since we already have the any() builtin with a completely different meaning. "getany" would probably be OK though. I'd also advise against using a method for this, since there is a reasonable default implementation that can be employed: def getany(container) if container: if isinstance(container, collections.Sequence): return container[0] else: for x in container: return x raise ValueError("No items in container") Finally, I'd suggest that any such function would belong in the collections module rather than being made a builtin. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From zuo at chopin.edu.pl Fri Sep 4 12:32:05 2009 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Fri, 04 Sep 2009 12:32:05 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA0E5E9.2050202@gmail.com> References: <4AA0E5E9.2050202@gmail.com> Message-ID: 04-09-2009 Nick Coghlan wrote: > def getany(container) > if container: > if isinstance(container, collections.Sequence): > return container[0] > else: > for x in container: > return x > raise ValueError("No items in container") or simpler: def getany(container): try: return next(iter(container)) except StopIteration raise ValueError("No items in container") -- Jan Kaliszewski (zuo) From stefan_ml at behnel.de Fri Sep 4 13:12:02 2009 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 04 Sep 2009 13:12:02 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0E5E9.2050202@gmail.com> Message-ID: Jan Kaliszewski wrote: > 04-09-2009 Nick Coghlan > wrote: > >> def getany(container) >> if container: >> if isinstance(container, collections.Sequence): >> return container[0] >> else: >> for x in container: >> return x >> raise ValueError("No items in container") > > or simpler: > > def getany(container): > try: > return next(iter(container)) > except StopIteration > raise ValueError("No items in container") or: def getany(container): for x in container: return x raise ValueError("No items in container") I actually like that, although I find this more readable: def getany(container): for x in container: return x else: raise ValueError("No items in container") Stefan From ncoghlan at gmail.com Fri Sep 4 13:50:11 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 04 Sep 2009 21:50:11 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA0E3F9.9010904@gmail.com> References: <4AA0E3F9.9010904@gmail.com> Message-ID: <4AA0FEF3.2000607@gmail.com> Matteo Dell'Amico wrote: > You can do next(iter(container)) (or iter(container).next() with python > <= 2.5). This works fine with any iterable. This and other responses miss part of Stefan's complaint: that creating an iterator (which isn't always cheap) only to throw it away almost immediately may be a somewhat wasteful operation. The shorthand expression above also suffers from the obscurity that Stefan was complaining about - there is very little to hint that "next(iter(obj))" means "get an arbitrary object out of a container". The StopIteration exception this approach will throw for an empty container is also rather unhelpful. That said, I'm -0 on the idea overall. If someone actually needs it, it isn't particularly hard for them to write their own getany() function. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From matteodellamico at gmail.com Fri Sep 4 14:01:28 2009 From: matteodellamico at gmail.com (Matteo Dell'Amico) Date: Fri, 04 Sep 2009 14:01:28 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA0FEF3.2000607@gmail.com> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> Message-ID: <4AA10198.8040401@gmail.com> Nick Coghlan ha scritto: > This and other responses miss part of Stefan's complaint: that creating > an iterator (which isn't always cheap) only to throw it away almost > immediately may be a somewhat wasteful operation. It's not particularly wasteful for the built-in data structures, though. It seems to me that the cases where the performance of next(iter(obj)) would be an actual issue are quite rare. > The shorthand expression above also suffers from the obscurity that > Stefan was complaining about - there is very little to hint that > "next(iter(obj))" means "get an arbitrary object out of a container". > The StopIteration exception this approach will throw for an empty > container is also rather unhelpful. Why? next(iter(obj)) means, pretty explicitly to me, "iterate on obj and give me one element". matteo From ncoghlan at gmail.com Fri Sep 4 14:17:09 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 04 Sep 2009 22:17:09 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA10198.8040401@gmail.com> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> Message-ID: <4AA10545.30408@gmail.com> Matteo Dell'Amico wrote: >> The shorthand expression above also suffers from the obscurity that >> Stefan was complaining about - there is very little to hint that >> "next(iter(obj))" means "get an arbitrary object out of a container". >> The StopIteration exception this approach will throw for an empty >> container is also rather unhelpful. > > Why? next(iter(obj)) means, pretty explicitly to me, "iterate on obj and > give me one element". Because it overspecifies the semantics of what you're trying to do. It just happens that when the requirement is "get me any object in this container" the design of Python means that the easiest implementation is "get me the first object in this container". The expression form then reflects the implementation rather than the algorithmic intent. That said, this concise way of implementing the desired feature is certainly one of the reasons I am -0 on the idea of adding it to the standard library. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From zuo at chopin.edu.pl Fri Sep 4 15:01:02 2009 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Fri, 04 Sep 2009 15:01:02 +0200 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> Message-ID: [originally from python-list at python.org, crossposted to python-ideas at python.org] 04-09-2009 o 00:46:01 Ken Newton wrote: > I have created the following class definition with the idea of making > a clean syntax for non-programmers to created structured data within a > python environment. > > I would appreciate comments on this code. First, is something like > this already done? Second, are there reasons for not doing this? If > this seems OK, how could I clean up the string conversion to have > indented format. > > The expected use would have all items in the structure be simple > python types or AttrClass types. Code written in python could walk the > structure in a simple way to locate any desired values. Code in a > C/C++ extension should also be able to walk the structure to use any > value in the structure. > > class AttrClass(object): > """AttrClass lets you freely add attributes in nested manner""" > > def __init__(self): > pass > def __setitem__(self, key, value): > return self.__dict__.__setitem__(key, value) > def __repr__(self): > return "%s(%s)" % (self.__class__.__name__, > self.__dict__.__repr__()) > def __str__(self): > ll = ['{'] > for k,v in self.__dict__.iteritems(): > ll.append("%s : %s" % (k, str(v))) > return '\n'.join(ll) + '}' [snip] I find the idea interesting and close to my own needs in many situations, if I could alter it a bit. Of course, we always can use an empty class ('class MyStruct: pass') or simply use a dict... But both methods are inconvinient in some ways. In the case of dict we are convicted -- even when we need static access -- to mapping notation (obj['member']) which is less convenient and (what's more important) more error-prone than attribute dot-notation. In the case of empty class/object we can use convenient attr dot-notation but dynamic access is less natural... IMHO there could be -- in collections module or even as a built-in factory function -- something (somehow) similar to namedtuple, but mutable and more dict-like. I'am less focused on nesting such structures, and more on making it a namespace-like objects with convenience-and-today-usage features. Please consider the code: class AttrDict(dict): # (or maybe from OrderedDict) "It's only a model. (Shhh!)" def __getattr__(self, name): if name.startswith('_'): raise AttributeError("AttrDict's key can't " "start with underscore") else: return self[name] def __setattr__(self, name, value): self[name] = value def __delattr__(self, name): del self[name] def __repr__(self): return '{0}({1})'.format(self.__class__.__name__, dict.__repr__(self)) def __str__(self): return self._as_str() def _gen_format(self, indwidth, indstate): indst = indstate * ' ' ind = (indstate + indwidth) * ' ' yield ('\n' + indst + '{' if indstate else '{') for key, val in self.items(): valstr = (str(val) if not isinstance(val, AttrDict) else val._as_str(indwidth, indstate + indwidth)) yield '{ind}{key}: {valstr}'.format(ind=ind, key=key, valstr=valstr) yield indst + '}' def _as_str(self, indwidth=4, indstate=0): return '\n'.join(self._gen_format(indwidth, indstate)) def _as_dict(self): return dict.copy(self) # Test code: if __name__ == '__main__': struct = AttrDict() struct.first = 1 struct.second = 2.0 struct.third = '3rd' struct.fourth = [4] print(struct) # output: # { # 'second': 2.0 # 'fourth': [4] # 'third': '3rd' # 'first': 1 # } del struct.fourth print(repr(struct)) # output: # AttrDict({'second': 2.0, 'third': '3rd', 'first': 1}) print(struct.first) # (static access) # output: # 1 for x in ('first', 'second', 'third'): print(struct[x]) # (dynamic access) # output: # 1 # 2.0 # 3rd struct.sub = AttrDict(a=1, b=2, c=89) print(struct._as_dict()) # output: # {'second': 2.0, 'sub': AttrDict({'a': 1, 'c': 89, 'b': 2}),\ # 'third': '3rd', 'first': 1} print(struct._as_str(8)) # output: # { # second: 2.0 # sub: # { # a: 1 # c: 89 # b: 2 # } # third: 3rd # first: 1 # } What do you think about it? Cheers, *j -- Jan Kaliszewski (zuo) From ilya.nikokoshev at gmail.com Fri Sep 4 15:29:30 2009 From: ilya.nikokoshev at gmail.com (ilya) Date: Fri, 4 Sep 2009 17:29:30 +0400 Subject: [Python-ideas] Decorator syntax In-Reply-To: References:

<4A9E9F26.8030608@mrabarnett.plus.com> <4A9EE6C9.4050808@mrabarnett.plus.com> Message-ID: I would say here are two more things to consider: (1) How to colorize expression @a or b and c? My IDE colorizes only @a as decorator (2) How to search for all functions that have been applied decorator `b` (*not* either `a` or `b`)? (bonus) How to test expression of the above form? By definition, you will have only *one* function decorated like that. To test it, you should define it as a separate function anyway. ilya. On Thu, Sep 3, 2009 at 3:27 PM, Georg Brandl wrote: > MRAB schrieb: > >>> I do see a reason. ?I have no problems with >>> >>> ? ?@foo.bar >>> ? ?@foo.bar[baz] >>> ? ?@foo.bar(baz) >>> >>> But this is ugly to me: >>> >>> ? ?@a + b >>> ? ?def foo(): pass >>> >> Ugly, yes. >> >>> As is this: >>> >>> ? ?@a or (c and d) >>> ? ?def foo(): pass >>> >> Agreed. > > Good :) > >>> Having the decorator expression "opened" by @ but not "closed" feels bad. >>> >> But: >> >> ? ? ?@foo >> >> isn't "closed" either. > > Hmm, the above is probably a bad expression of my "feeling" :) but I think > you know what I mean. > > Georg > > -- > Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. > Four shall be the number of spaces thou shalt indent, and the number of thy > indenting shall be four. Eight shalt thou not indent, nor either indent thou > two, excepting that thou then proceed to four. Tabs are right out. > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From ilya.nikokoshev at gmail.com Fri Sep 4 15:35:47 2009 From: ilya.nikokoshev at gmail.com (ilya) Date: Fri, 4 Sep 2009 17:35:47 +0400 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA10545.30408@gmail.com> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> Message-ID: I think dict.popitem() does something close to what the original post wanted. http://docs.python.org/3.1/library/stdtypes.html#dict.popitem On Fri, Sep 4, 2009 at 4:17 PM, Nick Coghlan wrote: > Matteo Dell'Amico wrote: >>> The shorthand expression above also suffers from the obscurity that >>> Stefan was complaining about - there is very little to hint that >>> "next(iter(obj))" means "get an arbitrary object out of a container". >>> The StopIteration exception this approach will throw for an empty >>> container is also rather unhelpful. >> >> Why? next(iter(obj)) means, pretty explicitly to me, "iterate on obj and >> give me one element". > > Because it overspecifies the semantics of what you're trying to do. It > just happens that when the requirement is "get me any object in this > container" the design of Python means that the easiest implementation is > "get me the first object in this container". > > The expression form then reflects the implementation rather than the > algorithmic intent. > > That said, this concise way of implementing the desired feature is > certainly one of the reasons I am -0 on the idea of adding it to the > standard library. > > Cheers, > Nick. > > -- > Nick Coghlan ? | ? ncoghlan at gmail.com ? | ? Brisbane, Australia > --------------------------------------------------------------- > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From gerald.britton at gmail.com Fri Sep 4 15:47:54 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Fri, 4 Sep 2009 09:47:54 -0400 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> Message-ID: <5d1a32000909040647u20889e1do296d31d8c3f45ac4@mail.gmail.com> Though IIRC the OP doesn't want to delete the item, so you would need something like: key, data = dict.popitem(); dict[key] = data On Fri, Sep 4, 2009 at 9:35 AM, ilya wrote: > I think dict.popitem() does something close to what the original post wanted. > > http://docs.python.org/3.1/library/stdtypes.html#dict.popitem > > On Fri, Sep 4, 2009 at 4:17 PM, Nick Coghlan wrote: >> Matteo Dell'Amico wrote: >>>> The shorthand expression above also suffers from the obscurity that >>>> Stefan was complaining about - there is very little to hint that >>>> "next(iter(obj))" means "get an arbitrary object out of a container". >>>> The StopIteration exception this approach will throw for an empty >>>> container is also rather unhelpful. >>> >>> Why? next(iter(obj)) means, pretty explicitly to me, "iterate on obj and >>> give me one element". >> >> Because it overspecifies the semantics of what you're trying to do. It >> just happens that when the requirement is "get me any object in this >> container" the design of Python means that the easiest implementation is >> "get me the first object in this container". >> >> The expression form then reflects the implementation rather than the >> algorithmic intent. >> >> That said, this concise way of implementing the desired feature is >> certainly one of the reasons I am -0 on the idea of adding it to the >> standard library. >> >> Cheers, >> Nick. >> >> -- >> Nick Coghlan ? | ? ncoghlan at gmail.com ? | ? Brisbane, Australia >> --------------------------------------------------------------- >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Gerald Britton From ncoghlan at gmail.com Fri Sep 4 16:59:32 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 05 Sep 2009 00:59:32 +1000 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> Message-ID: <4AA12B54.3070604@gmail.com> Jan Kaliszewski wrote: > What do you think about it? It reminds me a bit of the old (short-lived) namespaces module: http://web.archive.org/web/20060216094030/http://namespace.python-hosting.com/ Steven's draft PEP on the topic is still available in the python-list archives: http://mail.python.org/pipermail/python-list/2005-February/307235.html The problem we found with it was that the basic solutions (empty class and now named_tuple) were good enough that it wasn't worth the hassle involved in grabbing an extra library for it. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From guido at python.org Fri Sep 4 20:04:34 2009 From: guido at python.org (Guido van Rossum) Date: Fri, 4 Sep 2009 11:04:34 -0700 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: On Fri, Sep 4, 2009 at 2:35 AM, Stefan Behnel wrote: > I just had a discussion with a co-worker, and we noticed that there are use > cases where you just want the only element in a data structure, or just any > of the elements in a data structure because you know that they all contain > the same information (with respect to what you are looking for, at least). > > If you want all items, you can iterate, but if you just want any item or > the only item, it's inefficient (and not very explicit code) to create an > iterator and take the element out. I assure you it's not slow. next(iter(x)) is probably as good as it gets -- I don't think we need another way to say that in fewer words. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From stefan_ml at behnel.de Fri Sep 4 21:36:57 2009 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 04 Sep 2009 21:36:57 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: Guido van Rossum wrote: > On Fri, Sep 4, 2009 at 2:35 AM, Stefan Behnel wrote: >> I just had a discussion with a co-worker, and we noticed that there are use >> cases where you just want the only element in a data structure, or just any >> of the elements in a data structure because you know that they all contain >> the same information (with respect to what you are looking for, at least). >> >> If you want all items, you can iterate, but if you just want any item or >> the only item, it's inefficient (and not very explicit code) to create an >> iterator and take the element out. > > I assure you it's not slow. Not in absolute numbers, but certainly slower than necessary: $ python2.6 -m timeit -s 'l=[1]' 'l[0]' 10000000 loops, best of 3: 0.0977 usec per loop $ python2.6 -m timeit -s 'l=[1]' 'next(iter(l))' 1000000 loops, best of 3: 0.523 usec per loop > next(iter(x)) is probably as good as it > gets -- I don't think we need another way to say that in fewer words. I'm fine with such a decision, given that it's trivial to wrap this into your own function. That doesn't make it much faster: $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' -s ' return l[0]' 'getany(l)' 1000000 loops, best of 3: 0.34 usec per loop $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ -s ' for x in l:' \ -s ' return x' \ 'getany(l)' 1000000 loops, best of 3: 0.454 usec per loop $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ -s ' return next(iter(l))' \ 'getany(l)' 1000000 loops, best of 3: 0.743 usec per loop but, admittedly, that's still not slow in absolute numbers. Stefan From gerald.britton at gmail.com Fri Sep 4 21:56:19 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Fri, 4 Sep 2009 15:56:19 -0400 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: <5d1a32000909041256k6af5c87fv9a42e0655e7a4155@mail.gmail.com> Interesting that your return(next()) example runs about 60% slower than the "for x in l: return x" example. Must be the function call overhead. On Fri, Sep 4, 2009 at 3:36 PM, Stefan Behnel wrote: > Guido van Rossum wrote: >> On Fri, Sep 4, 2009 at 2:35 AM, Stefan Behnel wrote: >>> I just had a discussion with a co-worker, and we noticed that there are use >>> cases where you just want the only element in a data structure, or just any >>> of the elements in a data structure because you know that they all contain >>> the same information (with respect to what you are looking for, at least). >>> >>> If you want all items, you can iterate, but if you just want any item or >>> the only item, it's inefficient (and not very explicit code) to create an >>> iterator and take the element out. >> >> I assure you it's not slow. > > Not in absolute numbers, but certainly slower than necessary: > > $ python2.6 -m timeit -s 'l=[1]' 'l[0]' > 10000000 loops, best of 3: 0.0977 usec per loop > > $ python2.6 -m timeit -s 'l=[1]' 'next(iter(l))' > 1000000 loops, best of 3: 0.523 usec per loop > > >> next(iter(x)) is probably as good as it >> gets -- I don't think we need another way to say that in fewer words. > > I'm fine with such a decision, given that it's trivial to wrap this into > your own function. That doesn't make it much faster: > > > $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?return l[0]' > ? ? ? ? ? ? ? ? ? ? ?'getany(l)' > 1000000 loops, best of 3: 0.34 usec per loop > > $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?for x in l:' \ > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ? ?return x' \ > ? ? ? ? ? ? ? ? ? ? ?'getany(l)' > 1000000 loops, best of 3: 0.454 usec per loop > > $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?return next(iter(l))' \ > ? ? ? ? ? ? ? ? ? ? ?'getany(l)' > 1000000 loops, best of 3: 0.743 usec per loop > > > but, admittedly, that's still not slow in absolute numbers. > > Stefan > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Gerald Britton From gerald.britton at gmail.com Fri Sep 4 22:01:14 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Fri, 4 Sep 2009 16:01:14 -0400 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <5d1a32000909041256k6af5c87fv9a42e0655e7a4155@mail.gmail.com> References: <5d1a32000909041256k6af5c87fv9a42e0655e7a4155@mail.gmail.com> Message-ID: <5d1a32000909041301h52dd7e17se53bb3da05376218@mail.gmail.com> Also interesting that: def getany(x): k,d = x.popitem() x[k] = d return k outperforms the next(iter()) approach, though not by much (< 10% in my case) On Fri, Sep 4, 2009 at 3:56 PM, Gerald Britton wrote: > Interesting that your return(next()) example runs about 60% slower > than the "for x in l: return x" example. ?Must be the function call > overhead. > > On Fri, Sep 4, 2009 at 3:36 PM, Stefan Behnel wrote: >> Guido van Rossum wrote: >>> On Fri, Sep 4, 2009 at 2:35 AM, Stefan Behnel wrote: >>>> I just had a discussion with a co-worker, and we noticed that there are use >>>> cases where you just want the only element in a data structure, or just any >>>> of the elements in a data structure because you know that they all contain >>>> the same information (with respect to what you are looking for, at least). >>>> >>>> If you want all items, you can iterate, but if you just want any item or >>>> the only item, it's inefficient (and not very explicit code) to create an >>>> iterator and take the element out. >>> >>> I assure you it's not slow. >> >> Not in absolute numbers, but certainly slower than necessary: >> >> $ python2.6 -m timeit -s 'l=[1]' 'l[0]' >> 10000000 loops, best of 3: 0.0977 usec per loop >> >> $ python2.6 -m timeit -s 'l=[1]' 'next(iter(l))' >> 1000000 loops, best of 3: 0.523 usec per loop >> >> >>> next(iter(x)) is probably as good as it >>> gets -- I don't think we need another way to say that in fewer words. >> >> I'm fine with such a decision, given that it's trivial to wrap this into >> your own function. That doesn't make it much faster: >> >> >> $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?return l[0]' >> ? ? ? ? ? ? ? ? ? ? ?'getany(l)' >> 1000000 loops, best of 3: 0.34 usec per loop >> >> $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?for x in l:' \ >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ? ?return x' \ >> ? ? ? ? ? ? ? ? ? ? ?'getany(l)' >> 1000000 loops, best of 3: 0.454 usec per loop >> >> $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?return next(iter(l))' \ >> ? ? ? ? ? ? ? ? ? ? ?'getany(l)' >> 1000000 loops, best of 3: 0.743 usec per loop >> >> >> but, admittedly, that's still not slow in absolute numbers. >> >> Stefan >> >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> > > > > -- > Gerald Britton > -- Gerald Britton From zuo at chopin.edu.pl Fri Sep 4 22:37:15 2009 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Fri, 04 Sep 2009 22:37:15 +0200 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: <8fd67d4b0909040955k590b8d6fv9012165d847b02d9@mail.gmail.com> References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <8fd67d4b0909040955k590b8d6fv9012165d847b02d9@mail.gmail.com> Message-ID: 04-09-2009 Ken Newton wrote: > I like this version very much. I'm ready to put this into practice to see > how it works in practice. [snip] Not only you (Ken) and me. :-) It appears that the idea is quite old. Nick Coghlan replied at python-ideas at python.org: > Jan Kaliszewski wrote: >> What do you think about it? > > It reminds me a bit of the old (short-lived) namespaces module: > > http://web.archive.org/web/20060216094030/http://namespace.python-hosting.com/ > > Steven's draft PEP on the topic is still available in the python-list > archives: > > http://mail.python.org/pipermail/python-list/2005-February/307235.html > > The problem we found with it was that the basic solutions (empty class > and now named_tuple) were good enough that it wasn't worth the hassle > involved in grabbing an extra library for it. Named tuples (which indeed are really very nice) are read-only, but the approach they represent could (and IMHO should) be extended to some kind of mutable objects. The old discussion, the above link points to, shows that such a dot-accessible dict-like class is something that many people need and repeatedly implemet it (more or less perfectly) for themselves. Maybe that past proposition (to add a separate namespace module which a number types for viewing, chaining and so on) was too sophisticated? Most common use cases could be covered with one attr-dict-like type, that could be placed in collections module (or even, in time, as a built-in factory function, together with namedtuple?). Cheers, *j From george.sakkis at gmail.com Fri Sep 4 23:04:55 2009 From: george.sakkis at gmail.com (George Sakkis) Date: Fri, 4 Sep 2009 17:04:55 -0400 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <8fd67d4b0909040955k590b8d6fv9012165d847b02d9@mail.gmail.com> Message-ID: <91ad5bf80909041404t486a69dejf75ffd73a4d4a3ea@mail.gmail.com> On Fri, Sep 4, 2009 at 4:37 PM, Jan Kaliszewski wrote: > 04-09-2009 Ken Newton wrote: > >> I like this version very much. I'm ready to put this into practice to see >> how it works in practice. > > [snip] > > Not only you (Ken) and me. :-) It appears that the idea is quite old. Nick > Coghlan replied at python-ideas at python.org: > >> Jan Kaliszewski wrote: >>> >>> What do you think about it? >> >> It reminds me a bit of the old (short-lived) namespaces module: >> >> >> http://web.archive.org/web/20060216094030/http://namespace.python-hosting.com/ >> >> Steven's draft PEP on the topic is still available in the python-list >> archives: >> >> http://mail.python.org/pipermail/python-list/2005-February/307235.html >> >> The problem we found with it was that the basic solutions (empty class >> and now named_tuple) were good enough that it wasn't worth the hassle >> involved in grabbing an extra library for it. > > Named tuples (which indeed are really very nice) are read-only, but the > approach they represent could (and IMHO should) be extended to some kind > of mutable objects. Maybe something like http://code.activestate.com/recipes/576555/ ? George From guido at python.org Fri Sep 4 23:08:03 2009 From: guido at python.org (Guido van Rossum) Date: Fri, 4 Sep 2009 14:08:03 -0700 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <5d1a32000909041256k6af5c87fv9a42e0655e7a4155@mail.gmail.com> References: <5d1a32000909041256k6af5c87fv9a42e0655e7a4155@mail.gmail.com> Message-ID: More likely the cost of the two dynamic lookups of builtin functions is your cost. On Fri, Sep 4, 2009 at 12:56 PM, Gerald Britton wrote: > Interesting that your return(next()) example runs about 60% slower > than the "for x in l: return x" example. ?Must be the function call > overhead. > > On Fri, Sep 4, 2009 at 3:36 PM, Stefan Behnel wrote: >> Guido van Rossum wrote: >>> On Fri, Sep 4, 2009 at 2:35 AM, Stefan Behnel wrote: >>>> I just had a discussion with a co-worker, and we noticed that there are use >>>> cases where you just want the only element in a data structure, or just any >>>> of the elements in a data structure because you know that they all contain >>>> the same information (with respect to what you are looking for, at least). >>>> >>>> If you want all items, you can iterate, but if you just want any item or >>>> the only item, it's inefficient (and not very explicit code) to create an >>>> iterator and take the element out. >>> >>> I assure you it's not slow. >> >> Not in absolute numbers, but certainly slower than necessary: >> >> $ python2.6 -m timeit -s 'l=[1]' 'l[0]' >> 10000000 loops, best of 3: 0.0977 usec per loop >> >> $ python2.6 -m timeit -s 'l=[1]' 'next(iter(l))' >> 1000000 loops, best of 3: 0.523 usec per loop >> >> >>> next(iter(x)) is probably as good as it >>> gets -- I don't think we need another way to say that in fewer words. >> >> I'm fine with such a decision, given that it's trivial to wrap this into >> your own function. That doesn't make it much faster: >> >> >> $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?return l[0]' >> ? ? ? ? ? ? ? ? ? ? ?'getany(l)' >> 1000000 loops, best of 3: 0.34 usec per loop >> >> $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?for x in l:' \ >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ? ?return x' \ >> ? ? ? ? ? ? ? ? ? ? ?'getany(l)' >> 1000000 loops, best of 3: 0.454 usec per loop >> >> $ python2.6 -m timeit -s 'l=[1]' -s 'def getany(l):' \ >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -s ' ?return next(iter(l))' \ >> ? ? ? ? ? ? ? ? ? ? ?'getany(l)' >> 1000000 loops, best of 3: 0.743 usec per loop >> >> >> but, admittedly, that's still not slow in absolute numbers. >> >> Stefan >> >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> > > > > -- > Gerald Britton > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From solipsis at pitrou.net Sat Sep 5 04:45:13 2009 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sat, 5 Sep 2009 02:45:13 +0000 (UTC) Subject: [Python-ideas] data structures should have an .any() method References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> Message-ID: Nick Coghlan writes: > > Because it overspecifies the semantics of what you're trying to do. It > just happens that when the requirement is "get me any object in this > container" the design of Python means that the easiest implementation is > "get me the first object in this container". I don't agree. Since iteration is such a frequent operation, any container which doesn't provide cheap iteration could be considered badly designed and/or badly implemented. Therefore it makes sense to rely on iteration when implementing other primitives. People worrying that it expresses implementation rather than intent can write the trivial abstraction by themselves: def any_item(x): return next(iter(x)) Regards Antoine. From steve at pearwood.info Sat Sep 5 07:17:05 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 5 Sep 2009 15:17:05 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA0E5E9.2050202@gmail.com> References: <4AA0E5E9.2050202@gmail.com> Message-ID: <200909051517.06511.steve@pearwood.info> On Fri, 4 Sep 2009 08:03:21 pm Nick Coghlan wrote: > Stefan Behnel wrote: > > It would therefore be nice to have a common ".any()" method on data > > structures that would just read an arbitrary item from a container. > > I'd advise against bare name "any" for this, since we already have > the any() builtin with a completely different meaning. "getany" would > probably be OK though. > > I'd also advise against using a method for this, since there is a > reasonable default implementation that can be employed: > > def getany(container) > if container: > if isinstance(container, collections.Sequence): > return container[0] > else: > for x in container: > return x > raise ValueError("No items in container") > > Finally, I'd suggest that any such function would belong in the > collections module rather than being made a builtin. Given the above implementation, repeated calls to getany(sequence) will return the same item each time. Is that what people will expect by a function that claims to return "any" element of a collection? I suspect that users will be evenly divided into those who say Yes, those who say No, and those who say "It depends on what I'm trying to do". Should it return an arbitrary item, or a random item? Is "the first item" arbitrary enough? It should be for dicts, which are unordered, but may not be for lists. I think the answers to these questions are too application-specific for any solution or solutions to go into the standard library. It probably belongs in the cookbook as a handful of related recipes. -- Steven D'Aprano From steve at pearwood.info Sat Sep 5 11:47:46 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 5 Sep 2009 19:47:46 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA10198.8040401@gmail.com> References: <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> Message-ID: <200909051947.48197.steve@pearwood.info> On Fri, 4 Sep 2009 10:01:28 pm Matteo Dell'Amico wrote: > Why? next(iter(obj)) means, pretty explicitly to me, "iterate on obj > and give me one element". To me, it says "give me the first element", not "give me any (an arbitrary) element" or "give me a random element". Does anyone have a use-case for retrieving a single arbitrary element of an arbitrary sequence, without caring about any other elements? Is this really such a common operation that we need to consider it part of the interface for all collections? I doubt it. -- Steven D'Aprano From g.brandl at gmx.net Sat Sep 5 12:53:11 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 05 Sep 2009 12:53:11 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> Message-ID: Antoine Pitrou schrieb: > Nick Coghlan writes: >> >> Because it overspecifies the semantics of what you're trying to do. It >> just happens that when the requirement is "get me any object in this >> container" the design of Python means that the easiest implementation is >> "get me the first object in this container". > > I don't agree. > Since iteration is such a frequent operation, any container which doesn't > provide cheap iteration could be considered badly designed and/or badly > implemented. Therefore it makes sense to rely on iteration when implementing > other primitives. > > People worrying that it expresses implementation rather than intent can write > the trivial abstraction by themselves: > > def any_item(x): > return next(iter(x)) or any_item = compose(next, iter) ;) Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From p.f.moore at gmail.com Sat Sep 5 14:02:18 2009 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 5 Sep 2009 13:02:18 +0100 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <200909051947.48197.steve@pearwood.info> References: <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <200909051947.48197.steve@pearwood.info> Message-ID: <79990c6b0909050502g55581ac1x5296b21593ae757f@mail.gmail.com> 2009/9/5 Steven D'Aprano : > On Fri, 4 Sep 2009 10:01:28 pm Matteo Dell'Amico wrote: > >> Why? next(iter(obj)) means, pretty explicitly to me, "iterate on obj >> and give me one element". > > To me, it says "give me the first element", not "give me any (an > arbitrary) element" or "give me a random element". The original use as described was for picking an "arbitrary" element, because all of the elements were effectively the same - "any of the elements in a data structure because you know that they all contain the same information". For a random element, use random.choice, for the first, use next(iter()). Either option satisfies the original requirement - but the first is bound to be faster, so it seems appropriate. > Does anyone have a use-case for retrieving a single arbitrary element of > an arbitrary sequence, without caring about any other elements? Is this > really such a common operation that we need to consider it part of the > interface for all collections? I doubt it. Agreed. Paul. From gerald.britton at gmail.com Sat Sep 5 15:26:34 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Sat, 5 Sep 2009 09:26:34 -0400 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> Message-ID: <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> compose? Where'd you find that? On Sat, Sep 5, 2009 at 6:53 AM, Georg Brandl wrote: > Antoine Pitrou schrieb: >> Nick Coghlan writes: >>> >>> Because it overspecifies the semantics of what you're trying to do. It >>> just happens that when the requirement is "get me any object in this >>> container" the design of Python means that the easiest implementation is >>> "get me the first object in this container". >> >> I don't agree. >> Since iteration is such a frequent operation, any container which doesn't >> provide cheap iteration could be considered badly designed and/or badly >> implemented. Therefore it makes sense to rely on iteration when implementing >> other primitives. >> >> People worrying that it expresses implementation rather than intent can write >> the trivial abstraction by themselves: >> >> def any_item(x): >> ? ? return next(iter(x)) > > or > > any_item = compose(next, iter) > > ;) > > Georg > > -- > Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. > Four shall be the number of spaces thou shalt indent, and the number of thy > indenting shall be four. Eight shalt thou not indent, nor either indent thou > two, excepting that thou then proceed to four. Tabs are right out. > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Gerald Britton From g.brandl at gmx.net Sat Sep 5 15:31:40 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 05 Sep 2009 15:31:40 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> Message-ID: Gerald Britton schrieb: > compose? Where'd you find that? That was another recent discussion here. Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From gerald.britton at gmail.com Sat Sep 5 15:47:37 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Sat, 5 Sep 2009 09:47:37 -0400 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> Message-ID: <5d1a32000909050647j1d5d730q355f9715b48a32c5@mail.gmail.com> Ah -- so not a real function then (yet)? Though something we could borrow from Haskell, I suppose, even though: compose(foo,bar) == lambda x: foo(bar(x)) no? On Sat, Sep 5, 2009 at 9:31 AM, Georg Brandl wrote: > Gerald Britton schrieb: >> compose? ?Where'd you find that? > > That was another recent discussion here. > > Georg > > -- > Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. > Four shall be the number of spaces thou shalt indent, and the number of thy > indenting shall be four. Eight shalt thou not indent, nor either indent thou > two, excepting that thou then proceed to four. Tabs are right out. > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Gerald Britton From masklinn at masklinn.net Sat Sep 5 16:15:47 2009 From: masklinn at masklinn.net (Masklinn) Date: Sat, 5 Sep 2009 16:15:47 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <5d1a32000909050647j1d5d730q355f9715b48a32c5@mail.gmail.com> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> <5d1a32000909050647j1d5d730q355f9715b48a32c5@mail.gmail.com> Message-ID: <6751FF3A-7DFE-4BB1-B391-2EE9A2DE04BA@masklinn.net> On 5 Sep 2009, at 15:47 , Gerald Britton wrote: Ah -- so not a real function then (yet)? Though something we could > borrow from Haskell, I suppose, even though: > > compose(foo,bar) == lambda x: foo(bar(x)) > > no? > Yeah but you could leverage Python's *args to get a compositor of more than two functions e.g. def compose(*funcs): return reduce(lambda f1, f2: lambda *args, **kwargs: f1(f2(*args, **kwargs)), funcs) From masklinn at masklinn.net Sat Sep 5 16:39:14 2009 From: masklinn at masklinn.net (Masklinn) Date: Sat, 5 Sep 2009 16:39:14 +0200 Subject: [Python-ideas] Collect **kw arguments as an ordered dictionary Message-ID: <6813C5F4-6B5B-413B-AEE4-17F050D38019@masklinn.net> Response to a fairly old (April) thread, but there's another use case to **kwargs being collected in an ordered dictionary: interaction with Objective-C or Smalltalk (but mainly obj-c/Cocoa): since **kwargs is not an ordered dict, Python/Cocoa interop can't use them to emulate ObjC's compound message names (as order is significant), so PyObjC merges the message subparts into a single method and tacks the arguments at the end, transforming calls like [NSString stringWithContentsOfFile:@"/usr/share/dict/propernames" encoding:NSASCIIStringEncoding error:&error]; into NSString.stringWithContentsOfFile_encoding_error_( "/usr/share/dict/propernames", NSASCIIStringEncoding) while the verbosity is similar, the loss in readability is tremendous. With the availability of ordered **kwargs, the bridge could simply have a send method (akin to the ObjC performSelector:) taking a bunch of kwargs and sending the corresponding message to the underlying ObjC object: NSString.send(stringWithContentOfFile="/usr/share/dict/ propernames", encoding=NSASCIIStringEncoding, error=errors) Or variable method names with a bit of massaging e.g. using the first message part as the method name ? la MacRuby: NSString.stringWithContentOfFile("/usr/share/dict/propernames", encoding=NSASCIIStringEncoding, error=errors) or splitting that first part between the method name and the first kwarg: NSString.stringWith(ContentOfFile="/usr/share/dict/propernames", encoding=NSASCIIStringEncoding, error=errors) From zuo at chopin.edu.pl Sat Sep 5 18:20:44 2009 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Sat, 05 Sep 2009 18:20:44 +0200 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: <02b225c8$0$17565$c3e8da3@news.astraweb.com> References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <8fd67d4b0909040955k590b8d6fv9012165d847b02d9@mail.gmail.com> <02b1dd77$0$17565$c3e8da3@news.astraweb.com> <8fd67d4b0909042247h167226edv6693c314ae5a16eb@mail.gmail.com> <02b225c8$0$17565$c3e8da3@news.astraweb.com> Message-ID: 05-09-2009 Steven D'Aprano wrote: > On Fri, 04 Sep 2009 22:37:15 +0200, Jan Kaliszewski wrote: > >> Named tuples (which indeed are really very nice) are read-only, but the >> approach they represent could (and IMHO should) be extended to some kind >> of mutable objects. [snip] > What sort of extensions did you have in mind? Two useful (from my point of view) concepts have appeared (or been linked to) in this thread -- on python-list and python-ideas: * the namespace/AttrDict concept (see Ken Newton's, Nick Coghlan's and my posts). * record concept (see George Sakkis post). >> The old discussion, the above link points to, shows that such a >> dot-accessible dict-like class is something that many people need and >> repeatedly implemet it (more or less perfectly) for themselves. > > I think it's something which people copy from other languages because > that's what they're used to, not because they need it. I don't think so, especially if we say about the former. IMHO it is simply useful in practice, especially for scripting (but not only) -- being more convenient than using empty class. It offers (in compact way, without additional efforts and verbose syntax -- once you have got such a tool implemented) three things at the same time, without necessity to choose between them: comfortable static attribute access, flexible dict-like dynamic access when needed and possibility of iteration. > It's just a change in syntax. Whether you write x.key or x['key'] is a > matter of convenience. Attribute access is optimized for when you know > the key names at compile time, key access is optimized for when you don't > know the names until runtime. Exactly. It is a matter of *convenience* (as well as large areas of Python) and that's the point. I suppose that that is the reason for people to repeatedly implement it for themselves. 05-09-2009 Steven D'Aprano wrote: > On Fri, 04 Sep 2009 22:51:39 -0700, Ken Newton wrote: [snip] >> I would think this is much more than just copy from other language >> styles or 'just' a syntax change -- the apparent widespread use would >> hint at a deeper need. > > "Apparent" is the key word there. There are lots of people who *say* this > this useful functionality, but how many of them *actually* use it? And of > those who do use it, how many of them know what they're doing? There are > an awful lot of bad programmers out there. > > If you do need such functionality, it's easy to implement. Here's one: Neither you nor me have hard evidence about popularity/unpopularity of the idea (number of places where you can find similar, more or less successful, attempts to implement it seems to testify in favour of the idea) -- nor about how it is used or abused. Obviously there are a lot of bad programmers who are able to use globals instead of function arguments etc.... Thats the fate of every language feature. But it's not the reason to resign from a feature that has particular common and proper use-cases. Even official Python tutorial mentions a case that is typical for the matter: http://docs.python.org/3.1/tutorial/classes.html#odds-and-ends > As a > general rule, if obj.x is an attribute, then every valid obj should have > an attribute x. But if obj['x'] is a key/value, then it is data-specific: > some instances will have an 'x' key, and some won't. It's often true but not always (see e.g. the above example in docs). Cheers, *j -- Jan Kaliszewski (zuo) From ubershmekel at gmail.com Sat Sep 5 18:45:44 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Sat, 5 Sep 2009 19:45:44 +0300 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <6751FF3A-7DFE-4BB1-B391-2EE9A2DE04BA@masklinn.net> References: <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> <5d1a32000909050647j1d5d730q355f9715b48a32c5@mail.gmail.com> <6751FF3A-7DFE-4BB1-B391-2EE9A2DE04BA@masklinn.net> Message-ID: <9d153b7c0909050945p729bc82y27f5a52c66b637c6@mail.gmail.com> I can't stand map/reduce/lambda... *instant readable makeover*def minicomp(f1, f2): def comped(*args, **kwargs): return f1(f2(*args, **kwargs)) return comped def compose(*funcs): total = funcs[0] for f in funcs[1:]: total = minicomp(total, f) return total On Sat, Sep 5, 2009 at 5:15 PM, Masklinn wrote: > On 5 Sep 2009, at 15:47 , Gerald Britton wrote: > Ah -- so not a real function then (yet)? Though something we could > >> borrow from Haskell, I suppose, even though: >> >> compose(foo,bar) == lambda x: foo(bar(x)) >> >> no? >> >> > Yeah but you could leverage Python's *args to get a compositor of more than > two functions e.g. > > def compose(*funcs): > return reduce(lambda f1, f2: > lambda *args, **kwargs: > f1(f2(*args, **kwargs)), > funcs) > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Yuv hzk.co.il -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.brandl at gmx.net Sat Sep 5 18:53:07 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 05 Sep 2009 18:53:07 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <9d153b7c0909050945p729bc82y27f5a52c66b637c6@mail.gmail.com> References: <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> <5d1a32000909050626i7de52eb1qc1503372643970c@mail.gmail.com> <5d1a32000909050647j1d5d730q355f9715b48a32c5@mail.gmail.com> <6751FF3A-7DFE-4BB1-B391-2EE9A2DE04BA@masklinn.net> <9d153b7c0909050945p729bc82y27f5a52c66b637c6@mail.gmail.com> Message-ID: Yuvgoog Greenle schrieb: > I can't stand map/reduce/lambda... *instant readable makeover* > def minicomp(f1, f2): > def comped(*args, **kwargs): > return f1(f2(*args, **kwargs)) > return comped > > def compose(*funcs): > total = funcs[0] > for f in funcs[1:]: > total = minicomp(total, f) > return total Please, if this must be discussed again, do it in a new thread. cheers, Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From ubershmekel at gmail.com Sun Sep 6 01:49:43 2009 From: ubershmekel at gmail.com (RunThePun) Date: Sat, 5 Sep 2009 16:49:43 -0700 (PDT) Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse Message-ID: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> I put some sweat into this one so I was hoping to see if you guys like it or have any ideas for improvement. http://code.google.com/p/pyopt/ Currently there are 2 modes of operation, 1. keyword command-line functions which create switches (-m, -r etc). 2. positional command-line functions which simply translate positional arguments from the command-line to a function. possibly in the future I'll implement a mixed keyword/positional arguments behaviour. At the moment annotations are mandatory for explicitness and here's an example usage: from pyopt import CmdPos from pyopt import parse_cmds @CmdPos def possy(archer:str, boulder:float, magic:int=42): """Shows an example positional command-line function. archer - is a str boulder - should be a float magic - a number that is magical""" print(repr(archer), repr(boulder), repr(magic)) if __name__ == "__main__": parse_cmds() Notice 4 things: * an import * a decorator * a parse_cmds() * type-annotations for casting The following functionality is exposed: C:\>example.py -h Usage: example.py archer boulder [magic] Shows an example positional command-line function. archer - is a str boulder - should be a float magic - a number that is magical C:\>example.py 1 2 3 '1' 2.0 3 C:\>example.py 1 2 '1' 2.0 42 C:\>example.py 13 2 arguments required, got only 1. Run with ? or -h for more help. From ben+python at benfinney.id.au Sun Sep 6 03:59:11 2009 From: ben+python at benfinney.id.au (Ben Finney) Date: Sun, 06 Sep 2009 11:59:11 +1000 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> Message-ID: <8763bw4yvk.fsf@benfinney.id.au> RunThePun writes: > I put some sweat into this one so I was hoping to see if you guys like > it or have any ideas for improvement. > http://code.google.com/p/pyopt/ Thanks for your work on this. > from pyopt import CmdPos > from pyopt import parse_cmds My main complaint at this point is the chosen names. Within Python code, I don't need to be reminded that I'm writing Python. The module names should not be ?pyopt?; you should choose a namespace that better describes what the module is for, without the ?py?. Also, please name the classes and functions so they're not needlessly CprsdWrds. Instead, choose names that contain whole words, or at least very-commonly-used abbreviations with little ambiguity. That way the names will be both more descriptive and easier to pronounce, and thus easier to remember correctly. -- \ ?[I]t is impossible for anyone to begin to learn that which he | `\ thinks he already knows.? ?Epictetus, _Discourses_ | _o__) | Ben Finney From greg.ewing at canterbury.ac.nz Sun Sep 6 07:45:34 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sun, 06 Sep 2009 17:45:34 +1200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: Message-ID: <4AA34C7E.7030803@canterbury.ac.nz> Stefan Behnel wrote: > It would therefore be nice to have a common ".any()" method on data > structures that would just read an arbitrary item from a container. Rather than add a method to every container implementation, it would be easier to provide a function: def first(obj): return iter(ob).next() possibly with some embellishments to handle StopIteration, allow for a default value, etc. -- Greg From greg.ewing at canterbury.ac.nz Sun Sep 6 07:51:43 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sun, 06 Sep 2009 17:51:43 +1200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA0FEF3.2000607@gmail.com> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> Message-ID: <4AA34DEF.4070606@canterbury.ac.nz> Nick Coghlan wrote: > That said, I'm -0 on the idea overall. If someone actually needs it, it > isn't particularly hard for them to write their own getany() function. There's a situation where the need to do this kind of thing actually arises fairly frequently -- retrieving things from a relational database. Often you're expecting exactly one result from a query, but the API always gives you a sequence, which you then have to get the first item from. Doing that over and over again gets rather tedious. -- Greg From ubershmekel at gmail.com Sun Sep 6 09:34:10 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Sun, 6 Sep 2009 10:34:10 +0300 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <8763bw4yvk.fsf@benfinney.id.au> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> Message-ID: <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> How about these names for the module:1. optionparse 2. shlopt (for shell options, sounds kinda cute albeit less descriptive) 3. shelloptions 4. ? And the decorator Names: 1. shell_expose_kwargs and shell_expose_args 2. expose_keywords and expose_arguments 3. expose and expose_keywords? 4. ? On Sun, Sep 6, 2009 at 4:59 AM, Ben Finney > wrote: > RunThePun writes: > > > I put some sweat into this one so I was hoping to see if you guys like > > it or have any ideas for improvement. > > http://code.google.com/p/pyopt/ > > Thanks for your work on this. > > > from pyopt import CmdPos > > from pyopt import parse_cmds > > My main complaint at this point is the chosen names. > > Within Python code, I don't need to be reminded that I'm writing Python. > The module names should not be ?pyopt?; you should choose a namespace > that better describes what the module is for, without the ?py?. > > Also, please name the classes and functions so they're not needlessly > CprsdWrds. Instead, choose names that contain whole words, or at least > very-commonly-used abbreviations with little ambiguity. That way the > names will be both more descriptive and easier to pronounce, and thus > easier to remember correctly. > > -- > \ ?[I]t is impossible for anyone to begin to learn that which he | > `\ thinks he already knows.? ?Epictetus, _Discourses_ | > _o__) | > Ben Finney > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Yuv hzk.co.il -------------- next part -------------- An HTML attachment was scrubbed... URL: From jafo at tummy.com Sun Sep 6 10:14:00 2009 From: jafo at tummy.com (Sean Reifschneider) Date: Sun, 06 Sep 2009 02:14:00 -0600 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> Message-ID: <4AA36F48.5040109@tummy.com> On 09/06/2009 01:34 AM, Yuvgoog Greenle wrote: > How about these names for the module: > 1. optionparse > 2. shlopt (for shell options, sounds kinda cute albeit less descriptive) It shouldn't really IMHO be called anything having to do with "opt" because that typically means the "-" (or in Windows "/") options (as in "getopt" and "optik" which is now "optparse"). Maybe "shellargs" or "argparse"? I think this is a really good start, thanks. Sean -- Sean Reifschneider, Member of Technical Staff tummy.com, ltd. - Linux Consulting since 1995: Ask me about High Availability -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 252 bytes Desc: OpenPGP digital signature URL: From g.brandl at gmx.net Sun Sep 6 10:18:53 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 06 Sep 2009 10:18:53 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA34DEF.4070606@canterbury.ac.nz> References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> Message-ID: Greg Ewing schrieb: > Nick Coghlan wrote: > >> That said, I'm -0 on the idea overall. If someone actually needs it, it >> isn't particularly hard for them to write their own getany() function. > > There's a situation where the need to do this kind of > thing actually arises fairly frequently -- retrieving > things from a relational database. Often you're > expecting exactly one result from a query, but the > API always gives you a sequence, which you then have > to get the first item from. Doing that over and > over again gets rather tedious. But if it's a sequence, you can simply do s[0], can't you? Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From steve at pearwood.info Sun Sep 6 10:21:30 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Sun, 6 Sep 2009 18:21:30 +1000 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <4AA36F48.5040109@tummy.com> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> Message-ID: <200909061821.32540.steve@pearwood.info> On Sun, 6 Sep 2009 06:14:00 pm Sean Reifschneider wrote: > On 09/06/2009 01:34 AM, Yuvgoog Greenle wrote: > > How about these names for the module: > > 1. optionparse > > 2. shlopt (for shell options, sounds kinda cute albeit less > > descriptive) > > It shouldn't really IMHO be called anything having to do with "opt" > because that typically means the "-" (or in Windows "/") options (as > in "getopt" and "optik" which is now "optparse"). But handling the dash command line options is exactly what the module is about. Perhaps the author can explain why this module is better or more pythonic than the two existing solutions already in the standard library. More importantly, while I'm sure the author is excited by his project, why has it been announced on this list? It seems to be off-topic to my mind. -- Steven D'Aprano From steve at pearwood.info Sun Sep 6 10:23:23 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Sun, 6 Sep 2009 18:23:23 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA34DEF.4070606@canterbury.ac.nz> References: <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> Message-ID: <200909061823.23915.steve@pearwood.info> On Sun, 6 Sep 2009 03:51:43 pm Greg Ewing wrote: > Nick Coghlan wrote: > > That said, I'm -0 on the idea overall. If someone actually needs > > it, it isn't particularly hard for them to write their own getany() > > function. > > There's a situation where the need to do this kind of > thing actually arises fairly frequently -- retrieving > things from a relational database. Often you're > expecting exactly one result from a query, but the > API always gives you a sequence, which you then have > to get the first item from. Doing that over and > over again gets rather tedious. If you're expecting "exactly one result", then surely it should be an error to receive more than one result? Rather than ask for "any" result and ignoring any unexpected extra items, I think it would be better to have a helper function that verifies you have got exactly one result. -- Steven D'Aprano From rob.cliffe at btinternet.com Sun Sep 6 18:41:33 2009 From: rob.cliffe at btinternet.com (Rob Cliffe) Date: Sun, 6 Sep 2009 17:41:33 +0100 Subject: [Python-ideas] Decorator syntax restriction Message-ID: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> Can I make another plea for the syntax following '@' to be an unrestricted expression? Guido has said he has a 'gut feeling' against this but has not as far as I know rationalised it. 1) It is inconsistent with Python in general (unPythonic) to impose arbitrary restrictions in one particular place, and hard to explain to someone learning the language. 2) The restriction is in any case more apparent than real, as @ # disallowed, SyntaxError can be implemented, albeit in a more verbose aka less Pythonic was, as: AnyExpr = @AnyExpr or as def Identity(x): return x ... @Identity( ) # smuggle in as func arg 3) I propose the following as plausible use cases (I know other people will have their own): 3.1) @DecoratorList[index] 3.2) @DecoratorDictionary[key] 3.3) @Decorator1 if else Decorator2 # Special case of the last one: def Identity(x): return x @Decorator if __debug__ else Identity Xavier Morel has pointed out that 3.1) can be implemented now as @DecoratorList.__getitem__[index] but this doesn't seem a good reason for forbidding the simpler syntax; after all Python allows the simpler syntax in other contexts. Similarly 3.2) can be written as @DecoratorDictionary.get(key) (As an aside, perhaps a decorator that evaluates to None could be treated at run-time the same as no decorator, i.e. equivalent to the Identity function in the above examples. Currently it naturally raises TypeError: 'NoneType' object is not callable. Just a thought.) Finally, sorry if I have not sent this e-mail to the right place (I wanted to attach it to the 'allow lambdas as decorators' thread but don't yet know how to do this). Also sorry that this partly duplicates a message I sent to python-dev. I am still finding my way round the Python mailing lists. Best wishes Rob Cliffe -------------- next part -------------- An HTML attachment was scrubbed... URL: From fetchinson at googlemail.com Sun Sep 6 21:30:34 2009 From: fetchinson at googlemail.com (Daniel Fetchinson) Date: Sun, 6 Sep 2009 12:30:34 -0700 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> Message-ID: > Can I make another plea for the syntax following '@' to be an unrestricted > expression? Guido has said he has a 'gut feeling' against this but has not > as far as I know rationalised it. > > 1) It is inconsistent with Python in general (unPythonic) to impose > arbitrary restrictions in one particular place, and hard to explain to > someone learning the language. > > 2) The restriction is in any case more apparent than real, > as > @ # disallowed, SyntaxError > can be implemented, albeit in a more verbose aka less Pythonic was, as: What makes you think that if something is 'more verbose' it is 'less pythonic'? I actually like the fact that python doesn't try condensing everything into one-liners and special symbols. I never really understood this need for being not verbose, but it does periodically come up on this list (and pretty much on every other programming list). Your fingers get tired? It takes too long to read an extra line? You are running out of space on your harddrive? It takes too long to transfer the source file over the network because of the extra line? Honestly, why do some people set for themselves the goal of "let's have as few characters in a source file as possible"? Cheers, Daniel > AnyExpr = > @AnyExpr > or as > > def Identity(x): return x > ... > @Identity( ) # smuggle in as func arg > > 3) I propose the following as plausible use cases (I know other people will > have their own): > > 3.1) > @DecoratorList[index] > > 3.2) > @DecoratorDictionary[key] > > 3.3) > @Decorator1 if else Decorator2 > # Special case of the last one: > def Identity(x): return x > @Decorator if __debug__ else Identity > > Xavier Morel has pointed out that 3.1) can be implemented now as > @DecoratorList.__getitem__[index] > but this doesn't seem a good reason for forbidding the simpler syntax; after > all Python allows the simpler syntax in other contexts. Similarly 3.2) can > be written as > @DecoratorDictionary.get(key) > > (As an aside, perhaps a decorator that evaluates to None could be treated at > run-time the same as no decorator, i.e. equivalent to the Identity function > in the above examples. Currently it naturally raises TypeError: 'NoneType' > object is not callable. Just a thought.) > > Finally, sorry if I have not sent this e-mail to the right place (I wanted > to attach it to the 'allow lambdas as decorators' thread but don't yet know > how to do this). Also sorry that this partly duplicates a message I sent to > python-dev. I am still finding my way round the Python mailing lists. > > Best wishes > Rob Cliffe -- Psss, psss, put it down! - http://www.cafepress.com/putitdown From jafo at tummy.com Sun Sep 6 21:39:41 2009 From: jafo at tummy.com (Sean Reifschneider) Date: Sun, 06 Sep 2009 13:39:41 -0600 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <200909061821.32540.steve@pearwood.info> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <200909061821.32540.steve@pearwood.info> Message-ID: <4AA40FFD.1010601@tummy.com> On 09/06/2009 02:21 AM, Steven D'Aprano wrote: > But handling the dash command line options is exactly what the module is > about. The example at the referenced page looks like it's all about handling non-option arguments. I must be missing something. Sean -- Sean Reifschneider, Member of Technical Staff tummy.com, ltd. - Linux Consulting since 1995: Ask me about High Availability -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 252 bytes Desc: OpenPGP digital signature URL: From mwm-keyword-python.b4bdba at mired.org Sun Sep 6 22:07:07 2009 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Sun, 6 Sep 2009 16:07:07 -0400 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> Message-ID: <20090906160707.7883dac5@bhuda.mired.org> On Sun, 6 Sep 2009 12:30:34 -0700 Daniel Fetchinson wrote: > What makes you think that if something is 'more verbose' it is 'less > pythonic'? I actually like the fact that python doesn't try condensing > everything into one-liners and special symbols. Agreed. Readability, not succinctness, is what's pythonic. Being succinct usually - but not always - improves readability. > I never really understood this need for being not verbose, but it does > periodically come up on this list (and pretty much on every other > programming list). Your fingers get tired? It takes too long to read > an extra line? You are running out of space on your harddrive? It > takes too long to transfer the source file over the network because of > the extra line? > > Honestly, why do some people set for themselves the goal of "let's > have as few characters in a source file as possible"? Paul Graham (generally a very sharp guy) summarizes most of the reasons in http://www.paulgraham.com/power.html. I provide my attempt at a counterargument in http://www.mired.org/home/mwm/papers/readability.html. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From brett at python.org Sun Sep 6 22:18:51 2009 From: brett at python.org (Brett Cannon) Date: Sun, 6 Sep 2009 13:18:51 -0700 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> Message-ID: On Sun, Sep 6, 2009 at 09:41, Rob Cliffe wrote: > Can I make another plea for the syntax following '@' to be an unrestricted > expression?? Guido has said he has a 'gut feeling' against this but has not > as far as I know rationalised it. > When it comes to Guido's gut, a rationalization isn't needed. Perk of being BDFL. Plus his gut is right so often it tends to not be questioned. > 1) It is inconsistent with Python in general (unPythonic) to impose > arbitrary restrictions in one particular place, and hard to explain to > someone learning the language. > It's not difficult to explain; decorators can only be a dotted name w/ an optional method call and its corresponding arguments. It keeps the syntax simple and clean, IMO. Decorators add a mental overhead of having to think about what they will do to a function when reading the code. If I then also have to figure out what an arbitrary expression evaluates to in order to figure that out that is more mental effort than needed. Yes, you can do whatever with the decorator you are passing in, but hopefully you are not so evil/stupid as to make a decorator that copmlicated. Give people the power of full expressions and that will happen more often. > 2) The restriction is in any case more apparent than real, > as > ??? @ # disallowed, SyntaxError > can be implemented, albeit in a more verbose aka less Pythonic was, as: > > ??? AnyExpr = > ??? @AnyExpr > > or as > > ??? def Identity(x): return x > ???? ... > ????@Identity( ) # smuggle in as func arg > And we almost ditched lambdas in Python 3 because you can implement them in the same way. The only reason they got to stick around was they were already in use and people threw a fit over them. > 3) I propose the following as plausible use cases (I know other people will > have their own): > > 3.1) > ??? @DecoratorList[index] > > 3.2) > ??? @DecoratorDictionary[key] > > 3.3) > ??? @Decorator1 if? else Decorator2 > #?? Special case of?the last one: > ??? def Identity(x): return x > ??? @Decorator if __debug__ else Identity > Plausible does not equal useful. You need to show that this actually comes up in normal coding for a decent amount of Python code to warrant tweaking the language over. > Xavier Morel has pointed out that 3.1) can be implemented now as > ??? @DecoratorList.__getitem__[index] > but this doesn't seem a good reason for forbidding the simpler syntax; after > all Python allows the simpler syntax in other contexts.? Similarly 3.2) can > be written as > ??? @DecoratorDictionary.get(key) > > (As an aside, perhaps a decorator that evaluates to None could be treated at > run-time the same as no decorator, i.e. equivalent to the Identity function > in the above examples.? Currently?it naturally raises TypeError: 'NoneType' > object is not callable.? Just a thought.) That's not going to happen. =) Complicates the bytecode unnecessarily. Once again, this needs to actually come up in regular usage to warrant even considering the change. > > Finally, sorry if I have not sent this e-mail to the right place (I wanted > to attach it to the 'allow lambdas as decorators' thread but don't yet know > how to do this).? Also sorry that this partly duplicates a message I sent to > python-dev.? I am still finding my way round the Python mailing lists. No, this is the place to send thought out proposals for changing Python before they get promoted to hitting python-dev. -Brett From ncoghlan at gmail.com Sun Sep 6 22:44:42 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 07 Sep 2009 06:44:42 +1000 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> Message-ID: <4AA41F3A.1070808@gmail.com> Brett Cannon wrote: > On Sun, Sep 6, 2009 at 09:41, Rob Cliffe wrote: >> Can I make another plea for the syntax following '@' to be an unrestricted >> expression? Guido has said he has a 'gut feeling' against this but has not >> as far as I know rationalised it. >> > > When it comes to Guido's gut, a rationalization isn't needed. Perk of > being BDFL. Plus his gut is right so often it tends to not be > questioned. > >> 1) It is inconsistent with Python in general (unPythonic) to impose >> arbitrary restrictions in one particular place, and hard to explain to >> someone learning the language. >> > > It's not difficult to explain; decorators can only be a dotted name w/ > an optional method call and its corresponding arguments. >From the last discussion, I believe Guido was actually amenable to the idea of extending this to allow a subscript operation as well, so a decorator could be pulled from a sequence or map of decorators without requiring an otherwise unnecessary function call. So what's needed at this point is for someone that is bothered by the restriction to come up with a patch to loosen the restriction without getting rid of it entirely. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From solipsis at pitrou.net Sun Sep 6 23:45:49 2009 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 6 Sep 2009 21:45:49 +0000 (UTC) Subject: [Python-ideas] data structures should have an .any() method References: <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> <200909061823.23915.steve@pearwood.info> Message-ID: Steven D'Aprano writes: > > If you're expecting "exactly one result", then surely it should be an > error to receive more than one result? Rather than ask for "any" result > and ignoring any unexpected extra items, I think it would be better to > have a helper function that verifies you have got exactly one result. Why do you need a helper function? Simply write: x, = db.query("SELECT blah...") and you'll get a ValueError if there isn't exactly one item in the sequence. Regards Antoine. From solipsis at pitrou.net Sun Sep 6 23:46:53 2009 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 6 Sep 2009 21:46:53 +0000 (UTC) Subject: [Python-ideas] data structures should have an .any() method References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA10198.8040401@gmail.com> <4AA10545.30408@gmail.com> Message-ID: Georg Brandl writes: > > or > > any_item = compose(next, iter) Endly, an use case for compose! From zuo at chopin.edu.pl Mon Sep 7 01:37:35 2009 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Mon, 07 Sep 2009 01:37:35 +0200 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: <4AA3FD65.7000403@stoneleaf.us> References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <8fd67d4b0909040955k590b8d6fv9012165d847b02d9@mail.gmail.com> <02b1dd77$0$17565$c3e8da3@news.astraweb.com> <8fd67d4b0909042247h167226edv6693c314ae5a16eb@mail.gmail.com> <02b225c8$0$17565$c3e8da3@news.astraweb.com> <4AA3FD65.7000403@stoneleaf.us> Message-ID: 06-09-2009 o 20:20:21 Ethan Furman wrote: > In the dbf module I wrote, I use both the attribute access and the key > lookup. The attribute access is great for interactive use, and for all > the routines that play with the tables we have at work, where all the > field names are indeed known at compile (aka coding) time. On the other > hand, some routines don't know which fields they'll mucking about with, > and so the key access is vital for them. > > Of course, I could have done the whole thing using key access, and I did > have to impose some restrictions on method names so they wouldn't clash > with possible field names, but I love being able to type > > current_record.full_name == last_record.full_name > > instead of > > current_record['full_name'] == last_record['full_name'] Me too, and I suppose many people too... The latter: * makes your code less readable if there is high density of such expressions; * makes typing much more strenuous/irritating -- what is not very important in case of advanced development (when time of typing is short in relation to time of thinking/reading/testing) but becomes quite important in case of scripting (which is still important area of Python usage). -- Jan Kaliszewski (zuo) From greg.ewing at canterbury.ac.nz Mon Sep 7 02:32:44 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 07 Sep 2009 12:32:44 +1200 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <4AA36F48.5040109@tummy.com> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> Message-ID: <4AA454AC.60709@canterbury.ac.nz> Sean Reifschneider wrote: > Maybe "shellargs" or "argparse"? I don't think it should have "shell" in it, because the module doesn't really have anything to do with the shell. The shell is not the only way of launching a program and passing args to it. -- Greg From ubershmekel at gmail.com Mon Sep 7 02:34:52 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Mon, 7 Sep 2009 03:34:52 +0300 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <4AA40FFD.1010601@tummy.com> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <200909061821.32540.steve@pearwood.info> <4AA40FFD.1010601@tummy.com> Message-ID: <9d153b7c0909061734x615a97fao6e974110744173ff@mail.gmail.com> Currently the module handles positional arguments with one decorator and keyword arguments (with '-' switches) using another decorator. Maybe I should have linked to this more complete examples page: http://code.google.com/p/pyopt/wiki/Examples Steven: Forgive me, I am new to this list and now after checking I'm guessing you mean "stdlib-sig" is more fitting. So I'll move this discussion there. Just to clarify why I feel getopt/optparse aren't as pythonic - to me they feel clunky in that I need alot of ultra-explicit, extra-long lines to do some very basic things. The python I'm used to allows me to use open('filename').read() with reasonable default parameters. Now that I think of it, actually wrapping optparse might have been an easier implementation route for some of the functionality... Yuv On Sun, Sep 6, 2009 at 10:39 PM, Sean Reifschneider wrote: > On 09/06/2009 02:21 AM, Steven D'Aprano wrote: > > But handling the dash command line options is exactly what the module is > > about. > > The example at the referenced page looks like it's all about handling > non-option arguments. I must be missing something. > > Sean > -- > Sean Reifschneider, Member of Technical Staff > tummy.com, ltd. - Linux Consulting since 1995: Ask me about High > Availability > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > -- Yuv hzk.co.il -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg.ewing at canterbury.ac.nz Mon Sep 7 02:35:14 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 07 Sep 2009 12:35:14 +1200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0E3F9.9010904@gmail.com> <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> Message-ID: <4AA45542.1080102@canterbury.ac.nz> Georg Brandl wrote: > But if it's a sequence, you can simply do s[0], can't you? Not if it's an iterator, which it probably will be in the case of a DB API. -- Greg From greg.ewing at canterbury.ac.nz Mon Sep 7 02:39:28 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 07 Sep 2009 12:39:28 +1200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <200909061823.23915.steve@pearwood.info> References: <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> <200909061823.23915.steve@pearwood.info> Message-ID: <4AA45640.4020605@canterbury.ac.nz> Steven D'Aprano wrote: > If you're expecting "exactly one result", then surely it should be an > error to receive more than one result? Rather than ask for "any" result > and ignoring any unexpected extra items, I think it would be better to > have a helper function that verifies you have got exactly one result. Yes, that could be useful. Maybe call it "one"? -- Greg From fetchinson at googlemail.com Mon Sep 7 02:48:21 2009 From: fetchinson at googlemail.com (Daniel Fetchinson) Date: Sun, 6 Sep 2009 17:48:21 -0700 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <20090906160707.7883dac5@bhuda.mired.org> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090906160707.7883dac5@bhuda.mired.org> Message-ID: >> What makes you think that if something is 'more verbose' it is 'less >> pythonic'? I actually like the fact that python doesn't try condensing >> everything into one-liners and special symbols. > > Agreed. Readability, not succinctness, is what's pythonic. Being > succinct usually - but not always - improves readability. > >> I never really understood this need for being not verbose, but it does >> periodically come up on this list (and pretty much on every other >> programming list). Your fingers get tired? It takes too long to read >> an extra line? You are running out of space on your harddrive? It >> takes too long to transfer the source file over the network because of >> the extra line? >> >> Honestly, why do some people set for themselves the goal of "let's >> have as few characters in a source file as possible"? > > Paul Graham (generally a very sharp guy) summarizes most of the > reasons in http://www.paulgraham.com/power.html. Thanks, this answers my question why people think this way. Although I'm still totally convinced that guys like Paul Graham, or anybody else who believes in shorter code, are misguided. > I provide my attempt at a counterargument in > http://www.mired.org/home/mwm/papers/readability.html. Yep, I more-or-less agree with you. Cheers, Daniel -- Psss, psss, put it down! - http://www.cafepress.com/putitdown From ubershmekel at gmail.com Mon Sep 7 03:07:34 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Mon, 7 Sep 2009 04:07:34 +0300 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <4AA454AC.60709@canterbury.ac.nz> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <4AA454AC.60709@canterbury.ac.nz> Message-ID: <9d153b7c0909061807g19016cfcmd96f1ce93a7dbf54@mail.gmail.com> What do you think of the name "optionize" for the module then? @optionize.positional @optionize.keyword or maybe @optionize.args @optionize.kwargs On Mon, Sep 7, 2009 at 3:32 AM, Greg Ewing wrote: > Sean Reifschneider wrote: > > Maybe "shellargs" or "argparse"? >> > > I don't think it should have "shell" in it, because the > module doesn't really have anything to do with the shell. > The shell is not the only way of launching a program and > passing args to it. > > -- > Greg > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Yuv hzk.co.il -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg.ewing at canterbury.ac.nz Mon Sep 7 03:52:50 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 07 Sep 2009 13:52:50 +1200 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <9d153b7c0909061807g19016cfcmd96f1ce93a7dbf54@mail.gmail.com> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <4AA454AC.60709@canterbury.ac.nz> <9d153b7c0909061807g19016cfcmd96f1ce93a7dbf54@mail.gmail.com> Message-ID: <4AA46772.5050405@canterbury.ac.nz> Yuvgoog Greenle wrote: > What do you think of the name "optionize" for the module then? -1, too clever and not informative enough. -- Greg From tleeuwenburg at gmail.com Mon Sep 7 03:54:59 2009 From: tleeuwenburg at gmail.com (Tennessee Leeuwenburg) Date: Mon, 7 Sep 2009 11:54:59 +1000 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <4AA46772.5050405@canterbury.ac.nz> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <4AA454AC.60709@canterbury.ac.nz> <9d153b7c0909061807g19016cfcmd96f1ce93a7dbf54@mail.gmail.com> <4AA46772.5050405@canterbury.ac.nz> Message-ID: <43c8685c0909061854t40ca8467j39a37f5a2827826c@mail.gmail.com> Maybe opthandler? optparse2? Of course, not worth letting naming get in the way of progress... -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Mon Sep 7 03:29:24 2009 From: eric at trueblade.com (Eric Smith) Date: Sun, 06 Sep 2009 21:29:24 -0400 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <4AA454AC.60709@canterbury.ac.nz> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <4AA454AC.60709@canterbury.ac.nz> Message-ID: <4AA461F4.8010804@trueblade.com> Greg Ewing wrote: > Sean Reifschneider wrote: > >> Maybe "shellargs" or "argparse"? > > I don't think it should have "shell" in it, because the > module doesn't really have anything to do with the shell. > The shell is not the only way of launching a program and > passing args to it. And the name "argparse" is already used by a popular package: http://code.google.com/p/argparse/ From stephen at xemacs.org Mon Sep 7 04:13:09 2009 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 07 Sep 2009 11:13:09 +0900 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090906160707.7883dac5@bhuda.mired.org> Message-ID: <87k50b8pu2.fsf@uwakimon.sk.tsukuba.ac.jp> Daniel Fetchinson writes: > >> What makes you think that if something is 'more verbose' it is 'less > >> pythonic'? I actually like the fact that python doesn't try condensing > >> everything into one-liners and special symbols. Yes. > > Agreed. Readability, not succinctness, is what's pythonic. Being > > succinct usually - but not always - improves readability. Yes. > >> I never really understood this need for being not verbose, but it does > >> periodically come up on this list (and pretty much on every other > >> programming list). > >> Your fingers get tired? Yes. See Jan Kaliszewski's post in the "possible attribute-oriented class" thread. His reasoning is valid, though I don't sympathize with it personally. > >> It takes too long to read an extra line? Yes, when "too long" has the semantics "I read this repeatedly in a short space and don't need to see the whole thing over and over again. In fact, it gets in my way when reading an 'array' of the sme idiom." This is what Paul Graham means by (expressive) power, I believe. He mentions metrics like number of characters or lines, but he says what he really wants is something like the number of leaves in the AST. If the "this" is something local, then you use a function (or sometimes a macro if available) at that level of locality. But if the idiom appears across many programs, then it may be a good idea to turn it into a standard builtin, or even syntax. I believe this is the gist of Graham's argument, and it's very close to the criteria for adding syntax in the Zen (actually, the apocrypha, stuff like "not every three-line function needs to be a builtin" aren't canonized). > >> You are running out of space on your harddrive? > >> It takes too long to transfer the source file over the network > >> because of the extra line? Both of those are silly. If you use compression, it will work out about the same anyway. > >> Honestly, why do some people set for themselves the goal of "let's > >> have as few characters in a source file as possible"? Mostly the ones who show up on Python lists don't have such a goal. They just want the ache in their hands and arms to go away, one unnecessary character at a time. > > Paul Graham (generally a very sharp guy) summarizes most of the > > reasons in http://www.paulgraham.com/power.html. > > Thanks, this answers my question why people think this way. Although > I'm still totally convinced that guys like Paul Graham, or anybody > else who believes in shorter code, are misguided. > > > I provide my attempt at a counterargument in > > http://www.mired.org/home/mwm/papers/readability.html. > > Yep, I more-or-less agree with you. But Paul Graham does, too, AFAICS. ISTM that what Paul G. doesn't get is that Paul P.'s epigram is more along the lines of Emerson's epigram. To put it in the same style, "A bogus succinctness is the hobgoblin of L2-cache-deprived minds (and RSI-hobbled wrists)." To me, the argument on "mired" seems quite complementary to the argument Graham makes, in that it shows how Python actually is succinct in the sense that Graham proposes, despite not minimizing character, token, or line counts. From fetchinson at googlemail.com Mon Sep 7 04:53:55 2009 From: fetchinson at googlemail.com (Daniel Fetchinson) Date: Sun, 6 Sep 2009 19:53:55 -0700 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <2DBF7599EA49409F938951A0D3953635@robslaptop> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <2DBF7599EA49409F938951A0D3953635@robslaptop> Message-ID: > I happen to think that the longer forms in all the relevant examples that I > gave are harder to understand, because they introduce an extra step that is > not relevant to the job to be done; rather, it is a distraction that adds an > extra 'kink' in the flow of thought. I actually agree with you, in the case of decorators, and for some of the cases you discussed (for example allowing @decorator[5] syntax), it is true that the short forms are readable and I don't see any problem with them. My only concern was the general statement 'more verbose = unpythonic'. > You may disagree - fine - but it should be up to the judgement of the > programmer, within reason, how concise or how verbose to be. I don't fully agree. Some obfuscated, hard-to-read, etc forms should I think be explicitly forbidden. Luckily, python does forbid lots of constructs which would be very hard to follow. > In this case, > the language should not force me to go the extra mile with an arbitrary > restriction, when there is no reason to (no difficulty of implementation, as > I understand it). Yes, again, I fully agree with you on this particular case. Cheers, Daniel > >>> Can I make another plea for the syntax following '@' to be an >>> unrestricted >>> expression? Guido has said he has a 'gut feeling' against this but has >>> not >>> as far as I know rationalised it. >>> >>> 1) It is inconsistent with Python in general (unPythonic) to impose >>> arbitrary restrictions in one particular place, and hard to explain to >>> someone learning the language. >>> >>> 2) The restriction is in any case more apparent than real, >>> as >>> @ # disallowed, SyntaxError >>> can be implemented, albeit in a more verbose aka less Pythonic was, as: >> >> What makes you think that if something is 'more verbose' it is 'less >> pythonic'? I actually like the fact that python doesn't try condensing >> everything into one-liners and special symbols. >> >> I never really understood this need for being not verbose, but it does >> periodically come up on this list (and pretty much on every other >> programming list). Your fingers get tired? It takes too long to read >> an extra line? You are running out of space on your harddrive? It >> takes too long to transfer the source file over the network because of >> the extra line? >> >> Honestly, why do some people set for themselves the goal of "let's >> have as few characters in a source file as possible"? >> >> Cheers, >> Daniel >> >> >>> AnyExpr = >>> @AnyExpr >>> or as >>> >>> def Identity(x): return x >>> ... >>> @Identity( ) # smuggle in as func arg >>> >>> 3) I propose the following as plausible use cases (I know other people >>> will >>> have their own): >>> >>> 3.1) >>> @DecoratorList[index] >>> >>> 3.2) >>> @DecoratorDictionary[key] >>> >>> 3.3) >>> @Decorator1 if else Decorator2 >>> # Special case of the last one: >>> def Identity(x): return x >>> @Decorator if __debug__ else Identity >>> >>> Xavier Morel has pointed out that 3.1) can be implemented now as >>> @DecoratorList.__getitem__[index] >>> but this doesn't seem a good reason for forbidding the simpler syntax; >>> after >>> all Python allows the simpler syntax in other contexts. Similarly 3.2) >>> can >>> be written as >>> @DecoratorDictionary.get(key) >>> >>> (As an aside, perhaps a decorator that evaluates to None could be treated >>> >>> at >>> run-time the same as no decorator, i.e. equivalent to the Identity >>> function >>> in the above examples. Currently it naturally raises TypeError: >>> 'NoneType' >>> object is not callable. Just a thought.) >>> >>> Finally, sorry if I have not sent this e-mail to the right place (I >>> wanted >>> to attach it to the 'allow lambdas as decorators' thread but don't yet >>> know >>> how to do this). Also sorry that this partly duplicates a message I sent >>> >>> to >>> python-dev. I am still finding my way round the Python mailing lists. >>> >>> Best wishes >>> Rob Cliffe >> >> >> -- >> Psss, psss, put it down! - http://www.cafepress.com/putitdown >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> > > > -- Psss, psss, put it down! - http://www.cafepress.com/putitdown From ben+python at benfinney.id.au Mon Sep 7 05:05:59 2009 From: ben+python at benfinney.id.au (Ben Finney) Date: Mon, 07 Sep 2009 13:05:59 +1000 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <4AA454AC.60709@canterbury.ac.nz> <4AA461F4.8010804@trueblade.com> Message-ID: <87k50b3148.fsf@benfinney.id.au> Eric Smith writes: > And the name "argparse" is already used by a popular package: > http://code.google.com/p/argparse/ Which raises the question: Would the original poster do the free software community a service by, instead of writing a new library from scratch, try improving to the existing libraries that are already in use? -- \ ?As we enjoy great advantages from the inventions of others, we | `\ should be glad to serve others by any invention of ours; and | _o__) this we should do freely and generously.? ?Benjamin Franklin | Ben Finney From mwm-keyword-python.b4bdba at mired.org Mon Sep 7 09:05:32 2009 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Mon, 7 Sep 2009 03:05:32 -0400 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <87k50b8pu2.fsf@uwakimon.sk.tsukuba.ac.jp> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090906160707.7883dac5@bhuda.mired.org> <87k50b8pu2.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <20090907030532.56270cee@bhuda.mired.org> This appears to be veering way off topic... Except we're looking at what makes an idea "good" in python terms, vs. what makes them "not good". Basically, trying to define "pythonic". I don't know that that can be done, but there seem to be some broad points that can be agreed on.... > > >> Honestly, why do some people set for themselves the goal of "let's > > >> have as few characters in a source file as possible"? > Mostly the ones who show up on Python lists don't have such a goal. > They just want the ache in their hands and arms to go away, one > unnecessary character at a time. > > > > Paul Graham (generally a very sharp guy) summarizes most of the > > > reasons in http://www.paulgraham.com/power.html. > > > > Thanks, this answers my question why people think this way. Although > > I'm still totally convinced that guys like Paul Graham, or anybody > > else who believes in shorter code, are misguided. > > > > > I provide my attempt at a counterargument in > > > http://www.mired.org/home/mwm/papers/readability.html. I think my choice of "counterargument" here is a bit off. It's not all that argumentative. > > Yep, I more-or-less agree with you. > But Paul Graham does, too, AFAICS. > > ISTM that what Paul G. doesn't get is that Paul P.'s epigram is more > along the lines of Emerson's epigram. To put it in the same style, "A > bogus succinctness is the hobgoblin of L2-cache-deprived minds (and > RSI-hobbled wrists)." To me, the argument on "mired" seems quite > complementary to the argument Graham makes, in that it shows how > Python actually is succinct in the sense that Graham proposes, despite > not minimizing character, token, or line counts. What I was attempting to do was point out that succinctness for the sake of succinctness isn't necessarily a good thing. Python indeed tries to be succinct, but balances that against the need for the results to still be readable. I'd say that the mired.org document supplements what Paul G. had to say rather than complements it, as the mired.org document discusses areas where readability matters, which Paul G. ignored. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From stefan_ml at behnel.de Mon Sep 7 09:22:58 2009 From: stefan_ml at behnel.de (Stefan Behnel) Date: Mon, 07 Sep 2009 09:22:58 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA45640.4020605@canterbury.ac.nz> References: <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> <200909061823.23915.steve@pearwood.info> <4AA45640.4020605@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Steven D'Aprano wrote: > >> If you're expecting "exactly one result", then surely it should be an >> error to receive more than one result? Rather than ask for "any" >> result and ignoring any unexpected extra items, I think it would be >> better to have a helper function that verifies you have got exactly >> one result. > > Yes, that could be useful. Maybe call it "one"? I think "one" fits two of the proposed three use cases pretty nicely. The only remaining use case is where you actually have more than one item but only want any one out of them. But I think in that case you can actually roll your own anyway, as there may be other constrains on exactly how 'equal' all the items are. Stefan From ubershmekel at gmail.com Mon Sep 7 10:51:07 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Mon, 7 Sep 2009 11:51:07 +0300 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <87k50b3148.fsf@benfinney.id.au> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <8763bw4yvk.fsf@benfinney.id.au> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <4AA454AC.60709@canterbury.ac.nz> <4AA461F4.8010804@trueblade.com> <87k50b3148.fsf@benfinney.id.au> Message-ID: <9d153b7c0909070151i183f9b9bh84f4a3c31aa9f1c5@mail.gmail.com> To me the most awesome goal for this project would be to make it into the standard library. The concept is to allow a minimal-syntax decorator to expose regular functions as opposed to building an entire function just for parsing all the options. I don't mind where I implement this. =D On Mon, Sep 7, 2009 at 6:05 AM, Ben Finney > wrote: > Eric Smith writes: > > > And the name "argparse" is already used by a popular package: > > http://code.google.com/p/argparse/ > > Which raises the question: Would the original poster do the free > software community a service by, instead of writing a new library from > scratch, try improving to the existing libraries that are already in > use? > > -- > \ ?As we enjoy great advantages from the inventions of others, we | > `\ should be glad to serve others by any invention of ours; and | > _o__) this we should do freely and generously.? ?Benjamin Franklin | > Ben Finney > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Yuv hzk.co.il -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Mon Sep 7 13:29:52 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 07 Sep 2009 21:29:52 +1000 Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse In-Reply-To: <9d153b7c0909061734x615a97fao6e974110744173ff@mail.gmail.com> References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> <9d153b7c0909060034s4de0fd4an526cc6aae9157a31@mail.gmail.com> <4AA36F48.5040109@tummy.com> <200909061821.32540.steve@pearwood.info> <4AA40FFD.1010601@tummy.com> <9d153b7c0909061734x615a97fao6e974110744173ff@mail.gmail.com> Message-ID: <4AA4EEB0.6080608@gmail.com> Yuvgoog Greenle wrote: > Just to clarify why I feel getopt/optparse aren't as pythonic - to me > they feel clunky in that I need alot of ultra-explicit, extra-long lines > to do some very basic things. The python I'm used to allows me to use > open('filename').read() with reasonable default parameters. Now that I > think of it, actually wrapping optparse might have been an easier > implementation route for some of the functionality... Something to think about is the possibility of redesigning your API proposal to function as a convenience wrapper around the existing optparse implementation. Convenience wrappers have a much lower hurdle to clear than complete alternative APIs (since the full power of the original API remains available by dropping back to the lower level). It's still no guarantee of course - there still needs to be a python-ideas (and then python-dev) consensus that the proposed wrappers actually are an improvement. There are definitely some things about the basic concepts behind your API that bother me as it currently stands: 1. One of the major features of optparse is that it encourages a data driven approach to option definition. Going back to a largely procedural approach as in your examples is not a step forward. 2. The use of a single global parser is a fairly questionable feature. 3. Losing the options object makes it more difficult to pass options around to code that may only care about some of the options A potentially valuable addition to optparse might just focus on your "CmdPos" idea and add the ability to add commands to the option parser: import optparse parser = optparse.parser() @parser.add_command def main(options, *args): """Command line help info goes here""" # Main body goes here # Adding more than one command would result in the first argument # being used to select between them by name as with CmdPos if __name__ == "__main__": parser.run_command() The other thing I find somewhat tedious with optparse is having to do lots of procedural checking of option constraints in order to provide helpful error messages. Being able to add independent constraint checks would help a great deal with that: @parser.add_constraint def check_args(parser, options, *args): if len(args) != 2: parser.error("Exactly 2 arguments required") Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From solipsis at pitrou.net Mon Sep 7 13:48:48 2009 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 7 Sep 2009 11:48:48 +0000 (UTC) Subject: [Python-ideas] Pyopt - an attempt at a pythonic optparse References: <123377d1-0b16-4deb-b540-ca9cee02e216@s39g2000yqj.googlegroups.com> Message-ID: RunThePun writes: > > possibly in the future I'll implement a mixed keyword/positional > arguments behaviour. It is certainly necessary if you want it to be useful. Regards Antoine. From steve at pearwood.info Tue Sep 8 02:15:10 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Tue, 8 Sep 2009 10:15:10 +1000 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <4AA3FD65.7000403@stoneleaf.us> Message-ID: <200909081015.10767.steve@pearwood.info> On Mon, 7 Sep 2009 09:37:35 am Jan Kaliszewski wrote: > 06-09-2009 o 20:20:21 Ethan Furman wrote: > > ... I love being able to type > > > > current_record.full_name == last_record.full_name > > > > instead of > > > > current_record['full_name'] == last_record['full_name'] > > Me too, and I suppose many people too... > > The latter: > > * makes your code less readable if there is high density of such > expressions; > > * makes typing much more strenuous/irritating -- what is not very > important in case of advanced development (when time of typing is > short in relation to time of thinking/reading/testing) but becomes > quite important in case of scripting (which is still important > area of Python usage). If you have a large number of such expressions, what's wrong with this? FNAME = "full_name" # Define the string in one place only. current_record[FNAME] == last_record[FNAME] # Use it in many places. Not only is it shorter to use, but it's easy to change the key "full_name" to (say) "complete_name" or "volledige_naam" with one edit, and without mistakenly changing some other string which just happens to match the key. (I don't know about others, but when I'm first working on a piece of code, and before I settle on an API or database schema, I often change field names two or three times before I settle in on the final version.) In any case, while I accept that this is sometimes useful, I also think that it's a something which is simple enough to add to your classes when necessary with just a few lines -- all you really need are the __*attr__ methods, everything else is superfluous. If you're doing this a lot, avoid boilerplate with a class decorator. Here's an untested minimalistic version which probably does everything necessary: def add_attr(cls): """Class decorator which adds attribute access to mappings.""" def __getattr__(self, name): return self[name] def __setattr__(self, name, value): self[name] = value def __delattr__(self, name): del self[name] for func in (__getattr__, __setattr__, __delattr__): setattr(cls, func.__name__, func) return cls Fields of an object (attributes) and keys of a mapping are generally for different purposes, and I'm not sure we should encourage people to conflate the two. I think this belongs in the cookbook, not the standard library. -- Steven D'Aprano From zuo at chopin.edu.pl Tue Sep 8 03:02:25 2009 From: zuo at chopin.edu.pl (Jan Kaliszewski) Date: Tue, 08 Sep 2009 03:02:25 +0200 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: <200909081015.10767.steve@pearwood.info> References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <4AA3FD65.7000403@stoneleaf.us> <200909081015.10767.steve@pearwood.info> Message-ID: 08-09-2009 o 02:15:10 Steven D'Aprano wrote: > On Mon, 7 Sep 2009 09:37:35 am Jan Kaliszewski wrote: >> 06-09-2009 o 20:20:21 Ethan Furman wrote: > >> > ... I love being able to type >> > >> > current_record.full_name == last_record.full_name >> > >> > instead of >> > >> > current_record['full_name'] == last_record['full_name'] >> >> Me too, and I suppose many people too... >> >> The latter: >> >> * makes your code less readable if there is high density of such >> expressions; >> >> * makes typing much more strenuous/irritating -- what is not very >> important in case of advanced development (when time of typing is >> short in relation to time of thinking/reading/testing) but becomes >> quite important in case of scripting (which is still important >> area of Python usage). > > If you have a large number of such expressions, what's wrong with this? a['xyz'] = something['blablabla'] + somethingelse['foobar'] b['ababababa'] += afun(bobo['dodo']['kookoo'] * pofopofo['gh'][0]['a']) cupu['abc'] = (kukumunu['bo'], kukumunu['kuu'].mbmbmb['lalala']) a.xyz = something.blablabla + somethingelse.foobar b.ababababa += afun(bobo.dodo.kookoo * pofopofo.gh[0].a) cupu.abc = (kukumunu.bo, kukumunu.kuu.mbmbmb.lalala) For me the latter is definitely easier to read and understand. > FNAME = "full_name" # Define the string in one place only. > current_record[FNAME] == last_record[FNAME] # Use it in many places. > > Not only is it shorter to use, but it's easy to change the > key "full_name" to (say) "complete_name" or "volledige_naam" with one > edit, and without mistakenly changing some other string which just > happens to match the key. You are right, but it's a bit different story... I don't say that attr access is always better than key access -- but only that sometimes it is. > (I don't know about others, but when I'm > first working on a piece of code, and before I settle on an API or > database schema, I often change field names two or three times before I > settle in on the final version.) Me too! :) > In any case, while I accept that this is sometimes useful, I also think > that it's a something which is simple enough to add to your classes > when necessary with just a few lines -- all you really need are the > __*attr__ methods, everything else is superfluous. If you're doing this > a lot, avoid boilerplate with a class decorator. Here's an untested > minimalistic version which probably does everything necessary: > > def add_attr(cls): > """Class decorator which adds attribute access to mappings.""" > def __getattr__(self, name): > return self[name] > def __setattr__(self, name, value): > self[name] = value > def __delattr__(self, name): > del self[name] > for func in (__getattr__, __setattr__, __delattr__): > setattr(cls, func.__name__, func) > return cls I'd add to it also dict-like iteration (__iter__(), _keys(), _values(), _items()) and __str__ adjusted to nice nested representation (like in some posts in this thread, e.g. my proposition). > Fields of an object (attributes) and keys of a mapping are generally for > different purposes, and I'm not sure we should encourage people to > conflate the two. I think this belongs in the cookbook, not the > standard library. I think it depends how often people need to implement such boiler-plate code for themselves. Now I see that this thread is not very popular, so indeed maybe you are right... Though it'd be nice to have OOTB such a factory in `collections` module... Cheers, *j -- Jan Kaliszewski (zuo) From stephen at xemacs.org Tue Sep 8 05:06:51 2009 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Tue, 08 Sep 2009 12:06:51 +0900 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: <200909081015.10767.steve@pearwood.info> References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <4AA3FD65.7000403@stoneleaf.us> <200909081015.10767.steve@pearwood.info> Message-ID: <877hwannhw.fsf@uwakimon.sk.tsukuba.ac.jp> Steven D'Aprano writes: > Fields of an object (attributes) and keys of a mapping are generally for > different purposes, and I'm not sure we should encourage people to > conflate the two. +1 on discouraging conflation. From jimjjewett at gmail.com Tue Sep 8 16:53:57 2009 From: jimjjewett at gmail.com (Jim Jewett) Date: Tue, 8 Sep 2009 10:53:57 -0400 Subject: [Python-ideas] possible attribute-oriented class In-Reply-To: References: <8fd67d4b0909031546o423cf066wf7096651bbc46ee0@mail.gmail.com> <4AA3FD65.7000403@stoneleaf.us> <200909081015.10767.steve@pearwood.info> Message-ID: On Mon, Sep 7, 2009 at 9:02 PM, Jan Kaliszewski wrote: > 08-09-2009 o 02:15:10 Steven D'Aprano wrote: >> ... what's wrong with this? > a['xyz'] = something['blablabla'] + somethingelse['foobar'] > b['ababababa'] += afun(bobo['dodo']['kookoo'] * pofopofo['gh'][0]['a']) > cupu['abc'] = (kukumunu['bo'], kukumunu['kuu'].mbmbmb['lalala']) > a.xyz = something.blablabla + somethingelse.foobar > b.ababababa += afun(bobo.dodo.kookoo * pofopofo.gh[0].a) > cupu.abc = (kukumunu.bo, kukumunu.kuu.mbmbmb.lalala) > For me the latter is definitely easier to read and understand. I would describe it as "less difficult" rather than "easier". My biggest problem is that at that stage, I'm still typing raw, and inclined to make typos. The difference between fname and fnam won't be caught either way, but field access at least keeps me from forgetting quotes, or forgetting them at one end. >> ... I often change field names two or three times >> before I settle in on the final version. And often because of an ambiguity with another field that I hadn't originally thought to name. Neither solution fixes this, but attribute access is slightly easier to change. >> [recipe to simplify attr-access] > I think it depends how often people need to > implement such boiler-plate code for themselves. Attribute access is clearly better -- except for one thing. While I'm doing this, I'm still in exploratory mode, and I *will* need to clean up the API if I ever want better than quick-and-dirty. If the quick-and-dirty is already using attribute access, that makes the transition a bit trickier. If the quick-and-dirty is using dict access, at least I have a clear marker. -jJ From lie.1296 at gmail.com Tue Sep 8 16:56:26 2009 From: lie.1296 at gmail.com (Lie Ryan) Date: Wed, 09 Sep 2009 00:56:26 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <4AA34C7E.7030803@canterbury.ac.nz> References: <4AA34C7E.7030803@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Stefan Behnel wrote: > >> It would therefore be nice to have a common ".any()" method on data >> structures that would just read an arbitrary item from a container. > > Rather than add a method to every container implementation, > it would be easier to provide a function: > > def first(obj): > return iter(ob).next() > > possibly with some embellishments to handle StopIteration, > allow for a default value, etc. > Which would imply a request for last(). From Scott.Daniels at Acm.Org Tue Sep 8 23:05:05 2009 From: Scott.Daniels at Acm.Org (Scott David Daniels) Date: Tue, 08 Sep 2009 14:05:05 -0700 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: <200909061823.23915.steve@pearwood.info> References: <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> <200909061823.23915.steve@pearwood.info> Message-ID: Steven D'Aprano wrote: > On Sun, 6 Sep 2009 03:51:43 pm Greg Ewing wrote: >> Nick Coghlan wrote: >>> That said, I'm -0 on the idea overall. If someone actually needs >>> it, it isn't particularly hard for them to write their own getany() >>> function. >> There's a situation where the need to do this kind of >> thing actually arises fairly frequently -- retrieving >> things from a relational database. Often you're >> expecting exactly one result from a query, but the >> API always gives you a sequence, which you then have >> to get the first item from. Doing that over and >> over again gets rather tedious. > > If you're expecting "exactly one result", then surely it should be an > error to receive more than one result? Rather than ask for "any" result > and ignoring any unexpected extra items, I think it would be better to > have a helper function that verifies you have got exactly one result. Well, some queries return results without duplicate elimination, even though they are defined to return sets. If you really want to limit things in databases queries, the "LIMIT 1" clause is your friend, as the query optimizer knows it can stop as soon as its found something. Of course I don't know which query optimizers around now _use_ that knowledge to pick a query plan, but that leaves the info there if the next rev becomes limit-capable. Often when I just want to pick a single value from a column I use MIN or MAX (and fairly often when I need two distinct values I use both MIN and MAX). One trick to seeing a column is exactly a singleton is: SELECT MIN(something) FROM ... WHERE MIN(something) = MAX(something) --Scott David Daniels Scott.Daniels at Acm.Org From Scott.Daniels at Acm.Org Tue Sep 8 23:46:09 2009 From: Scott.Daniels at Acm.Org (Scott David Daniels) Date: Tue, 08 Sep 2009 14:46:09 -0700 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <20090907030532.56270cee@bhuda.mired.org> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090906160707.7883dac5@bhuda.mired.org> <87k50b8pu2.fsf@uwakimon.sk.tsukuba.ac.jp> <20090907030532.56270cee@bhuda.mired.org> Message-ID: Mike Meyer wrote: ... > What I was attempting to do was point out that succinctness for the > sake of succinctness isn't necessarily a good thing. Python indeed > tries to be succinct, but balances that against the need for the > results to still be readable. I'm quite interested in this as well. I think that a Pythonic succinctness is very DRY (don't repeat yourself), rather than short. What I want when I read code is to be reading ideas, not typing or pasting. In Python, when I find I'm doing something several times I look for ways to combine tables and code, so what varies shows up clearly, and what is in common shows in the loop. APL was one language that battered me over the head with the proof that shorter was not necessarily clearer. --Scott David Daniels Scott.Daniels at Acm.Org From anfedorov at gmail.com Wed Sep 9 03:07:53 2009 From: anfedorov at gmail.com (Andrey Fedorov) Date: Tue, 8 Sep 2009 21:07:53 -0400 Subject: [Python-ideas] Function to apply superset of arguments to a function Message-ID: <7659cab30909081807v6f59aed3p5e2b30a3a4d343d0@mail.gmail.com> Hi all, I've written a function [1] called apply_some which takes a set of keywords arguments, filters only those a function is expecting, and calls the function with only those arguments. This is meant to suppress TypeErrors - a way to abstract the logic which checks what arguments a passed-in function accepts. For example: def foo(x=1, y=2): return (x,y) apply_some(foo, y=0, z="hi") // calls foo(y=0) -> (1,0) I'd like to expand this to fill undefined arguments with None, but before I do, does anyone know of any packages/libraries which either do something similar or would make this code cleaner? Cheers, Andrey 1. http://gist.github.com/183375 -------------- next part -------------- An HTML attachment was scrubbed... URL: From cmjohnson.mailinglist at gmail.com Wed Sep 9 12:52:52 2009 From: cmjohnson.mailinglist at gmail.com (Carl Johnson) Date: Wed, 9 Sep 2009 00:52:52 -1000 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090906160707.7883dac5@bhuda.mired.org> <87k50b8pu2.fsf@uwakimon.sk.tsukuba.ac.jp> <20090907030532.56270cee@bhuda.mired.org> Message-ID: <3bdda690909090352v4668bd47g1bb7cde924c0d986@mail.gmail.com> On 2009/9/8 Scott David Daniels wrote: > APL was one language that battered me over the head with the > proof that shorter was not necessarily clearer. Ah yes, more proof of the adage: "if you can't be a good example, be a terrible warning." :-D ? Carl Johnson From steve at pearwood.info Wed Sep 9 13:26:36 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Wed, 9 Sep 2009 21:26:36 +1000 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090907030532.56270cee@bhuda.mired.org> Message-ID: <200909092126.37270.steve@pearwood.info> On Wed, 9 Sep 2009 07:46:09 am Scott David Daniels wrote: > APL was one language that battered me over the head with the > proof that shorter was not necessarily clearer. Conway's Game of Life in one line: http://www.catpad.net/michael/apl/ -- Steven D'Aprano From ncoghlan at gmail.com Wed Sep 9 14:36:42 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 09 Sep 2009 22:36:42 +1000 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA34C7E.7030803@canterbury.ac.nz> Message-ID: <4AA7A15A.8090202@gmail.com> Lie Ryan wrote: > Greg Ewing wrote: >> Stefan Behnel wrote: >> >>> It would therefore be nice to have a common ".any()" method on data >>> structures that would just read an arbitrary item from a container. >> >> Rather than add a method to every container implementation, >> it would be easier to provide a function: >> >> def first(obj): >> return iter(ob).next() >> >> possibly with some embellishments to handle StopIteration, >> allow for a default value, etc. >> > > Which would imply a request for last(). Not really - every iterator in Python is guaranteed to either have a first value or throw an exception when you try to retrieve it via next(). There's no such guarantee that every iterator will terminate and hence have a "last" value (cf. itertools.count). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From lie.1296 at gmail.com Wed Sep 9 16:50:18 2009 From: lie.1296 at gmail.com (Lie Ryan) Date: Thu, 10 Sep 2009 00:50:18 +1000 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <200909092126.37270.steve@pearwood.info> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090907030532.56270cee@bhuda.mired.org> <200909092126.37270.steve@pearwood.info> Message-ID: Steven D'Aprano wrote: > On Wed, 9 Sep 2009 07:46:09 am Scott David Daniels wrote: >> APL was one language that battered me over the head with the >> proof that shorter was not necessarily clearer. > > Conway's Game of Life in one line: > > http://www.catpad.net/michael/apl/ Here is a hypothesis: "The length of a code is inversely proportional to the length of documentation required to explain the code" The APL Conway's Game of Life requires a full page of documentation to explain how it works. Most implementations of the same game have much less documentation and much longer code. Prove or disprove the hypothesis. If proven true, the hypothesis may lead to: The net worth of having a short, succinct code may be outweighed by the amount of documentation needed to explain the code. From george.sakkis at gmail.com Wed Sep 9 17:23:57 2009 From: george.sakkis at gmail.com (George Sakkis) Date: Wed, 9 Sep 2009 11:23:57 -0400 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090907030532.56270cee@bhuda.mired.org> <200909092126.37270.steve@pearwood.info> Message-ID: <91ad5bf80909090823t52d50a03la941bb22b554e571@mail.gmail.com> On Wed, Sep 9, 2009 at 10:50 AM, Lie Ryan wrote: > Steven D'Aprano wrote: >> >> On Wed, 9 Sep 2009 07:46:09 am Scott David Daniels wrote: >>> >>> APL was one language that battered me over the head with the >>> proof that shorter was not necessarily clearer. >> >> Conway's Game of Life in one line: >> >> http://www.catpad.net/michael/apl/ > > > Here is a hypothesis: > "The length of a code is inversely proportional to the length of > documentation required to explain the code" > > The APL Conway's Game of Life requires a full page of documentation to > explain how it works. Most implementations of the same game have much less > documentation and much longer code. > > Prove or disprove the hypothesis. > > If proven true, the hypothesis may lead to: > The net worth of having a short, succinct code may be outweighed by the > amount of documentation needed to explain the code. Any such proof or even discussion should take into account what the primitives (atoms and allowed operations) are. If not, here is a solution that is short both in code and documentation: game_of_life().solve() ;-) George From g.brandl at gmx.net Wed Sep 9 18:03:44 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 09 Sep 2009 18:03:44 +0200 Subject: [Python-ideas] data structures should have an .any() method In-Reply-To: References: <4AA0FEF3.2000607@gmail.com> <4AA34DEF.4070606@canterbury.ac.nz> <200909061823.23915.steve@pearwood.info> Message-ID: Antoine Pitrou schrieb: > Steven D'Aprano writes: >> >> If you're expecting "exactly one result", then surely it should be an >> error to receive more than one result? Rather than ask for "any" result >> and ignoring any unexpected extra items, I think it would be better to >> have a helper function that verifies you have got exactly one result. > > Why do you need a helper function? > Simply write: > > x, = db.query("SELECT blah...") > > and you'll get a ValueError if there isn't exactly one item in the sequence. Amen! Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From brett at python.org Wed Sep 9 18:19:50 2009 From: brett at python.org (Brett Cannon) Date: Wed, 9 Sep 2009 09:19:50 -0700 Subject: [Python-ideas] Function to apply superset of arguments to a function In-Reply-To: <7659cab30909081807v6f59aed3p5e2b30a3a4d343d0@mail.gmail.com> References: <7659cab30909081807v6f59aed3p5e2b30a3a4d343d0@mail.gmail.com> Message-ID: This is the wrong list to ask for help with something like this. comp.lang.python/python-list is the proper place to try to get help. On Tue, Sep 8, 2009 at 18:07, Andrey Fedorov wrote: > Hi all, > > I've written a function [1] called apply_some which takes a set of keywords > arguments, filters only those a function is expecting, and calls the > function with only those arguments. This is meant to suppress TypeErrors - a > way to abstract the logic which checks what arguments a passed-in function > accepts. > > For example: > > def foo(x=1, y=2): > ??? return (x,y) > > apply_some(foo, y=0, z="hi") // calls foo(y=0) > -> (1,0) > > I'd like to expand this to fill undefined arguments with None, but before I > do, does anyone know of any packages/libraries which either do something > similar or would make this code cleaner? > > Cheers, > Andrey > > 1. http://gist.github.com/183375 > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > From gerald.britton at gmail.com Wed Sep 9 13:45:00 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Wed, 9 Sep 2009 07:45:00 -0400 Subject: [Python-ideas] Function to apply superset of arguments to a function In-Reply-To: <7659cab30909081807v6f59aed3p5e2b30a3a4d343d0@mail.gmail.com> References: <7659cab30909081807v6f59aed3p5e2b30a3a4d343d0@mail.gmail.com> Message-ID: <5d1a32000909090445r37c40a24v949823d2bd2e98ac@mail.gmail.com> functools.partial On Tue, Sep 8, 2009 at 9:07 PM, Andrey Fedorov wrote: > Hi all, > > I've written a function [1] called apply_some which takes a set of keywords > arguments, filters only those a function is expecting, and calls the > function with only those arguments. This is meant to suppress TypeErrors - a > way to abstract the logic which checks what arguments a passed-in function > accepts. > > For example: > > def foo(x=1, y=2): > ??? return (x,y) > > apply_some(foo, y=0, z="hi") // calls foo(y=0) > -> (1,0) > > I'd like to expand this to fill undefined arguments with None, but before I > do, does anyone know of any packages/libraries which either do something > similar or would make this code cleaner? > > Cheers, > Andrey > > 1. http://gist.github.com/183375 > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > -- Gerald Britton From greg.ewing at canterbury.ac.nz Thu Sep 10 02:52:44 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 10 Sep 2009 12:52:44 +1200 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <91ad5bf80909090823t52d50a03la941bb22b554e571@mail.gmail.com> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090907030532.56270cee@bhuda.mired.org> <200909092126.37270.steve@pearwood.info> <91ad5bf80909090823t52d50a03la941bb22b554e571@mail.gmail.com> Message-ID: <4AA84DDC.6070102@canterbury.ac.nz> George Sakkis wrote: > Any such proof or even discussion should take into account what the > primitives (atoms and allowed operations) are. Probably you should include the size of the documentation of the primitives used in your programming language manual, any other well-known literature they implicitly refer to, etc. > If not, here is a > solution that is short both in code and documentation: > > game_of_life().solve() But then you need to go and find a paper describing the game of life and the algorithm being used to solve it and include its length! -- Greg From ubershmekel at gmail.com Thu Sep 10 03:40:58 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Thu, 10 Sep 2009 04:40:58 +0300 Subject: [Python-ideas] Decorator syntax restriction In-Reply-To: <4AA84DDC.6070102@canterbury.ac.nz> References: <4E8F67D5F8CC4562A71B1516F7957D7A@robslaptop> <20090907030532.56270cee@bhuda.mired.org> <200909092126.37270.steve@pearwood.info> <91ad5bf80909090823t52d50a03la941bb22b554e571@mail.gmail.com> <4AA84DDC.6070102@canterbury.ac.nz> Message-ID: <9d153b7c0909091840k2133ce11l7b258fbfaae0759a@mail.gmail.com> You can drop the game of life manual, it would be needed in any implementation so it doesn't help in describing the function documentation_length(code_length)... Aside from a constant value. d - documentatin_length the amount of words or symbols. c - code_length in words or symbols. d = K*c + A Lets approximate: brainfuck K = 30 APL K = 20 perl K = 2 python K = 0.5 K is the obfuscation factor (anti-readability). The constant A is needed because if you didn't write any code you're still gonna have some explaining to do. For a given algorithm with a complexity of L logical nodes, here's the inverse relation Ryan was talking about.: L = d * c * Rd * Rc As you can see for a given algorithm with a constant L, as d grows, c shrinks. R is the richness of the language, if we have more words in our language, less words are needed to describe things. Rd is the richness of the documentation language (english) and Rc is the richness of the code-language. I think there might be some unit problems so I'm gonna let someone else clean up and complete the equations. Good night. On Thu, Sep 10, 2009 at 3:52 AM, Greg Ewing wrote: > George Sakkis wrote: > > Any such proof or even discussion should take into account what the >> primitives (atoms and allowed operations) are. >> > > Probably you should include the size of the documentation > of the primitives used in your programming language > manual, any other well-known literature they implicitly > refer to, etc. > > If not, here is a >> solution that is short both in code and documentation: >> >> game_of_life().solve() >> > > But then you need to go and find a paper describing > the game of life and the algorithm being used to > solve it and include its length! > > -- > Greg > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Yuv hzk.co.il -------------- next part -------------- An HTML attachment was scrubbed... URL: From anfedorov at gmail.com Thu Sep 10 21:50:18 2009 From: anfedorov at gmail.com (Andrey Fedorov) Date: Thu, 10 Sep 2009 15:50:18 -0400 Subject: [Python-ideas] Options parsing in the Tornado Web Server Message-ID: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> I remember seeing a bit of discussion about improving Python's default options parsing... Tornado's seems very impressive [1]: > from tornado.options import define, options > > define("port", default=8888, help="run on the given port", type=int) > > print options.port > > What does everyone think? - Andrey 1. http://github.com/facebook/tornado/blob/9a8bd2fb6fd6279be16d6f0a2e57e49fe1b98f8f/demos/chat/chatdemo.py -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Thu Sep 10 22:44:09 2009 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Sep 2009 13:44:09 -0700 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> Message-ID: That's not particularly more expressive than what optparse gives you today, except for the use of a function with a side effect on a magic global (how the heck did options.port suddenly get a value?!) and the potential for confusing tornado.options with tornado.options.options. On Thu, Sep 10, 2009 at 12:50 PM, Andrey Fedorov wrote: > I remember seeing a bit of discussion about improving Python's default > options parsing... Tornado's seems very impressive [1]: >> >> from tornado.options import define, options >> >> define("port", default=8888, help="run on the given port", type=int) >> >> print options.port > > What does everyone think? > > - Andrey > > 1. > http://github.com/facebook/tornado/blob/9a8bd2fb6fd6279be16d6f0a2e57e49fe1b98f8f/demos/chat/chatdemo.py -- --Guido van Rossum (home page: http://www.python.org/~guido/) From anfedorov at gmail.com Thu Sep 10 22:52:24 2009 From: anfedorov at gmail.com (Andrey Fedorov) Date: Thu, 10 Sep 2009 16:52:24 -0400 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> Message-ID: <7659cab30909101352u14554ca1i1a64dd07ccb8c29b@mail.gmail.com> Well, options.*port* corresponds to define("*port*", ...). Is it considered unPythonic to equate variable names and strings? I didn't think it was, since scopes are dictionaries... Good point about options.options. - Andrey On Thu, Sep 10, 2009 at 4:44 PM, Guido van Rossum wrote: > That's not particularly more expressive than what optparse gives you > today, except for the use of a function with a side effect on a magic > global (how the heck did options.port suddenly get a value?!) and the > potential for confusing tornado.options with tornado.options.options. > > On Thu, Sep 10, 2009 at 12:50 PM, Andrey Fedorov > wrote: > > I remember seeing a bit of discussion about improving Python's default > > options parsing... Tornado's seems very impressive [1]: > >> > >> from tornado.options import define, options > >> > >> define("port", default=8888, help="run on the given port", type=int) > >> > >> print options.port > > > > What does everyone think? > > > > - Andrey > > > > 1. > > > http://github.com/facebook/tornado/blob/9a8bd2fb6fd6279be16d6f0a2e57e49fe1b98f8f/demos/chat/chatdemo.py > > -- > --Guido van Rossum (home page: http://www.python.org/~guido/ > ) > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Thu Sep 10 23:11:54 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 11 Sep 2009 07:11:54 +1000 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: <7659cab30909101352u14554ca1i1a64dd07ccb8c29b@mail.gmail.com> References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> <7659cab30909101352u14554ca1i1a64dd07ccb8c29b@mail.gmail.com> Message-ID: <4AA96B9A.6020505@gmail.com> Andrey Fedorov wrote: > Well, options._port_ corresponds to define("_port_", ...). Is it > considered unPythonic to equate variable names and strings? I didn't > think it was, since scopes are dictionaries... It's the fact that there is an options global in the module at all which can be surprising. Application global objects like that aren't necessarily bad, but they aren't necessarily good either. optparse uses independent parsers by default, leaving applications free to put the options information wherever they want (e.g. merging it with settings coming from system and per-user configuration files and storing the results in a myapp.settings module) An approach like the tornado example that provides its own global parser better also have its own mechanism for producing additional independent parsers if it ever hopes to match the features of optparse. Aside from the presence of that global parser, I'm not seeing a lot difference between options.define and parser.add_option though. Anyone wanting to replace/compete with optparse (particularly with goals for latter standard library inclusion) would do well to better articulate what they don't like about optparse though. I acknowledge that using optparse the first couple of times can have something of a learning curve, but that's because it is rather powerful. And if there are features that appear to be missing, then why not suggest those as optparse enhancements rather than trying to replace the module wholesale? Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From brett at python.org Thu Sep 10 23:16:17 2009 From: brett at python.org (Brett Cannon) Date: Thu, 10 Sep 2009 14:16:17 -0700 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> Message-ID: Just so people know, we started discussing over on the stdlib-sig the idea of trying to convince Steven Bethard to contribute argparse to the standard library as a way to improve the argument parsing situation in the standard library. On Thu, Sep 10, 2009 at 13:44, Guido van Rossum wrote: > That's not particularly more expressive than what optparse gives you > today, except for the use of a function with a side effect on a magic > global (how the heck did options.port suddenly get a value?!) and the > potential for confusing tornado.options with tornado.options.options. > > On Thu, Sep 10, 2009 at 12:50 PM, Andrey Fedorov wrote: >> I remember seeing a bit of discussion about improving Python's default >> options parsing... Tornado's seems very impressive [1]: >>> >>> from tornado.options import define, options >>> >>> define("port", default=8888, help="run on the given port", type=int) >>> >>> print options.port >> >> What does everyone think? >> >> - Andrey >> >> 1. >> http://github.com/facebook/tornado/blob/9a8bd2fb6fd6279be16d6f0a2e57e49fe1b98f8f/demos/chat/chatdemo.py > > -- > --Guido van Rossum (home page: http://www.python.org/~guido/) > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From btaylor at gmail.com Thu Sep 10 23:29:38 2009 From: btaylor at gmail.com (Bret Taylor) Date: Thu, 10 Sep 2009 14:29:38 -0700 (PDT) Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> Message-ID: <69f2d5df-bb82-4dad-a43a-0f886df4b965@m7g2000prd.googlegroups.com> I am Bret from FriendFeed, author of a lot of Tornado. We modeled it after the Google option parsing (http://code.google.com/ p/google-gflags/). The main distinction is that every module declares its own options, so your main() function doesn't need to be aware of all of the options used by the transitive closure of modules in your server. As anyone who has worked on large systems knows, passing around options and defaults becomes a big pain after your number of modules increases above 100 or so. That said, I think our options parsing works well for individual projects, but you would get lots of naming conflicts if it were adopted in any official capacity by Python given the options all have global scope, so I agree with Guido that optparse is probably better as an official module. It certainly was much more useful to us than optparse from an operational standpoint, though. Bret On Sep 10, 1:44?pm, Guido van Rossum wrote: > That's not particularly more expressive than what optparse gives you > today, except for the use of a function with a side effect on a magic > global (how the heck did options.port suddenly get a value?!) and the > potential for confusing tornado.options with tornado.options.options. > > > > On Thu, Sep 10, 2009 at 12:50 PM, Andrey Fedorov wrote: > > I remember seeing a bit of discussion about improving Python's default > > options parsing... Tornado's seems very impressive [1]: > > >> from tornado.options import define, options > > >> define("port", default=8888, help="run on the given port", type=int) > > >> print options.port > > > What does everyone think? > > > - Andrey > > > 1. > >http://github.com/facebook/tornado/blob/9a8bd2fb6fd6279be16d6f0a2e57e... > > -- > --Guido van Rossum (home page:http://www.python.org/~guido/) > _______________________________________________ > Python-ideas mailing list > Python-id... at python.orghttp://mail.python.org/mailman/listinfo/python-ideas From guido at python.org Thu Sep 10 23:42:21 2009 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Sep 2009 14:42:21 -0700 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: <69f2d5df-bb82-4dad-a43a-0f886df4b965@m7g2000prd.googlegroups.com> References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> <69f2d5df-bb82-4dad-a43a-0f886df4b965@m7g2000prd.googlegroups.com> Message-ID: Hey Bret! I still see your name in our code base a lot... :-) The Google flags code has a fundamentally different use case than the typical argument parsing -- thanks for pointing this out. In fact, the two use cases are so different that there is barely any overlap. (What Google does with flags is more typically done with environment variables, although I totally understand that that didn't work for you.) Maybe the discussion about flags parsing (which is apparently happening on a list I'm not on :-) is helped by clearly distinguishing the two styles. --Guido On Thu, Sep 10, 2009 at 2:29 PM, Bret Taylor wrote: > I am Bret from FriendFeed, author of a lot of Tornado. > > We modeled it after the Google option parsing (http://code.google.com/ > p/google-gflags/). The main distinction is that every module declares > its own options, so your main() function doesn't need to be aware of > all of the options used by the transitive > closure of modules in your server. As anyone who has worked on large > systems knows, passing around options and defaults becomes a big pain > after your number of modules increases above 100 or so. > > That said, I think our options parsing works well for individual > projects, but you would get lots of naming conflicts if it were > adopted in any official capacity by Python given the options all have > global scope, so I agree with Guido that optparse is probably better > as an official module. It certainly was much more useful to us than > optparse from an operational standpoint, though. > > Bret > > On Sep 10, 1:44?pm, Guido van Rossum wrote: >> That's not particularly more expressive than what optparse gives you >> today, except for the use of a function with a side effect on a magic >> global (how the heck did options.port suddenly get a value?!) and the >> potential for confusing tornado.options with tornado.options.options. >> >> >> >> On Thu, Sep 10, 2009 at 12:50 PM, Andrey Fedorov wrote: >> > I remember seeing a bit of discussion about improving Python's default >> > options parsing... Tornado's seems very impressive [1]: >> >> >> from tornado.options import define, options >> >> >> define("port", default=8888, help="run on the given port", type=int) >> >> >> print options.port >> >> > What does everyone think? >> >> > - Andrey >> >> > 1. >> >http://github.com/facebook/tornado/blob/9a8bd2fb6fd6279be16d6f0a2e57e... >> >> -- >> --Guido van Rossum (home page:http://www.python.org/~guido/) >> _______________________________________________ >> Python-ideas mailing list >> Python-id... at python.orghttp://mail.python.org/mailman/listinfo/python-ideas > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From collinw at gmail.com Thu Sep 10 23:58:35 2009 From: collinw at gmail.com (Collin Winter) Date: Thu, 10 Sep 2009 18:58:35 -0300 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> <69f2d5df-bb82-4dad-a43a-0f886df4b965@m7g2000prd.googlegroups.com> Message-ID: <43aa6ff70909101458k4ece6909hf2cf411de8926f88@mail.gmail.com> On Thu, Sep 10, 2009 at 6:42 PM, Guido van Rossum wrote: > Hey Bret! > > I still see your name in our code base a lot... :-) > > The Google flags code has a fundamentally different use case than the > typical argument parsing -- thanks for pointing this out. In fact, the > two use cases are so different that there is barely any overlap. (What > Google does with flags is more typically done with environment > variables, although I totally understand that that didn't work for > you.) Other useful notes about the Google flag systems (most of which sounds like it applies to Tornado's system as well), for those who haven't used it: - Google's flag system is used primarily for configuring binaries, rather than command-line option parsing; it just happens to take the form of command-line options. - Accordingly, having a single global options dict is useful for configuring all the different libraries that get linked into a single binary. Most of the flags a given binary exposes come from these libraries, or libraries used by other libraries, etc. - When defining flags in libraries, you have to manually namespace them (mylibrary_rpc_deadline_secs, yourlibrary_rpc_deadline_secs, etc) to avoid collisions. - The flags system is designed to operate across languages: a Python application can define some flags, and that application may use a C++ extension module which defines more flags, and both are configured in the same place. Based on that, I'm not sure that a gflags-like system would be a good replacement for command-line parsing. In fact, gflags sometimes requires unexpected/unusual command-line ordering if you try to use it like a general option parser. Other people who've used gflags may have a different perspective. Collin Winter From eric at trueblade.com Fri Sep 11 10:59:26 2009 From: eric at trueblade.com (Eric Smith) Date: Fri, 11 Sep 2009 04:59:26 -0400 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: <4AA96B9A.6020505@gmail.com> References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> <7659cab30909101352u14554ca1i1a64dd07ccb8c29b@mail.gmail.com> <4AA96B9A.6020505@gmail.com> Message-ID: <4AAA116E.9080504@trueblade.com> Nick Coghlan wrote: > Anyone wanting to replace/compete with optparse (particularly with goals > for latter standard library inclusion) would do well to better > articulate what they don't like about optparse though. I acknowledge > that using optparse the first couple of times can have something of a > learning curve, but that's because it is rather powerful. Note that argparse does provide that justification: http://argparse.googlecode.com/svn/trunk/doc/argparse-vs-optparse.html In particular: http://argparse.googlecode.com/svn/trunk/doc/argparse-vs-optparse.html#upgrading-optparse-code has a short blurb on why Steven didn't just extend optparse. He tried to, but decided it was easier if he didn't. Eric. From eric at trueblade.com Fri Sep 11 11:10:22 2009 From: eric at trueblade.com (Eric Smith) Date: Fri, 11 Sep 2009 05:10:22 -0400 Subject: [Python-ideas] Options parsing in the Tornado Web Server In-Reply-To: <4AAA116E.9080504@trueblade.com> References: <7659cab30909101250h46ee269ch28c5b21e9fa9c9d5@mail.gmail.com> <7659cab30909101352u14554ca1i1a64dd07ccb8c29b@mail.gmail.com> <4AA96B9A.6020505@gmail.com> <4AAA116E.9080504@trueblade.com> Message-ID: <4AAA13FE.7050002@trueblade.com> Eric Smith wrote: > Nick Coghlan wrote: >> Anyone wanting to replace/compete with optparse (particularly with goals >> for latter standard library inclusion) would do well to better >> articulate what they don't like about optparse though. I acknowledge >> that using optparse the first couple of times can have something of a >> learning curve, but that's because it is rather powerful. > > Note that argparse does provide that justification: > http://argparse.googlecode.com/svn/trunk/doc/argparse-vs-optparse.html > > In particular: > http://argparse.googlecode.com/svn/trunk/doc/argparse-vs-optparse.html#upgrading-optparse-code > > has a short blurb on why Steven didn't just extend optparse. He tried > to, but decided it was easier if he didn't. And I notice now that this has already been posted to stdlib-sig, which I've joined. I'll continue this over there. From ubershmekel at gmail.com Fri Sep 11 17:16:50 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Fri, 11 Sep 2009 18:16:50 +0300 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: <4A9BF08C.9060607@mrabarnett.plus.com> References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> Message-ID: <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> Does anybody have any more use cases, ideas or suggestions? I'm getting the feeling this suggestion is +0 to most people and +1 for the rest. I'm pretty new to these mailing lists so does that mean a yes or a no? On Mon, Aug 31, 2009 at 6:47 PM, MRAB wrote: > Masklinn wrote: > >> On 31 Aug 2009, at 15:00 , Nick Coghlan wrote: >> Yuvgoog Greenle wrote: >> >>> I believe int(s, base) needs an inverse function to allow string >>>> representation with different bases. An example use case is 'hashing' a >>>> counter like video ID's on youtube, you could use a regular int >>>> internally and publish a shorter base-62 id >>>> for links. >>>> >>>> This subject was discussed 2.5 years ago: >>>> http://mail.python.org/pipermail/python-dev/2006-January/059789.html >>>> >>>> I opened a feature request ticket: >>>> http://bugs.python.org/issue6783 >>>> >>>> Some of the questions that remain: >>>> 1. Whether this should be a method for int or a regular function in a >>>> standard library module like math. >>>> 2. What should the method/function be called? (base_convert, radix, etc) >>>> >>>> What do you guys think? >>>> >>> >>> This has been coming up for years and always gets bogged down in a >>> spelling argument (a method on int, a function in the math module and an >>> update to the str.format mini language would be the current contenders). >>> >>> However, most of the actual real use cases for bases between 2 and 36 >>> were dealt with by the addition of binary and octal output to string >>> formatting so the impetus to do anything about it is now a lot lower. >>> >>> As far as bases between 37 and 62 go, that would involve first getting >>> agreement on extending int() to handle those bases by allowing case >>> sensitive digit parsing. Presumably that would use string lexical >>> ordering so that int('a', 37) > int('A', 37) and int('b', 37) would >>> raise an exception. >>> >>> That would only be intuitive to someone that knows how ASCII based >>> alphanumeric ordering works though. >>> >> > ASCII? Surely it should be Unicode! :-) > > Or it could be handled via a translation table (needed both ways of >> course) mapping n indexes to n characters (with n the base you're working >> with), defaulting to something sane. >> >> The default could cover only bases 2 to 36. Any base > 36 would require > a user-supplied translation table. > > Though I'm not sure this is of much interest really: even Erlang (which >> provides pretty good base conversion tools: it supports literal integers of >> any base between 2 and 36) doesn't natively support bases beyond 36. A >> library would probably be better for those more conflictual (or less >> intuitive) ranges. >> >> It could permit a dict as the translation table when 'decoding' so that > both 'A' and 'a' could be mapped to 10, if necessary. > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Fri Sep 11 18:08:28 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 12 Sep 2009 02:08:28 +1000 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> Message-ID: <4AAA75FC.4000001@gmail.com> Yuvgoog Greenle wrote: > Does anybody have any more use cases, ideas or suggestions? I'm getting > the feeling this suggestion is +0 to most people and +1 for the rest. > I'm pretty new to these mailing lists so does that mean a yes or a no? A generally lukewarm response means a maybe :) A positive response on python-ideas is still a maybe until the idea has subsequently also run the gauntlet of python-dev with actual code to back it up. In this case, the status quo is: str -> int (arbitrary base up to 36) via int() constructor (base "0" meaning Python literal format). int -> str via str() (for decimal output), hex(), oct(), bin() and string formatting So the currently unsupported use cases are limited to outputting numbers in bases between 3 and 36 that are not 8, 10 or 16. You're probably going to have a hard time convincing anyone that those additional use cases are worth putting much effort into supporting (and even then, they're probably better off as a 3rd party library that can add things like support for integers in bases up to 62). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From python at mrabarnett.plus.com Fri Sep 11 21:51:55 2009 From: python at mrabarnett.plus.com (MRAB) Date: Fri, 11 Sep 2009 20:51:55 +0100 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: <4AAA75FC.4000001@gmail.com> References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> <4AAA75FC.4000001@gmail.com> Message-ID: <4AAAAA5B.2060500@mrabarnett.plus.com> Nick Coghlan wrote: > Yuvgoog Greenle wrote: >> Does anybody have any more use cases, ideas or suggestions? I'm getting >> the feeling this suggestion is +0 to most people and +1 for the rest. >> I'm pretty new to these mailing lists so does that mean a yes or a no? > > A generally lukewarm response means a maybe :) > > A positive response on python-ideas is still a maybe until the idea has > subsequently also run the gauntlet of python-dev with actual code to > back it up. > > In this case, the status quo is: > > str -> int (arbitrary base up to 36) via int() constructor (base "0" > meaning Python literal format). > > int -> str via str() (for decimal output), hex(), oct(), bin() and > string formatting > > So the currently unsupported use cases are limited to outputting numbers > in bases between 3 and 36 that are not 8, 10 or 16. > > You're probably going to have a hard time convincing anyone that those > additional use cases are worth putting much effort into supporting (and > even then, they're probably better off as a 3rd party library that can > add things like support for integers in bases up to 62). > It's one of those ideas: Q: Do you think it's a good idea? A: Yes. Q: Do you think you'd use it? A: Probably not. :-) From dickinsm at gmail.com Sun Sep 13 20:06:22 2009 From: dickinsm at gmail.com (Mark Dickinson) Date: Sun, 13 Sep 2009 19:06:22 +0100 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> Message-ID: <5c6f2a5d0909131106x4e6f49aep56b47eb5675ab894@mail.gmail.com> On Fri, Sep 11, 2009 at 4:16 PM, Yuvgoog Greenle wrote: > Does anybody have any more use cases, ideas or suggestions? I'm getting the > feeling this suggestion is +0 to most people and +1 for the rest. I'm pretty > new to these mailing lists so does that mean a yes or a no? Just out of curiosity, I did a Google code search[*] for uses of the inverse operation: int( ,n). I found a good handful of uses of int(s, 36), almost all apparently to do with turning integers into suitable id strings; there was also evidence that people have implemented the reverse 'integer -> base 36 string' conversion at least twice. I found no meaningful uses of any bases other than 2, 8, 10, 16, and 36. So the main use case seems to be serialization and deserialization of integers into some 'suitably nice' alphabet, and that alphabet is likely to be application-dependent. -0 for int.to_base(n) (2 <= n <= 36) or equivalent functionality in the core. +0 for a pair of library functions converting to and from base n, with explicitly given translation table. I agree with MRAB that an implicit digit set should only be allowed for 2 <= base <= 36, if at all. By the way, _PyLong_Format in Objects/longobject.c *does* contain code for general integer -> base b conversions, 2 <= b <= 36, but that code is currently unused (as far as I can tell). Mark [*] http://www.google.com/codesearch?hl=en&lr=&q=%5CWint%5Cs*%5C%28.*%5C%2C%5Cs*36%5Cs*%5C%29+lang%3Apython&sbtn=Search From gerald.britton at gmail.com Sun Sep 13 20:10:45 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Sun, 13 Sep 2009 14:10:45 -0400 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) Message-ID: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> Hi -- This is maybe the wrong list for this question. If would someone please redirect me? I stumbled across a performance anomaly wrt the set.add method. My idea was that if I try to add something via set.add, the method has to first check if the new item is already in the set, since set items are supposed to be unique. Then, on a whim, I stuck an "if x in set" condition in front of it. I was surprised to learn that this latter approach runs faster! Here are some results: $ python -m timeit -n 1000000 -s 'with open("/usr/share/dict/words") as f: s = set(w.strip("\n") for w in f)' 's.add("mother")' 1000000 loops, best of 3: 0.292 usec per loop britton at TheBrittons:~$ python -m timeit -n 1000000 -s 'with open("/usr/share/dict/words") as f: s = set(w.strip("\n") for w in f)' 'if "mother" not in s:s.add("mother")' 1000000 loops, best of 3: 0.185 usec per loop the second example beats the first by about 36% Is the timing difference just the cost of the method lookup for s.add, or is something else happening that I'm not seeing? -- Gerald Britton From python at rcn.com Sun Sep 13 21:37:07 2009 From: python at rcn.com (Raymond Hettinger) Date: Sun, 13 Sep 2009 12:37:07 -0700 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> Message-ID: <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> On Sep 13, 2009, at 11:10 AM, Gerald Britton wrote: > Here are some results: > > $ python -m timeit -n 1000000 -s 'with open("/usr/share/dict/words") > as f: s = set(w.strip("\n") for w in f)' 's.add("mother")' > 1000000 loops, best of 3: 0.292 usec per loop > > britton at TheBrittons:~$ python -m timeit -n 1000000 -s 'with > open("/usr/share/dict/words") as f: s = set(w.strip("\n") for w in f)' > 'if "mother" not in s:s.add("mother")' > 1000000 loops, best of 3: 0.185 usec per loop > > the second example beats the first by about 36% > > Is the timing difference just the cost of the method lookup for s.add, > or is something else happening that I'm not seeing? It is the something else you're not seeing ;-) On the first pass of the 1000000 loops, "mother" gets added. On the remaining passes the 'if "mother" not in set' test fails and the set.add() never gets executed. That latter operation is a bit more expensive than the contains-test because it includes the time to lookup and bind the add method. Raymond From bernie at codewiz.org Sun Sep 13 23:25:23 2009 From: bernie at codewiz.org (Bernie Innocenti) Date: Sun, 13 Sep 2009 17:25:23 -0400 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling Message-ID: <1252877123.19979.43.camel@giskard> On startup, the Python interpreter changes the default behavior of SIGINT, which results in many Python programs to ignore the keyboard interrupt exactly in the situations when users are most likely to use it (i.e.: when the program becomes unresponsive). Minimal testcase: $ echo "void foo() { for(;;) {} }" >foo.c $ gcc -shared -o foo.so foo.c $ python -c 'import ctypes;ctypes.CDLL("./foo.so").foo()' ^C^C^C ^C ^C DAMN! ^C This scenario mimics a Python program calling some blocking library function. It can also happen with IO-bound functions if they loop on read() and don't abort on short reads. One might be tempted to say "this behavior of the Python intepreter is by design" and suggest users to use CTRL-\ instead of CTRL-C. However, this non-standard behavior is very annoying for users who expect ^C to work on UNIX systems. In fact, no other compiled or interpreted language I know of behaves this way, and Python should not be the only exception. While I see the usefulness of KeyboardInterrupt from the programmer point of view, only a minority of programs actually need to trap SIGINT and do something with it. Other language runtimes require the programmer to manually trap SIGINT when needed. The Python interpreter could maintain backwards compatibility by enabling automatic SIGINT trapping when entering a "try" block that would intercept KeyboardInterrupt. For 2 years, I've been using this workaround in my /usr/lib64/python2.6/sitecustomize.py: ----cut----- import signal signal.signal(signal.SIGINT, signal.SIG_DFL) ----cut----- CTRL-C has been working perfectly ever since. So far, I have not yet found a single Python program where restoring the default behavior of SIGINT causes real issues, but there may certainly be a few. Granted, this is just a kludge, not a perfect fix, but from a user perspective, it already improves upon the current behavior (i.e. more pros than cons). At least, this is my personal experience. If you're skeptical, please try the above workaround yourself for a few months and let me know what breaks for you. If we could break the syntax of "print" statements, I'm sure we can also find a satisfactory compromise for CTRL-C handling that won't affect more than 0.1% of existing Python programs. -- // Bernie Innocenti - http://codewiz.org/ \X/ Sugar Labs - http://sugarlabs.org/ From pyideas at rebertia.com Sun Sep 13 23:32:29 2009 From: pyideas at rebertia.com (Chris Rebert) Date: Sun, 13 Sep 2009 14:32:29 -0700 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <1252877123.19979.43.camel@giskard> References: <1252877123.19979.43.camel@giskard> Message-ID: <50697b2c0909131432w387cb34fl311d934d4680039f@mail.gmail.com> On Sun, Sep 13, 2009 at 2:25 PM, Bernie Innocenti wrote: > If we could break the syntax of "print" statements, I'm sure we > can also find a satisfactory compromise for CTRL-C handling that > won't affect more than 0.1% of existing Python programs. What does the print syntax have to do with this? You know it became a regular function in Python 3, right? Cheers, Chris -- http://blog.rebertia.com From benjamin at python.org Sun Sep 13 23:35:56 2009 From: benjamin at python.org (Benjamin Peterson) Date: Sun, 13 Sep 2009 21:35:56 +0000 (UTC) Subject: [Python-ideas] IDEA: do not alter default SIGINT handling References: <1252877123.19979.43.camel@giskard> Message-ID: Bernie Innocenti writes: > If we could break the syntax of "print" statements, I'm sure we > can also find a satisfactory compromise for CTRL-C handling that > won't affect more than 0.1% of existing Python programs. It would actually be a huge compatibility break because finally statments would no longer be garunteed to execute. From bernie at codewiz.org Mon Sep 14 00:34:12 2009 From: bernie at codewiz.org (Bernie Innocenti) Date: Sun, 13 Sep 2009 18:34:12 -0400 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <50697b2c0909131432w387cb34fl311d934d4680039f@mail.gmail.com> References: <1252877123.19979.43.camel@giskard> <50697b2c0909131432w387cb34fl311d934d4680039f@mail.gmail.com> Message-ID: <1252881252.19979.99.camel@giskard> El Sun, 13-09-2009 a las 14:32 -0700, Chris Rebert escribi?: > On Sun, Sep 13, 2009 at 2:25 PM, Bernie Innocenti wrote: > > > If we could break the syntax of "print" statements, I'm sure we > > can also find a satisfactory compromise for CTRL-C handling that > > won't affect more than 0.1% of existing Python programs. > > What does the print syntax have to do with this? You know it became a > regular function in Python 3, right? Yes, I do. What I meant is that changing the behavior of SIGINT would introduce negligible incompatibilities compared to the things that were changed in Python 3.0. By this, I'm not implying that SIGINT handling must absolutely be changed NOW. It can certainly wait until the next major revision of the language, if one is planned. -- // Bernie Innocenti - http://codewiz.org/ \X/ Sugar Labs - http://sugarlabs.org/ From pyideas at rebertia.com Mon Sep 14 00:36:22 2009 From: pyideas at rebertia.com (Chris Rebert) Date: Sun, 13 Sep 2009 15:36:22 -0700 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <1252881252.19979.99.camel@giskard> References: <1252877123.19979.43.camel@giskard> <50697b2c0909131432w387cb34fl311d934d4680039f@mail.gmail.com> <1252881252.19979.99.camel@giskard> Message-ID: <50697b2c0909131536q528674f5ued4a12ea584809ac@mail.gmail.com> On Sun, Sep 13, 2009 at 3:34 PM, Bernie Innocenti wrote: > El Sun, 13-09-2009 a las 14:32 -0700, Chris Rebert escribi?: >> On Sun, Sep 13, 2009 at 2:25 PM, Bernie Innocenti wrote: >> >> > If we could break the syntax of "print" statements, I'm sure we >> > can also find a satisfactory compromise for CTRL-C handling that >> > won't affect more than 0.1% of existing Python programs. >> >> What does the print syntax have to do with this? You know it became a >> regular function in Python 3, right? > > Yes, I do. ?What I meant is that changing the behavior of SIGINT would > introduce negligible incompatibilities compared to the things that were > changed in Python 3.0. Ah, my apologies, I misread that sentence in your original missive. Cheers, Chris From mwm-keyword-python.b4bdba at mired.org Mon Sep 14 01:38:47 2009 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Sun, 13 Sep 2009 19:38:47 -0400 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: References: <1252877123.19979.43.camel@giskard> Message-ID: <20090913193847.1efca525@bhuda.mired.org> On Sun, 13 Sep 2009 21:35:56 +0000 (UTC) Benjamin Peterson wrote: > Bernie Innocenti writes: > > If we could break the syntax of "print" statements, I'm sure we > > can also find a satisfactory compromise for CTRL-C handling that > > won't affect more than 0.1% of existing Python programs. > > It would actually be a huge compatibility break because finally statments would > no longer be garunteed to execute. They aren't now. os._exit() skips unwinding the try/except chain before exiting. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From python at mrabarnett.plus.com Mon Sep 14 01:53:22 2009 From: python at mrabarnett.plus.com (MRAB) Date: Mon, 14 Sep 2009 00:53:22 +0100 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: References: <1252877123.19979.43.camel@giskard> Message-ID: <4AAD85F2.9080609@mrabarnett.plus.com> Benjamin Peterson wrote: > Bernie Innocenti writes: >> If we could break the syntax of "print" statements, I'm sure we >> can also find a satisfactory compromise for CTRL-C handling that >> won't affect more than 0.1% of existing Python programs. > > It would actually be a huge compatibility break because finally statments would > no longer be garunteed to execute. > Perhaps CTRL-C could be left as-is but also have a 'stronger' version like SHIFT-CTRL-C. From mwm-keyword-python.b4bdba at mired.org Mon Sep 14 02:05:09 2009 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Sun, 13 Sep 2009 20:05:09 -0400 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAD85F2.9080609@mrabarnett.plus.com> References: <1252877123.19979.43.camel@giskard> <4AAD85F2.9080609@mrabarnett.plus.com> Message-ID: <20090913200509.439ca1b6@bhuda.mired.org> On Mon, 14 Sep 2009 00:53:22 +0100 MRAB wrote: > Benjamin Peterson wrote: > > Bernie Innocenti writes: > >> If we could break the syntax of "print" statements, I'm sure we > >> can also find a satisfactory compromise for CTRL-C handling that > >> won't affect more than 0.1% of existing Python programs. > > > > It would actually be a huge compatibility break because finally statments would > > no longer be garunteed to execute. > > > Perhaps CTRL-C could be left as-is but also have a 'stronger' version > like SHIFT-CTRL-C. Possibly that will work on Windows. It won't work on Posix-compliant systems. The tty driver handles turning characters into signals to the controlling process, and does this in the kernel. To get a keystroke to generate some signal, you have to chose a signal the TTY driver can generate. Further, you need to use one that's not already used for other things. Needing to emulate SIGINFO on a system that doesn't support it, I wind up choosing SIGQUIT, as we don't get much use from python core dumps from production, whereas everything else I can't see the users giving up readily. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From python at mrabarnett.plus.com Mon Sep 14 02:11:35 2009 From: python at mrabarnett.plus.com (MRAB) Date: Mon, 14 Sep 2009 01:11:35 +0100 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <20090913200509.439ca1b6@bhuda.mired.org> References: <1252877123.19979.43.camel@giskard> <4AAD85F2.9080609@mrabarnett.plus.com> <20090913200509.439ca1b6@bhuda.mired.org> Message-ID: <4AAD8A37.8080504@mrabarnett.plus.com> Mike Meyer wrote: > On Mon, 14 Sep 2009 00:53:22 +0100 > MRAB wrote: > >> Benjamin Peterson wrote: >>> Bernie Innocenti writes: >>>> If we could break the syntax of "print" statements, I'm sure we >>>> can also find a satisfactory compromise for CTRL-C handling that >>>> won't affect more than 0.1% of existing Python programs. >>> It would actually be a huge compatibility break because finally statments would >>> no longer be garunteed to execute. >>> >> Perhaps CTRL-C could be left as-is but also have a 'stronger' version >> like SHIFT-CTRL-C. > > Possibly that will work on Windows. It won't work on Posix-compliant > systems. The tty driver handles turning characters into signals to the > controlling process, and does this in the kernel. To get a keystroke > to generate some signal, you have to chose a signal the TTY driver can > generate. Further, you need to use one that's not already used for > other things. > > Needing to emulate SIGINFO on a system that doesn't support it, I wind > up choosing SIGQUIT, as we don't get much use from python core dumps > from production, whereas everything else I can't see the users giving > up readily. > How about 2 (or 3?) in rapid succession ("here's a CTRL-C, and here's another one to show I mean it!" :-)). From steve at pearwood.info Mon Sep 14 02:16:21 2009 From: steve at pearwood.info (Steven D'Aprano) Date: Mon, 14 Sep 2009 10:16:21 +1000 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: References: <1252877123.19979.43.camel@giskard> Message-ID: <200909141016.21945.steve@pearwood.info> On Mon, 14 Sep 2009 07:35:56 am Benjamin Peterson wrote: > Bernie Innocenti writes: > > If we could break the syntax of "print" statements, I'm sure we > > can also find a satisfactory compromise for CTRL-C handling that > > won't affect more than 0.1% of existing Python programs. > > It would actually be a huge compatibility break because finally > statments would no longer be garunteed to execute. Guarantee not a guarantee :) Finally statements are only guaranteed to execute if the Python process isn't killed or otherwise interrupted. You can kill an unresponsive Python program from the outside, (say) with ctrl-\, and the finally clause never executes: >>> try: ... t = time.time() ... while 1: ... pass ... finally: ... print "finished infinite loop in %f seconds" % (time.time() - t) ... Quit [steve] $ I think it's worth considering what the OP's real complaint is: namely, that (some? all?) C extensions can't be interrupted by ctrl-C as the user would expect. Is that a valid issue, or is it expected that Python has no control over what happens inside the C extension? -- Steven D'Aprano From stephen at xemacs.org Mon Sep 14 02:30:30 2009 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 14 Sep 2009 09:30:30 +0900 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: References: <1252877123.19979.43.camel@giskard> Message-ID: <87vdjmv049.fsf@uwakimon.sk.tsukuba.ac.jp> Benjamin Peterson writes: > It would actually be a huge compatibility break because finally > statments would no longer be garunteed to execute. Hey, kill -9 trumps any guarantee you can give, and that's what the user resorts to in the use case at hand. This is a bad thing, even from the point of view of those who use "finally" a lot. From greg.ewing at canterbury.ac.nz Mon Sep 14 02:37:51 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 14 Sep 2009 12:37:51 +1200 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAD85F2.9080609@mrabarnett.plus.com> References: <1252877123.19979.43.camel@giskard> <4AAD85F2.9080609@mrabarnett.plus.com> Message-ID: <4AAD905F.1090507@canterbury.ac.nz> MRAB wrote: > Perhaps CTRL-C could be left as-is but also have a 'stronger' version > like SHIFT-CTRL-C. There's already a stronger version, it's Ctrl-\. -- Greg From python at mrabarnett.plus.com Mon Sep 14 02:57:50 2009 From: python at mrabarnett.plus.com (MRAB) Date: Mon, 14 Sep 2009 01:57:50 +0100 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAD905F.1090507@canterbury.ac.nz> References: <1252877123.19979.43.camel@giskard> <4AAD85F2.9080609@mrabarnett.plus.com> <4AAD905F.1090507@canterbury.ac.nz> Message-ID: <4AAD950E.7000205@mrabarnett.plus.com> Greg Ewing wrote: > MRAB wrote: > >> Perhaps CTRL-C could be left as-is but also have a 'stronger' version >> like SHIFT-CTRL-C. > > There's already a stronger version, it's Ctrl-\. > OK, and as far as I can tell it's ctrl-break on Windows. From greg.ewing at canterbury.ac.nz Mon Sep 14 03:12:30 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 14 Sep 2009 13:12:30 +1200 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <200909141016.21945.steve@pearwood.info> References: <1252877123.19979.43.camel@giskard> <200909141016.21945.steve@pearwood.info> Message-ID: <4AAD987E.1070900@canterbury.ac.nz> Steven D'Aprano wrote: > You can kill an unresponsive Python program from the outside, (say) > with ctrl-\, and the finally clause never executes: Yes, but you expect that signal to kill the process immediately without bothering with any cleanup. Ctrl-C, on the other hand, is meant to be a graceful request to terminate cleanly. In the context of Python, one can reasonably expect that to include execution of finally blocks. -- Greg From greg.ewing at canterbury.ac.nz Mon Sep 14 03:15:58 2009 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 14 Sep 2009 13:15:58 +1200 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <200909141016.21945.steve@pearwood.info> References: <1252877123.19979.43.camel@giskard> <200909141016.21945.steve@pearwood.info> Message-ID: <4AAD994E.8040908@canterbury.ac.nz> Steven D'Aprano wrote: > I think it's worth considering what the OP's real complaint is: namely, > that (some? all?) C extensions can't be interrupted by ctrl-C as the > user would expect. Is that a valid issue, or is it expected that Python > has no control over what happens inside the C extension? I think it's an unavoidable consequence of the desire to be able to unwind cleanly in the event of a Ctrl-C. If it happens in the middle of arbitrary C code, you can't just jump out of it because that could leave things in an unpredictable state, and execution of further Python code wouldn't be safe. -- Greg From stephen at xemacs.org Mon Sep 14 03:23:14 2009 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 14 Sep 2009 10:23:14 +0900 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAD8A37.8080504@mrabarnett.plus.com> References: <1252877123.19979.43.camel@giskard> <4AAD85F2.9080609@mrabarnett.plus.com> <20090913200509.439ca1b6@bhuda.mired.org> <4AAD8A37.8080504@mrabarnett.plus.com> Message-ID: <87r5uauxod.fsf@uwakimon.sk.tsukuba.ac.jp> MRAB writes: > How about 2 (or 3?) in rapid succession ("here's a CTRL-C, and here's > another one to show I mean it!" :-)). That's more complexity than you want to put in a signal handler. The way Emacs handles this is that the signal handler just enqueues a quit event, and the event loop checks for it and handles it. In other places (such as looping functions) more complex QUIT processing (that checks for repeated signals and throws to the innermost QUIT catcher) is done, but this can only be done in "safe" places, not in the signal handler itself. I imagine Python works the same way and it works fine in pure Python programs, too. The problem here is that when you return from the signal handler, you're trapped inside a poorly-written (for this purpose) C extension, and Python never gets to check for the first quit, let alone repeated ones. In Emacs it's pretty rare to get those, because Emacs is quite hostile to third-party C extensions, so C code QUIT-ified by the maintainers before it's allowed to be added. Python has lots of C extensions, and some are going to need QUIT occasionally -- but not often enough for the extension maintainer to notice and handle it. :-P From mwm-keyword-python.b4bdba at mired.org Mon Sep 14 04:41:54 2009 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Sun, 13 Sep 2009 22:41:54 -0400 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <200909141016.21945.steve@pearwood.info> References: <1252877123.19979.43.camel@giskard> <200909141016.21945.steve@pearwood.info> Message-ID: <20090913224154.189bd9cb@bhuda.mired.org> On Mon, 14 Sep 2009 10:16:21 +1000 Steven D'Aprano wrote: > I think it's worth considering what the OP's real complaint is: namely, > that (some? all?) C extensions can't be interrupted by ctrl-C as the > user would expect. Is that a valid issue, or is it expected that Python > has no control over what happens inside the C extension? I think it's valid. It's expected that the USER will have some control over what's going on in a program - in particular, that they should be able to interrupt it pretty much any time barring a bug of some sort. The interaction between Python's default SIGINT handling and C extensions breaks that expectation. It's not clear what a good solution would be, though. This problem exists for pretty much all signal handlers - they don't get a chance to run if there's a misbehaving C extension executing. The simplest solution is to just not handle SIGINT by default, which raises the objection that try/finally doesn't "work". I don't think this will matter in most cases; the finally block is usually freeing up some resource that is going to be freed by the process exiting as part of default SIGINT behavior. We've certainly had similarly subtle changes that are much harder to deal with in the 2.X line. To me, the nasty part of such a change is the loss of traceback on SIGINT by default. During development, hitting C-C when the python code is in an infinite loop pretty reliably provides a traceback that nails down the problem. Yes, I could just boilerplate that into every program, but part of python's attraction is that such things are so seldom needed. I suspect this might be best dealt with by documenting how C extensions should behave with respect to signals - particularly SIGINT - and providing sample code for doing so. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From ubershmekel at gmail.com Mon Sep 14 04:51:29 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Mon, 14 Sep 2009 05:51:29 +0300 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: <5c6f2a5d0909131106x4e6f49aep56b47eb5675ab894@mail.gmail.com> References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> <5c6f2a5d0909131106x4e6f49aep56b47eb5675ab894@mail.gmail.com> Message-ID: <9d153b7c0909131951m585f3dcev953de169ffbf3b95@mail.gmail.com> Btw, when you say translation table, do you mean just a string? Because a translation table would need to be continuous from 0 to the base so a real dicitionary-esque table may be overkill. The only advantage of a table might be to convert certain digits into multiple bytes (some sort of ad-hoc unicode use case?). --yuv On Sun, Sep 13, 2009 at 9:06 PM, Mark Dickinson wrote: > On Fri, Sep 11, 2009 at 4:16 PM, Yuvgoog Greenle > wrote: > > Does anybody have any more use cases, ideas or suggestions? I'm getting > the > > feeling this suggestion is +0 to most people and +1 for the rest. I'm > pretty > > new to these mailing lists so does that mean a yes or a no? > > Just out of curiosity, I did a Google code search[*] for uses of the > inverse operation: int( ,n). I found a good handful of > uses of int(s, 36), almost all apparently to do with turning integers > into suitable id strings; there was also evidence that people have > implemented the reverse 'integer -> base 36 string' conversion at > least twice. I found no meaningful uses of any bases other than > 2, 8, 10, 16, and 36. So the main use case seems to be > serialization and deserialization of integers into some 'suitably nice' > alphabet, and that alphabet is likely to be application-dependent. > > -0 for int.to_base(n) (2 <= n <= 36) or equivalent functionality in the > core. > > +0 for a pair of library functions converting to and from base n, with > explicitly given translation table. I agree with MRAB that an implicit > digit set should only be allowed for 2 <= base <= 36, if at all. > > By the way, _PyLong_Format in Objects/longobject.c *does* contain > code for general integer -> base b conversions, 2 <= b <= 36, > but that code is currently unused (as far as I can tell). > > Mark > > [*] > http://www.google.com/codesearch?hl=en&lr=&q=%5CWint%5Cs*%5C%28.*%5C%2C%5Cs*36%5Cs*%5C%29+lang%3Apython&sbtn=Search > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ubershmekel at gmail.com Mon Sep 14 04:56:05 2009 From: ubershmekel at gmail.com (Yuvgoog Greenle) Date: Mon, 14 Sep 2009 05:56:05 +0300 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> Message-ID: <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> So this pattern is a valid python optimization? Funky... Sadly, there's no way around it unless the interpreter somehow did it magically for you. On Sun, Sep 13, 2009 at 10:37 PM, Raymond Hettinger wrote: > > On Sep 13, 2009, at 11:10 AM, Gerald Britton wrote: > >> Here are some results: >> >> $ python -m timeit -n 1000000 -s 'with open("/usr/share/dict/words") >> as f: s = set(w.strip("\n") for w in f)' 's.add("mother")' >> 1000000 loops, best of 3: 0.292 usec per loop >> >> britton at TheBrittons:~$ python -m timeit -n 1000000 -s 'with >> open("/usr/share/dict/words") as f: s = set(w.strip("\n") for w in f)' >> 'if "mother" not in s:s.add("mother")' >> 1000000 loops, best of 3: 0.185 usec per loop >> >> the second example beats the first by about 36% >> >> Is the timing difference just the cost of the method lookup for s.add, >> or is something else happening that I'm not seeing? >> > > It is the something else you're not seeing ;-) > > On the first pass of the 1000000 loops, "mother" gets added. > On the remaining passes the 'if "mother" not in set' test fails > and the set.add() never gets executed. That latter operation > is a bit more expensive than the contains-test because it > includes the time to lookup and bind the add method. > > > Raymond > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stephen at xemacs.org Mon Sep 14 05:49:10 2009 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 14 Sep 2009 12:49:10 +0900 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <1afaf6160909131726y444532c7h1b3dae0d3744667a@mail.gmail.com> References: <1252877123.19979.43.camel@giskard> <87vdjmv049.fsf@uwakimon.sk.tsukuba.ac.jp> <1afaf6160909131726y444532c7h1b3dae0d3744667a@mail.gmail.com> Message-ID: <87ljkiuqx5.fsf@uwakimon.sk.tsukuba.ac.jp> Benjamin Peterson writes: > 2009/9/13 Stephen J. Turnbull : > > Benjamin Peterson writes: > > > > ?> It would actually be a huge compatibility break because finally > > ?> statments would no longer be garunteed to execute. > > > > Hey, kill -9 trumps any guarantee you can give, and that's what the > > user resorts to in the use case at hand. ?This is a bad thing, even > > from the point of view of those who use "finally" a lot. > > Naturally, users should be able to control their programs absolutely, > but C-c is supposed to tell the program to clean up all its messes > then exit as gracefully as possible. Sure. But setting things up so that C-c doesn't even get through to the program doesn't help that cause. Since the OP claims it works for him, maybe the "IDEA" has merit in some more limited form, for example, by default uninstalling the Python graceful_SIGINT_handler when calling into a C module. (Probably this would have to be introduced in stages to allow extensions to update their functions as needed -- I suppose this would be a performance hit.) Then you could provide a with-signal-handler form of the standard macro for defining a Python function, with the semantics that the signal handler change gets optimized away if you specify the standard handler. And document that in this form the function needs to handle the SIGINT_signaled flag (or however it works) itself, or accept the possibility of an uninterruptible hang. From tjreedy at udel.edu Mon Sep 14 08:53:43 2009 From: tjreedy at udel.edu (Terry Reedy) Date: Mon, 14 Sep 2009 02:53:43 -0400 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> Message-ID: Yuvgoog Greenle wrote: > So this pattern is a valid python optimization? That exactly not how I interpreted Raymond > > On Sun, Sep 13, 2009 at 10:37 PM, Raymond Hettinger > > wrote: > > > On Sep 13, 2009, at 11:10 AM, Gerald Britton wrote: > > Here are some results: > > $ python -m timeit -n 1000000 -s 'with > open("/usr/share/dict/words") > as f: s = set(w.strip("\n") for w in f)' 's.add("mother")' > 1000000 loops, best of 3: 0.292 usec per loop Try looking up and binding the method just once, any any experienced Python programmer might do if doing repeated 'additions'. -s'...: sadd=set(w.strip("\n") for w in f).add' 'sadd("mother") > britton at TheBrittons:~$ python -m timeit -n 1000000 -s 'with > open("/usr/share/dict/words") as f: s = set(w.strip("\n") for w > in f)' > 'if "mother" not in s:s.add("mother")' > 1000000 loops, best of 3: 0.185 usec per loop Add 'sadd = s.add' at end of setup, followed by 'if "mother" not in s: sadd("mother") I doubt second will still be faster. > the second example beats the first by about 36% > > Is the timing difference just the cost of the method lookup for > s.add, > or is something else happening that I'm not seeing? > > > It is the something else you're not seeing ;-) > > On the first pass of the 1000000 loops, "mother" gets added. > On the remaining passes the 'if "mother" not in set' test fails > and the set.add() never gets executed. That latter operation > is a bit more expensive than the contains-test because it > includes the time to lookup and bind the add method. My suggested alteration removes the repeated lookup and bind. tjr From g.brandl at gmx.net Mon Sep 14 11:23:11 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Mon, 14 Sep 2009 09:23:11 +0000 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> Message-ID: Terry Reedy schrieb: >> britton at TheBrittons:~$ python -m timeit -n 1000000 -s 'with >> open("/usr/share/dict/words") as f: s = set(w.strip("\n") for w >> in f)' >> 'if "mother" not in s:s.add("mother")' >> 1000000 loops, best of 3: 0.185 usec per loop > > Add 'sadd = s.add' at end of setup, followed by > 'if "mother" not in s: sadd("mother") > > I doubt second will still be faster. Well, the method also has to be *called* (think argument parsing, but in this case that should be trivial). Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From g.brandl at gmx.net Mon Sep 14 11:28:56 2009 From: g.brandl at gmx.net (Georg Brandl) Date: Mon, 14 Sep 2009 09:28:56 +0000 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAD85F2.9080609@mrabarnett.plus.com> References: <1252877123.19979.43.camel@giskard> <4AAD85F2.9080609@mrabarnett.plus.com> Message-ID: MRAB schrieb: > Benjamin Peterson wrote: >> Bernie Innocenti writes: >>> If we could break the syntax of "print" statements, I'm sure we >>> can also find a satisfactory compromise for CTRL-C handling that >>> won't affect more than 0.1% of existing Python programs. >> >> It would actually be a huge compatibility break because finally statments would >> no longer be garunteed to execute. >> > Perhaps CTRL-C could be left as-is but also have a 'stronger' version > like SHIFT-CTRL-C. Isn't it as easy as signal.signal(signal.SIGINT, signal.SIG_DFL) if you don't like the current handler? Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From lie.1296 at gmail.com Mon Sep 14 14:36:43 2009 From: lie.1296 at gmail.com (Lie Ryan) Date: Mon, 14 Sep 2009 22:36:43 +1000 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: <9d153b7c0909131951m585f3dcev953de169ffbf3b95@mail.gmail.com> References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> <5c6f2a5d0909131106x4e6f49aep56b47eb5675ab894@mail.gmail.com> <9d153b7c0909131951m585f3dcev953de169ffbf3b95@mail.gmail.com> Message-ID: Yuvgoog Greenle wrote: > Btw, when you say translation table, do you mean just a string? Because > a translation table would need to be continuous from 0 to the base so a > real dicitionary-esque table may be overkill. The only advantage of a > table might be to convert certain digits into multiple bytes (some sort > of ad-hoc unicode use case?). > If the translation table is limited to a string, the function would be very limited. For example, it might be useful to use base-change function to convert between an IPv4 address and integer. The common representation of IPv4 address uses base-255 "number" (0.0.0.0-255.255.255.255) From ncoghlan at gmail.com Mon Sep 14 15:20:27 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 14 Sep 2009 23:20:27 +1000 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> Message-ID: <4AAE431B.2020404@gmail.com> Yuvgoog Greenle wrote: > So this pattern is a valid python optimization? Funky... > > Sadly, there's no way around it unless the interpreter somehow did it > magically for you. The interpreter has no way of knowing a priori that the branch won't be taken 999,999 times out of 1,000,000. Think about what you are actually comparing here: Average speed of 1 million calls to s.add(x) Average speed of 1 million calls to "x in s" + one call to s.add(x) Now think about the fact that s.add(x) includes a containment test plus function call and name lookup overhead even in the case that the item is already present in the set. All overhead included: $ python -m timeit -s "s = set()" "s.add(1)" 1000000 loops, best of 3: 0.197 usec per loop Lose the attribute lookup: $ python -m timeit -s "s = set()" -s "sadd = s.add" "sadd(1)" 10000000 loops, best of 3: 0.146 usec per loop Skip the function call altogether most of the time: $ python -m timeit -s "s = set()" "if 1 not in s: s.add(1)" 10000000 loops, best of 3: 0.101 usec per loop Just do the containment test: $ python -m timeit -s "s = set([1])" "1 not in s" 10000000 loops, best of 3: 0.1 usec per loop Now then, lets also look at the absolute numbers we're discussing here (on my machine, anyway). Is the fastest version twice as fast as the slowest version? Yes it is. But that difference is only 97 *nano*seconds. And relative to the recommended approach of caching the attribute looking, we're only saving 45 nanoseconds. And in more realistic use cases where some items are already in the set and some aren't, the "I'll check first" implementation can become a pessimisation instead of an optimisation. To emphasise the point, we'll go to the other extreme where the item is added to the set every time: All the overhead: $ python -m timeit -s "s = set()" "s.add(1)" "s.clear()" 1000000 loops, best of 3: 0.374 usec per loop The "optimised" approach: $ python -m timeit -s "s = set()" "if 1 not in s: s.add(1)" "s.clear()" 1000000 loops, best of 3: 0.444 usec per loop Oops, looks like the approach that saves us 45 nanoseconds when the item is already in the set may cost us up to *70* nanoseconds when it turns out we need to add the item after all. Caching attribute lookups before time critical loops is a good optimisation technique that most experienced Python programmers learn. Pre-checking a condition that a called function is just going to check again and bail out quickly in less than 50% of cases? Usually a bad idea - the extra checks made when the function is invoked anyway will often cancel out any gains you make from avoiding the function call overhead. That said, as with any micro-optimisation: time it on a range of typical and end-case data sets. If you avoid the function call overhead often enough, doing a pre-check may be a net win for some inner loops. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From ncoghlan at gmail.com Mon Sep 14 15:23:29 2009 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 14 Sep 2009 23:23:29 +1000 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <20090913193847.1efca525@bhuda.mired.org> References: <1252877123.19979.43.camel@giskard> <20090913193847.1efca525@bhuda.mired.org> Message-ID: <4AAE43D1.1030707@gmail.com> Mike Meyer wrote: > On Sun, 13 Sep 2009 21:35:56 +0000 (UTC) > Benjamin Peterson wrote: > >> Bernie Innocenti writes: >>> If we could break the syntax of "print" statements, I'm sure we >>> can also find a satisfactory compromise for CTRL-C handling that >>> won't affect more than 0.1% of existing Python programs. >> It would actually be a huge compatibility break because finally statments would >> no longer be garunteed to execute. > > They aren't now. os._exit() skips unwinding the try/except chain > before exiting. An application calling an underscore prefixed function (in the os module no less) is significantly different to a user pressing Ctrl-C while the application is running normally. If a user wants to kill it ungracefully, that's what Ctrl-Break is for. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- From python at mrabarnett.plus.com Mon Sep 14 15:37:20 2009 From: python at mrabarnett.plus.com (MRAB) Date: Mon, 14 Sep 2009 14:37:20 +0100 Subject: [Python-ideas] Add a builtin method to 'int' for base/radix conversion In-Reply-To: References: <9d153b7c0908301745n51a9751avbaf16d0e155fec7e@mail.gmail.com> <4A9BC96B.2010207@gmail.com> <1AD8D421-A250-421D-A6ED-4992234A5B96@masklinn.net> <4A9BF08C.9060607@mrabarnett.plus.com> <9d153b7c0909110816k93a3b91n161cb58572bf1151@mail.gmail.com> <5c6f2a5d0909131106x4e6f49aep56b47eb5675ab894@mail.gmail.com> <9d153b7c0909131951m585f3dcev953de169ffbf3b95@mail.gmail.com> Message-ID: <4AAE4710.1060301@mrabarnett.plus.com> Lie Ryan wrote: > Yuvgoog Greenle wrote: >> Btw, when you say translation table, do you mean just a string? >> Because a translation table would need to be continuous from 0 to the >> base so a real dicitionary-esque table may be overkill. The only >> advantage of a table might be to convert certain digits into multiple >> bytes (some sort of ad-hoc unicode use case?). >> > > If the translation table is limited to a string, the function would be > very limited. For example, it might be useful to use base-change > function to convert between an IPv4 address and integer. The common > representation of IPv4 address uses base-255 "number" > (0.0.0.0-255.255.255.255) > If the translation table is a dict then you can decode both 'A' and 'a' to 10. You could, of course, permit either a string or a dict, with a string being converted to a dict. From tjreedy at udel.edu Mon Sep 14 16:16:58 2009 From: tjreedy at udel.edu (Terry Reedy) Date: Mon, 14 Sep 2009 10:16:58 -0400 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAE43D1.1030707@gmail.com> References: <1252877123.19979.43.camel@giskard> <20090913193847.1efca525@bhuda.mired.org> <4AAE43D1.1030707@gmail.com> Message-ID: Nick Coghlan wrote: > Mike Meyer wrote: >> On Sun, 13 Sep 2009 21:35:56 +0000 (UTC) >> Benjamin Peterson wrote: >> >>> Bernie Innocenti writes: >>>> If we could break the syntax of "print" statements, I'm sure we >>>> can also find a satisfactory compromise for CTRL-C handling that >>>> won't affect more than 0.1% of existing Python programs. >>> It would actually be a huge compatibility break because finally statments would >>> no longer be garunteed to execute. >> They aren't now. os._exit() skips unwinding the try/except chain >> before exiting. > > An application calling an underscore prefixed function (in the os module > no less) is significantly different to a user pressing Ctrl-C while the > application is running normally. > > If a user wants to kill it ungracefully, that's what Ctrl-Break is for. I was not aware of that. Perhaps a deficiency in my Windows knowledge. "Using Python" only mentions ctrl-d on unix and ctrl-z on windows. Perhaps more could be added. From gerald.britton at gmail.com Mon Sep 14 16:19:06 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Mon, 14 Sep 2009 10:19:06 -0400 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: <4AAE431B.2020404@gmail.com> References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> <4AAE431B.2020404@gmail.com> Message-ID: <5d1a32000909140719k19dfcd0clac348667feae60f5@mail.gmail.com> Interesting discussion here! I did a little analysis: let a = cost of "item in set" let b = cost of set.add let c = cost of sadd = set.add if I do x+y additions to my set where x is number of items already in the set and y is the number not in the set, using my original approach I get a cost of: a.(x+y) + b.y # we always test for set membership, but only add new items for the "if item not in set: set.add(item)" approach, or a.(x+y) + c.y, if I substitute sadd for set.add If I always add the item to the set without checking first, the cost is b.(x+y) (or c.(x+y) using sadd) assuming that set.add() takes close to the same time whether the item is in the set or not (in reality I suppose that it needs to play with a few pointers if the item is not in the set and may need to rebalance the tree, if it is a red-black tree or something similar -- what is it, actually?) That means that break-even should happen when: a.(x+y) +b.y = b.(x+y) ax + ay + by = bx + by ay = (b-a)x y = ((b-a)/a)x plugging in the numbers from earlier in this thread, we have: a = .276 uS b = .489 uS c = .298uS (b-a)/a = .772 (c-a)/a = .08 so the approach "if item not in set: set.add(item)" wins when the number of items not added to the set (thus failing the "if" statement) >= 23% of those that are added. Setting sadd = set.add beforehand increases that break-even point to about 92% (almost no advantage). Lesson learned: If you know (or can reasonably test) your sample data, you can choose the better method. If your data is unknown (random from your viewpoint), you could assume that about half the items to be added are already in the set, which is < 92%, so there is no point in doing the "if item in set" test beforehand. YMMV On Mon, Sep 14, 2009 at 9:20 AM, Nick Coghlan wrote: > Yuvgoog Greenle wrote: >> So this pattern is a valid python optimization? Funky... >> >> Sadly, there's no way around it unless the interpreter somehow did it >> magically for you. > > The interpreter has no way of knowing a priori that the branch won't be > taken 999,999 times out of 1,000,000. > > Think about what you are actually comparing here: > > Average speed of 1 million calls to s.add(x) > > Average speed of 1 million calls to "x in s" + one call to s.add(x) > > Now think about the fact that s.add(x) includes a containment test plus > function call and name lookup overhead even in the case that the item is > already present in the set. > > All overhead included: > $ python -m timeit -s "s = set()" "s.add(1)" > 1000000 loops, best of 3: 0.197 usec per loop > > Lose the attribute lookup: > $ python -m timeit -s "s = set()" -s "sadd = s.add" "sadd(1)" > 10000000 loops, best of 3: 0.146 usec per loop > > Skip the function call altogether most of the time: > $ python -m timeit -s "s = set()" "if 1 not in s: s.add(1)" > 10000000 loops, best of 3: 0.101 usec per loop > > Just do the containment test: > $ python -m timeit -s "s = set([1])" "1 not in s" > 10000000 loops, best of 3: 0.1 usec per loop > > Now then, lets also look at the absolute numbers we're discussing here > (on my machine, anyway). Is the fastest version twice as fast as the > slowest version? Yes it is. But that difference is only 97 > *nano*seconds. And relative to the recommended approach of caching the > attribute looking, we're only saving 45 nanoseconds. > > And in more realistic use cases where some items are already in the set > and some aren't, the "I'll check first" implementation can become a > pessimisation instead of an optimisation. To emphasise the point, we'll > go to the other extreme where the item is added to the set every time: > > All the overhead: > $ python -m timeit -s "s = set()" "s.add(1)" "s.clear()" > 1000000 loops, best of 3: 0.374 usec per loop > > The "optimised" approach: > $ python -m timeit -s "s = set()" "if 1 not in s: s.add(1)" "s.clear()" > 1000000 loops, best of 3: 0.444 usec per loop > > Oops, looks like the approach that saves us 45 nanoseconds when the item > is already in the set may cost us up to *70* nanoseconds when it turns > out we need to add the item after all. > > Caching attribute lookups before time critical loops is a good > optimisation technique that most experienced Python programmers learn. > Pre-checking a condition that a called function is just going to check > again and bail out quickly in less than 50% of cases? Usually a bad idea > - the extra checks made when the function is invoked anyway will often > cancel out any gains you make from avoiding the function call overhead. > > That said, as with any micro-optimisation: time it on a range of typical > and end-case data sets. If you avoid the function call overhead often > enough, doing a pre-check may be a net win for some inner loops. > > Cheers, > Nick. > > -- > Nick Coghlan ? | ? ncoghlan at gmail.com ? | ? Brisbane, Australia > --------------------------------------------------------------- > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Gerald Britton From daniel at stutzbachenterprises.com Mon Sep 14 16:26:04 2009 From: daniel at stutzbachenterprises.com (Daniel Stutzbach) Date: Mon, 14 Sep 2009 09:26:04 -0500 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: <5d1a32000909140719k19dfcd0clac348667feae60f5@mail.gmail.com> References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> <4AAE431B.2020404@gmail.com> <5d1a32000909140719k19dfcd0clac348667feae60f5@mail.gmail.com> Message-ID: On Mon, Sep 14, 2009 at 9:19 AM, Gerald Britton wrote: > assuming that set.add() takes close to the same time whether the item > is in the set or not (in reality I suppose that it needs to play with > a few pointers if the item is not in the set and may need to rebalance > the tree, if it is a red-black tree or something similar -- what is > it, actually?) > Under the hood, the set type uses a hash table. -- Daniel Stutzbach, Ph.D. President, Stutzbach Enterprises, LLC -------------- next part -------------- An HTML attachment was scrubbed... URL: From stephen at xemacs.org Mon Sep 14 16:57:21 2009 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 14 Sep 2009 23:57:21 +0900 Subject: [Python-ideas] IDEA: do not alter default SIGINT handling In-Reply-To: <4AAE43D1.1030707@gmail.com> References: <1252877123.19979.43.camel@giskard> <20090913193847.1efca525@bhuda.mired.org> <4AAE43D1.1030707@gmail.com> Message-ID: <8763blvajy.fsf@uwakimon.sk.tsukuba.ac.jp> Nick Coghlan writes: > If a user wants to kill it ungracefully, that's what Ctrl-Break is for. Discussion of how to spell SIGKILL looks like bikeshed painting to me, but the question of how to extend graceful interrupt behavior to C modules looks worthy of an idea to me. Is that too hard or too rare a need? From gerald.britton at gmail.com Mon Sep 14 17:03:15 2009 From: gerald.britton at gmail.com (Gerald Britton) Date: Mon, 14 Sep 2009 11:03:15 -0400 Subject: [Python-ideas] set.add(x) slower than if x in set:set.add(x) In-Reply-To: References: <5d1a32000909131110k636f0d6v2413fb190e5a3c38@mail.gmail.com> <440A5B08-A86C-4F86-A03E-57C55D7B241C@rcn.com> <9d153b7c0909131956u42e1e77aufdb6fa9e16b020c5@mail.gmail.com> <4AAE431B.2020404@gmail.com> <5d1a32000909140719k19dfcd0clac348667feae60f5@mail.gmail.com>