From greg.ewing at canterbury.ac.nz Sat Sep 1 02:03:06 2012 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 01 Sep 2012 12:03:06 +1200 Subject: [Python-ideas] Unpack of sequences In-Reply-To: <50400D71.7000705@pearwood.info> References: <503E1FCA.7050309@nedbatchelder.com>

<503ECF2E.6060400@canterbury.ac.nz>

<5040027C.2000001@canterbury.ac.nz> <50400D71.7000705@pearwood.info> Message-ID: <504150BA.7020004@canterbury.ac.nz> Steven D'Aprano wrote: > For sequences: > > a, b, c, d = really_long_sequence[:4] Sure, there are any number of ways of doing it, but they all involve some degree of convolution or DRY-violation. The most direct and obvious way is denied to us, because there is no way to tell the unpacking operation *not* to check for the end of the sequence afterwards. Because that unwanted check is forced on us, we have to perform even more useless work to get around it. That seems perverse to me. -- Greg From greg at krypto.org Sat Sep 1 02:03:54 2012 From: greg at krypto.org (Gregory P. Smith) Date: Fri, 31 Aug 2012 17:03:54 -0700 Subject: [Python-ideas] Add a "hasmethod()" builtin? In-Reply-To: <20120830042825.GA13314@ando> References: <20120830042825.GA13314@ando> Message-ID: On Wed, Aug 29, 2012 at 9:28 PM, Steven D'Aprano wrote: > On Wed, Aug 29, 2012 at 03:48:01PM -0700, Guido van Rossum wrote: > > There's a concept that's sometimes useful when explaining behavior of > > certain Python operations in terms of simpler ones, and it is "does > > the class of x define a method m?". > > It's not just methods where this is useful. For example, the help() > quasi-builtin ignores instance attribute x.__doc__ and instead uses > type(x).__doc__. > > I'm not sure that needing this is common enough to justify builtins, but > I think it would be useful to have hastypeattr and friends (get*, set* > and del*) in the operator module. > +1. I wouldn't call it hasmethod as described either as that name implies to most readers the much simpler "it has a callable attribute with this name" check rather than getting into the differences between an arbitrary callable attribute, function, or an actual method on the type which most people do not need to know about (*). -gps (*) unless mocking, i ran into a test failure because for 2.6 it was stubbing out a __special__ method with a mock no longer worked in 2.7. but that test was better all around when refactored to not need to be that crazy. :) -------------- next part -------------- An HTML attachment was scrubbed... URL: From mikegraham at gmail.com Sat Sep 1 02:35:56 2012 From: mikegraham at gmail.com (Mike Graham) Date: Fri, 31 Aug 2012 20:35:56 -0400 Subject: [Python-ideas] Verbose traceback formatting In-Reply-To: References: Message-ID: On Wed, Aug 29, 2012 at 5:24 PM, Terry Reedy wrote: > Part of the problem is in the overly skimpy exception instances > themselves. They should contain the needed runtime info that one > cannot find in the code. I would rather you push for more such > changes. > >>> ... >>> x = 16 >>> y = 0 >>> ... > >>>ZeroDivisionError: integer division or modulo by zero > > This could and, imo, should be changed to include the numerator, > which is the main extra info included the the verbose traceback. > Most of the rest strikes me as noise. I think you have read the example too narrowly. Having the locals of each frame is _very_ useful--just because I know the terms of the division does not mean that I can easily connect that information back five calls ago when I passed the wrong thing. Currently you have to spin up the debugger (or insert print statements or similar) to get this information, which is more work than it is to read it and requires another run of your code (and getting to the same point could be expensive or tricky). Mike From techtonik at gmail.com Sat Sep 1 08:27:25 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Sat, 1 Sep 2012 09:27:25 +0300 Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References: Message-ID: I've run into the necessity of implementing chunks() again. Here is the code I've made from scratch. def chunks(seq, size): '''Cut sequence into chunks of given size. If `seq` length is not divisible by `size` without reminder, last chunk will have length less than size. >>> list( chunks([1,2,3,4,5,6,7], 3) ) [[1, 2, 3], [4, 5, 6], [7]] ''' endlen = len(seq)//size for i in range(endlen): yield [seq[i*size+n] for n in range(size)] if len(seq) % size: yield seq[endlen*size:] -- anatoly t. On Fri, Jun 29, 2012 at 11:32 PM, Georg Brandl wrote: > On 26.06.2012 10:03, anatoly techtonik wrote: >> >> Now that Python 3 is all about iterators (which is a user killer >> feature for Python according to StackOverflow - >> http://stackoverflow.com/questions/tagged/python) would it be nice to >> introduce more first class functions to work with them? One function >> to be exact to split string into chunks. >> >> itertools.chunks(iterable, size, fill=None) >> >> Which is the 33th most voted Python question on SO - >> >> http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python/312464 >> >> P.S. CC'ing to python-dev@ to notify about the thread in python-ideas. >> > > Anatoly, so far there were no negative votes -- would you care to go > another step and propose a patch? > > > Georg > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas From techtonik at gmail.com Sat Sep 1 08:29:37 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Sat, 1 Sep 2012 09:29:37 +0300 Subject: [Python-ideas] sequence.apply(function) Message-ID: Idea: Apply function to every element of a sequence and return new sequence. It's more pythonic than map(), because clearly works only as a list method. -- anatoly t. From steve at pearwood.info Sat Sep 1 08:56:21 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 01 Sep 2012 16:56:21 +1000 Subject: [Python-ideas] sequence.apply(function) In-Reply-To: References: Message-ID: <5041B195.7070708@pearwood.info> On 01/09/12 16:29, anatoly techtonik wrote: > Idea: Apply function to every element of a sequence and return new > sequence. It's more pythonic than map(), because clearly works only as > a list method. I think you mean "less pythonic". -1 We already have map, and it works lazily on any iterable. Why do we need something less efficient and more limited? -- Steven From ned at nedbatchelder.com Sat Sep 1 14:00:21 2012 From: ned at nedbatchelder.com (Ned Batchelder) Date: Sat, 01 Sep 2012 08:00:21 -0400 Subject: [Python-ideas] sequence.apply(function) In-Reply-To: References: Message-ID: <5041F8D5.6040904@nedbatchelder.com> On 9/1/2012 2:29 AM, anatoly techtonik wrote: > Idea: Apply function to every element of a sequence and return new > sequence. It's more pythonic than map(), because clearly works only as > a list method. Python 2 has itertools.imap, and Python 3 has map, both of which do exactly what you want. --Ned. > -- > anatoly t. > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From miki.tebeka at gmail.com Sat Sep 1 16:42:19 2012 From: miki.tebeka at gmail.com (Miki Tebeka) Date: Sat, 1 Sep 2012 07:42:19 -0700 (PDT) Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References:

Message-ID: See the "grouper" example in http://docs.python.org/library/itertools.html On Friday, August 31, 2012 11:28:33 PM UTC-7, anatoly techtonik wrote: > > I've run into the necessity of implementing chunks() again. Here is > the code I've made from scratch. > > def chunks(seq, size): > '''Cut sequence into chunks of given size. If `seq` length is > not divisible by `size` without reminder, last chunk will > have length less than size. > > >>> list( chunks([1,2,3,4,5,6,7], 3) ) > [[1, 2, 3], [4, 5, 6], [7]] > ''' > endlen = len(seq)//size > for i in range(endlen): > yield [seq[i*size+n] for n in range(size)] > if len(seq) % size: > yield seq[endlen*size:] > > -- > anatoly t. > > > On Fri, Jun 29, 2012 at 11:32 PM, Georg Brandl > > wrote: > > On 26.06.2012 10:03, anatoly techtonik wrote: > >> > >> Now that Python 3 is all about iterators (which is a user killer > >> feature for Python according to StackOverflow - > >> http://stackoverflow.com/questions/tagged/python) would it be nice to > >> introduce more first class functions to work with them? One function > >> to be exact to split string into chunks. > >> > >> itertools.chunks(iterable, size, fill=None) > >> > >> Which is the 33th most voted Python question on SO - > >> > >> > http://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks-in-python/312464 > >> > >> P.S. CC'ing to python-dev@ to notify about the thread in python-ideas. > >> > > > > Anatoly, so far there were no negative votes -- would you care to go > > another step and propose a patch? > > > > > > Georg > > > > _______________________________________________ > > Python-ideas mailing list > > Python... at python.org > > http://mail.python.org/mailman/listinfo/python-ideas > _______________________________________________ > Python-ideas mailing list > Python... at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Sat Sep 1 18:39:50 2012 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 01 Sep 2012 17:39:50 +0100 Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References:

Message-ID: <50423A56.5070903@mrabarnett.plus.com> On 01/09/2012 07:27, anatoly techtonik wrote: > I've run into the necessity of implementing chunks() again. Here is > the code I've made from scratch. > > def chunks(seq, size): > '''Cut sequence into chunks of given size. If `seq` length is > not divisible by `size` without reminder, last chunk will > have length less than size. > > >>> list( chunks([1,2,3,4,5,6,7], 3) ) > [[1, 2, 3], [4, 5, 6], [7]] > ''' > endlen = len(seq)//size > for i in range(endlen): > yield [seq[i*size+n] for n in range(size)] > if len(seq) % size: > yield seq[endlen*size:] > Here's a lazy version: def chunks(seq, size): '''Cut sequence into chunks of given size. If `seq` length is not divisible by `size` without reminder, last chunk will have length less than size. >>> list( chunks([1,2,3,4,5,6,7], 3) ) [[1, 2, 3], [4, 5, 6], [7]] ''' if size < 1: raise ValueError("chunk size less than 1") it = iter(seq) try: while True: chunk = [] for _ in range(size): chunk.append(next(it)) yield chunk except StopIteration: if chunk: yield chunk From guido at python.org Sat Sep 1 19:06:48 2012 From: guido at python.org (Guido van Rossum) Date: Sat, 1 Sep 2012 19:06:48 +0200 Subject: [Python-ideas] sequence.apply(function) In-Reply-To: References: Message-ID: On Sat, Sep 1, 2012 at 8:29 AM, anatoly techtonik wrote: > Idea: Apply function to every element of a sequence and return new > sequence. It's more pythonic than map(), because clearly works only as > a list method. It's less Pythonic, because every sequence-like type (not just list) would have to reimplement it. Similar things get proposed for iterators (e.g. it1 + it2, it[:n], it[n:]) regularly and they are (and should be) rejected for the same reason. -- --Guido van Rossum (python.org/~guido) From michelelacchia at gmail.com Sat Sep 1 19:16:36 2012 From: michelelacchia at gmail.com (Michele Lacchia) Date: Sat, 1 Sep 2012 19:16:36 +0200 Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: <50423A56.5070903@mrabarnett.plus.com> References:

<50423A56.5070903@mrabarnett.plus.com> Message-ID: + 1 for the lazy version. Why not using itertools.islice instead of the innermost for loop? -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Sat Sep 1 21:02:20 2012 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 01 Sep 2012 20:02:20 +0100 Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References:

<50423A56.5070903@mrabarnett.plus.com> Message-ID: <50425BBC.8060409@mrabarnett.plus.com> On 01/09/2012 18:16, Michele Lacchia wrote: > + 1 for the lazy version. Why not using itertools.islice instead of the > innermost for loop? > OK, here's a lazy version using islice: from itertools import islice def chunks(seq, size): '''Cut sequence into chunks of given size. If `seq` length is not divisible by `size` without reminder, last chunk will have length less than size. >>> list( chunks([1,2,3,4,5,6,7], 3) ) [[1, 2, 3], [4, 5, 6], [7]] ''' if size < 1: raise ValueError("chunk size less than 1") it = iter(seq) while True: chunk = list(islice(it, 0, size)) if not chunk: break yield chunk From ubershmekel at gmail.com Sat Sep 1 23:55:39 2012 From: ubershmekel at gmail.com (Yuval Greenfield) Date: Sun, 2 Sep 2012 00:55:39 +0300 Subject: [Python-ideas] sequence.apply(function) In-Reply-To: References:

Message-ID: On Sat, Sep 1, 2012 at 8:06 PM, Guido van Rossum wrote: > It's less Pythonic, because every sequence-like type (not just list) > would have to reimplement it. > > Similar things get proposed for iterators (e.g. it1 + it2, it[:n], > it[n:]) regularly and they are (and should be) rejected for the same > reason. > > Python causes some confusion because some things are methods and others builtins. Is there a PEP or rationale that defines what goes where? Yuval Greenfield -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Sun Sep 2 00:02:11 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 2 Sep 2012 00:02:11 +0200 Subject: [Python-ideas] sequence.apply(function) References:

Message-ID: <20120902000211.1ae5ae0b@pitrou.net> On Sun, 2 Sep 2012 00:55:39 +0300 Yuval Greenfield wrote: > On Sat, Sep 1, 2012 at 8:06 PM, Guido van Rossum wrote: > > > It's less Pythonic, because every sequence-like type (not just list) > > would have to reimplement it. > > > > Similar things get proposed for iterators (e.g. it1 + it2, it[:n], > > it[n:]) regularly and they are (and should be) rejected for the same > > reason. > > > > > Python causes some confusion because some things are methods and others > builtins. Is there a PEP or rationale that defines what goes where? When something only applies to a single type or a couple of types, it is a method. When it is generic enough, it is a builtin. Of course there are grey areas but that's the basic idea. Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From ncoghlan at gmail.com Sun Sep 2 04:14:50 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 2 Sep 2012 12:14:50 +1000 Subject: [Python-ideas] sequence.apply(function) In-Reply-To: <20120902000211.1ae5ae0b@pitrou.net> References:

<20120902000211.1ae5ae0b@pitrou.net> Message-ID: On Sun, Sep 2, 2012 at 8:02 AM, Antoine Pitrou wrote: > On Sun, 2 Sep 2012 00:55:39 +0300 > Yuval Greenfield > wrote: >> On Sat, Sep 1, 2012 at 8:06 PM, Guido van Rossum wrote: >> >> > It's less Pythonic, because every sequence-like type (not just list) >> > would have to reimplement it. >> > >> > Similar things get proposed for iterators (e.g. it1 + it2, it[:n], >> > it[n:]) regularly and they are (and should be) rejected for the same >> > reason. >> > >> > >> Python causes some confusion because some things are methods and others >> builtins. Is there a PEP or rationale that defines what goes where? > > When something only applies to a single type or a couple of types, it is > a method. When it is generic enough, it is a builtin. > Of course there are grey areas but that's the basic idea. Yes, it comes down to the fact that we are *very* reluctant to impose required base classes (I believe the only ones currently enforced anywhere are object, BaseException and str - everything else should fall back to a protocol method, ABC or interface specific registration mechanism. Most interfaces that used to require actual integer objects are now using operator.index, or one of its C API equivalents). In Python, we also actively discourage "reopening" classes to add new methods (this is mostly a cultural thing, though - the language doesn't actually contain any mechanism to stop you by default, although it's possible to add such enforcement via metaclasses) Thus, protocols are born which define "has this behaviour", rather than "is one of these". That's why we have the len() builtin and associated __len__() protocol to say "taking the length of this object is a meaningful operation" rather than mandatory inheritance from a Container class that has a ".len()" method. They're most obviously beneficial when there are *multiple* protocols that can be used to implement a particular behaviour. For example, with iter(), the __iter__ protocol is only the first option tried. If that fails, then it will instead check for __getitem__ and if that exists, return a standard sequence iterator instead. Similarly, reversed() checks for __reversed__ first, and then checks for __len__ and __getitem__, producing a reverse sequence iterator in the latter case. Similarly, next() was moved from a standard method to a builtin function in 3.x? Why? Mainly to add the "if not found, return this default value" behaviour. That kind of thing is much easier to add when the object is only handling a piece of the behaviour, with additional standard mechanisms around it (in this case, optionally returning a default value when StopIteration is thrown by the iterator). Generators are another good illustration of the principle: For iter() and next(), they follow the standard protocol and rely on the corresponding builtins. However, g.send() and g.throw() require deep integration with the interpreter's eval loop. There's currently no way to implement either of those behaviours as an ordinary type, thus they're exposed as ordinary methods, since they're genuinely generator specific. As to *why* this is a good thing: procedural APIs encourage low coupling. Yes, object oriented programming is a good way to scale an application architecture up to more complicated problems. The issue is with fetishising OOP to the point where you disallow the creation of procedural APIs that hide the OOP details. That approach sets a minimum floor to the complexity of your implementations, as even if you don't *need* the power of OOP, you're forced to deal with it because the language doesn't offer anything else, and that way lies Java. There's a reason Java is significantly more popular on large enterprise projects than it is in small teams - it takes a certain, rather high, level of complexity for the reasons behind any of that boilerplate to start to become clear :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From ubershmekel at gmail.com Sun Sep 2 10:50:07 2012 From: ubershmekel at gmail.com (Yuval Greenfield) Date: Sun, 2 Sep 2012 11:50:07 +0300 Subject: [Python-ideas] sequence.apply(function) In-Reply-To: References:

<20120902000211.1ae5ae0b@pitrou.net> Message-ID: On Sun, Sep 2, 2012 at 5:14 AM, Nick Coghlan wrote: > On Sun, Sep 2, 2012 at 8:02 AM, Antoine Pitrou > wrote: > > On Sun, 2 Sep 2012 00:55:39 +0300 > > Yuval Greenfield > > wrote: > >> On Sat, Sep 1, 2012 at 8:06 PM, Guido van Rossum > wrote: > >> > >> > It's less Pythonic, because every sequence-like type (not just list) > >> > would have to reimplement it. > >> > > >> > Similar things get proposed for iterators (e.g. it1 + it2, it[:n], > >> > it[n:]) regularly and they are (and should be) rejected for the same > >> > reason. > >> > > >> > > >> Python causes some confusion because some things are methods and others > >> builtins. Is there a PEP or rationale that defines what goes where? > > > > When something only applies to a single type or a couple of types, it is > > a method. When it is generic enough, it is a builtin. > > Of course there are grey areas but that's the basic idea. > > Yes, it comes down to the fact that we are *very* reluctant to impose > required base classes (I believe the only ones currently enforced > anywhere are object, BaseException and str - everything else should > fall back to a protocol method, ABC or interface specific registration > mechanism. Most interfaces that used to require actual integer objects > are now using operator.index, or one of its C API equivalents). > > In Python, we also actively discourage "reopening" classes to add new > methods (this is mostly a cultural thing, though - the language > doesn't actually contain any mechanism to stop you by default, > although it's possible to add such enforcement via metaclasses) > > Thus, protocols are born which define "has this behaviour", rather > than "is one of these". That's why we have the len() builtin and > associated __len__() protocol to say "taking the length of this object > is a meaningful operation" rather than mandatory inheritance from a > Container class that has a ".len()" method. > > They're most obviously beneficial when there are *multiple* protocols > that can be used to implement a particular behaviour. For example, > with iter(), the __iter__ protocol is only the first option tried. If > that fails, then it will instead check for __getitem__ and if that > exists, return a standard sequence iterator instead. Similarly, > reversed() checks for __reversed__ first, and then checks for __len__ > and __getitem__, producing a reverse sequence iterator in the latter > case. > > Similarly, next() was moved from a standard method to a builtin > function in 3.x? Why? Mainly to add the "if not found, return this > default value" behaviour. That kind of thing is much easier to add > when the object is only handling a piece of the behaviour, with > additional standard mechanisms around it (in this case, optionally > returning a default value when StopIteration is thrown by the > iterator). > > Generators are another good illustration of the principle: For iter() > and next(), they follow the standard protocol and rely on the > corresponding builtins. However, g.send() and g.throw() require deep > integration with the interpreter's eval loop. There's currently no way > to implement either of those behaviours as an ordinary type, thus > they're exposed as ordinary methods, since they're genuinely generator > specific. > > As to *why* this is a good thing: procedural APIs encourage low > coupling. Yes, object oriented programming is a good way to scale an > application architecture up to more complicated problems. The issue is > with fetishising OOP to the point where you disallow the creation of > procedural APIs that hide the OOP details. That approach sets a > minimum floor to the complexity of your implementations, as even if > you don't *need* the power of OOP, you're forced to deal with it > because the language doesn't offer anything else, and that way lies > Java. There's a reason Java is significantly more popular on large > enterprise projects than it is in small teams - it takes a certain, > rather high, level of complexity for the reasons behind any of that > boilerplate to start to become clear :) > > Cheers, > Nick. > > Thanks, that's some interesting reasoning. Maybe I'm old fashioned but I like running dir(x) to find out what an object can do, and the wall of double underscores is hard to read. Perhaps we could add to the inspect module a "dirprotocols" function which returns a list of builtins that can be used on an object. I see that the builtins are listed in e.g. help([]) but on user defined classes it might be less obvious. Maybe we could just add a dictionary: inspect.special_methods = {'__len__': len, '__getitem__': 'x.__getitem__(y) <==> x[y]', '__iter__': iter, ... } and then dirprotocols would be easy to implement. Yuval -------------- next part -------------- An HTML attachment was scrubbed... URL: From techtonik at gmail.com Sun Sep 2 13:36:59 2012 From: techtonik at gmail.com (anatoly techtonik) Date: Sun, 2 Sep 2012 14:36:59 +0300 Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References:

Message-ID: On Sat, Sep 1, 2012 at 5:42 PM, Miki Tebeka wrote: > See the "grouper" example in http://docs.python.org/library/itertools.html As was discussed before, the problem is visibility of the solution, not the implementation. If we can divide core Python API into levels where 0 is the less important and 10 is more, then `chunks` should be level above than it is now. -- anatoly t. From stephen at xemacs.org Mon Sep 3 11:01:44 2012 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 03 Sep 2012 18:01:44 +0900 Subject: [Python-ideas] itertools.chunks(iterable, size, fill=None) In-Reply-To: References:

Message-ID: <87k3wbtr7r.fsf@uwakimon.sk.tsukuba.ac.jp> anatoly techtonik writes: > On Sat, Sep 1, 2012 at 5:42 PM, Miki Tebeka wrote: > > See the "grouper" example in http://docs.python.org/library/itertools.html > > As was discussed before, the problem is visibility of the solution, Well, no, it's apparently not. You should be well aware of the solution since you were one of the most ardent posters in this thread the last time it came up.[1] Yet you say "I had to *re*implement chunks". IOW, the implementations which you were already aware of were inappropriate for your situation. That suggests that no, there are no generic solutions suitable for the stdlib yet, and you personally aren't convinced that any of the implementations belong in your own private library, either. You really need to get over those humps before you have a case for a "higher-visibility" placement of any particular implementation. Footnotes: [1] And I think you cross-posted that time, too, but that's another issue. In any case, please stop cross-posting. Pick one or the other. (IMHO, this discussion belongs here on -ideas (or maybe on python-list), not on python-dev. Or submit an issue and a patch and discuss it there.) From alexander.belopolsky at gmail.com Mon Sep 3 17:21:50 2012 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Mon, 3 Sep 2012 11:21:50 -0400 Subject: [Python-ideas] Memoryview tolist() method is misleading Message-ID: Consider this: >>> memoryview(b'x').cast('B', ()).tolist() 120 The return value of to list() is an int, not a list. I suggest to deprecate memoryview.tolist() and .tobytes() methods (soft deprecation - in documentation only) and recommend using list(m) and bytes(m) instead. For the multidimensional (and 0-dimensional) views, I suggest adding an unpack([depth]) method that would unpack a view into a nested list of tuples or subviews. For example a single-byte scalar should unpack as follows: >>> m = memoryview(b'x').cast('B', ()) >>> m.unpack() (120,) consistent with >>> struct.unpack_from(m.format, m) (120,) From solipsis at pitrou.net Mon Sep 3 18:35:08 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 3 Sep 2012 18:35:08 +0200 Subject: [Python-ideas] 0-dimensional memoryviews? References: Message-ID: <20120903183508.585c0e46@pitrou.net> On Mon, 3 Sep 2012 11:21:50 -0400 Alexander Belopolsky wrote: > Consider this: > > >>> memoryview(b'x').cast('B', ()).tolist() > 120 > > The return value of to list() is an int, not a list. > > I suggest to deprecate memoryview.tolist() and .tobytes() methods > (soft deprecation - in documentation only) and recommend using list(m) > and bytes(m) instead. > > For the multidimensional (and 0-dimensional) views, I suggest adding > an unpack([depth]) method that would unpack a view into a nested list > of tuples or subviews. Is there any point in 0-dimensional views? Wouldn't it be clearer if we simply mandated a strictly positive number of dimensions? Regards Antoine. -- Software development and contracting: http://pro.pitrou.net From alexander.belopolsky at gmail.com Mon Sep 3 18:55:32 2012 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Mon, 3 Sep 2012 12:55:32 -0400 Subject: [Python-ideas] 0-dimensional memoryviews? In-Reply-To: <20120903183508.585c0e46@pitrou.net> References: <20120903183508.585c0e46@pitrou.net> Message-ID: On Mon, Sep 3, 2012 at 12:35 PM, Antoine Pitrou wrote: > Is there any point in 0-dimensional views? Wouldn't it be clearer if we > simply mandated a strictly positive number of dimensions? 0-d arrays (scalars) are very important in numpy and it took a significant effort to get the semantics right. I would argue that in nonnumeric contexts, 0-d case is even more important than 1d. For example, users of ctypes are rarely exposed to arrays, but often see nontrivial scalars. From Athelopuce at gmx.fr Thu Sep 6 14:30:04 2012 From: Athelopuce at gmx.fr (Alexandre Bosc) Date: Thu, 06 Sep 2012 14:30:04 +0200 Subject: [Python-ideas] Add annotations to global statement Message-ID: <5048974C.6080901@gmx.fr> I think the annotations of parameters and return value of a function, a useful practice for the user of the function. As a function can modify or create global variables, and as it's important for the end user, I would appreciate to add annotations in the global statement. An annotation syntax similar to that of parameters could be employed : global var : expression global var1 : expression1, var2 : expression2,... cheers, Alex (geoscience modeler) From rob.cliffe at btinternet.com Thu Sep 6 15:17:03 2012 From: rob.cliffe at btinternet.com (Rob Cliffe) Date: Thu, 06 Sep 2012 14:17:03 +0100 Subject: [Python-ideas] Add annotations to global statement In-Reply-To: <5048974C.6080901@gmx.fr> References: <5048974C.6080901@gmx.fr> Message-ID: <5048A24F.9090101@btinternet.com> On 06/09/2012 13:30, Alexandre Bosc wrote: > I think the annotations of parameters and return value of a function, > a useful practice for the user of the function. > As a function can modify or create global variables, and as it's > important for the end user, I would appreciate to add annotations in > the global statement. > > An annotation syntax similar to that of parameters could be employed : > > global var : expression > global var1 : expression1, var2 : expression2,... > > > cheers, > > Alex (geoscience modeler) > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > Just to clarify: Are you proposing that global var : expression is equivalent to global var var = expression ? If so, wouldn't the syntax global var = expression be more readable? Rob Cliffe From mikegraham at gmail.com Thu Sep 6 15:26:16 2012 From: mikegraham at gmail.com (Mike Graham) Date: Thu, 6 Sep 2012 09:26:16 -0400 Subject: [Python-ideas] Add annotations to global statement In-Reply-To: <5048A24F.9090101@btinternet.com> References: <5048974C.6080901@gmx.fr> <5048A24F.9090101@btinternet.com> Message-ID: On Thu, Sep 6, 2012 at 8:30 AM, Alexandre Bosc wrote: > I think the annotations of parameters and return value of a function, a > useful practice for the user of the function. > As a function can modify or create global variables, and as it's important > for the end user, I would appreciate to add annotations in the global > statement. > > An annotation syntax similar to that of parameters could be employed : > > global var : expression > global var1 : expression1, var2 : expression2,... > > > cheers, > > Alex (geoscience modeler) I cannot think of a case where having this feature would be of real help making a well-designed Python program. On Thu, Sep 6, 2012 at 9:17 AM, Rob Cliffe wrote: > Just to clarify: > > Are you proposing that > global var : expression > is equivalent to > global var > var = expression > ? > > If so, wouldn't the syntax > global var = expression > be more readable? > Rob Cliffe I believe the suggestion is that the expression becomes metadata associated with the name var. Are you familiar with function annotations, with which the parallel was drawn? http://www.python.org/dev/peps/pep-3107/ Mike From Athelopuce at gmx.fr Thu Sep 6 15:29:57 2012 From: Athelopuce at gmx.fr (Alexandre Bosc) Date: Thu, 06 Sep 2012 15:29:57 +0200 Subject: [Python-ideas] Add annotations to global statement In-Reply-To: <5048A24F.9090101@btinternet.com> References: <5048974C.6080901@gmx.fr> <5048A24F.9090101@btinternet.com> Message-ID: <5048A555.6080707@gmx.fr> Le 06. 09. 12 15:17, Rob Cliffe a ?crit : > > On 06/09/2012 13:30, Alexandre Bosc wrote: >> I think the annotations of parameters and return value of a function, >> a useful practice for the user of the function. >> As a function can modify or create global variables, and as it's >> important for the end user, I would appreciate to add annotations >> in the global statement. >> >> An annotation syntax similar to that of parameters could be employed : >> >> global var : expression >> global var1 : expression1, var2 : expression2,... >> >> >> cheers, >> >> Alex (geoscience modeler) >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> >> > Just to clarify: > > Are you proposing that > global var : expression > is equivalent to > global var > var = expression > ? > > If so, wouldn't the syntax > global var = expression > be more readable? > Rob Cliffe > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas Hi Rob, No what I propose is : def f(...): global var: expression .... is equivalent to def f(...): ... f.__annotations__['var']=expression is it clear ? Alex, From mikegraham at gmail.com Fri Sep 7 21:51:21 2012 From: mikegraham at gmail.com (Mike Graham) Date: Fri, 7 Sep 2012 15:51:21 -0400 Subject: [Python-ideas] reprs of recursive datastructures. Message-ID: With the Python 3 loosening of where ... can occur, this somewhat suboptimal behaviour occurs >>> x = [] >>> x.append(x) >>> x [[...]] >>> eval(repr(x)) [[Ellipsis]] Is this something that can be improved? Is it something worth improving? Mike From tjreedy at udel.edu Fri Sep 7 23:57:31 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 07 Sep 2012 17:57:31 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References: Message-ID: On 9/7/2012 3:51 PM, Mike Graham wrote: > With the Python 3 loosening of where ... can occur, this somewhat > suboptimal behaviour occurs > >>>> x = [] >>>> x.append(x) >>>> x > [[...]] >>>> eval(repr(x)) > [[Ellipsis]] I believe ... was used for representations before it became Ellipsis literal. In any case, the representation is now ambiguous. It is not possible to reliably invert a many-to-one function. > Is this something that can be improved? Change the recursive substitution so there is no ambiguity. For instance, use the unicode ellipsis character instead of '...', Since the output is unicode and may contain non-ascii chars anyway, that might be considered. >>> '\u2026' '?' >>> [[...]] [[Ellipsis]] >>> [[?]] SyntaxError: invalid character in identifier If not that, pick anything else giving a syntax error. >>> [[,,,]] SyntaxError: invalid syntax >>> [[. . .]] SyntaxError: invalid syntax > Is it something worth improving? I think so. Ambiguity is bad, and the substituted representation is something of a fib, so it should not mimic something that is valid. eval(representation of recursive structure) should either correctly evaluate by re-creating the recursive structure represented* or it should raise an error. * That would mean that the same expression should be valid in code also. An interesting idea, and a deep can of worms. I believe it would require that '. . .' or whatever be recognizable syntax but not a named object, as the latter would re-introduce the same ambiguity. -- Terry Jan Reedy From tjreedy at udel.edu Sat Sep 8 00:07:37 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 07 Sep 2012 18:07:37 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References: Message-ID: On 9/7/2012 5:57 PM, Terry Reedy wrote: > On 9/7/2012 3:51 PM, Mike Graham wrote: >> With the Python 3 loosening of where ... can occur, this somewhat >> suboptimal behaviour occurs >> >>>>> x = [] >>>>> x.append(x) >>>>> x >> [[...]] >>>>> eval(repr(x)) >> [[Ellipsis]] > > I believe ... was used for representations before it became Ellipsis > literal. In any case, the representation is now ambiguous. It is not > possible to reliably invert a many-to-one function. > >> Is this something that can be improved? > > Change the recursive substitution so there is no ambiguity. For > instance, use the unicode ellipsis character instead of '...', Since the > output is unicode and may contain non-ascii chars anyway, that might be > considered. > >>> '\u2026' > '?' > >>> [[...]] > [[Ellipsis]] > >>> [[?]] > SyntaxError: invalid character in identifier > > If not that, pick anything else giving a syntax error. > >>> [[,,,]] > SyntaxError: invalid syntax > >>> [[. . .]] > SyntaxError: invalid syntax Or probably the simplest, just use 4 periods >>> [[....]] SyntaxError: invalid syntax > >> Is it something worth improving? > > I think so. Ambiguity is bad, and the substituted representation is > something of a fib, so it should not mimic something that is valid. > eval(representation of recursive structure) should either correctly > evaluate by re-creating the recursive structure represented* or it > should raise an error. > > * That would mean that the same expression should be valid in code also. > An interesting idea, and a deep can of worms. I believe it would require > that '. . .' or whatever be recognizable syntax but not a named object, > as the latter would re-introduce the same ambiguity. > -- Terry Jan Reedy From alexander.belopolsky at gmail.com Sat Sep 8 04:55:36 2012 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Fri, 7 Sep 2012 22:55:36 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References: Message-ID: On Fri, Sep 7, 2012 at 6:07 PM, Terry Reedy wrote: > Or probably the simplest, just use 4 periods >>>> [[....]] or two: [[..]] From guido at python.org Sat Sep 8 08:27:41 2012 From: guido at python.org (Guido van Rossum) Date: Sat, 8 Sep 2012 08:27:41 +0200 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

Message-ID: Can someone explain what problem we are trying to solve? I fail to uderstand what's wrong with the current behavior... -- Sent from Gmail Mobile -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Sat Sep 8 09:23:14 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 08 Sep 2012 03:23:14 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

Message-ID: On 9/8/2012 2:27 AM, Guido van Rossum wrote: > Can someone explain what problem we are trying to solve? I fail to > uderstand what's wrong with the current behavior... Pairs of different things have the same representation, making the representation ambiguous to both people and the interpreter. Moreover, the interpreter's guess is usually wrong. In particular, the representations of recursive lists use what is now the Ellipsis literal '...', so they are also valid list displays for a non-recursive nested list containing Ellipsis. The interpreter always reads ... as the Ellipsis literal, which it nearly always is not what is meant. It would be trivial to tweak the representations of recursive lists so they are not valid list displays. --- Terry Jan Reedy -- Terry Jan Reedy From steve at pearwood.info Sat Sep 8 09:45:54 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 08 Sep 2012 17:45:54 +1000 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

Message-ID: <504AF7B2.7090703@pearwood.info> On 08/09/12 16:27, Guido van Rossum wrote: > Can someone explain what problem we are trying to solve? I fail to > uderstand what's wrong with the current behavior... I believe that some people think that if you eval the repr of a recursive list, the result should be an equally recursive list. But it isn't: py> x = [1, 2, 3] py> x.append(x) py> eval(repr(x)) == x False I think they are misguided in their expectation. There is no way to write a single expression using list literals which generates a recursive list, so why would you expect eval to produce one? Furthermore, list reprs of recursive lists have been ambiguous for years. This code works identically in 2.4 and 3.2: py> a = []; a.append(a) py> b = []; b.append(b) py> x = [[], []]; x[0].append(x); x[1].append(x) py> y = [a, b] py> x == y False py> repr(x) == repr(y) True eval(repr(x)) == x is not a guaranteed invariant, it is a "nice to have". -1 on trying to fix this. -- Steven From steve at pearwood.info Sat Sep 8 10:06:41 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 08 Sep 2012 18:06:41 +1000 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

Message-ID: <504AFC91.4050707@pearwood.info> On 08/09/12 17:23, Terry Reedy wrote: > In particular, the representations of recursive lists use what is >now the Ellipsis literal '...', so they are also valid list displays >for a non-recursive nested list containing Ellipsis. The interpreter >always reads ... as the Ellipsis literal, which it nearly always is >not what is meant. I'm not sure that you are right to assume that recursive lists are more common than lists containing Ellipsis. Neither are exactly common, and at least a few people use Ellipsis as a ready-made sentinel value that isn't None. > It would be trivial to tweak the representations of recursive lists >so they are not valid list displays. Ah, I had not realised that you wanted eval(repr(x)) to fail if x was recursive. That's more reasonable than expecting it to generate x. Changing the repr of recursive lists will break doctests. And frankly, my aesthetic sense would be hurt if the repr of a recursive list used something other than ... for the part not displayed. An ellipsis is the right symbol to use when skipping part of the display, and an ellipsis is three dots, not two or four. A unicode ? would be acceptable, except I understand that buildins must be ASCII. I don't think this is genuinely enough of a problem that it needs fixing. -- Steven From ncoghlan at gmail.com Sat Sep 8 10:16:23 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 8 Sep 2012 18:16:23 +1000 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: <504AF7B2.7090703@pearwood.info> References:

<504AF7B2.7090703@pearwood.info> Message-ID: On Sat, Sep 8, 2012 at 5:45 PM, Steven D'Aprano wrote: > On 08/09/12 16:27, Guido van Rossum wrote: >> >> Can someone explain what problem we are trying to solve? I fail to >> uderstand what's wrong with the current behavior... > > > > I believe that some people think that if you eval the repr of a > recursive list, the result should be an equally recursive list. No, the problem is that you get the *wrong answer* instead of an exception. Python 2: >>> x = [] >>> x.append(x) >>> x [[...]] >>> eval(repr(x)) Traceback (most recent call last): File "", line 1, in File "", line 1 [[...]] ^ SyntaxError: invalid syntax Python 3: >>> x = [] >>> x.append(x) >>> x [[...]] >>> eval(repr(x)) [[Ellipsis]] As pointed out earlier, this is due to the fact that the previously illegal notation used to indicate the recursive reference is now valid syntax. The simplest fix is to just introduce alternative notation for the self-reference that will reintroduce the desired syntax error, such as "<...>" or "". Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From guido at python.org Sat Sep 8 11:49:29 2012 From: guido at python.org (Guido van Rossum) Date: Sat, 8 Sep 2012 11:49:29 +0200 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: <504AF11A.80700@udel.edu> References:

<504AF11A.80700@udel.edu> Message-ID: On Sat, Sep 8, 2012 at 9:17 AM, Terry Reedy wrote: > On 9/8/2012 2:27 AM, Guido van Rossum wrote: >> Can someone explain what problem we are trying to solve? I fail to >> uderstand what's wrong with the current behavior... > Pairs of different things have the same representation, making the > representation ambiguous to both people and the interpreter. Well yeah, when designing a repr() we usually have to compromise. E.g. if you render a class instance it often shows the class name but not the module name (e.g. decimal.Decimal.) > Moreover, the interpreter's guess is usually wrong. The requirement that the interpreter can evaluate a repr() and return a similar value is pretty weak, and I'm not sure that in this case the fact that copying the output back into the interpreter returns an object of a different share matters much to anyone. A subtler but similar bug appears with lists containing multiple references to the same sublist, e.g. >>> a = [1, 2] >>> b = [a, a] >>> b [[1, 2], [1, 2]] >>> b[0].append(3) >>> b [[1, 2, 3], [1, 2, 3]] >>> x = [[1, 2], [1, 2]] >>> x[0].append(3) >>> x [[1, 2, 3], [1, 2]] >>> I don't think we should attempt to fix this particular one -- first of all, the analysis would be tricky (there could be a user-defined object involved) and second of all, I can't think of a solution that still produces a valid expression (except perhaps a very ugly one). > In particular, the representations of recursive lists use what is now the > Ellipsis literal '...', so they are also valid list displays for a > non-recursive nested list containing Ellipsis. The interpreter always reads > ... as the Ellipsis literal, which it nearly always is not what is meant. But when does it ever matter? > It would be trivial to tweak the representations of recursive lists so they > are not valid list displays. To what purpose? I still don't understand what the actual use case is where you think that will produce a better experience for the user. -- --Guido van Rossum (python.org/~guido) From ncoghlan at gmail.com Sat Sep 8 12:02:04 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 8 Sep 2012 20:02:04 +1000 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu> Message-ID: On Sat, Sep 8, 2012 at 7:49 PM, Guido van Rossum wrote: > To what purpose? I still don't understand what the actual use case is > where you think that will produce a better experience for the user. The thing I don't like is that the current display flat out lies about the sequence contents - it displays a terminal constant ("..."), rather than a clear marker that a recursive loop was detected. The case of multiple references to a different list is not the same, as then the repr() at least still accurately reflects what you would get when iterating over the data structure. So, my perspective is if attempting to naively flatten the list would create an infinite loop, then I want evaluating the representation to throw a syntax error the way it did in Python 2. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From tjreedy at udel.edu Sat Sep 8 21:02:23 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 08 Sep 2012 15:02:23 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu> Message-ID: On 9/8/2012 6:02 AM, Nick Coghlan wrote: > On Sat, Sep 8, 2012 at 7:49 PM, Guido van Rossum wrote: >> To what purpose? I still don't understand what the actual use case is >> where you think that will produce a better experience for the user. > > The thing I don't like is that the current display flat out lies about > the sequence contents - it displays a terminal constant ("..."), > rather than a clear marker that a recursive loop was detected. > > The case of multiple references to a different list is not the same, > as then the repr() at least still accurately reflects what you would > get when iterating over the data structure. > > So, my perspective is if attempting to naively flatten the list would > create an infinite loop, then I want evaluating the representation to > throw a syntax error the way it did in Python 2. This expresses what I was trying to say better than I did. When '...' was chosen for recursive structures, it make the result not-legal-code, as it should be. The 3.0 incorporation of '...' as legal syntax, created, in a sense, a reversion. So that suggests revising the recursion marker. That said, there is the issue of doctests, so I would only change in 3.4. -- Terry Jan Reedy From alexander.belopolsky at gmail.com Sat Sep 8 21:53:23 2012 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Sat, 8 Sep 2012 15:53:23 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu> Message-ID: On Sep 8, 2012, at 3:02 PM, Terry Reedy wrote: > > That said, there is the issue of doctests, so I would only change in 3.4. Note that in doctest displays, ellipsis has yet another meaning. I agree that this is 3.4 material, and the solution should probably be something in <>. From guido at python.org Sat Sep 8 21:59:11 2012 From: guido at python.org (Guido van Rossum) Date: Sat, 8 Sep 2012 21:59:11 +0200 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu>

Message-ID: Ok, I can live with <...>. On Saturday, September 8, 2012, Alexander Belopolsky wrote: > > > On Sep 8, 2012, at 3:02 PM, Terry Reedy > > wrote: > > > > > That said, there is the issue of doctests, so I would only change in 3.4. > > Note that in doctest displays, ellipsis has yet another meaning. I agree > that this is 3.4 material, and the solution should probably be something in > <>. > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Sent from Gmail Mobile -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Sat Sep 8 23:26:26 2012 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 08 Sep 2012 22:26:26 +0100 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu>

Message-ID: <504BB802.5050603@mrabarnett.plus.com> On 08/09/2012 20:59, Guido van Rossum wrote: > On Saturday, September 8, 2012, Alexander Belopolsky wrote: > > > > On Sep 8, 2012, at 3:02 PM, Terry Reedy > wrote: > > > > > That said, there is the issue of doctests, so I would only change > in 3.4. > > Note that in doctest displays, ellipsis has yet another meaning. I > agree that this is 3.4 material, and the solution should probably be > something in <>. > > Ok, I can live with <...>. > Should we also guarantee that it'll never have any other meaning in Python so that this will never happen again? From alexander.belopolsky at gmail.com Sun Sep 9 00:06:14 2012 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Sat, 8 Sep 2012 18:06:14 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: <504BB802.5050603@mrabarnett.plus.com> References:

<504AF11A.80700@udel.edu>

<504BB802.5050603@mrabarnett.plus.com> Message-ID: On Sat, Sep 8, 2012 at 5:26 PM, MRAB wrote: >> >> Note that in doctest displays, ellipsis has yet another meaning. I >> agree that this is 3.4 material, and the solution should probably be >> something in <>. >> >> Ok, I can live with <...>. >> > Should we also guarantee that it'll never have any other meaning in > Python so that this will never happen again? I was probably not very clear about the problem of having ellipsis appear as expected output in doctests. The problem is that '...' has a special meaning for doctests: """ When specified, an ellipsis marker (...) in the expected output can match any substring in the actual output. ... """ http://docs.python.org/py3k/library/doctest.html#doctest.ELLIPSIS This means that <...> will match any angle bracketed repr. Note that lists are not the only types affected by this issue. Dicts, for example, have the same problem: >>> d = {} >>> d[0] = d >>> d {0: {...}} >>> eval('{0: {...}}') {0: {Ellipsis}} It is possible the other mutable container types are similarly affected. It looks like this problem requires some more though. If we ever decide to allow non-ASCII characters in repr, my vote for repr of recursive list will be >>> "[[\N{ANTICLOCKWISE GAPPED CIRCLE ARROW}]]" '[[?]]' >>> "\N{WHITE SMILING FACE}" '?' From python at mrabarnett.plus.com Sun Sep 9 00:27:24 2012 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 08 Sep 2012 23:27:24 +0100 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu>

<504BB802.5050603@mrabarnett.plus.com> Message-ID: <504BC64C.1050700@mrabarnett.plus.com> On 08/09/2012 23:06, Alexander Belopolsky wrote: > On Sat, Sep 8, 2012 at 5:26 PM, MRAB wrote: >>> >>> Note that in doctest displays, ellipsis has yet another meaning. I >>> agree that this is 3.4 material, and the solution should probably be >>> something in <>. >>> >>> Ok, I can live with <...>. >>> >> Should we also guarantee that it'll never have any other meaning in >> Python so that this will never happen again? > > I was probably not very clear about the problem of having ellipsis > appear as expected output in doctests. The problem is that '...' has > a special meaning for doctests: > > """ > When specified, an ellipsis marker (...) in the expected output can > match any substring in the actual output. ... > """ http://docs.python.org/py3k/library/doctest.html#doctest.ELLIPSIS > > This means that <...> will match any angle bracketed repr. Note that > lists are not the only types affected by this issue. Dicts, for > example, have the same problem: > >>>> d = {} >>>> d[0] = d >>>> d > {0: {...}} >>>> eval('{0: {...}}') > {0: {Ellipsis}} > > It is possible the other mutable container types are similarly > affected. It looks like this problem requires some more though. > > If we ever decide to allow non-ASCII characters in repr, my vote for > repr of recursive list will be > >>>> "[[\N{ANTICLOCKWISE GAPPED CIRCLE ARROW}]]" > '[[?]]' > Or: >>> "[[\N{CLOCKWISE GAPPED CIRCLE ARROW}]]" '[[?]]' > >>>> "\N{WHITE SMILING FACE}" > '?' > From cs at zip.com.au Sun Sep 9 01:20:20 2012 From: cs at zip.com.au (Cameron Simpson) Date: Sun, 9 Sep 2012 09:20:20 +1000 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: <504BC64C.1050700@mrabarnett.plus.com> References: <504BC64C.1050700@mrabarnett.plus.com> Message-ID: <20120908232020.GA2900@cskk.homeip.net> On 08Sep2012 23:27, MRAB wrote: | On 08/09/2012 23:06, Alexander Belopolsky wrote: | > If we ever decide to allow non-ASCII characters in repr, my vote for | > repr of recursive list will be | > | >>>> "[[\N{ANTICLOCKWISE GAPPED CIRCLE ARROW}]]" | > '[[?]]' | > | Or: | | >>> "[[\N{CLOCKWISE GAPPED CIRCLE ARROW}]]" | '[[?]]' [...] These are sublime! Personally I find the former one more intuitively expressive of a recursion, probably because the arrow points "left" (in my current font, anyway; how variable is this?) and therefore towards the stuff already recited. The latter arrow seems to point "right" or "forwards", no so recursive to my intuition. | >>>> "\N{WHITE SMILING FACE}" | > '?' Cute but a -1 from me; less intuitive meaning. Cheers, -- Cameron Simpson To understand recursion, you must first understand recursion. From dholth at gmail.com Sun Sep 9 05:41:43 2012 From: dholth at gmail.com (Daniel Holth) Date: Sat, 8 Sep 2012 23:41:43 -0400 Subject: [Python-ideas] bdist naming scheme (compatibility tags) PEP In-Reply-To: References: <50236FD5.2060404@egenix.com> Message-ID: Re: http://www.python.org/dev/peps/pep-0425/ "Compatibility tags for built distributions" Progress towards a proper set of rules for generating the tags a Python implementation is likely to support. This system of being willing to install older built distributions is intended to solve the frustrating problem with eggs that you would have to build a new egg for each Python release, even for a pure-Python egg that probably runs fine on a newer Python. In order of preference the tags are: - built for the current implementation and its preferred ABI and architecture - for the current implementation and tagged with just the major version number (explicitly tagged as cross-version compatible) - for the current implementation, and any of the lesser minor revisions (cp26..cp20) - for the current language version (py27) - for the current language major version (py2) - for any of the current language minor versions (py26..py20) Importantly "py2" means "expected to work across minor releases" and is not shorthand for "py20". Practically it means the packager overrode the default tag. For PyPy, I think "pp19" for the current version makes more sense than "pp27" since they add important runtime features without changing the version of the Python language they support (like stackless emulation). I don't know how their versions will work when PyPy for Python 3 is released. Other Python implementations seem to follow the CPython version numbers more closely. For PyPy it may be appropriate to cross major versions when going back to generate the list of older packages one is willing to install. For CPython it is a bit overkill to go all the way back to Python 2.0; the "all the way back to the last major revision" rule is really for the 2 - 3 split. List of supported or "willing to install" tags for CPython 3.2 (an mu build): [('cp32', 'cp32mu', 'linux_x86_64'), ('cp3', 'none', 'any'), ('cp31', 'none', 'any'), ('cp30', 'none', 'any'), ('py32', 'none', 'any'), ('py3', 'none', 'any'), ('py31', 'none', 'any'), ('py30', 'none', 'any')] For CPython 2.7: [('cp27', 'none', 'linux_x86_64'), ('cp2', 'none', 'any'), ('cp26', 'none', 'any'), ('cp25', 'none', 'any'), ('cp24', 'none', 'any'), ('cp23', 'none', 'any'), ('cp22', 'none', 'any'), ('cp21', 'none', 'any'), ('cp20', 'none', 'any'), ('py27', 'none', 'any'), ('py2', 'none', 'any'), ('py26', 'none', 'any'), ('py25', 'none', 'any'), ('py24', 'none', 'any'), ('py23', 'none', 'any'), ('py22', 'none', 'any'), ('py21', 'none', 'any'), ('py20', 'none', 'any')] From raymond.hettinger at gmail.com Sun Sep 9 07:04:34 2012 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Sun, 9 Sep 2012 01:04:34 -0400 Subject: [Python-ideas] reprs of recursive datastructures. In-Reply-To: References:

<504AF11A.80700@udel.edu> Message-ID: On Sep 8, 2012, at 6:02 AM, Nick Coghlan wrote: > The thing I don't like is that the current display flat out lies about > the sequence contents - it displays a terminal constant ("..."), > rather than a clear marker that a recursive loop was detected. It's more of an equivocation than a flat-out lie ;-) It is an equivocation because "..." is legitimately used for multiple purposes (in English text for "and so on ...", in long established use in Python to denote recursive reprs, in doctest as a placeholder for elided result text, and in its newest role as the Ellipsis terminal constant). It seems to me that the first three roles are well-established and are reasonably consistent with one another. Further, each of those roles an important task. In contrast, the new role as a terminal constant for an Ellipsis singleton is brand-new, not very important, and doesn't even have a clear semantic role (what it is "supposed" to mean?). Changing the first three uses just so it won't conflict with the last seems like the tail wagging the dog. I agree Steven that this isn't a problem worth solving. As Alexander pointed-out, the ... punctuation can be used in two distinct ways inside doctests (as part of expected output or as a placeholder for elided content). A consequence is that there won't be a reliable automated way to convert existing doctests for a new notation for recursive reprs. ISTM that changes which break tests are worse than other changes because the process of upgrading from one Python version to the next is so heavily dependent getting existing tests to pass. The tests are your safety net during upgrades -- breaking them makes upgrading less palatable. Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Sun Sep 9 14:16:24 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 9 Sep 2012 22:16:24 +1000 Subject: [Python-ideas] bdist naming scheme (compatibility tags) PEP In-Reply-To: References: <50236FD5.2060404@egenix.com>

Message-ID: On Sun, Sep 9, 2012 at 1:41 PM, Daniel Holth wrote: > Re: http://www.python.org/dev/peps/pep-0425/ "Compatibility tags for > built distributions" > > Progress towards a proper set of rules for generating the tags a > Python implementation is likely to support. > > This system of being willing to install older built distributions is > intended to solve the frustrating problem with eggs that you would > have to build a new egg for each Python release, even for a > pure-Python egg that probably runs fine on a newer Python. Yep, those rules look sensible to me (and thanks for clarifying the intended semantics of the "py2" and "py3" version markers) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From simon.sapin at kozea.fr Mon Sep 10 11:21:49 2012 From: simon.sapin at kozea.fr (Simon Sapin) Date: Mon, 10 Sep 2012 11:21:49 +0200 Subject: [Python-ideas] issue15824 In-Reply-To: References: Message-ID: <504DB12D.4060506@kozea.fr> Le 30/08/2012 21:51, Serhiy Storchaka a ?crit : > On 30.08.12 22:03, Ben Toews wrote: >> I think this is really clunky. I don't see any reason why we should be >> using a type that doesn't support item assignment and needs to be >> casted to a another type in order to make changes. > > Mutable urlparse result is backward incompatible. For now this result > can be used as dict key. Note that namedtuples *are* immutables, but they have a _replace method that returns a new namedtuple. This method could also be extended to handle the properties that are computed dynamically (username, password, host, port). Regards, -- Simon Sapin From masklinn at masklinn.net Thu Sep 13 15:15:03 2012 From: masklinn at masklinn.net (Masklinn) Date: Thu, 13 Sep 2012 15:15:03 +0200 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple Message-ID: attrgetter and itemgetter are both very useful functions, but both have a significant pitfall if the arguments passed in are validated but not controlled: if receiving the arguments (list of attributes, keys or indexes) from an external source and *-applying it, if the external source passes a sequence of one element both functions will in turn return an element rather than a singleton (1-element tuple). This means such code, for instance code "slicing" a matrix of some sort to get only some columns and getting the slicing information from its caller (in situation where extracting a single column may be perfectly sensible) will have to implement a manual dispatch between a "manual" getitem (or getattr) and an itemgetter (resp. attrgetter) call, e.g. slicer = (operator.itemgetter(*indices) if len(indices) > 1 else lambda ar: [ar[indices[0]]) This makes for more verbose and less straightforward code, I think it would be useful to such situations if attrgetter and itemgetter could be forced into always returning a tuple by way of an optional argument: # works the same no matter what len(indices) is slicer = operator.itemgetter(*indices, force_tuple=True) which in the example equivalences[0] would be an override (to False) of the `len` check (`len(items) == 1` would become `len(items) == 1 and not force_tuple`) The argument is backward-compatible as neither function currently accepts any keyword argument. Uncertainty note: whether force_tuple (or whatever its name is) silences the error generated when len(indices) == 0, and returns a null tuple rather than raising a TypeError. [0] http://docs.python.org/dev/library/operator.html#operator.attrgetter From tjreedy at udel.edu Thu Sep 13 21:11:22 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 13 Sep 2012 15:11:22 -0400 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: References: Message-ID: On 9/13/2012 9:15 AM, Masklinn wrote: > attrgetter and itemgetter are both very useful functions, but both have > a significant pitfall if the arguments passed in are validated but not > controlled: if receiving the arguments (list of attributes, keys or > indexes) from an external source and *-applying it, if the external > source passes a sequence of one element both functions will in turn > return an element rather than a singleton (1-element tuple). > > This means such code, for instance code "slicing" a matrix of some sort > to get only some columns and getting the slicing information from its > caller (in situation where extracting a single column may be perfectly > sensible) will have to implement a manual dispatch between a "manual" > getitem (or getattr) and an itemgetter (resp. attrgetter) call, e.g. > > slicer = (operator.itemgetter(*indices) if len(indices) > 1 > else lambda ar: [ar[indices[0]]) > > This makes for more verbose and less straightforward code, I think it > would be useful to such situations if attrgetter and itemgetter could be > forced into always returning a tuple by way of an optional argument: > > # works the same no matter what len(indices) is > slicer = operator.itemgetter(*indices, force_tuple=True) > > which in the example equivalences[0] would be an override (to False) of > the `len` check (`len(items) == 1` would become `len(items) == 1 and not > force_tuple`) > > The argument is backward-compatible as neither function currently > accepts any keyword argument. > > Uncertainty note: whether force_tuple (or whatever its name is) > silences the error generated when len(indices) == 0, and returns > a null tuple rather than raising a TypeError. > > [0] http://docs.python.org/dev/library/operator.html#operator.attrgetter This seems like a plausible idea. The actual C version requires one argument. The Python equivalent in the doc does not (hence the different signature), as it would return an empty tuple for empty *items. -- Terry Jan Reedy From poalman at gmail.com Thu Sep 13 23:05:22 2012 From: poalman at gmail.com (Paul Wiseman) Date: Thu, 13 Sep 2012 22:05:22 +0100 Subject: [Python-ideas] syntax to continue into the next subsequent except block Message-ID: I think it would be useful if there was a way to skip into the next except block, perhaps with continue as I think it's currently always illegal to use in an except block. I don't believe there's currently a way to do this. This is my reasoning, often there's multiple reasons for exceptions that raise the same exception, as an example an IOError might get raised for lots of different reasons. If you want to handle one or several of these reasons, you have to catch all exceptions of this type, but there's not really a way to "put back" the exception if it isn't the type you were after. For instance try: operation() except IOError as err: if err.errno == 2: do_something() else: continue #This would continue the except down to the next check, except Exception except Exception as err: logger.error("Error performing operation: {}".format(err.message)") some_clean_up() raise The current alternatives to get this behaviour I don't believe are as nice, but maybe I'm missing something This works but clearly not as nice with nested try excepts, try: try: operation() except IOError as err: if err.errno == 2: do_something() else: raise except Exception as err: logger.error("Error performing operation: {}".format(err.message)) some_clean_up() raise This is clearly a not very good and un-dry solution: try: operation() except IOError as err: if err.errno == 2: do_something() else: logger.error("Error performing operation: {}".format(err.message)) some_clean_up() raise except Exception as err: logger.error("Error performing operation: {}".format(err.message)) some_clean_up() raise There's the option of using a context manager, but personally I don't think it's as explicit or as obvious as a try except block, but maybe others would disagree class SpecificCaseErrorHandler(object): def __enter__(self): pass def __exit__(self, exc_type, exc_value, tb): if exc_type is not None: if exc_type is IOError and exc_value.errno == 2: do_something() return True logger.error("Error performing operation: {}".format(err.message)) some_clean_up() with SpecificCaseErrorHandler(): operation() -------------- next part -------------- An HTML attachment was scrubbed... URL: From scott+python-ideas at scottdial.com Fri Sep 14 00:28:56 2012 From: scott+python-ideas at scottdial.com (Scott Dial) Date: Thu, 13 Sep 2012 18:28:56 -0400 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: References: Message-ID: <50525E28.2040409@scottdial.com> On 9/13/2012 5:05 PM, Paul Wiseman wrote: > I think it would be useful if there was a way to skip into the next > except block, perhaps with continue as I think it's currently always > illegal to use in an except block. I don't believe there's currently a > way to do this. > > This is my reasoning, often there's multiple reasons for exceptions that > raise the same exception, as an example an IOError might get raised for > lots of different reasons. If you want to handle one or several of these > reasons, you have to catch all exceptions of this type, but there's not > really a way to "put back" the exception if it isn't the type you were > after. For instance > > try: > operation() > except IOError as err: > if err.errno == 2: > do_something() > else: > continue #This would continue the except down to the next check, > except Exception > except Exception as err: > logger.error("Error performing operation: {}".format(err.message)") > some_clean_up() > raise > "continue" already has a meaning that would make this ambiguous: for i in range(10): try: raise IOError() except IOError as err: continue Also, I would inevitably write what you want as: try: operation() except Exception as err: if isinstance(err, IOError): if err.errno == 2: do_something() else: logger.error( "Error performing operation: {}".format(err.message)") some_clean_up() raise -- Scott Dial scott at scottdial.com From tjreedy at udel.edu Fri Sep 14 01:46:05 2012 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 13 Sep 2012 19:46:05 -0400 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: References: Message-ID: On 9/13/2012 5:05 PM, Paul Wiseman wrote: > I think it would be useful if there was a way to skip into the next > except block, perhaps with continue as I think it's currently always > illegal to use in an except block. I don't believe there's currently a > way to do this. > > This is my reasoning, often there's multiple reasons for exceptions that > raise the same exception, as an example an IOError might get raised for > lots of different reasons. If you want to handle one or several of these > reasons, you have to catch all exceptions of this type, but there's not > really a way to "put back" the exception if it isn't the type you were > after. For instance > > try: > operation() > except IOError as err: > if err.errno == 2: > do_something() > else: > continue #This would continue the except down to the next > check, except Exception > except Exception as err: > logger.error("Error performing operation: {}".format(err.message)") > some_clean_up() > raise > The current alternatives to get this behaviour I don't believe are as > nice, but maybe I'm missing something As you already know, raise puts the exception back, in a sense try: try: operation() except IOError as err: if err.errno == 2: do_something() else: raise except Exception as err: logger.error("Error performing operation: {}".format(err.message)") some_clean_up() raise or probably better try: operation() except Exception as err: if isinstance(err, IOError) and err.errno == 2: do_something() else: logger.error("Error performing operation: {}".format(err.message)") some_clean_up() raise -- Terry Jan Reedy From steve at pearwood.info Fri Sep 14 03:20:38 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 14 Sep 2012 11:20:38 +1000 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: References: Message-ID: <50528666.7090400@pearwood.info> On 13/09/12 23:15, Masklinn wrote: > attrgetter and itemgetter are both very useful functions, but both have > a significant pitfall if the arguments passed in are validated but not > controlled: if receiving the arguments (list of attributes, keys or > indexes) from an external source and *-applying it, if the external > source passes a sequence of one element both functions will in turn > return an element rather than a singleton (1-element tuple). For those who, like me, had to read this three or four times to work out what Masklinn is talking about, I think he is referring to the fact that attrgetter and itemgetter both return a single element if passed a single index, otherwise they return a tuple of results. If a call itemgetter(*args)(some_list) returns a tuple, was that tuple a single element (and args contained a single index) or was the tuple a collection of individual elements (and args contained multiple indexes)? py> itemgetter(*[1])(['a', ('b', 'c'), 'd']) ('b', 'c') py> itemgetter(*[1, 2])(['a', 'b', 'c', 'd']) ('b', 'c') > This means such code, for instance code "slicing" a matrix of some sort > to get only some columns and getting the slicing information from its > caller (in situation where extracting a single column may be perfectly > sensible) will have to implement a manual dispatch between a "manual" > getitem (or getattr) and an itemgetter (resp. attrgetter) call, e.g. > > slicer = (operator.itemgetter(*indices) if len(indices)> 1 > else lambda ar: [ar[indices[0]]) Why is this a problem? If you don't like writing this out in place, write it once in a helper function. Not every short code snippet needs to be in the standard library. > This makes for more verbose and less straightforward code, I think it > would be useful to such situations if attrgetter and itemgetter could be > forced into always returning a tuple by way of an optional argument: -1 There is no need to add extra complexity to itemgetter and attrgetter for something best solved in your code. Write a helper: def slicer(*indexes): getter = itemgetter(*indexes) if len(indexes) == 1: return lambda seq: (getter(seq), ) # Wrap in a tuple. return getter -- Steven From stephen at xemacs.org Fri Sep 14 03:52:39 2012 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Fri, 14 Sep 2012 10:52:39 +0900 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: References: Message-ID: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> Terry Reedy writes: > try: > try: Ugh-ugh.<0.5 wink> > try: > operation() > except Exception as err: > if isinstance(err, IOError) and err.errno == 2: Ugh.<0.5 wink> Not your fault, but these constructions are pretty ugly IMO, I have to go with the OP on that. ISTR there were discussions of "qualified except" clauses here maybe 6mo to 1yr ago? That is, they'd look something like try: operation() except IOError as err if err.errno == 2: do_something() except Exception: logger.error("Error performing operation: {}".format(err.message)") some_clean_up() raise Again ISTR that this got spiked for some reason, but maybe it will be of use to the OP in formulating his next idea. Sorry for the lack of precise reference. From ncoghlan at gmail.com Fri Sep 14 04:32:04 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 14 Sep 2012 12:32:04 +1000 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> References: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Fri, Sep 14, 2012 at 11:52 AM, Stephen J. Turnbull wrote: > ISTR there were discussions of "qualified except" clauses here maybe > 6mo to 1yr ago? That is, they'd look something like > > try: > operation() > except IOError as err if err.errno == 2: > do_something() > except Exception: > logger.error("Error performing operation: {}".format(err.message)") > some_clean_up() > raise > > Again ISTR that this got spiked for some reason, but maybe it will be > of use to the OP in formulating his next idea. Sorry for the lack of > precise reference. They were one of the ideas discussed when Antoine was writing PEP 3151. As I recall, nobody could think of any good use cases that didn't involve errno checking, and PEP 3151 provides a far more elegant (and cross-platform) solution to most problems that require errno checking in versions prior to 3.3. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From greg.ewing at canterbury.ac.nz Fri Sep 14 02:26:59 2012 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 14 Sep 2012 12:26:59 +1200 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: References: Message-ID: <505279D3.9070709@canterbury.ac.nz> Paul Wiseman wrote: > try: > operation() > except IOError as err: > if err.errno == 2: > do_something() > else: > continue #This would continue the except down to the next check, I think a better way to address this would be to allow guard expressions on the except clauses. try: operation() except IOError as err if err.errno == 2: ... -- Greg From masklinn at masklinn.net Fri Sep 14 09:43:38 2012 From: masklinn at masklinn.net (Masklinn) Date: Fri, 14 Sep 2012 09:43:38 +0200 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: <50528666.7090400@pearwood.info> References: <50528666.7090400@pearwood.info> Message-ID: <61323E14-01CE-4A1E-9533-292B093F51BC@masklinn.net> On 2012-09-14, at 03:20 , Steven D'Aprano wrote: >> This means such code, for instance code "slicing" a matrix of some sort >> to get only some columns and getting the slicing information from its >> caller (in situation where extracting a single column may be perfectly >> sensible) will have to implement a manual dispatch between a "manual" >> getitem (or getattr) and an itemgetter (resp. attrgetter) call, e.g. >> >> slicer = (operator.itemgetter(*indices) if len(indices)> 1 >> else lambda ar: [ar[indices[0]]) > > > Why is this a problem? Because it adds significant complexity to the code, and that's for the trivial version of itemgetter, attrgetter also does keypath resolution so the code is nowhere near this simple. It's also anything but obvious what this snippet does on its own. > If you don't like writing this out in place, write > it once in a helper function. Not every short code snippet needs to be in > the standard library. It's not really "every short code snippet" in this case, it's a way to avoid a sometimes deleterious special case and irregularity of the stdlib. >> This makes for more verbose and less straightforward code, I think it >> would be useful to such situations if attrgetter and itemgetter could be >> forced into always returning a tuple by way of an optional argument: > > -1 > > There is no need to add extra complexity to itemgetter and attrgetter for > something best solved in your code. I don't agree with this statement, the stdlib flag adds very little extra complexity, way less than the original irregularity/special case and way less than necessary to do it outside the stdlib. Furthermore, it makes the solution (to having a regular output behavior for (attr|item)getter) far more obvious and makes the code itself much simpler to read. From steve at pearwood.info Fri Sep 14 11:02:54 2012 From: steve at pearwood.info (Steven D'Aprano) Date: Fri, 14 Sep 2012 19:02:54 +1000 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: <61323E14-01CE-4A1E-9533-292B093F51BC@masklinn.net> References: <50528666.7090400@pearwood.info> <61323E14-01CE-4A1E-9533-292B093F51BC@masklinn.net> Message-ID: <5052F2BE.5070507@pearwood.info> On 14/09/12 17:43, Masklinn wrote: > On 2012-09-14, at 03:20 , Steven D'Aprano wrote: >>> This means such code, for instance code "slicing" a matrix of some sort >>> to get only some columns and getting the slicing information from its >>> caller (in situation where extracting a single column may be perfectly >>> sensible) will have to implement a manual dispatch between a "manual" >>> getitem (or getattr) and an itemgetter (resp. attrgetter) call, e.g. >>> >>> slicer = (operator.itemgetter(*indices) if len(indices)> 1 >>> else lambda ar: [ar[indices[0]]) >> >> >> Why is this a problem? > > Because it adds significant complexity to the code, I don't consider that to be *significant* complexity. > and that's for the > trivial version of itemgetter, attrgetter also does keypath resolution > so the code is nowhere near this simple. I don't understand what you mean by "keypath resolution". attrgetter simply looks up the attribute(s) by name, just like obj.name would do. It has the same API as itemgetter, except with attribute names instead of item indexes. > It's also anything but obvious what this snippet does on its own. Once you get past the ternary if operator, the complexity is pretty much entirely in the call to itemgetter. You don't even use itemgetter in the else clause! Beyond the call to itemgetter, it's trivially simple Python code. slicer = operator.itemgetter(*indices, force_tuple=flag) is equally mysterious to anyone who doesn't know what itemgetter does. >> If you don't like writing this out in place, write >> it once in a helper function. Not every short code snippet needs to be in >> the standard library. > > It's not really "every short code snippet" in this case, it's a way to > avoid a sometimes deleterious special case and irregularity of the stdlib. I disagree that this is a "sometimes deleterious special case". itemgetter and attrgetter have two APIs: itemgetter(index)(L) => element itemgetter(index, index, ...)(L) => tuple of elements and likewise for attrgetter: attrgetter(name)(L) => attribute attrgetter(name, name, ...)(L) => tuple of attributes Perhaps it would have been better if there were four functions rather than two. Or if the second API were: itemgetter(sequence_of_indexes)(L) => tuple of elements attrgetter(sequence_of_names)(L) => tuple of attributes so that the two getters always took a single argument, and dispatched on whether that argument is an atomic value or a sequence. But either way, it is not what I consider a "special case" so much as two related non- special cases. But let's not argue about definitions. Special case or not, can you demonstrate that the situation is not only deleterious, but cannot be reasonably fixed with a helper function? Whenever you call itemgetter, there is no ambiguity because you always know whether you are calling it with a single index or multiple indexes. >>> This makes for more verbose and less straightforward code, I think it >>> would be useful to such situations if attrgetter and itemgetter could be >>> forced into always returning a tuple by way of an optional argument: >> >> -1 >> >> There is no need to add extra complexity to itemgetter and attrgetter for >> something best solved in your code. > > I don't agree with this statement, the stdlib flag adds very little > extra complexity, way less than the original irregularity/special case Whether or not it is empirically less than the complexity already there in itemgetter, it would still be adding extra complexity. It simply isn't possible to end up with *less* complexity by *adding* features. (Complexity is not always a bad thing. If we wanted to program in something simple, we would program using a Turing machine.) The reader now has to consider "what does the force_tuple argument do?" which is not necessarily trivial nor obvious. I expect a certain number of beginners who don't read documentation will assume that you have to do this: slicer = itemgetter(1, 2, 3, force_tuple=False) if they want to pass something other than a tuple to slicer. Don't imagine that adding an additional argument will make itemgetter and attrgetter *simpler* to understand. To me, a major red-flag for your suggested API can be seen here: itemgetter(1, 2, 3, 4, force_tuple=False) What should this do? I consider all the alternatives to be less than ideal: - ignore the explicit keyword argument and return a tuple anyway - raise an exception To say nothing of more... imaginative... semantics: - return a list, or a set, anything but a tuple - return a single element instead of four (but which one?) The suggested API is not as straight-forward as you seem to think it is. > and way less than necessary to do it outside the stdlib. Furthermore, it > makes the solution (to having a regular output behavior for > (attr|item)getter) far more obvious and makes the code itself much simpler > to read. The only thing I will grant is that it aids in discoverability of a solution: you don't have to think of the (trivial) solution yourself, you just need to read the documentation. But I don't see either the problem or the solution to be great enough to justify adding an argument, writing new documentation, and doubling the number of tests for both itemgetter and attrgetter. -- Steven From masklinn at masklinn.net Fri Sep 14 11:29:47 2012 From: masklinn at masklinn.net (Masklinn) Date: Fri, 14 Sep 2012 11:29:47 +0200 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: <5052F2BE.5070507@pearwood.info> References: <50528666.7090400@pearwood.info> <61323E14-01CE-4A1E-9533-292B093F51BC@masklinn.net> <5052F2BE.5070507@pearwood.info> Message-ID: On 2012-09-14, at 11:02 , Steven D'Aprano wrote >> and that's for the >> trivial version of itemgetter, attrgetter also does keypath resolution >> so the code is nowhere near this simple. > > I don't understand what you mean by "keypath resolution". attrgetter > simply looks up the attribute(s) by name, just like obj.name would do. It > has the same API as itemgetter, except with attribute names instead of > item indexes. It takes dotted paths, not just attribute names >> It's also anything but obvious what this snippet does on its own. > > Once you get past the ternary if operator, the complexity is pretty much > entirely in the call to itemgetter. You don't even use itemgetter in the > else clause! Beyond the call to itemgetter, it's trivially simple Python > code. > > slicer = operator.itemgetter(*indices, force_tuple=flag) > > is equally mysterious to anyone who doesn't know what itemgetter does. I would expect either foreknowledge or reading up on it to be obvious in the context of its usage. >>> If you don't like writing this out in place, write >>> it once in a helper function. Not every short code snippet needs to be in >>> the standard library. >> >> It's not really "every short code snippet" in this case, it's a way to >> avoid a sometimes deleterious special case and irregularity of the stdlib. > > > I disagree that this is a "sometimes deleterious special case". itemgetter > and attrgetter have two APIs: > > itemgetter(index)(L) => element > itemgetter(index, index, ...)(L) => tuple of elements > > and likewise for attrgetter: > > attrgetter(name)(L) => attribute > attrgetter(name, name, ...)(L) => tuple of attributes > > Perhaps it would have been better if there were four functions rather than > two. Or if the second API were: > > itemgetter(sequence_of_indexes)(L) => tuple of elements > attrgetter(sequence_of_names)(L) => tuple of attributes > > so that the two getters always took a single argument, and dispatched on > whether that argument is an atomic value or a sequence. But either way, > it is not what I consider a "special case" so much as two related non- > special cases. Which conflict for a sequence of length 1, which is the very reason why I started this thread. > But let's not argue about definitions. Special case or not, can you > demonstrate that the situation is not only deleterious, but cannot be > reasonably fixed with a helper function? Which as usual hinges on the definition of "reasonably", of course the situation can be "fixed" (with "reasonably" being a wholly personal value judgement) with a helper function or a reimplementation of an (attr|item)getter-like function from scratch. As it can pretty much always be. I don't see that as a very useful benchmark. > Whenever you call itemgetter, there is no ambiguity because you always know > whether you are calling it with a single index or multiple indexes. That is not quite correct, even ignoring that you have to call `len` to do so when the indices are provided by a third party, the correct code gets yet more complex as the third party could provide an iterator which would have to be reified before being passed to len(), increasing the complexity of the "helper" yet again. >>>> This makes for more verbose and less straightforward code, I think it >>>> would be useful to such situations if attrgetter and itemgetter could be >>>> forced into always returning a tuple by way of an optional argument: >>> >>> -1 >>> >>> There is no need to add extra complexity to itemgetter and attrgetter for >>> something best solved in your code. >> >> I don't agree with this statement, the stdlib flag adds very little >> extra complexity, way less than the original irregularity/special case > > Whether or not it is empirically less than the complexity already there in > itemgetter, it would still be adding extra complexity. It simply isn't > possible to end up with *less* complexity by *adding* features. At no point did I deny that, as far as I know or can see. > (Complexity is not always a bad thing. If we wanted to program in something > simple, we would program using a Turing machine.) > > The reader now has to consider "what does the force_tuple argument do?" > which is not necessarily trivial nor obvious. I expect a certain number of > beginners who don't read documentation will assume that you have to do this: > > slicer = itemgetter(1, 2, 3, force_tuple=False) > > if they want to pass something other than a tuple to slicer. Don't imagine > that adding an additional argument will make itemgetter and attrgetter > *simpler* to understand. > > > To me, a major red-flag for your suggested API can be seen here: > > itemgetter(1, 2, 3, 4, force_tuple=False) > > What should this do? The exact same as `itemgetter(1, 2, 3, 4)`, since `force_tuple` defaults to False. > I consider all the alternatives to be less than > ideal: > > - ignore the explicit keyword argument and return a tuple anyway > - raise an exception > > To say nothing of more... imaginative... semantics: > > - return a list, or a set, anything but a tuple > - return a single element instead of four (but which one?) I have trouble seeing how such interpretations can be drawn up from explicitly providing the default value for the argument. Does anyone really expect dict.get(key, None) to always return None? > The suggested API is not as straight-forward as you seem to think it is. It's simply a proposal to fix what I see as an issue (as befits to python-ideas), you're getting way too hung up on something which can quite trivially be discussed and changed. >> and way less than necessary to do it outside the stdlib. Furthermore, it >> makes the solution (to having a regular output behavior for >> (attr|item)getter) far more obvious and makes the code itself much simpler >> to read. > > The only thing I will grant is that it aids in discoverability of a > solution It also aids in the discoverability of the problem in the first place, and in limiting the surprise when unexpectedly encountering it for the first time. From wuwei23 at gmail.com Fri Sep 14 11:41:43 2012 From: wuwei23 at gmail.com (alex23) Date: Fri, 14 Sep 2012 02:41:43 -0700 (PDT) Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: References: Message-ID: On Sep 13, 11:15?pm, Masklinn wrote: > ? ? # works the same no matter what len(indices) is > ? ? slicer = operator.itemgetter(*indices, force_tuple=True) I'd be inclined to write that as: slicer = force_tuple(operator.itemgetter(*indices)) With force_tuple then just being another decorator. From ncoghlan at gmail.com Fri Sep 14 13:01:04 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 14 Sep 2012 21:01:04 +1000 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: References: Message-ID: On Thu, Sep 13, 2012 at 11:15 PM, Masklinn wrote: > attrgetter and itemgetter are both very useful functions, but both have > a significant pitfall if the arguments passed in are validated but not > controlled: if receiving the arguments (list of attributes, keys or > indexes) from an external source and *-applying it, if the external > source passes a sequence of one element both functions will in turn > return an element rather than a singleton (1-element tuple). Both attrgetter and itemgetter are really designed to be called with *literal* arguments, not via *args. In particular, they are designed to be useful as arguments bound to a "key" parameter, where the object vs singleton tuple distinction doesn't matter. If that behaviour is not desirable, *write a different function* that does what you want, and don't use itemgetter or attrgetter at all. These tools are designed as convenience functions for a particular use case (specifically sorting, and similar ordering operations). Outside those use cases, you will need to drop back down to the underlying building blocks and produce your *own* tool from the same raw materials. For example: def my_itemgetter(*subscripts): def f(obj): return tuple(obj[x] for x in subscripts) return f I agree attrgetter is slightly more complex due to the fact that it *also* handles chained lookups, where getattr does not, but that's a matter of making the case for providing chained lookup (or even str.format style field value lookup) as a more readily accessible building block, not for making the attrgetter API more complicated. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia From masklinn at masklinn.net Fri Sep 14 13:36:39 2012 From: masklinn at masklinn.net (Masklinn) Date: Fri, 14 Sep 2012 13:36:39 +0200 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: References:

Message-ID: <93B4E6EE-8556-45E2-9F40-BCF7D6609328@masklinn.net> On 2012-09-14, at 13:01 , Nick Coghlan wrote: > On Thu, Sep 13, 2012 at 11:15 PM, Masklinn wrote: >> attrgetter and itemgetter are both very useful functions, but both have >> a significant pitfall if the arguments passed in are validated but not >> controlled: if receiving the arguments (list of attributes, keys or >> indexes) from an external source and *-applying it, if the external >> source passes a sequence of one element both functions will in turn >> return an element rather than a singleton (1-element tuple). > > Both attrgetter and itemgetter are really designed to be called with > *literal* arguments, not via *args. In particular, they are designed > to be useful as arguments bound to a "key" parameter, where the object > vs singleton tuple distinction doesn't matter. It was my understanding that they are also designed to be useful for mapping (such a usage is shown in itemgetter's examples), which is a superset of the use case outlined here. > If that behaviour is not desirable, *write a different function* that > does what you want, and don't use itemgetter or attrgetter at all. > These tools are designed as convenience functions And save for one stumbling block, they are utilities I love for their convenience and their plain clarity of purpose. From oscar.j.benjamin at gmail.com Fri Sep 14 15:23:53 2012 From: oscar.j.benjamin at gmail.com (Oscar Benjamin) Date: Fri, 14 Sep 2012 14:23:53 +0100 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: <93B4E6EE-8556-45E2-9F40-BCF7D6609328@masklinn.net> References:

<93B4E6EE-8556-45E2-9F40-BCF7D6609328@masklinn.net> Message-ID: On 14 September 2012 12:36, Masklinn wrote: > On 2012-09-14, at 13:01 , Nick Coghlan wrote: > > On Thu, Sep 13, 2012 at 11:15 PM, Masklinn > wrote: > >> attrgetter and itemgetter are both very useful functions, but both have > >> a significant pitfall if the arguments passed in are validated but not > >> controlled: if receiving the arguments (list of attributes, keys or > >> indexes) from an external source and *-applying it, if the external > >> source passes a sequence of one element both functions will in turn > >> return an element rather than a singleton (1-element tuple). > > > > Both attrgetter and itemgetter are really designed to be called with > > *literal* arguments, not via *args. In particular, they are designed > > to be useful as arguments bound to a "key" parameter, where the object > > vs singleton tuple distinction doesn't matter. > > It was my understanding that they are also designed to be useful for > mapping (such a usage is shown in itemgetter's examples), which is > a superset of the use case outlined here. > > > If that behaviour is not desirable, *write a different function* that > > does what you want, and don't use itemgetter or attrgetter at all. > > These tools are designed as convenience functions > I can see why you would expect different behaviour here, though. I tend not to think of the functions in the operator module as convenience functions but as *efficient* nameable functions referring to operations that are normally invoked with a non-function syntax. Which is more convenient out of the following: 1) using operator import operator result = sorted(values, key=operator.attrgetter('name')) 2) using lambda result = sorted(values, key=lambda v: v.name) I don't think that the operator module is convenient and I think that it damages readability in many cases. My primary reason for choosing it in some cases is that it is more efficient than the lambda expression. There is no special syntax for 'get several items as a tuple'. I didn't know about this extended use for attrgetter, itemgetter. I can't see any other functions in the operator module (abs, add, and_, ...) that extend the semantics of the operation they are supposed to represent in this way. In general it is bad to conflate scalar/sequence semantics so that a caller should get a different type of object depending on the length of a sequence. I can see how practicality beats purity in adding this feature for people who want to use these functions for sorting by a couple of elements/attributes. I think it would have been better though to add these as separate functions itemsgetter and attrsgetter that always return tuples. Oscar -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefan at bytereef.org Fri Sep 14 22:00:09 2012 From: stefan at bytereef.org (Stefan Krah) Date: Fri, 14 Sep 2012 22:00:09 +0200 Subject: [Python-ideas] Memoryview tolist() method is misleading Message-ID: <20120914200009.GA29952@sleipnir.bytereef.org> Alexander Belopolsky wrote: > Consider this: > >>> memoryview(b'x').cast('B', ()).tolist() > 120 > > The return value of to list() is an int, not a list. That's because NumPy's tolist() does the same thing: >>> x = numpy.array(120, dtype='B') >>> x array(120, dtype=uint8) >>> x.tolist() 120 If you implement tolist() recursively like in _testbuffer.c and choose the zeroth dimension as the base case, you arrive at single elements. So at least it's not completely unnatural. Stefan Krah From jimjjewett at gmail.com Fri Sep 14 23:02:31 2012 From: jimjjewett at gmail.com (Jim Jewett) Date: Fri, 14 Sep 2012 17:02:31 -0400 Subject: [Python-ideas] Optional kwarg making attrgetter & itemgetter always return a tuple In-Reply-To: References:

<93B4E6EE-8556-45E2-9F40-BCF7D6609328@masklinn.net> Message-ID: On 9/14/12, Oscar Benjamin wrote: > I can see why you would expect different behaviour here, though. I tend not > to think of the functions in the operator module as convenience functions > but as *efficient* nameable functions referring to operations that are > normally invoked with a non-function syntax. Which is more convenient out > of the following: > 1) using operator > import operator > result = sorted(values, key=operator.attrgetter('name')) I would normally write that as from operator import attrgetter as attr ... # may use it several times result=sorted(values, key=attr('name')) which is about the best I could hope for, without being able to use the dot itself. > 2) using lambda > result = sorted(values, key=lambda v: v.name) And I honestly think that would be worse, even if lambda didn't have a code smell. It focuses attention on the fact that you're creating a callable, instead of on the fact that you're grabbing the name attribute. > In general it is bad to conflate scalar/sequence semantics so that a caller > should get a different type of object depending on the length of a > sequence. Yeah, but that can't really be solved well in python, except maybe by never extending an API to handle sequences. I would personally not consider that an improvement. Part of the problem is that the cleanest way to take a variable number of arguments is to turn them into a sequence under the covers (*args), even if they weren't passed that way. -jJ From dholth at gmail.com Fri Sep 14 23:24:13 2012 From: dholth at gmail.com (Daniel Holth) Date: Fri, 14 Sep 2012 17:24:13 -0400 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? Message-ID: Why is there no way to pass PYTHONPATH on the command line? Oversight or intentional? Given path_item/something.py python -p path_item -c "import something; something.foo()" I am aware that the __main__.py behavior lessens the need for this significantly. From mikegraham at gmail.com Sat Sep 15 00:10:17 2012 From: mikegraham at gmail.com (Mike Graham) Date: Fri, 14 Sep 2012 18:10:17 -0400 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References: Message-ID: On Fri, Sep 14, 2012 at 5:24 PM, Daniel Holth wrote: > Why is there no way to pass PYTHONPATH on the command line? Oversight > or intentional? > > Given path_item/something.py > > python -p path_item -c "import something; something.foo()" > > I am aware that the __main__.py behavior lessens the need for this > significantly. PYTHONPATH=whatever python -c "import something; something.foo()" Mike From dholth at gmail.com Sat Sep 15 00:11:29 2012 From: dholth at gmail.com (Daniel Holth) Date: Fri, 14 Sep 2012 18:11:29 -0400 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References:

Message-ID: On Fri, Sep 14, 2012 at 6:10 PM, Mike Graham wrote: > On Fri, Sep 14, 2012 at 5:24 PM, Daniel Holth wrote: >> Why is there no way to pass PYTHONPATH on the command line? Oversight >> or intentional? >> >> Given path_item/something.py >> >> python -p path_item -c "import something; something.foo()" >> >> I am aware that the __main__.py behavior lessens the need for this >> significantly. > > PYTHONPATH=whatever python -c "import something; something.foo()" > > Mike Yes, I've heard of that one. Does it work on Windows too? From p.f.moore at gmail.com Sat Sep 15 00:40:31 2012 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 14 Sep 2012 23:40:31 +0100 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References:

Message-ID: On 14 September 2012 23:11, Daniel Holth wrote: > On Fri, Sep 14, 2012 at 6:10 PM, Mike Graham wrote: >> On Fri, Sep 14, 2012 at 5:24 PM, Daniel Holth wrote: >>> Why is there no way to pass PYTHONPATH on the command line? Oversight >>> or intentional? >>> >>> Given path_item/something.py >>> >>> python -p path_item -c "import something; something.foo()" >>> >>> I am aware that the __main__.py behavior lessens the need for this >>> significantly. >> >> PYTHONPATH=whatever python -c "import something; something.foo()" >> >> Mike > > Yes, I've heard of that one. Does it work on Windows too? No. Sadly. Paul. From timothy.c.delaney at gmail.com Sat Sep 15 00:47:23 2012 From: timothy.c.delaney at gmail.com (Tim Delaney) Date: Sat, 15 Sep 2012 08:47:23 +1000 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References:

Message-ID: On 15 September 2012 08:40, Paul Moore wrote: > On 14 September 2012 23:11, Daniel Holth wrote: > > On Fri, Sep 14, 2012 at 6:10 PM, Mike Graham > wrote: > >> PYTHONPATH=whatever python -c "import something; something.foo()" > >> > >> Mike > > > > Yes, I've heard of that one. Does it work on Windows too? > > No. Sadly. Unless you're using either cygwin or (my preference) msys < http://www.mingw.org/wiki/MSYS>. Tim Delaney -------------- next part -------------- An HTML attachment was scrubbed... URL: From ericsnowcurrently at gmail.com Sat Sep 15 00:48:15 2012 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Fri, 14 Sep 2012 16:48:15 -0600 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References: Message-ID: On Fri, Sep 14, 2012 at 3:24 PM, Daniel Holth wrote: > Why is there no way to pass PYTHONPATH on the command line? Oversight > or intentional? > > Given path_item/something.py > > python -p path_item -c "import something; something.foo()" Do you mean something along the lines of http://bugs.python.org/issue13475? This would add --path0 and --nopath0 as commandline options. Of course, this would only be in Python 3.4 (if it makes it in at all). -eric From guido at python.org Sat Sep 15 01:03:39 2012 From: guido at python.org (Guido van Rossum) Date: Fri, 14 Sep 2012 16:03:39 -0700 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References:

Message-ID: On Fri, Sep 14, 2012 at 3:48 PM, Eric Snow wrote: > On Fri, Sep 14, 2012 at 3:24 PM, Daniel Holth wrote: >> Why is there no way to pass PYTHONPATH on the command line? Oversight >> or intentional? >> >> Given path_item/something.py >> >> python -p path_item -c "import something; something.foo()" > > Do you mean something along the lines of > http://bugs.python.org/issue13475? This would add --path0 and > --nopath0 as commandline options. Of course, this would only be in > Python 3.4 (if it makes it in at all). Why is the proposal --path0 and not --path? (The only utility I know with a command line option spelled as "foo0" is find, where -print0 means "like -print, but append \0 instead of \n".) -- --Guido van Rossum (python.org/~guido) From ericsnowcurrently at gmail.com Sat Sep 15 01:17:46 2012 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Fri, 14 Sep 2012 17:17:46 -0600 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References:

Message-ID: On Fri, Sep 14, 2012 at 5:03 PM, Guido van Rossum wrote: > On Fri, Sep 14, 2012 at 3:48 PM, Eric Snow wrote: >> ... issue13475 ... > Why is the proposal --path0 and not --path? (The only utility I know > with a command line option spelled as "foo0" is find, where -print0 > means "like -print, but append \0 instead of \n".) It's meant to allow you to explicitly manage the sys.path[0] value that gets set implicitly. That it fits Daniel's use case (as I understand it) is not coincidental, but it's not meant as a wholesale substitute for PYTHONPATH. That would be a separate proposal (which I believe exists in the tracker already). -eric From dholth at gmail.com Sat Sep 15 01:34:55 2012 From: dholth at gmail.com (Daniel Holth) Date: Fri, 14 Sep 2012 19:34:55 -0400 Subject: [Python-ideas] Why is there no way to pass PYTHONPATH on the command line? In-Reply-To: References:

Message-ID: Surely it should be the same as python path, at the front, and use - E to ignore the environment On Sep 14, 2012 7:17 PM, "Eric Snow" wrote: > On Fri, Sep 14, 2012 at 5:03 PM, Guido van Rossum > wrote: > > On Fri, Sep 14, 2012 at 3:48 PM, Eric Snow > wrote: > >> ... issue13475 ... > > Why is the proposal --path0 and not --path? (The only utility I know > > with a command line option spelled as "foo0" is find, where -print0 > > means "like -print, but append \0 instead of \n".) > > It's meant to allow you to explicitly manage the sys.path[0] value > that gets set implicitly. That it fits Daniel's use case (as I > understand it) is not coincidental, but it's not meant as a wholesale > substitute for PYTHONPATH. That would be a separate proposal (which I > believe exists in the tracker already). > > -eric > -------------- next part -------------- An HTML attachment was scrubbed... URL: From greg at krypto.org Sat Sep 15 08:28:59 2012 From: greg at krypto.org (Gregory P. Smith) Date: Fri, 14 Sep 2012 23:28:59 -0700 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: <505279D3.9070709@canterbury.ac.nz> References: <505279D3.9070709@canterbury.ac.nz> Message-ID: On Thu, Sep 13, 2012 at 5:26 PM, Greg Ewing wrote: > Paul Wiseman wrote: > >> try: >> operation() >> except IOError as err: >> if err.errno == 2: >> do_something() >> else: >> continue #This would continue the except down to the next check, >> > > I think a better way to address this would be to allow guard > expressions on the except clauses. > > > try: > operation() > except IOError as err if err.errno == 2: > ... I like that. Granted, what are the use cases beyond the errno one that PEP 3151 largely addresses? > > > -- > Greg > > ______________________________**_________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/**mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From poalman at gmail.com Sat Sep 15 10:15:53 2012 From: poalman at gmail.com (Paul Wiseman) Date: Sat, 15 Sep 2012 09:15:53 +0100 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: References: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On 14 September 2012 03:32, Nick Coghlan wrote: > On Fri, Sep 14, 2012 at 11:52 AM, Stephen J. Turnbull > wrote: > > ISTR there were discussions of "qualified except" clauses here maybe > > 6mo to 1yr ago? That is, they'd look something like > > > > try: > > operation() > > except IOError as err if err.errno == 2: > > do_something() > > except Exception: > > logger.error("Error performing operation: {}".format(err.message)") > > some_clean_up() > > raise > > > > Again ISTR that this got spiked for some reason, but maybe it will be > > of use to the OP in formulating his next idea. Sorry for the lack of > > precise reference. > > They were one of the ideas discussed when Antoine was writing PEP > 3151. As I recall, nobody could think of any good use cases that > didn't involve errno checking, and PEP 3151 provides a far more > elegant (and cross-platform) solution to most problems that require > errno checking in versions prior to 3.3. > > Ah I didn't know about that, maybe I chose a bad example with IOError. The reason that got me thinking is I had to handle specific S3ResponseErrors from boto. the S3ResponseError exception class has a code attribute (or errorcode, i forget exactly). Surely in this case and a lot of other modules different exceptions are grouped together that are likely to want to be handle differently? URLError and HTTPError I'm sure fall into this catagory > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From poalman at gmail.com Sat Sep 15 10:21:40 2012 From: poalman at gmail.com (Paul Wiseman) Date: Sat, 15 Sep 2012 09:21:40 +0100 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> References: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On 14 September 2012 02:52, Stephen J. Turnbull wrote: > Terry Reedy writes: > > > try: > > try: > > Ugh-ugh.<0.5 wink> > > > try: > > operation() > > except Exception as err: > > if isinstance(err, IOError) and err.errno == 2: > > Ugh.<0.5 wink> > > Not your fault, but these constructions are pretty ugly IMO, I have to > go with the OP on that. > > ISTR there were discussions of "qualified except" clauses here maybe > 6mo to 1yr ago? That is, they'd look something like > > try: > operation() > except IOError as err if err.errno == 2: > do_something() > except Exception: > logger.error("Error performing operation: {}".format(err.message)") > some_clean_up() > raise > > Again ISTR that this got spiked for some reason, but maybe it will be > of use to the OP in formulating his next idea. Sorry for the lack of > precise reference. > I like that "qualified except". Almost goes without saying it's a much better idea/solution that my idea of a continue (which has already pointed out to be flawed- I'm not sure why now I thought it was always a syntax error) > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rob.cliffe at btinternet.com Sat Sep 15 12:20:49 2012 From: rob.cliffe at btinternet.com (Rob Cliffe) Date: Sat, 15 Sep 2012 11:20:49 +0100 Subject: [Python-ideas] syntax to continue into the next subsequent except block In-Reply-To: References: <87k3vxqsk8.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <50545681.5040406@btinternet.com> On 15/09/2012 09:21, Paul Wiseman wrote: > On 14 September 2012 02:52, Stephen J. Turnbull