From eric at trueblade.com Wed May 2 00:00:28 2012 From: eric at trueblade.com (Eric V. Smith) Date: Tue, 01 May 2012 18:00:28 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4F90730D.1040808@trueblade.com> References: <4F90730D.1040808@trueblade.com> Message-ID: <4FA05CFC.6050609@trueblade.com> I'm working on finishing up the PEP 420 work. I think the PEP itself is complete. If you have any comments, please send them to me or this list. The implementation at features/pep-420 has been merged with the recent importlib changes to the 3.3 branch. I've implemented support in the import machinery itself, as well as modified the filesystem finder (FileFinder) and the zipimport finder. About the only question I have is: Is everyone okay with the changes to the finders, described in the PEP? Basically they now return a string in addition to a loader or None. If they return a string, then the string represents the path of a possible namespace package portion. The change is backward compatible: unmodified finders will just be unable to participate in a namespace package. Barry Warsaw, Jason Coombs, and I are sprinting this Thursday. We'll focus on adding tests, and maybe documentation if we have time. If anyone has any concerns I'd like to hear them before then so that we can work on addressing them. The changes themselves are very small. I think the diff is a total of maybe 40 lines of code. Yury Selivanov had mentioned backporting to 3.2 (which I assume would be an unsupported-by-python-dev effort). I actually don't think it would be all that complicated. Eric. From brett at python.org Wed May 2 04:22:03 2012 From: brett at python.org (Brett Cannon) Date: Tue, 1 May 2012 22:22:03 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA05CFC.6050609@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> Message-ID: On Tue, May 1, 2012 at 6:00 PM, Eric V. Smith wrote: > I'm working on finishing up the PEP 420 work. I think the PEP itself is > complete. If you have any comments, please send them to me or this list. > > The implementation at features/pep-420 has been merged with the recent > importlib changes to the 3.3 branch. I've implemented support in the > import machinery itself, as well as modified the filesystem finder > (FileFinder) and the zipimport finder. > > About the only question I have is: Is everyone okay with the changes to > the finders, described in the PEP? Basically they now return a string in > addition to a loader or None. If they return a string, then the string > represents the path of a possible namespace package portion. The change > is backward compatible: unmodified finders will just be unable to > participate in a namespace package. > I obviously okay with the change. =) So this email is just a +1 in support of this work and a thanks for coding it up and seeing this through! -Brett > > Barry Warsaw, Jason Coombs, and I are sprinting this Thursday. We'll > focus on adding tests, and maybe documentation if we have time. If > anyone has any concerns I'd like to hear them before then so that we can > work on addressing them. > > The changes themselves are very small. I think the diff is a total of > maybe 40 lines of code. Yury Selivanov had mentioned backporting to 3.2 > (which I assume would be an unsupported-by-python-dev effort). I > actually don't think it would be all that complicated. > Ignoring that the classes he would need to access are technically private, backporting should be no more than a subclass and an extra stat call by FileFinder if None is returned. -Brett > > Eric. > > > _______________________________________________ > Import-SIG mailing list > Import-SIG at python.org > http://mail.python.org/mailman/listinfo/import-sig > -------------- next part -------------- An HTML attachment was scrubbed... URL: From martin at v.loewis.de Wed May 2 09:17:00 2012 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 02 May 2012 09:17:00 +0200 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA05CFC.6050609@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> Message-ID: <4FA0DF6C.4090709@v.loewis.de> > About the only question I have is: Is everyone okay with the changes to > the finders, described in the PEP? It looks good to me. It's a somewhat surprising change, but I can see no flaw in it. Regards, Martin From eric at trueblade.com Wed May 2 12:23:17 2012 From: eric at trueblade.com (Eric V. Smith) Date: Wed, 02 May 2012 06:23:17 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA0DF6C.4090709@v.loewis.de> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> Message-ID: <4FA10B15.1000302@trueblade.com> On 5/2/2012 3:17 AM, "Martin v. L?wis" wrote: >> About the only question I have is: Is everyone okay with the changes to >> the finders, described in the PEP? > > It looks good to me. It's a somewhat surprising change, but I can see no > flaw in it. Surprising in that any change to find_module is needed, or surprising that it now returns one of {None, loader, str}? If it's the latter: yeah, it's a little strange. But find_module knows something that the caller needs to be told. It seemed easiest to add another possible return type. Any other suggestions? Eric. From pje at telecommunity.com Wed May 2 19:06:27 2012 From: pje at telecommunity.com (PJ Eby) Date: Wed, 2 May 2012 13:06:27 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA10B15.1000302@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> Message-ID: On Wed, May 2, 2012 at 6:23 AM, Eric V. Smith wrote: > If it's the latter: yeah, it's a little strange. But find_module knows > something that the caller needs to be told. It seemed easiest to add > another possible return type. Any other suggestions? > It seems quite elegant to me. I do see one point of concern with the spec, though. At one point it says that finders must return a path without a trailing separator, but at another it says the package __file__ will contain a separator. This strikes me as inconsistent, and also incompatible with non-filesystem-based finder implementations. The import machinery *must not* assume that import path strings are filenames, so it is wrong for the import machinery to add a path separator that the finder did not include. IOW, I don't think the spec can assume or guarantee anything about the strings returned by finders: it MUST treat them as opaque strings. If this means that there can't be any meaningful __file__ for a namespace package, I think we will have to live with that. The only alternative I see is to delegate the string manipulation back to the finders, or to change the return value from a string to a (file, path) tuple, wherein 'file' is the value to be used as __file__, and 'path' is the value to be used in __path__. -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Wed May 2 19:24:21 2012 From: eric at trueblade.com (Eric V. Smith) Date: Wed, 02 May 2012 13:24:21 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> Message-ID: <4FA16DC5.1000204@trueblade.com> On 05/02/2012 01:06 PM, PJ Eby wrote: > I do see one point of concern with the spec, though. At one point it > says that finders must return a path without a trailing separator, but > at another it says the package __file__ will contain a separator. > > This strikes me as inconsistent, and also incompatible with > non-filesystem-based finder implementations. The import machinery *must > not* assume that import path strings are filenames, so it is wrong for > the import machinery to add a path separator that the finder did not > include. > > IOW, I don't think the spec can assume or guarantee anything about the > strings returned by finders: it MUST treat them as opaque strings. If > this means that there can't be any meaningful __file__ for a namespace > package, I think we will have to live with that. I've come to the same conclusion myself. I actually had a draft of the PEP that removed the word "directory", at which point it becomes obvious that you're adding a path separator to something that might not be a path name. > The only alternative I see is to delegate the string manipulation back > to the finders, or to change the return value from a string to a (file, > path) tuple, wherein 'file' is the value to be used as __file__, and > 'path' is the value to be used in __path__. I don't see the value of __file__ at all in the case of namespace packages. If it's just a hint that it's a namespace package, I think it would be better to set __file__ to None. That would noisily break some code that isn't likely to work anyway. Eric. From brett at python.org Wed May 2 19:53:44 2012 From: brett at python.org (Brett Cannon) Date: Wed, 2 May 2012 13:53:44 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA16DC5.1000204@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> Message-ID: On Wed, May 2, 2012 at 1:24 PM, Eric V. Smith wrote: > On 05/02/2012 01:06 PM, PJ Eby wrote: > > > I do see one point of concern with the spec, though. At one point it > > says that finders must return a path without a trailing separator, but > > at another it says the package __file__ will contain a separator. > > > > This strikes me as inconsistent, and also incompatible with > > non-filesystem-based finder implementations. The import machinery *must > > not* assume that import path strings are filenames, so it is wrong for > > the import machinery to add a path separator that the finder did not > > include. > > > > IOW, I don't think the spec can assume or guarantee anything about the > > strings returned by finders: it MUST treat them as opaque strings. If > > this means that there can't be any meaningful __file__ for a namespace > > package, I think we will have to live with that. > > I've come to the same conclusion myself. I actually had a draft of the > PEP that removed the word "directory", at which point it becomes obvious > that you're adding a path separator to something that might not be a > path name. > > > The only alternative I see is to delegate the string manipulation back > > to the finders, or to change the return value from a string to a (file, > > path) tuple, wherein 'file' is the value to be used as __file__, and > > 'path' is the value to be used in __path__. > > I don't see the value of __file__ at all in the case of namespace > packages. If it's just a hint that it's a namespace package, I think it > would be better to set __file__ to None. That would noisily break some > code that isn't likely to work anyway. Problem is that None for __file__ would be a unique use here. Frozen modules, for instance, typically say "" for __file__. Now part of the reason (I suspect) this is done is that this was the only way to tell how the module was created, but with __loader__ now on all modules this is redundant. So perhaps this fake value for __file__ is just outdated and not worth perpetuating? I vote for using __file__ as None as suggested and having people infer how the module was created from __loader__. -------------- next part -------------- An HTML attachment was scrubbed... URL: From martin at v.loewis.de Wed May 2 20:32:09 2012 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Wed, 02 May 2012 20:32:09 +0200 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA10B15.1000302@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> Message-ID: <4FA17DA9.1070207@v.loewis.de> On 02.05.2012 12:23, Eric V. Smith wrote: > On 5/2/2012 3:17 AM, "Martin v. L?wis" wrote: >>> About the only question I have is: Is everyone okay with the changes to >>> the finders, described in the PEP? >> >> It looks good to me. It's a somewhat surprising change, but I can see no >> flaw in it. > > Surprising in that any change to find_module is needed, or surprising > that it now returns one of {None, loader, str}? > Both, actually. I had expected that new API (i.e. a new method of some kind) would be necessary, so it has elegance that this is not required. OTOH, explicit type checking is despised in the OO world, and varying result types are disliked by Guido van Rossum (not sure whether this reservation applies to this case as well, or only to cases where the return type depends on the parameter types). Regards, Martin From barry at python.org Wed May 2 20:50:05 2012 From: barry at python.org (Barry Warsaw) Date: Wed, 2 May 2012 14:50:05 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA17DA9.1070207@v.loewis.de> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA17DA9.1070207@v.loewis.de> Message-ID: <20120502145005.4d0633b4@resist.wooz.org> On May 02, 2012, at 08:32 PM, Martin v. L?wis wrote: >Both, actually. I had expected that new API (i.e. a new method of some kind) >would be necessary, so it has elegance that this is not required. OTOH, >explicit type checking is despised in the OO world, and varying result types >are disliked by Guido van Rossum (not sure whether this reservation applies >to this case as well, or only to cases where the return type depends on the >parameter types). My understanding (and I'm sure Guido will correct me if I'm wrong) is that it's the latter: return type should not depend on function argument values. -Barry From brett at python.org Wed May 2 20:53:17 2012 From: brett at python.org (Brett Cannon) Date: Wed, 2 May 2012 14:53:17 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA17DA9.1070207@v.loewis.de> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA17DA9.1070207@v.loewis.de> Message-ID: On Wed, May 2, 2012 at 2:32 PM, "Martin v. L?wis" wrote: > On 02.05.2012 12:23, Eric V. Smith wrote: > >> On 5/2/2012 3:17 AM, "Martin v. L?wis" wrote: >> >>> About the only question I have is: Is everyone okay with the changes to >>>> the finders, described in the PEP? >>>> >>> >>> It looks good to me. It's a somewhat surprising change, but I can see no >>> flaw in it. >>> >> >> Surprising in that any change to find_module is needed, or surprising >> that it now returns one of {None, loader, str}? >> >> > Both, actually. I had expected that new API (i.e. a new method of some > kind) would be necessary, so it has elegance that this is not required. > OTOH, explicit type checking is despised in the OO world, and varying > result types are disliked by Guido van Rossum (not sure whether this > reservation applies to this case as well, or only to cases where the > return type depends on the parameter types). > You actually don't need to explicitly type-check and instead can rely on duck typing:: if loader is None: continue elif hasattr(loader, 'load_module'): return loader else: namespace.append(loader) continue -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Wed May 2 21:28:42 2012 From: eric at trueblade.com (Eric V. Smith) Date: Wed, 02 May 2012 15:28:42 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA17DA9.1070207@v.loewis.de> Message-ID: <4FA18AEA.9070406@trueblade.com> On 05/02/2012 02:53 PM, Brett Cannon wrote: > You actually don't need to explicitly type-check and instead can rely on > duck typing:: > > if loader is None: continue > elif hasattr(loader, 'load_module'): return loader > else: > namespace.append(loader) > continue While I agree that this accomplishes the job, I don't think it's any more readable than the existing code: if isinstance(loader, str): namespace.append(loader) elif loader: return loader (with the case of None causing the code to loop) But I'm open to changing it. As to the three return types: Given that find_module() has all of the information, I don't think it makes sense to add another method. And for backward compatibility, we need to keep the {None, loader} return types. If you agree that adding another method is wasteful (it will have to do most of the same work as find_module(), or cache its result), then I think adding a str return type makes the most sense. I can't foresee this ever causing an actual problem. No one is going to subclass a loader from str (famous last words, I know!). Eric. From brett at python.org Wed May 2 21:39:47 2012 From: brett at python.org (Brett Cannon) Date: Wed, 2 May 2012 15:39:47 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA18AEA.9070406@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA17DA9.1070207@v.loewis.de> <4FA18AEA.9070406@trueblade.com> Message-ID: On Wed, May 2, 2012 at 3:28 PM, Eric V. Smith wrote: > On 05/02/2012 02:53 PM, Brett Cannon wrote: > > > You actually don't need to explicitly type-check and instead can rely on > > duck typing:: > > > > if loader is None: continue > > elif hasattr(loader, 'load_module'): return loader > > else: > > namespace.append(loader) > > continue > > While I agree that this accomplishes the job, I don't think it's any > more readable than the existing code: > > if isinstance(loader, str): > namespace.append(loader) > elif loader: > return loader > > (with the case of None causing the code to loop) > > But I'm open to changing it. > > I honestly don't care. I just wanted to point out to Martin that if he wanted a more interface check over type check it's totally doable. > As to the three return types: Given that find_module() has all of the > information, I don't think it makes sense to add another method. And for > backward compatibility, we need to keep the {None, loader} return types. > If you agree that adding another method is wasteful (it will have to do > most of the same work as find_module(), or cache its result), then I > think adding a str return type makes the most sense. > > I can't foresee this ever causing an actual problem. No one is going to > subclass a loader from str (famous last words, I know!). Just as I know PJE is going to point out that your loader test won't work if a loader happens to be false and thus you should do an explicit ``is not None`` check. -------------- next part -------------- An HTML attachment was scrubbed... URL: From brett at python.org Wed May 2 21:40:41 2012 From: brett at python.org (Brett Cannon) Date: Wed, 2 May 2012 15:40:41 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120502145005.4d0633b4@resist.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA17DA9.1070207@v.loewis.de> <20120502145005.4d0633b4@resist.wooz.org> Message-ID: On Wed, May 2, 2012 at 2:50 PM, Barry Warsaw wrote: > On May 02, 2012, at 08:32 PM, Martin v. L?wis wrote: > > >Both, actually. I had expected that new API (i.e. a new method of some > kind) > >would be necessary, so it has elegance that this is not required. OTOH, > >explicit type checking is despised in the OO world, and varying result > types > >are disliked by Guido van Rossum (not sure whether this reservation > applies > >to this case as well, or only to cases where the return type depends on > the > >parameter types). > > My understanding (and I'm sure Guido will correct me if I'm wrong) is that > it's the latter: return type should not depend on function argument values. This is how I interpreted Guido's preference (e.g. return bytes or str based on whether an argument(s) is bytes or str). -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Wed May 2 21:47:37 2012 From: eric at trueblade.com (Eric V. Smith) Date: Wed, 02 May 2012 15:47:37 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA17DA9.1070207@v.loewis.de> <4FA18AEA.9070406@trueblade.com> Message-ID: <4FA18F59.5070701@trueblade.com> On 05/02/2012 03:39 PM, Brett Cannon wrote: > I can't foresee this ever causing an actual problem. No one is going to > subclass a loader from str (famous last words, I know!). > > > Just as I know PJE is going to point out that your loader test won't > work if a loader happens to be false and thus you should do an explicit > ``is not None`` check. Good one! I'll make that change. From pje at telecommunity.com Wed May 2 23:05:51 2012 From: pje at telecommunity.com (PJ Eby) Date: Wed, 2 May 2012 17:05:51 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA16DC5.1000204@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> Message-ID: On Wed, May 2, 2012 at 1:24 PM, Eric V. Smith wrote: > On 05/02/2012 01:06 PM, PJ Eby wrote: > > > I do see one point of concern with the spec, though. At one point it > > says that finders must return a path without a trailing separator, but > > at another it says the package __file__ will contain a separator. > > > > This strikes me as inconsistent, and also incompatible with > > non-filesystem-based finder implementations. The import machinery *must > > not* assume that import path strings are filenames, so it is wrong for > > the import machinery to add a path separator that the finder did not > > include. > > > > IOW, I don't think the spec can assume or guarantee anything about the > > strings returned by finders: it MUST treat them as opaque strings. If > > this means that there can't be any meaningful __file__ for a namespace > > package, I think we will have to live with that. > > I've come to the same conclusion myself. I actually had a draft of the > PEP that removed the word "directory", at which point it becomes obvious > that you're adding a path separator to something that might not be a > path name. > > > The only alternative I see is to delegate the string manipulation back > > to the finders, or to change the return value from a string to a (file, > > path) tuple, wherein 'file' is the value to be used as __file__, and > > 'path' is the value to be used in __path__. > > I don't see the value of __file__ at all in the case of namespace > packages. If it's just a hint that it's a namespace package, I think it > would be better to set __file__ to None. That would noisily break some > code that isn't likely to work anyway. > Either None or a missing attribute is fine with me. (One advantage to the missing attribute is that it fails at the exact point where the inspecting code needs fixing, whereas the None will get passed on to some other code before the error manifests itsefl.) By the way, I finished reading the rest of the PEP, and with regard to auto-updating paths, I want to mention that it wasn't me who originally brought up issues about auto-update, it was someone on Python-Dev, and the use cases were discussed there. Also, I would challenge the argument about it being a major block to implementation, since the implementation is straightforward (and TONS simpler than setuptools' approach to the problem). More to the point, though, supporting auto-updates *later* is not really an option, since we'd be changing the rules on people, and invalidating whatever workarounds people come up with for manually updating the path. If namespace package __path__ objects start out as some other type than lists, then there's no change to trip anyone up later. I guess my point is that if we're not going to do auto-updates from the start, it's kind of going to rule it out in the long term as well, so if that's the intention it should be explicitly addressed. I don't want to see it just get ruled out by default due to not being done now, and then not being able to be done later. That's why my earlier question was about whether it had been discussed or not -- there was previous discussion on it in the 402 context, and it was left as an open issue pending BDFL comment on the basic idea of 402. Since then, the basic idea of treating init-less directories as namespace packages has been blessed, so now it's time to get the auto-updates yea-or-nay question ruled on as well. The implementation is pretty trivial; see PEP 402 version of it here: http://mail.python.org/pipermail/import-sig/2012-April/000473.html ...and the PEP 420 version is even simpler, since instead of looking for a 'get_subpath()' method on the finders, it should just call find_module() and check for a string return. -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Thu May 3 02:58:27 2012 From: eric at trueblade.com (Eric V. Smith) Date: Wed, 02 May 2012 20:58:27 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> Message-ID: <4FA1D833.20208@trueblade.com> On 5/2/2012 5:05 PM, PJ Eby wrote: > I don't see the value of __file__ at all in the case of namespace > packages. If it's just a hint that it's a namespace package, I think it > would be better to set __file__ to None. That would noisily break some > code that isn't likely to work anyway. > > > Either None or a missing attribute is fine with me. (One advantage to > the missing attribute is that it fails at the exact point where the > inspecting code needs fixing, whereas the None will get passed on to > some other code before the error manifests itsefl.) I can go either way on this, but would lean toward __file__ not being set. Brett: what's your opinion? > By the way, I finished reading the rest of the PEP, and with regard to > auto-updating paths, I want to mention that it wasn't me who originally > brought up issues about auto-update, it was someone on Python-Dev, and > the use cases were discussed there. Also, I would challenge the > argument about it being a major block to implementation, since the > implementation is straightforward (and TONS simpler than setuptools' > approach to the problem). > > I guess my point is that if we're not going to do auto-updates from the > start, it's kind of going to rule it out in the long term as well, so if > that's the intention it should be explicitly addressed. I don't want to > see it just get ruled out by default due to not being done now, and then > not being able to be done later. Okay. I'll take a look at it tomorrow to see what's involved and if we're backing ourselves into a corner or not. Thanks. Eric. From barry at python.org Thu May 3 03:23:55 2012 From: barry at python.org (Barry Warsaw) Date: Wed, 2 May 2012 21:23:55 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA1D833.20208@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> Message-ID: <20120502212355.6bda4cd4@resist.wooz.org> On May 02, 2012, at 08:58 PM, Eric V. Smith wrote: >On 5/2/2012 5:05 PM, PJ Eby wrote: > >> I don't see the value of __file__ at all in the case of namespace >> packages. If it's just a hint that it's a namespace package, I think it >> would be better to set __file__ to None. That would noisily break some >> code that isn't likely to work anyway. >> >> >> Either None or a missing attribute is fine with me. (One advantage to >> the missing attribute is that it fails at the exact point where the >> inspecting code needs fixing, whereas the None will get passed on to >> some other code before the error manifests itsefl.) > >I can go either way on this, but would lean toward __file__ not being >set. Brett: what's your opinion? I rather like __file__ not existing, although I haven't really thought about the practical effects. PJE makes a good argument though. -Barry From pje at telecommunity.com Thu May 3 06:37:25 2012 From: pje at telecommunity.com (PJ Eby) Date: Thu, 3 May 2012 00:37:25 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120502212355.6bda4cd4@resist.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org> Message-ID: On Wed, May 2, 2012 at 9:23 PM, Barry Warsaw wrote: > On May 02, 2012, at 08:58 PM, Eric V. Smith wrote: > > >On 5/2/2012 5:05 PM, PJ Eby wrote: > > > >> I don't see the value of __file__ at all in the case of namespace > >> packages. If it's just a hint that it's a namespace package, I > think it > >> would be better to set __file__ to None. That would noisily break > some > >> code that isn't likely to work anyway. > >> > >> > >> Either None or a missing attribute is fine with me. (One advantage to > >> the missing attribute is that it fails at the exact point where the > >> inspecting code needs fixing, whereas the None will get passed on to > >> some other code before the error manifests itsefl.) > > > >I can go either way on this, but would lean toward __file__ not being > >set. Brett: what's your opinion? > > I rather like __file__ not existing, although I haven't really thought > about > the practical effects. PJE makes a good argument though. > There's a counterargument that I realized later: PEP 302 currently requires that __file__ be set, AND that it be a string. "The privilege of not having a __file__ attribute at all is reserved for built-in modules." (Of course, that argues equally against __file__ being None, so I'm not sure it helps any to point that out!) Still, code that expects to do something with a package's __file__ is *going* to break somehow with a namespace package, so it's probably better for it to break sooner rather than later. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Thu May 3 08:23:34 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 3 May 2012 16:23:34 +1000 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org> Message-ID: On Thu, May 3, 2012 at 2:37 PM, PJ Eby wrote: > Still, code that expects to do something with a package's __file__ is > *going* to break somehow with a namespace package, so it's probably better > for it to break sooner rather than later. My own preference is for markers like "", "" and "". They're significantly nicer to deal with when dumping module state for diagnostic purposes. If I get a KeyError on __file__, or an AttributeError on NoneType when all I'm trying to do is display data, it's annoying. Standardising on a pattern also opens up the possibility of doing something meaningful with it in get_data() later. One of the guarantees of PEP 302 if that you should be able to do this: data_ref = os.path.join(__file__, relative_ref) data = __loader__.get_data(data_ref) That should really only blow up in get_data(), *not* on the os.path.join step. Ideally, you should also be able to do this: data_ref = os.path.join(mod.__file__, relative_ref) data = mod.__loader__.get_data(data_ref) I see it as being similar to the mandatory file attribute on code objects - placeholders like "" and "" are a lot more informative when errors occur than just using None, even though neither of them is a valid filesystem path. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From martin at v.loewis.de Thu May 3 10:37:02 2012 From: martin at v.loewis.de (martin at v.loewis.de) Date: Thu, 03 May 2012 10:37:02 +0200 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA1D833.20208@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> Message-ID: <20120503103702.Horde.pBsTdLuWis5PokOuiL1VKAA@webmail.df.eu> > I can go either way on this, but would lean toward __file__ not being > set. Brett: what's your opinion? I'd like to recall that we were explicitly discussion this question at PyCon, and (IIRC) I proposed that it be None, and Guido pronounced that it shall be the path to the first portion. So if you now want to change it, you should check with him again. Regards, Martin From eric at trueblade.com Thu May 3 14:28:03 2012 From: eric at trueblade.com (Eric V. Smith) Date: Thu, 03 May 2012 08:28:03 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120503103702.Horde.pBsTdLuWis5PokOuiL1VKAA@webmail.df.eu> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120503103702.Horde.pBsTdLuWis5PokOuiL1VKAA@webmail.df.eu> Message-ID: <4FA279D3.6090701@trueblade.com> On 5/3/2012 4:37 AM, martin at v.loewis.de wrote: >> I can go either way on this, but would lean toward __file__ not being >> set. Brett: what's your opinion? > > I'd like to recall that we were explicitly discussion this question at > PyCon, and (IIRC) I proposed that it be None, and Guido pronounced that > it shall be the path to the first portion. So if you now want to change it, > you should check with him again. I recall that, and I also recall advocating None. I see the process as: - come to a consensus here - update the PEP, documenting this discussion - update the implementation - get Guido to rule on the PEP Eric. From brett at python.org Thu May 3 16:48:43 2012 From: brett at python.org (Brett Cannon) Date: Thu, 3 May 2012 10:48:43 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: On Thu, May 3, 2012 at 2:23 AM, Nick Coghlan wrote: > On Thu, May 3, 2012 at 2:37 PM, PJ Eby wrote: > > Still, code that expects to do something with a package's __file__ is > > *going* to break somehow with a namespace package, so it's probably > better > > for it to break sooner rather than later. > I'm going to roll my replies all into this email to keep things simple. So, to the people not wanting to set __file__, that (probably) won't fly because it has been documented for years that built-in modules are the only things that don't define __file__. Or we at least need to explain to people how to tell the difference in a backwards-compatible fashion (e.g. ``module.__name__ in sys.builtin_module_names``). > > My own preference is for markers like "", "" and > "". > So I would have said that had experience with the stdlib not big me on this. In my situation, the trace module was checking file, and if __file__ didn't contain "" or "'), but I wonder how many people made a similar whitelist approach. And while having __file__ to None or non-existent will take about the same amount of time to fix, it is less prone to silly whitelisting like what the trace module had. > > They're significantly nicer to deal with when dumping module state for > diagnostic purposes. If I get a KeyError on __file__, or an > AttributeError on NoneType when all I'm trying to do is display data, > it's annoying. > > Standardising on a pattern also opens up the possibility of doing > something meaningful with it in get_data() later. One of the > guarantees of PEP 302 if that you should be able to do this: > > data_ref = os.path.join(__file__, relative_ref) > data = __loader__.get_data(data_ref) > > That should really only blow up in get_data(), *not* on the > os.path.join step. Ideally, you should also be able to do this: > > data_ref = os.path.join(mod.__file__, relative_ref) > data = mod.__loader__.get_data(data_ref) > > I see it as being similar to the mandatory file attribute on code > objects - placeholders like "" and "" are a lot more > informative when errors occur than just using None, even though > neither of them is a valid filesystem path. > But that's because there are no other introspection options to tell where the module originated, unlike modules which have __loader__. > > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > _______________________________________________ > Import-SIG mailing list > Import-SIG at python.org > http://mail.python.org/mailman/listinfo/import-sig > -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Thu May 3 17:00:26 2012 From: eric at trueblade.com (Eric V. Smith) Date: Thu, 03 May 2012 11:00:26 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: <4FA29D8A.7020103@trueblade.com> On 5/3/2012 2:23 AM, Nick Coghlan wrote: > On Thu, May 3, 2012 at 2:37 PM, PJ Eby wrote: >> Still, code that expects to do something with a package's __file__ is >> *going* to break somehow with a namespace package, so it's probably better >> for it to break sooner rather than later. > > My own preference is for markers like "", "" and "". It looks like "" is indeed used, but built in modules do not set __file__. So I don't really see that as a precedent for setting it to something, but I do agree with most of your points below. > They're significantly nicer to deal with when dumping module state for > diagnostic purposes. If I get a KeyError on __file__, or an > AttributeError on NoneType when all I'm trying to do is display data, > it's annoying. > > Standardising on a pattern also opens up the possibility of doing > something meaningful with it in get_data() later. One of the > guarantees of PEP 302 if that you should be able to do this: > > data_ref = os.path.join(__file__, relative_ref) > data = __loader__.get_data(data_ref) > > That should really only blow up in get_data(), *not* on the > os.path.join step. Ideally, you should also be able to do this: > > data_ref = os.path.join(mod.__file__, relative_ref) > data = mod.__loader__.get_data(data_ref) While I embrace the pattern, I don't see how it could ever work for a namespace package. The defining quality is that the namespace package itself doesn't contain any files. And NamespaceLoader doesn't define get_data for this reason. > I see it as being similar to the mandatory file attribute on code > objects - placeholders like "" and "" are a lot more > informative when errors occur than just using None, even though > neither of them is a valid filesystem path. So the 4 options on the table are: 1. Add a (possibly meaningless) trailing slash character. 2. Use None. 3. Do not set it. 4. Set it to "". We'll discuss it today at our sprint. From brett at python.org Thu May 3 17:09:10 2012 From: brett at python.org (Brett Cannon) Date: Thu, 3 May 2012 11:09:10 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: On Thu, May 3, 2012 at 10:48 AM, Brett Cannon wrote: > > > On Thu, May 3, 2012 at 2:23 AM, Nick Coghlan wrote: > >> On Thu, May 3, 2012 at 2:37 PM, PJ Eby wrote: >> > Still, code that expects to do something with a package's __file__ is >> > *going* to break somehow with a namespace package, so it's probably >> better >> > for it to break sooner rather than later. >> > > I'm going to roll my replies all into this email to keep things simple. > > So, to the people not wanting to set __file__, that (probably) won't fly > because it has been documented for years that built-in modules are the only > things that don't define __file__. Or we at least need to explain to people > how to tell the difference in a backwards-compatible fashion (e.g. > ``module.__name__ in sys.builtin_module_names``). > > >> >> My own preference is for markers like "", "" and >> "". >> > > So I would have said that had experience with the stdlib not big me on > this. > That should say "So I would have agreed with that had my experience with the stdlib in bootstrapping importlib not caused me to disagree." Don't try to multi-task at work while in the middle of writing an email is the lesson there. =) -Brett In my situation, the trace module was checking file, and if __file__ didn't > contain "" or " then error out if it couldn't open the file. Now I updated it to > startswith('<') and endswith('>'), but I wonder how many people made a > similar whitelist approach. And while having __file__ to None or > non-existent will take about the same amount of time to fix, it is less > prone to silly whitelisting like what the trace module had. > > >> >> They're significantly nicer to deal with when dumping module state for >> diagnostic purposes. If I get a KeyError on __file__, or an >> AttributeError on NoneType when all I'm trying to do is display data, >> it's annoying. >> >> Standardising on a pattern also opens up the possibility of doing >> something meaningful with it in get_data() later. One of the >> guarantees of PEP 302 if that you should be able to do this: >> >> data_ref = os.path.join(__file__, relative_ref) >> data = __loader__.get_data(data_ref) >> >> That should really only blow up in get_data(), *not* on the >> os.path.join step. Ideally, you should also be able to do this: >> >> data_ref = os.path.join(mod.__file__, relative_ref) >> data = mod.__loader__.get_data(data_ref) >> >> I see it as being similar to the mandatory file attribute on code >> objects - placeholders like "" and "" are a lot more >> informative when errors occur than just using None, even though >> neither of them is a valid filesystem path. >> > > But that's because there are no other introspection options to tell where > the module originated, unlike modules which have __loader__. > > >> >> Cheers, >> Nick. >> >> -- >> Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia >> _______________________________________________ >> Import-SIG mailing list >> Import-SIG at python.org >> http://mail.python.org/mailman/listinfo/import-sig >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From pje at telecommunity.com Thu May 3 18:11:00 2012 From: pje at telecommunity.com (PJ Eby) Date: Thu, 3 May 2012 12:11:00 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: On Thu, May 3, 2012 at 2:23 AM, Nick Coghlan wrote: > Standardising on a pattern also opens up the possibility of doing > something meaningful with it in get_data() later. One of the > guarantees of PEP 302 if that you should be able to do this: > > data_ref = os.path.join(__file__, relative_ref) > data = __loader__.get_data(data_ref) > Um, namespace package modules shouldn't have a __loader__ either, should they? -------------- next part -------------- An HTML attachment was scrubbed... URL: From barry at python.org Thu May 3 18:15:41 2012 From: barry at python.org (Barry Warsaw) Date: Thu, 3 May 2012 12:15:41 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: <20120503121541.6b5ff385@resist.wooz.org> On May 03, 2012, at 10:48 AM, Brett Cannon wrote: >So, to the people not wanting to set __file__, that (probably) won't fly >because it has been documented for years that built-in modules are the only >things that don't define __file__. Okay, but *why* is this the rule, other than that PEP 302 says it? IOW, PEP 302 doesn't give much of a rationale for the rule, and I suspect it just reflected the reality back in 2002. >Or we at least need to explain to people how to tell the difference in a >backwards-compatible fashion. Definitely, and I think that would be fine to include in PEP 420. >So I would have said that had experience with the stdlib not big me on >this. In my situation, the trace module was checking file, and if __file__ >didn't contain "" or "and then error out if it couldn't open the file. Now I updated it to >startswith('<') and endswith('>'), but I wonder how many people made a >similar whitelist approach. And while having __file__ to None or >non-existent will take about the same amount of time to fix, it is less >prone to silly whitelisting like what the trace module had. See what I mean about arbitrary and underdocumented? :) Cheers, -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: not available URL: From brett at python.org Thu May 3 18:47:39 2012 From: brett at python.org (Brett Cannon) Date: Thu, 3 May 2012 12:47:39 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: On Thu, May 3, 2012 at 12:11 PM, PJ Eby wrote: > On Thu, May 3, 2012 at 2:23 AM, Nick Coghlan wrote: > >> Standardising on a pattern also opens up the possibility of doing >> something meaningful with it in get_data() later. One of the >> guarantees of PEP 302 if that you should be able to do this: >> >> data_ref = os.path.join(__file__, relative_ref) >> data = __loader__.get_data(data_ref) >> > > Um, namespace package modules shouldn't have a __loader__ either, should > they? > No, they should (and PEP 302 now requires that). Namespace modules are loaded by a loader, and thus should have it defined. It's all the other optional interfaces that they don't need to have (e.g. NamespaceLoader should have importlib.abc.Loader and probably none of the other ABCs). > > > _______________________________________________ > Import-SIG mailing list > Import-SIG at python.org > http://mail.python.org/mailman/listinfo/import-sig > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From brett at python.org Thu May 3 18:49:23 2012 From: brett at python.org (Brett Cannon) Date: Thu, 3 May 2012 12:49:23 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120503121541.6b5ff385@resist.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120503121541.6b5ff385@resist.wooz.org> Message-ID: On Thu, May 3, 2012 at 12:15 PM, Barry Warsaw wrote: > On May 03, 2012, at 10:48 AM, Brett Cannon wrote: > > >So, to the people not wanting to set __file__, that (probably) won't fly > >because it has been documented for years that built-in modules are the > only > >things that don't define __file__. > > Okay, but *why* is this the rule, other than that PEP 302 says it? IOW, > PEP > 302 doesn't give much of a rationale for the rule, and I suspect it just > reflected the reality back in 2002. > Exactly. I am willing to be that historically it's just because that was the only way you could tell what was or was not a built-in module. > > >Or we at least need to explain to people how to tell the difference in a > >backwards-compatible fashion. > > Definitely, and I think that would be fine to include in PEP 420. > > >So I would have said that had experience with the stdlib not big me on > >this. In my situation, the trace module was checking file, and if __file__ > >didn't contain "" or " >and then error out if it couldn't open the file. Now I updated it to > >startswith('<') and endswith('>'), but I wonder how many people made a > >similar whitelist approach. And while having __file__ to None or > >non-existent will take about the same amount of time to fix, it is less > >prone to silly whitelisting like what the trace module had. > > See what I mean about arbitrary and underdocumented? :) > I don't remind me about "arbitrary and underdocumented" when it comes to the import system. =P -Brett > > Cheers, > -Barry > > _______________________________________________ > Import-SIG mailing list > Import-SIG at python.org > http://mail.python.org/mailman/listinfo/import-sig > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Fri May 4 00:20:16 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 4 May 2012 08:20:16 +1000 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: I'd still prefer to just officially bless the existing "" convention for non-filesystem imports over encouraging type checks on __loader__ or defining a new introspection interface for loaders. If we say "this is the stdlib convention" people are going to start using the same check as is now used in traceback.py The precedent is there with code objects, and I think it's a good example to follow. Cheers, Nick. -- Sent from my phone, thus the relative brevity :) -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Fri May 4 00:43:40 2012 From: guido at python.org (Guido van Rossum) Date: Thu, 3 May 2012 15:43:40 -0700 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: +1 On Thu, May 3, 2012 at 3:20 PM, Nick Coghlan wrote: > I'd still prefer to just officially bless the existing "" > convention for non-filesystem imports over encouraging type checks on > __loader__ or defining a new introspection interface for loaders. > > If we say "this is the stdlib convention" people are going to start using > the same check as is now used in traceback.py > > The precedent is there with code objects, and I think it's a good example to > follow. > > Cheers, > Nick. > > -- > Sent from my phone, thus the relative brevity :) > > > _______________________________________________ > Import-SIG mailing list > Import-SIG at python.org > http://mail.python.org/mailman/listinfo/import-sig > -- --Guido van Rossum (python.org/~guido) From pje at telecommunity.com Fri May 4 02:05:15 2012 From: pje at telecommunity.com (PJ Eby) Date: Thu, 3 May 2012 20:05:15 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: On Thu, May 3, 2012 at 6:20 PM, Nick Coghlan wrote: > I'd still prefer to just officially bless the existing "" > convention for non-filesystem imports over encouraging type checks on > __loader__ or defining a new introspection interface for loaders. > > If we say "this is the stdlib convention" people are going to start using > the same check as is now used in traceback.py > > The precedent is there with code objects, and I think it's a good example > to follow. > Note that this messes with the idea of using the first directory as filename -- anybody who joins with os.path.dirname(__file__) is going to get a mess (on regular filesystem paths), which is (I'm guessing) why the trailing separator idea was proposed in the first place. Which kind of brings us full circle on that point. I suppose we could just say screw it, anybody implementing VFS importers had darn well better understand os.path.join and friends, since PEP 302 requires it for get_data anyway. Still seems like a wart, but oh well. OTOH, maybe it's better for people munging __file__ to get a weird error all the time with namespace packages, instead of something that works some of the time, and fails later? -------------- next part -------------- An HTML attachment was scrubbed... URL: From martin at v.loewis.de Fri May 4 02:11:02 2012 From: martin at v.loewis.de (martin at v.loewis.de) Date: Fri, 04 May 2012 02:11:02 +0200 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120503121541.6b5ff385@resist.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120503121541.6b5ff385@resist.wooz.org> Message-ID: <20120504021102.Horde.4iA2c9jz9kRPox6WHie3KUA@webmail.df.eu> Zitat von Barry Warsaw : > On May 03, 2012, at 10:48 AM, Brett Cannon wrote: > >> So, to the people not wanting to set __file__, that (probably) won't fly >> because it has been documented for years that built-in modules are the only >> things that don't define __file__. > > Okay, but *why* is this the rule, other than that PEP 302 says it? I think it predates PEP 302 by a decade or so. You might also ask why the keyword is "def", and not "define" (other than that the Grammar says so). It's a natural thing, also: If the module comes from the file system, it has an __file__ attribute, else it's built-in. Regards, Martin From ncoghlan at gmail.com Fri May 4 03:05:16 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 4 May 2012 11:05:16 +1000 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: On Fri, May 4, 2012 at 10:05 AM, PJ Eby wrote: > On Thu, May 3, 2012 at 6:20 PM, Nick Coghlan wrote: >> >> I'd still prefer to just officially bless the existing "" >> convention for non-filesystem imports over encouraging type checks on >> __loader__ or defining a new introspection interface for loaders. >> >> If we say "this is the stdlib convention" people are going to start using >> the same check as is now used in traceback.py >> >> The precedent is there with code objects, and I think it's a good example >> to follow. > > Note that this messes with the idea of using the first directory as filename > -- anybody who joins with os.path.dirname(__file__) is going to get a mess > (on regular filesystem paths), which is (I'm guessing) why the trailing > separator idea was proposed in the first place. > > Which kind of brings us full circle on that point.? I suppose we could just > say screw it, anybody implementing VFS importers had darn well better > understand os.path.join and friends, since PEP 302 requires it for get_data > anyway. Yep. It also means VFS importers are officially free to put all the metadata they want inside the angle brackets, secure in the knowledge that everyone else should be treating it as an opaque blob. It then becomes a way for them to pass necessary info to get_data() *without* having to create distinct loader instances for every module. Arguably, we should also be adding the angle brackets in zipimporter (since those aren't real filesystem paths). > Still seems like a wart, but oh well.? OTOH, maybe it's better for people > munging __file__ to get a weird error all the time with namespace packages, > instead of something that works some of the time, and fails later? Right. Otherwise we'd get layout dependent behaviour where dubious cross-portion references worked if all portions were installed to the same path segment, but then failed if they were split across multiple segments. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From eric at trueblade.com Fri May 4 03:21:44 2012 From: eric at trueblade.com (Eric V. Smith) Date: Thu, 03 May 2012 21:21:44 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: <4FA32F28.5040200@trueblade.com> On 05/03/2012 09:05 PM, Nick Coghlan wrote: > On Fri, May 4, 2012 at 10:05 AM, PJ Eby wrote: >> Still seems like a wart, but oh well. OTOH, maybe it's better for people >> munging __file__ to get a weird error all the time with namespace packages, >> instead of something that works some of the time, and fails later? > > Right. Otherwise we'd get layout dependent behaviour where dubious > cross-portion references worked if all portions were installed to the > same path segment, but then failed if they were split across multiple > segments. Under no circumstances should anyone be looking at __file__ for a namespace package in order to find a related file. We should do something that causes this to always break. Eric. From barry at python.org Fri May 4 16:34:50 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 10:34:50 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: <20120504103450.58286b0c@limelight.wooz.org> On May 04, 2012, at 08:20 AM, Nick Coghlan wrote: >I'd still prefer to just officially bless the existing "" >convention for non-filesystem imports over encouraging type checks on >__loader__ or defining a new introspection interface for loaders. The thing is, that convention is at best meaningless and at worst misleading. I also don't think it gives you all the diagnosis support you really want. The PEP 302 rule (reservation of no __file__ only for built-ins) is a historical relic for which no good rationale exists. Forgetting that for a moment, it simply makes no sense for a module that wasn't loaded from a file system path to have an __file__ attribute. It's also not true even today. At our PEP 420 sprint we noticed importlib does something like this to create new modules: >>> type(sys)('foo') That module isn't a built-in and doesn't have an __file__. It also doesn't have an __loader__, but oh well. (BTW, Brett, that's pretty clever. :) It seemed to us that the only reasonable semantics for such modules is that __file__ is None or __file__ is missing. Not setting __file__ is better though because you get appropriate exceptions at the place where you make the initial mistake (i.e. assuming every module has an __file__). If you set __file__ to None, you may instead get cryptic messages in os.path.join() for example. So, what about the "diagnostics" use case? Certainly a very important use case is the repr of module objects. In the case of modules loaded from the file system, I definitely want to know where the file lives, and the repr is a great way to see that. For other modules, you do want to know something about how that module was created, and having a repr that gives a good indication of that is very useful. But you can easily do that without a contrived __file__ (more on that below). What about other introspection use cases? Relying on __file__ programmatically might be a convenient shorthand, but knowing the loader (via __loader__ if available) is more helpful, because that tells you more about how that module actually came into existence. The value of __file__ is really under the purview of the loader anyway. Consider a hypothetical database loader (or even many different third party database loaders). Of what use is an __file__ that says ''? That way leads to uncertainty, and namespace collisions, for example if both a SQLite loader and a PostgreSQL loader wanted to use the '' value. In either case, maybe you'd prefer to know what the database url is, or maybe the query that produced the module, or some combination there of. Overloading all that into a contrived __file__ seems wrong. I would prefer if the requirement were relaxed, and we simply allowed the loaders to set __file__ to whatever they think is appropriate, which would include allowing them to not setting __file__ at all. It's actually easy to give modules a reasonable repr even without __file__. I have a branch in the PEP 420 feature repo which implements the following rules for module object reprs: * Use mod.__file__ if it exists * Otherwise, get the module's __loader__ * If the module has no loader, then just return the module's name. E.g. >>> type(sys)('foo') * Define a new optional method on loaders, called module_repr() that takes the module as an argument. Use whatever this returns as the module's repr. * As a last fallback, just use the repr of the loader as part of the module's repr. I'm not particularly married to this implementation, but it seems reasonably backward compatible, and flexible enough to support useful alternatives. For example, the BuiltinImporter could define its module_repr() like so: @classmethod def module_repr(cls, module): return ''.format(module.__name__) Specifically, my proposed elaboration on PEP 420 is this: * Explicitly leave the assignment of __file__ to the loader. * Allow loaders to not set __file__ * Add an optional API to loaders, module_repr() as defined above. Cheers, -Barry From barry at python.org Fri May 4 16:51:49 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 10:51:49 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120504021102.Horde.4iA2c9jz9kRPox6WHie3KUA@webmail.df.eu> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120503121541.6b5ff385@resist.wooz.org> <20120504021102.Horde.4iA2c9jz9kRPox6WHie3KUA@webmail.df.eu> Message-ID: <20120504105149.472a2f61@limelight.wooz.org> On May 04, 2012, at 02:11 AM, martin at v.loewis.de wrote: >I think it predates PEP 302 by a decade or so. You might also ask why >the keyword is "def", and not "define" (other than that the Grammar says >so). It's a natural thing, also: If the module comes from the file system, >it has an __file__ attribute, else it's built-in. Sure, that makes sense in a 2002 world where we didn't have importlib and all the modernization of the import system. Today, it's not only antiquated, it's also not necessarily true. We're already significantly overhauling the import machinery, so I think it's entirely reasonable to relax this constraint. See my previous post for a proposal. -Barry From barry at python.org Fri May 4 16:56:56 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 10:56:56 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

Message-ID: <20120504105656.11fca0e9@limelight.wooz.org> On May 04, 2012, at 11:05 AM, Nick Coghlan wrote: >Yep. It also means VFS importers are officially free to put all the >metadata they want inside the angle brackets, secure in the knowledge >that everyone else should be treating it as an opaque blob. It then >becomes a way for them to pass necessary info to get_data() *without* >having to create distinct loader instances for every module. Ooh! I can't wait for the __file__ set to a pickle to steganographically communicate secret messages to get_data(). :) -Barry From pje at telecommunity.com Fri May 4 16:56:56 2012 From: pje at telecommunity.com (PJ Eby) Date: Fri, 4 May 2012 10:56:56 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120504103450.58286b0c@limelight.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On May 4, 2012 10:34 AM, "Barry Warsaw" wrote: > Specifically, my proposed elaboration on PEP 420 is this: > > * Explicitly leave the assignment of __file__ to the loader. > * Allow loaders to not set __file__ > * Add an optional API to loaders, module_repr() as defined above. +1 on all the above, plus getting rid of __file__ for namespace packages. Seems like an elegant solution to the problems involved, and allows DB or other importers to make their own attributes like __dsn__ or __url__, but still have a decent repr. -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Fri May 4 17:13:48 2012 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 04 May 2012 11:13:48 -0400 Subject: [Import-SIG] PEP 420 sprint report Message-ID: <4FA3F22C.9090003@trueblade.com> Yesterday Jason Coombs, Barry Warsaw, and I met for about 6 hours of sprinting on PEP 420. We added a test framework and added tests for namespace packages using the filesystem loader and zipimport loader. We flushed out a bug in zipimport's namespace finder support as part of this. We identified the following issues which need to get resolved before the PEP is ruled on: 1. What about __file__? Barry is currently discussing this in the other thread. 2: Parent path modification detection. I'm still thinking this one over. I'm going to look into whipping up a sample implementation. I think these can all be resolved this weekend, so we'll ask that a ruling be made on the PEP next week. Please let me know if you have other PEP (not implementation) concerns. There are also these quality of implementation issues that I don't think need to get addressed before PEP 420 is ruled on: 1. Documentation. 2. More tests. We need to test namespace packages as sub-packages, not just top level. 3. The zipimport finder currently looks for "path/" to detect if a 'directory' exists and could be a namespace portion. However, this is a valid zip file: Archive: namespace_pkgs/missing_directory.zip Length Date Time Name --------- ---------- ----- ---- 0 2012-05-04 04:45 bar/ 35 2012-05-04 04:45 bar/two.py 26 2012-05-04 04:45 foo/one.py --------- ------- 61 3 files The current code will treat "bar" as a possible portion, but not "foo". We discussed a number of ways to address this, but I'm unconvinced they're worth the hassle and runtime expense. But in any event, it's an issue for another day and doesn't affect the PEP's acceptance one way or the other. All of the code is checked in to features/pep-420. Eric. From ncoghlan at gmail.com Fri May 4 17:14:13 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 5 May 2012 01:14:13 +1000 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120504103450.58286b0c@limelight.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On Sat, May 5, 2012 at 12:34 AM, Barry Warsaw wrote: > ?* Explicitly leave the assignment of __file__ to the loader. > ?* Allow loaders to not set __file__ > ?* Add an optional API to loaders, module_repr() as defined above. I can accept that approach on one condition: the PEP 420 implementation comes with the long-overdue migration of the definition of the import system semantics into the language reference. The main sticking point preventing that in the past has been that nobody wanted to document all the caveats and special cases needed to accurately describe CPython's behaviour. For 3.3+, no such caveats are necessary, since Brett's importlib efforts mean that even the default import system follows the rules. The proposed update will require changes to the description of the import semantics, anyway, so rather than making those changes directly in PEP 302, it would be better to document them in the language reference and update PEP 302 with a note to say that, for 3.3+, it is no longer the authoritative source. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From p.f.moore at gmail.com Fri May 4 17:16:14 2012 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 4 May 2012 16:16:14 +0100 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120504105149.472a2f61@limelight.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120503121541.6b5ff385@resist.wooz.org> <20120504021102.Horde.4iA2c9jz9kRPox6WHie3KUA@webmail.df.eu> <20120504105149.472a2f61@limelight.wooz.org> Message-ID: On 4 May 2012 15:51, Barry Warsaw wrote: > On May 04, 2012, at 02:11 AM, martin at v.loewis.de wrote: > >>I think it predates PEP 302 by a decade or so. You might also ask why >>the keyword is "def", and not "define" (other than that the Grammar says >>so). It's a natural thing, also: If the module comes from the file system, >>it has an __file__ attribute, else it's built-in. > > Sure, that makes sense in a 2002 world where we didn't have importlib and all > the modernization of the import system. ?Today, it's not only antiquated, it's > also not necessarily true. ?We're already significantly overhauling the import > machinery, so I think it's entirely reasonable to relax this constraint. When we wrote PEP 302, so much code assumed that modules lived in the filesystem that we had very little room for manoeuvre, One of the goals of PEP 302 (in my mind, at least) was to disrupt the mindset that assumed this. Now, Brett's implementation of importlib has made that a reality - code that assumes modules live in a filesystem should have a really good justification for doing so (and document the limitation, ideally). I suspect you'll still break a reasonable amount of code like this, but that's probably OK, as it's less of a breakage, and more of a case of the existing code not anticipating cases that never existed before. > See my previous post for a proposal. +1 and I'd also explicitly allow for loaders to assign other "private" metadata as well as __file__, if only to avoid the spectre of __file__ being a base64-encoded pickled object :-) I wonder whether treating repr specially is the best way, though - maybe have a loader method "code_location" which is defined as being a human-readable, but otherwise unspecified string. The key use case is for repr, but it might be useful elsewhere (IDE tooltips or some such usage spring to mind). Paul. From eric at trueblade.com Fri May 4 17:17:13 2012 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 04 May 2012 11:17:13 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: <4FA3F2F9.8020001@trueblade.com> On 05/04/2012 11:14 AM, Nick Coghlan wrote: > On Sat, May 5, 2012 at 12:34 AM, Barry Warsaw wrote: >> * Explicitly leave the assignment of __file__ to the loader. >> * Allow loaders to not set __file__ >> * Add an optional API to loaders, module_repr() as defined above. > > I can accept that approach on one condition: the PEP 420 > implementation comes with the long-overdue migration of the definition > of the import system semantics into the language reference. > > The main sticking point preventing that in the past has been that > nobody wanted to document all the caveats and special cases needed to > accurately describe CPython's behaviour. For 3.3+, no such caveats are > necessary, since Brett's importlib efforts mean that even the default > import system follows the rules. > > The proposed update will require changes to the description of the > import semantics, anyway, so rather than making those changes directly > in PEP 302, it would be better to document them in the language > reference and update PEP 302 with a note to say that, for 3.3+, it is > no longer the authoritative source. We did discuss this yesterday at the sprint. I'm all for it, and I think the others were, too. I'm not keen on tying all of this to PEP 420 acceptance or rejection, but it's not the end of the world. Eric. From p.f.moore at gmail.com Fri May 4 17:23:37 2012 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 4 May 2012 16:23:37 +0100 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On 4 May 2012 16:14, Nick Coghlan wrote: > On Sat, May 5, 2012 at 12:34 AM, Barry Warsaw wrote: >> ?* Explicitly leave the assignment of __file__ to the loader. >> ?* Allow loaders to not set __file__ >> ?* Add an optional API to loaders, module_repr() as defined above. > > I can accept that approach on one condition: the PEP 420 > implementation comes with the long-overdue migration of the definition > of the import system semantics into the language reference. That would be a *very* good idea. Whether PEP 420 should be held hostage to this, I don't know, but I think it should be targeted as a key item for 3.3. Just having a reference to what the language actually guarantees would be immensely useful. I did actually try to do this once, but my head exploded :-) (I'd be willing to help out with it, but I don't know where it would fit in the docs - could anyone suggest a basic location and structure, and I could try to write some words to go into it?) On a somewhat related note, does anyone know how well oddities like jython's ability to import Java classes (and IronPython for .Net classes) fit any such rules? Paul. From fwierzbicki at gmail.com Fri May 4 18:00:52 2012 From: fwierzbicki at gmail.com (fwierzbicki at gmail.com) Date: Fri, 4 May 2012 09:00:52 -0700 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120504103450.58286b0c@limelight.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On Fri, May 4, 2012 at 7:34 AM, Barry Warsaw wrote: > It's also not true even today. ?At our PEP 420 sprint we noticed importlib > does something like this to create new modules: > > ? ?>>> type(sys)('foo') > > That module isn't a built-in and doesn't have an __file__. ?It also > doesn't have an __loader__, but oh well. > > (BTW, Brett, that's pretty clever. :) Too clever for Jython at them moment :) -- which leads me to ask: Should I consider this a a feature of the sys module? It doesn't look too hard to do, and I really want importlib to work when Jython starts on Jython3 (I'm hoping to seriously start that this summer - Jython 2.7 is progressing well). -Frank From brett at python.org Fri May 4 18:21:36 2012 From: brett at python.org (Brett Cannon) Date: Fri, 4 May 2012 12:21:36 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On Fri, May 4, 2012 at 12:00 PM, fwierzbicki at gmail.com < fwierzbicki at gmail.com> wrote: > On Fri, May 4, 2012 at 7:34 AM, Barry Warsaw wrote: > > It's also not true even today. At our PEP 420 sprint we noticed > importlib > > does something like this to create new modules: > > > > >>> type(sys)('foo') > > > > That module isn't a built-in and doesn't have an __file__. It also > > doesn't have an __loader__, but oh well. > > > > (BTW, Brett, that's pretty clever. :) > Too clever for Jython at them moment :) -- which leads me to ask: > Should I consider this a a feature of the sys module? No, this is an ability of types.ModuleType (which I don't have access to in importlib, so I just inlined the call). This works for any module in CPython. > It doesn't look > too hard to do, and I really want importlib to work when Jython starts > on Jython3 (I'm hoping to seriously start that this summer - Jython > 2.7 is progressing well). > I've actually been meaning to email the various VMs to have them look over importlib to see if there are any sticking points that are obvious so we can fix them now instead of waiting until a point release when the first VM other than CPython tries to use importlib. -------------- next part -------------- An HTML attachment was scrubbed... URL: From fwierzbicki at gmail.com Fri May 4 18:28:55 2012 From: fwierzbicki at gmail.com (fwierzbicki at gmail.com) Date: Fri, 4 May 2012 09:28:55 -0700 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On Fri, May 4, 2012 at 9:21 AM, Brett Cannon wrote: >> Too clever for Jython at them moment :) -- which leads me to ask: >> Should I consider this a a feature of the sys module? > > > No, this is an ability of types.ModuleType (which I don't have access to in > importlib, so I just inlined the call). This works for any module in > CPython. Ah of course, and our ModuleType works just fine for this. The Jython sys module is fake sadly. Perhaps 3.x will be the time to finally make it a real module... it's been a fake module with a comment at the top to make it a real module for longer than I've been involved. BTW any real module works for us, for example: >>> type(os)('foo') -Frank From pje at telecommunity.com Fri May 4 18:50:11 2012 From: pje at telecommunity.com (PJ Eby) Date: Fri, 4 May 2012 12:50:11 -0400 Subject: [Import-SIG] PEP 420 sprint report In-Reply-To: <4FA3F22C.9090003@trueblade.com> References: <4FA3F22C.9090003@trueblade.com> Message-ID: On Fri, May 4, 2012 at 11:13 AM, Eric V. Smith wrote: > 3. The zipimport finder currently looks for "path/" to detect if a > 'directory' exists and could be a namespace portion. However, this is a > valid zip file: > Archive: namespace_pkgs/missing_directory.zip > Length Date Time Name > --------- ---------- ----- ---- > 0 2012-05-04 04:45 bar/ > 35 2012-05-04 04:45 bar/two.py > 26 2012-05-04 04:45 foo/one.py > --------- ------- > 61 3 files > The current code will treat "bar" as a possible portion, but not "foo". > We discussed a number of ways to address this, but I'm unconvinced > they're worth the hassle and runtime expense. But in any event, it's an > issue for another day and doesn't affect the PEP's acceptance one way or > the other. > FYI, the zip files produced by distutils do not include the empty directory. Actually, I'm not sure when/where I've ever seen an empty directory listed in a zipfile. IMO, the no-explicit-directory case should be handled, if for no other reason than that it shouldn't randomly break depending on which archiving tool you used to create the zipfile with. -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Fri May 4 18:57:28 2012 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 04 May 2012 12:57:28 -0400 Subject: [Import-SIG] PEP 420 sprint report In-Reply-To: References: <4FA3F22C.9090003@trueblade.com> Message-ID: <4FA40A78.2020806@trueblade.com> On 05/04/2012 12:50 PM, PJ Eby wrote: > On Fri, May 4, 2012 at 11:13 AM, Eric V. Smith > wrote: > > 3. The zipimport finder currently looks for "path/" to detect if a > 'directory' exists and could be a namespace portion. However, this is a > valid zip file: > Archive: namespace_pkgs/missing_directory.zip > Length Date Time Name > --------- ---------- ----- ---- > 0 2012-05-04 04:45 bar/ > 35 2012-05-04 04:45 bar/two.py > 26 2012-05-04 04:45 foo/one.py > --------- ------- > 61 3 files > The current code will treat "bar" as a possible portion, but not "foo". > We discussed a number of ways to address this, but I'm unconvinced > they're worth the hassle and runtime expense. But in any event, it's an > issue for another day and doesn't affect the PEP's acceptance one way or > the other. > > > FYI, the zip files produced by distutils do not include the empty > directory. Actually, I'm not sure when/where I've ever seen an empty > directory listed in a zipfile. Interesting, thanks for the info. They are created if you use "zip -r" from a Linux box and it recurses into the directory. But it's definitely possible to create them without the empty directory if you explicitly list the files, or of course you can just delete them after the fact (which is what I did here). > IMO, the no-explicit-directory case should be handled, if for no other > reason than that it shouldn't randomly break depending on which > archiving tool you used to create the zipfile with. I agree. It's just that I'm not likely to get to it in the next few weeks. Hopefully I'll delay long enough that someone smarter than me will rewrite zipimport in Python (http://bugs.python.org/issue14678?@ok_message=issue 14678). I started with Python so I wouldn't have to write any more C! Eric. From martin at v.loewis.de Fri May 4 19:00:01 2012 From: martin at v.loewis.de (martin at v.loewis.de) Date: Fri, 04 May 2012 19:00:01 +0200 Subject: [Import-SIG] PEP 420 sprint report In-Reply-To: References: <4FA3F22C.9090003@trueblade.com> Message-ID: <20120504190001.Horde.G7ZKSML8999PpAsRP9YVMTA@webmail.df.eu> > IMO, the no-explicit-directory case should be handled, if for no other > reason than that it shouldn't randomly break depending on which archiving > tool you used to create the zipfile with. I agree. IIRC, the zip importer creates a cached list/dictionary of the zip directory, anyway; while doing so, it could easily synthesize the directory names. Regards, Martin From eric at trueblade.com Fri May 4 19:07:42 2012 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 04 May 2012 13:07:42 -0400 Subject: [Import-SIG] PEP 420 sprint report In-Reply-To: <20120504190001.Horde.G7ZKSML8999PpAsRP9YVMTA@webmail.df.eu> References: <4FA3F22C.9090003@trueblade.com> <20120504190001.Horde.G7ZKSML8999PpAsRP9YVMTA@webmail.df.eu> Message-ID: <4FA40CDE.3080207@trueblade.com> On 05/04/2012 01:00 PM, martin at v.loewis.de wrote: >> IMO, the no-explicit-directory case should be handled, if for no other >> reason than that it shouldn't randomly break depending on which archiving >> tool you used to create the zipfile with. > > I agree. IIRC, the zip importer creates a cached list/dictionary of the > zip directory, anyway; while doing so, it could easily synthesize the > directory names. Correct. It builds a dictionary. It could create another dictionary (or set is all I really need) with all directories, found or synthesized. Eric. From brett at python.org Fri May 4 19:32:53 2012 From: brett at python.org (Brett Cannon) Date: Fri, 4 May 2012 13:32:53 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: On Fri, May 4, 2012 at 12:28 PM, fwierzbicki at gmail.com < fwierzbicki at gmail.com> wrote: > On Fri, May 4, 2012 at 9:21 AM, Brett Cannon wrote: > >> Too clever for Jython at them moment :) -- which leads me to ask: > >> Should I consider this a a feature of the sys module? > > > > > > No, this is an ability of types.ModuleType (which I don't have access to > in > > importlib, so I just inlined the call). This works for any module in > > CPython. > Ah of course, and our ModuleType works just fine for this. The Jython > sys module is fake sadly. Perhaps 3.x will be the time to finally make > it a real module... it's been a fake module with a comment at the top > to make it a real module for longer than I've been involved. > > BTW any real module works for us, for example: > > >>> type(os)('foo') > > OK, so of the CPython built-in modules that importlib uses (sys, _imp, _warnings, _io, marshal, builtins, posix/nt), which are an actual module in Jython? -------------- next part -------------- An HTML attachment was scrubbed... URL: From barry at python.org Fri May 4 21:07:37 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 15:07:37 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: <20120504150737.7a5131ab@resist.wooz.org> On May 05, 2012, at 01:14 AM, Nick Coghlan wrote: >On Sat, May 5, 2012 at 12:34 AM, Barry Warsaw wrote: >> ?* Explicitly leave the assignment of __file__ to the loader. >> ?* Allow loaders to not set __file__ >> ?* Add an optional API to loaders, module_repr() as defined above. > >I can accept that approach on one condition: the PEP 420 >implementation comes with the long-overdue migration of the definition >of the import system semantics into the language reference. I think you were listening in our sprint Nick! :) One of the downsides of the PEP process is that sometimes the PEP will end up being the definitive documentation for a new feature. This sucks for many reasons, including that PEPs don't live in the source tree and they end up getting pretty out-of-date as time goes by. PEP 302 suffers quite a bit from historical rot, but also from lots of superfluous text that doesn't make it easy to understand exactly what is going on. At our sprint, we all agreed that it would be much better for there to be documentation about the import system's semantics in the language reference guide. I think "Import System" is important enough to warrant a top-level chapter, probably either before or after "Execution Model". Section 6.11 describes the import statement, but I'd probably refactor large bits of that into the "Import System" chapter, and leave $6.11 to describe the import statement specifically. I mentioned at the sprint that I'd be willing to work on such a document. It's likely more than a one-person-operation, but I'd be happy to take a crack at a first draft once PEP 420 gets accepted. Cheers, -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: not available URL: From barry at python.org Fri May 4 21:11:05 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 15:11:05 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> Message-ID: <20120504151105.3d953080@resist.wooz.org> On May 04, 2012, at 10:56 AM, PJ Eby wrote: >On May 4, 2012 10:34 AM, "Barry Warsaw" wrote: >> Specifically, my proposed elaboration on PEP 420 is this: >> >> * Explicitly leave the assignment of __file__ to the loader. >> * Allow loaders to not set __file__ >> * Add an optional API to loaders, module_repr() as defined above. > >+1 on all the above, plus getting rid of __file__ for namespace packages. >Seems like an elegant solution to the problems involved, and allows DB or >other importers to make their own attributes like __dsn__ or __url__, but >still have a decent repr. Yes, exactly. It seems like there's general consensus about the basic proposal; I'll update the PEP so Guido has specific language to pronounce on. I want to make one change to what I posted. If m.__loader__.module_repr() exists, I want to give it a first crack at producing the repr. This means that __file__ is used as a fallback, not as the first step. Cheers, -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: not available URL: From fwierzbicki at gmail.com Fri May 4 21:44:29 2012 From: fwierzbicki at gmail.com (fwierzbicki at gmail.com) Date: Fri, 4 May 2012 12:44:29 -0700 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org>

Message-ID: Sorry for the dup Brett - I still mess up on the new gmail interface sometimes :( On Fri, May 4, 2012 at 10:32 AM, Brett Cannon wrote: > OK, so of the CPython built-in modules that importlib uses (sys, _imp, > _warnings, _io, marshal, builtins, posix/nt), which are an actual module in > Jython? I'll start with the bad: builtins would be hard to turn into a module - however __builtin__ is a module and works well. nt is not likely to get implemented, we pretend nt is a posix with missing bits. The ok: posix is not a currently a true module, but can probably be turned into one without too much trouble -- I will need to investigate. _imp is not exposed as a module, but I think this will be a necessary and acceptable step to integrate with importlib (and I don't think it should be too hard given the benefits). The good: marshal and _io are already true modules. _warnings will be when I get around to implementing it - probably next week :) -- if I run out of time it may end up just being the same as the python version (but that will still make it a true module). -Frank From barry at python.org Fri May 4 21:52:58 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 15:52:58 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120503121541.6b5ff385@resist.wooz.org> <20120504021102.Horde.4iA2c9jz9kRPox6WHie3KUA@webmail.df.eu> <20120504105149.472a2f61@limelight.wooz.org> Message-ID: <20120504155258.45ea89aa@resist.wooz.org> On May 04, 2012, at 04:16 PM, Paul Moore wrote: >+1 and I'd also explicitly allow for loaders to assign other "private" >metadata as well as __file__, if only to avoid the spectre of __file__ >being a base64-encoded pickled object :-) That's in PEP 420 now too. >I wonder whether treating repr specially is the best way, though - >maybe have a loader method "code_location" which is defined as being a >human-readable, but otherwise unspecified string. The key use case is >for repr, but it might be useful elsewhere (IDE tooltips or some such >usage spring to mind). Maybe, but I think this is the simplest thing possible, which solves an existing use case. :) -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: not available URL: From barry at python.org Fri May 4 21:56:51 2012 From: barry at python.org (Barry Warsaw) Date: Fri, 4 May 2012 15:56:51 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <4FA3F2F9.8020001@trueblade.com> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> <4FA3F2F9.8020001@trueblade.com> Message-ID: <20120504155651.1f661364@resist.wooz.org> On May 04, 2012, at 11:17 AM, Eric V. Smith wrote: >I'm not keen on tying all of this to PEP 420 acceptance or rejection, >but it's not the end of the world. I think the PEP should be pronounced on before the documentation is written. If Guido wants to make changes to the spec, it's better not to waste effort. Are there any more open issues? Are we ready to ask Guido to pronounce? I think the feature branch is in pretty good shape, but we can delay merging it to the main trunk (assuming the PEP gets accepted) until we have more tests and a first draft of the import semantics documentation. I don't mind working in the feature branch for a little while longer. Cheers, -Barry From pje at telecommunity.com Fri May 4 23:02:16 2012 From: pje at telecommunity.com (PJ Eby) Date: Fri, 4 May 2012 17:02:16 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120504155651.1f661364@resist.wooz.org> References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> <4FA3F2F9.8020001@trueblade.com> <20120504155651.1f661364@resist.wooz.org> Message-ID: On Fri, May 4, 2012 at 3:56 PM, Barry Warsaw wrote: > Are there any more open issues? Maybe not on this particular subproposal, but IIUC, Eric was still looking at the feasibility of doing auto-updates when parent paths change. (Unless I'm mistaken, my sketch for PEP 402 should only need a bit of hacking to allow setting the initial calculated path, so that there's not an extra scan when a namespace package is initialized, and a change to make it use find_module() instead of PEP 402's get_subpath(). Well, that, and renaming "virtual packages" back to "namespace packages" in the error messages and such.) -------------- next part -------------- An HTML attachment was scrubbed... URL: From eric at trueblade.com Fri May 4 23:13:47 2012 From: eric at trueblade.com (Eric V. Smith) Date: Fri, 04 May 2012 17:13:47 -0400 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <4FA05CFC.6050609@trueblade.com> <4FA0DF6C.4090709@v.loewis.de> <4FA10B15.1000302@trueblade.com> <4FA16DC5.1000204@trueblade.com> <4FA1D833.20208@trueblade.com> <20120502212355.6bda4cd4@resist.wooz.org>

<20120504103450.58286b0c@limelight.wooz.org> <4FA3F2F9.8020001@trueblade.com> <20120504155651.1f661364@resist.wooz.org> Message-ID: <4FA4468B.7040105@trueblade.com> On 5/4/2012 5:02 PM, PJ Eby wrote: > On Fri, May 4, 2012 at 3:56 PM, Barry Warsaw > wrote: > > Are there any more open issues? > > > Maybe not on this particular subproposal, but IIUC, Eric was still > looking at the feasibility of doing auto-updates when parent paths change. > > (Unless I'm mistaken, my sketch for PEP 402 should only need a bit of > hacking to allow setting the initial calculated path, so that there's > not an extra scan when a namespace package is initialized, and a change > to make it use find_module() instead of PEP 402's get_subpath(). Well, > that, and renaming "virtual packages" back to "namespace packages" in > the error messages and such.) I'm looking at it and have it mostly implemented for PEP 420. I still need to refactor out some code so I can re-use the path-building code that's currently in PathFinder.find_module. It looks simple enough. Eric. From solipsis at pitrou.net Sat May 5 00:47:11 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sat, 5 May 2012 00:47:11 +0200 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages References: <4F90730D.1040808@trueblade.com> Message-ID: <20120505004711.2140afbf@pitrou.net> Hello, On Thu, 19 Apr 2012 16:18:21 -0400 "Eric V. Smith" wrote: > This reflects (I hope!) the discussions at PyCon. My plan is to produce > an implementation based on the importlib code, and then flush out pieces > of the PEP. I don't understand why PEP 382 was rejected. There doesn't seem to be any obvious argument against it. The mechanism is simple, explicit and unambiguous. As PEP 382 points out: ?At the discussion at PyCon DE 2011, people remarked that having an explicit declaration of a directory as contributing to a package is a desirable property, rather than an obstactle. In particular, Jython developers noticed that Jython could easily mistake a directory that is a Java package as being a Python package, if there is no need to declare Python packages.? The "directory.pyp" scheme is highly unlikely to conflict with unrelated uses of a ".pyp" directory extension. It's also easy to use, and avoids oddities in the lookup algorithm such as ?if the scan completes without returning a module or package, and at least one directory was recorded, then a namespace package is created?. On the other hand, PEP 420 provides potential for confusion (for example, if the standard "test" package is not installed, trying to import it could end up importing some other arbitrary "test" directory on the path as a namespace package), without seeming to have any obvious advantage over PEP 382. Unless there are clear advantages over PEP 382, I'm -1 on this PEP, and would like to see PEP 382 revived. Regards Antoine. From ncoghlan at gmail.com Sat May 5 08:27:26 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 5 May 2012 16:27:26 +1000 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120505004711.2140afbf@pitrou.net> References: <4F90730D.1040808@trueblade.com> <20120505004711.2140afbf@pitrou.net> Message-ID: On Sat, May 5, 2012 at 8:47 AM, Antoine Pitrou wrote: > Unless there are clear advantages over PEP 382, I'm -1 on this PEP, and > would like to see PEP 382 revived. I raised this question as well, and the PEP as written doesn't do a great job of summarising the thread that addressed it. There were two counterpoints raised that I found compelling: A. Guido simply doesn't like directory extensions. I have to agree with him that using them to handle packaging would be a weird and unusual approach, and, well, he *does* get to play the BDFL card in cases like this. B. Current version control systems are still pretty abysmal when it comes to coping with directory renames, and we want to avoid unnecessary stumbling blocks on the migration path from the current pkgutil.extend_path() based namespace packages to the new native system. With PEP 382, the migration path is: 1. delete all __init__.py files from namespace package portions 2. rename the directories for all namespace package portions to append the ".pyp" extension With PEP 420, the migration path is: 1. delete all __init__.py files from namespace package portions 2. there is no step 2 The extra step required by the PEP 382 approach is exactly the kind of pointless revision history noise that PEP 414's reintroduction of explicit Unicode literals is designed to eliminate from Python 2 to Python 3 migrations. Between "Guido doesn't like directory suffixes" and "version control systems are still fairly bad at handling directory renames", I changed my own opinion on PEP 420 from -1 to +0. If we'd been starting from a clean slate with no language history or migration of existing projects to account for, then my opinion would be different, but given where we are today, I find the pragmatic argument in favour of simply losing the explicit markers compelling. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From martin at v.loewis.de Sat May 5 09:18:13 2012 From: martin at v.loewis.de (martin at v.loewis.de) Date: Sat, 05 May 2012 09:18:13 +0200 Subject: [Import-SIG] PEP 420 issue: standard namespace packages Message-ID: <20120505091813.Horde.angpY8L8999PpNQ1Vz8hCnA@webmail.df.eu> I'd like the PEP to rule that the standard library may designate some of its packages as namespace packages, and also specifically declare the encodings package as a namespace package. This would allow to install additional encodings just by mere installation, without the need of having a search function registered at startup. Not sure what other packages would be candidates for namespace packages. Regards, Martin From solipsis at pitrou.net Sat May 5 12:33:03 2012 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sat, 5 May 2012 12:33:03 +0200 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: References: <4F90730D.1040808@trueblade.com> <20120505004711.2140afbf@pitrou.net> Message-ID: <20120505123303.29c3f4bb@pitrou.net> On Sat, 5 May 2012 16:27:26 +1000 Nick Coghlan wrote: > On Sat, May 5, 2012 at 8:47 AM, Antoine Pitrou wrote: > > Unless there are clear advantages over PEP 382, I'm -1 on this PEP, and > > would like to see PEP 382 revived. > > I raised this question as well, and the PEP as written doesn't do a > great job of summarising the thread that addressed it. > > There were two counterpoints raised that I found compelling: > > A. Guido simply doesn't like directory extensions. I have to agree > with him that using them to handle packaging would be a weird and > unusual approach, and, well, he *does* get to play the BDFL card in > cases like this. Well, I agree that "foo.pyp" isn't very pretty, but that's a pretty minor argument. At least it's explicit. (of course, another marker could have been chosen: for example an empty "foo/__namespace__.py", or whatever else floats our boat of aesthetics) > B. Current version control systems are still pretty abysmal when it > comes to coping with directory renames, and we want to avoid > unnecessary stumbling blocks on the migration path from the current > pkgutil.extend_path() based namespace packages to the new native > system. Isn't that baseless? AFAIU all modern DVCS should cope correctly with a directory rename. Even SVN may be ok. If anything, I'd like to see data points about these "current version control systems" being "pretty abysmal [!] when it comes to coping with directory renames". (preferably something else than a 2007 rant by Mark Shuttleworth in order to justify bzr's existence :-)) > The extra step required by the PEP 382 approach is exactly the kind of > pointless revision history noise that PEP 414's reintroduction of > explicit Unicode literals is designed to eliminate from Python 2 to > Python 3 migrations. Except that noone *has* to migrate to namespace packages. These are fairly rare and only useful for a couple of big projects. (I've only heard about Zope using them; Twisted AFAICT doesn't) Even then, renaming a directory is hardly comparable to the hurdle of migrating unicode literals from Python 2 to Python 3. The analogy sounds melodramatic. > Between "Guido doesn't like directory suffixes" and "version control > systems are still fairly bad at handling directory renames", I changed > my own opinion on PEP 420 from -1 to +0. This doesn't address PEP 420's issues, which will still come to bite us in 10 years: the potential for confusion, the weirdness of the lookup algorithm. > If we'd been starting from a > clean slate with no language history or migration of existing projects > to account for, then my opinion would be different, but given where we > are today, I find the pragmatic argument in favour of simply losing > the explicit markers compelling. The real pragmatic argument would be to avoid creating maintenance and support issues for the future, IMO. Regards Antoine. From ncoghlan at gmail.com Sat May 5 14:12:51 2012 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 5 May 2012 22:12:51 +1000 Subject: [Import-SIG] PEP 420: Implicit Namespace Packages In-Reply-To: <20120505123303.29c3f4bb@pitrou.net> References: <4F90730D.1040808@trueblade.com> <20120505004711.2140afbf@pitrou.net> <20120505123303.29c3f4bb@pitrou.net> Message-ID: