From matti.picus at gmail.com Sun Sep 1 03:46:47 2019 From: matti.picus at gmail.com (Matti Picus) Date: Sun, 1 Sep 2019 10:46:47 +0300 Subject: [Numpy-discussion] Allowing Dependabot access to the numpy repo In-Reply-To: References: <51892585-ba6e-94dd-fb73-9d1091231939@gmail.com> Message-ID: <19a99ca0-fb96-5b08-1838-25452d5e4604@gmail.com> Discussion has died down, I think the consensus is to use Dependabot. I will proceed with allowing it access. Thanks, Matti On 29/8/19 12:07 pm, Nathaniel Smith wrote: > AFAICT all these services work by creating branches inside your repo > and then making a PR from that ? they don't make their own forks. > (Which makes some sense when you consider they would need tens of > thousands of forked epos for all the projects they work with.) > > I don't think there's any need to worry about giving GitHub Inc. (dba > Dependabot) write permissions to a GitHub repo, though. > > You do maybe want to set up CI so that it doesn't run on these > branches, since it will also run on the PRs, and running CI twice on > the same branch is slow and wasteful. > > -n > > On Thu, Aug 29, 2019, 01:45 Ryan May > wrote: > > Hi, > > The answer to why Dependabot needs write permission seems to be to > be able to work with private repos: > > https://github.com/dependabot/feedback/issues/22 > > There doesn't seem to be any way around it... :( > > Ryan > > On Thu, Aug 29, 2019 at 12:04 AM Matti Picus > > wrote: > > In PR 14378 https://github.com/numpy/numpy/pull/14378 I moved > all our python test dependencies to a test_requirements.txt > file (for building numpy the only requirement is cython). This > is worthy since it unifies the different "pip install" > commands across the different CI systems we use. Additionally, > there are services that monitor the file and will issue a PR > if any of those packages have a new release, so we can test > out new versions of dependencies in a controlled fashion. > Someone suggested Dependabot (thanks Ryan), which turns out to > be run by a company bought by github itself. > > > When signing up for the service, it asks for permissions: > https://pasteboard.co/IuTeWNz.png. The service is in use by > other projects like cpython. Does it seem OK to sign up for > this service? > > > Matti > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > -- > Ryan May > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From einstein.edison at gmail.com Mon Sep 2 05:15:15 2019 From: einstein.edison at gmail.com (Hameer Abbasi) Date: Mon, 2 Sep 2019 09:15:15 +0000 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= Message-ID: Hello all, It was recently brought to my attention that my mails to NumPy-discussion were probably going into the spam folder for many people, so here I am trying from another email. Probably Google trying to force people onto their products as usual. ? Me, Ralf Gommers and Peter Bell (both cc?d) have come up with a proposal on how to solve the array creation and duck array problems. The solution is outlined in NEP-31, currently in the form of a PR, [1] Following the high level discussion in NEP-22. [2] It would be nice to get some feedback. Full-text of the NEP: ============================================================ NEP 31 ? Context-local and global overrides of the NumPy API ============================================================ :Author: Hameer Abbasi :Author: Ralf Gommers :Author: Peter Bell :Status: Draft :Type: Standards Track :Created: 2019-08-22 Abstract -------- This NEP proposes to make all of NumPy's public API overridable via an extensible backend mechanism, using a library called ``uarray`` `[1]`_ ``uarray`` provides global and context-local overrides, as well as a dispatch mechanism similar to NEP-18 `[2]`_. First experiences with ``__array_function__`` show that it is necessary to be able to override NumPy functions that *do not take an array-like argument*, and hence aren't overridable via ``__array_function__``. The most pressing need is array creation and coercion functions - see e.g. NEP-30 `[9]`_. This NEP proposes to allow, in an opt-in fashion, overriding any part of the NumPy API. It is intended as a comprehensive resolution to NEP-22 `[3]`_, and obviates the need to add an ever-growing list of new protocols for each new type of function or object that needs to become overridable. Motivation and Scope -------------------- The motivation behind ``uarray`` is manyfold: First, there have been several attempts to allow dispatch of parts of the NumPy API, including (most prominently), the ``__array_ufunc__`` protocol in NEP-13 `[4]`_, and the ``__array_function__`` protocol in NEP-18 `[2]`_, but this has shown the need for further protocols to be developed, including a protocol for coercion (see `[5]`_). The reasons these overrides are needed have been extensively discussed in the references, and this NEP will not attempt to go into the details of why these are needed. Another pain point requiring yet another protocol is the duck-array protocol (see `[9]`_). This NEP takes a more holistic approach: It assumes that there are parts of the API that need to be overridable, and that these will grow over time. It provides a general framework and a mechanism to avoid a design of a new protocol each time this is required. This NEP proposes the following: That ``unumpy`` `[8]`_ becomes the recommended override mechanism for the parts of the NumPy API not yet covered by ``__array_function__`` or ``__array_ufunc__``, and that ``uarray`` is vendored into a new namespace within NumPy to give users and downstream dependencies access to these overrides. This vendoring mechanism is similar to what SciPy decided to do for making ``scipy.fft`` overridable (see `[10]`_). Detailed description -------------------- **Note:** *This section will not attempt to explain the specifics or the mechanism of ``uarray``, that is explained in the ``uarray`` documentation.* `[1]`_ *However, the NumPy community will have input into the design of ``uarray``, and any backward-incompatible changes will be discussed on the mailing list.* The way we propose the overrides will be used by end users is:: import numpy.overridable as np with np.set_backend(backend): x = np.asarray(my_array, dtype=dtype) And a library that implements a NumPy-like API will use it in the following manner (as an example):: import numpy.overridable as np _ua_implementations = {} __ua_domain__ = "numpy" def __ua_function__(func, args, kwargs): fn = _ua_implementations.get(func, None) return fn(*args, **kwargs) if fn is not None else NotImplemented def implements(ua_func): def inner(func): _ua_implementations[ua_func] = func return func return inner @implements(np.asarray) def asarray(a, dtype=None, order=None): # Code here # Either this method or __ua_convert__ must # return NotImplemented for unsupported types, # Or they shouldn't be marked as dispatchable. # Provides a default implementation for ones and zeros. @implements(np.full) def full(shape, fill_value, dtype=None, order='C'): # Code here The only change this NEP proposes at its acceptance, is to make ``unumpy`` the officially recommended way to override NumPy. ``unumpy`` will remain a separate repository/package (which we propose to vendor to avoid a hard dependency, and use the separate ``unumpy`` package only if it is installed) rather than depend on for the time being), and will be developed primarily with the input of duck-array authors and secondarily, custom dtype authors, via the usual GitHub workflow. There are a few reasons for this: * Faster iteration in the case of bugs or issues. * Faster design changes, in the case of needed functionality. * ``unumpy`` will work with older versions of NumPy as well. * The user and library author opt-in to the override process, rather than breakages happening when it is least expected. In simple terms, bugs in ``unumpy`` mean that ``numpy`` remains unaffected. Advantanges of ``unumpy`` over other solutions ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ``unumpy`` offers a number of advantanges over the approach of defining a new protocol for every problem encountered: Whenever there is something requiring an override, ``unumpy`` will be able to offer a unified API with very minor changes. For example: * ``ufunc`` objects can be overridden via their ``__call__``, ``reduce`` and other methods. * ``dtype`` objects can be overridden via the dispatch/backend mechanism, going as far as to allow ``np.float32`` et. al. to be overridden by overriding ``__get__``. * Other functions can be overridden in a similar fashion. * ``np.asduckarray`` goes away, and becomes ``np.asarray`` with a backend set. * The same holds for array creation functions such as ``np.zeros``, ``np.empty`` and so on. This also holds for the future: Making something overridable would require only minor changes to ``unumpy``. Another promise ``unumpy`` holds is one of default implementations. Default implementations can be provided for any multimethod, in terms of others. This allows one to override a large part of the NumPy API by defining only a small part of it. This is to ease the creation of new duck-arrays, by providing default implementations of many functions that can be easily expressed in terms of others, as well as a repository of utility functions that help in the implementation of duck-arrays that most duck-arrays would require. The last benefit is a clear way to coerce to a given backend, and a protocol for coercing not only arrays, but also ``dtype`` objects and ``ufunc`` objects with similar ones from other libraries. This is due to the existence of actual, third party dtype packages, and their desire to blend into the NumPy ecosystem (see `[6]`_). This is a separate issue compared to the C-level dtype redesign proposed in `[7]`_, it's about allowing third-party dtype implementations to work with NumPy, much like third-party array implementations. Mixing NumPy and ``unumpy`` in the same file ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Normally, one would only want to import only one of ``unumpy`` or ``numpy``, you would import it as ``np`` for familiarity. However, there may be situations where one wishes to mix NumPy and the overrides, and there are a few ways to do this, depending on the user's style:: import numpy.overridable as unumpy import numpy as np or:: import numpy as np # Use unumpy via np.overridable Related Work ------------ Previous override mechanisms ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * NEP-18, the ``__array_function__`` protocol. `[2]`_ * NEP-13, the ``__array_ufunc__`` protocol. `[3]`_ Existing NumPy-like array implementations ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * Dask: https://dask.org/ * CuPy: https://cupy.chainer.org/ * PyData/Sparse: https://sparse.pydata.org/ * Xnd: https://xnd.readthedocs.io/ * Astropy's Quantity: https://docs.astropy.org/en/stable/units/ Existing and potential consumers of alternative arrays ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * Dask: https://dask.org/ * scikit-learn: https://scikit-learn.org/ * Xarray: https://xarray.pydata.org/ * TensorLy: http://tensorly.org/ Existing alternate dtype implementations ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * ``ndtypes``: https://ndtypes.readthedocs.io/en/latest/ * Datashape: https://datashape.readthedocs.io * Plum: https://plum-py.readthedocs.io/ Implementation -------------- The implementation of this NEP will require the following steps: * Implementation of ``uarray`` multimethods corresponding to the NumPy API, including classes for overriding ``dtype``, ``ufunc`` and ``array`` objects, in the ``unumpy`` repository. * Moving backends from ``unumpy`` into the respective array libraries. Backward compatibility ---------------------- There are no backward incompatible changes proposed in this NEP. Alternatives ------------ The current alternative to this problem is NEP-30 plus adding more protocols (not yet specified) in addition to it. Even then, some parts of the NumPy API will remain non-overridable, so it's a partial alternative. The main alternative to vendoring ``unumpy`` is to simply move it into NumPy completely and not distribute it as a separate package. This would also achieve the proposed goals, however we prefer to keep it a separate package for now, for reasons already stated above. Discussion ---------- * ``uarray`` blogpost: https://labs.quansight.org/blog/2019/07/uarray-update-api-changes-overhead-and-comparison-to-__array_function__/ * The discussion section of NEP-18: https://numpy.org/neps/nep-0018-array-function-protocol.html#discussion * NEP-22: https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html * Dask issue #4462: https://github.com/dask/dask/issues/4462 * PR #13046: https://github.com/numpy/numpy/pull/13046 * Dask issue #4883: https://github.com/dask/dask/issues/4883 * Issue #13831: https://github.com/numpy/numpy/issues/13831 * Discussion PR 1: https://github.com/hameerabbasi/numpy/pull/3 * Discussion PR 2: https://github.com/hameerabbasi/numpy/pull/4 References and Footnotes ------------------------ .. _[1]: [1] uarray, A general dispatch mechanism for Python: https://uarray.readthedocs.io .. _[2]: [2] NEP 18 ? A dispatch mechanism for NumPy?s high level array functions: https://numpy.org/neps/nep-0018-array-function-protocol.html .. _[3]: [3] NEP 22 ? Duck typing for NumPy arrays ? high level overview: https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html .. _[4]: [4] NEP 13 ? A Mechanism for Overriding Ufuncs: https://numpy.org/neps/nep-0013-ufunc-overrides.html .. _[5]: [5] Reply to Adding to the non-dispatched implementation of NumPy methods: http://numpy-discussion.10968.n7.nabble.com/Adding-to-the-non-dispatched-implementation-of-NumPy-methods-tp46816p46874.html .. _[6]: [6] Custom Dtype/Units discussion: http://numpy-discussion.10968.n7.nabble.com/Custom-Dtype-Units-discussion-td43262.html .. _[7]: [7] The epic dtype cleanup plan: https://github.com/numpy/numpy/issues/2899 .. _[8]: [8] unumpy: NumPy, but implementation-independent: https://unumpy.readthedocs.io .. _[9]: [9] NEP 30 ? Duck Typing for NumPy Arrays - Implementation: https://www.numpy.org/neps/nep-0030-duck-array-protocol.html .. _[10]: [10] http://scipy.github.io/devdocs/fft.html#backend-control Copyright --------- This document has been placed in the public domain. Best regards, Hameer Abbasi [1] https://github.com/numpy/numpy/pull/14389 [2] https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Mon Sep 2 17:09:02 2019 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 2 Sep 2019 14:09:02 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References: Message-ID: On Mon, Sep 2, 2019 at 2:15 AM Hameer Abbasi wrote: > Me, Ralf Gommers and Peter Bell (both cc?d) have come up with a proposal on how to solve the array creation and duck array problems. The solution is outlined in NEP-31, currently in the form of a PR, [1] Thanks for putting this together! It'd be great to have more engagement between uarray and numpy. > ============================================================ > > NEP 31 ? Context-local and global overrides of the NumPy API > > ============================================================ Now that I've read this over, my main feedback is that right now it seems too vague and high-level to give it a fair evaluation? The idea of a NEP is to lay out a problem and proposed solution in enough detail that it can be evaluated and critiqued, but this felt to me more like it was pointing at some other documents for all the details and then promising that uarray has solutions for all our problems. > This NEP takes a more holistic approach: It assumes that there are parts of the API that need to be > overridable, and that these will grow over time. It provides a general framework and a mechanism to > avoid a design of a new protocol each time this is required. The idea of a holistic approach makes me nervous, because I'm not sure we have holistic problems. Sometimes a holistic approach is the right thing; other times it means sweeping the actual problems under the rug, so things *look* simple and clean but in fact nothing has been solved, and they just end up biting us later. And from the NEP as currently written, I can't tell whether this is the good kind of holistic or the bad kind of holistic. Now I'm writing vague handwavey things, so let me follow my own advice and make it more concrete with an example :-). When Stephan and I were writing NEP 22, the single thing we spent the most time discussing was the problem of duck-array coercion, and in particular what to do about existing code that does np.asarray(duck_array_obj). The reason this is challenging is that there's a lot of code written in Cython/C/C++ that calls np.asarray, and then blindly casts the return value to a PyArray struct and starts accessing the raw memory fields. If np.asarray starts returning anything besides a real-actual np.ndarray object, then this code will start corrupting random memory, leading to a segfault at best. Stephan felt strongly that this meant that existing np.asarray calls *must not* ever return anything besides an np.ndarray object, and therefore we needed to add a new function np.asduckarray(), or maybe an explicit opt-in flag like np.asarray(..., allow_duck_array=True). I agreed that this was a problem, but thought we might be able to get away with an "opt-out" system, where we add an allow_duck_array= flag, but make it *default* to True, and document that the Cython/C/C++ users who want to work with a raw np.ndarray object should modify their code to explicitly call np.asarray(obj, allow_duck_array=False). This would mean that for a while people who tried to pass duck-arrays into legacy library would get segfaults, but there would be a clear path for fixing these issues as they were discovered. Either way, there are also some other details to figure out: how does this affect the C version of asarray? What about np.asfortranarray ? probably that should default to allow_duck_array=False, even if we did make np.asarray default to allow_duck_array=True, right? Now if I understand right, your proposal would be to make it so any code in any package could arbitrarily change the behavior of np.asarray for all inputs, e.g. I could just decide that np.asarray([1, 2, 3]) should return some arbitrary non-np.ndarray object. It seems like this has a much greater potential for breaking existing Cython/C/C++ code, and the NEP doesn't currently describe why this extra power is useful, and it doesn't currently describe how it plans to mitigate the downsides. (For example, if a caller needs a real np.ndarray, then is there some way to explicitly request one? The NEP doesn't say.) Maybe this is all fine and there are solutions to these issues, but any proposal to address duck array coercion needs to at least talk about these issues! And that's just one example... array coercion is a particularly central and tricky problem, but the numpy API big, and there are probably other problems like this. For another example, I don't understand what the NEP is proposing to do about dtypes at all. That's why I think the NEP needs to be fleshed out a lot more before it will be possible to evaluate fairly. -n -- Nathaniel J. Smith -- https://vorpus.org From ralf.gommers at gmail.com Tue Sep 3 02:20:36 2019 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 2 Sep 2019 23:20:36 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

Message-ID: On Mon, Sep 2, 2019 at 2:09 PM Nathaniel Smith wrote: > On Mon, Sep 2, 2019 at 2:15 AM Hameer Abbasi > wrote: > > Me, Ralf Gommers and Peter Bell (both cc?d) have come up with a proposal > on how to solve the array creation and duck array problems. The solution is > outlined in NEP-31, currently in the form of a PR, [1] > > Thanks for putting this together! It'd be great to have more > engagement between uarray and numpy. > > > ============================================================ > > > > NEP 31 ? Context-local and global overrides of the NumPy API > > > > ============================================================ > > Now that I've read this over, my main feedback is that right now it > seems too vague and high-level to give it a fair evaluation? The idea > of a NEP is to lay out a problem and proposed solution in enough > detail that it can be evaluated and critiqued, but this felt to me > more like it was pointing at some other documents for all the details > and then promising that uarray has solutions for all our problems. > This is fair enough I think. We'll need to put some more thought in where to refer to other NEPs, and where to be more concrete. > > This NEP takes a more holistic approach: It assumes that there are parts > of the API that need to be > > overridable, and that these will grow over time. It provides a general > framework and a mechanism to > > avoid a design of a new protocol each time this is required. > > The idea of a holistic approach makes me nervous, because I'm not sure > we have holistic problems. Sometimes a holistic approach is the right > thing; other times it means sweeping the actual problems under the > rug, so things *look* simple and clean but in fact nothing has been > solved, and they just end up biting us later. And from the NEP as > currently written, I can't tell whether this is the good kind of > holistic or the bad kind of holistic. > > Now I'm writing vague handwavey things, so let me follow my own advice > and make it more concrete with an example :-). > > When Stephan and I were writing NEP 22, the single thing we spent the > most time discussing was the problem of duck-array coercion, and in > particular what to do about existing code that does > np.asarray(duck_array_obj). > > The reason this is challenging is that there's a lot of code written > in Cython/C/C++ that calls np.asarray, Cython code only perhaps? It would surprise me if there's a lot of C/C++ code that explicitly calls into our Python rather than C API. and then blindly casts the > return value to a PyArray struct and starts accessing the raw memory > fields. If np.asarray starts returning anything besides a real-actual > np.ndarray object, then this code will start corrupting random memory, > leading to a segfault at best. > > Stephan felt strongly that this meant that existing np.asarray calls > *must not* ever return anything besides an np.ndarray object, and > therefore we needed to add a new function np.asduckarray(), or maybe > an explicit opt-in flag like np.asarray(..., allow_duck_array=True). > > I agreed that this was a problem, but thought we might be able to get > away with an "opt-out" system, where we add an allow_duck_array= flag, > but make it *default* to True, and document that the Cython/C/C++ > users who want to work with a raw np.ndarray object should modify > their code to explicitly call np.asarray(obj, allow_duck_array=False). > This would mean that for a while people who tried to pass duck-arrays > into legacy library would get segfaults, but there would be a clear > path for fixing these issues as they were discovered. > > Either way, there are also some other details to figure out: how does > this affect the C version of asarray? What about np.asfortranarray ? > probably that should default to allow_duck_array=False, even if we did > make np.asarray default to allow_duck_array=True, right? > > Now if I understand right, your proposal would be to make it so any > code in any package could arbitrarily change the behavior of > np.asarray for all inputs, e.g. I could just decide that > np.asarray([1, 2, 3]) should return some arbitrary non-np.ndarray > object. No, definitely not! It's all opt-in, by explicitly importing from `numpy.overridable` or `unumpy`. No behavior of anything in the existing numpy namespaces should be affected in any way. I agree with the concerns below, hence it should stay opt-in. Cheers, Ralf It seems like this has a much greater potential for breaking > existing Cython/C/C++ code, and the NEP doesn't currently describe why > this extra power is useful, and it doesn't currently describe how it > plans to mitigate the downsides. (For example, if a caller needs a > real np.ndarray, then is there some way to explicitly request one? The > NEP doesn't say.) Maybe this is all fine and there are solutions to > these issues, but any proposal to address duck array coercion needs to > at least talk about these issues! > > And that's just one example... array coercion is a particularly > central and tricky problem, but the numpy API big, and there are > probably other problems like this. For another example, I don't > understand what the NEP is proposing to do about dtypes at all. > > That's why I think the NEP needs to be fleshed out a lot more before > it will be possible to evaluate fairly. > > -n > > -- > Nathaniel J. Smith -- https://vorpus.org > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From einstein.edison at gmail.com Tue Sep 3 05:06:38 2019 From: einstein.edison at gmail.com (Hameer Abbasi) Date: Tue, 3 Sep 2019 11:06:38 +0200 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

Message-ID: <8bc4cb7c-3334-2a82-ba1b-94b7ed3425dd@gmail.com> Hi Nathaniel, On 02.09.19 23:09, Nathaniel Smith wrote: > On Mon, Sep 2, 2019 at 2:15 AM Hameer Abbasi wrote: >> Me, Ralf Gommers and Peter Bell (both cc?d) have come up with a proposal on how to solve the array creation and duck array problems. The solution is outlined in NEP-31, currently in the form of a PR, [1] > Thanks for putting this together! It'd be great to have more > engagement between uarray and numpy. > >> ============================================================ >> >> NEP 31 ? Context-local and global overrides of the NumPy API >> >> ============================================================ > Now that I've read this over, my main feedback is that right now it > seems too vague and high-level to give it a fair evaluation? The idea > of a NEP is to lay out a problem and proposed solution in enough > detail that it can be evaluated and critiqued, but this felt to me > more like it was pointing at some other documents for all the details > and then promising that uarray has solutions for all our problems. > >> This NEP takes a more holistic approach: It assumes that there are parts of the API that need to be >> overridable, and that these will grow over time. It provides a general framework and a mechanism to >> avoid a design of a new protocol each time this is required. > The idea of a holistic approach makes me nervous, because I'm not sure > we have holistic problems. The fact that we're having to design more and more protocols for a lot of very similar things is, to me, an indicator that we do have holistic problems that ought to be solved by a single protocol. > Sometimes a holistic approach is the right > thing; other times it means sweeping the actual problems under the > rug, so things *look* simple and clean but in fact nothing has been > solved, and they just end up biting us later. And from the NEP as > currently written, I can't tell whether this is the good kind of > holistic or the bad kind of holistic. > > Now I'm writing vague handwavey things, so let me follow my own advice > and make it more concrete with an example :-). > > When Stephan and I were writing NEP 22, the single thing we spent the > most time discussing was the problem of duck-array coercion, and in > particular what to do about existing code that does > np.asarray(duck_array_obj). > > The reason this is challenging is that there's a lot of code written > in Cython/C/C++ that calls np.asarray, and then blindly casts the > return value to a PyArray struct and starts accessing the raw memory > fields. If np.asarray starts returning anything besides a real-actual > np.ndarray object, then this code will start corrupting random memory, > leading to a segfault at best. > > Stephan felt strongly that this meant that existing np.asarray calls > *must not* ever return anything besides an np.ndarray object, and > therefore we needed to add a new function np.asduckarray(), or maybe > an explicit opt-in flag like np.asarray(..., allow_duck_array=True). > > I agreed that this was a problem, but thought we might be able to get > away with an "opt-out" system, where we add an allow_duck_array= flag, > but make it *default* to True, and document that the Cython/C/C++ > users who want to work with a raw np.ndarray object should modify > their code to explicitly call np.asarray(obj, allow_duck_array=False). > This would mean that for a while people who tried to pass duck-arrays > into legacy library would get segfaults, but there would be a clear > path for fixing these issues as they were discovered. > > Either way, there are also some other details to figure out: how does > this affect the C version of asarray? What about np.asfortranarray ? > probably that should default to allow_duck_array=False, even if we did > make np.asarray default to allow_duck_array=True, right? > > Now if I understand right, your proposal would be to make it so any > code in any package could arbitrarily change the behavior of > np.asarray for all inputs, e.g. I could just decide that > np.asarray([1, 2, 3]) should return some arbitrary non-np.ndarray > object. It seems like this has a much greater potential for breaking > existing Cython/C/C++ code, and the NEP doesn't currently describe why > this extra power is useful, and it doesn't currently describe how it > plans to mitigate the downsides. (For example, if a caller needs a > real np.ndarray, then is there some way to explicitly request one? The > NEP doesn't say.) Maybe this is all fine and there are solutions to > these issues, but any proposal to address duck array coercion needs to > at least talk about these issues! I believe I addressed this in a previous email, but the NEP doesn't suggest overriding numpy.asarray or numpy.array. It suggests overriding numpy.overridable.asarray and numpy.overridable.array, so existing code will continue to work as-is and overrides are opt-in rather than forced on you. The argument about this kind of code could be applied to return values from other functions as well. That said, there is a way to request a NumPy array object explicitly: with ua.set_backend(np): ??? x = np.asarray(...) > > And that's just one example... array coercion is a particularly > central and tricky problem, but the numpy API big, and there are > probably other problems like this. For another example, I don't > understand what the NEP is proposing to do about dtypes at all. Just as there are other kinds of arrays, there may be other kinds of dtypes that are not NumPy dtypes. They cannot be attached to a NumPy array object (as Sebastian pointed out to me in last week's Community meeting), but they can still provide other powerful features. > That's why I think the NEP needs to be fleshed out a lot more before > it will be possible to evaluate fairly. > > -n > I just pushed a new version of the NEP to my PR, the full-text of which is below. ============================================================ NEP 31 ? Context-local and global overrides of the NumPy API ============================================================ :Author: Hameer Abbasi :Author: Ralf Gommers :Author: Peter Bell :Status: Draft :Type: Standards Track :Created: 2019-08-22 Abstract -------- This NEP proposes to make all of NumPy's public API overridable via an extensible backend mechanism, using a library called ``uarray`` `[1]`_ ``uarray`` provides global and context-local overrides, as well as a dispatch mechanism similar to NEP-18 `[2]`_. First experiences with ``__array_function__`` show that it is necessary to be able to override NumPy functions that *do not take an array-like argument*, and hence aren't overridable via ``__array_function__``. The most pressing need is array creation and coercion functions - see e.g. NEP-30 `[9]`_. This NEP proposes to allow, in an opt-in fashion, overriding any part of the NumPy API. It is intended as a comprehensive resolution to NEP-22 `[3]`_, and obviates the need to add an ever-growing list of new protocols for each new type of function or object that needs to become overridable. Motivation and Scope -------------------- The motivation behind ``uarray`` is manyfold: First, there have been several attempts to allow dispatch of parts of the NumPy API, including (most prominently), the ``__array_ufunc__`` protocol in NEP-13 `[4]`_, and the ``__array_function__`` protocol in NEP-18 `[2]`_, but this has shown the need for further protocols to be developed, including a protocol for coercion (see `[5]`_). The reasons these overrides are needed have been extensively discussed in the references, and this NEP will not attempt to go into the details of why these are needed. Another pain point requiring yet another protocol is the duck-array protocol (see `[9]`_). This NEP takes a more holistic approach: It assumes that there are parts of the API that need to be overridable, and that these will grow over time. It provides a general framework and a mechanism to avoid a design of a new protocol each time this is required. This NEP proposes the following: That ``unumpy`` `[8]`_ becomes the recommended override mechanism for the parts of the NumPy API not yet covered by ``__array_function__`` or ``__array_ufunc__``, and that ``uarray`` is vendored into a new namespace within NumPy to give users and downstream dependencies access to these overrides.? This vendoring mechanism is similar to what SciPy decided to do for making ``scipy.fft`` overridable (see `[10]`_). Detailed description -------------------- **Note:** *This section will not attempt to go into too much detail about ``uarray``, that is the purpose of the ``uarray`` documentation.* `[1]`_ *However, the NumPy community will have input into the design of ``uarray``, via the issue tracker.* ``uarray`` Primer ^^^^^^^^^^^^^^^^^ Defining backends ~~~~~~~~~~~~~~~~~ ``uarray`` consists of two main protocols: ``__ua_convert__`` and ``__ua_function__``, called in that order, along with ``__ua_domain__``, which is a string defining the domain of the backend. If any of the protocols return ``NotImplemented``, we fall back to the next backend. ``__ua_convert__`` is for conversion and coercion. It has the signature ``(dispatchables, coerce)``, where ``dispatchables`` is an iterable of ``ua.Dispatchable`` objects and ``coerce`` is a boolean indicating whether or not to force the conversion. ``ua.Dispatchable`` is a simple class consisting of three simple values: ``type``, ``value``, and ``coercible``. ``__ua_convert__`` returns an iterable of the converted values, or ``NotImplemented`` in the case of failure. Returning ``NotImplemented`` here will cause ``uarray`` to move to the next available backend. ``__ua_function__`` has the signature ``(func, args, kwargs)`` and defines the actual implementation of the function. It recieves the function and its arguments. Returning ``NotImplemented`` will cause a move to the default implementation of the function if one exists, and failing that, the next backend. If all backends are exhausted, a ``ua.BackendNotImplementedError`` is raised. Backends can be registered for permanent use if required. Defining overridable multimethods ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ To define an overridable function (a multimethod), one needs a few things: 1. A dispatcher that returns an iterable of ``ua.Dispatchable`` objects. 2. A reverse dispatcher that replaces dispatchable values with the supplied ?? ones. 3. A domain. 4. Optionally, a default implementation, which can be provided in terms of ?? other multimethods. As an example, consider the following:: ??? import uarray as ua ??? def full_argreplacer(args, kwargs, dispatchables): ??????? def full(shape, fill_value, dtype=None, order='C'): ??????????? return (shape, fill_value), dict( ??????????????? dtype=dispatchables[0], ??????????????? order=order ??????????? ) ??????? return full(*args, **kwargs) ??? @ua.create_multimethod(full_argreplacer, domain="numpy") ??? def full(shape, fill_value, dtype=None, order='C'): ??????? return (ua.Dispatchable(dtype, np.dtype),) A large set of examples can be found in the ``unumpy`` repository, `[8]`_. This simple act of overriding callables allows us to override: * Methods * Properties, via ``fget`` and ``fset`` * Entire objects, via ``__get__``. Using overrides ~~~~~~~~~~~~~~~ The way we propose the overrides will be used by end users is:: ??? import numpy.overridable as np ??? with np.set_backend(backend): ??????? x = np.asarray(my_array, dtype=dtype) And a library that implements a NumPy-like API will use it in the following manner (as an example):: ??? import numpy.overridable as np ??? _ua_implementations = {} ??? __ua_domain__ = "numpy" ??? def __ua_function__(func, args, kwargs): ??????? fn = _ua_implementations.get(func, None) ??????? return fn(*args, **kwargs) if fn is not None else NotImplemented ??? def implements(ua_func): ??????? def inner(func): ??????????? _ua_implementations[ua_func] = func ??????????? return func ??????? return inner ??? @implements(np.asarray) ??? def asarray(a, dtype=None, order=None): ??????? # Code here ??????? # Either this method or __ua_convert__ must ??????? # return NotImplemented for unsupported types, ??????? # Or they shouldn't be marked as dispatchable. ??? # Provides a default implementation for ones and zeros. ??? @implements(np.full) ??? def full(shape, fill_value, dtype=None, order='C'): ??????? # Code here The only change this NEP proposes at its acceptance, is to make ``unumpy`` the officially recommended way to override NumPy. ``unumpy`` will remain a separate repository/package (which we propose to vendor to avoid a hard dependency, and use the separate ``unumpy`` package only if it is installed) rather than depend on for the time being), and will be developed primarily with the input of duck-array authors and secondarily, custom dtype authors, via the usual GitHub workflow. There are a few reasons for this: * Faster iteration in the case of bugs or issues. * Faster design changes, in the case of needed functionality. * ``unumpy`` will work with older versions of NumPy as well. * The user and library author opt-in to the override process, ? rather than breakages happening when it is least expected. ? In simple terms, bugs in ``unumpy`` mean that ``numpy`` remains ? unaffected. Duck-array coercion ~~~~~~~~~~~~~~~~~~~ There are inherent problems about returning objects that are not NumPy arrays from ``numpy.array`` or ``numpy.asarray``, particularly in the context of C/C++ or Cython code that may get an object with a different memory layout than the one it expects. However, we believe this problem may apply not only to these two functions but all functions that return NumPy arrays. For this reason, overrides are opt-in for the user, by using the submodule ``numpy.overridable`` rather than ``numpy``. NumPy will continue to work unaffected by anything in ``numpy.overridable``. If the user wishes to obtain a NumPy array, there are two ways of doing it: 1. Use ``numpy.asarray`` (the non-overridable version). 2. Use ``numpy.overridable.asarray`` with the NumPy backend set and coercion ?? enabled:: ??? import numpy.overridable as np ??? with ua.set_backend(np): ??????? x = np.asarray(...) Advantanges of ``unumpy`` over other solutions ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ``unumpy`` offers a number of advantanges over the approach of defining a new protocol for every problem encountered: Whenever there is something requiring an override, ``unumpy`` will be able to offer a unified API with very minor changes. For example: * ``ufunc`` objects can be overridden via their ``__call__``, ``reduce`` and ? other methods. * Other functions can be overridden in a similar fashion. * ``np.asduckarray`` goes away, and becomes ``np.asarray`` with a backend set. * The same holds for array creation functions such as ``np.zeros``, ? ``np.empty`` and so on. This also holds for the future: Making something overridable would require only minor changes to ``unumpy``. Another promise ``unumpy`` holds is one of default implementations. Default implementations can be provided for any multimethod, in terms of others. This allows one to override a large part of the NumPy API by defining only a small part of it. This is to ease the creation of new duck-arrays, by providing default implementations of many functions that can be easily expressed in terms of others, as well as a repository of utility functions that help in the implementation of duck-arrays that most duck-arrays would require. The last benefit is a clear way to coerce to a given backend (via the ``coerce`` keyword in ``ua.set_backend``), and a protocol for coercing not only arrays, but also ``dtype`` objects and ``ufunc`` objects with similar ones from other libraries. This is due to the existence of actual, third party dtype packages, and their desire to blend into the NumPy ecosystem (see `[6]`_). This is a separate issue compared to the C-level dtype redesign proposed in `[7]`_, it's about allowing third-party dtype implementations to work with NumPy, much like third-party array implementations. These can provide features such as, for example, units, jagged arrays or other such features that are outside the scope of NumPy. Mixing NumPy and ``unumpy`` in the same file ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Normally, one would only want to import only one of ``unumpy`` or ``numpy``, you would import it as ``np`` for familiarity. However, there may be situations where one wishes to mix NumPy and the overrides, and there are a few ways to do this, depending on the user's style:: ??? import numpy.overridable as unumpy ??? import numpy as np or:: ??? import numpy as np ??? # Use unumpy via np.overridable Related Work ------------ Previous override mechanisms ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * NEP-18, the ``__array_function__`` protocol. `[2]`_ * NEP-13, the ``__array_ufunc__`` protocol. `[3]`_ Existing NumPy-like array implementations ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * Dask: https://dask.org/ * CuPy: https://cupy.chainer.org/ * PyData/Sparse: https://sparse.pydata.org/ * Xnd: https://xnd.readthedocs.io/ * Astropy's Quantity: https://docs.astropy.org/en/stable/units/ Existing and potential consumers of alternative arrays ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * Dask: https://dask.org/ * scikit-learn: https://scikit-learn.org/ * xarray: https://xarray.pydata.org/ * TensorLy: http://tensorly.org/ Existing alternate dtype implementations ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ * ``ndtypes``: https://ndtypes.readthedocs.io/en/latest/ * Datashape: https://datashape.readthedocs.io * Plum: https://plum-py.readthedocs.io/ Implementation -------------- The implementation of this NEP will require the following steps: * Implementation of ``uarray`` multimethods corresponding to the ? NumPy API, including classes for overriding ``dtype``, ``ufunc`` ? and ``array`` objects, in the ``unumpy`` repository. * Moving backends from ``unumpy`` into the respective array libraries. Backward compatibility ---------------------- There are no backward incompatible changes proposed in this NEP. Alternatives ------------ The current alternative to this problem is NEP-30 plus adding more protocols (not yet specified) in addition to it.? Even then, some parts of the NumPy API will remain non-overridable, so it's a partial alternative. The main alternative to vendoring ``unumpy`` is to simply move it into NumPy completely and not distribute it as a separate package. This would also achieve the proposed goals, however we prefer to keep it a separate package for now, for reasons already stated above. Discussion ---------- * ``uarray`` blogpost: https://labs.quansight.org/blog/2019/07/uarray-update-api-changes-overhead-and-comparison-to-__array_function__/ * The discussion section of NEP-18: https://numpy.org/neps/nep-0018-array-function-protocol.html#discussion * NEP-22: https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html * Dask issue #4462: https://github.com/dask/dask/issues/4462 * PR #13046: https://github.com/numpy/numpy/pull/13046 * Dask issue #4883: https://github.com/dask/dask/issues/4883 * Issue #13831: https://github.com/numpy/numpy/issues/13831 * Discussion PR 1: https://github.com/hameerabbasi/numpy/pull/3 * Discussion PR 2: https://github.com/hameerabbasi/numpy/pull/4 References and Footnotes ------------------------ .. _[1]: [1] uarray, A general dispatch mechanism for Python: https://uarray.readthedocs.io .. _[2]: [2] NEP 18 ? A dispatch mechanism for NumPy?s high level array functions: https://numpy.org/neps/nep-0018-array-function-protocol.html .. _[3]: [3] NEP 22 ? Duck typing for NumPy arrays ? high level overview: https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html .. _[4]: [4] NEP 13 ? A Mechanism for Overriding Ufuncs: https://numpy.org/neps/nep-0013-ufunc-overrides.html .. _[5]: [5] Reply to Adding to the non-dispatched implementation of NumPy methods: http://numpy-discussion.10968.n7.nabble.com/Adding-to-the-non-dispatched-implementation-of-NumPy-methods-tp46816p46874.html .. _[6]: [6] Custom Dtype/Units discussion: http://numpy-discussion.10968.n7.nabble.com/Custom-Dtype-Units-discussion-td43262.html .. _[7]: [7] The epic dtype cleanup plan: https://github.com/numpy/numpy/issues/2899 .. _[8]: [8] unumpy: NumPy, but implementation-independent: https://unumpy.readthedocs.io .. _[9]: [9] NEP 30 ? Duck Typing for NumPy Arrays - Implementation: https://www.numpy.org/neps/nep-0030-duck-array-protocol.html .. _[10]: [10] http://scipy.github.io/devdocs/fft.html#backend-control Copyright --------- This document has been placed in the public domain. From warren.weckesser at gmail.com Tue Sep 3 08:56:23 2019 From: warren.weckesser at gmail.com (Warren Weckesser) Date: Tue, 3 Sep 2019 08:56:23 -0400 Subject: [Numpy-discussion] NEP 32: Remove the financial functions from NumPy Message-ID: Github issue 2880 ("Get financial functions out of main namespace", https://github.com/numpy/numpy/issues/2880) has been open since 2013. In a recent community meeting, it was suggested that we create a NEP to propose the removal of the financial functions from NumPy. I have submitted "NEP 32: Remove the financial functions from NumPy" in a pull request at https://github.com/numpy/numpy/pull/14399. A copy of the latest version of the NEP is below. According to the NEP process document, "Once the PR is in place, the NEP should be announced on the mailing list for discussion (comments on the PR itself should be restricted to minor editorial and technical fixes)." This email is the announcement for NEP 32. The NEP includes a brief summary of the history of the financial functions, and has links to several relevant mailing list threads, dating back to when the functions were added to NumPy in 2008. I recommend reviewing those threads before commenting here. Warren ----- ================================================== NEP 32 ? Remove the financial functions from NumPy ================================================== :Author: Warren Weckesser :Status: Draft :Type: Standards Track :Created: 2019-08-30 Abstract -------- We propose deprecating and ultimately removing the financial functions [1]_ from NumPy. The functions will be moved to an independent repository, and provided to the community as a separate package with the name ``numpy_financial``. Motivation and scope -------------------- The NumPy financial functions [1]_ are the 10 functions ``fv``, ``ipmt``, ``irr``, ``mirr``, ``nper``, ``npv``, ``pmt``, ``ppmt``, ``pv`` and ``rate``. The functions provide elementary financial calculations such as future value, net present value, etc. These functions were added to NumPy in 2008 [2]_. In May, 2009, a request by Joe Harrington to add a function called ``xirr`` to the financial functions triggered a long thread about these functions [3]_. One important point that came up in that thread is that a "real" financial library must be able to handle real dates. The NumPy financial functions do not work with actual dates or calendars. The preference for a more capable library independent of NumPy was expressed several times in that thread. In June, 2009, D. L. Goldsmith expressed concerns about the correctness of the implementations of some of the financial functions [4]_. It was suggested then to move the financial functions out of NumPy to an independent package. In a GitHub issue in 2013 [5]_, Nathaniel Smith suggested moving the financial functions from the top-level namespace to ``numpy.financial``. He also suggested giving the functions better names. Responses at that time included the suggestion to deprecate them and move them from NumPy to a separate package. This issue is still open. Later in 2013 [6]_, it was suggested on the mailing list that these functions be removed from NumPy. The arguments for the removal of these functions from NumPy: * They are too specialized for NumPy. * They are not actually useful for "real world" financial calculations, because they do not handle real dates and calendars. * The definition of "correctness" for some of these functions seems to be a matter of convention, and the current NumPy developers do not have the background to judge their correctness. * There has been little interest among past and present NumPy developers in maintaining these functions. The main arguments for keeping the functions in NumPy are: * Removing these functions will be disruptive for some users. Current users will have to add the new ``numpy_financial`` package to their dependencies, and then modify their code to use the new package. * The functions provided, while not "industrial strength", are apparently similar to functions provided by spreadsheets and some calculators. Having them available in NumPy makes it easier for some developers to migrate their software to Python and NumPy. It is clear from comments in the mailing list discussions and in the GitHub issues that many current NumPy developers believe the benefits of removing the functions outweigh the costs. For example, from [5]_:: The financial functions should probably be part of a separate package -- Charles Harris If there's a better package we can point people to we could just deprecate them and then remove them entirely... I'd be fine with that too... -- Nathaniel Smith +1 to deprecate them. If no other package exists, it can be created if someone feels the need for that. -- Ralf Gommers I feel pretty strongly that we should deprecate these. If nobody on numpy?s core team is interested in maintaining them, then it is purely a drag on development for NumPy. -- Stephan Hoyer And from the 2013 mailing list discussion, about removing the functions from NumPy:: I am +1 as well, I don't think they should have been included in the first place. -- David Cournapeau But not everyone was in favor of removal:: The fin routines are tiny and don't require much maintenance once written. If we made an effort (putting up pages with examples of common financial calculations and collecting those under a topical web page, then linking to that page from various places and talking it up), I would think they could attract users looking for a free way to play with financial scenarios. [...] So, I would say we keep them. If ours are not the best, we should bring them up to snuff. -- Joe Harrington For an idea of the maintenance burden of the financial functions, one can look for all the GitHub issues [7]_ and pull requests [8]_ that have the tag ``component: numpy.lib.financial``. One method for measuring the effect of removing these functions is to find all the packages on GitHub that use them. Such a search can be performed with the ``python-api-inspect`` service [9]_. A search for all uses of the NumPy financial functions finds just eight repositories. (See the comments in [5]_ for the actual SQL query.) Implementation -------------- * Create a new Python package, ``numpy_financial``, to be maintained in the top-level NumPy github organization. This repository will contain the definitions and unit tests for the financial functions. The package will be added to PyPI so it can be installed with ``pip``. * Deprecate the financial functions in the ``numpy`` namespace, beginning in NumPy version 1.18. Remove the financial functions from NumPy version 1.20. Backward compatibility ---------------------- The removal of these functions breaks backward compatibility, as explained earlier. The effects are mitigated by providing the ``numpy_financial`` library. Alternatives ------------ The following alternatives were mentioned in [5]_: * *Maintain the functions as they are (i.e. do nothing).* A review of the history makes clear that this is not the preference of many NumPy developers. A recurring comment is that the functions simply do not belong in NumPy. When that sentiment is combined with the history of bug reports and the ongoing questions about the correctness of the functions, the conclusion is that the cleanest solution is deprecation and removal. * *Move the functions from the ``numpy`` namespace to ``numpy.financial``.* This was the initial suggestion in [5]_. Such a change does not address the maintenance issues, and doesn't change the misfit that many developers see between these functions and NumPy. It causes disruption for the current users of these functions without addressing what many developers see as the fundamental problem. Discussion ---------- Links to past mailing list discussions, and to relevant GitHub issues and pull requests, have already been given. References and footnotes ------------------------ .. [1] Financial functions, https://numpy.org/doc/1.17/reference/routines.financial.html .. [2] Numpy-discussion mailing list, "Simple financial functions for NumPy", https://mail.python.org/pipermail/numpy-discussion/2008-April/032353.html .. [3] Numpy-discussion mailing list, "add xirr to numpy financial functions?", https://mail.python.org/pipermail/numpy-discussion/2009-May/042645.html .. [4] Numpy-discussion mailing list, "Definitions of pv, fv, nper, pmt, and rate", https://mail.python.org/pipermail/numpy-discussion/2009-June/043188.html .. [5] Get financial functions out of main namespace, https://github.com/numpy/numpy/issues/2880 .. [6] Numpy-discussion mailing list, "Deprecation of financial routines", https://mail.python.org/pipermail/numpy-discussion/2013-August/067409.html .. [7] ``component: numpy.lib.financial`` issues, https://github.com/numpy/numpy/issues?utf8=%E2%9C%93&q=is%3Aissue+label%3A%22component%3A+numpy.lib.financial%22+ .. [8] ``component: numpy.lib.financial`` pull request, https://github.com/numpy/numpy/pulls?utf8=%E2%9C%93&q=is%3Apr+label%3A%22component%3A+numpy.lib.financial%22+ .. [9] Quansight-Labs/python-api-inspect, https://github.com/Quansight-Labs/python-api-inspect/ Copyright --------- This document has been placed in the public domain. -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Tue Sep 3 10:33:58 2019 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 03 Sep 2019 09:33:58 -0500 Subject: [Numpy-discussion] NumPy Community Meeting Wednesday, Sep. 4 Message-ID: Hi all, There will be a NumPy Community meeting Wednesday September 4 at 11 am Pacific Time. Everyone is invited to join in and edit the work-in- progress meeting topics and notes: https://hackmd.io/76o-IxCjQX2mOXO_wwkcpg?both Best wishes Sebastian -------------- next part -------------- A non-text attachment was scrubbed... Name: NumPy_Community_Call.ics Type: text/calendar Size: 3264 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From sebastian at sipsolutions.net Tue Sep 3 12:35:45 2019 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 03 Sep 2019 11:35:45 -0500 Subject: [Numpy-discussion] NEP 32: Remove the financial functions from NumPy In-Reply-To: References: Message-ID: <9067a8f06bc885307d1ec726a55bc5fd906c3c62.camel@sipsolutions.net> On Tue, 2019-09-03 at 08:56 -0400, Warren Weckesser wrote: > Github issue 2880 ("Get financial functions out of main namespace", Very briefly, I am absolutely in favor of this. Keeping the functions in numpy seems more of a liability than help anyone. And this push is more likely to help users by spurring development on a good replacement, than a practically unmaintained corner of NumPy that may seem like it solves a problem, but probably does so very poorly. Moving them into a separate pip installable package seems like the best way forward until a better replacement, to which we can point users, comes up. - Sebastian > https://github.com/numpy/numpy/issues/2880) has been open since 2013. > In a recent community meeting, it was suggested that we create a NEP > to propose the removal of the financial functions from NumPy. I have > submitted "NEP 32: Remove the financial functions from NumPy" in a > pull request at https://github.com/numpy/numpy/pull/14399. A copy of > the latest version of the NEP is below. > > According to the NEP process document, "Once the PR is in place, the > NEP should be announced on the mailing list for discussion (comments > on the PR itself should be restricted to minor editorial and > technical fixes)." This email is the announcement for NEP 32. > > The NEP includes a brief summary of the history of the financial > functions, and has links to several relevant mailing list threads, > dating back to when the functions were added to NumPy in 2008. I > recommend reviewing those threads before commenting here. > > Warren > > ----- > > ================================================== > NEP 32 ? Remove the financial functions from NumPy > ================================================== > > :Author: Warren Weckesser > :Status: Draft > :Type: Standards Track > :Created: 2019-08-30 > > > Abstract > -------- > > We propose deprecating and ultimately removing the financial > functions [1]_ > from NumPy. The functions will be moved to an independent > repository, > and provided to the community as a separate package with the name > ``numpy_financial``. > > > Motivation and scope > -------------------- > > The NumPy financial functions [1]_ are the 10 functions ``fv``, > ``ipmt``, > ``irr``, ``mirr``, ``nper``, ``npv``, ``pmt``, ``ppmt``, ``pv`` and > ``rate``. > The functions provide elementary financial calculations such as > future value, > net present value, etc. These functions were added to NumPy in 2008 > [2]_. > > In May, 2009, a request by Joe Harrington to add a function called > ``xirr`` to > the financial functions triggered a long thread about these functions > [3]_. > One important point that came up in that thread is that a "real" > financial > library must be able to handle real dates. The NumPy financial > functions do > not work with actual dates or calendars. The preference for a more > capable > library independent of NumPy was expressed several times in that > thread. > > In June, 2009, D. L. Goldsmith expressed concerns about the > correctness of the > implementations of some of the financial functions [4]_. It was > suggested then > to move the financial functions out of NumPy to an independent > package. > > In a GitHub issue in 2013 [5]_, Nathaniel Smith suggested moving the > financial > functions from the top-level namespace to ``numpy.financial``. He > also > suggested giving the functions better names. Responses at that time > included > the suggestion to deprecate them and move them from NumPy to a > separate > package. This issue is still open. > > Later in 2013 [6]_, it was suggested on the mailing list that these > functions > be removed from NumPy. > > The arguments for the removal of these functions from NumPy: > > * They are too specialized for NumPy. > * They are not actually useful for "real world" financial > calculations, because > they do not handle real dates and calendars. > * The definition of "correctness" for some of these functions seems > to be a > matter of convention, and the current NumPy developers do not have > the > background to judge their correctness. > * There has been little interest among past and present NumPy > developers > in maintaining these functions. > > The main arguments for keeping the functions in NumPy are: > > * Removing these functions will be disruptive for some users. > Current users > will have to add the new ``numpy_financial`` package to their > dependencies, > and then modify their code to use the new package. > * The functions provided, while not "industrial strength", are > apparently > similar to functions provided by spreadsheets and some > calculators. Having > them available in NumPy makes it easier for some developers to > migrate their > software to Python and NumPy. > > It is clear from comments in the mailing list discussions and in the > GitHub > issues that many current NumPy developers believe the benefits of > removing > the functions outweigh the costs. For example, from [5]_:: > > The financial functions should probably be part of a separate > package > -- Charles Harris > > If there's a better package we can point people to we could just > deprecate > them and then remove them entirely... I'd be fine with that > too... > -- Nathaniel Smith > > +1 to deprecate them. If no other package exists, it can be > created if > someone feels the need for that. > -- Ralf Gommers > > I feel pretty strongly that we should deprecate these. If nobody > on numpy?s > core team is interested in maintaining them, then it is purely a > drag on > development for NumPy. > -- Stephan Hoyer > > And from the 2013 mailing list discussion, about removing the > functions from > NumPy:: > > I am +1 as well, I don't think they should have been included in > the first > place. > -- David Cournapeau > > But not everyone was in favor of removal:: > > The fin routines are tiny and don't require much maintenance once > written. If we made an effort (putting up pages with examples of > common > financial calculations and collecting those under a topical web > page, > then linking to that page from various places and talking it up), > I > would think they could attract users looking for a free way to > play with > financial scenarios. [...] > So, I would say we keep them. If ours are not the best, we > should bring > them up to snuff. > -- Joe Harrington > > For an idea of the maintenance burden of the financial functions, one > can > look for all the GitHub issues [7]_ and pull requests [8]_ that have > the tag > ``component: numpy.lib.financial``. > > One method for measuring the effect of removing these functions is to > find > all the packages on GitHub that use them. Such a search can be > performed > with the ``python-api-inspect`` service [9]_. A search for all uses > of the > NumPy financial functions finds just eight repositories. (See the > comments > in [5]_ for the actual SQL query.) > > > Implementation > -------------- > > * Create a new Python package, ``numpy_financial``, to be maintained > in the > top-level NumPy github organization. This repository will contain > the > definitions and unit tests for the financial functions. The > package will > be added to PyPI so it can be installed with ``pip``. > * Deprecate the financial functions in the ``numpy`` namespace, > beginning in > NumPy version 1.18. Remove the financial functions from NumPy > version 1.20. > > > Backward compatibility > ---------------------- > > The removal of these functions breaks backward compatibility, as > explained > earlier. The effects are mitigated by providing the > ``numpy_financial`` > library. > > > Alternatives > ------------ > > The following alternatives were mentioned in [5]_: > > * *Maintain the functions as they are (i.e. do nothing).* > A review of the history makes clear that this is not the preference > of many > NumPy developers. A recurring comment is that the functions simply > do not > belong in NumPy. When that sentiment is combined with the history > of bug > reports and the ongoing questions about the correctness of the > functions, the > conclusion is that the cleanest solution is deprecation and > removal. > * *Move the functions from the ``numpy`` namespace to > ``numpy.financial``.* > This was the initial suggestion in [5]_. Such a change does not > address the > maintenance issues, and doesn't change the misfit that many > developers see > between these functions and NumPy. It causes disruption for the > current > users of these functions without addressing what many developers > see as the > fundamental problem. > > > Discussion > ---------- > > Links to past mailing list discussions, and to relevant GitHub issues > and pull > requests, have already been given. > > > References and footnotes > ------------------------ > > .. [1] Financial functions, > https://numpy.org/doc/1.17/reference/routines.financial.html > > .. [2] Numpy-discussion mailing list, "Simple financial functions for > NumPy", > > https://mail.python.org/pipermail/numpy-discussion/2008-April/032353.html > > .. [3] Numpy-discussion mailing list, "add xirr to numpy financial > functions?", > > https://mail.python.org/pipermail/numpy-discussion/2009-May/042645.html > > .. [4] Numpy-discussion mailing list, "Definitions of pv, fv, nper, > pmt, and rate", > > https://mail.python.org/pipermail/numpy-discussion/2009-June/043188.html > > .. [5] Get financial functions out of main namespace, > https://github.com/numpy/numpy/issues/2880 > > .. [6] Numpy-discussion mailing list, "Deprecation of financial > routines", > > https://mail.python.org/pipermail/numpy-discussion/2013-August/067409.html > > .. [7] ``component: numpy.lib.financial`` issues, > > https://github.com/numpy/numpy/issues?utf8=%E2%9C%93&q=is%3Aissue+label%3A%22component%3A+numpy.lib.financial%22+ > > .. [8] ``component: numpy.lib.financial`` pull request, > > https://github.com/numpy/numpy/pulls?utf8=%E2%9C%93&q=is%3Apr+label%3A%22component%3A+numpy.lib.financial%22+ > > .. [9] Quansight-Labs/python-api-inspect, > https://github.com/Quansight-Labs/python-api-inspect/ > > > Copyright > --------- > > This document has been placed in the public domain. > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From Martin.Gfeller at swisscom.com Wed Sep 4 13:35:28 2019 From: Martin.Gfeller at swisscom.com (Martin.Gfeller at swisscom.com) Date: Wed, 4 Sep 2019 17:35:28 +0000 Subject: [Numpy-discussion] NEP 32: Remove the financial functions from NumPy In-Reply-To: References: Message-ID: Dear all As a user of Numpy in finance, I'm absolutely in favour of removing these functions. They're too domain-specific, not flexible and general enough for widespread use, and probably not easy to maintain. Best regards Martin -------------- next part -------------- An HTML attachment was scrubbed... URL: From ilhanpolat at gmail.com Wed Sep 4 14:10:11 2019 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Wed, 4 Sep 2019 20:10:11 +0200 Subject: [Numpy-discussion] NEP 32: Remove the financial functions from NumPy In-Reply-To: <9067a8f06bc885307d1ec726a55bc5fd906c3c62.camel@sipsolutions.net> References: <9067a8f06bc885307d1ec726a55bc5fd906c3c62.camel@sipsolutions.net> Message-ID: +1 on removing them from NumPy. I think there are plenty of alternatives already so many that we might even consider deprecating them just like SciPy misc module by pointing to alternatives. On Tue, Sep 3, 2019 at 6:38 PM Sebastian Berg wrote: > On Tue, 2019-09-03 at 08:56 -0400, Warren Weckesser wrote: > > Github issue 2880 ("Get financial functions out of main namespace", > > Very briefly, I am absolutely in favor of this. > > Keeping the functions in numpy seems more of a liability than help > anyone. And this push is more likely to help users by spurring > development on a good replacement, than a practically unmaintained > corner of NumPy that may seem like it solves a problem, but probably > does so very poorly. > > Moving them into a separate pip installable package seems like the best > way forward until a better replacement, to which we can point users, > comes up. > > - Sebastian > > > > https://github.com/numpy/numpy/issues/2880) has been open since 2013. > > In a recent community meeting, it was suggested that we create a NEP > > to propose the removal of the financial functions from NumPy. I have > > submitted "NEP 32: Remove the financial functions from NumPy" in a > > pull request at https://github.com/numpy/numpy/pull/14399. A copy of > > the latest version of the NEP is below. > > > > According to the NEP process document, "Once the PR is in place, the > > NEP should be announced on the mailing list for discussion (comments > > on the PR itself should be restricted to minor editorial and > > technical fixes)." This email is the announcement for NEP 32. > > > > The NEP includes a brief summary of the history of the financial > > functions, and has links to several relevant mailing list threads, > > dating back to when the functions were added to NumPy in 2008. I > > recommend reviewing those threads before commenting here. > > > > Warren > > > > ----- > > > > ================================================== > > NEP 32 ? Remove the financial functions from NumPy > > ================================================== > > > > :Author: Warren Weckesser > > :Status: Draft > > :Type: Standards Track > > :Created: 2019-08-30 > > > > > > Abstract > > -------- > > > > We propose deprecating and ultimately removing the financial > > functions [1]_ > > from NumPy. The functions will be moved to an independent > > repository, > > and provided to the community as a separate package with the name > > ``numpy_financial``. > > > > > > Motivation and scope > > -------------------- > > > > The NumPy financial functions [1]_ are the 10 functions ``fv``, > > ``ipmt``, > > ``irr``, ``mirr``, ``nper``, ``npv``, ``pmt``, ``ppmt``, ``pv`` and > > ``rate``. > > The functions provide elementary financial calculations such as > > future value, > > net present value, etc. These functions were added to NumPy in 2008 > > [2]_. > > > > In May, 2009, a request by Joe Harrington to add a function called > > ``xirr`` to > > the financial functions triggered a long thread about these functions > > [3]_. > > One important point that came up in that thread is that a "real" > > financial > > library must be able to handle real dates. The NumPy financial > > functions do > > not work with actual dates or calendars. The preference for a more > > capable > > library independent of NumPy was expressed several times in that > > thread. > > > > In June, 2009, D. L. Goldsmith expressed concerns about the > > correctness of the > > implementations of some of the financial functions [4]_. It was > > suggested then > > to move the financial functions out of NumPy to an independent > > package. > > > > In a GitHub issue in 2013 [5]_, Nathaniel Smith suggested moving the > > financial > > functions from the top-level namespace to ``numpy.financial``. He > > also > > suggested giving the functions better names. Responses at that time > > included > > the suggestion to deprecate them and move them from NumPy to a > > separate > > package. This issue is still open. > > > > Later in 2013 [6]_, it was suggested on the mailing list that these > > functions > > be removed from NumPy. > > > > The arguments for the removal of these functions from NumPy: > > > > * They are too specialized for NumPy. > > * They are not actually useful for "real world" financial > > calculations, because > > they do not handle real dates and calendars. > > * The definition of "correctness" for some of these functions seems > > to be a > > matter of convention, and the current NumPy developers do not have > > the > > background to judge their correctness. > > * There has been little interest among past and present NumPy > > developers > > in maintaining these functions. > > > > The main arguments for keeping the functions in NumPy are: > > > > * Removing these functions will be disruptive for some users. > > Current users > > will have to add the new ``numpy_financial`` package to their > > dependencies, > > and then modify their code to use the new package. > > * The functions provided, while not "industrial strength", are > > apparently > > similar to functions provided by spreadsheets and some > > calculators. Having > > them available in NumPy makes it easier for some developers to > > migrate their > > software to Python and NumPy. > > > > It is clear from comments in the mailing list discussions and in the > > GitHub > > issues that many current NumPy developers believe the benefits of > > removing > > the functions outweigh the costs. For example, from [5]_:: > > > > The financial functions should probably be part of a separate > > package > > -- Charles Harris > > > > If there's a better package we can point people to we could just > > deprecate > > them and then remove them entirely... I'd be fine with that > > too... > > -- Nathaniel Smith > > > > +1 to deprecate them. If no other package exists, it can be > > created if > > someone feels the need for that. > > -- Ralf Gommers > > > > I feel pretty strongly that we should deprecate these. If nobody > > on numpy?s > > core team is interested in maintaining them, then it is purely a > > drag on > > development for NumPy. > > -- Stephan Hoyer > > > > And from the 2013 mailing list discussion, about removing the > > functions from > > NumPy:: > > > > I am +1 as well, I don't think they should have been included in > > the first > > place. > > -- David Cournapeau > > > > But not everyone was in favor of removal:: > > > > The fin routines are tiny and don't require much maintenance once > > written. If we made an effort (putting up pages with examples of > > common > > financial calculations and collecting those under a topical web > > page, > > then linking to that page from various places and talking it up), > > I > > would think they could attract users looking for a free way to > > play with > > financial scenarios. [...] > > So, I would say we keep them. If ours are not the best, we > > should bring > > them up to snuff. > > -- Joe Harrington > > > > For an idea of the maintenance burden of the financial functions, one > > can > > look for all the GitHub issues [7]_ and pull requests [8]_ that have > > the tag > > ``component: numpy.lib.financial``. > > > > One method for measuring the effect of removing these functions is to > > find > > all the packages on GitHub that use them. Such a search can be > > performed > > with the ``python-api-inspect`` service [9]_. A search for all uses > > of the > > NumPy financial functions finds just eight repositories. (See the > > comments > > in [5]_ for the actual SQL query.) > > > > > > Implementation > > -------------- > > > > * Create a new Python package, ``numpy_financial``, to be maintained > > in the > > top-level NumPy github organization. This repository will contain > > the > > definitions and unit tests for the financial functions. The > > package will > > be added to PyPI so it can be installed with ``pip``. > > * Deprecate the financial functions in the ``numpy`` namespace, > > beginning in > > NumPy version 1.18. Remove the financial functions from NumPy > > version 1.20. > > > > > > Backward compatibility > > ---------------------- > > > > The removal of these functions breaks backward compatibility, as > > explained > > earlier. The effects are mitigated by providing the > > ``numpy_financial`` > > library. > > > > > > Alternatives > > ------------ > > > > The following alternatives were mentioned in [5]_: > > > > * *Maintain the functions as they are (i.e. do nothing).* > > A review of the history makes clear that this is not the preference > > of many > > NumPy developers. A recurring comment is that the functions simply > > do not > > belong in NumPy. When that sentiment is combined with the history > > of bug > > reports and the ongoing questions about the correctness of the > > functions, the > > conclusion is that the cleanest solution is deprecation and > > removal. > > * *Move the functions from the ``numpy`` namespace to > > ``numpy.financial``.* > > This was the initial suggestion in [5]_. Such a change does not > > address the > > maintenance issues, and doesn't change the misfit that many > > developers see > > between these functions and NumPy. It causes disruption for the > > current > > users of these functions without addressing what many developers > > see as the > > fundamental problem. > > > > > > Discussion > > ---------- > > > > Links to past mailing list discussions, and to relevant GitHub issues > > and pull > > requests, have already been given. > > > > > > References and footnotes > > ------------------------ > > > > .. [1] Financial functions, > > https://numpy.org/doc/1.17/reference/routines.financial.html > > > > .. [2] Numpy-discussion mailing list, "Simple financial functions for > > NumPy", > > > > > https://mail.python.org/pipermail/numpy-discussion/2008-April/032353.html > > > > .. [3] Numpy-discussion mailing list, "add xirr to numpy financial > > functions?", > > > > https://mail.python.org/pipermail/numpy-discussion/2009-May/042645.html > > > > .. [4] Numpy-discussion mailing list, "Definitions of pv, fv, nper, > > pmt, and rate", > > > > https://mail.python.org/pipermail/numpy-discussion/2009-June/043188.html > > > > .. [5] Get financial functions out of main namespace, > > https://github.com/numpy/numpy/issues/2880 > > > > .. [6] Numpy-discussion mailing list, "Deprecation of financial > > routines", > > > > > https://mail.python.org/pipermail/numpy-discussion/2013-August/067409.html > > > > .. [7] ``component: numpy.lib.financial`` issues, > > > > > https://github.com/numpy/numpy/issues?utf8=%E2%9C%93&q=is%3Aissue+label%3A%22component%3A+numpy.lib.financial%22+ > > > > .. [8] ``component: numpy.lib.financial`` pull request, > > > > > https://github.com/numpy/numpy/pulls?utf8=%E2%9C%93&q=is%3Apr+label%3A%22component%3A+numpy.lib.financial%22+ > > > > .. [9] Quansight-Labs/python-api-inspect, > > https://github.com/Quansight-Labs/python-api-inspect/ > > > > > > Copyright > > --------- > > > > This document has been placed in the public domain. > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Wed Sep 4 14:17:01 2019 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 4 Sep 2019 19:17:01 +0100 Subject: [Numpy-discussion] NEP 32: Remove the financial functions from NumPy In-Reply-To: References: <9067a8f06bc885307d1ec726a55bc5fd906c3c62.camel@sipsolutions.net> Message-ID: Hi, Maybe worth asking over at the Pandas list? I bet there are more Python / finance people over there. Cheers, Matthew On Wed, Sep 4, 2019 at 7:11 PM Ilhan Polat wrote: > > +1 on removing them from NumPy. I think there are plenty of alternatives already so many that we might even consider deprecating them just like SciPy misc module by pointing to alternatives. > > On Tue, Sep 3, 2019 at 6:38 PM Sebastian Berg wrote: >> >> On Tue, 2019-09-03 at 08:56 -0400, Warren Weckesser wrote: >> > Github issue 2880 ("Get financial functions out of main namespace", >> >> Very briefly, I am absolutely in favor of this. >> >> Keeping the functions in numpy seems more of a liability than help >> anyone. And this push is more likely to help users by spurring >> development on a good replacement, than a practically unmaintained >> corner of NumPy that may seem like it solves a problem, but probably >> does so very poorly. >> >> Moving them into a separate pip installable package seems like the best >> way forward until a better replacement, to which we can point users, >> comes up. >> >> - Sebastian >> >> >> > https://github.com/numpy/numpy/issues/2880) has been open since 2013. >> > In a recent community meeting, it was suggested that we create a NEP >> > to propose the removal of the financial functions from NumPy. I have >> > submitted "NEP 32: Remove the financial functions from NumPy" in a >> > pull request at https://github.com/numpy/numpy/pull/14399. A copy of >> > the latest version of the NEP is below. >> > >> > According to the NEP process document, "Once the PR is in place, the >> > NEP should be announced on the mailing list for discussion (comments >> > on the PR itself should be restricted to minor editorial and >> > technical fixes)." This email is the announcement for NEP 32. >> > >> > The NEP includes a brief summary of the history of the financial >> > functions, and has links to several relevant mailing list threads, >> > dating back to when the functions were added to NumPy in 2008. I >> > recommend reviewing those threads before commenting here. >> > >> > Warren >> > >> > ----- >> > >> > ================================================== >> > NEP 32 ? Remove the financial functions from NumPy >> > ================================================== >> > >> > :Author: Warren Weckesser >> > :Status: Draft >> > :Type: Standards Track >> > :Created: 2019-08-30 >> > >> > >> > Abstract >> > -------- >> > >> > We propose deprecating and ultimately removing the financial >> > functions [1]_ >> > from NumPy. The functions will be moved to an independent >> > repository, >> > and provided to the community as a separate package with the name >> > ``numpy_financial``. >> > >> > >> > Motivation and scope >> > -------------------- >> > >> > The NumPy financial functions [1]_ are the 10 functions ``fv``, >> > ``ipmt``, >> > ``irr``, ``mirr``, ``nper``, ``npv``, ``pmt``, ``ppmt``, ``pv`` and >> > ``rate``. >> > The functions provide elementary financial calculations such as >> > future value, >> > net present value, etc. These functions were added to NumPy in 2008 >> > [2]_. >> > >> > In May, 2009, a request by Joe Harrington to add a function called >> > ``xirr`` to >> > the financial functions triggered a long thread about these functions >> > [3]_. >> > One important point that came up in that thread is that a "real" >> > financial >> > library must be able to handle real dates. The NumPy financial >> > functions do >> > not work with actual dates or calendars. The preference for a more >> > capable >> > library independent of NumPy was expressed several times in that >> > thread. >> > >> > In June, 2009, D. L. Goldsmith expressed concerns about the >> > correctness of the >> > implementations of some of the financial functions [4]_. It was >> > suggested then >> > to move the financial functions out of NumPy to an independent >> > package. >> > >> > In a GitHub issue in 2013 [5]_, Nathaniel Smith suggested moving the >> > financial >> > functions from the top-level namespace to ``numpy.financial``. He >> > also >> > suggested giving the functions better names. Responses at that time >> > included >> > the suggestion to deprecate them and move them from NumPy to a >> > separate >> > package. This issue is still open. >> > >> > Later in 2013 [6]_, it was suggested on the mailing list that these >> > functions >> > be removed from NumPy. >> > >> > The arguments for the removal of these functions from NumPy: >> > >> > * They are too specialized for NumPy. >> > * They are not actually useful for "real world" financial >> > calculations, because >> > they do not handle real dates and calendars. >> > * The definition of "correctness" for some of these functions seems >> > to be a >> > matter of convention, and the current NumPy developers do not have >> > the >> > background to judge their correctness. >> > * There has been little interest among past and present NumPy >> > developers >> > in maintaining these functions. >> > >> > The main arguments for keeping the functions in NumPy are: >> > >> > * Removing these functions will be disruptive for some users. >> > Current users >> > will have to add the new ``numpy_financial`` package to their >> > dependencies, >> > and then modify their code to use the new package. >> > * The functions provided, while not "industrial strength", are >> > apparently >> > similar to functions provided by spreadsheets and some >> > calculators. Having >> > them available in NumPy makes it easier for some developers to >> > migrate their >> > software to Python and NumPy. >> > >> > It is clear from comments in the mailing list discussions and in the >> > GitHub >> > issues that many current NumPy developers believe the benefits of >> > removing >> > the functions outweigh the costs. For example, from [5]_:: >> > >> > The financial functions should probably be part of a separate >> > package >> > -- Charles Harris >> > >> > If there's a better package we can point people to we could just >> > deprecate >> > them and then remove them entirely... I'd be fine with that >> > too... >> > -- Nathaniel Smith >> > >> > +1 to deprecate them. If no other package exists, it can be >> > created if >> > someone feels the need for that. >> > -- Ralf Gommers >> > >> > I feel pretty strongly that we should deprecate these. If nobody >> > on numpy?s >> > core team is interested in maintaining them, then it is purely a >> > drag on >> > development for NumPy. >> > -- Stephan Hoyer >> > >> > And from the 2013 mailing list discussion, about removing the >> > functions from >> > NumPy:: >> > >> > I am +1 as well, I don't think they should have been included in >> > the first >> > place. >> > -- David Cournapeau >> > >> > But not everyone was in favor of removal:: >> > >> > The fin routines are tiny and don't require much maintenance once >> > written. If we made an effort (putting up pages with examples of >> > common >> > financial calculations and collecting those under a topical web >> > page, >> > then linking to that page from various places and talking it up), >> > I >> > would think they could attract users looking for a free way to >> > play with >> > financial scenarios. [...] >> > So, I would say we keep them. If ours are not the best, we >> > should bring >> > them up to snuff. >> > -- Joe Harrington >> > >> > For an idea of the maintenance burden of the financial functions, one >> > can >> > look for all the GitHub issues [7]_ and pull requests [8]_ that have >> > the tag >> > ``component: numpy.lib.financial``. >> > >> > One method for measuring the effect of removing these functions is to >> > find >> > all the packages on GitHub that use them. Such a search can be >> > performed >> > with the ``python-api-inspect`` service [9]_. A search for all uses >> > of the >> > NumPy financial functions finds just eight repositories. (See the >> > comments >> > in [5]_ for the actual SQL query.) >> > >> > >> > Implementation >> > -------------- >> > >> > * Create a new Python package, ``numpy_financial``, to be maintained >> > in the >> > top-level NumPy github organization. This repository will contain >> > the >> > definitions and unit tests for the financial functions. The >> > package will >> > be added to PyPI so it can be installed with ``pip``. >> > * Deprecate the financial functions in the ``numpy`` namespace, >> > beginning in >> > NumPy version 1.18. Remove the financial functions from NumPy >> > version 1.20. >> > >> > >> > Backward compatibility >> > ---------------------- >> > >> > The removal of these functions breaks backward compatibility, as >> > explained >> > earlier. The effects are mitigated by providing the >> > ``numpy_financial`` >> > library. >> > >> > >> > Alternatives >> > ------------ >> > >> > The following alternatives were mentioned in [5]_: >> > >> > * *Maintain the functions as they are (i.e. do nothing).* >> > A review of the history makes clear that this is not the preference >> > of many >> > NumPy developers. A recurring comment is that the functions simply >> > do not >> > belong in NumPy. When that sentiment is combined with the history >> > of bug >> > reports and the ongoing questions about the correctness of the >> > functions, the >> > conclusion is that the cleanest solution is deprecation and >> > removal. >> > * *Move the functions from the ``numpy`` namespace to >> > ``numpy.financial``.* >> > This was the initial suggestion in [5]_. Such a change does not >> > address the >> > maintenance issues, and doesn't change the misfit that many >> > developers see >> > between these functions and NumPy. It causes disruption for the >> > current >> > users of these functions without addressing what many developers >> > see as the >> > fundamental problem. >> > >> > >> > Discussion >> > ---------- >> > >> > Links to past mailing list discussions, and to relevant GitHub issues >> > and pull >> > requests, have already been given. >> > >> > >> > References and footnotes >> > ------------------------ >> > >> > .. [1] Financial functions, >> > https://numpy.org/doc/1.17/reference/routines.financial.html >> > >> > .. [2] Numpy-discussion mailing list, "Simple financial functions for >> > NumPy", >> > >> > https://mail.python.org/pipermail/numpy-discussion/2008-April/032353.html >> > >> > .. [3] Numpy-discussion mailing list, "add xirr to numpy financial >> > functions?", >> > >> > https://mail.python.org/pipermail/numpy-discussion/2009-May/042645.html >> > >> > .. [4] Numpy-discussion mailing list, "Definitions of pv, fv, nper, >> > pmt, and rate", >> > >> > https://mail.python.org/pipermail/numpy-discussion/2009-June/043188.html >> > >> > .. [5] Get financial functions out of main namespace, >> > https://github.com/numpy/numpy/issues/2880 >> > >> > .. [6] Numpy-discussion mailing list, "Deprecation of financial >> > routines", >> > >> > https://mail.python.org/pipermail/numpy-discussion/2013-August/067409.html >> > >> > .. [7] ``component: numpy.lib.financial`` issues, >> > >> > https://github.com/numpy/numpy/issues?utf8=%E2%9C%93&q=is%3Aissue+label%3A%22component%3A+numpy.lib.financial%22+ >> > >> > .. [8] ``component: numpy.lib.financial`` pull request, >> > >> > https://github.com/numpy/numpy/pulls?utf8=%E2%9C%93&q=is%3Apr+label%3A%22component%3A+numpy.lib.financial%22+ >> > >> > .. [9] Quansight-Labs/python-api-inspect, >> > https://github.com/Quansight-Labs/python-api-inspect/ >> > >> > >> > Copyright >> > --------- >> > >> > This document has been placed in the public domain. >> > >> > _______________________________________________ >> > NumPy-Discussion mailing list >> > NumPy-Discussion at python.org >> > https://mail.python.org/mailman/listinfo/numpy-discussion >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From einstein.edison at gmail.com Thu Sep 5 08:12:04 2019 From: einstein.edison at gmail.com (Hameer Abbasi) Date: Thu, 5 Sep 2019 14:12:04 +0200 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References: Message-ID: Hello everyone; Thanks to all the feedback from the community, in particular Sebastian Berg, we have a new draft of NEP-31. Please find the full text quoted below for discussion and reference. Any feedback and discussion is welcome. ============================================================ NEP 31 ? Context-local and global overrides of the NumPy API ============================================================ :Author: Hameer Abbasi :Author: Ralf Gommers :Author: Peter Bell :Status: Draft :Type: Standards Track :Created: 2019-08-22 Abstract -------- This NEP proposes to make all of NumPy's public API overridable via an extensible backend mechanism. Acceptance of this NEP means NumPy would provide global and context-local overrides, as well as a dispatch mechanism similar to NEP-18 [2]_. First experiences with ``__array_function__`` show that it is necessary to be able to override NumPy functions that *do not take an array-like argument*, and hence aren't overridable via ``__array_function__``. The most pressing need is array creation and coercion functions, such as ``numpy.zeros`` or ``numpy.asarray``; see e.g. NEP-30 [9]_. This NEP proposes to allow, in an opt-in fashion, overriding any part of the NumPy API. It is intended as a comprehensive resolution to NEP-22 [3]_, and obviates the need to add an ever-growing list of new protocols for each new type of function or object that needs to become overridable. Motivation and Scope -------------------- The motivation behind ``uarray`` is manyfold: First, there have been several attempts to allow dispatch of parts of the NumPy API, including (most prominently), the ``__array_ufunc__`` protocol in NEP-13 [4]_, and the ``__array_function__`` protocol in NEP-18 [2]_, but this has shown the need for further protocols to be developed, including a protocol for coercion (see [5]_, [9]_). The reasons these overrides are needed have been extensively discussed in the references, and this NEP will not attempt to go into the details of why these are needed; but in short: It is necessary for library authors to be able to coerce arbitrary objects into arrays of their own types, such as CuPy needing to coerce to a CuPy array, for example, instead of a NumPy array. These kinds of overrides are useful for both the end-user as well as library authors. End-users may have written or wish to write code that they then later speed up or move to a different implementation, say PyData/Sparse. They can do this simply by setting a backend. Library authors may also wish to write code that is portable across array implementations, for example ``sklearn`` may wish to write code for a machine learning algorithm that is portable across array implementations while also using array creation functions. This NEP takes a holistic approach: It assumes that there are parts of the API that need to be overridable, and that these will grow over time. It provides a general framework and a mechanism to avoid a design of a new protocol each time this is required. This was the goal of ``uarray``: to allow for overrides in an API without needing the design of a new protocol. This NEP proposes the following: That ``unumpy`` [8]_ becomes the recommended override mechanism for the parts of the NumPy API not yet covered by ``__array_function__`` or ``__array_ufunc__``, and that ``uarray`` is vendored into a new namespace within NumPy to give users and downstream dependencies access to these overrides. This vendoring mechanism is similar to what SciPy decided to do for making ``scipy.fft`` overridable (see [10]_). Detailed description -------------------- Using overrides ~~~~~~~~~~~~~~~ The way we propose the overrides will be used by end users is:: # On the library side import numpy.overridable as unp def library_function(array): array = unp.asarray(array) # Code using unumpy as usual return array # On the user side: import numpy.overridable as unp import uarray as ua import dask.array as da ua.register_backend(da) library_function(dask_array) # works and returns dask_array with unp.set_backend(da): library_function([1, 2, 3, 4]) # actually returns a Dask array. Here, ``backend`` can be any compatible object defined either by NumPy or an external library, such as Dask or CuPy. Ideally, it should be the module ``dask.array`` or ``cupy`` itself. Composing backends ~~~~~~~~~~~~~~~~~~ There are some backends which may depend on other backends, for example xarray depending on `numpy.fft`, and transforming a time axis into a frequency axis, or Dask/xarray holding an array other than a NumPy array inside it. This would be handled in the following manner inside code:: with ua.set_backend(cupy), ua.set_backend(dask.array): # Code that has distributed GPU arrays here Proposals ~~~~~~~~~ The only change this NEP proposes at its acceptance, is to make ``unumpy`` the officially recommended way to override NumPy. ``unumpy`` will remain a separate repository/package (which we propose to vendor to avoid a hard dependency, and use the separate ``unumpy`` package only if it is installed, rather than depend on for the time being). In concrete terms, ``numpy.overridable`` becomes an alias for ``unumpy``, if available with a fallback to the a vendored version if not. ``uarray`` and ``unumpy`` and will be developed primarily with the input of duck-array authors and secondarily, custom dtype authors, via the usual GitHub workflow. There are a few reasons for this: * Faster iteration in the case of bugs or issues. * Faster design changes, in the case of needed functionality. * ``unumpy`` will work with older versions of NumPy as well. * The user and library author opt-in to the override process, rather than breakages happening when it is least expected. In simple terms, bugs in ``unumpy`` mean that ``numpy`` remains unaffected. Advantanges of ``unumpy`` over other solutions ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ``unumpy`` offers a number of advantanges over the approach of defining a new protocol for every problem encountered: Whenever there is something requiring an override, ``unumpy`` will be able to offer a unified API with very minor changes. For example: * ``ufunc`` objects can be overridden via their ``__call__``, ``reduce`` and other methods. * Other functions can be overridden in a similar fashion. * ``np.asduckarray`` goes away, and becomes ``np.overridable.asarray`` with a backend set. * The same holds for array creation functions such as ``np.zeros``, ``np.empty`` and so on. This also holds for the future: Making something overridable would require only minor changes to ``unumpy``. Another promise ``unumpy`` holds is one of default implementations. Default implementations can be provided for any multimethod, in terms of others. This allows one to override a large part of the NumPy API by defining only a small part of it. This is to ease the creation of new duck-arrays, by providing default implementations of many functions that can be easily expressed in terms of others, as well as a repository of utility functions that help in the implementation of duck-arrays that most duck-arrays would require. It also allows one to override functions in a manner which ``__array_function__`` simply cannot, such as overriding ``np.einsum`` with the version from the ``opt_einsum`` package, or Intel MKL overriding FFT, BLAS or ``ufunc`` objects. They would define a backend with the appropriate multimethods, and the user would select them via a ``with`` statement, or registering them as a backend. The last benefit is a clear way to coerce to a given backend (via the ``coerce`` keyword in ``ua.set_backend``), and a protocol for coercing not only arrays, but also ``dtype`` objects and ``ufunc`` objects with similar ones from other libraries. This is due to the existence of actual, third party dtype packages, and their desire to blend into the NumPy ecosystem (see [6]_). This is a separate issue compared to the C-level dtype redesign proposed in [7]_, it's about allowing third-party dtype implementations to work with NumPy, much like third-party array implementations. These can provide features such as, for example, units, jagged arrays or other such features that are outside the scope of NumPy. Mixing NumPy and ``unumpy`` in the same file ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Normally, one would only want to import only one of ``unumpy`` or ``numpy``, you would import it as ``np`` for familiarity. However, there may be situations where one wishes to mix NumPy and the overrides, and there are a few ways to do this, depending on the user's style:: from numpy import overridable as unp import numpy as np or:: import numpy as np # Use unumpy via np.overridable Duck-array coercion ~~~~~~~~~~~~~~~~~~~ There are inherent problems about returning objects that are not NumPy arrays from ``numpy.array`` or ``numpy.asarray``, particularly in the context of C/C++ or Cython code that may get an object with a different memory layout than the one it expects. However, we believe this problem may apply not only to these two functions but all functions that return NumPy arrays. For this reason, overrides are opt-in for the user, by using the submodule ``numpy.overridable`` rather than ``numpy``. NumPy will continue to work unaffected by anything in ``numpy.overridable``. If the user wishes to obtain a NumPy array, there are two ways of doing it: 1. Use ``numpy.asarray`` (the non-overridable version). 2. Use ``numpy.overridable.asarray`` with the NumPy backend set and coercion enabled Related Work ------------ Other override mechanisms ~~~~~~~~~~~~~~~~~~~~~~~~~ * NEP-18, the ``__array_function__`` protocol. [2]_ * NEP-13, the ``__array_ufunc__`` protocol. [3]_ * NEP-30, the ``__duck_array__`` protocol. [9]_ Existing NumPy-like array implementations ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ * Dask: https://dask.org/ * CuPy: https://cupy.chainer.org/ * PyData/Sparse: https://sparse.pydata.org/ * Xnd: https://xnd.readthedocs.io/ * Astropy's Quantity: https://docs.astropy.org/en/stable/units/ Existing and potential consumers of alternative arrays ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ * Dask: https://dask.org/ * scikit-learn: https://scikit-learn.org/ * xarray: https://xarray.pydata.org/ * TensorLy: http://tensorly.org/ Existing alternate dtype implementations ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ * ``ndtypes``: https://ndtypes.readthedocs.io/en/latest/ * Datashape: https://datashape.readthedocs.io * Plum: https://plum-py.readthedocs.io/ Implementation -------------- The implementation of this NEP will require the following steps: * Implementation of ``uarray`` multimethods corresponding to the NumPy API, including classes for overriding ``dtype``, ``ufunc`` and ``array`` objects, in the ``unumpy`` repository. * Moving backends from ``unumpy`` into the respective array libraries. ``uarray`` Primer ~~~~~~~~~~~~~~~~~ **Note:** *This section will not attempt to go into too much detail about uarray, that is the purpose of the uarray documentation.* [1]_ *However, the NumPy community will have input into the design of uarray, via the issue tracker.* ``unumpy`` is the interface that defines a set of overridable functions (multimethods) compatible with the numpy API. To do this, it uses the ``uarray`` library. ``uarray`` is a general purpose tool for creating multimethods that dispatch to one of multiple different possible backend implementations. In this sense, it is similar to the ``__array_function__`` protocol but with the key difference that the backend is explicitly installed by the end-user and not coupled into the array type. Decoupling the backend from the array type gives much more flexibility to end-users and backend authors. For example, it is possible to: * override functions not taking arrays as arguments * create backends out of source from the array type * install multiple backends for the same array type This decoupling also means that ``uarray`` is not constrained to dispatching over array-like types. The backend is free to inspect the entire set of function arguments to determine if it can implement the function e.g. ``dtype`` parameter dispatching. Defining backends ^^^^^^^^^^^^^^^^^ ``uarray`` consists of two main protocols: ``__ua_convert__`` and ``__ua_function__``, called in that order, along with ``__ua_domain__``. ``__ua_convert__`` is for conversion and coercion. It has the signature ``(dispatchables, coerce)``, where ``dispatchables`` is an iterable of ``ua.Dispatchable`` objects and ``coerce`` is a boolean indicating whether or not to force the conversion. ``ua.Dispatchable`` is a simple class consisting of three simple values: ``type``, ``value``, and ``coercible``. ``__ua_convert__`` returns an iterable of the converted values, or ``NotImplemented`` in the case of failure. ``__ua_function__`` has the signature ``(func, args, kwargs)`` and defines the actual implementation of the function. It recieves the function and its arguments. Returning ``NotImplemented`` will cause a move to the default implementation of the function if one exists, and failing that, the next backend. Here is what will happen assuming a ``uarray`` multimethod is called: 1. We canonicalise the arguments so any arguments without a default are placed in ``*args`` and those with one are placed in ``**kwargs``. 2. We check the list of backends. a. If it is empty, we try the default implementation. 3. We check if the backend's ``__ua_convert__`` method exists. If it exists: a. We pass it the output of the dispatcher, which is an iterable of ``ua.Dispatchable`` objects. b. We feed this output, along with the arguments, to the argument replacer. ``NotImplemented`` means we move to 3 with the next backend. c. We store the replaced arguments as the new arguments. 4. We feed the arguments into ``__ua_function__``, and return the output, and exit if it isn't ``NotImplemented``. 5. If the default implementation exists, we try it with the current backend. 6. On failure, we move to 3 with the next backend. If there are no more backends, we move to 7. 7. We raise a ``ua.BackendNotImplementedError``. Defining overridable multimethods ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ To define an overridable function (a multimethod), one needs a few things: 1. A dispatcher that returns an iterable of ``ua.Dispatchable`` objects. 2. A reverse dispatcher that replaces dispatchable values with the supplied ones. 3. A domain. 4. Optionally, a default implementation, which can be provided in terms of other multimethods. As an example, consider the following:: import uarray as ua def full_argreplacer(args, kwargs, dispatchables): def full(shape, fill_value, dtype=None, order='C'): return (shape, fill_value), dict( dtype=dispatchables[0], order=order ) return full(*args, **kwargs) @ua.create_multimethod(full_argreplacer, domain="numpy") def full(shape, fill_value, dtype=None, order='C'): return (ua.Dispatchable(dtype, np.dtype),) A large set of examples can be found in the ``unumpy`` repository, [8]_. This simple act of overriding callables allows us to override: * Methods * Properties, via ``fget`` and ``fset`` * Entire objects, via ``__get__``. Examples for NumPy ^^^^^^^^^^^^^^^^^^ A library that implements a NumPy-like API will use it in the following manner (as an example):: import numpy.overridable as unp _ua_implementations = {} __ua_domain__ = "numpy" def __ua_function__(func, args, kwargs): fn = _ua_implementations.get(func, None) return fn(*args, **kwargs) if fn is not None else NotImplemented def implements(ua_func): def inner(func): _ua_implementations[ua_func] = func return func return inner @implements(unp.asarray) def asarray(a, dtype=None, order=None): # Code here # Either this method or __ua_convert__ must # return NotImplemented for unsupported types, # Or they shouldn't be marked as dispatchable. # Provides a default implementation for ones and zeros. @implements(unp.full) def full(shape, fill_value, dtype=None, order='C'): # Code here Backward compatibility ---------------------- There are no backward incompatible changes proposed in this NEP. Alternatives ------------ The current alternative to this problem is a combination of NEP-18 [2]_, NEP-13 [4]_ and NEP-30 [9]_ plus adding more protocols (not yet specified) in addition to it. Even then, some parts of the NumPy API will remain non-overridable, so it's a partial alternative. The main alternative to vendoring ``unumpy`` is to simply move it into NumPy completely and not distribute it as a separate package. This would also achieve the proposed goals, however we prefer to keep it a separate package for now, for reasons already stated above. The third alternative is to move ``unumpy`` into the NumPy organisation and develop it as a NumPy project. This will also achieve the said goals, and is also a possibility that can be considered by this NEP. However, the act of doing an extra ``pip install`` or ``conda install`` may discourage some users from adopting this method. Discussion ---------- * ``uarray`` blogpost: https://labs.quansight.org/blog/2019/07/uarray-update-api-changes-overhead-and-comparison-to-__array_function__/ * The discussion section of NEP-18: https://numpy.org/neps/nep-0018-array-function-protocol.html#discussion * NEP-22: https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html * Dask issue #4462: https://github.com/dask/dask/issues/4462 * PR #13046: https://github.com/numpy/numpy/pull/13046 * Dask issue #4883: https://github.com/dask/dask/issues/4883 * Issue #13831: https://github.com/numpy/numpy/issues/13831 * Discussion PR 1: https://github.com/hameerabbasi/numpy/pull/3 * Discussion PR 2: https://github.com/hameerabbasi/numpy/pull/4 * Discussion PR 3: https://github.com/numpy/numpy/pull/14389 References and Footnotes ------------------------ .. [1] uarray, A general dispatch mechanism for Python: https://uarray.readthedocs.io .. [2] NEP 18 ? A dispatch mechanism for NumPy?s high level array functions: https://numpy.org/neps/nep-0018-array-function-protocol.html .. [3] NEP 22 ? Duck typing for NumPy arrays ? high level overview: https://numpy.org/neps/nep-0022-ndarray-duck-typing-overview.html .. [4] NEP 13 ? A Mechanism for Overriding Ufuncs: https://numpy.org/neps/nep-0013-ufunc-overrides.html .. [5] Reply to Adding to the non-dispatched implementation of NumPy methods: http://numpy-discussion.10968.n7.nabble.com/Adding-to-the-non-dispatched-implementation-of-NumPy-methods-tp46816p46874.html .. [6] Custom Dtype/Units discussion: http://numpy-discussion.10968.n7.nabble.com/Custom-Dtype-Units-discussion-td43262.html .. [7] The epic dtype cleanup plan: https://github.com/numpy/numpy/issues/2899 .. [8] unumpy: NumPy, but implementation-independent: https://unumpy.readthedocs.io .. [9] NEP 30 ? Duck Typing for NumPy Arrays - Implementation: https://www.numpy.org/neps/nep-0030-duck-array-protocol.html .. [10] http://scipy.github.io/devdocs/fft.html#backend-control Copyright --------- This document has been placed in the public domain. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jpivarski at gmail.com Thu Sep 5 11:51:12 2019 From: jpivarski at gmail.com (Jim Pivarski) Date: Thu, 5 Sep 2019 10:51:12 -0500 Subject: [Numpy-discussion] Integer array indexing (numpy.take) as function composition Message-ID: Hi, I'm a long-time user of Numpy; I had a question and I didn't know where else to ask. (It's not a bug?otherwise I would have posted it at https://github.com/numpy/numpy/issues). Has anyone noticed that indexing an array with integer arrays (i.e. numpy.take) is a function composition? For example, suppose you have any two non-negative functions of integers: def f(x): return x**2 - 5*x + 10 def g(y): return max(0, 2*y - 10) + 3 and you sample them as arrays, as well as their composition g(f(?)): F = numpy.array([f(i) for i in range(10)]) # F is f at 10 elements G = numpy.array([g(i) for i in range(100)]) # G is g at enough elements to include max(f) GoF = numpy.array([g(f(i)) for i in range(10)]) # GoF is g?f at 10 elements Indexing G by F (G[F]) returns the same result as the sampled composition ( GoF): print("G\u2218F =", G[F]) # integer indexing print("g\u2218f =", GoF) # array of the composed functions G?F = [13 5 3 3 5 13 25 41 61 85] g?f = [13 5 3 3 5 13 25 41 61 85] This isn't a proof, but I think it's easy to see that it would be true for any non-negative functions (negative index handling spoils this property). It might sound like a purely academic point, but I've noticed that I've been able to optimize and simplify some code by taking advantage of the associative property of function composition, repeatedly applying numpy.take on arrays of integers before applying the fully composed index to my data. As an example of an optimization, if I have to do the same thing to N data arrays, it helps to prepare a single integer index and apply it to the N data arrays instead of modifying all N data arrays in multiple steps. As an example of a simplification, if I need to modify arrays in recursion, it's easier to reason about the recursion if only the terminal case applies an index to data, with the non-terminal steps applying indexes to indexes. This is such a basic property that I bet it has a name, and there's probably some literature on it, like what you could find if you were interested in monads in Haskell. But I haven't been able to find the right search strings?what would you call this property? Is there a literature on it and its uses? Thanks! -- Jim -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Sep 5 18:55:30 2019 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 5 Sep 2019 16:55:30 -0600 Subject: [Numpy-discussion] 1.17.2 release. Message-ID: Hi All, I'm planning to make a 1.17.2 release Friday or Saturday in order to fix some newly reported regressions. If there is anything that you think absolutely needs to be in that release, please yell. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From grlee77 at gmail.com Thu Sep 5 19:27:19 2019 From: grlee77 at gmail.com (Gregory Lee) Date: Thu, 5 Sep 2019 19:27:19 -0400 Subject: [Numpy-discussion] 1.17.2 release. In-Reply-To: References: Message-ID: Hi Chuck, It is not critical, but it would be nice if the fft ZeroDivisionError fix in https://github.com/numpy/numpy/pull/14279 could make it into 1.17.2. It has an "approved" review and seems to be ready. Thanks! Greg On Thu, Sep 5, 2019 at 6:56 PM Charles R Harris wrote: > Hi All, > > I'm planning to make a 1.17.2 release Friday or Saturday in order to fix > some newly reported regressions. If there is anything that you think > absolutely needs to be in that release, please yell. > > Chuck > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Sep 5 20:32:29 2019 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 5 Sep 2019 18:32:29 -0600 Subject: [Numpy-discussion] 1.17.2 release. In-Reply-To: References: Message-ID: On Thu, Sep 5, 2019 at 5:27 PM Gregory Lee wrote: > Hi Chuck, > > It is not critical, but it would be nice if the fft ZeroDivisionError fix > in https://github.com/numpy/numpy/pull/14279 could make it into 1.17.2. > It has an "approved" review and seems to be ready. > Thanks! > OK, I put that in and copied `pocketfft.py` from master for the backport. The main argument was over the naming of the new variable and I think Eric made a valid point, but we can always switch things around. I also thought it would be nice to check the `inv_norm` directly rather than through `n`, but there you go. If you would like to clean it up further, feel free to do so, but at least 1.17 will no longer be an issue in that regard. Chuck On Thu, Sep 5, 2019 at 6:56 PM Charles R Harris > wrote: > >> Hi All, >> >> I'm planning to make a 1.17.2 release Friday or Saturday in order to fix >> some newly reported regressions. If there is anything that you think >> absolutely needs to be in that release, please yell. >> >> Chuck >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Fri Sep 6 03:49:15 2019 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 6 Sep 2019 00:49:15 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

Message-ID: On Mon, Sep 2, 2019 at 11:21 PM Ralf Gommers wrote: > On Mon, Sep 2, 2019 at 2:09 PM Nathaniel Smith wrote: >> The reason this is challenging is that there's a lot of code written >> in Cython/C/C++ that calls np.asarray, > > Cython code only perhaps? It would surprise me if there's a lot of C/C++ code that explicitly calls into our Python rather than C API. I think there's also code written as Python-wrappers-around-C-code where the Python layer handles the error-checking/coercion, and the C code trusts it to have done so. >> Now if I understand right, your proposal would be to make it so any >> code in any package could arbitrarily change the behavior of >> np.asarray for all inputs, e.g. I could just decide that >> np.asarray([1, 2, 3]) should return some arbitrary non-np.ndarray >> object. > > No, definitely not! It's all opt-in, by explicitly importing from `numpy.overridable` or `unumpy`. No behavior of anything in the existing numpy namespaces should be affected in any way. Ah, whoops, I definitely missed that :-). That does change things! So one of the major decision points for any duck-array API work, is whether to modify the numpy semantics "in place", so user code automatically gets access to the new semantics, or else to make a new namespace, that users have to switch over to manually. The major disadvantage of doing changes "in place" is, of course, that we have to do all this careful work to move incrementally and make sure that we don't break things. The major (potential) advantage is that we have a much better chance of moving the ecosystem with us. The major advantage of making a new namespace is that it's *much* easier to experiment, because there's no chance of breaking any projects that didn't opt in. The major disadvantage is that numpy is super strongly entrenched, and convincing every project to switch to something else is incredibly difficult and costly. (I just searched github for "import numpy" and got 17.7 million hits. That's a lot of imports to update!) Also, empirically, we've seen multiple projects try to do this (e.g. DyND), and so far they all failed. It sounds like unumpy is an interesting approach that hasn't been tried before ? in particular, the promise that you can "just switch your imports" is a much easier transition than e.g. DyND offered. Of course, that promise is somewhat undermined by the reality that all these potential backend libraries *aren't* 100% compatible with numpy, and can't be... it might turn out that this ends up like asanyarray, where you can't really use it reliably because the thing that comes out will generally support *most* of the normal ndarray semantics, but you don't know which part. Is scipy planning to switch to using this everywhere, including in C code? If not, then how do you expect projects like matplotlib to switch, given that matplotlib likes to pass array objects into scipy functions? Are you planning to take the opportunity to clean up some of the obscure corners of the numpy API? But those are general questions about unumpy, and I'm guessing no-one knows all the answers yet... and these question actually aren't super relevant to the NEP. The NEP isn't inventing unumpy. IIUC, the main thing the NEP is proposes is simply to make "numpy.overridable" an alias for "unumpy". It's not clear to me what problem this alias is solving. If all downstream users have to update their imports anyway, then they can write "import unumpy as np" just as easily as they can write "import numpy.overridable as np". I guess the main reason this is a NEP is because the unumpy project is hoping to get an "official stamp of approval" from numpy? But even that could be accomplished by just putting something in the docs. And adding the alias has substantial risks: it makes unumpy tied to the numpy release cycle and compatibility rules, and it means that we're committing to maintaining unumpy ~forever even if Hameer or Quansight move onto other things. That seems like a lot to take on for such vague benefits? On Tue, Sep 3, 2019 at 2:04 AM Hameer Abbasi wrote: > The fact that we're having to design more and more protocols for a lot > of very similar things is, to me, an indicator that we do have holistic > problems that ought to be solved by a single protocol. But the reason we've had trouble designing these protocols is that they're each different :-). If it was just a matter of copying __array_ufunc__ we'd have been done in a few minutes... -n -- Nathaniel J. Smith -- https://vorpus.org From einstein.edison at gmail.com Fri Sep 6 04:32:25 2019 From: einstein.edison at gmail.com (Hameer Abbasi) Date: Fri, 6 Sep 2019 10:32:25 +0200 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

Message-ID: <67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> That's a lot of very good questions! Let me see if I can answer them one-by-one. On 06.09.19 09:49, Nathaniel Smith wrote: > Ah, whoops, I definitely missed that :-). That does change things! > So one of the major decision points for any duck-array API work, is > whether to modify the numpy semantics "in place", so user code > automatically gets access to the new semantics, or else to make a new > namespace, that users have to switch over to manually. > > The major disadvantage of doing changes "in place" is, of course, that > we have to do all this careful work to move incrementally and make > sure that we don't break things. The major (potential) advantage is > that we have a much better chance of moving the ecosystem with us. > > The major advantage of making a new namespace is that it's *much* > easier to experiment, because there's no chance of breaking any > projects that didn't opt in. The major disadvantage is that numpy is > super strongly entrenched, and convincing every project to switch to > something else is incredibly difficult and costly. (I just searched > github for "import numpy" and got 17.7 million hits. That's a lot of > imports to update!) Also, empirically, we've seen multiple projects > try to do this (e.g. DyND), and so far they all failed. > > It sounds like unumpy is an interesting approach that hasn't been > tried before ? in particular, the promise that you can "just switch > your imports" is a much easier transition than e.g. DyND offered. Of > course, that promise is somewhat undermined by the reality that all > these potential backend libraries *aren't* 100% compatible with numpy, > and can't be... This is true, however, with minor adjustments it should be possible to make your code work across backends, if you don't use a few obscure parts of NumPy. > it might turn out that this ends up like asanyarray, > where you can't really use it reliably because the thing that comes > out will generally support *most* of the normal ndarray semantics, but > you don't know which part. Is scipy planning to switch to using this > everywhere, including in C code? Not at present I think, however, it should be possible to "re-write" parts of scipy on top of unumpy in order to make that work, and where speed is required and an efficient implementation isn't available in terms of NumPy functions, make dispatchable multimethods and allow library authors to provide the said implementations. We'll call this project uscipy, but that's an endgame at this point. Right now, we're focusing on unumpy. > If not, then how do you expect > projects like matplotlib to switch, given that matplotlib likes to > pass array objects into scipy functions? Are you planning to take the > opportunity to clean up some of the obscure corners of the numpy API? That's a completely different thing, and to answer that question requires a distinction between uarray and unumpy... uarray is a backend-mechanism, independent of array computing. We hope that matplotlib will adopt it to switch around it's GUI back-ends for example. > But those are general questions about unumpy, and I'm guessing no-one > knows all the answers yet... and these question actually aren't super > relevant to the NEP. The NEP isn't inventing unumpy. IIUC, the main > thing the NEP is proposes is simply to make "numpy.overridable" an > alias for "unumpy". > > It's not clear to me what problem this alias is solving. If all > downstream users have to update their imports anyway, then they can > write "import unumpy as np" just as easily as they can write "import > numpy.overridable as np". I guess the main reason this is a NEP is > because the unumpy project is hoping to get an "official stamp of > approval" from numpy? That's part of it. The concrete problems it's solving are threefold: 1. Array creation functions can be overridden. 2. Array coercion is now covered. 3. "Default implementations" will allow you to re-write your NumPy array more easily, when such efficient implementations exist in terms of other NumPy functions. That will also help achieve similar semantics, but as I said, they're just "default"... The import numpy.overridable part is meant to help garner adoption, and to prefer the unumpy module if it is available (which will continue to be developed separately). That way it isn't so tightly coupled to the release cycle. One alternative Sebastian Berg mentioned (and I am on board with) is just moving unumpy into the NumPy organisation. What we fear keeping it separate is that the simple act of a pip install unumpy will keep people from using it or trying it out. > But even that could be accomplished by just > putting something in the docs. And adding the alias has substantial > risks: it makes unumpy tied to the numpy release cycle and > compatibility rules, and it means that we're committing to maintaining > unumpy ~forever even if Hameer or Quansight move onto other things. > That seems like a lot to take on for such vague benefits? I can assure you Travis has had the goal of "replatforming SciPy" from as far back as I met him, he's spawned quite a few efforts in that direction along with others from Quansight (and they've led to nice projects). Quansight, as I see it, is unlikely to abandon something like this if it becomes successful (and acceptance of this NEP will be a huge success story). > On Tue, Sep 3, 2019 at 2:04 AM Hameer Abbasi wrote: >> The fact that we're having to design more and more protocols for a lot >> of very similar things is, to me, an indicator that we do have holistic >> problems that ought to be solved by a single protocol. > But the reason we've had trouble designing these protocols is that > they're each different :-). If it was just a matter of copying > __array_ufunc__ we'd have been done in a few minutes... uarray borrows heavily from __array_function__. It allows substituting (for example) __array_ufunc__ by overriding ufunc.__call__, ufunc.reduce and so on. It takes, as I mentioned, a holistic approach: There are callables that need to be overriden, possibly with nothing to dispatch on. And then it builds on top of that, adding coercion/conversion. > -n > > -- > Nathaniel J. Smith --https://vorpus.org > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From tcaswell at gmail.com Fri Sep 6 14:37:26 2019 From: tcaswell at gmail.com (Thomas Caswell) Date: Fri, 6 Sep 2019 14:37:26 -0400 Subject: [Numpy-discussion] Proposal to accept NEP #29: Recommend Python and Numpy version support as a community policy standard Message-ID: https://numpy.org/neps/nep-0029-deprecation_policy.html The outstanding concern in https://github.com/numpy/numpy/pull/14086 was that some projects want to continue to support additional versions of Python and numpy outside of the minimum support windows. The language has been changed to specify that these are _minimum_ support windows and that projects _should_ not _will_ drop support as they can. There is one trivial wording change PR open ( https://github.com/numpy/numpy/pull/14444). If there are no substantive objections within 7 days from this email, then the NEP will be accepted; see NEP 0 for more details. Tom -- Thomas Caswell tcaswell at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Fri Sep 6 14:44:19 2019 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Sep 2019 11:44:19 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: <67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> References:

Message-ID: On Fri, Sep 6, 2019 at 12:53 AM Nathaniel Smith wrote: > On Mon, Sep 2, 2019 at 11:21 PM Ralf Gommers > wrote: > > On Mon, Sep 2, 2019 at 2:09 PM Nathaniel Smith wrote: > > On Tue, Sep 3, 2019 at 2:04 AM Hameer Abbasi > wrote: > > The fact that we're having to design more and more protocols for a lot > > of very similar things is, to me, an indicator that we do have holistic > > problems that ought to be solved by a single protocol. > > But the reason we've had trouble designing these protocols is that > they're each different :-). If it was just a matter of copying > __array_ufunc__ we'd have been done in a few minutes... > I don't think that argument is correct. That we now have two very similar protocols is simply a matter of history and limited developer time. NEP 18 discusses in several places that __array_ufunc__ should be brought in line with __array_ufunc__, and that we can migrate a function from one protocol to the other. There's no technical reason other than backwards compat and dev time why we couldn't use __array_function__ for ufuncs also. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Fri Sep 6 17:45:11 2019 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Sep 2019 14:45:11 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: <67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> References:

<67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> Message-ID: On Fri, Sep 6, 2019 at 1:32 AM Hameer Abbasi wrote: > That's a lot of very good questions! Let me see if I can answer them > one-by-one. > > On 06.09.19 09:49, Nathaniel Smith wrote: > > But those are general questions about unumpy, and I'm guessing no-one > knows all the answers yet... and these question actually aren't super > relevant to the NEP. The NEP isn't inventing unumpy. IIUC, the main > thing the NEP is proposes is simply to make "numpy.overridable" an > alias for "unumpy". > > It's not clear to me what problem this alias is solving. If all > downstream users have to update their imports anyway, then they can > write "import unumpy as np" just as easily as they can write "import > numpy.overridable as np". I guess the main reason this is a NEP is > because the unumpy project is hoping to get an "official stamp of > approval" from numpy? > > Also because we have NEP 30 for yet another protocol, and there's likely another NEP to follow after that for array creation. Those use cases are covered by unumpy, so it makes sense to have a NEP for that as well, so they can be considered side-by-side. > That's part of it. The concrete problems it's solving are threefold: > > 1. Array creation functions can be overridden. > 2. Array coercion is now covered. > 3. "Default implementations" will allow you to re-write your NumPy > array more easily, when such efficient implementations exist in terms of > other NumPy functions. That will also help achieve similar semantics, but > as I said, they're just "default"... > > There may be another very concrete one (that's not yet in the NEP): allowing other libraries that consume ndarrays to use overrides. An example is numpy.fft: currently both mkl_fft and pyfftw monkeypatch NumPy, something we don't like all that much (in particular for mkl_fft, because it's the default in Anaconda). `__array_function__` isn't able to help here, because it will always choose NumPy's own implementation for ndarray input. With unumpy you can support multiple libraries that consume ndarrays. Another example is einsum: if you want to use opt_einsum for all inputs (including ndarrays), then you cannot use np.einsum. And yet another is using bottleneck (https://kwgoodman.github.io/bottleneck-doc/reference.html) for nan-functions and partition. There's likely more of these. The point is: sometimes the array protocols are preferred (e.g. Dask/Xarray-style meta-arrays), sometimes unumpy-style dispatch works better. It's also not necessarily an either or, they can be complementary. Actually, after writing this I just realized something. With 1.17.x we have: ``` In [1]: import dask.array as da In [2]: d = da.from_array(np.linspace(0, 1)) In [3]: np.fft.fft(d) Out[3]: dask.array ``` In Anaconda `np.fft.fft` *is* `mkl_fft._numpy_fft.fft`, so this won't work. We have no bug report yet because 1.17.x hasn't landed in conda defaults yet (perhaps this is a/the reason why?), but it will be a problem. The import numpy.overridable part is meant to help garner adoption, and to > prefer the unumpy module if it is available (which will continue to be > developed separately). That way it isn't so tightly coupled to the release > cycle. One alternative Sebastian Berg mentioned (and I am on board with) is > just moving unumpy into the NumPy organisation. What we fear keeping it > separate is that the simple act of a pip install unumpy will keep people > from using it or trying it out. > Note that this is not the most critical aspect. I pushed for vendoring as numpy.overridable because I want to not derail the comparison with NEP 30 et al. with a "should we add a dependency" discussion. The interesting part to decide on first is: do we need the unumpy override mechanism? Vendoring opt-in vs. making it default vs. adding a dependency is of secondary interest right now. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Fri Sep 6 19:50:46 2019 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 6 Sep 2019 16:50:46 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

<67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> Message-ID: On Fri, Sep 6, 2019 at 2:45 PM Ralf Gommers wrote: > There may be another very concrete one (that's not yet in the NEP): allowing other libraries that consume ndarrays to use overrides. An example is numpy.fft: currently both mkl_fft and pyfftw monkeypatch NumPy, something we don't like all that much (in particular for mkl_fft, because it's the default in Anaconda). `__array_function__` isn't able to help here, because it will always choose NumPy's own implementation for ndarray input. With unumpy you can support multiple libraries that consume ndarrays. unumpy doesn't help with this either though, does it? unumpy is double-opt-in: the code using np.fft has to switch to using unumpy.fft instead, and then someone has to enable the backend. But MKL/pyfftw started out as opt-in ? you could `import mkl_fft` or `import pyfftw` ? and the whole reason they switched to monkeypatching is that they decided that opt-in wasn't good enough for them. >> The import numpy.overridable part is meant to help garner adoption, and to prefer the unumpy module if it is available (which will continue to be developed separately). That way it isn't so tightly coupled to the release cycle. One alternative Sebastian Berg mentioned (and I am on board with) is just moving unumpy into the NumPy organisation. What we fear keeping it separate is that the simple act of a pip install unumpy will keep people from using it or trying it out. > > Note that this is not the most critical aspect. I pushed for vendoring as numpy.overridable because I want to not derail the comparison with NEP 30 et al. with a "should we add a dependency" discussion. The interesting part to decide on first is: do we need the unumpy override mechanism? Vendoring opt-in vs. making it default vs. adding a dependency is of secondary interest right now. Wait, but I thought the only reason we would have a dependency is if we're exporting it as part of the numpy namespace. If we keep the import as `import unumpy`, then it works just as well, without any dependency *or* vendoring in numpy, right? -n -- Nathaniel J. Smith -- https://vorpus.org From njs at pobox.com Fri Sep 6 20:16:04 2019 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 6 Sep 2019 17:16:04 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

<67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> Message-ID: On Fri, Sep 6, 2019 at 11:44 AM Ralf Gommers wrote: > > > > On Fri, Sep 6, 2019 at 1:32 AM Hameer Abbasi wrote: >> >> That's a lot of very good questions! Let me see if I can answer them one-by-one. >> >> On 06.09.19 09:49, Nathaniel Smith wrote: >> >> But even that could be accomplished by just >> putting something in the docs. And adding the alias has substantial >> risks: it makes unumpy tied to the numpy release cycle and >> compatibility rules, and it means that we're committing to maintaining >> unumpy ~forever even if Hameer or Quansight move onto other things. >> That seems like a lot to take on for such vague benefits? >> >> I can assure you Travis has had the goal of "replatforming SciPy" from as far back as I met him, he's spawned quite a few efforts in that direction along with others from Quansight (and they've led to nice projects). Quansight, as I see it, is unlikely to abandon something like this if it becomes successful (and acceptance of this NEP will be a huge success story). > > > Let me address this separately, since it's not really a technical concern. > > First, this is not what we say for other contributions. E.g. we didn't say no to Pocketfft because Martin Reineck may move on, or __array_function__ because Stephan may get other interests at some point, or a whole new numpy.random, etc. > > Second, this is not about Quansight. At Quansight Labs we've been able to create time for Hameer to build this, and me and others to contribute - which is very nice, but the two are not tied inextricably together. In the end it's still individuals submitting this NEP. I have been a NumPy dev for ~10 years before joining Quansight, and my future NumPy contributions are not dependent on staying at Quansight (not that I plan to go anywhere!). I'm guessing the same is true for others. > > Third, unumpy is a fairly thin layer over uarray, which already has another user in SciPy. I'm sorry if that came across as some kind snipe at Quansight specifically. I didn't mean it that way. It's a much more general concern: software projects are inherently risky, and often fail; companies and research labs change focus and funding shifts around. This is just a general risk that we need to take that into account when making decisions. And when there are proposals to add new submodules to numpy, we always put them under intense scrutiny, exactly because of the support commitments. The new fft and random code are replacing/extending our existing public APIs that we already committed to, so that's a very different situation. And __array_function__ was something that couldn't work at all without being built into numpy, and even then it was controversial and merged on an experimental basis. It's always about trade-offs. My concern here is that the NEP is proposing that the numpy maintainers take on this large commitment, *and* AFAICT there's no compensating benefit to justify that: everything that can be done with numpy.overridable can be done just as well with a standalone unumpy package... right? -n -- Nathaniel J. Smith -- https://vorpus.org From charlesr.harris at gmail.com Fri Sep 6 20:42:04 2019 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 6 Sep 2019 18:42:04 -0600 Subject: [Numpy-discussion] NumPy 1.17.2 released. Message-ID: Hi All, On behalf of the NumPy team I am pleased to announce that NumPy 1.17.2 has been released. This release contains fixes for bugs reported against NumPy 1.17.1 along with some documentation improvements. The most important fix is for lexsort when the keys are of type (u)int8 or (u)int16. If you are currently using 1.17 you should upgrade. The Python versions supported in this release are 3.5-3.7, Python 3.8b4 should work with the released source packages, but there are no future guarantees. Downstream developers should use Cython >= 0.29.13 for Python 3.8 support and OpenBLAS >= 3.7 to avoid wrong results on the Skylake architecture. The NumPy wheels on PyPI are built from the OpenBLAS development branch in order to avoid those problems. Wheels for this release can be downloaded from PyPI , source archives and release notes are available from Github . *Contributors* A total of 7 people contributed to this release. People with a "+" by their names contributed a patch for the first time. - CakeWithSteak + - Charles Harris - Dan Allan - Hameer Abbasi - Lars Grueter - Matti Picus - Sebastian Berg *Pull requests merged* A total of 8 pull requests were merged for this release. - #14418: BUG: Fix aradixsort indirect indexing. - #14420: DOC: Fix a minor typo in dispatch documentation. - #14421: BUG: test, fix regression in converting to ctypes - #14430: BUG: Do not show Override module in private error classes. - #14432: BUG: Fixed maximum relative error reporting in assert_allclose. - #14433: BUG: Fix uint-overflow if padding with linear_ramp and negative... - #14436: BUG: Update 1.17.x with 1.18.0-dev pocketfft.py. - #14446: REL: Prepare for NumPy 1.17.2 release. Cheers, Charles Harris -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Sep 7 01:54:08 2019 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Sep 2019 22:54:08 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References:

<67b837e7-5d3d-337e-49f7-cac078ec4d8f@gmail.com> Message-ID: On Fri, Sep 6, 2019 at 5:16 PM Nathaniel Smith wrote: > On Fri, Sep 6, 2019 at 11:44 AM Ralf Gommers > wrote: > > > > > > > > On Fri, Sep 6, 2019 at 1:32 AM Hameer Abbasi > wrote: > >> > >> That's a lot of very good questions! Let me see if I can answer them > one-by-one. > >> > >> On 06.09.19 09:49, Nathaniel Smith wrote: > >> > >> But even that could be accomplished by just > >> putting something in the docs. And adding the alias has substantial > >> risks: it makes unumpy tied to the numpy release cycle and > >> compatibility rules, and it means that we're committing to maintaining > >> unumpy ~forever even if Hameer or Quansight move onto other things. > >> That seems like a lot to take on for such vague benefits? > >> > >> I can assure you Travis has had the goal of "replatforming SciPy" from > as far back as I met him, he's spawned quite a few efforts in that > direction along with others from Quansight (and they've led to nice > projects). Quansight, as I see it, is unlikely to abandon something like > this if it becomes successful (and acceptance of this NEP will be a huge > success story). > > > > > > Let me address this separately, since it's not really a technical > concern. > > > > First, this is not what we say for other contributions. E.g. we didn't > say no to Pocketfft because Martin Reineck may move on, or > __array_function__ because Stephan may get other interests at some point, > or a whole new numpy.random, etc. > > > > Second, this is not about Quansight. At Quansight Labs we've been able > to create time for Hameer to build this, and me and others to contribute - > which is very nice, but the two are not tied inextricably together. In the > end it's still individuals submitting this NEP. I have been a NumPy dev for > ~10 years before joining Quansight, and my future NumPy contributions are > not dependent on staying at Quansight (not that I plan to go anywhere!). > I'm guessing the same is true for others. > > > > Third, unumpy is a fairly thin layer over uarray, which already has > another user in SciPy. > > I'm sorry if that came across as some kind snipe at Quansight > specifically. I didn't mean it that way. It's a much more general > concern: software projects are inherently risky, and often fail; > companies and research labs change focus and funding shifts around. > This is just a general risk that we need to take that into account > when making decisions. And when there are proposals to add new > submodules to numpy, we always put them under intense scrutiny, > exactly because of the support commitments. > Yes, that's fair, and we should be critical here. All code we accept is indeed a maintenance burden. > The new fft and random code are replacing/extending our existing > public APIs that we already committed to, so that's a very different > situation. And __array_function__ was something that couldn't work at > all without being built into numpy, and even then it was controversial > and merged on an experimental basis. It's always about trade-offs. My > concern here is that the NEP is proposing that the numpy maintainers > take on this large commitment, Again, not just the NumPy maintainers. There really isn't that much in `unumpy` that's all that complicated. And again, `uarray` has multiple maintainers (note that Peter is also a SciPy core dev) and has another user in SciPy. *and* AFAICT there's no compensating > benefit to justify that: everything that can be done with > numpy.overridable can be done just as well with a standalone unumpy > package... right? > True, mostly. But at that point, if we say that it's the way to do array coercion, and creation (and perhaps some other things as well), we're saying at the same time that every other package that needs this (e.g. Dask, CuPy) should take unumpy as a hard dependency. Which is a much bigger ask than when it comes with NumPy. We can discuss it of course. Major exception is if we want to make it default for some functionality, like for example numpy.fft (I'll answer your other email for that. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Sep 7 02:04:02 2019 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Sep 2019 23:04:02 -0700 Subject: [Numpy-discussion] =?utf-8?q?NEP_31_=E2=80=94_Context-local_and_?= =?utf-8?q?global_overrides_of_the_NumPy_API?= In-Reply-To: References: