From sturla.molden at gmail.com Wed Nov 2 18:38:13 2016 From: sturla.molden at gmail.com (Sturla Molden) Date: Wed, 2 Nov 2016 22:38:13 +0000 (UTC) Subject: [Numpy-discussion] missing from contributor list? Message-ID: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> Why am I missing from the contributor hist here? https://github.com/numpy/numpy/blob/master/numpy/_build_utils/src/apple_sgemv_fix.c Sturla From robert.kern at gmail.com Wed Nov 2 18:46:01 2016 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 2 Nov 2016 15:46:01 -0700 Subject: [Numpy-discussion] missing from contributor list? In-Reply-To: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> References: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> Message-ID: Because Github (or maybe git) doesn't track the history of the file through all of the renames. It is only reporting the contributors of changes to the file at its current location. If you go back to the time just prior to the commit that renamed the file, you do show up in the list: https://github.com/numpy/numpy/blob/f179ec92d8ddb0dc5f7445255022be5c4765a704/numpy/build_utils/src/apple_sgemv_fix.c On Wed, Nov 2, 2016 at 3:38 PM, Sturla Molden wrote: > Why am I missing from the contributor hist here? > > https://github.com/numpy/numpy/blob/master/numpy/_ > build_utils/src/apple_sgemv_fix.c > > > Sturla > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From evgeny.burovskiy at gmail.com Wed Nov 2 18:50:33 2016 From: evgeny.burovskiy at gmail.com (Evgeni Burovski) Date: Thu, 3 Nov 2016 01:50:33 +0300 Subject: [Numpy-discussion] missing from contributor list? In-Reply-To: References: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> Message-ID: br at duneyrr:~/repos/numpy/numpy/_build_utils$ git blame -M -C -C apple_accelerate.py 4743f3b4 numpy/_build_utils/apple_accelerate.py (Charles Harris 2015-12-05 19:16:00 -0700 1) from __future__ import division, a 4743f3b4 numpy/_build_utils/apple_accelerate.py (Charles Harris 2015-12-05 19:16:00 -0700 2) cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 3) import os cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 4) import sys cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 5) import re cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 6) cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 7) __all__ = ['uses_accelerate_framew cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 8) cf307769 numpy/build_utils/apple_accelerate.py (Sturla Molden 2014-10-27 07:45:57 +0100 9) def uses_accelerate_framework(info On Thu, Nov 3, 2016 at 1:46 AM, Robert Kern wrote: > Because Github (or maybe git) doesn't track the history of the file through > all of the renames. It is only reporting the contributors of changes to the > file at its current location. If you go back to the time just prior to the > commit that renamed the file, you do show up in the list: > > https://github.com/numpy/numpy/blob/f179ec92d8ddb0dc5f7445255022be5c4765a704/numpy/build_utils/src/apple_sgemv_fix.c > > On Wed, Nov 2, 2016 at 3:38 PM, Sturla Molden > wrote: >> >> Why am I missing from the contributor hist here? >> >> >> https://github.com/numpy/numpy/blob/master/numpy/_build_utils/src/apple_sgemv_fix.c >> >> >> Sturla >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > -- > Robert Kern > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > From jtaylor.debian at googlemail.com Wed Nov 2 18:52:16 2016 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 2 Nov 2016 23:52:16 +0100 Subject: [Numpy-discussion] missing from contributor list? In-Reply-To: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> References: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> Message-ID: On 02.11.2016 23:38, Sturla Molden wrote: > Why am I missing from the contributor hist here? > > https://github.com/numpy/numpy/blob/master/numpy/_build_utils/src/apple_sgemv_fix.c > > Probably because the file was moved and github does not track this. This is something you should ask gihub about and not this list. fwiw. git blame -C still shows it From charlesr.harris at gmail.com Wed Nov 2 18:59:58 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 2 Nov 2016 16:59:58 -0600 Subject: [Numpy-discussion] missing from contributor list? In-Reply-To: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> References: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> Message-ID: On Wed, Nov 2, 2016 at 4:38 PM, Sturla Molden wrote: > Why am I missing from the contributor hist here? > > https://github.com/numpy/numpy/blob/master/numpy/_ > build_utils/src/apple_sgemv_fix.c > > > You still show up in the commit log of if you follow the file ``` git log --follow numpy/_build_utils/apple_accelerate.py ``` So I have to agree with others that the problem is on the github end. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From mail at telenczuk.pl Thu Nov 3 06:10:22 2016 From: mail at telenczuk.pl (mail at telenczuk.pl) Date: Thu, 03 Nov 2016 11:10:22 +0100 Subject: [Numpy-discussion] missing from contributor list? In-Reply-To: References: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> Message-ID: <581b0d0e1261f_131115123bc5f@Pct-EqAlain-Z30.notmuch> Hi, I had the same problem when I changed my email address. It seems that github matches user profiles to commits using the email address, so you have to make sure that the email you sign your commits with is registered in your github account. Cheers, Bartosz Charles R Harris wrote: > On Wed, Nov 2, 2016 at 4:38 PM, Sturla Molden > wrote: > > > Why am I missing from the contributor hist here? > > > > https://github.com/numpy/numpy/blob/master/numpy/_ > > build_utils/src/apple_sgemv_fix.c > > > > > > > You still show up in the commit log of if you follow the file > ``` > git log --follow numpy/_build_utils/apple_accelerate.py > ``` > > So I have to agree with others that the problem is on the github end. > > Chuck From jtaylor.debian at googlemail.com Thu Nov 3 10:24:24 2016 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Thu, 3 Nov 2016 15:24:24 +0100 Subject: [Numpy-discussion] missing from contributor list? In-Reply-To: <581b0d0e1261f_131115123bc5f@Pct-EqAlain-Z30.notmuch> References: <66871716499819023.492381sturla.molden-gmail.com@news.gmane.org> <581b0d0e1261f_131115123bc5f@Pct-EqAlain-Z30.notmuch> Message-ID: you can add multiple email addresses to your github settings so commits are properly attributed to your account. On 11/03/2016 11:10 AM, mail at telenczuk.pl wrote: > Hi, > > I had the same problem when I changed my email address. It seems that github matches user profiles to commits using the email address, so you have to make sure that the email you sign your commits with is registered in your github account. > > Cheers, > > Bartosz > > Charles R Harris wrote: >> On Wed, Nov 2, 2016 at 4:38 PM, Sturla Molden >> wrote: >> >>> Why am I missing from the contributor hist here? >>> >>> https://github.com/numpy/numpy/blob/master/numpy/_ >>> build_utils/src/apple_sgemv_fix.c >>> >>> >>> >> You still show up in the commit log of if you follow the file >> ``` >> git log --follow numpy/_build_utils/apple_accelerate.py >> ``` >> >> So I have to agree with others that the problem is on the github end. >> >> Chuck > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > From yuri.sukhov at gmail.com Thu Nov 3 11:17:26 2016 From: yuri.sukhov at gmail.com (Yuri Sukhov) Date: Thu, 3 Nov 2016 18:17:26 +0300 Subject: [Numpy-discussion] Using numpy.rint() with scalars Message-ID: Hi all, According to the documentation for numpy.rint() (https://docs.scipy.org/doc/ numpy/reference/generated/numpy.rint.html), it's a ufunc that accepts an array-like object as an input. But it also works with scalar inputs. Could anyone clarify if such use case is considered to be common and acceptable? It it just the documentation that does not cover one of the possible scenarios, or is it a side effect and such use case should be avoided? I want to use rint() with scalars as I have not found an alternative with the same behavior in cPython, but if it's not how it should be used, I don't want to rewrite the app when that behavior will change. Thanks! Yuri. -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Nov 3 11:31:18 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 3 Nov 2016 09:31:18 -0600 Subject: [Numpy-discussion] Using numpy.rint() with scalars In-Reply-To: References: Message-ID: On Thu, Nov 3, 2016 at 9:17 AM, Yuri Sukhov wrote: > Hi all, > > According to the documentation for numpy.rint() ( > https://docs.scipy.org/doc/numpy/reference/generated/numpy.rint.html), > it's a ufunc that accepts an array-like object as an input. > > But it also works with scalar inputs. Could anyone clarify if such use > case is considered to be common and acceptable? It it just the > documentation that does not cover one of the possible scenarios, or is it a > side effect and such use case should be avoided? > > I want to use rint() with scalars as I have not found an alternative with > the same behavior in cPython, but if it's not how it should be used, I > don't want to rewrite the app when that behavior will change. > > Scalars are array_like, they can be converted to arrays. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Nov 3 13:04:45 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 3 Nov 2016 11:04:45 -0600 Subject: [Numpy-discussion] Branching NumPy 1.12.x Message-ID: Hi All, I'm thinking that it is time to branch NumPy 1.12.x. I haven't got everything in it that I would have liked, in particular __numpy_ufunc__, but I think there is plenty of material and not branching is holding up some of the more risky stuff. My current thoughts on __numpy_ufunc__ is that it would be best to work it out over the 1.13.0 release cycle, starting with enabling it again right after the branch. Julian's work on avoiding temporary copies and Pauli's overlap handling PR are two other changes I've been putting off but don't want to delay further. There are some other smaller things that I had scheduled for 1.12.0, but would like to spend more time looking at. If there are some things that you think just have to be in 1.12.0, please mention them, but I'd rather aim at getting 1.13.0 out in a timely manner. Thoughts? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From Nicolas.Rougier at inria.fr Thu Nov 3 14:29:10 2016 From: Nicolas.Rougier at inria.fr (Nicolas P. Rougier) Date: Thu, 3 Nov 2016 19:29:10 +0100 Subject: [Numpy-discussion] Useless but tricky exercise Message-ID: <6652346A-AC95-4A71-82BE-9A0A0F254303@inria.fr> Hi all, Given an array V that is a view of a base B, I was wondering if it is possible to find a (string) index such that `V = B[index]` For example: ``` B = np.arange(8*8).reshape(8,8) V = B[::2,::2] index = find_view(B,V) print(index) "::2,::2" print(np.allclose(V, eval("B[%s]" % index))) True ``` I wrote a solution at https://gist.github.com/rougier/b8c2256434b3a4a4271260cd4cc6cbc7 (not very thoroughly tested) but maybe there are better ways to do that (like a magic numpy call ?) (no use-case at all, only for teaching) Nicolas From ralf.gommers at gmail.com Fri Nov 4 04:05:41 2016 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 4 Nov 2016 21:05:41 +1300 Subject: [Numpy-discussion] Branching NumPy 1.12.x In-Reply-To: References: Message-ID: On Fri, Nov 4, 2016 at 6:04 AM, Charles R Harris wrote: > Hi All, > > I'm thinking that it is time to branch NumPy 1.12.x. I haven't got > everything in it that I would have liked, in particular __numpy_ufunc__, > but I think there is plenty of material and not branching is holding up > some of the more risky stuff. My current thoughts on __numpy_ufunc__ is > that it would be best to work it out over the 1.13.0 release cycle, > starting with enabling it again right after the branch. Julian's work on > avoiding temporary copies and Pauli's overlap handling PR are two other > changes I've been putting off but don't want to delay further. There are > some other smaller things that I had scheduled for 1.12.0, but would like > to spend more time looking at. If there are some things that you think just > have to be in 1.12.0, please mention them, but I'd rather aim at getting > 1.13.0 out in a timely manner. > > Thoughts? > That's a really good plan I think. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndbecker2 at gmail.com Fri Nov 4 08:06:32 2016 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 04 Nov 2016 08:06:32 -0400 Subject: [Numpy-discussion] array comprehension Message-ID: I find I often write: np.array ([some list comprehension]) mainly because list comprehensions are just so sweet. But I imagine this isn't particularly efficient. I wonder if numpy has a "better" way, and if not, maybe it would be a nice addition? From faltet at gmail.com Fri Nov 4 08:26:10 2016 From: faltet at gmail.com (Francesc Alted) Date: Fri, 4 Nov 2016 13:26:10 +0100 Subject: [Numpy-discussion] array comprehension In-Reply-To: References: Message-ID: 2016-11-04 13:06 GMT+01:00 Neal Becker : > I find I often write: > np.array ([some list comprehension]) > > mainly because list comprehensions are just so sweet. > > But I imagine this isn't particularly efficient. > Right. Using a generator and np.fromiter() will avoid the creation of the intermediate list. Something like: np.fromiter((i for i in range(x))) # use xrange for Python 2 > > I wonder if numpy has a "better" way, and if not, maybe it would be a nice > addition? > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Francesc Alted -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndbecker2 at gmail.com Fri Nov 4 09:36:02 2016 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 04 Nov 2016 09:36:02 -0400 Subject: [Numpy-discussion] array comprehension References:

Message-ID: Francesc Alted wrote: > 2016-11-04 13:06 GMT+01:00 Neal Becker : > >> I find I often write: >> np.array ([some list comprehension]) >> >> mainly because list comprehensions are just so sweet. >> >> But I imagine this isn't particularly efficient. >> > > Right. Using a generator and np.fromiter() will avoid the creation of the > intermediate list. Something like: > > np.fromiter((i for i in range(x))) # use xrange for Python 2 > > Does this generalize to >1 dimensions? From faltet at gmail.com Fri Nov 4 10:12:11 2016 From: faltet at gmail.com (Francesc Alted) Date: Fri, 4 Nov 2016 15:12:11 +0100 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: 2016-11-04 14:36 GMT+01:00 Neal Becker : > Francesc Alted wrote: > > > 2016-11-04 13:06 GMT+01:00 Neal Becker : > > > >> I find I often write: > >> np.array ([some list comprehension]) > >> > >> mainly because list comprehensions are just so sweet. > >> > >> But I imagine this isn't particularly efficient. > >> > > > > Right. Using a generator and np.fromiter() will avoid the creation of > the > > intermediate list. Something like: > > > > np.fromiter((i for i in range(x))) # use xrange for Python 2 > > > > > Does this generalize to >1 dimensions? > A reshape() is not enough? What do you want to do exactly? > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Francesc Alted -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Fri Nov 4 11:04:13 2016 From: shoyer at gmail.com (Stephan Hoyer) Date: Fri, 4 Nov 2016 08:04:13 -0700 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On Fri, Nov 4, 2016 at 7:12 AM, Francesc Alted wrote: > Does this generalize to >1 dimensions? >> > > A reshape() is not enough? What do you want to do exactly? > np.fromiter takes scalar input and only builds a 1D array. So it actually can't combine multiple values at once unless they are flattened out in Python. It could be nice to add support for non-scalar inputs, stacking them similarly to np.array. Likewise, it could be nice to add an axis argument, so it can work similarly to np.stack. More generally, you might want to iterate and rebuild over arbitrary dimension(s) of an array. Something like np.stack([x for x in np.unstack(y, axis)], axis) But, we also don't have an unstack function. This would mostly be syntactic sugar, but I think it would be a nice addition. Such a function actually exists in TensorFlow: https://g3doc.corp.google.com/third_party/tensorflow/g3doc/api_docs/python/array_ops.md?cl=head#unstack -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Fri Nov 4 11:11:51 2016 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Fri, 4 Nov 2016 16:11:51 +0100 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On 4 November 2016 at 16:04, Stephan Hoyer wrote: > > But, we also don't have an unstack function. This would mostly be syntactic > sugar, but I think it would be a nice addition. Such a function actually > exists in TensorFlow: > https://g3doc.corp.google.com/third_party/tensorflow/g3doc/api_docs/python/array_ops.md?cl=head#unstack That link is behind a login wall. This is the public version: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/g3doc/api_docs/python/array_ops.md From rmay31 at gmail.com Fri Nov 4 11:33:22 2016 From: rmay31 at gmail.com (Ryan May) Date: Fri, 4 Nov 2016 09:33:22 -0600 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On Fri, Nov 4, 2016 at 9:04 AM, Stephan Hoyer wrote: > On Fri, Nov 4, 2016 at 7:12 AM, Francesc Alted wrote: > >> Does this generalize to >1 dimensions? >>> >> >> A reshape() is not enough? What do you want to do exactly? >> > > np.fromiter takes scalar input and only builds a 1D array. So it actually > can't combine multiple values at once unless they are flattened out in > Python. It could be nice to add support for non-scalar inputs, stacking > them similarly to np.array. Likewise, it could be nice to add an axis > argument, so it can work similarly to np.stack. > itertools.product, itertools.permutation, etc. with np.fromiter (and reshape) is probably also useful here, though it doesn't solve the non-scalar problem. Ryan -- Ryan May -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndbecker2 at gmail.com Fri Nov 4 11:56:39 2016 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 04 Nov 2016 11:56:39 -0400 Subject: [Numpy-discussion] array comprehension References:

Message-ID: Francesc Alted wrote: > 2016-11-04 14:36 GMT+01:00 Neal Becker : > >> Francesc Alted wrote: >> >> > 2016-11-04 13:06 GMT+01:00 Neal Becker : >> > >> >> I find I often write: >> >> np.array ([some list comprehension]) >> >> >> >> mainly because list comprehensions are just so sweet. >> >> >> >> But I imagine this isn't particularly efficient. >> >> >> > >> > Right. Using a generator and np.fromiter() will avoid the creation of >> the >> > intermediate list. Something like: >> > >> > np.fromiter((i for i in range(x))) # use xrange for Python 2 >> > >> > >> Does this generalize to >1 dimensions? >> > > A reshape() is not enough? What do you want to do exactly? > I was thinking about: x = np.array ([[L1] L2]) where L1,L2 take the form of a list comprehension, as a means to create a 2-D array (in this example) From robert.kern at gmail.com Fri Nov 4 11:59:23 2016 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 4 Nov 2016 08:59:23 -0700 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On Fri, Nov 4, 2016 at 6:36 AM, Neal Becker wrote: > > Francesc Alted wrote: > > > 2016-11-04 13:06 GMT+01:00 Neal Becker : > > > >> I find I often write: > >> np.array ([some list comprehension]) > >> > >> mainly because list comprehensions are just so sweet. > >> > >> But I imagine this isn't particularly efficient. > >> > > > > Right. Using a generator and np.fromiter() will avoid the creation of the > > intermediate list. Something like: > > > > np.fromiter((i for i in range(x))) # use xrange for Python 2 > > > > > Does this generalize to >1 dimensions? No. -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From harrigan.matthew at gmail.com Fri Nov 4 13:11:36 2016 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Fri, 4 Nov 2016 13:11:36 -0400 Subject: [Numpy-discussion] ufunc for sum of squared difference Message-ID: I was reading this and got thinking about if a ufunc could compute the sum of squared differences in a single pass without a temporary array. The python code below demonstrates a possible approach. import numpy as np x = np.arange(10) c = 1.0 def add_square_diff(x1, x2): return x1 + (x2-c)**2 ufunc = np.frompyfunc(add_square_diff, 2, 1) print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) print(np.sum(np.square(x-c))) I have (at least) 4 questions: 1. Is it possible to pass run time constants to a ufunc written in C for use in its inner loop, and if so how? 2. Is it possible to pass an initial value to reduce to avoid the clean up required for the first element? 3. Does that ufunc work, or are there special cases which cause it to fall apart? 4. Would a very specialized ufunc such as this be considered for incorporating in numpy since it would help reduce time and memory of functions already in numpy? Thank you, Matt -------------- next part -------------- An HTML attachment was scrubbed... URL: From jorisvandenbossche at gmail.com Fri Nov 4 13:14:55 2016 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Fri, 4 Nov 2016 18:14:55 +0100 Subject: [Numpy-discussion] ANN: pandas v0.19.1 released! Message-ID: Hi all, I'm pleased to announce the release of pandas 0.19.1. This is a bug-fix release from 0.19.0 and includes some small regression fixes, bug fixes and performance improvements. We recommend that all users upgrade to this version. See the v0.19.1 Whatsnew page for an overview of all bugs that have been fixed in 0.19.1. Thanks to all contributors! Joris --- *How to get it:* Source tarballs and windows/mac/linux wheels are available on PyPI (thanks to Christoph Gohlke for the windows wheels, and to Matthew Brett for setting up the mac/linux wheels). Conda packages are already available via the conda-forge channel (conda install pandas -c conda-forge). It will be available on the main channel shortly. *Issues:* Please report any issues on our issue tracker: https://github.com/pydata/ pandas/issues *Thanks to all the contributors of the 0.19.1 release:* - Adam Chainz - Anthonios Partheniou - Arash Rouhani - Ben Kandel - Brandon M. Burroughs - Chris - chris-b1 - Chris Warth - David Krych - dubourg - gfyoung - Iv?n Vall?s P?rez - Jeff Reback - Joe Jevnik - Jon M. Mease - Joris Van den Bossche - Josh Owen - Keshav Ramaswamy - Larry Ren - mattrijk - Michael Felt - paul-mannino - Piotr Chromiec - Robert Bradshaw - Sinhrks - Thiago Serafim - Tom Bird -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Fri Nov 4 13:24:22 2016 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 4 Nov 2016 10:24:22 -0700 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: Are you sure fromiter doesn't make an intermediate list or equivalent? It has to collect all the values before it can know the shape or dtype of the array to put them in. On Nov 4, 2016 5:26 AM, "Francesc Alted" wrote: 2016-11-04 13:06 GMT+01:00 Neal Becker : > I find I often write: > np.array ([some list comprehension]) > > mainly because list comprehensions are just so sweet. > > But I imagine this isn't particularly efficient. > Right. Using a generator and np.fromiter() will avoid the creation of the intermediate list. Something like: np.fromiter((i for i in range(x))) # use xrange for Python 2 > > I wonder if numpy has a "better" way, and if not, maybe it would be a nice > addition? > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Francesc Alted _______________________________________________ NumPy-Discussion mailing list NumPy-Discussion at scipy.org https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Fri Nov 4 13:32:05 2016 From: shoyer at gmail.com (Stephan Hoyer) Date: Fri, 4 Nov 2016 10:32:05 -0700 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On Fri, Nov 4, 2016 at 10:24 AM, Nathaniel Smith wrote: > Are you sure fromiter doesn't make an intermediate list or equivalent? It > has to collect all the values before it can know the shape or dtype of the > array to put them in. > fromiter dynamically resizes a NumPy array, like a Python list, except with a growth factor of 1.5 (rather than 1.25): https://github.com/numpy/numpy/blob/bb59409abf5237c155a1dc4c4d5b31e4acf32fbe/numpy/core/src/multiarray/ctors.c#L3721 -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Fri Nov 4 13:36:06 2016 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 4 Nov 2016 10:36:06 -0700 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On Nov 4, 2016 10:32 AM, "Stephan Hoyer" wrote: > > On Fri, Nov 4, 2016 at 10:24 AM, Nathaniel Smith wrote: >> >> Are you sure fromiter doesn't make an intermediate list or equivalent? It has to collect all the values before it can know the shape or dtype of the array to put them in. > > fromiter dynamically resizes a NumPy array, like a Python list, except with a growth factor of 1.5 (rather than 1.25): > https://github.com/numpy/numpy/blob/bb59409abf5237c155a1dc4c4d5b31e4acf32fbe/numpy/core/src/multiarray/ctors.c#L3721 Oh, right, and the dtype argument is mandatory, which is what makes this possible. -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Fri Nov 4 13:56:48 2016 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Fri, 04 Nov 2016 18:56:48 +0100 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: Message-ID: <1478282208.861.29.camel@sipsolutions.net> On Fr, 2016-11-04 at 13:11 -0400, Matthew Harrigan wrote: > I was reading this and got thinking about if a ufunc could compute > the sum of squared differences in a single pass without a temporary > array.? The python code below demonstrates a possible approach. > > import numpy as np > x = np.arange(10) > c = 1.0 > def add_square_diff(x1, x2): > ??? return x1 + (x2-c)**2 > ufunc = np.frompyfunc(add_square_diff, 2, 1) > print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) > print(np.sum(np.square(x-c))) > > I have (at least) 4 questions: > 1. Is it possible to pass run time constants to a ufunc written in C > for use in its inner loop, and if so how? I don't think its anticipated, since a ufunc could in most cases use a third argument, but a 3 arg ufunc can't be reduced. Not sure if there might be some trickery possible. > 2. Is it possible to pass an initial value to reduce to avoid the > clean up required for the first element? This is the identity normally. But the identity can only be 0, 1 or -1 right now I think. The identity is what the output array gets initialized with (which effectively makes it the first value passed into the inner loop). > 3. Does that ufunc work, or are there special cases which cause it to > fall apart? > 4. Would a very specialized ufunc such as this be considered for > incorporating in numpy since it would help reduce time and memory of > functions already in numpy? > Might be mixing up things, however, IIRC the single pass approach has a bad numerical accuracy, so that I doubt that it is a good default algorithm. - Sebastian > Thank you, > Matt > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: This is a digitally signed message part URL: From harrigan.matthew at gmail.com Fri Nov 4 15:42:25 2016 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Fri, 4 Nov 2016 15:42:25 -0400 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: <1478282208.861.29.camel@sipsolutions.net> References: <1478282208.861.29.camel@sipsolutions.net> Message-ID: I didn't notice identity before. Seems like frompyfunc always sets it to None. If it were zero maybe it would work as desired here. In the writing your own ufunc doc, I was wondering if the pointer to data could be used to get a constant at runtime. If not, what could that be used for? static void double_logit(char **args, npy_intp *dimensions, npy_intp* steps, void* data) Why would the numerical accuracy be any different? The subtraction and square operations look identical and I thought np.sum just calls np.add.reduce, so the reduction step uses the same code and would therefore have the same accuracy. Thanks On Fri, Nov 4, 2016 at 1:56 PM, Sebastian Berg wrote: > On Fr, 2016-11-04 at 13:11 -0400, Matthew Harrigan wrote: > > I was reading this and got thinking about if a ufunc could compute > > the sum of squared differences in a single pass without a temporary > > array. The python code below demonstrates a possible approach. > > > > import numpy as np > > x = np.arange(10) > > c = 1.0 > > def add_square_diff(x1, x2): > > return x1 + (x2-c)**2 > > ufunc = np.frompyfunc(add_square_diff, 2, 1) > > print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) > > print(np.sum(np.square(x-c))) > > > > I have (at least) 4 questions: > > 1. Is it possible to pass run time constants to a ufunc written in C > > for use in its inner loop, and if so how? > > I don't think its anticipated, since a ufunc could in most cases use a > third argument, but a 3 arg ufunc can't be reduced. Not sure if there > might be some trickery possible. > > > 2. Is it possible to pass an initial value to reduce to avoid the > > clean up required for the first element? > > This is the identity normally. But the identity can only be 0, 1 or -1 > right now I think. The identity is what the output array gets > initialized with (which effectively makes it the first value passed > into the inner loop). > > > 3. Does that ufunc work, or are there special cases which cause it to > > fall apart? > > 4. Would a very specialized ufunc such as this be considered for > > incorporating in numpy since it would help reduce time and memory of > > functions already in numpy? > > > > Might be mixing up things, however, IIRC the single pass approach has a > bad numerical accuracy, so that I doubt that it is a good default > algorithm. > > - Sebastian > > > > Thank you, > > Matt > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Fri Nov 4 16:08:44 2016 From: chris.barker at noaa.gov (Chris Barker) Date: Fri, 4 Nov 2016 13:08:44 -0700 Subject: [Numpy-discussion] array comprehension In-Reply-To: References:

Message-ID: On Fri, Nov 4, 2016 at 10:36 AM, Nathaniel Smith wrote: > On Nov 4, 2016 10:32 AM, "Stephan Hoyer" wrote: > > fromiter dynamically resizes a NumPy array, like a Python list, except > with a growth factor of 1.5 > > Oh, right, and the dtype argument is mandatory, which is what makes this > possible. > Couldn't it determine the dtype from the first element, and then barf later if an incompatible one shows up? And then we could adapt this code to np.array() and get nice performance with no extra functions to think about calling... And off the top of my head, I can't think of why it couldn't be generalized to the nd case as well. -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Fri Nov 4 17:33:05 2016 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Fri, 04 Nov 2016 22:33:05 +0100 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> Message-ID: <1478295185.4986.0.camel@sipsolutions.net> On Fr, 2016-11-04 at 15:42 -0400, Matthew Harrigan wrote: > I didn't notice identity before.? Seems like frompyfunc always sets > it to None.? If it were zero maybe it would work as desired here. > > In the writing your own ufunc doc, I was wondering if the pointer to > data could be used to get a constant at runtime.? If not, what could > that be used for? > static void double_logit(char **args, npy_intp *dimensions, > ????????????????????????????npy_intp* steps, void* data) > Why would the numerical accuracy be any different?? The subtraction > and square operations look identical and I thought np.sum just calls > np.add.reduce, so the reduction step uses the same code and would > therefore have the same accuracy. > Sorry, did not read it carefully, I guess `c` is the mean, so you are doing the two pass method. - Sebastian > Thanks > > On Fri, Nov 4, 2016 at 1:56 PM, Sebastian Berg s.net> wrote: > > On Fr, 2016-11-04 at 13:11 -0400, Matthew Harrigan wrote: > > > I was reading this and got thinking about if a ufunc could > > compute > > > the sum of squared differences in a single pass without a > > temporary > > > array.? The python code below demonstrates a possible approach. > > > > > > import numpy as np > > > x = np.arange(10) > > > c = 1.0 > > > def add_square_diff(x1, x2): > > > ??? return x1 + (x2-c)**2 > > > ufunc = np.frompyfunc(add_square_diff, 2, 1) > > > print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) > > > print(np.sum(np.square(x-c))) > > > > > > I have (at least) 4 questions: > > > 1. Is it possible to pass run time constants to a ufunc written > > in C > > > for use in its inner loop, and if so how? > > > > I don't think its anticipated, since a ufunc could in most cases > > use a > > third argument, but a 3 arg ufunc can't be reduced. Not sure if > > there > > might be some trickery possible. > > > > > 2. Is it possible to pass an initial value to reduce to avoid the > > > clean up required for the first element? > > > > This is the identity normally. But the identity can only be 0, 1 or > > -1 > > right now I think. The identity is what the output array gets > > initialized with (which effectively makes it the first value passed > > into the inner loop). > > > > > 3. Does that ufunc work, or are there special cases which cause > > it to > > > fall apart? > > > 4. Would a very specialized ufunc such as this be considered for > > > incorporating in numpy since it would help reduce time and memory > > of > > > functions already in numpy? > > > > > > > Might be mixing up things, however, IIRC the single pass approach > > has a > > bad numerical accuracy, so that I doubt that it is a good default > > algorithm. > > > > - Sebastian > > > > > > > Thank you, > > > Matt > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at scipy.org > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: This is a digitally signed message part URL: From charlesr.harris at gmail.com Sat Nov 5 19:04:06 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 5 Nov 2016 17:04:06 -0600 Subject: [Numpy-discussion] Numpy 1.12.x branched Message-ID: Hi All, Numpy 1.12.x has been branched and the 1.13 development branch is open. It would be helpful if folks could review the release notes as it is likely I've missed something. I'd like to make the first beta release in a couple of days. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Sun Nov 6 11:56:12 2016 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Sun, 06 Nov 2016 17:56:12 +0100 Subject: [Numpy-discussion] Numpy 1.12.x branched In-Reply-To: References: Message-ID: <1478451372.3875.5.camel@sipsolutions.net> On Sa, 2016-11-05 at 17:04 -0600, Charles R Harris wrote: > Hi All, > > Numpy 1.12.x has been branched and the 1.13 development branch is > open. It would be helpful if folks could review the release notes as > it is likely I've missed something.? I'd like to make the first beta > release in a couple of days. > Very cool, thanks for all the hard work! - Sebastian > Chuck > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: This is a digitally signed message part URL: From charlesr.harris at gmail.com Sun Nov 6 13:44:04 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 6 Nov 2016 11:44:04 -0700 Subject: [Numpy-discussion] __numpy_ufunc__ Message-ID: Hi All, For those interested in continuing the __numpy_ufunc__ saga, there is a pull request enabling it . Likely we will want to make some changes up front before merging that, so some discussion is in order. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sun Nov 6 15:10:25 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 6 Nov 2016 13:10:25 -0700 Subject: [Numpy-discussion] __numpy_ufunc__ In-Reply-To: References: Message-ID: On Sun, Nov 6, 2016 at 11:44 AM, Charles R Harris wrote: > Hi All, > > For those interested in continuing the __numpy_ufunc__ saga, there is a pull > request enabling it . Likely we > will want to make some changes up front before merging that, so some > discussion is in order. > > As a first order of business, let's decide whether to remove the index and rename `__numpy_ufunc__`. The motivation for this is discussed in issue #5986. If we decide positive on that (I'm in favor), I would be happy with the proposed name `__array_ufunc__`, although something more descriptive like `__handle_ufunc__` might be better. This is a wonderful opportunity for bike shedding for those so inclined ;) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From nathan12343 at gmail.com Sun Nov 6 15:21:08 2016 From: nathan12343 at gmail.com (Nathan Goldbaum) Date: Sun, 6 Nov 2016 14:21:08 -0600 Subject: [Numpy-discussion] __numpy_ufunc__ In-Reply-To: References:

Message-ID: I'm interested in this, but was never able to dive in to the lengthy discussions on this. I'm curious if there is a summary of the current state of things somewhere that I could read to catch up. On Sunday, November 6, 2016, Charles R Harris wrote: > > > On Sun, Nov 6, 2016 at 11:44 AM, Charles R Harris < > charlesr.harris at gmail.com > > wrote: > >> Hi All, >> >> For those interested in continuing the __numpy_ufunc__ saga, there is a pull >> request enabling it . Likely >> we will want to make some changes up front before merging that, so some >> discussion is in order. >> >> > As a first order of business, let's decide whether to remove the index and > rename `__numpy_ufunc__`. The motivation for this is discussed in issue > #5986. > If we decide positive on that (I'm in favor), I would be happy with the > proposed name `__array_ufunc__`, although something more descriptive like > `__handle_ufunc__` might be better. This is a wonderful opportunity for > bike shedding for those so inclined ;) > > Chuck > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Nov 7 03:08:54 2016 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 7 Nov 2016 21:08:54 +1300 Subject: [Numpy-discussion] __numpy_ufunc__ In-Reply-To: References:

Message-ID: On Mon, Nov 7, 2016 at 9:10 AM, Charles R Harris wrote: > > > On Sun, Nov 6, 2016 at 11:44 AM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> Hi All, >> >> For those interested in continuing the __numpy_ufunc__ saga, there is a pull >> request enabling it . Likely >> we will want to make some changes up front before merging that, so some >> discussion is in order. >> >> > As a first order of business, let's decide whether to remove the index and > rename `__numpy_ufunc__`. The motivation for this is discussed in issue > #5986. > If we decide positive on that (I'm in favor), > It seems like everyone on that issue is in favor or at least +0. So +1 from me too. > I would be happy with the proposed name `__array_ufunc__`, although > something more descriptive like `__handle_ufunc__` might be better. > > This is a wonderful opportunity for bike shedding for those so inclined ;) > pass :) Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Nov 7 03:19:20 2016 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 7 Nov 2016 21:19:20 +1300 Subject: [Numpy-discussion] __numpy_ufunc__ In-Reply-To: References:

Message-ID: On Mon, Nov 7, 2016 at 9:21 AM, Nathan Goldbaum wrote: > I'm interested in this, but was never able to dive in to the lengthy > discussions on this. I'm curious if there is a summary of the current > state of things somewhere that I could read to catch up. Old proposal, original implementation: https://github.com/numpy/numpy/blob/master/doc/neps/ufunc-overrides.rst Main issue for discussion: https://github.com/numpy/numpy/issues/5844. That really needs a summary, but I'm not sure there's an up-to-date one. Ralf > On Sunday, November 6, 2016, Charles R Harris > wrote: > >> >> >> On Sun, Nov 6, 2016 at 11:44 AM, Charles R Harris < >> charlesr.harris at gmail.com> wrote: >> >>> Hi All, >>> >>> For those interested in continuing the __numpy_ufunc__ saga, there is a pull >>> request enabling it . Likely >>> we will want to make some changes up front before merging that, so some >>> discussion is in order. >>> >>> >> As a first order of business, let's decide whether to remove the index >> and rename `__numpy_ufunc__`. The motivation for this is discussed in issue >> #5986. >> If we decide positive on that (I'm in favor), I would be happy with the >> proposed name `__array_ufunc__`, although something more descriptive like >> `__handle_ufunc__` might be better. This is a wonderful opportunity for >> bike shedding for those so inclined ;) >> >> Chuck >> >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matti.picus at gmail.com Mon Nov 7 13:32:54 2016 From: matti.picus at gmail.com (Matti Picus) Date: Mon, 7 Nov 2016 20:32:54 +0200 Subject: [Numpy-discussion] Numpy 1.12.x branched In-Reply-To: References: Message-ID: <638b75a9-7d9b-5ddf-6b3a-fadf5a12c65a@gmail.com> On 07/11/16 10:19, numpy-discussion-request at scipy.org wrote: > Date: Sun, 06 Nov 2016 17:56:12 +0100 > From: Sebastian Berg > To:numpy-discussion at scipy.org > Subject: Re: [Numpy-discussion] Numpy 1.12.x branched > Message-ID:<1478451372.3875.5.camel at sipsolutions.net> > Content-Type: text/plain; charset="utf-8" > > On Sa, 2016-11-05 at 17:04 -0600, Charles R Harris wrote: >> >Hi All, >> > >> >Numpy 1.12.x has been branched and the 1.13 development branch is >> >open. It would be helpful if folks could review the release notes as >> >it is likely I've missed something.? I'd like to make the first beta >> >release in a couple of days. >> > > Very cool, thanks for all the hard work! > > - Sebastian > > >> >Chuck Thanks for managing this. I don't know where, but it would be nice if the release notes could mention the PyPy support - we are down to only a few failures on the test suite, the only real oustanding issue is nditer using UPDATEIFCOPY which depends on refcounting semantics to trigger the copy. Other than that PyPy + NumPy 1.12 is a working thing, we (PyPy devs) will soon try to make it work faster :). Matti From charlesr.harris at gmail.com Mon Nov 7 14:57:29 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 7 Nov 2016 12:57:29 -0700 Subject: [Numpy-discussion] __numpy_ufunc__ In-Reply-To: References:

Message-ID: On Mon, Nov 7, 2016 at 1:08 AM, Ralf Gommers wrote: > > > On Mon, Nov 7, 2016 at 9:10 AM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> >> >> On Sun, Nov 6, 2016 at 11:44 AM, Charles R Harris < >> charlesr.harris at gmail.com> wrote: >> >>> Hi All, >>> >>> For those interested in continuing the __numpy_ufunc__ saga, there is a pull >>> request enabling it . Likely >>> we will want to make some changes up front before merging that, so some >>> discussion is in order. >>> >>> >> As a first order of business, let's decide whether to remove the index >> and rename `__numpy_ufunc__`. The motivation for this is discussed in issue >> #5986. >> If we decide positive on that (I'm in favor), >> > > It seems like everyone on that issue is in favor or at least +0. So +1 > from me too. > > >> I would be happy with the proposed name `__array_ufunc__`, although >> something more descriptive like `__handle_ufunc__` might be better. >> > > > >> This is a wonderful opportunity for bike shedding for those so inclined ;) >> > Let me try to break things down an bit. *Uncontroversial* - Rename __numpy_ufunc__ to __array_ufunc__ - Remove index - Out is always a tuple I think this much is useful even if nothing else is done. *Goals* - Deprecate __array_priority__ - Ufuncs should succeed or error, never return NotImplemented - Add __array_ufunc__ stub to ndarray. I don't think these are controversial either, but they are longer term except possibly the last. Note that never returning NotImplemented disentangles ufuncs from ndarray binops, which I think is a good thing. *Binops* Here we come to the crux of the last arguments. The functions used for binops can currently be set dynamically, the method that is currently used to set them when the ufunc module is loaded. I think we want to do away with that at some point and fix a set of ufuncs with which they are implemented. This allows folks to overide the binop behavior using __array_ufunc__. I think that is mostly of interest to people who are subclassing ndarray and with that restriction doesn't bother me except that it entangles ufuncs with binops. However, what I'd like to see as well is an opt out for objects that don't care about ufuncs, but want to override the python numeric operators, something simple like `__me_me_me__`, or, more seriously, `__array_opt_out__` that will only come into play if the defining object is on the right hand side of an instance of ndarray. In that case the binop would return NotImplemented so as to defer to the Python machinery. Note that __array_priority__ is currently (ab)used for this. *Numpy scalars* Numpy scalars default to the corresponding PyArray_Type or PyGenericArrType_Type unless both arguments can be converted to the same c type as the calling scalar, so I don't think there is a problem there. Note that they do check _array_priority_ before trying to convert unknown objects to array scalars in a fallback case. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Mon Nov 7 14:59:48 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 7 Nov 2016 12:59:48 -0700 Subject: [Numpy-discussion] Numpy 1.12.x branched In-Reply-To: <638b75a9-7d9b-5ddf-6b3a-fadf5a12c65a@gmail.com> References: <638b75a9-7d9b-5ddf-6b3a-fadf5a12c65a@gmail.com> Message-ID: On Mon, Nov 7, 2016 at 11:32 AM, Matti Picus wrote: > On 07/11/16 10:19, numpy-discussion-request at scipy.org wrote: > >> Date: Sun, 06 Nov 2016 17:56:12 +0100 >> From: Sebastian Berg >> To:numpy-discussion at scipy.org >> Subject: Re: [Numpy-discussion] Numpy 1.12.x branched >> Message-ID:<1478451372.3875.5.camel at sipsolutions.net> >> Content-Type: text/plain; charset="utf-8" >> >> On Sa, 2016-11-05 at 17:04 -0600, Charles R Harris wrote: >> >>> >Hi All, >>> > >>> >Numpy 1.12.x has been branched and the 1.13 development branch is >>> >open. It would be helpful if folks could review the release notes as >>> >it is likely I've missed something.? I'd like to make the first beta >>> >release in a couple of days. >>> > >>> >> Very cool, thanks for all the hard work! >> >> - Sebastian >> >> >> >Chuck >>> >> Thanks for managing this. I don't know where, but it would be nice if the > release notes could mention the PyPy support - we are down to only a few > failures on the test suite, the only real oustanding issue is nditer using > UPDATEIFCOPY which depends on refcounting semantics to trigger the copy. > Other than that PyPy + NumPy 1.12 is a working thing, we (PyPy devs) will > soon try to make it work faster :). > A PR updating the release notes would be welcome. This might be one of the highlights for those interested in PyPy. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From frederic.bastien at gmail.com Thu Nov 10 11:06:34 2016 From: frederic.bastien at gmail.com (=?UTF-8?B?RnLDqWTDqXJpYyBCYXN0aWVu?=) Date: Thu, 10 Nov 2016 11:06:34 -0500 Subject: [Numpy-discussion] Numpy 1.12.x branched In-Reply-To: References: <638b75a9-7d9b-5ddf-6b3a-fadf5a12c65a@gmail.com> Message-ID: My change about numpy.mean in float16 aren't in the doc. Should I make a PR again numpy master or maintenance/1.12.x? Fred On Mon, Nov 7, 2016 at 2:59 PM, Charles R Harris wrote: > > > On Mon, Nov 7, 2016 at 11:32 AM, Matti Picus > wrote: > >> On 07/11/16 10:19, numpy-discussion-request at scipy.org wrote: >> >>> Date: Sun, 06 Nov 2016 17:56:12 +0100 >>> From: Sebastian Berg >>> To:numpy-discussion at scipy.org >>> Subject: Re: [Numpy-discussion] Numpy 1.12.x branched >>> Message-ID:<1478451372.3875.5.camel at sipsolutions.net> >>> Content-Type: text/plain; charset="utf-8" >>> >>> On Sa, 2016-11-05 at 17:04 -0600, Charles R Harris wrote: >>> >>>> >Hi All, >>>> > >>>> >Numpy 1.12.x has been branched and the 1.13 development branch is >>>> >open. It would be helpful if folks could review the release notes as >>>> >it is likely I've missed something.? I'd like to make the first beta >>>> >release in a couple of days. >>>> > >>>> >>> Very cool, thanks for all the hard work! >>> >>> - Sebastian >>> >>> >>> >Chuck >>>> >>> Thanks for managing this. I don't know where, but it would be nice if >> the release notes could mention the PyPy support - we are down to only a >> few failures on the test suite, the only real oustanding issue is nditer >> using UPDATEIFCOPY which depends on refcounting semantics to trigger the >> copy. Other than that PyPy + NumPy 1.12 is a working thing, we (PyPy devs) >> will soon try to make it work faster :). >> > > A PR updating the release notes would be welcome. This might be one of the > highlights for those interested in PyPy. > > Chuck > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Nov 10 13:30:13 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 10 Nov 2016 11:30:13 -0700 Subject: [Numpy-discussion] Numpy 1.12.x branched In-Reply-To: References: <638b75a9-7d9b-5ddf-6b3a-fadf5a12c65a@gmail.com>

Message-ID: On Thu, Nov 10, 2016 at 9:06 AM, Fr?d?ric Bastien < frederic.bastien at gmail.com> wrote: > My change about numpy.mean in float16 aren't in the doc. > > Should I make a PR again numpy master or maintenance/1.12.x? > Make it against master. I may cut and paste the content into a bigger PR I will merge before the first beta. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From harrigan.matthew at gmail.com Fri Nov 11 11:25:58 2016 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Fri, 11 Nov 2016 11:25:58 -0500 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: <1478295185.4986.0.camel@sipsolutions.net> References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> Message-ID: I started a ufunc to compute the sum of square differences here . It is about 4x faster and uses half the memory compared to np.sum(np.square(x-c)). This could significantly speed up common operations like std and var (where c=np.mean(x). It faster because its a single pass instead of 3, and also because the inner loop is specialized for the common reduce case, which might be an optimization considering more generally. I think I have answered my question #2&3 above. Can someone please point me to an example where "data" was used in a ufunc inner loop? How can that value be set at runtime? Thanks On Fri, Nov 4, 2016 at 5:33 PM, Sebastian Berg wrote: > On Fr, 2016-11-04 at 15:42 -0400, Matthew Harrigan wrote: > > I didn't notice identity before. Seems like frompyfunc always sets > > it to None. If it were zero maybe it would work as desired here. > > > > In the writing your own ufunc doc, I was wondering if the pointer to > > data could be used to get a constant at runtime. If not, what could > > that be used for? > > static void double_logit(char **args, npy_intp *dimensions, > > npy_intp* steps, void* data) > > Why would the numerical accuracy be any different? The subtraction > > and square operations look identical and I thought np.sum just calls > > np.add.reduce, so the reduction step uses the same code and would > > therefore have the same accuracy. > > > > Sorry, did not read it carefully, I guess `c` is the mean, so you are > doing the two pass method. > > - Sebastian > > > > Thanks > > > > On Fri, Nov 4, 2016 at 1:56 PM, Sebastian Berg > s.net> wrote: > > > On Fr, 2016-11-04 at 13:11 -0400, Matthew Harrigan wrote: > > > > I was reading this and got thinking about if a ufunc could > > > compute > > > > the sum of squared differences in a single pass without a > > > temporary > > > > array. The python code below demonstrates a possible approach. > > > > > > > > import numpy as np > > > > x = np.arange(10) > > > > c = 1.0 > > > > def add_square_diff(x1, x2): > > > > return x1 + (x2-c)**2 > > > > ufunc = np.frompyfunc(add_square_diff, 2, 1) > > > > print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) > > > > print(np.sum(np.square(x-c))) > > > > > > > > I have (at least) 4 questions: > > > > 1. Is it possible to pass run time constants to a ufunc written > > > in C > > > > for use in its inner loop, and if so how? > > > > > > I don't think its anticipated, since a ufunc could in most cases > > > use a > > > third argument, but a 3 arg ufunc can't be reduced. Not sure if > > > there > > > might be some trickery possible. > > > > > > > 2. Is it possible to pass an initial value to reduce to avoid the > > > > clean up required for the first element? > > > > > > This is the identity normally. But the identity can only be 0, 1 or > > > -1 > > > right now I think. The identity is what the output array gets > > > initialized with (which effectively makes it the first value passed > > > into the inner loop). > > > > > > > 3. Does that ufunc work, or are there special cases which cause > > > it to > > > > fall apart? > > > > 4. Would a very specialized ufunc such as this be considered for > > > > incorporating in numpy since it would help reduce time and memory > > > of > > > > functions already in numpy? > > > > > > > > > > Might be mixing up things, however, IIRC the single pass approach > > > has a > > > bad numerical accuracy, so that I doubt that it is a good default > > > algorithm. > > > > > > - Sebastian > > > > > > > > > > Thank you, > > > > Matt > > > > _______________________________________________ > > > > NumPy-Discussion mailing list > > > > NumPy-Discussion at scipy.org > > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at scipy.org > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Fri Nov 11 13:38:51 2016 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Fri, 11 Nov 2016 19:38:51 +0100 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> Message-ID: here is an old unfinished PR adding max_abs which has a similar purpose as yours: https://github.com/numpy/numpy/pull/5509 So far I know data is set during runtime construction and is part of the ufunc object, so it is not really very suitable to pass in constants on call. Its intended purpose is pass in functions to be called in generic loops like PyUFunc_d_d. Having an option to mark arguments of a ufunc as special in reductions could be useful, e.g. it would allow a potential (fused-)multiply-and-add ufunc to be used to implement a weighted sum. On 11.11.2016 17:25, Matthew Harrigan wrote: > I started a ufunc to compute the sum of square differences here > . It > is about 4x faster and uses half the memory compared to > np.sum(np.square(x-c)). This could significantly speed up common > operations like std and var (where c=np.mean(x). It faster because its > a single pass instead of 3, and also because the inner loop is > specialized for the common reduce case, which might be an optimization > considering more generally. > > I think I have answered my question #2&3 above. > > Can someone please point me to an example where "data" was used in a > ufunc inner loop? How can that value be set at runtime? Thanks > > On Fri, Nov 4, 2016 at 5:33 PM, Sebastian Berg > > wrote: > > On Fr, 2016-11-04 at 15:42 -0400, Matthew Harrigan wrote: > > I didn't notice identity before. Seems like frompyfunc always sets > > it to None. If it were zero maybe it would work as desired here. > > > > In the writing your own ufunc doc, I was wondering if the pointer to > > data could be used to get a constant at runtime. If not, what could > > that be used for? > > static void double_logit(char **args, npy_intp *dimensions, > > npy_intp* steps, void* data) > > Why would the numerical accuracy be any different? The subtraction > > and square operations look identical and I thought np.sum just calls > > np.add.reduce, so the reduction step uses the same code and would > > therefore have the same accuracy. > > > > Sorry, did not read it carefully, I guess `c` is the mean, so you are > doing the two pass method. > > - Sebastian > > > > Thanks > > > > On Fri, Nov 4, 2016 at 1:56 PM, Sebastian Berg > s.net > wrote: > > > On Fr, 2016-11-04 at 13:11 -0400, Matthew Harrigan wrote: > > > > I was reading this and got thinking about if a ufunc could > > > compute > > > > the sum of squared differences in a single pass without a > > > temporary > > > > array. The python code below demonstrates a possible approach. > > > > > > > > import numpy as np > > > > x = np.arange(10) > > > > c = 1.0 > > > > def add_square_diff(x1, x2): > > > > return x1 + (x2-c)**2 > > > > ufunc = np.frompyfunc(add_square_diff, 2, 1) > > > > print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) > > > > print(np.sum(np.square(x-c))) > > > > > > > > I have (at least) 4 questions: > > > > 1. Is it possible to pass run time constants to a ufunc written > > > in C > > > > for use in its inner loop, and if so how? > > > > > > I don't think its anticipated, since a ufunc could in most cases > > > use a > > > third argument, but a 3 arg ufunc can't be reduced. Not sure if > > > there > > > might be some trickery possible. > > > > > > > 2. Is it possible to pass an initial value to reduce to avoid the > > > > clean up required for the first element? > > > > > > This is the identity normally. But the identity can only be 0, 1 or > > > -1 > > > right now I think. The identity is what the output array gets > > > initialized with (which effectively makes it the first value passed > > > into the inner loop). > > > > > > > 3. Does that ufunc work, or are there special cases which cause > > > it to > > > > fall apart? > > > > 4. Would a very specialized ufunc such as this be considered for > > > > incorporating in numpy since it would help reduce time and memory > > > of > > > > functions already in numpy? > > > > > > > > > > Might be mixing up things, however, IIRC the single pass approach > > > has a > > > bad numerical accuracy, so that I doubt that it is a good default > > > algorithm. > > > > > > - Sebastian > > > > > > > > > > Thank you, > > > > Matt > > > > _______________________________________________ > > > > NumPy-Discussion mailing list > > > > NumPy-Discussion at scipy.org > > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at scipy.org > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > From harrigan.matthew at gmail.com Fri Nov 11 16:06:15 2016 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Fri, 11 Nov 2016 16:06:15 -0500 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net>

Message-ID: My possibly poorly thought out API would be something like np.add_sq_diff.set_c(42.0).reduce(x), basically add a setter method which returned self. Thanks for explaining what data was intended to do. All of this is really just a dance around the requirement that reduce only works on binary functions. Would there be any interest in extending reduce to allow for arbitrary numbers of input and output arguments? On Fri, Nov 11, 2016 at 1:38 PM, Julian Taylor < jtaylor.debian at googlemail.com> wrote: > here is an old unfinished PR adding max_abs which has a similar purpose > as yours: > https://github.com/numpy/numpy/pull/5509 > > So far I know data is set during runtime construction and is part of the > ufunc object, so it is not really very suitable to pass in constants on > call. > Its intended purpose is pass in functions to be called in generic loops > like PyUFunc_d_d. > > Having an option to mark arguments of a ufunc as special in reductions > could be useful, e.g. it would allow a potential > (fused-)multiply-and-add ufunc to be used to implement a weighted sum. > > On 11.11.2016 17:25, Matthew Harrigan wrote: > > I started a ufunc to compute the sum of square differences here > > . > It > > is about 4x faster and uses half the memory compared to > > np.sum(np.square(x-c)). This could significantly speed up common > > operations like std and var (where c=np.mean(x). It faster because its > > a single pass instead of 3, and also because the inner loop is > > specialized for the common reduce case, which might be an optimization > > considering more generally. > > > > I think I have answered my question #2&3 above. > > > > Can someone please point me to an example where "data" was used in a > > ufunc inner loop? How can that value be set at runtime? Thanks > > > > On Fri, Nov 4, 2016 at 5:33 PM, Sebastian Berg > > > wrote: > > > > On Fr, 2016-11-04 at 15:42 -0400, Matthew Harrigan wrote: > > > I didn't notice identity before. Seems like frompyfunc always sets > > > it to None. If it were zero maybe it would work as desired here. > > > > > > In the writing your own ufunc doc, I was wondering if the pointer > to > > > data could be used to get a constant at runtime. If not, what > could > > > that be used for? > > > static void double_logit(char **args, npy_intp *dimensions, > > > npy_intp* steps, void* data) > > > Why would the numerical accuracy be any different? The subtraction > > > and square operations look identical and I thought np.sum just > calls > > > np.add.reduce, so the reduction step uses the same code and would > > > therefore have the same accuracy. > > > > > > > Sorry, did not read it carefully, I guess `c` is the mean, so you are > > doing the two pass method. > > > > - Sebastian > > > > > > > Thanks > > > > > > On Fri, Nov 4, 2016 at 1:56 PM, Sebastian Berg > > > s.net > wrote: > > > > On Fr, 2016-11-04 at 13:11 -0400, Matthew Harrigan wrote: > > > > > I was reading this and got thinking about if a ufunc could > > > > compute > > > > > the sum of squared differences in a single pass without a > > > > temporary > > > > > array. The python code below demonstrates a possible approach. > > > > > > > > > > import numpy as np > > > > > x = np.arange(10) > > > > > c = 1.0 > > > > > def add_square_diff(x1, x2): > > > > > return x1 + (x2-c)**2 > > > > > ufunc = np.frompyfunc(add_square_diff, 2, 1) > > > > > print(ufunc.reduce(x) - x[0] + (x[0]-c)**2) > > > > > print(np.sum(np.square(x-c))) > > > > > > > > > > I have (at least) 4 questions: > > > > > 1. Is it possible to pass run time constants to a ufunc written > > > > in C > > > > > for use in its inner loop, and if so how? > > > > > > > > I don't think its anticipated, since a ufunc could in most cases > > > > use a > > > > third argument, but a 3 arg ufunc can't be reduced. Not sure if > > > > there > > > > might be some trickery possible. > > > > > > > > > 2. Is it possible to pass an initial value to reduce to avoid > the > > > > > clean up required for the first element? > > > > > > > > This is the identity normally. But the identity can only be 0, 1 > or > > > > -1 > > > > right now I think. The identity is what the output array gets > > > > initialized with (which effectively makes it the first value > passed > > > > into the inner loop). > > > > > > > > > 3. Does that ufunc work, or are there special cases which cause > > > > it to > > > > > fall apart? > > > > > 4. Would a very specialized ufunc such as this be considered > for > > > > > incorporating in numpy since it would help reduce time and > memory > > > > of > > > > > functions already in numpy? > > > > > > > > > > > > > Might be mixing up things, however, IIRC the single pass approach > > > > has a > > > > bad numerical accuracy, so that I doubt that it is a good default > > > > algorithm. > > > > > > > > - Sebastian > > > > > > > > > > > > > Thank you, > > > > > Matt > > > > > _______________________________________________ > > > > > NumPy-Discussion mailing list > > > > > NumPy-Discussion at scipy.org > > > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > _______________________________________________ > > > > NumPy-Discussion mailing list > > > > NumPy-Discussion at scipy.org > > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at scipy.org > > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From oleksandr.pavlyk at intel.com Sat Nov 12 12:00:07 2016 From: oleksandr.pavlyk at intel.com (Pavlyk, Oleksandr) Date: Sat, 12 Nov 2016 17:00:07 +0000 Subject: [Numpy-discussion] Ensuring one can operate on array-like argument in place Message-ID: <4C9EDA7282E297428F3986994EB0FBD387DFB8@ORSMSX110.amr.corp.intel.com> Hi, In my Cython code a function processes it argument x as follows: x_arr = PyArray_CheckFromAny( x, NULL, 0, 0, cnp.NPY_ELEMENTSTRIDES | cnp.NPY_ENSUREARRAY | cnp.NPY_NOTSWAPPED, NULL) if x_arr is not x: in_place = 1 # a copy was made, so we can work in place. The logic is of the last line turns out to be incorrect, because the input x can be a class with an array interface: class FakeArray(object): def __init__(self, data): self._data = data self.__array_interface__ = data.__array_interface__ Feeding my function FakeArray(xx), x_arr will point into the content of xx, resulting in unwarranted content overwrite of xx. I am trying to replace that condition with if x_arr is not x and cnp.PyArray_Check(x): # a copy was made, so we can work in place. in_place = 1 if cnp.PyArray_CHKFLAGS(x_arr, cnp.NPY_WRITEABLE) else 0 I am wondering if I perhaps overlooked some case. Thank you, Sasha -------------- next part -------------- An HTML attachment was scrubbed... URL: From pav at iki.fi Sun Nov 13 09:29:06 2016 From: pav at iki.fi (Pauli Virtanen) Date: Sun, 13 Nov 2016 14:29:06 +0000 (UTC) Subject: [Numpy-discussion] Ensuring one can operate on array-like argument in place References: <4C9EDA7282E297428F3986994EB0FBD387DFB8@ORSMSX110.amr.corp.intel.com> Message-ID: Sat, 12 Nov 2016 17:00:07 +0000, Pavlyk, Oleksandr kirjoitti: [clip] > if x_arr is not x: > in_place = 1 # a copy was made, so we can work in place. > > The logic is of the last line turns out to be incorrect, because the > input x can be a class with an array interface. Please see: https://github.com/scipy/scipy/blob/master/scipy/linalg/misc.py#L169 This probably can be translated to equivalent Numpy C API calls. From Jerome.Kieffer at esrf.fr Mon Nov 14 03:38:09 2016 From: Jerome.Kieffer at esrf.fr (Jerome Kieffer) Date: Mon, 14 Nov 2016 09:38:09 +0100 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> Message-ID: <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> On Fri, 11 Nov 2016 11:25:58 -0500 Matthew Harrigan wrote: > I started a ufunc to compute the sum of square differences here > . > It is about 4x faster and uses half the memory compared to > np.sum(np.square(x-c)). Hi Matt, Using *blas* you win already a factor two (maybe more depending on you blas implementation): % python -m timeit -s "import numpy as np;x=np.linspace(0,1,int(1e7))" "np.sum(np.square(x-2.))" 10 loops, best of 3: 135 msec per loop % python -m timeit -s "import numpy as np;x=np.linspace(0,1,int(1e7))" "y=x-2.;np.dot(y,y)" 10 loops, best of 3: 70.2 msec per loop Cheers, -- J?r?me Kieffer From e.antero.tammi at gmail.com Mon Nov 14 15:38:25 2016 From: e.antero.tammi at gmail.com (eat) Date: Mon, 14 Nov 2016 22:38:25 +0200 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> Message-ID: Yeah, but it's not so obvious what's happening "under the hoods". Consider this (with an old Win7 machine): Python 3.5.2 (v3.5.2:4def2a2901a5, Jun 25 2016, 22:18:55) [MSC v.1900 64 bit (AMD64)] np.__version__ '1.11.1' On Mon, Nov 14, 2016 at 10:38 AM, Jerome Kieffer wrote: > On Fri, 11 Nov 2016 11:25:58 -0500 > Matthew Harrigan wrote: > > > I started a ufunc to compute the sum of square differences here > > . > > It is about 4x faster and uses half the memory compared to > > np.sum(np.square(x-c)). > > Hi Matt, > > Using *blas* you win already a factor two (maybe more depending on you > blas implementation): > > % python -m timeit -s "import numpy as np;x=np.linspace(0,1,int(1e7))" > "np.sum(np.square(x-2.))" > 10 loops, best of 3: 135 msec per loop > > % python -m timeit -s "import numpy as np;x=np.linspace(0,1,int(1e7))" > "y=x-2.;np.dot(y,y)" > 10 loops, best of 3: 70.2 msec per loop > x= np.linspace(0, 1, int(1e6)) timeit np.sum(np.square(x- 2.)) 10 loops, best of 3: 23 ms per loop y= x- 2. timeit np.dot(y, y) The slowest run took 18.60 times longer than the fastest. This could mean that an intermediate result is being cached. 1000 loops, best of 3: 1.78 ms per loop timeit np.dot(y, y) 1000 loops, best of 3: 1.73 ms per loop Best, eat > > > Cheers, > -- > J?r?me Kieffer > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Mon Nov 14 18:20:53 2016 From: chris.barker at noaa.gov (Chris Barker) Date: Mon, 14 Nov 2016 15:20:53 -0800 Subject: [Numpy-discussion] Ensuring one can operate on array-like argument in place In-Reply-To: <4C9EDA7282E297428F3986994EB0FBD387DFB8@ORSMSX110.amr.corp.intel.com> References: <4C9EDA7282E297428F3986994EB0FBD387DFB8@ORSMSX110.amr.corp.intel.com> Message-ID: I tend to use ndarray.copy() in python code -- no reason you couldn't do the same in Cython. If you want to take any array-like object that may not have a copy() method, you could call asanyarray() first: -CHB On Sat, Nov 12, 2016 at 9:00 AM, Pavlyk, Oleksandr < oleksandr.pavlyk at intel.com> wrote: > Hi, > > > > In my Cython code a function processes it argument x as follows: > > > > x_arr = PyArray_CheckFromAny( > > x, NULL, 0, 0, > > cnp.NPY_ELEMENTSTRIDES | cnp.NPY_ENSUREARRAY | > cnp.NPY_NOTSWAPPED, NULL) > > > > if x_arr is not x: > > in_place = 1 # a copy was made, so we can work in place. > > > > The logic is of the last line turns out to be incorrect, because the input > x can be a class with an array interface: > > > > class FakeArray(object): > > def __init__(self, data): > > self._data = data > > self.__array_interface__ = data.__array_interface__ > > > > Feeding my function FakeArray(xx), x_arr will point into the content of > xx, resulting in unwarranted content > overwrite of xx. > > > > I am trying to replace that condition with > > > > if x_arr is not x and cnp.PyArray_Check(x): > > # a copy was made, so we can work in place. > > in_place = 1 if cnp.PyArray_CHKFLAGS(x_arr, cnp.NPY_WRITEABLE) else > 0 > > > > I am wondering if I perhaps overlooked some case. > > > > Thank you, > > Sasha > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From harrigan.matthew at gmail.com Mon Nov 14 20:40:39 2016 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Mon, 14 Nov 2016 20:40:39 -0500 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> Message-ID: Still slower and worse uses 2x the memory for the intermediate temporary array. I propose allowing implicit reductions with ufuncs. Specifically if out is provided with shape[axis] = 1, then pass it on to the ufunc with a stride of 0. That should allow this to work: x = np.arange(10) def add_square_diff(x1, x2, x3): return x1 + (x2-x3)**2 result =np.zeros(1) np.frompyfunc(add_square_diff, 3, 1)(result, x, np.mean(x), result) Essentially it creates a reduce for a function which isn't binary. I think this would be generally useful. For instance, finding the min and max in one pass would be nice: def minmax(x1, x2, x3): return min(x1,x3), max(x2,x3) minn = np.array([np.inf]) maxx = np.array([-np.inf]) np.frompyfunc(minmax, 3, 2)(minn, maxx, x, minn, maxx) Note it also allows for arbitrary initial values or identity to be specified, possibly determined at run time. I think this would make ufuncs even more universal. On Mon, Nov 14, 2016 at 3:38 AM, Jerome Kieffer wrote: > On Fri, 11 Nov 2016 11:25:58 -0500 > Matthew Harrigan wrote: > > > I started a ufunc to compute the sum of square differences here > > . > > It is about 4x faster and uses half the memory compared to > > np.sum(np.square(x-c)). > > Hi Matt, > > Using *blas* you win already a factor two (maybe more depending on you > blas implementation): > > % python -m timeit -s "import numpy as np;x=np.linspace(0,1,int(1e7))" > "np.sum(np.square(x-2.))" > 10 loops, best of 3: 135 msec per loop > > % python -m timeit -s "import numpy as np;x=np.linspace(0,1,int(1e7))" > "y=x-2.;np.dot(y,y)" > 10 loops, best of 3: 70.2 msec per loop > > > Cheers, > -- > J?r?me Kieffer > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Mon Nov 14 20:49:30 2016 From: shoyer at gmail.com (Stephan Hoyer) Date: Mon, 14 Nov 2016 17:49:30 -0800 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> Message-ID: On Mon, Nov 14, 2016 at 5:40 PM, Matthew Harrigan < harrigan.matthew at gmail.com> wrote: > Essentially it creates a reduce for a function which isn't binary. I > think this would be generally useful. > NumPy already has a generic enough interface for creating such ufuncs. In fact, it's called a "generalized ufunc": https://docs.scipy.org/doc/numpy/reference/c-api.generalized-ufuncs.html I think you could already write "implicit reductions" using gufuncs? -------------- next part -------------- An HTML attachment was scrubbed... URL: From stuart at stuartreynolds.net Tue Nov 15 12:37:42 2016 From: stuart at stuartreynolds.net (Stuart Reynolds) Date: Tue, 15 Nov 2016 09:37:42 -0800 Subject: [Numpy-discussion] subclassing ndarray and keeping same ufunc behavior Message-ID: I'm trying to subclass an ndarray so that I can add some additional fields. When I do this however, I get new odd behavior when my object is passed to a variety of numpy functions. For example nanmin returns now return an object of the type of my new array class, whereas previously I'd get a float64. Why? Is this a bug with nanmin or my class? import numpy as np class NDArrayWithColumns(np.ndarray): def __new__(cls, obj, columns=None): obj = obj.view(cls) obj.columns = tuple(columns) return obj def __array_finalize__(self, obj): if obj is None: return self.columns = getattr(obj, 'columns', None) NAN = float("nan") r = np.array([1.,0.,1.,0.,1.,0.,1.,0.,NAN, 1., 1.])print "MIN", np.nanmin(r), type(np.nanmin(r)) gives: MIN 0.0 but >>> r = NDArrayWithColumns(r, ["a"])>>> print "MIN", np.nanmin(r), type(np.nanmin(r)) MIN 0.0 >>> print r.shape # ?!(11,) Note the change in type, and also that str(np.nanmin(r)) shows 1 field, not 11 as indicated by its shape. This seems wrong. Is there a way to get my subclass to behave more like an ndarray? I realize from the docs that I can override __array_wrap__, but its not clear me how how to use it to solve this issue. Or whether its the right tool. In case you're interested, I'm subclassing because I'd like to track column names in matrices of a single type. This is pretty common wish in scikit pipelines. Structured arrays and record type arrays allow for varying type. Pandas provides this functionality, but dealing with numpy arrays is easier (and more efficient) when writing cython extensions. Also, I think the structured arrays and record types are unlikely to play nice with cython because they're more freely typed -- I want to deal exclusively with arrays of doubles. Any thoughts of how to subclass ndarray and keep original behavior in ufuncs? -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.h.vankerkwijk at gmail.com Tue Nov 15 13:48:11 2016 From: m.h.vankerkwijk at gmail.com (Marten van Kerkwijk) Date: Tue, 15 Nov 2016 13:48:11 -0500 Subject: [Numpy-discussion] subclassing ndarray and keeping same ufunc behavior In-Reply-To: References: Message-ID: Hi Stuart, It certainly seems correct behaviour to return the subclass you created: after all, you might want to keep the information on `columns` (e.g., consider doing nanmin along a given axis). Indeed, we certainly want to keep the unit in astropy's Quantity (which also is a subclass of ndarray). On the shape: shouldn't that be print(np.nanmin(r).shape)?? Overall, I think it is worth considering very carefully what exactly you try to accomplish; if different elements along a given axis have different meaning, I'm not sure it makes all that much sense to treat them as a single array (e.g., np.sin might be useful for one column, not not another). Even if pandas is slower, the advantage in clarity of what is happening might well be more important in the long run. All the best, Marten p.s. nanmin is not a ufunc; you can find it in numpy/lib/nan_functions.py From nathan12343 at gmail.com Tue Nov 15 13:52:35 2016 From: nathan12343 at gmail.com (Nathan Goldbaum) Date: Tue, 15 Nov 2016 13:52:35 -0500 Subject: [Numpy-discussion] subclassing ndarray and keeping same ufunc behavior In-Reply-To: References:

Message-ID: You might also want to consider writing a wrapper object that contains an ndarray as a (possibly private) attribute and then presents different views or interpretations of that array. Subclassing ndarray is a pit of snakes, it's best to avoid it if you can (I say as the author and maintainer of an ndarray subclass). On Tue, Nov 15, 2016 at 1:48 PM, Marten van Kerkwijk < m.h.vankerkwijk at gmail.com> wrote: > Hi Stuart, > > It certainly seems correct behaviour to return the subclass you > created: after all, you might want to keep the information on > `columns` (e.g., consider doing nanmin along a given axis). Indeed, we > certainly want to keep the unit in astropy's Quantity (which also is a > subclass of ndarray). > > On the shape: shouldn't that be print(np.nanmin(r).shape)?? > > Overall, I think it is worth considering very carefully what exactly > you try to accomplish; if different elements along a given axis have > different meaning, I'm not sure it makes all that much sense to treat > them as a single array (e.g., np.sin might be useful for one column, > not not another). Even if pandas is slower, the advantage in clarity > of what is happening might well be more important in the long run. > > All the best, > > Marten > > p.s. nanmin is not a ufunc; you can find it in numpy/lib/nan_functions.py > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stuart at stuartreynolds.net Tue Nov 15 14:01:45 2016 From: stuart at stuartreynolds.net (Stuart Reynolds) Date: Tue, 15 Nov 2016 11:01:45 -0800 Subject: [Numpy-discussion] subclassing ndarray and keeping same ufunc behavior In-Reply-To: References:

Message-ID: Doh! Thanks for that. On Tue, Nov 15, 2016 at 10:48 AM, Marten van Kerkwijk < m.h.vankerkwijk at gmail.com> wrote: > Hi Stuart, > > It certainly seems correct behaviour to return the subclass you > created: after all, you might want to keep the information on > `columns` (e.g., consider doing nanmin along a given axis). Indeed, we > certainly want to keep the unit in astropy's Quantity (which also is a > subclass of ndarray). > > On the shape: shouldn't that be print(np.nanmin(r).shape)?? > > Overall, I think it is worth considering very carefully what exactly > you try to accomplish; if different elements along a given axis have > different meaning, I'm not sure it makes all that much sense to treat > them as a single array (e.g., np.sin might be useful for one column, > not not another). Even if pandas is slower, the advantage in clarity > of what is happening might well be more important in the long run. > > All the best, > > Marten > > p.s. nanmin is not a ufunc; you can find it in numpy/lib/nan_functions.py > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Jerome.Kieffer at esrf.fr Wed Nov 16 03:29:24 2016 From: Jerome.Kieffer at esrf.fr (Jerome Kieffer) Date: Wed, 16 Nov 2016 09:29:24 +0100 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> <20161114093809.6ea2b3a0@lintaillefer.esrf.fr> Message-ID: <20161116092924.79c2d82c@lintaillefer.esrf.fr> On Mon, 14 Nov 2016 22:38:25 +0200 eat wrote: > but it's not so obvious what's happening "under the hoods". Consider this > (with an old Win7 machine): > Python 3.5.2 (v3.5.2:4def2a2901a5, Jun 25 2016, 22:18:55) [MSC v.1900 64 > bit (AMD64)] > np.__version__ > '1.11.1' What matters is the "blas" library used under the hood, hence the options passed to numpy at compile time. I notices 20x differences depending on the blas version. But more importantly: * results are the same (at the limit of the numerical precision) * dot() was always faster than sum(square()), varying from a bit to a lot. I agree this may change in future version of numpy. Cheers, -- J?r?me Kieffer From harrigan.matthew at gmail.com Wed Nov 16 20:18:34 2016 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Wed, 16 Nov 2016 20:18:34 -0500 Subject: [Numpy-discussion] ufunc for sum of squared difference In-Reply-To: References: <1478282208.861.29.camel@sipsolutions.net> <1478295185.4986.0.camel@sipsolutions.net> <20161114093809.6ea2b3a0@lintaillefer.esrf.fr>

Message-ID: Thanks for pointing me to that. I think its a great match for a 1D case, like the inner product of two arrays in terms of signature. I don't see how it works with higher dimensional arrays, esp with something like the axis parameter in ufunc.reduce. With what I proposed for an array of shape (M, N) for example, result.shape could be (1,) or (M, 1) or (1, N) for reducing over the flattened array or either axis. Can you do something like that with gufunc or do you need to iterate gufunc calls over slices/views? Thanks again. On Mon, Nov 14, 2016 at 8:49 PM, Stephan Hoyer wrote: > On Mon, Nov 14, 2016 at 5:40 PM, Matthew Harrigan < > harrigan.matthew at gmail.com> wrote: > >> Essentially it creates a reduce for a function which isn't binary. I >> think this would be generally useful. >> > > NumPy already has a generic enough interface for creating such ufuncs. In > fact, it's called a "generalized ufunc": > https://docs.scipy.org/doc/numpy/reference/c-api.generalized-ufuncs.html > > I think you could already write "implicit reductions" using gufuncs? > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Nov 17 00:47:39 2016 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 16 Nov 2016 22:47:39 -0700 Subject: [Numpy-discussion] NumPy 1.12.0b1 released. Message-ID: Hi All, I'm pleased to annouce the release of NumPy 1.12.0b1. This release supports Python 2.7 and 3.4 - 3.6 and is the result of 388 pull requests submitted by 133 contributors. It is quite sizeable and rather than put the release notes inline I've attached them as a file and they may also be viewed at Github . Zip files and tarballs may also be found the Github link. Wheels and source archives may be downloaded from PyPI, which is the recommended method. This release is a large collection of fixes, enhancements, and improvements and it is difficult to select just a few as highlights. However, the following enhancements may be of particular interest - Order of operations in ``np.einsum`` now can be optimized for large speed improvements. - New ``signature`` argument to ``np.vectorize`` for vectorizing with core dimensions. - The ``keepdims`` argument was added to many functions. - Support for PyPy 2.7 v5.6.0 has been added. While not complete, this is a milestone for PyPy's C-API compatibility layer. Thanks to all, Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: 1.12.0-notes.rst Type: text/prs.fallenstein.rst Size: 64965 bytes Desc: not available URL: From agballen189 at aol.com Thu Nov 17 11:49:29 2016 From: agballen189 at aol.com (Andres Guzman-Ballen) Date: Thu, 17 Nov 2016 11:49:29 -0500 Subject: [Numpy-discussion] The --compilers flag is ignored for build_ext Message-ID: <15873316e96-5cdc-4a06@webprd-a11.mail.aol.com> Hello Numpy community! First time poster here. I was recommended by Charles Harris to present a question I have to you guys regarding using --compiler=intelem being ignored when I use build_ext. I am trying to compile Numpy v1.11.2 that I am getting from https://pypi.python.org/packages/16/f5/b432f028134dd30cfbf6f21b8264a9938e5e0f75204e72453af08d67eb0b/numpy-1.11.2.tar.gz, using Python 2.7.12 on my Ubuntu 10.04 LTS Machine. I specifically want to compile this with the Intel compiler. I have sourced the compilervars.sh so the Intel compiler is in the path. This is the command that is failing: 'python setup.py build_ext --inplace --debug --force --compiler=intelem --fcompiler=intelem'. The reason it fails is because I have added compiler flags to umath's configuration extension code in numpy/core/setup.py as extra_compiler_arg and they are specifically meant for Intel. Since the compiler setup is ignored (unlike in setup.py build, where config_cc is correctly called and takes care of this), it defaults to GCC. This is a snippet of what you get for 'build_ext': running build_clib running build_src build_src building py_modules sources building library "npymath" sources creating build creating build/src.linux-x86_64-2.7 customize Gnu95FCompiler Found executable /usr/bin/gfortran customize Gnu95FCompiler customize Gnu95FCompiler using config This is what you get for using 'build': running build running config_cc unifing config_cc, config, build_clib, build_ext, build commands --compiler options running config_fc unifing config_fc, config, build_clib, build_ext, build commands --fcompiler options running build_src build_src building py_modules sources creating build creating build/src.linux-x86_64-2.7 creating build/src.linux-x86_64-2.7/numpy creating build/src.linux-x86_64-2.7/numpy/distutils building library "npymath" sources Found executable /localdisk/psxe_16/compilers_and_libraries_2016.3.210/linux/bin/intel64/icc Could not locate executable ecc customize IntelEM64TFCompiler Found executable /localdisk/psxe_16/compilers_and_libraries_2016.3.210/linux/bin/intel64/ifort customize IntelEM64TFCompiler using config I do not experience this problem with 'python setup.py build --debug --force --compiler=intelem --fcompiler=intelem'. because the configuration function is correctly called here: https://github.intel.com/SAT/numpy/blob/develop/numpy/distutils/command/build.py#L11 (that is where I got the output from above). However, I do not see anything like this in build_ext: https://github.intel.com/SAT/numpy/blob/develop/numpy/distutils/command/build_ext.py#L34. This of course could be because build_ext is doing something else below in the run command on line 109. This is most likely not the right answer, but what harm does this do? diff --git a/numpy/distutils/command/build_ext.py b/numpy/distutils/command/build_ext.py index 0fa52a2..ffa88ae 100644 --- a/numpy/distutils/command/build_ext.py +++ b/numpy/distutils/command/build_ext.py @@ -79,6 +79,8 @@ class build_ext (old_build_ext): return # Make sure that extension sources are complete. + self.run_command('config_cc') + self.run_command('config_fc') self.run_command('build_src') if self.distribution.has_c_libraries(): I am able to get build_ext to behave properly by adding this snippet, but I'm pretty sure this is the wrong approach because there seems to be code below line 109 in numpy/distutils/command/build_ext.py that sets compiler-related things up. This is the ticket I created under Numpy's issues page: https://github.com/numpy/numpy/issues/8283. Any feedback is very well appreciated and I'd be more than happy to help contribute and get this fixed for the community. I don't want to just throw a problem at you guys and expect someone else to fix it for me. Thanks! Andres -------------- next part -------------- An HTML attachment was scrubbed... URL: From matti.picus at gmail.com Thu Nov 17 18:24:19 2016 From: matti.picus at gmail.com (Matti Picus) Date: Fri, 18 Nov 2016 01:24:19 +0200 Subject: [Numpy-discussion] NumPy 1.12.0b1 released In-Reply-To: References: Message-ID: <85707776-c19f-3b1e-ba75-9ca4ed2b209e@gmail.com> Congrats to all on the release.Two questions: Is there a guide to building standard wheels for NumPy? Assuming I can build standardized PyPy 2.7 wheels for Ubuntu, Win32 and OSX64, how can I get them blessed and uploaded to PyPI? Matti On 17/11/16 07:47, numpy-discussion-request at scipy.org wrote: > Date: Wed, 16 Nov 2016 22:47:39 -0700 > From: Charles R Harris > To: numpy-discussion, SciPy Users List > , SciPy Developers List, > python-announce-list at python.org > Subject: [Numpy-discussion] NumPy 1.12.0b1 released. > > Hi All, > > I'm pleased to annouce the release of NumPy 1.12.0b1. This release > supports Python 2.7 and 3.4 - 3.6 and is the result of 388 pull requests > submitted by 133 contributors. It is quite sizeable and rather than put the > release notes inline I've attached them as a file and they may also be > viewed at Github. > Zip files and tarballs may also be found the Github link. Wheels and source > archives may be downloaded from PyPI, which is the recommended method. > > This release is a large collection of fixes, enhancements, and improvements > and it is difficult to select just a few as highlights. However, the > following enhancements may be of particular interest > > - Order of operations in ``np.einsum`` now can be optimized for large > speed improvements. > - New ``signature`` argument to ``np.vectorize`` for vectorizing with > core dimensions. > - The ``keepdims`` argument was added to many functions. > - Support for PyPy 2.7 v5.6.0 has been added. While not complete, this > is a milestone for PyPy's C-API compatibility layer. > > Thanks to all, > > Chuck From izgurskii at gmail.com Fri Nov 18 02:43:05 2016 From: izgurskii at gmail.com (Dmitrii Izgurskii) Date: Fri, 18 Nov 2016 08:43:05 +0100 Subject: [Numpy-discussion] Status of NAs for integer arrays Message-ID: Hi, I would like to know where I can follow the progress of what's being done on implementing missing values in integer arrays. Some threads I found are from 2012. Is really nothing being done on this issue? -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Fri Nov 18 03:08:25 2016 From: matthew.brett at gmail.com (Matthew Brett) Date: Fri, 18 Nov 2016 00:08:25 -0800 Subject: [Numpy-discussion] NumPy 1.12.0b1 released In-Reply-To: <85707776-c19f-3b1e-ba75-9ca4ed2b209e@gmail.com> References: <85707776-c19f-3b1e-ba75-9ca4ed2b209e@gmail.com> Message-ID: Hi, On Thu, Nov 17, 2016 at 3:24 PM, Matti Picus wrote: > Congrats to all on the release.Two questions: > > Is there a guide to building standard wheels for NumPy? I don't think so - there is a repository that we use to build the wheels, that has the Windows, OSX and manyllinux recipes for the standard CPython build: https://github.com/MacPython/numpy-wheelso If you can work out a way to automate the PyPy builds and tests - especially using the same repo - that would be very useful. > Assuming I can build standardized PyPy 2.7 wheels for Ubuntu, Win32 and > OSX64, how can I get them blessed and uploaded to PyPI? If you can automate the build and tests, I'm guessing there will be no objections - but it's not my call... Cheers, Matthew From ralf.gommers at gmail.com Fri Nov 18 03:41:24 2016 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 18 Nov 2016 21:41:24 +1300 Subject: [Numpy-discussion] Status of NAs for integer arrays In-Reply-To: References: Message-ID: On Fri, Nov 18, 2016 at 8:43 PM, Dmitrii Izgurskii wrote: > Hi, > > I would like to know where I can follow the progress of what's being done > on implementing missing values in integer arrays. Some threads I found are > from 2012. Is really nothing being done on this issue? > Not that I'm aware of, no. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Fri Nov 18 04:14:45 2016 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 18 Nov 2016 22:14:45 +1300 Subject: [Numpy-discussion] NumPy 1.12.0b1 released In-Reply-To: References: <85707776-c19f-3b1e-ba75-9ca4ed2b209e@gmail.com> Message-ID: On Fri, Nov 18, 2016 at 9:08 PM, Matthew Brett wrote: > Hi, > > On Thu, Nov 17, 2016 at 3:24 PM, Matti Picus > wrote: > > Congrats to all on the release.Two questions: > > > > Is there a guide to building standard wheels for NumPy? > > I don't think so - there is a repository that we use to build the > wheels, that has the Windows, OSX and manyllinux recipes for the > standard CPython build: > > https://github.com/MacPython/numpy-wheelso > > If you can work out a way to automate the PyPy builds and tests - > especially using the same repo - that would be very useful. > > > Assuming I can build standardized PyPy 2.7 wheels for Ubuntu, Win32 and > > OSX64, how can I get them blessed and uploaded to PyPI? > > If you can automate the build and tests, I'm guessing there will be no > objections - but it's not my call... > I'm in favor, assuming that the wheel tags and PyPy backwards compatibility situation is OK. Can't really find any examples. What I mean is that for CPython wheels contain tags like "cp27" or "cp35". PyPy wheels should have tags "pp". Are the PyPy cpyext layer and the defined such that a new PyPy release won't break older wheels? Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From sylvain.corlay at gmail.com Fri Nov 18 05:34:21 2016 From: sylvain.corlay at gmail.com (Sylvain Corlay) Date: Fri, 18 Nov 2016 11:34:21 +0100 Subject: [Numpy-discussion] ANN: xtensor and xtensor-python Message-ID: Hi all, We are pleased to announce the release of the xtensor library and its python bindings xtensor-python , by Johan Mabille (@JohanMabille) and Sylvain Corlay (@SylvainCorlay). 1. xtensor xtensor is a C++ template library for manipulating multi-dimensional array expressions. It provides - an API following the idioms of the *C++ standard library*. - an extensible expression system enabling lazy broadcasting and universal functions. - python bindings for operating inplace on numpy arrays in your C++ thanks to Python's buffer protocol and pybind. More details on lazy computing with xtensor are available below. 2. xtensor-cookiecutter Besides, xtensor and xtensor-python , we provide a cookiecutter template project to help extension authors get started. The xtensor-cookiecutter generates a simple project for a Python C++ extension with - a working setup.py compiling the extension module - a few example of functions making use of xtensor exposed to python, and the example of a vectorized function exposed to python. - all the boilerplate for the setup of the unit testing and generation of the HTML documentation for these examples. 3. You can try it now! You can try the xtensor live on the project website . The Try it Now button is powered by - The Cling C++ interpreter. - The Jupyter notebook. - The Binder project. ------------------------------ ------------------------------ Getting started xtensor requires a modern C++ compiler supporting C++14. The following C+ compilers are supported: - On Windows platforms, Visual C++ 2015 Update 2, or more recent - On Unix platforms, gcc 4.9 or a recent version of Clang Installation xtensor and xtensor-python are header-only libraries. We provide packages for the conda package manager. conda install -c conda-forge xtensor # installs xtensor conda install -c conda-forge xtensor-python # installs xtensor and xtensor-python Usage Basic Usage Initialize a 2-D array and compute the sum of one of its rows and a 1-D array. #include #include "xtensor/xarray.hpp" #include "xtensor/xio.hpp" xt::xarray arr1 {{1.0, 2.0, 3.0}, {2.0, 5.0, 7.0}, {2.0, 5.0, 7.0}}; xt::xarray arr2 {5.0, 6.0, 7.0}; xt::xarray res = xt::make_xview(arr1, 1) + arr2; std::cout << res; Outputs: {7, 11, 14} Initialize a 1-D array and reshape it inplace. #include #include "xtensor/xarray.hpp" #include "xtensor/xio.hpp" xt::xarray arr {1, 2, 3, 4, 5, 6, 7, 8, 9}; arr.reshape({3, 3}); std::cout << arr; Outputs: {{1, 2, 3}, {4, 5, 6}, {7, 8, 9}} Lazy Broadcasting with xtensor We can operate on arrays of different shapes of dimensions in an elementwise fashion. Broadcasting rules of xtensor are similar to those of numpy and libdynd . Broadcasting rules In an operation involving two arrays of different dimensions, the array with the lesser dimensions is broadcast across the leading dimensions of the other. For example, if A has shape (2, 3), and B has shape (4, 2, 3), the result of a broadcasted operation with A and B has shape (4, 2, 3). (2, 3) # A (4, 2, 3) # B --------- (4, 2, 3) # Result The same rule holds for scalars, which are handled as 0-D expressions. If matched up dimensions of two input arrays are different, and one of them has size 1, it is broadcast to match the size of the other. Let's say B has the shape (4, 2, 1) in the previous example, so the broadcasting happens as follows: (2, 3) # A (4, 2, 1) # B --------- (4, 2, 3) # Result Universal functions, Laziness and Vectorization With xtensor, if x, y and z are arrays of *broadcastable shapes*, the return type of an expression such as x + y * sin(z) is not an array. It is an xexpression object offering the same interface as an N-dimensional array, which does not hold the result. Values are only computed upon access or when the expression is assigned to an xarray object. This allows to operate symbolically on very large arrays and only compute the result for the indices of interest. We provide utilities to vectorize any scalar function (taking multiple scalar arguments) into a function that will perform on xexpressions, applying the lazy broadcasting rules which we just described. These functions are called *xfunction*s. They are xtensor's counterpart to numpy's universal functions. In xtensor, arithmetic operations (+, -, *, /) and all special functions are *xfunction*s. Iterating over xexpressions and Broadcasting Iterators All xexpressions offer two sets of functions to retrieve iterator pairs (and their const counterpart). - begin() and end() provide instances of xiterators which can be used to iterate over all the elements of the expression. The order in which elements are listed is row-major in that the index of last dimension is incremented first. - xbegin(shape) and xend(shape) are similar but take a *broadcasting shape* as an argument. Elements are iterated upon in a row-major way, but certain dimensions are repeated to match the provided shape as per the rules described above. For an expression e, e.xbegin(e.shape()) and e.begin() are equivalent. Fixed-dimension *and* Dynamic dimension Two container classes implementing multi-dimensional arrays are provided: xarray and xtensor. - xarray can be reshaped dynamically to any number of dimensions. It is the container that is the most similar to numpy arrays. - xtensor has a dimension set at compilation time, which enables many optimizations. For example, shapes and strides of xtensor instances are allocated on the stack instead of the heap. xarray and xtensor container are both xexpressions and can be involved and mixed in universal functions, assigned to each other etc... Besides, two access operators are provided: - The variadic template operator() which can take multiple integral arguments or none. - And the operator[] which takes a single multi-index argument, which can be of size determined at runtime. operator[] also supports access with braced initializers. Python bindings The python bindings are built upon the pybind11 library, a lightweight header-only for creating bindings between the Python and C++ programming languages. Example 1: Use an algorithm of the C++ library on a numpy array inplace. C++ code #include // std::accumulate #include "pybind11/pybind11.h" // pybind11 #include "xtensor/xmath.hpp" // C++ universal functions #include "xtensor-python/pyarray.hpp" // numpy bindings double sum_of_sines(xt::pyarray &m) { auto sines = xt::sin(m); // sines does not hold any value return std::accumulate(sines.begin(), sines.end(), 0.0); } PYBIND11_PLUGIN(xtensor_python_test) { pybind11::module m("xtensor_python_test", "Test module for xtensor python bindings"); m.def("sum_of_sines", sum_of_sines, "Return the sum of the sines"); return m.ptr(); } Python Code import numpy as npimport xtensor_python_test as xt a = np.arange(15).reshape(3, 5) s = xt.sum_of_sines(v) s Outputs 1.2853996391883833 Example 2: Create a universal function from a C++ scalar function C++ code #include "pybind11/pybind11.h" #include "xtensor-python/pyvectorize.hpp" #include #include namespace py = pybind11; double scalar_func(double i, double j) { return std::sin(i) - std::cos(j); } PYBIND11_PLUGIN(xtensor_python_test) { py::module m("xtensor_python_test", "Test module for xtensor python bindings"); m.def("vectorized_func", xt::pyvectorize(scalar_func), ""); return m.ptr(); } Python Code import numpy as npimport xtensor_python_test as xt x = np.arange(15).reshape(3, 5) y = [1, 2, 3] z = xt.vectorized_func(x, y) z Outputs [[-1. , 0.30116, 1.32544, 1.13111, -0.10315], [-1.95892, -0.81971, 1.07313, 1.97935, 1.06576], [-1.54402, -1.54029, -0.12042, 1.41016, 1.64425]] -------------- next part -------------- An HTML attachment was scrubbed... URL: