From dg.gmane at thesamovar.net Sun Dec 1 15:53:14 2013 From: dg.gmane at thesamovar.net (Dan Goodman) Date: Sun, 1 Dec 2013 20:53:14 +0000 (UTC) Subject: [Numpy-discussion] -ffast-math References: <5299CCE9.90602@googlemail.com> Message-ID: Julian Taylor googlemail.com> writes: > can you show the code that is slow in numpy? > which version of gcc and libc are you using? > with gcc 4.8 it uses the glibc 2.17 sin/cos with fast-math, so there > should be no difference. In trying to write some simple code to demonstrate it, I realised it was weirdly more complicated than I thought. Previously I had been comparing numpy against weave on a complicated expression, namely a*sin(2.0*freq*pi*t) + b + v*exp(-dt/tau) + (-a*sin(2.0*freq*pi*t) - b)*exp(-dt/tau). Doing that with weave and no -ffast-math took the same time as numpy approximately, but with weave and -ffast-math it was about 30x faster. Here only a and v are arrays. Since numpy and weave with no -ffast-math took about the same time I assumed it wasn't memory bound but to do with the -ffast-math. Here's the demo code (you might need to comment a couple of lines out if you want to actually run it, since it also tests a couple of things that depend on a library): http://bit.ly/IziH8H However, when I did a simple example that just computed y=sin(x) for arrays x and y, I found that numpy and weave without -ffast-math took about the same time, but weave with -ffast-math was significantly slower than numpy! My take home message from this: optimisation is weird. Could it be that -ffast-math and -O3 allow SSE instructions and that there is some overhead to this that makes it worth it for a complex expression but not for a simple expression? Here's the code for the simple example (doesn't have any dependencies): http://bit.ly/18wdCKY For reference, I'm on a newish 64 bit windows machine running 32 bit Python 2.7.3, gcc version 4.5.2, numpy 1.8.0 installed from binaries. Dan From jtaylor.debian at googlemail.com Sun Dec 1 16:30:35 2013 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Sun, 01 Dec 2013 22:30:35 +0100 Subject: [Numpy-discussion] -ffast-math In-Reply-To: References: <5299CCE9.90602@googlemail.com> Message-ID: <529BAA7B.3070206@googlemail.com> On 01.12.2013 21:53, Dan Goodman wrote: > Julian Taylor googlemail.com> writes: >> can you show the code that is slow in numpy? >> which version of gcc and libc are you using? >> with gcc 4.8 it uses the glibc 2.17 sin/cos with fast-math, so there >> should be no difference. > > In trying to write some simple code to demonstrate it, I realised it was > weirdly more complicated than I thought. Previously I had been comparing > numpy against weave on a complicated expression, namely a*sin(2.0*freq*pi*t) > + b + v*exp(-dt/tau) + (-a*sin(2.0*freq*pi*t) - b)*exp(-dt/tau). Doing that > with weave and no -ffast-math took the same time as numpy approximately, but > with weave and -ffast-math it was about 30x faster. Here only a and v are > arrays. Since numpy and weave with no -ffast-math took about the same time I > assumed it wasn't memory bound but to do with the -ffast-math. > this should be the code: int N = _N; for(int _idx=0; _idx <5299CCE9.90602@googlemail.com> <529BAA7B.3070206@googlemail.com> Message-ID: Julian Taylor googlemail.com> writes: > your sin and exp calls are loop invariants, they do not depend on the > loop iterable. > This allows to move the expensive functions out of the loop and only > leave some simple arithmetic in the body. Ahhhh! I feel extremely stupid for not realising this! Thanks Julian. Any thoughts on why using -ffast-math it actually goes slower for just doing sin(x)? From jtaylor.debian at googlemail.com Sun Dec 1 17:16:40 2013 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Sun, 01 Dec 2013 23:16:40 +0100 Subject: [Numpy-discussion] -ffast-math In-Reply-To: References: <5299CCE9.90602@googlemail.com> <529BAA7B.3070206@googlemail.com> Message-ID: <529BB548.6040806@googlemail.com> On 01.12.2013 22:59, Dan Goodman wrote: > Julian Taylor googlemail.com> writes: >> your sin and exp calls are loop invariants, they do not depend on the >> loop iterable. >> This allows to move the expensive functions out of the loop and only >> leave some simple arithmetic in the body. > > Ahhhh! I feel extremely stupid for not realising this! Thanks Julian. > > Any thoughts on why using -ffast-math it actually goes slower for just doing > sin(x)? > no on my linux machine ffast-math is a little faster: numpy: 311 ms weave_slow: 291 ms weave_fast: 262 ms here is a pure numpy version of your calculation which only performs 3 times worse than weave: def timefunc_numpy2(a, v): ext = exp(-dt/tau) sit = sin(2.0*freq*pi*t) bs = 20000 for i in range(0, N, bs): ab = a[i:i+bs] vb = v[i:i+bs] absit = ab*sit + b vb *= ext vb += absit vb -= absit*ext it works by replacing temporaries with inplace operations and blocks the operations to be more memory cache friendlier. using numexpr should give you similar results. From dg.gmane at thesamovar.net Sun Dec 1 18:01:31 2013 From: dg.gmane at thesamovar.net (Dan Goodman) Date: Sun, 1 Dec 2013 23:01:31 +0000 (UTC) Subject: [Numpy-discussion] -ffast-math References: <5299CCE9.90602@googlemail.com> <529BAA7B.3070206@googlemail.com> <529BB548.6040806@googlemail.com> Message-ID: Julian Taylor googlemail.com> writes: > > On 01.12.2013 22:59, Dan Goodman wrote: > > Julian Taylor googlemail.com> writes: > >> your sin and exp calls are loop invariants, they do not depend on the > >> loop iterable. > >> This allows to move the expensive functions out of the loop and only > >> leave some simple arithmetic in the body. > > > > Ahhhh! I feel extremely stupid for not realising this! Thanks Julian. > > > > Any thoughts on why using -ffast-math it actually goes slower for just doing > > sin(x)? > > > > no on my linux machine ffast-math is a little faster: > numpy: 311 ms > weave_slow: 291 ms > weave_fast: 262 ms Maybe something to do with my older version of gcc (4.5)? > here is a pure numpy version of your calculation which only performs 3 > times worse than weave: > > def timefunc_numpy2(a, v): > ext = exp(-dt/tau) > sit = sin(2.0*freq*pi*t) > bs = 20000 > for i in range(0, N, bs): > ab = a[i:i+bs] > vb = v[i:i+bs] > absit = ab*sit + b > vb *= ext > vb += absit > vb -= absit*ext > > it works by replacing temporaries with inplace operations and blocks the > operations to be more memory cache friendlier. > using numexpr should give you similar results. I was working on something similar without the blocking and also got good results. Actually, your version with blocking doesn't give me as good performance on my machine, it's around 6x slower than weave. I tried different sizes for the block size but couldn't improve much on that. Using this unblocked code: def timefunc_numpy_smart(): _sin_term = sin(2.0*freq*pi*t) _exp_term = exp(-dt/tau) _a_term = (_sin_term-_sin_term*_exp_term) _v = v _v *= _exp_term _v += a*_a_term _v += -b*_exp_term + b I got around 5x slower. Using numexpr 'dumbly' (i.e. just putting the expression in directly) was slower than the function above, but doing a hybrid between the two approaches worked well: def timefunc_numexpr_smart(): _sin_term = sin(2.0*freq*pi*t) _exp_term = exp(-dt/tau) _a_term = (_sin_term-_sin_term*_exp_term) _const_term = -b*_exp_term + b v[:] = numexpr.evaluate('a*_a_term+v*_exp_term+_const_term') #numexpr.evaluate('a*_a_term+v*_exp_term+_const_term', out=v) This was about 3.5x slower than weave. If I used the commented out final line then it was only 1.5x slower than weave, but it also gives wrong results. I reported this as a bug in numexpr a long time ago but I guess it hasn't been fixed yet (or maybe I didn't upgrade my version recently). Dan From dg.gmane at thesamovar.net Sun Dec 1 18:14:57 2013 From: dg.gmane at thesamovar.net (Dan Goodman) Date: Sun, 1 Dec 2013 23:14:57 +0000 (UTC) Subject: [Numpy-discussion] -ffast-math References: <5299CCE9.90602@googlemail.com> <529BAA7B.3070206@googlemail.com> <529BB548.6040806@googlemail.com> Message-ID: Dan Goodman thesamovar.net> writes: ... > I got around 5x slower. Using numexpr 'dumbly' (i.e. just putting the > expression in directly) was slower than the function above, but doing a > hybrid between the two approaches worked well: > > def timefunc_numexpr_smart(): > _sin_term = sin(2.0*freq*pi*t) > _exp_term = exp(-dt/tau) > _a_term = (_sin_term-_sin_term*_exp_term) > _const_term = -b*_exp_term + b > v[:] = numexpr.evaluate('a*_a_term+v*_exp_term+_const_term') > #numexpr.evaluate('a*_a_term+v*_exp_term+_const_term', out=v) > > This was about 3.5x slower than weave. If I used the commented out final > line then it was only 1.5x slower than weave, but it also gives wrong > results. I reported this as a bug in numexpr a long time ago but I guess it > hasn't been fixed yet (or maybe I didn't upgrade my version recently). I just upgraded numexpr to 2.2 where they did fix this bug, and now the 'smart' numexpr version runs exactly as fast as weave (so I guess there were some performance enhancements in numexpr as well). Fantastic! Dan From mailinglists at xgm.de Mon Dec 2 11:52:07 2013 From: mailinglists at xgm.de (Florian Lindner) Date: Mon, 02 Dec 2013 17:52:07 +0100 Subject: [Numpy-discussion] Joining lists to an array Message-ID: <6972186.imR98MDMjN@horus> Hello, I have this piece of example code import random, numpy as np y = [] doc_all = [] # da = np.zeros(2) for i in range(4): docs = range(random.randint(1, 10)) y += [i]*len(docs) doc_all += docs # np.append(da, np.column_stack((docs, y)), axis=0) data = np.array([doc_all, y]).transpose() y and docs are lists that are created in the loop body and all joined together. From these two long lists an array is created at the end. At the end data has a shape like (28, 2). Is there a way I can do this more elegantly using numpy or scipy tricks? I was working on something like the two lines I commented out, but ... Thanks, Florian From daniele at grinta.net Mon Dec 2 12:44:25 2013 From: daniele at grinta.net (Daniele Nicolodi) Date: Mon, 02 Dec 2013 18:44:25 +0100 Subject: [Numpy-discussion] math.fsum() like ufunc for numpy Message-ID: <529CC6F9.2060106@grinta.net> Hello, there would be interest in adding a floating point accurate summation function like Python's math.fsum() in the form of an ufunc to NumPy? I had a look at the algorithm (http://code.activestate.com/recipes/393090-binary-floating-point-summation-accurate-to-full-p/) and it looks quite straightforward to implement. I can try to submit a patch for it. Cheers, Daniele From njs at pobox.com Mon Dec 2 13:07:02 2013 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 2 Dec 2013 10:07:02 -0800 Subject: [Numpy-discussion] math.fsum() like ufunc for numpy In-Reply-To: <529CC6F9.2060106@grinta.net> References: <529CC6F9.2060106@grinta.net> Message-ID: I think that would be great. Technically what you'd want is a "gufunc". -n On Mon, Dec 2, 2013 at 9:44 AM, Daniele Nicolodi wrote: > Hello, > > there would be interest in adding a floating point accurate summation > function like Python's math.fsum() in the form of an ufunc to NumPy? > > I had a look at the algorithm > (http://code.activestate.com/recipes/393090-binary-floating-point-summation-accurate-to-full-p/) > and it looks quite straightforward to implement. I can try to submit a > patch for it. > > Cheers, > Daniele > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From jtaylor.debian at googlemail.com Mon Dec 2 13:11:25 2013 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Mon, 02 Dec 2013 19:11:25 +0100 Subject: [Numpy-discussion] math.fsum() like ufunc for numpy In-Reply-To: References: <529CC6F9.2060106@grinta.net> Message-ID: <529CCD4D.70506@googlemail.com> related this PR attempts to improve the accuracy of summation: https://github.com/numpy/numpy/pull/3685 but math.fsum gives the exact result so it would a valuable ufunc even when that PR is merged. python3.4 will have yet another accurate summation in the statistics module: http://www.python.org/dev/peps/pep-0450/ On 02.12.2013 19:07, Nathaniel Smith wrote: > I think that would be great. Technically what you'd want is a "gufunc". > > -n > > On Mon, Dec 2, 2013 at 9:44 AM, Daniele Nicolodi wrote: >> Hello, >> >> there would be interest in adding a floating point accurate summation >> function like Python's math.fsum() in the form of an ufunc to NumPy? >> >> I had a look at the algorithm >> (http://code.activestate.com/recipes/393090-binary-floating-point-summation-accurate-to-full-p/) >> and it looks quite straightforward to implement. I can try to submit a >> patch for it. >> >> Cheers, >> Daniele >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > From ndbecker2 at gmail.com Mon Dec 2 14:08:52 2013 From: ndbecker2 at gmail.com (Neal Becker) Date: Mon, 02 Dec 2013 14:08:52 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? Message-ID: This is np 1.8.0 on fedora x86_64: In [5]: x =np.array ((1,)) In [6]: x.shape Out[6]: (1,) In [7]: x.strides Out[7]: (9223372036854775807,) From davidmenhur at gmail.com Mon Dec 2 14:14:13 2013 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Mon, 2 Dec 2013 20:14:13 +0100 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References: Message-ID: I get: In [4]: x.strides Out[4]: (8,) Same architecture and OS, Numpy installed via Pip on Python 2.7.5. On 2 December 2013 20:08, Neal Becker wrote: > This is np 1.8.0 on fedora x86_64: > > In [5]: x =np.array ((1,)) > > In [6]: x.shape > Out[6]: (1,) > > In [7]: x.strides > Out[7]: (9223372036854775807,) > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndbecker2 at gmail.com Mon Dec 2 14:18:02 2013 From: ndbecker2 at gmail.com (Neal Becker) Date: Mon, 02 Dec 2013 14:18:02 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? References:

Message-ID: I built using: CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py install --user a?id wrote: > I get: > > In [4]: x.strides > Out[4]: (8,) > > Same architecture and OS, Numpy installed via Pip on Python 2.7.5. > > > On 2 December 2013 20:08, Neal Becker wrote: > >> This is np 1.8.0 on fedora x86_64: >> >> In [5]: x =np.array ((1,)) >> >> In [6]: x.shape >> Out[6]: (1,) >> >> In [7]: x.strides >> Out[7]: (9223372036854775807,) >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> From nouiz at nouiz.org Mon Dec 2 14:19:07 2013 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Mon, 2 Dec 2013 14:19:07 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

Message-ID: There is a way to compile NumPy to use strange strides for dimension with shape of 1. This is done to help developer test their code to don't rely on this. There was never a warranty to the value of strides in that cases. Most of the time, it was the same, but in some cases, it was different. Using such strange strides will cause segfault if you use them, so it allow to see if you rely on them. In Theano, we did some assertion on strides and checked them for optimized call to blas. So we will need to change some code to support this. But I don't those strange strides should happen in the wild. Did you installed NumPy manually? Fred On Mon, Dec 2, 2013 at 2:14 PM, Da?id wrote: > I get: > > In [4]: x.strides > Out[4]: (8,) > > Same architecture and OS, Numpy installed via Pip on Python 2.7.5. > > > On 2 December 2013 20:08, Neal Becker wrote: >> >> This is np 1.8.0 on fedora x86_64: >> >> In [5]: x =np.array ((1,)) >> >> In [6]: x.shape >> Out[6]: (1,) >> >> In [7]: x.strides >> Out[7]: (9223372036854775807,) >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From nouiz at nouiz.org Mon Dec 2 14:19:22 2013 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Mon, 2 Dec 2013 14:19:22 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

Message-ID: It is the NPY_RELAXED_STRIDES_CHECKING=1 flag that caused this. Fred On Mon, Dec 2, 2013 at 2:18 PM, Neal Becker wrote: > I built using: > > CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py > install --user > > > a?id wrote: > >> I get: >> >> In [4]: x.strides >> Out[4]: (8,) >> >> Same architecture and OS, Numpy installed via Pip on Python 2.7.5. >> >> >> On 2 December 2013 20:08, Neal Becker wrote: >> >>> This is np 1.8.0 on fedora x86_64: >>> >>> In [5]: x =np.array ((1,)) >>> >>> In [6]: x.shape >>> Out[6]: (1,) >>> >>> In [7]: x.strides >>> Out[7]: (9223372036854775807,) >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>> > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From ndbecker2 at gmail.com Mon Dec 2 14:35:36 2013 From: ndbecker2 at gmail.com (Neal Becker) Date: Mon, 02 Dec 2013 14:35:36 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? References:

Message-ID: I don't think that behavior is acceptable. Fr?d?ric Bastien wrote: > It is the NPY_RELAXED_STRIDES_CHECKING=1 flag that caused this. > > Fred > > On Mon, Dec 2, 2013 at 2:18 PM, Neal Becker wrote: >> I built using: >> >> CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py >> install --user >> >> >> a?id wrote: >> >>> I get: >>> >>> In [4]: x.strides >>> Out[4]: (8,) >>> >>> Same architecture and OS, Numpy installed via Pip on Python 2.7.5. >>> >>> >>> On 2 December 2013 20:08, Neal Becker wrote: >>> >>>> This is np 1.8.0 on fedora x86_64: >>>> >>>> In [5]: x =np.array ((1,)) >>>> >>>> In [6]: x.shape >>>> Out[6]: (1,) >>>> >>>> In [7]: x.strides >>>> Out[7]: (9223372036854775807,) >>>> >>>> _______________________________________________ >>>> NumPy-Discussion mailing list >>>> NumPy-Discussion at scipy.org >>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>>> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From nouiz at nouiz.org Mon Dec 2 14:44:55 2013 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Mon, 2 Dec 2013 14:44:55 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

Message-ID: Just don't compile with NPY_RELAXED_STRIDES_CHECKING to have the old behavior I think (which is an not always the same strides depending of how it was created, I don't know if they changed that or not). Do someone else recall the detail of this? Fred p.s. I didn't do this or asked for it. But this help test your software to don't depend of the strides when shapes is 1. On Mon, Dec 2, 2013 at 2:35 PM, Neal Becker wrote: > I don't think that behavior is acceptable. > > Fr?d?ric Bastien wrote: > >> It is the NPY_RELAXED_STRIDES_CHECKING=1 flag that caused this. >> >> Fred >> >> On Mon, Dec 2, 2013 at 2:18 PM, Neal Becker wrote: >>> I built using: >>> >>> CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py >>> install --user >>> >>> >>> a?id wrote: >>> >>>> I get: >>>> >>>> In [4]: x.strides >>>> Out[4]: (8,) >>>> >>>> Same architecture and OS, Numpy installed via Pip on Python 2.7.5. >>>> >>>> >>>> On 2 December 2013 20:08, Neal Becker wrote: >>>> >>>>> This is np 1.8.0 on fedora x86_64: >>>>> >>>>> In [5]: x =np.array ((1,)) >>>>> >>>>> In [6]: x.shape >>>>> Out[6]: (1,) >>>>> >>>>> In [7]: x.strides >>>>> Out[7]: (9223372036854775807,) >>>>> >>>>> _______________________________________________ >>>>> NumPy-Discussion mailing list >>>>> NumPy-Discussion at scipy.org >>>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>>>> >>> >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From jtaylor.debian at googlemail.com Mon Dec 2 14:48:42 2013 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Mon, 02 Dec 2013 20:48:42 +0100 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

Message-ID: <529CE41A.7040602@googlemail.com> I opened a ticket for it, though thinking about it, its probably intentional be intentional to find code that assumes it can use the strides to get the itemsize. https://github.com/numpy/numpy/issues/4091 On 02.12.2013 20:35, Neal Becker wrote: > I don't think that behavior is acceptable. > > Fr?d?ric Bastien wrote: > >> It is the NPY_RELAXED_STRIDES_CHECKING=1 flag that caused this. >> >> Fred >> >> On Mon, Dec 2, 2013 at 2:18 PM, Neal Becker wrote: >>> I built using: >>> >>> CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py >>> install --user >>> >>> >>> a?id wrote: >>> >>>> I get: >>>> >>>> In [4]: x.strides >>>> Out[4]: (8,) >>>> >>>> Same architecture and OS, Numpy installed via Pip on Python 2.7.5. >>>> >>>> >>>> On 2 December 2013 20:08, Neal Becker wrote: >>>> >>>>> This is np 1.8.0 on fedora x86_64: >>>>> >>>>> In [5]: x =np.array ((1,)) >>>>> >>>>> In [6]: x.shape >>>>> Out[6]: (1,) >>>>> >>>>> In [7]: x.strides >>>>> Out[7]: (9223372036854775807,) >>>>> >>>>> _______________________________________________ >>>>> NumPy-Discussion mailing list >>>>> NumPy-Discussion at scipy.org >>>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>>>> >>> >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From ndbecker2 at gmail.com Mon Dec 2 14:51:25 2013 From: ndbecker2 at gmail.com (Neal Becker) Date: Mon, 02 Dec 2013 14:51:25 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? References:

Message-ID: The software I'm using, which is https://github.com/ndarray/ndarray does depend on this. Am I the only one who thinks that this behavior is not desirable? Fr?d?ric Bastien wrote: > Just don't compile with NPY_RELAXED_STRIDES_CHECKING to have the old > behavior I think (which is an not always the same strides depending of > how it was created, I don't know if they changed that or not). > > Do someone else recall the detail of this? > > Fred > > p.s. I didn't do this or asked for it. But this help test your > software to don't depend of the strides when shapes is 1. > > On Mon, Dec 2, 2013 at 2:35 PM, Neal Becker wrote: >> I don't think that behavior is acceptable. >> >> Fr?d?ric Bastien wrote: >> >>> It is the NPY_RELAXED_STRIDES_CHECKING=1 flag that caused this. >>> >>> Fred >>> >>> On Mon, Dec 2, 2013 at 2:18 PM, Neal Becker wrote: >>>> I built using: >>>> >>>> CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py >>>> install --user >>>> >>>> >>>> a?id wrote: >>>> >>>>> I get: >>>>> >>>>> In [4]: x.strides >>>>> Out[4]: (8,) >>>>> >>>>> Same architecture and OS, Numpy installed via Pip on Python 2.7.5. >>>>> >>>>> >>>>> On 2 December 2013 20:08, Neal Becker wrote: >>>>> >>>>>> This is np 1.8.0 on fedora x86_64: >>>>>> >>>>>> In [5]: x =np.array ((1,)) >>>>>> >>>>>> In [6]: x.shape >>>>>> Out[6]: (1,) >>>>>> >>>>>> In [7]: x.strides >>>>>> Out[7]: (9223372036854775807,) >>>>>> >>>>>> _______________________________________________ >>>>>> NumPy-Discussion mailing list >>>>>> NumPy-Discussion at scipy.org >>>>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>>>>> >>>> >>>> >>>> _______________________________________________ >>>> NumPy-Discussion mailing list >>>> NumPy-Discussion at scipy.org >>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From njs at pobox.com Mon Dec 2 14:56:33 2013 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 2 Dec 2013 11:56:33 -0800 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

Message-ID: On Mon, Dec 2, 2013 at 11:35 AM, Neal Becker wrote: > I don't think that behavior is acceptable. That's... too bad? I'm not sure what your objection actually is. It's an intentional change (though disabled by default in 1.8), and a necessary step to rationalizing our definition of contiguity and stride handling in general, which has a number of benefits: http://docs.scipy.org/doc/numpy-dev/release.html#npy-relaxed-strides-checking http://docs.scipy.org/doc/numpy/reference/arrays.ndarray.html#internal-memory-layout-of-an-ndarray Why do you care about the stride of an array with only 1 element, where by definition you never use the stride? -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From sebastian at sipsolutions.net Mon Dec 2 16:48:21 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Mon, 02 Dec 2013 22:48:21 +0100 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

Message-ID: <1386020901.22572.7.camel@sebastian-laptop> On Mon, 2013-12-02 at 14:51 -0500, Neal Becker wrote: > The software I'm using, which is > > https://github.com/ndarray/ndarray > > does depend on this. Am I the only one who thinks that this > behavior is not desirable? > Well, this is not meant to be the way for a release version of numpy. The rational was that since arbitrary strides *are* possible for such arrays, creating them with arbitrary strides when you use NPY_RELAXED_STRIDES_CHECKING=1 helps finding bugs. Of course you are right to suppose that a *new* array should generally have nice and clean strides, but if your code does work with arrays you did not yourself create, it cannot make these assumptions. And I think most code does this. And there are some advantages to ignoring such strides for the contiguous flags. If your arrays are contiguous, you don't really need the strides (use the itemsize instead). How is ndarray broken by this? - Sebastian > Fr?d?ric Bastien wrote: > > > Just don't compile with NPY_RELAXED_STRIDES_CHECKING to have the old > > behavior I think (which is an not always the same strides depending of > > how it was created, I don't know if they changed that or not). > > > > Do someone else recall the detail of this? > > > > Fred > > > > p.s. I didn't do this or asked for it. But this help test your > > software to don't depend of the strides when shapes is 1. > > > > On Mon, Dec 2, 2013 at 2:35 PM, Neal Becker wrote: > >> I don't think that behavior is acceptable. > >> > >> Fr?d?ric Bastien wrote: > >> > >>> It is the NPY_RELAXED_STRIDES_CHECKING=1 flag that caused this. > >>> > >>> Fred > >>> > >>> On Mon, Dec 2, 2013 at 2:18 PM, Neal Becker wrote: > >>>> I built using: > >>>> > >>>> CFLAGS='-march=native -O3' NPY_RELAXED_STRIDES_CHECKING=1 python3 setup.py > >>>> install --user > >>>> > >>>> > >>>> a?id wrote: > >>>> > >>>>> I get: > >>>>> > >>>>> In [4]: x.strides > >>>>> Out[4]: (8,) > >>>>> > >>>>> Same architecture and OS, Numpy installed via Pip on Python 2.7.5. > >>>>> > >>>>> > >>>>> On 2 December 2013 20:08, Neal Becker wrote: > >>>>> > >>>>>> This is np 1.8.0 on fedora x86_64: > >>>>>> > >>>>>> In [5]: x =np.array ((1,)) > >>>>>> > >>>>>> In [6]: x.shape > >>>>>> Out[6]: (1,) > >>>>>> > >>>>>> In [7]: x.strides > >>>>>> Out[7]: (9223372036854775807,) > >>>>>> > >>>>>> _______________________________________________ > >>>>>> NumPy-Discussion mailing list > >>>>>> NumPy-Discussion at scipy.org > >>>>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion > >>>>>> > >>>> > >>>> > >>>> _______________________________________________ > >>>> NumPy-Discussion mailing list > >>>> NumPy-Discussion at scipy.org > >>>> http://mail.scipy.org/mailman/listinfo/numpy-discussion > >>> _______________________________________________ > >>> NumPy-Discussion mailing list > >>> NumPy-Discussion at scipy.org > >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion > >> > >> > >> _______________________________________________ > >> NumPy-Discussion mailing list > >> NumPy-Discussion at scipy.org > >> http://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From talljimbo at gmail.com Mon Dec 2 18:15:50 2013 From: talljimbo at gmail.com (Jim Bosch) Date: Mon, 2 Dec 2013 18:15:50 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: <1386020901.22572.7.camel@sebastian-laptop> References:

<1386020901.22572.7.camel@sebastian-laptop> Message-ID: > If your arrays are contiguous, you don't really need the strides (use the itemsize instead). How is ndarray broken by this? ndarray is broken by this change because it expects the stride to be a multiple of the itemsize (I think; I'm just looking at code here, as I haven't had time to build NumPy 1.8 yet to test this); it has a slightly more restricted model for what data can look like than NumPy has, and it's easier to always just look at the stride for all sizes rather than special-case for size=1. I think that means the bug is ndarray's (indeed, it's probably the kind of bug this new behavior was intended to catch, as I should be handling the case of non-itemsize-multiple strides more gracefully even when size > 1), and I'm working on a fix for it there now. Thanks, Neil, for bringing this to my attention, and to all the NumPy dev's for help in explaining what's going on. Jim -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Mon Dec 2 18:20:48 2013 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 2 Dec 2013 15:20:48 -0800 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

<1386020901.22572.7.camel@sebastian-laptop> Message-ID: On Mon, Dec 2, 2013 at 3:15 PM, Jim Bosch wrote: >> If your arrays are contiguous, you don't really need the strides (use the >> itemsize instead). How is ndarray broken by this? > > ndarray is broken by this change because it expects the stride to be a > multiple of the itemsize (I think; I'm just looking at code here, as I > haven't had time to build NumPy 1.8 yet to test this); it has a slightly > more restricted model for what data can look like than NumPy has, and it's > easier to always just look at the stride for all sizes rather than > special-case for size=1. Note that arrays in which any dimension is 0 (i.e., 0 total elements) can also have arbitrary strides with no consequence. -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From sebastian at sipsolutions.net Mon Dec 2 18:37:42 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 03 Dec 2013 00:37:42 +0100 Subject: [Numpy-discussion] nasty bug in 1.8.0?? In-Reply-To: References:

<1386020901.22572.7.camel@sebastian-laptop> Message-ID: <1386027462.23220.9.camel@sebastian-laptop> On Mon, 2013-12-02 at 18:15 -0500, Jim Bosch wrote: > > If your arrays are contiguous, you don't really need the strides > (use the itemsize instead). How is ndarray broken by this? > > ndarray is broken by this change because it expects the stride to be a > multiple of the itemsize (I think; I'm just looking at code here, as I > haven't had time to build NumPy 1.8 yet to test this); it has a > slightly more restricted model for what data can look like than NumPy > has, and it's easier to always just look at the stride for all sizes > rather than special-case for size=1. I think that means the bug is > ndarray's (indeed, it's probably the kind of bug this new behavior was > intended to catch, as I should be handling the case of > non-itemsize-multiple strides more gracefully even when size > 1), and > I'm working on a fix for it there now. > Most bugs I saw were just simply assuming: arr.strides[-1] == arr.itemsize when the array is C-contiguous, and could be fixed by just using arr.itemsize... Unless you need to calculate contiguous flags which are compatible to NumPy with NPY_RELAXED_STRIDES_CHECKING (i.e. cython had this problem since its memoryview would reject numpy's contiguous arrays as not contiguous), you should not need to special case anything. - Sebastian > > Thanks, Neil, for bringing this to my attention, and to all the NumPy > dev's for help in explaining what's going on. > > > > Jim > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From ndbecker2 at gmail.com Mon Dec 2 19:05:41 2013 From: ndbecker2 at gmail.com (Neal Becker) Date: Mon, 02 Dec 2013 19:05:41 -0500 Subject: [Numpy-discussion] nasty bug in 1.8.0?? References:

<1386020901.22572.7.camel@sebastian-laptop> Message-ID: Jim Bosch wrote: >> If your arrays are contiguous, you don't really need the strides (use the > itemsize instead). How is ndarray broken by this? > > ndarray is broken by this change because it expects the stride to be a > multiple of the itemsize (I think; I'm just looking at code here, as I > haven't had time to build NumPy 1.8 yet to test this); it has a slightly > more restricted model for what data can look like than NumPy has, and it's > easier to always just look at the stride for all sizes rather than > special-case for size=1. I think that means the bug is ndarray's (indeed, > it's probably the kind of bug this new behavior was intended to catch, as I > should be handling the case of non-itemsize-multiple strides more > gracefully even when size > 1), and I'm working on a fix for it there now. > > Thanks, Neil, for bringing this to my attention, and to all the NumPy dev's > for help in explaining what's going on. > > Jim The problem I encountered, is that canonical generic c++ code looks like: template void F (in_t in) { int size = boost::size (in); ... This fails when "in" is nd::Array. In that case, the iterator is strided_iterator. And here, I find (via gdb), that stride==0. The failure occurs here: StridedIterator.h: template int distance_to(StridedIterator const & other) const { return std::distance(_data, other._data) / _stride; } How it happens that stride==0, and how to fix it, I don't know. From francesc at continuum.io Tue Dec 3 05:40:55 2013 From: francesc at continuum.io (Francesc Alted) Date: Tue, 03 Dec 2013 11:40:55 +0100 Subject: [Numpy-discussion] -ffast-math In-Reply-To: References: <5299CCE9.90602@googlemail.com> <529BAA7B.3070206@googlemail.com> <529BB548.6040806@googlemail.com>

Message-ID: <529DB537.4090103@continuum.io> On 12/2/13, 12:14 AM, Dan Goodman wrote: > Dan Goodman thesamovar.net> writes: > ... >> I got around 5x slower. Using numexpr 'dumbly' (i.e. just putting the >> expression in directly) was slower than the function above, but doing a >> hybrid between the two approaches worked well: >> >> def timefunc_numexpr_smart(): >> _sin_term = sin(2.0*freq*pi*t) >> _exp_term = exp(-dt/tau) >> _a_term = (_sin_term-_sin_term*_exp_term) >> _const_term = -b*_exp_term + b >> v[:] = numexpr.evaluate('a*_a_term+v*_exp_term+_const_term') >> #numexpr.evaluate('a*_a_term+v*_exp_term+_const_term', out=v) >> >> This was about 3.5x slower than weave. If I used the commented out final >> line then it was only 1.5x slower than weave, but it also gives wrong >> results. I reported this as a bug in numexpr a long time ago but I guess it >> hasn't been fixed yet (or maybe I didn't upgrade my version recently). > I just upgraded numexpr to 2.2 where they did fix this bug, and now the > 'smart' numexpr version runs exactly as fast as weave (so I guess there were > some performance enhancements in numexpr as well). Err no, there have not been performance improvements in numexpr since 2.0 (that I am aware of). Maybe you are running in a multi-core machine now and you are seeing better speedup because of this? Also, your expressions are made of transcendental functions, so linking numexpr with MKL could accelerate computations a good deal too. -- Francesc Alted From cmkleffner at gmail.com Tue Dec 3 07:02:09 2013 From: cmkleffner at gmail.com (Carl Kleffner) Date: Tue, 3 Dec 2013 13:02:09 +0100 Subject: [Numpy-discussion] how to use install_clib with setup.py? Message-ID: I have a question concerning install_clib on windows. What I want to do is to copy a dll (libopenblas.dll) to the numpy/core folder with setup.py install. The path to the dll is given in site.cfg. The dll itself is a external dependency. Somewhere i found a reference to install_clib to copy external dll's, but howto to apply this within setup.py? Regards Carl -------------- next part -------------- An HTML attachment was scrubbed... URL: From nouiz at nouiz.org Tue Dec 3 14:50:37 2013 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Tue, 3 Dec 2013 14:50:37 -0500 Subject: [Numpy-discussion] Theano 0.6 released Message-ID: What's New ---------- We recommend that everybody update to this version. Highlights (since 0.6rc5): * Last release with support for Python 2.4 and 2.5. * We will try to release more frequently. * Fix crash/installation problems. * Use less memory for conv3d2d. 0.6rc4 skipped for a technical reason. Highlights (since 0.6rc3): * Python 3.3 compatibility with buildbot test for it. * Full advanced indexing support. * Better Windows 64 bit support. * New profiler. * Better error messages that help debugging. * Better support for newer NumPy versions (remove useless warning/crash). * Faster optimization/compilation for big graph. * Move in Theano the Conv3d2d implementation. * Better SymPy/Theano bridge: Make an Theano op from SymPy expression and use SymPy c code generator. * Bug fixes. Change from 0.6rc5: * Fix crash when specifing march in cxxflags Theano flag. (Frederic B., reported by FiReTiTi) * code cleanup (Jorg Bornschein) * Fix Canopy installation on windows when it was installed for all users: Raingo * Fix Theano tests due to a scipy change. (Frederic B.) * Work around bug introduced in scipy dev 0.14. (Frederic B.) * Fix Theano tests following bugfix in SciPy. (Frederic B., reported by Ziyuan Lin) * Add Theano flag cublas.lib (Misha Denil) * Make conv3d2d work more inplace (so less memory usage) (Frederic B., repoted by Jean-Philippe Ouellet) See https://pypi.python.org/pypi/Theano for more details. Download and Install -------------------- You can download Theano from http://pypi.python.org/pypi/Theano Installation instructions are available at http://deeplearning.net/software/theano/install.html Description ----------- Theano is a Python library that allows you to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays. It is built on top of NumPy. Theano features: * tight integration with NumPy: a similar interface to NumPy's. numpy.ndarrays are also used internally in Theano-compiled functions. * transparent use of a GPU: perform data-intensive computations up to 140x faster than on a CPU (support for float32 only). * efficient symbolic differentiation: Theano can compute derivatives for functions of one or many inputs. * speed and stability optimizations: avoid nasty bugs when computing expressions such as log(1+ exp(x)) for large values of x. * dynamic C code generation: evaluate expressions faster. * extensive unit-testing and self-verification: includes tools for detecting and diagnosing bugs and/or potential problems. Theano has been powering large-scale computationally intensive scientific research since 2007, but it is also approachable enough to be used in the classroom (IFT6266 at the University of Montreal). Resources --------- About Theano: http://deeplearning.net/software/theano/ Theano-related projects: http://github.com/Theano/Theano/wiki/Related-projects About NumPy: http://numpy.scipy.org/ About SciPy: http://www.scipy.org/ Machine Learning Tutorial with Theano on Deep Architectures: http://deeplearning.net/tutorial/ Acknowledgments --------------- I would like to thank all contributors of Theano. For this particular release (since 0.5), many people have helped, notably: Frederic Bastien Pascal Lamblin Ian Goodfellow Olivier Delalleau Razvan Pascanu abalkin Arnaud Bergeron Nicolas Bouchard + Jeremiah Lowin + Matthew Rocklin Eric Larsen + James Bergstra David Warde-Farley John Salvatier + Vivek Kulkarni + Yann N. Dauphin Ludwig Schmidt-Hackenberg + Gabe Schwartz + Rami Al-Rfou' + Guillaume Desjardins Caglar + Sigurd Spieckermann + Steven Pigeon + Bogdan Budescu + Jey Kottalam + Mehdi Mirza + Alexander Belopolsky + Ethan Buchman + Jason Yosinski Nicolas Pinto + Sina Honari + Ben McCann + Graham Taylor Hani Almousli Ilya Dyachenko + Jan Schl?ter + Jorg Bornschein + Micky Latowicki + Yaroslav Halchenko + Eric Hunsberger + Amir Elaguizy + Hannes Schulz + Huy Nguyen + Ilan Schnell + Li Yao Misha Denil + Robert Kern + Sebastian Berg + Vincent Dumoulin + Wei Li + XterNalz + A total of 51 people contributed to this release. People with a "+" by their names contributed a patch for the first time. Also, thank you to all NumPy and Scipy developers as Theano builds on their strengths. All questions/comments are always welcome on the Theano mailing-lists ( http://deeplearning.net/software/theano/#community ) From jslavin at cfa.harvard.edu Thu Dec 5 11:04:19 2013 From: jslavin at cfa.harvard.edu (Slavin, Jonathan) Date: Thu, 5 Dec 2013 11:04:19 -0500 Subject: [Numpy-discussion] no more search capability? Message-ID: Hi all, Although I like the look of the newly designed numpy/scipy web pages, I have to say that I really miss the search capability. Is there any motion toward restoring that? Jon -- ________________________________________________________ Jonathan D. Slavin Harvard-Smithsonian CfA jslavin at cfa.harvard.edu 60 Garden Street, MS 83 phone: (617) 496-7981 Cambridge, MA 02138-1516 fax: (617) 496-7577 USA ________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From jslavin at cfa.harvard.edu Thu Dec 5 13:30:47 2013 From: jslavin at cfa.harvard.edu (Slavin, Jonathan) Date: Thu, 5 Dec 2013 13:30:47 -0500 Subject: [Numpy-discussion] no more search capability? Message-ID: Answering part of my own question, I see that there is still a search capability on one of the numpy web pages, but it's not where it used to be and, in my opinion, is not easy to find. There used to be a search box on each web page of the numpy docs. Jon On Thu, Dec 5, 2013 at 1:00 PM, wrote: > Send NumPy-Discussion mailing list submissions to > numpy-discussion at scipy.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://mail.scipy.org/mailman/listinfo/numpy-discussion > or, via email, send a message with subject or body 'help' to > numpy-discussion-request at scipy.org > > You can reach the person managing the list at > numpy-discussion-owner at scipy.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of NumPy-Discussion digest..." > > Today's Topics: > > 1. no more search capability? (Slavin, Jonathan) > > > ---------- Forwarded message ---------- > From: "Slavin, Jonathan" > To: numpy-discussion at scipy.org > Cc: > Date: Thu, 5 Dec 2013 11:04:19 -0500 > Subject: [Numpy-discussion] no more search capability? > Hi all, > > Although I like the look of the newly designed numpy/scipy web pages, I > have to say that I really miss the search capability. Is there any motion > toward restoring that? > > Jon > -- > ________________________________________________________ > Jonathan D. Slavin Harvard-Smithsonian CfA > jslavin at cfa.harvard.edu 60 Garden Street, MS 83 > phone: (617) 496-7981 Cambridge, MA 02138-1516 > fax: (617) 496-7577 USA > ________________________________________________________ > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- ________________________________________________________ Jonathan D. Slavin Harvard-Smithsonian CfA jslavin at cfa.harvard.edu 60 Garden Street, MS 83 phone: (617) 496-7981 Cambridge, MA 02138-1516 fax: (617) 496-7577 USA ________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Thu Dec 5 17:37:19 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Thu, 05 Dec 2013 23:37:19 +0100 Subject: [Numpy-discussion] Deprecate boolean math operators? Message-ID: <1386283039.23728.6.camel@sebastian-laptop> Hey, there was a discussion that for numpy booleans math operators +,-,* (and the unary -), while defined, are not very helpful. I have set up a quick PR with start (needs some fixes inside numpy still): https://github.com/numpy/numpy/pull/4105 The idea is to deprecate these, since the binary operators |,^,| (and the unary ~ even if it is weird) behave identical. This would not affect sums of boolean arrays. For the moment I saw one "annoying" change in numpy, and that is `abs(x - y)` being used for allclose and working nicely currently. - Sebastian From ndarray at mac.com Thu Dec 5 18:49:31 2013 From: ndarray at mac.com (Alexander Belopolsky) Date: Thu, 5 Dec 2013 18:49:31 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <1386283039.23728.6.camel@sebastian-laptop> References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg wrote: > For the moment I saw one "annoying" change in > numpy, and that is `abs(x - y)` being used for allclose and working > nicely currently. > It would probably be an improvement if allclose returned all(x == y) unless one of the arguments is inexact. At the moment allclose() fails for char arrays: >>> allclose('abc', 'abc') Traceback (most recent call last): File "", line 1, in File "numpy/core/numeric.py", line 2114, in allclose xinf = isinf(x) TypeError: Not implemented for this type -------------- next part -------------- An HTML attachment was scrubbed... URL: From fmmirzaei at gmail.com Thu Dec 5 22:14:01 2013 From: fmmirzaei at gmail.com (Faraz Mirzaei) Date: Thu, 5 Dec 2013 19:14:01 -0800 Subject: [Numpy-discussion] surprising behavior of np.asarray on masked arrays Message-ID: Hi, If I pass a masked array through np.asarray, I get original unmasked array. Example: test = np.array([[1, 0], [-1, 3]]) testMasked = ma.masked_less_equal(test, 0) print testMasked [[1 --] [-- 3]] print testMasked.fill_value 999999 print np.asarray(testMasked) [[ 1 0] [-1 3]] Is this behavior intentional? How does the np.asarray access the original masked values? Shouldn't the masked values be at least filled with fill_value? Thanks, Faraz -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Thu Dec 5 22:33:02 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 5 Dec 2013 22:33:02 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <1386283039.23728.6.camel@sebastian-laptop> References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg wrote: > Hey, > > there was a discussion that for numpy booleans math operators +,-,* (and > the unary -), while defined, are not very helpful. I have set up a quick > PR with start (needs some fixes inside numpy still): > > https://github.com/numpy/numpy/pull/4105 > > The idea is to deprecate these, since the binary operators |,^,| (and > the unary ~ even if it is weird) behave identical. This would not affect > sums of boolean arrays. For the moment I saw one "annoying" change in > numpy, and that is `abs(x - y)` being used for allclose and working > nicely currently. I like mask = mask1 * mask2 That's what I learned working my way through scipy.stats.distributions a long time ago. But the main thing is that we use boolean often as 0,1 integer array in the actual calculations, and I only sometimes add the astype(int) x[:, None] * (y[:, None] == np.unique(y)) I always thought booleans *are* just 0, 1 integers, until last time there was the discussion we saw the weird + or - behavior. We also use rescaling to (-1, 1) in statsmodels y = mask * 2 - 1 (but maybe we convert to integer first) My guess is that I only use multiplication heavily, where the boolean is a dummy variable with 0 if male and 1 if female for example. Nothing serious but nice not to have to worry about casting with astype(int) first. x[:, None] * (y[:, None] == np.unique(y)).astype(int) (Is the bracket at the right spot ?) Josef > > - Sebastian > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From josef.pktd at gmail.com Thu Dec 5 22:35:34 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 5 Dec 2013 22:35:34 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 10:33 PM, wrote: > On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg > wrote: >> Hey, >> >> there was a discussion that for numpy booleans math operators +,-,* (and >> the unary -), while defined, are not very helpful. I have set up a quick >> PR with start (needs some fixes inside numpy still): >> >> https://github.com/numpy/numpy/pull/4105 >> >> The idea is to deprecate these, since the binary operators |,^,| (and >> the unary ~ even if it is weird) behave identical. This would not affect >> sums of boolean arrays. For the moment I saw one "annoying" change in >> numpy, and that is `abs(x - y)` being used for allclose and working >> nicely currently. > > I like mask = mask1 * mask2 > > That's what I learned working my way through scipy.stats.distributions > a long time ago. > > But the main thing is that we use boolean often as 0,1 integer array > in the actual calculations, and I only sometimes add the astype(int) > > x[:, None] * (y[:, None] == np.unique(y)) > > I always thought booleans *are* just 0, 1 integers, until last time > there was the discussion we saw the weird + or - behavior. > > We also use rescaling to (-1, 1) in statsmodels y = mask * 2 - 1 > (but maybe we convert to integer first) > My guess is that I only use multiplication heavily, where the boolean > is a dummy variable with 0 if male and 1 if female for example. > > Nothing serious but nice not to have to worry about casting with > astype(int) first. > > x[:, None] * (y[:, None] == np.unique(y)).astype(int) (Is the > bracket at the right spot ?) what about np.dot, np.dot(mask, x) which is the same as (mask * x).sum(0) ? Josef > > Josef > > >> >> - Sebastian >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion From ndarray at mac.com Thu Dec 5 22:56:48 2013 From: ndarray at mac.com (Alexander Belopolsky) Date: Thu, 5 Dec 2013 22:56:48 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <1386283039.23728.6.camel@sebastian-laptop> References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg wrote: > there was a discussion that for numpy booleans math operators +,-,* (and > the unary -), while defined, are not very helpful. It has been suggested at the Github that there is an area where it is useful to have linear algebra operations like matrix multiplication to be defined over a semiring: http://en.wikipedia.org/wiki/Logical_matrix This still does not justify having unary or binary -, so I suggest that we first discuss deprecation of those. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndarray at mac.com Thu Dec 5 23:00:50 2013 From: ndarray at mac.com (Alexander Belopolsky) Date: Thu, 5 Dec 2013 23:00:50 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 10:35 PM, wrote: > what about np.dot, np.dot(mask, x) which is the same as (mask * > x).sum(0) ? I am not sure which way your argument goes, but I don't think you would find the following natural: >>> x = array([True, True]) >>> dot(x,x) True >>> (x*x).sum() 2 >>> (x*x).sum(0) 2 >>> (x*x).sum(False) 2 -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Thu Dec 5 23:02:34 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 5 Dec 2013 23:02:34 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 10:56 PM, Alexander Belopolsky wrote: > On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg > wrote: >> there was a discussion that for numpy booleans math operators +,-,* (and >> the unary -), while defined, are not very helpful. > > It has been suggested at the Github that there is an area where it is useful > to have linear algebra operations like matrix multiplication to be defined > over a semiring: > > http://en.wikipedia.org/wiki/Logical_matrix > > This still does not justify having unary or binary -, so I suggest that we > first discuss deprecation of those. Does it make sense to only remove - and maybe / ? would python sum still work? (I almost never use it.) >>> sum(mask) 2 >>> sum(mask.tolist()) 2 is accumulate the same as sum and would keep working? >>> np.add.accumulate(mask) array([0, 0, 0, 1, 2]) In operation with other dtypes, do they still dominate so these work? >>> x / mask array([0, 0, 0, 3, 4]) >>> x * 1. / mask array([ nan, inf, inf, 3., 4.]) >>> x**mask array([1, 1, 1, 3, 4]) >>> mask - 5 array([-5, -5, -5, -4, -4]) Josef > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From alan.isaac at gmail.com Thu Dec 5 23:05:32 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Thu, 05 Dec 2013 23:05:32 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: <52A14D0C.1060209@gmail.com> For + and * (and thus `dot`), this will "fix" something that is not broken. It is in fact in conformance with a large literature on boolean arrays and boolean matrices. That not everyone pays attention to this literature does not constitute a reason to break the extant, correct behavior. I'm sure I cannot be the only one who has for years taught students about Boolean matrices using NumPy, because of this correct behavior of this dtype. (By correct, I mean in conformance with the literature.) Alan Isaac From ndarray at mac.com Thu Dec 5 23:14:52 2013 From: ndarray at mac.com (Alexander Belopolsky) Date: Thu, 5 Dec 2013 23:14:52 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <52A14D0C.1060209@gmail.com> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> Message-ID: On Thu, Dec 5, 2013 at 11:05 PM, Alan G Isaac wrote: > For + and * (and thus `dot`), this will "fix" something that is not broken. + and * are not broken - just redundant given | and &. What is really broken is -, both unary and binary: >>> int(np.bool_(0) - np.bool_(1)) 1 >>> int(-np.bool_(0)) 1 > I'm sure I cannot be the only one who has for years taught students > about Boolean matrices using NumPy (I would not be so sure:-) In that experience, did you find minus to be as useful? -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefan at sun.ac.za Thu Dec 5 23:22:18 2013 From: stefan at sun.ac.za (=?iso-8859-1?Q?St=E9fan?= van der Walt) Date: Fri, 6 Dec 2013 06:22:18 +0200 Subject: [Numpy-discussion] surprising behavior of np.asarray on masked arrays In-Reply-To: References: Message-ID: <20131206042218.GA11321@shinobi> Hi Faraz On Thu, 05 Dec 2013 19:14:01 -0800, Faraz Mirzaei wrote: > If I pass a masked array through np.asarray, I get original unmasked array. `asarray` disregards any information attached to the underlying ndarray by the subclass. To preserve the subclass, you'd need to use `asanyarray`. The only functions that are aware of masked arrays live inside of `np.ma`, so you can also use `np.ma.asarray`. Which behavior in particular would you like to see, since I presume you can already get hold of the filled array, should you want to? St?fan From josef.pktd at gmail.com Thu Dec 5 23:28:47 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 5 Dec 2013 23:28:47 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop>

Message-ID: On Thu, Dec 5, 2013 at 11:00 PM, Alexander Belopolsky wrote: > > On Thu, Dec 5, 2013 at 10:35 PM, wrote: >> >> what about np.dot, np.dot(mask, x) which is the same as (mask * >> x).sum(0) ? > > > I am not sure which way your argument goes, but I don't think you would find > the following natural: > >>>> x = array([True, True]) >>>> dot(x,x) > True this is weird but I would never do that. maybe I would, but then i would add 1 non boolean >>>> (x*x).sum() > 2 >>>> (x*x).sum(0) > 2 That sounds right to me >>> (mask**2 == mask).all() True >>>> (x*x).sum(False) > 2 What is axis=False? The way my argument goes: I'm a heavy user of using * pretending the bool behaves like an int, and of sum and accumulate. It would be a pain to loose them. >From where I come from (*) a bool is not a boolean it's just 0, 1, given that numpy casting rules apply and it's sometimes cast back to (0, 1) Does this work as explanation for the pattern of + and - also. (*) places where the type system is more restricted. What about max? >>> np.maximum(mask, mask) array([False, False, False, True, True], dtype=bool) >>> np.maximum(mask, ~mask) array([ True, True, True, True, True], dtype=bool) >>> mask + mask array([False, False, False, True, True], dtype=bool) >>> mask + ~mask array([ True, True, True, True, True], dtype=bool) first mask is if the wife has a car, second mask is if the husband has a car. The max is if there is a car in the family. What's this as logical? Josef > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From efiring at hawaii.edu Fri Dec 6 01:53:48 2013 From: efiring at hawaii.edu (Eric Firing) Date: Thu, 05 Dec 2013 20:53:48 -1000 Subject: [Numpy-discussion] surprising behavior of np.asarray on masked arrays In-Reply-To: References: Message-ID: <52A1747C.1040200@hawaii.edu> On 2013/12/05 5:14 PM, Faraz Mirzaei wrote: > Hi, > > If I pass a masked array through np.asarray, I get original unmasked array. > > Example: > > test = np.array([[1, 0], [-1, 3]]) > > testMasked = ma.masked_less_equal(test, 0) > > > print testMasked > > [[1 --] > > [-- 3]] > > > print testMasked.fill_value > > 999999 > > > print np.asarray(testMasked) > > [[ 1 0] > > [-1 3]] > > > Is this behavior intentional? How does the np.asarray access the > original masked values? Shouldn't the masked values be at least filled > with fill_value? It might be nice, but it's not the way it is. If you want to preserve masked arrays, use np.asanyarray() instead of np.asarray(). If you want to end up with filled ndarrays, use np.ma.filled(). Eric > > > Thanks, > > > Faraz > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From njs at pobox.com Fri Dec 6 03:24:30 2013 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 6 Dec 2013 00:24:30 -0800 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: On Thu, Dec 5, 2013 at 7:33 PM, wrote: > On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg > wrote: >> Hey, >> >> there was a discussion that for numpy booleans math operators +,-,* (and >> the unary -), while defined, are not very helpful. I have set up a quick >> PR with start (needs some fixes inside numpy still): >> >> https://github.com/numpy/numpy/pull/4105 >> >> The idea is to deprecate these, since the binary operators |,^,| (and >> the unary ~ even if it is weird) behave identical. This would not affect >> sums of boolean arrays. For the moment I saw one "annoying" change in >> numpy, and that is `abs(x - y)` being used for allclose and working >> nicely currently. > > I like mask = mask1 * mask2 > > That's what I learned working my way through scipy.stats.distributions > a long time ago. * is least problematic case, since there numpy and python bools already almost agree. (They return the same values, but numpy returns a bool array instead of an integer array.) On Thu, Dec 5, 2013 at 8:05 PM, Alan G Isaac wrote: > For + and * (and thus `dot`), this will "fix" something that is not broken. > It is in fact in conformance with a large literature on boolean arrays > and boolean matrices. Interesting point! I had assumed that dot() just upcast! But what do you think about the inconsistency between sum() and dot() on bool arrays? -n From sebastian at sipsolutions.net Fri Dec 6 04:39:36 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Fri, 06 Dec 2013 10:39:36 +0100 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop>

Message-ID: <1386322776.3153.21.camel@sebastian-laptop> On Thu, 2013-12-05 at 23:02 -0500, josef.pktd at gmail.com wrote: > On Thu, Dec 5, 2013 at 10:56 PM, Alexander Belopolsky wrote: > > On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg > > wrote: > >> there was a discussion that for numpy booleans math operators +,-,* (and > >> the unary -), while defined, are not very helpful. > > > > It has been suggested at the Github that there is an area where it is useful > > to have linear algebra operations like matrix multiplication to be defined > > over a semiring: > > > > http://en.wikipedia.org/wiki/Logical_matrix > > > > This still does not justify having unary or binary -, so I suggest that we > > first discuss deprecation of those. > > Does it make sense to only remove - and maybe / ? > > would python sum still work? (I almost never use it.) > > >>> sum(mask) > 2 > >>> sum(mask.tolist()) > 2 > > is accumulate the same as sum and would keep working? > > >>> np.add.accumulate(mask) > array([0, 0, 0, 1, 2]) > > > In operation with other dtypes, do they still dominate so these work? > Hey, of course the other types will always dominate interpreting bools as 0 and 1. This would only affect operations with only booleans. There is a good point that * is well defined however you define it, though. (Btw. / is not defined for bools, `np.bool_(True)/np.bool_(True)` will upcast to int8 to do the operation) However, while well defined, + is not defined like it is for python bools (which are just ints) so that is the reason to consider deprecation there (if we allow upcast to int8 -- or maybe the default int -- in the future, in-place += and -= operations would not behave differently, since they just cast back...). I suppose python sum works because it first tries using the C-Api number protocol, which also means it is not affected. If you were to write a sum which just uses the `+` operator, it would be affected, but that would seem good to me. - Sebastian > >>> x / mask > array([0, 0, 0, 3, 4]) > >>> x * 1. / mask > array([ nan, inf, inf, 3., 4.]) > >>> x**mask > array([1, 1, 1, 3, 4]) > >>> mask - 5 > array([-5, -5, -5, -4, -4]) > > Josef > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From josef.pktd at gmail.com Fri Dec 6 09:32:16 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 09:32:16 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <1386322776.3153.21.camel@sebastian-laptop> References: <1386283039.23728.6.camel@sebastian-laptop>

<1386322776.3153.21.camel@sebastian-laptop> Message-ID: On Fri, Dec 6, 2013 at 4:39 AM, Sebastian Berg wrote: > On Thu, 2013-12-05 at 23:02 -0500, josef.pktd at gmail.com wrote: >> On Thu, Dec 5, 2013 at 10:56 PM, Alexander Belopolsky wrote: >> > On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg >> > wrote: >> >> there was a discussion that for numpy booleans math operators +,-,* (and >> >> the unary -), while defined, are not very helpful. >> > >> > It has been suggested at the Github that there is an area where it is useful >> > to have linear algebra operations like matrix multiplication to be defined >> > over a semiring: >> > >> > http://en.wikipedia.org/wiki/Logical_matrix >> > >> > This still does not justify having unary or binary -, so I suggest that we >> > first discuss deprecation of those. >> >> Does it make sense to only remove - and maybe / ? >> >> would python sum still work? (I almost never use it.) >> >> >>> sum(mask) >> 2 >> >>> sum(mask.tolist()) >> 2 >> >> is accumulate the same as sum and would keep working? >> >> >>> np.add.accumulate(mask) >> array([0, 0, 0, 1, 2]) >> >> >> In operation with other dtypes, do they still dominate so these work? >> > > Hey, In statistics and econometrics (and economic theory) we just use an indicator function 1_{x=5} which has largely the same properties as a numpy bool array, at least in my code. some of the common operations are *, dot and kron. So far this has worked quite well as intuition, plus numpy casting rules. dot is the main surprise, because I thought that it would upcast. (I always think of dot as a np.linalg.) > > of course the other types will always dominate interpreting bools as 0 > and 1. This would only affect operations with only booleans. My guess is that this would leave then 90% of our (statsmodels) possible usage alone. There is still the case that with * we can calculate the intersection. There is a > good point that * is well defined however you define it, though. (Btw. / > is not defined for bools, `np.bool_(True)/np.bool_(True)` will upcast to > int8 to do the operation) > > However, while well defined, + is not defined like it is for python > bools (which are just ints) so that is the reason to consider > deprecation there (if we allow upcast to int8 -- or maybe the default > int -- in the future, in-place += and -= operations would not behave > differently, since they just cast back...). Actually, I used + once: The calculation in terms of indicator functions is 1_{A} + 1_{B} - 1_{A & B} The last part avoids double counting, which is not necessary if numpy casts back to bool. Nothing that couldn't be replaced by logical operators, but the (linear) algebra is not "logical". In this case I did care about memory because the arrays are (nobs, nobs) (nobs is the number of observations shape[0]) which can be large, and I have a sparse version also. In most other case we use astype(int) already very often, because eventually we still have to cast and memory won't be a big problem. The mental model is set membership and set operations with indicator functions, not "logical", and I don't remember running into problems with this so far, and happily ignored logical_xxx when I do linear algebra instead of just working with masks of booleans. Nevertheless: If I'm forced to, then I will get used to logical_xxx. (*) And the above bool addition hasn't made it into statsmodels yet. I used a simpler version because I thought initially it's too cute. (And I was using an older numpy that couldn't do broadcasted dot.) (*) how do you search in the documentation of `&` or `|`, I cannot find what the other symbols are, if there are any. > > I suppose python sum works because it first tries using the C-Api number > protocol, which also means it is not affected. If you were to write a > sum which just uses the `+` operator, it would be affected, but that > would seem good to me. based on the ticket example, I'm not sure whether `+` should upcast or not. >>> mm.dtype dtype('bool') >>> mm.sum(0) array([48, 45, 56, 47]) >>> mm.sum(0, bool) array([ True, True, True, True], dtype=bool) I would just use any but what happens with logical cumsum >>> mm[:5].cumsum(0, bool) array([[False, True, True, True], [ True, True, True, True], [ True, True, True, True], [ True, True, True, True], [ True, True, True, True]], dtype=bool) same as mm[:5].astype(int).cumsum(0, bool) without casting Josef > > - Sebastian > > >> >>> x / mask >> array([0, 0, 0, 3, 4]) >> >>> x * 1. / mask >> array([ nan, inf, inf, 3., 4.]) >> >>> x**mask >> array([1, 1, 1, 3, 4]) >> >>> mask - 5 >> array([-5, -5, -5, -4, -4]) >> >> Josef >> >> > >> > _______________________________________________ >> > NumPy-Discussion mailing list >> > NumPy-Discussion at scipy.org >> > http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From alan.isaac at gmail.com Fri Dec 6 10:32:11 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 10:32:11 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> Message-ID: <52A1EDFB.6090408@gmail.com> > On Thu, Dec 5, 2013 at 8:05 PM, Alan G Isaac > wrote: >> For + and * (and thus `dot`), this will "fix" something that is not broken. >> It is in fact in conformance with a large literature on boolean arrays >> and boolean matrices. On 12/6/2013 3:24 AM, Nathaniel Smith wrote: > Interesting point! I had assumed that dot() just upcast! But what do > you think about the inconsistency between sum() and dot() on bool > arrays? I don't like the behavior of sum on bool arrays. (I.e., automatic upcasting.) But I do not suggest changing it, as much code is likely to depend on it. Cheers, Alan From josef.pktd at gmail.com Fri Dec 6 11:12:11 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 11:12:11 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop>

<1386322776.3153.21.camel@sebastian-laptop> Message-ID: On Fri, Dec 6, 2013 at 9:32 AM, wrote: > On Fri, Dec 6, 2013 at 4:39 AM, Sebastian Berg > wrote: >> On Thu, 2013-12-05 at 23:02 -0500, josef.pktd at gmail.com wrote: >>> On Thu, Dec 5, 2013 at 10:56 PM, Alexander Belopolsky wrote: >>> > On Thu, Dec 5, 2013 at 5:37 PM, Sebastian Berg >>> > wrote: >>> >> there was a discussion that for numpy booleans math operators +,-,* (and >>> >> the unary -), while defined, are not very helpful. >>> > >>> > It has been suggested at the Github that there is an area where it is useful >>> > to have linear algebra operations like matrix multiplication to be defined >>> > over a semiring: >>> > >>> > http://en.wikipedia.org/wiki/Logical_matrix >>> > >>> > This still does not justify having unary or binary -, so I suggest that we >>> > first discuss deprecation of those. >>> >>> Does it make sense to only remove - and maybe / ? >>> >>> would python sum still work? (I almost never use it.) >>> >>> >>> sum(mask) >>> 2 >>> >>> sum(mask.tolist()) >>> 2 >>> >>> is accumulate the same as sum and would keep working? >>> >>> >>> np.add.accumulate(mask) >>> array([0, 0, 0, 1, 2]) >>> >>> >>> In operation with other dtypes, do they still dominate so these work? >>> >> >> Hey, > > > In statistics and econometrics (and economic theory) we just use an > indicator function 1_{x=5} which has largely the same properties as a > numpy bool array, at least in my code. > > some of the common operations are *, dot and kron. > > So far this has worked quite well as intuition, plus numpy casting rules. > > dot is the main surprise, because I thought that it would upcast. (I > always think of dot as a np.linalg.) > > >> >> of course the other types will always dominate interpreting bools as 0 >> and 1. This would only affect operations with only booleans. > > My guess is that this would leave then 90% of our (statsmodels) > possible usage alone. > > There is still the case that with * we can calculate the intersection. > > > There is a >> good point that * is well defined however you define it, though. (Btw. / >> is not defined for bools, `np.bool_(True)/np.bool_(True)` will upcast to >> int8 to do the operation) >> >> However, while well defined, + is not defined like it is for python >> bools (which are just ints) so that is the reason to consider >> deprecation there (if we allow upcast to int8 -- or maybe the default >> int -- in the future, in-place += and -= operations would not behave >> differently, since they just cast back...). > > Actually, I used + once: > > The calculation in terms of indicator functions is > > 1_{A} + 1_{B} - 1_{A & B} > > The last part avoids double counting, which is not necessary if numpy > casts back to bool. > Nothing that couldn't be replaced by logical operators, but the > (linear) algebra is not "logical". > > In this case I did care about memory because the arrays are (nobs, > nobs) (nobs is the number of observations shape[0]) which can be > large, and I have a sparse version also. In most other case we use > astype(int) already very often, because eventually we still have to > cast and memory won't be a big problem. > > The mental model is set membership and set operations with indicator > functions, not "logical", and I don't remember running into problems > with this so far, and happily ignored logical_xxx when I do linear > algebra instead of just working with masks of booleans. http://en.wikipedia.org/wiki/Indicator_function with the added advantage that we have also the version where + constrains to (0, 1). However `-` doesn't work properly because >>> np.bool_(-5) True instead of False except in the case `1 - mask`. We really have two kinds of addition: bool sum: for indicating set membership counting sum: for counting number of elements. from my viewpoint: I would keep + and * since they work well (bool + and count +) minus - is partially broken and `/` looks useless this casts anyway >>> 1 - m1 array([1, 1, 0, 0, 0]) and I never thought of doing this >>> True - m1 array([ True, True, False, False, False], dtype=bool) (python set defines minus but raises error on plus) Josef > > Nevertheless: If I'm forced to, then I will get used to logical_xxx. (*) > And the above bool addition hasn't made it into statsmodels yet. I > used a simpler version because I thought initially it's too cute. (And > I was using an older numpy that couldn't do broadcasted dot.) > > (*) how do you search in the documentation of `&` or `|`, I cannot > find what the other symbols are, if there are any. > >> >> I suppose python sum works because it first tries using the C-Api number >> protocol, which also means it is not affected. If you were to write a >> sum which just uses the `+` operator, it would be affected, but that >> would seem good to me. > > based on the ticket example, I'm not sure whether `+` should upcast or not. > >>>> mm.dtype > dtype('bool') >>>> mm.sum(0) > array([48, 45, 56, 47]) > >>>> mm.sum(0, bool) > array([ True, True, True, True], dtype=bool) > I would just use any > > but what happens with logical cumsum > >>>> mm[:5].cumsum(0, bool) > array([[False, True, True, True], > [ True, True, True, True], > [ True, True, True, True], > [ True, True, True, True], > [ True, True, True, True]], dtype=bool) > > same as mm[:5].astype(int).cumsum(0, bool) without casting > > Josef > > >> >> - Sebastian >> >> >>> >>> x / mask >>> array([0, 0, 0, 3, 4]) >>> >>> x * 1. / mask >>> array([ nan, inf, inf, 3., 4.]) >>> >>> x**mask >>> array([1, 1, 1, 3, 4]) >>> >>> mask - 5 >>> array([-5, -5, -5, -4, -4]) >>> >>> Josef >>> >>> > >>> > _______________________________________________ >>> > NumPy-Discussion mailing list >>> > NumPy-Discussion at scipy.org >>> > http://mail.scipy.org/mailman/listinfo/numpy-discussion >>> > >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> http://mail.scipy.org/mailman/listinfo/numpy-discussion >>> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion From alan.isaac at gmail.com Fri Dec 6 11:13:07 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 11:13:07 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> Message-ID: <52A1F793.3010506@gmail.com> On 12/5/2013 11:14 PM, Alexander Belopolsky wrote: > did you find minus to be as useful? It is also a correct usage. I think a good approach to this is to first realize that there were good reasons for the current behavior. Alan Isaac From josef.pktd at gmail.com Fri Dec 6 11:20:34 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 11:20:34 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <52A1F793.3010506@gmail.com> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 11:13 AM, Alan G Isaac wrote: > On 12/5/2013 11:14 PM, Alexander Belopolsky wrote: >> did you find minus to be as useful? > > > It is also a correct usage. > > I think a good approach to this is to first realize that > there were good reasons for the current behavior. What's the meaning of minus? I cannot make much sense out of it, or come up with any use case. Josef > > Alan Isaac > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From ndarray at mac.com Fri Dec 6 12:23:23 2013 From: ndarray at mac.com (Alexander Belopolsky) Date: Fri, 6 Dec 2013 12:23:23 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <52A1F793.3010506@gmail.com> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 11:13 AM, Alan G Isaac wrote: > On 12/5/2013 11:14 PM, Alexander Belopolsky wrote: > > did you find minus to be as useful? > > > It is also a correct usage. > > Can you provide a reference? > I think a good approach to this is to first realize that > there were good reasons for the current behavior. > > Maybe there were, in which case the current behavior should be documented somewhere. What is the rationale for this: >>> -array(True) + array(True) True ? I am not aware of any algebraic system where unary minus denotes anything other than additive inverse. Having bools form a semiring under + and * is a fine (yet somewhat unusual) choice, but once you've made that choice you loose subtraction because True + x = True no longer has a unique solution. -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Fri Dec 6 12:47:06 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 12:47:06 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 12:23 PM, Alexander Belopolsky wrote: > On Fri, Dec 6, 2013 at 11:13 AM, Alan G Isaac wrote: > >> On 12/5/2013 11:14 PM, Alexander Belopolsky wrote: >> > did you find minus to be as useful? >> >> >> It is also a correct usage. >> > > Can you provide a reference? > > >> >> I think a good approach to this is to first realize that >> there were good reasons for the current behavior. >> > > Maybe there were, in which case the current behavior should be documented > somewhere. > > What is the rationale for this: > >>>> -array(True) + array(True) > True > > ? > > I am not aware of any algebraic system where unary minus denotes anything > other than additive inverse. I would be perfectly happy if numpy would cast (negative) overflow to the smallest value, instead of wrapping around. The same is true for integers. >>> np.array(0, np.int8) - np.array(-128, np.int8) -128 >>> - np.array(-128, np.int8) -128 Josef > > Having bools form a semiring under + and * is a fine (yet somewhat unusual) > choice, but once you've made that choice you loose subtraction because True > + x = True no longer has a unique solution. > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From ralf.gommers at gmail.com Fri Dec 6 13:06:55 2013 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Dec 2013 19:06:55 +0100 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options Message-ID: Hi all, There are a few discussions on packaging for the scientific Python stack ongoing, on the NumFOCUS and distutils lists: https://groups.google.com/forum/#!topic/numfocus/mVNakFqfpZg https://groups.google.com/forum/#!topic/numfocus/HUcwXTM_jNY http://thread.gmane.org/gmane.comp.python.distutils.devel/20202 http://thread.gmane.org/gmane.comp.python.distutils.devel/20296 One of the things that we should start doing for numpy is distribute releases as wheels. On OS X at least this is quite simple, so I propose to just experiment with it. I can create some to try out and put them on a separate folder on SourceForge. If that works they can be put on PyPi. For Windows things are less simple, because the wheel format doesn't handle the multiple builds (no SSE, SSE2, SSE3) that are in the superpack installers. A problem is that we don't really know how many users still have old CPUs that don't support SSE3. The impact for those users is high, numpy will install but crash (see https://github.com/scipy/scipy/issues/1697). Questions: 1. does anyone have a good idea to obtain statistics? 2. in the absence of statistics, can we do an experiment by putting one wheel up on PyPi which contains SSE3 instructions, for python 3.3 I propose, and seeing for how many (if any) users this goes wrong? Ralf P.S. related question: did anyone check whether the recently merged NPY_HAVE_SSE2_INTRINSIC puts SSE2 instructions into the no-SSE binary? -------------- next part -------------- An HTML attachment was scrubbed... URL: From alan.isaac at gmail.com Fri Dec 6 13:16:08 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 13:16:08 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> Message-ID: <52A21468.8010809@gmail.com> On 12/6/2013 12:23 PM, Alexander Belopolsky wrote: > What is the rationale for this: > > >>> -array(True) + array(True) > True The minus is complementation. So you are just writing False or True Alan Isaac From alan.isaac at gmail.com Fri Dec 6 13:31:34 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 13:31:34 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> Message-ID: <52A21806.3080103@gmail.com> >> On 12/5/2013 11:14 PM, Alexander Belopolsky wrote: >>> did you find minus to be as useful? > On Fri, Dec 6, 2013 at 11:13 AM, Alan G Isaac >> It is also a correct usage. On 12/6/2013 12:23 PM, Alexander Belopolsky wrote: > Can you provide a reference? For use of the minus sign, I don't have one at hand, but a quick Google seach comes up with: http://www.csee.umbc.edu/~artola/fall02/BooleanAlgebra.ppt It is more common to use a superscript `c`, but that's just a notational issue. For multiplication, addition, and dot, you can see Ki Hang Kim's Boolean matrix Theory and Applications. Applications are endless and include graph theory and (then naturally) circuit design. Cheers, Alan Isaac From josef.pktd at gmail.com Fri Dec 6 13:35:16 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 13:35:16 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <52A21468.8010809@gmail.com> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 1:16 PM, Alan G Isaac wrote: > On 12/6/2013 12:23 PM, Alexander Belopolsky wrote: >> What is the rationale for this: >> >> >>> -array(True) + array(True) >> True > > > The minus is complementation. > So you are just writing > False or True unary versus binary minus >>> m1 + (-m2) array([False, False, True, True, True], dtype=bool) >>> m1 - m2 array([ True, True, False, False, True], dtype=bool) >>> -m2 + m1 array([False, False, True, True, True], dtype=bool) >>> m1 - (-m2) array([False, False, True, True, False], dtype=bool) I'd rather write ~ than unary - if that's what it is. Josef > > Alan Isaac > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From alan.isaac at gmail.com Fri Dec 6 13:46:20 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 13:46:20 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> Message-ID: <52A21B7C.6000908@gmail.com> On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: > unary versus binary minus Oh right; I consider binary `-` broken for Boolean arrays. (Sorry Alexander; I did not see your entire issue.) > I'd rather write ~ than unary - if that's what it is. I agree. So I have no objection to elimination of the `-`. I see it does the subtraction and then a boolean conversion, which is not helpful. Or rather, I do not see how it can be helpful. Alan Isaac From jtaylor.debian at googlemail.com Fri Dec 6 13:58:59 2013 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Fri, 06 Dec 2013 19:58:59 +0100 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: References: Message-ID: <52A21E73.5090200@googlemail.com> On 06.12.2013 19:06, Ralf Gommers wrote: > Hi all, > > There are a few discussions on packaging for the scientific Python stack > ongoing, on the NumFOCUS and distutils lists: > https://groups.google.com/forum/#!topic/numfocus/mVNakFqfpZg > https://groups.google.com/forum/#!topic/numfocus/HUcwXTM_jNY > http://thread.gmane.org/gmane.comp.python.distutils.devel/20202 > http://thread.gmane.org/gmane.comp.python.distutils.devel/20296 > > One of the things that we should start doing for numpy is distribute > releases as wheels. On OS X at least this is quite simple, so I propose > to just experiment with it. I can create some to try out and put them on > a separate folder on SourceForge. If that works they can be put on PyPi. > > For Windows things are less simple, because the wheel format doesn't > handle the multiple builds (no SSE, SSE2, SSE3) that are in the > superpack installers. A problem is that we don't really know how many > users still have old CPUs that don't support SSE3. The impact for those > users is high, numpy will install but crash (see > https://github.com/scipy/scipy/issues/1697). Questions: > 1. does anyone have a good idea to obtain statistics? > 2. in the absence of statistics, can we do an experiment by putting one > wheel up on PyPi which contains SSE3 instructions, for python 3.3 I > propose, and seeing for how many (if any) users this goes wrong? why SSE3 and not SSE2? SSE2 is a requirement of the amd64 ABI, so it is present in all 64 bit x86 cpus, so a even majority of windows machines running 32 bit will have it. SSE3 is not mandated by any ABI so it should more likely to find machines without it. to my knowledge SSE3 is not such big a difference to SSE2, only a little better complex arithmetic and horizontal additions, I don't think its worth it. Are there performance comparisons for ATLAS with SSE2 and SSE3 available? > > P.S. related question: did anyone check whether the recently merged > NPY_HAVE_SSE2_INTRINSIC puts SSE2 instructions into the no-SSE binary? > according to https://github.com/numpy/numpy/issues/3760 SSE2 should be off in the binaries created with mingw. but there was also https://github.com/numpy/numpy/issues/3680, but that might have been built with the VSC compiler (I think by Christoph Gohlke) Assuming linux objdump works correctly on windows .pyd files there are indeed sse2 instructions in the win32 build created with VSC. From josef.pktd at gmail.com Fri Dec 6 13:59:31 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 13:59:31 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <52A21B7C.6000908@gmail.com> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 1:46 PM, Alan G Isaac wrote: > On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: >> unary versus binary minus > > Oh right; I consider binary `-` broken for > Boolean arrays. (Sorry Alexander; I did not > see your entire issue.) > > >> I'd rather write ~ than unary - if that's what it is. > > I agree. So I have no objection to elimination > of the `-`. I see it does the subtraction and then > a boolean conversion, which is not helpful. > Or rather, I do not see how it can be helpful. What I would or might find useful is if binary `-` subtracts set membership instead of doing xor >>> m1 = np.array([0,0,1,1], bool) >>> m2 = np.array([0,1,0,1], bool) >>> m1 - m2 array([False, True, True, False], dtype=bool) >>> np.logical_xor(m1, m2) array([False, True, True, False], dtype=bool) >>> np.clip(m1.astype(int) - m2.astype(int), 0, 1).astype(bool) array([False, False, True, False], dtype=bool) >>> np.nonzero(_)[0] array([2]) >>> s1 = set(np.arange(4)[m1]) >>> s2 = set(np.arange(4)[m2]) >>> s1 - s2 set([2]) Josef > > Alan Isaac > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From ndarray at mac.com Fri Dec 6 14:55:41 2013 From: ndarray at mac.com (Alexander Belopolsky) Date: Fri, 6 Dec 2013 14:55:41 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <52A21B7C.6000908@gmail.com> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 1:46 PM, Alan G Isaac wrote: > On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: > > unary versus binary minus > > Oh right; I consider binary `-` broken for > Boolean arrays. (Sorry Alexander; I did not > see your entire issue.) > > > > I'd rather write ~ than unary - if that's what it is. > > I agree. So I have no objection to elimination > of the `-`. It looks like we are close to reaching a consensus on the following points: 1. * is well-defined on boolean arrays and may be used in preference of & in code that is designed to handle 1s and 0s of any dtype in addition to booleans. 2. + is defined consistently with * and the only issue is the absence of additive inverse. This is not a problem as long as presence of - does not suggest otherwise. 3. binary and unary minus should be deprecated because its use in expressions where variables can be either boolean or numeric would lead to subtle bugs. For example -x*y would produce different results from -(x*y) depending on whether x is boolean or not. In all situations, ^ is preferable to binary - and ~ is preferable to unary -. 4. changing boolean arithmetics to auto-promotion to int is precluded by a significant use-case of boolean matrices. -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Fri Dec 6 14:59:36 2013 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 6 Dec 2013 11:59:36 -0800 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 11:55 AM, Alexander Belopolsky wrote: > > > > On Fri, Dec 6, 2013 at 1:46 PM, Alan G Isaac wrote: >> >> On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: >> > unary versus binary minus >> >> Oh right; I consider binary `-` broken for >> Boolean arrays. (Sorry Alexander; I did not >> see your entire issue.) >> >> >> > I'd rather write ~ than unary - if that's what it is. >> >> I agree. So I have no objection to elimination >> of the `-`. > > > It looks like we are close to reaching a consensus on the following points: > > 1. * is well-defined on boolean arrays and may be used in preference of & in > code that is designed to handle 1s and 0s of any dtype in addition to > booleans. > > 2. + is defined consistently with * and the only issue is the absence of > additive inverse. This is not a problem as long as presence of - does not > suggest otherwise. > > 3. binary and unary minus should be deprecated because its use in > expressions where variables can be either boolean or numeric would lead to > subtle bugs. For example -x*y would produce different results from -(x*y) > depending on whether x is boolean or not. In all situations, ^ is > preferable to binary - and ~ is preferable to unary -. > > 4. changing boolean arithmetics to auto-promotion to int is precluded by a > significant use-case of boolean matrices. +1 -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From chris.barker at noaa.gov Fri Dec 6 15:09:55 2013 From: chris.barker at noaa.gov (Chris Barker) Date: Fri, 6 Dec 2013 12:09:55 -0800 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: References: Message-ID: On Fri, Dec 6, 2013 at 10:06 AM, Ralf Gommers wrote: > One of the things that we should start doing for numpy is distribute > releases as wheels. On OS X at least this is quite simple, so I propose to > just experiment with it. I can create some to try out and put them on a > separate folder on SourceForge. If that works they can be put on PyPi. > > +1 For Windows things are less simple, because the wheel format doesn't handle > the multiple builds (no SSE, SSE2, SSE3) that are in the superpack > installers. A problem is that we don't really know how many users still > have old CPUs that don't support SSE3. The impact for those users is high, > numpy will install but crash (see > https://github.com/scipy/scipy/issues/1697). > Could we have a run-time check, so at least folks would get a nice error message? 2. in the absence of statistics, can we do an experiment by putting one > wheel up on PyPi which contains SSE3 instructions, for python 3.3 I > propose, and seeing for how many (if any) users this goes wrong? > sounds good -- it looks like SSE3 has been around a good while: http://en.wikipedia.org/wiki/SSE3 8+ years is a pretty long time in computer land! anyone know how long SSE3 has been around? -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From oscar.j.benjamin at gmail.com Fri Dec 6 15:28:55 2013 From: oscar.j.benjamin at gmail.com (Oscar Benjamin) Date: Fri, 6 Dec 2013 20:28:55 +0000 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: References:

Message-ID: On 6 December 2013 20:09, Chris Barker wrote: >> 2. in the absence of statistics, can we do an experiment by putting one >> wheel up on PyPi which contains SSE3 instructions, for python 3.3 I propose, >> and seeing for how many (if any) users this goes wrong? > > > sounds good -- it looks like SSE3 has been around a good while: > > http://en.wikipedia.org/wiki/SSE3 > > 8+ years is a pretty long time in computer land! > > anyone know how long SSE3 has been around? I don't have statistics but I do have a couple of data points. Both of the computers I regularly use (my work desktop and my girlfriend's laptop) have SSE2 but not SSE3. Really I'm not sure that releasing a potentially compatible binary - with no install time checks - is such a good idea. What we really want is a situation where you can confidently advise someone to just "pip install numpy" without caveats i.e. a solution that "just works". Oscar From josef.pktd at gmail.com Fri Dec 6 15:30:47 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 15:30:47 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com> Message-ID: On Fri, Dec 6, 2013 at 2:59 PM, Nathaniel Smith wrote: > On Fri, Dec 6, 2013 at 11:55 AM, Alexander Belopolsky wrote: >> >> >> >> On Fri, Dec 6, 2013 at 1:46 PM, Alan G Isaac wrote: >>> >>> On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: >>> > unary versus binary minus >>> >>> Oh right; I consider binary `-` broken for >>> Boolean arrays. (Sorry Alexander; I did not >>> see your entire issue.) >>> >>> >>> > I'd rather write ~ than unary - if that's what it is. >>> >>> I agree. So I have no objection to elimination >>> of the `-`. >> >> >> It looks like we are close to reaching a consensus on the following points: >> >> 1. * is well-defined on boolean arrays and may be used in preference of & in >> code that is designed to handle 1s and 0s of any dtype in addition to >> booleans. >> >> 2. + is defined consistently with * and the only issue is the absence of >> additive inverse. This is not a problem as long as presence of - does not >> suggest otherwise. >> >> 3. binary and unary minus should be deprecated because its use in >> expressions where variables can be either boolean or numeric would lead to >> subtle bugs. For example -x*y would produce different results from -(x*y) >> depending on whether x is boolean or not. In all situations, ^ is >> preferable to binary - and ~ is preferable to unary -. >> >> 4. changing boolean arithmetics to auto-promotion to int is precluded by a >> significant use-case of boolean matrices. > > +1 +0.5 (I would still prefer a different binary minus, but it would be inconsistent with a logical unary minus that negates.) 5. `/` is useless 6 `**` follows from 1. Josef > > -- > Nathaniel J. Smith > Postdoctoral researcher - Informatics - University of Edinburgh > http://vorpus.org > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From cournape at gmail.com Fri Dec 6 15:37:42 2013 From: cournape at gmail.com (David Cournapeau) Date: Fri, 6 Dec 2013 20:37:42 +0000 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: References:

Message-ID: On Fri, Dec 6, 2013 at 8:28 PM, Oscar Benjamin wrote: > On 6 December 2013 20:09, Chris Barker wrote: > >> 2. in the absence of statistics, can we do an experiment by putting one > >> wheel up on PyPi which contains SSE3 instructions, for python 3.3 I > propose, > >> and seeing for how many (if any) users this goes wrong? > > > > > > sounds good -- it looks like SSE3 has been around a good while: > > > > http://en.wikipedia.org/wiki/SSE3 > > > > 8+ years is a pretty long time in computer land! > > > > anyone know how long SSE3 has been around? > > I don't have statistics but I do have a couple of data points. Both of > the computers I regularly use (my work desktop and my girlfriend's > laptop) have SSE2 but not SSE3. > > Really I'm not sure that releasing a potentially compatible binary - > with no install time checks - is such a good idea. What we really want > is a situation where you can confidently advise someone to just "pip > install numpy" without caveats i.e. a solution that "just works". > agreed. Also, we should not lie to ourselves: our current ATLAS on windows are most likely not very efficient anyway, SSE or not. Ralf, you mentioned that openblas was problematic on windows ? I could not find any recent discussion on that list. David -------------- next part -------------- An HTML attachment was scrubbed... URL: From cgohlke at uci.edu Fri Dec 6 15:38:32 2013 From: cgohlke at uci.edu (Christoph Gohlke) Date: Fri, 06 Dec 2013 12:38:32 -0800 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: References: Message-ID: <52A235C8.8030103@uci.edu> On 12/6/2013 10:06 AM, Ralf Gommers wrote: > Hi all, > > There are a few discussions on packaging for the scientific Python stack > ongoing, on the NumFOCUS and distutils lists: > https://groups.google.com/forum/#!topic/numfocus/mVNakFqfpZg > > https://groups.google.com/forum/#!topic/numfocus/HUcwXTM_jNY > > http://thread.gmane.org/gmane.comp.python.distutils.devel/20202 > http://thread.gmane.org/gmane.comp.python.distutils.devel/20296 > > One of the things that we should start doing for numpy is distribute > releases as wheels. On OS X at least this is quite simple, so I propose > to just experiment with it. I can create some to try out and put them on > a separate folder on SourceForge. If that works they can be put on PyPi. > > For Windows things are less simple, because the wheel format doesn't > handle the multiple builds (no SSE, SSE2, SSE3) that are in the > superpack installers. A problem is that we don't really know how many > users still have old CPUs that don't support SSE3. The impact for those > users is high, numpy will install but crash (see > https://github.com/scipy/scipy/issues/1697). Questions: > 1. does anyone have a good idea to obtain statistics? > 2. in the absence of statistics, can we do an experiment by putting one > wheel up on PyPi which contains SSE3 instructions, for python 3.3 I > propose, and seeing for how many (if any) users this goes wrong? > > Ralf > > P.S. related question: did anyone check whether the recently merged > NPY_HAVE_SSE2_INTRINSIC puts SSE2 instructions into the no-SSE binary? > > Has anyone succeeded building wheels for numpy, scipy, and matplotlib? On Windows `bdist_wheel` fails for me. It looks like numpy.distutils doesn't know about wheels and version 0.22.0 fails to package matplotlib. Pillow, pandas, scikit-image, scikits-learn work. Christoph Numpy 1.8.x ----------- ... running build_scripts creating build\scripts.win-amd64-3.3 Creating build\scripts.win-amd64-3.3\f2py.py adding 'build\scripts.win-amd64-3.3\f2py.py' to scripts Running from numpy source directory. usage: setup.py [global_opts] cmd1 [cmd1_opts] [cmd2 [cmd2_opts] ...] or: setup.py --help [cmd1 cmd2 ...] or: setup.py --help-commands or: setup.py cmd --help error: invalid command 'bdist_wheel' Scipy 0.13.x ------------ ... X:\Python27-x64\lib\distutils\dist.py:267: UserWarning: Unknown distribution option: 'test_suite' warnings.warn(msg) usage: setup.py [global_opts] cmd1 [cmd1_opts] [cmd2 [cmd2_opts] ...] or: setup.py --help [cmd1 cmd2 ...] or: setup.py --help-commands or: setup.py cmd --help error: invalid command 'bdist_wheel' Matplotlib 1.3.x ---------------- ... installing to build\bdist.win32\wheel running install running install_lib copying pylab.py -> build\bdist.win32\wheel\matplotlib-1.3.1.data\.. error: build\bdist.win32\wheel\matplotlib-1.3.1.data\..: Cannot create a file when that file already exists From cournape at gmail.com Fri Dec 6 15:40:59 2013 From: cournape at gmail.com (David Cournapeau) Date: Fri, 6 Dec 2013 20:40:59 +0000 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: <52A235C8.8030103@uci.edu> References: <52A235C8.8030103@uci.edu> Message-ID: On Fri, Dec 6, 2013 at 8:38 PM, Christoph Gohlke wrote: > On 12/6/2013 10:06 AM, Ralf Gommers wrote: > > Hi all, > > > > There are a few discussions on packaging for the scientific Python stack > > ongoing, on the NumFOCUS and distutils lists: > > https://groups.google.com/forum/#!topic/numfocus/mVNakFqfpZg > > > > https://groups.google.com/forum/#!topic/numfocus/HUcwXTM_jNY > > > > http://thread.gmane.org/gmane.comp.python.distutils.devel/20202 > > http://thread.gmane.org/gmane.comp.python.distutils.devel/20296 > > > > One of the things that we should start doing for numpy is distribute > > releases as wheels. On OS X at least this is quite simple, so I propose > > to just experiment with it. I can create some to try out and put them on > > a separate folder on SourceForge. If that works they can be put on PyPi. > > > > For Windows things are less simple, because the wheel format doesn't > > handle the multiple builds (no SSE, SSE2, SSE3) that are in the > > superpack installers. A problem is that we don't really know how many > > users still have old CPUs that don't support SSE3. The impact for those > > users is high, numpy will install but crash (see > > https://github.com/scipy/scipy/issues/1697). Questions: > > 1. does anyone have a good idea to obtain statistics? > > 2. in the absence of statistics, can we do an experiment by putting one > > wheel up on PyPi which contains SSE3 instructions, for python 3.3 I > > propose, and seeing for how many (if any) users this goes wrong? > > > > Ralf > > > > P.S. related question: did anyone check whether the recently merged > > NPY_HAVE_SSE2_INTRINSIC puts SSE2 instructions into the no-SSE binary? > > > > > > Has anyone succeeded building wheels for numpy, scipy, and matplotlib? > I did for numpy and scipy. You had to hack a bit numpy.distutils to make it work for scipy,but nothing that would be too complicated to really fix. In your case, the trick is to use the setupegg file: python setupegg.py bdist_wheel David > On Windows `bdist_wheel` fails for me. It looks like numpy.distutils > doesn't know about wheels and version 0.22.0 fails to package > matplotlib. Pillow, pandas, scikit-image, scikits-learn work. > > Christoph > > > Numpy 1.8.x > ----------- > ... > running build_scripts > creating build\scripts.win-amd64-3.3 > Creating build\scripts.win-amd64-3.3\f2py.py > adding 'build\scripts.win-amd64-3.3\f2py.py' to scripts > Running from numpy source directory. > usage: setup.py [global_opts] cmd1 [cmd1_opts] [cmd2 [cmd2_opts] ...] > or: setup.py --help [cmd1 cmd2 ...] > or: setup.py --help-commands > or: setup.py cmd --help > > error: invalid command 'bdist_wheel' > > > Scipy 0.13.x > ------------ > ... > X:\Python27-x64\lib\distutils\dist.py:267: UserWarning: Unknown > distribution option: 'test_suite' > warnings.warn(msg) > usage: setup.py [global_opts] cmd1 [cmd1_opts] [cmd2 [cmd2_opts] ...] > or: setup.py --help [cmd1 cmd2 ...] > or: setup.py --help-commands > or: setup.py cmd --help > > error: invalid command 'bdist_wheel' > > > Matplotlib 1.3.x > ---------------- > ... > installing to build\bdist.win32\wheel > running install > running install_lib > copying pylab.py -> build\bdist.win32\wheel\matplotlib-1.3.1.data\.. > error: build\bdist.win32\wheel\matplotlib-1.3.1.data\..: Cannot create a > file when that file already exists > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Fri Dec 6 15:50:42 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Fri, 06 Dec 2013 21:50:42 +0100 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com>

Message-ID: <1386363042.29623.10.camel@sebastian-laptop> On Fri, 2013-12-06 at 15:30 -0500, josef.pktd at gmail.com wrote: > On Fri, Dec 6, 2013 at 2:59 PM, Nathaniel Smith wrote: > > On Fri, Dec 6, 2013 at 11:55 AM, Alexander Belopolsky wrote: > >> > >> > >> > >> On Fri, Dec 6, 2013 at 1:46 PM, Alan G Isaac wrote: > >>> > >>> On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: > >>> > unary versus binary minus > >>> > >>> Oh right; I consider binary `-` broken for > >>> Boolean arrays. (Sorry Alexander; I did not > >>> see your entire issue.) > >>> > >>> > >>> > I'd rather write ~ than unary - if that's what it is. > >>> > >>> I agree. So I have no objection to elimination > >>> of the `-`. > >> > >> > >> It looks like we are close to reaching a consensus on the following points: > >> > >> 1. * is well-defined on boolean arrays and may be used in preference of & in > >> code that is designed to handle 1s and 0s of any dtype in addition to > >> booleans. > >> > >> 2. + is defined consistently with * and the only issue is the absence of > >> additive inverse. This is not a problem as long as presence of - does not > >> suggest otherwise. > >> > >> 3. binary and unary minus should be deprecated because its use in > >> expressions where variables can be either boolean or numeric would lead to > >> subtle bugs. For example -x*y would produce different results from -(x*y) > >> depending on whether x is boolean or not. In all situations, ^ is > >> preferable to binary - and ~ is preferable to unary -. > >> > >> 4. changing boolean arithmetics to auto-promotion to int is precluded by a > >> significant use-case of boolean matrices. > > > > +1 > > +0.5 > (I would still prefer a different binary minus, but it would be > inconsistent with a logical unary minus that negates.) > The question is if the current xor behaviour can make sense? It doesn't seem to make much sense mathematically? Which only leaves that `abs(x - y)` is actually what a (python) programmer might expect. I think I would like to deprecate at least the unary one. The ~ kind of behaviour just doesn't fit as far as I can see. > 5. `/` is useless > 6 `**` follows from 1. Both of these are currently not defined, they will just cause upcast to int8. I suppose it would be possible to deprecate that upcast though (same goes for most all other ufuncs/operators in principle). > > Josef > > > > > > -- > > Nathaniel J. Smith > > Postdoctoral researcher - Informatics - University of Edinburgh > > http://vorpus.org > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From cgohlke at uci.edu Fri Dec 6 16:01:32 2013 From: cgohlke at uci.edu (Christoph Gohlke) Date: Fri, 06 Dec 2013 13:01:32 -0800 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: References: <52A235C8.8030103@uci.edu> Message-ID: <52A23B2C.9010807@uci.edu> On 12/6/2013 12:40 PM, David Cournapeau wrote: > > > > On Fri, Dec 6, 2013 at 8:38 PM, Christoph Gohlke > wrote: > > On 12/6/2013 10:06 AM, Ralf Gommers wrote: > > Hi all, > > > > There are a few discussions on packaging for the scientific Python stack > > ongoing, on the NumFOCUS and distutils lists: > >https://groups.google.com/forum/#!topic/numfocus/mVNakFqfpZg > > > > >https://groups.google.com/forum/#!topic/numfocus/HUcwXTM_jNY > > > > >http://thread.gmane.org/gmane.comp.python.distutils.devel/20202 > >http://thread.gmane.org/gmane.comp.python.distutils.devel/20296 > > > > One of the things that we should start doing for numpy is distribute > > releases as wheels. On OS X at least this is quite simple, so I propose > > to just experiment with it. I can create some to try out and put them on > > a separate folder on SourceForge. If that works they can be put on PyPi. > > > > For Windows things are less simple, because the wheel format doesn't > > handle the multiple builds (no SSE, SSE2, SSE3) that are in the > > superpack installers. A problem is that we don't really know how many > > users still have old CPUs that don't support SSE3. The impact for those > > users is high, numpy will install but crash (see > >https://github.com/scipy/scipy/issues/1697). Questions: > > 1. does anyone have a good idea to obtain statistics? > > 2. in the absence of statistics, can we do an experiment by putting one > > wheel up on PyPi which contains SSE3 instructions, for python 3.3 I > > propose, and seeing for how many (if any) users this goes wrong? > > > > Ralf > > > > P.S. related question: did anyone check whether the recently merged > > NPY_HAVE_SSE2_INTRINSIC puts SSE2 instructions into the no-SSE binary? > > > > > > Has anyone succeeded building wheels for numpy, scipy, and matplotlib? > > > I did for numpy and scipy. You had to hack a bit numpy.distutils to make > it work for scipy,but nothing that would be too complicated to really fix. > > In your case, the trick is to use the setupegg file: python setupegg.py > bdist_wheel > > David > Thank you. The setupegg.py trick worked. Could the numpy.distutils hack be applied to the numpy 1.8.x and master branches? I'll try to fix the matplotlib issue. Christoph From josef.pktd at gmail.com Fri Dec 6 16:14:13 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Fri, 6 Dec 2013 16:14:13 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <1386363042.29623.10.camel@sebastian-laptop> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com>

<1386363042.29623.10.camel@sebastian-laptop> Message-ID: On Fri, Dec 6, 2013 at 3:50 PM, Sebastian Berg wrote: > On Fri, 2013-12-06 at 15:30 -0500, josef.pktd at gmail.com wrote: >> On Fri, Dec 6, 2013 at 2:59 PM, Nathaniel Smith wrote: >> > On Fri, Dec 6, 2013 at 11:55 AM, Alexander Belopolsky wrote: >> >> >> >> >> >> >> >> On Fri, Dec 6, 2013 at 1:46 PM, Alan G Isaac wrote: >> >>> >> >>> On 12/6/2013 1:35 PM, josef.pktd at gmail.com wrote: >> >>> > unary versus binary minus >> >>> >> >>> Oh right; I consider binary `-` broken for >> >>> Boolean arrays. (Sorry Alexander; I did not >> >>> see your entire issue.) >> >>> >> >>> >> >>> > I'd rather write ~ than unary - if that's what it is. >> >>> >> >>> I agree. So I have no objection to elimination >> >>> of the `-`. >> >> >> >> >> >> It looks like we are close to reaching a consensus on the following points: >> >> >> >> 1. * is well-defined on boolean arrays and may be used in preference of & in >> >> code that is designed to handle 1s and 0s of any dtype in addition to >> >> booleans. >> >> >> >> 2. + is defined consistently with * and the only issue is the absence of >> >> additive inverse. This is not a problem as long as presence of - does not >> >> suggest otherwise. >> >> >> >> 3. binary and unary minus should be deprecated because its use in >> >> expressions where variables can be either boolean or numeric would lead to >> >> subtle bugs. For example -x*y would produce different results from -(x*y) >> >> depending on whether x is boolean or not. In all situations, ^ is >> >> preferable to binary - and ~ is preferable to unary -. >> >> >> >> 4. changing boolean arithmetics to auto-promotion to int is precluded by a >> >> significant use-case of boolean matrices. >> > >> > +1 >> >> +0.5 >> (I would still prefer a different binary minus, but it would be >> inconsistent with a logical unary minus that negates.) >> > > The question is if the current xor behaviour can make sense? It doesn't > seem to make much sense mathematically? Which only leaves that `abs(x - > y)` is actually what a (python) programmer might expect. > I think I would like to deprecate at least the unary one. The ~ kind of > behaviour just doesn't fit as far as I can see. I haven't seen any real use cases for xor yet. My impression is that both plus and minus are just overflow accidents and not intentional. plus works in a useful way, minus as xor might be used once per century. I would deprecate both unary and binary minus. (And when nobody is looking in two versions from now, I would add a binary minus that overflows to the clipped version, so I get a set subtraction. :) > >> 5. `/` is useless >> 6 `**` follows from 1. >>> m1 ** m2 array([1, 0, 1, 1], dtype=int8) >>> m1 ** 2 array([False, False, True, True], dtype=bool) >>> m1 ** 3 array([0, 0, 1, 1]) but I'm using python with an old numpy right now >>> np.__version__ '1.6.1' > > Both of these are currently not defined, they will just cause upcast to > int8. I suppose it would be possible to deprecate that upcast though > (same goes for most all other ufuncs/operators in principle). We would have to start the discussion again for all other operators/ufuncs to see if they are useful in some cases. For most treating as int will make sense, I guess. Josef > >> >> Josef >> >> >> > >> > -- >> > Nathaniel J. Smith >> > Postdoctoral researcher - Informatics - University of Edinburgh >> > http://vorpus.org >> > _______________________________________________ >> > NumPy-Discussion mailing list >> > NumPy-Discussion at scipy.org >> > http://mail.scipy.org/mailman/listinfo/numpy-discussion >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> http://mail.scipy.org/mailman/listinfo/numpy-discussion >> > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From ralf.gommers at gmail.com Fri Dec 6 16:26:15 2013 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Dec 2013 22:26:15 +0100 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: <52A23B2C.9010807@uci.edu> References: <52A235C8.8030103@uci.edu> <52A23B2C.9010807@uci.edu> Message-ID: On Fri, Dec 6, 2013 at 10:01 PM, Christoph Gohlke wrote: > On 12/6/2013 12:40 PM, David Cournapeau wrote: > > > > > > On Fri, Dec 6, 2013 at 8:38 PM, Christoph Gohlke > > wrote: > > > > Has anyone succeeded building wheels for numpy, scipy, and > matplotlib? > > > > I did for numpy and scipy. You had to hack a bit numpy.distutils to make > > it work for scipy,but nothing that would be too complicated to really > fix. > > > > In your case, the trick is to use the setupegg file: python setupegg.py > > bdist_wheel > > > > David > > > > Thank you. The setupegg.py trick worked. Could the numpy.distutils hack > be applied to the numpy 1.8.x and master branches? I'll try to fix the > matplotlib issue. > This should make ``python setup.py bdist_wheel`` work: https://github.com/numpy/numpy/pull/4110 Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From alan.isaac at gmail.com Fri Dec 6 17:09:59 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 17:09:59 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com>

Message-ID: <52A24B37.7090200@gmail.com> On 12/6/2013 3:30 PM, josef.pktd at gmail.com wrote: > 6 `**` follows from 1. Yes, but what really matters is that linalg.matrix_power give the correct (boolean) result. Alan From alan.isaac at gmail.com Fri Dec 6 17:14:23 2013 From: alan.isaac at gmail.com (Alan G Isaac) Date: Fri, 06 Dec 2013 17:14:23 -0500 Subject: [Numpy-discussion] Deprecate boolean math operators? In-Reply-To: <1386363042.29623.10.camel@sebastian-laptop> References: <1386283039.23728.6.camel@sebastian-laptop> <52A14D0C.1060209@gmail.com> <52A1F793.3010506@gmail.com> <52A21468.8010809@gmail.com> <52A21B7C.6000908@gmail.com>

<1386363042.29623.10.camel@sebastian-laptop> Message-ID: <52A24C3F.4060308@gmail.com> On 12/6/2013 3:50 PM, Sebastian Berg wrote: > Both of these are currently not defined, they will just cause upcast to > int8. What does currently mean? `**` works fine for boolean arrays in 1.7.1. (It's useless, but it works.) Alan Isaac From ralf.gommers at gmail.com Fri Dec 6 17:16:37 2013 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Dec 2013 23:16:37 +0100 Subject: [Numpy-discussion] distributing wheels & SSE/superpack options In-Reply-To: