From thomas.robitaille at gmail.com Tue Oct 1 03:01:40 2013 From: thomas.robitaille at gmail.com (Thomas Robitaille) Date: Tue, 1 Oct 2013 09:01:40 +0200 Subject: [Numpy-discussion] Issue with np.median and array subclasses in 1.8.0rc (worked with 1.7.0) Message-ID: Hi, The behavior for ``np.median`` and array sub-classes has changed in 1.8.0rc, which breaks unit-handling code (such as the ``quantities`` package, or ``astropy.units``): https://github.com/numpy/numpy/issues/3846 This previously worked from Numpy 1.5 (at least) to Numpy 1.7. Is there a new (and better) way to override the ``np.median`` behavior? Cheers, Tom From jensj at fysik.dtu.dk Tue Oct 1 06:00:45 2013 From: jensj at fysik.dtu.dk (=?ISO-8859-1?Q?Jens_J=F8rgen_Mortensen?=) Date: Tue, 1 Oct 2013 12:00:45 +0200 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: References: Message-ID: <524A9D4D.4080106@fysik.dtu.dk> Den 30-09-2013 17:17, Charles R Harris skrev: > Hi All, > > NumPy 1.8.0rc1 is up now on sourceforge > .The > binary builds are included except for Python 3.3 on windows, which > will arrive later. Many thanks to Ralf for the binaries, and to those > who found and fixed the bugs in the last beta. Any remaining bugs are > all my fault ;) I hope this will be the last release before final, so > please test it thoroughly. I seems that solve() doesn't like arrays like zeros((3, 0)). I get this with 1.8.0rc1 on Ubuntu 13.04 and Python 2.7.4: Python 2.7.4 (default, Apr 19 2013, 18:32:33) [GCC 4.7.3] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> import numpy as np >>> np >>> np.linalg.solve(np.eye(3), np.zeros((3, 0))) Traceback (most recent call last): File "", line 1, in File "/home/jensj/numpy-1.8.0rc1/lib/python/numpy/linalg/linalg.py", line 380, in solve r = gufunc(a, b, signature=signature, extobj=extobj) ValueError: cannot remove a zero-sized axis from an iterator with 1.7.1 it works OK: >>> np >>> np.linalg.solve(np.eye(3), np.zeros((3, 0))) array([], shape=(3, 0), dtype=float64) Jens J?rgen > Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Tue Oct 1 06:11:12 2013 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Tue, 1 Oct 2013 12:11:12 +0200 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: References: Message-ID: On 30 September 2013 17:17, Charles R Harris wrote: > NumPy 1.8.0rc1 is up now on sourceforge.The binary builds are included except for Python 3.3 on windows, which > will arrive later. Many thanks to Ralf for the binaries, and to those who > found and fixed the bugs in the last beta. Any remaining bugs are all my > fault ;) I hope this will be the last release before final, so please test > it thoroughly. > I installed it with # python setup.py install But something is wrong there: >>> import numpy as np Traceback (most recent call last): File "", line 1, in File "/usr/lib64/python2.7/site-packages/numpy/__init__.py", line 137, in import add_newdocs File "/usr/lib64/python2.7/site-packages/numpy/add_newdocs.py", line 13, in from numpy.lib import add_newdoc File "/usr/lib64/python2.7/site-packages/numpy/lib/__init__.py", line 4, in from type_check import * File "/usr/lib64/python2.7/site-packages/numpy/lib/type_check.py", line 8, in import numpy.core.numeric as _nx File "/usr/lib64/python2.7/site-packages/numpy/core/__init__.py", line 45, in from numpy.testing import Tester File "/usr/lib64/python2.7/site-packages/numpy/testing/__init__.py", line 10, in import decorators as dec File "/usr/lib64/python2.7/site-packages/numpy/testing/decorators.py", line 19, in from numpy.testing.utils import \ File "/usr/lib64/python2.7/site-packages/numpy/testing/utils.py", line 12, in from .nosetester import import_nose File "/usr/lib64/python2.7/site-packages/numpy/testing/nosetester.py", line 12, in from numpy.compat import basestring ImportError: cannot import name basestring I am using Python27 on Fedora 19. $ gcc --version gcc (GCC) 4.8.1 20130603 (Red Hat 4.8.1-1) -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Tue Oct 1 06:18:03 2013 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Tue, 1 Oct 2013 12:18:03 +0200 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: References: Message-ID: Disregard that, I had not cleaned the previous installation properly. Sorry for the noise. On 1 October 2013 12:11, Da?id wrote: > > On 30 September 2013 17:17, Charles R Harris wrote: > >> NumPy 1.8.0rc1 is up now on sourceforge.The binary builds are included except for Python 3.3 on windows, which >> will arrive later. Many thanks to Ralf for the binaries, and to those who >> found and fixed the bugs in the last beta. Any remaining bugs are all my >> fault ;) I hope this will be the last release before final, so please test >> it thoroughly. >> > > I installed it with > > # python setup.py install > > But something is wrong there: > > >>> import numpy as np > > Traceback (most recent call last): > File "", line 1, in > File "/usr/lib64/python2.7/site-packages/numpy/__init__.py", line 137, > in > import add_newdocs > File "/usr/lib64/python2.7/site-packages/numpy/add_newdocs.py", line 13, > in > from numpy.lib import add_newdoc > File "/usr/lib64/python2.7/site-packages/numpy/lib/__init__.py", line 4, > in > from type_check import * > File "/usr/lib64/python2.7/site-packages/numpy/lib/type_check.py", line > 8, in > import numpy.core.numeric as _nx > File "/usr/lib64/python2.7/site-packages/numpy/core/__init__.py", line > 45, in > from numpy.testing import Tester > File "/usr/lib64/python2.7/site-packages/numpy/testing/__init__.py", > line 10, in > import decorators as dec > File "/usr/lib64/python2.7/site-packages/numpy/testing/decorators.py", > line 19, in > from numpy.testing.utils import \ > File "/usr/lib64/python2.7/site-packages/numpy/testing/utils.py", line > 12, in > from .nosetester import import_nose > File "/usr/lib64/python2.7/site-packages/numpy/testing/nosetester.py", > line 12, in > from numpy.compat import basestring > ImportError: cannot import name basestring > > > I am using Python27 on Fedora 19. > > $ gcc --version > gcc (GCC) 4.8.1 20130603 (Red Hat 4.8.1-1) > -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Oct 1 06:29:01 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 11:29:01 +0100 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: <524A9D4D.4080106@fysik.dtu.dk> References: <524A9D4D.4080106@fysik.dtu.dk> Message-ID: On Tue, Oct 1, 2013 at 11:00 AM, Jens J?rgen Mortensen wrote: > I seems that solve() doesn't like arrays like zeros((3, 0)). I get this > with 1.8.0rc1 on Ubuntu 13.04 and Python 2.7.4: Yep, looks like a simple bug in the new gufunc stuff, thanks for catching it. Filed here: https://github.com/numpy/numpy/issues/3847 -n From robert.kern at gmail.com Tue Oct 1 06:29:44 2013 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 1 Oct 2013 11:29:44 +0100 Subject: [Numpy-discussion] Masked arrays: Rationale for "False convention" In-Reply-To: References:

<524A3385.1030101@hawaii.edu> Message-ID: On Tue, Oct 1, 2013 at 3:57 AM, Ond?ej ?ert?k wrote: > I see, that makes sense. So to remember this, the rule is: > > "Specify elements that you want to get masked using True in 'mask'". Yes. This convention dates back at least to the original MA package in Numeric; I don't know if Paul Dubois stole it from any previous software. One way to motivate the convention is to think about doing a binary operation on masked arrays, which is really the most common kind of thing one does with masked arrays. The mask of the result is the logical OR of the two operand masks (barring additional masked elements from domain violations, 0/0, etc.). I assume that the convention was decided mostly on what was most convenient and efficient for the common internal operations for *implementing* the masked arrays and not necessarily matching any particular intuitions when putting data *into* the masked arrays. -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Tue Oct 1 06:37:10 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 01 Oct 2013 12:37:10 +0200 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: <524A9D4D.4080106@fysik.dtu.dk> References: <524A9D4D.4080106@fysik.dtu.dk> Message-ID: <1380623830.5259.2.camel@sebastian-laptop> On Tue, 2013-10-01 at 12:00 +0200, Jens J?rgen Mortensen wrote: > Den 30-09-2013 17:17, Charles R Harris skrev: > > > Hi All, > > > > > > NumPy 1.8.0rc1 is up now on sourceforge .The binary builds are > > included except for Python 3.3 on windows, which will arrive later. > > Many thanks to Ralf for the binaries, and to those who found and > > fixed the bugs in the last beta. Any remaining bugs are all my > > fault ;) I hope this will be the last release before final, so > > please test it thoroughly. > > > > I seems that solve() doesn't like arrays like zeros((3, 0)). I get > this with 1.8.0rc1 on Ubuntu 13.04 and Python 2.7.4: > Hey, can you create an issue on github so I don't forget it? This needs another special case to handle with the new gufunc based code. (Added checks there recently to fix these, but the `K=0` one is missing). - Sebastian > Python 2.7.4 (default, Apr 19 2013, 18:32:33) > [GCC 4.7.3] on linux2 > Type "help", "copyright", "credits" or "license" for more information. > >>> import numpy as np > >>> np > '/home/jensj/numpy-1.8.0rc1/lib/python/numpy/__init__.pyc'> > >>> np.linalg.solve(np.eye(3), np.zeros((3, 0))) > Traceback (most recent call last): > File "", line 1, in > File "/home/jensj/numpy-1.8.0rc1/lib/python/numpy/linalg/linalg.py", > line 380, in solve > r = gufunc(a, b, signature=signature, extobj=extobj) > ValueError: cannot remove a zero-sized axis from an iterator > > with 1.7.1 it works OK: > > >>> np > '/usr/lib/python2.7/dist-packages/numpy/__init__.pyc'> > >>> np.linalg.solve(np.eye(3), np.zeros((3, 0))) > array([], shape=(3, 0), dtype=float64) > > Jens J?rgen > > > Chuck > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From njs at pobox.com Tue Oct 1 06:41:26 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 11:41:26 +0100 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu> <524A0561.4000808@uci.edu> <524A0CE3.3080502@googlemail.com> <524A1393.1040701@uci.edu> Message-ID: On Tue, Oct 1, 2013 at 3:27 AM, Charles R Harris wrote: >>> >> On Mon, Sep 30, 2013 at 5:12 PM, Christoph Gohlke >> >> > wrote: >>> >> > NumPy 1.8.0rc1 looks good. All tests pass on Windows and >>> >> most >>> >> 3rd party >>> >> > packages test OK now. Thank you. >>> >> > >>> >> > A few tests still fail in the following packages when run >>> >> with >>> >> > numpy-MKL-1.8.0rc1-win-amd64-py3.3 compared to >>> >> > numpy-MKL-1.7.1-win-amd64-py3.3: >>> >> > >>> >> > 1) Pandas 0.12.0 >>> >> > >>> >> > ``` >>> >> > >>> >> >>> >> ====================================================================== >>> >> > FAIL: test_nansum_buglet >>> >> (pandas.tests.test_series.TestNanops) >>> >> > >>> >> >>> >> ---------------------------------------------------------------------- >>> >> > Traceback (most recent call last): >>> >> > File >>> >> "X:\Python33\lib\site-packages\pandas\tests\test_series.py", >>> >> > line 254, in test_nansum_buglet >>> >> > assert_almost_equal(result, 1) >>> >> > File >>> >> "X:\Python33\lib\site-packages\pandas\util\testing.py", line >>> >> > 134, in assert_almost_equal >>> >> > np.testing.assert_(isiterable(b)) >>> >> > File >>> >> "D:\Dev\Compile\Test\numpy-build\numpy\testing\utils.py", line >>> >> > 44, in assert_ >>> >> > raise AssertionError(msg) >>> >> > AssertionError >>> >> > ``` >>> >> > >>> >> > Possibly related: >>> >> > >>> >> > ``` >>> >> > >>> import numpy as np >>> >> > >>> from pandas import Series >>> >> > >>> s = Series([0.0]) >>> >> > >>> result = np.nansum(s) >>> >> > >>> print(result) >>> >> > Traceback (most recent call last): >>> >> > File "", line 1, in >>> >> > File >>> >> "X:\Python33\lib\site-packages\pandas\core\base.py", line >>> >> > 19, in >>> >> > __str__ >>> >> > return self.__unicode__() >>> >> > File >>> >> "X:\Python33\lib\site-packages\pandas\core\series.py", line >>> >> > 1115, in __unicode__ >>> >> > length=len(self) > 50, >>> >> > TypeError: len() of unsized object >>> >> > ``` [...] > > The pandas test passes for current pandas dev, so it looks like a bug on > their end that has been taken care of. > > test_nansum_buglet (__main__.TestNanops) ... ok I'm concerned about this. 0.12.0 is currently the latest pandas release, so even if it is a bug on their side, we're going to be converting it from a latent bug to a real bug when we release... CC'ing pydata, do you guys have any insight into what changed here? The code is: s = pandas.Series([1.0, np.nan]) result = np.nansum(s) With numpy 1.7.1, 'result' comes out as a np.float64. With numpy maintenance/1.8.x, 'result' comes out as a 0-d Series object. Series is a subclass of ndarray, but it's supposed to always be 1-d, so all kinds of stuff blows up as soon as you have a 0-d Series object. I'm not sure what changed in numpy's nansum; if I try this same test with a simple no-op ndarray subclass: class MyArray(np.ndarray): pass np.nansum(np.array([1.0, np.nan]).view(MyArray)) then 1.7.1 and maintenance/1.8.x both act the same, and both return a 0-d MyArray, so it's not just a question of whether we remembered to handle subclasses at all. -n From njs at pobox.com Tue Oct 1 06:43:50 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 11:43:50 +0100 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: <5249F26B.2040109@uci.edu> References:

<5249F26B.2040109@uci.edu> Message-ID: On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke wrote: > 2) Bottleneck 0.7.0 > > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 I can't tell if these are real bugs in numpy, or tests checking that bottleneck is bug-for-bug compatible with old numpy and we just fixed some bugs, or what. It's clearly something to do with the nanarg{max,min} rewrite -- @charris, do you know what's going on here? -n From charlesr.harris at gmail.com Tue Oct 1 08:56:31 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 1 Oct 2013 06:56:31 -0600 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu> Message-ID: On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith wrote: > On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke > wrote: > > 2) Bottleneck 0.7.0 > > > > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 > > I can't tell if these are real bugs in numpy, or tests checking that > bottleneck is bug-for-bug compatible with old numpy and we just fixed > some bugs, or what. It's clearly something to do with the > nanarg{max,min} rewrite -- @charris, do you know what's going on here? > > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast nan to intp when the result was an array, and return nan when a scalar. The current behaviour is to return the most negative value of intp as an error marker in both cases and raise a warning. It is a change in behavior, but I think one that needs to be made. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From grb at skogoglandskap.no Tue Oct 1 09:17:14 2013 From: grb at skogoglandskap.no (Graeme B. Bell) Date: Tue, 1 Oct 2013 13:17:14 +0000 Subject: [Numpy-discussion] review/approval request In-Reply-To: References: Message-ID: <3942095D-1770-4B51-B8DE-BE4C560A059E@skogoglandskap.no> Regarding 1.8: A few months ago I submitted a patch that fixes some bugs, including a major bug in select() that has been around for years, and optimised the code quite a bit. https://github.com/numpy/numpy/pull/3537 It would be nice to see the patch in 1.8, but it has languished for a couple of months waiting for final approval/merging. The patch is pretty small and the new code is not complicated. Are there any core devs that would have time/interest to help me get this approved/merged into numpy master? I have already adjusted the patch as suggested by njsmith to suit inclusion in master. Graeme. >>> 331701 >> >> I can't tell if these are real bugs in numpy, or tests checking that >> bottleneck is bug-for-bug compatible with old numpy and we just fixed >> some bugs, or what. It's clearly something to do with the >> nanarg{max,min} rewrite -- @charris, do you know what's going on here? >> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast nan to > intp when the result was an array, and return nan when a scalar. The > current behaviour is to return the most negative value of intp as an error > marker in both cases and raise a warning. It is a change in behavior, but I > think one that needs to be made. > > Chuck > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131001/829df70a/attachment.html > > ------------------------------ > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > End of NumPy-Discussion Digest, Vol 85, Issue 2 > *********************************************** From njs at pobox.com Tue Oct 1 09:25:37 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 14:25:37 +0100 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu> Message-ID: On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris wrote: > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith wrote: >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke >> wrote: >> > 2) Bottleneck 0.7.0 >> > >> > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 >> >> I can't tell if these are real bugs in numpy, or tests checking that >> bottleneck is bug-for-bug compatible with old numpy and we just fixed >> some bugs, or what. It's clearly something to do with the >> nanarg{max,min} rewrite -- @charris, do you know what's going on here? >> > > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast nan to > intp when the result was an array, and return nan when a scalar. The current > behaviour is to return the most negative value of intp as an error marker in > both cases and raise a warning. It is a change in behavior, but I think one > that needs to be made. Ah, okay! I kind of lost track of the nanfunc changes by the end there. So for the bottleneck issue, it sounds like the problem is just that bottleneck is still emulating the old numpy behaviour in this corner case, which isn't really a problem. So we don't really need to worry about that, both behaviours are correct, just maybe out of sync. I'm a little dubious about this "make up some weird value that will *probably* blow up if people try to use it without checking, and also raise a warning" thing, wouldn't it make more sense to just raise an error? That's what exceptions are for? I guess I should have said something earlier though... -n From njs at pobox.com Tue Oct 1 09:48:05 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 14:48:05 +0100 Subject: [Numpy-discussion] [pydata] Re: [SciPy-Dev] 1.8.0rc1 In-Reply-To: <9c643170-cff1-41a8-9bb3-66aec33a38e3@googlegroups.com> References:

<5249F26B.2040109@uci.edu> <524A0561.4000808@uci.edu> <524A0CE3.3080502@googlemail.com> <524A1393.1040701@uci.edu>

<9c643170-cff1-41a8-9bb3-66aec33a38e3@googlegroups.com> Message-ID: On Tue, Oct 1, 2013 at 2:39 PM, Jeff wrote: > Just built from numpy master looks ok > > In [7]: pd.__version__ > Out[7]: '0.12.0' > > In [8]: np.__version__ > Out[8]: '1.9.0.dev-74abfa2' > > In [9]: s = pandas.Series([1.0, np.nan]) > > In [10]: np.nansum(s) > Out[10]: 1.0 > > In [11]: type(np.nansum(s)) > Out[11]: numpy.float64 Yeah, I can confirm this... With pandas 0.12.0, this code: np.nansum(pandas.Series([1.0, np.nan])) works correctly using current master (74abfa27), but is broken with current maintenance/1.8.x (f5fe91b9). -n > On Tuesday, October 1, 2013 6:41:26 AM UTC-4, Nathaniel Smith wrote: >> >> On Tue, Oct 1, 2013 at 3:27 AM, Charles R Harris >> wrote: >> >>> >> On Mon, Sep 30, 2013 at 5:12 PM, Christoph Gohlke > >>> >> > wrote: >> >>> >> > NumPy 1.8.0rc1 looks good. All tests pass on Windows and >> >>> >> most >> >>> >> 3rd party >> >>> >> > packages test OK now. Thank you. >> >>> >> > >> >>> >> > A few tests still fail in the following packages when >> >>> >> run >> >>> >> with >> >>> >> > numpy-MKL-1.8.0rc1-win-amd64-py3.3 compared to >> >>> >> > numpy-MKL-1.7.1-win-amd64-py3.3: >> >>> >> > >> >>> >> > 1) Pandas 0.12.0 >> >>> >> > >> >>> >> > ``` >> >>> >> > >> >>> >> >> >>> >> >> >>> >> ====================================================================== >> >>> >> > FAIL: test_nansum_buglet >> >>> >> (pandas.tests.test_series.TestNanops) >> >>> >> > >> >>> >> >> >>> >> >> >>> >> ---------------------------------------------------------------------- >> >>> >> > Traceback (most recent call last): >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\tests\test_series.py", >> >>> >> > line 254, in test_nansum_buglet >> >>> >> > assert_almost_equal(result, 1) >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\util\testing.py", line >> >>> >> > 134, in assert_almost_equal >> >>> >> > np.testing.assert_(isiterable(b)) >> >>> >> > File >> >>> >> "D:\Dev\Compile\Test\numpy-build\numpy\testing\utils.py", line >> >>> >> > 44, in assert_ >> >>> >> > raise AssertionError(msg) >> >>> >> > AssertionError >> >>> >> > ``` >> >>> >> > >> >>> >> > Possibly related: >> >>> >> > >> >>> >> > ``` >> >>> >> > >>> import numpy as np >> >>> >> > >>> from pandas import Series >> >>> >> > >>> s = Series([0.0]) >> >>> >> > >>> result = np.nansum(s) >> >>> >> > >>> print(result) >> >>> >> > Traceback (most recent call last): >> >>> >> > File "", line 1, in >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\core\base.py", line >> >>> >> > 19, in >> >>> >> > __str__ >> >>> >> > return self.__unicode__() >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\core\series.py", line >> >>> >> > 1115, in __unicode__ >> >>> >> > length=len(self) > 50, >> >>> >> > TypeError: len() of unsized object >> >>> >> > ``` >> [...] >> > >> > The pandas test passes for current pandas dev, so it looks like a bug on >> > their end that has been taken care of. >> > >> > test_nansum_buglet (__main__.TestNanops) ... ok >> >> I'm concerned about this. 0.12.0 is currently the latest pandas >> release, so even if it is a bug on their side, we're going to be >> converting it from a latent bug to a real bug when we release... >> CC'ing pydata, do you guys have any insight into what changed here? >> >> The code is: >> s = pandas.Series([1.0, np.nan]) >> result = np.nansum(s) >> With numpy 1.7.1, 'result' comes out as a np.float64. With numpy >> maintenance/1.8.x, 'result' comes out as a 0-d Series object. Series >> is a subclass of ndarray, but it's supposed to always be 1-d, so all >> kinds of stuff blows up as soon as you have a 0-d Series object. >> >> I'm not sure what changed in numpy's nansum; if I try this same test >> with a simple no-op ndarray subclass: >> class MyArray(np.ndarray): >> pass >> np.nansum(np.array([1.0, np.nan]).view(MyArray)) >> then 1.7.1 and maintenance/1.8.x both act the same, and both return a >> 0-d MyArray, so it's not just a question of whether we remembered to >> handle subclasses at all. >> >> -n > > -- > You received this message because you are subscribed to the Google Groups > "PyData" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pydata+unsubscribe at googlegroups.com. > For more options, visit https://groups.google.com/groups/opt_out. From charlesr.harris at gmail.com Tue Oct 1 09:52:33 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 1 Oct 2013 07:52:33 -0600 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu>

Message-ID: On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith wrote: > On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris > wrote: > > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith wrote: > >> > >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke > >> wrote: > >> > 2) Bottleneck 0.7.0 > >> > > >> > > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 > >> > >> I can't tell if these are real bugs in numpy, or tests checking that > >> bottleneck is bug-for-bug compatible with old numpy and we just fixed > >> some bugs, or what. It's clearly something to do with the > >> nanarg{max,min} rewrite -- @charris, do you know what's going on here? > >> > > > > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast nan > to > > intp when the result was an array, and return nan when a scalar. The > current > > behaviour is to return the most negative value of intp as an error > marker in > > both cases and raise a warning. It is a change in behavior, but I think > one > > that needs to be made. > > Ah, okay! I kind of lost track of the nanfunc changes by the end there. > > So for the bottleneck issue, it sounds like the problem is just that > bottleneck is still emulating the old numpy behaviour in this corner > case, which isn't really a problem. So we don't really need to worry > about that, both behaviours are correct, just maybe out of sync. > > I'm a little dubious about this "make up some weird value that will > *probably* blow up if people try to use it without checking, and also > raise a warning" thing, wouldn't it make more sense to just raise an > error? That's what exceptions are for? I guess I should have said > something earlier though... > > I figure the blowup is safe, as we can't allocate arrays big enough that the minimum intp value would be a valid index. I considered raising an error, and if there is a consensus the behavior could be changed. Or we could add a keyword to determine the behavior. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Oct 1 09:58:24 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 14:58:24 +0100 Subject: [Numpy-discussion] [pydata] Re: [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu> <524A0561.4000808@uci.edu> <524A0CE3.3080502@googlemail.com> <524A1393.1040701@uci.edu>

Message-ID: I bet the difference is: In master, nansum ultimately calls arr.sum(...), which will be intercepted by Series.sum. In 1.8.x, nansum ultimately calls np.add.reduce(...), which can't be intercepted and will return the wrong thing. AFAICT the np.add.reduce(a, ...) call could just be replaced with a.sum(...), but I might be missing something... surely there must have been some reason it was written that way in the first place? On Tue, Oct 1, 2013 at 2:48 PM, Jeff wrote: > so...looks like a backport issue? > > > On Tuesday, October 1, 2013 6:41:26 AM UTC-4, Nathaniel Smith wrote: >> >> On Tue, Oct 1, 2013 at 3:27 AM, Charles R Harris >> wrote: >> >>> >> On Mon, Sep 30, 2013 at 5:12 PM, Christoph Gohlke > >>> >> > wrote: >> >>> >> > NumPy 1.8.0rc1 looks good. All tests pass on Windows and >> >>> >> most >> >>> >> 3rd party >> >>> >> > packages test OK now. Thank you. >> >>> >> > >> >>> >> > A few tests still fail in the following packages when >> >>> >> run >> >>> >> with >> >>> >> > numpy-MKL-1.8.0rc1-win-amd64-py3.3 compared to >> >>> >> > numpy-MKL-1.7.1-win-amd64-py3.3: >> >>> >> > >> >>> >> > 1) Pandas 0.12.0 >> >>> >> > >> >>> >> > ``` >> >>> >> > >> >>> >> >> >>> >> >> >>> >> ====================================================================== >> >>> >> > FAIL: test_nansum_buglet >> >>> >> (pandas.tests.test_series.TestNanops) >> >>> >> > >> >>> >> >> >>> >> >> >>> >> ---------------------------------------------------------------------- >> >>> >> > Traceback (most recent call last): >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\tests\test_series.py", >> >>> >> > line 254, in test_nansum_buglet >> >>> >> > assert_almost_equal(result, 1) >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\util\testing.py", line >> >>> >> > 134, in assert_almost_equal >> >>> >> > np.testing.assert_(isiterable(b)) >> >>> >> > File >> >>> >> "D:\Dev\Compile\Test\numpy-build\numpy\testing\utils.py", line >> >>> >> > 44, in assert_ >> >>> >> > raise AssertionError(msg) >> >>> >> > AssertionError >> >>> >> > ``` >> >>> >> > >> >>> >> > Possibly related: >> >>> >> > >> >>> >> > ``` >> >>> >> > >>> import numpy as np >> >>> >> > >>> from pandas import Series >> >>> >> > >>> s = Series([0.0]) >> >>> >> > >>> result = np.nansum(s) >> >>> >> > >>> print(result) >> >>> >> > Traceback (most recent call last): >> >>> >> > File "", line 1, in >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\core\base.py", line >> >>> >> > 19, in >> >>> >> > __str__ >> >>> >> > return self.__unicode__() >> >>> >> > File >> >>> >> "X:\Python33\lib\site-packages\pandas\core\series.py", line >> >>> >> > 1115, in __unicode__ >> >>> >> > length=len(self) > 50, >> >>> >> > TypeError: len() of unsized object >> >>> >> > ``` >> [...] >> > >> > The pandas test passes for current pandas dev, so it looks like a bug on >> > their end that has been taken care of. >> > >> > test_nansum_buglet (__main__.TestNanops) ... ok >> >> I'm concerned about this. 0.12.0 is currently the latest pandas >> release, so even if it is a bug on their side, we're going to be >> converting it from a latent bug to a real bug when we release... >> CC'ing pydata, do you guys have any insight into what changed here? >> >> The code is: >> s = pandas.Series([1.0, np.nan]) >> result = np.nansum(s) >> With numpy 1.7.1, 'result' comes out as a np.float64. With numpy >> maintenance/1.8.x, 'result' comes out as a 0-d Series object. Series >> is a subclass of ndarray, but it's supposed to always be 1-d, so all >> kinds of stuff blows up as soon as you have a 0-d Series object. >> >> I'm not sure what changed in numpy's nansum; if I try this same test >> with a simple no-op ndarray subclass: >> class MyArray(np.ndarray): >> pass >> np.nansum(np.array([1.0, np.nan]).view(MyArray)) >> then 1.7.1 and maintenance/1.8.x both act the same, and both return a >> 0-d MyArray, so it's not just a question of whether we remembered to >> handle subclasses at all. >> >> -n > > -- > You received this message because you are subscribed to the Google Groups > "PyData" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pydata+unsubscribe at googlegroups.com. > For more options, visit https://groups.google.com/groups/opt_out. From njs at pobox.com Tue Oct 1 10:12:26 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 15:12:26 +0100 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) Message-ID: [switching subject to break out from the giant 1.8.0rc1 thread] On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris wrote: > > > > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith wrote: >> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris >> wrote: >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith wrote: >> >> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke >> >> wrote: >> >> > 2) Bottleneck 0.7.0 >> >> > >> >> > >> >> > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 >> >> >> >> I can't tell if these are real bugs in numpy, or tests checking that >> >> bottleneck is bug-for-bug compatible with old numpy and we just fixed >> >> some bugs, or what. It's clearly something to do with the >> >> nanarg{max,min} rewrite -- @charris, do you know what's going on here? >> >> >> > >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast nan >> > to >> > intp when the result was an array, and return nan when a scalar. The >> > current >> > behaviour is to return the most negative value of intp as an error >> > marker in >> > both cases and raise a warning. It is a change in behavior, but I think >> > one >> > that needs to be made. >> >> Ah, okay! I kind of lost track of the nanfunc changes by the end there. >> >> So for the bottleneck issue, it sounds like the problem is just that >> bottleneck is still emulating the old numpy behaviour in this corner >> case, which isn't really a problem. So we don't really need to worry >> about that, both behaviours are correct, just maybe out of sync. >> >> I'm a little dubious about this "make up some weird value that will >> *probably* blow up if people try to use it without checking, and also >> raise a warning" thing, wouldn't it make more sense to just raise an >> error? That's what exceptions are for? I guess I should have said >> something earlier though... >> > > I figure the blowup is safe, as we can't allocate arrays big enough that the > minimum intp value would be a valid index. I considered raising an error, > and if there is a consensus the behavior could be changed. Or we could add a > keyword to determine the behavior. Yeah, the intp value can't be a valid index, so that covers 95% of cases, but I'm worried about that other 5%. It could still pass silently as the endpoint of a slice, or participate in some sort of integer arithmetic calculation, etc. I assume you also share this worry to some extent or you wouldn't have put in the warning ;-). I guess the bigger question is, why would we *not* use the standard method for signaling an exceptional condition here, i.e., exceptions? That way we're 100% guaranteed that if people aren't prepared to handle it then they'll at least know something has gone wrong, and if they are prepared to handle it then it's very easy and standard, just use try/except. Right now I guess you have to check for the special value, but also do something to silence warnings, but just for that one line? Sounds kind of complicated... -n From charlesr.harris at gmail.com Tue Oct 1 10:13:37 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 1 Oct 2013 08:13:37 -0600 Subject: [Numpy-discussion] [pydata] Re: [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu> <524A0561.4000808@uci.edu> <524A0CE3.3080502@googlemail.com> <524A1393.1040701@uci.edu>

Message-ID: On Tue, Oct 1, 2013 at 7:58 AM, Nathaniel Smith wrote: > I bet the difference is: > > In master, nansum ultimately calls arr.sum(...), which will be > intercepted by Series.sum. > > In 1.8.x, nansum ultimately calls np.add.reduce(...), which can't be > intercepted and will return the wrong thing. > > AFAICT the np.add.reduce(a, ...) call could just be replaced with > a.sum(...), but I might be missing something... surely there must have > been some reason it was written that way in the first place? > No good reason, just a bit more efficient. The change for current master was because of the change in nansum behavior for empty slices. Changing the call to a.sum is not a problem, although I confess that it seems a bit fragile... Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Oct 1 10:18:25 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 15:18:25 +0100 Subject: [Numpy-discussion] [pydata] Re: [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu> <524A0561.4000808@uci.edu> <524A0CE3.3080502@googlemail.com> <524A1393.1040701@uci.edu>

Message-ID: On Tue, Oct 1, 2013 at 3:13 PM, Charles R Harris wrote: > > > > On Tue, Oct 1, 2013 at 7:58 AM, Nathaniel Smith wrote: >> >> I bet the difference is: >> >> In master, nansum ultimately calls arr.sum(...), which will be >> intercepted by Series.sum. >> >> In 1.8.x, nansum ultimately calls np.add.reduce(...), which can't be >> intercepted and will return the wrong thing. >> >> AFAICT the np.add.reduce(a, ...) call could just be replaced with >> a.sum(...), but I might be missing something... surely there must have >> been some reason it was written that way in the first place? > > No good reason, just a bit more efficient. The change for current master was > because of the change in nansum behavior for empty slices. Changing the call > to a.sum is not a problem, although I confess that it seems a bit fragile... Yeah, ndarray subclassing is always fragile :-/. But hopefully __numpy_ufunc__ will solve the problem in 1.9 and going forward...? (I forget if it's implemented for .reduce yet.) Filed a tracker bug here: https://github.com/numpy/numpy/issues/3849 -n From charlesr.harris at gmail.com Tue Oct 1 10:20:09 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 1 Oct 2013 08:20:09 -0600 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References: Message-ID: On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith wrote: > [switching subject to break out from the giant 1.8.0rc1 thread] > > On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris > wrote: > > > > > > > > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith wrote: > >> > >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris > >> wrote: > >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith > wrote: > >> >> > >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke > >> >> wrote: > >> >> > 2) Bottleneck 0.7.0 > >> >> > > >> >> > > >> >> > > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 > >> >> > >> >> I can't tell if these are real bugs in numpy, or tests checking that > >> >> bottleneck is bug-for-bug compatible with old numpy and we just fixed > >> >> some bugs, or what. It's clearly something to do with the > >> >> nanarg{max,min} rewrite -- @charris, do you know what's going on > here? > >> >> > >> > > >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast > nan > >> > to > >> > intp when the result was an array, and return nan when a scalar. The > >> > current > >> > behaviour is to return the most negative value of intp as an error > >> > marker in > >> > both cases and raise a warning. It is a change in behavior, but I > think > >> > one > >> > that needs to be made. > >> > >> Ah, okay! I kind of lost track of the nanfunc changes by the end there. > >> > >> So for the bottleneck issue, it sounds like the problem is just that > >> bottleneck is still emulating the old numpy behaviour in this corner > >> case, which isn't really a problem. So we don't really need to worry > >> about that, both behaviours are correct, just maybe out of sync. > >> > >> I'm a little dubious about this "make up some weird value that will > >> *probably* blow up if people try to use it without checking, and also > >> raise a warning" thing, wouldn't it make more sense to just raise an > >> error? That's what exceptions are for? I guess I should have said > >> something earlier though... > >> > > > > I figure the blowup is safe, as we can't allocate arrays big enough that > the > > minimum intp value would be a valid index. I considered raising an error, > > and if there is a consensus the behavior could be changed. Or we could > add a > > keyword to determine the behavior. > > Yeah, the intp value can't be a valid index, so that covers 95% of > cases, but I'm worried about that other 5%. It could still pass > silently as the endpoint of a slice, or participate in some sort of > integer arithmetic calculation, etc. I assume you also share this > worry to some extent or you wouldn't have put in the warning ;-). > > I guess the bigger question is, why would we *not* use the standard > method for signaling an exceptional condition here, i.e., exceptions? > That way we're 100% guaranteed that if people aren't prepared to > handle it then they'll at least know something has gone wrong, and if > they are prepared to handle it then it's very easy and standard, just > use try/except. Right now I guess you have to check for the special > value, but also do something to silence warnings, but just for that > one line? Sounds kind of complicated... > The main reason was for the case of multiple axis, where some of the results would be valid and others not. The simple thing might be to raise an exception but keep the current return values so that users could determine where the problem occurred. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Oct 1 10:47:14 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 15:47:14 +0100 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On Tue, Oct 1, 2013 at 3:20 PM, Charles R Harris wrote: > > > > On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith wrote: >> >> [switching subject to break out from the giant 1.8.0rc1 thread] >> >> On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris >> wrote: >> > >> > >> > >> > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith wrote: >> >> >> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris >> >> wrote: >> >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith >> >> > wrote: >> >> >> >> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke >> >> >> wrote: >> >> >> > 2) Bottleneck 0.7.0 >> >> >> > >> >> >> > >> >> >> > >> >> >> > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 >> >> >> >> >> >> I can't tell if these are real bugs in numpy, or tests checking that >> >> >> bottleneck is bug-for-bug compatible with old numpy and we just >> >> >> fixed >> >> >> some bugs, or what. It's clearly something to do with the >> >> >> nanarg{max,min} rewrite -- @charris, do you know what's going on >> >> >> here? >> >> >> >> >> > >> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast >> >> > nan >> >> > to >> >> > intp when the result was an array, and return nan when a scalar. The >> >> > current >> >> > behaviour is to return the most negative value of intp as an error >> >> > marker in >> >> > both cases and raise a warning. It is a change in behavior, but I >> >> > think >> >> > one >> >> > that needs to be made. >> >> >> >> Ah, okay! I kind of lost track of the nanfunc changes by the end there. >> >> >> >> So for the bottleneck issue, it sounds like the problem is just that >> >> bottleneck is still emulating the old numpy behaviour in this corner >> >> case, which isn't really a problem. So we don't really need to worry >> >> about that, both behaviours are correct, just maybe out of sync. >> >> >> >> I'm a little dubious about this "make up some weird value that will >> >> *probably* blow up if people try to use it without checking, and also >> >> raise a warning" thing, wouldn't it make more sense to just raise an >> >> error? That's what exceptions are for? I guess I should have said >> >> something earlier though... >> >> >> > >> > I figure the blowup is safe, as we can't allocate arrays big enough that >> > the >> > minimum intp value would be a valid index. I considered raising an >> > error, >> > and if there is a consensus the behavior could be changed. Or we could >> > add a >> > keyword to determine the behavior. >> >> Yeah, the intp value can't be a valid index, so that covers 95% of >> cases, but I'm worried about that other 5%. It could still pass >> silently as the endpoint of a slice, or participate in some sort of >> integer arithmetic calculation, etc. I assume you also share this >> worry to some extent or you wouldn't have put in the warning ;-). >> >> I guess the bigger question is, why would we *not* use the standard >> method for signaling an exceptional condition here, i.e., exceptions? >> That way we're 100% guaranteed that if people aren't prepared to >> handle it then they'll at least know something has gone wrong, and if >> they are prepared to handle it then it's very easy and standard, just >> use try/except. Right now I guess you have to check for the special >> value, but also do something to silence warnings, but just for that >> one line? Sounds kind of complicated... > > > The main reason was for the case of multiple axis, where some of the results > would be valid and others not. The simple thing might be to raise an > exception but keep the current return values so that users could determine > where the problem occurred. Oh, duh, yes, right, now I remember this discussion. Sorry for being slow. In the past we've *always* raised in error in the multiple axis case, right? Has anyone ever complained? Wanting to get all nanargmax/nanargmin results, of which some might be errors, without just writing a loop, seems like a pretty exotic case to me, so I'm not sure we should optimize for it at the expense of returning possibly-misleading results in the scalar case. Like (I think) you say, we could get the best of both worlds by encoding the results in the same way we do right now, but then raise an exception and attach the results to the exception so they can be retrieved if wanted. Kind of cumbersome, but maybe good? This is a more general problem though of course -- we've run into it in the gufunc linalg code too, where there's some question about you do in e.g. chol() if some sub-matrices are positive-definite and some are not. Off the top of my head the general solution might be to define a MultiError exception type that has a standard generic format for describing such things. It'd need a mask saying which values were valid, rather than encoding them directly into the return values -- otherwise we have the problem where nanargmax wants to use INT_MIN, chol wants to use NaN, and maybe the next function along doesn't have any usable flag value available at all. So probably more thought is needed before nailing down exactly how we handle such "partial" errors for vectorized functions. In the short term (i.e., 1.8.0), maybe we should defer this discussion by simply raising a regular ValueError for nanarg functions on all errors? That's not a regression from 1.7, since 1.7 also didn't provide any way to get at partial results in the event of an error, and it leaves us in a good position to solve the more general problem later. -n From robert.kern at gmail.com Tue Oct 1 11:52:06 2013 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 1 Oct 2013 16:52:06 +0100 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References: <20131001132847.GA14614@bromo.med.uc.edu> Message-ID: On Tue, Oct 1, 2013 at 4:41 PM, Pauli Virtanen wrote: > > Hi, > > 01.10.2013 16:28, Jack Howarth kirjoitti: > [clip] > > /sw/bin/python2.7 setup.py build > > > > which fails at... > > > > /sw/bin/gfortran -Wall -L/sw/lib build/temp.macosx-10.6-x86_64-2.7/numpy/linalg/lapack_litemodule.o build/temp.macosx-10.6-x86_64-2.7/numpy/linalg/lapack_lite/python_xerbla.o -L/sw/lib -L/sw/lib/gcc4.8/lib/gcc/x86_64-apple-darwin10.8.0/4.8.1 -Lbuild/temp.macosx-10.6-x86_64-2.7 -llapack -lptf77blas -lptcblas -latlas -lgfortran -o build/lib.macosx-10.6-x86_64-2.7/numpy/linalg/lapack_lite.so > > Undefined symbols for architecture x86_64: > > "_main", referenced from: > > start in crt1.10.6.o > [clip] > > Something is screwed up in your build environment: the `-shared` flag is > missing from the link command. > > Perhaps you have set one of the the environment variables FFLAGS, > CFLAGS, LDFLAGS? Also the `-undefined dynamic_lookup` flag. -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Tue Oct 1 12:10:39 2013 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 1 Oct 2013 17:10:39 +0100 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: <20131001160221.GA15735@bromo.med.uc.edu> References: <20131001132847.GA14614@bromo.med.uc.edu> <20131001160221.GA15735@bromo.med.uc.edu> Message-ID: On Tue, Oct 1, 2013 at 5:02 PM, Jack Howarth wrote: > > On Tue, Oct 01, 2013 at 04:52:06PM +0100, Robert Kern wrote: > > On Tue, Oct 1, 2013 at 4:41 PM, Pauli Virtanen wrote: > > > > > > Hi, > > > > > > 01.10.2013 16:28, Jack Howarth kirjoitti: > > > [clip] > > > > /sw/bin/python2.7 setup.py build > > > > > > > > which fails at... > > > > > > > > /sw/bin/gfortran -Wall -L/sw/lib > > build/temp.macosx-10.6-x86_64-2.7/numpy/linalg/lapack_litemodule.o > > build/temp.macosx-10.6-x86_64-2.7/numpy/linalg/lapack_lite/python_xerbla.o > > -L/sw/lib -L/sw/lib/gcc4.8/lib/gcc/x86_64-apple-darwin10.8.0/4.8.1 > > -Lbuild/temp.macosx-10.6-x86_64-2.7 -llapack -lptf77blas -lptcblas -latlas > > -lgfortran -o build/lib.macosx-10.6-x86_64-2.7/numpy/linalg/lapack_lite.so > > > > Undefined symbols for architecture x86_64: > > > > "_main", referenced from: > > > > start in crt1.10.6.o > > > [clip] > > > > > > Something is screwed up in your build environment: the `-shared` flag is > > > missing from the link command. > > > > > > Perhaps you have set one of the the environment variables FFLAGS, > > > CFLAGS, LDFLAGS? > > > > Also the `-undefined dynamic_lookup` flag. > > The consensus of the fink developers is that you are introducing a bug in both > scipy and numpy. The build should be able to pass additional flags on these > variables and the scipy/numpy build should be able to append any additional > flags required. In particular, both MacPorts and fink will want to be able to > pass -L/opt/local/lib or -L/sw/lib via LDFLAGS. The changes added to scipy and > numpy have broken this and now require that these additional flags be manually > patched into the Makefiles of numpy and scipy rather than just passing them > on LDFLAGS as has always worked in the past. Oh no it hasn't. It has been a consistent thorn in our side for a very long time. In the case of Fortran modules built by numpy.distutils, $LDFLAGS has replaced rather than appended flags since time immemorial. It is a compromise solution to work around the fact that the wide variety of Fortran compilers are very finicky about their flags, and distutils is not very accommodating about letting users change the flags to suit their local environments. If you think you have a better solution to this problem that does not degrade the existing flexibility, your PR will be cheerfully accepted. No one thinks this is desirable behavior, but it is most certainly not *new* behavior. -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Tue Oct 1 12:19:14 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 1 Oct 2013 12:19:14 -0400 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On Tue, Oct 1, 2013 at 10:47 AM, Nathaniel Smith wrote: > On Tue, Oct 1, 2013 at 3:20 PM, Charles R Harris > wrote: >> >> >> >> On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith wrote: >>> >>> [switching subject to break out from the giant 1.8.0rc1 thread] >>> >>> On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris >>> wrote: >>> > >>> > >>> > >>> > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith wrote: >>> >> >>> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris >>> >> wrote: >>> >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith >>> >> > wrote: >>> >> >> >>> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke >>> >> >> wrote: >>> >> >> > 2) Bottleneck 0.7.0 >>> >> >> > >>> >> >> > >>> >> >> > >>> >> >> > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 >>> >> >> >>> >> >> I can't tell if these are real bugs in numpy, or tests checking that >>> >> >> bottleneck is bug-for-bug compatible with old numpy and we just >>> >> >> fixed >>> >> >> some bugs, or what. It's clearly something to do with the >>> >> >> nanarg{max,min} rewrite -- @charris, do you know what's going on >>> >> >> here? >>> >> >> >>> >> > >>> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast >>> >> > nan >>> >> > to >>> >> > intp when the result was an array, and return nan when a scalar. The >>> >> > current >>> >> > behaviour is to return the most negative value of intp as an error >>> >> > marker in >>> >> > both cases and raise a warning. It is a change in behavior, but I >>> >> > think >>> >> > one >>> >> > that needs to be made. >>> >> >>> >> Ah, okay! I kind of lost track of the nanfunc changes by the end there. >>> >> >>> >> So for the bottleneck issue, it sounds like the problem is just that >>> >> bottleneck is still emulating the old numpy behaviour in this corner >>> >> case, which isn't really a problem. So we don't really need to worry >>> >> about that, both behaviours are correct, just maybe out of sync. >>> >> >>> >> I'm a little dubious about this "make up some weird value that will >>> >> *probably* blow up if people try to use it without checking, and also >>> >> raise a warning" thing, wouldn't it make more sense to just raise an >>> >> error? That's what exceptions are for? I guess I should have said >>> >> something earlier though... >>> >> >>> > >>> > I figure the blowup is safe, as we can't allocate arrays big enough that >>> > the >>> > minimum intp value would be a valid index. I considered raising an >>> > error, >>> > and if there is a consensus the behavior could be changed. Or we could >>> > add a >>> > keyword to determine the behavior. >>> >>> Yeah, the intp value can't be a valid index, so that covers 95% of >>> cases, but I'm worried about that other 5%. It could still pass >>> silently as the endpoint of a slice, or participate in some sort of >>> integer arithmetic calculation, etc. I assume you also share this >>> worry to some extent or you wouldn't have put in the warning ;-). >>> >>> I guess the bigger question is, why would we *not* use the standard >>> method for signaling an exceptional condition here, i.e., exceptions? >>> That way we're 100% guaranteed that if people aren't prepared to >>> handle it then they'll at least know something has gone wrong, and if >>> they are prepared to handle it then it's very easy and standard, just >>> use try/except. Right now I guess you have to check for the special >>> value, but also do something to silence warnings, but just for that >>> one line? Sounds kind of complicated... >> >> >> The main reason was for the case of multiple axis, where some of the results >> would be valid and others not. The simple thing might be to raise an >> exception but keep the current return values so that users could determine >> where the problem occurred. > > Oh, duh, yes, right, now I remember this discussion. Sorry for being slow. > > In the past we've *always* raised in error in the multiple axis case, > right? Has anyone ever complained? Wanting to get all > nanargmax/nanargmin results, of which some might be errors, without > just writing a loop, seems like a pretty exotic case to me, so I'm not > sure we should optimize for it at the expense of returning > possibly-misleading results in the scalar case. > > Like (I think) you say, we could get the best of both worlds by > encoding the results in the same way we do right now, but then raise > an exception and attach the results to the exception so they can be > retrieved if wanted. Kind of cumbersome, but maybe good? > > This is a more general problem though of course -- we've run into it > in the gufunc linalg code too, where there's some question about you > do in e.g. chol() if some sub-matrices are positive-definite and some > are not. > > Off the top of my head the general solution might be to define a > MultiError exception type that has a standard generic format for > describing such things. It'd need a mask saying which values were > valid, rather than encoding them directly into the return values -- > otherwise we have the problem where nanargmax wants to use INT_MIN, > chol wants to use NaN, and maybe the next function along doesn't have > any usable flag value available at all. So probably more thought is > needed before nailing down exactly how we handle such "partial" errors > for vectorized functions. > > In the short term (i.e., 1.8.0), maybe we should defer this discussion > by simply raising a regular ValueError for nanarg functions on all > errors? That's not a regression from 1.7, since 1.7 also didn't > provide any way to get at partial results in the event of an error, > and it leaves us in a good position to solve the more general problem > later. Can we make the error optional in these cases? like np.seterr for zerodivision, invalid, or floating point errors that allows ignore and raise np.seterr(linalg='ignore') I don't know about nanarg, but thinking about some applications for gufunc linalg code. In some cases I might require for example invertibility of all matrices and raise if one fails, in other case I would be happy with nans, and just sum the results with nansum for example or replace them by some fill value. Josef > > -n > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From charlesr.harris at gmail.com Tue Oct 1 12:33:58 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 1 Oct 2013 10:33:58 -0600 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On Tue, Oct 1, 2013 at 10:19 AM, wrote: > On Tue, Oct 1, 2013 at 10:47 AM, Nathaniel Smith wrote: > > On Tue, Oct 1, 2013 at 3:20 PM, Charles R Harris > > wrote: > >> > >> > >> > >> On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith wrote: > >>> > >>> [switching subject to break out from the giant 1.8.0rc1 thread] > >>> > >>> On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris > >>> wrote: > >>> > > >>> > > >>> > > >>> > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith > wrote: > >>> >> > >>> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris > >>> >> wrote: > >>> >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith > >>> >> > wrote: > >>> >> >> > >>> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke < > cgohlke at uci.edu> > >>> >> >> wrote: > >>> >> >> > 2) Bottleneck 0.7.0 > >>> >> >> > > >>> >> >> > > >>> >> >> > > >>> >> >> > > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 > >>> >> >> > >>> >> >> I can't tell if these are real bugs in numpy, or tests checking > that > >>> >> >> bottleneck is bug-for-bug compatible with old numpy and we just > >>> >> >> fixed > >>> >> >> some bugs, or what. It's clearly something to do with the > >>> >> >> nanarg{max,min} rewrite -- @charris, do you know what's going on > >>> >> >> here? > >>> >> >> > >>> >> > > >>> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to > cast > >>> >> > nan > >>> >> > to > >>> >> > intp when the result was an array, and return nan when a scalar. > The > >>> >> > current > >>> >> > behaviour is to return the most negative value of intp as an error > >>> >> > marker in > >>> >> > both cases and raise a warning. It is a change in behavior, but I > >>> >> > think > >>> >> > one > >>> >> > that needs to be made. > >>> >> > >>> >> Ah, okay! I kind of lost track of the nanfunc changes by the end > there. > >>> >> > >>> >> So for the bottleneck issue, it sounds like the problem is just that > >>> >> bottleneck is still emulating the old numpy behaviour in this corner > >>> >> case, which isn't really a problem. So we don't really need to worry > >>> >> about that, both behaviours are correct, just maybe out of sync. > >>> >> > >>> >> I'm a little dubious about this "make up some weird value that will > >>> >> *probably* blow up if people try to use it without checking, and > also > >>> >> raise a warning" thing, wouldn't it make more sense to just raise an > >>> >> error? That's what exceptions are for? I guess I should have said > >>> >> something earlier though... > >>> >> > >>> > > >>> > I figure the blowup is safe, as we can't allocate arrays big enough > that > >>> > the > >>> > minimum intp value would be a valid index. I considered raising an > >>> > error, > >>> > and if there is a consensus the behavior could be changed. Or we > could > >>> > add a > >>> > keyword to determine the behavior. > >>> > >>> Yeah, the intp value can't be a valid index, so that covers 95% of > >>> cases, but I'm worried about that other 5%. It could still pass > >>> silently as the endpoint of a slice, or participate in some sort of > >>> integer arithmetic calculation, etc. I assume you also share this > >>> worry to some extent or you wouldn't have put in the warning ;-). > >>> > >>> I guess the bigger question is, why would we *not* use the standard > >>> method for signaling an exceptional condition here, i.e., exceptions? > >>> That way we're 100% guaranteed that if people aren't prepared to > >>> handle it then they'll at least know something has gone wrong, and if > >>> they are prepared to handle it then it's very easy and standard, just > >>> use try/except. Right now I guess you have to check for the special > >>> value, but also do something to silence warnings, but just for that > >>> one line? Sounds kind of complicated... > >> > >> > >> The main reason was for the case of multiple axis, where some of the > results > >> would be valid and others not. The simple thing might be to raise an > >> exception but keep the current return values so that users could > determine > >> where the problem occurred. > > > > Oh, duh, yes, right, now I remember this discussion. Sorry for being > slow. > > > > In the past we've *always* raised in error in the multiple axis case, > > right? Has anyone ever complained? Wanting to get all > > nanargmax/nanargmin results, of which some might be errors, without > > just writing a loop, seems like a pretty exotic case to me, so I'm not > > sure we should optimize for it at the expense of returning > > possibly-misleading results in the scalar case. > > > > Like (I think) you say, we could get the best of both worlds by > > encoding the results in the same way we do right now, but then raise > > an exception and attach the results to the exception so they can be > > retrieved if wanted. Kind of cumbersome, but maybe good? > > > > This is a more general problem though of course -- we've run into it > > in the gufunc linalg code too, where there's some question about you > > do in e.g. chol() if some sub-matrices are positive-definite and some > > are not. > > > > Off the top of my head the general solution might be to define a > > MultiError exception type that has a standard generic format for > > describing such things. It'd need a mask saying which values were > > valid, rather than encoding them directly into the return values -- > > otherwise we have the problem where nanargmax wants to use INT_MIN, > > chol wants to use NaN, and maybe the next function along doesn't have > > any usable flag value available at all. So probably more thought is > > needed before nailing down exactly how we handle such "partial" errors > > for vectorized functions. > > > > In the short term (i.e., 1.8.0), maybe we should defer this discussion > > by simply raising a regular ValueError for nanarg functions on all > > errors? That's not a regression from 1.7, since 1.7 also didn't > > provide any way to get at partial results in the event of an error, > > and it leaves us in a good position to solve the more general problem > > later. > > Can we make the error optional in these cases? > > like np.seterr for zerodivision, invalid, or floating point errors > that allows ignore and raise > np.seterr(linalg='ignore') > > I don't know about nanarg, but thinking about some applications for > gufunc linalg code. > > In some cases I might require for example invertibility of all > matrices and raise if one fails, > in other case I would be happy with nans, and just sum the results > with nansum for example or replace them by some fill value. > > I'm thinking warnings might be more flexible than exceptions: with warnings.catch_warnings(): warnings.simplefilter('error') ... Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From ondrej.certik at gmail.com Tue Oct 1 13:23:43 2013 From: ondrej.certik at gmail.com (=?UTF-8?B?T25kxZllaiDEjGVydMOtaw==?=) Date: Tue, 1 Oct 2013 11:23:43 -0600 Subject: [Numpy-discussion] Masked arrays: Rationale for "False convention" In-Reply-To: <524A3BDC.4070007@hawaii.edu> References:

<524A3385.1030101@hawaii.edu> <524A3BDC.4070007@hawaii.edu> Message-ID: On Tue, Oct 1, 2013 at 4:29 AM, Robert Kern wrote: > On Tue, Oct 1, 2013 at 3:57 AM, Ond?ej ?ert?k > wrote: > >> I see, that makes sense. So to remember this, the rule is: >> >> "Specify elements that you want to get masked using True in 'mask'". > > Yes. This convention dates back at least to the original MA package in > Numeric; I don't know if Paul Dubois stole it from any previous software. I see, thanks. > > One way to motivate the convention is to think about doing a binary > operation on masked arrays, which is really the most common kind of thing > one does with masked arrays. The mask of the result is the logical OR of the > two operand masks (barring additional masked elements from domain > violations, 0/0, etc.). In the other convention, you just use logical AND, so that seams equally simple, unless I am missing something. > I assume that the convention was decided mostly on > what was most convenient and efficient for the common internal operations > for *implementing* the masked arrays and not necessarily matching any > particular intuitions when putting data *into* the masked arrays. That makes sense. On Mon, Sep 30, 2013 at 9:05 PM, Eric Firing wrote: > On 2013/09/30 4:57 PM, Ond?ej ?ert?k wrote: >> >> But why do I need to invert the mask when I want to see the valid elements: >> >> In [1]: from numpy import ma >> >> In [2]: a = ma.array([1, 2, 3, 4], mask=[False, False, True, False]) >> >> In [3]: a >> Out[3]: >> masked_array(data = [1 2 -- 4], >> mask = [False False True False], >> fill_value = 999999) >> >> >> In [4]: a[~a.mask] >> Out[4]: >> masked_array(data = [1 2 4], >> mask = [False False False], >> fill_value = 999999) >> >> >> I would find natural to write [4] as a[a.mask]. This is when it gets confusing. > > There is no getting around it; each of the two possible conventions has > its advantages. But try this instead: > > In [2]: a = ma.array([1, 2, 3, 4], mask=[False, False, True, False]) > > In [3]: a.compressed() > Out[3]: array([1, 2, 4]) > > > I do occasionally need a "goodmask" which is the inverse of a.mask, but > not very often; and when I do, needing to invert a.mask doesn't bother me. a.compressed() works for getting data out --- but I also use it to assign data in, e.g.: a[~a.mask] = 1 Thanks everybody for the discussion. It sheds some light onto the current convention. Ondrej From njs at pobox.com Tue Oct 1 16:13:11 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 21:13:11 +0100 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On 1 Oct 2013 17:34, "Charles R Harris" wrote: > > > > > On Tue, Oct 1, 2013 at 10:19 AM, wrote: >> >> On Tue, Oct 1, 2013 at 10:47 AM, Nathaniel Smith wrote: >> > On Tue, Oct 1, 2013 at 3:20 PM, Charles R Harris >> > wrote: >> >> >> >> >> >> >> >> On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith wrote: >> >>> >> >>> [switching subject to break out from the giant 1.8.0rc1 thread] >> >>> >> >>> On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris >> >>> wrote: >> >>> > >> >>> > >> >>> > >> >>> > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith wrote: >> >>> >> >> >>> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris >> >>> >> wrote: >> >>> >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith >> >>> >> > wrote: >> >>> >> >> >> >>> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke < cgohlke at uci.edu> >> >>> >> >> wrote: >> >>> >> >> > 2) Bottleneck 0.7.0 >> >>> >> >> > >> >>> >> >> > >> >>> >> >> > >> >>> >> >> > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 >> >>> >> >> >> >>> >> >> I can't tell if these are real bugs in numpy, or tests checking that >> >>> >> >> bottleneck is bug-for-bug compatible with old numpy and we just >> >>> >> >> fixed >> >>> >> >> some bugs, or what. It's clearly something to do with the >> >>> >> >> nanarg{max,min} rewrite -- @charris, do you know what's going on >> >>> >> >> here? >> >>> >> >> >> >>> >> > >> >>> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to cast >> >>> >> > nan >> >>> >> > to >> >>> >> > intp when the result was an array, and return nan when a scalar. The >> >>> >> > current >> >>> >> > behaviour is to return the most negative value of intp as an error >> >>> >> > marker in >> >>> >> > both cases and raise a warning. It is a change in behavior, but I >> >>> >> > think >> >>> >> > one >> >>> >> > that needs to be made. >> >>> >> >> >>> >> Ah, okay! I kind of lost track of the nanfunc changes by the end there. >> >>> >> >> >>> >> So for the bottleneck issue, it sounds like the problem is just that >> >>> >> bottleneck is still emulating the old numpy behaviour in this corner >> >>> >> case, which isn't really a problem. So we don't really need to worry >> >>> >> about that, both behaviours are correct, just maybe out of sync. >> >>> >> >> >>> >> I'm a little dubious about this "make up some weird value that will >> >>> >> *probably* blow up if people try to use it without checking, and also >> >>> >> raise a warning" thing, wouldn't it make more sense to just raise an >> >>> >> error? That's what exceptions are for? I guess I should have said >> >>> >> something earlier though... >> >>> >> >> >>> > >> >>> > I figure the blowup is safe, as we can't allocate arrays big enough that >> >>> > the >> >>> > minimum intp value would be a valid index. I considered raising an >> >>> > error, >> >>> > and if there is a consensus the behavior could be changed. Or we could >> >>> > add a >> >>> > keyword to determine the behavior. >> >>> >> >>> Yeah, the intp value can't be a valid index, so that covers 95% of >> >>> cases, but I'm worried about that other 5%. It could still pass >> >>> silently as the endpoint of a slice, or participate in some sort of >> >>> integer arithmetic calculation, etc. I assume you also share this >> >>> worry to some extent or you wouldn't have put in the warning ;-). >> >>> >> >>> I guess the bigger question is, why would we *not* use the standard >> >>> method for signaling an exceptional condition here, i.e., exceptions? >> >>> That way we're 100% guaranteed that if people aren't prepared to >> >>> handle it then they'll at least know something has gone wrong, and if >> >>> they are prepared to handle it then it's very easy and standard, just >> >>> use try/except. Right now I guess you have to check for the special >> >>> value, but also do something to silence warnings, but just for that >> >>> one line? Sounds kind of complicated... >> >> >> >> >> >> The main reason was for the case of multiple axis, where some of the results >> >> would be valid and others not. The simple thing might be to raise an >> >> exception but keep the current return values so that users could determine >> >> where the problem occurred. >> > >> > Oh, duh, yes, right, now I remember this discussion. Sorry for being slow. >> > >> > In the past we've *always* raised in error in the multiple axis case, >> > right? Has anyone ever complained? Wanting to get all >> > nanargmax/nanargmin results, of which some might be errors, without >> > just writing a loop, seems like a pretty exotic case to me, so I'm not >> > sure we should optimize for it at the expense of returning >> > possibly-misleading results in the scalar case. >> > >> > Like (I think) you say, we could get the best of both worlds by >> > encoding the results in the same way we do right now, but then raise >> > an exception and attach the results to the exception so they can be >> > retrieved if wanted. Kind of cumbersome, but maybe good? >> > >> > This is a more general problem though of course -- we've run into it >> > in the gufunc linalg code too, where there's some question about you >> > do in e.g. chol() if some sub-matrices are positive-definite and some >> > are not. >> > >> > Off the top of my head the general solution might be to define a >> > MultiError exception type that has a standard generic format for >> > describing such things. It'd need a mask saying which values were >> > valid, rather than encoding them directly into the return values -- >> > otherwise we have the problem where nanargmax wants to use INT_MIN, >> > chol wants to use NaN, and maybe the next function along doesn't have >> > any usable flag value available at all. So probably more thought is >> > needed before nailing down exactly how we handle such "partial" errors >> > for vectorized functions. >> > >> > In the short term (i.e., 1.8.0), maybe we should defer this discussion >> > by simply raising a regular ValueError for nanarg functions on all >> > errors? That's not a regression from 1.7, since 1.7 also didn't >> > provide any way to get at partial results in the event of an error, >> > and it leaves us in a good position to solve the more general problem >> > later. >> >> Can we make the error optional in these cases? >> >> like np.seterr for zerodivision, invalid, or floating point errors >> that allows ignore and raise >> np.seterr(linalg='ignore') >> >> I don't know about nanarg, but thinking about some applications for >> gufunc linalg code. >> >> In some cases I might require for example invertibility of all >> matrices and raise if one fails, >> in other case I would be happy with nans, and just sum the results >> with nansum for example or replace them by some fill value. >> > I'm thinking warnings might be more flexible than exceptions: > > with warnings.catch_warnings(): > warnings.simplefilter('error') > ... Sure. Passing in a callback or just leaving the function out and telling people to implement it themselves would be even more flexible :-). But we have to trade off complexity of usage, complexity of teaching people how to do stuff (nobody knows how to use catch_warnings, we only know because we started writing warning tests just in the last year or so), usefulness in common situations, etc. The warnings api doesn't give you any way to pass results out, you still need a separate channel to say what failed and what succeeded (and maybe for the failures to say what the different failures are). Anyway this back and forth still supprts my main suggestion for *right* now, which is that this is sufficiently nonobvious that with 1.8 breathing down our necks we should start with the safe behaviour and then work up from there. -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Tue Oct 1 16:37:34 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 1 Oct 2013 16:37:34 -0400 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On Tue, Oct 1, 2013 at 4:13 PM, Nathaniel Smith wrote: > On 1 Oct 2013 17:34, "Charles R Harris" wrote: >> >> >> >> >> On Tue, Oct 1, 2013 at 10:19 AM, wrote: >>> >>> On Tue, Oct 1, 2013 at 10:47 AM, Nathaniel Smith wrote: >>> > On Tue, Oct 1, 2013 at 3:20 PM, Charles R Harris >>> > wrote: >>> >> >>> >> >>> >> >>> >> On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith wrote: >>> >>> >>> >>> [switching subject to break out from the giant 1.8.0rc1 thread] >>> >>> >>> >>> On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris >>> >>> wrote: >>> >>> > >>> >>> > >>> >>> > >>> >>> > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith >>> >>> > wrote: >>> >>> >> >>> >>> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris >>> >>> >> wrote: >>> >>> >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith >>> >>> >> > wrote: >>> >>> >> >> >>> >>> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke >>> >>> >> >> >>> >>> >> >> wrote: >>> >>> >> >> > 2) Bottleneck 0.7.0 >>> >>> >> >> > >>> >>> >> >> > >>> >>> >> >> > >>> >>> >> >> > >>> >>> >> >> > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 >>> >>> >> >> >>> >>> >> >> I can't tell if these are real bugs in numpy, or tests checking >>> >>> >> >> that >>> >>> >> >> bottleneck is bug-for-bug compatible with old numpy and we just >>> >>> >> >> fixed >>> >>> >> >> some bugs, or what. It's clearly something to do with the >>> >>> >> >> nanarg{max,min} rewrite -- @charris, do you know what's going >>> >>> >> >> on >>> >>> >> >> here? >>> >>> >> >> >>> >>> >> > >>> >>> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was to >>> >>> >> > cast >>> >>> >> > nan >>> >>> >> > to >>> >>> >> > intp when the result was an array, and return nan when a scalar. >>> >>> >> > The >>> >>> >> > current >>> >>> >> > behaviour is to return the most negative value of intp as an >>> >>> >> > error >>> >>> >> > marker in >>> >>> >> > both cases and raise a warning. It is a change in behavior, but >>> >>> >> > I >>> >>> >> > think >>> >>> >> > one >>> >>> >> > that needs to be made. >>> >>> >> >>> >>> >> Ah, okay! I kind of lost track of the nanfunc changes by the end >>> >>> >> there. >>> >>> >> >>> >>> >> So for the bottleneck issue, it sounds like the problem is just >>> >>> >> that >>> >>> >> bottleneck is still emulating the old numpy behaviour in this >>> >>> >> corner >>> >>> >> case, which isn't really a problem. So we don't really need to >>> >>> >> worry >>> >>> >> about that, both behaviours are correct, just maybe out of sync. >>> >>> >> >>> >>> >> I'm a little dubious about this "make up some weird value that >>> >>> >> will >>> >>> >> *probably* blow up if people try to use it without checking, and >>> >>> >> also >>> >>> >> raise a warning" thing, wouldn't it make more sense to just raise >>> >>> >> an >>> >>> >> error? That's what exceptions are for? I guess I should have said >>> >>> >> something earlier though... >>> >>> >> >>> >>> > >>> >>> > I figure the blowup is safe, as we can't allocate arrays big enough >>> >>> > that >>> >>> > the >>> >>> > minimum intp value would be a valid index. I considered raising an >>> >>> > error, >>> >>> > and if there is a consensus the behavior could be changed. Or we >>> >>> > could >>> >>> > add a >>> >>> > keyword to determine the behavior. >>> >>> >>> >>> Yeah, the intp value can't be a valid index, so that covers 95% of >>> >>> cases, but I'm worried about that other 5%. It could still pass >>> >>> silently as the endpoint of a slice, or participate in some sort of >>> >>> integer arithmetic calculation, etc. I assume you also share this >>> >>> worry to some extent or you wouldn't have put in the warning ;-). >>> >>> >>> >>> I guess the bigger question is, why would we *not* use the standard >>> >>> method for signaling an exceptional condition here, i.e., exceptions? >>> >>> That way we're 100% guaranteed that if people aren't prepared to >>> >>> handle it then they'll at least know something has gone wrong, and if >>> >>> they are prepared to handle it then it's very easy and standard, just >>> >>> use try/except. Right now I guess you have to check for the special >>> >>> value, but also do something to silence warnings, but just for that >>> >>> one line? Sounds kind of complicated... >>> >> >>> >> >>> >> The main reason was for the case of multiple axis, where some of the >>> >> results >>> >> would be valid and others not. The simple thing might be to raise an >>> >> exception but keep the current return values so that users could >>> >> determine >>> >> where the problem occurred. >>> > >>> > Oh, duh, yes, right, now I remember this discussion. Sorry for being >>> > slow. >>> > >>> > In the past we've *always* raised in error in the multiple axis case, >>> > right? Has anyone ever complained? Wanting to get all >>> > nanargmax/nanargmin results, of which some might be errors, without >>> > just writing a loop, seems like a pretty exotic case to me, so I'm not >>> > sure we should optimize for it at the expense of returning >>> > possibly-misleading results in the scalar case. >>> > >>> > Like (I think) you say, we could get the best of both worlds by >>> > encoding the results in the same way we do right now, but then raise >>> > an exception and attach the results to the exception so they can be >>> > retrieved if wanted. Kind of cumbersome, but maybe good? >>> > >>> > This is a more general problem though of course -- we've run into it >>> > in the gufunc linalg code too, where there's some question about you >>> > do in e.g. chol() if some sub-matrices are positive-definite and some >>> > are not. >>> > >>> > Off the top of my head the general solution might be to define a >>> > MultiError exception type that has a standard generic format for >>> > describing such things. It'd need a mask saying which values were >>> > valid, rather than encoding them directly into the return values -- >>> > otherwise we have the problem where nanargmax wants to use INT_MIN, >>> > chol wants to use NaN, and maybe the next function along doesn't have >>> > any usable flag value available at all. So probably more thought is >>> > needed before nailing down exactly how we handle such "partial" errors >>> > for vectorized functions. >>> > >>> > In the short term (i.e., 1.8.0), maybe we should defer this discussion >>> > by simply raising a regular ValueError for nanarg functions on all >>> > errors? That's not a regression from 1.7, since 1.7 also didn't >>> > provide any way to get at partial results in the event of an error, >>> > and it leaves us in a good position to solve the more general problem >>> > later. >>> >>> Can we make the error optional in these cases? >>> >>> like np.seterr for zerodivision, invalid, or floating point errors >>> that allows ignore and raise >>> np.seterr(linalg='ignore') >>> >>> I don't know about nanarg, but thinking about some applications for >>> gufunc linalg code. >>> >>> In some cases I might require for example invertibility of all >>> matrices and raise if one fails, >>> in other case I would be happy with nans, and just sum the results >>> with nansum for example or replace them by some fill value. >>> >> I'm thinking warnings might be more flexible than exceptions: >> >> with warnings.catch_warnings(): >> warnings.simplefilter('error') >> ... > > Sure. Passing in a callback or just leaving the function out and telling > people to implement it themselves would be even more flexible :-). But we > have to trade off complexity of usage, complexity of teaching people how to > do stuff (nobody knows how to use catch_warnings, we only know because we > started writing warning tests just in the last year or so), usefulness in > common situations, etc. The warnings api doesn't give you any way to pass > results out, you still need a separate channel to say what failed and what > succeeded (and maybe for the failures to say what the different failures > are). Since numpy and scipy just moved to python 2.6, it's time to advertise and support warnings.catch_warnings(). If you want to wait for a "missing value support" in numpy to support this, then this postpones this to .... (numpy 3.0?) while gufuncs seem to be happening now. Josef "from the balcony" 3-dimensional panel data linear algebra without vec and kron ? > > Anyway this back and forth still supprts my main suggestion for *right* now, > which is that this is sufficiently nonobvious that with 1.8 breathing down > our necks we should start with the safe behaviour and then work up from > there. > > -n > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From charlesr.harris at gmail.com Tue Oct 1 16:37:51 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 1 Oct 2013 14:37:51 -0600 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On Tue, Oct 1, 2013 at 2:13 PM, Nathaniel Smith wrote: > On 1 Oct 2013 17:34, "Charles R Harris" wrote: > > > > > > > > > > On Tue, Oct 1, 2013 at 10:19 AM, wrote: > >> > >> On Tue, Oct 1, 2013 at 10:47 AM, Nathaniel Smith wrote: > >> > On Tue, Oct 1, 2013 at 3:20 PM, Charles R Harris > >> > wrote: > >> >> > >> >> > >> >> > >> >> On Tue, Oct 1, 2013 at 8:12 AM, Nathaniel Smith > wrote: > >> >>> > >> >>> [switching subject to break out from the giant 1.8.0rc1 thread] > >> >>> > >> >>> On Tue, Oct 1, 2013 at 2:52 PM, Charles R Harris > >> >>> wrote: > >> >>> > > >> >>> > > >> >>> > > >> >>> > On Tue, Oct 1, 2013 at 7:25 AM, Nathaniel Smith > wrote: > >> >>> >> > >> >>> >> On Tue, Oct 1, 2013 at 1:56 PM, Charles R Harris > >> >>> >> wrote: > >> >>> >> > On Tue, Oct 1, 2013 at 4:43 AM, Nathaniel Smith > > >> >>> >> > wrote: > >> >>> >> >> > >> >>> >> >> On Mon, Sep 30, 2013 at 10:51 PM, Christoph Gohlke < > cgohlke at uci.edu> > >> >>> >> >> wrote: > >> >>> >> >> > 2) Bottleneck 0.7.0 > >> >>> >> >> > > >> >>> >> >> > > >> >>> >> >> > > >> >>> >> >> > > https://github.com/kwgoodman/bottleneck/issues/71#issuecomment-25331701 > >> >>> >> >> > >> >>> >> >> I can't tell if these are real bugs in numpy, or tests > checking that > >> >>> >> >> bottleneck is bug-for-bug compatible with old numpy and we > just > >> >>> >> >> fixed > >> >>> >> >> some bugs, or what. It's clearly something to do with the > >> >>> >> >> nanarg{max,min} rewrite -- @charris, do you know what's going > on > >> >>> >> >> here? > >> >>> >> >> > >> >>> >> > > >> >>> >> > Yes ;) The previous behaviour of nanarg for all-nan axis was > to cast > >> >>> >> > nan > >> >>> >> > to > >> >>> >> > intp when the result was an array, and return nan when a > scalar. The > >> >>> >> > current > >> >>> >> > behaviour is to return the most negative value of intp as an > error > >> >>> >> > marker in > >> >>> >> > both cases and raise a warning. It is a change in behavior, > but I > >> >>> >> > think > >> >>> >> > one > >> >>> >> > that needs to be made. > >> >>> >> > >> >>> >> Ah, okay! I kind of lost track of the nanfunc changes by the end > there. > >> >>> >> > >> >>> >> So for the bottleneck issue, it sounds like the problem is just > that > >> >>> >> bottleneck is still emulating the old numpy behaviour in this > corner > >> >>> >> case, which isn't really a problem. So we don't really need to > worry > >> >>> >> about that, both behaviours are correct, just maybe out of sync. > >> >>> >> > >> >>> >> I'm a little dubious about this "make up some weird value that > will > >> >>> >> *probably* blow up if people try to use it without checking, and > also > >> >>> >> raise a warning" thing, wouldn't it make more sense to just > raise an > >> >>> >> error? That's what exceptions are for? I guess I should have said > >> >>> >> something earlier though... > >> >>> >> > >> >>> > > >> >>> > I figure the blowup is safe, as we can't allocate arrays big > enough that > >> >>> > the > >> >>> > minimum intp value would be a valid index. I considered raising an > >> >>> > error, > >> >>> > and if there is a consensus the behavior could be changed. Or we > could > >> >>> > add a > >> >>> > keyword to determine the behavior. > >> >>> > >> >>> Yeah, the intp value can't be a valid index, so that covers 95% of > >> >>> cases, but I'm worried about that other 5%. It could still pass > >> >>> silently as the endpoint of a slice, or participate in some sort of > >> >>> integer arithmetic calculation, etc. I assume you also share this > >> >>> worry to some extent or you wouldn't have put in the warning ;-). > >> >>> > >> >>> I guess the bigger question is, why would we *not* use the standard > >> >>> method for signaling an exceptional condition here, i.e., > exceptions? > >> >>> That way we're 100% guaranteed that if people aren't prepared to > >> >>> handle it then they'll at least know something has gone wrong, and > if > >> >>> they are prepared to handle it then it's very easy and standard, > just > >> >>> use try/except. Right now I guess you have to check for the special > >> >>> value, but also do something to silence warnings, but just for that > >> >>> one line? Sounds kind of complicated... > >> >> > >> >> > >> >> The main reason was for the case of multiple axis, where some of the > results > >> >> would be valid and others not. The simple thing might be to raise an > >> >> exception but keep the current return values so that users could > determine > >> >> where the problem occurred. > >> > > >> > Oh, duh, yes, right, now I remember this discussion. Sorry for being > slow. > >> > > >> > In the past we've *always* raised in error in the multiple axis case, > >> > right? Has anyone ever complained? Wanting to get all > >> > nanargmax/nanargmin results, of which some might be errors, without > >> > just writing a loop, seems like a pretty exotic case to me, so I'm not > >> > sure we should optimize for it at the expense of returning > >> > possibly-misleading results in the scalar case. > >> > > >> > Like (I think) you say, we could get the best of both worlds by > >> > encoding the results in the same way we do right now, but then raise > >> > an exception and attach the results to the exception so they can be > >> > retrieved if wanted. Kind of cumbersome, but maybe good? > >> > > >> > This is a more general problem though of course -- we've run into it > >> > in the gufunc linalg code too, where there's some question about you > >> > do in e.g. chol() if some sub-matrices are positive-definite and some > >> > are not. > >> > > >> > Off the top of my head the general solution might be to define a > >> > MultiError exception type that has a standard generic format for > >> > describing such things. It'd need a mask saying which values were > >> > valid, rather than encoding them directly into the return values -- > >> > otherwise we have the problem where nanargmax wants to use INT_MIN, > >> > chol wants to use NaN, and maybe the next function along doesn't have > >> > any usable flag value available at all. So probably more thought is > >> > needed before nailing down exactly how we handle such "partial" errors > >> > for vectorized functions. > >> > > >> > In the short term (i.e., 1.8.0), maybe we should defer this discussion > >> > by simply raising a regular ValueError for nanarg functions on all > >> > errors? That's not a regression from 1.7, since 1.7 also didn't > >> > provide any way to get at partial results in the event of an error, > >> > and it leaves us in a good position to solve the more general problem > >> > later. > >> > >> Can we make the error optional in these cases? > >> > >> like np.seterr for zerodivision, invalid, or floating point errors > >> that allows ignore and raise > >> np.seterr(linalg='ignore') > >> > >> I don't know about nanarg, but thinking about some applications for > >> gufunc linalg code. > >> > >> In some cases I might require for example invertibility of all > >> matrices and raise if one fails, > >> in other case I would be happy with nans, and just sum the results > >> with nansum for example or replace them by some fill value. > >> > > I'm thinking warnings might be more flexible than exceptions: > > > > with warnings.catch_warnings(): > > warnings.simplefilter('error') > > ... > > Sure. Passing in a callback or just leaving the function out and telling > people to implement it themselves would be even more flexible :-). But we > have to trade off complexity of usage, complexity of teaching people how to > do stuff (nobody knows how to use catch_warnings, we only know because we > started writing warning tests just in the last year or so), usefulness in > common situations, etc. The warnings api doesn't give you any way to pass > results out, you still need a separate channel to say what failed and what > succeeded (and maybe for the failures to say what the different failures > are). > > Anyway this back and forth still supprts my main suggestion for *right* > now, which is that this is sufficiently nonobvious that with 1.8 breathing > down our necks we should start with the safe behaviour and then work up > from there. > I'm surely not opposed to raising an exception if there is agreement on that. I think it would also be pretty easy to attach the result to the exception. For the latter it would be good to have an exception type that we could maybe reuse for other parts of Numpy. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Oct 1 16:55:01 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 21:55:01 +0100 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: On Tue, Oct 1, 2013 at 9:37 PM, wrote: > Since numpy and scipy just moved to python 2.6, it's time to advertise > and support > warnings.catch_warnings(). warnings.catch_warnings is a very useful tool and this is all fun to talk about, but realistically we're simply not going to merge any change which involves telling people "the way you detect failure in this function is to use the catch_warnings() context manager". > If you want to wait for a "missing value support" in numpy to support > this, then this postpones this to .... (numpy 3.0?) > while gufuncs seem to be happening now. No-one said anything about missing value support :-). I don't see how it would really solve the problem -- we'll probably never allow missing values to magically appear in arbitrary function outputs (e.g. you can't put a bitpattern NA in a regular integer dtype, it's just not possible). On Tue, Oct 1, 2013 at 9:37 PM, Charles R Harris wrote: > I'm surely not opposed to raising an exception if there is agreement on > that. I think it would also be pretty easy to attach the result to the > exception. For the latter it would be good to have an exception type that we > could maybe reuse for other parts of Numpy. It is true that no-one's ever objected to the ValueError that nanarg{max,min} have raised in the past, right? That seems like one measure of agreement that it's at least acceptable. Spitball of a proper solution, though some thought would need to go into how to get this out of gufuncs. (And absolutely unsuitable for 1.8!): class VectorizedError(object): pass def vectorized_raise(result, good_mask, exceptions): exc_types = set([type(e) for e in exceptions]) exc_types.add(VectorizedError) exc_type = type.__new__(type, "SubVectorizedError", tuple(exc_types), {}) exc = exc_type(" From njs at pobox.com Tue Oct 1 16:58:23 2013 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 1 Oct 2013 21:58:23 +0100 Subject: [Numpy-discussion] nanargmax failure case (was: Re: [SciPy-Dev] 1.8.0rc1) In-Reply-To: References:

Message-ID: whoops, fat-fingered this out while in the middle of typing it On Tue, Oct 1, 2013 at 9:55 PM, Nathaniel Smith wrote: > Spitball of a proper solution, though some thought would need to go > into how to get this out of gufuncs. (And absolutely unsuitable for > 1.8!): > > class VectorizedError(object): > pass > > def vectorized_raise(result, good_mask, exceptions): > exc_types = set([type(e) for e in exceptions]) > exc_types.add(VectorizedError) > exc_type = type.__new__(type, "SubVectorizedError", tuple(exc_types), {}) > exc = exc_type(" # FIXME: use some heuristics to look at the exception messages and try to # say something more useful here exc = exc_type("Multiple errors") exc.result = result exc.good_mask = good_mask exc.exceptions = exceptions raise exc Okay, maybe that's a little ridiculous, but, you know. Discuss :-). (The black magic at the beginning is to ensure that if you have a ValueError and a TypeError consolidated into a single VectorizedError, then the exception that is raised can be caught by code that's looking for a ValueError, a TypeError, *or* a VectorizedError.) -n From cgohlke at uci.edu Wed Oct 2 03:31:49 2013 From: cgohlke at uci.edu (Christoph Gohlke) Date: Wed, 02 Oct 2013 00:31:49 -0700 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: References: Message-ID: <524BCBE5.8030306@uci.edu> On 9/30/2013 8:17 AM, Charles R Harris wrote: > Hi All, > > NumPy 1.8.0rc1 is up now on sourceforge > .The > binary builds are included except for Python 3.3 on windows, which will > arrive later. Many thanks to Ralf for the binaries, and to those who > found and fixed the bugs in the last beta. Any remaining bugs are all my > fault ;) I hope this will be the last release before final, so please > test it thoroughly. > > Chuck > One more: Cython-0.19.1 fails relaxed_strides.test_one_sized with "ValueError: ndarray is not C-contiguous". Tested on numpy-MKL-1.8.0rc1-win-amd64-py2.7 and msvc9. https://github.com/cython/cython/blob/0.19.x/tests/memoryview/relaxed_strides.pyx#L29 The relevant test output is attached. Christoph -------------- next part -------------- ====================================================================== FAIL: test_one_sized (line 29) (relaxed_strides.__test__) Doctest: relaxed_strides.__test__.test_one_sized (line 29) ---------------------------------------------------------------------- Traceback (most recent call last): File "X:\Python27-x64\lib\doctest.py", line 2201, in runTest raise self.failureException(self.format_failure(new.getvalue())) AssertionError: Failed doctest test for relaxed_strides.__test__.test_one_sized (line 29) File "Cython-0.19.1\BUILD\memoryview\c\relaxed_strides\relaxed_strides.pyd", line unknown line number, in test_one_sized (line 29) ---------------------------------------------------------------------- File "Cython-0.19.1\BUILD\memoryview\c\relaxed_strides\relaxed_strides.pyd", line ?, in relaxed_strides.__test__.test_one_sized (line 29) Failed example: test_one_sized(a)[0] Exception raised: Traceback (most recent call last): File "X:\Python27-x64\lib\doctest.py", line 1289, in __run compileflags, 1) in test.globs File "", line 1, in test_one_sized(a)[0] File "relaxed_strides.pyx", line 38, in relaxed_strides.test_one_sized (relaxed_strides.c:1379) File "stringsource", line 619, in View.MemoryView.memoryview_cwrapper (relaxed_strides.c:7145) File "stringsource", line 327, in View.MemoryView.memoryview.__cinit__ (relaxed_strides.c:3555) ValueError: ndarray is not C-contiguous ====================================================================== FAIL: test_one_sized (line 29) (relaxed_strides.__test__) Doctest: relaxed_strides.__test__.test_one_sized (line 29) ---------------------------------------------------------------------- Traceback (most recent call last): File "X:\Python27-x64\lib\doctest.py", line 2201, in runTest raise self.failureException(self.format_failure(new.getvalue())) AssertionError: Failed doctest test for relaxed_strides.__test__.test_one_sized (line 29) File "Cython-0.19.1\BUILD\memoryview\cpp\relaxed_strides\relaxed_strides.pyd", line unknown line number, in test_one_sized (line 29) ---------------------------------------------------------------------- File "Cython-0.19.1\BUILD\memoryview\cpp\relaxed_strides\relaxed_strides.pyd", line ?, in relaxed_strides.__test__.test_one_sized (line 29) Failed example: test_one_sized(a)[0] Exception raised: Traceback (most recent call last): File "X:\Python27-x64\lib\doctest.py", line 1289, in __run compileflags, 1) in test.globs File "", line 1, in test_one_sized(a)[0] File "relaxed_strides.pyx", line 38, in relaxed_strides.test_one_sized (relaxed_strides.cpp:1379) File "stringsource", line 619, in View.MemoryView.memoryview_cwrapper (relaxed_strides.cpp:7145) File "stringsource", line 327, in View.MemoryView.memoryview.__cinit__ (relaxed_strides.cpp:3555) ValueError: ndarray is not C-contiguous From njs at pobox.com Wed Oct 2 05:04:09 2013 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 2 Oct 2013 10:04:09 +0100 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: <524BCBE5.8030306@uci.edu> References: <524BCBE5.8030306@uci.edu> Message-ID: This is a complicated issue to describe but i think the bottom line is that the test is just wonky here. the behaviour it's checking for is: wrong in old numpy, but we do it anyway (bug) wrong in current numpy without RELAXED_STRIDES, and we get it right (I.e. don't do it, fixed bug) correct in current numpy with RELAXED_STRIDES (and I think the test will pass if you build 1.8 this way?) (fixed bug again, since the definition of "correct" flipped) correct in future numpys (@seberg, does that look right?) I guess maybe we could go back and make sure it stays broken in 1.8, but it's probably not worth the trouble because this is such a corner case. Nothing real is going to break because of this AFAICT. -n On 2 Oct 2013 08:31, "Christoph Gohlke" wrote: > On 9/30/2013 8:17 AM, Charles R Harris wrote: > >> Hi All, >> >> NumPy 1.8.0rc1 is up now on sourceforge >> > >> .The >> binary builds are included except for Python 3.3 on windows, which will >> arrive later. Many thanks to Ralf for the binaries, and to those who >> found and fixed the bugs in the last beta. Any remaining bugs are all my >> fault ;) I hope this will be the last release before final, so please >> test it thoroughly. >> >> Chuck >> >> > One more: Cython-0.19.1 fails relaxed_strides.test_one_sized with > "ValueError: ndarray is not C-contiguous". Tested on > numpy-MKL-1.8.0rc1-win-amd64-**py2.7 and msvc9. > > https://github.com/cython/**cython/blob/0.19.x/tests/** > memoryview/relaxed_strides.**pyx#L29 > > The relevant test output is attached. > > Christoph > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Wed Oct 2 06:54:45 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 02 Oct 2013 12:54:45 +0200 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: References: <524BCBE5.8030306@uci.edu> Message-ID: <1380711285.15888.3.camel@sebastian-laptop> On Wed, 2013-10-02 at 10:04 +0100, Nathaniel Smith wrote: > This is a complicated issue to describe but i think the bottom line is > that the test is just wonky here. the behaviour it's checking for is: > wrong in old numpy, but we do it anyway (bug) > wrong in current numpy without RELAXED_STRIDES, and we get it right > (I.e. don't do it, fixed bug) > correct in current numpy with RELAXED_STRIDES (and I think the test > will pass if you build 1.8 this way?) (fixed bug again, since the > definition of "correct" flipped) > correct in future numpys > Sounds about right, but I did not yet look at the test exactly. The old numpy had some corner cases where the *not* relaxed strides were not enforced. I could imagine that the test was written to accommodate the relaxed strides, but runs through fine with the old behaviour, too. That said, if this is the case, this must be something like a size 1 array corner case, so I would agree with Nathaniel, that probably the test should just see whether relaxed strides is active or not and change behaviour accordingly. If this has to do with the Buffer protocol change we have to have a closer look if it might be breaking code out there. (even if the buffer protocol change is correct and this should be fixed on cythons end in that case) - Sebastian > (@seberg, does that look right?) > > I guess maybe we could go back and make sure it stays broken in 1.8, > but it's probably not worth the trouble because this is such a corner > case. Nothing real is going to break because of this AFAICT. > > -n > > On 2 Oct 2013 08:31, "Christoph Gohlke" wrote: > On 9/30/2013 8:17 AM, Charles R Harris wrote: > Hi All, > > NumPy 1.8.0rc1 is up now on sourceforge > .The > binary builds are included except for Python 3.3 on > windows, which will > arrive later. Many thanks to Ralf for the binaries, > and to those who > found and fixed the bugs in the last beta. Any > remaining bugs are all my > fault ;) I hope this will be the last release before > final, so please > test it thoroughly. > > Chuck > > > One more: Cython-0.19.1 fails relaxed_strides.test_one_sized > with "ValueError: ndarray is not C-contiguous". Tested on > numpy-MKL-1.8.0rc1-win-amd64-py2.7 and msvc9. > > https://github.com/cython/cython/blob/0.19.x/tests/memoryview/relaxed_strides.pyx#L29 > > The relevant test output is attached. > > Christoph > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From sebastian at sipsolutions.net Wed Oct 2 07:51:04 2013 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 02 Oct 2013 13:51:04 +0200 Subject: [Numpy-discussion] 1.8.0rc1 In-Reply-To: <1380711285.15888.3.camel@sebastian-laptop> References: <524BCBE5.8030306@uci.edu> <1380711285.15888.3.camel@sebastian-laptop> Message-ID: <1380714664.17995.2.camel@sebastian-laptop> On Wed, 2013-10-02 at 12:54 +0200, Sebastian Berg wrote: > On Wed, 2013-10-02 at 10:04 +0100, Nathaniel Smith wrote: > > This is a complicated issue to describe but i think the bottom line is > > that the test is just wonky here. the behaviour it's checking for is: > > wrong in old numpy, but we do it anyway (bug) > > wrong in current numpy without RELAXED_STRIDES, and we get it right > > (I.e. don't do it, fixed bug) > > correct in current numpy with RELAXED_STRIDES (and I think the test > > will pass if you build 1.8 this way?) (fixed bug again, since the > > definition of "correct" flipped) > > correct in future numpys > > OK, I had a look. And it is the 1-dimensional size-1 special case that was always "relaxed", but is not anymore. If anyone thinks this is a problem, we could re-add the special case (rethinking it, it is almost impossible to hurt, since the array has only one element). Otherwise, I think the test should check if numpy was compiled with relaxed strides, for more extensive tests this is necessary anyway. - Sebastian > Sounds about right, but I did not yet look at the test exactly. The old > numpy had some corner cases where the *not* relaxed strides were not > enforced. I could imagine that the test was written to accommodate the > relaxed strides, but runs through fine with the old behaviour, too. > > That said, if this is the case, this must be something like a size 1 > array corner case, so I would agree with Nathaniel, that probably the > test should just see whether relaxed strides is active or not and change > behaviour accordingly. > > If this has to do with the Buffer protocol change we have to have a > closer look if it might be breaking code out there. (even if the buffer > protocol change is correct and this should be fixed on cythons end in > that case) > > - Sebastian > > > > (@seberg, does that look right?) > > > > I guess maybe we could go back and make sure it stays broken in 1.8, > > but it's probably not worth the trouble because this is such a corner > > case. Nothing real is going to break because of this AFAICT. > > > > -n > > > > On 2 Oct 2013 08:31, "Christoph Gohlke" wrote: > > On 9/30/2013 8:17 AM, Charles R Harris wrote: > > Hi All, > > > > NumPy 1.8.0rc1 is up now on sourceforge > > .The > > binary builds are included except for Python 3.3 on > > windows, which will > > arrive later. Many thanks to Ralf for the binaries, > > and to those who > > found and fixed the bugs in the last beta. Any > > remaining bugs are all my > > fault ;) I hope this will be the last release before > > final, so please > > test it thoroughly. > > > > Chuck > > > > > > One more: Cython-0.19.1 fails relaxed_strides.test_one_sized > > with "ValueError: ndarray is not C-contiguous". Tested on > > numpy-MKL-1.8.0rc1-win-amd64-py2.7 and msvc9. > > > > https://github.com/cython/cython/blob/0.19.x/tests/memoryview/relaxed_strides.pyx#L29 > > > > The relevant test output is attached. > > > > Christoph > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From stefan at sun.ac.za Wed Oct 2 11:29:41 2013 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Wed, 2 Oct 2013 17:29:41 +0200 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu>

Message-ID: Hi Chuck On Tue, Oct 1, 2013 at 1:07 AM, Charles R Harris wrote: >> I'll bet the skimage problems come from >> https://github.com/numpy/numpy/pull/3811. They may be doing something >> naughty... >> > > Reverting that commit fixes those skimage failures. However, there are a > number of python2.7 failures that look pretty strange. What is the exact change in behavior with that PR? I'm trying to figure out what skimage does wrong in this case. St?fan From charlesr.harris at gmail.com Wed Oct 2 11:43:42 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 2 Oct 2013 09:43:42 -0600 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu>

Message-ID: Hi Stefan, On Wed, Oct 2, 2013 at 9:29 AM, St?fan van der Walt wrote: > Hi Chuck > > On Tue, Oct 1, 2013 at 1:07 AM, Charles R Harris > wrote: > >> I'll bet the skimage problems come from > >> https://github.com/numpy/numpy/pull/3811. They may be doing something > >> naughty... > >> > > > > Reverting that commit fixes those skimage failures. However, there are a > > number of python2.7 failures that look pretty strange. > > What is the exact change in behavior with that PR? I'm trying to > figure out what skimage does wrong in this case. > > The current master, and reverted for the 1.8 release only, is stricter about np.bool only taking values 0 or 1. Apparently the convolve returns boolean (I haven't checked) for boolean input, and consequently the check if the return value matches the number of 1 elements in the convolution kernel will fail when that number is greater than one. That is why the proposed fix is to view the boolean as uint8 instead. Note that out=(boolean) will still cause problems. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed Oct 2 12:03:43 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 2 Oct 2013 10:03:43 -0600 Subject: [Numpy-discussion] Behavior of nan{max, min} and nanarg{max, min} for all-nan slices. Message-ID: Hi All, The question is what to do when all-nan slices are encountered in the nan{max,min} and nanarg{max, min} functions. Currently in 1.8.0, the first returns nan and raises a warning, the second returns intp.min and raises a warning. It is proposed that the nanarg{max, min} functions, and possibly the nan{max, min} also, raise an error instead. Raising errors would be consistent with the behavior of the arg{max, min} and amax/amin functions when they encounter empty arrays. OTOH, now that we no longer support Python 2.4/2.5 the catch_warnings context manager can serve the same purpose by changing the warnings into exceptions. So, what to do? Thoughts? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Wed Oct 2 12:09:40 2013 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 2 Oct 2013 17:09:40 +0100 Subject: [Numpy-discussion] Behavior of nan{max, min} and nanarg{max, min} for all-nan slices. In-Reply-To: References: Message-ID: +1 to making the nan functions consistent with the non-nan functions. On 2 Oct 2013 17:03, "Charles R Harris" wrote: > Hi All, > > The question is what to do when all-nan slices are encountered in the > nan{max,min} and nanarg{max, min} functions. Currently in 1.8.0, the first > returns nan and raises a warning, the second returns intp.min and raises a > warning. It is proposed that the nanarg{max, min} functions, and possibly > the nan{max, min} also, raise an error instead. Raising errors would be > consistent with the behavior of the arg{max, min} and amax/amin functions > when they encounter empty arrays. OTOH, now that we no longer support > Python 2.4/2.5 the catch_warnings context manager can serve the same > purpose by changing the warnings into exceptions. So, what to do? > > Thoughts? > > Chuck > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ben.root at ou.edu Wed Oct 2 12:12:00 2013 From: ben.root at ou.edu (Benjamin Root) Date: Wed, 2 Oct 2013 12:12:00 -0400 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu>

Message-ID: On Wed, Oct 2, 2013 at 11:43 AM, Charles R Harris wrote: > Hi Stefan, > > > On Wed, Oct 2, 2013 at 9:29 AM, St?fan van der Walt wrote: > >> Hi Chuck >> >> On Tue, Oct 1, 2013 at 1:07 AM, Charles R Harris >> wrote: >> >> I'll bet the skimage problems come from >> >> https://github.com/numpy/numpy/pull/3811. They may be doing something >> >> naughty... >> >> >> > >> > Reverting that commit fixes those skimage failures. However, there are a >> > number of python2.7 failures that look pretty strange. >> >> What is the exact change in behavior with that PR? I'm trying to >> figure out what skimage does wrong in this case. >> >> > The current master, and reverted for the 1.8 release only, is stricter > about np.bool only taking values 0 or 1. Apparently the convolve returns > boolean (I haven't checked) for boolean input, and consequently the check > if the return value matches the number of 1 elements in the convolution > kernel will fail when that number is greater than one. That is why the > proposed fix is to view the boolean as uint8 instead. Note that > out=(boolean) will still cause problems. > > Chuck > > So, just to be clear... what would happen if I had an array of floats between 0 and 1 inclusive and I cast that as a boolean using astype()? Ben Root -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed Oct 2 12:24:24 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 2 Oct 2013 10:24:24 -0600 Subject: [Numpy-discussion] [SciPy-Dev] 1.8.0rc1 In-Reply-To: References:

<5249F26B.2040109@uci.edu>

Message-ID: On Wed, Oct 2, 2013 at 10:12 AM, Benjamin Root wrote: > > > > On Wed, Oct 2, 2013 at 11:43 AM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> Hi Stefan, >> >> >> On Wed, Oct 2, 2013 at 9:29 AM, St?fan van der Walt wrote: >> >>> Hi Chuck >>> >>> On Tue, Oct 1, 2013 at 1:07 AM, Charles R Harris >>> wrote: >>> >> I'll bet the skimage problems come from >>> >> https://github.com/numpy/numpy/pull/3811. They may be doing something >>> >> naughty... >>> >> >>> > >>> > Reverting that commit fixes those skimage failures. However, there are >>> a >>> > number of python2.7 failures that look pretty strange. >>> >>> What is the exact change in behavior with that PR? I'm trying to >>> figure out what skimage does wrong in this case. >>> >>> >> The current master, and reverted for the 1.8 release only, is stricter >> about np.bool only taking values 0 or 1. Apparently the convolve returns >> boolean (I haven't checked) for boolean input, and consequently the check >> if the return value matches the number of 1 elements in the convolution >> kernel will fail when that number is greater than one. That is why the >> proposed fix is to view the boolean as uint8 instead. Note that >> out=(boolean) will still cause problems. >> >> Chuck >> >> > So, just to be clear... what would happen if I had an array of floats > between 0 and 1 inclusive and I cast that as a boolean using astype()? > > You get zeros and ones. I don't think that has changed. Of course, you can still mess things up using views ;) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefan at sun.ac.za Wed Oct 2 12:37:50 2013 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Wed, 2 Oct 2013 18:37:50 +0200 Subject: [Numpy-discussion] Behavior of nan{max, min} and nanarg{max, min} for all-nan slices. In-Reply-To: References: Message-ID: On 2 Oct 2013 18:04, "Charles R Harris" wrote: > > The question is what to do when all-nan slices are encountered in the nan{max,min} and nanarg{max, min} functions. Currently in 1.8.0, the first returns nan and raises a warning, the second returns intp.min and raises a warning. It is proposed that the nanarg{max, min} functions, and possibly the nan{max, min} also, raise an error instead. I agree with Nathan; this sounds like more reasonable behaviour to me. St?fan -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Wed Oct 2 12:56:19 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 2 Oct 2013 12:56:19 -0400 Subject: [Numpy-discussion] Behavior of nan{max, min} and nanarg{max, min} for all-nan slices. In-Reply-To: References: Message-ID: On Wed, Oct 2, 2013 at 12:37 PM, St?fan van der Walt