From chekir.amira at gmail.com Wed Jan 1 10:45:09 2014 From: chekir.amira at gmail.com (Amira Chekir) Date: Wed, 1 Jan 2014 16:45:09 +0100 Subject: [Numpy-discussion] NumPy-Discussion Digest, Vol 87, Issue 35 In-Reply-To: References: Message-ID: Hi, Thanks for your answer. I use ubuntu 12.04 32 bits and python 2.7 I upgrade numpy to 1.8, but the error persists I think that the problem is in gzip.py : max_read_chunk = 10 * 1024 * 1024 # 10Mb What do you think? Best regards, AMIRA 2013/12/31 > Send NumPy-Discussion mailing list submissions to > numpy-discussion at scipy.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://mail.scipy.org/mailman/listinfo/numpy-discussion > or, via email, send a message with subject or body 'help' to > numpy-discussion-request at scipy.org > > You can reach the person managing the list at > numpy-discussion-owner at scipy.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of NumPy-Discussion digest..." > > > Today's Topics: > > 1. Loading large NIfTI file -> MemoryError (Amira Chekir) > 2. Re: Loading large NIfTI file -> MemoryError (Julian Taylor) > 3. Re: proposal: min, max of complex should give warning (Cera, > Tim) > 4. Re: proposal: min, max of complex should give warning > (Neal Becker) > 5. Re: proposal: min, max of complex should give warning > (Ralf Gommers) > 6. Re: proposal: min, max of complex should give warning > (Neal Becker) > 7. ANN: NumPy 1.7.2 release (Julian Taylor) > 8. Re: ANN: NumPy 1.7.2 release (Charles R Harris) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Tue, 31 Dec 2013 14:13:57 +0100 > From: Amira Chekir > Subject: [Numpy-discussion] Loading large NIfTI file -> MemoryError > To: numpy-discussion at scipy.org > Message-ID: > EQ29Zw at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > Hello together, > > I try to load a (large) NIfTI file (DMRI from Human Connectome Project, > about 1 GB) with NiBabel. > > import nibabel as nib > img = nib.load("dmri.nii.gz") > data = img.get_data() > > The program crashes during "img.get_data()" with an "MemoryError" (having 4 > GB of RAM in my machine). > > Any suggestions? > > Best regards, > AMIRA > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/b13969b3/attachment-0001.html > > ------------------------------ > > Message: 2 > Date: Tue, 31 Dec 2013 14:29:42 +0100 > From: Julian Taylor > Subject: Re: [Numpy-discussion] Loading large NIfTI file -> > MemoryError > To: Discussion of Numerical Python > Message-ID: <52C2C6C6.6070002 at googlemail.com> > Content-Type: text/plain; charset=ISO-8859-1 > > On 31.12.2013 14:13, Amira Chekir wrote: > > Hello together, > > > > I try to load a (large) NIfTI file (DMRI from Human Connectome Project, > > about 1 GB) with NiBabel. > > > > import nibabel as nib > > img = nib.load("dmri.nii.gz") > > data = img.get_data() > > > > The program crashes during "img.get_data()" with an "MemoryError" > > (having 4 GB of RAM in my machine). > > > > Any suggestions? > > are you using a 64 bit operating system? > which version of numpy? > > assuming nibabel uses np.load under the hood you could try it with numpy > 1.8 which reduces excess memory usage when loading compressed files. > > > ------------------------------ > > Message: 3 > Date: Tue, 31 Dec 2013 08:51:52 -0500 > From: "Cera, Tim" > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > give warning > To: Discussion of Numerical Python > Message-ID: > < > CAO5s+D_m5N6SJgsKoV7O-+yHh5gPnB0_a-ozKgETGRwTgN_axg at mail.gmail.com> > Content-Type: text/plain; charset=ISO-8859-1 > > I don't work with complex numbers, but just sampling what others do: > > > Python: no ordering, results in TypeError > > Matlab: sorts by magnitude > http://www.mathworks.com/help/matlab/ref/sort.html > > R: sorts first by real, then by imaginary > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > Numpy: sorts first by real, then by imaginary (the documentation link > below calls this sort 'lexicographical' which I don't think is > correct) > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > > I would think that the Matlab sort might be more useful, but easy > enough by using the absolute value. > > I think what Numpy does is normal enough to not justify a warning, but > leave this to others because as I pointed out in the beginning I don't > work with complex numbers. > > Kindest regards, > Tim > > > ------------------------------ > > Message: 4 > Date: Tue, 31 Dec 2013 10:52:47 -0500 > From: Neal Becker > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > give warning > To: numpy-discussion at scipy.org > Message-ID: > Content-Type: text/plain; charset="ISO-8859-1" > > Cera, Tim wrote: > > > I don't work with complex numbers, but just sampling what others do: > > > > > > Python: no ordering, results in TypeError > > > > Matlab: sorts by magnitude > > http://www.mathworks.com/help/matlab/ref/sort.html > > > > R: sorts first by real, then by imaginary > > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > > > Numpy: sorts first by real, then by imaginary (the documentation link > > below calls this sort 'lexicographical' which I don't think is > > correct) > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > > > > > I would think that the Matlab sort might be more useful, but easy > > enough by using the absolute value. > > > > I think what Numpy does is normal enough to not justify a warning, but > > leave this to others because as I pointed out in the beginning I don't > > work with complex numbers. > > > > Kindest regards, > > Tim > > But I'm not proposing to change numpy's result, which I'm sure would raise > many > objections. I'm just asking to give a warning, because I think in most > cases > this is actually a mistake on the user's part. Just like the warning > currently > given when complex data are truncated to real part. > > > > ------------------------------ > > Message: 5 > Date: Tue, 31 Dec 2013 17:24:05 +0100 > From: Ralf Gommers > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > give warning > To: Discussion of Numerical Python > Message-ID: > < > CABL7CQh9Fc0Uh36W9p16mzAR-oYjJ7_k7rU_Dwq+eZND6YrbDA at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > On Tue, Dec 31, 2013 at 4:52 PM, Neal Becker wrote: > > > Cera, Tim wrote: > > > > > I don't work with complex numbers, but just sampling what others do: > > > > > > > > > Python: no ordering, results in TypeError > > > > > > Matlab: sorts by magnitude > > > http://www.mathworks.com/help/matlab/ref/sort.html > > > > > > R: sorts first by real, then by imaginary > > > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > > > > > Numpy: sorts first by real, then by imaginary (the documentation link > > > below calls this sort 'lexicographical' which I don't think is > > > correct) > > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > > > > > > > > I would think that the Matlab sort might be more useful, but easy > > > enough by using the absolute value. > > > > > > I think what Numpy does is normal enough to not justify a warning, but > > > leave this to others because as I pointed out in the beginning I don't > > > work with complex numbers. > > > > > > Kindest regards, > > > Tim > > > > But I'm not proposing to change numpy's result, which I'm sure would > raise > > many > > objections. I'm just asking to give a warning, because I think in most > > cases > > this is actually a mistake on the user's part. Just like the warning > > currently > > given when complex data are truncated to real part. > > > > Keep in mind that warnings can be highly annoying. If you're a user who > uses this functionality regularly (and you know what you're doing), then > you're going to be very unhappy to have to wrap each function call in: > olderr = np.seterr(all='ignore') > max(...) > np.seterr(**olderr) > or in: > with warnings.catch_warnings(): > warnings.filterwarnings('ignore', ...) > max(...) > > The actual behavior isn't documented now it looks like, so that should be > done. In the Notes section of max/min probably. > > As for your proposal, it would be good to know if adding a warning would > actually catch any bugs. For the truncation warning it caught several in > scipy and other libs IIRC. > > Ralf > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/add729d8/attachment-0001.html > > ------------------------------ > > Message: 6 > Date: Tue, 31 Dec 2013 11:45:08 -0500 > From: Neal Becker > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > give warning > To: numpy-discussion at scipy.org > Message-ID: > Content-Type: text/plain; charset="ISO-8859-1" > > Ralf Gommers wrote: > > > On Tue, Dec 31, 2013 at 4:52 PM, Neal Becker > wrote: > > > >> Cera, Tim wrote: > >> > >> > I don't work with complex numbers, but just sampling what others do: > >> > > >> > > >> > Python: no ordering, results in TypeError > >> > > >> > Matlab: sorts by magnitude > >> > http://www.mathworks.com/help/matlab/ref/sort.html > >> > > >> > R: sorts first by real, then by imaginary > >> > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > >> > > >> > Numpy: sorts first by real, then by imaginary (the documentation link > >> > below calls this sort 'lexicographical' which I don't think is > >> > correct) > >> > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > >> > > >> > > >> > I would think that the Matlab sort might be more useful, but easy > >> > enough by using the absolute value. > >> > > >> > I think what Numpy does is normal enough to not justify a warning, but > >> > leave this to others because as I pointed out in the beginning I don't > >> > work with complex numbers. > >> > > >> > Kindest regards, > >> > Tim > >> > >> But I'm not proposing to change numpy's result, which I'm sure would > raise > >> many > >> objections. I'm just asking to give a warning, because I think in most > >> cases > >> this is actually a mistake on the user's part. Just like the warning > >> currently > >> given when complex data are truncated to real part. > >> > > > > Keep in mind that warnings can be highly annoying. If you're a user who > > uses this functionality regularly (and you know what you're doing), then > > you're going to be very unhappy to have to wrap each function call in: > > olderr = np.seterr(all='ignore') > > max(...) > > np.seterr(**olderr) > > or in: > > with warnings.catch_warnings(): > > warnings.filterwarnings('ignore', ...) > > max(...) > > > > The actual behavior isn't documented now it looks like, so that should be > > done. In the Notes section of max/min probably. > > > > As for your proposal, it would be good to know if adding a warning would > > actually catch any bugs. For the truncation warning it caught several in > > scipy and other libs IIRC. > > > > Ralf > > I tripped over it yesterday, which is what prompted my suggestion. > > > > ------------------------------ > > Message: 7 > Date: Tue, 31 Dec 2013 17:57:18 +0100 > From: Julian Taylor > Subject: [Numpy-discussion] ANN: NumPy 1.7.2 release > To: Discussion of Numerical Python , > SciPy Users List , SciPy Developers > List > > Message-ID: <52C2F76E.9010509 at googlemail.com> > Content-Type: text/plain; charset=ISO-8859-1 > > Hello, > > I'm happy to announce the of Numpy 1.7.2. > This is a bugfix only release supporting Python 2.4 - 2.7 and 3.1 - 3.3. > > More than 42 issues were fixed, the most important issues are listed in > the release notes: > https://github.com/numpy/numpy/blob/v1.7.2/doc/release/1.7.2-notes.rst > > Compared to the last release candidate four additional minor issues have > been fixed and compatibility with python 3.4b1 improved. > > Source tarballs, installers and release notes can be found at > https://sourceforge.net/projects/numpy/files/NumPy/1.7.2 > > Cheers, > Julian Taylor > > > ------------------------------ > > Message: 8 > Date: Tue, 31 Dec 2013 10:47:44 -0700 > From: Charles R Harris > Subject: Re: [Numpy-discussion] ANN: NumPy 1.7.2 release > To: Discussion of Numerical Python > Message-ID: > abrqm4DNRG7f6-1keU_hPd253O64d0-Yhw at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > On Tue, Dec 31, 2013 at 9:57 AM, Julian Taylor < > jtaylor.debian at googlemail.com> wrote: > > > Hello, > > > > I'm happy to announce the of Numpy 1.7.2. > > This is a bugfix only release supporting Python 2.4 - 2.7 and 3.1 - 3.3. > > > > More than 42 issues were fixed, the most important issues are listed in > > the release notes: > > https://github.com/numpy/numpy/blob/v1.7.2/doc/release/1.7.2-notes.rst > > > > Compared to the last release candidate four additional minor issues have > > been fixed and compatibility with python 3.4b1 improved. > > > > Source tarballs, installers and release notes can be found at > > https://sourceforge.net/projects/numpy/files/NumPy/1.7.2 > > > > > Congrats on the release. > > Chuck > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/946abcb9/attachment.html > > ------------------------------ > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > End of NumPy-Discussion Digest, Vol 87, Issue 35 > ************************************************ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From chekir.amira at gmail.com Wed Jan 1 10:50:11 2014 From: chekir.amira at gmail.com (Amira Chekir) Date: Wed, 1 Jan 2014 16:50:11 +0100 Subject: [Numpy-discussion] NumPy-Discussion Digest, Vol 88, Issue 1 In-Reply-To: References: Message-ID: On 31.12.2013 14:13, Amira Chekir wrote: > > Hello together, > > > > I try to load a (large) NIfTI file (DMRI from Human Connectome Project, > > about 1 GB) with NiBabel. > > > > import nibabel as nib > > img = nib.load("dmri.nii.gz") > > data = img.get_data() > > > > The program crashes during "img.get_data()" with an "MemoryError" > > (having 4 GB of RAM in my machine). > > > > Any suggestions? > > are you using a 64 bit operating system? > which version of numpy? > > assuming nibabel uses np.load under the hood you could try it with numpy > 1.8 which reduces excess memory usage when loading compressed files. Hi, Thanks for your answer. I use ubuntu 12.04 32 bits and python 2.7 I upgrade numpy to 1.8, but the error persists I think that the problem is in gzip.py : max_read_chunk = 10 * 1024 * 1024 # 10Mb What do you think? Best regards, AMIRA 2014/1/1 > Send NumPy-Discussion mailing list submissions to > numpy-discussion at scipy.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://mail.scipy.org/mailman/listinfo/numpy-discussion > or, via email, send a message with subject or body 'help' to > numpy-discussion-request at scipy.org > > You can reach the person managing the list at > numpy-discussion-owner at scipy.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of NumPy-Discussion digest..." > > > Today's Topics: > > 1. Re: proposal: min, max of complex should give warning (Ralf > Gommers) (David Goldsmith) > 2. Re: NumPy-Discussion Digest, Vol 87, Issue 35 (Amira Chekir) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Tue, 31 Dec 2013 11:43:49 -0800 > From: David Goldsmith > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > give warning (Ralf Gommers) > To: numpy-discussion at scipy.org > Message-ID: > rWU6EVuBMG+mY-XJdA at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > > > > As for your proposal, it would be good to know if adding a warning would > > actually catch any bugs. For the truncation warning it caught several in > > scipy and other libs IIRC. > > > > Ralf > > > > In light of this, perhaps the pertinent unit tests should be modified (even > if the warning suggestion isn't adopted, about which I'm neutral...but I'm > a little surprised that there isn't a generic way to globally turn off > specific warnings). > > DG > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/ac17f43e/attachment-0001.html > > ------------------------------ > > Message: 2 > Date: Wed, 1 Jan 2014 16:45:09 +0100 > From: Amira Chekir > Subject: Re: [Numpy-discussion] NumPy-Discussion Digest, Vol 87, Issue > 35 > To: numpy-discussion at scipy.org > Message-ID: > < > CAB-foYhZMYH+asXUC_SnO6bjCDSOji+d8J6tyqSvucNOv_dyiQ at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > Hi, > Thanks for your answer. > I use ubuntu 12.04 32 bits and python 2.7 > I upgrade numpy to 1.8, but the error persists > I think that the problem is in gzip.py : > max_read_chunk = 10 * 1024 * 1024 # 10Mb > What do you think? > > Best regards, > AMIRA > > > 2013/12/31 > > > Send NumPy-Discussion mailing list submissions to > > numpy-discussion at scipy.org > > > > To subscribe or unsubscribe via the World Wide Web, visit > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > or, via email, send a message with subject or body 'help' to > > numpy-discussion-request at scipy.org > > > > You can reach the person managing the list at > > numpy-discussion-owner at scipy.org > > > > When replying, please edit your Subject line so it is more specific > > than "Re: Contents of NumPy-Discussion digest..." > > > > > > Today's Topics: > > > > 1. Loading large NIfTI file -> MemoryError (Amira Chekir) > > 2. Re: Loading large NIfTI file -> MemoryError (Julian Taylor) > > 3. Re: proposal: min, max of complex should give warning (Cera, > > Tim) > > 4. Re: proposal: min, max of complex should give warning > > (Neal Becker) > > 5. Re: proposal: min, max of complex should give warning > > (Ralf Gommers) > > 6. Re: proposal: min, max of complex should give warning > > (Neal Becker) > > 7. ANN: NumPy 1.7.2 release (Julian Taylor) > > 8. Re: ANN: NumPy 1.7.2 release (Charles R Harris) > > > > > > ---------------------------------------------------------------------- > > > > Message: 1 > > Date: Tue, 31 Dec 2013 14:13:57 +0100 > > From: Amira Chekir > > Subject: [Numpy-discussion] Loading large NIfTI file -> MemoryError > > To: numpy-discussion at scipy.org > > Message-ID: > > > EQ29Zw at mail.gmail.com> > > Content-Type: text/plain; charset="iso-8859-1" > > > > Hello together, > > > > I try to load a (large) NIfTI file (DMRI from Human Connectome Project, > > about 1 GB) with NiBabel. > > > > import nibabel as nib > > img = nib.load("dmri.nii.gz") > > data = img.get_data() > > > > The program crashes during "img.get_data()" with an "MemoryError" > (having 4 > > GB of RAM in my machine). > > > > Any suggestions? > > > > Best regards, > > AMIRA > > -------------- next part -------------- > > An HTML attachment was scrubbed... > > URL: > > > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/b13969b3/attachment-0001.html > > > > ------------------------------ > > > > Message: 2 > > Date: Tue, 31 Dec 2013 14:29:42 +0100 > > From: Julian Taylor > > Subject: Re: [Numpy-discussion] Loading large NIfTI file -> > > MemoryError > > To: Discussion of Numerical Python > > Message-ID: <52C2C6C6.6070002 at googlemail.com> > > Content-Type: text/plain; charset=ISO-8859-1 > > > > On 31.12.2013 14:13, Amira Chekir wrote: > > > Hello together, > > > > > > I try to load a (large) NIfTI file (DMRI from Human Connectome > Project, > > > about 1 GB) with NiBabel. > > > > > > import nibabel as nib > > > img = nib.load("dmri.nii.gz") > > > data = img.get_data() > > > > > > The program crashes during "img.get_data()" with an "MemoryError" > > > (having 4 GB of RAM in my machine). > > > > > > Any suggestions? > > > > are you using a 64 bit operating system? > > which version of numpy? > > > > assuming nibabel uses np.load under the hood you could try it with numpy > > 1.8 which reduces excess memory usage when loading compressed files. > > > > > > ------------------------------ > > > > Message: 3 > > Date: Tue, 31 Dec 2013 08:51:52 -0500 > > From: "Cera, Tim" > > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > > give warning > > To: Discussion of Numerical Python > > Message-ID: > > < > > CAO5s+D_m5N6SJgsKoV7O-+yHh5gPnB0_a-ozKgETGRwTgN_axg at mail.gmail.com> > > Content-Type: text/plain; charset=ISO-8859-1 > > > > I don't work with complex numbers, but just sampling what others do: > > > > > > Python: no ordering, results in TypeError > > > > Matlab: sorts by magnitude > > http://www.mathworks.com/help/matlab/ref/sort.html > > > > R: sorts first by real, then by imaginary > > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > > > Numpy: sorts first by real, then by imaginary (the documentation link > > below calls this sort 'lexicographical' which I don't think is > > correct) > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > > > > > I would think that the Matlab sort might be more useful, but easy > > enough by using the absolute value. > > > > I think what Numpy does is normal enough to not justify a warning, but > > leave this to others because as I pointed out in the beginning I don't > > work with complex numbers. > > > > Kindest regards, > > Tim > > > > > > ------------------------------ > > > > Message: 4 > > Date: Tue, 31 Dec 2013 10:52:47 -0500 > > From: Neal Becker > > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > > give warning > > To: numpy-discussion at scipy.org > > Message-ID: > > Content-Type: text/plain; charset="ISO-8859-1" > > > > Cera, Tim wrote: > > > > > I don't work with complex numbers, but just sampling what others do: > > > > > > > > > Python: no ordering, results in TypeError > > > > > > Matlab: sorts by magnitude > > > http://www.mathworks.com/help/matlab/ref/sort.html > > > > > > R: sorts first by real, then by imaginary > > > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > > > > > Numpy: sorts first by real, then by imaginary (the documentation link > > > below calls this sort 'lexicographical' which I don't think is > > > correct) > > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > > > > > > > > I would think that the Matlab sort might be more useful, but easy > > > enough by using the absolute value. > > > > > > I think what Numpy does is normal enough to not justify a warning, but > > > leave this to others because as I pointed out in the beginning I don't > > > work with complex numbers. > > > > > > Kindest regards, > > > Tim > > > > But I'm not proposing to change numpy's result, which I'm sure would > raise > > many > > objections. I'm just asking to give a warning, because I think in most > > cases > > this is actually a mistake on the user's part. Just like the warning > > currently > > given when complex data are truncated to real part. > > > > > > > > ------------------------------ > > > > Message: 5 > > Date: Tue, 31 Dec 2013 17:24:05 +0100 > > From: Ralf Gommers > > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > > give warning > > To: Discussion of Numerical Python > > Message-ID: > > < > > CABL7CQh9Fc0Uh36W9p16mzAR-oYjJ7_k7rU_Dwq+eZND6YrbDA at mail.gmail.com> > > Content-Type: text/plain; charset="iso-8859-1" > > > > On Tue, Dec 31, 2013 at 4:52 PM, Neal Becker > wrote: > > > > > Cera, Tim wrote: > > > > > > > I don't work with complex numbers, but just sampling what others do: > > > > > > > > > > > > Python: no ordering, results in TypeError > > > > > > > > Matlab: sorts by magnitude > > > > http://www.mathworks.com/help/matlab/ref/sort.html > > > > > > > > R: sorts first by real, then by imaginary > > > > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > > > > > > > Numpy: sorts first by real, then by imaginary (the documentation link > > > > below calls this sort 'lexicographical' which I don't think is > > > > correct) > > > > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > > > > > > > > > > > I would think that the Matlab sort might be more useful, but easy > > > > enough by using the absolute value. > > > > > > > > I think what Numpy does is normal enough to not justify a warning, > but > > > > leave this to others because as I pointed out in the beginning I > don't > > > > work with complex numbers. > > > > > > > > Kindest regards, > > > > Tim > > > > > > But I'm not proposing to change numpy's result, which I'm sure would > > raise > > > many > > > objections. I'm just asking to give a warning, because I think in most > > > cases > > > this is actually a mistake on the user's part. Just like the warning > > > currently > > > given when complex data are truncated to real part. > > > > > > > Keep in mind that warnings can be highly annoying. If you're a user who > > uses this functionality regularly (and you know what you're doing), then > > you're going to be very unhappy to have to wrap each function call in: > > olderr = np.seterr(all='ignore') > > max(...) > > np.seterr(**olderr) > > or in: > > with warnings.catch_warnings(): > > warnings.filterwarnings('ignore', ...) > > max(...) > > > > The actual behavior isn't documented now it looks like, so that should be > > done. In the Notes section of max/min probably. > > > > As for your proposal, it would be good to know if adding a warning would > > actually catch any bugs. For the truncation warning it caught several in > > scipy and other libs IIRC. > > > > Ralf > > -------------- next part -------------- > > An HTML attachment was scrubbed... > > URL: > > > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/add729d8/attachment-0001.html > > > > ------------------------------ > > > > Message: 6 > > Date: Tue, 31 Dec 2013 11:45:08 -0500 > > From: Neal Becker > > Subject: Re: [Numpy-discussion] proposal: min, max of complex should > > give warning > > To: numpy-discussion at scipy.org > > Message-ID: > > Content-Type: text/plain; charset="ISO-8859-1" > > > > Ralf Gommers wrote: > > > > > On Tue, Dec 31, 2013 at 4:52 PM, Neal Becker > > wrote: > > > > > >> Cera, Tim wrote: > > >> > > >> > I don't work with complex numbers, but just sampling what others do: > > >> > > > >> > > > >> > Python: no ordering, results in TypeError > > >> > > > >> > Matlab: sorts by magnitude > > >> > http://www.mathworks.com/help/matlab/ref/sort.html > > >> > > > >> > R: sorts first by real, then by imaginary > > >> > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > > >> > > > >> > Numpy: sorts first by real, then by imaginary (the documentation > link > > >> > below calls this sort 'lexicographical' which I don't think is > > >> > correct) > > >> > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > > >> > > > >> > > > >> > I would think that the Matlab sort might be more useful, but easy > > >> > enough by using the absolute value. > > >> > > > >> > I think what Numpy does is normal enough to not justify a warning, > but > > >> > leave this to others because as I pointed out in the beginning I > don't > > >> > work with complex numbers. > > >> > > > >> > Kindest regards, > > >> > Tim > > >> > > >> But I'm not proposing to change numpy's result, which I'm sure would > > raise > > >> many > > >> objections. I'm just asking to give a warning, because I think in > most > > >> cases > > >> this is actually a mistake on the user's part. Just like the warning > > >> currently > > >> given when complex data are truncated to real part. > > >> > > > > > > Keep in mind that warnings can be highly annoying. If you're a user who > > > uses this functionality regularly (and you know what you're doing), > then > > > you're going to be very unhappy to have to wrap each function call in: > > > olderr = np.seterr(all='ignore') > > > max(...) > > > np.seterr(**olderr) > > > or in: > > > with warnings.catch_warnings(): > > > warnings.filterwarnings('ignore', ...) > > > max(...) > > > > > > The actual behavior isn't documented now it looks like, so that should > be > > > done. In the Notes section of max/min probably. > > > > > > As for your proposal, it would be good to know if adding a warning > would > > > actually catch any bugs. For the truncation warning it caught several > in > > > scipy and other libs IIRC. > > > > > > Ralf > > > > I tripped over it yesterday, which is what prompted my suggestion. > > > > > > > > ------------------------------ > > > > Message: 7 > > Date: Tue, 31 Dec 2013 17:57:18 +0100 > > From: Julian Taylor > > Subject: [Numpy-discussion] ANN: NumPy 1.7.2 release > > To: Discussion of Numerical Python , > > SciPy Users List , SciPy Developers > > List > > > > Message-ID: <52C2F76E.9010509 at googlemail.com> > > Content-Type: text/plain; charset=ISO-8859-1 > > > > Hello, > > > > I'm happy to announce the of Numpy 1.7.2. > > This is a bugfix only release supporting Python 2.4 - 2.7 and 3.1 - 3.3. > > > > More than 42 issues were fixed, the most important issues are listed in > > the release notes: > > https://github.com/numpy/numpy/blob/v1.7.2/doc/release/1.7.2-notes.rst > > > > Compared to the last release candidate four additional minor issues have > > been fixed and compatibility with python 3.4b1 improved. > > > > Source tarballs, installers and release notes can be found at > > https://sourceforge.net/projects/numpy/files/NumPy/1.7.2 > > > > Cheers, > > Julian Taylor > > > > > > ------------------------------ > > > > Message: 8 > > Date: Tue, 31 Dec 2013 10:47:44 -0700 > > From: Charles R Harris > > Subject: Re: [Numpy-discussion] ANN: NumPy 1.7.2 release > > To: Discussion of Numerical Python > > Message-ID: > > > abrqm4DNRG7f6-1keU_hPd253O64d0-Yhw at mail.gmail.com> > > Content-Type: text/plain; charset="iso-8859-1" > > > > On Tue, Dec 31, 2013 at 9:57 AM, Julian Taylor < > > jtaylor.debian at googlemail.com> wrote: > > > > > Hello, > > > > > > I'm happy to announce the of Numpy 1.7.2. > > > This is a bugfix only release supporting Python 2.4 - 2.7 and 3.1 - > 3.3. > > > > > > More than 42 issues were fixed, the most important issues are listed in > > > the release notes: > > > https://github.com/numpy/numpy/blob/v1.7.2/doc/release/1.7.2-notes.rst > > > > > > Compared to the last release candidate four additional minor issues > have > > > been fixed and compatibility with python 3.4b1 improved. > > > > > > Source tarballs, installers and release notes can be found at > > > https://sourceforge.net/projects/numpy/files/NumPy/1.7.2 > > > > > > > > Congrats on the release. > > > > Chuck > > -------------- next part -------------- > > An HTML attachment was scrubbed... > > URL: > > > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20131231/946abcb9/attachment.html > > > > ------------------------------ > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > > End of NumPy-Discussion Digest, Vol 87, Issue 35 > > ************************************************ > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > http://mail.scipy.org/pipermail/numpy-discussion/attachments/20140101/279def51/attachment.html > > ------------------------------ > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > End of NumPy-Discussion Digest, Vol 88, Issue 1 > *********************************************** > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Wed Jan 1 10:56:14 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 01 Jan 2014 16:56:14 +0100 Subject: [Numpy-discussion] NumPy-Discussion Digest, Vol 88, Issue 1 In-Reply-To: References:

Message-ID: <52C43A9E.7060403@googlemail.com> On 01.01.2014 16:50, Amira Chekir wrote: > On 31.12.2013 14:13, Amira Chekir wrote: >> > Hello together, >> > >> > I try to load a (large) NIfTI file (DMRI from Human Connectome Project, >> > about 1 GB) with NiBabel. >> > >> > import nibabel as nib >> > img = nib.load("dmri.nii.gz") >> > data = img.get_data() >> > >> > The program crashes during "img.get_data()" with an "MemoryError" >> > (having 4 GB of RAM in my machine). >> > >> > Any suggestions? >> >> are you using a 64 bit operating system? >> which version of numpy? >> >> assuming nibabel uses np.load under the hood you could try it with numpy >> 1.8 which reduces excess memory usage when loading compressed files. > > Hi, > Thanks for your answer. > I use ubuntu 12.04 32 bits and python 2.7 > I upgrade numpy to 1.8, but the error persists > I think that the problem is in gzip.py : > max_read_chunk = 10 * 1024 * 1024 # 10Mb > What do you think? > On a 32 bit system you can only use 2GB of ram (even if you have 4GB). A single copy of your data will already exhaust this and this can be hard to avoid with numpy. Use an 64 bit operating system with more RAM or somehow try to chunk your workload into smaller sizes. From bartbkr at gmail.com Wed Jan 1 14:56:44 2014 From: bartbkr at gmail.com (Bart Baker) Date: Wed, 1 Jan 2014 14:56:44 -0500 Subject: [Numpy-discussion] Altering/initializing NumPy array in C Message-ID: Hello, I'm having issues with performing operations on an array in C and passing it back to Python. The array values seem to become unitialized upon being passed back to Python. My first attempt involved initializing the array in C as so: double a_fin[max_mth]; where max_mth is an int. I fill in the values of a_fin and then, before returning back to Python, I create a NumPy array in C and fill it in using a pointer to the a_fin array: npy_intp a_dims[2] = {max_mth, 1}; a_fin_array = (PyArrayObject *) PyArray_SimpleNewFromData(2, a_dims, NPY_DOUBLE, a_fin); I update the flags as so: PyArray_UpdateFlags(a_fin_array, NPY_OWNDATA); and return using: PyObject *Result = Py_BuildValue("OO", a_fin_array, b_fin_array); Py_DECREF(a_fin_array); (there is another array, b_bin_array that I create in this way and it suffers from the same issues). Immediately upon returning to Python, all of a_fin_array appears unitilized. This only happens in certain situations and sometime only part of the arrary will be unitilized. I check the values of a_dim and a_fin_array in C using gdb and they appear as expected, but are over-written with unitialized values upon returning to Python. I've tried initializing in Python and then passing the NumPy array in instead of initializing in C, but the effects of the calculations in C are still not kept. My feeling is that, with the Numpy C-API, this should be a simple process, but I'm having a lot of trouble with it. Any help would be much appreciated. I didn't want to give too much information in the post, but please let me know what other information would be useful. -Bart From njs at pobox.com Wed Jan 1 15:04:52 2014 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 1 Jan 2014 14:04:52 -0600 Subject: [Numpy-discussion] Altering/initializing NumPy array in C In-Reply-To: References: Message-ID: On 1 Jan 2014 13:57, "Bart Baker" wrote: > > Hello, > > I'm having issues with performing operations on an array in C and > passing it back to Python. The array values seem to become unitialized > upon being passed back to Python. My first attempt involved initializing > the array in C as so: > > double a_fin[max_mth]; > > where max_mth is an int. You're stack-allocating your array, so the memory is getting recycled for other uses as soon as your C function returns. You should malloc it instead (but you don't have to worry about free'ing it, numpy will do that when the array object is deconstructed). Any C reference will fill you in on the details of stack versus malloc allocation. -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From bartbkr at gmail.com Thu Jan 2 08:49:11 2014 From: bartbkr at gmail.com (Bart Baker) Date: Thu, 2 Jan 2014 08:49:11 -0500 Subject: [Numpy-discussion] Altering/initializing NumPy array in C In-Reply-To: References: Message-ID: > You're stack-allocating your array, so the memory is getting recycled for > other uses as soon as your C function returns. You should malloc it instead > (but you don't have to worry about free'ing it, numpy will do that when the > array object is deconstructed). Any C reference will fill you in on the > details of stack versus malloc allocation. OK, that makes a lot of sense. I changed them to malloc's of the appropriate size and now things seems to be working well. It also led to a good read on stack vs heap. Thanks a lot, Bart From d.l.goldsmith at gmail.com Thu Jan 2 15:29:42 2014 From: d.l.goldsmith at gmail.com (David Goldsmith) Date: Thu, 2 Jan 2014 12:29:42 -0800 Subject: [Numpy-discussion] Quaternion type @ rosettacode.org Message-ID: Anyone here use/have an opinion about the Quaternion type @ rosettacode.org? Or have an opinion about it having derived the type from collections.namedtuple? Anyone have an open-source, numpy-based alternative? Ditto last question for Octonion and/or general n-basis Grassmann (exterior) and/or Clifford Algebras? (rosettacode appears to have none of these). Thanks! David Goldsmith -------------- next part -------------- An HTML attachment was scrubbed... URL: From scopatz at gmail.com Thu Jan 2 15:44:23 2014 From: scopatz at gmail.com (Anthony Scopatz) Date: Thu, 2 Jan 2014 12:44:23 -0800 Subject: [Numpy-discussion] Quaternion type @ rosettacode.org In-Reply-To: References: Message-ID: Hello David, There is a numpy-quarterion repo that has served me well in the past. I believe this came out of a SciPy 2011 sprint. See https://github.com/martinling/numpy_quaternion. I hope this helps. Be Well Anthony On Thu, Jan 2, 2014 at 12:29 PM, David Goldsmith wrote: > Anyone here use/have an opinion about the Quaternion type @ > rosettacode.org? > Or have an opinion about it having derived the type from > collections.namedtuple? Anyone have an open-source, numpy-based > alternative? Ditto last question for Octonion and/or general n-basis > Grassmann (exterior) and/or Clifford Algebras? (rosettacode appears to > have none of these). Thanks! > > David Goldsmith > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From paul.leopardi at anu.edu.au Thu Jan 2 16:16:53 2014 From: paul.leopardi at anu.edu.au (Paul Leopardi) Date: Fri, 3 Jan 2014 08:16:53 +1100 Subject: [Numpy-discussion] Quaternion type @ rosettacode.org In-Reply-To: References: Message-ID: <5706837.Fip8dVJlit@linfinit> On Thu, 2 Jan 2014 12:29:42 David Goldsmith wrote: > Anyone here use/have an opinion about the Quaternion type @ > rosettacode.org tions#Python>? Or have an opinion about it having derived the type from > collections.namedtuple? Anyone have an open-source, numpy-based > alternative? Ditto last question for Octonion and/or general n-basis > Grassmann (exterior) and/or Clifford Algebras? (rosettacode appears to > have none of these). Thanks! Hi David, Not Numpy based, but: GluCat http://sourceforge.net/projects/glucat/ is an open source C++ library for calculations in Clifford algebras, based on the C++ Standard Library and Boost uBLAS. It also includes PyClical, a Python extension module coded in Cython. The PyClical tutorials and demos at http://sourceforge.net/p/glucat/git/ci/master/tree/pyclical/demos/ should give you an idea of how PyClical can be used with Numpy, SciPy and the rest of Python. See also http://sourceforge.net/p/glucat/git/ci/master/tree/README If you have compilation problems, try the release_0_7_1-patches branch: http://sourceforge.net/p/glucat/git/ci/release_0_7_1-patches/ I am always open to feedback and criticism of this code. All the best, Paul -- Paul Leopardi http://www.maths.anu.edu.au/~leopardi -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Fri Jan 3 05:39:25 2014 From: matthew.brett at gmail.com (Matthew Brett) Date: Fri, 3 Jan 2014 10:39:25 +0000 Subject: [Numpy-discussion] Loading large NIfTI file -> MemoryError In-Reply-To: <52C2C6C6.6070002@googlemail.com> References: <52C2C6C6.6070002@googlemail.com> Message-ID: Hi, On Tue, Dec 31, 2013 at 1:29 PM, Julian Taylor wrote: > > On 31.12.2013 14:13, Amira Chekir wrote: > > Hello together, > > > > I try to load a (large) NIfTI file (DMRI from Human Connectome Project, > > about 1 GB) with NiBabel. > > > > import nibabel as nib > > img = nib.load("dmri.nii.gz") > > data = img.get_data() > > > > The program crashes during "img.get_data()" with an "MemoryError" > > (having 4 GB of RAM in my machine). > > > > Any suggestions? > > are you using a 64 bit operating system? > which version of numpy? I think you want the nipy-devel mailing list for this question : http://nipy.org/nibabel/ I'm guessing that the reader is loading the raw data which is - say - int16 - and then multiplying by the scale factors to make a float64 image, which is 4 times larger. We're working on an iterative load API at the moment that might help loading the image slice by slice : https://github.com/nipy/nibabel/pull/211 It should be merged in a week or so - but it would be very helpful if you would try out the proposal to see if it helps, Best, Matthew From freddie at witherden.org Fri Jan 3 07:58:49 2014 From: freddie at witherden.org (Freddie Witherden) Date: Fri, 03 Jan 2014 12:58:49 +0000 Subject: [Numpy-discussion] Padding An Array Along A Single Axis Message-ID: <52C6B409.4090005@witherden.org> Hi all, This should be an easy one but I can not come up with a good solution. Given an ndarray with a shape of (..., X) I wish to zero-pad it to have a shape of (..., X + K), presumably obtaining a new array in the process. My best solution this far is to use np.zeros(curr.shape[:-1] + (curr.shape[-1] + K,)) followed by an assignment. However, this seems needlessly cumbersome. I looked at np.pad but it does not seem to provide a means of just padding a single axis easily. Regards, Freddie. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: OpenPGP digital signature URL: From joferkington at gmail.com Fri Jan 3 09:02:03 2014 From: joferkington at gmail.com (Joe Kington) Date: Fri, 3 Jan 2014 08:02:03 -0600 Subject: [Numpy-discussion] Padding An Array Along A Single Axis In-Reply-To: <52C6B409.4090005@witherden.org> References: <52C6B409.4090005@witherden.org> Message-ID: You can use np.pad for this: In [1]: import numpy as np In [2]: x = np.ones((3, 3)) In [3]: np.pad(x, [(0, 0), (0, 1)], mode='constant') Out[3]: array([[ 1., 1., 1., 0.], [ 1., 1., 1., 0.], [ 1., 1., 1., 0.]]) Each item of the pad_width (second) argument is a tuple of before, after for each axis. I've only padded the end of the last axis, but if you wanted to pad both "sides" of it: In [4]: np.pad(x, [(0, 0), (1, 1)], mode='constant') Out[4]: array([[ 0., 1., 1., 1., 0.], [ 0., 1., 1., 1., 0.], [ 0., 1., 1., 1., 0.]]) Hope that helps, -Joe On Fri, Jan 3, 2014 at 6:58 AM, Freddie Witherden wrote: > Hi all, > > This should be an easy one but I can not come up with a good solution. > Given an ndarray with a shape of (..., X) I wish to zero-pad it to have > a shape of (..., X + K), presumably obtaining a new array in the process. > > My best solution this far is to use > > np.zeros(curr.shape[:-1] + (curr.shape[-1] + K,)) > > followed by an assignment. However, this seems needlessly cumbersome. > I looked at np.pad but it does not seem to provide a means of just > padding a single axis easily. > > Regards, Freddie. > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From d.l.goldsmith at gmail.com Sat Jan 4 01:00:33 2014 From: d.l.goldsmith at gmail.com (David Goldsmith) Date: Fri, 3 Jan 2014 22:00:33 -0800 Subject: [Numpy-discussion] Quaternion type @ rosettacode.org Message-ID: Thanks Anthony and Paul! OlyDLG -------------- next part -------------- An HTML attachment was scrubbed... URL: From Nicolas.Rougier at inria.fr Sat Jan 4 03:50:04 2014 From: Nicolas.Rougier at inria.fr (Nicolas Rougier) Date: Sat, 4 Jan 2014 09:50:04 +0100 Subject: [Numpy-discussion] ArrayList object Message-ID: <6F4DCFB4-813D-4595-BB02-2AC2CA181A57@inria.fr> Hi all, I've coding an ArrayList object based on a regular numpy array. This objects allows to dynamically append/insert/delete/access items. I found it quite convenient since it allows to manipulate an array as if it was a list with elements of different sizes but with same underlying type (=array dtype). # Creation from a nested list L = ArrayList([ [0], [1,2], [3,4,5], [6,7,8,9] ]) # Creation from an array + common item size L = ArrayList(np.ones(1000), 3) # Empty list L = ArrayList(dype=int) # Creation from an array + individual item sizes L = ArrayList(np.ones(10), 1+np.arange(4)) # Access to elements: print L[0], L[1], L[2], L[3] [0] [1 2] [3 4 5] [6 7 8 9] # Operations on elements L[:2] += 1 print L.data [1 2 3 3 4 5 6 7 8 9] Source code is available from: https://github.com/rougier/array-list I wonder is there is any interest in having such object within core numpy (np.list ?) ? Nicolas From ralf.gommers at gmail.com Sat Jan 4 10:45:14 2014 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sat, 4 Jan 2014 16:45:14 +0100 Subject: [Numpy-discussion] proposal: min, max of complex should give warning In-Reply-To: References:

Message-ID: On Tue, Dec 31, 2013 at 5:45 PM, Neal Becker wrote: > Ralf Gommers wrote: > > > On Tue, Dec 31, 2013 at 4:52 PM, Neal Becker > wrote: > > > >> Cera, Tim wrote: > >> > >> > I don't work with complex numbers, but just sampling what others do: > >> > > >> > > >> > Python: no ordering, results in TypeError > >> > > >> > Matlab: sorts by magnitude > >> > http://www.mathworks.com/help/matlab/ref/sort.html > >> > > >> > R: sorts first by real, then by imaginary > >> > http://stat.ethz.ch/R-manual/R-patched/library/base/html/sort.html > >> > > >> > Numpy: sorts first by real, then by imaginary (the documentation link > >> > below calls this sort 'lexicographical' which I don't think is > >> > correct) > >> > http://docs.scipy.org/doc/numpy/reference/generated/numpy.sort.html > >> > > >> > > >> > I would think that the Matlab sort might be more useful, but easy > >> > enough by using the absolute value. > >> > > >> > I think what Numpy does is normal enough to not justify a warning, but > >> > leave this to others because as I pointed out in the beginning I don't > >> > work with complex numbers. > >> > > >> > Kindest regards, > >> > Tim > >> > >> But I'm not proposing to change numpy's result, which I'm sure would > raise > >> many > >> objections. I'm just asking to give a warning, because I think in most > >> cases > >> this is actually a mistake on the user's part. Just like the warning > >> currently > >> given when complex data are truncated to real part. > >> > > > > Keep in mind that warnings can be highly annoying. If you're a user who > > uses this functionality regularly (and you know what you're doing), then > > you're going to be very unhappy to have to wrap each function call in: > > olderr = np.seterr(all='ignore') > > max(...) > > np.seterr(**olderr) > > or in: > > with warnings.catch_warnings(): > > warnings.filterwarnings('ignore', ...) > > max(...) > > > > The actual behavior isn't documented now it looks like, so that should be > > done. In the Notes section of max/min probably. > > > > As for your proposal, it would be good to know if adding a warning would > > actually catch any bugs. For the truncation warning it caught several in > > scipy and other libs IIRC. > > > > Ralf > > I tripped over it yesterday, which is what prompted my suggestion. > That I had guessed. I meant: can you try to add this warning and then see if it catches any bugs or displays any incorrect warnings for scipy and some scikits? Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 4 14:14:37 2014 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sat, 4 Jan 2014 20:14:37 +0100 Subject: [Numpy-discussion] C99 compatible complex number tests fail In-Reply-To: <52B7723B.7080900@gmail.com> References: <52B7723B.7080900@gmail.com> Message-ID: On Mon, Dec 23, 2013 at 12:14 AM, Matti Picus wrote: > Hi. I started to port the stdlib cmath C99 compatible complex number > tests to numpy, after noticing that numpy seems to have different > complex number routines than cmath. The work is available on a > "retest_complex" branch of numpy > https://github.com/mattip/numpy/tree/retest_complex > The tests can be run by pulling the branch (no need to rebuild numpy) > and running > > python /numpy/core/tests/test_umath_complex.py > > test.log 2>&1 > > So far it is just a couple of commits that run the tests on numpy, I > did not dive into modifying the math routines. If I did the work > correctly, failures point to some differences, most due to edge cases > with inf and nan, but there are a number of failures due to different > finite values (for some small definition of different). > I guess my first question is "did I do the tests properly". > They work fine, however you did it in a nonstandard way which makes the output hard to read. Some comments: - the assert_* functions expect "actual" as first input and "desired" next, while you have them reversed. - it would be good to split those tests into multiple cases, for example one per function to be tested. - you shouldn't print anything, just let it fail. If you want to see each individual failure, use generator tests. - the cmathtestcases.txt is a little nonstandard but should be OK to keep it like that. Assuming I did, the next question is "are the inconsistencies > intentional" i.e. are they that way in order to be compatible with > Matlab or some other non-C99 conformant library? > The implementation should conform to IEEE 754. > > For instance, a comparison between the implementation of cmath's sqrt > and numpy's sqrt shows that numpy does not check for subnormals. I suspect no handling for denormals was done on purpose, since that should have a significant performance penalty. I'm not sure about other differences, probably just following a different reference. And I am probably mistaken since I am new to the generator methods of numpy, > but could it be that trigonometric functions like acos and acosh are > generated in umath/funcs.inc.src, using a very different algorithm than > cmathmodule.c? > You're not mistaken. > Would there be interest in a pull request that changed the routines to > be more compatible with results from cmath? > I don't think compatibility with cmath should be a goal, but if you find differences where cmath has a more accurate or faster implementation, then a PR to adopt the cmath algorithm would be very welcome. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ewm at redtetrahedron.org Sat Jan 4 20:39:03 2014 From: ewm at redtetrahedron.org (Eric Moore) Date: Sat, 4 Jan 2014 20:39:03 -0500 Subject: [Numpy-discussion] C99 compatible complex number tests fail In-Reply-To: References: <52B7723B.7080900@gmail.com> Message-ID: On Saturday, January 4, 2014, Ralf Gommers wrote: > > > > On Mon, Dec 23, 2013 at 12:14 AM, Matti Picus > > wrote: > >> Hi. I started to port the stdlib cmath C99 compatible complex number >> tests to numpy, after noticing that numpy seems to have different >> complex number routines than cmath. The work is available on a >> "retest_complex" branch of numpy >> https://github.com/mattip/numpy/tree/retest_complex >> The tests can be run by pulling the branch (no need to rebuild numpy) >> and running >> >> python /numpy/core/tests/test_umath_complex.py > >> test.log 2>&1 >> >> So far it is just a couple of commits that run the tests on numpy, I >> did not dive into modifying the math routines. If I did the work >> correctly, failures point to some differences, most due to edge cases >> with inf and nan, but there are a number of failures due to different >> finite values (for some small definition of different). >> I guess my first question is "did I do the tests properly". >> > > They work fine, however you did it in a nonstandard way which makes the > output hard to read. Some comments: > - the assert_* functions expect "actual" as first input and "desired" > next, while you have them reversed. > - it would be good to split those tests into multiple cases, for example > one per function to be tested. > - you shouldn't print anything, just let it fail. If you want to see each > individual failure, use generator tests. > - the cmathtestcases.txt is a little nonstandard but should be OK to keep > it like that. > > Assuming I did, the next question is "are the inconsistencies >> intentional" i.e. are they that way in order to be compatible with >> Matlab or some other non-C99 conformant library? >> > > The implementation should conform to IEEE 754. > >> >> For instance, a comparison between the implementation of cmath's sqrt >> and numpy's sqrt shows that numpy does not check for subnormals. > > > I suspect no handling for denormals was done on purpose, since that should > have a significant performance penalty. I'm not sure about other > differences, probably just following a different reference. > > And I am probably mistaken since I am new to the generator methods of >> numpy, >> but could it be that trigonometric functions like acos and acosh are >> generated in umath/funcs.inc.src, using a very different algorithm than >> cmathmodule.c? >> > > You're not mistaken. > > >> Would there be interest in a pull request that changed the routines to >> be more compatible with results from cmath? >> > > I don't think compatibility with cmath should be a goal, but if you find > differences where cmath has a more accurate or faster implementation, then > a PR to adopt the cmath algorithm would be very welcome. > > Ralf > Have you seen https://github.com/numpy/numpy/pull/3010 ? This adds C99 compatible complex functions and tests with build time checking if the system provided functions can pass our tests. I should have some time to get back to it soon, but somemore eyes and tests and input would be good. Especially since it's not clear to me if all of the changes will be accepted. Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Tue Jan 7 13:59:59 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 7 Jan 2014 11:59:59 -0700 Subject: [Numpy-discussion] LLVM Message-ID: Has anyone tried using LLVM with Visual Studio? It is supposed to work with Visual Studio >= 2010 and might provide an alternative to MinGw64. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Tue Jan 7 16:49:45 2014 From: cournape at gmail.com (David Cournapeau) Date: Tue, 7 Jan 2014 21:49:45 +0000 Subject: [Numpy-discussion] LLVM In-Reply-To: References: Message-ID: On Tue, Jan 7, 2014 at 6:59 PM, Charles R Harris wrote: > Has anyone tried using LLVM with Visual Studio? It is supposed to work > with Visual Studio >= 2010 and might provide an alternative to MinGw64. > Yes, I have. It is still pretty painful to use on windows beyond simple examples, though I have not tried the new 3.4 version. See also that discussion I had with one clang dev @ apple a couple of months ago: https://twitter.com/cournape/status/381038514076655618 David > > Chuck > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Wed Jan 8 13:13:20 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 08 Jan 2014 19:13:20 +0100 Subject: [Numpy-discussion] Speedup by avoiding memory alloc twice in scalar array In-Reply-To: References:

Message-ID: <52CD9540.3000802@googlemail.com> On 18.07.2013 15:36, Nathaniel Smith wrote: > On Wed, Jul 17, 2013 at 5:57 PM, Fr?d?ric Bastien wrote: >> On Wed, Jul 17, 2013 at 10:39 AM, Nathaniel Smith wrote: >>>> >>>> On Tue, Jul 16, 2013 at 11:55 AM, Nathaniel Smith wrote: >>> It's entirely possible I misunderstood, so let's see if we can work it >>> out. I know that you want to assign to the ->data pointer in a >>> PyArrayObject, right? That's what caused some trouble with the 1.7 API >>> deprecations, which were trying to prevent direct access to this >>> field? Creating a new array given a pointer to a memory region is no >>> problem, and obviously will be supported regardless of any >>> optimizations. But if that's all you were doing then you shouldn't >>> have run into the deprecation problem. Or maybe I'm misremembering! >> >> What is currently done at only 1 place is to create a new PyArrayObject with >> a given ptr. So NumPy don't do the allocation. We later change that ptr to >> another one. > > Hmm, OK, so that would still work. If the array has the OWNDATA flag > set (or you otherwise know where the data came from), then swapping > the data pointer would still work. > > The change would be that in most cases when asking numpy to allocate a > new array from scratch, the OWNDATA flag would not be set. That's > because the OWNDATA flag really means "when this object is > deallocated, call free(self->data)", but if we allocate the array > struct and the data buffer together in a single memory region, then > deallocating the object will automatically cause the data buffer to be > deallocated as well, without the array destructor having to take any > special effort. > >> It is the change to the ptr of the just created PyArrayObject that caused >> problem with the interface deprecation. I fixed all other problem releated >> to the deprecation (mostly just rename of function/macro). But I didn't >> fixed this one yet. I would need to change the logic to compute the final >> ptr before creating the PyArrayObject object and create it with the final >> data ptr. But in call cases, NumPy didn't allocated data memory for this >> object, so this case don't block your optimization. > > Right. > >> One thing in our optimization "wish list" is to reuse allocated >> PyArrayObject between Theano function call for intermediate results(so >> completly under Theano control). This could be useful in particular for >> reshape/transpose/subtensor. Those functions are pretty fast and from >> memory, I already found the allocation time was significant. But in those >> cases, it is on PyArrayObject that are views, so the metadata and the data >> would be in different memory region in all cases. >> >> The other cases of optimization "wish list" is if we want to reuse the >> PyArrayObject when the shape isn't the good one (but the number of >> dimensions is the same). If we do that for operation like addition, we will >> need to use PyArray_Resize(). This will be done on PyArrayObject whose data >> memory was allocated by NumPy. So if you do one memory allowcation for >> metadata and data, just make sure that PyArray_Resize() will handle that >> correctly. > > I'm not sure I follow the details here, but it does turn out that a > really surprising amount of time in PyArray_NewFromDescr is spent in > just calculating and writing out the shape and strides buffers, so for > programs that e.g. use hundreds of small 3-element arrays to represent > points in space, re-using even these buffers might be a big win... > >> On the usefulness of doing only 1 memory allocation, on our old gpu ndarray, >> we where doing 2 alloc on the GPU, one for metadata and one for data. I >> removed this, as this was a bottleneck. allocation on the CPU are faster the >> on the GPU, but this is still something that is slow except if you reuse >> memory. Do PyMem_Malloc, reuse previous small allocation? > > Yes, at least in theory PyMem_Malloc is highly-optimized for small > buffer re-use. (For requests >256 bytes it just calls malloc().) And > it's possible to define type-specific freelists; not sure if there's > any value in doing that for PyArrayObjects. See Objects/obmalloc.c in > the Python source tree. > > -n PyMem_Malloc is just a wrapper around malloc, so its only as optimized as the c library is (glibc is not good for small allocations). PyObject_Malloc uses a small object allocator for requests smaller 512 bytes (256 in python2). I filed a pull request [0] replacing a few functions which I think are safe to convert to this API. The nditer allocation which is completely encapsulated and the construction of the scalar and array python objects which are deleted via the tp_free slot (we really should not support third party libraries using PyMem_Free on python objects without checks). This already gives up to 15% improvements for scalar operations compared to glibc 2.17 malloc. Do I understand the discussions here right that we could replace PyDimMem_NEW which is used for strides in PyArray with the small object allocation too? It would still allow swapping the stride buffer, but every application must then delete it with PyDimMem_FREE which should be a reasonable requirement. [0] https://github.com/numpy/numpy/pull/4177 From nouiz at nouiz.org Wed Jan 8 14:04:38 2014 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Wed, 8 Jan 2014 14:04:38 -0500 Subject: [Numpy-discussion] Speedup by avoiding memory alloc twice in scalar array In-Reply-To: <52CD9540.3000802@googlemail.com> References:

<52CD9540.3000802@googlemail.com> Message-ID: Hi, As told, I don't think Theano swap the stride buffer. Most of the time, we allocated with PyArray_empty or zeros. (not sure of the capitals). The only exception I remember have been changed in the last release to use PyArray_NewFromDescr(). Before that, we where allocating the PyArray with the right number of dimensions, then we where manually filling the ptr, shapes and strides. I don't recall any swapping of pointer for shapes and strides in Theano. So I don't see why Theano would prevent doing just one malloc for the struct and the shapes/strides. If it does, tell me and I'll fix Theano:) I don't want Theano to prevent optimization in NumPy. Theano now support completly the new NumPy C-API interface. Nathaniel also told that resizing the PyArray could prevent that. When Theano call PyArray_resize (not sure of the syntax), we always keep the number of dimensions the same. But I don't know if other code do differently. That could be a reason to keep separate alloc. I don't know any software that manually free the strides/shapes pointer to swap it. So I also think your suggestion to change PyDimMem_NEW to call the small allocator is good. The new interface prevent people from doing that anyway I think. Do we need to wait until we completly remove the old interface for this? Fred On Wed, Jan 8, 2014 at 1:13 PM, Julian Taylor wrote: > On 18.07.2013 15:36, Nathaniel Smith wrote: >> On Wed, Jul 17, 2013 at 5:57 PM, Fr?d?ric Bastien wrote: >>> On Wed, Jul 17, 2013 at 10:39 AM, Nathaniel Smith wrote: >>>>> >>>>> On Tue, Jul 16, 2013 at 11:55 AM, Nathaniel Smith wrote: >>>> It's entirely possible I misunderstood, so let's see if we can work it >>>> out. I know that you want to assign to the ->data pointer in a >>>> PyArrayObject, right? That's what caused some trouble with the 1.7 API >>>> deprecations, which were trying to prevent direct access to this >>>> field? Creating a new array given a pointer to a memory region is no >>>> problem, and obviously will be supported regardless of any >>>> optimizations. But if that's all you were doing then you shouldn't >>>> have run into the deprecation problem. Or maybe I'm misremembering! >>> >>> What is currently done at only 1 place is to create a new PyArrayObject with >>> a given ptr. So NumPy don't do the allocation. We later change that ptr to >>> another one. >> >> Hmm, OK, so that would still work. If the array has the OWNDATA flag >> set (or you otherwise know where the data came from), then swapping >> the data pointer would still work. >> >> The change would be that in most cases when asking numpy to allocate a >> new array from scratch, the OWNDATA flag would not be set. That's >> because the OWNDATA flag really means "when this object is >> deallocated, call free(self->data)", but if we allocate the array >> struct and the data buffer together in a single memory region, then >> deallocating the object will automatically cause the data buffer to be >> deallocated as well, without the array destructor having to take any >> special effort. >> >>> It is the change to the ptr of the just created PyArrayObject that caused >>> problem with the interface deprecation. I fixed all other problem releated >>> to the deprecation (mostly just rename of function/macro). But I didn't >>> fixed this one yet. I would need to change the logic to compute the final >>> ptr before creating the PyArrayObject object and create it with the final >>> data ptr. But in call cases, NumPy didn't allocated data memory for this >>> object, so this case don't block your optimization. >> >> Right. >> >>> One thing in our optimization "wish list" is to reuse allocated >>> PyArrayObject between Theano function call for intermediate results(so >>> completly under Theano control). This could be useful in particular for >>> reshape/transpose/subtensor. Those functions are pretty fast and from >>> memory, I already found the allocation time was significant. But in those >>> cases, it is on PyArrayObject that are views, so the metadata and the data >>> would be in different memory region in all cases. >>> >>> The other cases of optimization "wish list" is if we want to reuse the >>> PyArrayObject when the shape isn't the good one (but the number of >>> dimensions is the same). If we do that for operation like addition, we will >>> need to use PyArray_Resize(). This will be done on PyArrayObject whose data >>> memory was allocated by NumPy. So if you do one memory allowcation for >>> metadata and data, just make sure that PyArray_Resize() will handle that >>> correctly. >> >> I'm not sure I follow the details here, but it does turn out that a >> really surprising amount of time in PyArray_NewFromDescr is spent in >> just calculating and writing out the shape and strides buffers, so for >> programs that e.g. use hundreds of small 3-element arrays to represent >> points in space, re-using even these buffers might be a big win... >> >>> On the usefulness of doing only 1 memory allocation, on our old gpu ndarray, >>> we where doing 2 alloc on the GPU, one for metadata and one for data. I >>> removed this, as this was a bottleneck. allocation on the CPU are faster the >>> on the GPU, but this is still something that is slow except if you reuse >>> memory. Do PyMem_Malloc, reuse previous small allocation? >> >> Yes, at least in theory PyMem_Malloc is highly-optimized for small >> buffer re-use. (For requests >256 bytes it just calls malloc().) And >> it's possible to define type-specific freelists; not sure if there's >> any value in doing that for PyArrayObjects. See Objects/obmalloc.c in >> the Python source tree. >> >> -n > > PyMem_Malloc is just a wrapper around malloc, so its only as optimized > as the c library is (glibc is not good for small allocations). > PyObject_Malloc uses a small object allocator for requests smaller 512 > bytes (256 in python2). > > I filed a pull request [0] replacing a few functions which I think are > safe to convert to this API. The nditer allocation which is completely > encapsulated and the construction of the scalar and array python objects > which are deleted via the tp_free slot (we really should not support > third party libraries using PyMem_Free on python objects without checks). > > This already gives up to 15% improvements for scalar operations compared > to glibc 2.17 malloc. > Do I understand the discussions here right that we could replace > PyDimMem_NEW which is used for strides in PyArray with the small object > allocation too? > It would still allow swapping the stride buffer, but every application > must then delete it with PyDimMem_FREE which should be a reasonable > requirement. > > [0] https://github.com/numpy/numpy/pull/4177 > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From ndbecker2 at gmail.com Wed Jan 8 14:12:28 2014 From: ndbecker2 at gmail.com (Neal Becker) Date: Wed, 08 Jan 2014 14:12:28 -0500 Subject: [Numpy-discussion] an indexing question Message-ID: I have a 1d vector d. I want compute the means of subsets of this vector. The subsets are selected by looking at another vector s or same shape as d. This can be done as: [np.mean (d[s == i]) for i in range (size)] But I think this could be done directly with numpy addressing, without resorting to list comprehension? From jaime.frio at gmail.com Wed Jan 8 14:32:19 2014 From: jaime.frio at gmail.com (=?ISO-8859-1?Q?Jaime_Fern=E1ndez_del_R=EDo?=) Date: Wed, 8 Jan 2014 11:32:19 -0800 Subject: [Numpy-discussion] an indexing question In-Reply-To: References: Message-ID: On Wed, Jan 8, 2014 at 11:12 AM, Neal Becker wrote: > I have a 1d vector d. I want compute the means of subsets of this vector. > The subsets are selected by looking at another vector s or same shape as d. > > This can be done as: > > [np.mean (d[s == i]) for i in range (size)] > > But I think this could be done directly with numpy addressing, without > resorting > to list comprehension? > You could get it done with np.bincount: d_sums = np.bincount(s, weights=d) d_counts = np.bincount(s) d_means = d_sums / d_counts Jaime -- (\__/) ( O.o) ( > <) Este es Conejo. Copia a Conejo en tu firma y ay?dale en sus planes de dominaci?n mundial. -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Wed Jan 8 15:40:26 2014 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 8 Jan 2014 14:40:26 -0600 Subject: [Numpy-discussion] Speedup by avoiding memory alloc twice in scalar array In-Reply-To: <52CD9540.3000802@googlemail.com> References:

<52CD9540.3000802@googlemail.com> Message-ID: On Wed, Jan 8, 2014 at 12:13 PM, Julian Taylor wrote: > On 18.07.2013 15:36, Nathaniel Smith wrote: >> On Wed, Jul 17, 2013 at 5:57 PM, Fr?d?ric Bastien wrote: >>> On the usefulness of doing only 1 memory allocation, on our old gpu ndarray, >>> we where doing 2 alloc on the GPU, one for metadata and one for data. I >>> removed this, as this was a bottleneck. allocation on the CPU are faster the >>> on the GPU, but this is still something that is slow except if you reuse >>> memory. Do PyMem_Malloc, reuse previous small allocation? >> >> Yes, at least in theory PyMem_Malloc is highly-optimized for small >> buffer re-use. (For requests >256 bytes it just calls malloc().) And >> it's possible to define type-specific freelists; not sure if there's >> any value in doing that for PyArrayObjects. See Objects/obmalloc.c in >> the Python source tree. > > PyMem_Malloc is just a wrapper around malloc, so its only as optimized > as the c library is (glibc is not good for small allocations). > PyObject_Malloc uses a small object allocator for requests smaller 512 > bytes (256 in python2). Right, I meant PyObject_Malloc of course. > I filed a pull request [0] replacing a few functions which I think are > safe to convert to this API. The nditer allocation which is completely > encapsulated and the construction of the scalar and array python objects > which are deleted via the tp_free slot (we really should not support > third party libraries using PyMem_Free on python objects without checks). > > This already gives up to 15% improvements for scalar operations compared > to glibc 2.17 malloc. > Do I understand the discussions here right that we could replace > PyDimMem_NEW which is used for strides in PyArray with the small object > allocation too? > It would still allow swapping the stride buffer, but every application > must then delete it with PyDimMem_FREE which should be a reasonable > requirement. That sounds reasonable to me. If we wanted to get even more elaborate, we could by default stick the shape/strides into the same allocation as the PyArrayObject, and then defer allocating a separate buffer until someone actually calls PyArray_Resize. (With a new flag, similar to OWNDATA, that tells us whether we need to free the shape/stride buffer when deallocating the array.) It's got to be a vanishingly small proportion of arrays where PyArray_Resize is actually called, so for most arrays, this would let us skip the allocation entirely, and the only cost would be that for arrays where PyArray_Resize *is* called to add new dimensions, we'd leave the original buffers sitting around until the array was freed, wasting a tiny amount of memory. Given that no-one has noticed that currently *every* array wastes 50% of this much memory (see upthread), I doubt anyone will care... -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From nouiz at nouiz.org Wed Jan 8 15:44:41 2014 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Wed, 8 Jan 2014 15:44:41 -0500 Subject: [Numpy-discussion] Speedup by avoiding memory alloc twice in scalar array In-Reply-To: References:

<52CD9540.3000802@googlemail.com> Message-ID: On Wed, Jan 8, 2014 at 3:40 PM, Nathaniel Smith wrote: > On Wed, Jan 8, 2014 at 12:13 PM, Julian Taylor > wrote: >> On 18.07.2013 15:36, Nathaniel Smith wrote: >>> On Wed, Jul 17, 2013 at 5:57 PM, Fr?d?ric Bastien wrote: >>>> On the usefulness of doing only 1 memory allocation, on our old gpu ndarray, >>>> we where doing 2 alloc on the GPU, one for metadata and one for data. I >>>> removed this, as this was a bottleneck. allocation on the CPU are faster the >>>> on the GPU, but this is still something that is slow except if you reuse >>>> memory. Do PyMem_Malloc, reuse previous small allocation? >>> >>> Yes, at least in theory PyMem_Malloc is highly-optimized for small >>> buffer re-use. (For requests >256 bytes it just calls malloc().) And >>> it's possible to define type-specific freelists; not sure if there's >>> any value in doing that for PyArrayObjects. See Objects/obmalloc.c in >>> the Python source tree. >> >> PyMem_Malloc is just a wrapper around malloc, so its only as optimized >> as the c library is (glibc is not good for small allocations). >> PyObject_Malloc uses a small object allocator for requests smaller 512 >> bytes (256 in python2). > > Right, I meant PyObject_Malloc of course. > >> I filed a pull request [0] replacing a few functions which I think are >> safe to convert to this API. The nditer allocation which is completely >> encapsulated and the construction of the scalar and array python objects >> which are deleted via the tp_free slot (we really should not support >> third party libraries using PyMem_Free on python objects without checks). >> >> This already gives up to 15% improvements for scalar operations compared >> to glibc 2.17 malloc. >> Do I understand the discussions here right that we could replace >> PyDimMem_NEW which is used for strides in PyArray with the small object >> allocation too? >> It would still allow swapping the stride buffer, but every application >> must then delete it with PyDimMem_FREE which should be a reasonable >> requirement. > > That sounds reasonable to me. > > If we wanted to get even more elaborate, we could by default stick the > shape/strides into the same allocation as the PyArrayObject, and then > defer allocating a separate buffer until someone actually calls > PyArray_Resize. (With a new flag, similar to OWNDATA, that tells us > whether we need to free the shape/stride buffer when deallocating the > array.) It's got to be a vanishingly small proportion of arrays where > PyArray_Resize is actually called, so for most arrays, this would let > us skip the allocation entirely, and the only cost would be that for > arrays where PyArray_Resize *is* called to add new dimensions, we'd > leave the original buffers sitting around until the array was freed, > wasting a tiny amount of memory. Given that no-one has noticed that > currently *every* array wastes 50% of this much memory (see upthread), > I doubt anyone will care... Seam a good plan. When is it planed to remove the old interface? We can't do it before I think. Fred From jtaylor.debian at googlemail.com Wed Jan 8 16:39:07 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 08 Jan 2014 22:39:07 +0100 Subject: [Numpy-discussion] adding fused multiply and add to numpy Message-ID: <52CDC57B.6010507@googlemail.com> Hi, Since AMDs bulldozer and Intels Haswell x86 cpus now also support the fused-multiply-and-add operation in hardware. http://en.wikipedia.org/wiki/Multiply?accumulate_operation This operation is interesting for numpy for two reasons: - Only one rounding is involved in two arithmetic operations, this is good reducing rounding errors. - Two operations are done in one memory pass, so it improves the performance if ones operations are bound by the memory bandwidth which is very common in numpy. I have done some experiments using a custom ufunc: https://github.com/juliantaylor/npufuncs See the README.md on how to try it out. It requires a recent GCC compiler, at least 4.7 I think. It contains SSE2, AVX, FMA3 (AVX2), FMA4 and software emulation variants. Edit the file to select which one to use. Note if the cpu does not support the instruction it will just crash. Only the latter three are real FMA operations, the SSE2 and AVX variants just perform two regular operations in one loop. My current machine only supports SSE2, so here are the timings for it: In [25]: a = np.arange(500000.) In [26]: b = np.arange(500000.) In [27]: c = np.arange(500000.) In [28]: %timeit npfma.fma(a, b, c) 100 loops, best of 3: 2.49 ms per loop In [30]: def pure_numpy_fma(a,b,c): ....: return a * b + c In [31]: %timeit pure_numpy_fma(a, b, c) 100 loops, best of 3: 7.36 ms per loop In [32]: def pure_numpy_fma2(a,b,c): ....: tmp = a *b ....: tmp += c ....: return tmp In [33]: %timeit pure_numpy_fma2(a, b, c) 100 loops, best of 3: 3.47 ms per loop As you can see even without real hardware support it is about 30% faster than inplace unblocked numpy due better use of memory bandwidth. Its even more than two times faster than unoptimized numpy. If you have a machine capable of fma instructions give it a spin to see if you get similar or better results. Please verify the assembly (objdump -d fma-.o) to check if the compiler properly used the machine fma. An issue is software emulation of real fma. This can be enabled in the test ufunc with npfma.set_type("libc"). This is unfortunately incredibly slow about a factor 300 on my machine without hardware fma. This means we either have a function that is fast on some platforms and slow on others but always gives the same result or we have a fast function that gives better results on some platforms. Given that we are not worth that what numpy currently provides I favor the latter. Any opinions on whether this should go into numpy or maybe stay a third party ufunc? Concerning the interface one should probably add several variants mirroring the FMA3 instruction set: http://en.wikipedia.org/wiki/Multiply?accumulate_operation additionally there is fmaddsub (a0 * b0 + c0, a1 *b1 - c1) which can be used for complex numbers, but they probably don't need an explicit numpy interface. From charlesr.harris at gmail.com Wed Jan 8 17:09:58 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 8 Jan 2014 15:09:58 -0700 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: <52CDC57B.6010507@googlemail.com> References: <52CDC57B.6010507@googlemail.com> Message-ID: On Wed, Jan 8, 2014 at 2:39 PM, Julian Taylor wrote: > Hi, > Since AMDs bulldozer and Intels Haswell x86 cpus now also support the > fused-multiply-and-add operation in hardware. > > http://en.wikipedia.org/wiki/Multiply?accumulate_operation > > This operation is interesting for numpy for two reasons: > - Only one rounding is involved in two arithmetic operations, this is > good reducing rounding errors. > - Two operations are done in one memory pass, so it improves the > performance if ones operations are bound by the memory bandwidth which > is very common in numpy. > > I have done some experiments using a custom ufunc: > https://github.com/juliantaylor/npufuncs > > See the README.md on how to try it out. It requires a recent GCC > compiler, at least 4.7 I think. > > It contains SSE2, AVX, FMA3 (AVX2), FMA4 and software emulation > variants. Edit the file to select which one to use. Note if the cpu does > not support the instruction it will just crash. > Only the latter three are real FMA operations, the SSE2 and AVX variants > just perform two regular operations in one loop. > > My current machine only supports SSE2, so here are the timings for it: > > In [25]: a = np.arange(500000.) > In [26]: b = np.arange(500000.) > In [27]: c = np.arange(500000.) > > In [28]: %timeit npfma.fma(a, b, c) > 100 loops, best of 3: 2.49 ms per loop > > In [30]: def pure_numpy_fma(a,b,c): > ....: return a * b + c > > In [31]: %timeit pure_numpy_fma(a, b, c) > 100 loops, best of 3: 7.36 ms per loop > > > In [32]: def pure_numpy_fma2(a,b,c): > ....: tmp = a *b > ....: tmp += c > ....: return tmp > > In [33]: %timeit pure_numpy_fma2(a, b, c) > 100 loops, best of 3: 3.47 ms per loop > > > As you can see even without real hardware support it is about 30% faster > than inplace unblocked numpy due better use of memory bandwidth. Its > even more than two times faster than unoptimized numpy. > > If you have a machine capable of fma instructions give it a spin to see > if you get similar or better results. Please verify the assembly > (objdump -d fma-.o) to check if the compiler properly used the > machine fma. > > An issue is software emulation of real fma. This can be enabled in the > test ufunc with npfma.set_type("libc"). > This is unfortunately incredibly slow about a factor 300 on my machine > without hardware fma. > This means we either have a function that is fast on some platforms and > slow on others but always gives the same result or we have a fast > function that gives better results on some platforms. > Given that we are not worth that what numpy currently provides I favor > the latter. > > Any opinions on whether this should go into numpy or maybe stay a third > party ufunc? > Multiply and add is a standard function that I think would be good to have in numpy. Not only does it save on memory accesses, it saves on temporary arrays. Another function that could be useful is a |a|**2 function, abs2 perhaps. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndbecker2 at gmail.com Thu Jan 9 09:35:22 2014 From: ndbecker2 at gmail.com (Neal Becker) Date: Thu, 09 Jan 2014 09:35:22 -0500 Subject: [Numpy-discussion] adding fused multiply and add to numpy References: <52CDC57B.6010507@googlemail.com> Message-ID: Charles R Harris wrote: > On Wed, Jan 8, 2014 at 2:39 PM, Julian Taylor > wrote: > ... > > Another function that could be useful is a |a|**2 function, abs2 perhaps. > > Chuck I use mag_sqr all the time. It should be much faster for complex, if computed via: x.real**2 + x.imag**2 avoiding the sqrt of abs. From freddie at witherden.org Thu Jan 9 09:43:07 2014 From: freddie at witherden.org (Freddie Witherden) Date: Thu, 09 Jan 2014 14:43:07 +0000 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: <52CDC57B.6010507@googlemail.com> References: <52CDC57B.6010507@googlemail.com> Message-ID: <52CEB57B.1090504@witherden.org> On 08/01/14 21:39, Julian Taylor wrote: > An issue is software emulation of real fma. This can be enabled in the > test ufunc with npfma.set_type("libc"). > This is unfortunately incredibly slow about a factor 300 on my machine > without hardware fma. > This means we either have a function that is fast on some platforms and > slow on others but always gives the same result or we have a fast > function that gives better results on some platforms. > Given that we are not worth that what numpy currently provides I favor > the latter. > > Any opinions on whether this should go into numpy or maybe stay a third > party ufunc? My preference would be to initially add an "madd" intrinsic. This can be supported on all platforms and can be documented to permit the use of FMA where available. A 'true' FMA intrinsic function should only be provided when hardware FMA support is available. Many of the more interesting applications of FMA depend on there only being a single rounding step and as such "FMA" should probably mean "a*b + c with only a single rounding". Regards, Freddie. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: OpenPGP digital signature URL: From nouiz at nouiz.org Thu Jan 9 09:50:55 2014 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Thu, 9 Jan 2014 09:50:55 -0500 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: <52CEB57B.1090504@witherden.org> References: <52CDC57B.6010507@googlemail.com> <52CEB57B.1090504@witherden.org> Message-ID: Hi, It happen frequently that NumPy isn't compiled with all instruction that is available where it run. For example in distro. So if the decision is made to use the fast version when we don't use the newer instruction, the user need a way to know that. So the library need a function/attribute to tell that. How hard would it be to provide the choise to the user? We could provide 2 functions like: fma_fast() fma_prec() (for precision)? Or this could be a parameter or a user configuration option like for the overflow/underflow error. Fred On Thu, Jan 9, 2014 at 9:43 AM, Freddie Witherden wrote: > On 08/01/14 21:39, Julian Taylor wrote: >> An issue is software emulation of real fma. This can be enabled in the >> test ufunc with npfma.set_type("libc"). >> This is unfortunately incredibly slow about a factor 300 on my machine >> without hardware fma. >> This means we either have a function that is fast on some platforms and >> slow on others but always gives the same result or we have a fast >> function that gives better results on some platforms. >> Given that we are not worth that what numpy currently provides I favor >> the latter. >> >> Any opinions on whether this should go into numpy or maybe stay a third >> party ufunc? > > My preference would be to initially add an "madd" intrinsic. This can > be supported on all platforms and can be documented to permit the use of > FMA where available. > > A 'true' FMA intrinsic function should only be provided when hardware > FMA support is available. Many of the more interesting applications of > FMA depend on there only being a single rounding step and as such "FMA" > should probably mean "a*b + c with only a single rounding". > > Regards, Freddie. > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From davidmenhur at gmail.com Thu Jan 9 09:54:42 2014 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Thu, 9 Jan 2014 15:54:42 +0100 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: <52CDC57B.6010507@googlemail.com> References: <52CDC57B.6010507@googlemail.com> Message-ID: On 8 January 2014 22:39, Julian Taylor wrote: > As you can see even without real hardware support it is about 30% faster > than inplace unblocked numpy due better use of memory bandwidth. Its > even more than two times faster than unoptimized numpy. > I have an i5, and AVX crashes, even though it is supported by my CPU. Here are my timings: SSE2: In [24]: %timeit npfma.fma(a, b, c) 100000 loops, best of 3: 15 us per loop In [28]: %timeit npfma.fma(a, b, c) 100 loops, best of 3: 2.36 ms per loop In [29]: %timeit npfma.fms(a, b, c) 100 loops, best of 3: 2.36 ms per loop In [31]: %timeit pure_numpy_fma(a, b, c) 100 loops, best of 3: 7.5 ms per loop In [33]: %timeit pure_numpy_fma2(a, b, c) 100 loops, best of 3: 4.41 ms per loop The model supports all the way to sse4_2 libc: In [24]: %timeit npfma.fma(a, b, c) 1000 loops, best of 3: 883 us per loop In [28]: %timeit npfma.fma(a, b, c) 10 loops, best of 3: 88.7 ms per loop In [29]: %timeit npfma.fms(a, b, c) 10 loops, best of 3: 87.4 ms per loop In [31]: %timeit pure_numpy_fma(a, b, c) 100 loops, best of 3: 7.94 ms per loop In [33]: %timeit pure_numpy_fma2(a, b, c) 100 loops, best of 3: 3.03 ms per loop > If you have a machine capable of fma instructions give it a spin to see > if you get similar or better results. Please verify the assembly > (objdump -d fma-.o) to check if the compiler properly used the > machine fma. > Following the instructions in the readme, there is only one compiled file, npfma.so, but no .o. /David. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Thu Jan 9 10:18:03 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Thu, 9 Jan 2014 16:18:03 +0100 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: References: <52CDC57B.6010507@googlemail.com> Message-ID: On Thu, Jan 9, 2014 at 3:54 PM, Da?id wrote: > > On 8 January 2014 22:39, Julian Taylor wrote: > >> As you can see even without real hardware support it is about 30% faster >> than inplace unblocked numpy due better use of memory bandwidth. Its >> even more than two times faster than unoptimized numpy. >> > > I have an i5, and AVX crashes, even though it is supported by my CPU. > I forgot about the 32 byte alignment avx (as it is used in this code) requires. I pushed a new version that takes care of it. It should now work with avx. > Following the instructions in the readme, there is only one compiled file, > npfma.so, but no .o. > > > the .o files are in the build/ subfolder -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Thu Jan 9 10:30:14 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Thu, 9 Jan 2014 16:30:14 +0100 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: References: <52CDC57B.6010507@googlemail.com> <52CEB57B.1090504@witherden.org> Message-ID: On Thu, Jan 9, 2014 at 3:50 PM, Fr?d?ric Bastien wrote: > Hi, > > It happen frequently that NumPy isn't compiled with all instruction > that is available where it run. For example in distro. So if the > decision is made to use the fast version when we don't use the newer > instruction, the user need a way to know that. So the library need a > function/attribute to tell that. > As these instructions are very new runtime cpu feature detection is required. That way also distribution users get the fast code if their cpu supports it. > > How hard would it be to provide the choise to the user? We could > provide 2 functions like: fma_fast() fma_prec() (for precision)? Or > this could be a parameter or a user configuration option like for the > overflow/underflow error. > I like Freddie Witherden proposal to name the function madd which does not guarantee one rounding operation. This leaves the namespace open for a special fma function with that guarantee. It can use the libc fma function which is very slow sometimes but platform independent. This is assuming apple did not again take shortcuts like they did with their libc hypot implementation, can someone disassemble apple libc to check what they are doing for C99 fma? And it leaves users the possibility to use the faster madd function if they do not need the precision guarantee. Another option would be a precision context manager which tells numpy which variant to use. This would also be useful for other code (like abs/hypot/abs2/sum/reciprocal sqrt) but probably it involves more work. with numpy.precision_mode('fast'): ... # allow no fma, use fast hypot, fast sum, ignore overflow/invalid errors with numpy.precision_mode('precise'): ... # require fma, use precise hypot, use exact summation (math.fsum) or at least kahan summation, full overflow/invalid checks etc > > > On Thu, Jan 9, 2014 at 9:43 AM, Freddie Witherden > wrote: > > On 08/01/14 21:39, Julian Taylor wrote: > >> An issue is software emulation of real fma. This can be enabled in the > >> test ufunc with npfma.set_type("libc"). > >> This is unfortunately incredibly slow about a factor 300 on my machine > >> without hardware fma. > >> This means we either have a function that is fast on some platforms and > >> slow on others but always gives the same result or we have a fast > >> function that gives better results on some platforms. > >> Given that we are not worth that what numpy currently provides I favor > >> the latter. > >> > >> Any opinions on whether this should go into numpy or maybe stay a third > >> party ufunc? > > > > My preference would be to initially add an "madd" intrinsic. This can > > be supported on all platforms and can be documented to permit the use of > > FMA where available. > > > > A 'true' FMA intrinsic function should only be provided when hardware > > FMA support is available. Many of the more interesting applications of > > FMA depend on there only being a single rounding step and as such "FMA" > > should probably mean "a*b + c with only a single rounding". > > > > Regards, Freddie. > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Thu Jan 9 12:07:00 2014 From: njs at pobox.com (Nathaniel Smith) Date: Thu, 9 Jan 2014 17:07:00 +0000 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: References: <52CDC57B.6010507@googlemail.com> <52CEB57B.1090504@witherden.org>

Message-ID: On Thu, Jan 9, 2014 at 3:30 PM, Julian Taylor wrote: > On Thu, Jan 9, 2014 at 3:50 PM, Fr?d?ric Bastien wrote: >> How hard would it be to provide the choise to the user? We could >> provide 2 functions like: fma_fast() fma_prec() (for precision)? Or >> this could be a parameter or a user configuration option like for the >> overflow/underflow error. > > I like Freddie Witherden proposal to name the function madd which does not > guarantee one rounding operation. This leaves the namespace open for a > special fma function with that guarantee. It can use the libc fma function > which is very slow sometimes but platform independent. This is assuming > apple did not again take shortcuts like they did with their libc hypot > implementation, can someone disassemble apple libc to check what they are > doing for C99 fma? > And it leaves users the possibility to use the faster madd function if they > do not need the precision guarantee. If madd doesn't provide any rounding guarantees, then its only reason for existence is that it provides a fused a*b+c loop that better utilizes memory bandwidth, right? I'm guessing that speed-wise it doesn't really matter whether you use the fancy AVX instructions or not, since even the naive implementation is memory bound -- the advantage is just in the fusion? Lack of loop fusion is obviously a major limitation of numpy, but it's a very general problem. I'm sceptical about whether we want to get into the business of adding functions whose only purpose is to provide pre-fused loops. After madd, what other operations should we provide like this? msub (a*b-c)? add3 (a+b+c)? maddm (a*b+c*d)? mult3 (a*b*c)? How do we decide? Surely it's better to direct people who are hitting memory bottlenecks to much more powerful and general solutions to this problem, like numexpr/cython/numba/theano? (OTOH the verison that gives rounding guarantees is obviously a unique new feature.) -n From jaime.frio at gmail.com Thu Jan 9 13:32:29 2014 From: jaime.frio at gmail.com (=?ISO-8859-1?Q?Jaime_Fern=E1ndez_del_R=EDo?=) Date: Thu, 9 Jan 2014 10:32:29 -0800 Subject: [Numpy-discussion] ENH: add a 'return_counts=' keyword argument to `np.unique` Message-ID: Hi, I have just sent a PR, adding a `return_counts` keyword argument to `np.unique` that does exactly what the name suggests: counting the number of times each unique time comes up in the array. It reuses the `flag` array that is constructed whenever any optional index is requested, extracts the indices of the `True`s in it, and returns their diff. You can check it here: https://github.com/numpy/numpy/pull/4180 Regards, Jaime -- (\__/) ( O.o) ( > <) Este es Conejo. Copia a Conejo en tu firma y ay?dale en sus planes de dominaci?n mundial. -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Jan 9 18:21:17 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 9 Jan 2014 16:21:17 -0700 Subject: [Numpy-discussion] Memory allocation cleanup Message-ID: Apropos Julian's changes to use the PyObject_* allocation suite for some parts of numpy, I posted the following I think numpy memory management is due a cleanup. Currently we have PyDataMem_* PyDimMem_* PyArray_* Plus the malloc, PyMem_*, and PyObject_* interfaces. That is six ways to manage heap allocations. As far as I can tell, PyArray_* is always PyMem_*in practice. We probably need to keep the PyDataMem family as it has a memory tracking option, but PyDimMem just confuses things, I'd rather just use PyMem_* with explicit size. Curiously, the PyObject_Malloc family is not documented apart from some release notes. We should also check for the macro versions of PyMem_* as they are deprecated for extension modules. Nathaniel then suggested that we consider going all Python allocators, especially as new memory tracing tools are coming online in 3.4. Given that these changes could have some impact on current extension writers I thought I'd bring this up on the list to gather opinions. Thoughts? -------------- next part -------------- An HTML attachment was scrubbed... URL: From nouiz at nouiz.org Thu Jan 9 19:35:32 2014 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Thu, 9 Jan 2014 19:35:32 -0500 Subject: [Numpy-discussion] Memory allocation cleanup In-Reply-To: References: Message-ID: This shouldn't affect Theano. So I have no objection. Making thing faster and more tracktable is always good. So I think it seam a good idea. Fred On Thu, Jan 9, 2014 at 6:21 PM, Charles R Harris wrote: > Apropos Julian's changes to use the PyObject_* allocation suite for some > parts of numpy, I posted the following > > I think numpy memory management is due a cleanup. Currently we have > > PyDataMem_* > PyDimMem_* > PyArray_* > > Plus the malloc, PyMem_*, and PyObject_* interfaces. That is six ways to > manage heap allocations. As far as I can tell, PyArray_* is always PyMem_* > in practice. We probably need to keep the PyDataMem family as it has a > memory tracking option, but PyDimMem just confuses things, I'd rather just > use PyMem_* with explicit size. Curiously, the PyObject_Malloc family is not > documented apart from some release notes. > > We should also check for the macro versions of PyMem_* as they are > deprecated for extension modules. > > Nathaniel then suggested that we consider going all Python allocators, > especially as new memory tracing tools are coming online in 3.4. Given that > these changes could have some impact on current extension writers I thought > I'd bring this up on the list to gather opinions. > > Thoughts? > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > From nouiz at nouiz.org Thu Jan 9 19:49:00 2014 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Thu, 9 Jan 2014 19:49:00 -0500 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: References: <52CDC57B.6010507@googlemail.com> <52CEB57B.1090504@witherden.org>

Message-ID: Good questions where do we stop. I think as you that the fma with guarantees is a good new feature. But if this is made available, people will want to use it for speed. Some people won't like to use another library or dependency. They won't like to have random speed up or slow down. So why not add the ma and fma and trace the line to the operation implemented on the CPU that have an fused version? That will make a sensible limit I think. Anyway, we won't use it directly. This is just my taught. Do you know if those instruction are automatically used by gcc if we use the good architecture parameter? Fred On Thu, Jan 9, 2014 at 12:07 PM, Nathaniel Smith wrote: > On Thu, Jan 9, 2014 at 3:30 PM, Julian Taylor > wrote: >> On Thu, Jan 9, 2014 at 3:50 PM, Fr?d?ric Bastien wrote: >>> How hard would it be to provide the choise to the user? We could >>> provide 2 functions like: fma_fast() fma_prec() (for precision)? Or >>> this could be a parameter or a user configuration option like for the >>> overflow/underflow error. >> >> I like Freddie Witherden proposal to name the function madd which does not >> guarantee one rounding operation. This leaves the namespace open for a >> special fma function with that guarantee. It can use the libc fma function >> which is very slow sometimes but platform independent. This is assuming >> apple did not again take shortcuts like they did with their libc hypot >> implementation, can someone disassemble apple libc to check what they are >> doing for C99 fma? >> And it leaves users the possibility to use the faster madd function if they >> do not need the precision guarantee. > > If madd doesn't provide any rounding guarantees, then its only reason > for existence is that it provides a fused a*b+c loop that better > utilizes memory bandwidth, right? I'm guessing that speed-wise it > doesn't really matter whether you use the fancy AVX instructions or > not, since even the naive implementation is memory bound -- the > advantage is just in the fusion? > > Lack of loop fusion is obviously a major limitation of numpy, but it's > a very general problem. I'm sceptical about whether we want to get > into the business of adding functions whose only purpose is to provide > pre-fused loops. After madd, what other operations should we provide > like this? msub (a*b-c)? add3 (a+b+c)? maddm (a*b+c*d)? mult3 (a*b*c)? > How do we decide? Surely it's better to direct people who are hitting > memory bottlenecks to much more powerful and general solutions to this > problem, like numexpr/cython/numba/theano? > > (OTOH the verison that gives rounding guarantees is obviously a unique > new feature.) > > -n > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From jtaylor.debian at googlemail.com Thu Jan 9 20:06:01 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Fri, 10 Jan 2014 02:06:01 +0100 Subject: [Numpy-discussion] adding fused multiply and add to numpy In-Reply-To: References: <52CDC57B.6010507@googlemail.com> <52CEB57B.1090504@witherden.org>

Message-ID: <52CF4779.3050403@googlemail.com> On 10.01.2014 01:49, Fr?d?ric Bastien wrote: > > Do you know if those instruction are automatically used by gcc if we > use the good architecture parameter? > > they are used if you enable -ffp-contract=fast. Do not set it to `on` this is an alias to `off` due to the semantics of C. -ffast-math enables in in gcc 4.7 and 4.8 but not in 4.9 but this might be a bug, I filed one a while ago. Also you need to set the -mfma or -arch=bdver{1,2,3,4}. Its not part of -mavx2 last I checked. But there are not many places in numpy the compiler can use it, only dot comes to mind which goes over blas libraries in the high performance case. From njs at pobox.com Thu Jan 9 21:48:25 2014 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 10 Jan 2014 02:48:25 +0000 Subject: [Numpy-discussion] Memory allocation cleanup In-Reply-To: References: Message-ID: On Thu, Jan 9, 2014 at 11:21 PM, Charles R Harris wrote: > Apropos Julian's changes to use the PyObject_* allocation suite for some > parts of numpy, I posted the following > > I think numpy memory management is due a cleanup. Currently we have > > PyDataMem_* > PyDimMem_* > PyArray_* > > Plus the malloc, PyMem_*, and PyObject_* interfaces. That is six ways to > manage heap allocations. As far as I can tell, PyArray_* is always PyMem_* > in practice. We probably need to keep the PyDataMem family as it has a > memory tracking option, but PyDimMem just confuses things, I'd rather just > use PyMem_* with explicit size. Curiously, the PyObject_Malloc family is not > documented apart from some release notes. > > We should also check for the macro versions of PyMem_* as they are > deprecated for extension modules. > > Nathaniel then suggested that we consider going all Python allocators, > especially as new memory tracing tools are coming online in 3.4. Given that > these changes could have some impact on current extension writers I thought > I'd bring this up on the list to gather opinions. > > Thoughts? After a bit more research, some further points to keep in mind: Currently, PyDimMem_* and PyArray_* are just aliases for malloc/free, and PyDataMem_* is an alias for malloc/free with some extra tracing hooks wrapped around it. (AFAIK, these tracing hooks are not used by anyone anywhere -- at least, if they are I haven't heard about it, and there is no code on github that uses them.) There is one substantial difference between the PyMem_* and PyObject_* interfaces as compared to malloc(), which is that the Py* interfaces require that the GIL be held when they are called. (@Julian -- I think your PR we just merged fulfills this requirement, is that right?) I strongly suspect that we have PyDataMem_* calls outside of the GIL -- e.g., when allocating ufunc buffers -- and third-party code might as well. Python 3.4's new memory allocation API and tracing stuff is documented here: http://www.python.org/dev/peps/pep-0445/ http://docs.python.org/dev/c-api/memory.html http://docs.python.org/dev/library/tracemalloc.html In particular, 3.4 adds a set of PyRawMem_* functions, which do not require the GIL. Checking the current source code for _tracemalloc.c, it appears that PyRawMem_* functions *are* traced, so that's nice - that means that switching PyDataMem_* to use PyRawMem_* would be both safe and provide benefits. However, PyRawMem_* does not provide the pymalloc optimizations for small allocations. Also, none of the Py* interfaces implement calloc(), which is annoying because it messes up our new optimization of using calloc() for np.zeros. (calloc() is generally faster than malloc()+explicit zeroing, because it can use OS-specific virtual memory tricks to zero out the memory "for free". These same tricks also mean that if you use np.zeros() to allocate a large array, and then only write to a few entries in that array, the total memory used is proportional to the number of non-zero entries, rather than to the actual size of the array, which can be extremely useful in some situations as a kind of "poor man's sparse array".) I'm pretty sure that the vast majority of our allocations do occur with GIL protection, so we might want to switch to using PyObject_* for most cases to take advantage of the small-object optimizations, and use PyRawMem_* for any non-GIL cases (like possibly ufunc buffers), with a compatibility wrapper to replace PyRawMem_* with malloc() on pre-3.4 pythons. Of course this will need some profiling to see if PyObject_* is actually better than malloc() in practice. For calloc(), we could try and convince python-dev to add this, or np.zeros() could explicitly use calloc() even when other code uses Py* interface and then uses an ndarray flag or special .base object to keep track of the fact that we need to use free() to deallocate this memory, or we could give up on the calloc optimization. -n From jtaylor.debian at googlemail.com Fri Jan 10 04:18:05 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Fri, 10 Jan 2014 10:18:05 +0100 Subject: [Numpy-discussion] Memory allocation cleanup In-Reply-To: References: Message-ID: On Fri, Jan 10, 2014 at 3:48 AM, Nathaniel Smith wrote: > On Thu, Jan 9, 2014 at 11:21 PM, Charles R Harris > wrote: > > [...] > > After a bit more research, some further points to keep in mind: > > Currently, PyDimMem_* and PyArray_* are just aliases for malloc/free, > and PyDataMem_* is an alias for malloc/free with some extra tracing > hooks wrapped around it. (AFAIK, these tracing hooks are not used by > anyone anywhere -- at least, if they are I haven't heard about it, and > there is no code on github that uses them.) > There is one substantial difference between the PyMem_* and PyObject_* > interfaces as compared to malloc(), which is that the Py* interfaces > require that the GIL be held when they are called. (@Julian -- I think > your PR we just merged fulfills this requirement, is that right?) I only replaced object allocation which should always be called under GIL, not sure about nditer construction, but it does uses python exceptions for errors which I think also require the GIL. [...] > > Also, none of the Py* interfaces implement calloc(), which is annoying > because it messes up our new optimization of using calloc() for > np.zeros. [...] > Another thing that is not directly implemented in Python is aligned allocation. This is going to get increasingly important with the advent heavily vectorized x86 CPUs (e.g. AVX512 is rolling out now) and the C malloc being optimized for the oldish SSE (16 bytes). I want to change the array buffer allocation to make use of posix_memalign and C11 aligned_malloc if available to avoid some penalties when loading from non 32 byte aligned buffers. I could imagine it might also help coprocessors and gpus to have higher alignments, but I'm not very familiar with that type of hardware. The allocator used by the Python3.4 is plugable, so we could implement our special allocators with the new API, but only when 3.4 is more widespread. For this reason and missing calloc I don't think we should use the Python API for data buffers just yet. Any benefits are relatively small anyway. [...] > > I'm pretty sure that the vast majority of our allocations do occur > with GIL protection, so we might want to switch to using PyObject_* > for most cases to take advantage of the small-object optimizations, > and use PyRawMem_* for any non-GIL cases (like possibly ufunc > buffers), with a compatibility wrapper to replace PyRawMem_* with > malloc() on pre-3.4 pythons. Of course this will need some profiling > to see if PyObject_* is actually better than malloc() in practice. I don't think its required to replace everything with PyObject_* just because it can be faster. We should do it only in places where it really makes a difference and there are not that many of them. -------------- next part -------------- An HTML attachment was scrubbed... URL: From nouiz at nouiz.org Fri Jan 10 09:52:23 2014 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Fri, 10 Jan 2014 09:52:23 -0500 Subject: [Numpy-discussion] Memory allocation cleanup In-Reply-To: References:

Message-ID: On Fri, Jan 10, 2014 at 4:18 AM, Julian Taylor wrote: > On Fri, Jan 10, 2014 at 3:48 AM, Nathaniel Smith wrote: >> >> On Thu, Jan 9, 2014 at 11:21 PM, Charles R Harris >> wrote: >> > [...] >> >> After a bit more research, some further points to keep in mind: >> >> Currently, PyDimMem_* and PyArray_* are just aliases for malloc/free, >> and PyDataMem_* is an alias for malloc/free with some extra tracing >> hooks wrapped around it. (AFAIK, these tracing hooks are not used by >> anyone anywhere -- at least, if they are I haven't heard about it, and >> there is no code on github that uses them.) >> >> >> There is one substantial difference between the PyMem_* and PyObject_* >> interfaces as compared to malloc(), which is that the Py* interfaces >> require that the GIL be held when they are called. (@Julian -- I think >> your PR we just merged fulfills this requirement, is that right?) > > > I only replaced object allocation which should always be called under GIL, > not sure about nditer construction, but it does uses python exceptions for > errors which I think also require the GIL. > > [...] >> >> >> Also, none of the Py* interfaces implement calloc(), which is annoying >> because it messes up our new optimization of using calloc() for >> np.zeros. [...] > > > Another thing that is not directly implemented in Python is aligned > allocation. This is going to get increasingly important with the advent > heavily vectorized x86 CPUs (e.g. AVX512 is rolling out now) and the C > malloc being optimized for the oldish SSE (16 bytes). I want to change the > array buffer allocation to make use of posix_memalign and C11 aligned_malloc > if available to avoid some penalties when loading from non 32 byte aligned > buffers. I could imagine it might also help coprocessors and gpus to have > higher alignments, but I'm not very familiar with that type of hardware. > The allocator used by the Python3.4 is plugable, so we could implement our > special allocators with the new API, but only when 3.4 is more widespread. About the co-processor and GPUs, it could help, but as NumPy is CPU only and that there is other problem in directly using it, I dought that this change would help code around co-processor/GPUs. Fred From njs at pobox.com Fri Jan 10 11:03:11 2014 From: njs at pobox.com (Nathaniel Smith) Date: Fri, 10 Jan 2014 16:03:11 +0000 Subject: [Numpy-discussion] Memory allocation cleanup In-Reply-To: References:

Message-ID: On Fri, Jan 10, 2014 at 9:18 AM, Julian Taylor wrote: > On Fri, Jan 10, 2014 at 3:48 AM, Nathaniel Smith wrote: >> >> Also, none of the Py* interfaces implement calloc(), which is annoying >> because it messes up our new optimization of using calloc() for >> np.zeros. [...] > > > Another thing that is not directly implemented in Python is aligned > allocation. This is going to get increasingly important with the advent > heavily vectorized x86 CPUs (e.g. AVX512 is rolling out now) and the C > malloc being optimized for the oldish SSE (16 bytes). I want to change the > array buffer allocation to make use of posix_memalign and C11 aligned_malloc > if available to avoid some penalties when loading from non 32 byte aligned > buffers. I could imagine it might also help coprocessors and gpus to have > higher alignments, but I'm not very familiar with that type of hardware. > The allocator used by the Python3.4 is plugable, so we could implement our > special allocators with the new API, but only when 3.4 is more widespread. > > For this reason and missing calloc I don't think we should use the Python > API for data buffers just yet. Any benefits are relatively small anyway. It really would be nice if our data allocations would all be visible to the tracemalloc library though, somehow. And I doubt we want to patch *all* Python allocations to go through posix_memalign, both because this is rather intrusive and because it would break python -X tracemalloc. How certain are we that we want to switch to aligned allocators in the future? If we don't, then maybe it makes to ask python-dev for a calloc interface; but if we do, then I doubt we can convince them to add aligned allocation interfaces, and we'll need to ask for something else (maybe a "null" allocator, which just notifies the python memory tracking machinery that we allocated something ourselves?). It's not obvious to me why aligning data buffers is useful - can you elaborate? There's no code simplification, because we always have to handle the unaligned case anyway with the standard unaligned startup/cleanup loops. And intuitively, given the existence of such loops, alignment shouldn't matter much in practice, since the most that shifting alignment can do is change the number of elements that need to be handled by such loops by (SIMD alignment value / element size). For doubles, in a buffer that has 16 byte alignment but not 32 byte alignment, this means that worst case, we end up doing 4 unnecessary non-SIMD operations. And surely that only matters for very small arrays (for large arrays such constant overhead will amortize out), but for small arrays SIMD doesn't help much anyway? Probably I'm missing something, because you actually know something about SIMD and I'm just hand-waving from first principles :-). But it'd be nice to understand the reasoning for why/whether alignment really helps in the numpy context. -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From lists at hilboll.de Fri Jan 10 11:03:33 2014 From: lists at hilboll.de (Andreas Hilboll) Date: Fri, 10 Jan 2014 17:03:33 +0100 Subject: [Numpy-discussion] Why do weights in np.polyfit have to be 1D? Message-ID: <52D019D5.4020902@hilboll.de> Hi, in using np.polyfit (in version 1.7.1), I ran accross TypeError: expected a 1-d array for weights when trying to fit k polynomials at once (x.shape = (4, ), y.shape = (4, 136), w.shape = (4, 136)). Is there any specific reason why this is not supported? -- Andreas. From charlesr.harris at gmail.com Fri Jan 10 12:02:01 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 10 Jan 2014 10:02:01 -0700 Subject: [Numpy-discussion] Why do weights in np.polyfit have to be 1D? In-Reply-To: <52D019D5.4020902@hilboll.de> References: <52D019D5.4020902@hilboll.de> Message-ID: On Fri, Jan 10, 2014 at 9:03 AM, Andreas Hilboll wrote: > Hi, > > in using np.polyfit (in version 1.7.1), I ran accross > > TypeError: expected a 1-d array for weights > > when trying to fit k polynomials at once (x.shape = (4, ), y.shape = (4, > 136), w.shape = (4, 136)). Is there any specific reason why this is not > supported? > The weights are applied to the rows of the design matrix, so if you have multiple weight vectors you essentially need to iterate the fit over them. Said differently, for each weight vector there is a generalized inverse and if there is a different weight vector for each column of the rhs, then there is a different generalized inverse for each column. You can't just multiply the rhs from the left by *the* inverse. The problem doesn't vectorize. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From Nicolas.Rougier at inria.fr Fri Jan 10 12:37:38 2014 From: Nicolas.Rougier at inria.fr (Nicolas Rougier) Date: Fri, 10 Jan 2014 18:37:38 +0100 Subject: [Numpy-discussion] Bug in resize of structured array (with initial size = 0) Message-ID: <722576BC-22A0-418F-A039-65F44B835784@inria.fr> Hi, I've tried to resize a record array that was first empty (on purpose, I need it) and I got the following error (while it's working for regular array). Traceback (most recent call last): File "test_resize.py", line 10, in print np.resize(V,2) File "/usr/locaL/Cellar/python/2.7.6/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/core/fromnumeric.py", line 1053, in resize if not Na: return mu.zeros(new_shape, a.dtype.char) TypeError: Empty data-type I'm using numpy 1.8.0, python 2.7.6, osx 10.9.1. Can anyone confirm before I submit an issue ? Here is the script: V = np.zeros(0, dtype=np.float32) print V.dtype print np.resize(V,2) V = np.zeros(0, dtype=[('a', np.float32, 1)]) print V.dtype print np.resize(V,2) From jtaylor.debian at googlemail.com Fri Jan 10 14:15:26 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Fri, 10 Jan 2014 20:15:26 +0100 Subject: [Numpy-discussion] Memory allocation cleanup In-Reply-To: References:

Message-ID: <52D046CE.4050409@googlemail.com> On 10.01.2014 17:03, Nathaniel Smith wrote: > On Fri, Jan 10, 2014 at 9:18 AM, Julian Taylor > wrote: >> On Fri, Jan 10, 2014 at 3:48 AM, Nathaniel Smith wrote: >>> [...] >> >> For this reason and missing calloc I don't think we should use the Python >> API for data buffers just yet. Any benefits are relatively small anyway. > > It really would be nice if our data allocations would all be visible > to the tracemalloc library though, somehow. And I doubt we want to > patch *all* Python allocations to go through posix_memalign, both > because this is rather intrusive and because it would break python -X > tracemalloc. we can most likely plug aligned allocators into the python allocator to still be able to use tracemalloc but it would be python3.4 only [0], older versions would continue to use our aligned allocators directly with our own tracing. I think thats fine, I doubt the tracemalloc module will be backported to older pythons. An issue is we can't fit calloc in there without abusing one of the domains, but I think it is also not so critical to keep it. The sparseness is neat but you can lose it very quickly again too (basically on any full copy) and its not portable. > > How certain are we that we want to switch to aligned allocators in the > future? If we don't, then maybe it makes to ask python-dev for a > calloc interface; but if we do, then I doubt we can convince them to > add aligned allocation interfaces, and we'll need to ask for something > else (maybe a "null" allocator, which just notifies the python memory > tracking machinery that we allocated something ourselves?). > > It's not obvious to me why aligning data buffers is useful - can you > elaborate? There's no code simplification, because we always have to > handle the unaligned case anyway with the standard unaligned > startup/cleanup loops. And intuitively, given the existence of such > loops, alignment shouldn't matter much in practice, since the most > that shifting alignment can do is change the number of elements that > need to be handled by such loops by (SIMD alignment value / element > size). For doubles, in a buffer that has 16 byte alignment but not 32 > byte alignment, this means that worst case, we end up doing 4 > unnecessary non-SIMD operations. Its relevant when you have multiple buffer inputs. If they do not have the same alignment they can't be all peeled to a correct alignment, some of the inputs will always have be loaded unaligned. It might be that in modern x86 hardware unaligned loads might be cheaper. In Nehalem architectures using unaligned instructions have almost no penalty if the underlying memory is in fact aligned correctly, but there is still a penalty if it is not aligned. I'm not sure how relevant that is in the even newer architectures, the intel docs still recommend aligning memory though. [0] http://www.python.org/dev/peps/pep-0445/ From hedieh.ebrahimi at amphos21.com Wed Jan 15 05:12:42 2014 From: hedieh.ebrahimi at amphos21.com (Hedieh Ebrahimi) Date: Wed, 15 Jan 2014 11:12:42 +0100 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array Message-ID: Hello, I am trying to use the following line of code : fileContent=loadtxt(filePath,dtype=str) in order to load a text file located at path= filePath in to a numpy array called fileContent. I?ve simplifed my file for the purpose of this question but the file looks something like this: file Content : C:\Users\Documents\Project\mytextfile1.txt C:\Users\Documents\Project\mytextfile2.txt C:\Users\Documents\Project\mytextfile3.txt I try to print my fileContent array after I read it and it looks like this : ["b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile1.txt'" "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile2.txt'" "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile3.txt'"] Why is this happening and how can I prevent it ? Also if I have a line that starts like this in my file, python will crash on me. how can i fix this ? !--Timestep ( line in file starting with !-- ) I guess it has to have something to do with datatype. if I donot define the datatype it will be float by default which will give me an error an if I define the datatype as string as I did above, then I get to the problems that I mentioned above. I?d appreciate any help on how to fix this. Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Wed Jan 15 05:25:26 2014 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Wed, 15 Jan 2014 11:25:26 +0100 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References: Message-ID: On 15 January 2014 11:12, Hedieh Ebrahimi wrote: > I try to print my fileContent array after I read it and it looks like this > : > > ["b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile1.txt'" > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile2.txt'" > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile3.txt'"] > > Why is this happening and how can I prevent it ? > Also if I have a line that starts like this in my file, python will crash > on me. how can i fix this ? > What is wrong with this case? If you are concerned about the multiple backslashes, they are there because they are special symbols, and so they have to be escaped (you actually want a backslash, not whatever else they could mean). Depending on what else is on the file, you may be better off reading the file in pure python. Assuming there is nothing else, something like this would work: [line.strip() for line in open(filePath, 'r').readlines()] /David. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Wed Jan 15 07:38:57 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 15 Jan 2014 13:38:57 +0100 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References:

Message-ID: <52D68161.7090807@googlemail.com> On 01/15/2014 11:25 AM, Da?id wrote: > On 15 January 2014 11:12, Hedieh Ebrahimi > wrote: > > I try to print my fileContent array after I read it and it looks > like this : > > ["b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile1.txt'" > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile2.txt'" > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile3.txt'"] > > Why is this happening and how can I prevent it ? > Also if I have a line that starts like this in my file, python will > crash on me. how can i fix this ? > > > What is wrong with this case? If you are concerned about the multiple > backslashes, they are there because they are special symbols, and so > they have to be escaped (you actually want a backslash, not whatever > else they could mean). > you have the bytes representation and a duplicate slash in it. Its due to unicode strings in python3. A workaround that only works for ascii is: np.loadtxt(file, dtype=bytes).astype(str) for non ascii I guess you should use python directly as numpy would also require a python loop with explicit decoding. Currently handling strings in python3 with numpy is even worse than before, you always have to go over bytes and do explicit decodes to get python strings out of ascii data. What we might need in numpy is new string xtypes specifying encodings to allow sane conversion to python3 strings without the excessive memory usage of 4 byte unicode (ucs-4). e.g. if its ascii reuse a (which currently maps to bytes) np.loadtxt(file, dtype='a') for utf 8 data: d = np.loadtxt(file, dtype='utf8') so that type(d[0]) is unicode and not bytes as is currently the case if you don't want to store your arrays in 4 bytes per character. From jtaylor.debian at googlemail.com Wed Jan 15 07:43:50 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 15 Jan 2014 13:43:50 +0100 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: <52D68161.7090807@googlemail.com> References:

<52D68161.7090807@googlemail.com> Message-ID: <52D68286.5060908@googlemail.com> On 01/15/2014 01:38 PM, Julian Taylor wrote: > On 01/15/2014 11:25 AM, Da?id wrote: >> On 15 January 2014 11:12, Hedieh Ebrahimi for utf 8 data: > > d = np.loadtxt(file, dtype='utf8') > ups this is a very bad example as we can't have utf8 as its variable length, but we can have ascii and ucs-2 for lower footprint encodings with proper python string integration. From chris.barker at noaa.gov Wed Jan 15 12:27:28 2014 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 15 Jan 2014 09:27:28 -0800 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: <52D68161.7090807@googlemail.com> References:

<52D68161.7090807@googlemail.com> Message-ID: On Wed, Jan 15, 2014 at 4:38 AM, Julian Taylor < jtaylor.debian at googlemail.com> wrote: > > I try to print my fileContent array after I read it and it looks > > like this : > > > > ["b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile1.txt'" > > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile2.txt'" > > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile3.txt'"] > > you have the bytes representation and a duplicate slash in it. > the duplicate slash confuses me, but I'm not running py3 to test, so... > np.loadtxt(file, dtype=bytes).astype(str) > > for non ascii I guess you should use python directly as numpy would also > require a python loop with explicit decoding. > > Currently handling strings in python3 with numpy is even worse than > before, you always have to go over bytes and do explicit decodes to get > python strings out of ascii data. > There is a MASSIVE set of threads on Python-dev about better support for ASCII and ASCII+binary data in py3 -- but in the meantime, I think we have two issue shere that could be adressed: 1) loadtext behavior -- it's a really, really common case for data files suitable for loadtxt to be ascii, but they also could be another encoding -- so loadtext should have the option to specify the encoding (default to ascii? or ascii-compatible?) The trick here is handling both these cases correctly -- clearly loadtxt is broken on py3 now. This example works fine under py2. It seems to be reading the file as bytes, then passing those bytes off to a unicode string (str in py3), without specifying an encoding (which I think is how that b' ...' junk gets in there. note that: np.loadtxt('pathlist.txt', dtype=unicode) works fine on py2 as well: In [7]: np.loadtxt('pathlist.txt', dtype=unicode) Out[7]: array([u'C:\\Users\\Documents\\Project\\mytextfile1.txt', u'C:\\Users\\Documents\\Project\\mytextfile2.txt', u'C:\\Users\\Documents\\Project\\mytextfile3.txt'], dtype=' From charlesr.harris at gmail.com Wed Jan 15 12:57:51 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 15 Jan 2014 10:57:51 -0700 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References:

<52D68161.7090807@googlemail.com> Message-ID: On Wed, Jan 15, 2014 at 10:27 AM, Chris Barker wrote: > On Wed, Jan 15, 2014 at 4:38 AM, Julian Taylor < > jtaylor.debian at googlemail.com> wrote: > >> > I try to print my fileContent array after I read it and it looks >> > like this : >> > >> > ["b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile1.txt'" >> > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile2.txt'" >> > "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile3.txt'"] >> > > >> you have the bytes representation and a duplicate slash in it. >> > > the duplicate slash confuses me, but I'm not running py3 to test, so... > > >> np.loadtxt(file, dtype=bytes).astype(str) >> >> for non ascii I guess you should use python directly as numpy would also >> require a python loop with explicit decoding. >> >> Currently handling strings in python3 with numpy is even worse than >> before, you always have to go over bytes and do explicit decodes to get >> python strings out of ascii data. >> > > There is a MASSIVE set of threads on Python-dev about better support for > ASCII and ASCII+binary data in py3 -- but in the meantime, I think we have > two issue shere that could be adressed: > > 1) loadtext behavior -- it's a really, really common case for data files > suitable for loadtxt to be ascii, but they also could be another encoding > -- so loadtext should have the option to specify the encoding (default to > ascii? or ascii-compatible?) > > The trick here is handling both these cases correctly -- clearly loadtxt > is broken on py3 now. This example works fine under py2. > > It seems to be reading the file as bytes, then passing those bytes off to > a unicode string (str in py3), without specifying an encoding (which I > think is how that b' ...' > junk gets in there. > > note that: np.loadtxt('pathlist.txt', dtype=unicode) works fine on py2 as > well: > > In [7]: np.loadtxt('pathlist.txt', dtype=unicode) > Out[7]: > array([u'C:\\Users\\Documents\\Project\\mytextfile1.txt', > u'C:\\Users\\Documents\\Project\\mytextfile2.txt', > u'C:\\Users\\Documents\\Project\\mytextfile3.txt'], > dtype=' > which is what should happen in py3. So the internal loadtxt code must be > confusing bytes and unicode objects... > > Anyway, this should work, and there should be an obvious way to spell it. > > 2) numpy string types -- it seems numpy already has a both a string type > and unicode type -- perhaps some re-naming or better documentation is in > order: > the string type 'S10', for example, should be clearly defined as 1-byte > per character ascii-compatible. > > I'm not sure how many bytes the unicode type has, but it may make sense to > be abel to choose UCS-2 or UCS-4 -- though memory is cheep, I'd probably go > with UCS-4 and be done with it. > There was a discussion of this long ago and UCS-4 was chosen as the numpy standard. There are just too many complications that arise in supporting both. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Wed Jan 15 13:25:31 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Wed, 15 Jan 2014 19:25:31 +0100 Subject: [Numpy-discussion] adding more unicode dtypes In-Reply-To: References:

<52D68161.7090807@googlemail.com> Message-ID: <52D6D29B.8020509@googlemail.com> On 15.01.2014 18:57, Charles R Harris wrote: > ... > > There was a discussion of this long ago and UCS-4 was chosen as the > numpy standard. There are just too many complications that arise in > supporting both. > my guess is that that discussion was before python3 and you could still simply treat bytes == string? In python3 you need extra code to deal with arrays containing strings as the S type is interpreted as bytes which is not a string type anymore [0]. Someone on irc (I think Freddie Witherden CC'd) had a use case with huge ascii tables in numpy which now have to be stored as 4 bytes unicode on disk or decode bytes all the time. I personally don't use strings in arrays so I can neither judge the impact nor the use, but it seems to me like at least having an ascii dtype for python2<->python3 compatibility would be useful. [0] https://github.com/numpy/numpy/issues/4162 From chris.barker at noaa.gov Wed Jan 15 14:40:58 2014 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 15 Jan 2014 11:40:58 -0800 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References:

<52D68161.7090807@googlemail.com> Message-ID: On Wed, Jan 15, 2014 at 9:57 AM, Charles R Harris wrote: > There was a discussion of this long ago and UCS-4 was chosen as the numpy > standard. There are just too many complications that arise in supporting > both. > fair enough -- but loadtxt appears to be broken just the same. Any proposals for that? My proposal: loadtxt accepts an encoding argument. default is ascii -- that's what it's doing now, anyway, yes? If the file is encoded ascii, then a one-byte-per character dtype is used for text data, unless the user specifies otherwise (do they need to specify anyway?) If the file has another encoding, the the default dtype for text is unicode. Not sure about other one-byte per character encodings (e.g. latin-1) The defaults may be moot, if the loadtxt doesn't have auto-detection of text in a filie anyway. This all required that there be an obvious way for the user to spell the one-byte-per character dtype -- I think 'S' will do it. Note to OP: what happens if you specify 'S' for your dtype, rather than str - it works for me on py2: In [16]: np.loadtxt('pathlist.txt', dtype='S') Out[16]: array(['C:\\Users\\Documents\\Project\\mytextfile1.txt', 'C:\\Users\\Documents\\Project\\mytextfile2.txt', 'C:\\Users\\Documents\\Project\\mytextfile3.txt'], dtype='|S42') Note: this leaves us with what to pass back to the user when they index into an array of type 'S*' -- a bytes object or a unicode object (decoded as ascii). I think a unicode object, in keeping with proper py3 behavior. This would be like we currently do with, say floating point numbers: We can store/operate with 32 bit floats, but when you pass it back as a python type, you get the native python float -- 64bit. NOTE: another option is to use latin-1 all around, rather than ascii -- you may get garbage from the higher value bytes, but it won't barf on you. -Chris > Chuck > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Wed Jan 15 15:07:35 2014 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 15 Jan 2014 12:07:35 -0800 Subject: [Numpy-discussion] adding more unicode dtypes In-Reply-To: <52D6D29B.8020509@googlemail.com> References:

<52D68161.7090807@googlemail.com> <52D6D29B.8020509@googlemail.com> Message-ID: Julian -- beat me to it! On Wed, Jan 15, 2014 at 10:25 AM, Julian Taylor < jtaylor.debian at googlemail.com> wrote: > On 15.01.2014 18:57, Charles R Harris wrote: > > There was a discussion of this long ago and UCS-4 was chosen as the > > numpy standard. There are just too many complications that arise in > > supporting both. > supporting both UCS-4 and UCS-2 would be more pain than it's worth. > In python3 you need extra code to deal with arrays containing strings as > the S type is interpreted as bytes which is not a string type anymore [0]. > ouch! I was just assuming that it still was -- yes, I really think we need a one-byte-per char string type -- probably ascii, but we could do latin-1 and let the buyer beware of the higher value bytes Someone on irc (I think Freddie Witherden CC'd) had a use case with huge > ascii tables in numpy which now have to be stored as 4 bytes unicode on > disk or decode bytes all the time. > and ascii data is not the least bit rare in the science world in particular. > I personally don't use strings in arrays so I can neither judge the > impact nor the use, but it seems to me like at least having an ascii > dtype for python2<->python3 compatibility would be useful. > I think py2<->py3 compatibilty is a separate issue -- we should have this if it's a good thing to have, not because of that. And it is a good thing to have. And since this is a new thread -- regardless of the decision on this, loadtxt is broken -- we certainly should be able to parse ascii text and return something reasonable -- unicode strings would have been fine in the OPs case, if they didn't have the extra bytes to tring crap in them. [0] https://github.com/numpy/numpy/issues/4162 from that: The transition towards split string/bytes types in Python 3 has the unfortunate side effect of breaking the following snippet: np.array("Hello", dtype="|S").item() == "Hello" Sorry for not testing in py3, but this makes it look like the "S" dtype is one-byte per char strings, but creates a bytes object, rather than a unicode (py3 str) object. As in my other note, I think it would be better to have it return a unicode string by default. But it looks like you can still use it to store large quantities of ascii data if you want. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From d.l.goldsmith at gmail.com Wed Jan 15 15:15:15 2014 From: d.l.goldsmith at gmail.com (David Goldsmith) Date: Wed, 15 Jan 2014 12:15:15 -0800 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array (Charles R Harris) Message-ID: On Wed, Jan 15, 2014 at 9:52 AM, wrote: > Date: Wed, 15 Jan 2014 10:57:51 -0700 > From: Charles R Harris > Subject: Re: [Numpy-discussion] using loadtxt to load a text file in > to a numpy array > To: Discussion of Numerical Python > Message-ID: > < > CAB6mnxJpvJbsoZzY0Ctk1bk+kDCUDivC9KrzYt1johU33bZOLw at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > On Wed, Jan 15, 2014 at 10:27 AM, Chris Barker >wrote: > > There was a discussion of this long ago and UCS-4 was chosen as the numpy > standard. There are just too many complications that arise in supporting > both. > > Chuck > In that case, perhaps another function altogether is called for. DG -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Wed Jan 15 18:42:35 2014 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 15 Jan 2014 15:42:35 -0800 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References: Message-ID: bump back to the OP: On Wed, Jan 15, 2014 at 2:12 AM, Hedieh Ebrahimi < hedieh.ebrahimi at amphos21.com> wrote: > fileContent=loadtxt(filePath,dtype=str) > do either of these work for you? fileContent=loadtxt(filePath,dtype='S') or fileContent=loadtxt(filePath,dtype=np.unicode) -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Wed Jan 15 18:58:25 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Thu, 16 Jan 2014 00:58:25 +0100 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References:

Message-ID: <52D720A1.9060100@googlemail.com> On 16.01.2014 00:42, Chris Barker wrote: > bump back to the OP: > On Wed, Jan 15, 2014 at 2:12 AM, Hedieh Ebrahimi > > wrote: > > fileContent=loadtxt(filePath,dtype=str) > > > do either of these work for you? > > fileContent=loadtxt(filePath,dtype='S') this gives you bytes not a string, this can only be fixed by adding new dtypes, see the other thread about that. > > or > > fileContent=loadtxt(filePath,dtype=np.unicode) > same as using python str you get the output originally posted, bytes representation with duplicated slashes. This is a bug in loadtxt we need to fix independent of adding new dtypes. It is also independent of the encoding of the text file, loadtxt doesn't seem to be able to open other encodings than ascii/utf8 at all and has no option to tell it what the file is. as mentioned in my earlier mail this works for ascii: np.loadtxt('test.txt',dtype=bytes).astype(str) or of course looping and decoding explicitly. From oscar.j.benjamin at gmail.com Wed Jan 15 19:06:22 2014 From: oscar.j.benjamin at gmail.com (Oscar Benjamin) Date: Thu, 16 Jan 2014 00:06:22 +0000 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: <52D68161.7090807@googlemail.com> References:

<52D68161.7090807@googlemail.com> Message-ID: On 15 January 2014 12:38, Julian Taylor wrote: > On 01/15/2014 11:25 AM, Da?id wrote: >> On 15 January 2014 11:12, Hedieh Ebrahimi > > wrote: >> >> I try to print my fileContent array after I read it and it looks >> like this : >> >> ["b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile1.txt'" >> "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile2.txt'" >> "b'C:\\\\Users\\\\Documents\\\\Project\\\\mytextfile3.txt'"] >> >> Why is this happening and how can I prevent it ? >> Also if I have a line that starts like this in my file, python will >> crash on me. how can i fix this ? >> >> >> What is wrong with this case? If you are concerned about the multiple >> backslashes, they are there because they are special symbols, and so >> they have to be escaped (you actually want a backslash, not whatever >> else they could mean). >> > > you have the bytes representation and a duplicate slash in it. > Its due to unicode strings in python3. So why does the array store the repr of a bytes string? Surely that's just a loadtxt bug and no one is actually depending on that behaviour. Oscar From chris.barker at noaa.gov Wed Jan 15 20:10:07 2014 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 15 Jan 2014 17:10:07 -0800 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: <52D720A1.9060100@googlemail.com> References:

<52D720A1.9060100@googlemail.com> Message-ID: On Wed, Jan 15, 2014 at 3:58 PM, Julian Taylor < jtaylor.debian at googlemail.com> wrote: > > fileContent=loadtxt(filePath,dtype='S') > > this gives you bytes not a string, this can only be fixed by adding new > dtypes, or changing the behavior or dtype 'S', but yes, the other thread. But the OP's problem was not that s/he got bytes, but that the content was wrong -- he got the repr of bytes in a py3 string. - > same as using python str you get the output originally posted, bytes > representation with duplicated slashes. > This is a bug in loadtxt we need to fix independent of adding new dtypes. > yup. > It is also independent of the encoding of the text file, loadtxt doesn't > seem to be able to open other encodings than ascii/utf8 at all and has > no option to tell it what the file is. > a key missing feature -- and I doubt it does utf-8 right, either. as mentioned in my earlier mail this works for ascii: > > np.loadtxt('test.txt',dtype=bytes).astype(str) > thanks -- I wasn't sure what astype would do for that. and what are you getting then, unicode or ascii? Thanks, -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From oscar.j.benjamin at gmail.com Thu Jan 16 05:43:05 2014 From: oscar.j.benjamin at gmail.com (Oscar Benjamin) Date: Thu, 16 Jan 2014 10:43:05 +0000 Subject: [Numpy-discussion] using loadtxt to load a text file in to a numpy array In-Reply-To: References: