From ralf.gommers at gmail.com Thu Mar 1 02:21:01 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 28 Feb 2018 23:21:01 -0800 Subject: [SciPy-Dev] GSoC 2018: Blendenpik (A least-square solver) In-Reply-To: References: Message-ID: Hi Jordi, On Wed, Feb 28, 2018 at 7:51 AM, Jordi Montes wrote: > Hello, > > I am Jordi Montes, an enthusiastic student of discrete mathematics wanting > to participate in this GSoC. > > My idea is to take advantage of the current method for dimensinality > reduction of scipy (clarkson_woodruff_transformation) and build a > least-square solver on top of it. The principal motivation is that it > outperforms the other solvers in many real world applications, even those > in LAPACK. > > I already discussed this in the mailing list before the GSoC admission > term started, but I have also written a formal proposal (which is attached > in this email). > That looks like a good start! Note that the proposal you have submit to Google needs to have some sections that you are still missing, like a timeline (broken down by week, with the work for that week) and links to previous PRs. Here is the PSF template info to use: http://python-gsoc.org/studenttemplate.html It would be useful to reference Blendenpik and add a few more details about it. Think also about answering a few obvious questions about it, like what are the main challenges when implementing it, and how do you benchmark its performance. It's great to see that you've found a co-mentor who is intimately familiar with the domain and algorithms. Cheers, Ralf > As always, feel free to ask about any doubt that comes to your mind or > point to any improvement that you think I could make. > > > Thanks, > > Jordi. > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From pierre.debuyl at kuleuven.be Thu Mar 1 04:18:21 2018 From: pierre.debuyl at kuleuven.be (Pierre de Buyl) Date: Thu, 1 Mar 2018 10:18:21 +0100 Subject: [SciPy-Dev] GSoC 2018 : Rotation formalism In-Reply-To: References: <372936706.2127.1519741937865@wamui-jasmine.atl.sa.earthlink.net> <56F42377-BF68-4F8D-AAB0-7603C72A1D37@inria.fr> Message-ID: <20180301091821.GI11663@pi-x230> On Tue, Feb 27, 2018 at 10:06:56PM +0000, Matthew Brett wrote: > Also : http://matthew-brett.github.io/transforms3d/ > > This is mostly Christoph Gohlke's code, but I've been the maintainer > for a while: > > * rotation matrices > * quaternions In the case of quaternions, there are two typical storages: [w, x, y, z] (scalar first) or [x, y, z, w] (scalar last). Is there a "standard" here? Regards, Pierre From lagru at mailbox.org Thu Mar 1 06:20:10 2018 From: lagru at mailbox.org (Lars G.) Date: Thu, 1 Mar 2018 12:20:10 +0100 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org> Message-ID: On 28.02.2018 16:04, Eric Larson wrote: > For GSoC we need to ensure (at least) that the project fits 1) the needs > of SciPy, 2) the GSoC program scope / timeline, 3) possible mentors, and > 4) your goals. My sense is that a proposal based on code Cythonizing > (with proper benchmark testing and regression protection) would be good > for SciPy maintainability and could be crafted to have a reasonable > scope. In terms of mentors, I feel comfortable mentoring changes to the > `signal` module but not `ndimage`, so we'd need to find a qualified > primary volunteer mentor if that ends up being the primary proposal > direction. Actually, considering that my background lies in electrical engineering I'd be more than happy to focus on the `signal` module. And from the other response it seems like cythonizing `ndimage` wouldn't be a good idea. > Another thing to keep in mind is that the list of GSoC ideas is not > meant to be exhaustive. So if you have some other ideas for SciPy > functionality, feel free to throw those out for discussion as well. In > my experience, genuine intrinsic enthusiasm for a project -- finding > something you'd enjoy working on in your free time even if you weren't > getting paid to do so -- can help make for successful GSoC applications > and experiences. So there would be enough candidates for Cythonization in `scipy.signal` to fit the scope of GSoC? I myself can only guess where this would be wanted and useful. It doesn't have to be Cythonizing either. I'd be happy to add missing functionality to the `signal` module or rework stuff that needs it. The content in https://docs.scipy.org/doc/scipy-1.0.0/reference/roadmap.html#signal doesn't seem to be a good fit for a GSoC project. The only thing I can think of right now is to extend the API for and add more adaptive filters: https://en.wikipedia.org/wiki/Adaptive_filter Again, I'm not sure this is wanted or if I'm judging the need correctly. If you guys have any ideas or wishes in that direction I'd be happy to hear them. Best regards, Lars From nikolay.mayorov at zoho.com Thu Mar 1 08:13:08 2018 From: nikolay.mayorov at zoho.com (Nikolay Mayorov) Date: Thu, 01 Mar 2018 18:13:08 +0500 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org>

Message-ID: <161e1b1e91a.f3b04efd27571.76008067328158801@zoho.com> Hey, Lars! I don't want to rob other potential ideas or mentors, but you mentioned that you think that rotation formalism idea is already for someone else. It is absolutely not the case, nothing is settled, and if you are interested in this subject --- I'm interested to see your ideas or a proposal (and Eric Larson likely as well). Best, Nikolay ---- On Thu, 01 Mar 2018 16:20:10 +0500 Lars G. <lagru at mailbox.org> wrote ---- On 28.02.2018 16:04, Eric Larson wrote: > For GSoC we need to ensure (at least) that the project fits 1) the needs > of SciPy, 2) the GSoC program scope / timeline, 3) possible mentors, and > 4) your goals. My sense is that a proposal based on code Cythonizing > (with proper benchmark testing and regression protection) would be good > for SciPy maintainability and could be crafted to have a reasonable > scope. In terms of mentors, I feel comfortable mentoring changes to the > `signal` module but not `ndimage`, so we'd need to find a qualified > primary volunteer mentor if that ends up being the primary proposal > direction. Actually, considering that my background lies in electrical engineering I'd be more than happy to focus on the `signal` module. And from the other response it seems like cythonizing `ndimage` wouldn't be a good idea. > Another thing to keep in mind is that the list of GSoC ideas is not > meant to be exhaustive. So if you have some other ideas for SciPy > functionality, feel free to throw those out for discussion as well. In > my experience, genuine intrinsic enthusiasm for a project -- finding > something you'd enjoy working on in your free time even if you weren't > getting paid to do so -- can help make for successful GSoC applications > and experiences. So there would be enough candidates for Cythonization in `scipy.signal` to fit the scope of GSoC? I myself can only guess where this would be wanted and useful. It doesn't have to be Cythonizing either. I'd be happy to add missing functionality to the `signal` module or rework stuff that needs it. The content in https://docs.scipy.org/doc/scipy-1.0.0/reference/roadmap.html#signal doesn't seem to be a good fit for a GSoC project. The only thing I can think of right now is to extend the API for and add more adaptive filters: https://en.wikipedia.org/wiki/Adaptive_filter Again, I'm not sure this is wanted or if I'm judging the need correctly. If you guys have any ideas or wishes in that direction I'd be happy to hear them. Best regards, Lars _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From nikolay.mayorov at zoho.com Thu Mar 1 08:21:51 2018 From: nikolay.mayorov at zoho.com (Nikolay Mayorov) Date: Thu, 01 Mar 2018 18:21:51 +0500 Subject: [SciPy-Dev] GSoC 2018 : Rotation formalism In-Reply-To: <20180301091821.GI11663@pi-x230> References: <372936706.2127.1519741937865@wamui-jasmine.atl.sa.earthlink.net> <56F42377-BF68-4F8D-AAB0-7603C72A1D37@inria.fr> <20180301091821.GI11663@pi-x230> Message-ID: <161e1b9e4d8.fe99cfce27647.3667928799815692430@zoho.com> I believe there is no standard for this. And there are several other things similar to this, like "passive" or "active" view on rotations, multiplication order for composition, etc. In my opinion this is sort of a "soft challenge" for this idea --- document everything precisely. Best regards, Nikolay ---- On Thu, 01 Mar 2018 14:18:21 +0500 Pierre de Buyl <pierre.debuyl at kuleuven.be> wrote ---- On Tue, Feb 27, 2018 at 10:06:56PM +0000, Matthew Brett wrote: > Also : http://matthew-brett.github.io/transforms3d/ > > This is mostly Christoph Gohlke's code, but I've been the maintainer > for a while: > > * rotation matrices > * quaternions In the case of quaternions, there are two typical storages: [w, x, y, z] (scalar first) or [x, y, z, w] (scalar last). Is there a "standard" here? Regards, Pierre _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From lagru at mailbox.org Thu Mar 1 09:32:33 2018 From: lagru at mailbox.org (Lars G.) Date: Thu, 1 Mar 2018 15:32:33 +0100 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: <161e1b1e91a.f3b04efd27571.76008067328158801@zoho.com> References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org>

<161e1b1e91a.f3b04efd27571.76008067328158801@zoho.com> Message-ID: On 01.03.2018 14:13, Nikolay Mayorov wrote: > Hey, Lars! > > I don't want to rob other potential ideas or mentors, but you mentioned > that you think that rotation formalism idea is already for someone else. > It is absolutely not the case, nothing is settled, and if you are > interested in this subject --- I'm interested to see your ideas or a > proposal (and Eric Larson likely as well). > > Best, > Nikolay The topic does indeed sound interesting and from what it looks like you already have a pretty clear description of the scope, structure and goals. However I have never done any relevant programming in that area so I currently don't feel very confident that I'll be able to come up with a sensible API for that or make informed decisions. First, I'll see what comes of my first suggestions. In the meantime I will look through the linked references and see if I feel more confident afterwards. Best regards, Lars From toddrjen at gmail.com Thu Mar 1 10:40:16 2018 From: toddrjen at gmail.com (Todd) Date: Thu, 1 Mar 2018 10:40:16 -0500 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org>

Message-ID: On Mar 1, 2018 06:20, "Lars G." wrote: > On 28.02.2018 16:04, Eric Larson wrote: > > For GSoC we need to ensure (at least) that the project fits 1) the needs > > of SciPy, 2) the GSoC program scope / timeline, 3) possible mentors, and > > 4) your goals. My sense is that a proposal based on code Cythonizing > > (with proper benchmark testing and regression protection) would be good > > for SciPy maintainability and could be crafted to have a reasonable > > scope. In terms of mentors, I feel comfortable mentoring changes to the > > `signal` module but not `ndimage`, so we'd need to find a qualified > > primary volunteer mentor if that ends up being the primary proposal > > direction. > Actually, considering that my background lies in electrical engineering > I'd be more than happy to focus on the `signal` module. And from the > other response it seems like cythonizing `ndimage` wouldn't be a good idea. > > > Another thing to keep in mind is that the list of GSoC ideas is not > > meant to be exhaustive. So if you have some other ideas for SciPy > > functionality, feel free to throw those out for discussion as well. In > > my experience, genuine intrinsic enthusiasm for a project -- finding > > something you'd enjoy working on in your free time even if you weren't > > getting paid to do so -- can help make for successful GSoC applications > > and experiences. > > So there would be enough candidates for Cythonization in `scipy.signal` > to fit the scope of GSoC? I myself can only guess where this would be > wanted and useful. > > It doesn't have to be Cythonizing either. I'd be happy to add missing > functionality to the `signal` module or rework stuff that needs it. > The content in > https://docs.scipy.org/doc/scipy-1.0.0/reference/roadmap.html#signal > doesn't seem to be a good fit for a GSoC project. The only thing I can > think of right now is to extend the API for and add more adaptive filters: > https://en.wikipedia.org/wiki/Adaptive_filter > Again, I'm not sure this is wanted or if I'm judging the need correctly. > > If you guys have any ideas or wishes in that direction I'd be happy to > hear them. > > Best regards, > Lars > The first issue listed in the roadmap, convolution, is a much more complicated issue than that description makes out. There are a few issues, some with some overlap behind-the-scenes: 1. As discussed, there are a bunch of different implementations that that use different algorithm that work better in different scenarios. Ideally there would be one "master" function that would pick the best algorithm for a given set of parameters. This will depend on the number of dimensions to be convolved over, the size of the the first signal to be convolved, and the size of the second signal to be convolved. Changing any one of these can change which implementation is optimal, or even useful. So for with vectors, it is better to use a different algorithm if the one vector is short, if both vectors are long but one is much longer, and if both vectors are long and of similar length. 2. We don't have the best algorithms implemented for all of these scenarios. For example the "both vectors are long but one is much longer" scenario is best with the overlap-add algorithm, which scipy doesn't have. Similarly, there is an fft-based version of correlation equivalent to fftconvolve that isn't implemented, 2D and n-d versions of fft convolution and correlation that aren't implemented, etc. 3. The implementations only work over the number of dimensions they apply to. So the 1D implementations can only take vectors, the 2D implementations can only take 2D arrays, etc. There is no way to, say, apply a filter along the second dimension of a 3D signal. In order to implement the "master" function, at least one implementation (and ideally all implementations) should be able to be applied across additional dimensions. And there is overlap between these. For example I mention the overlap-add method in point 2, but that would most likely be implemented in part by applying across dimensions as mentioned in point 3. A lot of these issues apply elsewhere in scipy.signal. For example the stft/spectrogram uses a slow, naive implementation. A lot of the functions don't support applying across multidimensional arrays (for example to create a filter bank). -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Thu Mar 1 11:15:53 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 1 Mar 2018 09:15:53 -0700 Subject: [SciPy-Dev] Spline interpolation in ndimage In-Reply-To: References: Message-ID: On Tue, Feb 27, 2018 at 4:59 AM, Jaime Fern?ndez del R?o < jaime.frio at gmail.com> wrote: > Hi all, > > We have been discussing in issue #8465 > about the extension modes > for the interpolation module of ndimage. As some other issues linked in > that one show, this is an old problem, that has been recurringly discussed, > and there mostly is agreement on what the correct behavior should be. But > as all things ndimage, implementing any change is hard, in this case not > only because the code is complicated, but also because of the mathematical > subtleties of the interpolation method used. > > In trying to come up with a way forward for this I think I can handle the > code complexity, but am having trouble being sure that I come up with a > sound math approach. I think I have more or less figured it out, but I > don't have a good academic background on signal processing, so was hoping > that someone who does (I'm thinking of you, Chuck!) can read my longish > description of things below and validate or correct it. > > My main source of knowledge has been: > > M. Unser, "Splines: a perfect fit for signal and image processing," in > IEEE Signal Processing Magazine, vol. 16, no. 6, pp. 22-38, Nov 1999. > > Recommendations for bigger, better readings on the subject are also > welcome! > > ----- > > In what follows I'll assume the input image is 2-D and NxN for simplicity, > but I think everything generalizes easily > > If I'm understanding it right, all of our interpolation functions use > B-splines for interpolation. So instead of having NxN discrete values on a > grid, we have NxN B-splines centered at the grid points, and we keep the > NxN coefficients multiplying each spline to reproduce the original image at > the grid points exactly. If the spline order is < 2, the spline > coefficients are the same as the image values. But if order >= 2 (and we > use 3 by default), these have to be calculated in a non-trivial fashion. > This can be done efficiently by applying a separable filter, which is > implemented in ndimage.spline_filter. > > Because B-splines have compact support, when using splines of order n we > only need to consider the B-splines on an (n + 1)x(n + 1) grid neighborhood > of the point being interpolated. This is more or less straightforward, > until you move close to the edges and all of a sudden some of the points in > your grid neighborhood fall outside the original image grid. We have our > extend mode, which controls how this points outside should be mapped to > points inside. But here is where things start getting tricky... > > When the spline coefficients are computed (i.e. when ndimage.spline_filter > is called), assumptions have to be made about boundary conditions. But > ndimage.spline_filter does not take a mode parameter to control this! So > what I think ndimage does is compute the spline coefficients assuming > "mirror symmetric" boundary conditions, i.e.: > > a b c d c b | a b c d | c b a b c d > > So if our interpolated point is within the image boundaries, but some of > the grid points in its (n + 1)x(n + 1) neighborhood fall outside the > boundary, the code uses a mirror symmetric extension mode to fill in those > values. This approach has a funny smell to it, but even if it's formally > incorrect I think it would only be marginally so, as the values are > probably going to be very close to the correct ones. > > The problem comes when the point being interpolated is outside the > boundaries of the image. We cannot use mirror-symmetric spline coefficients > to extend if e.g. we have been asked to extend using wrap mode. So what > ndimage does is first map the value outside the boundaries to a value > within the boundaries, using the given extension mode, then interpolate it > as before, using mirror-symmetric coefficients if needed because its (n + > 1)x(n + 1) neighborhood extends outside. Again, this smells funny, but it > is either correct or very close to correct. > The problem with the factorization is that it assumes infinite data points in all dimensions, i.e, no explicit boundary conditions. With finite data there are edge effects when the data is deconvolved to get the spline coefficients. The way that ndimage deals with that is to extend the data using reflection and start far enough away that the edge effects have died away by the time the "real" data has been reached. How far away that is, is heuristic. I think it should not be too difficult to extend the data in the other ways, but note that since uniform splines are being used, the relevant coefficients lie outside the boundary and need to be picked up from the correct spots in the interior using the symmetries. IIRC, the b-splines are always centered at zero so that they are symmetrical. For odd order splines that will be at the center of a pixel (pixel points), for even order splines at an edge, pixel centers at half integer points. The data can always be considered to be at pixel centers, but the splines are displaced. I don't remember if ndimage treats that correctly. > This is mostly all good and well, except for the "wrap" and "reflect" > extension modes: in these cases the area within one pixel of the image > boundaries is different from anything inside the image, so we cannot use > that approach. So what ndimage does is basically make shit up and use > something similar, but not quite right. "reflect" is mostly correct, except > for within that pixel of the boundary, but "wrap" is a surprising and > confusing mess. > > So how do we fix this? I see two ways forward: > > 1. What seems the more correct approach would be to compute the spline > coefficients taking into account the extension mode to be used, then use > the same extension mode to fill in the neighborhood values when > interpolating for a point outside the boundaries. > 1. First question is whether this is doable? I need to work out the > math, but for "wrap" it looks like it should be, not 100% sure if also is > for "reflect". > 2. Assuming it is it has the main advantage of doing things in a > more general and understandable way once you have enough background > knowledge. > 3. It does go a little bit against our API design: you can control > whether the input is spline-filtered automatically with a parameter, the > idea being that you may want to do the filtering yourself if you are going > to apply several different transformations to the same image. If the mode > of the filtering has to be synced with the mode of the transformation, > letting the user do it themselves is a recipe for disaster, because it's > going to lead to very hard to track bugs. > 4. As elsewhere in ndimage, the current implementation does a lot > of caching, which works because it always interpolates for a point within > the image boundaries. If we started interpolating for points outside the > boundaries without first mapping to within there may be a performance hit > which has to be evaluated. > 2. The other approach is, for "wrap" and "reflect" modes, pad the > input image with an extra pixel in each direction, then compute our > current "mirror symmetric" spline coefficients, and leave things as they > are right now, aside from some changes to the mapping of values to take the > extra pixels into account. > 1. This looks like a nightmare of special cases everywhere and > potential off-by-one errors while putting it together, but it would just go > along with the ugliness of the existing code. > 2. It's unclear what we would do if we are given an input with > prefilter=False, so this also breaks the current API, probably even more so. > > Any thoughts or recommendations are very welcome! > Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Fri Mar 2 15:24:33 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 2 Mar 2018 13:24:33 -0700 Subject: [SciPy-Dev] Spline interpolation in ndimage In-Reply-To: References:

Message-ID: On Thu, Mar 1, 2018 at 9:15 AM, Charles R Harris wrote: > > > On Tue, Feb 27, 2018 at 4:59 AM, Jaime Fern?ndez del R?o < > jaime.frio at gmail.com> wrote: > >> Hi all, >> >> We have been discussing in issue #8465 >> about the extension modes >> for the interpolation module of ndimage. As some other issues linked in >> that one show, this is an old problem, that has been recurringly discussed, >> and there mostly is agreement on what the correct behavior should be. But >> as all things ndimage, implementing any change is hard, in this case not >> only because the code is complicated, but also because of the mathematical >> subtleties of the interpolation method used. >> >> In trying to come up with a way forward for this I think I can handle the >> code complexity, but am having trouble being sure that I come up with a >> sound math approach. I think I have more or less figured it out, but I >> don't have a good academic background on signal processing, so was hoping >> that someone who does (I'm thinking of you, Chuck!) can read my longish >> description of things below and validate or correct it. >> >> My main source of knowledge has been: >> >> M. Unser, "Splines: a perfect fit for signal and image processing," in >> IEEE Signal Processing Magazine, vol. 16, no. 6, pp. 22-38, Nov 1999. >> >> Recommendations for bigger, better readings on the subject are also >> welcome! >> >> ----- >> >> In what follows I'll assume the input image is 2-D and NxN for >> simplicity, but I think everything generalizes easily >> >> If I'm understanding it right, all of our interpolation functions use >> B-splines for interpolation. So instead of having NxN discrete values on a >> grid, we have NxN B-splines centered at the grid points, and we keep the >> NxN coefficients multiplying each spline to reproduce the original image at >> the grid points exactly. If the spline order is < 2, the spline >> coefficients are the same as the image values. But if order >= 2 (and we >> use 3 by default), these have to be calculated in a non-trivial fashion. >> This can be done efficiently by applying a separable filter, which is >> implemented in ndimage.spline_filter. >> >> Because B-splines have compact support, when using splines of order n we >> only need to consider the B-splines on an (n + 1)x(n + 1) grid neighborhood >> of the point being interpolated. This is more or less straightforward, >> until you move close to the edges and all of a sudden some of the points in >> your grid neighborhood fall outside the original image grid. We have our >> extend mode, which controls how this points outside should be mapped to >> points inside. But here is where things start getting tricky... >> >> When the spline coefficients are computed (i.e. when >> ndimage.spline_filter is called), assumptions have to be made about >> boundary conditions. But ndimage.spline_filter does not take a mode >> parameter to control this! So what I think ndimage does is compute the >> spline coefficients assuming "mirror symmetric" boundary conditions, i.e.: >> >> a b c d c b | a b c d | c b a b c d >> >> So if our interpolated point is within the image boundaries, but some of >> the grid points in its (n + 1)x(n + 1) neighborhood fall outside the >> boundary, the code uses a mirror symmetric extension mode to fill in those >> values. This approach has a funny smell to it, but even if it's formally >> incorrect I think it would only be marginally so, as the values are >> probably going to be very close to the correct ones. >> >> The problem comes when the point being interpolated is outside the >> boundaries of the image. We cannot use mirror-symmetric spline coefficients >> to extend if e.g. we have been asked to extend using wrap mode. So what >> ndimage does is first map the value outside the boundaries to a value >> within the boundaries, using the given extension mode, then interpolate it >> as before, using mirror-symmetric coefficients if needed because its (n + >> 1)x(n + 1) neighborhood extends outside. Again, this smells funny, but it >> is either correct or very close to correct. >> > > The problem with the factorization is that it assumes infinite data points > in all dimensions, i.e, no explicit boundary conditions. With finite data > there are edge effects when the data is deconvolved to get the spline > coefficients. The way that ndimage deals with that is to extend the data > using reflection and start far enough away that the edge effects have died > away by the time the "real" data has been reached. How far away that is, is > heuristic. I think it should not be too difficult to extend the data in the > other ways, but note that since uniform splines are being used, the > relevant coefficients lie outside the boundary and need to be picked up > from the correct spots in the interior using the symmetries. > > IIRC, the b-splines are always centered at zero so that they are > symmetrical. For odd order splines that will be at the center of a pixel > (pixel points), for even order splines at an edge, pixel centers at half > integer points. The data can always be considered to be at pixel centers, > but the splines are displaced. I don't remember if ndimage treats that > correctly. > > > >> This is mostly all good and well, except for the "wrap" and "reflect" >> extension modes: in these cases the area within one pixel of the image >> boundaries is different from anything inside the image, so we cannot use >> that approach. So what ndimage does is basically make shit up and use >> something similar, but not quite right. "reflect" is mostly correct, except >> for within that pixel of the boundary, but "wrap" is a surprising and >> confusing mess. >> >> So how do we fix this? I see two ways forward: >> >> 1. What seems the more correct approach would be to compute the >> spline coefficients taking into account the extension mode to be used, then >> use the same extension mode to fill in the neighborhood values when >> interpolating for a point outside the boundaries. >> 1. First question is whether this is doable? I need to work out the >> math, but for "wrap" it looks like it should be, not 100% sure if also is >> for "reflect". >> 2. Assuming it is it has the main advantage of doing things in a >> more general and understandable way once you have enough background >> knowledge. >> 3. It does go a little bit against our API design: you can control >> whether the input is spline-filtered automatically with a parameter, the >> idea being that you may want to do the filtering yourself if you are going >> to apply several different transformations to the same image. If the mode >> of the filtering has to be synced with the mode of the transformation, >> letting the user do it themselves is a recipe for disaster, because it's >> going to lead to very hard to track bugs. >> 4. As elsewhere in ndimage, the current implementation does a lot >> of caching, which works because it always interpolates for a point within >> the image boundaries. If we started interpolating for points outside the >> boundaries without first mapping to within there may be a performance hit >> which has to be evaluated. >> 2. The other approach is, for "wrap" and "reflect" modes, pad the >> input image with an extra pixel in each direction, then compute our >> current "mirror symmetric" spline coefficients, and leave things as they >> are right now, aside from some changes to the mapping of values to take the >> extra pixels into account. >> 1. This looks like a nightmare of special cases everywhere and >> potential off-by-one errors while putting it together, but it would just go >> along with the ugliness of the existing code. >> 2. It's unclear what we would do if we are given an input with >> prefilter=False, so this also breaks the current API, probably even more so. >> >> Any thoughts or recommendations are very welcome! >> > > To explicate a bit more as to why the b-splines are centered, consider quadratic and cubic splines when the data points are considered to occur at the pixel centers. *Quadratic (even order)* 1. If the data points are taken to be halfway between knot points, we need to deconvolve the sequence `array([1, 6, 1])/8`. The Fourier transform of that has no zeros, indeed, is rather smooth, and two extra coefficients, one at each end, are required to interpolate out to the pixel edges. It is all nicely symmetrical. 2. If the data points are taken to correspond to the knot points, we need to deconvolve the sequence `array([1, 1])/2`. The Fourier transform of that sequence has a zero at the Nyquist, not good, and one extra coefficient is needed at some arbitrary end in order to get interpolation out to the pixel edges. *Cubic (odd order)* 1. Taking the data points to correspond to the knot points, we need to deconvolve the sequence `array([1, 4, 1])/6`, whose Fourier transform is nicely behaved. However *four* extra extra coefficients, two at each end, are required to interpolate out to the pixel edges. The "extra" coefficients are needed in order to cover the outer half of the edge pixels, otherwise we are extrapolating. I haven't looked, but I wonder if ndimage gets that right? In both cases, the corner pixels may be a bit of a problem. I haven't thought through that bit. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Fri Mar 2 16:57:18 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 2 Mar 2018 14:57:18 -0700 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org> Message-ID: On Wed, Feb 28, 2018 at 9:11 AM, Jaime Fern?ndez del R?o < jaime.frio at gmail.com> wrote: > On Wed, Feb 28, 2018 at 3:40 PM Lars G. wrote: > >> Dear SciPy devs, >> >> I'm currently thinking about an application for this year's GSoC as >> well. As there already seems to be a large interest in the rotation >> formalism I'm trying to find another area that matches my interest and >> skill. >> >> I've dug up this proposal in scikit-image from GSoC 2015 >> https://github.com/scikit-image/scikit-image/wiki/GSoC- >> 2015#rewriting-scipyndimage-in-cython >> and judging by the state of scipy/ndimage/src/ nobody has worked on this >> proposal yet (feel free to correct me). >> > > I mentored the not very successful, to put it mildly, GSoC 2015 project > about cythonizing ndimage. From that experience, and further work > afterwards, I no longer think Cython is the answer to ndimage's problems. > The underlying C has a lot of very complicated code making lots of clever > (often too clever for everyone's good) uses of pointer magic, that I > honestly think are better kept in C. Or would at least need someone with a > very deep understanding of both C and Cython, would much exceed the scope > of a GSoC project, and I don't think I can commit to properly mentor such a > project this summer. > > What would be nice is to replace the current nd_image.c file that > implements the Python interface to the underlying C by a Cython > implementation. That is not enough for a GSoC project, and it's not the > most exciting thing to work on either. But if you want to put a full > project together out of smaller, Cython related, subprojects, this could > certainly be a part of it, and I wouldn't mind mentoring that subproject. > > I think the spline bits could be vectorized and rewritten in Python without too much loss of speed. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From lagru at mailbox.org Sat Mar 3 02:38:11 2018 From: lagru at mailbox.org (Lars G.) Date: Sat, 3 Mar 2018 08:38:11 +0100 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org>

Message-ID: <7800f9b4-51b9-39cf-9ac8-9eae88e2eab9@mailbox.org> On 01.03.2018 16:40, Todd wrote: > The first issue listed in the roadmap, convolution, is a much more > complicated issue than that description makes out.? There are a few > issues, some with some overlap behind-the-scenes: > > ?? 1. As discussed, there are a bunch of different implementations that > that use different?algorithm that work better in different scenarios.? > Ideally there would be one "master" function that would pick the best > algorithm for a given set of parameters.? This will depend on the number > of dimensions to be convolved over, the size of the the first signal to > be convolved, and the size of the second signal to be convolved.? > Changing any one of these can change which implementation is optimal, or > even useful.? So for with vectors, it is better to use a different > algorithm if the one vector is short, if both vectors are long but one > is much longer, and if both vectors are long and of similar length. > ?? 2. We don't have the best algorithms implemented for all? of these > scenarios.? For example the "both vectors are long but one is much > longer" scenario is best with the overlap-add algorithm, which scipy > doesn't have.? Similarly, there is an fft-based version of correlation > equivalent to fftconvolve that isn't implemented, 2D and n-d versions of > fft convolution and correlation that aren't implemented, etc. > ?? 3. The implementations only work over the number of dimensions they > apply to.? So the 1D implementations can only take vectors, the 2D > implementations can only take 2D arrays, etc.? There is no way to, say, > apply a filter along the second dimension of a 3D signal.? In order to > implement the "master" function, at least one implementation (and > ideally all implementations) should be able to be applied across > additional dimensions. > > And there is overlap between these.? For example I mention the > overlap-add method in point 2, but that would most likely be implemented > in part by applying across dimensions as mentioned in point 3. > > A lot of these issues apply elsewhere in scipy.signal.? For example the > stft/spectrogram uses a slow, naive implementation.? A lot of the > functions don't support applying across multidimensional arrays (for > example to create a filter bank).? So you're saying this could be a possible GSoC project? Because this does sound the most interesting to me so far. To make sure I understand this correctly: - I would work with the two modules `signal` and `ndimage` as well as NumPy (`numpy.convolve`)? - I would unify, redesign and extend the parts / API that deal with convolution with the goal to cover the most common use cases and minimize overlap. - Is somebody willing to mentor this? - Required knowledge would involve understanding different algorithms to implement convolution as well as optimization, Python, Cython, C, ...? - How would you judge the size and difficulty of this task? Thank you all for the feedback so far. :) Best regards, Lars From anubhavp28 at gmail.com Sat Mar 3 06:26:28 2018 From: anubhavp28 at gmail.com (Anubhav Patel) Date: Sat, 3 Mar 2018 16:56:28 +0530 Subject: [SciPy-Dev] Contributing to SciPy through GSoC In-Reply-To: References:

Message-ID: Hi, I wanted feedback regarding whether a combination of rotation class and implementation of quaternion SLERP algorithm and Davenport's Q-method solving Wahba's Problem, will be enough for GSoC? Should I include more rotation related algorithm for implementation? Any suggestions what more I could do? On Mon, Feb 26, 2018 at 9:30 PM, Ralf Gommers wrote: > Hi Anubhev, > > On Mon, Feb 26, 2018 at 2:12 AM, Anubhav Patel > wrote: > >> Hi everyone, >> I want to work on SciPy as part of GSoC and I have few queries. >> >> 1. On the Ideas Page, there was a mention of scipy.spatial.transform >> module. I want to know what will be the exact purpose of this module? >> > > Did you read the whole idea? There's a lot of detail. It says for example > "The aim of this project is to create a module which will allow to > conveniently describe, apply and compose rotations. ". That answer your > question I think. > > >> >> 2. Whether the idea for a module for numerical differentiation was >> dropped completely? >> > > Yes, for now that's off the table - at least not feasible for a GSoC we've > concluded after several attempts. > > >> >> 3. Apart from those ideas listed on ideas page, are there any other area >> where you guys would like to see contribution on? >> > > Ideas for new features on http://scipy.github.io/devdocs/roadmap.html are > of interest, or ones you may have yourself. But given that they're not on > the ideas page, it's not guaranteed we can find mentors for those. > > Cheers, > Ralf > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Mar 3 10:05:14 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 3 Mar 2018 08:05:14 -0700 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: <7800f9b4-51b9-39cf-9ac8-9eae88e2eab9@mailbox.org> References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org>

<7800f9b4-51b9-39cf-9ac8-9eae88e2eab9@mailbox.org> Message-ID: On Sat, Mar 3, 2018 at 12:38 AM, Lars G. wrote: > On 01.03.2018 16:40, Todd wrote: > > The first issue listed in the roadmap, convolution, is a much more > > complicated issue than that description makes out. There are a few > > issues, some with some overlap behind-the-scenes: > > > > 1. As discussed, there are a bunch of different implementations that > > that use different algorithm that work better in different scenarios. > > Ideally there would be one "master" function that would pick the best > > algorithm for a given set of parameters. This will depend on the number > > of dimensions to be convolved over, the size of the the first signal to > > be convolved, and the size of the second signal to be convolved. > > Changing any one of these can change which implementation is optimal, or > > even useful. So for with vectors, it is better to use a different > > algorithm if the one vector is short, if both vectors are long but one > > is much longer, and if both vectors are long and of similar length. > > 2. We don't have the best algorithms implemented for all of these > > scenarios. For example the "both vectors are long but one is much > > longer" scenario is best with the overlap-add algorithm, which scipy > > doesn't have. Similarly, there is an fft-based version of correlation > > equivalent to fftconvolve that isn't implemented, 2D and n-d versions of > > fft convolution and correlation that aren't implemented, etc. > > 3. The implementations only work over the number of dimensions they > > apply to. So the 1D implementations can only take vectors, the 2D > > implementations can only take 2D arrays, etc. There is no way to, say, > > apply a filter along the second dimension of a 3D signal. In order to > > implement the "master" function, at least one implementation (and > > ideally all implementations) should be able to be applied across > > additional dimensions. > > > > And there is overlap between these. For example I mention the > > overlap-add method in point 2, but that would most likely be implemented > > in part by applying across dimensions as mentioned in point 3. > > > > A lot of these issues apply elsewhere in scipy.signal. For example the > > stft/spectrogram uses a slow, naive implementation. A lot of the > > functions don't support applying across multidimensional arrays (for > > example to create a filter bank). > > So you're saying this could be a possible GSoC project? Because this > does sound the most interesting to me so far. > > To make sure I understand this correctly: > > - I would work with the two modules `signal` and `ndimage` as well as > NumPy (`numpy.convolve`)? > - I would unify, redesign and extend the parts / API that deal with > convolution with the goal to cover the most common use cases and > minimize overlap. > - Is somebody willing to mentor this? > - Required knowledge would involve understanding different algorithms to > implement convolution as well as optimization, Python, Cython, C, ...? > - How would you judge the size and difficulty of this task? > It would be a difficult project for GSOC, as a lot would depend on identifying and designing the underlying common algorithms, including APIs That makes it a step beyond just implementing something known in advance, it requires strong background knowledge and familiarity with the relevant SciPy modules. I would be hesitant to propose it unless it could be trimmed down to just one or two functions that are well defined before the project starts. Just as an example of how the complexity grows, NumPy convolution assumes finite sequences extended to +/- inf with zeros, whereas convolution used for interpolation and filtering will have a number of choices for edge conditions, some of which would be best handled with one of the discrete cos transforms. I don't think anyone has sat down and figured out how to organize all that, much less proposed a roadmap to implement it. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From 3ukip0s02 at sneakemail.com Sat Mar 3 12:47:57 2018 From: 3ukip0s02 at sneakemail.com (3ukip0s02 at sneakemail.com) Date: Sat, 3 Mar 2018 12:47:57 -0500 Subject: [SciPy-Dev] SciPy-Dev Digest, Vol 173, Issue 3 In-Reply-To: References: Message-ID: <26548-1520099298-414934@sneakemail.com> On Thu Mar 1 10:40:16 EST 2018, Todd wrote: > Similarly, there is an fft-based version of correlation equivalent to > fftconvolve that isn't implemented, > > 2D and n-d versions of fft convolution > and correlation that aren't implemented, etc. > correlate and convolve are both N-D, and implemented using fftpack when possible, which is compiled fortran, so I'm not sure those would benefit much. convolve2d and correlate2d could benefit from FFT. Only reason they don't is because of the boundary conditions, but I think they could be adapted to FFT if the inputs were extended in the relevant ways first? A lot of these issues apply elsewhere in scipy.signal. For example the > stft/spectrogram uses a slow, naive implementation. > All of the _spectral_helper functions like stft() and spectrogram() are fftpack-based, as well. Is there a lot of room for improvement here? -------------- next part -------------- An HTML attachment was scrubbed... URL: From mikegraham at gmail.com Sat Mar 3 18:29:50 2018 From: mikegraham at gmail.com (Mike Graham) Date: Sat, 3 Mar 2018 18:29:50 -0500 Subject: [SciPy-Dev] SciPy IRC channel lacks moderation In-Reply-To: References:

Message-ID: On Wed, Feb 28, 2018 at 11:49 PM, Ralf Gommers wrote: > > > On Mon, Feb 26, 2018 at 9:53 AM, Mike Graham wrote: > >> On Fri, Feb 23, 2018 at 1:25 AM, Ralf Gommers >> wrote: >> >>> >>> I'm fine with either option, discontinuing or moderation. Either way we >>> should make it clear that scipy devs don't hang out there. >>> >> >> For what it's worth, people still get help with numpy, scipy, and other >> scientific programming issues in the channel every day. Nathan remarked >> that closing the channel was "a really dumb idea". ;) >> >> If you want to appoint Nathan (ngoldbaum on Freenode) and/or me (papna on >> freenode) as group contact to have moderator tools, the freenode people >> will probably do that if we can point them to this mailing list thread. >> They will just want to see that it is what the project wants. >> > > Okay, that seems fine - if Nathan and you want to do that, and it's > helpful for a part of the community, then that seems like a good idea. > Thanks for stepping up. Could you please request access to those moderator > tools for Nathan and yourself, and point them to this thread? > Many thanks! Best, Mike -------------- next part -------------- An HTML attachment was scrubbed... URL: From hritiknarayan at gmail.com Sun Mar 4 03:38:02 2018 From: hritiknarayan at gmail.com (Hritik Narayan) Date: Sun, 4 Mar 2018 14:08:02 +0530 Subject: [SciPy-Dev] GSoC Message-ID: Hey everyone, I want to contribute to SciPy via GSoC, want to get into touch with possible mentors for clarification. Mainly, are the final proposals to be mailed directly to the mentor, or are they to be posted on a mailing list? -- Hritik, UTC +5:30 -------------- next part -------------- An HTML attachment was scrubbed... URL: From tyler.je.reddy at gmail.com Sun Mar 4 13:39:49 2018 From: tyler.je.reddy at gmail.com (Tyler Reddy) Date: Sun, 4 Mar 2018 11:39:49 -0700 Subject: [SciPy-Dev] Contributing to SciPy through GSoC In-Reply-To: References:

Message-ID: It is perhaps worth noting that I have written a low-level (Cython) Slerp function in https://github.com/scipy/scipy/pull/8069. There was no real intention to make that a standalone user-facing function, but if things do move forward with rotation-related development, worth keeping in mind that we have some pretty well-working source code for that particular routine. Even if the referenced PR doesn't get merged some day, could always cannibalize _slerp from there as a starting point. On 3 March 2018 at 04:26, Anubhav Patel wrote: > Hi, > I wanted feedback regarding whether a combination of rotation class and > implementation of quaternion SLERP algorithm and Davenport's Q-method > solving Wahba's Problem, will be enough for GSoC? Should I include more > rotation related algorithm for implementation? Any suggestions what more I > could do? > > On Mon, Feb 26, 2018 at 9:30 PM, Ralf Gommers > wrote: > >> Hi Anubhev, >> >> On Mon, Feb 26, 2018 at 2:12 AM, Anubhav Patel >> wrote: >> >>> Hi everyone, >>> I want to work on SciPy as part of GSoC and I have few queries. >>> >>> 1. On the Ideas Page, there was a mention of scipy.spatial.transform >>> module. I want to know what will be the exact purpose of this module? >>> >> >> Did you read the whole idea? There's a lot of detail. It says for example >> "The aim of this project is to create a module which will allow to >> conveniently describe, apply and compose rotations. ". That answer your >> question I think. >> >> >>> >>> 2. Whether the idea for a module for numerical differentiation was >>> dropped completely? >>> >> >> Yes, for now that's off the table - at least not feasible for a GSoC >> we've concluded after several attempts. >> >> >>> >>> 3. Apart from those ideas listed on ideas page, are there any other area >>> where you guys would like to see contribution on? >>> >> >> Ideas for new features on http://scipy.github.io/devdocs/roadmap.html >> are of interest, or ones you may have yourself. But given that they're not >> on the ideas page, it's not guaranteed we can find mentors for those. >> >> Cheers, >> Ralf >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sun Mar 4 13:47:59 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 4 Mar 2018 10:47:59 -0800 Subject: [SciPy-Dev] GSoC In-Reply-To: References: Message-ID: On Sun, Mar 4, 2018 at 12:38 AM, Hritik Narayan wrote: > Hey everyone, I want to contribute to SciPy via GSoC, want to get into > touch with possible mentors for clarification. Mainly, are the final > proposals to be mailed directly to the mentor, or are they to be posted on > a mailing list? > Hi Hritik, the proposals are to be sent to this mailing list. It's best to post a draft early, so you still have time to incorporate feedback from us. All mentors read this list. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From nikolay.mayorov at zoho.com Sun Mar 4 16:41:58 2018 From: nikolay.mayorov at zoho.com (Nikolay Mayorov) Date: Mon, 05 Mar 2018 02:41:58 +0500 Subject: [SciPy-Dev] Contributing to SciPy through GSoC Message-ID: <161f2f6d6b6.fb3ea05a10736.6245209566181213756@zoho.com> Hi, Anybhev! I think getting Rotation right is a top priority and so far unfortunately nobody dig into technical details of it. I would like to see that from students. As for the algorithms. I believe that Wahba's problem can be generalized by adding a translation vector, but the interpretation will be different (search for "absolute orientation problem"). As for methods to solve Wahba's problem --- probably SVD based is the easiest to understand and implement, but if we decide to use quaternions as the base representation, then we can go with "Q-method". "Cubic spline" for orientation is also a very cool algorithm which wasn't promoted anywhere, but this is the best idea I found on the subject (i.e. interpolation with continuous angular rates and acceleration). Other algorithms are quite small, they can be added quickly. I would say it is more of a question of what we should include. If you have some ideas outside (mostly) "aerospace" field --- they are welcome. For all parts I would like to see more concrete and technical details. For example, if we want SLERP interpolation --- what will it be (class or function), what it will accept, what will be the most difficult part to implement it correctly. The same for all other things. Best, Nikolay ---- On Sat, 03 Mar 2018 16:26:28 +0500 anubhavp28 at gmail.com wrote ---- Hi, I wanted feedback regarding whether a combination of rotation class and implementation of quaternion SLERP algorithm and Davenport's Q-method solving Wahba's Problem, will be enough for GSoC? Should I include more rotation related algorithm for implementation? Any suggestions what more I could do? On Mon, Feb 26, 2018 at 9:30 PM, Ralf Gommers <ralf.gommers at gmail.com> wrote: Hi Anubhev, On Mon, Feb 26, 2018 at 2:12 AM, Anubhav Patel <anubhavp28 at gmail.com> wrote: Hi everyone, I want to work on SciPy as part of GSoC and I have few queries. 1. On the Ideas Page, there was a mention of scipy.spatial.transform module. I want to know what will be the exact purpose of this module? Did you read the whole idea? There's a lot of detail. It says for example "The aim of this project is to create a module which will allow to conveniently describe, apply and compose rotations. ". That answer your question I think. 2. Whether the idea for a module for numerical differentiation was dropped completely? Yes, for now that's off the table - at least not feasible for a GSoC we've concluded after several attempts. 3. Apart from those ideas listed on ideas page, are there any other area where you guys would like to see contribution on? Ideas for new features on http://scipy.github.io/devdocs/roadmap.html are of interest, or ones you may have yourself. But given that they're not on the ideas page, it's not guaranteed we can find mentors for those. Cheers, Ralf _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Sun Mar 4 22:05:36 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Mon, 5 Mar 2018 14:05:36 +1100 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com> Message-ID: Scott Sievert and I have put a lot of work into preparing a draft of the PEP for class based scalar minimizers: https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab072c92c609d71ed6943c6/PEP/1-Optimizer.rst where we've tried to address comments already made in this thread, and from the WIP github PR. Scott and I look forward to hearing any comments/concerns/feedback about the proposal. We can field any questions and address them in an updated PEP, as well as on here. Andrew. p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or scipy/scipep, repo? How should we discuss such this, and any further PEP? Should we have a scipep process, or shall we keep things simple? -------------- next part -------------- An HTML attachment was scrubbed... URL: From phillip.m.feldman at gmail.com Mon Mar 5 02:03:34 2018 From: phillip.m.feldman at gmail.com (Phillip Feldman) Date: Sun, 4 Mar 2018 23:03:34 -0800 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: >From the (beautifully written) draft PEP: "Different optimization algorithms can inherit from Optimizer, with each of the subclasses overriding the __next__ method ..." I'm unclear re. whether this approach would allow something like a parallel implementation of Nelder-Mead. Phillip On Sun, Mar 4, 2018 at 7:05 PM, Andrew Nelson wrote: > Scott Sievert and I have put a lot of work into preparing a draft of the > PEP for class based scalar minimizers: > > https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab072c92c609d > 71ed6943c6/PEP/1-Optimizer.rst > > where we've tried to address comments already made in this thread, and > from the WIP github PR. Scott and I look forward to hearing any > comments/concerns/feedback about the proposal. We can field any questions > and address them in an updated PEP, as well as on here. > > Andrew. > > > p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or scipy/scipep, > repo? How should we discuss such this, and any further PEP? Should we have > a scipep process, or shall we keep things simple? > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Mon Mar 5 02:13:28 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Mon, 5 Mar 2018 18:13:28 +1100 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: > would allow something like a parallel implementation of Nelder-Mead. At the moment the __next__ method would consist of the logic of the existing loop inside _minimize_neldermead, which is done serially. I was not aware of a parallel version of NM, but a quick search reveals there is something along those lines. That's not in scope here, but could be added later. On 5 March 2018 at 18:03, Phillip Feldman wrote: > From the (beautifully written) draft PEP: > > "Different optimization algorithms can inherit from Optimizer, with each > of the subclasses overriding the __next__ method ..." > > I'm unclear re. whether this approach would allow something like a > parallel implementation of Nelder-Mead. > > Phillip > > On Sun, Mar 4, 2018 at 7:05 PM, Andrew Nelson wrote: > >> Scott Sievert and I have put a lot of work into preparing a draft of the >> PEP for class based scalar minimizers: >> >> https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab07 >> 2c92c609d71ed6943c6/PEP/1-Optimizer.rst >> >> where we've tried to address comments already made in this thread, and >> from the WIP github PR. Scott and I look forward to hearing any >> comments/concerns/feedback about the proposal. We can field any questions >> and address them in an updated PEP, as well as on here. >> >> Andrew. >> >> >> p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or scipy/scipep, >> repo? How should we discuss such this, and any further PEP? Should we have >> a scipep process, or shall we keep things simple? >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Mon Mar 5 02:31:43 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Mon, 5 Mar 2018 18:31:43 +1100 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: For future reference parallel Nelder Mead is described at 10.1007/s10614-007-9094-2, and some performance at https://scwu.io/f/Parallelization_Nelder_Mead_Simplex_Algorithm_Abstract.pdf . On 5 March 2018 at 18:13, Andrew Nelson wrote: > > would allow something like a parallel implementation of Nelder-Mead. > > At the moment the __next__ method would consist of the logic of the > existing loop inside _minimize_neldermead, which is done serially. I was > not aware of a parallel version of NM, but a quick search reveals there is > something along those lines. That's not in scope here, but could be added > later. > > On 5 March 2018 at 18:03, Phillip Feldman > wrote: > >> From the (beautifully written) draft PEP: >> >> "Different optimization algorithms can inherit from Optimizer, with each >> of the subclasses overriding the __next__ method ..." >> >> I'm unclear re. whether this approach would allow something like a >> parallel implementation of Nelder-Mead. >> >> Phillip >> >> On Sun, Mar 4, 2018 at 7:05 PM, Andrew Nelson wrote: >> >>> Scott Sievert and I have put a lot of work into preparing a draft of the >>> PEP for class based scalar minimizers: >>> >>> https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab07 >>> 2c92c609d71ed6943c6/PEP/1-Optimizer.rst >>> >>> where we've tried to address comments already made in this thread, and >>> from the WIP github PR. Scott and I look forward to hearing any >>> comments/concerns/feedback about the proposal. We can field any questions >>> and address them in an updated PEP, as well as on here. >>> >>> Andrew. >>> >>> >>> p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or scipy/scipep, >>> repo? How should we discuss such this, and any further PEP? Should we have >>> a scipep process, or shall we keep things simple? >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > > -- > _____________________________________ > Dr. Andrew Nelson > > > _____________________________________ > -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Mar 5 02:41:36 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 4 Mar 2018 23:41:36 -0800 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: On Sun, Mar 4, 2018 at 7:05 PM, Andrew Nelson wrote: > Scott Sievert and I have put a lot of work into preparing a draft of the > PEP for class based scalar minimizers: > > https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab072c92c609d > 71ed6943c6/PEP/1-Optimizer.rst > > where we've tried to address comments already made in this thread, and > from the WIP github PR. Scott and I look forward to hearing any > comments/concerns/feedback about the proposal. We can field any questions > and address them in an updated PEP, as well as on here. > > Andrew. > > > p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or scipy/scipep, > repo? How should we discuss such this, and any further PEP? Should we have > a scipep process, or shall we keep things simple? > I'd suggest a separate repo, and no custom process but rather just follow what is done for Python Enhancement Proposals - discussion of major things on this list, more detailed things on a PR on that new repo. Unless there's other opinions, I can create the repo. There's also https://github.com/numpy/neps for which build infrastructure is in progress (I expect/hope), so we should be able to steal that soon. So for now just open a PR on the new empty repo I'd say. Your proposal looks quite comprehensive and well written. I would suggest following the structure of PEPs a little more closely ( https://www.python.org/dev/peps/pep-0001/#what-belongs-in-a-successful-pep), e.g. add the metadata and copyright bits, and use "motivation" and "rationale" as section headers for some of the content that you have. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From lagru at mailbox.org Mon Mar 5 05:26:29 2018 From: lagru at mailbox.org (Lars G.) Date: Mon, 5 Mar 2018 11:26:29 +0100 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org>

<7800f9b4-51b9-39cf-9ac8-9eae88e2eab9@mailbox.org> Message-ID: On 03.03.2018 16:05, Charles R Harris wrote: > It would be a difficult project for GSOC, as a lot would depend on > identifying and designing the underlying common algorithms, including > APIs That makes it a step beyond just implementing something known in > advance, it requires strong background knowledge and familiarity with > the relevant SciPy modules.? I would be hesitant to propose it unless it > could be trimmed down to just one or two functions that are well defined > before the project starts. Just as an example of how the complexity > grows, NumPy convolution assumes finite sequences extended to +/- inf > with zeros, whereas convolution used for interpolation and filtering > will have a number of choices for edge conditions, some of which would > be best handled with one of the discrete cos transforms. I don't think > anyone has sat down and figured out how to organize all that, much less > proposed a roadmap to implement it. > > Chuck Okay, thanks for the warning. That doesn't sound promising. I think I'll try my luck with the other options. This makes me think as well, that the idea about cythonizing would suffer from similar problems. Best regards, Lars From lagru at mailbox.org Mon Mar 5 06:02:24 2018 From: lagru at mailbox.org (Lars G.) Date: Mon, 5 Mar 2018 12:02:24 +0100 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org> References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org> Message-ID: On 28.02.2018 15:32, Lars G. wrote: > Dear SciPy devs, > > I'm currently thinking about an application for this year's GSoC as > well. As there already seems to be a large interest in the rotation > formalism I'm trying to find another area that matches my interest and > skill. > > I've dug up this proposal in scikit-image from GSoC 2015 > https://github.com/scikit-image/scikit-image/wiki/GSoC-2015#rewriting-scipyndimage-in-cython > and judging by the state of scipy/ndimage/src/ nobody has worked on this > proposal yet (feel free to correct me). > Alternatively I could imagine something similar for other sub-packages, > e.g. scipy/signal which features many source files in C as well. > > So basically if there is an interest I could try to port C / Python code > to Cython. What I would like to know: > > - Is there an interest? ;) > - Is the original proposal in scikit-image still unfinished and are the > potential mentors still interested in mentoring? > - If there is a general interest to cythonize C or Python code during a > GSoC project, which parts / sub-packages of SciPy would you priorize? > > As for my current involvement with SciPy: > > - I've already added a small function written in Cython > https://github.com/scipy/scipy/pull/8350 > - as part of a larger PR extending the signal module > https://github.com/scipy/scipy/pull/8264 > which will possibly merged this week. > - I already cythonized slow parts of the above PR and plan > to add these with new PRs after #8264 is merged. > > If this receives positive feedback I'd be happy to draft a more complete > proposal / application based on the discussion around this. > > Best regards, > Lars Actually, considering that GSoC should be treated as a full-time job during time of coding I must sadly pass on this. However I want to thank you all for the feedback already given. I hope its still useful for other potential applicants. Best regards, Lars From hritiknarayan at gmail.com Mon Mar 5 11:08:55 2018 From: hritiknarayan at gmail.com (Hritik Narayan) Date: Mon, 5 Mar 2018 21:38:55 +0530 Subject: [SciPy-Dev] GSoC Message-ID: I'm having trouble identifying issues/problems that I could contribute to. Could someone help out? (Apart from the ideas listed on the Github page) I definitely want to pick SciPy as my GSoC organisation because frankly, I'd love contributing to something that is so powerful. -- Hritik -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Mar 5 11:11:28 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 5 Mar 2018 08:11:28 -0800 Subject: [SciPy-Dev] GSoC In-Reply-To: References: Message-ID: Hi Hritik, On Mon, Mar 5, 2018 at 8:08 AM, Hritik Narayan wrote: > I'm having trouble identifying issues/problems that I could contribute to. > Could someone help out? (Apart from the ideas listed on the Github page) I > definitely want to pick SciPy as my GSoC organisation because frankly, I'd > love contributing to something that is so powerful. > Please have a look at the issues labelled "good first issue": https://github.com/scipy/scipy/issues?q=is%3Aopen+is%3Aissue+label%3A%22good+first+issue%22 If none of those are of interest, then it would help if you told us which part of SciPy you're interested in exactly. Cheers, Ralf > -- > Hritik > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefanv at berkeley.edu Mon Mar 5 16:34:38 2018 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Mon, 5 Mar 2018 13:34:38 -0800 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: <20180305213438.sct3ilj4t7kzb7ci@carbo> On Sun, 04 Mar 2018 23:41:36 -0800, Ralf Gommers wrote: > Your proposal looks quite comprehensive and well written. I would suggest > following the structure of PEPs a little more closely ( > https://www.python.org/dev/peps/pep-0001/#what-belongs-in-a-successful-pep), > e.g. add the metadata and copyright bits, and use "motivation" and > "rationale" as section headers for some of the content that you have. We've tuned the Python PEP specification a bit for NumPy. See https://github.com/numpy/numpy/blob/master/doc/neps/nep-0000.rst We're now finalizing the build machinery (using CircleCI, very much the same as what is being used by SciPy). Best regards St?fan From charlesr.harris at gmail.com Mon Mar 5 21:30:57 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 5 Mar 2018 19:30:57 -0700 Subject: [SciPy-Dev] Spline interpolation in ndimage In-Reply-To: References: Message-ID: On Tue, Feb 27, 2018 at 4:59 AM, Jaime Fern?ndez del R?o < jaime.frio at gmail.com> wrote: > Hi all, > > We have been discussing in issue #8465 > about the extension modes > for the interpolation module of ndimage. As some other issues linked in > that one show, this is an old problem, that has been recurringly discussed, > and there mostly is agreement on what the correct behavior should be. But > as all things ndimage, implementing any change is hard, in this case not > only because the code is complicated, but also because of the mathematical > subtleties of the interpolation method used. > > In trying to come up with a way forward for this I think I can handle the > code complexity, but am having trouble being sure that I come up with a > sound math approach. I think I have more or less figured it out, but I > don't have a good academic background on signal processing, so was hoping > that someone who does (I'm thinking of you, Chuck!) can read my longish > description of things below and validate or correct it. > > My main source of knowledge has been: > > M. Unser, "Splines: a perfect fit for signal and image processing," in > IEEE Signal Processing Magazine, vol. 16, no. 6, pp. 22-38, Nov 1999. > > Recommendations for bigger, better readings on the subject are also > welcome! > > ----- > > In what follows I'll assume the input image is 2-D and NxN for simplicity, > but I think everything generalizes easily > > If I'm understanding it right, all of our interpolation functions use > B-splines for interpolation. So instead of having NxN discrete values on a > grid, we have NxN B-splines centered at the grid points, and we keep the > NxN coefficients multiplying each spline to reproduce the original image at > the grid points exactly. If the spline order is < 2, the spline > coefficients are the same as the image values. But if order >= 2 (and we > use 3 by default), these have to be calculated in a non-trivial fashion. > This can be done efficiently by applying a separable filter, which is > implemented in ndimage.spline_filter. > > Because B-splines have compact support, when using splines of order n we > only need to consider the B-splines on an (n + 1)x(n + 1) grid neighborhood > of the point being interpolated. This is more or less straightforward, > until you move close to the edges and all of a sudden some of the points in > your grid neighborhood fall outside the original image grid. We have our > extend mode, which controls how this points outside should be mapped to > points inside. But here is where things start getting tricky... > > When the spline coefficients are computed (i.e. when ndimage.spline_filter > is called), assumptions have to be made about boundary conditions. But > ndimage.spline_filter does not take a mode parameter to control this! So > what I think ndimage does is compute the spline coefficients assuming > "mirror symmetric" boundary conditions, i.e.: > > a b c d c b | a b c d | c b a b c d > For "a, b, c", I would write that as "..., a, b, c, b, a, b, c, ..." extended to infinity in both directions because we are applying IIR filters. Fortunately, the IIR filters fall off rapidly, so one need not take too many extended points to start them up. Counter intuitively, I think the interpolation "mode" only applies to the spline coefficients, not to the original image, resulting in unexpected behavior at the edges. If we want to have "mode" apply to the image rather than the filtered results, we can do that, but the result of the filtering would best be a named tuple containing the mode, the spline order, and, in the case of a constant extension, extra edge coefficients. The coefficients for the "wrap" and "constant" modes can then be obtained by appropriately extending the data for starting up the filters. For the "wrap" and "reflect" modes, the coefficents have the same symmetries, so that can be used to get the needed coefficients outside the boundaries. The spline_filter using IIR is just a clever way to solve A*coef = data. For the cubic case and three data points, A depends on the mode as follows: |4 2 0| |1 4 1| x 1/6 (reflect about center of edge pixels -- we do this) |0 2 4| |5 1 0| |1 4 1| x 1/6 (reflect about edge -- we don't do this) |0 1 5| |4 1 1| |1 4 1| x 1/6 (wrap -- we don't do this) |1 1 4| Explicitly inverting such matrices also provides a good test to check that things are done right if we go this way. As you have pointed out, the proper coefficients will vary depending on the mode. The spline coefficients have the same symmetry as the image pixels. Unfortunately, `map_coordinates` reflects about the edges, which doesn't match any of the coefficients that we compute. > So if our interpolated point is within the image boundaries, but some of > the grid points in its (n + 1)x(n + 1) neighborhood fall outside the > boundary, the code uses a mirror symmetric extension mode to fill in those > values. This approach has a funny smell to it, but even if it's formally > incorrect I think it would only be marginally so, as the values are > probably going to be very close to the correct ones. > > The problem comes when the point being interpolated is outside the > boundaries of the image. We cannot use mirror-symmetric spline coefficients > to extend if e.g. we have been asked to extend using wrap mode. So what > ndimage does is first map the value outside the boundaries to a value > within the boundaries, using the given extension mode, then interpolate it > as before, using mirror-symmetric coefficients if needed because its (n + > 1)x(n + 1) neighborhood extends outside. Again, this smells funny, but it > is either correct or very close to correct. > > This is mostly all good and well, except for the "wrap" and "reflect" > extension modes: in these cases the area within one pixel of the image > boundaries is different from anything inside the image, so we cannot use > that approach. So what ndimage does is basically make shit up and use > something similar, but not quite right. "reflect" is mostly correct, except > for within that pixel of the boundary, but "wrap" is a surprising and > confusing mess. > > So how do we fix this? I see two ways forward: > > 1. What seems the more correct approach would be to compute the spline > coefficients taking into account the extension mode to be used, then use > the same extension mode to fill in the neighborhood values when > interpolating for a point outside the boundaries. > 1. First question is whether this is doable? I need to work out the > math, but for "wrap" it looks like it should be, not 100% sure if also is > for "reflect". > > Yes. The constant mode is actually the tricky one. I'd be tempted to make it orthogonal to "reflect", "wrap", and "nearest", that way we don't need to compute extra coefficients. Note that if the image isn't prefiltered, we are running a smoothing filter over it. > > 1. Assuming it is it has the main advantage of doing things in a more > general and understandable way once you have enough background knowledge. > 2. It does go a little bit against our API design: you can control > whether the input is spline-filtered automatically with a parameter, the > idea being that you may want to do the filtering yourself if you are going > to apply several different transformations to the same image. If the mode > of the filtering has to be synced with the mode of the transformation, > letting the user do it themselves is a recipe for disaster, because it's > going to lead to very hard to track bugs. > 3. As elsewhere in ndimage, the current implementation does a lot > of caching, which works because it always interpolates for a point within > the image boundaries. If we started interpolating for points outside the > boundaries without first mapping to within there may be a performance hit > which has to be evaluated. > 1. The other approach is, for "wrap" and "reflect" modes, pad the > input image with an extra pixel in each direction, then compute our > current "mirror symmetric" spline coefficients, and leave things as they > are right now, aside from some changes to the mapping of values to take the > extra pixels into account. > > Are "mirror" and "reflect" actually different? The function documentation only mentions "reflect". > > 1. This looks like a nightmare of special cases everywhere and > potential off-by-one errors while putting it together, but it would just go > along with the ugliness of the existing code. > 2. It's unclear what we would do if we are given an input with > prefilter=False, so this also breaks the current API, probably even more so. > > Any thoughts or recommendations are very welcome! > The docstrings need fixing, they are almost useless. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Mar 5 23:06:11 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 5 Mar 2018 20:06:11 -0800 Subject: [SciPy-Dev] Numba as a dependency for SciPy? Message-ID: Hi all, Goal of this email: start a discussion to decide whether we'd be okay with relying on Numba as a dependency, now or in 1-2 years' time. Context: in https://github.com/pydata/sparse/issues/126 a discussion is ongoing about whether to adopt Cython or Numba, with Numba being preferred by the majority. That `sparse` package is meant to provide sparse *arrays* that down the line should either be replacing our current sparse *matrices* or at least be integrated in scipy.sparse in addition to them. See https://github.com/scipy/scipy/issues/8162 and https://github.com/hameerabbasi/sparse-ndarray-protocols for more details on that. Also related is the question from Serge Guelton some weeks ago about whether we'd want to rely on Pythran: https://mail.python.org/pipermail/scipy-dev/2018-January/022325.html On that Pythran thread I commented that we'd want to take these aspects into account: - portability - performance - maturity - maintenance status (active devs, how quick do bugs get fixed after a release with an issue) - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) - size of generated binaries - templating support for multiple dtypes - debugging and optimization experience/tool Debugging is one of the ones where I'd say Numba is still worse than Cython, however that's being resolved as we speak: https://github.com/numba/numba/issues/2788 One thing I missed in the above list is dependencies: while our use of Cython only adds a build-time dependency, Numba would add a run-time dependency. Given that binary wheels and conda packages for all major platforms are available that's not a showstopper, but it matters. Overall I'd say that: - Numba is better than Cython at: performance, ease of use, size of generated binaries, and templating support for multiple dtypes. Possibly also maintenance status right now. - Numba and Cython are about equally good at portability (I think, not much data about exotic platforms for Numba). - Cython is better than Numba at: maturity, debugging (but not for long anymore probably), dependencies. I'm usually pretty conservative in these things, but considering the above I'm leaning towards saying use of Numba should be allowed in the future. The added run-time dependency is the one major downside that's going to stay, however compared to our Fortran headaches that's a relatively small issue. Thoughts? Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Mar 5 23:35:05 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 5 Mar 2018 20:35:05 -0800 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: <20180305213438.sct3ilj4t7kzb7ci@carbo> References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

<20180305213438.sct3ilj4t7kzb7ci@carbo> Message-ID: On Mon, Mar 5, 2018 at 1:34 PM, Stefan van der Walt wrote: > On Sun, 04 Mar 2018 23:41:36 -0800, Ralf Gommers wrote: > > Your proposal looks quite comprehensive and well written. I would suggest > > following the structure of PEPs a little more closely ( > > https://www.python.org/dev/peps/pep-0001/#what-belongs- > in-a-successful-pep), > > e.g. add the metadata and copyright bits, and use "motivation" and > > "rationale" as section headers for some of the content that you have. > > We've tuned the Python PEP specification a bit for NumPy. See > > https://github.com/numpy/numpy/blob/master/doc/neps/nep-0000.rst > Thanks, that seems fine to copy. The link to the template in nep-0000 is broken, here it is: https://github.com/numpy/numpy/blob/master/doc/neps/nep-template.rst Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From alan.isaac at gmail.com Mon Mar 5 23:58:17 2018 From: alan.isaac at gmail.com (Alan Isaac) Date: Mon, 5 Mar 2018 23:58:17 -0500 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: Didn't OpenOpt take some steps in this direction? http://courses.csail.mit.edu/6.867/wiki/images/6/6e/Qp-openopt.pdf Just making sure anything useful doesn't get overlooked, since I did not see this mentioned. Alan Isaac On 3/5/2018 2:03 AM, Phillip Feldman wrote: > From the (beautifully written) draft PEP: > > "Different optimization algorithms can inherit from Optimizer, with each of the subclasses overriding the __next__ method ..." > > I'm unclear re. whether this approach would allow something like a parallel implementation of Nelder-Mead. > > Phillip > On Sun, Mar 4, 2018 at 7:05 PM, Andrew Nelson > wrote: > > Scott Sievert and I have put a lot of work into preparing a draft of the PEP for class based scalar minimizers: > > https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab072c92c609d71ed6943c6/PEP/1-Optimizer.rst > > > where we've tried to address comments already made in this thread, and from the WIP github PR. Scott and I look forward to hearing any comments/concerns/feedback about the > proposal. We can field any questions and address them in an updated PEP, as well as on here. > > Andrew. > > > p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or scipy/scipep, repo? How should we discuss such this, and any further PEP? Should we have a scipep process, or shall we > keep things simple? From ralf.gommers at gmail.com Tue Mar 6 00:35:48 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 5 Mar 2018 21:35:48 -0800 Subject: [SciPy-Dev] WIP: Class based Optimizers In-Reply-To: References: <1518642682.8666.18.camel@iki.fi> <5a84a711.05cfca0a.4a242.4608@mx.google.com>

Message-ID: On Mon, Mar 5, 2018 at 8:58 PM, Alan Isaac wrote: > Didn't OpenOpt take some steps in this direction? > http://courses.csail.mit.edu/6.867/wiki/images/6/6e/Qp-openopt.pdf > Just making sure anything useful doesn't get overlooked, since > I did not see this mentioned. > Hard to say, since OpenOpt has disappeared AFAIK. There's no repo to be found and openopt.org has been offline for a while. Ralf > > Alan Isaac > > On 3/5/2018 2:03 AM, Phillip Feldman wrote: > >> From the (beautifully written) draft PEP: >> >> "Different optimization algorithms can inherit from Optimizer, with each >> of the subclasses overriding the __next__ method ..." >> >> I'm unclear re. whether this approach would allow something like a >> parallel implementation of Nelder-Mead. >> >> Phillip >> > > On Sun, Mar 4, 2018 at 7:05 PM, Andrew Nelson > andyfaff at gmail.com>> wrote: >> >> Scott Sievert and I have put a lot of work into preparing a draft of >> the PEP for class based scalar minimizers: >> >> https://github.com/andyfaff/scipy/blob/a52bb4f9029389da3ab07 >> 2c92c609d71ed6943c6/PEP/1-Optimizer.rst >> > 72c92c609d71ed6943c6/PEP/1-Optimizer.rst> >> >> where we've tried to address comments already made in this thread, >> and from the WIP github PR. Scott and I look forward to hearing any >> comments/concerns/feedback about the >> proposal. We can field any questions and address them in an updated >> PEP, as well as on here. >> >> Andrew. >> >> >> p.s. Ralf/Pauli, could we add the PEP to a scipy/PEP, or >> scipy/scipep, repo? How should we discuss such this, and any further PEP? >> Should we have a scipep process, or shall we >> keep things simple? >> > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cimrman3 at ntc.zcu.cz Tue Mar 6 06:06:20 2018 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Tue, 6 Mar 2018 12:06:20 +0100 Subject: [SciPy-Dev] ANN: SfePy 2018.1 Message-ID: <07f7c4f6-abae-a7f4-5e87-e29dbab2e296@ntc.zcu.cz> I am pleased to announce release 2018.1 of SfePy. Description ----------- SfePy (simple finite elements in Python) is a software for solving systems of coupled partial differential equations by the finite element method or by the isogeometric analysis (limited support). It is distributed under the new BSD license. Home page: http://sfepy.org Mailing list: https://mail.python.org/mm3/mailman3/lists/sfepy.python.org/ Git (source) repository, issue tracker: https://github.com/sfepy/sfepy Highlights of this release -------------------------- - major update of time-stepping solvers and solver handling - Newmark and Bathe elastodynamics solvers - interface to MUMPS linear solver - new examples: - iron plate impact problem (elastodynamics) - incompressible Mooney-Rivlin material model (hyperelasticity) as a script For full release notes see http://docs.sfepy.org/doc/release_notes.html#id1 (rather long and technical). Cheers, Robert Cimrman --- Contributors to this release in alphabetical order: Robert Cimrman Jan Heczko Jan Kopacka Vladimir Lukes From tyler.je.reddy at gmail.com Tue Mar 6 15:02:35 2018 From: tyler.je.reddy at gmail.com (Tyler Reddy) Date: Tue, 6 Mar 2018 13:02:35 -0700 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: Interesting discussion. Would our plan be to support both side-by-side for a while & just see what happens with the evolution of the ecosystem? If there's no clear winner in the short-term would we discourage PRs that simply migrate from Cython to numba for say 1.5 x performance increase? What about an algorithm that mixes the two approaches -- some numba and some Cython components for whatever reason -- is that discouraged? It looks like numba plays ok with airspeed velocity -- presumably mixing Cython / numba in our suite will be ok? On 5 March 2018 at 21:06, Ralf Gommers wrote: > Hi all, > > Goal of this email: start a discussion to decide whether we'd be okay with > relying on Numba as a dependency, now or in 1-2 years' time. > > Context: in https://github.com/pydata/sparse/issues/126 a discussion is > ongoing about whether to adopt Cython or Numba, with Numba being preferred > by the majority. That `sparse` package is meant to provide sparse *arrays* > that down the line should either be replacing our current sparse *matrices* > or at least be integrated in scipy.sparse in addition to them. See > https://github.com/scipy/scipy/issues/8162 and https://github.com/ > hameerabbasi/sparse-ndarray-protocols for more details on that. > > Also related is the question from Serge Guelton some weeks ago about > whether we'd want to rely on Pythran: https://mail.python.org/ > pipermail/scipy-dev/2018-January/022325.html > > On that Pythran thread I commented that we'd want to take these aspects > into account: > - portability > - performance > - maturity > - maintenance status (active devs, how quick do bugs get fixed after a > release with an issue) > - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) > - size of generated binaries > - templating support for multiple dtypes > - debugging and optimization experience/tool > > Debugging is one of the ones where I'd say Numba is still worse than > Cython, however that's being resolved as we speak: > https://github.com/numba/numba/issues/2788 > > One thing I missed in the above list is dependencies: while our use of > Cython only adds a build-time dependency, Numba would add a run-time > dependency. Given that binary wheels and conda packages for all major > platforms are available that's not a showstopper, but it matters. > > Overall I'd say that: > - Numba is better than Cython at: performance, ease of use, size of > generated binaries, and templating support for multiple dtypes. Possibly > also maintenance status right now. > - Numba and Cython are about equally good at portability (I think, not > much data about exotic platforms for Numba). > - Cython is better than Numba at: maturity, debugging (but not for long > anymore probably), dependencies. > > I'm usually pretty conservative in these things, but considering the above > I'm leaning towards saying use of Numba should be allowed in the future. > The added run-time dependency is the one major downside that's going to > stay, however compared to our Fortran headaches that's a relatively small > issue. > > Thoughts? > > Cheers, > Ralf > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From serge.guelton at telecom-bretagne.eu Tue Mar 6 16:13:33 2018 From: serge.guelton at telecom-bretagne.eu (Serge Guelton) Date: Tue, 6 Mar 2018 22:13:33 +0100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: <20180306211333.GA29117@lakota> > Overall I'd say that: > - Numba is better than Cython at: performance, ease of use, size of generated > binaries, and templating support for multiple dtypes. Possibly also maintenance > status right now. Hi all, To assert that statement, I've quickly setup a bunch of notebooks with cython kernels extracted from the scipy codebase here: https://github.com/serge-sans-paille/scipy-kernels For the sake of the comparison, I've contributed Pythran implementation of these kernels, and I hope someone in the ML will issue a PR with numba version. Please note that the goal is *not* to claim that a given compiler is better than another. The benchmarks provided only target a subset of the functions input space, I have not tuned the backend compiler for a given architecture nor tried to enable parallelism. It's just there to give a rough idea of the ease-of-use / performance tradeoffs. Best, Serge From perimosocordiae at gmail.com Tue Mar 6 16:21:06 2018 From: perimosocordiae at gmail.com (CJ Carey) Date: Tue, 06 Mar 2018 21:21:06 +0000 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: I think adding a required runtime dependency may be overly restrictive, given scipy's position near(-ish) the base of the scientific computing pyramid. Would it be possible to run numba-optimized code on systems with numba installed without impacting "vanilla" users? On Tue, Mar 6, 2018 at 3:03 PM Tyler Reddy wrote: > Interesting discussion. Would our plan be to support both side-by-side for > a while & just see what happens with the evolution of the ecosystem? If > there's no clear winner in the short-term would we discourage PRs that > simply migrate from Cython to numba for say 1.5 x performance increase? > What about an algorithm that mixes the two approaches -- some numba and > some Cython components for whatever reason -- is that discouraged? > > It looks like numba plays ok with airspeed velocity -- presumably mixing > Cython / numba in our suite will be ok? > > > > On 5 March 2018 at 21:06, Ralf Gommers wrote: > >> Hi all, >> >> Goal of this email: start a discussion to decide whether we'd be okay >> with relying on Numba as a dependency, now or in 1-2 years' time. >> >> Context: in https://github.com/pydata/sparse/issues/126 a discussion is >> ongoing about whether to adopt Cython or Numba, with Numba being preferred >> by the majority. That `sparse` package is meant to provide sparse *arrays* >> that down the line should either be replacing our current sparse *matrices* >> or at least be integrated in scipy.sparse in addition to them. See >> https://github.com/scipy/scipy/issues/8162 and >> https://github.com/hameerabbasi/sparse-ndarray-protocols for more >> details on that. >> >> Also related is the question from Serge Guelton some weeks ago about >> whether we'd want to rely on Pythran: >> https://mail.python.org/pipermail/scipy-dev/2018-January/022325.html >> >> On that Pythran thread I commented that we'd want to take these aspects >> into account: >> - portability >> - performance >> - maturity >> - maintenance status (active devs, how quick do bugs get fixed after a >> release with an issue) >> - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) >> - size of generated binaries >> - templating support for multiple dtypes >> - debugging and optimization experience/tool >> >> Debugging is one of the ones where I'd say Numba is still worse than >> Cython, however that's being resolved as we speak: >> https://github.com/numba/numba/issues/2788 >> >> One thing I missed in the above list is dependencies: while our use of >> Cython only adds a build-time dependency, Numba would add a run-time >> dependency. Given that binary wheels and conda packages for all major >> platforms are available that's not a showstopper, but it matters. >> >> Overall I'd say that: >> - Numba is better than Cython at: performance, ease of use, size of >> generated binaries, and templating support for multiple dtypes. Possibly >> also maintenance status right now. >> - Numba and Cython are about equally good at portability (I think, not >> much data about exotic platforms for Numba). >> - Cython is better than Numba at: maturity, debugging (but not for long >> anymore probably), dependencies. >> >> I'm usually pretty conservative in these things, but considering the >> above I'm leaning towards saying use of Numba should be allowed in the >> future. The added run-time dependency is the one major downside that's >> going to stay, however compared to our Fortran headaches that's a >> relatively small issue. >> >> Thoughts? >> >> Cheers, >> Ralf >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Tue Mar 6 16:55:41 2018 From: shoyer at gmail.com (Stephan Hoyer) Date: Tue, 06 Mar 2018 21:55:41 +0000 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: Numba does have a mechanism for exporting pre-compiled code: http://numba.pydata.org/numba-doc/dev/user/pycc.html It would be interesting to see if those limitations are flexible enough for SciPy. On Tue, Mar 6, 2018 at 1:21 PM CJ Carey wrote: > I think adding a required runtime dependency may be overly restrictive, > given scipy's position near(-ish) the base of the scientific computing > pyramid. > > Would it be possible to run numba-optimized code on systems with numba > installed without impacting "vanilla" users? > > > On Tue, Mar 6, 2018 at 3:03 PM Tyler Reddy > wrote: > >> Interesting discussion. Would our plan be to support both side-by-side >> for a while & just see what happens with the evolution of the ecosystem? If >> there's no clear winner in the short-term would we discourage PRs that >> simply migrate from Cython to numba for say 1.5 x performance increase? >> What about an algorithm that mixes the two approaches -- some numba and >> some Cython components for whatever reason -- is that discouraged? >> >> It looks like numba plays ok with airspeed velocity -- presumably mixing >> Cython / numba in our suite will be ok? >> >> >> >> On 5 March 2018 at 21:06, Ralf Gommers wrote: >> >>> Hi all, >>> >>> Goal of this email: start a discussion to decide whether we'd be okay >>> with relying on Numba as a dependency, now or in 1-2 years' time. >>> >>> Context: in https://github.com/pydata/sparse/issues/126 a discussion is >>> ongoing about whether to adopt Cython or Numba, with Numba being preferred >>> by the majority. That `sparse` package is meant to provide sparse *arrays* >>> that down the line should either be replacing our current sparse *matrices* >>> or at least be integrated in scipy.sparse in addition to them. See >>> https://github.com/scipy/scipy/issues/8162 and >>> https://github.com/hameerabbasi/sparse-ndarray-protocols for more >>> details on that. >>> >>> Also related is the question from Serge Guelton some weeks ago about >>> whether we'd want to rely on Pythran: >>> https://mail.python.org/pipermail/scipy-dev/2018-January/022325.html >>> >>> On that Pythran thread I commented that we'd want to take these aspects >>> into account: >>> - portability >>> - performance >>> - maturity >>> - maintenance status (active devs, how quick do bugs get fixed after a >>> release with an issue) >>> - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) >>> - size of generated binaries >>> - templating support for multiple dtypes >>> - debugging and optimization experience/tool >>> >>> Debugging is one of the ones where I'd say Numba is still worse than >>> Cython, however that's being resolved as we speak: >>> https://github.com/numba/numba/issues/2788 >>> >>> One thing I missed in the above list is dependencies: while our use of >>> Cython only adds a build-time dependency, Numba would add a run-time >>> dependency. Given that binary wheels and conda packages for all major >>> platforms are available that's not a showstopper, but it matters. >>> >>> Overall I'd say that: >>> - Numba is better than Cython at: performance, ease of use, size of >>> generated binaries, and templating support for multiple dtypes. Possibly >>> also maintenance status right now. >>> - Numba and Cython are about equally good at portability (I think, not >>> much data about exotic platforms for Numba). >>> - Cython is better than Numba at: maturity, debugging (but not for long >>> anymore probably), dependencies. >>> >>> I'm usually pretty conservative in these things, but considering the >>> above I'm leaning towards saying use of Numba should be allowed in the >>> future. The added run-time dependency is the one major downside that's >>> going to stay, however compared to our Fortran headaches that's a >>> relatively small issue. >>> >>> Thoughts? >>> >>> Cheers, >>> Ralf >>> >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Tue Mar 6 17:54:15 2018 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 6 Mar 2018 22:54:15 +0000 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: On Tue, Mar 6, 2018 at 9:21 PM, CJ Carey wrote: > I think adding a required runtime dependency may be overly restrictive, > given scipy's position near(-ish) the base of the scientific computing > pyramid. Yes, that's a worry I have too. You may remember Samuel Maybury on the mailing list recently sighing somewhat when he found he had to get numba installed on the Raspberry Pi. My guess is numba will be fine on the standard platforms and a significant problem on non-standard ones. Cheers, Matthew From stefanv at berkeley.edu Tue Mar 6 18:22:53 2018 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Tue, 6 Mar 2018 15:22:53 -0800 Subject: [SciPy-Dev] SciPy Central Rescue Message-ID: <20180306232253.fz6rixtqedai27m6@carbo> Hi, everyone Jiayue Li, a student working with me here at BIDS, recently converted the database of the now-defunc SciPy Central into Sphinx-rendered pages: https://machine-shop.github.io/scipy-central-rescue/ The source materials are at: https://github.com/machine-shop/scipy-central-rescue We are happy to move this over to the SciPy repo, and to integrate these materials wherever is most applicable. Let us know! Thanks also to Surya Kasturi who helped us get access to the original database. Best regards, St?fan From sseibert at anaconda.com Tue Mar 6 18:32:47 2018 From: sseibert at anaconda.com (Stanley Seibert) Date: Tue, 6 Mar 2018 17:32:47 -0600 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: (Hi, as someone from the Numba project, my apologies for wading into this discussion late.) Last time we checked, Numba on ARM was pretty close to working already. We have an open PR for ARMv7 that we just ran out of time to finish QA'ing ( https://github.com/numba/numba/pull/1968) a while ago, and never have gotten back to because there was not much demand at the time. It sounds like this is tripping up more people now, so we can take a look again. One thing that really is unpleasant on ARM is compiling LLVM, so we would probably want to make sure we had conda packages and wheels available for llvmlite on ARM. Is there any precedent for posting Linux ARMv7 wheels to PyPI? (For conda, we would just make sure they appear in Jonathan's berryconda channel.) A more difficult platform is POWER8, where we tried to do a port and got stuck last year. Recently, someone has figured out what the issues were, and it sounds like several of them stemmed from LLVM bugs in the POWER8 backend that may or may not be fixed in the next release. IBM is interested in improving the situation, so hopefully that will be sorted out soon. On Tue, Mar 6, 2018 at 4:54 PM, Matthew Brett wrote: > On Tue, Mar 6, 2018 at 9:21 PM, CJ Carey > wrote: > > I think adding a required runtime dependency may be overly restrictive, > > given scipy's position near(-ish) the base of the scientific computing > > pyramid. > > Yes, that's a worry I have too. You may remember Samuel Maybury on > the mailing list recently sighing somewhat when he found he had to get > numba installed on the Raspberry Pi. My guess is numba will be fine > on the standard platforms and a significant problem on non-standard > ones. > > Cheers, > > Matthew > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Tue Mar 6 20:00:25 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 6 Mar 2018 18:00:25 -0700 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: On Mon, Mar 5, 2018 at 9:06 PM, Ralf Gommers wrote: > Hi all, > > Goal of this email: start a discussion to decide whether we'd be okay with > relying on Numba as a dependency, now or in 1-2 years' time. > > Context: in https://github.com/pydata/sparse/issues/126 a discussion is > ongoing about whether to adopt Cython or Numba, with Numba being preferred > by the majority. That `sparse` package is meant to provide sparse *arrays* > that down the line should either be replacing our current sparse *matrices* > or at least be integrated in scipy.sparse in addition to them. See > https://github.com/scipy/scipy/issues/8162 and https://github.com/ > hameerabbasi/sparse-ndarray-protocols for more details on that. > > Also related is the question from Serge Guelton some weeks ago about > whether we'd want to rely on Pythran: https://mail.python.org/ > pipermail/scipy-dev/2018-January/022325.html > > On that Pythran thread I commented that we'd want to take these aspects > into account: > - portability > - performance > - maturity > - maintenance status (active devs, how quick do bugs get fixed after a > release with an issue) > - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) > - size of generated binaries > - templating support for multiple dtypes > - debugging and optimization experience/tool > > Debugging is one of the ones where I'd say Numba is still worse than > Cython, however that's being resolved as we speak: > https://github.com/numba/numba/issues/2788 > > One thing I missed in the above list is dependencies: while our use of > Cython only adds a build-time dependency, Numba would add a run-time > dependency. Given that binary wheels and conda packages for all major > platforms are available that's not a showstopper, but it matters. > > Overall I'd say that: > - Numba is better than Cython at: performance, ease of use, size of > generated binaries, and templating support for multiple dtypes. Possibly > also maintenance status right now. > - Numba and Cython are about equally good at portability (I think, not > much data about exotic platforms for Numba). > - Cython is better than Numba at: maturity, debugging (but not for long > anymore probably), dependencies. > > I'm usually pretty conservative in these things, but considering the above > I'm leaning towards saying use of Numba should be allowed in the future. > The added run-time dependency is the one major downside that's going to > stay, however compared to our Fortran headaches that's a relatively small > issue. > I like the idea of using Numba, but remain a bit skeptical about the dependencies and long term maintenance. I suppose the same could have been said about NumPy and SciPy ten years ago, the continued maintenance and availability of both was not a foregone conclusion. It is probably best to wait a bit to see how things shake out, but I'm not opposed to the use of either Pythran or Numba on technical grounds. There have been other such attempts, weave and that other tensor code -- I forget the name -- were both present in early releases and have since disappeared. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Mar 6 20:02:14 2018 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 6 Mar 2018 17:02:14 -0800 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: On Tue, Mar 6, 2018 at 3:32 PM, Stanley Seibert wrote: > One thing that really is unpleasant on ARM is compiling LLVM, so we would > probably want to make sure we had conda packages and wheels available for > llvmlite on ARM. Is there any precedent for posting Linux ARMv7 wheels to > PyPI? (For conda, we would just make sure they appear in Jonathan's > berryconda channel.) No, there currently isn't any way to post Linux ARM wheels on PyPI. There could be -- the main issue is defining what "Linux ARMv7" means in enough detail for pip to figure out which wheels it should be trying to download on a given system. -n -- Nathaniel J. Smith -- https://vorpus.org From ralf.gommers at gmail.com Wed Mar 7 00:24:07 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 6 Mar 2018 21:24:07 -0800 Subject: [SciPy-Dev] SciPy Central Rescue In-Reply-To: <20180306232253.fz6rixtqedai27m6@carbo> References: <20180306232253.fz6rixtqedai27m6@carbo> Message-ID: On Tue, Mar 6, 2018 at 3:22 PM, Stefan van der Walt wrote: > Hi, everyone > > Jiayue Li, a student working with me here at BIDS, recently converted > the database of the now-defunc SciPy Central into Sphinx-rendered pages: > > https://machine-shop.github.io/scipy-central-rescue/ Thanks Jiayue and Stefan for doing this! > The source materials are at: > > https://github.com/machine-shop/scipy-central-rescue > > We are happy to move this over to the SciPy repo, and to integrate these > materials wherever is most applicable. Let us know! > I would suggest to move the whole thing under the scipy Github org as a historical snapshot, and select the most useful content and put it into https://github.com/scipy/scipy-cookbook Cheers, Ralf > > Thanks also to Surya Kasturi who helped us get access to the original > database. > > Best regards, > St?fan > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Mar 7 02:47:18 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 6 Mar 2018 23:47:18 -0800 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: On Tue, Mar 6, 2018 at 12:02 PM, Tyler Reddy wrote: > Interesting discussion. Would our plan be to support both side-by-side for > a while & just see what happens with the evolution of the ecosystem? > Indeed. I expect we would get lots of easy wins - stuff that's slow now and we can speed up by simply adding @jit and that no one wants to port to Cython because that is a lot of work. > If there's no clear winner in the short-term would we discourage PRs that > simply migrate from Cython to numba for say 1.5 x performance increase? > Yes that's not the best idea. I'd say start carefully, just adding some jit calls. Then it's also easy to reverse if something goes wrong. > What about an algorithm that mixes the two approaches -- some numba and > some Cython components for whatever reason -- is that discouraged? > That seems like a recipe for disaster. > > It looks like numba plays ok with airspeed velocity -- presumably mixing > Cython / numba in our suite will be ok? > They're both okay, asv will be completely agnostic to implementation language. Ralf > > > > On 5 March 2018 at 21:06, Ralf Gommers wrote: > >> Hi all, >> >> Goal of this email: start a discussion to decide whether we'd be okay >> with relying on Numba as a dependency, now or in 1-2 years' time. >> >> Context: in https://github.com/pydata/sparse/issues/126 a discussion is >> ongoing about whether to adopt Cython or Numba, with Numba being preferred >> by the majority. That `sparse` package is meant to provide sparse *arrays* >> that down the line should either be replacing our current sparse *matrices* >> or at least be integrated in scipy.sparse in addition to them. See >> https://github.com/scipy/scipy/issues/8162 and >> https://github.com/hameerabbasi/sparse-ndarray-protocols for more >> details on that. >> >> Also related is the question from Serge Guelton some weeks ago about >> whether we'd want to rely on Pythran: https://mail.python.org/piperm >> ail/scipy-dev/2018-January/022325.html >> >> On that Pythran thread I commented that we'd want to take these aspects >> into account: >> - portability >> - performance >> - maturity >> - maintenance status (active devs, how quick do bugs get fixed after a >> release with an issue) >> - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) >> - size of generated binaries >> - templating support for multiple dtypes >> - debugging and optimization experience/tool >> >> Debugging is one of the ones where I'd say Numba is still worse than >> Cython, however that's being resolved as we speak: >> https://github.com/numba/numba/issues/2788 >> >> One thing I missed in the above list is dependencies: while our use of >> Cython only adds a build-time dependency, Numba would add a run-time >> dependency. Given that binary wheels and conda packages for all major >> platforms are available that's not a showstopper, but it matters. >> >> Overall I'd say that: >> - Numba is better than Cython at: performance, ease of use, size of >> generated binaries, and templating support for multiple dtypes. Possibly >> also maintenance status right now. >> - Numba and Cython are about equally good at portability (I think, not >> much data about exotic platforms for Numba). >> - Cython is better than Numba at: maturity, debugging (but not for long >> anymore probably), dependencies. >> >> I'm usually pretty conservative in these things, but considering the >> above I'm leaning towards saying use of Numba should be allowed in the >> future. The added run-time dependency is the one major downside that's >> going to stay, however compared to our Fortran headaches that's a >> relatively small issue. >> >> Thoughts? >> >> Cheers, >> Ralf >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Mar 7 03:00:57 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 7 Mar 2018 00:00:57 -0800 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: On Tue, Mar 6, 2018 at 3:32 PM, Stanley Seibert wrote: > (Hi, as someone from the Numba project, my apologies for wading into this > discussion late.) > No apologies needed, the inside perspective is very welcome! > > Last time we checked, Numba on ARM was pretty close to working already. > We have an open PR for ARMv7 that we just ran out of time to finish QA'ing ( > https://github.com/numba/numba/pull/1968) a while ago, and never have > gotten back to because there was not much demand at the time. It sounds > like this is tripping up more people now, so we can take a look again. > > One thing that really is unpleasant on ARM is compiling LLVM, so we would > probably want to make sure we had conda packages and wheels available for > llvmlite on ARM. Is there any precedent for posting Linux ARMv7 wheels to > PyPI? (For conda, we would just make sure they appear in Jonathan's > berryconda channel.) > > A more difficult platform is POWER8, where we tried to do a port and got > stuck last year. Recently, someone has figured out what the issues were, > and it sounds like several of them stemmed from LLVM bugs in the POWER8 > backend that may or may not be fixed in the next release. IBM is > interested in improving the situation, so hopefully that will be sorted out > soon. > Hmm, that does sound like portability is still a significant issue today. We do sometimes break things for users on POWER8, ARM and similar platforms because of our lack of CI there, but we do try to keep things working and accept patches for those platforms. Ralf > > On Tue, Mar 6, 2018 at 4:54 PM, Matthew Brett > wrote: > >> On Tue, Mar 6, 2018 at 9:21 PM, CJ Carey >> wrote: >> > I think adding a required runtime dependency may be overly restrictive, >> > given scipy's position near(-ish) the base of the scientific computing >> > pyramid. >> >> Yes, that's a worry I have too. You may remember Samuel Maybury on >> the mailing list recently sighing somewhat when he found he had to get >> numba installed on the Raspberry Pi. My guess is numba will be fine >> on the standard platforms and a significant problem on non-standard >> ones. >> >> Cheers, >> >> Matthew >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Mar 7 02:56:49 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 6 Mar 2018 23:56:49 -0800 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: On Tue, Mar 6, 2018 at 1:55 PM, Stephan Hoyer wrote: > Numba does have a mechanism for exporting pre-compiled code: > http://numba.pydata.org/numba-doc/dev/user/pycc.html > > It would be interesting to see if those limitations are flexible enough > for SciPy. > I suspect that (a) we're then going to run into more Numba bugs because pre-compilation is not well-tested, and (b) we throw away some of the advantages of Numba, e.g. we then get back the binary size explosion for multiple dtype templating. > On Tue, Mar 6, 2018 at 1:21 PM CJ Carey wrote: > >> I think adding a required runtime dependency may be overly restrictive, >> given scipy's position near(-ish) the base of the scientific computing >> pyramid. >> >> Would it be possible to run numba-optimized code on systems with numba >> installed without impacting "vanilla" users? >> > It's worth thinking about. We could put a jit decorator in scipy._lib that becomes numba @jit if numba is installed and is do-nothing otherwise. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Mar 7 03:02:45 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 7 Mar 2018 00:02:45 -0800 Subject: [SciPy-Dev] GSoC 2018: Cythonizing In-Reply-To: References: <3648ed21-1b43-975c-4ca0-05926d683736@mailbox.org> Message-ID: On Mon, Mar 5, 2018 at 3:02 AM, Lars G. wrote: > On 28.02.2018 15:32, Lars G. wrote: > > Dear SciPy devs, > > > > I'm currently thinking about an application for this year's GSoC as > > well. As there already seems to be a large interest in the rotation > > formalism I'm trying to find another area that matches my interest and > > skill. > > > > I've dug up this proposal in scikit-image from GSoC 2015 > > https://github.com/scikit-image/scikit-image/wiki/GSoC- > 2015#rewriting-scipyndimage-in-cython > > and judging by the state of scipy/ndimage/src/ nobody has worked on this > > proposal yet (feel free to correct me). > > Alternatively I could imagine something similar for other sub-packages, > > e.g. scipy/signal which features many source files in C as well. > > > > So basically if there is an interest I could try to port C / Python code > > to Cython. What I would like to know: > > > > - Is there an interest? ;) > > - Is the original proposal in scikit-image still unfinished and are the > > potential mentors still interested in mentoring? > > - If there is a general interest to cythonize C or Python code during a > > GSoC project, which parts / sub-packages of SciPy would you priorize? > > > > As for my current involvement with SciPy: > > > > - I've already added a small function written in Cython > > https://github.com/scipy/scipy/pull/8350 > > - as part of a larger PR extending the signal module > > https://github.com/scipy/scipy/pull/8264 > > which will possibly merged this week. > > - I already cythonized slow parts of the above PR and plan > > to add these with new PRs after #8264 is merged. > > > > If this receives positive feedback I'd be happy to draft a more complete > > proposal / application based on the discussion around this. > > > > Best regards, > > Lars > > Actually, considering that GSoC should be treated as a full-time job > during time of coding I must sadly pass on this. However I want to thank > you all for the feedback already given. I hope its still useful for > other potential applicants. > Thanks Lars. I hope you do stick around part-time! Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From jaime.frio at gmail.com Wed Mar 7 03:40:19 2018 From: jaime.frio at gmail.com (=?UTF-8?Q?Jaime_Fern=C3=A1ndez_del_R=C3=ADo?=) Date: Wed, 07 Mar 2018 08:40:19 +0000 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: On Wed, Mar 7, 2018 at 9:02 AM Ralf Gommers wrote: > > > On Tue, Mar 6, 2018 at 1:55 PM, Stephan Hoyer wrote: > >> Numba does have a mechanism for exporting pre-compiled code: >> http://numba.pydata.org/numba-doc/dev/user/pycc.html >> >> It would be interesting to see if those limitations are flexible enough >> for SciPy. >> > > I suspect that (a) we're then going to run into more Numba bugs because > pre-compilation is not well-tested, and (b) we throw away some of the > advantages of Numba, e.g. we then get back the binary size explosion for > multiple dtype templating. > > >> On Tue, Mar 6, 2018 at 1:21 PM CJ Carey >> wrote: >> >>> I think adding a required runtime dependency may be overly restrictive, >>> given scipy's position near(-ish) the base of the scientific computing >>> pyramid. >>> >>> Would it be possible to run numba-optimized code on systems with numba >>> installed without impacting "vanilla" users? >>> >> > It's worth thinking about. We could put a jit decorator in scipy._lib that > becomes numba @jit if numba is installed and is do-nothing otherwise. > I'll admit I have a "fear of the unknown" mistrust of numba, but reading through this thread I was thinking of something like this as being something even I would have no problem with. Juan Luis Cano, who probably reads this, but who I have just in case CCed in this e-mail, is the author of https://github.com/poliastro/poliastro, a numerical library that gave up on Fortran/Cython/C by design and embraced Numba from the start, would be nice to hear his take on making it a dependency. Jaime > > Ralf > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -- (\__/) ( O.o) ( > <) Este es Conejo. Copia a Conejo en tu firma y ay?dale en sus planes de dominaci?n mundial. -------------- next part -------------- An HTML attachment was scrubbed... URL: From juanlu001 at gmail.com Wed Mar 7 05:30:47 2018 From: juanlu001 at gmail.com (Juan Luis Cano) Date: Wed, 7 Mar 2018 11:30:47 +0100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: Hi all, Thanks Jaime for pinging me, I don't follow scipy-dev regularly. My frustration with the first release of poliastro, a Python library for astrodynamics that I created some years ago, was that it mixed MATLAB (Octave), FORTRAN (all caps) and Python code and that it was a mess to distribute (this was 2013). I had no experience with C so Cython looked intimidating to me (I even tried to do some debugging recently and simply couldn't cope with cython-gdb, see https://groups.google.com/d/ msg/cython-users/SCk2IDG9M5g/muhUhmw9AwAJ). When I saw that I could achieve a decent performance with numba, I threw away all the FORTRAN and now the library is pure Python. For some releases, my approach was exactly what Ralf said: have a `@jit` decorator that selected what to do depending on whether numba was installed. This was nice because there was no easy way to install numba with pip when I started, and in doing so I guaranteed performance with `conda install poliastro` and a working system with `pip install poliastro`. https://github.com/poliastro/poliastro/blob/0.6.x/src/poliastro/jit.py However, now there are wheels on PyPI for both numba and llvmlite, so I removed the conditional `@jit` and included numba as a required dependency (except on PyPy): https://github.com/poliastro/poliastro/blob/master/setup.py#L40 As a last note, numba works beautifully with C code through CFFI: https://www.anaconda.com/blog/developer-blog/calling-c- libraries-numba-using-cffi/ http://old.pybonacci.org/2016/02/07/como-crear-extensiones- en-c-para-python-usando-cffi-y-numba/ [my take, in Spanish] I understand the concerns about numba because it used to work only with conda and "it comes from a company" (which some people consider a bad thing). numba is no doubt a complex project, but the devs are committed to it and the installation and packaging are now few (specially compared to the situation we had in 2013). Also, the Julia folks will drive the LLVM ecosystem to more platforms I would say. Hope this helps! On Wed, Mar 7, 2018 at 9:40 AM, Jaime Fern?ndez del R?o < jaime.frio at gmail.com> wrote: > On Wed, Mar 7, 2018 at 9:02 AM Ralf Gommers > wrote: > >> >> >> On Tue, Mar 6, 2018 at 1:55 PM, Stephan Hoyer wrote: >> >>> Numba does have a mechanism for exporting pre-compiled code: >>> http://numba.pydata.org/numba-doc/dev/user/pycc.html >>> >>> It would be interesting to see if those limitations are flexible enough >>> for SciPy. >>> >> >> I suspect that (a) we're then going to run into more Numba bugs because >> pre-compilation is not well-tested, and (b) we throw away some of the >> advantages of Numba, e.g. we then get back the binary size explosion for >> multiple dtype templating. >> >> >>> On Tue, Mar 6, 2018 at 1:21 PM CJ Carey >>> wrote: >>> >>>> I think adding a required runtime dependency may be overly restrictive, >>>> given scipy's position near(-ish) the base of the scientific computing >>>> pyramid. >>>> >>>> Would it be possible to run numba-optimized code on systems with numba >>>> installed without impacting "vanilla" users? >>>> >>> >> It's worth thinking about. We could put a jit decorator in scipy._lib >> that becomes numba @jit if numba is installed and is do-nothing otherwise. >> > > I'll admit I have a "fear of the unknown" mistrust of numba, but reading > through this thread I was thinking of something like this as being > something even I would have no problem with. > > Juan Luis Cano, who probably reads this, but who I have just in case CCed > in this e-mail, is the author of https://github.com/poliastro/poliastro, > a numerical library that gave up on Fortran/Cython/C by design and embraced > Numba from the start, would be nice to hear his take on making it a > dependency. > > Jaime > > >> >> Ralf >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> > > > -- > (\__/) > ( O.o) > ( > <) Este es Conejo. Copia a Conejo en tu firma y ay?dale en sus planes > de dominaci?n mundial. > -- Juan Luis Cano -------------- next part -------------- An HTML attachment was scrubbed... URL: From pav at iki.fi Wed Mar 7 06:09:48 2018 From: pav at iki.fi (Pauli Virtanen) Date: Wed, 7 Mar 2018 12:09:48 +0100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> Hi, Ralf Gommers kirjoitti 06.03.2018 klo 05:06: > Goal of this email: start a discussion to decide whether we'd be okay with > relying on Numba as a dependency, now or in 1-2 years' time. I think the main concerns indeed are portability and maturity. The advantages of Numba on the other hand are clear, as it mostly avoids the multiple-language mess. We might also want to consider using cffi at some point. Currently, we basically just have Cython, f2py, and hand-written C code for ffi. I guess LLVM supports most of the architectures we would be interested in, but e.g. the fact that ARM does not work yet is not so nice. Moreover, libLLVM is 50+ megabyte blob, but maybe today when people run text editors on web browsers instead of vice versa that's not a big deal. The idea that we could use `scipy._lib.jit` that's either a noop or Numba jit does not sound good in practice: for code where the JIT is wanted, the performance without it is likely unacceptable. (For PyPy, the no-op decorator in principle could be acceptable, but only with numpypy which IIUC is not production ready currently, cpyext+numpy likely won't be faster than CPython.) Moreover, we presumably would like to use features such as `numba.cfunc`. So as I see it, either Numba is a hard dependency, or we don't use it. On maturity: I don't know what is the API stability status for Numba, presumably the basic API is stable. Numba debugging also in my experience has several paper cuts, e.g., the compilation and runtime errors are cryptic --- they assume you know how numba works, and don't include such niceties as line numbers or useful tracebacks, etc. Maybe this will improve in coming years. Pauli From sseibert at anaconda.com Wed Mar 7 10:12:57 2018 From: sseibert at anaconda.com (Stanley Seibert) Date: Wed, 7 Mar 2018 09:12:57 -0600 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> Message-ID: A couple notes of clarification: The way llvmlite (Numba's Python wrapper around LLVM) is designed, we statically link to the LLVM library for several reasons: - LLVM changes its C++ API and its IR syntax frequently enough that it is never safe to assume Linux distributions ship the version Numba/llvmlite requires, or that it was compiled with the options we need. - This allows llvmlite to contain only the subset of LLVM that is needed for the JIT. Your 50MB estimate is about right for the install size of llvmlite (54 MB on Linux x86_64, 30 MB on macOS 64-bit), but a full install of LLVM has more like 165MB of library code. I just wanted to call all this out in case someone is assuming that Numba requires an installation of LLVM. It does not, and in fact deliberately ignores any system installation of LLVM on purpose. Regarding the API stability: Numba's API is stable in practice because the most common interface (the compiler decorators, like @jit) tend to be very simple. We are planning to release 1.0 this year, at which point we will clearly identify in the documentation which APIs and options are stable and which we consider experimental. That will also identify a clear point where we think that Numba is safe for core projects to take as a dependency. Debugging support continues to be an area we work on, so your criticism is fair. On Wed, Mar 7, 2018 at 5:09 AM, Pauli Virtanen wrote: > Hi, > > Ralf Gommers kirjoitti 06.03.2018 klo 05:06: > >> Goal of this email: start a discussion to decide whether we'd be okay with >> relying on Numba as a dependency, now or in 1-2 years' time. >> > > I think the main concerns indeed are portability and maturity. > > The advantages of Numba on the other hand are clear, as it mostly avoids > the multiple-language mess. > > We might also want to consider using cffi at some point. Currently, we > basically just have Cython, f2py, and hand-written C code for ffi. > > I guess LLVM supports most of the architectures we would be interested in, > but e.g. the fact that ARM does not work yet is not so nice. Moreover, > libLLVM is 50+ megabyte blob, but maybe today when people run text editors > on web browsers instead of vice versa that's not a big deal. > > The idea that we could use `scipy._lib.jit` that's either a noop or Numba > jit does not sound good in practice: for code where the JIT is wanted, the > performance without it is likely unacceptable. (For PyPy, the no-op > decorator in principle could be acceptable, but only with numpypy which > IIUC is not production ready currently, cpyext+numpy likely won't be faster > than CPython.) Moreover, we presumably would like to use features such as > `numba.cfunc`. So as I see it, either Numba is a hard dependency, or we > don't use it. > > On maturity: I don't know what is the API stability status for Numba, > presumably the basic API is stable. > > Numba debugging also in my experience has several paper cuts, e.g., the > compilation and runtime errors are cryptic --- they assume you know how > numba works, and don't include such niceties as line numbers or useful > tracebacks, etc. Maybe this will improve in coming years. > > Pauli > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sseibert at anaconda.com Wed Mar 7 10:34:12 2018 From: sseibert at anaconda.com (Stanley Seibert) Date: Wed, 7 Mar 2018 09:34:12 -0600 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: On Wed, Mar 7, 2018 at 4:30 AM, Juan Luis Cano wrote: > I understand the concerns about numba because it used to work only with > conda and "it comes from a company" (which some people consider a bad > thing). numba is no doubt a complex project, but the devs are committed to > it and the installation and packaging are now few (specially compared to > the situation we had in 2013). Also, the Julia folks will drive the LLVM > ecosystem to more platforms I would say. > I don't want to strawman the "it comes from a company" argument (which I, not surprisingly, don't agree with), but I think buried in there is a concern I do agree with: It is a risk for a core project to be sponsored by a single stakeholder. Companies change strategy, professors lose grant funding, postdocs move on, and hobbyists burn out and have life-changing events. The longevity of the project depends on being able to handle the disappearance of any core developer (and their sponsor). So on this front, it is fair to be concerned about Numba, although the situation is improving. Right now, the core Numba team consists of 3 Anaconda employees and 2 Intel employees who were recently added for their automatic multithreading contributions in 2017. We've been taking notes on the challenges of onboarding other developers to the code base, and definitely see the barriers to entry in the code base. There are things we will work to improve this year, which will hopefully continue to make it easier for new developers to get involved. -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Wed Mar 7 16:22:34 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Thu, 8 Mar 2018 08:22:34 +1100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: I have a few points to chuck in from the sidelines: 1) are there any licence considerations have scipy depend on numba? For example, I'm thinking along the lines where someone would have to bundle (or link) numba with scipy/numpy, and be forced to use a specific licence. 2) The size could be an issue for those distributing apps. In a conda/virtualenv/python install numba is only required to be installed once. However, I have a few standalone python apps frozen using PyInstaller. Every time I make one of those it has to freeze numpy, scipy, etc, into the app structure. PyInstaller strips out a lot of stuff that isn't necessary, but the size of scipy is still large. Having numbda in there would increase the size of an app by another 50 Mb if the entirety of the package was required. If one could statically link in the required bits, then I suppose it's not so bad. -------------- next part -------------- An HTML attachment was scrubbed... URL: From sseibert at anaconda.com Wed Mar 7 16:57:42 2018 From: sseibert at anaconda.com (Stanley Seibert) Date: Wed, 7 Mar 2018 15:57:42 -0600 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References:

Message-ID: On Wed, Mar 7, 2018 at 3:22 PM, Andrew Nelson wrote: > 1) are there any licence considerations have scipy depend on numba? For > example, I'm thinking along the lines where someone would have to bundle > (or link) numba with scipy/numpy, and be forced to use a specific licence. > Numba and llvmlite are BSD licensed, and LLVM uses the University of Illinois/NCSA Open Source license: https://opensource.org/licenses/UoI-NCSA.php ...which basically looks like the 3-clause BSD license. This very similar to SciPy's own license. 2) The size could be an issue for those distributing apps. In a > conda/virtualenv/python install numba is only required to be installed > once. However, I have a few standalone python apps frozen using > PyInstaller. Every time I make one of those it has to freeze numpy, scipy, > etc, into the app structure. PyInstaller strips out a lot of stuff that > isn't necessary, but the size of scipy is still large. Having numbda in > there would increase the size of an app by another 50 Mb if the entirety of > the package was required. If one could statically link in the required > bits, then I suppose it's not so bad. > llvmlite is already statically linked to the required bits of LLVM, so there isn't much chance of reducing the size further. In the situations where ahead of time compilation is possible, Numba produces shared libraries which do not require Numba or llvmlite, but the use cases where that can be done are limited (at the moment). -------------- next part -------------- An HTML attachment was scrubbed... URL: From jheemskerk at urthecast.com Wed Mar 7 17:24:15 2018 From: jheemskerk at urthecast.com (Jordan Heemskerk) Date: Wed, 7 Mar 2018 22:24:15 +0000 Subject: [SciPy-Dev] Exposing additional window functions in scipy.signal In-Reply-To: <5D91CFE5-1DEB-49CD-A66E-752EED7F5F5D@urthecast.com> References: <5D91CFE5-1DEB-49CD-A66E-752EED7F5F5D@urthecast.com> Message-ID: <5B769115-F750-41FC-89DF-01DE35544EC1@urthecast.com> Greetings, I?m hoping to make a small contribution in the scipy.signal.windows module which exposes an existing private function _cos_win as general_cosine and implements a new general_hamming window. Lately I?ve been coming across these windows in my line of work and having an idiomatic way to produce them through SciPy would be beneficial to me. Although they are not the most common windows, they are used (see PR) and hopefully other members of the community would be interested in having a standard Python implementation. I?ve drafted the potential changes in PR #8534 (https://github.com/scipy/scipy/pull/8534) and am hoping to recruit some reviewers. Thanks, Jordan Heemskerk Urthecast Inc. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Thu Mar 8 02:04:37 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 7 Mar 2018 23:04:37 -0800 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> Message-ID: On Wed, Mar 7, 2018 at 3:09 AM, Pauli Virtanen wrote: > Hi, > > Ralf Gommers kirjoitti 06.03.2018 klo 05:06: > >> Goal of this email: start a discussion to decide whether we'd be okay with >> relying on Numba as a dependency, now or in 1-2 years' time. >> > > I think the main concerns indeed are portability and maturity. > > The advantages of Numba on the other hand are clear, as it mostly avoids > the multiple-language mess. > > We might also want to consider using cffi at some point. Currently, we > basically just have Cython, f2py, and hand-written C code for ffi. > > I guess LLVM supports most of the architectures we would be interested in, > but e.g. the fact that ARM does not work yet is not so nice. Moreover, > libLLVM is 50+ megabyte blob, but maybe today when people run text editors > on web browsers instead of vice versa that's not a big deal. > > The idea that we could use `scipy._lib.jit` that's either a noop or Numba > jit does not sound good in practice: for code where the JIT is wanted, the > performance without it is likely unacceptable. I don't think it's performance here that matters. For >99.x% of our users Numba will install just fine, the exceptions being the exotic architectures like POWER8. On those, having SciPy import and work as expected is enough; those functions that are implemented with Numba then run slower - that's at the start <1% of functions we offer for <1% of our userbase. Also, I don't think performance will necessarily be unacceptable. There are a bunch of places in the existing code base where we can throw in @jit and get speedups basically for free. Performance in the noop case will then be what it is today - not great, but apparently also not enough of a problem that someone has attempted to go to Cython. Ralf > (For PyPy, the no-op decorator in principle could be acceptable, but only > with numpypy which IIUC is not production ready currently, cpyext+numpy > likely won't be faster than CPython.) Moreover, we presumably would like to > use features such as `numba.cfunc`. So as I see it, either Numba is a hard > dependency, or we don't use it. > > On maturity: I don't know what is the API stability status for Numba, > presumably the basic API is stable. > > Numba debugging also in my experience has several paper cuts, e.g., the > compilation and runtime errors are cryptic --- they assume you know how > numba works, and don't include such niceties as line numbers or useful > tracebacks, etc. Maybe this will improve in coming years. > > Pauli > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From pav at iki.fi Thu Mar 8 04:38:51 2018 From: pav at iki.fi (Pauli Virtanen) Date: Thu, 8 Mar 2018 10:38:51 +0100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> Message-ID: <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi> Ralf Gommers kirjoitti 08.03.2018 klo 08:04: [clip] > Also, I don't think performance will necessarily be unacceptable. There are > a bunch of places in the existing code base where we can throw in @jit and > get speedups basically for free. Performance in the noop case will then be > what it is today - not great, but apparently also not enough of a problem > that someone has attempted to go to Cython. I guess you agree that Numba would regardless be declared a dependency in setup.py? People on unsupported arches can edit it away manually. For computational tight loops operating on arrays when Numba is used as an alternative to Cython/C/Fortran, there probably will be a performance hit in the ballpark of 100x. If we are planning to use numba features more fully, e.g. numba.cfunc e.g. to write callback functions, that would also require Numba as a hard dependency. Pauli From matthew.brett at gmail.com Thu Mar 8 05:00:29 2018 From: matthew.brett at gmail.com (Matthew Brett) Date: Thu, 8 Mar 2018 10:00:29 +0000 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi> References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi> Message-ID: Hi, On Thu, Mar 8, 2018 at 9:38 AM, Pauli Virtanen wrote: > Ralf Gommers kirjoitti 08.03.2018 klo 08:04: > [clip] >> >> Also, I don't think performance will necessarily be unacceptable. There >> are >> a bunch of places in the existing code base where we can throw in @jit and >> get speedups basically for free. Performance in the noop case will then be >> what it is today - not great, but apparently also not enough of a problem >> that someone has attempted to go to Cython. > > > I guess you agree that Numba would regardless be declared a dependency in > setup.py? People on unsupported arches can edit it away manually. > > For computational tight loops operating on arrays when Numba is used as an > alternative to Cython/C/Fortran, there probably will be a performance hit in > the ballpark of 100x. > > If we are planning to use numba features more fully, e.g. numba.cfunc e.g. > to write callback functions, that would also require Numba as a hard > dependency. If we were at the top of the stack, like pystatsmodels, then this would be reasonable, but, if we make numba a dependency, that makes numba a dependency for almost anyone doing scientific computing. I think we do have to care about people not running on Intel. If we make numba an optional dependency, it gives us an additional maintenance burden, because we'd have to check for each numba segment, whether it is going to be disabling for a user without numba. Is there anything we have at the moment where Cython won't get us into the ballpark? If not, my preference would be to wait for a year or so, to see how things turn out. Cheers, Matthew From gael.varoquaux at normalesup.org Thu Mar 8 08:39:02 2018 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Thu, 8 Mar 2018 14:39:02 +0100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: Message-ID: <20180308133902.GE1855417@phare.normalesup.org> Hi everybody, My take on this issue (I choose to reply to the first mail of the thread, because I've read the thread, but wasn't sure where to insert a reply). First, I was excited when I read the beginning of the thread. Numba is a big promise: it can make our code faster while staying high level. But following the thread, it seems that it is not prime time yet. I am not worried at all that it is a "company-driven project". The code license is right, and I think that there is a real will to grow a community. What I am worried about is that it has not reached a sufficient level of portability. It is important that the base of the pyramid works easily on less mainstream platforms like ARM. Embedded systems are important for science, whether it be academic, industrial, or citizen science. From what I read on this thread, there is still work to be done there, included at the level of the Python packaging system. Things have improved hugely in the last few years, so I am extremely hopeful. I worry about the solution that @jit can fall back to NoOp when numba is not available, because it means that those in such situations will have silent slowdowns. I remember a few years ago, it was common for cluster administrators to install numpy without linking it to an optimized blas (using the embedded lapack-lite), and on many clusters numpy was unusable. Computing clusters can also be quite adverse situations for installation, as some don't have access to Internet, and the libraries must be installed in an existing Python distribution, to play well with domain-specific libraries. Debugging is also a problem, but it seems slightly less of a showstopper to me. I would say that we should postpone this decision. Ideally, as a community, we should help numba and the packaging ecosystem get to a point where portability is no longer a problem. Cheers, Ga?l On Mon, Mar 05, 2018 at 08:06:11PM -0800, Ralf Gommers wrote: > Hi all, > Goal of this email: start a discussion to decide whether we'd be okay with > relying on Numba as a dependency, now or in 1-2 years' time. > Context: in https://github.com/pydata/sparse/issues/126 a discussion is ongoing > about whether to adopt Cython or Numba, with Numba being preferred by the > majority. That `sparse` package is meant to provide sparse *arrays* that down > the line should either be replacing our current sparse *matrices* or at least > be integrated in scipy.sparse in addition to them. See https://github.com/scipy > /scipy/issues/8162 and https://github.com/hameerabbasi/sparse-ndarray-protocols > for more details on that. > Also related is the question from Serge Guelton some weeks ago about whether > we'd want to rely on Pythran: https://mail.python.org/pipermail/scipy-dev/ > 2018-January/022325.html > On that Pythran thread I commented that we'd want to take these aspects into > account: > - portability > - performance > - maturity > - maintenance status (active devs, how quick do bugs get fixed after a > release with an issue) > - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) > - size of generated binaries > - templating support for multiple dtypes > - debugging and optimization experience/tool > Debugging is one of the ones where I'd say Numba is still worse than Cython, > however that's being resolved as we speak: https://github.com/numba/numba/ > issues/2788 > One thing I missed in the above list is dependencies: while our use of Cython > only adds a build-time dependency, Numba would add a run-time dependency. Given > that binary wheels and conda packages for all major platforms are available > that's not a showstopper, but it matters. > Overall I'd say that: > - Numba is better than Cython at: performance, ease of use, size of generated > binaries, and templating support for multiple dtypes. Possibly also maintenance > status right now. > - Numba and Cython are about equally good at portability (I think, not much > data about exotic platforms for Numba). > - Cython is better than Numba at: maturity, debugging (but not for long anymore > probably), dependencies. > I'm usually pretty conservative in these things, but considering the above I'm > leaning towards saying use of Numba should be allowed in the future. The added > run-time dependency is the one major downside that's going to stay, however > compared to our Fortran headaches that's a relatively small issue. > Thoughts? > Cheers, > Ralf > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev -- Gael Varoquaux Senior Researcher, INRIA Parietal NeuroSpin/CEA Saclay , Bat 145, 91191 Gif-sur-Yvette France Phone: ++ 33-1-69-08-79-68 http://gael-varoquaux.info http://twitter.com/GaelVaroquaux From sseibert at anaconda.com Thu Mar 8 09:44:04 2018 From: sseibert at anaconda.com (Stanley Seibert) Date: Thu, 8 Mar 2018 08:44:04 -0600 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi> Message-ID: TBH, I agree with this general sentiment. This thread has been very valuable to clarify the Numba team's understanding of what the SciPy community needs from a compilation solution. Our trajectory is good, but we're not quite there yet for a project that needs to be as conservative about dependencies as SciPy. We will keep working to get there, though. However, if someone is interested in trying to implement some SciPy functions with Numba implementations, there's nothing blocking that work in a separate repository as an experiment. (One of the Numba developers has already named this hypothetical project "Scumba," which I quite like.) If anyone does decide to try this, please make sure to ping the Numba developers on Gitter. We will learn a great deal from the effort, I think. On Thu, Mar 8, 2018 at 4:00 AM, Matthew Brett wrote: > Hi, > > On Thu, Mar 8, 2018 at 9:38 AM, Pauli Virtanen wrote: > > Ralf Gommers kirjoitti 08.03.2018 klo 08:04: > > [clip] > >> > >> Also, I don't think performance will necessarily be unacceptable. There > >> are > >> a bunch of places in the existing code base where we can throw in @jit > and > >> get speedups basically for free. Performance in the noop case will then > be > >> what it is today - not great, but apparently also not enough of a > problem > >> that someone has attempted to go to Cython. > > > > > > I guess you agree that Numba would regardless be declared a dependency in > > setup.py? People on unsupported arches can edit it away manually. > > > > For computational tight loops operating on arrays when Numba is used as > an > > alternative to Cython/C/Fortran, there probably will be a performance > hit in > > the ballpark of 100x. > > > > If we are planning to use numba features more fully, e.g. numba.cfunc > e.g. > > to write callback functions, that would also require Numba as a hard > > dependency. > > If we were at the top of the stack, like pystatsmodels, then this > would be reasonable, but, if we make numba a dependency, that makes > numba a dependency for almost anyone doing scientific computing. I > think we do have to care about people not running on Intel. If we > make numba an optional dependency, it gives us an additional > maintenance burden, because we'd have to check for each numba segment, > whether it is going to be disabling for a user without numba. > > Is there anything we have at the moment where Cython won't get us into > the ballpark? If not, my preference would be to wait for a year or > so, to see how things turn out. > > Cheers, > > Matthew > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From roberto.bucher at supsi.ch Thu Mar 8 09:26:49 2018 From: roberto.bucher at supsi.ch (Roberto Bucher) Date: Thu, 8 Mar 2018 15:26:49 +0100 Subject: [SciPy-Dev] Strange problem with Kanes Method and Lagrange Method Message-ID: <51413fa9-e66b-d269-9167-70b88eba2e98@supsi.ch> I have two scripts which implement method to model a wheeled inverted pendulum, the first one implementing a Kanes method, the second one with Lagrange. Using Kane, I obtain the right result into the "fr+frstar" variable, but after linearization the matrices A and B are completely wrong. The same behavior seems to append with the Lagrange script, where after the Lagrange Method the "form_lagranges_equations()" output is correct but the generated matrices A and B not... In my opinion the right matrices should be: A= np.matrix([ [0, 0, 1, 0], [0, 0, 0, 1], [((L_w*M_w+L_p*M_p)*g)/(L_w^2*M_w-J_w+J_p), 0, 0, 0], [-((L_w*M_w+L_p*M_p)*g)/(L_w^2*M_w-J_w+J_p), 0, 0, 0]] ) B = np.matrix( [0], [0], [-K_t/(L_w^2*M_w-J_w+J_p)], [(K_t*L_w^2*M_w+J_p*K_t)/(J_w*L_w^2*M_w-J_w^2+J_p*J_w)] ) Both scripts are attached Thanks in advance Best regards Roberto -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: pendolo_kane.py Type: text/x-python Size: 2731 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: pendolo_lagrange.py Type: text/x-python Size: 2604 bytes Desc: not available URL: From josh.craig.wilson at gmail.com Thu Mar 8 11:16:20 2018 From: josh.craig.wilson at gmail.com (Joshua Wilson) Date: Thu, 8 Mar 2018 10:16:20 -0600 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi>

Message-ID: > if someone is interested in trying to implement some SciPy functions with Numba implementations When this thread started I actually did just that for a small part of scipy.special. Code is here: https://github.com/person142/special It currently implements `special.loggamma` and some private functions in special (sinpi, cospi, polynomial evaluation) that are needed to support it. I'm currently seeing a factor of 2 slowdown and trying to figure out why, but I'm very interested in figuring out how to close the speed gap/porting more functions. - Josh On Thu, Mar 8, 2018 at 8:44 AM, Stanley Seibert wrote: > TBH, I agree with this general sentiment. This thread has been very > valuable to clarify the Numba team's understanding of what the SciPy > community needs from a compilation solution. Our trajectory is good, but > we're not quite there yet for a project that needs to be as conservative > about dependencies as SciPy. We will keep working to get there, though. > > However, if someone is interested in trying to implement some SciPy > functions with Numba implementations, there's nothing blocking that work in > a separate repository as an experiment. (One of the Numba developers has > already named this hypothetical project "Scumba," which I quite like.) If > anyone does decide to try this, please make sure to ping the Numba > developers on Gitter. We will learn a great deal from the effort, I think. > > On Thu, Mar 8, 2018 at 4:00 AM, Matthew Brett > wrote: >> >> Hi, >> >> On Thu, Mar 8, 2018 at 9:38 AM, Pauli Virtanen wrote: >> > Ralf Gommers kirjoitti 08.03.2018 klo 08:04: >> > [clip] >> >> >> >> Also, I don't think performance will necessarily be unacceptable. There >> >> are >> >> a bunch of places in the existing code base where we can throw in @jit >> >> and >> >> get speedups basically for free. Performance in the noop case will then >> >> be >> >> what it is today - not great, but apparently also not enough of a >> >> problem >> >> that someone has attempted to go to Cython. >> > >> > >> > I guess you agree that Numba would regardless be declared a dependency >> > in >> > setup.py? People on unsupported arches can edit it away manually. >> > >> > For computational tight loops operating on arrays when Numba is used as >> > an >> > alternative to Cython/C/Fortran, there probably will be a performance >> > hit in >> > the ballpark of 100x. >> > >> > If we are planning to use numba features more fully, e.g. numba.cfunc >> > e.g. >> > to write callback functions, that would also require Numba as a hard >> > dependency. >> >> If we were at the top of the stack, like pystatsmodels, then this >> would be reasonable, but, if we make numba a dependency, that makes >> numba a dependency for almost anyone doing scientific computing. I >> think we do have to care about people not running on Intel. If we >> make numba an optional dependency, it gives us an additional >> maintenance burden, because we'd have to check for each numba segment, >> whether it is going to be disabling for a user without numba. >> >> Is there anything we have at the moment where Cython won't get us into >> the ballpark? If not, my preference would be to wait for a year or >> so, to see how things turn out. >> >> Cheers, >> >> Matthew >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > From juanlu001 at gmail.com Thu Mar 8 12:25:48 2018 From: juanlu001 at gmail.com (Juan Luis Cano) Date: Thu, 8 Mar 2018 18:25:48 +0100 Subject: [SciPy-Dev] Numba as a dependency for SciPy? In-Reply-To: References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi>

Message-ID: On Thu, Mar 8, 2018 at 5:16 PM, Joshua Wilson wrote: > > if someone is interested in trying to implement some SciPy functions > with Numba implementations > > When this thread started I actually did just that for a small part of > scipy.special. Code is here: > Now that you mention scipy.special, here is another data point with some promising benchmarks, wrapping CEPHES with CFFI: https://github.com/poliastro/pycephes#performance > > https://github.com/person142/special > > It currently implements `special.loggamma` and some private functions > in special (sinpi, cospi, polynomial evaluation) that are needed to > support it. I'm currently seeing a factor of 2 slowdown and trying to > figure out why, but I'm very interested in figuring out how to close > the speed gap/porting more functions. > > - Josh > > On Thu, Mar 8, 2018 at 8:44 AM, Stanley Seibert > wrote: > > TBH, I agree with this general sentiment. This thread has been very > > valuable to clarify the Numba team's understanding of what the SciPy > > community needs from a compilation solution. Our trajectory is good, but > > we're not quite there yet for a project that needs to be as conservative > > about dependencies as SciPy. We will keep working to get there, though. > > > > However, if someone is interested in trying to implement some SciPy > > functions with Numba implementations, there's nothing blocking that work > in > > a separate repository as an experiment. (One of the Numba developers has > > already named this hypothetical project "Scumba," which I quite like.) > If > > anyone does decide to try this, please make sure to ping the Numba > > developers on Gitter. We will learn a great deal from the effort, I > think. > > > > On Thu, Mar 8, 2018 at 4:00 AM, Matthew Brett > > wrote: > >> > >> Hi, > >> > >> On Thu, Mar 8, 2018 at 9:38 AM, Pauli Virtanen wrote: > >> > Ralf Gommers kirjoitti 08.03.2018 klo 08:04: > >> > [clip] > >> >> > >> >> Also, I don't think performance will necessarily be unacceptable. > There > >> >> are > >> >> a bunch of places in the existing code base where we can throw in > @jit > >> >> and > >> >> get speedups basically for free. Performance in the noop case will > then > >> >> be > >> >> what it is today - not great, but apparently also not enough of a > >> >> problem > >> >> that someone has attempted to go to Cython. > >> > > >> > > >> > I guess you agree that Numba would regardless be declared a dependency > >> > in > >> > setup.py? People on unsupported arches can edit it away manually. > >> > > >> > For computational tight loops operating on arrays when Numba is used > as > >> > an > >> > alternative to Cython/C/Fortran, there probably will be a performance > >> > hit in > >> > the ballpark of 100x. > >> > > >> > If we are planning to use numba features more fully, e.g. numba.cfunc > >> > e.g. > >> > to write callback functions, that would also require Numba as a hard > >> > dependency. > >> > >> If we were at the top of the stack, like pystatsmodels, then this > >> would be reasonable, but, if we make numba a dependency, that makes > >> numba a dependency for almost anyone doing scientific computing. I > >> think we do have to care about people not running on Intel. If we > >> make numba an optional dependency, it gives us an additional > >> maintenance burden, because we'd have to check for each numba segment, > >> whether it is going to be disabling for a user without numba. > >> > >> Is there anything we have at the moment where Cython won't get us into > >> the ballpark? If not, my preference would be to wait for a year or > >> so, to see how things turn out. > >> > >> Cheers, > >> > >> Matthew > >> _______________________________________________ > >> SciPy-Dev mailing list > >> SciPy-Dev at python.org > >> https://mail.python.org/mailman/listinfo/scipy-dev > > > > > > > > _______________________________________________ > > SciPy-Dev mailing list > > SciPy-Dev at python.org > > https://mail.python.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -- Juan Luis Cano -------------- next part -------------- An HTML attachment was scrubbed... URL: From bwana.marko at yahoo.com Thu Mar 8 12:43:07 2018 From: bwana.marko at yahoo.com (Mark Mikofski) Date: Thu, 8 Mar 2018 17:43:07 +0000 (UTC) Subject: [SciPy-Dev] Numba as a dependency for SciPy?lplupel In-Reply-To: References: <67c2004c-2234-c3fc-9b2a-3704dfd5a450@iki.fi> <4cc5f89c-201e-91b1-a849-5990bb05af18@iki.fi>

Message-ID: <346596039.13511618.1520530987803@mail.yahoo.com> Here are some benchmarks by Will Holmgren comparing brentq with numba jit variations.?https://github.com/wholmgren/ivnumba and here is a start at a cython API for scipy.optimize zeros?https://github.com/scipy/scipy/pull/8431 although it's not sleeping to apples since I've only done bisection, I'll try to rush to do Brent Q for comparison, but this thread raises a lot of questions and I wonder if I should continue working on cythonizing SciPy.optimize.zeros? Also,? Sent from Yahoo Mail on Android On Thu, Mar 8, 2018 at 9:26 AM, Juan Luis Cano wrote: _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: Untitled URL: From adibhar97 at gmail.com Thu Mar 8 14:00:10 2018 From: adibhar97 at gmail.com (Aditya Bharti) Date: Fri, 9 Mar 2018 00:30:10 +0530 Subject: [SciPy-Dev] GSOC 2018 [Starting advice, not proposal] Message-ID: Hi all, Goal of this email: To ask for general advice on starting contributions to scipy for GSOC 2018 To summarise: I am interested in a project which aligns well with my research and open source experience, but I'm having a little trouble finding issues to start working on. I am an undergraduate researcher in Computer Vision, interested in contributing to scipy. I was specifically looking forward to the second project on the github project ideas page : Rotation Formalism in 3 dimensions. I think I would be a good fit for the project because I have experience with contributing to open source in general, and specifically to C++ and python codebases [1]