From andyfaff at gmail.com Mon Jan 1 03:31:44 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Mon, 1 Jan 2018 19:31:44 +1100 Subject: [SciPy-Dev] TravisCI issues Message-ID: Happy New Year everyone. Whilst submitting a PR today I noticed that some of the travis builds are failing with the error: "oci runtime error: exec failed: container_linux.go:265: starting container process caused "could not create session key: disk quota exceeded" I'm hoping it's a temporary issue. A. -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Mon Jan 1 07:05:34 2018 From: matthew.brett at gmail.com (Matthew Brett) Date: Mon, 1 Jan 2018 12:05:34 +0000 Subject: [SciPy-Dev] TravisCI issues In-Reply-To: References: Message-ID: Hi, On Mon, Jan 1, 2018 at 8:31 AM, Andrew Nelson wrote: > Happy New Year everyone. > > Whilst submitting a PR today I noticed that some of the travis builds are > failing with the error: > > "oci runtime error: exec failed: container_linux.go:265: starting container > process caused "could not create session key: disk quota exceeded" > > I'm hoping it's a temporary issue. I'm getting the same error on another repo: https://travis-ci.org/matthew-brett/nb2plots/jobs/323694309 I've sent an email to their support address, Cheers, Matthew From ilhanpolat at gmail.com Mon Jan 1 11:23:54 2018 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Mon, 1 Jan 2018 17:23:54 +0100 Subject: [SciPy-Dev] TravisCI issues In-Reply-To: References: Message-ID: I have manually started one-by-one so far all passed. So it is probably a TravisCI issue that is being resolved. On Mon, Jan 1, 2018 at 1:05 PM, Matthew Brett wrote: > Hi, > > On Mon, Jan 1, 2018 at 8:31 AM, Andrew Nelson wrote: > > Happy New Year everyone. > > > > Whilst submitting a PR today I noticed that some of the travis builds are > > failing with the error: > > > > "oci runtime error: exec failed: container_linux.go:265: starting > container > > process caused "could not create session key: disk quota exceeded" > > > > I'm hoping it's a temporary issue. > > I'm getting the same error on another repo: > > https://travis-ci.org/matthew-brett/nb2plots/jobs/323694309 > > I've sent an email to their support address, > > Cheers, > > Matthew > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stefanv at berkeley.edu Thu Jan 4 16:14:13 2018 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Thu, 04 Jan 2018 13:14:13 -0800 Subject: [SciPy-Dev] Position at BIDS (UC Berkeley) to work on NumPy Message-ID: <1515100453.2960132.1224583776.2F500AEF@webmail.messagingengine.com> Hi everyone, The Berkeley Institute for Data Science (BIDS) is hiring scientific Python Developers to contribute to NumPy. You can read more about the new positions here: https://bids.berkeley.edu/news/bids-receives-sloan-foundation-grant-contribute-numpy-development If you enjoy collaborative work as well as the technical challenges posed by numerical computing, this is an excellent opportunity to play a fundamental role in the development of one of the most impactful libraries in the entire Python ecosystem. Best regards St?fan Job link: https://jobsprod.is.berkeley.edu/psc/jobsprod/EMPLOYEE/HRMS/c/HRS_HRAM.HRS_CE.GBL?Page=HRS_CE_JOB_DTL&Action=A&JobOpeningId=24142&SiteId=1&PostingSeq=1 From charlesr.harris at gmail.com Thu Jan 4 18:52:30 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 4 Jan 2018 16:52:30 -0700 Subject: [SciPy-Dev] Position at BIDS (UC Berkeley) to work on NumPy In-Reply-To: <1515100453.2960132.1224583776.2F500AEF@webmail.messagingengine.com> References: <1515100453.2960132.1224583776.2F500AEF@webmail.messagingengine.com> Message-ID: Should send this to the NumPy list also. On Thu, Jan 4, 2018 at 2:14 PM, Stefan van der Walt wrote: > Hi everyone, > > The Berkeley Institute for Data Science (BIDS) is hiring scientific Python > Developers to contribute to NumPy. You can read more about the new > positions here: > > https://bids.berkeley.edu/news/bids-receives-sloan- > foundation-grant-contribute-numpy-development > > If you enjoy collaborative work as well as the technical challenges posed > by numerical computing, this is an excellent opportunity to play a > fundamental role in the development of one of the most impactful libraries > in the entire Python ecosystem. > > Best regards > St?fan > > Job link: https://jobsprod.is.berkeley.edu/psc/jobsprod/EMPLOYEE/ > HRMS/c/HRS_HRAM.HRS_CE.GBL?Page=HRS_CE_JOB_DTL&Action=A& > JobOpeningId=24142&SiteId=1&PostingSeq=1 > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Jan 6 20:00:20 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 6 Jan 2018 18:00:20 -0700 Subject: [SciPy-Dev] NumPy 1.14.0 release Message-ID: Hi All, On behalf of the NumPy team, I am pleased to announce NumPy 1.14.0. Numpy 1.14.0 is the result of seven months of work and contains a large number of bug fixes and new features, along with several changes with potential compatibility issues. The major change that users will notice are the stylistic changes in the way numpy arrays and scalars are printed, a change that will affect doctests. See the release notes for details on how to preserve the old style printing when needed. A major decision affecting future development concerns the schedule for dropping Python 2.7 support in the runup to 2020. The decision has been made to support 2.7 for all releases made in 2018, with the last release being designated a long term release with support for bug fixes extending through the end of 2019. Starting from January, 2019 support for 2.7 will be dropped in all new releases. More details can be found in the relevant NEP . This release supports Python 2.7 and 3.4 - 3.6. Wheels for the release are available on PyPI. Source tarballs, zipfiles, release notes, and the changelog are available on github . *Highlights* - The ``np.einsum`` function uses BLAS when possible - ``genfromtxt``, ``loadtxt``, ``fromregex`` and ``savetxt`` can now handle files with arbitrary Python supported encoding. - Major improvements to printing of NumPy arrays and scalars. *New functions* - ``parametrize``: decorator added to numpy.testing - ``chebinterpolate``: Interpolate function at Chebyshev points. - ``format_float_positional`` and ``format_float_scientific`` : format floating-point scalars unambiguously with control of rounding and padding. - ``PyArray_ResolveWritebackIfCopy`` and ``PyArray_SetWritebackIfCopyBase``, new C-API functions useful in achieving PyPy compatibity. *Contributors* A total of 100 people contributed to this release. People with a "+" by their names contributed a patch for the first time. * Alexey Brodkin + * Allan Haldane * Andras Deak + * Andrew Lawson + * Anna Chiara + * Antoine Pitrou * Bernhard M. Wiedemann + * Bob Eldering + * Brandon Carter * CJ Carey * Charles Harris * Chris Lamb * Christoph Boeddeker + * Christoph Gohlke * Daniel Hrisca + * Daniel Smith * Danny Hermes * David Freese * David Hagen * David Linke + * David Schaefer + * Dillon Niederhut + * Egor Panfilov + * Emilien Kofman * Eric Wieser * Erik Bray + * Erik Quaeghebeur + * Garry Polley + * Gunjan + * Han Shen + * Henke Adolfsson + * Hidehiro NAGAOKA + * Hemil Desai + * Hong Xu + * Iryna Shcherbina + * Jaime Fernandez * James Bourbeau + * Jamie Townsend + * Jarrod Millman * Jean Helie + * Jeroen Demeyer + * John Goetz + * John Kirkham * John Zwinck * Jonathan Helmus * Joseph Fox-Rabinovitz * Joseph Paul Cohen + * Joshua Leahy + * Julian Taylor * J?rg D?pfert + * Keno Goertz + * Kevin Sheppard + * Kexuan Sun + * Konrad Kapp + * Kristofor Maynard + * Licht Takeuchi + * Lo?c Est?ve * Lukas Mericle + * Marten van Kerkwijk * Matheus Portela + * Matthew Brett * Matti Picus * Michael Lamparski + * Michael Odintsov + * Michael Schnaitter + * Michael Seifert * Mike Nolta * Nathaniel J. Smith * Nelle Varoquaux + * Nicholas Del Grosso + * Nico Schl?mer + * Oleg Zabluda + * Oleksandr Pavlyk * Pauli Virtanen * Pim de Haan + * Ralf Gommers * Robert T. McGibbon + * Roland Kaufmann * Sebastian Berg * Serhiy Storchaka + * Shitian Ni + * Spencer Hill + * Srinivas Reddy Thatiparthy + * Stefan Winkler + * Stephan Hoyer * Steven Maude + * SuperBo + * Thomas K?ppe + * Toon Verstraelen * Vedant Misra + * Warren Weckesser * Wirawan Purwanto + * Yang Li + * Ziyan Zhou + * chaoyu3 + * orbit-stabilizer + * solarjoe * wufangjie + * xoviat + * ?lie Gouzien + Cheers, Charles Harris -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Mon Jan 8 19:35:22 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Tue, 9 Jan 2018 11:35:22 +1100 Subject: [SciPy-Dev] General discussion on parallelisation Message-ID: As part of https://github.com/scipy/scipy/pull/8259 I'm proposing that a `workers` keyword is added to optimize.differential_evolution to parallelise some computation. The proposal is that: 1. the workers keyword accepts either an integer or an object with a map-like method. 2. If an integer is supplied then the parallelisation is taken care of by scipy (more on that later), with -1 signifying that all processors are to be used. 3. If an object with a map-like method is supplied, e.g. `multiprocessing.Pool.map`, `mpi4py.futures.MPIPoolExecutor.map`, etc, then the parallelisation is taken care of by that object. This allows the user to specify the parallelisation configuration for their problem. 4. If workers=1, then computation will be done by the builtin `map` function. Now we come to the under the hood part. I've written something called PoolWrapper ( https://github.com/andyfaff/scipy/blob/b14bb513c0ffb9807a67663d39b9ab399375d37d/scipy/_lib/_util.py#L343) which wraps `multiprocessing.Pool` to achieve the behaviour outlined above. It can be used as a context manager, or the user of the object can decide when to close the resources opened by PoolWrapper. I've looked at using joblib instead of PoolWrapper and it seems useful but it doesn't have a couple of bits of functionality that are needed for this specific problem: viz: 5. joblib.Parallel doesn't have a map method (desirable to allow 3) so a small wrapper would have to be created anyway. 6. joblib.Parallel creates/destroys a multiprocessing.Pool each time the Parallel object is `__call__`ed. This leads to significant overhead. One can use the Parallel object with a context manager, which allows reuse of the Pool, but I don't think that's do-able in the context of using the DifferentialEvolutionSolver (DES) object as an iterator: >>> solver = DifferentialEvolutionSolver(func, bounds) >>> # use DES object as an iterator >>> for it in solver: ... res = next(solver) >>> print(res) >>> # use the DES.solve method >>> res = solver.solve() Whilst the DES object is not currently public (it's called by the differential_evolution function) it would be nice to expose it in the future, and people will want to use both approaches. Unfortunately with the first approach if we used joblib.Parallel we'd have to use Parallel.__call__ in DES.next() which has the overhead of creating/destroying Pools. For efficient use of resources the Pool should persist for the lifetime of the DES object. I also looked into `concurrent.futures.ProcessPoolExecutor`, but it's not available for Python 2.7. The purpose of this email is to elicit feedback for developing parallelisation strategy for scipy - what does the public interface look like, what does scipy do under the hood? Under the hood I think a mixture of PoolWrapper and joblib.Parallel could be used (with scipy vendoring joblib). A. -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From larson.eric.d at gmail.com Mon Jan 8 21:48:42 2018 From: larson.eric.d at gmail.com (Eric Larson) Date: Mon, 8 Jan 2018 21:48:42 -0500 Subject: [SciPy-Dev] General discussion on parallelisation In-Reply-To: References: Message-ID: > > 6. joblib.Parallel creates/destroys a multiprocessing.Pool each time the > Parallel object is `__call__`ed. This leads to significant overhead. One > can use the Parallel object with a context manager, which allows reuse of > the Pool, but I don't think that's do-able in the context of using the > DifferentialEvolutionSolver (DES) object as an iterator: > If you can use `__iter__` instead of `__next__` in your DifferentialEvolutionSolver class I think you can avoid this problem with something along the lines of: class DifferentialEvolutionSolver(object): ... def __iter__(self): with Parallel(...) ...: for it in ...: yield it ? As for the map problem I don't know, but it's probably worth asking the `joblib` people if there is a suitable solution or workaround. Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Mon Jan 8 23:22:17 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Tue, 9 Jan 2018 15:22:17 +1100 Subject: [SciPy-Dev] General discussion on parallelisation In-Reply-To: References: Message-ID: > > On 9 January 2018 at 13:48, Eric Larson wrote: > > If you can use `__iter__` instead of `__next__` in your > DifferentialEvolutionSolver class I think you can avoid this problem with > something along the lines of: > > > > class DifferentialEvolutionSolver(object): > > ... > > def __iter__(self): > > with Parallel(...) ...: > > for it in ...: > > yield it > That seems Pythonic, I'll look into that in more detail. One hurdle might be when the two approaches are mixed, starting off with a generator, uses the solver method, etc. -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Jan 9 03:24:17 2018 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 9 Jan 2018 00:24:17 -0800 Subject: [SciPy-Dev] RFC: comments to BLAS committee from numpy/scipy devs Message-ID: Hi all, As mentioned earlier [1][2], there's work underway to revise and update the BLAS standard -- e.g. we might get support for strided arrays and lose xerbla! There's a draft at [3]. They're interested in feedback from users, so I've written up a first draft of comments about what we would like as NumPy/SciPy developers. This is very much a first attempt -- I know we have lots of people who are more expert on BLAS than me on these lists :-). Please let me know what you think. -n [1] https://mail.python.org/pipermail/numpy-discussion/2017-November/077420.html [2] https://mail.python.org/pipermail/scipy-dev/2017-November/022267.html [3] https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBTTTVdBDvtD5I14QHp9OE/edit ----- # Comments from NumPy / SciPy developers on "A Proposal for a Next-Generation BLAS" These are comments on [A Proposal for a Next-Generation BLAS](https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBTTTVdBDvtD5I14QHp9OE/edit#) (version as of 2017-12-13), from the perspective of the developers of the NumPy and SciPy libraries. We hope this feedback is useful, and welcome further discussion. ## Who are we? NumPy and SciPy are the two foundational libraries of the Python numerical ecosystem, and one of their duties is to wrap BLAS and expose it for the use of other Python libraries. (NumPy primarily provides a GEMM wrapper, while SciPy exposes more specialized operations.) It's unclear how many users we have exactly, but we certainly ship multiple million copies of BLAS every month, and provide one of the most popular numerical toolkits for both novice and expert users. Looking at the original BLAS and LAPACK interfaces, it often seems that their imagined user is something like a classic supercomputer consumer, who writes code directly in Fortran or C against the BLAS API, and where the person writing the code and running the code are the same. NumPy/SciPy are coming from a very different perspective: our users generally know nothing about the details of the underlying BLAS; they just want to describe their problem in some high-level way, and the library is responsible for making it happen as efficiently as possible, and is often integrated into some larger system (e.g. a real-time analytics platform embedded in a web server). When it comes to our BLAS usage, we mostly use only a small subset of the routines. However, as "consumer software" used by a wide variety of users with differing degress of technical expertise, we're expected to Just Work on a wide variety of systems, and with as many different vendor BLAS libraries as possible. On the other hand, the fact that we're working with Python means we don't tend to worry about small inefficiencies that will be lost in the noise in any case, and are willing to sacrifice some performance to get more reliable operation across our diverse userbase. ## Comments on specific aspects of the proposal ### Data Layout We are **strongly in favor** of the proposal to support arbitrary strided data layouts. Ideally, this would support strides *specified in bytes* (allowing for unaligned data layouts), and allow for truly arbitrary strides, including *zero or negative* values. However, we think it's fine if some of the weirder cases suffer a performance penalty. Rationale: NumPy ? and thus, most of the scientific Python ecosystem ? only has one way of representing an array: the `numpy.ndarray` type, which is an arbitrary dimensional tensor with arbitrary strides. It is common to encounter matrices with non-trivial strides. For example:: # Make a 3-dimensional tensor, 10 x 9 x 8 t = np.zeros((10, 9, 8)) # Considering this as a stack of eight 10x9 matrices, extract the first: mat = t[:, :, 0] Now `mat` has non-trivial strides on both axes. (If running this in a Python interpreter, you can see this by looking at the value of `mat.strides`.) Another case where interesting strides arise is when performing ["broadcasting"](https://docs.scipy.org/doc/numpy-1.13.0/user/basics.broadcasting.html), which is the name for NumPy's rules for stretching arrays to make their shapes match. For example, in an expression like:: np.array([1, 2, 3]) + 1 the scalar `1` is "broadcast" to create a vector `[1, 1, 1]`. This is accomplished without allocating memory, by creating a vector with settings length = 3, strides = 0 ? so all the elements share a single location in memory. Similarly, by using negative strides we can reverse an array without allocating memory:: a = np.array([1, 2, 3]) a_flipped = a[::-1] Now `a_flipped` has the value `[3, 2, 1]`, while sharing storage with the array `a = [1, 2, 3]`. Misaligned data is also possible (e.g. an array of 8-byte doubles with a 9-byte stride), though it arises more rarely. (An example of when it might occurs is in an on-disk data format that alternates between storing a double value and then a single byte value, which is then memory-mapped.) While this array representation is very flexible and useful, it makes interfacing with BLAS a challenge: how do you perform a GEMM operation on two arrays with arbitrary strides? Currently, NumPy attempts to detect a number of special cases: if the strides in both arrays imply a column-major layout, then call BLAS directly; if one of them has strides corresponding to a row-major layout, then set the corresponding `transA`/`transB` argument, etc. ? and if all else fails, either copy the data into a contiguous buffer, or else fall back on a naive triple-nested-loop GEMM implementation. (There's also a check where if we can determine through examining the arrays' data pointers and strides that they're actually transposes of each other, then we instead dispatch to SYRK.) So for us, native stride support in the BLAS has two major advantages: - It allows a wider variety of operations to be transparently handled using high-speed kernels. - It reduces the complexity of the NumPy/BLAS interface code. Any kind of stride support produces both of these advantages to some extent. The ideal would be if BLAS supports strides specified in bytes (not elements), and allowing truly arbitrary values, because in this case, we can simply pass through our arrays directly to the library without having to do any fiddly checking at all. Note that for these purposes, it's OK if "weird" arrays pay a speed penalty: if the BLAS didn't support them, we'd just have to fall back on some even worse strategy for handling them ourselves. And this approach would mean that if some BLAS vendors do at some point decide to optimize these cases, then our users will immediately see the advantages. ### Error-reporting We are **strongly in favor** of the draft's proposal to replace XERBLA with proper error codes. XERBLA is fine for one-off programs in Fortran, but awful for wrapper libraries like NumPy/SciPy. ### Handling of exceptional values A recurring issue in low-level linear-algebra libraries is that they may crash, freeze, or produce unexpected results when receiving non-finite inputs. Of course as a wrapper library, NumPy/SciPy can't control what kind of strange data our users might pass in, and we have to produce sensible results regardless, so we sometimes accumulate hacks like having to validate all incoming data for exceptional values We support the proposal's recommendation of "NaN interpretation (4)". We also note that contrary to the note under "Interpretation (1)"; in the R statistical environment NaN does *not* represent missing data. R maintains a clear distinction between "NA" (missing) values and "NaN" (invalid) values. This is important for various statistical procedures such as imputation, where you might want to estimate the true value of an unmeasured variable, but you don't want to estimate the true value of 0/0. For efficiency, they do perform some calculations on NA values by storing them as NaNs with a specific payload; in the R context, therefore, it's particularly valuable if operations **correctly propagate NaN payloads**. NumPy does not yet provide first-class missing value support, but we plan to add it within the next 1-2 years, and when we do it seems likely that we'll have similar needs to R. ### BLAS G2 NumPy has a rich type system, including 16-, 32-, and 64-bit floats (plus extended precision on some platforms), a similar set of complex values, and is further extensible with new types. We have a similarly rich dispatch system for operations that attempts to find the best matching optimized-loop for an arbitrary set of input types. We're thus quite interested in the proposed mixed-type operations, and if implemented we'll probably start transparently taking advantage of them (i.e., in cases where we're currently upcasting to perform a requested operation, we may switch to using the native precision operation without requiring any changes in user code). Two comments, though: 1. The various tricks to make names less redundant (e.g., specifying just one type if they're all the same) are convenient for those calling these routines by hand, but rather inconvenient for those of us who will be generating a table mapping different type combinations to different functions. When designing a compression scheme for names, please remember that the scheme will have to be unambiguously implemented in code by multiple projects. Another option to consider: ask vendors to implement the full "uncompressed" names, and then provide a shim library mapping short "convenience" names to their full form. 2. Regarding the suggestion that the naming scheme will allow for a wide variety of differently typed routines, but that only some subset will be implemented by any particular vendor: we are worried about this subset business. For a library like NumPy that wants to wrap "all" the provided routines, while supporting "all" the different BLAS libraries ... how will this work? **How do we know which routines any particular library provides?** What if new routines are added to the list later? ### Reproducible BLAS This is interesting, but we don't have any particularly useful comments. ### Batch BLAS NumPy/SciPy actually provide batched operations in many cases, currently implemented using a `for` loop around individual calls to BLAS/LAPACK. However, our interface is relatively restricted compared to some of the proposals considered here: generally all we need is the ability to perform an operation on two "stacks" of size-homogenous matrices, with the offset between matrices determined by an arbitrary (potentially zero) stride. ### Fixed-point BLAS This is interesting, but we don't have any particularly useful comments. -- Nathaniel J. Smith -- https://vorpus.org From ilhanpolat at gmail.com Tue Jan 9 06:40:02 2018 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Tue, 9 Jan 2018 12:40:02 +0100 Subject: [SciPy-Dev] RFC: comments to BLAS committee from numpy/scipy devs In-Reply-To: References: Message-ID: I couldn't find an item to place this but I think ilaenv and also calling the function twice (one with lwork=-1 and reading the optimal block size and the call the function again properly with lwork=) in LAPACK needs to be gotten rid of. That's a major annoyance during the wrapping of LAPACK routines for SciPy. I don't know if this is realistic but the values ilaenv needed can be computed once (or again if hardware is changed) at the install and can be read off by the routines. On Jan 9, 2018 09:25, "Nathaniel Smith" wrote: > Hi all, > > As mentioned earlier [1][2], there's work underway to revise and > update the BLAS standard -- e.g. we might get support for strided > arrays and lose xerbla! There's a draft at [3]. They're interested in > feedback from users, so I've written up a first draft of comments > about what we would like as NumPy/SciPy developers. This is very much > a first attempt -- I know we have lots of people who are more expert > on BLAS than me on these lists :-). Please let me know what you think. > > -n > > [1] https://mail.python.org/pipermail/numpy-discussion/ > 2017-November/077420.html > [2] https://mail.python.org/pipermail/scipy-dev/2017-November/022267.html > [3] https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBTTTVdB > DvtD5I14QHp9OE/edit > > ----- > > # Comments from NumPy / SciPy developers on "A Proposal for a > Next-Generation BLAS" > > These are comments on [A Proposal for a Next-Generation > BLAS](https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBTTTVdB > DvtD5I14QHp9OE/edit#) > (version as of 2017-12-13), from the perspective of the developers of > the NumPy and SciPy libraries. We hope this feedback is useful, and > welcome further discussion. > > ## Who are we? > > NumPy and SciPy are the two foundational libraries of the Python > numerical ecosystem, and one of their duties is to wrap BLAS and > expose it for the use of other Python libraries. (NumPy primarily > provides a GEMM wrapper, while SciPy exposes more specialized > operations.) It's unclear how many users we have exactly, but we > certainly ship multiple million copies of BLAS every month, and > provide one of the most popular numerical toolkits for both novice and > expert users. > > Looking at the original BLAS and LAPACK interfaces, it often seems > that their imagined user is something like a classic supercomputer > consumer, who writes code directly in Fortran or C against the BLAS > API, and where the person writing the code and running the code are > the same. NumPy/SciPy are coming from a very different perspective: > our users generally know nothing about the details of the underlying > BLAS; they just want to describe their problem in some high-level way, > and the library is responsible for making it happen as efficiently as > possible, and is often integrated into some larger system (e.g. a > real-time analytics platform embedded in a web server). > > When it comes to our BLAS usage, we mostly use only a small subset of > the routines. However, as "consumer software" used by a wide variety > of users with differing degress of technical expertise, we're expected > to Just Work on a wide variety of systems, and with as many different > vendor BLAS libraries as possible. On the other hand, the fact that > we're working with Python means we don't tend to worry about small > inefficiencies that will be lost in the noise in any case, and are > willing to sacrifice some performance to get more reliable operation > across our diverse userbase. > > ## Comments on specific aspects of the proposal > > ### Data Layout > > We are **strongly in favor** of the proposal to support arbitrary > strided data layouts. Ideally, this would support strides *specified > in bytes* (allowing for unaligned data layouts), and allow for truly > arbitrary strides, including *zero or negative* values. However, we > think it's fine if some of the weirder cases suffer a performance > penalty. > > Rationale: NumPy ? and thus, most of the scientific Python ecosystem ? > only has one way of representing an array: the `numpy.ndarray` type, > which is an arbitrary dimensional tensor with arbitrary strides. It is > common to encounter matrices with non-trivial strides. For example:: > > # Make a 3-dimensional tensor, 10 x 9 x 8 > t = np.zeros((10, 9, 8)) > # Considering this as a stack of eight 10x9 matrices, extract the > first: > mat = t[:, :, 0] > > Now `mat` has non-trivial strides on both axes. (If running this in a > Python interpreter, you can see this by looking at the value of > `mat.strides`.) Another case where interesting strides arise is when > performing ["broadcasting"](https://docs.scipy.org/doc/numpy-1.13.0/ > user/basics.broadcasting.html), > which is the name for NumPy's rules for stretching arrays to make > their shapes match. For example, in an expression like:: > > np.array([1, 2, 3]) + 1 > > the scalar `1` is "broadcast" to create a vector `[1, 1, 1]`. This is > accomplished without allocating memory, by creating a vector with > settings length = 3, strides = 0 ? so all the elements share a single > location in memory. Similarly, by using negative strides we can > reverse an array without allocating memory:: > > a = np.array([1, 2, 3]) > a_flipped = a[::-1] > > Now `a_flipped` has the value `[3, 2, 1]`, while sharing storage with > the array `a = [1, 2, 3]`. Misaligned data is also possible (e.g. an > array of 8-byte doubles with a 9-byte stride), though it arises more > rarely. (An example of when it might occurs is in an on-disk data > format that alternates between storing a double value and then a > single byte value, which is then memory-mapped.) > > While this array representation is very flexible and useful, it makes > interfacing with BLAS a challenge: how do you perform a GEMM operation > on two arrays with arbitrary strides? Currently, NumPy attempts to > detect a number of special cases: if the strides in both arrays imply > a column-major layout, then call BLAS directly; if one of them has > strides corresponding to a row-major layout, then set the > corresponding `transA`/`transB` argument, etc. ? and if all else > fails, either copy the data into a contiguous buffer, or else fall > back on a naive triple-nested-loop GEMM implementation. (There's also > a check where if we can determine through examining the arrays' data > pointers and strides that they're actually transposes of each other, > then we instead dispatch to SYRK.) > > So for us, native stride support in the BLAS has two major advantages: > > - It allows a wider variety of operations to be transparently handled > using high-speed kernels. > > - It reduces the complexity of the NumPy/BLAS interface code. > > Any kind of stride support produces both of these advantages to some > extent. The ideal would be if BLAS supports strides specified in bytes > (not elements), and allowing truly arbitrary values, because in this > case, we can simply pass through our arrays directly to the library > without having to do any fiddly checking at all. Note that for these > purposes, it's OK if "weird" arrays pay a speed penalty: if the BLAS > didn't support them, we'd just have to fall back on some even worse > strategy for handling them ourselves. And this approach would mean > that if some BLAS vendors do at some point decide to optimize these > cases, then our users will immediately see the advantages. > > ### Error-reporting > > We are **strongly in favor** of the draft's proposal to replace XERBLA > with proper error codes. XERBLA is fine for one-off programs in > Fortran, but awful for wrapper libraries like NumPy/SciPy. > > ### Handling of exceptional values > > A recurring issue in low-level linear-algebra libraries is that they > may crash, freeze, or produce unexpected results when receiving > non-finite inputs. Of course as a wrapper library, NumPy/SciPy can't > control what kind of strange data our users might pass in, and we have > to produce sensible results regardless, so we sometimes accumulate > hacks like having to validate all incoming data for exceptional values > > We support the proposal's recommendation of "NaN interpretation (4)". > > We also note that contrary to the note under "Interpretation (1)"; in > the R statistical environment NaN does *not* represent missing data. R > maintains a clear distinction between "NA" (missing) values and "NaN" > (invalid) values. This is important for various statistical procedures > such as imputation, where you might want to estimate the true value of > an unmeasured variable, but you don't want to estimate the true value > of 0/0. For efficiency, they do perform some calculations on NA values > by storing them as NaNs with a specific payload; in the R context, > therefore, it's particularly valuable if operations **correctly > propagate NaN payloads**. NumPy does not yet provide first-class > missing value support, but we plan to add it within the next 1-2 > years, and when we do it seems likely that we'll have similar needs to > R. > > ### BLAS G2 > > NumPy has a rich type system, including 16-, 32-, and 64-bit floats > (plus extended precision on some platforms), a similar set of complex > values, and is further extensible with new types. We have a similarly > rich dispatch system for operations that attempts to find the best > matching optimized-loop for an arbitrary set of input types. We're > thus quite interested in the proposed mixed-type operations, and if > implemented we'll probably start transparently taking advantage of > them (i.e., in cases where we're currently upcasting to perform a > requested operation, we may switch to using the native precision > operation without requiring any changes in user code). > > Two comments, though: > > 1. The various tricks to make names less redundant (e.g., specifying > just one type if they're all the same) are convenient for those > calling these routines by hand, but rather inconvenient for those of > us who will be generating a table mapping different type combinations > to different functions. When designing a compression scheme for names, > please remember that the scheme will have to be unambiguously > implemented in code by multiple projects. Another option to consider: > ask vendors to implement the full "uncompressed" names, and then > provide a shim library mapping short "convenience" names to their full > form. > > 2. Regarding the suggestion that the naming scheme will allow for a > wide variety of differently typed routines, but that only some subset > will be implemented by any particular vendor: we are worried about > this subset business. For a library like NumPy that wants to wrap > "all" the provided routines, while supporting "all" the different BLAS > libraries ... how will this work? **How do we know which routines any > particular library provides?** What if new routines are added to the > list later? > > ### Reproducible BLAS > > This is interesting, but we don't have any particularly useful comments. > > ### Batch BLAS > > NumPy/SciPy actually provide batched operations in many cases, > currently implemented using a `for` loop around individual calls to > BLAS/LAPACK. However, our interface is relatively restricted compared > to some of the proposals considered here: generally all we need is the > ability to perform an operation on two "stacks" of size-homogenous > matrices, with the offset between matrices determined by an arbitrary > (potentially zero) stride. > > ### Fixed-point BLAS > > This is interesting, but we don't have any particularly useful comments. > > > -- > Nathaniel J. Smith -- https://vorpus.org > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From gael.varoquaux at normalesup.org Tue Jan 9 08:34:48 2018 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 9 Jan 2018 14:34:48 +0100 Subject: [SciPy-Dev] General discussion on parallelisation In-Reply-To: References: Message-ID: <20180109133448.GA832752@phare.normalesup.org> > 5. joblib.Parallel doesn't have a map method (desirable to allow 3) so a small joblib has a custom backend framework that can be used for such purpose (if I understnad you well): https://pythonhosted.org/joblib/parallel.html#custom-backend-api-experimental There are currently a Yarn and a dask.distributed backend that are getting better and better. > 6. joblib.Parallel creates/destroys a multiprocessing.Pool each time the > Parallel object is `__call__`ed. This leads to significant overhead. One can > use the Parallel object with a context manager, which allows reuse of the Pool, > but I don't think that's do-able in the context of using the > DifferentialEvolutionSolver (DES) object as an iterator: This is evolving. However, the reason behind this is that Pool get corrupted and lead to deadlock. Olivier Grisel and Thomas Moreau are working on fixing this in the Python standard library (first PR merged recently)! One of the vision of joblib is to provide very light mid-layer that can connect to multiprocessing and threading (though we are considering switching to concurrent.futures) as well as other backends. Hopefully this common language makes it easier to do things like embedding dask in numerical algorithms without a hard dependencies (yes we are working with the dask team on this). Ga?l From tyler.je.reddy at gmail.com Tue Jan 9 15:53:10 2018 From: tyler.je.reddy at gmail.com (Tyler Reddy) Date: Tue, 9 Jan 2018 13:53:10 -0700 Subject: [SciPy-Dev] RFC: comments to BLAS committee from numpy/scipy devs In-Reply-To: References: Message-ID: One common issue in computational geometry is the need to operate rapidly on arrays with "heterogeneous shapes." So, an array that has rows with different numbers of columns -- shape (1,3) for the first polygon and shape (1, 12) for the second polygon and so on. This seems like a particularly nasty scenario when the loss of "homogeneity" in shape precludes traditional vectorization -- I think numpy effectively converts these to dtype=object, etc. I don't think is necessarily a BLAS issue since wrapping comp. geo. libraries does happen in a subset of cases to handle this, but if there's overlap in utility you could pass it along I suppose. On 9 January 2018 at 04:40, Ilhan Polat wrote: > I couldn't find an item to place this but I think ilaenv and also calling > the function twice (one with lwork=-1 and reading the optimal block size > and the call the function again properly with lwork=) in LAPACK > needs to be gotten rid of. > > That's a major annoyance during the wrapping of LAPACK routines for SciPy. > > I don't know if this is realistic but the values ilaenv needed can be > computed once (or again if hardware is changed) at the install and can be > read off by the routines. > > > > On Jan 9, 2018 09:25, "Nathaniel Smith" wrote: > >> Hi all, >> >> As mentioned earlier [1][2], there's work underway to revise and >> update the BLAS standard -- e.g. we might get support for strided >> arrays and lose xerbla! There's a draft at [3]. They're interested in >> feedback from users, so I've written up a first draft of comments >> about what we would like as NumPy/SciPy developers. This is very much >> a first attempt -- I know we have lots of people who are more expert >> on BLAS than me on these lists :-). Please let me know what you think. >> >> -n >> >> [1] https://mail.python.org/pipermail/numpy-discussion/2017- >> November/077420.html >> [2] https://mail.python.org/pipermail/scipy-dev/2017-November/022267.html >> [3] https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBT >> TTVdBDvtD5I14QHp9OE/edit >> >> ----- >> >> # Comments from NumPy / SciPy developers on "A Proposal for a >> Next-Generation BLAS" >> >> These are comments on [A Proposal for a Next-Generation >> BLAS](https://docs.google.com/document/d/1DY4ImZT1coqri2382G >> usXgBTTTVdBDvtD5I14QHp9OE/edit#) >> (version as of 2017-12-13), from the perspective of the developers of >> the NumPy and SciPy libraries. We hope this feedback is useful, and >> welcome further discussion. >> >> ## Who are we? >> >> NumPy and SciPy are the two foundational libraries of the Python >> numerical ecosystem, and one of their duties is to wrap BLAS and >> expose it for the use of other Python libraries. (NumPy primarily >> provides a GEMM wrapper, while SciPy exposes more specialized >> operations.) It's unclear how many users we have exactly, but we >> certainly ship multiple million copies of BLAS every month, and >> provide one of the most popular numerical toolkits for both novice and >> expert users. >> >> Looking at the original BLAS and LAPACK interfaces, it often seems >> that their imagined user is something like a classic supercomputer >> consumer, who writes code directly in Fortran or C against the BLAS >> API, and where the person writing the code and running the code are >> the same. NumPy/SciPy are coming from a very different perspective: >> our users generally know nothing about the details of the underlying >> BLAS; they just want to describe their problem in some high-level way, >> and the library is responsible for making it happen as efficiently as >> possible, and is often integrated into some larger system (e.g. a >> real-time analytics platform embedded in a web server). >> >> When it comes to our BLAS usage, we mostly use only a small subset of >> the routines. However, as "consumer software" used by a wide variety >> of users with differing degress of technical expertise, we're expected >> to Just Work on a wide variety of systems, and with as many different >> vendor BLAS libraries as possible. On the other hand, the fact that >> we're working with Python means we don't tend to worry about small >> inefficiencies that will be lost in the noise in any case, and are >> willing to sacrifice some performance to get more reliable operation >> across our diverse userbase. >> >> ## Comments on specific aspects of the proposal >> >> ### Data Layout >> >> We are **strongly in favor** of the proposal to support arbitrary >> strided data layouts. Ideally, this would support strides *specified >> in bytes* (allowing for unaligned data layouts), and allow for truly >> arbitrary strides, including *zero or negative* values. However, we >> think it's fine if some of the weirder cases suffer a performance >> penalty. >> >> Rationale: NumPy ? and thus, most of the scientific Python ecosystem ? >> only has one way of representing an array: the `numpy.ndarray` type, >> which is an arbitrary dimensional tensor with arbitrary strides. It is >> common to encounter matrices with non-trivial strides. For example:: >> >> # Make a 3-dimensional tensor, 10 x 9 x 8 >> t = np.zeros((10, 9, 8)) >> # Considering this as a stack of eight 10x9 matrices, extract the >> first: >> mat = t[:, :, 0] >> >> Now `mat` has non-trivial strides on both axes. (If running this in a >> Python interpreter, you can see this by looking at the value of >> `mat.strides`.) Another case where interesting strides arise is when >> performing ["broadcasting"](https://docs.scipy.org/doc/numpy-1.13.0/use >> r/basics.broadcasting.html), >> which is the name for NumPy's rules for stretching arrays to make >> their shapes match. For example, in an expression like:: >> >> np.array([1, 2, 3]) + 1 >> >> the scalar `1` is "broadcast" to create a vector `[1, 1, 1]`. This is >> accomplished without allocating memory, by creating a vector with >> settings length = 3, strides = 0 ? so all the elements share a single >> location in memory. Similarly, by using negative strides we can >> reverse an array without allocating memory:: >> >> a = np.array([1, 2, 3]) >> a_flipped = a[::-1] >> >> Now `a_flipped` has the value `[3, 2, 1]`, while sharing storage with >> the array `a = [1, 2, 3]`. Misaligned data is also possible (e.g. an >> array of 8-byte doubles with a 9-byte stride), though it arises more >> rarely. (An example of when it might occurs is in an on-disk data >> format that alternates between storing a double value and then a >> single byte value, which is then memory-mapped.) >> >> While this array representation is very flexible and useful, it makes >> interfacing with BLAS a challenge: how do you perform a GEMM operation >> on two arrays with arbitrary strides? Currently, NumPy attempts to >> detect a number of special cases: if the strides in both arrays imply >> a column-major layout, then call BLAS directly; if one of them has >> strides corresponding to a row-major layout, then set the >> corresponding `transA`/`transB` argument, etc. ? and if all else >> fails, either copy the data into a contiguous buffer, or else fall >> back on a naive triple-nested-loop GEMM implementation. (There's also >> a check where if we can determine through examining the arrays' data >> pointers and strides that they're actually transposes of each other, >> then we instead dispatch to SYRK.) >> >> So for us, native stride support in the BLAS has two major advantages: >> >> - It allows a wider variety of operations to be transparently handled >> using high-speed kernels. >> >> - It reduces the complexity of the NumPy/BLAS interface code. >> >> Any kind of stride support produces both of these advantages to some >> extent. The ideal would be if BLAS supports strides specified in bytes >> (not elements), and allowing truly arbitrary values, because in this >> case, we can simply pass through our arrays directly to the library >> without having to do any fiddly checking at all. Note that for these >> purposes, it's OK if "weird" arrays pay a speed penalty: if the BLAS >> didn't support them, we'd just have to fall back on some even worse >> strategy for handling them ourselves. And this approach would mean >> that if some BLAS vendors do at some point decide to optimize these >> cases, then our users will immediately see the advantages. >> >> ### Error-reporting >> >> We are **strongly in favor** of the draft's proposal to replace XERBLA >> with proper error codes. XERBLA is fine for one-off programs in >> Fortran, but awful for wrapper libraries like NumPy/SciPy. >> >> ### Handling of exceptional values >> >> A recurring issue in low-level linear-algebra libraries is that they >> may crash, freeze, or produce unexpected results when receiving >> non-finite inputs. Of course as a wrapper library, NumPy/SciPy can't >> control what kind of strange data our users might pass in, and we have >> to produce sensible results regardless, so we sometimes accumulate >> hacks like having to validate all incoming data for exceptional values >> >> We support the proposal's recommendation of "NaN interpretation (4)". >> >> We also note that contrary to the note under "Interpretation (1)"; in >> the R statistical environment NaN does *not* represent missing data. R >> maintains a clear distinction between "NA" (missing) values and "NaN" >> (invalid) values. This is important for various statistical procedures >> such as imputation, where you might want to estimate the true value of >> an unmeasured variable, but you don't want to estimate the true value >> of 0/0. For efficiency, they do perform some calculations on NA values >> by storing them as NaNs with a specific payload; in the R context, >> therefore, it's particularly valuable if operations **correctly >> propagate NaN payloads**. NumPy does not yet provide first-class >> missing value support, but we plan to add it within the next 1-2 >> years, and when we do it seems likely that we'll have similar needs to >> R. >> >> ### BLAS G2 >> >> NumPy has a rich type system, including 16-, 32-, and 64-bit floats >> (plus extended precision on some platforms), a similar set of complex >> values, and is further extensible with new types. We have a similarly >> rich dispatch system for operations that attempts to find the best >> matching optimized-loop for an arbitrary set of input types. We're >> thus quite interested in the proposed mixed-type operations, and if >> implemented we'll probably start transparently taking advantage of >> them (i.e., in cases where we're currently upcasting to perform a >> requested operation, we may switch to using the native precision >> operation without requiring any changes in user code). >> >> Two comments, though: >> >> 1. The various tricks to make names less redundant (e.g., specifying >> just one type if they're all the same) are convenient for those >> calling these routines by hand, but rather inconvenient for those of >> us who will be generating a table mapping different type combinations >> to different functions. When designing a compression scheme for names, >> please remember that the scheme will have to be unambiguously >> implemented in code by multiple projects. Another option to consider: >> ask vendors to implement the full "uncompressed" names, and then >> provide a shim library mapping short "convenience" names to their full >> form. >> >> 2. Regarding the suggestion that the naming scheme will allow for a >> wide variety of differently typed routines, but that only some subset >> will be implemented by any particular vendor: we are worried about >> this subset business. For a library like NumPy that wants to wrap >> "all" the provided routines, while supporting "all" the different BLAS >> libraries ... how will this work? **How do we know which routines any >> particular library provides?** What if new routines are added to the >> list later? >> >> ### Reproducible BLAS >> >> This is interesting, but we don't have any particularly useful comments. >> >> ### Batch BLAS >> >> NumPy/SciPy actually provide batched operations in many cases, >> currently implemented using a `for` loop around individual calls to >> BLAS/LAPACK. However, our interface is relatively restricted compared >> to some of the proposals considered here: generally all we need is the >> ability to perform an operation on two "stacks" of size-homogenous >> matrices, with the offset between matrices determined by an arbitrary >> (potentially zero) stride. >> >> ### Fixed-point BLAS >> >> This is interesting, but we don't have any particularly useful comments. >> >> >> -- >> Nathaniel J. Smith -- https://vorpus.org >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Tue Jan 9 19:01:05 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Wed, 10 Jan 2018 11:01:05 +1100 Subject: [SciPy-Dev] General discussion on parallelisation In-Reply-To: References: Message-ID: On 9 January 2018 at 13:48, Eric Larson wrote: >> >> If you can use `__iter__` instead of `__next__` in your DifferentialEvolutionSolver class I think you can avoid this problem with something along the lines of: > > class DifferentialEvolutionSolver(object): > ... > def __iter__(self): > with Parallel(...) ...: > for it in ...: > yield it > > As for the map problem I don't know, but it's probably worth asking the `joblib` people if there is a suitable solution or workaround. In the scheme of things I'd prefer to retain the use of `__next__`, see https://github.com/scipy/scipy/pull/6923 for the direction of where I think things are headed. -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Jan 9 19:34:04 2018 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 9 Jan 2018 16:34:04 -0800 Subject: [SciPy-Dev] RFC: comments to BLAS committee from numpy/scipy devs In-Reply-To: References: Message-ID: On Tue, Jan 9, 2018 at 3:40 AM, Ilhan Polat wrote: > I couldn't find an item to place this but I think ilaenv and also calling > the function twice (one with lwork=-1 and reading the optimal block size and > the call the function again properly with lwork=) in LAPACK needs to > be gotten rid of. > > That's a major annoyance during the wrapping of LAPACK routines for SciPy. > > I don't know if this is realistic but the values ilaenv needed can be > computed once (or again if hardware is changed) at the install and can be > read off by the routines. Unfortunately I think this effort is just to revise BLAS, not LAPACK. Maybe you should try starting a conversation with the LAPACK developers though ? I don't know much about how they work but maybe they'd be interested in feedback. -n -- Nathaniel J. Smith -- https://vorpus.org From njs at pobox.com Tue Jan 9 19:37:18 2018 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 9 Jan 2018 16:37:18 -0800 Subject: [SciPy-Dev] RFC: comments to BLAS committee from numpy/scipy devs In-Reply-To: References: Message-ID: On Tue, Jan 9, 2018 at 12:53 PM, Tyler Reddy wrote: > One common issue in computational geometry is the need to operate rapidly on > arrays with "heterogeneous shapes." > > So, an array that has rows with different numbers of columns -- shape (1,3) > for the first polygon and shape (1, 12) for the second polygon and so on. > > This seems like a particularly nasty scenario when the loss of "homogeneity" > in shape precludes traditional vectorization -- I think numpy effectively > converts these to dtype=object, etc. I don't > think is necessarily a BLAS issue since wrapping comp. geo. libraries does > happen in a subset of cases to handle this, but if there's overlap in > utility you could pass it along I suppose. You might be interested in this discussion of "Batch BLAS": https://docs.google.com/document/d/1DY4ImZT1coqri2382GusXgBTTTVdBDvtD5I14QHp9OE/edit#heading=h.pvsif1mxvaqq I didn't get into it in the draft response, because it didn't seem like something where NumPy/SciPy have any useful experience to offer, but it sounds like there are people worrying about this case. -n -- Nathaniel J. Smith -- https://vorpus.org From andyfaff at gmail.com Wed Jan 10 00:08:42 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Wed, 10 Jan 2018 16:08:42 +1100 Subject: [SciPy-Dev] master branch failing on travis + appveyor at the moment Message-ID: https://travis-ci.org/scipy/scipy.svg?branch=master The doctests look to be the culprit. From what I can tell there are offending spaces causing the issue. I don't know why it's reared it's head now, probably some underlying package has received an update. A. -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From larson.eric.d at gmail.com Wed Jan 10 01:06:34 2018 From: larson.eric.d at gmail.com (Eric Larson) Date: Wed, 10 Jan 2018 01:06:34 -0500 Subject: [SciPy-Dev] master branch failing on travis + appveyor at the moment In-Reply-To: References: Message-ID: NumPy 1.14 changed array printing behavior: https://github.com/numpy/numpy/blob/master/doc/release/1.14.0-notes.rst#many-changes-to-array-printing-disableable-with-the-new-legacy-printing-mode Eric On Wed, Jan 10, 2018 at 12:08 AM, Andrew Nelson wrote: > https://travis-ci.org/scipy/scipy.svg?branch=master > > The doctests look to be the culprit. From what I can tell there are > offending spaces causing the issue. I don't know why it's reared it's head > now, probably some underlying package has received an update. > > A. > > -- > _____________________________________ > Dr. Andrew Nelson > > > _____________________________________ > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Wed Jan 10 02:42:30 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Wed, 10 Jan 2018 18:42:30 +1100 Subject: [SciPy-Dev] master branch failing on travis + appveyor at the moment In-Reply-To: References:

Message-ID: I guess the right thing to do going forward is change all the doctests then. I'll have a bash at that tonight. On 10 Jan 2018 5:07 pm, "Eric Larson" wrote: > NumPy 1.14 changed array printing behavior: > > https://github.com/numpy/numpy/blob/master/doc/release/ > 1.14.0-notes.rst#many-changes-to-array-printing-disableable- > with-the-new-legacy-printing-mode > > Eric > > > On Wed, Jan 10, 2018 at 12:08 AM, Andrew Nelson > wrote: > >> https://travis-ci.org/scipy/scipy.svg?branch=master >> >> The doctests look to be the culprit. From what I can tell there are >> offending spaces causing the issue. I don't know why it's reared it's head >> now, probably some underlying package has received an update. >> >> A. >> >> -- >> _____________________________________ >> Dr. Andrew Nelson >> >> >> _____________________________________ >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From evgeny.burovskiy at gmail.com Wed Jan 10 02:58:36 2018 From: evgeny.burovskiy at gmail.com (Evgeni Burovski) Date: Wed, 10 Jan 2018 10:58:36 +0300 Subject: [SciPy-Dev] master branch failing on travis + appveyor at the moment In-Reply-To: References:

Message-ID: On Jan 10, 2018 10:42 AM, "Andrew Nelson" wrote: I guess the right thing to do going forward is change all the doctests then. I'll have a bash at that tonight. Might want to take a look at why refguide-check fails: it is supposed to understand the array syntax and compare arrays numerically. Also this should have been caught a while ago by the CI against the numpy master branch. There might be something to fix in the CI setup if that failed to trigger. On 10 Jan 2018 5:07 pm, "Eric Larson" wrote: > NumPy 1.14 changed array printing behavior: > > https://github.com/numpy/numpy/blob/master/doc/release/1.14. > 0-notes.rst#many-changes-to-array-printing-disableable-wit > h-the-new-legacy-printing-mode > > Eric > > > On Wed, Jan 10, 2018 at 12:08 AM, Andrew Nelson > wrote: > >> https://travis-ci.org/scipy/scipy.svg?branch=master >> >> The doctests look to be the culprit. From what I can tell there are >> offending spaces causing the issue. I don't know why it's reared it's head >> now, probably some underlying package has received an update. >> >> A. >> >> -- >> _____________________________________ >> Dr. Andrew Nelson >> >> >> _____________________________________ >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Jan 10 03:25:51 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 10 Jan 2018 21:25:51 +1300 Subject: [SciPy-Dev] master branch failing on travis + appveyor at the moment In-Reply-To: References:

Message-ID: On Wed, Jan 10, 2018 at 8:58 PM, Evgeni Burovski wrote: > > > On Jan 10, 2018 10:42 AM, "Andrew Nelson" wrote: > > I guess the right thing to do going forward is change all the doctests > then. I'll have a bash at that tonight. > > > Might want to take a look at why refguide-check fails: it is supposed to > understand the array syntax and compare arrays numerically. > Agreed, shouldn't have failed. Changing the doctests as well may still be good for doc purposes (so it matches the latest numpy), assuming it's not too much work. > Also this should have been caught a while ago by the CI against the numpy > master branch. There might be something to fix in the CI setup if that > failed to trigger. > The refguide check is only run once, and that's against the last released numpy. It should only be run once, but we could consider using numpy master for it. Ralf > > On 10 Jan 2018 5:07 pm, "Eric Larson" wrote: > >> NumPy 1.14 changed array printing behavior: >> >> https://github.com/numpy/numpy/blob/master/doc/release/1.14. >> 0-notes.rst#many-changes-to-array-printing-disableable-with- >> the-new-legacy-printing-mode >> >> Eric >> >> >> On Wed, Jan 10, 2018 at 12:08 AM, Andrew Nelson >> wrote: >> >>> https://travis-ci.org/scipy/scipy.svg?branch=master >>> >>> The doctests look to be the culprit. From what I can tell there are >>> offending spaces causing the issue. I don't know why it's reared it's head >>> now, probably some underlying package has received an update. >>> >>> A. >>> >>> -- >>> _____________________________________ >>> Dr. Andrew Nelson >>> >>> >>> _____________________________________ >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Wed Jan 10 03:31:23 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Wed, 10 Jan 2018 19:31:23 +1100 Subject: [SciPy-Dev] master branch failing on travis + appveyor at the moment In-Reply-To: References:

Message-ID: It's easy for me to change the doctests, but I'm not familiar with refguide_check.py. I'm going to leave that to someone else. A. On 10 January 2018 at 19:25, Ralf Gommers wrote: > > > On Wed, Jan 10, 2018 at 8:58 PM, Evgeni Burovski < > evgeny.burovskiy at gmail.com> wrote: > >> >> >> On Jan 10, 2018 10:42 AM, "Andrew Nelson" wrote: >> >> I guess the right thing to do going forward is change all the doctests >> then. I'll have a bash at that tonight. >> >> >> Might want to take a look at why refguide-check fails: it is supposed to >> understand the array syntax and compare arrays numerically. >> > > Agreed, shouldn't have failed. Changing the doctests as well may still be > good for doc purposes (so it matches the latest numpy), assuming it's not > too much work. > > >> Also this should have been caught a while ago by the CI against the numpy >> master branch. There might be something to fix in the CI setup if that >> failed to trigger. >> > > The refguide check is only run once, and that's against the last released > numpy. It should only be run once, but we could consider using numpy master > for it. > > Ralf > > >> >> On 10 Jan 2018 5:07 pm, "Eric Larson" wrote: >> >>> NumPy 1.14 changed array printing behavior: >>> >>> https://github.com/numpy/numpy/blob/master/doc/release/1.14. >>> 0-notes.rst#many-changes-to-array-printing-disableable-with- >>> the-new-legacy-printing-mode >>> >>> Eric >>> >>> >>> On Wed, Jan 10, 2018 at 12:08 AM, Andrew Nelson >>> wrote: >>> >>>> https://travis-ci.org/scipy/scipy.svg?branch=master >>>> >>>> The doctests look to be the culprit. From what I can tell there are >>>> offending spaces causing the issue. I don't know why it's reared it's head >>>> now, probably some underlying package has received an update. >>>> >>>> A. >>>> >>>> -- >>>> _____________________________________ >>>> Dr. Andrew Nelson >>>> >>>> >>>> _____________________________________ >>>> >>>> _______________________________________________ >>>> SciPy-Dev mailing list >>>> SciPy-Dev at python.org >>>> https://mail.python.org/mailman/listinfo/scipy-dev >>>> >>>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Jan 10 04:42:31 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 10 Jan 2018 22:42:31 +1300 Subject: [SciPy-Dev] GSoC'18 participation? Message-ID: Hi all, The GSoC schedule is a bit earlier than normal this year. The PSF is asking for ideas pages to be up and in decent shape by Jan 19th. So we'll need to come up with some content quick if we want to participate. Who is interested in mentoring this year? I'm happy to do the admin again, but probably won't have time to mentor. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From jomsdev at gmail.com Wed Jan 10 07:02:33 2018 From: jomsdev at gmail.com (Jordi Montes) Date: Wed, 10 Jan 2018 13:02:33 +0100 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: Message-ID: Hi all, Last year I started implementing some methods for Randomized Numerical Linear Algebra (RNLA) in scipy. By now it is only the CountMin Sketch (clarkson_woodruff_transformation) for reducing the dimensionality of a vector space to an embedded space. I think that it would be interesting to add to scipy other methods for subspace embedding (like the Johnson-Lindenstrauss) and build some algorithms on top of it for things like least squeres or low rank approximation. Would some other people be interesting in this? PS: I have a project called RandNLA where I implemented some of the methods of RNLA. The idea is to implement only the most important methods of RNLA in scipy and have this other library for experimenting with new methods and APIs. That will let us not overloading scipy with features if people are not interested in them and focus on the ones that really brings value to the community. Thanks, Jordi. On 10 January 2018 at 10:42, Ralf Gommers wrote: > Hi all, > > The GSoC schedule is a bit earlier than normal this year. The PSF is > asking for ideas pages to be up and in decent shape by Jan 19th. So we'll > need to come up with some content quick if we want to participate. > > Who is interested in mentoring this year? > > I'm happy to do the admin again, but probably won't have time to mentor. > > Cheers, > Ralf > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Jan 10 13:24:01 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 11 Jan 2018 07:24:01 +1300 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References:

Message-ID: On Thu, Jan 11, 2018 at 1:02 AM, Jordi Montes wrote: > Hi all, > > Last year I started implementing some methods for Randomized Numerical > Linear Algebra (RNLA) in scipy. > By now it is only the CountMin Sketch (clarkson_woodruff_transformation) > for reducing the dimensionality of a vector space to an embedded space. > > I think that it would be interesting to add to scipy other methods for > subspace embedding (like the Johnson-Lindenstrauss) and build some > algorithms on top of it for things like least squeres or low rank > approximation. > > Would some other people be interesting in this? > > > PS: I have a project called RandNLA > where I implemented some of the methods of RNLA. The idea is to implement > only the most important methods of RNLA in scipy and have this other > library for experimenting with new methods and APIs. That will let us not > overloading scipy with features if people are not interested in them and > focus on the ones that really brings value to the community. > That sounds like a good approach. Worth a discussion in a separate thread or as a draft enhancement proposal / GSoC idea to detail out which would be those most important methods. Would you be interested in mentoring then? Ralf > Thanks, > > Jordi. > > On 10 January 2018 at 10:42, Ralf Gommers wrote: > >> Hi all, >> >> The GSoC schedule is a bit earlier than normal this year. The PSF is >> asking for ideas pages to be up and in decent shape by Jan 19th. So we'll >> need to come up with some content quick if we want to participate. >> >> Who is interested in mentoring this year? >> >> I'm happy to do the admin again, but probably won't have time to mentor. >> >> Cheers, >> Ralf >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jomsdev at gmail.com Wed Jan 10 14:16:20 2018 From: jomsdev at gmail.com (Jordi Montes) Date: Wed, 10 Jan 2018 20:16:20 +0100 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References:

Message-ID: Hello Ralf, I have been working on these methods before. I do not have any problem writing down a draft developing the idea and starting a discussion about the scope after. However, I had in mind to be the student and not the mentor. This is my last year as Master student so it is my last chance for participating in a GSoC. I am sure that we can find good mentors for this initiative who are more expert in the field than myself. Jordi. On 10 January 2018 at 19:24, Ralf Gommers wrote: > > > On Thu, Jan 11, 2018 at 1:02 AM, Jordi Montes wrote: > >> Hi all, >> >> Last year I started implementing some methods for Randomized Numerical >> Linear Algebra (RNLA) in scipy. >> By now it is only the CountMin Sketch (clarkson_woodruff_transformation) >> for reducing the dimensionality of a vector space to an embedded space. >> >> I think that it would be interesting to add to scipy other methods for >> subspace embedding (like the Johnson-Lindenstrauss) and build some >> algorithms on top of it for things like least squeres or low rank >> approximation. >> >> Would some other people be interesting in this? >> >> >> PS: I have a project called RandNLA >> where I implemented some of the methods of RNLA. The idea is to implement >> only the most important methods of RNLA in scipy and have this other >> library for experimenting with new methods and APIs. That will let us not >> overloading scipy with features if people are not interested in them and >> focus on the ones that really brings value to the community. >> > > That sounds like a good approach. Worth a discussion in a separate thread > or as a draft enhancement proposal / GSoC idea to detail out which would be > those most important methods. > > Would you be interested in mentoring then? > > Ralf > > >> Thanks, >> >> Jordi. >> >> On 10 January 2018 at 10:42, Ralf Gommers wrote: >> >>> Hi all, >>> >>> The GSoC schedule is a bit earlier than normal this year. The PSF is >>> asking for ideas pages to be up and in decent shape by Jan 19th. So we'll >>> need to come up with some content quick if we want to participate. >>> >>> Who is interested in mentoring this year? >>> >>> I'm happy to do the admin again, but probably won't have time to mentor. >>> >>> Cheers, >>> Ralf >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Jan 10 14:34:11 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 11 Jan 2018 08:34:11 +1300 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References:

Message-ID: On Thu, Jan 11, 2018 at 8:16 AM, Jordi Montes wrote: > Hello Ralf, > > I have been working on these methods before. I do not have any problem > writing down a draft developing the idea and starting a discussion about > the scope after. > > However, I had in mind to be the student and not the mentor. > Okay, that's great! This is my last year as Master student so it is my last chance for > participating in a GSoC. I am sure that we can find good mentors for this > initiative who are more expert in the field than myself. > If we can pair one domain expert with one of the core developers, that would be a good mentoring config. Ralf > > Jordi. > > On 10 January 2018 at 19:24, Ralf Gommers wrote: > >> >> >> On Thu, Jan 11, 2018 at 1:02 AM, Jordi Montes wrote: >> >>> Hi all, >>> >>> Last year I started implementing some methods for Randomized Numerical >>> Linear Algebra (RNLA) in scipy. >>> By now it is only the CountMin Sketch (clarkson_woodruff_transformation) >>> for reducing the dimensionality of a vector space to an embedded space. >>> >>> I think that it would be interesting to add to scipy other methods for >>> subspace embedding (like the Johnson-Lindenstrauss) and build some >>> algorithms on top of it for things like least squeres or low rank >>> approximation. >>> >>> Would some other people be interesting in this? >>> >>> >>> PS: I have a project called RandNLA >>> where I implemented some of the methods of RNLA. The idea is to implement >>> only the most important methods of RNLA in scipy and have this other >>> library for experimenting with new methods and APIs. That will let us not >>> overloading scipy with features if people are not interested in them and >>> focus on the ones that really brings value to the community. >>> >> >> That sounds like a good approach. Worth a discussion in a separate thread >> or as a draft enhancement proposal / GSoC idea to detail out which would be >> those most important methods. >> >> Would you be interested in mentoring then? >> >> Ralf >> >> >>> Thanks, >>> >>> Jordi. >>> >>> On 10 January 2018 at 10:42, Ralf Gommers >>> wrote: >>> >>>> Hi all, >>>> >>>> The GSoC schedule is a bit earlier than normal this year. The PSF is >>>> asking for ideas pages to be up and in decent shape by Jan 19th. So we'll >>>> need to come up with some content quick if we want to participate. >>>> >>>> Who is interested in mentoring this year? >>>> >>>> I'm happy to do the admin again, but probably won't have time to mentor. >>>> >>>> Cheers, >>>> Ralf >>>> >>>> >>>> _______________________________________________ >>>> SciPy-Dev mailing list >>>> SciPy-Dev at python.org >>>> https://mail.python.org/mailman/listinfo/scipy-dev >>>> >>>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 13 03:30:12 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sat, 13 Jan 2018 21:30:12 +1300 Subject: [SciPy-Dev] Restructuring flapack.src.pyf file In-Reply-To: References: <3sm3kp.p1jh54.rthztg-qmf@smtp.gmail.com> Message-ID: On Wed, Dec 27, 2017 at 2:40 AM, Ilhan Polat wrote: > The situation, as usual, turns out a bit more involved > > In the discussion, https://github.com/scipy/scipy/issues/6502 Tiziano > writes > > "Hi, I am the author of the original wrappers for those LAPACK routines. > If I remember correctly, the reason for hiding the eigenvalues in an > interval option was simply that the original implementation of > scipy.linalg.eigh did not even allow to limit the number of > eigenvalues/eigenvectors returned. I needed only the possiblity to define > the indices of the eigenvalues and not the range, and the result is the > option eigvals to scipy.linalg.eigh. I think that your wrapper is too > generic. You should not allow setting from Python the values of things like > lwork: a wrong value there and the user can generate a segfault from > Python, which is a no-no." > > and I can see more LAPACK routines that are wrapped along this line. > Having 8 more "xxxx_full" and "xxxx_full_lwork" routines doesn't seem to be > elegant though I don't have any alternative yet. Maybe it is OK to have > them for a few versions but I guess it would be nice if we have a parallel > running deprecation plans for this issue. It looks like _np.deprecate works > pretty OK. And we have four deprecations already in lapack.py e.g., > > cgegv = _np.deprecate(cgegv, old_name='cgegv', message=_dep_message) > > This will print a deprecation warning when it is called or docstring is > printed. "DeprecationWarning: `dgegv` is deprecated! The `*gegv` family of > routines has been deprecated in LAPACK 3.6.0 in favor of the `*ggev` family > of routines.The corresponding wrappers will be removed from SciPy in a > future release." So I would like to deprecate the routines abouve with a > message > > "The *XXXXX family of routines will have a different signature in the > future versions of SciPy. The new versions are now available as > *XXXXX_full and will eventually replace the original *XXXXX family" > > Would you dis/agree? > Just wondering about the tradeoff between deprecating + having a new function with arguments in the same order as LAPACK, vs. just appending the keywords to the existing function. The latter case would need a thin Python function in lapack.py, but unless that hurts performance significantly I'm not sure that's much of an issue. Ralf > > > > On Mon, Dec 25, 2017 at 11:58 PM, wrote: > >> Hi, >> >> Ilhan Polat kirjoitti tiistai 26. joulukuuta 2017: >> [clip] >> > I've done a similar thing by freeing out all ins and outs. It seems to >> me >> > that the signature change is inevitable but the question is how to >> handle >> > this? >> >> New function sounds best, maximally unimaginative new name is probably >> most appropriate. >> >> Not sure how well np.deprecate etc python level wrappers work on f2py. >> >> >> > 2) It looks like flapack.src.pyf file is getting exceedingly long and >> > unstructured. Should we break it apart just like LAPACK documentation? >> >> If you think that helps it significantly and is worthwhile use of your >> time then why not. (I suspect it's not particularly worthwhile but ymmv.) >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ilhanpolat at gmail.com Sat Jan 13 05:16:30 2018 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Sat, 13 Jan 2018 11:16:30 +0100 Subject: [SciPy-Dev] Restructuring flapack.src.pyf file In-Reply-To: References: <3sm3kp.p1jh54.rthztg-qmf@smtp.gmail.com>

Message-ID: Most of them would indeed just gain new keywords xxgvd, xxgv, and so on. These would only get lwork and liwork keywords. No deprecations needed. ?(sy/he)evr on the other hand not only gets these extra keywords but also would gain two more output arguments which is actually the reason for this thread. So fortunately the amount of deprecation is "a few"~ ssyevr/dsyevr/cheevr/zheevr so far. These are used in the regular case of scipy.linalg.eigh and was causing problems due to unoptimized lwork values. Also they were not completely exposed.hence the new ouput args. I would like to start a separate discussion about the deprecation of keywords "eigvals" and "turbo" via a decorator later once this is done. Because now we are able to give "eig_range" as an input if need be as a new functionality. And "eigvals" does not correspond to giving an index range say, "eig_index". But that will have to wait until this surgery on the LAPACK side. On Jan 13, 2018 09:31, "Ralf Gommers" wrote: > > > On Wed, Dec 27, 2017 at 2:40 AM, Ilhan Polat wrote: > >> The situation, as usual, turns out a bit more involved >> >> In the discussion, https://github.com/scipy/scipy/issues/6502 Tiziano >> writes >> >> "Hi, I am the author of the original wrappers for those LAPACK routines. >> If I remember correctly, the reason for hiding the eigenvalues in an >> interval option was simply that the original implementation of >> scipy.linalg.eigh did not even allow to limit the number of >> eigenvalues/eigenvectors returned. I needed only the possiblity to define >> the indices of the eigenvalues and not the range, and the result is the >> option eigvals to scipy.linalg.eigh. I think that your wrapper is too >> generic. You should not allow setting from Python the values of things like >> lwork: a wrong value there and the user can generate a segfault from >> Python, which is a no-no." >> >> and I can see more LAPACK routines that are wrapped along this line. >> Having 8 more "xxxx_full" and "xxxx_full_lwork" routines doesn't seem to be >> elegant though I don't have any alternative yet. Maybe it is OK to have >> them for a few versions but I guess it would be nice if we have a parallel >> running deprecation plans for this issue. It looks like _np.deprecate works >> pretty OK. And we have four deprecations already in lapack.py e.g., >> >> cgegv = _np.deprecate(cgegv, old_name='cgegv', message=_dep_message) >> >> This will print a deprecation warning when it is called or docstring is >> printed. "DeprecationWarning: `dgegv` is deprecated! The `*gegv` family of >> routines has been deprecated in LAPACK 3.6.0 in favor of the `*ggev` family >> of routines.The corresponding wrappers will be removed from SciPy in a >> future release." So I would like to deprecate the routines abouve with a >> message >> >> "The *XXXXX family of routines will have a different signature in the >> future versions of SciPy. The new versions are now available as >> *XXXXX_full and will eventually replace the original *XXXXX family" >> >> Would you dis/agree? >> > > Just wondering about the tradeoff between deprecating + having a new > function with arguments in the same order as LAPACK, vs. just appending the > keywords to the existing function. The latter case would need a thin Python > function in lapack.py, but unless that hurts performance significantly I'm > not sure that's much of an issue. > > Ralf > > > > >> >> >> >> On Mon, Dec 25, 2017 at 11:58 PM, wrote: >> >>> Hi, >>> >>> Ilhan Polat kirjoitti tiistai 26. joulukuuta 2017: >>> [clip] >>> > I've done a similar thing by freeing out all ins and outs. It seems to >>> me >>> > that the signature change is inevitable but the question is how to >>> handle >>> > this? >>> >>> New function sounds best, maximally unimaginative new name is probably >>> most appropriate. >>> >>> Not sure how well np.deprecate etc python level wrappers work on f2py. >>> >>> >>> > 2) It looks like flapack.src.pyf file is getting exceedingly long and >>> > unstructured. Should we break it apart just like LAPACK documentation? >>> >>> If you think that helps it significantly and is worthwhile use of your >>> time then why not. (I suspect it's not particularly worthwhile but ymmv.) >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Fri Jan 19 04:15:53 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 19 Jan 2018 22:15:53 +1300 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References:

Message-ID: Hi all, I have created https://github.com/scipy/scipy/wiki/GSoC-2018-project-ideas and added Jordi's idea to it. If anyone would be interested in co-mentoring this topic, please add your name. Also if you have other ideas and are interested to mentor those if a good student comes along, please add! I'll send the link through to the PSF organisers so we get on the list. Clearly we're having a bit of an issue with bandwidth of the usual suspects for mentoring (and PR reviewing) right now, so while we will keep the door open to participate we're going to have to be quite selective with students. Cheers, Ralf On Thu, Jan 11, 2018 at 1:02 AM, Jordi Montes wrote: > Hi all, > > Last year I started implementing some methods for Randomized Numerical > Linear Algebra (RNLA) in scipy. > By now it is only the CountMin Sketch (clarkson_woodruff_transformation) > for reducing the dimensionality of a vector space to an embedded space. > > I think that it would be interesting to add to scipy other methods for > subspace embedding (like the Johnson-Lindenstrauss) and build some > algorithms on top of it for things like least squeres or low rank > approximation. > > Would some other people be interesting in this? > > > PS: I have a project called RandNLA > where I implemented some of the methods of RNLA. The idea is to implement > only the most important methods of RNLA in scipy and have this other > library for experimenting with new methods and APIs. That will let us not > overloading scipy with features if people are not interested in them and > focus on the ones that really brings value to the community. > > Thanks, > > Jordi. > > On 10 January 2018 at 10:42, Ralf Gommers wrote: > >> Hi all, >> >> The GSoC schedule is a bit earlier than normal this year. The PSF is >> asking for ideas pages to be up and in decent shape by Jan 19th. So we'll >> need to come up with some content quick if we want to participate. >> >> Who is interested in mentoring this year? >> >> I'm happy to do the admin again, but probably won't have time to mentor. >> >> Cheers, >> Ralf >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From nikolay.mayorov at zoho.com Fri Jan 19 10:11:31 2018 From: nikolay.mayorov at zoho.com (Nikolay Mayorov) Date: Fri, 19 Jan 2018 20:11:31 +0500 Subject: [SciPy-Dev] GSoC'18 participation? Message-ID: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> Hi! I have this idea, which I'm well familiar with. The module would be called like scipy.spatial.rotation and be devoted to the rotation formalism in 3 dimensions. The main objects are Euler angles (and their variations), direction cosine matrices, quaternions and rotation vectors. We can go with an abstraction class Rotation (using DCMs or Qs internally), but we should be able to create that from any representation and see it in any representation. In spirit of scipy/numpy we use vectorized/bulk approaches (i.e. many rotations in single Rotation class). Rotation should support 2 operations: compose 2 consecutive rotations and rotate/project a 3d vector.??Of course all procedures must be 100% robust and there are some fine points, especially in conversions between representations. Also we can add some algorithms, like: quaternion interpolation (SLERP), least-squares vector matching by a rotation (Whabba's problem), more advanced and less known algotithms for rotation interpolation, and I will try to come up with something more. Overall it seems reasonably straightforward , but with enough challenges in design and implementation As currently described, it might be not enough volume for the GSoC, but we can develop it farther.? Also I'm not sure if its applicability is broad enough to include it into scipy. I believe similar functionality is available in Aerospace toolbox in Matlab. I want to hear some opinions on that. Nikolay ---- On Wed, 10 Jan 2018 17:02:33 +0500 jomsdev at gmail.com wrote ---- Hi all, Last year I started implementing some methods for Randomized Numerical Linear Algebra (RNLA) in scipy. By now it is only the CountMin Sketch (clarkson_woodruff_transformation) for reducing the dimensionality of a vector space to an embedded space. I think that it would be interesting to add to scipy other methods for subspace embedding (like the Johnson-Lindenstrauss) and build some algorithms on top of it for things like least squeres or low rank approximation. Would some other people be interesting in this? PS: I have a project called RandNLA where I implemented some of the methods of RNLA. The idea is to implement only the most important methods of RNLA in scipy and have this other library for experimenting with new methods and APIs. That will let us not overloading scipy with features if people are not interested in them and focus on the ones that really brings value to the community. Thanks, Jordi. On 10 January 2018 at 10:42, Ralf Gommers wrote: Hi all, The GSoC schedule is a bit earlier than normal this year. The PSF is asking for ideas pages to be up and in decent shape by Jan 19th. So we'll need to come up with some content quick if we want to participate. Who is interested in mentoring this year? I'm happy to do the admin again, but probably won't have time to mentor. Cheers, Ralf _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From larson.eric.d at gmail.com Fri Jan 19 10:25:17 2018 From: larson.eric.d at gmail.com (Eric Larson) Date: Fri, 19 Jan 2018 10:25:17 -0500 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> Message-ID: I have personally run into the need for such transformations in two separate domains (3D visualization, neuroscience / electrophysiology) and I know it's used in multiple other places, too. So I think it would be sufficiently general. I'd look forward to having it in SciPy! I'd be happy to be a secondary mentor on this if you (or someone else) wants to be primary. Best, Eric On Fri, Jan 19, 2018 at 10:11 AM, Nikolay Mayorov wrote: > Hi! > > I have this idea, which I'm well familiar with. The module would be called > like scipy.spatial.rotation and be devoted to the rotation formalism in 3 > dimensions. > > The main objects are Euler angles (and their variations), direction cosine > matrices, quaternions and rotation vectors. We can go with an abstraction > class Rotation (using DCMs or Qs internally), but we should be able to > create that from any representation and see it in any representation. In > spirit of scipy/numpy we use vectorized/bulk approaches (i.e. many > rotations in single Rotation class). > > Rotation should support 2 operations: compose 2 consecutive rotations and > rotate/project a 3d vector. Of course all procedures must be 100% robust > and there are some fine points, especially in conversions between > representations. > Also we can add some algorithms, like: quaternion interpolation (SLERP), > least-squares vector matching by a rotation (Whabba's problem), more > advanced and less known algotithms for rotation interpolation, and I will > try to come up with something more. > > Overall it seems reasonably straightforward , but with enough challenges > in design and implementation > > As currently described, it might be not enough volume for the GSoC, but we > can develop it farther. > > Also I'm not sure if its applicability is broad enough to include it into > scipy. I believe similar functionality is available in Aerospace toolbox in > Matlab. I want to hear some opinions on that. > > Nikolay > > > ---- On Wed, 10 Jan 2018 17:02:33 +0500 * jomsdev at gmail.com > * wrote ---- > > Hi all, > > Last year I started implementing some methods for Randomized Numerical > Linear Algebra (RNLA) in scipy. > By now it is only the CountMin Sketch (clarkson_woodruff_transformation) > for reducing the dimensionality of a vector space to an embedded space. > > I think that it would be interesting to add to scipy other methods for > subspace embedding (like the Johnson-Lindenstrauss) and build some > algorithms on top of it for things like least squeres or low rank > approximation. > > Would some other people be interesting in this? > > > PS: I have a project called RandNLA > where I implemented some of the methods of RNLA. The idea is to implement > only the most important methods of RNLA in scipy and have this other > library for experimenting with new methods and APIs. That will let us not > overloading scipy with features if people are not interested in them and > focus on the ones that really brings value to the community. > > Thanks, > > Jordi. > > On 10 January 2018 at 10:42, Ralf Gommers wrote: > > Hi all, > > The GSoC schedule is a bit earlier than normal this year. The PSF is > asking for ideas pages to be up and in decent shape by Jan 19th. So we'll > need to come up with some content quick if we want to participate. > > Who is interested in mentoring this year? > > I'm happy to do the admin again, but probably won't have time to mentor. > > Cheers, > Ralf > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From nikolay.mayorov at zoho.com Fri Jan 19 12:00:46 2018 From: nikolay.mayorov at zoho.com (Nikolay Mayorov) Date: Fri, 19 Jan 2018 22:00:46 +0500 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> Message-ID: <1610f5d92cc.116a5cf7615287.4034716040473351782@zoho.com> Hi, Eric! Thank you for the feedback. You mentioned 3D visualization in which, as far as I know, 4D homogeneous coordinates are commonly used. But we shouldn't go in this direction and consider 3D vectors and rotations only, right? Great that you are interested in mentoring this project as well. I will add the idea in a basic form to the wiki. Generally I think we should leave some space for a student to investigate, instead of specifying absolutely everything. Do you agree? Best, Nikolay ---- On Fri, 19 Jan 2018 20:25:17 +0500 Eric Larson <larson.eric.d at gmail.com> wrote ---- I have personally run into the need for such transformations in two separate domains (3D visualization, neuroscience / electrophysiology) and I know it's used in multiple other places, too. So I think it would be sufficiently general. I'd look forward to having it in SciPy! I'd be happy to be a secondary mentor on this if you (or someone else) wants to be primary. Best, Eric On Fri, Jan 19, 2018 at 10:11 AM, Nikolay Mayorov <nikolay.mayorov at zoho.com> wrote: _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev Hi! I have this idea, which I'm well familiar with. The module would be called like scipy.spatial.rotation and be devoted to the rotation formalism in 3 dimensions. The main objects are Euler angles (and their variations), direction cosine matrices, quaternions and rotation vectors. We can go with an abstraction class Rotation (using DCMs or Qs internally), but we should be able to create that from any representation and see it in any representation. In spirit of scipy/numpy we use vectorized/bulk approaches (i.e. many rotations in single Rotation class). Rotation should support 2 operations: compose 2 consecutive rotations and rotate/project a 3d vector. Of course all procedures must be 100% robust and there are some fine points, especially in conversions between representations. Also we can add some algorithms, like: quaternion interpolation (SLERP), least-squares vector matching by a rotation (Whabba's problem), more advanced and less known algotithms for rotation interpolation, and I will try to come up with something more. Overall it seems reasonably straightforward , but with enough challenges in design and implementation As currently described, it might be not enough volume for the GSoC, but we can develop it farther. Also I'm not sure if its applicability is broad enough to include it into scipy. I believe similar functionality is available in Aerospace toolbox in Matlab. I want to hear some opinions on that. Nikolay ---- On Wed, 10 Jan 2018 17:02:33 +0500 jomsdev at gmail.com wrote ---- Hi all, Last year I started implementing some methods for Randomized Numerical Linear Algebra (RNLA) in scipy. By now it is only the CountMin Sketch (clarkson_woodruff_transformation) for reducing the dimensionality of a vector space to an embedded space. I think that it would be interesting to add to scipy other methods for subspace embedding (like the Johnson-Lindenstrauss) and build some algorithms on top of it for things like least squeres or low rank approximation. Would some other people be interesting in this? PS: I have a project called RandNLA where I implemented some of the methods of RNLA. The idea is to implement only the most important methods of RNLA in scipy and have this other library for experimenting with new methods and APIs. That will let us not overloading scipy with features if people are not interested in them and focus on the ones that really brings value to the community. Thanks, Jordi. On 10 January 2018 at 10:42, Ralf Gommers <ralf.gommers at gmail.com> wrote: Hi all, The GSoC schedule is a bit earlier than normal this year. The PSF is asking for ideas pages to be up and in decent shape by Jan 19th. So we'll need to come up with some content quick if we want to participate. Who is interested in mentoring this year? I'm happy to do the admin again, but probably won't have time to mentor. Cheers, Ralf _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From larson.eric.d at gmail.com Fri Jan 19 12:18:27 2018 From: larson.eric.d at gmail.com (Eric Larson) Date: Fri, 19 Jan 2018 12:18:27 -0500 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: <1610f5d92cc.116a5cf7615287.4034716040473351782@zoho.com> References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> <1610f5d92cc.116a5cf7615287.4034716040473351782@zoho.com> Message-ID: > > Thank you for the feedback. You mentioned 3D visualization in which, as > far as I know, 4D homogeneous coordinates are commonly used. But we > shouldn't go in this direction and consider 3D vectors and rotations only, > right? > Yeah at least at first (for GSoC or first implementation) this makes sense to me. In viz I ran into needing quaternions specifically. In neuroscience I've needed that and also e.g. computing the angle between two rotation matrices (or quaternions), among other things. Great that you are interested in mentoring this project as well. I will add > the idea in a basic form to the wiki. Generally I think we should leave > some space for a student to investigate, instead of specifying absolutely > everything. Do you agree? > Yes that sounds reasonable to me. If anyone thinks this isn't within the scope of SciPy, speak now or forever hold your peace :) Cheers, Eric -------------- next part -------------- An HTML attachment was scrubbed... URL: From mark.campanelli at gmail.com Fri Jan 19 14:58:03 2018 From: mark.campanelli at gmail.com (Mark Campanelli) Date: Fri, 19 Jan 2018 19:58:03 +0000 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> <1610f5d92cc.116a5cf7615287.4034716040473351782@zoho.com> Message-ID: Anyone familiar with Geometric Algebra? Might be worth consideration during preliminary discussions. For example, thinks looks potentially relevant: https://dl.acm.org/citation.cfm?id=1610323 On Fri, Jan 19, 2018 at 10:19 Eric Larson wrote: > Thank you for the feedback. You mentioned 3D visualization in which, as >> far as I know, 4D homogeneous coordinates are commonly used. But we >> shouldn't go in this direction and consider 3D vectors and rotations only, >> right? >> > > Yeah at least at first (for GSoC or first implementation) this makes sense > to me. In viz I ran into needing quaternions specifically. In neuroscience > I've needed that and also e.g. computing the angle between two rotation > matrices (or quaternions), among other things. > > Great that you are interested in mentoring this project as well. I will >> add the idea in a basic form to the wiki. Generally I think we should leave >> some space for a student to investigate, instead of specifying absolutely >> everything. Do you agree? >> > > Yes that sounds reasonable to me. > > If anyone thinks this isn't within the scope of SciPy, speak now or > forever hold your peace :) > > Cheers, > Eric > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cimrman3 at ntc.zcu.cz Fri Jan 19 16:11:42 2018 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Fri, 19 Jan 2018 22:11:42 +0100 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> <1610f5d92cc.116a5cf7615287.4034716040473351782@zoho.com>

Message-ID: On 01/19/2018 08:58 PM, Mark Campanelli wrote: > Anyone familiar with Geometric Algebra? Might be worth consideration during > preliminary discussions. For example, thinks looks potentially relevant: > https://dl.acm.org/citation.cfm?id=1610323 FYI: a freely accessible resource on geometric algebra: http://www.jaapsuter.com/geometric-algebra/ And ,of course, there is a Python package for that: https://clifford.readthedocs.io r. > On Fri, Jan 19, 2018 at 10:19 Eric Larson wrote: > >> Thank you for the feedback. You mentioned 3D visualization in which, as >>> far as I know, 4D homogeneous coordinates are commonly used. But we >>> shouldn't go in this direction and consider 3D vectors and rotations only, >>> right? >>> >> >> Yeah at least at first (for GSoC or first implementation) this makes sense >> to me. In viz I ran into needing quaternions specifically. In neuroscience >> I've needed that and also e.g. computing the angle between two rotation >> matrices (or quaternions), among other things. >> >> Great that you are interested in mentoring this project as well. I will >>> add the idea in a basic form to the wiki. Generally I think we should leave >>> some space for a student to investigate, instead of specifying absolutely >>> everything. Do you agree? >>> >> >> Yes that sounds reasonable to me. >> >> If anyone thinks this isn't within the scope of SciPy, speak now or >> forever hold your peace :) >> >> Cheers, >> Eric >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > From oss at multiwave.ch Fri Jan 19 16:28:36 2018 From: oss at multiwave.ch (oss) Date: Fri, 19 Jan 2018 22:28:36 +0100 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> Message-ID: Hello, Maybe this could come in handy regarding transforms matrices quaternions etc. https://www.lfd.uci.edu/~gohlke/code/transformations.py.html Best Tryfon > On Jan 19, 2018, at 4:25 PM, Eric Larson wrote: > > I have personally run into the need for such transformations in two separate domains (3D visualization, neuroscience / electrophysiology) and I know it's used in multiple other places, too. So I think it would be sufficiently general. I'd look forward to having it in SciPy! > > I'd be happy to be a secondary mentor on this if you (or someone else) wants to be primary. > > Best, > Eric > > > On Fri, Jan 19, 2018 at 10:11 AM, Nikolay Mayorov > wrote: > Hi! > > I have this idea, which I'm well familiar with. The module would be called like scipy.spatial.rotation and be devoted to the rotation formalism in 3 dimensions. > > The main objects are Euler angles (and their variations), direction cosine matrices, quaternions and rotation vectors. We can go with an abstraction class Rotation (using DCMs or Qs internally), but we should be able to create that from any representation and see it in any representation. In spirit of scipy/numpy we use vectorized/bulk approaches (i.e. many rotations in single Rotation class). > > Rotation should support 2 operations: compose 2 consecutive rotations and rotate/project a 3d vector. Of course all procedures must be 100% robust and there are some fine points, especially in conversions between representations. > Also we can add some algorithms, like: quaternion interpolation (SLERP), least-squares vector matching by a rotation (Whabba's problem), more advanced and less known algotithms for rotation interpolation, and I will try to come up with something more. > > Overall it seems reasonably straightforward , but with enough challenges in design and implementation > > As currently described, it might be not enough volume for the GSoC, but we can develop it farther. > > Also I'm not sure if its applicability is broad enough to include it into scipy. I believe similar functionality is available in Aerospace toolbox in Matlab. I want to hear some opinions on that. > > Nikolay > > > ---- On Wed, 10 Jan 2018 17:02:33 +0500 jomsdev at gmail.com wrote ---- > > Hi all, > > Last year I started implementing some methods for Randomized Numerical Linear Algebra (RNLA) in scipy. > By now it is only the CountMin Sketch (clarkson_woodruff_transformation) for reducing the dimensionality of a vector space to an embedded space. > > I think that it would be interesting to add to scipy other methods for subspace embedding (like the Johnson-Lindenstrauss) and build some algorithms on top of it for things like least squeres or low rank approximation. > > Would some other people be interesting in this? > > > PS: I have a project called RandNLA where I implemented some of the methods of RNLA. The idea is to implement only the most important methods of RNLA in scipy and have this other library for experimenting with new methods and APIs. That will let us not overloading scipy with features if people are not interested in them and focus on the ones that really brings value to the community. > > Thanks, > > Jordi. > > On 10 January 2018 at 10:42, Ralf Gommers > wrote: > Hi all, > > The GSoC schedule is a bit earlier than normal this year. The PSF is asking for ideas pages to be up and in decent shape by Jan 19th. So we'll need to come up with some content quick if we want to participate. > > Who is interested in mentoring this year? > > I'm happy to do the admin again, but probably won't have time to mentor. > > Cheers, > Ralf > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From mark.campanelli at gmail.com Fri Jan 19 18:17:01 2018 From: mark.campanelli at gmail.com (Mark Campanelli) Date: Fri, 19 Jan 2018 23:17:01 +0000 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com> <1610f5d92cc.116a5cf7615287.4034716040473351782@zoho.com>

Message-ID: Thanks Robert! GA seems to have advanced nicely since I last dabbled with it. I would hope any ?core? approach based on GA could remain accessible to those with a more traditional background. I recall this bridge seemed very worth crossing over. On Fri, Jan 19, 2018 at 14:12 Robert Cimrman wrote: > On 01/19/2018 08:58 PM, Mark Campanelli wrote: > > Anyone familiar with Geometric Algebra? Might be worth consideration > during > > preliminary discussions. For example, thinks looks potentially relevant: > > https://dl.acm.org/citation.cfm?id=1610323 > > FYI: a freely accessible resource on geometric algebra: > http://www.jaapsuter.com/geometric-algebra/ > > And ,of course, there is a Python package for that: > https://clifford.readthedocs.io > > r. > > > On Fri, Jan 19, 2018 at 10:19 Eric Larson > wrote: > > > >> Thank you for the feedback. You mentioned 3D visualization in which, as > >>> far as I know, 4D homogeneous coordinates are commonly used. But we > >>> shouldn't go in this direction and consider 3D vectors and rotations > only, > >>> right? > >>> > >> > >> Yeah at least at first (for GSoC or first implementation) this makes > sense > >> to me. In viz I ran into needing quaternions specifically. In > neuroscience > >> I've needed that and also e.g. computing the angle between two rotation > >> matrices (or quaternions), among other things. > >> > >> Great that you are interested in mentoring this project as well. I will > >>> add the idea in a basic form to the wiki. Generally I think we should > leave > >>> some space for a student to investigate, instead of specifying > absolutely > >>> everything. Do you agree? > >>> > >> > >> Yes that sounds reasonable to me. > >> > >> If anyone thinks this isn't within the scope of SciPy, speak now or > >> forever hold your peace :) > >> > >> Cheers, > >> Eric > >> > >> _______________________________________________ > >> SciPy-Dev mailing list > >> SciPy-Dev at python.org > >> https://mail.python.org/mailman/listinfo/scipy-dev > >> > > > > > > > > _______________________________________________ > > SciPy-Dev mailing list > > SciPy-Dev at python.org > > https://mail.python.org/mailman/listinfo/scipy-dev > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 20 00:09:11 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sat, 20 Jan 2018 18:09:11 +1300 Subject: [SciPy-Dev] Restructuring flapack.src.pyf file In-Reply-To: References: <3sm3kp.p1jh54.rthztg-qmf@smtp.gmail.com>

Message-ID: On Sat, Jan 13, 2018 at 11:16 PM, Ilhan Polat wrote: > Most of them would indeed just gain new keywords xxgvd, xxgv, and so on. > These would only get lwork and liwork keywords. No deprecations needed. > > ?(sy/he)evr on the other hand not only gets these extra keywords but also > would gain two more output arguments which is actually the reason for this > thread. So fortunately the amount of deprecation is "a few"~ > ssyevr/dsyevr/cheevr/zheevr so far. > Okay, that sounds a lot more sensible / less worrying. Maybe just open a PR and we'll look at the detailed changes there? Cheers, Ralf These are used in the regular case of scipy.linalg.eigh and was causing > problems due to unoptimized lwork values. Also they were not completely > exposed.hence the new ouput args. > > I would like to start a separate discussion about the deprecation of > keywords "eigvals" and "turbo" via a decorator later once this is done. > > Because now we are able to give "eig_range" as an input if need be as a > new functionality. And "eigvals" does not correspond to giving an index > range say, "eig_index". But that will have to wait until this surgery on > the LAPACK side. > > > > On Jan 13, 2018 09:31, "Ralf Gommers" wrote: > >> >> >> On Wed, Dec 27, 2017 at 2:40 AM, Ilhan Polat >> wrote: >> >>> The situation, as usual, turns out a bit more involved >>> >>> In the discussion, https://github.com/scipy/scipy/issues/6502 Tiziano >>> writes >>> >>> "Hi, I am the author of the original wrappers for those LAPACK routines. >>> If I remember correctly, the reason for hiding the eigenvalues in an >>> interval option was simply that the original implementation of >>> scipy.linalg.eigh did not even allow to limit the number of >>> eigenvalues/eigenvectors returned. I needed only the possiblity to define >>> the indices of the eigenvalues and not the range, and the result is the >>> option eigvals to scipy.linalg.eigh. I think that your wrapper is too >>> generic. You should not allow setting from Python the values of things like >>> lwork: a wrong value there and the user can generate a segfault from >>> Python, which is a no-no." >>> >>> and I can see more LAPACK routines that are wrapped along this line. >>> Having 8 more "xxxx_full" and "xxxx_full_lwork" routines doesn't seem to be >>> elegant though I don't have any alternative yet. Maybe it is OK to have >>> them for a few versions but I guess it would be nice if we have a parallel >>> running deprecation plans for this issue. It looks like _np.deprecate works >>> pretty OK. And we have four deprecations already in lapack.py e.g., >>> >>> cgegv = _np.deprecate(cgegv, old_name='cgegv', message=_dep_message) >>> >>> This will print a deprecation warning when it is called or docstring is >>> printed. "DeprecationWarning: `dgegv` is deprecated! The `*gegv` family of >>> routines has been deprecated in LAPACK 3.6.0 in favor of the `*ggev` family >>> of routines.The corresponding wrappers will be removed from SciPy in a >>> future release." So I would like to deprecate the routines abouve with a >>> message >>> >>> "The *XXXXX family of routines will have a different signature in the >>> future versions of SciPy. The new versions are now available as >>> *XXXXX_full and will eventually replace the original *XXXXX family" >>> >>> Would you dis/agree? >>> >> >> Just wondering about the tradeoff between deprecating + having a new >> function with arguments in the same order as LAPACK, vs. just appending the >> keywords to the existing function. The latter case would need a thin Python >> function in lapack.py, but unless that hurts performance significantly I'm >> not sure that's much of an issue. >> >> Ralf >> >> >> >> >>> >>> >>> >>> On Mon, Dec 25, 2017 at 11:58 PM, wrote: >>> >>>> Hi, >>>> >>>> Ilhan Polat kirjoitti tiistai 26. joulukuuta 2017: >>>> [clip] >>>> > I've done a similar thing by freeing out all ins and outs. It seems >>>> to me >>>> > that the signature change is inevitable but the question is how to >>>> handle >>>> > this? >>>> >>>> New function sounds best, maximally unimaginative new name is probably >>>> most appropriate. >>>> >>>> Not sure how well np.deprecate etc python level wrappers work on f2py. >>>> >>>> >>>> > 2) It looks like flapack.src.pyf file is getting exceedingly long and >>>> > unstructured. Should we break it apart just like LAPACK documentation? >>>> >>>> If you think that helps it significantly and is worthwhile use of your >>>> time then why not. (I suspect it's not particularly worthwhile but ymmv.) >>>> _______________________________________________ >>>> SciPy-Dev mailing list >>>> SciPy-Dev at python.org >>>> https://mail.python.org/mailman/listinfo/scipy-dev >>>> >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From b.rosa at unistra.fr Sat Jan 20 03:12:12 2018 From: b.rosa at unistra.fr (Benoit Rosa) Date: Sat, 20 Jan 2018 09:12:12 +0100 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com>

Message-ID: Hi, I have used that library quite a few times, and it is rather slow. Adding a transformation (or, for starters, rotation) module to scipy would be, in my opinion, a nice addition. Speaking about adding a few algorithms, one interesting add could be a function to uniformly sample the rotation space. It is a core functionality needed in a number of cases, and not that straightforward to perform properly (again, depending on the chosen formalism for representing the rotation). Best Benoit On 19/01/2018 22:28, oss wrote: > Hello, > > Maybe this could come in handy regarding transforms matrices quaternions > ?etc. > > https://www.lfd.uci.edu/~gohlke/code/transformations.py.html > > > Best > > Tryfon > > >> On Jan 19, 2018, at 4:25 PM, Eric Larson > > wrote: >> >> I have personally run into the need for such transformations in two >> separate domains (3D visualization, neuroscience / electrophysiology) >> and I know it's used in multiple other places, too. So I think it >> would be sufficiently general. I'd look forward to having it in SciPy! >> >> I'd be happy to be a secondary mentor on this if you (or someone >> else) wants to be primary. >> >> Best, >> Eric >> >> >> On Fri, Jan 19, 2018 at 10:11 AM, Nikolay Mayorov >> > wrote: >> >> Hi! >> >> I have this idea, which I'm well familiar with. The module would >> be called like scipy.spatial.rotation and be devoted to the >> rotation formalism in 3 dimensions. >> >> The main objects are Euler angles (and their variations), >> direction cosine matrices, quaternions and rotation vectors. We >> can go with an abstraction class Rotation (using DCMs or Qs >> internally), but we should be able to create that from any >> representation and see it in any representation. In spirit of >> scipy/numpy we use vectorized/bulk approaches (i.e. many >> rotations in single Rotation class). >> >> Rotation should support 2 operations: compose 2 consecutive >> rotations and rotate/project a 3d vector.??Of course all >> procedures must be 100% robust and there are some fine points, >> especially in conversions between representations.Also we can add >> some algorithms, like: quaternion interpolation (SLERP), >> least-squares vector matching by a rotation (Whabba's problem), >> more advanced and less known algotithms for rotation >> interpolation, and I will try to come up with something more. >> >> Overall it seems reasonably straightforward , but with enough >> challenges in design and implementation >> >> As currently described, it might be not enough volume for the >> GSoC, but we can develop it farther. >> >> Also I'm not sure if its applicability is broad enough to include >> it into scipy. I believe similar functionality is available in >> Aerospace toolbox in Matlab. I want to hear some opinions on that. >> >> Nikolay >> >> >> ---- On Wed, 10 Jan 2018 17:02:33 +0500 *jomsdev at gmail.com >> * wrote ---- >> >> Hi all, >> >> Last year I started implementing some methods for Randomized >> Numerical Linear Algebra (RNLA) in scipy. >> By now it is only the CountMin Sketch >> (clarkson_woodruff_transformation) for reducing the >> dimensionality of a vector space to an embedded space. >> >> I think that it would be interesting to add to scipy other >> methods for subspace embedding (like the >> Johnson-Lindenstrauss) and build some algorithms on top of it >> for things like least squeres or low rank approximation. >> >> Would some other people be interesting in this? >> >> >> PS: I have a project called RandNLA >> where I implemented some >> of the methods of RNLA. The idea is to implement only the >> most important methods of RNLA in scipy and have this other >> library for experimenting with new methods and APIs. That >> will let us not overloading scipy with features if people are >> not interested in them and focus on the ones that really >> brings value to the community. >> >> Thanks, >> >> Jordi. >> >> On 10 January 2018 at 10:42, Ralf Gommers >> > wrote: >> >> Hi all, >> >> The GSoC schedule is a bit earlier than normal this year. >> The PSF is asking for ideas pages to be up and in decent >> shape by Jan 19th. So we'll need to come up with some >> content quick if we want to participate. >> >> Who is interested in mentoring this year? >> >> I'm happy to do the admin again, but probably won't have >> time to mentor. >> >> Cheers, >> Ralf >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From nikolay.mayorov at zoho.com Sat Jan 20 05:09:15 2018 From: nikolay.mayorov at zoho.com (Nikolay Mayorov) Date: Sat, 20 Jan 2018 15:09:15 +0500 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com>

Message-ID: <161130b3084.cf9acf3121661.3858523581452506771@zoho.com> Thanks for the suggestion, Benoit! I think the easiest way to achieve that is sample a quaternion uniformly on a unit 4D sphere. By the way I updated the ideas wiki and I will include your suggestion as well. Also as was suggested we could use a more general name scipy.spatial.transform. Nikolay Hi, I have used that library quite a few times, and it is rather slow. Adding a transformation (or, for starters, rotation) module to scipy would be, in my opinion, a nice addition. Speaking about adding a few algorithms, one interesting add could be a function to uniformly sample the rotation space. It is a core functionality needed in a number of cases, and not that straightforward to perform properly (again, depending on the chosen formalism for representing the rotation). Best Benoit On 19/01/2018 22:28, oss wrote: Hello, Maybe this could come in handy regarding transforms matrices quaternions etc. https://www.lfd.uci.edu/~gohlke/code/transformations.py.html Best Tryfon On Jan 19, 2018, at 4:25 PM, Eric Larson <larson.eric.d at gmail.com> wrote: I have personally run into the need for such transformations in two separate domains (3D visualization, neuroscience / electrophysiology) and I know it's used in multiple other places, too. So I think it would be sufficiently general. I'd look forward to having it in SciPy! I'd be happy to be a secondary mentor on this if you (or someone else) wants to be primary. Best, Eric On Fri, Jan 19, 2018 at 10:11 AM, Nikolay Mayorov <nikolay.mayorov at zoho.com> wrote: Hi! I have this idea, which I'm well familiar with. The module would be called like scipy.spatial.rotation and be devoted to the rotation formalism in 3 dimensions. The main objects are Euler angles (and their variations), direction cosine matrices, quaternions and rotation vectors. We can go with an abstraction class Rotation (using DCMs or Qs internally), but we should be able to create that from any representation and see it in any representation. In spirit of scipy/numpy we use vectorized/bulk approaches (i.e. many rotations in single Rotation class). Rotation should support 2 operations: compose 2 consecutive rotations and rotate/project a 3d vector. Of course all procedures must be 100% robust and there are some fine points, especially in conversions between representations.Also we can add some algorithms, like: quaternion interpolation (SLERP), least-squares vector matching by a rotation (Whabba's problem), more advanced and less known algotithms for rotation interpolation, and I will try to come up with something more. Overall it seems reasonably straightforward , but with enough challenges in design and implementation As currently described, it might be not enough volume for the GSoC, but we can develop it farther. Also I'm not sure if its applicability is broad enough to include it into scipy. I believe similar functionality is available in Aerospace toolbox in Matlab. I want to hear some opinions on that. Nikolay ---- On Wed, 10 Jan 2018 17:02:33 +0500 jomsdev at gmail.com wrote ---- Hi all, Last year I started implementing some methods for Randomized Numerical Linear Algebra (RNLA) in scipy. By now it is only the CountMin Sketch (clarkson_woodruff_transformation) for reducing the dimensionality of a vector space to an embedded space. I think that it would be interesting to add to scipy other methods for subspace embedding (like the Johnson-Lindenstrauss) and build some algorithms on top of it for things like least squeres or low rank approximation. Would some other people be interesting in this? PS: I have a project called RandNLA where I implemented some of the methods of RNLA. The idea is to implement only the most important methods of RNLA in scipy and have this other library for experimenting with new methods and APIs. That will let us not overloading scipy with features if people are not interested in them and focus on the ones that really brings value to the community. Thanks, Jordi. On 10 January 2018 at 10:42, Ralf Gommers <ralf.gommers at gmail.com> wrote: Hi all, The GSoC schedule is a bit earlier than normal this year. The PSF is asking for ideas pages to be up and in decent shape by Jan 19th. So we'll need to come up with some content quick if we want to participate. Who is interested in mentoring this year? I'm happy to do the admin again, but probably won't have time to mentor. Cheers, Ralf _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev _______________________________________________ SciPy-Dev mailing list SciPy-Dev at python.org https://mail.python.org/mailman/listinfo/scipy-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: From serge.guelton at telecom-bretagne.eu Sat Jan 20 05:10:34 2018 From: serge.guelton at telecom-bretagne.eu (Serge Guelton) Date: Sat, 20 Jan 2018 11:10:34 +0100 Subject: [SciPy-Dev] Using Pythran to compile some of the scipy internals Message-ID: <20180120101034.GA3707@lakota> Hi Scipy-dev, = Context I've started the experiment of compiling some of the scipy kernels with the Pythran[0] compiler. The approach was basically to pick some easy .pyx file, turn them back to high-level Python + Numpy files then processing them with Pythran. This results in easier to maintain, Python-compatible files that run at the same speed as their Cython alternative up to 10 times faster depending on the kernels and whether SIMD instructions are activated or not (I did not play with parallelization in that experiment). This required a few improvement to Pythran, but not that much, see https://github.com/serge-sans-paille/pythran/pull/770 for the associated PR on pythran, which includes the following scipy kernels: hausdorff.py max_len_seq_inner.py solve_toeplitz.py spectral.py = Why bother? According to that first round of experiments, using Pythran can bring extra speed improvements while remaining at a higher level than Cython. Both points looks like an improvement to me. = Cost? This adds an extra dependency on Pythran, which uses C++ as backend. This increases the failure surface. Although alive since 2012 and being tested a lot [1] on Linux (but scarcely on Windows), its is obviously less mature than cython = Alternatives There is an experimental Pythran mode in cython[2] that uses Pythran as a backend for numpy operations. Unfortunately it is still at early stages and cannot translate calls to ``np.roll`` or ``np.sum(a[i,:] - b[j, :])`` while Pythran supports it. Instead of translating Cython files, I could also focus on some pure-python functions. I tested Pythran on the rosenbrock function and I get good speedup (from 1.5x to 4x depending on vectorization being enabled or not) there too. So yeah, that's a rather long introduction to probe the interest here around that idea :-) Best, Serge PS: I started https://github.com/scipy/scipy/pull/8306 meanwhile, but let's discuss here first :-) [0] https://github.com/serge-sans-paille/pythran [1] https://travis-ci.org/serge-sans-paille/pythran/builds/330052310 [2] http://cython.readthedocs.io/en/latest/src/userguide/numpy_pythran.html From cgohlke at uci.edu Sat Jan 20 10:34:35 2018 From: cgohlke at uci.edu (Christoph Gohlke) Date: Sat, 20 Jan 2018 07:34:35 -0800 Subject: [SciPy-Dev] GSoC'18 participation? In-Reply-To: References: <1610ef99097.100af0851154280.4991198338195760990@zoho.com>

Message-ID: On 1/20/2018 12:12 AM, Benoit Rosa wrote: > Hi, > > I have used that library quite a few times, and it is rather slow. That's because using small numpy arrays (4x4 or 1x4) to represent homogeneous transformation matrices, vectors, points, and quaternions is inherently inefficient, even when using the C API. As usual if that overhead matters, vectorize the algorithm or implement it in Cython/C/C++/FORTRAN/etc. There's a speedier C extension module but it still has to go through the numpy C API. The transformations.py module has been mostly superseded by the Transforms3d package . IMO quaternions should be a numpy dtype living in numpy or in a small package, not in scipy. See Christoph > Adding a transformation (or, for starters, rotation) module to scipy > would be, in my opinion, a nice addition. > > Speaking about adding a few algorithms, one interesting add could be a > function to uniformly sample the rotation space. It is a core > functionality needed in a number of cases, and not that straightforward > to perform properly (again, depending on the chosen formalism for > representing the rotation). > > Best > Benoit > > > On 19/01/2018 22:28, oss wrote: >> Hello, >> >> Maybe this could come in handy regarding transforms matrices quaternions >> ?etc. >> >> https://www.lfd.uci.edu/~gohlke/code/transformations.py.html >> >> >> Best >> >> Tryfon >> >> >>> On Jan 19, 2018, at 4:25 PM, Eric Larson >> > wrote: >>> >>> I have personally run into the need for such transformations in two >>> separate domains (3D visualization, neuroscience / electrophysiology) >>> and I know it's used in multiple other places, too. So I think it >>> would be sufficiently general. I'd look forward to having it in SciPy! >>> >>> I'd be happy to be a secondary mentor on this if you (or someone >>> else) wants to be primary. >>> >>> Best, >>> Eric >>> >>> >>> On Fri, Jan 19, 2018 at 10:11 AM, Nikolay Mayorov >>> > wrote: >>> >>> Hi! >>> >>> I have this idea, which I'm well familiar with. The module would >>> be called like scipy.spatial.rotation and be devoted to the >>> rotation formalism in 3 dimensions. >>> >>> The main objects are Euler angles (and their variations), >>> direction cosine matrices, quaternions and rotation vectors. We >>> can go with an abstraction class Rotation (using DCMs or Qs >>> internally), but we should be able to create that from any >>> representation and see it in any representation. In spirit of >>> scipy/numpy we use vectorized/bulk approaches (i.e. many >>> rotations in single Rotation class). >>> >>> Rotation should support 2 operations: compose 2 consecutive >>> rotations and rotate/project a 3d vector.??Of course all >>> procedures must be 100% robust and there are some fine points, >>> especially in conversions between representations.Also we can add >>> some algorithms, like: quaternion interpolation (SLERP), >>> least-squares vector matching by a rotation (Whabba's problem), >>> more advanced and less known algotithms for rotation >>> interpolation, and I will try to come up with something more. >>> >>> Overall it seems reasonably straightforward , but with enough >>> challenges in design and implementation >>> >>> As currently described, it might be not enough volume for the >>> GSoC, but we can develop it farther. >>> >>> Also I'm not sure if its applicability is broad enough to include >>> it into scipy. I believe similar functionality is available in >>> Aerospace toolbox in Matlab. I want to hear some opinions on that. >>> >>> Nikolay >>> >>> >>> ---- On Wed, 10 Jan 2018 17:02:33 +0500 *jomsdev at gmail.com >>> * wrote ---- >>> >>> Hi all, >>> >>> Last year I started implementing some methods for Randomized >>> Numerical Linear Algebra (RNLA) in scipy. >>> By now it is only the CountMin Sketch >>> (clarkson_woodruff_transformation) for reducing the >>> dimensionality of a vector space to an embedded space. >>> >>> I think that it would be interesting to add to scipy other >>> methods for subspace embedding (like the >>> Johnson-Lindenstrauss) and build some algorithms on top of it >>> for things like least squeres or low rank approximation. >>> >>> Would some other people be interesting in this? >>> >>> >>> PS: I have a project called RandNLA >>> where I implemented some >>> of the methods of RNLA. The idea is to implement only the >>> most important methods of RNLA in scipy and have this other >>> library for experimenting with new methods and APIs. That >>> will let us not overloading scipy with features if people are >>> not interested in them and focus on the ones that really >>> brings value to the community. >>> >>> Thanks, >>> >>> Jordi. >>> >>> On 10 January 2018 at 10:42, Ralf Gommers >>> > wrote: >>> >>> Hi all, >>> >>> The GSoC schedule is a bit earlier than normal this year. >>> The PSF is asking for ideas pages to be up and in decent >>> shape by Jan 19th. So we'll need to come up with some >>> content quick if we want to participate. >>> >>> Who is interested in mentoring this year? >>> >>> I'm happy to do the admin again, but probably won't have >>> time to mentor. >>> >>> Cheers, >>> Ralf >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >>> >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > From ilhanpolat at gmail.com Sat Jan 20 15:42:44 2018 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Sat, 20 Jan 2018 21:42:44 +0100 Subject: [SciPy-Dev] Restructuring flapack.src.pyf file In-Reply-To: References: <3sm3kp.p1jh54.rthztg-qmf@smtp.gmail.com>

Message-ID: I'm secretly working on it :) Hopefully this week I can submit something. On Jan 20, 2018 06:10, "Ralf Gommers" wrote: > > > On Sat, Jan 13, 2018 at 11:16 PM, Ilhan Polat > wrote: > >> Most of them would indeed just gain new keywords xxgvd, xxgv, and so on. >> These would only get lwork and liwork keywords. No deprecations needed. >> >> ?(sy/he)evr on the other hand not only gets these extra keywords but also >> would gain two more output arguments which is actually the reason for this >> thread. So fortunately the amount of deprecation is "a few"~ >> ssyevr/dsyevr/cheevr/zheevr so far. >> > > Okay, that sounds a lot more sensible / less worrying. Maybe just open a > PR and we'll look at the detailed changes there? > > Cheers, > Ralf > > > These are used in the regular case of scipy.linalg.eigh and was causing >> problems due to unoptimized lwork values. Also they were not completely >> exposed.hence the new ouput args. >> >> I would like to start a separate discussion about the deprecation of >> keywords "eigvals" and "turbo" via a decorator later once this is done. >> >> Because now we are able to give "eig_range" as an input if need be as a >> new functionality. And "eigvals" does not correspond to giving an index >> range say, "eig_index". But that will have to wait until this surgery on >> the LAPACK side. >> >> >> >> On Jan 13, 2018 09:31, "Ralf Gommers" wrote: >> >>> >>> >>> On Wed, Dec 27, 2017 at 2:40 AM, Ilhan Polat >>> wrote: >>> >>>> The situation, as usual, turns out a bit more involved >>>> >>>> In the discussion, https://github.com/scipy/scipy/issues/6502 Tiziano >>>> writes >>>> >>>> "Hi, I am the author of the original wrappers for those LAPACK >>>> routines. If I remember correctly, the reason for hiding the eigenvalues in >>>> an interval option was simply that the original implementation of >>>> scipy.linalg.eigh did not even allow to limit the number of >>>> eigenvalues/eigenvectors returned. I needed only the possiblity to define >>>> the indices of the eigenvalues and not the range, and the result is the >>>> option eigvals to scipy.linalg.eigh. I think that your wrapper is too >>>> generic. You should not allow setting from Python the values of things like >>>> lwork: a wrong value there and the user can generate a segfault from >>>> Python, which is a no-no." >>>> >>>> and I can see more LAPACK routines that are wrapped along this line. >>>> Having 8 more "xxxx_full" and "xxxx_full_lwork" routines doesn't seem to be >>>> elegant though I don't have any alternative yet. Maybe it is OK to have >>>> them for a few versions but I guess it would be nice if we have a parallel >>>> running deprecation plans for this issue. It looks like _np.deprecate works >>>> pretty OK. And we have four deprecations already in lapack.py e.g., >>>> >>>> cgegv = _np.deprecate(cgegv, old_name='cgegv', message=_dep_message) >>>> >>>> This will print a deprecation warning when it is called or docstring is >>>> printed. "DeprecationWarning: `dgegv` is deprecated! The `*gegv` family of >>>> routines has been deprecated in LAPACK 3.6.0 in favor of the `*ggev` family >>>> of routines.The corresponding wrappers will be removed from SciPy in a >>>> future release." So I would like to deprecate the routines abouve with a >>>> message >>>> >>>> "The *XXXXX family of routines will have a different signature in the >>>> future versions of SciPy. The new versions are now available as >>>> *XXXXX_full and will eventually replace the original *XXXXX family" >>>> >>>> Would you dis/agree? >>>> >>> >>> Just wondering about the tradeoff between deprecating + having a new >>> function with arguments in the same order as LAPACK, vs. just appending the >>> keywords to the existing function. The latter case would need a thin Python >>> function in lapack.py, but unless that hurts performance significantly I'm >>> not sure that's much of an issue. >>> >>> Ralf >>> >>> >>> >>> >>>> >>>> >>>> >>>> On Mon, Dec 25, 2017 at 11:58 PM, wrote: >>>> >>>>> Hi, >>>>> >>>>> Ilhan Polat kirjoitti tiistai 26. joulukuuta 2017: >>>>> [clip] >>>>> > I've done a similar thing by freeing out all ins and outs. It seems >>>>> to me >>>>> > that the signature change is inevitable but the question is how to >>>>> handle >>>>> > this? >>>>> >>>>> New function sounds best, maximally unimaginative new name is probably >>>>> most appropriate. >>>>> >>>>> Not sure how well np.deprecate etc python level wrappers work on f2py. >>>>> >>>>> >>>>> > 2) It looks like flapack.src.pyf file is getting exceedingly long and >>>>> > unstructured. Should we break it apart just like LAPACK >>>>> documentation? >>>>> >>>>> If you think that helps it significantly and is worthwhile use of your >>>>> time then why not. (I suspect it's not particularly worthwhile but ymmv.) >>>>> _______________________________________________ >>>>> SciPy-Dev mailing list >>>>> SciPy-Dev at python.org >>>>> https://mail.python.org/mailman/listinfo/scipy-dev >>>>> >>>> >>>> >>>> _______________________________________________ >>>> SciPy-Dev mailing list >>>> SciPy-Dev at python.org >>>> https://mail.python.org/mailman/listinfo/scipy-dev >>>> >>>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 20 16:03:25 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 21 Jan 2018 10:03:25 +1300 Subject: [SciPy-Dev] SciPy 1.0 paper writing proposal Message-ID: Hi all, TL;DR, let's write the long journal paper on SciPy that we've wanted for a while, let's form a small committee to coordinate, and get it out the door in 2-3 months. Motivation --------------- (credits for most of this text: Evgeni) Many scipy contributors' day jobs are in academia. Bibliometry -- papers in refereed journals and citations of papers by other papers -- is one of the main performance indicators in most academic establishments. Since we do not generate papers, scipy contributions are all but invisible for the purposes of a contributor's annual report. Of course, details vary wildly; in many cases a contributor manages to balance their time, or to argue common sense with their superiors, or get an approval for scipy work, or just ignores the issue altogether -- but sooner or later there is a form to be filled or boxes to be checked, and scipy contributions simply do not fit in. A peer-reviewed journal paper on scipy will help contributors get the academic credit they deserve. We can write *the* paper for SciPy 1.0, with overall project structure, goals, etc., and for specific features/modules a focus on say the last 3 years. History ---------- For SciPy 1.0 we had three targets on the publicity/credits front: an interesting release announcement, interesting blogs/stories (NumFOCUS blog, Hacker News, etc.) and a paper. We didn't have the bandwidth for a paper in the end, the rest was successful. [1] is a previous announcement on this list about writing (a) paper(s) on SciPy. We wanted both "short papers" to cover one or two releases (target journal JOSS) and a full paper as the authoritative reference for SciPy. We had an earlier attempt for a "short paper", it's mostly written but has stalled (see [2]). We ran out of steam on that one. To avoid that this time around, it would be good to have a clear public plan, target dates, and a small committee rather than one person to drive things forward. Proposal ------------ Here's a proposal for all aspects of this exercise that I can think about now. Some parts stolen from the AstroPy paper [3] (because their process worked quite well). Form a small coordination committee of 3-5 people that set up the paper structure, move things along when parts stall, propose/take decisions as needed, invite co-authors, and organise paper submission/rework. Paper writing to be done by whoever volunteers for a section, not just the coordination committee. First outline/structure to be done by committee, which then asks for review of structure and volunteers for section writing. Scope: a 6-10 page paper, covering history, package scope and structure, community/organisational aspects, key features and recent enhancements per module, and roadmap. Authorship: anyone who made a substantial contribution in the history of the project. Here "substantial" is interpreted as anything beyond a one-line doc fix. Rationale: better to be too inclusive than exclusive. Sign-up via a web form, we send the link to that form to all email addresses in the commit history till v1.0. Author order (details tbd by committee): 1. The SciPy Developers 2. Maintainers, paper writers, other key contributors - in order of contribution level 3. All other authors - alphabetically ordered Submission target: mid-April, to either PeerJ Computer Science or Journal of Open Research Software (tbd by committee). Comments? Volunteers for committee? References ---------------- [1] https://mail.python.org/pipermail/scipy-dev/2016-August/021474.html [2] https://github.com/scipy/scipy-articles/pull/4 [3] https://github.com/astropy/astropy-v2.0-paper Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 20 16:03:06 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 21 Jan 2018 10:03:06 +1300 Subject: [SciPy-Dev] maintainership Sunday morning thoughts Message-ID: Hi all, I wanted to share this excellent talk, "Rebuilding the cathedral", from Nadia Eghbal about how open source software gets maintained: https://www.youtube.com/watch?v=VS6IpvTWwkQ If you're a maintainer that experiences feeling guilty about not answering questions or reviewing PRs on Github quickly enough (I certainly do sometimes), or are a contributor that wonders why your well crafted PR doesn't get reviewed or merged, watching this talk may explain a few things. In terms of "rewards" for maintainership of SciPy, we could do better. One obvious thing is a paper that contributors can be co-authors on - this is one thing that we planned but didn't manage to do in the rush to get SciPy 1.0 polished and out the door. I'm seeing a bit of free time coming up, and fixing that omission is on my todo list for that time (really this time) - second email to follow shortly. More substantial rewards ($$) is a bit of a chicken-and-egg problem - it requires investing more time than anyone can put in at the moment to apply for funding. It'll be interesting to see how the dynamics of NumPy development change with the two grants for the next two years... Finally, a thank you to the people who've jumped in to fill the post-1.0 code review gap we seem to be experiencing - mainly Ilhan, Andrew and Tyler. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Jan 20 19:29:09 2018 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 20 Jan 2018 17:29:09 -0700 Subject: [SciPy-Dev] maintainership Sunday morning thoughts In-Reply-To: References: Message-ID: On Sat, Jan 20, 2018 at 2:03 PM, Ralf Gommers wrote: > Hi all, > > I wanted to share this excellent talk, "Rebuilding the cathedral", from > Nadia Eghbal about how open source software gets maintained: > https://www.youtube.com/watch?v=VS6IpvTWwkQ > Thanks for the link, good talk. Considering the exponential growth in the number of users, I'm surprised that NumPy/SciPy aren't completely swamped in bug reports. We're falling behind, sure, but not as badly as might be expected. > If you're a maintainer that experiences feeling guilty about not answering > questions or reviewing PRs on Github quickly enough (I certainly do > sometimes), or are a contributor that wonders why your well crafted PR > doesn't get reviewed or merged, watching this talk may explain a few things. > > In terms of "rewards" for maintainership of SciPy, we could do better. One > obvious thing is a paper that contributors can be co-authors on - this is > one thing that we planned but didn't manage to do in the rush to get SciPy > 1.0 polished and out the door. I'm seeing a bit of free time coming up, and > fixing that omission is on my todo list for that time (really this time) - > second email to follow shortly. More substantial rewards ($$) is a bit of a > chicken-and-egg problem - it requires investing more time than anyone can > put in at the moment to apply for funding. It'll be interesting to see how > the dynamics of NumPy development change with the two grants for the next > two years... > > Finally, a thank you to the people who've jumped in to fill the post-1.0 > code review gap we seem to be experiencing - mainly Ilhan, Andrew and Tyler. > > Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Sat Jan 20 19:51:46 2018 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 20 Jan 2018 19:51:46 -0500 Subject: [SciPy-Dev] maintainership Sunday morning thoughts In-Reply-To: References:

Message-ID: On Sat, Jan 20, 2018 at 7:29 PM, Charles R Harris wrote: > > > On Sat, Jan 20, 2018 at 2:03 PM, Ralf Gommers > wrote: > >> Hi all, >> >> I wanted to share this excellent talk, "Rebuilding the cathedral", from >> Nadia Eghbal about how open source software gets maintained: >> https://www.youtube.com/watch?v=VS6IpvTWwkQ >> > > Thanks for the link, good talk. > > Considering the exponential growth in the number of users, I'm surprised > that NumPy/SciPy aren't completely swamped in bug reports. We're falling > behind, sure, but not as badly as might be expected. > I think it's because there is no "swamp of bugs". I have the impression in keeping roughly track of scipy issues and PRs that PR review and unit testing works very well so that only occasional bugs escape and bug the users. Josef > > >> If you're a maintainer that experiences feeling guilty about not >> answering questions or reviewing PRs on Github quickly enough (I certainly >> do sometimes), or are a contributor that wonders why your well crafted PR >> doesn't get reviewed or merged, watching this talk may explain a few things. >> >> In terms of "rewards" for maintainership of SciPy, we could do better. >> One obvious thing is a paper that contributors can be co-authors on - this >> is one thing that we planned but didn't manage to do in the rush to get >> SciPy 1.0 polished and out the door. I'm seeing a bit of free time coming >> up, and fixing that omission is on my todo list for that time (really this >> time) - second email to follow shortly. More substantial rewards ($$) is a >> bit of a chicken-and-egg problem - it requires investing more time than >> anyone can put in at the moment to apply for funding. It'll be interesting >> to see how the dynamics of NumPy development change with the two grants for >> the next two years... >> >> Finally, a thank you to the people who've jumped in to fill the post-1.0 >> code review gap we seem to be experiencing - mainly Ilhan, Andrew and Tyler. >> >> > Chuck > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 20 20:02:53 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 21 Jan 2018 14:02:53 +1300 Subject: [SciPy-Dev] maintainership Sunday morning thoughts In-Reply-To: References:

Message-ID: On Sun, Jan 21, 2018 at 1:51 PM, wrote: > > > On Sat, Jan 20, 2018 at 7:29 PM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> >> >> On Sat, Jan 20, 2018 at 2:03 PM, Ralf Gommers >> wrote: >> >>> Hi all, >>> >>> I wanted to share this excellent talk, "Rebuilding the cathedral", from >>> Nadia Eghbal about how open source software gets maintained: >>> https://www.youtube.com/watch?v=VS6IpvTWwkQ >>> >> >> Thanks for the link, good talk. >> >> Considering the exponential growth in the number of users, I'm surprised >> that NumPy/SciPy aren't completely swamped in bug reports. We're falling >> behind, sure, but not as badly as might be expected. >> > > I think it's because there is no "swamp of bugs". > I have the impression in keeping roughly track of scipy issues and PRs > that PR review and unit testing works very well so that only occasional > bugs escape and bug the users. > Agreed, there's no swamp of bugs. Maybe a swamp of rough edges.... What we do struggle with is lack of progress on big-ticket items, e.g. - sparse arrays instead of (or in addition to) sparse matrices - replacing all the spline implementations (in interpolate, signal, ndimage) with a single clean design - making ndimage consistent for the "is it a pixel or is it a data point" question Basically the stuff one can't do piece by piece but requires someone to sit down for a couple of months. That requires luck (a person with the right skills and motivation and time coming along) or funding. Ralf > Josef > > >> >> >>> If you're a maintainer that experiences feeling guilty about not >>> answering questions or reviewing PRs on Github quickly enough (I certainly >>> do sometimes), or are a contributor that wonders why your well crafted PR >>> doesn't get reviewed or merged, watching this talk may explain a few things. >>> >>> In terms of "rewards" for maintainership of SciPy, we could do better. >>> One obvious thing is a paper that contributors can be co-authors on - this >>> is one thing that we planned but didn't manage to do in the rush to get >>> SciPy 1.0 polished and out the door. I'm seeing a bit of free time coming >>> up, and fixing that omission is on my todo list for that time (really this >>> time) - second email to follow shortly. More substantial rewards ($$) is a >>> bit of a chicken-and-egg problem - it requires investing more time than >>> anyone can put in at the moment to apply for funding. It'll be interesting >>> to see how the dynamics of NumPy development change with the two grants for >>> the next two years... >>> >>> Finally, a thank you to the people who've jumped in to fill the post-1.0 >>> code review gap we seem to be experiencing - mainly Ilhan, Andrew and Tyler. >>> >>> >> Chuck >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Sat Jan 20 20:20:16 2018 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 20 Jan 2018 20:20:16 -0500 Subject: [SciPy-Dev] maintainership Sunday morning thoughts In-Reply-To: References:

Message-ID: On Sat, Jan 20, 2018 at 8:02 PM, Ralf Gommers wrote: > > > On Sun, Jan 21, 2018 at 1:51 PM, wrote: > >> >> >> On Sat, Jan 20, 2018 at 7:29 PM, Charles R Harris < >> charlesr.harris at gmail.com> wrote: >> >>> >>> >>> On Sat, Jan 20, 2018 at 2:03 PM, Ralf Gommers >>> wrote: >>> >>>> Hi all, >>>> >>>> I wanted to share this excellent talk, "Rebuilding the cathedral", from >>>> Nadia Eghbal about how open source software gets maintained: >>>> https://www.youtube.com/watch?v=VS6IpvTWwkQ >>>> >>> >>> Thanks for the link, good talk. >>> >>> Considering the exponential growth in the number of users, I'm surprised >>> that NumPy/SciPy aren't completely swamped in bug reports. We're falling >>> behind, sure, but not as badly as might be expected. >>> >> >> I think it's because there is no "swamp of bugs". >> I have the impression in keeping roughly track of scipy issues and PRs >> that PR review and unit testing works very well so that only occasional >> bugs escape and bug the users. >> > > Agreed, there's no swamp of bugs. Maybe a swamp of rough edges.... > > What we do struggle with is lack of progress on big-ticket items, e.g. > - sparse arrays instead of (or in addition to) sparse matrices > - replacing all the spline implementations (in interpolate, signal, > ndimage) with a single clean design > - making ndimage consistent for the "is it a pixel or is it a data > point" question > and at least differential equations, AFAIK as bystander > Basically the stuff one can't do piece by piece but requires someone to > sit down for a couple of months. That requires luck (a person with the > right skills and motivation and time coming along) or funding. > I agree with all the requirements. However, it also needs some difficult design work by experts or experienced developers. E.g. How can general interpolating splines, smoothing splines, splines on a one dimensional grid and splines on a nd dimensional grid be combined into a single clean design without loosing ... (sounds between tough and impossible to me) Josef > > > Ralf > > > >> Josef >> >> >>> >>> >>>> If you're a maintainer that experiences feeling guilty about not >>>> answering questions or reviewing PRs on Github quickly enough (I certainly >>>> do sometimes), or are a contributor that wonders why your well crafted PR >>>> doesn't get reviewed or merged, watching this talk may explain a few things. >>>> >>>> In terms of "rewards" for maintainership of SciPy, we could do better. >>>> One obvious thing is a paper that contributors can be co-authors on - this >>>> is one thing that we planned but didn't manage to do in the rush to get >>>> SciPy 1.0 polished and out the door. I'm seeing a bit of free time coming >>>> up, and fixing that omission is on my todo list for that time (really this >>>> time) - second email to follow shortly. More substantial rewards ($$) is a >>>> bit of a chicken-and-egg problem - it requires investing more time than >>>> anyone can put in at the moment to apply for funding. It'll be interesting >>>> to see how the dynamics of NumPy development change with the two grants for >>>> the next two years... >>>> >>>> Finally, a thank you to the people who've jumped in to fill the >>>> post-1.0 code review gap we seem to be experiencing - mainly Ilhan, Andrew >>>> and Tyler. >>>> >>>> >>> Chuck >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Jan 20 20:34:59 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 21 Jan 2018 14:34:59 +1300 Subject: [SciPy-Dev] Using Pythran to compile some of the scipy internals In-Reply-To: <20180120101034.GA3707@lakota> References: <20180120101034.GA3707@lakota> Message-ID: On Sat, Jan 20, 2018 at 11:10 PM, Serge Guelton wrote: > Hi Scipy-dev, > > = Context > > I've started the experiment of compiling some of the scipy kernels with > the Pythran[0] compiler. > > The approach was basically to pick some easy .pyx file, turn them back > to high-level Python + Numpy files then processing them with Pythran. > This results in easier to maintain, Python-compatible files that run at > the same speed as their Cython alternative up to 10 times faster > depending on the kernels and whether SIMD instructions are activated or > not (I did not play with parallelization in that experiment). > > This required a few improvement to Pythran, but not that much, see > > https://github.com/serge-sans-paille/pythran/pull/770 > > for the associated PR on pythran, which includes the following scipy > kernels: > > hausdorff.py > max_len_seq_inner.py > solve_toeplitz.py > spectral.py > > = Why bother? > > According to that first round of experiments, using Pythran can bring > extra speed improvements while remaining at a higher level than Cython. > Both points looks like an improvement to me. > Another potential benefit is to decrease the size of the binary distribution of SciPy. Cython extensions are quite expensive in this respect. Do you have an idea of how Pythran compares? Both in case of only float64 inputs, and with templated inputs? A good example of the latter is scipy.ndimage.label, which is a straightforward function that ends up being a 700kb .so > = Cost? > > This adds an extra dependency on Pythran, which uses C++ as backend. > Only a build-time dependency. That's not my major worry from the maintenance point of view. > This increases the failure surface. Although alive since 2012 and being > tested a lot [1] on Linux (but scarcely on Windows), its is obviously > less mature than cython > This is a worry. We need Windows, Linux, macOS (officially supported, also the 32-bit flavors) as well as less commonly used Unix/BSD-like platforms. I guess Windows needs separate testing, both with gcc and MSVC. But for all other platforms, can you say something about portability based on the kind of C++ Pythran generates? > = Alternatives > > There is an experimental Pythran mode in cython[2] that uses Pythran as > a backend for numpy operations. Unfortunately it is still at early > stages and cannot translate calls to ``np.roll`` or ``np.sum(a[i,:] - b[j, > :])`` while Pythran supports it. > > Instead of translating Cython files, I could also focus on some > pure-python functions. I tested Pythran on the rosenbrock function and I > get good speedup (from 1.5x to 4x depending on vectorization being > enabled or not) there too. > > So yeah, that's a rather long introduction to probe the interest here > around that idea :-) > Interest in general, but there's a long way to go - our requirements are pretty demanding. I have seen some comparisons between Cython, Pythran and Numba in terms of performance and ease of use, but never a comprehensive comparison from the point of view of library authors. I know Travis O. has an interest in seeing Numba being adopted more widely, which will also need such a comparison. It should cover at least: - portability - performance - maturity - maintenance status (active devs, how quick do bugs get fixed after a release with an issue) - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) - size of generated binaries - templating support for multiple dtypes - debugging and optimization experience/tools Is anyone aware of such a comparison? Or interested in putting it together? Cheers, Ralf > Best, > Serge > > PS: I started https://github.com/scipy/scipy/pull/8306 meanwhile, but > let's discuss here first :-) > > > > [0] https://github.com/serge-sans-paille/pythran > [1] https://travis-ci.org/serge-sans-paille/pythran/builds/330052310 > [2] http://cython.readthedocs.io/en/latest/src/userguide/numpy_p > ythran.html > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tyler.je.reddy at gmail.com Sat Jan 20 22:08:21 2018 From: tyler.je.reddy at gmail.com (Tyler Reddy) Date: Sat, 20 Jan 2018 20:08:21 -0700 Subject: [SciPy-Dev] SciPy 1.0 paper writing proposal In-Reply-To: References: Message-ID: Sounds good -- I'd be up for committee work. Definitely +1 on a paper for time justification, etc. On 20 January 2018 at 14:03, Ralf Gommers wrote: > Hi all, > > TL;DR, let's write the long journal paper on SciPy that we've wanted for a > while, let's form a small committee to coordinate, and get it out the door > in 2-3 months. > > > Motivation > --------------- > (credits for most of this text: Evgeni) > > Many scipy contributors' day jobs are in academia. Bibliometry -- papers in > refereed journals and citations of papers by other papers -- is one of the > main > performance indicators in most academic establishments. Since we do not > generate papers, scipy contributions are all but invisible for the > purposes of a > contributor's annual report. Of course, details vary wildly; in many cases > a > contributor manages to balance their time, or to argue common sense with > their > superiors, or get an approval for scipy work, or just ignores the issue > altogether -- but sooner or later there is a form to be filled or boxes to > be > checked, and scipy contributions simply do not fit in. A peer-reviewed > journal paper on scipy will help contributors get the academic credit they > deserve. > > We can write *the* paper for SciPy 1.0, with overall project structure, > goals, etc., and for specific features/modules a focus on say the last 3 > years. > > > History > ---------- > For SciPy 1.0 we had three targets on the publicity/credits front: an > interesting release announcement, interesting blogs/stories (NumFOCUS blog, > Hacker News, etc.) and a paper. We didn't have the bandwidth for a paper in > the end, the rest was successful. > > [1] is a previous announcement on this list about writing (a) paper(s) on > SciPy. We wanted both "short papers" to cover one or two releases (target > journal JOSS) and a full paper as the authoritative reference for SciPy. > > We had an earlier attempt for a "short paper", it's mostly written but has > stalled (see [2]). We ran out of steam on that one. To avoid that this time > around, it would be good to have a clear public plan, target dates, and a > small committee rather than one person to drive things forward. > > > Proposal > ------------ > Here's a proposal for all aspects of this exercise that I can think about > now. Some parts stolen from the AstroPy paper [3] (because their process > worked quite well). > > Form a small coordination committee of 3-5 people that set up the paper > structure, move things along when parts stall, propose/take decisions as > needed, invite co-authors, and organise paper submission/rework. > > Paper writing to be done by whoever volunteers for a section, not just the > coordination committee. First outline/structure to be done by committee, > which then asks for review of structure and volunteers for section writing. > > Scope: a 6-10 page paper, covering history, package scope and structure, > community/organisational aspects, key features and recent enhancements per > module, and roadmap. > > Authorship: anyone who made a substantial contribution in the history of > the project. Here "substantial" is interpreted as anything beyond a > one-line doc fix. Rationale: better to be too inclusive than exclusive. > Sign-up via a web form, we send the link to that form to all email > addresses in the commit history till v1.0. > > Author order (details tbd by committee): > 1. The SciPy Developers > 2. Maintainers, paper writers, other key contributors - in order of > contribution level > 3. All other authors - alphabetically ordered > > Submission target: mid-April, to either PeerJ Computer Science or Journal > of Open Research Software (tbd by committee). > > Comments? Volunteers for committee? > > > References > ---------------- > [1] https://mail.python.org/pipermail/scipy-dev/2016-August/021474.html > [2] https://github.com/scipy/scipy-articles/pull/4 > [3] https://github.com/astropy/astropy-v2.0-paper > > Cheers, > Ralf > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From warren.weckesser at gmail.com Sun Jan 21 12:36:16 2018 From: warren.weckesser at gmail.com (Warren Weckesser) Date: Sun, 21 Jan 2018 12:36:16 -0500 Subject: [SciPy-Dev] Adding an argument to 'odeint' to control the order of the first two arguments of the user's function. Message-ID: In 'scipy.integrate', the 'ode' class and the function 'solve_ivp' assume that first two arguments of the functions provided by the user are 't' ("time") followed by 'y' (the current state). The older solver, 'odeint', expects those arguments to be in the opposite order. This is a nuisance for anyone who wants to use the same functions with the different solvers. To fix this, I've created a pull request in which the argument 'tfirst' is added to 'odeint': https://github.com/scipy/scipy/pull/8318. When 'tfirst' is True, 'odeint' uses the same convention as 'solve_ivp' and the 'ode' class. The change is implemented in the C code to avoid the overhead that would result from using a function wrapper with reversed arguments. Comments on the pull request are welcome. Warren -------------- next part -------------- An HTML attachment was scrubbed... URL: From andyfaff at gmail.com Sun Jan 21 15:26:53 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Mon, 22 Jan 2018 07:26:53 +1100 Subject: [SciPy-Dev] SciPy 1.0 paper writing proposal In-Reply-To: References: Message-ID: I'm obviously up for being on the paper :-), but I'd prefer to contribute by writing small sections/editing of a paper rather than coordinating. On 21 January 2018 at 14:08, Tyler Reddy wrote: > Sounds good -- I'd be up for committee work. Definitely +1 on a paper for > time justification, etc. > > On 20 January 2018 at 14:03, Ralf Gommers wrote: > >> Hi all, >> >> TL;DR, let's write the long journal paper on SciPy that we've wanted for >> a while, let's form a small committee to coordinate, and get it out the >> door in 2-3 months. >> >> >> Motivation >> --------------- >> (credits for most of this text: Evgeni) >> >> Many scipy contributors' day jobs are in academia. Bibliometry -- papers >> in >> refereed journals and citations of papers by other papers -- is one of >> the main >> performance indicators in most academic establishments. Since we do not >> generate papers, scipy contributions are all but invisible for the >> purposes of a >> contributor's annual report. Of course, details vary wildly; in many >> cases a >> contributor manages to balance their time, or to argue common sense with >> their >> superiors, or get an approval for scipy work, or just ignores the issue >> altogether -- but sooner or later there is a form to be filled or boxes >> to be >> checked, and scipy contributions simply do not fit in. A peer-reviewed >> journal paper on scipy will help contributors get the academic credit they >> deserve. >> >> We can write *the* paper for SciPy 1.0, with overall project structure, >> goals, etc., and for specific features/modules a focus on say the last 3 >> years. >> >> >> History >> ---------- >> For SciPy 1.0 we had three targets on the publicity/credits front: an >> interesting release announcement, interesting blogs/stories (NumFOCUS blog, >> Hacker News, etc.) and a paper. We didn't have the bandwidth for a paper in >> the end, the rest was successful. >> >> [1] is a previous announcement on this list about writing (a) paper(s) on >> SciPy. We wanted both "short papers" to cover one or two releases (target >> journal JOSS) and a full paper as the authoritative reference for SciPy. >> >> We had an earlier attempt for a "short paper", it's mostly written but >> has stalled (see [2]). We ran out of steam on that one. To avoid that this >> time around, it would be good to have a clear public plan, target dates, >> and a small committee rather than one person to drive things forward. >> >> >> Proposal >> ------------ >> Here's a proposal for all aspects of this exercise that I can think about >> now. Some parts stolen from the AstroPy paper [3] (because their process >> worked quite well). >> >> Form a small coordination committee of 3-5 people that set up the paper >> structure, move things along when parts stall, propose/take decisions as >> needed, invite co-authors, and organise paper submission/rework. >> >> Paper writing to be done by whoever volunteers for a section, not just >> the coordination committee. First outline/structure to be done by >> committee, which then asks for review of structure and volunteers for >> section writing. >> >> Scope: a 6-10 page paper, covering history, package scope and structure, >> community/organisational aspects, key features and recent enhancements per >> module, and roadmap. >> >> Authorship: anyone who made a substantial contribution in the history of >> the project. Here "substantial" is interpreted as anything beyond a >> one-line doc fix. Rationale: better to be too inclusive than exclusive. >> Sign-up via a web form, we send the link to that form to all email >> addresses in the commit history till v1.0. >> >> Author order (details tbd by committee): >> 1. The SciPy Developers >> 2. Maintainers, paper writers, other key contributors - in order of >> contribution level >> 3. All other authors - alphabetically ordered >> >> Submission target: mid-April, to either PeerJ Computer Science or Journal >> of Open Research Software (tbd by committee). >> >> Comments? Volunteers for committee? >> >> >> References >> ---------------- >> [1] https://mail.python.org/pipermail/scipy-dev/2016-August/021474.html >> [2] https://github.com/scipy/scipy-articles/pull/4 >> [3] https://github.com/astropy/astropy-v2.0-paper >> >> Cheers, >> Ralf >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -- _____________________________________ Dr. Andrew Nelson _____________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Jan 22 01:14:22 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 22 Jan 2018 19:14:22 +1300 Subject: [SciPy-Dev] SciPy 1.0 paper writing proposal In-Reply-To: References:

Message-ID: On Mon, Jan 22, 2018 at 9:26 AM, Andrew Nelson wrote: > I'm obviously up for being on the paper :-), but I'd prefer to contribute > by writing small sections/editing of a paper rather than coordinating. > No worries, help of any kind is welcome, and no obligations of course. > On 21 January 2018 at 14:08, Tyler Reddy wrote: > >> Sounds good -- I'd be up for committee work. Definitely +1 on a paper for >> time justification, etc. >> > Thanks Tyler! Ralf >> On 20 January 2018 at 14:03, Ralf Gommers wrote: >> >>> Hi all, >>> >>> TL;DR, let's write the long journal paper on SciPy that we've wanted for >>> a while, let's form a small committee to coordinate, and get it out the >>> door in 2-3 months. >>> >>> >>> Motivation >>> --------------- >>> (credits for most of this text: Evgeni) >>> >>> Many scipy contributors' day jobs are in academia. Bibliometry -- papers >>> in >>> refereed journals and citations of papers by other papers -- is one of >>> the main >>> performance indicators in most academic establishments. Since we do not >>> generate papers, scipy contributions are all but invisible for the >>> purposes of a >>> contributor's annual report. Of course, details vary wildly; in many >>> cases a >>> contributor manages to balance their time, or to argue common sense with >>> their >>> superiors, or get an approval for scipy work, or just ignores the issue >>> altogether -- but sooner or later there is a form to be filled or boxes >>> to be >>> checked, and scipy contributions simply do not fit in. A peer-reviewed >>> journal paper on scipy will help contributors get the academic credit they >>> deserve. >>> >>> We can write *the* paper for SciPy 1.0, with overall project structure, >>> goals, etc., and for specific features/modules a focus on say the last 3 >>> years. >>> >>> >>> History >>> ---------- >>> For SciPy 1.0 we had three targets on the publicity/credits front: an >>> interesting release announcement, interesting blogs/stories (NumFOCUS blog, >>> Hacker News, etc.) and a paper. We didn't have the bandwidth for a paper in >>> the end, the rest was successful. >>> >>> [1] is a previous announcement on this list about writing (a) paper(s) >>> on SciPy. We wanted both "short papers" to cover one or two releases >>> (target journal JOSS) and a full paper as the authoritative reference for >>> SciPy. >>> >>> We had an earlier attempt for a "short paper", it's mostly written but >>> has stalled (see [2]). We ran out of steam on that one. To avoid that this >>> time around, it would be good to have a clear public plan, target dates, >>> and a small committee rather than one person to drive things forward. >>> >>> >>> Proposal >>> ------------ >>> Here's a proposal for all aspects of this exercise that I can think >>> about now. Some parts stolen from the AstroPy paper [3] (because their >>> process worked quite well). >>> >>> Form a small coordination committee of 3-5 people that set up the paper >>> structure, move things along when parts stall, propose/take decisions as >>> needed, invite co-authors, and organise paper submission/rework. >>> >>> Paper writing to be done by whoever volunteers for a section, not just >>> the coordination committee. First outline/structure to be done by >>> committee, which then asks for review of structure and volunteers for >>> section writing. >>> >>> Scope: a 6-10 page paper, covering history, package scope and structure, >>> community/organisational aspects, key features and recent enhancements per >>> module, and roadmap. >>> >>> Authorship: anyone who made a substantial contribution in the history of >>> the project. Here "substantial" is interpreted as anything beyond a >>> one-line doc fix. Rationale: better to be too inclusive than exclusive. >>> Sign-up via a web form, we send the link to that form to all email >>> addresses in the commit history till v1.0. >>> >>> Author order (details tbd by committee): >>> 1. The SciPy Developers >>> 2. Maintainers, paper writers, other key contributors - in order of >>> contribution level >>> 3. All other authors - alphabetically ordered >>> >>> Submission target: mid-April, to either PeerJ Computer Science or >>> Journal of Open Research Software (tbd by committee). >>> >>> Comments? Volunteers for committee? >>> >>> >>> References >>> ---------------- >>> [1] https://mail.python.org/pipermail/scipy-dev/2016-August/021474.html >>> [2] https://github.com/scipy/scipy-articles/pull/4 >>> [3] https://github.com/astropy/astropy-v2.0-paper >>> >>> Cheers, >>> Ralf >>> >>> >>> >>> _______________________________________________ >>> SciPy-Dev mailing list >>> SciPy-Dev at python.org >>> https://mail.python.org/mailman/listinfo/scipy-dev >>> >>> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > > > -- > _____________________________________ > Dr. Andrew Nelson > > > _____________________________________ > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From antonior92 at gmail.com Mon Jan 22 06:09:30 2018 From: antonior92 at gmail.com (Antonio Ribeiro) Date: Mon, 22 Jan 2018 09:09:30 -0200 Subject: [SciPy-Dev] SciPy-Dev Digest, Vol 171, Issue 19 In-Reply-To: References: Message-ID: Hi all, Just to understand it better. I have some contributions before v1.0: https://github.com/scipy/scipy/pull/6404/commits/caa58db0a0695ec1cc0636ed85903449fef5d57a https://github.com/scipy/scipy/pull/6919/commits/e0e7e55502fb6fa0142b2f173f8de6b73b254594 Should I have received a web form? If it is the case, I should be happy to help writing about "scipy.optimize", if the committee decide to include some section about it. All the best, Ant?nio 2018-01-20 19:03 GMT-02:00 : > > Date: Sun, 21 Jan 2018 10:03:25 +1300 > From: Ralf Gommers > To: SciPy Developers List > Subject: [SciPy-Dev] SciPy 1.0 paper writing proposal > Message-ID: > bzKpos3WQ at mail.gmail.com> > Content-Type: text/plain; charset="utf-8" > > Hi all, > > TL;DR, let's write the long journal paper on SciPy that we've wanted for a > while, let's form a small committee to coordinate, and get it out the door > in 2-3 months. > > > Motivation > --------------- > (credits for most of this text: Evgeni) > > Many scipy contributors' day jobs are in academia. Bibliometry -- papers in > refereed journals and citations of papers by other papers -- is one of the > main > performance indicators in most academic establishments. Since we do not > generate papers, scipy contributions are all but invisible for the purposes > of a > contributor's annual report. Of course, details vary wildly; in many cases > a > contributor manages to balance their time, or to argue common sense with > their > superiors, or get an approval for scipy work, or just ignores the issue > altogether -- but sooner or later there is a form to be filled or boxes to > be > checked, and scipy contributions simply do not fit in. A peer-reviewed > journal paper on scipy will help contributors get the academic credit they > deserve. > > We can write *the* paper for SciPy 1.0, with overall project structure, > goals, etc., and for specific features/modules a focus on say the last 3 > years. > > > History > ---------- > For SciPy 1.0 we had three targets on the publicity/credits front: an > interesting release announcement, interesting blogs/stories (NumFOCUS blog, > Hacker News, etc.) and a paper. We didn't have the bandwidth for a paper in > the end, the rest was successful. > > [1] is a previous announcement on this list about writing (a) paper(s) on > SciPy. We wanted both "short papers" to cover one or two releases (target > journal JOSS) and a full paper as the authoritative reference for SciPy. > > We had an earlier attempt for a "short paper", it's mostly written but has > stalled (see [2]). We ran out of steam on that one. To avoid that this time > around, it would be good to have a clear public plan, target dates, and a > small committee rather than one person to drive things forward. > > > Proposal > ------------ > Here's a proposal for all aspects of this exercise that I can think about > now. Some parts stolen from the AstroPy paper [3] (because their process > worked quite well). > > Form a small coordination committee of 3-5 people that set up the paper > structure, move things along when parts stall, propose/take decisions as > needed, invite co-authors, and organise paper submission/rework. > > Paper writing to be done by whoever volunteers for a section, not just the > coordination committee. First outline/structure to be done by committee, > which then asks for review of structure and volunteers for section writing. > > Scope: a 6-10 page paper, covering history, package scope and structure, > community/organisational aspects, key features and recent enhancements per > module, and roadmap. > > Authorship: anyone who made a substantial contribution in the history of > the project. Here "substantial" is interpreted as anything beyond a > one-line doc fix. Rationale: better to be too inclusive than exclusive. > Sign-up via a web form, we send the link to that form to all email > addresses in the commit history till v1.0. > > Author order (details tbd by committee): > 1. The SciPy Developers > 2. Maintainers, paper writers, other key contributors - in order of > contribution level > 3. All other authors - alphabetically ordered > > Submission target: mid-April, to either PeerJ Computer Science or Journal > of Open Research Software (tbd by committee). > > Comments? Volunteers for committee? > > > References > ---------------- > [1] https://mail.python.org/pipermail/scipy-dev/2016-August/021474.html > [2] https://github.com/scipy/scipy-articles/pull/4 > [3] https://github.com/astropy/astropy-v2.0-paper > > Cheers, > Ralf > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: attachments/20180121/523826fe/attachment.html> > > ------------------------------ > > Subject: Digest Footer > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > > ------------------------------ > > End of SciPy-Dev Digest, Vol 171, Issue 19 > ****************************************** > -------------- next part -------------- An HTML attachment was scrubbed... URL: From s.denaxas at gmail.com Mon Jan 22 07:33:21 2018 From: s.denaxas at gmail.com (Spiros Denaxas) Date: Mon, 22 Jan 2018 12:33:21 +0000 Subject: [SciPy-Dev] New feature - discrete Frechet distance for scipy.spatial.distance Message-ID: Dear SciPy devs, I recently decided to clean up and package a Python implementation [1] for calculating the discrete Frechet distance [2] between two polygonal curves that I had lying around. I was thinking this could potentially be a useful addition to scipy.spatial.distance. More than happy to do a pull request if you think it will be useful. Spiros [1] https://github.com/spiros/discrete_frechet [2] http://www.kr.tuwien.ac.at/staff/eiter/et-archive/cdtr9464.pdf -------------- next part -------------- An HTML attachment was scrubbed... URL: From serge.guelton at telecom-bretagne.eu Mon Jan 22 09:28:40 2018 From: serge.guelton at telecom-bretagne.eu (Serge Guelton) Date: Mon, 22 Jan 2018 15:28:40 +0100 Subject: [SciPy-Dev] Using Pythran to compile some of the scipy internals In-Reply-To: References: <20180120101034.GA3707@lakota> Message-ID: <20180122142840.GA22566@lakota> > Another potential benefit is to decrease the size of the binary distribution of > SciPy. Cython extensions are quite expensive in this respect. Do you have an > idea of how Pythran compares? Both in case of only float64 inputs, and with > templated inputs? A good example of the latter is scipy.ndimage.label, which is > a straightforward function that ends up being a 700kb .so Thanks for pointing out this aspect. There's still room for improvement here, but with current pythran's master, and same compilation flags, the cython implementation of `max_len_seq_inner` takes ~700kb while pythran's version takes ~200kb. I'm going to investigate that a bet more though, and probably post the results in this thread later on. > This adds an extra dependency on Pythran, which uses C++ as backend. > > > Only a build-time dependency. That's not my major worry from the maintenance > point of view. And a dependency to libstc++ too. > This increases the failure surface. Although alive since 2012 and being > tested a lot [1] on Linux (but scarcely on Windows), its is obviously > less mature than cython > > > This is a worry. We need Windows, Linux, macOS (officially supported, also the > 32-bit flavors) as well as less commonly used Unix/BSD-like platforms. > > I guess Windows needs separate testing, both with gcc and MSVC. But for all > other platforms, can you say something about portability based on the kind of > C++ Pythran generates? Portability on OSX never required extra work, it's just less tested than the Linux version. There was some C++11 support issue with MSVC compiler back when I tried, but that was some 2 years ago. I'll try again. > = Alternatives > > There is an experimental Pythran mode in cython[2] that uses Pythran as > a backend for numpy operations. Unfortunately it is still at early > stages and cannot translate calls to ``np.roll`` or ``np.sum(a[i,:] - b[j, > :])`` while Pythran supports it. > > Instead of translating Cython files, I could also focus on some > pure-python functions. I tested Pythran on the rosenbrock function and I > get good speedup (from 1.5x to 4x depending on vectorization being > enabled or not) there too. > > So yeah, that's a rather long introduction to probe the interest here > around that idea :-) > > > Interest in general, but there's a long way to go - our requirements are pretty > demanding. I have seen some comparisons between Cython, Pythran and Numba in > terms of performance and ease of use, but never a comprehensive comparison from > the point of view of library authors. I know Travis O. has an interest in > seeing Numba being adopted more widely, which will also need such a comparison. > It should cover at least: > > - portability > - performance > - maturity > - maintenance status (active devs, how quick do bugs get fixed after a release > with an issue) > - ease of use (@jit vs. Pythran comments vs. translate to .pyx syntax) > - size of generated binaries > - templating support for multiple dtypes > - debugging and optimization experience/tools Independently of the potential inclusion of Pythran in scipy, those are very valuable points. I'm 100% biased but would say that community and debugging are two weak points of Pythran, esp. when compared to Cython. ++ Serge From tyler.je.reddy at gmail.com Mon Jan 22 10:14:48 2018 From: tyler.je.reddy at gmail.com (Tyler Reddy) Date: Mon, 22 Jan 2018 08:14:48 -0700 Subject: [SciPy-Dev] New feature - discrete Frechet distance for scipy.spatial.distance In-Reply-To: References: Message-ID: There are a few Python implementations of discrete Frechet around (i.e., in MDA: https://www.mdanalysis.org/docs/documentation_pages/analysis/psa.html#MDAnalysis.analysis.psa.discrete_frechet ). Most of them use the quadratic time complexity dynamic programming approach, and it seems you have used that as well. I think there's probably interest in incorporating discrete Frechet in scipy since we already have directed_hausdorff, which is thematically similar. One thing that would be really impressive is if we could get a subquadratic implementation of discrete Frechet ( i.e., http://epubs.siam.org/doi/abs/10.1137/130920526 ), but these algorithms seem to be extraordinarily complicated -- indeed the authors of the paper I cited there are not aware of any working implementations in the wild & when I asked one I think they suggested it wasn't worth implementing because for most input data sizes the much simpler algorithm will still win. Perhaps this sort of thing could be summarized in the docstring. There are quite a few different flavors of Frechet distance (continuous, homotopic, weak, etc.). From a design standpoint it might makes sense to have some kind of keyword to allow expansion for those in the future. On 22 January 2018 at 05:33, Spiros Denaxas wrote: > Dear SciPy devs, > > I recently decided to clean up and package a Python implementation [1] for > calculating the discrete Frechet distance [2] between two polygonal curves > that I had lying around. > > I was thinking this could potentially be a useful addition to > scipy.spatial.distance. More than happy to do a pull request if you think > it will be useful. > > Spiros > > [1] https://github.com/spiros/discrete_frechet > [2] http://www.kr.tuwien.ac.at/staff/eiter/et-archive/cdtr9464.pdf > > > > _______________________________________________ > SciPy-Dev mailing list > SciPy-Dev at python.org > https://mail.python.org/mailman/listinfo/scipy-dev > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Mon Jan 22 10:34:32 2018 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Mon, 22 Jan 2018 16:34:32 +0100 Subject: [SciPy-Dev] Using Pythran to compile some of the scipy internals In-Reply-To: <20180120101034.GA3707@lakota> References: <20180120101034.GA3707@lakota> Message-ID: On 20 January 2018 at 11:10, Serge Guelton wrote: > = Cost? > > This adds an extra dependency on Pythran, which uses C++ as backend. > This increases the failure surface. Although alive since 2012 and being > tested a lot [1] on Linux (but scarcely on Windows), its is obviously > less mature than cython > I think this is a blocker, but it seems it is on its way to be fixed: "Pythran supports Python *2.7* and also has a beta Python *3* support." scipy has a lot of Cython code, much of it fairly well tested, which can be very useful for testing Pythran itself. A question, does Pythran eliminate the need for storing intermediate results? Like (a*a).sum(axis=1) for a large a, will it store the full a-squared? (If it is, I'd be very sorry I didn't run into it earlier, and very happy I finally did). /David. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Jan 22 11:00:34 2018 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 23 Jan 2018 05:00:34 +1300 Subject: [SciPy-Dev] SciPy-Dev Digest, Vol 171, Issue 19 In-Reply-To: References:

Message-ID: On Tue, Jan 23, 2018 at 12:09 AM, Antonio Ribeiro wrote: > Hi all, > > Just to understand it better. I have some contributions before v1.0: > https://github.com/scipy/scipy/pull/6404/commits/ > caa58db0a0695ec1cc0636ed85903449fef5d57a > https://github.com/scipy/scipy/pull/6919/commits/ > e0e7e55502fb6fa0142b2f173f8de6b73b254594 > > Should I have received a web form? > Not yet because we haven't created it yet. But yes, the plan is that you will receive a link to a form on the email address in the commit history. > If it is the case, I should be happy to help writing about > "scipy.optimize", if the committee decide to include some section about it. > Thanks! Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From serge.guelton at telecom-bretagne.eu Mon Jan 22 17:34:57 2018 From: serge.guelton at telecom-bretagne.eu (Serge Guelton) Date: Mon, 22 Jan 2018 23:34:57 +0100 Subject: [SciPy-Dev] Using Pythran to compile some of the scipy internals In-Reply-To: References: <20180120101034.GA3707@lakota> Message-ID: <20180122223457.GA14580@lakota> On Mon, Jan 22, 2018 at 04:34:32PM +0100, Da?id wrote: > > On 20 January 2018 at 11:10, Serge Guelton > wrote: > > = Cost? > > This adds an extra dependency on Pythran, which uses C++ as backend. > This increases the failure surface. Although alive since 2012 and being > tested a lot [1] on Linux (but scarcely on Windows), its is obviously > less mature than cython > > > I think this is a blocker, but it seems it is on its way to be fixed: > > "Pythran supports Python 2.7 and also has a beta Python 3 support." > > scipy has a lot of Cython code, much of it fairly well tested, which can be > very useful for testing Pythran itself. Yes, I started integrating cython code from scipy into pythran testbed, after converting them back to Python. It did found some bugs in Pythran, so that's great! > A question, does Pythran eliminate the need for storing intermediate results? Yep, it turns code like a = b + 1 c = a * 2 into c = (b + 1) * 2 It of course also handles interaction with control flow to forward-substitute only if it does not triggers recomputation. > Like (a*a).sum(axis=1) for a large a, will it store the full a-squared? (If it > is, I'd be very sorry I didn't run into it earlier, and very happy I finally > did). Yes, it does perfom lazy evaluation. From rlucente at pipeline.com Mon Jan 22 18:25:39 2018 From: rlucente at pipeline.com (Robert Lucente) Date: Mon, 22 Jan 2018 18:25:39 -0500 (GMT-05:00) Subject: [SciPy-Dev] What we do struggle with is lack of progress on big-ticket items Message-ID: <1075549019.8362.1516663539259@wamui-cinderella.atl.sa.earthlink.net> >What we do struggle with is lack of progress on big-ticket items Could we get some of these "big-ticket" items to be graduate student projects as part of an independent study or ...? From andyfaff at gmail.com Mon Jan 22 23:15:21 2018 From: andyfaff at gmail.com (Andrew Nelson) Date: Tue, 23 Jan 2018 15:15:21 +1100 Subject: [SciPy-Dev] What we do struggle with is lack of progress on big-ticket items In-Reply-To: <1075549019.8362.1516663539259@wamui-cinderella.atl.sa.earthlink.net> References: <1075549019.8362.1516663539259@wamui-cinderella.atl.sa.earthlink.net> Message-ID: On 23 January 2018 at 10:25, Robert Lucente wrote: > >What we do struggle with is lack of progress on big-ticket items > > Could we get some of these "big-ticket" items to be graduate student > projects as part of an independent study or ...? > The difficult in progressing big ticket items looms large, the difficulty here cannot be underestimated. Big ticket items require several iterations which tests the patience of the proposer and reviewer, it's a large amount of effort on both ends. It requires the reviewer is familiar with the subject material (not a given), and that the proposer is familiar with the scipy workflow. Giving it to a grad student isn't necessarily going to solve either of those issues. I know on my part that I've been reluctant to review some PR's because I don't feel I have enough knowledge, and worry I would commit something that isn't good enough. But if there's no one else to do it, then what should happen? -------------- next part -------------- An HTML attachment was scrubbed... URL: From s.denaxas at gmail.com Tue Jan 23 04:38:31 2018 From: s.denaxas at gmail.com (Spiros Denaxas) Date: Tue, 23 Jan 2018 09:38:31 +0000 Subject: [SciPy-Dev] New feature - discrete Frechet distance for scipy.spatial.distance In-Reply-To: References: Message-ID: Hi Tyler Thank you for the feedback. I agree that a subquadratic implementation would be really cool but I havent seen one either and just by glancing at the paper realize that this would take a substantial amount of effort. What is the canonical way of specying the flavour ? the two options I can think of are: a) scipy.spatial.distance.continuous_frechet or b) scipy.spatial.distance.frechet(P,Q, type="continuous") best Spiros On Mon, Jan 22, 2018 at 3:14 PM, Tyler Reddy wrote: > There are a few Python implementations of discrete Frechet around (i.e., > in MDA: https://www.mdanalysis.org/docs/documentation_pages/ > analysis/psa.html#MDAnalysis.analysis.psa.discrete_frechet ). Most of > them use the quadratic time complexity dynamic programming approach, and it > seems you have used that as well. I think there's probably interest in > incorporating discrete Frechet in scipy since we already have > directed_hausdorff, which is thematically similar. > > One thing that would be really impressive is if we could get a > subquadratic implementation of discrete Frechet ( i.e., > http://epubs.siam.org/doi/abs/10.1137/130920526 ), but these algorithms > seem to be extraordinarily complicated -- indeed the authors of the paper I > cited there are not aware of any working implementations in the wild & when > I asked one I think they suggested it wasn't worth implementing because for > most input data sizes the much simpler algorithm will still win. Perhaps > this sort of thing could be summarized in the docstring. > > There are quite a few different flavors of Frechet distance (continuous, > homotopic, weak, etc.). From a design standpoint it might makes sense to > have some kind of keyword to allow expansion for those in the future. > > On 22 January 2018 at 05:33, Spiros Denaxas wrote: > >> Dear SciPy devs, >> >> I recently decided to clean up and package a Python implementation [1] >> for calculating the discrete Frechet distance [2] between two polygonal >> curves that I had lying around. >> >> I was thinking this could potentially be a useful addition to >> scipy.spatial.distance. More than happy to do a pull request if you think >> it will be useful. >> >> Spiros >> >> [1] https://github.com/spiros/discrete_frechet >> [2] http://www.kr.tuwien.ac.at/staff/eiter/et-archive/cdtr9464.pdf >> >> >> >> _______________________________________________ >> SciPy-Dev mailing list >> SciPy-Dev at python.org >> https://mail.python.org/mailman/listinfo/scipy-dev >> >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rlucente at pipeline.com Tue Jan 23 04:57:01 2018 From: rlucente at pipeline.com (Robert Lucente - Pipeline.Com) Date: Tue, 23 Jan 2018 04:57:01 -0500 Subject: [SciPy-Dev] What we do struggle with is lack of progress on big-ticket items In-Reply-To: References: <1075549019.8362.1516663539259@wamui-cinderella.atl.sa.earthlink.net> Message-ID: <14f101d39430$84b127e0$8e1377a0$@pipeline.com> >good enough I am a newbie and have just been lurking. Perhaps we could put some bounds around unacceptable, good enough and perfection? I realize that it is hard to put words to this. Perhaps an example to kick off a conversation? Perhaps we could release features w/ tags like alpha, beta, GA and part of official release? Perhaps people could volunteer to just help someone w/ the SciPy workflow? Suggesting some ideas to stimulate conversation. From: Andrew Nelson [mailto:andyfaff at gmail.com] Sent: Monday, January 22, 2018 11:15 PM To: Robert Lucente ; SciPy Developers List Subject: Re: [SciPy-Dev] What we do struggle with is lack of progress on big-ticket items On 23 January 2018 at 10:25, Robert Lucente