From dagss at student.matnat.uio.no Sat Aug 1 05:57:15 2009 From: dagss at student.matnat.uio.no (Dag Sverre Seljebotn) Date: Sat, 1 Aug 2009 11:57:15 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: Message-ID: I am going to play the devil's advocate here -- I'm not into this in order to make myself enemies, I just have some sincere questions. Joe Harrington wrote: > I define success as popular adoption in preference to commercial > packages. I believe in vote-with-your-feet: this goal will not be > reached until all aspects of the package and its presentation to the > world exceed those of our commercial competition. Scipy is now a > grass roots effort, but that takes it only so far. Other projects, > such as OpenOffice and Sage, don't follow this model and do produce > quality products that compete with commercial offerings, at least on > open-source platforms. Before we can even hope for that, we have to > do the following: > > - Public communication > - A real marketing plan > - Executing on that plan > - Web site geared toward multiple audiences, run by experts at that > kind of communication > - More webinars, conference booths, training, aimed at all levels > - Demos, testimonials, topical forums, all showcased A thing OpenOffice.org and Sage both have is a very clear sense of direction and a clearly stated goal. SciPy might also have that for all I know, but I must admit I haven't understood what it is in the past year following the SciPy and NumPy lists, and reading the SciPy site. But I have seen email threads asking what the SciPy goal is, without any clear resolution (?). The website says this: "SciPy is open-source software for mathematics, science, and engineering." Which of course says nothing at all. Someone asked me what SciPy is the other day, and while I more or less "know" when I'd try to look in SciPy for an algorithm (instead of going to, say, R, or netlib.org, or whatever), I was more or less forced to say that it is a "dumping ground for various algorithms people have found useful, with the link being them being either written in Python or wrapped for Python". That's probably an unfair description -- the point is: If one needs to formulate a two- or three-liner about SciPy, what would it be? Is it a goal to reimplement stuff in SciPy that's (for instance) already thriving in the open source R community, or is that not a goal? And so on. You might feel this is going off-topic, but I somehow feel that a very clear sense of direction is paramount when talking of these issues -- just look at the Sage project. Dag Sverre From josef.pktd at gmail.com Sat Aug 1 08:47:36 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 1 Aug 2009 08:47:36 -0400 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: Message-ID: <1cd32cbb0908010547y17385fb6sa790fddf73a37ba9@mail.gmail.com> On Sat, Aug 1, 2009 at 5:57 AM, Dag Sverre Seljebotn wrote: > I am going to play the devil's advocate here -- I'm not into this in order > to make myself enemies, I just have some sincere questions. > > Joe Harrington wrote: >> I define success as popular adoption in preference to commercial >> packages. ?I believe in vote-with-your-feet: this goal will not be >> reached until all aspects of the package and its presentation to the >> world exceed those of our commercial competition. ?Scipy is now a >> grass roots effort, but that takes it only so far. ?Other projects, >> such as OpenOffice and Sage, don't follow this model and do produce >> quality products that compete with commercial offerings, at least on >> open-source platforms. ?Before we can even hope for that, we have to >> do the following: >> > > > >> - Public communication >> ? - A real marketing plan >> ? - Executing on that plan >> ? - Web site geared toward multiple audiences, run by experts at that >> ? ? kind of communication >> ? - More webinars, conference booths, training, aimed at all levels >> ? - Demos, testimonials, topical forums, all showcased > > A thing OpenOffice.org and Sage both have is a very clear sense of > direction and a clearly stated goal. > > SciPy might also have that for all I know, but I must admit I haven't > understood what it is in the past year following the SciPy and NumPy > lists, and reading the SciPy site. But I have seen email threads asking > what the SciPy goal is, without any clear resolution (?). > > The website says this: "SciPy is open-source software for mathematics, > science, and engineering." > > Which of course says nothing at all. Someone asked me what SciPy is the > other day, and while I more or less "know" when I'd try to look in SciPy > for an algorithm (instead of going to, say, R, or netlib.org, or > whatever), I was more or less forced to say that it is a "dumping ground > for various algorithms people have found useful, with the link being them > being either written in Python or wrapped for Python". I think scipy is a pretty much the same as a collection of matlab tool boxes, either with more enhanced basic numerical algorithms (linalg, special, optimize, interpolate, sparse, fft, spatial) or toolboxes with wider applicability (stats including cluster, odr and maxentropy, signal, ndimage+stsci?). This misses weave. Which algorithms are actually included and some of the structure still reflects the "dumping ground for various algorithms people have found useful". And some parts don't look very used. There is still a lot of cleaning and testing to do, but the description as analogy to matlab toolboxes is pretty accurate, if a description by analogy is allowed. E.g. to understand more of scipy.signal, I started to read the help for matlabs signal toolbox. That's my impression of scipy after working my way through some parts of it in the last year. > > That's probably an unfair description -- the point is: If one needs to > formulate a two- or three-liner about SciPy, what would it be? Is it a > goal to reimplement stuff in SciPy that's (for instance) already thriving > in the open source R community, or is that not a goal? And so on. For stats, I consider matlab and maybe gauss for econometrics as benchmark, not the coverage of a specialized language/package like R, but I'm no statistician and I don't know anyone personally that uses R. Josef > > You might feel this is going off-topic, but I somehow feel that a very > clear sense of direction is paramount when talking of these issues -- just > look at the Sage project. > > Dag Sverre > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From josef.pktd at gmail.com Sat Aug 1 11:29:59 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 1 Aug 2009 11:29:59 -0400 Subject: [SciPy-dev] nbinom.ppf In-Reply-To: <3d375d730907280956m5425d997u5b37d8a355047439@mail.gmail.com> References: <199F6194-4953-41F2-9F47-13CC4A5AD936@gmail.com> <3d375d730907271255i32283ae9j624e2a57b52fdac4@mail.gmail.com> <1cd32cbb0907271329q33bec67dqc120575454dbf3dc@mail.gmail.com> <3d375d730907271358s361cd41cke0804f1b80332c93@mail.gmail.com> <64EF39E2-8AF5-47C0-84D8-ADB4E063A211@gmail.com> <1cd32cbb0907271610w6986b256me9730f2ee3c436c5@mail.gmail.com> <3d375d730907271621m4aabe914l1536fd6a49da0ee5@mail.gmail.com> <1cd32cbb0907271811x411a32dbs4f11bf2f220f6781@mail.gmail.com> <3d375d730907280956m5425d997u5b37d8a355047439@mail.gmail.com> Message-ID: <1cd32cbb0908010829pb812229hb6d0203d6d2cb386@mail.gmail.com> On Tue, Jul 28, 2009 at 12:56 PM, Robert Kern wrote: > On Mon, Jul 27, 2009 at 20:11, wrote: >> That's better. It took me a while to understand the logic behind the >> way the ceiling error is corrected. The same pattern is also followed >> by the other discrete distributions that define a _ppf method. It is >> cleaner then the epsilon correction, but takes longer to figure out >> what it does. >> >> To understand the logic more easily and to be DRY, it would be better >> to replace the duplication of the _cdf method directly with a call to >> self._cdf. >> For example, in changeset 4673, Robert, you changed the _cdf method to >> use betainc instead of nbdtr, but not the _ppf method. Without the >> code duplication, partial corrections could be more easily avoided. >> >> Is there a reason not to call self._cdf instead? > > Nope. Go ahead. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > ?-- Umberto Eco > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > for the record: dlaplace and boltzman also fail the roundtrip test boltzmann (1.3999999999999999, 19) [ 1. 2. 3. 3.] [ 0. 1. 2. 3.] False True False [ 0. 3.] dlaplace (0.80000000000000004,) [-5. -4. -3. -2. -1. 1. 1. 2. 3. 4. 5.] [-5. -4. -3. -2. -1. 0. 1. 2. 3. 4. 5.] Josef From jh at physics.ucf.edu Sat Aug 1 12:20:17 2009 From: jh at physics.ucf.edu (Joe Harrington) Date: Sat, 01 Aug 2009 12:20:17 -0400 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: (dagss@student.matnat.uio.no) References: Message-ID: Dag wrote: > I have seen email threads asking > what the SciPy goal is, without any clear resolution (?). How's this for a goal/mission statement (for SciPy, IDL, and Matlab): (The toolstack) is a professional-quality numerical computation and visualization environment that supports convenient handling of numerical arrays, provides a rich set of basic tools and algorithms for science and engineering, and supports a variety of both general and discipline-specific application software. It is easy for numerically savvy teens to learn, but rich enough to support the most complex of professional applications. It can be run both non-interactively and interactively, with the latter featuring both GUI and rich command-line interfaces. It comes with full documentation, is easy to install and run on all popular platforms, has a strong online user community spanning all disciplines, and has commercial support and consulting. For SciPy, I'd replace the part after the last comma with "is free and open-source, supports cloud computing, and has options for commercial user support and consulting." One could add to the list of general features, such as symbolic manipulation, parallel processing, etc., but it's already getting long. For SciPy, some of this, of course, is not yet true, which is the point of the current thread. Another way of looking at it: For me, SciPy is a replacement for IDL that improves on it in some areas. No more, but no less. That doesn't say what it *is*, since it just begs the question, "what is IDL", but it does identify the space I'd like to see SciPy occupy. It occupies most of the space IDL occupied for me now, except for a few crucial areas. The main one is that enough of my colleagues use it that I can exchange codes with them. A code written in an interpreted language that your colleague does not use is not useful to them. If it's not useful to them, then the interest in your contribution is that much smaller. So, my goal is to make SciPy (the toolstack, not the package) *to them* be what IDL is to them today. That is a lot more than what IDL is to me, since I have more of a knack for computers than most of my colleagues. They need a one-touch install, hold-your-hand docs, GUIs, and so forth. They are also less interested in the linguistic improvements of Python over IDL. Or, they are until they really get coding, which is long after they make the decision to give it a spin. This is a good thing in a way, since it means that once they try it, they *really* like it. Most current SciPy users, I think, are savvy enough about computers that we can work around the shortcomings, but the next round of adopters will always be less savvy than the last, on the whole, hence the need for better and lower-level docs, professional packaging on all platforms, etc. --jh-- From cournape at gmail.com Sat Aug 1 12:21:10 2009 From: cournape at gmail.com (David Cournapeau) Date: Sun, 2 Aug 2009 01:21:10 +0900 Subject: [SciPy-dev] [SciPy-User] SciPy Foundation In-Reply-To: References: Message-ID: <5b8d13220908010921m49b218f2ga13b5b75a2aebaff@mail.gmail.com> Hi Joe, On Sat, Aug 1, 2009 at 2:06 AM, Joe Harrington wrote: > > I define success as popular adoption in preference to commercial > packages. ?I believe in vote-with-your-feet: this goal will not be > reached until all aspects of the package and its presentation to the > world exceed those of our commercial competition. ?Scipy is now a > grass roots effort, but that takes it only so far. ?Other projects, > such as OpenOffice and Sage, don't follow this model and do produce > quality products that compete with commercial offerings, at least on > open-source platforms. I am not sure openoffice is a good example, but I share the sentiment that something is missing in the organization of the community. I think it is very important to keep in mind that in any open source project, telling people what to do does not work well. Not everybody will share the same goals, are interested in scipy in the same way, etc... So any structure should help people doing what they want for scipy's sake, but above all, should not alienate anyone who would have worked on scipy otherwise. It may just be rhetoric, but saying that "it would be nice for scipy to have this goal" instead of "we should do this" matters IMHO. Some of the things I am missing: - no quantifiable feedback from users: if we want to work on a set of features, we cannot prioritize. Likewise, we have very little statistics on usage, platforms, etc... OTOH, this is often hard to obtain for open source projects. - a scipy foundation: several times already, I have been asked privately to do add some feature to scipy, generally things which takes a few hours max, in exchange for some money. It is too much of a hassle to set up things to get money for a few hours work, and frankly, for a few hours, I would prefer to ask people to give money to a scipy foundation instead. Something like the R foundation (http://www.r-project.org/foundation/main.html). A foundation with a legal status would make the situation much easier w.r.t donations I believe. It should not be that hard to set up. - website: I think the root of the problem is lack of a dedicated person for it, a person with design skills ideally, to design a coherent graphic "chart" (not sure about the exact English word), etc... I don't know how to get volunteers for this: it seems like many projects manage to have such volunteers. About the more particular points you raised: > - Packaging > ?- Personal Package Archive or equivalent for every release of every > ? ?OS for the full toolstack (There are tools that do this but we > ? ?don't use them. ?NSF requires Metronome - http://nmi.cs.wisc.edu/ > ? ?- for funding most development grants, so right now we're not even > ? ?on NSF's radar.) > ?- Track record of having the whole toolstack installation "just > ? ?work" in a few command lines or clicks for *everyone* > ?- Regular, scheduled releases of numpy and scipy > ?- Coordinated releases of numpy, scipy, and stable scikits into PPA system The problem of packaging is that it is hard to do well, but has no technically challenging part in it. And it usually does not fall into the "scratching ones' itch", because once you know how to build the software, you are done and usually want to start using the damn thing. Worse, it needs to be done every-time (every release). So this is fundamentally different than doc: having done a great packaging work for version N is useless after N+1 is out. It does not make sense to pay someone to do it once. Having some infrastructure would help: for example, something which automatically builds packages on a set of supported platforms. It has to be 100 % automatic, so that pushing one button get you the sources, build the package, install it, and test it. This costs money and time to set up. > - Public communication > ?- A real marketing plan > ?- Executing on that plan > ?- Web site geared toward multiple audiences, run by experts at that > ? ?kind of communication > ?- More webinars, conference booths, training, aimed at all levels > ?- Demos, testimonials, topical forums, all showcased Concerning communication with users, I think that the mailing lists do not work well. It is ok for development, but it kinda sucks for helping average users. Since I have been working on the dark side for numpy/scipy- windows, I have been regularly using stackoverflow to ask for some obscure windows stuff. stackoverflow is a a mix between a FAQ and wikipedia. It works extremely well, and the user experience is way above anything I have seen in this vein. Something like this to use for scipy/numpy would be extremely useful I believe. It is vastly superior to ML or wiki for focused problems ("how to do this in matlab", "how to install on this linux distribution", etc...). As an example of usage, R has recently used the main website so that the most upvoted N R questions would be answered by R core developers (during a R conference I believe). This all feels much better than ML to me (again, as far as average user usage is concerned, not for developer communication). One website to handle all the user community, no need for complicated forum rules and all (everything works with search and tags). Stackoverflow works without any fixed hierarchy for many times more participants that we will ever have, and much broader topics than us. They will have soon a dedicated solution for custom websites using the same stack - maybe something can be worked on as a open source project. David From jh at physics.ucf.edu Sat Aug 1 13:10:45 2009 From: jh at physics.ucf.edu (Joe Harrington) Date: Sat, 01 Aug 2009 13:10:45 -0400 Subject: [SciPy-dev] [SciPy-User] SciPy Foundation In-Reply-To: <5b8d13220908010921m49b218f2ga13b5b75a2aebaff@mail.gmail.com> (message from David Cournapeau on Sun, 2 Aug 2009 01:21:10 +0900) References: <5b8d13220908010921m49b218f2ga13b5b75a2aebaff@mail.gmail.com> Message-ID: [Replying only on scipy-dev, per the original post.] David wrote: > I think it is very important to keep in mind that in any open source > project, telling people what to do does not work well. Not everybody > will share the same goals, are interested in scipy in the same way, > etc... So any structure should help people doing what they want for > scipy's sake, but above all, should not alienate anyone who would > have worked on scipy otherwise. It may just be rhetoric, but saying > that "it would be nice for scipy to have this goal" instead of "we > should do this" matters IMHO. I think (hope!) that everyone understands that anything posted here is a personal opinion and that none of us feels we are in a position to give orders. Nobody is boss or supervisor to the whole list. When I write, "We need...," of course I am writing "It is my opinion that we need," etc., but that gets tedious both to write and to read. Visions should be bold. That said, there do need to be goals, standards, etc. Those do translate into telling people what to do. I think the key point is that it must be the community, not any individual, that does the telling. For example, we are engaged in a discussion of a plan I floated. The list I posted is "my plan", but already we've added code to the funding umbrella and no doubt there will be more changes (I fully expected Robert Kern to flip out about my suggestion to remove functions from numpy...maybe he didn't read that far...I expect to lose that one.:-). I think that once it's the community's plan, we can say no to contributions that don't fit, that conflict with others, that are too slow or insufficient, and so on, because we will have the critical mass to replace those contributions with ones the community thinks are better. We see this already with the vigilant rejection of change requests to the numpy API and the review comment system on the doc wiki. We can and have to say no occasionally, to maintain our direction and our standards. We just have to be careful about it and make sure it is based on established community goals and norms, not one person's random opinion. More on some of your other points later... --jh-- From tgrav at mac.com Sat Aug 1 13:11:09 2009 From: tgrav at mac.com (Tommy Grav) Date: Sat, 01 Aug 2009 13:11:09 -0400 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: Message-ID: <2FEC9DDC-1748-4C87-B5A4-6A8224E25D14@mac.com> On Aug 1, 2009, at 12:20 PM, Joe Harrington wrote: > Dag wrote: >> I have seen email threads asking >> what the SciPy goal is, without any clear resolution (?). > > For me, SciPy is a replacement for IDL that improves on it in some > areas. No more, but no less. I have been using python, numpy and matplotlib for a few years as part of my astronomy research. While I find numpy and matplotlib extremely useful, scipy just don't seem to help me much. I think the problem is that it is very unfocused. To me scipy is not a replacement of IDL, it is a python implementation of Numerical Recipes, but it because of its lack of focus it has become very chaotic. So far I have only found use for the integrate.leastsq and spatial.KDTree packages from scipy. Packages like pyfits, pyraf, AstLib, etc. take care of the more astronomy related problems. So I would personally like to see scipy become a package that binds the numpy package to the more field specific packages, by providing numerical methods that are broadly applicable in many fields (i.e. least square minimization, KDTree implementation, Runga-Kutta and other type of integration schemes, differential equation solvers and so on). Making scipy into a tool for science and engineering is in my opinion a to broad a goal. Making into a set of tools that are useable in many fields and thus supporting development of field specific packages is in again my opinion the way to go. It narrows the focus and makes the project more self contained. Cheers Tommy Grav + ----------------------------------------------------------------------------+ Associate Researcher @ Dept. of Physics and Astronomy Johns Hopkins University + ----------------------------------------------------------------------------+ tgrav at pha.jhu.edu (410) 516-7683 http://web.mac.com/tgrav/Astronomy/Welcome.html + ----------------------------------------------------------------------------+ From dagss at student.matnat.uio.no Sat Aug 1 13:49:21 2009 From: dagss at student.matnat.uio.no (Dag Sverre Seljebotn) Date: Sat, 1 Aug 2009 19:49:21 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: Message-ID: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> Joe Harrington wrote: > Dag wrote: >> I have seen email threads asking >> what the SciPy goal is, without any clear resolution (?). > > For me, SciPy is a replacement for IDL that improves on it in some > areas. No more, but no less. That doesn't say what it *is*, since it > just begs the question, "what is IDL", but it does identify the space > I'd like to see SciPy occupy. It occupies most of the space IDL > occupied for me now, except for a few crucial areas. The main one is > that enough of my colleagues use it that I can exchange codes with > them. A code written in an interpreted language that your colleague > does not use is not useful to them. If it's not useful to them, then > the interest in your contribution is that much smaller. So, my goal > is to make SciPy (the toolstack, not the package) *to them* be what > IDL is to them today. That is a lot more than what IDL is to me, > since I have more of a knack for computers than most of my colleagues. > They need a one-touch install, hold-your-hand docs, GUIs, and so > forth. They are also less interested in the linguistic improvements > of Python over IDL. Or, they are until they really get coding, which > is long after they make the decision to give it a spin. This is a > good thing in a way, since it means that once they try it, they > *really* like it. Most current SciPy users, I think, are savvy enough > about computers that we can work around the shortcomings, but the next > round of adopters will always be less savvy than the last, on the > whole, hence the need for better and lower-level docs, professional > packaging on all platforms, etc. I really, really want what you seem to want too. BUT, I'll continue my criticism, in the hope that something may come out of it. What you mention above seem to be A LOT of work (in particular "professional packaging on all platforms"), and as others have mentioned partly in conflict with the way people tend to view SciPy currently, and so on. As you say it is indeed the whole stack that is important. Still, part of what you write seems to be an effort to do what many are doing already: - EPD - Sage (currently maths focused, but it does bundle SciPy and integrating it better would ) - SPD (Sage without some of the math libs) - Python(x,y) These all bundle SciPy, but also sets up the whole stack, and can focus on the whole picture. Are you saying that you just want to do it better than these, through a foundation? Wouldn't it be better to direct any funding through one of these existing candidates? This post I've written on the Sage list is very related and is about SciPy vs. Sage: http://groups.google.com/group/sage-devel/msg/78e2a2032042d35b The parent thread is a bit long but lots of related material in there: http://groups.google.com/group/sage-devel/browse_thread/thread/bef2010f45984730/78e2a2032042d35b?#78e2a2032042d35b Dag Sverre From josef.pktd at gmail.com Sat Aug 1 16:10:06 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 1 Aug 2009 16:10:06 -0400 Subject: [SciPy-dev] nbinom.ppf In-Reply-To: <1cd32cbb0908010829pb812229hb6d0203d6d2cb386@mail.gmail.com> References: <199F6194-4953-41F2-9F47-13CC4A5AD936@gmail.com> <3d375d730907271255i32283ae9j624e2a57b52fdac4@mail.gmail.com> <1cd32cbb0907271329q33bec67dqc120575454dbf3dc@mail.gmail.com> <3d375d730907271358s361cd41cke0804f1b80332c93@mail.gmail.com> <64EF39E2-8AF5-47C0-84D8-ADB4E063A211@gmail.com> <1cd32cbb0907271610w6986b256me9730f2ee3c436c5@mail.gmail.com> <3d375d730907271621m4aabe914l1536fd6a49da0ee5@mail.gmail.com> <1cd32cbb0907271811x411a32dbs4f11bf2f220f6781@mail.gmail.com> <3d375d730907280956m5425d997u5b37d8a355047439@mail.gmail.com> <1cd32cbb0908010829pb812229hb6d0203d6d2cb386@mail.gmail.com> Message-ID: <1cd32cbb0908011310t39fec09dg2c3037db02dbda72@mail.gmail.com> On Sat, Aug 1, 2009 at 11:29 AM, wrote: > On Tue, Jul 28, 2009 at 12:56 PM, Robert Kern wrote: >> On Mon, Jul 27, 2009 at 20:11, wrote: >>> That's better. It took me a while to understand the logic behind the >>> way the ceiling error is corrected. The same pattern is also followed >>> by the other discrete distributions that define a _ppf method. It is >>> cleaner then the epsilon correction, but takes longer to figure out >>> what it does. >>> >>> To understand the logic more easily and to be DRY, it would be better >>> to replace the duplication of the _cdf method directly with a call to >>> self._cdf. >>> For example, in changeset 4673, Robert, you changed the _cdf method to >>> use betainc instead of nbdtr, but not the _ppf method. Without the >>> code duplication, partial corrections could be more easily avoided. >>> >>> Is there a reason not to call self._cdf instead? >> >> Nope. Go ahead. >> >> -- >> Robert Kern >> >> "I have come to believe that the whole world is an enigma, a harmless >> enigma that is made terrible by our own mad attempt to interpret it as >> though it had an underlying truth." >> ?-- Umberto Eco >> _______________________________________________ >> Scipy-dev mailing list >> Scipy-dev at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-dev >> > > > for the record: dlaplace and boltzman also fail the roundtrip test > > boltzmann (1.3999999999999999, 19) > [ 1. ?2. ?3. ?3.] > [ 0. ?1. ?2. ?3.] > False True False [ 0. ?3.] > dlaplace (0.80000000000000004,) > [-5. -4. -3. -2. -1. ?1. ?1. ?2. ?3. ?4. ?5.] > [-5. -4. -3. -2. -1. ?0. ?1. ?2. ?3. ?4. ?5.] > > Josef > and planck >>> stats.planck.ppf(stats.planck.cdf(np.arange(10),.51),.51) array([ 1., 1., 2., 3., 4., 5., 7., 7., 8., 10.]) fixed in 5889 Josef From d_l_goldsmith at yahoo.com Sat Aug 1 17:56:52 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sat, 1 Aug 2009 14:56:52 -0700 (PDT) Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <2FEC9DDC-1748-4C87-B5A4-6A8224E25D14@mac.com> Message-ID: <642650.96553.qm@web52109.mail.re2.yahoo.com> --- On Sat, 8/1/09, Tommy Grav wrote: > Making scipy into a tool for science and engineering is in > my opinion? > a to broad a > goal. Making into a set of tools that are useable in many > fields and? > thus supporting > development of field specific packages is in again my > opinion the way? > to go. Please clarify what you see as the difference between these two - to me, on the surface of it, your goal statement is no more "focused" nor "self-contained" than Joe's. Perhaps if you clarify what you see as the differences, we all may discover that your vision and Joe's actually aren't that far apart. DG > It narrows > the focus and makes the project more self contained. > > Cheers > Tommy Grav > + > ----------------------------------------------------------------------------+ > Associate Researcher @ Dept. of Physics and Astronomy > Johns Hopkins University > + > ----------------------------------------------------------------------------+ > tgrav at pha.jhu.edu > (410) 516-7683 > http://web.mac.com/tgrav/Astronomy/Welcome.html > + > ----------------------------------------------------------------------------+ > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From gael.varoquaux at normalesup.org Sat Aug 1 18:52:16 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Sun, 2 Aug 2009 00:52:16 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> Message-ID: <20090801225216.GB31380@phare.normalesup.org> On Sat, Aug 01, 2009 at 07:49:21PM +0200, Dag Sverre Seljebotn wrote: > As you say it is indeed the whole stack that is important. Still, part of > what you write seems to be an effort to do what many are doing already: > - EPD > - Sage (currently maths focused, but it does bundle SciPy and integrating > it better would ) > - SPD (Sage without some of the math libs) > - Python(x,y) > These all bundle SciPy, but also sets up the whole stack, and can focus on > the whole picture. > Are you saying that you just want to do it better than these, through a > foundation? Wouldn't it be better to direct any funding through one of > these existing candidates? > This post I've written on the Sage list is very related and is about SciPy > vs. Sage: > http://groups.google.com/group/sage-devel/msg/78e2a2032042d35b I am jumping in this discussion (something that I have been trying to avoid, because such discussions are very hard to drive to a useful point). I'll try to write a clear e-mail, to the point, however, as the previous discussion you are pointing to does not reflect my needs. On the various usecases and users =================================== I think that the discussion on the Sage mailing list, and a few points of the last e-mails I have seen on this mailing list, miss a very important point for many users of the scipy stack that I see around me: We want a tool, or a set of tools, to build our own entry points. We want more than an IDE like Matlab, Mathematica. We want to be able to use the tools separated, to do data mining on servers log, to build custom applications for eg medical image analysis, or to control a physics experiment (there are a lot of talks at the scipy conference this year on this). Most of the scipy users are "even more applied than applied math" (golly, this sounds almost dirty ;> ). Building a reusable stack is why we need tools to be broken up separating features. Scipy as a community and an umbrella project may benefit from an IDE, like matlab, or a web interface like the amazing one Sage has, but we don't want to bundle these features with the core numerical tools of scipy. Now this might actually concern only a fractions of users. Many users (including me) mostly use the scipy tool stack as a matlab/mathematica replacement. However, these users are not the main code contributors. If somebody develops an algorithm he wants to ship or to share, chances are he wants it not to be bound to a heavy platform, but more to a light core (hey, numpy is even shipped by default on macOSX and many linux distributions nowadays). An integrated environment as an entry point ============================================== Besides building a good set of tools and their documentation, we need to address two separate issues to make life easier for users: building an integrated environment (what I call an entry point) and building distributions. It is tempting to do both at the same time, however, I think that if we collapse the two problems, we are going in the wrong directions: I want to be able to reuse the underlying technology of the integrated environment, for instance to build an astronomic-specific IDE, and I want to be able to contribute modules to it even if those modules are not distributed together. Like many people, my working environment is IPython. It suits my needs, and I get scientific results using it. However, I can see that it is not the best solution to guide a beginner. Inspired by matlab, IDL or mathematica, we have been dreaming of having an IDE for a long while. Last year, Enthought has payed me to start work on making IPython GUI-friendly to plug one of the missing bricks to assembling the tool stack in an IDE. I have been unable to work on this for a year, as it is not a priority for my research, but the effort lives on in the IPython repository, and it would be great to see IDE build upon it, and improve it. An IDE for easy scientific development with Python would bring together tools such as a shell, easy access to documentation, and an editor (reinventing any one of these components might not be necessary). There is EPDLab, which is being developed in the ETS repository. I love the technology stack that it is built upon (ETS provides good tools for building GUIs, and IPython provides an very handy and powerful command line), and I am thus full of hope for EPDLab. I can see however that people might be afraid of using it, let alone contributing to it, as it bares strong Enthought branding. This is a pity, because in this case we have the chance of having a compagny's interest lying in the same direction than the community. For a web environment, the Sage notebook is amazing. Unfortunately last time I looked, it was GPL licensed, which renders it improper for my use, as the tools we use at the lab must be BSD, in order to be able to build (eventually) medical imaging products from them one day. But, from a more pragmatic point of view the simplest thing to do to make it easier for a beginner to get started, would be to improve the documentation on the web. I am not thinking of the specific packages documentation, but more describing how things fit together: giving the workflow, and pointing to the various main packages used for different things. We already have a lot of material on the webpages, but this material is not as 'sexy' as it could be, and not as to-the-point as possible. Sure, this is a lot of work too. Building standard distributions ================================= I am a huge fan of distributions. Every large applied lab I know ends up building a distribution mechanism. Without standard distributions, we cannot reuse each-other's effort to distribute, but also we have huge friction on reusing each-other's tools: installing on your computer may be easy, but if you have to worry whether your non-technical users will succeed in installing a tool, you start wondering whether you want to rely on the tool, or whether you are going to reimplement it. However, the other side of the problem is that distributions could end up developing tools that make use of the tight integration that they provide to solve numerical or usability problems quicker, while locking the users in the distribution. If I want to integrate an algorithm developed by another lab in a medical imaging platform, I cannot afford to drag in Sage, just like I cannot afford R, or Maltab, as they are too big dependencies. An IDE that works only on a distribution is not one that I will rely on for teaching). This is why I believe that every single piece of code in a distribution should be usable outside of this distribution (and I applaud the SPD effort started by Ondrej and the SAGE guys). Concrete suggestions to ease the progress ========================================== Of course providing a consistent environment is a hard problem, but hey, this is a problem many of us face. I believe that we are making progress with many encouraging projects such as Sage, EPD, Python(x,y), or SPD. Establishing scientific environments in Python is an ambitious project; there will not be a one-size-fits-all solution and having many different approaches is healthy, as long as we keep it friendly and learn from all the efforts. I strongly believe that we will be getting more and more satisfactory solutions in the next years. Specifically, I would love to see an official umbrella project for BSD-licensed tools for building scientific projects with Python. As the "scipy" name is well branded (through the website, and the conference), we could call this the 'scipy project'. I would personally like to limit wheel reinvention and have preferred solutions for the various bricks (I am thinking of the unfortunate Chaco versus Matplotlib situation, where I have to depend on both libraries that complement each other). Back to the scipy foundation idea ================================== The idea of the scipy foundation is an idea that has been floating around for a while. If it is manned by a variety of people who express the wills and needs of users and developers of the scipy ecosystem, it could be a great thing. But I see two road blocks: first, as Robert points out, telling somebody what to do will not achieve anything. I am already way too busy scratching my own itches. Second, who will find the time to take care of this? And now, I have to catch up on sleep. Ga?l From cournape at gmail.com Sat Aug 1 20:39:48 2009 From: cournape at gmail.com (David Cournapeau) Date: Sun, 2 Aug 2009 09:39:48 +0900 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <20090801225216.GB31380@phare.normalesup.org> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> Message-ID: <5b8d13220908011739w7bf2eaf2tf5e652ca5a76bc7@mail.gmail.com> On Sun, Aug 2, 2009 at 7:52 AM, Gael Varoquaux wrote: > > Back to the scipy foundation idea > ================================== > > The idea of the scipy foundation is an idea that has been floating around > for a while. If it is manned by a variety of people who express the wills > and needs of users and developers of the scipy ecosystem, it could be a > great thing. But I see two road blocks: first, as Robert points out, > telling somebody what to do will not achieve anything. To have a foundation, by itself, has no consequence on telling people what to do. It is just a way to have a single point of entry for people who want to interact with the community, and to have the legal right to collect money. > I am already way > too busy scratching my own itches. Second, who will find the time to take > care of this? There is an inherent amount of bureaucracy involved with those things, but it does not have to always be done by the same people, and rotation works better than for code I think. David From tgrav at mac.com Sun Aug 2 08:32:56 2009 From: tgrav at mac.com (Tommy Grav) Date: Sun, 02 Aug 2009 08:32:56 -0400 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <642650.96553.qm@web52109.mail.re2.yahoo.com> References: <642650.96553.qm@web52109.mail.re2.yahoo.com> Message-ID: <51FCDB99-5D9C-42D8-9864-658755C01C98@mac.com> On Aug 1, 2009, at 5:56 PM, David Goldsmith wrote: > > --- On Sat, 8/1/09, Tommy Grav wrote: > >> Making scipy into a tool for science and engineering is in my opinion >> a to broad a goal. Making into a set of tools that are useable in >> many >> fields and thus supporting development of field specific packages >> is in again my >> opinion the way to go. > > Please clarify what you see as the difference between these two - to > me, on the surface > of it, your goal statement is no more "focused" nor "self-contained" > than Joe's. Perhaps > if you clarify what you see as the differences, we all may discover > that your vision and > Joe's actually aren't that far apart. I don't think that Joe and I are that far apart either. My point (very badly formulated) was that trying to make scipy be a replacement for IDL or matlab is in my opinion not the right goal. IDL in particular has a lot of field specific code available in it. I would like to see a structure where scipy provides the underlaying code needed by many fields (like the Numerical Recipes codes) but stay away from providing field specific code. Also scipy should not venture into GUI or provide an interactive environment like IDL (there are other packages that provide this). Just my opinion Tommy Grav From d_l_goldsmith at yahoo.com Sun Aug 2 14:58:11 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sun, 2 Aug 2009 11:58:11 -0700 (PDT) Subject: [SciPy-dev] SciPy Foundation Message-ID: <274366.95245.qm@web52106.mail.re2.yahoo.com> --- On Sun, 8/2/09, Tommy Grav wrote: > I don't think that Joe and I are that far apart either. My > point (very? > badly formulated) was > that trying to make scipy be a replacement for IDL or > matlab is in my? > opinion not the right > goal. IDL in particular has a lot of field specific code > available in? > it. I would like to see a > structure where scipy provides the underlaying code needed > by many? > fields (like the > Numerical Recipes codes) but stay away from providing field > specific? > code. Also scipy > should not venture into GUI or provide an interactive > environment like? > IDL (there are > other packages that provide this). > > Just my opinion > ? ? Tommy Grav OK, that helps. :-) Fine goal (between the two, I choose to remain neutral for now), but one comment: you say avoid a GUI, but the kind of "tool set" you describe would greatly benefit from (dare I say require) some sort of UI that makes it "easy" for the uninitiated (at the very least) to find the specific resources they need; IMO, for example, the UI LAPACK provides for this is a good example of how *not* to do it. DG > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From nmb at wartburg.edu Sun Aug 2 16:44:49 2009 From: nmb at wartburg.edu (Neil Martinsen-Burrell) Date: Sun, 02 Aug 2009 15:44:49 -0500 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <274366.95245.qm@web52106.mail.re2.yahoo.com> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> Message-ID: <4A75FAC1.7070600@wartburg.edu> On 08/02/2009 01:58 PM, David Goldsmith wrote: >> I don't think that Joe and I are that far apart either. My point >> (very badly formulated) was that trying to make scipy be a >> replacement for IDL or matlab is in my opinion not the right goal. >> IDL in particular has a lot of field specific code available in it. >> I would like to see a structure where scipy provides the >> underlaying code needed by many fields (like the Numerical Recipes >> codes) but stay away from providing field specific code. Also >> scipy should not venture into GUI or provide an interactive >> environment like IDL (there are other packages that provide this). >> >> Just my opinion Tommy Grav > > OK, that helps. :-) > > Fine goal (between the two, I choose to remain neutral for now), but > one comment: you say avoid a GUI, but the kind of "tool set" you > describe would greatly benefit from (dare I say require) some sort of > UI that makes it "easy" for the uninitiated (at the very least) to > find the specific resources they need; IMO, for example, the UI > LAPACK provides for this is a good example of how *not* to do it. This may be an instrumentality on the way to the "Goal of Scipy" (whatever that is) but I wanted to mention here the importance of reaching students with SciPy. Software vendors know this: if a student learns about a certain type of computing using your software, then they are likely to continue using your software throughout their career. Matlab has been stupendously good at this sort of marketing in engineering schools, where learning Matlab is seen by some as a *required* part of the curriculum, due to its industry dominance. Apropos of David's point about the relevance of a GUI, I think that in addition to the packaging, documentation and communication aspects of Joe's plan, an easy-to-install environment for interactive computation is important for teaching students with SciPy. When I taught an undergraduate class on Markov chains using numpy and scipy, it was hard for students to install scipy. Once they had it installed, they were able to be moderately productive in IDLE, but they missed some of the features of IPython (command completion, saved inputs and output). An interactive Python environment that allowed access to documentation, an editor and a rich interpreter would have made the uptake much easier for students. In the past, Alan has spoken strongly about the importance of the matrix class for teaching linear algebra and I want to echo his message about the importance of pedagogical usability for the continued adoption of the SciPy stack. Students who start using software in their classes will continue using that software throughout their careers, particularly so for something such as SciPy which has some significant advantages over its better-known competitors. I think that there is a tendency for active researchers to underestimate the importance of undergraduate-level learning and I hope that in this discussion, we will keep in mind the singular importance of that young audience. -Neil From vanforeest at gmail.com Sun Aug 2 16:55:46 2009 From: vanforeest at gmail.com (nicky van foreest) Date: Sun, 2 Aug 2009 22:55:46 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <4A75FAC1.7070600@wartburg.edu> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> <4A75FAC1.7070600@wartburg.edu> Message-ID: Hi, my 2 cts: I completely agree. I try to "force" python upon my students, and advice them to use python xy. In my opinion this package is certainly a step in the right direction as it makes using python/numpy/scipy very easy. Hopefully I am helping raising zealots (of the right type :-) ) with this. bye NIcky 2009/8/2 Neil Martinsen-Burrell : > On 08/02/2009 01:58 PM, David Goldsmith wrote: > >>> I don't think that Joe and I are that far apart either. My point >>> (very badly formulated) was that trying to make scipy be a >>> replacement for IDL or matlab is in my opinion not the right goal. >>> IDL in particular has a lot of field specific code available in it. >>> I would like to see a structure where scipy provides the >>> underlaying code needed by many fields (like the Numerical Recipes >>> codes) but stay away from providing field specific code. Also >>> scipy should not venture into GUI or provide an interactive >>> environment like IDL (there are other packages that provide this). >>> >>> Just my opinion Tommy Grav >> >> OK, that helps. :-) >> > >> Fine goal (between the two, I choose to remain neutral for now), but >> one comment: you say avoid a GUI, but the kind of "tool set" you >> describe would greatly benefit from (dare I say require) some sort of >> UI that makes it "easy" for the uninitiated (at the very least) to >> find the specific resources they need; IMO, for example, the UI >> LAPACK provides for this is a good example of how *not* to do it. > > This may be an instrumentality on the way to the "Goal of Scipy" > (whatever that is) but I wanted to mention here the importance of > reaching students with SciPy. ?Software vendors know this: if a student > learns about a certain type of computing using your software, then they > are likely to continue using your software throughout their career. > Matlab has been stupendously good at this sort of marketing in > engineering schools, where learning Matlab is seen by some as a > *required* part of the curriculum, due to its industry dominance. > > Apropos of David's point about the relevance of a GUI, I think that in > addition to the packaging, documentation and communication aspects of > Joe's plan, an easy-to-install environment for interactive computation > is important for teaching students with SciPy. ?When I taught an > undergraduate class on Markov chains using numpy and scipy, it was hard > for students to install scipy. ?Once they had it installed, they were > able to be moderately productive in IDLE, but they missed some of the > features of IPython (command completion, saved inputs and output). ?An > interactive Python environment that allowed access to documentation, an > editor and a rich interpreter would have made the uptake much easier for > students. > > In the past, Alan has spoken strongly about the importance of the matrix > class for teaching linear algebra and I want to echo his message about > the importance of pedagogical usability for the continued adoption of > the SciPy stack. ?Students who start using software in their classes > will continue using that software throughout their careers, particularly > so for something such as SciPy which has some significant advantages > over its better-known competitors. ?I think that there is a tendency for > active researchers to underestimate the importance of > undergraduate-level learning and I hope that in this discussion, we will > keep in mind the singular importance of that young audience. > > -Neil > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From alan at ajackson.org Sun Aug 2 17:03:44 2009 From: alan at ajackson.org (alan at ajackson.org) Date: Sun, 2 Aug 2009 16:03:44 -0500 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <4A75FAC1.7070600@wartburg.edu> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> <4A75FAC1.7070600@wartburg.edu> Message-ID: <20090802160344.11aa8999@ajackson.org> ---------------8< snip ------------------- >This may be an instrumentality on the way to the "Goal of Scipy" >(whatever that is) but I wanted to mention here the importance of >reaching students with SciPy. Software vendors know this: if a student >learns about a certain type of computing using your software, then they >are likely to continue using your software throughout their career. >Matlab has been stupendously good at this sort of marketing in >engineering schools, where learning Matlab is seen by some as a >*required* part of the curriculum, due to its industry dominance. > > ---------------8< snip ------------------- >-Neil I'd like to echo these comments. I have been working for several years now to get people in my company to use python, numpy, scipy, etc, and have made progress, but the biggest battle I fight is with the Matlab people. Pretty nearly every person we hire just out of school, and every summer intern comes to us as a Matlab user, due to the excellent job Matlab has done with cheap academic licenses. Anything we can do do get professors to start using the python suite so that their students will learn and use it will pay great dividends in the future. - Alan -- ----------------------------------------------------------------------- | Alan K. Jackson | To see a World in a Grain of Sand | | alan at ajackson.org | And a Heaven in a Wild Flower, | | www.ajackson.org | Hold Infinity in the palm of your hand | | Houston, Texas | And Eternity in an hour. - Blake | ----------------------------------------------------------------------- From gael.varoquaux at normalesup.org Sun Aug 2 17:09:42 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Sun, 2 Aug 2009 23:09:42 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <4A75FAC1.7070600@wartburg.edu> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> <4A75FAC1.7070600@wartburg.edu> Message-ID: <20090802210942.GC25001@phare.normalesup.org> On Sun, Aug 02, 2009 at 03:44:49PM -0500, Neil Martinsen-Burrell wrote: > I think that there is a tendency for active researchers to > underestimate the importance of undergraduate-level learning and I > hope that in this discussion, we will keep in mind the singular > importance of that young audience. That's all good and nice. I agree with you it is important, and I am very happy to hear people talking about this, because it makes me hope that we will be getting more help to do this. If I work my ass off on an IDE, or more simply a GUI frontend, it won't help me get more work done, which means shooting papers out, to be cynical, and, in a few years, I will most likely not be doing any scientific Python anymore. On the other hand, if I work on something that is useful for my day to day work, I get some traction at the lab, and my sleepless face is more easily forgiven. If I build an IDE that is of no use to our work, nobody cares, and for a good reason. This is not to say that we shouldn't be working on the IDE, I believe that I am one of the people that have actually written code to do this, but there is a lot of work to be done here, and working on making sure that we have a shell to do this, and interactive plotting, and good documentation is part of this work, and can be reused for direct research interests. Writing docs is also something that can help a lot, does not require extensive technical knowledge and takes a lot of time. Actually, I must point out that I am quite unhappy, because I am very tired, I have spent the week end fixing bugs on various open source projects (nipy and mayavi) and answering complicated users questions. I find that to be told that we are underestimating the importance of ease of use and ease of learning is unfair. This simply takes a lot of time and some of us are working on it. Ga?l From tgrav at mac.com Sun Aug 2 17:22:54 2009 From: tgrav at mac.com (Tommy Grav) Date: Sun, 02 Aug 2009 17:22:54 -0400 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <4A75FAC1.7070600@wartburg.edu> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> <4A75FAC1.7070600@wartburg.edu> Message-ID: <6F225505-2C39-4B32-AE49-6108FCCB2301@mac.com> On Aug 2, 2009, at 4:44 PM, Neil Martinsen-Burrell wrote: > This may be an instrumentality on the way to the "Goal of Scipy" > (whatever that is) but I wanted to mention here the importance of > reaching students with SciPy. Software vendors know this: if a > student > learns about a certain type of computing using your software, then > they > are likely to continue using your software throughout their career. > Matlab has been stupendously good at this sort of marketing in > engineering schools, where learning Matlab is seen by some as a > *required* part of the curriculum, due to its industry dominance. > > Apropos of David's point about the relevance of a GUI, I think that in > addition to the packaging, documentation and communication aspects of > Joe's plan, an easy-to-install environment for interactive computation > is important for teaching students with SciPy. When I taught an > undergraduate class on Markov chains using numpy and scipy, it was > hard > for students to install scipy. Once they had it installed, they were > able to be moderately productive in IDLE, but they missed some of the > features of IPython (command completion, saved inputs and output). An > interactive Python environment that allowed access to documentation, > an > editor and a rich interpreter would have made the uptake much easier > for > students. I agree with what you are saying, but I don't think scipy is the right package for this. The scipy package should in my opinion be like numpy, a self contained package of methods that are frequently used in science and engineering. In a sense it should provide the applied math. Then one can have separate packages providing interpreters ala matlab and IDL that sits on top of the scipy package and other more field specific packages. I think that in thinking of scipy as a replacement for IDL and Matlab the project becomes to broad reaching and it gets harder to get everyone to pull in approximately the same direction. > In the past, Alan has spoken strongly about the importance of the > matrix > class for teaching linear algebra and I want to echo his message about > the importance of pedagogical usability for the continued adoption of > the SciPy stack. Students who start using software in their classes > will continue using that software throughout their careers, > particularly > so for something such as SciPy which has some significant advantages > over its better-known competitors. I think that there is a tendency > for > active researchers to underestimate the importance of > undergraduate-level learning and I hope that in this discussion, we > will > keep in mind the singular importance of that young audience. I agree again, but I also think that students should learn how to code in Python, not in Sage/Python(x,y)/Scipy. The more of the core language the student learns the more powerful all the tools become. Tommy From d_l_goldsmith at yahoo.com Sun Aug 2 18:00:44 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sun, 2 Aug 2009 15:00:44 -0700 (PDT) Subject: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() Message-ID: <928017.59766.qm@web52111.mail.re2.yahoo.com> Hi, folks. So, I'm trying to devise an example to illustrate memmap.flush(). It didn't work the way I was intuitively expecting (i.e., it didn't appear to do anything at all) so I went looking for a "ready-made" example and found: http://www.slideshare.net/enthought/python-for-scientific-computing-webinar-may-22-2009 which has an example on Slide 25. However, here's what I get when I try to duplicate the example: Python 2.5.2 (r252:60911, Feb 21 2008, 13:11:45) [MSC v.1310 32 bit win32 >>> import numpy as np >>> np.version.version '1.3.0rc2' >>> q = np.memmap('new_file.dat',mode='w+',shape=(2,5)) >>> q memmap([[0, 0, 0, 0, 0], [0, 0, 0, 0, 0]], dtype=uint8) >>> # Print out underlying file contents ... # Note: not using iPython, so have to use os.system ... >>> import os >>> os.system('type new_file.dat') 0 >>> # Note: already a little different than Webinar Expl. ... >>> # Next write ascii value for 'A' (65) into q ... >>> q[:] = ord('A') >>> q memmap([[65, 65, 65, 65, 65], [65, 65, 65, 65, 65]], dtype=uint8) >>> # Do I need to call flush before file is written to? ... >>> os.system('type new_file.dat') AAAAAAAAAA0 >>> # No! Does flushing change anything? ... >>> q.flush() >>> os.system('type new_file.dat') AAAAAAAAAA0 >>> # No! Is it because I printed q before checking it on disc? ... >>> # Start afresh, but don't print memmap before checking it ... >>> r = np.memmap('new_file2.dat',mode='w+',shape=(2,5)) >>> os.system('type new_file2.dat') # "reproducibility check" 0 >>> r[:] = ord('A') >>> os.system('type new_file2.dat') # Checking file on disc immediately AAAAAAAAAA0 >>> # File is updated without calling flush, indeed without any ... # intervening access to the memmap at all! What gives? Is this a bug? If not, can someone please furnish me with an example that clearly an explicitly illustrates the function (and necessity) of memmap.flush()? Thanks! DG From d_l_goldsmith at yahoo.com Sun Aug 2 18:22:56 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sun, 2 Aug 2009 15:22:56 -0700 (PDT) Subject: [SciPy-dev] SciPy Foundation Message-ID: <89797.28963.qm@web52109.mail.re2.yahoo.com> Not wishing to turn this into a mutual admiration society, but "+10" vis-a-vis everything Neil said! ;-) DG PS: I do feel, however, that a UI as rich as Neil implies, though such should be wholly supported both materially and "in spirit" by whatever entity takes responsibility for SciPy, such a UI should be "separable" from the SciPy "core," so that the latter is deliverable both with a "rich" UI and a "serviceable" UI. --- On Sun, 8/2/09, Neil Martinsen-Burrell wrote: > From: Neil Martinsen-Burrell > Subject: Re: [SciPy-dev] SciPy Foundation > To: "SciPy Developers List" > Date: Sunday, August 2, 2009, 1:44 PM > On 08/02/2009 01:58 PM, David > Goldsmith wrote: > > >> I don't think that Joe and I are that far apart > either. My point > >> (very badly formulated) was that trying to make > scipy be a > >> replacement for IDL or matlab is in my opinion not > the right goal. > >> IDL in particular has a lot of field specific code > available in it. > >> I would like to see a structure where scipy > provides the > >> underlaying code needed by many fields (like the > Numerical Recipes > >> codes) but stay away from providing field specific > code. Also > >> scipy should not venture into GUI or provide an > interactive > >> environment like IDL (there are other packages > that provide this). > >> > >> Just my opinion Tommy Grav > > > > OK, that helps. :-) > > > > > Fine goal (between the two, I choose to remain neutral > for now), but > > one comment: you say avoid a GUI, but the kind of > "tool set" you > > describe would greatly benefit from (dare I say > require) some sort of > > UI that makes it "easy" for the uninitiated (at the > very least) to > > find the specific resources they need; IMO, for > example, the UI > > LAPACK provides for this is a good example of how > *not* to do it. > > This may be an instrumentality on the way to the "Goal of > Scipy" > (whatever that is) but I wanted to mention here the > importance of > reaching students with SciPy.? Software vendors know > this: if a student > learns about a certain type of computing using your > software, then they > are likely to continue using your software throughout their > career. > Matlab has been stupendously good at this sort of marketing > in > engineering schools, where learning Matlab is seen by some > as a > *required* part of the curriculum, due to its industry > dominance. > > Apropos of David's point about the relevance of a GUI, I > think that in > addition to the packaging, documentation and communication > aspects of > Joe's plan, an easy-to-install environment for interactive > computation > is important for teaching students with SciPy.? When I > taught an > undergraduate class on Markov chains using numpy and scipy, > it was hard > for students to install scipy.? Once they had it > installed, they were > able to be moderately productive in IDLE, but they missed > some of the > features of IPython (command completion, saved inputs and > output).? An > interactive Python environment that allowed access to > documentation, an > editor and a rich interpreter would have made the uptake > much easier for > students. > > In the past, Alan has spoken strongly about the importance > of the matrix > class for teaching linear algebra and I want to echo his > message about > the importance of pedagogical usability for the continued > adoption of > the SciPy stack.? Students who start using software in > their classes > will continue using that software throughout their careers, > particularly > so for something such as SciPy which has some significant > advantages > over its better-known competitors.? I think that there > is a tendency for > active researchers to underestimate the importance of > undergraduate-level learning and I hope that in this > discussion, we will > keep in mind the singular importance of that young > audience. > > -Neil > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From 00ai99 at gmail.com Sun Aug 2 19:56:21 2009 From: 00ai99 at gmail.com (David Gowers) Date: Mon, 3 Aug 2009 09:26:21 +0930 Subject: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() In-Reply-To: <928017.59766.qm@web52111.mail.re2.yahoo.com> References: <928017.59766.qm@web52111.mail.re2.yahoo.com> Message-ID: <23f4e3390908021656q426d4820va1fce9e688ef8930@mail.gmail.com> On Mon, Aug 3, 2009 at 7:30 AM, David Goldsmith wrote: > > Hi, folks. So, I'm trying to devise an example to illustrate > memmap.flush(). It didn't work the way I was intuitively expecting (i.e., > it didn't appear to do anything at all) so I went looking for a "ready-made" > example and found: > > > http://www.slideshare.net/enthought/python-for-scientific-computing-webinar-may-22-2009 > > which has an example on Slide 25. However, here's what I get when I try to > duplicate the example: > > Python 2.5.2 (r252:60911, Feb 21 2008, 13:11:45) [MSC v.1310 32 bit > win32 > >>> import numpy as np > >>> np.version.version > '1.3.0rc2' > >>> q = np.memmap('new_file.dat',mode='w+',shape=(2,5)) > >>> q > memmap([[0, 0, 0, 0, 0], > [0, 0, 0, 0, 0]], dtype=uint8) > >>> # Print out underlying file contents > ... # Note: not using iPython, so have to use os.system > ... > >>> import os > >>> os.system('type new_file.dat') > 0 > >>> # Note: already a little different than Webinar Expl. > ... > >>> # Next write ascii value for 'A' (65) into q > ... > >>> q[:] = ord('A') > >>> q > memmap([[65, 65, 65, 65, 65], > [65, 65, 65, 65, 65]], dtype=uint8) > >>> # Do I need to call flush before file is written to? > ... > >>> os.system('type new_file.dat') > AAAAAAAAAA0 > >>> # No! Does flushing change anything? > ... > >>> q.flush() > >>> os.system('type new_file.dat') > AAAAAAAAAA0 > >>> # No! Is it because I printed q before checking it on disc? > ... > >>> # Start afresh, but don't print memmap before checking it > ... > >>> r = np.memmap('new_file2.dat',mode='w+',shape=(2,5)) > >>> os.system('type new_file2.dat') # "reproducibility check" > 0 > >>> r[:] = ord('A') > >>> os.system('type new_file2.dat') # Checking file on disc immediately > AAAAAAAAAA0 > >>> # File is updated without calling flush, indeed without any > ... # intervening access to the memmap at all! > > What gives? Is this a bug? If not, can someone please furnish me with an > example that clearly an explicitly illustrates the function (and necessity) > of memmap.flush()? > Sorry, I can't. I can say that I encountered this when I first discovered np.memmap and began using it for a lot of things. At one point, I didn't seem to need to flush; In other cases I did. I wonder whether it has to do with the underlying i/o buffering (eg. if your array is smaller than the buffer, all changes are written immediately) David -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Sun Aug 2 23:21:25 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sun, 2 Aug 2009 20:21:25 -0700 (PDT) Subject: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() Message-ID: <723404.56571.qm@web52102.mail.re2.yahoo.com> Hi, David. Thanks for your (validating) reply. :-) Well, there's something more going on than just the size of the mmap or the size of the change to the mmap: I experimented with adding a shape (10,) complex128 array to shape (10, 10), (10, 10, 10), (10, 10, 10, 10), (10, 10, 10, 10, 10) mmaps (and their corresponding "flat" mmaps, shape (100,), etc.), both just in the last dimension using [9, 9, 9, 9, :], e.g., and everywhere using [:,:,:,:,:] - which worked! - and the file on disc was updated every time without ever having to call flush, not once! So, hopefully, someone who knows what's going on will chime in... DG --- On Sun, 8/2/09, David Gowers <00ai99 at gmail.com> wrote: > From: David Gowers <00ai99 at gmail.com> > Subject: Re: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() > To: "SciPy Developers List" > Date: Sunday, August 2, 2009, 4:56 PM > > > On Mon, Aug 3, 2009 at 7:30 AM, > David Goldsmith > wrote: > > > > Hi, folks. ?So, I'm trying to devise an example to > illustrate memmap.flush(). ?It didn't work the way I > was intuitively expecting (i.e., it didn't appear to do > anything at all) so I went looking for a > "ready-made" example and found: > > > > > http://www.slideshare.net/enthought/python-for-scientific-computing-webinar-may-22-2009 > > > > which has an example on Slide 25. ?However, here's > what I get when I try to duplicate the example: > > > > Python 2.5.2 (r252:60911, Feb 21 2008, 13:11:45) [MSC > v.1310 32 bit > > win32 > > >>> import numpy as np > > >>> np.version.version > > '1.3.0rc2' > > >>> q = > np.memmap('new_file.dat',mode='w+',shape=(2,5)) > > >>> q > > memmap([[0, 0, 0, 0, 0], > > ? ? ? [0, 0, 0, 0, 0]], dtype=uint8) > > >>> # Print out underlying file contents > > ... # Note: not using iPython, so have to use os.system > > ... > > >>> import os > > >>> os.system('type new_file.dat') > > ? ? ? ? ?0 > > >>> # Note: already a little different than > Webinar Expl. > > ... > > >>> # Next write ascii value for 'A' (65) > into q > > ... > > >>> q[:] = ord('A') > > >>> q > > memmap([[65, 65, 65, 65, 65], > > ? ? ? [65, 65, 65, 65, 65]], dtype=uint8) > > >>> # Do I need to call flush before file is > written to? > > ... > > >>> os.system('type new_file.dat') > > AAAAAAAAAA0 > > >>> # No! Does flushing change anything? > > ... > > >>> q.flush() > > >>> os.system('type new_file.dat') > > AAAAAAAAAA0 > > >>> # No! Is it because I printed q before > checking it on disc? > > ... > > >>> # Start afresh, but don't print memmap > before checking it > > ... > > >>> r = > np.memmap('new_file2.dat',mode='w+',shape=(2,5)) > > >>> os.system('type new_file2.dat') # > "reproducibility check" > > ? ? ? ? ?0 > > >>> r[:] = ord('A') > > >>> os.system('type new_file2.dat') # > Checking file on disc immediately > > AAAAAAAAAA0 > > >>> # File is updated without calling flush, > indeed without any > > ... # intervening access to the memmap at all! > > > > What gives? ?Is this a bug? ?If not, can someone please > furnish me with an example that clearly an explicitly > illustrates the function (and necessity) of memmap.flush()? > Sorry, I can't. I can say that I > encountered this when I first discovered np.memmap and began > using it for a lot of things. At one point, I didn't > seem to need to flush; In other cases I did. > > > I wonder whether it has to do with the underlying i/o > buffering (eg. if your array is smaller than the buffer, all > changes are written immediately) > > David > > > -----Inline Attachment Follows----- > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From charlesr.harris at gmail.com Sun Aug 2 23:48:21 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 2 Aug 2009 21:48:21 -0600 Subject: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() In-Reply-To: <723404.56571.qm@web52102.mail.re2.yahoo.com> References: <723404.56571.qm@web52102.mail.re2.yahoo.com> Message-ID: On Sun, Aug 2, 2009 at 9:21 PM, David Goldsmith wrote: > > Hi, David. Thanks for your (validating) reply. :-) Well, there's > something more going on than just the size of the mmap or the size of the > change to the mmap: I experimented with adding a shape (10,) complex128 > array to shape (10, 10), (10, 10, 10), (10, 10, 10, 10), (10, 10, 10, 10, > 10) mmaps (and their corresponding "flat" mmaps, shape (100,), etc.), both > just in the last dimension using [9, 9, 9, 9, :], e.g., and everywhere using > [:,:,:,:,:] - which worked! - and the file on disc was updated every time > without ever having to call flush, not once! So, hopefully, someone who > knows what's going on will chime in... > The data in memmapped files is swapped in and out of memory by the operating system, so what happens to be in memory at any given time is OS dependent and also changes with demands for memory from other processes and such. It's a bunch a behind the scenes machinery and not predictable. Even a flush can be delayed. My guess is that you need to flush the data if you want it to go to disk, which might be the case if you have a program that needs to push out data and you want to minimise losses from power outages and such. Otherwise the data could just sit in memory and never get written out until the file is closed. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Mon Aug 3 00:04:11 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Sun, 2 Aug 2009 21:04:11 -0700 (PDT) Subject: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() Message-ID: <68517.77484.qm@web52108.mail.re2.yahoo.com> OK, thanks, Charles, that explains a lot (which I'll be sure to put in the docstring - don't worry, not all of it, just the "highpoints"), but: if anyone knows a way to actually _prevent_ assignment from writing to the file - and forcing flush to do so right away - so that it may be illustrated with an example, that would still be appreciated (of course, I could "fudge" and call flush and then show the file contents, pretending that it hadn't already happened, but that would be dishonest.) DG --- On Sun, 8/2/09, Charles R Harris wrote: > From: Charles R Harris > Subject: Re: [SciPy-dev] Difficulty creating an example for illustrating memmap.flush() > To: "SciPy Developers List" > Date: Sunday, August 2, 2009, 8:48 PM > > > On Sun, Aug 2, 2009 at 9:21 PM, > David Goldsmith > wrote: > > > > Hi, David. ?Thanks for your (validating) reply. :-) > ?Well, there's something more going on than just the > size of the mmap or the size of the change to the mmap: I > experimented with adding a shape (10,) complex128 array to > shape (10, 10), (10, 10, 10), (10, 10, 10, 10), (10, 10, 10, > 10, 10) mmaps (and their corresponding "flat" > mmaps, shape (100,), etc.), both just in the last dimension > using [9, 9, 9, 9, :], e.g., and everywhere using > [:,:,:,:,:] - which worked! - and the file on disc was > updated every time without ever having to call flush, not > once! ?So, hopefully, someone who knows what's going on > will chime in... > > > > The data in memmapped files is swapped in and out of memory > by the operating system, so what happens to be in memory at > any given time is OS dependent and also changes with demands > for memory from other processes and such. It's a bunch a > behind the scenes machinery and not predictable. Even a > flush can be delayed. My guess is that you need to flush the > data if you want it to go to disk, which might be the case > if you have a program that needs to push out data and you > want to minimise losses from power outages and such. > Otherwise the data could just sit in memory and never get > written out until the file is closed. > > > Chuck > > > > > -----Inline Attachment Follows----- > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From nmb at wartburg.edu Mon Aug 3 10:29:35 2009 From: nmb at wartburg.edu (Neil Martinsen-Burrell) Date: Mon, 03 Aug 2009 09:29:35 -0500 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <20090802210942.GC25001@phare.normalesup.org> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> <4A75FAC1.7070600@wartburg.edu> <20090802210942.GC25001@phare.normalesup.org> Message-ID: <4A76F44F.7080201@wartburg.edu> On 08/02/2009 04:09 PM, Gael Varoquaux wrote: > On Sun, Aug 02, 2009 at 03:44:49PM -0500, Neil Martinsen-Burrell wrote: >> I think that there is a tendency for active researchers to >> underestimate the importance of undergraduate-level learning and I >> hope that in this discussion, we will keep in mind the singular >> importance of that young audience. > > That's all good and nice. I agree with you it is important, and I am very > happy to hear people talking about this, because it makes me hope that we > will be getting more help to do this. As I have time to spare apart from the teaching and researching duties that I need to do to keep *my* job, I am glad to volunteer my time for this effort. I have some things in mind for making Scipy accessible as a module within a Numerical Analysis or Scientific Computing course that I hope to work on within this calendar year. > If I work my ass off on an IDE, or more simply a GUI frontend, it won't > help me get more work done, which means shooting papers out, to be > cynical, and, in a few years, I will most likely not be doing any > scientific Python anymore. On the other hand, if I work on something that > is useful for my day to day work, I get some traction at the lab, and my > sleepless face is more easily forgiven. If I build an IDE that is of no > use to our work, nobody cares, and for a good reason. > > This is not to say that we shouldn't be working on the IDE, I believe > that I am one of the people that have actually written code to do this, > but there is a lot of work to be done here, and working on making sure > that we have a shell to do this, and interactive plotting, and good > documentation is part of this work, and can be reused for direct research > interests. Writing docs is also something that can help a lot, does not > require extensive technical knowledge and takes a lot of time. Indeed, you have highlighted one of the difficulties in depending on active domain scientists to create software projects: scratching one's itch is not selfish, but necessary for their career. As Joe mentioned about the Doc marathon funded through some of his grants, as his granting situation gets tighter, the funding that he is able to devote to SciPy development is drying up. I think that this is a persuasive argument for the establishment of a SciPy Foundation which can provide the organizational structure to pay willing developers for some of the code which they develop. In doing so, we provide an alternative system of rewards (however small) from the scientific career track. > Actually, I must point out that I am quite unhappy, because I am very > tired, I have spent the week end fixing bugs on various open source > projects (nipy and mayavi) and answering complicated users questions. I > find that to be told that we are underestimating the importance of ease > of use and ease of learning is unfair. This simply takes a lot of time > and some of us are working on it. I certainly appreciate your work fixing bugs on open-source scientific projects. Thank you. It was not my intent to say that anyone is "underestimating the importance of ease of use and ease of learning". My intent was to highlight an audience for Scipy that has significant importance for future uptake. -Neil From prabhu at aero.iitb.ac.in Mon Aug 3 13:20:28 2009 From: prabhu at aero.iitb.ac.in (Prabhu Ramachandran) Date: Mon, 03 Aug 2009 22:50:28 +0530 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: Message-ID: <4A771C5C.4060702@aero.iitb.ac.in> On 07/31/09 22:36, Joe Harrington wrote: > About sixteen months ago, I launched the SciPy Documentation Project > and its Marathon. Dozens pitched in and now numpy docs are rapidly > approaching a professional level. The "pink wave" ("Needs Review" > status) is at 56% today! There is consensus among doc writers that > much of the rest can be labeled in the "unimportant" category, so > we're close to starting the review push (hold your fire, there is a > web site mod to be done first). > > We're also nearing the end of the summer, and it's time to look ahead. > The path for docs is clear, but the path for SciPy is not. I think > our weakest area right now is organization of the project. There is > no consensus-based plan for improvement of the whole toward a stated > goal, no centralized coordination of work, and no funded work focused > on many of our weaknesses, notwithstanding my doc effort and what > Enthought does for code. Thank you for your efforts! I believe I will be able to help this effort in various ways over the next few years from India as part of a large government grant. I do not have the time to discuss it here at the moment but I will be at SciPy09 and would love to discuss it there in person. I will also be talking briefly about our overall goals there. Specifically see: http://conference.scipy.org/abstract?id=13 regards, prabhu From d_l_goldsmith at yahoo.com Mon Aug 3 13:33:27 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Mon, 3 Aug 2009 10:33:27 -0700 (PDT) Subject: [SciPy-dev] SciPy Foundation Message-ID: <191449.13197.qm@web52108.mail.re2.yahoo.com> Thanks, Prabhu, this looks very promising! I look forward to your talk! DG --- On Mon, 8/3/09, Prabhu Ramachandran wrote: > From: Prabhu Ramachandran > Subject: Re: [SciPy-dev] SciPy Foundation > To: jh at physics.ucf.edu, scipy-dev at scipy.org > Cc: "Scipy Users" , "Astronomy Python" , numpy-discussion at scipy.org > Date: Monday, August 3, 2009, 10:20 AM > On 07/31/09 22:36, Joe Harrington > wrote: > > About sixteen months ago, I launched the SciPy > Documentation Project > > and its Marathon.? Dozens pitched in and now > numpy docs are rapidly > > approaching a professional level.? The "pink > wave" ("Needs Review" > > status) is at 56% today!? There is consensus > among doc writers that > > much of the rest can be labeled in the "unimportant" > category, so > > we're close to starting the review push (hold your > fire, there is a > > web site mod to be done first). > > > > We're also nearing the end of the summer, and it's > time to look ahead. > > The path for docs is clear, but the path for SciPy is > not.? I think > > our weakest area right now is organization of the > project.? There is > > no consensus-based plan for improvement of the whole > toward a stated > > goal, no centralized coordination of work, and no > funded work focused > > on many of our weaknesses, notwithstanding my doc > effort and what > > Enthought does for code. > > Thank you for your efforts! > > I believe I will be able to help this effort in various > ways over the > next few years from India as part of a large government > grant.? I do not > have the time to discuss it here at the moment but I will > be at SciPy09 > and would love to discuss it there in person.? I will > also be talking > briefly about our overall goals there.? Specifically > see: > > ? http://conference.scipy.org/abstract?id=13 > > regards, > prabhu > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From charlesr.harris at gmail.com Mon Aug 3 13:44:38 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 3 Aug 2009 11:44:38 -0600 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <4A76F44F.7080201@wartburg.edu> References: <274366.95245.qm@web52106.mail.re2.yahoo.com> <4A75FAC1.7070600@wartburg.edu> <20090802210942.GC25001@phare.normalesup.org> <4A76F44F.7080201@wartburg.edu> Message-ID: On Mon, Aug 3, 2009 at 8:29 AM, Neil Martinsen-Burrell wrote: > On 08/02/2009 04:09 PM, Gael Varoquaux wrote: > > On Sun, Aug 02, 2009 at 03:44:49PM -0500, Neil Martinsen-Burrell wrote: > >> I think that there is a tendency for active researchers to > >> underestimate the importance of undergraduate-level learning and I > >> hope that in this discussion, we will keep in mind the singular > >> importance of that young audience. > > > > That's all good and nice. I agree with you it is important, and I am very > > happy to hear people talking about this, because it makes me hope that we > > will be getting more help to do this. > > As I have time to spare apart from the teaching and researching duties > that I need to do to keep *my* job, I am glad to volunteer my time for > this effort. I have some things in mind for making Scipy accessible as > a module within a Numerical Analysis or Scientific Computing course that > I hope to work on within this calendar year. > > > If I work my ass off on an IDE, or more simply a GUI frontend, it won't > > help me get more work done, which means shooting papers out, to be > > cynical, and, in a few years, I will most likely not be doing any > > scientific Python anymore. On the other hand, if I work on something that > > is useful for my day to day work, I get some traction at the lab, and my > > sleepless face is more easily forgiven. If I build an IDE that is of no > > use to our work, nobody cares, and for a good reason. > > > > This is not to say that we shouldn't be working on the IDE, I believe > > that I am one of the people that have actually written code to do this, > > but there is a lot of work to be done here, and working on making sure > > that we have a shell to do this, and interactive plotting, and good > > documentation is part of this work, and can be reused for direct research > > interests. Writing docs is also something that can help a lot, does not > > require extensive technical knowledge and takes a lot of time. > > Indeed, you have highlighted one of the difficulties in depending on > active domain scientists to create software projects: scratching one's > itch is not selfish, but necessary for their career. Linus on selfish apropos Microsoft contributing driver code to linux: I agree that it's driven by selfish reasons, but that's how all open source code gets written! We all "scratch our own itches". It's why I started Linux, it's why I started git, and it's why I am still involved. It's the reason for everybody to end up in open source, to some degree. So complaining about the fact that Microsoft picked a selfish area to work on is just silly. Of course they picked an area that helps them. That's the point of open source - the ability to make the code better for your particular needs, whoever the 'your' in question happens to be. Does anybody complain when hardware companies write drivers for the hardware they produce? No. That would be crazy. Does anybody complain when IBM funds all the POWER development, and works on enterprise features because they sell into the enterprise? No. That would be insane. So the people who complain about Microsoft writing drivers for their own virtualization model should take a long look in the mirror and ask themselves why they are being so hypocritical. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From ondrej at certik.cz Mon Aug 3 17:32:31 2009 From: ondrej at certik.cz (Ondrej Certik) Date: Mon, 3 Aug 2009 15:32:31 -0600 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <20090801225216.GB31380@phare.normalesup.org> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> Message-ID: <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> On Sat, Aug 1, 2009 at 4:52 PM, Gael Varoquaux wrote: [...] > For a web environment, the Sage notebook is amazing. Unfortunately last > time I looked, it was GPL licensed, which renders it improper for my use, > as the tools we use at the lab must be BSD, in order to be able to build > (eventually) medical imaging products from them one day. Actually, in this thread: http://groups.google.com/group/sage-devel/browse_thread/thread/65ca1e0489a0a980/ most (if not all) contributors to the Sage notebook agreed to release their code as BSD. The same about William being positive to license the build system as BSD too. So we can get lots of done by working on these things together with Sage. Ondrej From gael.varoquaux at normalesup.org Mon Aug 3 17:36:49 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Mon, 3 Aug 2009 23:36:49 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> Message-ID: <20090803213649.GI32408@phare.normalesup.org> On Mon, Aug 03, 2009 at 03:32:31PM -0600, Ondrej Certik wrote: > On Sat, Aug 1, 2009 at 4:52 PM, Gael > Varoquaux wrote: > [...] > > For a web environment, the Sage notebook is amazing. Unfortunately last > > time I looked, it was GPL licensed, which renders it improper for my use, > > as the tools we use at the lab must be BSD, in order to be able to build > > (eventually) medical imaging products from them one day. > Actually, in this thread: > http://groups.google.com/group/sage-devel/browse_thread/thread/65ca1e0489a0a980/ > most (if not all) contributors to the Sage notebook agreed to release > their code as BSD. > The same about William being positive to license the build system as > BSD too. So we can get lots of done by working on these things > together with Sage. I can see that a lot of good things are coming out of Sage (the current Cython development frenzy was clearly helped by the needs of Sage). It is really nice to see our community (I am talking in the sens of a scientific Python community, agnostic of tools and distribution) growing. Cheers to these guys, that notebook is really amazing! Ga?l From ondrej at certik.cz Mon Aug 3 17:54:08 2009 From: ondrej at certik.cz (Ondrej Certik) Date: Mon, 3 Aug 2009 15:54:08 -0600 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <20090803213649.GI32408@phare.normalesup.org> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> <20090803213649.GI32408@phare.normalesup.org> Message-ID: <85b5c3130908031454p5f75e950occc7a89553ace535@mail.gmail.com> On Mon, Aug 3, 2009 at 3:36 PM, Gael Varoquaux wrote: > On Mon, Aug 03, 2009 at 03:32:31PM -0600, Ondrej Certik wrote: >> On Sat, Aug 1, 2009 at 4:52 PM, Gael >> Varoquaux wrote: >> [...] >> > For a web environment, the Sage notebook is amazing. Unfortunately last >> > time I looked, it was GPL licensed, which renders it improper for my use, >> > as the tools we use at the lab must be BSD, in order to be able to build >> > (eventually) medical imaging products from them one day. > >> Actually, in this thread: > >> http://groups.google.com/group/sage-devel/browse_thread/thread/65ca1e0489a0a980/ > >> most (if not all) contributors to the Sage notebook agreed to release >> their code as BSD. > >> The same about William being positive to license the build system as >> BSD too. So we can get lots of done by working on these things >> together with Sage. > > I can see that a lot of good things are coming out of Sage (the current > Cython development frenzy was clearly helped by the needs of Sage). It is > really nice to see our community (I am talking in the sens of a > scientific Python community, agnostic of tools and distribution) growing. > > Cheers to these guys, that notebook is really amazing! Yep. And Cython is BSD like (resp Apache) license too, so I think that for these basic tools that everyone needs (cython/notebook/build infrustructure) Sage is not against BSD at all. Ondrej From sebastian.walter at gmail.com Tue Aug 4 04:00:26 2009 From: sebastian.walter at gmail.com (Sebastian Walter) Date: Tue, 4 Aug 2009 10:00:26 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <85b5c3130908031454p5f75e950occc7a89553ace535@mail.gmail.com> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> <20090803213649.GI32408@phare.normalesup.org> <85b5c3130908031454p5f75e950occc7a89553ace535@mail.gmail.com> Message-ID: 2 cents from an outsider who thought about contributing to scipy/scikits (but didn't (yet)): I think it is a good idea to make scipy easy to use for beginners. However, after reading this thread, I have the impression that it is not the goal to provide state of the art algorithms but rather making Scipy as popular as possible by putting money and effort into the "marketing" of Scipy. Don't get me wrong, I think there are some good reasons why a project should thrive for a large user base. Some of the best projects are popular. Alas, correlation does not imply causality. Me for instance, would rather like to see more efforts to get state of the art algorithms to be implemented in Scipy because that's something that would make a real difference in my research work. Of course, targeting the "clueless Matlab" users is quite pointless if it is that what you are after. IMHO the way to go is to convince experts to implement their research prototypes as part of scipy. Then you really get some "killer applications". I could name a few people who are coding some cool state of the art algorithms but waste so much time because they started coding directly in C++. In the meantime, they could have implemented the algorithms in Python _and_ in C++. If scipy had something really good that Matlab etc. do not have: guess what ppl would do.... What would you need to get experts contribute to scipy instead of hacking their prototype in Matlab or C++? I can't speak for everyone, so I'll just say what I think (and feel): I would instantly start "contributing research prototypes" to scipy if scipy offered: 1) an easy, modular and flexible build system (fortran, c, c++, D, swig, boost:python, cython,...) 2) very low entry barrier for possible contributors: a simple checkout, then ./manage.py startapp mycoolmodule and everything is ready to go ( "Start coding in 5 minutes!") 3) a distributed version control system (e.g. git). SVN really scares me off... 4) standardized unit tests 5) automated documentation generation Then I could simply 1) fork the master branch 2) ./manage.py startapp mycoolmodule 3) adjust config files that were written in ./scipy/mycoolmodule/config.py 4) start coding 5) share the experimental code with collaborators or interested users who are not afraid to use experimental code 6) eventually, when the project has matured, hope that it gets included in the master branch hope that made sense, Sebastian On Mon, Aug 3, 2009 at 11:54 PM, Ondrej Certik wrote: > On Mon, Aug 3, 2009 at 3:36 PM, Gael > Varoquaux wrote: >> On Mon, Aug 03, 2009 at 03:32:31PM -0600, Ondrej Certik wrote: >>> On Sat, Aug 1, 2009 at 4:52 PM, Gael >>> Varoquaux wrote: >>> [...] >>> > For a web environment, the Sage notebook is amazing. Unfortunately last >>> > time I looked, it was GPL licensed, which renders it improper for my use, >>> > as the tools we use at the lab must be BSD, in order to be able to build >>> > (eventually) medical imaging products from them one day. >> >>> Actually, in this thread: >> >>> http://groups.google.com/group/sage-devel/browse_thread/thread/65ca1e0489a0a980/ >> >>> most (if not all) contributors to the Sage notebook agreed to release >>> their code as BSD. >> >>> The same about William being positive to license the build system as >>> BSD too. So we can get lots of done by working on these things >>> together with Sage. >> >> I can see that a lot of good things are coming out of Sage (the current >> Cython development frenzy was clearly helped by the needs of Sage). It is >> really nice to see our community (I am talking in the sens of a >> scientific Python community, agnostic of tools and distribution) growing. >> >> Cheers to these guys, that notebook is really amazing! > > Yep. And Cython is BSD like (resp Apache) license too, so I think that > for these basic tools that everyone needs (cython/notebook/build > infrustructure) Sage is not against BSD at all. > > Ondrej > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From gael.varoquaux at normalesup.org Tue Aug 4 04:18:25 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 10:18:25 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> <20090803213649.GI32408@phare.normalesup.org> <85b5c3130908031454p5f75e950occc7a89553ace535@mail.gmail.com> Message-ID: <20090804081825.GC17519@phare.normalesup.org> On Tue, Aug 04, 2009 at 10:00:26AM +0200, Sebastian Walter wrote: > Me for instance, would rather like to see more efforts to get state of > the art algorithms to be implemented in Scipy because that's something > that would make a real difference in my research work. On this side, we are hiring a talented engineer to work on machine learning in scipy, via the scikit learn. We already have the algorithm, it is a question of QAing them, integrating them in the scikit, writing docs and making releases. Ga?l From david at ar.media.kyoto-u.ac.jp Tue Aug 4 04:35:02 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 04 Aug 2009 17:35:02 +0900 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> <20090803213649.GI32408@phare.normalesup.org> <85b5c3130908031454p5f75e950occc7a89553ace535@mail.gmail.com> Message-ID: <4A77F2B6.8040108@ar.media.kyoto-u.ac.jp> Sebastian Walter wrote: > 2 cents from an outsider who thought about contributing to > scipy/scikits (but didn't (yet)): > > I think it is a good idea to make scipy easy to use for beginners. > However, after reading this thread, I have the impression that it is > not the goal to provide state of the art algorithms but rather making > Scipy as popular as possible by putting money and effort into the > "marketing" of Scipy. > Don't get me wrong, I think there are some good reasons why a project > should thrive for a large user base. Some of the best projects are > popular. > Alas, correlation does not imply causality. > > Me for instance, would rather like to see more efforts to get state of > the art algorithms to be implemented in Scipy because that's something > that would make a real difference in my research work. Of course, > targeting the "clueless Matlab" users is quite pointless if it is that > what you are after. > One point which has not been mentioned concerning matlab-like environment - maybe it is obvious and everyone implicitly acknowledges it, but Mathworks is a 30 years old company, with > 1000 people today. Building something like matlab, with a good GUI and top notch documentation takes a huge amount of resources, of which the 'useful' code is only a fraction. I of course don't know the details of matlab implementation, but I know that for music oriented softwares (which need good UI to sell well, and have non trivial computational requirements, so the comparison is not totally stupid), the graphical code is 80 % of the code. This ratio is consistent with the big open source audio softwares as well (ardour, rosegarden). Worse, being cross platform makes the problem much more difficult. For music softwares market, mac os x is rarely ignored (~ 40-50% of the market I believe), so people need to support two platforms, and that's really a lot of work. For scientific software, I think you can go the non native route for the graphical toolkit, though. Also, very few open source software are successful as far as good GUI are concerned (I don't want to enter into a debate here, but there are good documents/studies on this topic). You need financial incentive for this, so only projects backed up by big companies managed to pull it of. IOW, I am pretty pessimistic about being a 'matlab' clone. We should rather shoot for what makes numpy/scipy better (extensibility, cross platform, actual language, etc...), because really, matlab will always be a much better matlab than us. Price and licensing are not good enough to justify migration - if what you want is a free matlab clone, why not using octave or scilab anyway. That does NOT mean that we should not aim at making the software more accessible. I (and I guess other developers) are definitely interested in a more product-like, integrated stack, to make the barrier of entry lower. I for example am really tired of the installation problems consistently reported. I feel like we cover mac os x and windows pretty well now, but the linux situation is still dreadful. I have a few ideas on how to improve the situation, but they all requires quite a bit of work/infrastructure. I hope that soon, the scenario "I see this cool python script on the internet, it requires this numpy/scipy thing, can I try it in 2 minutes ?" will be a reality. > Then you really get some "killer applications". I could name a few > people who are coding some cool state of the art algorithms but waste > so much time because they started coding directly in C++. In the > meantime, they could have implemented the algorithms in Python _and_ > in C++. If scipy had something really good that Matlab etc. do not > have: guess what ppl would do.... > Yes, there are a lot of people who still don't know that there are languages outside Fortran, C and C++. In my field, I still see some people who implement parsers in C... > 1) an easy, modular and flexible build system (fortran, c, c++, D, > swig, boost:python, cython,...) > you mean like numscons :) ? Adding D support to numscons should be easy. For example, I added initial cython support in a couple of minutes during the cython talk at SciPy08, adding new languages is relatively easy thanks to scons. > 2) very low entry barrier for possible contributors: > a simple checkout, then ./manage.py startapp mycoolmodule > and everything is ready to go ( "Start coding in 5 minutes!") > there are various pieces to enable this (in place build, develop command of setuptools, virtualenv/pip/easy_install), but yes, the situation is kind of messy. For scikits, that's not so difficult - you should be able to implement a trivial scikit by copying the scikits.example package and starting from there. One problem is that it is technically impossible to build in place and test in one go because of a nose limitation ATM (for some reason, nose fails to import a package if it is in the current directory). > 3) a distributed version control system (e.g. git). SVN really scares me off... > That's a sensitive issue, I think we should avoid starting this one here :) Needless to say, you can use git-svn - several core developers use it for numpy/scipy dev, and we distribute an official import: http://projects.scipy.org/numpy/browse_git At least I have not touched svn for numpy/scipy development for > 6 months now, except to check releases when I tag them. > 4) standardized unit tests > What do you mean exactly here ? We use nose for testing, what do you consider "non standard". > 5) automated documentation generation > It is almost automated now - but an example for scikits is missing in the example package :) cheers, David From sebastian.walter at gmail.com Tue Aug 4 05:25:55 2009 From: sebastian.walter at gmail.com (Sebastian Walter) Date: Tue, 4 Aug 2009 11:25:55 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <4A77F2B6.8040108@ar.media.kyoto-u.ac.jp> References: <704e22078868036a820177ce53122d48.squirrel@webmail.uio.no> <20090801225216.GB31380@phare.normalesup.org> <85b5c3130908031432l6190c654t87dcb6c1ae44bb5e@mail.gmail.com> <20090803213649.GI32408@phare.normalesup.org> <85b5c3130908031454p5f75e950occc7a89553ace535@mail.gmail.com> <4A77F2B6.8040108@ar.media.kyoto-u.ac.jp> Message-ID: On Tue, Aug 4, 2009 at 10:35 AM, David Cournapeau wrote: > Sebastian Walter wrote: >> 2 cents from an outsider who thought about contributing to >> scipy/scikits (but didn't (yet)): >> >> I think it is a good idea to make scipy easy to use for beginners. >> However, after reading this thread, I have the impression that it is >> not the goal to provide state of the art algorithms but rather making >> Scipy as popular as possible by putting money and effort into the >> "marketing" of Scipy. >> Don't get me wrong, I think there are some good reasons why a project >> should thrive for a large user base. Some of the best projects are >> popular. >> Alas, correlation does not imply causality. >> >> Me for instance, would rather like to see more efforts to get state of >> the art algorithms to be implemented in Scipy because that's something >> that would make a real difference in my research work. Of course, >> targeting the "clueless Matlab" users is quite pointless if it is that >> what you are after. >> > > One point which has not been mentioned concerning matlab-like > environment - maybe it is obvious and everyone implicitly acknowledges > it, but Mathworks is a 30 years old company, with > 1000 people today. > > Building something like matlab, with a good GUI and top notch > documentation takes a huge amount of resources, of which the 'useful' > code is only a fraction. I of course don't know the details of matlab > implementation, but I know that for music oriented softwares (which need > good UI to sell well, and have non trivial computational requirements, > so the comparison is not totally stupid), the graphical code is 80 % of > the code. This ratio is consistent with the big open source audio > softwares as well (ardour, rosegarden). Worse, being cross platform > makes the problem much more difficult. For music softwares market, mac > os x is rarely ignored (~ 40-50% of the market I believe), so people > need to support two platforms, and that's really a lot of work. For > scientific software, I think you can go the non native route for the > graphical toolkit, though. > > Also, very few open source software are successful as far as good GUI > are concerned (I don't want to enter into a debate here, but there are > good documents/studies on this topic). You need financial incentive for > this, so only projects backed up by big companies managed to pull it of. > > IOW, I am pretty pessimistic about being a 'matlab' clone. We should > rather shoot for what makes numpy/scipy better (extensibility, cross > platform, actual language, etc...), because really, matlab will always > be a much better matlab than us. Price and licensing are not good enough > to justify migration - if what you want is a free matlab clone, why not > using octave or scilab anyway. > > That does NOT mean that we should not aim at making the software more > accessible. I (and I guess other developers) are definitely interested > in a more product-like, integrated stack, to make the barrier of entry > lower. I for example am really tired of the installation problems > consistently reported. I feel like we cover mac os x and windows pretty > well now, but the linux situation is still dreadful. I have a few ideas > on how to improve the situation, but they all requires quite a bit of > work/infrastructure. I hope that soon, the scenario "I see this cool > python script on the internet, it requires this numpy/scipy thing, can I > try it in 2 minutes ?" will be a reality. > >> Then you really get some "killer applications". I could name a few >> people who are coding some cool state of the art algorithms but waste >> so much time because they started coding directly in C++. In the >> meantime, they could have implemented the algorithms in Python _and_ >> in C++. If scipy had something really good that Matlab etc. do not >> have: guess what ppl would do.... >> > > Yes, there are a lot of people who still don't know that there are > languages outside Fortran, C and C++. In my field, I still see some > people who implement parsers in C... > >> 1) an easy, modular and flexible build system (fortran, c, c++, D, >> swig, boost:python, cython,...) >> > > you mean like numscons :) ? Adding D support to numscons should be easy. > For example, I added initial cython support in a couple of minutes > during the cython talk at SciPy08, adding new languages is relatively > easy thanks to scons. > >> 2) very low entry barrier for possible contributors: >> a simple checkout, then ./manage.py startapp mycoolmodule >> and everything is ready to go ( "Start coding in 5 minutes!") >> > > there are various pieces to enable this (in place build, develop command > of setuptools, virtualenv/pip/easy_install), but yes, the situation is > kind of messy. For scikits, that's not so difficult - you should be > able to implement a trivial scikit by copying the scikits.example > package and starting from there. > > One problem is that it is technically impossible to build in place and > test in one go because of a nose limitation ATM (for some reason, nose > fails to import a package if it is in the current directory). > >> 3) a distributed version control system (e.g. git). SVN really scares me off... >> > > That's a sensitive issue, I think we should avoid starting this one here > :) Needless to say, you can use git-svn - several core developers use it > for numpy/scipy dev, and we distribute an official import: > > http://projects.scipy.org/numpy/browse_git > > At least I have not touched svn for numpy/scipy development for > 6 > months now, except to check releases when I tag them. > >> 4) standardized unit tests >> > > What do you mean exactly here ? We use nose for testing, what do you > consider "non standard". > >> 5) automated documentation generation >> > > It is almost automated now - but an example for scikits is missing in > the example package :) > Just enumerating what I think would be useful to attract high quality contributors. I'm aware that scipy has already a lot of the features (which is nice). But it would be even nicer to have a really low entry barrier and have a framework that guides you to write good (and documented) code with extensive unit tests, just like the big web frameworks (Django, RoR, ...) It has to be a win-win situation for both the community and the developer. > cheers, > > David > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Tue Aug 4 13:17:31 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 10:17:31 -0700 (PDT) Subject: [SciPy-dev] Anyone up for a Skypecon? Message-ID: <907186.73991.qm@web52103.mail.re2.yahoo.com> Well, past two weeks, Summer Marathon Skypecon participation has been nil. Jack Liddle just Skyped me asking me about this week - which I presume means he's interested and available, in principle at least - and I said "regular time, tomorrow (Wed.) 19:00 UTC," but if it would increase participation to hold it at a different time, just let me know... DG From d_l_goldsmith at yahoo.com Tue Aug 4 14:53:20 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 11:53:20 -0700 (PDT) Subject: [SciPy-dev] SciPy Foundation In-Reply-To: Message-ID: <291714.68720.qm@web52104.mail.re2.yahoo.com> At this point I think the question becomes: do we let the (clear) fact that there is not a single set of priorities for where SciPy should be headed (which I do not see as a bad thing at this stage) get in the way of the community moving on *some* proposal (e.g., Joe's, with mods) for *some* "not-for-profit entity" (e.g., a "SciPy Foundation," the original topic of this thread) that will function as an institutional resource for furthering whichever priorities for SciPy should bubble to the surface? In other words, this thread is diverging (into territory necessary to discuss, yes), but can we at least agree (a semi-rhetorical question because I think the answer is clearly "yes") that something along the lines of a "SciPy Foundation" would be useful, certainly for helping us move SciPy where we want it to go, but perhaps also for helping us decide where as well? DG --- On Tue, 8/4/09, Sebastian Walter wrote: > From: Sebastian Walter > Subject: Re: [SciPy-dev] SciPy Foundation > To: "SciPy Developers List" > Date: Tuesday, August 4, 2009, 2:25 AM > On Tue, Aug 4, 2009 at 10:35 AM, > David > Cournapeau > wrote: > > Sebastian Walter wrote: > >> 2 cents from an outsider who thought about > contributing to > >> scipy/scikits (but didn't (yet)): > >> > >> I think it is a good idea to make scipy easy to > use for beginners. > >> However, after reading this thread, I have the > impression that it is > >> not the goal to provide state of the art > algorithms but rather making > >> Scipy as popular as possible by putting money and > effort into the > >> "marketing" of Scipy. > >> Don't get me wrong, I think there are some good > reasons why a project > >> should thrive for a large user base. Some of the > best projects are > >> popular. > >> Alas, correlation does not imply causality. > >> > >> Me for instance, would rather like to see more > efforts to get state of > >> the art algorithms to be implemented in Scipy > because that's something > >> that would make a real difference in my research > work. Of course, > >> targeting the "clueless Matlab" users is quite > pointless if it is that > >> what you are after. > >> > > > > One point which has not been mentioned concerning > matlab-like > > environment - maybe it is obvious and everyone > implicitly acknowledges > > it, but Mathworks is a 30 years old company, with > > 1000 people today. > > > > Building something like matlab, with a good GUI and > top notch > > documentation takes a huge amount of resources, of > which the 'useful' > > code is only a fraction. I of course don't know the > details of matlab > > implementation, but I know that for music oriented > softwares (which need > > good UI to sell well, and have non trivial > computational requirements, > > so the comparison is not totally stupid), the > graphical code is 80 % of > > the code. This ratio is consistent with the big open > source audio > > softwares as well (ardour, rosegarden). Worse, being > cross platform > > makes the problem much more difficult. For music > softwares market, mac > > os x is rarely ignored (~ 40-50% of the market I > believe), so people > > need to support two platforms, and that's really a lot > of work. For > > scientific software, I think you can go the non native > route for the > > graphical toolkit, though. > > > > Also, very few open source software are successful as > far as good GUI > > are concerned (I don't want to enter into a debate > here, but there are > > good documents/studies on this topic). You need > financial incentive for > > this, so only projects backed up by big companies > managed to pull it of. > > > > IOW, I am pretty pessimistic about being a 'matlab' > clone. We should > > rather shoot for what makes numpy/scipy better > (extensibility, cross > > platform, actual language, etc...), because really, > matlab will always > > be a much better matlab than us. Price and licensing > are not good enough > > to justify migration - if what you want is a free > matlab clone, why not > > using octave or scilab anyway. > > > > That does NOT mean that we should not aim at making > the software more > > accessible. I (and I guess other developers) are > definitely interested > > in a more product-like, integrated stack, to make the > barrier of entry > > lower. I for example am really tired of the > installation problems > > consistently reported. I feel like we cover mac os x > and windows pretty > > well now, but the linux situation is still dreadful. I > have a few ideas > > on how to improve the situation, but they all requires > quite a bit of > > work/infrastructure. I hope that soon, the scenario "I > see this cool > > python script on the internet, it requires this > numpy/scipy thing, can I > > try it in 2 minutes ?" will be a reality. > > > >> Then you really get some "killer applications". I > could name a few > >> people who are coding some cool state of the art > algorithms but waste > >> so much time because they started coding directly > in C++. In the > >> meantime, they could have implemented the > algorithms in Python _and_ > >> in C++. If scipy had something really good that > Matlab etc. do not > >> have: guess what ppl would do.... > >> > > > > Yes, there are a lot of people who still don't know > that there are > > languages outside Fortran, C and C++. In my field, I > still see some > > people who implement parsers in C... > > > >> 1) an easy, modular and flexible build system > (fortran, c, c++, D, > >> swig, boost:python, cython,...) > >> > > > > you mean like numscons :) ? Adding D support to > numscons should be easy. > > For example, I added initial cython support in a > couple of minutes > > during the cython talk at SciPy08, adding new > languages is relatively > > easy thanks to scons. > > > >> 2) very low entry barrier for possible > contributors: > >>???a simple checkout, then? > ./manage.py startapp? mycoolmodule > >>???and everything is ready to go ( > "Start coding in 5 minutes!") > >> > > > > there are various pieces to enable this (in place > build, develop command > > of setuptools, virtualenv/pip/easy_install), but yes, > the situation is > > kind of messy. For scikits, that's not so > difficult? - you should be > > able to implement a trivial scikit by copying the > scikits.example > > package and starting from there. > > > > One problem is that it is technically impossible to > build in place and > > test in one go because of a nose limitation ATM (for > some reason, nose > > fails to import a package if it is in the current > directory). > > > >> 3) a distributed version control system (e.g. > git). SVN really scares me off... > >> > > > > That's a sensitive issue, I think we should avoid > starting this one here > > :) Needless to say, you can use git-svn - several core > developers use it > > for numpy/scipy dev, and we distribute an official > import: > > > > http://projects.scipy.org/numpy/browse_git > > > > At least I have not touched svn for numpy/scipy > development for > 6 > > months now, except to check releases when I tag them. > > > >> 4) standardized unit tests > >> > > > > What do you mean exactly here ? We use nose for > testing, what do you > > consider "non standard". > > > >> 5) automated documentation generation > >> > > > > It is almost automated now - but an example for > scikits is missing in > > the example package :) > > > > Just enumerating what I think would be useful to attract > high quality > contributors.? I'm aware that scipy has already? > a lot of the features > (which is nice). > But it would be even nicer to have a really low entry > barrier and have > a framework that guides you to write good (and documented) > code with > extensive unit tests, just like the big web frameworks > (Django, RoR, > ...) > It has to be a win-win situation for both the community and > the developer. > > > > cheers, > > > > David > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From robert.kern at gmail.com Tue Aug 4 15:37:01 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 14:37:01 -0500 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <291714.68720.qm@web52104.mail.re2.yahoo.com> References: <291714.68720.qm@web52104.mail.re2.yahoo.com> Message-ID: <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> On Tue, Aug 4, 2009 at 13:53, David Goldsmith wrote: > > At this point I think the question becomes: do we let the (clear) fact that there is not a single set of priorities for where SciPy should be headed (which I do not see as a bad thing at this stage) get in the way of the community moving on *some* proposal (e.g., Joe's, with mods) for *some* "not-for-profit entity" (e.g., a "SciPy Foundation," the original topic of this thread) that will function as an institutional resource for furthering whichever priorities for SciPy should bubble to the surface? ?In other words, this thread is diverging (into territory necessary to discuss, yes), but can we at least agree (a semi-rhetorical question because I think the answer is clearly "yes") that something along the lines of a "SciPy Foundation" would be useful, certainly for helping us move SciPy where we want it to go, but perhaps also for helping us decide where as well? Perhaps a new name would be in order. I think a lot of the disagreement in vision arises from the fact that a number of the very good ideas about how to encourage the use of Python in the sciences, which could be implemented by the people involved in SciPy-the-project, are being conflated with scipy-the-package. Things like IDEs and GUIs and applications do not fit into scipy-the-package as it currently exists, and changing scipy-the-package such that they do fit in deteriorates what scipy-the-package is good at now. Personally, I see scipy-the-package as something very close in spirit to what GSL is to C: a library of quality numerical algorithms useful to science and engineering. scipy-the-package is not everything that is required to advance Python's use in the sciences. It can't be. A single Python package is the wrong technology for delivering all of that functionality. I think we need to step back and question the question itself. Perhaps we should not be asking "where should scipy(-the-package) be heading?" but "what do we need to do advance Python's use in the sciences?" I don't think a Foundation helps the former much, but I do think the latter would be an excellent mission for one. scipy-the-package is a component of what the Foundation might work one, but I think it would make a huge mistake if it fixated on scipy-the-package and assumed that all of the work it does needs to be jammed into scipy-the-package. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From gael.varoquaux at normalesup.org Tue Aug 4 15:41:00 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 21:41:00 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> References: <291714.68720.qm@web52104.mail.re2.yahoo.com> <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> Message-ID: <20090804194100.GD11772@phare.normalesup.org> I fully agree with your analysis Robert. I had this discussion with Eric, and he did mention that it would be useful if the name was reminiscent of 'SciPy', because it is a higly visible name. Should we have a BOF on that at the SciPy conference? Mailing list discussions tend to go in a circle. On Tue, Aug 04, 2009 at 02:37:01PM -0500, Robert Kern wrote: > Perhaps a new name would be in order. I think a lot of the > disagreement in vision arises from the fact that a number of the very > good ideas about how to encourage the use of Python in the sciences, > which could be implemented by the people involved in > SciPy-the-project, are being conflated with scipy-the-package. Things > like IDEs and GUIs and applications do not fit into scipy-the-package > as it currently exists, and changing scipy-the-package such that they > do fit in deteriorates what scipy-the-package is good at now. > Personally, I see scipy-the-package as something very close in spirit > to what GSL is to C: a library of quality numerical algorithms useful > to science and engineering. scipy-the-package is not everything that > is required to advance Python's use in the sciences. It can't be. A > single Python package is the wrong technology for delivering all of > that functionality. > I think we need to step back and question the question itself. Perhaps > we should not be asking "where should scipy(-the-package) be heading?" > but "what do we need to do advance Python's use in the sciences?" I > don't think a Foundation helps the former much, but I do think the > latter would be an excellent mission for one. scipy-the-package is a > component of what the Foundation might work one, but I think it would > make a huge mistake if it fixated on scipy-the-package and assumed > that all of the work it does needs to be jammed into > scipy-the-package. From robert.kern at gmail.com Tue Aug 4 15:45:32 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 14:45:32 -0500 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <20090804194100.GD11772@phare.normalesup.org> References: <291714.68720.qm@web52104.mail.re2.yahoo.com> <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> <20090804194100.GD11772@phare.normalesup.org> Message-ID: <3d375d730908041245w39ef56f2y5d394098b5e5932e@mail.gmail.com> On Tue, Aug 4, 2009 at 14:41, Gael Varoquaux wrote: > I fully agree with your analysis Robert. > > I had this discussion with Eric, and he did mention that it would be > useful if the name was reminiscent of 'SciPy', because it is a higly > visible name. > > Should we have a BOF on that at the SciPy conference? Mailing list > discussions tend to go in a circle. We could get a bikeshed, some paint, and some brushes. Everyone who wants to contribute an idea must paint it on the bikeshed. I like it. Anyways, it could probably even be called the SciPy Foundation as long as the introductory material was very explicit about its relationship to scipy-the-package and the founding members use language carefully. Tricky, but doable. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Tue Aug 4 15:48:10 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 4 Aug 2009 14:48:10 -0500 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <3d375d730908041245w39ef56f2y5d394098b5e5932e@mail.gmail.com> References: <291714.68720.qm@web52104.mail.re2.yahoo.com> <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> <20090804194100.GD11772@phare.normalesup.org> <3d375d730908041245w39ef56f2y5d394098b5e5932e@mail.gmail.com> Message-ID: <3d375d730908041248y1008c6a0y9bf904abe3c9cd8e@mail.gmail.com> On Tue, Aug 4, 2009 at 14:45, Robert Kern wrote: > We could get a bikeshed, some paint, and some brushes. Everyone who > wants to contribute an idea must paint it on the bikeshed. In their preferred color, of course. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From ondrej at certik.cz Tue Aug 4 17:28:52 2009 From: ondrej at certik.cz (Ondrej Certik) Date: Tue, 4 Aug 2009 15:28:52 -0600 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <3d375d730908041248y1008c6a0y9bf904abe3c9cd8e@mail.gmail.com> References: <291714.68720.qm@web52104.mail.re2.yahoo.com> <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> <20090804194100.GD11772@phare.normalesup.org> <3d375d730908041245w39ef56f2y5d394098b5e5932e@mail.gmail.com> <3d375d730908041248y1008c6a0y9bf904abe3c9cd8e@mail.gmail.com> Message-ID: <85b5c3130908041428k428cd249y97313b768b32ff94@mail.gmail.com> On Tue, Aug 4, 2009 at 1:48 PM, Robert Kern wrote: > On Tue, Aug 4, 2009 at 14:45, Robert Kern wrote: >> We could get a bikeshed, some paint, and some brushes. Everyone who >> wants to contribute an idea must paint it on the bikeshed. > > In their preferred color, of course. Maybe everyone should bring a bike too. Ondrej From charlesr.harris at gmail.com Tue Aug 4 17:47:21 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 4 Aug 2009 15:47:21 -0600 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: <85b5c3130908041428k428cd249y97313b768b32ff94@mail.gmail.com> References: <291714.68720.qm@web52104.mail.re2.yahoo.com> <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> <20090804194100.GD11772@phare.normalesup.org> <3d375d730908041245w39ef56f2y5d394098b5e5932e@mail.gmail.com> <3d375d730908041248y1008c6a0y9bf904abe3c9cd8e@mail.gmail.com> <85b5c3130908041428k428cd249y97313b768b32ff94@mail.gmail.com> Message-ID: On Tue, Aug 4, 2009 at 3:28 PM, Ondrej Certik wrote: > On Tue, Aug 4, 2009 at 1:48 PM, Robert Kern wrote: > > On Tue, Aug 4, 2009 at 14:45, Robert Kern wrote: > >> We could get a bikeshed, some paint, and some brushes. Everyone who > >> wants to contribute an idea must paint it on the bikeshed. > > > > In their preferred color, of course. > > Maybe everyone should bring a bike too. > It would be nice if the hotels would offer rental bikes. As is, the bike stores are far enough away that getting the bike and dropping it off on departure, is too much hassle. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From gael.varoquaux at normalesup.org Tue Aug 4 17:49:41 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Tue, 4 Aug 2009 23:49:41 +0200 Subject: [SciPy-dev] SciPy Foundation In-Reply-To: References: <291714.68720.qm@web52104.mail.re2.yahoo.com> <3d375d730908041237g35c415f8vf1506087f12197ba@mail.gmail.com> <20090804194100.GD11772@phare.normalesup.org> <3d375d730908041245w39ef56f2y5d394098b5e5932e@mail.gmail.com> <3d375d730908041248y1008c6a0y9bf904abe3c9cd8e@mail.gmail.com> <85b5c3130908041428k428cd249y97313b768b32ff94@mail.gmail.com> Message-ID: <20090804214941.GA23662@phare.normalesup.org> On Tue, Aug 04, 2009 at 03:47:21PM -0600, Charles R Harris wrote: > On Tue, Aug 4, 2009 at 3:28 PM, Ondrej Certik <[1]ondrej at certik.cz> wrote: > On Tue, Aug 4, 2009 at 1:48 PM, Robert Kern<[2]robert.kern at gmail.com> > wrote: > > On Tue, Aug 4, 2009 at 14:45, Robert Kern<[3]robert.kern at gmail.com> > wrote: > >> We could get a bikeshed, some paint, and some brushes. Everyone who > >> wants to contribute an idea must paint it on the bikeshed. > > In their preferred color, of course. > Maybe everyone should bring a bike too. > It would be nice if the hotels would offer rental bikes. As is, the bike > stores are far enough away that getting the bike and dropping it off on > departure, is too much hassle. The good news is that you'll find a bikeshed to shelter your bike when you get there. Actually, I suggest some people bring bikesheds too. As I am unsure of my favorite color. Ga?l From d_l_goldsmith at yahoo.com Wed Aug 5 00:52:24 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 4 Aug 2009 21:52:24 -0700 (PDT) Subject: [SciPy-dev] Trying to finish off the docstring for i0 Message-ID: <645082.77656.qm@web52109.mail.re2.yahoo.com> Hi! Because i0 (modified Bessel func., first kind, order zero) is not your typical "run-of-the-mill" transcendental function, I was hoping to include a little info in its Notes section about the algorithm we use, convergence, etc. I found the code in function_base.py; long story short, it led me to: http://kobesearch.cpan.org/htdocs/Math-Cephe/Math/Cephes.html#i_i0_i_Modified_Bessel_function_of_o which looks like what we're using, correct? DG From stefan at sun.ac.za Wed Aug 5 01:11:32 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Wed, 5 Aug 2009 00:11:32 -0500 Subject: [SciPy-dev] Anyone up for a Skypecon? In-Reply-To: <907186.73991.qm@web52103.mail.re2.yahoo.com> References: <907186.73991.qm@web52103.mail.re2.yahoo.com> Message-ID: <9457e7c80908042211n634c8063w60d45c0eb2c06e51@mail.gmail.com> Hi David Apologies from my side, but I am currently traveling in your beautiful country. Looking forward to seeing all of you at SciPy'09! Happy documenting! (1000 words for a high quality cotton T-shirt (TM) -- bargain!). Cheers St?fan 2009/8/4 David Goldsmith : > > Well, past two weeks, Summer Marathon Skypecon participation has been nil. ?Jack Liddle just Skyped me asking me about this week - which I presume means he's interested and available, in principle at least - and I said "regular time, tomorrow (Wed.) 19:00 UTC," but if it would increase participation to hold it at a different time, just let me know... > > DG > > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Wed Aug 5 13:33:48 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 10:33:48 -0700 (PDT) Subject: [SciPy-dev] Two Marathon questions Message-ID: <940468.47590.qm@web52104.mail.re2.yahoo.com> 0) Are there any Category Leaders who do *not* want help finishing their categories? 1) Is there anyone "in-the-know" who feels that reading "numpy-docs/reference/routines.ma.rst" (as it is now) is *insufficient* preparation for assisting w/ the Masked Array docstrings; or, put another way, feels that if one is not at least "well-practiced" using masked arrays, then one should not touch their docstrings? But for the masked array categories and the uncategorized docstrings, we're pretty close! DG From d_l_goldsmith at yahoo.com Wed Aug 5 14:03:43 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 11:03:43 -0700 (PDT) Subject: [SciPy-dev] Hour 'til Skypecon. Message-ID: <560636.82353.qm@web52105.mail.re2.yahoo.com> From emmanuelle.gouillart at normalesup.org Wed Aug 5 14:43:42 2009 From: emmanuelle.gouillart at normalesup.org (Emmanuelle Gouillart) Date: Wed, 5 Aug 2009 20:43:42 +0200 Subject: [SciPy-dev] On scipy/numpy documentation, and executing code in docstrings Message-ID: <20090805184342.GA7664@phare.normalesup.org> Hello list, disclaimer: I don't have much hindsight about what I'm talking about in the following, so I apologize if it doesn't make much sense... My question is: is there some way (apart from copy-and-paste :D) to execute some of the code inside docstrings, in order e.g. to generate pylab plots? This may be a useful feature for the documentation of scipy: indeed, a plot may speak for itself better than long explanations about the output of scipy's algorithms. Some docstrings already include calls to plotting commands (one example: http://docs.scipy.org/scipy/docs/scipy.stats.distributions.poisson/), but of course, the plots are not created while viewing the help. Maybe it's a stupid idea, but I'm thinking about a %demo magic function in Ipython that would print the docstring of an object and execute the code of the docstring (preferably in a separate namespace) and, in particular, display pylab's windows. Does such a feature already exist somewhere? If not, do you see any interest in coding it? Matplotlib and Mayavi2 call special demo functions and it would be possible to do the same for scipy, but on the other side, using directly the docstrings for demos might be just as well... Regards, Emmanuelle From d_l_goldsmith at yahoo.com Wed Aug 5 14:48:42 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 11:48:42 -0700 (PDT) Subject: [SciPy-dev] numpy.broadcast Message-ID: <819183.15509.qm@web52106.mail.re2.yahoo.com> I guess I don't really understand this too well - is the below correct behavior, and if so, why? >>> x = np.array([1, 2, 3]) >>> y = np.array([[4], [5], [6]]) >>> b = np.broadcast(x, y) >>> b.nd # returns what I'd expect 2 >>> b = np.broadcast(x, y, x, y) >>> b.nd # doesn't return what I'd expect 2 >>> del b # maybe problem is that I have to "clear" b first? >>> # or maybe it's that all args have to be different? ... >>> b = np.broadcast(x, y, x * y) >>> b.nd 2 >>> z = x * y # grasping at straws now >>> z array([[ 4, 8, 12], [ 5, 10, 15], [ 6, 12, 18]]) >>> x array([1, 2, 3]) >>> y array([[4], [5], [6]]) >>> del b # trying everything to get intuitive behavior >>> b = np.broadcast(x, y, z) >>> b.nd 2 Huhn? DG From robert.kern at gmail.com Wed Aug 5 14:51:25 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 13:51:25 -0500 Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <819183.15509.qm@web52106.mail.re2.yahoo.com> References: <819183.15509.qm@web52106.mail.re2.yahoo.com> Message-ID: <3d375d730908051151g7b8a0d95t23f2a16309e31986@mail.gmail.com> On Wed, Aug 5, 2009 at 13:48, David Goldsmith wrote: > I guess I don't really understand this too well - is the below correct behavior, and if so, why? > >>>> x = np.array([1, 2, 3]) >>>> y = np.array([[4], [5], [6]]) >>>> b = np.broadcast(x, y) >>>> b.nd # returns what I'd expect > 2 >>>> b = np.broadcast(x, y, x, y) >>>> b.nd # doesn't return what I'd expect > 2 Why don't you expect this? It's the correct answer. (x*y*x*y).shape == (3,3). >>>> del b # maybe problem is that I have to "clear" b first? >>>> # or maybe it's that all args have to be different? > ... >>>> b = np.broadcast(x, y, x * y) >>>> b.nd > 2 >>>> z = x * y # grasping at straws now >>>> z > array([[ 4, ?8, 12], > ? ? ? [ 5, 10, 15], > ? ? ? [ 6, 12, 18]]) Yup. z.shape == (3,3). Broadcasting that with a (3,1) array or a (3,) array still gives a (3,3) array. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Wed Aug 5 14:59:49 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 5 Aug 2009 14:59:49 -0400 Subject: [SciPy-dev] Two Marathon questions In-Reply-To: <940468.47590.qm@web52104.mail.re2.yahoo.com> References: <940468.47590.qm@web52104.mail.re2.yahoo.com> Message-ID: <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> On Aug 5, 2009, at 1:33 PM, David Goldsmith wrote: > 0) Are there any Category Leaders who do *not* want help finishing > their categories? > > 1) Is there anyone "in-the-know" who feels that reading "numpy-docs/ > reference/routines.ma.rst" (as it is now) is *insufficient* > preparation for assisting w/ the Masked Array docstrings; or, put > another way, feels that if one is not at least "well-practiced" > using masked arrays, then one should not touch their docstrings? Mmh. Can't really tell. Experience with masked arrays is certainly a prerequisite, but one shouldn't need to be an expert. Nevertheless, I'd be obliged if anybody willing to edit the MA docstrings could contact me beforehand w/ questions. From d_l_goldsmith at yahoo.com Wed Aug 5 15:19:53 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 12:19:53 -0700 (PDT) Subject: [SciPy-dev] Skypecon aborted due to host's technical difficulties Message-ID: <658720.89931.qm@web52112.mail.re2.yahoo.com> Oooops: main computer stolen last week; thought replacement (my previous computer, borrowed back from my son) had built in mike, but doesn't appear to; only Ralf called in anyway, so we've opted to reschedule (or just cancel, picking-up again next week.) If we have it at a different time this week, can more people "make it"? If so, when's good for people? DG From ralf.gommers at googlemail.com Wed Aug 5 15:28:38 2009 From: ralf.gommers at googlemail.com (Ralf Gommers) Date: Wed, 5 Aug 2009 15:28:38 -0400 Subject: [SciPy-dev] Two Marathon questions In-Reply-To: <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> References: <940468.47590.qm@web52104.mail.re2.yahoo.com> <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> Message-ID: On Wed, Aug 5, 2009 at 2:59 PM, Pierre GM wrote: > > On Aug 5, 2009, at 1:33 PM, David Goldsmith wrote: > > > 0) Are there any Category Leaders who do *not* want help finishing > > their categories? > > > > 1) Is there anyone "in-the-know" who feels that reading "numpy-docs/ > > reference/routines.ma.rst" (as it is now) is *insufficient* > > preparation for assisting w/ the Masked Array docstrings; or, put > > another way, feels that if one is not at least "well-practiced" > > using masked arrays, then one should not touch their docstrings? > > Mmh. Can't really tell. Experience with masked arrays is certainly a > prerequisite, but one shouldn't need to be an expert. > Nevertheless, I'd be obliged if anybody willing to edit the MA > docstrings could contact me beforehand w/ questions. > Pierre, I did a lot of the MA docstrings over the past few days. I would appreciate it if you could look at some of those (see http://docs.scipy.org/numpy/changes/ for recent changes) and let me know if there's anything that you think should be done differently. I also moved a lot of the private functions/classes to Unimportant status, but I did fix the markup for a lot of them and added some Parameters sections etc where I thought it was helpful. Again, please let me know if you have any recommendations. Cheers, Ralf > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Wed Aug 5 15:38:19 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 12:38:19 -0700 (PDT) Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <3d375d730908051151g7b8a0d95t23f2a16309e31986@mail.gmail.com> Message-ID: <120029.98483.qm@web52107.mail.re2.yahoo.com> --- On Wed, 8/5/09, Robert Kern wrote: > wrote: > > I guess I don't really understand this too well - is > the below correct behavior, and if so, why? > > > >>>> b = np.broadcast(x, y, x, y) > >>>> b.nd # doesn't return what I'd expect > > 2 Why isn't that 4? > Why don't you expect this? It's the correct answer. > (x*y*x*y).shape == (3,3). > > >>>> del b # maybe problem is that I have to > "clear" b first? > >>>> # or maybe it's that all args have to be > different? > > ... > >>>> b = np.broadcast(x, y, x * y) > >>>> b.nd > > 2 Why isn't that 3? If x0, ..., xN are the arguments to `broadcast` and D = max(x0.nd, ..., xN.nd), is broadcast.nd necessarily <= D? If so, then I think I'm on the road to understanding. DG From robert.kern at gmail.com Wed Aug 5 15:41:51 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 14:41:51 -0500 Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <120029.98483.qm@web52107.mail.re2.yahoo.com> References: <3d375d730908051151g7b8a0d95t23f2a16309e31986@mail.gmail.com> <120029.98483.qm@web52107.mail.re2.yahoo.com> Message-ID: <3d375d730908051241g122ed99bn5c0b86c74690adfb@mail.gmail.com> On Wed, Aug 5, 2009 at 14:38, David Goldsmith wrote: > --- On Wed, 8/5/09, Robert Kern wrote: > >> wrote: >> > I guess I don't really understand this too well - is >> the below correct behavior, and if so, why? >> > >> >>>> b = np.broadcast(x, y, x, y) >> >>>> b.nd # doesn't return what I'd expect >> > 2 > > Why isn't that 4? Why would it be 4? >> Why don't you expect this? It's the correct answer. >> (x*y*x*y).shape == (3,3). This is the example you need to pay attention to. >> >>>> del b # maybe problem is that I have to >> "clear" b first? >> >>>> # or maybe it's that all args have to be >> different? >> > ... >> >>>> b = np.broadcast(x, y, x * y) >> >>>> b.nd >> > 2 > > Why isn't that 3? > > If x0, ..., xN are the arguments to `broadcast` and D = max(x0.nd, ..., xN.nd), is broadcast.nd necessarily <= D? ?If so, then I think I'm on the road to understanding. It is necessarily == D. Broadcasting is associative. The (x*y*z).shape == (x*(y*z)).shape == ((x*y)*z).shape. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From d_l_goldsmith at yahoo.com Wed Aug 5 15:46:27 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 12:46:27 -0700 (PDT) Subject: [SciPy-dev] Two Marathon questions In-Reply-To: <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> Message-ID: <829204.30757.qm@web52105.mail.re2.yahoo.com> Thanks for your reply, Pierre. --- On Wed, 8/5/09, Pierre GM wrote: > On Aug 5, 2009, at 1:33 PM, David Goldsmith wrote: > > > 1) Is there anyone "in-the-know" who feels that > reading "numpy-docs/ > > reference/routines.ma.rst" (as it is now) is > *insufficient*? > > preparation for assisting w/ the Masked Array > docstrings; or, put? > > another way, feels that if one is not at least > "well-practiced"? > > using masked arrays, then one should not touch their > docstrings? > > Mmh. Can't really tell. Experience with masked arrays is > certainly a? > prerequisite, but one shouldn't need to be an expert. > Nevertheless, I'd be obliged if anybody willing to edit the > MA? > docstrings could contact me beforehand w/ questions. Well, for example, I have experience *masking* arrays in IDL, but not in Numpy, and not using an object-oriented model. Nevertheless, I'd be happy to ask you q's... I'll finish the uncategorized docstrings I feel comfortable with (e.g., *not* distutils), and then help w/ any remaining MA's. Others, of course, do as thou willst, modulo Pierre's request. Thanks again! DG > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Wed Aug 5 16:01:23 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 13:01:23 -0700 (PDT) Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <3d375d730908051241g122ed99bn5c0b86c74690adfb@mail.gmail.com> Message-ID: <391244.80171.qm@web52111.mail.re2.yahoo.com> --- On Wed, 8/5/09, Robert Kern wrote: > It is necessarily == D. Broadcasting is associative. Ah, that's the key I didn't understand! That helps (me) a lot; I'm going to make a "Note" of it in numpy.broadcast's docstring. DG > The > (x*y*z).shape > == (x*(y*z)).shape == ((x*y)*z).shape. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, > a harmless > enigma that is made terrible by our own mad attempt to > interpret it as > though it had an underlying truth." > ? -- Umberto Eco > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From pav+sp at iki.fi Wed Aug 5 16:32:13 2009 From: pav+sp at iki.fi (Pauli Virtanen) Date: Wed, 5 Aug 2009 20:32:13 +0000 (UTC) Subject: [SciPy-dev] On scipy/numpy documentation, and executing code in docstrings References: <20090805184342.GA7664@phare.normalesup.org> Message-ID: On 2009-08-05, Emmanuelle Gouillart wrote: > Hello list, > > disclaimer: I don't have much hindsight about what I'm talking > about in the following, so I apologize if it doesn't make much sense... > > My question is: is there some way (apart from copy-and-paste :D) > to execute some of the code inside docstrings, in order e.g. to generate > pylab plots? This may be a useful feature for the documentation of scipy: > indeed, a plot may speak for itself better than long explanations about > the output of scipy's algorithms. Some docstrings already include calls > to plotting commands (one example: > http://docs.scipy.org/scipy/docs/scipy.stats.distributions.poisson/), > but of course, the plots are not created while viewing the help. The plots do appear in the final documentation, cf. http://docs.scipy.org/doc/numpy/reference/generated/numpy.random.gamma.html (The scipy.stats.distributions examples are not actually executable Python code, being more pseudo-codeish. This could and probably should be fixed, though.) It's not really feasible to have them appear in the doc editor -- there's no reliable & easy way to sandbox Python code, and I'm not comfortable with having a way to run potentially untrusted code on the servers. One thing that I'm not very happy with the Sphinx output is that copy & paste of the examples is quite difficult, since you get the >>> and ... prompts. This could be avoided with suitable HTML magick. > Maybe it's a stupid idea, but I'm thinking about a %demo magic > function in Ipython that would print the docstring of an object and > execute the code of the docstring (preferably in a separate namespace) > and, in particular, display pylab's windows. > > Does such a feature already exist somewhere? If not, do you see > any interest in coding it? Matplotlib and Mayavi2 call special demo > functions and it would be possible to do the same for scipy, but on the > other side, using directly the docstrings for demos might be just as > well... I think such a demo function could be easy to implement: just pick up the doctest lines and run them. I think a IPython extension could easily be written for this: just check what's in the ipy_*.py files under IPython/Extensions and adapt one of them. There's a ready-made implementation of the doctest pickup in plot_directive.py under numpy/doc/sphinxext. -- Pauli Virtanen From d_l_goldsmith at yahoo.com Wed Aug 5 16:40:01 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 13:40:01 -0700 (PDT) Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <391244.80171.qm@web52111.mail.re2.yahoo.com> Message-ID: <49739.57430.qm@web52105.mail.re2.yahoo.com> --- On Wed, 8/5/09, David Goldsmith wrote: > > It is necessarily == D. Broadcasting is associative. > > The > > (x*y*z).shape > > == (x*(y*z)).shape == ((x*y)*z).shape. Um: >>> x = np.array((1, 2, 3)) >>> y = np.array([[4], [5], [6]]) >>> z = x * y >>> x; y; z array([1, 2, 3]) array([[4], [5], [6]]) array([[ 4, 8, 12], [ 5, 10, 15], [ 6, 12, 18]]) >>> B = np.broadcast >>> X = B(x, y, z) >>> Y = B(x, B(y, z)) >>> Z = B(B(x, y), z) >>> X.numiter, Y.numiter, Z.numiter (3, 2, 2) >>> X.nd, Y.nd, Z.nd (2, 1, 2) >>> X.shape, Y.shape, Z.shape ((3, 3), (3,), (3, 3)) Am I doing something wrong? DG From emmanuelle.gouillart at normalesup.org Wed Aug 5 16:49:41 2009 From: emmanuelle.gouillart at normalesup.org (Emmanuelle Gouillart) Date: Wed, 5 Aug 2009 22:49:41 +0200 Subject: [SciPy-dev] On scipy/numpy documentation, and executing code in docstrings In-Reply-To: References: <20090805184342.GA7664@phare.normalesup.org> Message-ID: <20090805204941.GA25169@phare.normalesup.org> Hi Pauli, thanks for your answer! > The plots do appear in the final documentation, cf. > http://docs.scipy.org/doc/numpy/reference/generated/numpy.random.gamma.html True! I hadn't noticed as I usually use only Ipython's help, but the html pages look really nice with the plots. > It's not really feasible to have them appear in the doc editor -- > there's no reliable & easy way to sandbox Python code, and I'm > not comfortable with having a way to run potentially untrusted > code on the servers. Sure, I wasn't talking about having them appear in the doc editor. We can afford doing some copy & paste while using the doc editor, I think... > One thing that I'm not very happy with the Sphinx output is that > copy & paste of the examples is quite difficult, since you get > the >>> and ... prompts. This could be avoided with suitable HTML > magick. Maybe the wonderful %doctest_mode magic command of Ipython should be advertised somewhere... I use it all the time since I've discovered the feature, it's awfully convenient :D. > I think such a demo function could be easy to implement: just > pick up the doctest lines and run them. I think a IPython > extension could easily be written for this: just check what's in > the ipy_*.py files under IPython/Extensions and adapt one of > them. > There's a ready-made implementation of the doctest pickup in > plot_directive.py under numpy/doc/sphinxext. That's exactly the kind of hints I was looking for, many thanks! I'll have a look at the files you mention to see how it could be done. Cheers, Emmanuelle From robert.kern at gmail.com Wed Aug 5 17:01:32 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 16:01:32 -0500 Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <49739.57430.qm@web52105.mail.re2.yahoo.com> References: <391244.80171.qm@web52111.mail.re2.yahoo.com> <49739.57430.qm@web52105.mail.re2.yahoo.com> Message-ID: <3d375d730908051401t28ecc411jabc2ba93edd91887@mail.gmail.com> On Wed, Aug 5, 2009 at 15:40, David Goldsmith wrote: > --- On Wed, 8/5/09, David Goldsmith wrote: > >> > It is necessarily == D. Broadcasting is associative. >> > The >> > (x*y*z).shape >> > == (x*(y*z)).shape == ((x*y)*z).shape. > > Um: > >>>> x = np.array((1, 2, 3)) >>>> y = np.array([[4], [5], [6]]) >>>> z = x * y >>>> x; y; z > array([1, 2, 3]) > array([[4], > ? ? ? [5], > ? ? ? [6]]) > array([[ 4, ?8, 12], > ? ? ? [ 5, 10, 15], > ? ? ? [ 6, 12, 18]]) >>>> B = np.broadcast >>>> X = B(x, y, z) >>>> Y = B(x, B(y, z)) >>>> Z = B(B(x, y), z) >>>> X.numiter, Y.numiter, Z.numiter > (3, 2, 2) >>>> X.nd, Y.nd, Z.nd > (2, 1, 2) >>>> X.shape, Y.shape, Z.shape > ((3, 3), (3,), (3, 3)) > > Am I doing something wrong? You are passing a broadcast iterator as an argument to broadcast. The broadcast iterator iterates over the inputs in-parallel. This is entirely different from actually operating on the inputs. In [1]: x = array([1, 2, 3]) In [2]: y = array([[4], [5], [6]]) In [3]: z = x * y In [4]: x*y*z Out[4]: array([[ 16, 64, 144], [ 25, 100, 225], [ 36, 144, 324]]) In [5]: (x*y)*z Out[5]: array([[ 16, 64, 144], [ 25, 100, 225], [ 36, 144, 324]]) In [6]: x*(y*z) Out[6]: array([[ 16, 64, 144], [ 25, 100, 225], [ 36, 144, 324]]) In [8]: array(list(broadcast(x,y))) Out[8]: array([[1, 4], [2, 4], [3, 4], [1, 5], [2, 5], [3, 5], [1, 6], [2, 6], [3, 6]]) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From d_l_goldsmith at yahoo.com Wed Aug 5 18:15:43 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 15:15:43 -0700 (PDT) Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <3d375d730908051401t28ecc411jabc2ba93edd91887@mail.gmail.com> Message-ID: <668591.72009.qm@web52106.mail.re2.yahoo.com> Sorry to be "dense", but can someone please show me how to exhibit associativity with the broadcast object - it's that (not broadcasting as a process) I'm trying to understand (and illustrate). DG --- On Wed, 8/5/09, Robert Kern wrote: > From: Robert Kern > Subject: Re: [SciPy-dev] numpy.broadcast > To: "SciPy Developers List" > Date: Wednesday, August 5, 2009, 2:01 PM > On Wed, Aug 5, 2009 at 15:40, David > Goldsmith > wrote: > > --- On Wed, 8/5/09, David Goldsmith > wrote: > > > >> > It is necessarily == D. Broadcasting is > associative. > >> > The > >> > (x*y*z).shape > >> > == (x*(y*z)).shape == ((x*y)*z).shape. > > > > Um: > > > >>>> x = np.array((1, 2, 3)) > >>>> y = np.array([[4], [5], [6]]) > >>>> z = x * y > >>>> x; y; z > > array([1, 2, 3]) > > array([[4], > > ? ? ? [5], > > ? ? ? [6]]) > > array([[ 4, ?8, 12], > > ? ? ? [ 5, 10, 15], > > ? ? ? [ 6, 12, 18]]) > >>>> B = np.broadcast > >>>> X = B(x, y, z) > >>>> Y = B(x, B(y, z)) > >>>> Z = B(B(x, y), z) > >>>> X.numiter, Y.numiter, Z.numiter > > (3, 2, 2) > >>>> X.nd, Y.nd, Z.nd > > (2, 1, 2) > >>>> X.shape, Y.shape, Z.shape > > ((3, 3), (3,), (3, 3)) > > > > Am I doing something wrong? > > You are passing a broadcast iterator as an argument to > broadcast. The > broadcast iterator iterates over the inputs in-parallel. > This is > entirely different from actually operating on the inputs. > > In [1]: x = array([1, 2, 3]) > > In [2]: y = array([[4], [5], [6]]) > > In [3]: z = x * y > > In [4]: x*y*z > Out[4]: > array([[ 16,? 64, 144], > ? ? ???[ 25, 100, 225], > ? ? ???[ 36, 144, 324]]) > > In [5]: (x*y)*z > Out[5]: > array([[ 16,? 64, 144], > ? ? ???[ 25, 100, 225], > ? ? ???[ 36, 144, 324]]) > > In [6]: x*(y*z) > Out[6]: > array([[ 16,? 64, 144], > ? ? ???[ 25, 100, 225], > ? ? ???[ 36, 144, 324]]) > > In [8]: array(list(broadcast(x,y))) > Out[8]: > array([[1, 4], > ? ? ???[2, 4], > ? ? ???[3, 4], > ? ? ???[1, 5], > ? ? ???[2, 5], > ? ? ???[3, 5], > ? ? ???[1, 6], > ? ? ???[2, 6], > ? ? ???[3, 6]]) > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, > a harmless > enigma that is made terrible by our own mad attempt to > interpret it as > though it had an underlying truth." > ? -- Umberto Eco > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From robert.kern at gmail.com Wed Aug 5 18:17:23 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 17:17:23 -0500 Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <668591.72009.qm@web52106.mail.re2.yahoo.com> References: <3d375d730908051401t28ecc411jabc2ba93edd91887@mail.gmail.com> <668591.72009.qm@web52106.mail.re2.yahoo.com> Message-ID: <3d375d730908051517k3b746ea7ra81a775690fa4528@mail.gmail.com> On Wed, Aug 5, 2009 at 17:15, David Goldsmith wrote: > Sorry to be "dense", but can someone please show me how to exhibit associativity with the broadcast object - it's that (not broadcasting as a process) I'm trying to understand (and illustrate). The broadcast *object* doesn't associate. *Broadcasting* associates. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From d_l_goldsmith at yahoo.com Wed Aug 5 18:23:51 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Wed, 5 Aug 2009 15:23:51 -0700 (PDT) Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <3d375d730908051517k3b746ea7ra81a775690fa4528@mail.gmail.com> Message-ID: <879886.78095.qm@web52110.mail.re2.yahoo.com> --- On Wed, 8/5/09, Robert Kern wrote: > The broadcast *object* doesn't associate. *Broadcasting* > associates. I understand that; I guess what I don't understand is the relationship between the object and the process. DG > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, > a harmless > enigma that is made terrible by our own mad attempt to > interpret it as > though it had an underlying truth." > ? -- Umberto Eco > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From fperez.net at gmail.com Wed Aug 5 18:38:05 2009 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 5 Aug 2009 15:38:05 -0700 Subject: [SciPy-dev] On scipy/numpy documentation, and executing code in docstrings In-Reply-To: <20090805204941.GA25169@phare.normalesup.org> References: <20090805184342.GA7664@phare.normalesup.org> <20090805204941.GA25169@phare.normalesup.org> Message-ID: On Wed, Aug 5, 2009 at 1:49 PM, Emmanuelle Gouillart wrote: >> One thing that I'm not very happy with the Sphinx output is that >> copy & paste of the examples is quite difficult, since you get >> the >>> and ... prompts. This could be avoided with suitable HTML >> magick. > > Maybe the wonderful %doctest_mode magic command of Ipython should be > advertised somewhere... I use it all the time since I've discovered the > feature, it's awfully convenient :D. Thanks for answering for me :) Regarding the advertising, I'll be the first to admit we could do better. But in our defense, it's at least documented: http://ipython.scipy.org/doc/stable/html/interactive/tutorial.html?highlight=doctest_mode http://ipython.scipy.org/doc/stable/html/overview.html?highlight=doctest_mode http://ipython.scipy.org/doc/stable/html/development/overview.html?highlight=doctest_mode http://ipython.scipy.org/doc/stable/html/api/generated/IPython.Magic.html?highlight=doctest_mode#IPython.Magic.Magic.magic_doctest_mode Maybe ipython should open a twitter account with a tweet-tip (tweetip?) of the day everyday :) >> I think such a demo function could be easy to implement: just >> pick up the doctest lines and run them. I think a IPython >> extension could easily be written for this: just check what's in >> the ipy_*.py files under IPython/Extensions and adapt one of >> them. > >> There's a ready-made implementation of the doctest pickup in >> plot_directive.py under numpy/doc/sphinxext. > > That's exactly the kind of hints I was looking for, many thanks! I'll > have a look at the files you mention to see how it could be done. And when you finish, send it our way. Operators are standing by to take your patch and your credit card number... :) Take care, f From robert.kern at gmail.com Wed Aug 5 18:40:07 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 17:40:07 -0500 Subject: [SciPy-dev] numpy.broadcast In-Reply-To: <879886.78095.qm@web52110.mail.re2.yahoo.com> References: <3d375d730908051517k3b746ea7ra81a775690fa4528@mail.gmail.com> <879886.78095.qm@web52110.mail.re2.yahoo.com> Message-ID: <3d375d730908051540t5decee87w6f17eccc23a69ae0@mail.gmail.com> On Wed, Aug 5, 2009 at 17:23, David Goldsmith wrote: > --- On Wed, 8/5/09, Robert Kern wrote: > >> The broadcast *object* doesn't associate. *Broadcasting* >> associates. > > I understand that; I guess what I don't understand is the relationship between the object and the process. The broadcast object is an iterator. It has attributes .nd, .shape, and .size which tell you the number of dimensions, the shape, and the number of elements of what the broadcasted forms of each the input arrays would be. For N input arrays, iterating over the broadcast object would yield you .size N-tuples with the elements from each input array just as if you were to form the broadcasted arrays for each of the inputs and then iterate over zip(broadcasted_x.flat, broadcasted_y.flat, ...). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Wed Aug 5 20:24:19 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 5 Aug 2009 19:24:19 -0500 Subject: [SciPy-dev] On scipy/numpy documentation, and executing code in docstrings In-Reply-To: References: <20090805184342.GA7664@phare.normalesup.org> <20090805204941.GA25169@phare.normalesup.org> Message-ID: <3d375d730908051724n34bc785dwb8dbce35ca0d46e2@mail.gmail.com> On Wed, Aug 5, 2009 at 17:38, Fernando Perez wrote: > On Wed, Aug 5, 2009 at 1:49 PM, Emmanuelle > Gouillart wrote: >>> I think such a demo function could be easy to implement: just >>> pick up the doctest lines and run them. I think a IPython >>> extension could easily be written for this: just check what's in >>> the ipy_*.py files under IPython/Extensions and adapt one of >>> them. >> >>> There's a ready-made implementation of the doctest pickup in >>> plot_directive.py under numpy/doc/sphinxext. >> >> That's exactly the kind of hints I was looking for, many thanks! I'll >> have a look at the files you mention to see how it could be done. > > And when you finish, send it our way. ?Operators are standing by to > take your patch and your credit card number... :) Something like this? In [1]: %run_examples np.broadcast Produce an object that mimics broadcasting. Parameters ---------- in1, in2, ... : array_like Input parameters. Returns ------- b : broadcast object Broadcast the input parameters against one another, and return an object that encapsulates the result. Amongst others, it has ``shape`` and ``nd`` properties, and may be used as an iterator. Examples -------- Manually adding two vectors, using broadcasting: >>> x = np.array([[1], [2], [3]]) Press to quit, to execute... >>> y = np.array([4, 5, 6]) Press to quit, to execute... >>> b = np.broadcast(x, y) Press to quit, to execute... >>> out = np.empty(b.shape) Press to quit, to execute... >>> out.flat = [u+v for (u,v) in b] Press to quit, to execute... >>> out array([[ 5., 6., 7.], [ 6., 7., 8.], [ 7., 8., 9.]]) Press to quit, to execute... output: array([[ 5., 6., 7.], [ 6., 7., 8.], [ 7., 8., 9.]]) Compare against built-in broadcasting: >>> x + y array([[5, 6, 7], [6, 7, 8], [7, 8, 9]]) Press to quit, to execute... output: array([[5, 6, 7], [6, 7, 8], [7, 8, 9]]) END OF DEMO Use .reset() if you want to rerun it. It uses IPython.demo. It's not especially pretty, but it's servicable. Code is at the end of this file: http://www.enthought.com/~rkern/cgi-bin/hgwebdir.cgi/kernmagic/file/c18f492e9688/kernmagic/mymagics.py -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From fperez.net at gmail.com Wed Aug 5 20:35:54 2009 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 5 Aug 2009 17:35:54 -0700 Subject: [SciPy-dev] On scipy/numpy documentation, and executing code in docstrings In-Reply-To: <3d375d730908051724n34bc785dwb8dbce35ca0d46e2@mail.gmail.com> References: <20090805184342.GA7664@phare.normalesup.org> <20090805204941.GA25169@phare.normalesup.org> <3d375d730908051724n34bc785dwb8dbce35ca0d46e2@mail.gmail.com> Message-ID: On Wed, Aug 5, 2009 at 5:24 PM, Robert Kern wrote: >> And when you finish, send it our way. ?Operators are standing by to >> take your patch and your credit card number... :) > > Something like this? [...] # Robert, as usual, rocks. > It uses IPython.demo. It's not especially pretty, but it's servicable. Yes, with Tom's recent demo improvements, this is precisely what I had in mind. > Code is at the end of this file: > > http://www.enthought.com/~rkern/cgi-bin/hgwebdir.cgi/kernmagic/file/c18f492e9688/kernmagic/mymagics.py Fantastic, many thanks. We can now properly track it: https://bugs.launchpad.net/ipython/+bug/409633 Hopefully once we land Brian's refactoring, we can start merging features again :) Now, that credit card number... f From scott.sinclair.za at gmail.com Thu Aug 6 02:19:50 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Thu, 6 Aug 2009 08:19:50 +0200 Subject: [SciPy-dev] Two Marathon questions In-Reply-To: <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> References: <940468.47590.qm@web52104.mail.re2.yahoo.com> <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> Message-ID: <6a17e9ee0908052319s2794095bq2dbcf670670dd1de@mail.gmail.com> > 2009/8/5 Pierre GM : > > On Aug 5, 2009, at 1:33 PM, David Goldsmith wrote: > >> 1) Is there anyone "in-the-know" who feels that reading "numpy-docs/ >> reference/routines.ma.rst" (as it is now) is *insufficient* >> preparation for assisting w/ the Masked Array docstrings; or, put >> another way, feels that if one is not at least "well-practiced" >> using masked arrays, then one should not touch their docstrings? > > Mmh. Can't really tell. Experience with masked arrays is certainly a > prerequisite, but one shouldn't need to be an expert. > Nevertheless, I'd be obliged if anybody willing to edit the MA > docstrings could contact me beforehand w/ questions. The best place to get an overview of masked arrays is Pierre's introduction here http://docs.scipy.org/doc/numpy/reference/maskedarray.html It's a little hard to locate in the doc-wiki, but can be found here http://docs.scipy.org/numpy/docs/numpy-docs/reference/maskedarray.rst Ralf (or others) feel free to work on the docstrings in Milestone "Operations On Masks" if you finish the other stuff and I still haven't gotten much further. You obviously have much more enthusiasm/time :) Something worth paying attention to, is making sure that the MaskedArray method docstrings and the equivalent functions in the ma module refer to each other and spell out any differences in behaviour. Compare http://docs.scipy.org/numpy/docs/numpy.ma.core.MaskedArray.set_fill_value/ http://docs.scipy.org/numpy/docs/numpy.ma.core.set_fill_value/ Cheers, Scott From ralf.gommers at googlemail.com Thu Aug 6 11:37:57 2009 From: ralf.gommers at googlemail.com (Ralf Gommers) Date: Thu, 6 Aug 2009 11:37:57 -0400 Subject: [SciPy-dev] Two Marathon questions In-Reply-To: <6a17e9ee0908052319s2794095bq2dbcf670670dd1de@mail.gmail.com> References: <940468.47590.qm@web52104.mail.re2.yahoo.com> <71D66DD2-F618-47E7-8F61-C2E37677EBA2@gmail.com> <6a17e9ee0908052319s2794095bq2dbcf670670dd1de@mail.gmail.com> Message-ID: On Thu, Aug 6, 2009 at 2:19 AM, Scott Sinclair wrote: > > 2009/8/5 Pierre GM : > > > > On Aug 5, 2009, at 1:33 PM, David Goldsmith wrote: > > > >> 1) Is there anyone "in-the-know" who feels that reading "numpy-docs/ > >> reference/routines.ma.rst" (as it is now) is *insufficient* > >> preparation for assisting w/ the Masked Array docstrings; or, put > >> another way, feels that if one is not at least "well-practiced" > >> using masked arrays, then one should not touch their docstrings? > > > > Mmh. Can't really tell. Experience with masked arrays is certainly a > > prerequisite, but one shouldn't need to be an expert. > > Nevertheless, I'd be obliged if anybody willing to edit the MA > > docstrings could contact me beforehand w/ questions. > > The best place to get an overview of masked arrays is Pierre's introduction > here > > http://docs.scipy.org/doc/numpy/reference/maskedarray.html > > It's a little hard to locate in the doc-wiki, but can be found here > > http://docs.scipy.org/numpy/docs/numpy-docs/reference/maskedarray.rst > > Ralf (or others) feel free to work on the docstrings in Milestone > "Operations On Masks" if you finish the other stuff and I still > haven't gotten much further. You obviously have much more > enthusiasm/time :) > more enthusiasm than time. luckily time is something you can make ... > > Something worth paying attention to, is making sure that the > MaskedArray method docstrings and the equivalent functions in the ma > module refer to each other and spell out any differences in behaviour. > Compare > > http://docs.scipy.org/numpy/docs/numpy.ma.core.MaskedArray.set_fill_value/ > http://docs.scipy.org/numpy/docs/numpy.ma.core.set_fill_value/ good catch. i'll fix that and check the other ones i did. cheers, ralf > > > Cheers, > Scott > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From pgmdevlist at gmail.com Thu Aug 6 16:34:03 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 6 Aug 2009 16:34:03 -0400 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? Message-ID: All, Consider the poisson distribution in stats.distributions: it requires a mandatory argument, `mu`, as the mean/variance of the distribution. All is fine, but the `loc` parameter is still available, and that's my problem. When `loc` is not 0, the mean becomes `mu+loc`, `.cdf(range(loc))==0`, but the variance stays `mu`. That's a bit confusing. I thought I could use `loc` as a way to control truncation, but that doesn't seem to work either: emulating zero-truncation by using `loc=1` gives a distribution with a mean `mu+1` when is should be `mu/ (1-exp(-mu))` (the exact expression for zero-truncation). In short, I don't really see any advantage in having a location parameter for the Poisson distribution. AAMOF, for any discrete distribution. I suggest we would implement some mechanism to force loc to 0 while outputting a warning. Any comment ? P. From vanforeest at gmail.com Thu Aug 6 17:21:40 2009 From: vanforeest at gmail.com (nicky van foreest) Date: Thu, 6 Aug 2009 23:21:40 +0200 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: References: Message-ID: Hi, I agree. Anything that makes the behavior of the distribution functions more intuitive is helpful, at least to me. BTW, I find the term loc already by itself very confusing---what does it actually mean? For instance, >>> Help on gamma_gen in module scipy.stats.distributions object ... | cdf(self, x, *args, **kwds) | Cumulative distribution function at x of the given RV. | | Parameters | ---------- | x : array-like | quantiles | arg1, arg2, arg3,... : array-like | The shape parameter(s) for the distribution (see docstring of the | instance object for more information) | loc : array-like, optional | location parameter (default=0) | scale : array-like, optional | scale parameter (default=1) I am inclined to characterize the gamma distbution by means of n (number of stages if one is used to the Erlang distribution) and the rate parameter lambda, say, and I am clueless as to the meaning of scale and location here. Actually, I am not alone in this: see for instance: http://www.johndcook.com/blog/2009/07/20/probability-distributions-scipy/ Of course, this is not to say that I am not happy with the distribution package. It makes me a happier man every day :-) Nicky 2009/8/6 Pierre GM : > All, > Consider the poisson distribution in stats.distributions: it requires > a mandatory argument, `mu`, as the mean/variance of the distribution. > All is fine, but the `loc` parameter is still available, and that's my > problem. When `loc` is not 0, the mean becomes `mu+loc`, > `.cdf(range(loc))==0`, but the variance stays `mu`. That's a bit > confusing. > I thought I could use `loc` as a way to control truncation, but that > doesn't seem to work either: emulating zero-truncation by using > `loc=1` gives a distribution with a mean `mu+1` when is should be `mu/ > (1-exp(-mu))` (the exact expression for zero-truncation). > In short, I don't really see any advantage in having a location > parameter for the Poisson distribution. AAMOF, for any discrete > distribution. I suggest we would implement some mechanism to force loc > to 0 while outputting a warning. > Any comment ? > P. > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From robert.kern at gmail.com Thu Aug 6 17:34:08 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Aug 2009 16:34:08 -0500 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: References: Message-ID: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> On Thu, Aug 6, 2009 at 16:21, nicky van foreest wrote: > Hi, > > I agree. Anything that makes the behavior of the distribution > functions more intuitive is helpful, at least to me. > > BTW, I find the term loc already by itself very confusing---what does > it actually mean? For instance, >>>> Help on gamma_gen in module scipy.stats.distributions object > ... > > ?| ?cdf(self, x, *args, **kwds) > ?| ? ? ?Cumulative distribution function at x of the given RV. > ?| > ?| ? ? ?Parameters > ?| ? ? ?---------- > ?| ? ? ?x : array-like > ?| ? ? ? ? ?quantiles > ?| ? ? ?arg1, arg2, arg3,... : array-like > ?| ? ? ? ? ?The shape parameter(s) for the distribution (see docstring of the > ?| ? ? ? ? ?instance object for more information) > ?| ? ? ?loc : array-like, optional > ?| ? ? ? ? ?location parameter (default=0) > ?| ? ? ?scale : array-like, optional > ?| ? ? ? ? ?scale parameter (default=1) > > I am inclined to characterize the gamma distbution by means of n > (number of stages if one is used to the Erlang distribution) and the > rate parameter lambda, say, and I am clueless as to the meaning of > scale and location here. Every probability distribution can be generalized to accept a location and scale parameter even if their standard treatments do not. pdf(x; loc,scale) -> pdf((x-loc)/scale)/scale The other related functions transform in easily derivable ways. This is covered at the top of the document scipy/stats/continuous.lyx in the source distribution. The reason we do this is partly generality and mostly convenience of implementation; all of the distributions can share the shifting and scaling code instead of implementing it separately. I once floated the idea of removing this for the distributions whose standard definitions do not include such parameters, specifically gamma. However, there was an objection from someone who apparently has used a "shifted gamma" distribution to model sunspot radii where loc>0, if I remember correctly, so I dropped my proposal. If you don't need to use them, don't. If you want to prevent confusion, help port the LyX documentation into the main documentation. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Thu Aug 6 17:43:12 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 6 Aug 2009 17:43:12 -0400 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> Message-ID: On Aug 6, 2009, at 5:34 PM, Robert Kern wrote: > > Every probability distribution can be generalized to accept a location > and scale parameter even if their standard treatments do not. > > pdf(x; loc,scale) -> pdf((x-loc)/scale)/scale Agreed, as long as we are talking about *continuous* distributions. The behavior is quite different for *discrete* distributions. Even if the scale is simply discarded already, using a location will probably NOT give the expected result From robert.kern at gmail.com Thu Aug 6 17:49:22 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Aug 2009 16:49:22 -0500 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> Message-ID: <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> On Thu, Aug 6, 2009 at 16:43, Pierre GM wrote: > > On Aug 6, 2009, at 5:34 PM, Robert Kern wrote: >> >> Every probability distribution can be generalized to accept a location >> and scale parameter even if their standard treatments do not. >> >> ?pdf(x; loc,scale) -> pdf((x-loc)/scale)/scale > > Agreed, as long as we are talking about *continuous* distributions. > The behavior is quite different for *discrete* distributions. Even if > the scale is simply discarded already, using a location will probably > NOT give the expected result It depends on what your expectations are. For the discrete distributions, all the loc parameter means is this, as documented: pmf(x; loc) -> pmf(x-loc) That's it. I don't know why you would expect anything else. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Thu Aug 6 18:02:57 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 6 Aug 2009 18:02:57 -0400 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> Message-ID: <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> On Aug 6, 2009, at 5:49 PM, Robert Kern wrote: > On Thu, Aug 6, 2009 at 16:43, Pierre GM wrote: >> Even if >> the scale is simply discarded already, using a location will probably >> NOT give the expected result > > It depends on what your expectations are. For the discrete > distributions, all the loc parameter means is this, as documented: > > pmf(x; loc) -> pmf(x-loc) > > That's it. I don't know why you would expect anything else. Because using a location parameter, you change the support domain. Back to the example of a Poisson distribution with loc=1, the support domain is now x>=1, which amounts to truncating the zeroes. The mean of a zero-truncated Poisson with parameter pr should be pr/(1-exp(- pr)), but we end up with pr+1. Not the expected result. I think it's a source of confusion to keep a location parameter for discrete distributions. it'd be worth to implement method to allow truncation, but just a loc parameter doesn't do it. From robert.kern at gmail.com Thu Aug 6 18:11:04 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Aug 2009 17:11:04 -0500 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> Message-ID: <3d375d730908061511r666d1646p825e07dcffa8043a@mail.gmail.com> On Thu, Aug 6, 2009 at 17:02, Pierre GM wrote: > > On Aug 6, 2009, at 5:49 PM, Robert Kern wrote: > >> On Thu, Aug 6, 2009 at 16:43, Pierre GM wrote: >>> Even if >>> the scale is simply discarded already, using a location will probably >>> NOT give the expected result >> >> It depends on what your expectations are. For the discrete >> distributions, all the loc parameter means is this, as documented: >> >> ?pmf(x; loc) -> pmf(x-loc) >> >> That's it. I don't know why you would expect anything else. > > Because using a location parameter, you change the support domain. > Back to the example of a Poisson distribution with loc=1, the support > domain is now x>=1, which amounts to truncating the zeroes. I don't understand why you go through all of these contortions. It does not amount to truncation at all. It just shifts the distribution. > The mean > of a zero-truncated Poisson with parameter pr should be pr/(1-exp(- > pr)), but we end up with pr+1. Not the expected result. Because you are expecting that the operation is equivalent to something that it is not. pmf(x; loc) -> pmf(x-loc) Nothing more. It is definitely *not* the same thing as setting all x References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> Message-ID: <1cd32cbb0908061516g3cbab79ax2798b322a3c2fdf@mail.gmail.com> On Thu, Aug 6, 2009 at 6:02 PM, Pierre GM wrote: > > On Aug 6, 2009, at 5:49 PM, Robert Kern wrote: > >> On Thu, Aug 6, 2009 at 16:43, Pierre GM wrote: >>> Even if >>> the scale is simply discarded already, using a location will probably >>> NOT give the expected result >> >> It depends on what your expectations are. For the discrete >> distributions, all the loc parameter means is this, as documented: >> >> ?pmf(x; loc) -> pmf(x-loc) >> >> That's it. I don't know why you would expect anything else. > > Because using a location parameter, you change the support domain. > Back to the example of a Poisson distribution with loc=1, the support > domain is now x>=1, which amounts to truncating the zeroes. The mean > of a zero-truncated Poisson with parameter pr should be pr/(1-exp(- > pr)), but we end up with pr+1. Not the expected result. > I think it's a source of confusion to keep a location parameter for > discrete distributions. it'd be worth to implement method to allow > truncation, but just a loc parameter doesn't do it. > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > loc just shifts the distribution on the real/integer line. except for the fit method (which doesn't exist for discrete distribution), I don't see any real disadvantage to having loc in there as an option, but I guess in many cases it won't be very useful either. I think there are also discrete distribution with unbound support +/- inf for which a loc shift would make sense. The big advantage of the current setup, as Robert said, is consistency, both in the implementation and in code that goes over all (or a large set of) distribution(s). But for a long time, I have been all in favor of "fixing" the fit method, and possibly introduce a semi-frozen distribution class, but for this I don't see why we should special case location. fixing loc is the main use case, but for example estimation with the scale parameter fixed is also a common use case. Josef From robert.kern at gmail.com Thu Aug 6 18:21:49 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Aug 2009 17:21:49 -0500 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> Message-ID: <3d375d730908061521s166051f2h6e1e7579b8018bfe@mail.gmail.com> On Thu, Aug 6, 2009 at 17:02, Pierre GM wrote: > > On Aug 6, 2009, at 5:49 PM, Robert Kern wrote: > >> On Thu, Aug 6, 2009 at 16:43, Pierre GM wrote: >>> Even if >>> the scale is simply discarded already, using a location will probably >>> NOT give the expected result >> >> It depends on what your expectations are. For the discrete >> distributions, all the loc parameter means is this, as documented: >> >> ?pmf(x; loc) -> pmf(x-loc) >> >> That's it. I don't know why you would expect anything else. > > Because using a location parameter, you change the support domain. It should be noted that the location parameter changes the support domain *as a consequence* of the above transformation. Changing the support domain (and holding everything else fixed) is not the defining characteristic of the location parameter. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Thu Aug 6 18:23:12 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 6 Aug 2009 17:23:12 -0500 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: <1cd32cbb0908061516g3cbab79ax2798b322a3c2fdf@mail.gmail.com> References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> <1cd32cbb0908061516g3cbab79ax2798b322a3c2fdf@mail.gmail.com> Message-ID: <3d375d730908061523m712e9470pf0c577e1efa2ed56@mail.gmail.com> On Thu, Aug 6, 2009 at 17:16, wrote: > But for a long time, I have been all in favor of "fixing" the fit > method, I don't think anyone's *against* fixing the fit method. No one's found the time or motivation to actually do it, though. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Thu Aug 6 18:37:07 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 6 Aug 2009 18:37:07 -0400 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: <3d375d730908061521s166051f2h6e1e7579b8018bfe@mail.gmail.com> References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> <3d375d730908061521s166051f2h6e1e7579b8018bfe@mail.gmail.com> Message-ID: On Aug 6, 2009, at 6:21 PM, Robert Kern wrote: > > It should be noted that the location parameter changes the support > domain *as a consequence* of the above transformation. Changing the > support domain (and holding everything else fixed) is not the defining > characteristic of the location parameter. Got the point. I'll make a mental note to mention that in the docs. I'm switching to "meh" mode: I still think that allowing for the shift can lead to some troubles on the user, and I'd be in favor to modify _fix_loc_scale or something like that to force loc=0 on discrete distributions with support on positive integers, but I'll certainly not lose any sleep other that... In any case, thx a lot to y'all for your comments. From josef.pktd at gmail.com Thu Aug 6 20:17:47 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 6 Aug 2009 20:17:47 -0400 Subject: [SciPy-dev] stats.distributions.poisson loc parameter : is it wise ? In-Reply-To: References: <3d375d730908061434j21d72c8dk36fb7d72cb8a9092@mail.gmail.com> <3d375d730908061449s6fae1a57p3f416dc41910f833@mail.gmail.com> <9D0FD21F-CE10-499A-971B-BDA3C0F1CEFE@gmail.com> <3d375d730908061521s166051f2h6e1e7579b8018bfe@mail.gmail.com> Message-ID: <1cd32cbb0908061717i37deee4x542e21cd146c2691@mail.gmail.com> On Thu, Aug 6, 2009 at 6:37 PM, Pierre GM wrote: > > On Aug 6, 2009, at 6:21 PM, Robert Kern wrote: >> >> It should be noted that the location parameter changes the support >> domain *as a consequence* of the above transformation. Changing the >> support domain (and holding everything else fixed) is not the defining >> characteristic of the location parameter. > > Got the point. I'll make a mental note to mention that in the docs. > > I'm switching to "meh" mode: I still think that allowing for the shift > can lead to some troubles on the user, and I'd be in favor to modify > _fix_loc_scale or something like that to force loc=0 on discrete > distributions with support on positive integers, but I'll certainly > not lose any sleep other that... > In any case, thx a lot to y'all for your comments. > I agree that loc for distribution with a finite upper or lower support bound is confusing, at least at the beginning. It took me a while to figure out why I get some strange results with some distributions when I ran a fit over all of them until I realized that the support is shifted when loc is estimated. But I think this is mostly a documentation problem. (I still have an unresolved problem with vonmises which doesn't define it's support points, but I don't know anything at all about circular distributions.) Below is a prototype for a semi-frozen class, essentially an adapted version of the frozen class, that fixes only the location loc. (copy and paste errors still possible) However, this doesn't do anything different than the current implementation if you ignore the loc keyword. It also has the same uninformative signature which could be improved. The only real advantage I see, is, when the fit method is adjusted to take some of the parameters as fixed. Josef import numpy as np from scipy import stats class rv_frozenloc(object): def __init__(self, dist, loc=0): self.loc = loc self.dist = dist def pdf(self,x,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.pdf(x,*args,**kwds) def cdf(self,x,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.cdf(x,*args,**kwds) def ppf(self,q,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.ppf(q,*args,**kwds) def isf(self,q,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.isf(q,*args,**kwds) def rvs(self, size=None,*args,**kwds): kwds.update({'loc':self.loc, 'size':size}) return self.dist.rvs(*self.args,**kwds) def sf(self,x,*args,**kwds): return self.dist.sf(x,*args,**kwds) def stats(self, moments='mv',*args,**kwds): kwds.update({'loc':self.loc, 'moments':moments}) return self.dist.stats(*args,**kwds) def moment(self,n,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.moment(n,*args,**kwds) def entropy(self,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.entropy(*args,**kwds) def pmf(self,k,*args,**kwds): kwds.update({'loc':self.loc}) return self.dist.pmf(k,*args,**kwds) def freezeloc(dist, loc=0): return rv_frozenloc(dist, loc=loc) poiss = freezeloc(stats.poisson) print poiss.pmf(np.arange(10),5) print poiss.cdf(np.arange(10),5) print poiss.cdf(np.arange(10),5, loc=5) #this ignores loc but doesn't raise warning (yet) print stats.poisson.cdf(np.arange(10),5, loc=5) poiss5 = freezeloc(stats.poisson, loc=5) print poiss5.cdf(np.arange(10),5) norm0 = freezeloc(stats.norm, loc=1) print norm0.stats() norm0.stats(loc=0) # loc is ignored but doesn't raise warning (yet) print norm0.stats(scale=np.sqrt(2)) From david at ar.media.kyoto-u.ac.jp Fri Aug 7 09:13:41 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 07 Aug 2009 22:13:41 +0900 Subject: [SciPy-dev] Last numpy required for scipy Message-ID: <4A7C2885.1090308@ar.media.kyoto-u.ac.jp> Hi, While fixing the neighborhood iterators, I once again broke the trunk API - so for last scipy trunk (r5893), you need last numpy (r7297). Hopefully, this will be the last time this is needed, sorry for the inconvenience, cheers, David From d_l_goldsmith at yahoo.com Sat Aug 8 02:42:58 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Fri, 7 Aug 2009 23:42:58 -0700 (PDT) Subject: [SciPy-dev] Where is numpy.generic defined? Message-ID: <843554.18671.qm@web52105.mail.re2.yahoo.com> From d_l_goldsmith at yahoo.com Mon Aug 10 15:45:51 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Mon, 10 Aug 2009 12:45:51 -0700 (PDT) Subject: [SciPy-dev] summer marathon skypecon this week? Message-ID: <105079.43514.qm@web52108.mail.re2.yahoo.com> Wed. 16:00 UTC has been suggested - if there's even interest; how does this suit people? DG From ralf.gommers at googlemail.com Mon Aug 10 22:38:17 2009 From: ralf.gommers at googlemail.com (Ralf Gommers) Date: Mon, 10 Aug 2009 22:38:17 -0400 Subject: [SciPy-dev] summer marathon skypecon this week? In-Reply-To: <105079.43514.qm@web52108.mail.re2.yahoo.com> References: <105079.43514.qm@web52108.mail.re2.yahoo.com> Message-ID: That would work for me. Ralf On Mon, Aug 10, 2009 at 3:45 PM, David Goldsmith wrote: > Wed. 16:00 UTC has been suggested - if there's even interest; how does this > suit people? > > DG > > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From d_l_goldsmith at yahoo.com Tue Aug 11 00:23:15 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Mon, 10 Aug 2009 21:23:15 -0700 (PDT) Subject: [SciPy-dev] summer marathon skypecon this week? Message-ID: <89338.9696.qm@web52110.mail.re2.yahoo.com> OK, thanks, Ralf. DG --- On Mon, 8/10/09, Ralf Gommers wrote: > From: Ralf Gommers > Subject: Re: [SciPy-dev] summer marathon skypecon this week? > To: "SciPy Developers List" > Date: Monday, August 10, 2009, 7:38 PM > That would work for me. > > Ralf > > > On Mon, Aug 10, 2009 at 3:45 PM, > David Goldsmith > wrote: > > Wed. 16:00 UTC has been suggested - > if there's even interest; how does this suit people? > > > > > DG > > > > > > > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > -----Inline Attachment Follows----- > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Tue Aug 11 00:25:16 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Mon, 10 Aug 2009 21:25:16 -0700 (PDT) Subject: [SciPy-dev] summer marathon skypecon this week? Message-ID: <97534.82743.qm@web52106.mail.re2.yahoo.com> Whoops, didn't mean that to go to the whole list - sorry. DG --- On Mon, 8/10/09, David Goldsmith wrote: > From: David Goldsmith > Subject: Re: [SciPy-dev] summer marathon skypecon this week? > To: "SciPy Developers List" > Date: Monday, August 10, 2009, 9:23 PM > OK, thanks, Ralf. > > DG > > --- On Mon, 8/10/09, Ralf Gommers > wrote: > > > From: Ralf Gommers > > Subject: Re: [SciPy-dev] summer marathon skypecon this > week? > > To: "SciPy Developers List" > > Date: Monday, August 10, 2009, 7:38 PM > > That would work for me. > > > > Ralf > > > > > > On Mon, Aug 10, 2009 at 3:45 PM, > > David Goldsmith > > wrote: > > > > Wed. 16:00 UTC has been suggested - > > if there's even interest; how does this suit people? > > > > > > > > > > DG > > > > > > > > > > > > > > > > _______________________________________________ > > > > Scipy-dev mailing list > > > > Scipy-dev at scipy.org > > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > > > > > -----Inline Attachment Follows----- > > > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > ? ? ? > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Tue Aug 11 15:05:06 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 12:05:06 -0700 (PDT) Subject: [SciPy-dev] Summer Marathon Skypecon tomorrow, Wed., Aug. 12, 16:00 UTC Message-ID: <315280.34538.qm@web52102.mail.re2.yahoo.com> OK, have two positive RSVP's , so we're on for 16:00 UTC tomorrow. DG From HAWRYLA at novachem.com Tue Aug 11 15:10:39 2009 From: HAWRYLA at novachem.com (Andrew Hawryluk) Date: Tue, 11 Aug 2009 13:10:39 -0600 Subject: [SciPy-dev] can I help with the docs? Message-ID: <48C01AE7354EC240A26F19CEB995E943033AF264@CHMAILMBX01.novachem.com> I'd like to help with the docs project. I have registered a user name of ahawryluk on the Wiki. Although the current focus is cleaning up the docstrings on the numpy module, I'm most interested in writing some introductory material for the NumPy User Guide. Andrew Hawryluk NOVA Chemicals Research & Technology Centre Calgary, Canada From d_l_goldsmith at yahoo.com Tue Aug 11 15:15:39 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 12:15:39 -0700 (PDT) Subject: [SciPy-dev] can I help with the docs? Message-ID: <734509.78574.qm@web52101.mail.re2.yahoo.com> Hi, Andrew, and thanks for your offer to help. Can you be more specific vis-a-vis what you have in mind? David Goldsmith, Technical Editor Olympia, WA --- On Tue, 8/11/09, Andrew Hawryluk wrote: > From: Andrew Hawryluk > Subject: [SciPy-dev] can I help with the docs? > To: scipy-dev at scipy.org > Date: Tuesday, August 11, 2009, 12:10 PM > I'd like to help with the docs > project. I have registered a user name of > ahawryluk on the Wiki. > Although the current focus is cleaning up the docstrings on > the numpy > module, I'm most interested in writing some introductory > material for > the NumPy User Guide. > > Andrew Hawryluk > NOVA Chemicals Research & Technology Centre > Calgary, Canada > > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From HAWRYLA at novachem.com Tue Aug 11 16:08:30 2009 From: HAWRYLA at novachem.com (Andrew Hawryluk) Date: Tue, 11 Aug 2009 14:08:30 -0600 Subject: [SciPy-dev] can I help with the docs? In-Reply-To: <734509.78574.qm@web52101.mail.re2.yahoo.com> References: <734509.78574.qm@web52101.mail.re2.yahoo.com> Message-ID: <48C01AE7354EC240A26F19CEB995E943033AF265@CHMAILMBX01.novachem.com> Good question. I'll let you know what I was thinking, and then you can tell me if I've misunderstood the documentation objectives. Each year I train a new set of engineering internship students in our department, and one of the things they learn is Python. Upon arriving here they have taken a single course on C++ and another on Matlab/Mathematica/Excel etc., but no Python. I usually start them off reading "Dive Into Python", but I'd also like a single place to send them to get a brief tour of proper NumPy use. The main NumPy & SciPy docs are excellent (and improving rapidly), but they start a bit too abruptly for my needs. For example, the NumPy User Guide currently start with installation instructions, and then the next page of body text is a table of all the available array types. I would like something that briefly explains what NumPy and SciPy are, and why/when arrays are better than lists/dicts, perhaps followed by a brief tour of some common NumPy tricks, before diving into the more detailed sections. One possible table of contents would be Introduction What is NumPy? Building/Installing Short Tour How to find documentation NumPy Basics ... What direction were you thinking the User Guide would take? Andrew > -----Original Message----- > From: scipy-dev-bounces at scipy.org [mailto:scipy-dev-bounces at scipy.org] > On Behalf Of David Goldsmith > Sent: 11 Aug 2009 1:16 PM > To: SciPy Developers List > Subject: Re: [SciPy-dev] can I help with the docs? > > Hi, Andrew, and thanks for your offer to help. Can you be more > specific vis-a-vis what you have in mind? > > David Goldsmith, Technical Editor > Olympia, WA > > --- On Tue, 8/11/09, Andrew Hawryluk wrote: > > > From: Andrew Hawryluk > > Subject: [SciPy-dev] can I help with the docs? > > To: scipy-dev at scipy.org > > Date: Tuesday, August 11, 2009, 12:10 PM I'd like to help with the > > docs project. I have registered a user name of ahawryluk on the Wiki. > > Although the current focus is cleaning up the docstrings on the numpy > > module, I'm most interested in writing some introductory material for > > the NumPy User Guide. > > > > Andrew Hawryluk > > NOVA Chemicals Research & Technology Centre Calgary, Canada > > > > > > > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev From vanforeest at gmail.com Tue Aug 11 17:06:52 2009 From: vanforeest at gmail.com (nicky van foreest) Date: Tue, 11 Aug 2009 23:06:52 +0200 Subject: [SciPy-dev] can I help with the docs? In-Reply-To: <48C01AE7354EC240A26F19CEB995E943033AF265@CHMAILMBX01.novachem.com> References: <734509.78574.qm@web52101.mail.re2.yahoo.com> <48C01AE7354EC240A26F19CEB995E943033AF265@CHMAILMBX01.novachem.com> Message-ID: Hi Andrew, I do not want to interfere with the scipy doc project... However, after having read your ideas, might this cover your needs, at least in part? http://johnstachurski.net/lectures/index.html bye Nicky 2009/8/11 Andrew Hawryluk : > Good question. I'll let you know what I was thinking, and then you can > tell me if I've misunderstood the documentation objectives. > > Each year I train a new set of engineering internship students in our > department, and one of the things they learn is Python. ?Upon arriving > here they have taken a single course on C++ and another on > Matlab/Mathematica/Excel etc., but no Python. ?I usually start them off > reading "Dive Into Python", but I'd also like a single place to send > them to get a brief tour of proper NumPy use. ?The main NumPy & SciPy > docs are excellent (and improving rapidly), but they start a bit too > abruptly for my needs. For example, the NumPy User Guide currently start > with installation instructions, and then the next page of body text is a > table of all the available array types. > > I would like something that briefly explains what NumPy and SciPy are, > and why/when arrays are better than lists/dicts, perhaps followed by a > brief tour of some common NumPy tricks, before diving into the more > detailed sections. > > One possible table of contents would be > > Introduction > ?What is NumPy? > ?Building/Installing > ?Short Tour > ?How to find documentation > NumPy Basics > ?... > > What direction were you thinking the User Guide would take? > > Andrew > > >> -----Original Message----- >> From: scipy-dev-bounces at scipy.org [mailto:scipy-dev-bounces at scipy.org] >> On Behalf Of David Goldsmith >> Sent: 11 Aug 2009 1:16 PM >> To: SciPy Developers List >> Subject: Re: [SciPy-dev] can I help with the docs? >> >> Hi, Andrew, and thanks for your offer to help. ?Can you be more >> specific vis-a-vis what you have in mind? >> >> David Goldsmith, Technical Editor >> Olympia, WA >> >> --- On Tue, 8/11/09, Andrew Hawryluk wrote: >> >> > From: Andrew Hawryluk >> > Subject: [SciPy-dev] can I help with the docs? >> > To: scipy-dev at scipy.org >> > Date: Tuesday, August 11, 2009, 12:10 PM I'd like to help with the >> > docs project. I have registered a user name of ahawryluk on the > Wiki. >> > Although the current focus is cleaning up the docstrings on the > numpy >> > module, I'm most interested in writing some introductory material > for >> > the NumPy User Guide. >> > >> > Andrew Hawryluk >> > NOVA Chemicals Research & Technology Centre Calgary, Canada >> > >> > >> > >> > _______________________________________________ >> > Scipy-dev mailing list >> > Scipy-dev at scipy.org >> > http://mail.scipy.org/mailman/listinfo/scipy-dev >> > >> >> >> >> _______________________________________________ >> Scipy-dev mailing list >> Scipy-dev at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-dev > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Tue Aug 11 18:17:09 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 15:17:09 -0700 (PDT) Subject: [SciPy-dev] can I help with the docs? Message-ID: <697806.15047.qm@web52105.mail.re2.yahoo.com> --- On Tue, 8/11/09, Andrew Hawryluk wrote: > I would like something that briefly explains what NumPy and > SciPy are, > and why/when arrays are better than lists/dicts, perhaps > followed by a > brief tour of some common NumPy tricks, before diving into > the more > detailed sections. > > One possible table of contents would be > > Introduction > ? What is NumPy? > ? Building/Installing > ? Short Tour > ? How to find documentation > NumPy Basics > ? ... > > What direction were you thinking the User Guide would > take? Good question, especially as I don't think there's as clear a vision on that (certainly not as a matter of consensus) as yours is for what you want/need. Personally, my "advice" to you would be: draft the document/sections you would like to see added - then you'll have what you want regardless of how/when it gets added to the "official" User Guide - and then post it (or a link to it) here to be vetted, if you want. That said, http://docs.scipy.org/numpy/docs/numpy-docs/user/index.rst/ is the link to the Wiki (pydocweb) interface for editing the official User Guide, and you're more than welcome to add your content that way; of course, for better or worse, that's a Wiki, so whatever you add there will be subject to review and edit (but that's going to be true for any contribution to the official doc, of course). Note that there you'll find a link to a section titled "NumPy Basics" w/ further links to sub-sections "types," "array creation," "indexing," and so forth - if that content is *not* what you've already read (there's a lot of doc in the Wiki - anything that's not past the proof stage, basically - that's pending inclusion in anything official) be sure to look it over, as it may already contain some (much?) of what you're after. Thanks again, DG PS: If your original email was (primarily) intended to have someone grant you edit permissions, I'm afraid I haven't been granted that power yet, so you'll have to wait until someone who does, does. > > Andrew > > > > -----Original Message----- > > From: scipy-dev-bounces at scipy.org > [mailto:scipy-dev-bounces at scipy.org] > > On Behalf Of David Goldsmith > > Sent: 11 Aug 2009 1:16 PM > > To: SciPy Developers List > > Subject: Re: [SciPy-dev] can I help with the docs? > > > > Hi, Andrew, and thanks for your offer to help.? > Can you be more > > specific vis-a-vis what you have in mind? > > > > David Goldsmith, Technical Editor > > Olympia, WA > > > > --- On Tue, 8/11/09, Andrew Hawryluk > wrote: > > > > > From: Andrew Hawryluk > > > Subject: [SciPy-dev] can I help with the docs? > > > To: scipy-dev at scipy.org > > > Date: Tuesday, August 11, 2009, 12:10 PM I'd like > to help with the > > > docs project. I have registered a user name of > ahawryluk on the > Wiki. > > > Although the current focus is cleaning up the > docstrings on the > numpy > > > module, I'm most interested in writing some > introductory material > for > > > the NumPy User Guide. > > > > > > Andrew Hawryluk > > > NOVA Chemicals Research & Technology Centre > Calgary, Canada > > > > > > > > > > > > _______________________________________________ > > > Scipy-dev mailing list > > > Scipy-dev at scipy.org > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > > > > > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Tue Aug 11 19:02:15 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 16:02:15 -0700 (PDT) Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support Message-ID: <377127.33034.qm@web52105.mail.re2.yahoo.com> First, a "reality check" question: 0) Is Windows (DOS) Terminal capable of rendering unicode? Unless the answer is "No," my real question: 1) Does chararray.capitalize() capitalize non-Roman letters that have different lower-case and upper-case forms (e.g., the Greek letters)? If "yes," are there any exceptions (e.g., Russian letters)? Thanks! DG From HAWRYLA at novachem.com Tue Aug 11 19:13:49 2009 From: HAWRYLA at novachem.com (Andrew Hawryluk) Date: Tue, 11 Aug 2009 17:13:49 -0600 Subject: [SciPy-dev] can I help with the docs? In-Reply-To: <697806.15047.qm@web52105.mail.re2.yahoo.com> References: <697806.15047.qm@web52105.mail.re2.yahoo.com> Message-ID: <48C01AE7354EC240A26F19CEB995E943033AF267@CHMAILMBX01.novachem.com> That sounds good. I will draft up some ideas, and submit them for comment / review / merciless editing. Yes, the 'NumPy Basics' section covers most of the important ground, so I will re-read it before I suggest anything. Thanks, Andrew > -----Original Message----- > From: scipy-dev-bounces at scipy.org [mailto:scipy-dev-bounces at scipy.org] > On Behalf Of David Goldsmith > Sent: 11 Aug 2009 4:17 PM > To: SciPy Developers List > Subject: Re: [SciPy-dev] can I help with the docs? > > --- On Tue, 8/11/09, Andrew Hawryluk wrote: > > > I would like something that briefly explains what NumPy and SciPy > are, > > and why/when arrays are better than lists/dicts, perhaps followed by > a > > brief tour of some common NumPy tricks, before diving into the more > > detailed sections. > > > > One possible table of contents would be > > > > Introduction > > ? What is NumPy? > > ? Building/Installing > > ? Short Tour > > ? How to find documentation > > NumPy Basics > > ? ... > > > > What direction were you thinking the User Guide would take? > > Good question, especially as I don't think there's as clear a vision on > that (certainly not as a matter of consensus) as yours is for what you > want/need. Personally, my "advice" to you would be: draft the > document/sections you would like to see added - then you'll have what > you want regardless of how/when it gets added to the "official" User > Guide - and then post it (or a link to it) here to be vetted, if you > want. That said, > > http://docs.scipy.org/numpy/docs/numpy-docs/user/index.rst/ > > is the link to the Wiki (pydocweb) interface for editing the official > User Guide, and you're more than welcome to add your content that way; > of course, for better or worse, that's a Wiki, so whatever you add > there will be subject to review and edit (but that's going to be true > for any contribution to the official doc, of course). Note that there > you'll find a link to a section titled "NumPy Basics" w/ further links > to sub-sections "types," "array creation," "indexing," and so forth - > if that content is *not* what you've already read (there's a lot of doc > in the Wiki - anything that's not past the proof stage, basically - > that's pending inclusion in anything official) be sure to look it over, > as it may already contain some (much?) of what you're after. Thanks > again, > > DG > > PS: If your original email was (primarily) intended to have someone > grant you edit permissions, I'm afraid I haven't been granted that > power yet, so you'll have to wait until someone who does, does. > > > > > Andrew > > > > > > > -----Original Message----- > > > From: scipy-dev-bounces at scipy.org > > [mailto:scipy-dev-bounces at scipy.org] > > > On Behalf Of David Goldsmith > > > Sent: 11 Aug 2009 1:16 PM > > > To: SciPy Developers List > > > Subject: Re: [SciPy-dev] can I help with the docs? > > > > > > Hi, Andrew, and thanks for your offer to help. > > Can you be more > > > specific vis-a-vis what you have in mind? > > > > > > David Goldsmith, Technical Editor > > > Olympia, WA > > > > > > --- On Tue, 8/11/09, Andrew Hawryluk > > wrote: > > > > > > > From: Andrew Hawryluk > > > > Subject: [SciPy-dev] can I help with the docs? > > > > To: scipy-dev at scipy.org > > > > Date: Tuesday, August 11, 2009, 12:10 PM I'd like > > to help with the > > > > docs project. I have registered a user name of > > ahawryluk on the > > Wiki. > > > > Although the current focus is cleaning up the > > docstrings on the > > numpy > > > > module, I'm most interested in writing some > > introductory material > > for > > > > the NumPy User Guide. > > > > > > > > Andrew Hawryluk > > > > NOVA Chemicals Research & Technology Centre > > Calgary, Canada > > > > > > > > > > > > > > > > _______________________________________________ > > > > Scipy-dev mailing list > > > > Scipy-dev at scipy.org > > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > > > > > > > > > > > _______________________________________________ > > > Scipy-dev mailing list > > > Scipy-dev at scipy.org > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev From HAWRYLA at novachem.com Tue Aug 11 19:15:48 2009 From: HAWRYLA at novachem.com (Andrew Hawryluk) Date: Tue, 11 Aug 2009 17:15:48 -0600 Subject: [SciPy-dev] can I help with the docs? In-Reply-To: References: <734509.78574.qm@web52101.mail.re2.yahoo.com><48C01AE7354EC240A26F19CEB995E943033AF265@CHMAILMBX01.novachem.com> Message-ID: <48C01AE7354EC240A26F19CEB995E943033AF268@CHMAILMBX01.novachem.com> Thanks for the link! I will certainly include that in my list of helpful training materials. Andrew > -----Original Message----- > From: scipy-dev-bounces at scipy.org [mailto:scipy-dev-bounces at scipy.org] > On Behalf Of nicky van foreest > Sent: 11 Aug 2009 3:07 PM > To: SciPy Developers List > Subject: Re: [SciPy-dev] can I help with the docs? > > Hi Andrew, > > I do not want to interfere with the scipy doc project... However, after > having read your ideas, might this cover your needs, at least in part? > > http://johnstachurski.net/lectures/index.html > > bye > > Nicky > > 2009/8/11 Andrew Hawryluk : > > Good question. I'll let you know what I was thinking, and then you > can > > tell me if I've misunderstood the documentation objectives. > > > > Each year I train a new set of engineering internship students in our > > department, and one of the things they learn is Python. ?Upon > arriving > > here they have taken a single course on C++ and another on > > Matlab/Mathematica/Excel etc., but no Python. ?I usually start them > > off reading "Dive Into Python", but I'd also like a single place to > > send them to get a brief tour of proper NumPy use. ?The main NumPy & > > SciPy docs are excellent (and improving rapidly), but they start a > bit > > too abruptly for my needs. For example, the NumPy User Guide > currently > > start with installation instructions, and then the next page of body > > text is a table of all the available array types. > > > > I would like something that briefly explains what NumPy and SciPy > are, > > and why/when arrays are better than lists/dicts, perhaps followed by > a > > brief tour of some common NumPy tricks, before diving into the more > > detailed sections. > > > > One possible table of contents would be > > > > Introduction > > ?What is NumPy? > > ?Building/Installing > > ?Short Tour > > ?How to find documentation > > NumPy Basics > > ?... > > > > What direction were you thinking the User Guide would take? > > > > Andrew > > > > > >> -----Original Message----- > >> From: scipy-dev-bounces at scipy.org > >> [mailto:scipy-dev-bounces at scipy.org] > >> On Behalf Of David Goldsmith > >> Sent: 11 Aug 2009 1:16 PM > >> To: SciPy Developers List > >> Subject: Re: [SciPy-dev] can I help with the docs? > >> > >> Hi, Andrew, and thanks for your offer to help. ?Can you be more > >> specific vis-a-vis what you have in mind? > >> > >> David Goldsmith, Technical Editor > >> Olympia, WA > >> > >> --- On Tue, 8/11/09, Andrew Hawryluk wrote: > >> > >> > From: Andrew Hawryluk > >> > Subject: [SciPy-dev] can I help with the docs? > >> > To: scipy-dev at scipy.org > >> > Date: Tuesday, August 11, 2009, 12:10 PM I'd like to help with the > >> > docs project. I have registered a user name of ahawryluk on the > > Wiki. > >> > Although the current focus is cleaning up the docstrings on the > > numpy > >> > module, I'm most interested in writing some introductory material > > for > >> > the NumPy User Guide. > >> > > >> > Andrew Hawryluk > >> > NOVA Chemicals Research & Technology Centre Calgary, Canada > >> > > >> > > >> > > >> > _______________________________________________ > >> > Scipy-dev mailing list > >> > Scipy-dev at scipy.org > >> > http://mail.scipy.org/mailman/listinfo/scipy-dev > >> > > >> > >> > >> > >> _______________________________________________ > >> Scipy-dev mailing list > >> Scipy-dev at scipy.org > >> http://mail.scipy.org/mailman/listinfo/scipy-dev > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev From d_l_goldsmith at yahoo.com Tue Aug 11 19:30:45 2009 From: d_l_goldsmith at yahoo.com (d_l_goldsmith at yahoo.com) Date: Tue, 11 Aug 2009 16:30:45 -0700 (PDT) Subject: [SciPy-dev] 'nuther Q re: chararray Message-ID: <164127.69670.qm@web52103.mail.re2.yahoo.com> >From "Guide to Numpy": "Perhaps the easiest way to create a chararray is to use self.view(chararray) where self is an ndarray of string or unicode data-type."? OK, but what is "(chararray)"?? In particular, my best guesses yielded: >>> c = np.empty((2,2), dtype=unicode) >>> c array([[u'\u0f01\ude68', u'\uf880\udc00'], ? ? ???[u'\uf0c0\udc00', u'\u5300']], ? ? ? dtype='>> c.view(chararray) # can't mean it literally, right? Traceback (most recent call last): ? File "", line 1, in NameError: name 'chararray' is not defined >>> c.view(C) # does the expression assign a value to the argument? Traceback (most recent call last): ? File "", line 1, in NameError: name 'C' is not defined >>> C = np.chararray((2,2)) # this is most logical >>> c.view(C) Traceback (most recent call last): ? File "", line 1, in ValueError: Dtype must be a numpy data-type So, what does Travis mean? Thanks, DG From d_l_goldsmith at yahoo.com Tue Aug 11 19:32:20 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 16:32:20 -0700 (PDT) Subject: [SciPy-dev] can I help with the docs? Message-ID: <140993.83164.qm@web52112.mail.re2.yahoo.com> You're very welcome! Thank you for helping! DG --- On Tue, 8/11/09, Andrew Hawryluk wrote: > From: Andrew Hawryluk > Subject: Re: [SciPy-dev] can I help with the docs? > To: "SciPy Developers List" > Date: Tuesday, August 11, 2009, 4:13 PM > That sounds good. I will draft up > some ideas, and submit them for comment / review / merciless > editing. > Yes, the 'NumPy Basics' section covers most of the > important ground, so I will re-read it before I suggest > anything. > > Thanks, > Andrew > > > > > -----Original Message----- > > From: scipy-dev-bounces at scipy.org > [mailto:scipy-dev-bounces at scipy.org] > > On Behalf Of David Goldsmith > > Sent: 11 Aug 2009 4:17 PM > > To: SciPy Developers List > > Subject: Re: [SciPy-dev] can I help with the docs? > > > > --- On Tue, 8/11/09, Andrew Hawryluk > wrote: > > > > > I would like something that briefly explains what > NumPy and SciPy > > are, > > > and why/when arrays are better than lists/dicts, > perhaps followed by > > a > > > brief tour of some common NumPy tricks, before > diving into the more > > > detailed sections. > > > > > > One possible table of contents would be > > > > > > Introduction > > > ? What is NumPy? > > > ? Building/Installing > > > ? Short Tour > > > ? How to find documentation > > > NumPy Basics > > > ? ... > > > > > > What direction were you thinking the User Guide > would take? > > > > Good question, especially as I don't think there's as > clear a vision on > > that (certainly not as a matter of consensus) as yours > is for what you > > want/need.? Personally, my "advice" to you would > be: draft the > > document/sections you would like to see added - then > you'll have what > > you want regardless of how/when it gets added to the > "official" User > > Guide - and then post it (or a link to it) here to be > vetted, if you > > want.? That said, > > > > http://docs.scipy.org/numpy/docs/numpy-docs/user/index.rst/ > > > > is the link to the Wiki (pydocweb) interface for > editing the official > > User Guide, and you're more than welcome to add your > content that way; > > of course, for better or worse, that's a Wiki, so > whatever you add > > there will be subject to review and edit (but that's > going to be true > > for any contribution to the official doc, of > course).? Note that there > > you'll find a link to a section titled "NumPy Basics" > w/ further links > > to sub-sections "types," "array creation," "indexing," > and so forth - > > if that content is *not* what you've already read > (there's a lot of doc > > in the Wiki - anything that's not past the proof > stage, basically - > > that's pending inclusion in anything official) be sure > to look it over, > > as it may already contain some (much?) of what you're > after.? Thanks > > again, > > > > DG > > > > PS: If your original email was (primarily) intended to > have someone > > grant you edit permissions, I'm afraid I haven't been > granted that > > power yet, so you'll have to wait until someone who > does, does. > > > > > > > > Andrew > > > > > > > > > > -----Original Message----- > > > > From: scipy-dev-bounces at scipy.org > > > [mailto:scipy-dev-bounces at scipy.org] > > > > On Behalf Of David Goldsmith > > > > Sent: 11 Aug 2009 1:16 PM > > > > To: SciPy Developers List > > > > Subject: Re: [SciPy-dev] can I help with the > docs? > > > > > > > > Hi, Andrew, and thanks for your offer to > help. > > > Can you be more > > > > specific vis-a-vis what you have in mind? > > > > > > > > David Goldsmith, Technical Editor > > > > Olympia, WA > > > > > > > > --- On Tue, 8/11/09, Andrew Hawryluk > > > wrote: > > > > > > > > > From: Andrew Hawryluk > > > > > Subject: [SciPy-dev] can I help with > the docs? > > > > > To: scipy-dev at scipy.org > > > > > Date: Tuesday, August 11, 2009, 12:10 > PM I'd like > > > to help with the > > > > > docs project. I have registered a user > name of > > > ahawryluk on the > > > Wiki. > > > > > Although the current focus is cleaning > up the > > > docstrings on the > > > numpy > > > > > module, I'm most interested in writing > some > > > introductory material > > > for > > > > > the NumPy User Guide. > > > > > > > > > > Andrew Hawryluk > > > > > NOVA Chemicals Research & > Technology Centre > > > Calgary, Canada > > > > > > > > > > > > > > > > > > > > > _______________________________________________ > > > > > Scipy-dev mailing list > > > > > Scipy-dev at scipy.org > > > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > > > > > > > > > > > > > > > > > _______________________________________________ > > > > Scipy-dev mailing list > > > > Scipy-dev at scipy.org > > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > > > Scipy-dev mailing list > > > Scipy-dev at scipy.org > > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > > > > > > > > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From josef.pktd at gmail.com Tue Aug 11 19:36:10 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 11 Aug 2009 19:36:10 -0400 Subject: [SciPy-dev] 'nuther Q re: chararray In-Reply-To: <164127.69670.qm@web52103.mail.re2.yahoo.com> References: <164127.69670.qm@web52103.mail.re2.yahoo.com> Message-ID: <1cd32cbb0908111636qd8a54d8l6a873b078971f3ec@mail.gmail.com> On Tue, Aug 11, 2009 at 7:30 PM, wrote: > >From "Guide to Numpy": > > "Perhaps the easiest way to create a chararray is to use self.view(chararray) where self is an ndarray of string or unicode data-type." > > OK, but what is "(chararray)"?? In particular, my best guesses yielded: > >>>> c = np.empty((2,2), dtype=unicode) >>>> c > array([[u'\u0f01\ude68', u'\uf880\udc00'], > ? ? ???[u'\uf0c0\udc00', u'\u5300']], > ? ? ? dtype='>>> c.view(chararray) # can't mean it literally, right? > Traceback (most recent call last): > ? File "", line 1, in > NameError: name 'chararray' is not defined >>>> c.view(C) # does the expression assign a value to the argument? > Traceback (most recent call last): > ? File "", line 1, in > NameError: name 'C' is not defined >>>> C = np.chararray((2,2)) # this is most logical >>>> c.view(C) > Traceback (most recent call last): > ? File "", line 1, in > ValueError: Dtype must be a numpy data-type > > So, what does Travis mean? ?Thanks, > > DG > > with this type of error I usually check the namespace >>> import numpy as np >>> np.chararray >>> c = np.empty((2,2), dtype=unicode) >>> c.view(np.chararray) chararray([['', u'\x04'], [u'\x01', u'\uf3db\ude20']], dtype=' > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From d_l_goldsmith at yahoo.com Tue Aug 11 19:49:32 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 16:49:32 -0700 (PDT) Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support Message-ID: <902989.11820.qm@web52110.mail.re2.yahoo.com> OK, may have answered Q1 myself: unless I'm misunderstanding what I'm seeing, what I'm finding is that capitalize() does nothing at all if the chararray is of dtype unicode - correct? Thanks, DG --- On Tue, 8/11/09, David Goldsmith wrote: > From: David Goldsmith > Subject: Some Q's vis-a-vis Numpy unicode support > To: scipy-dev at scipy.org > Date: Tuesday, August 11, 2009, 4:02 PM > First, a "reality check" question: > > 0) Is Windows (DOS) Terminal capable of rendering unicode? > > Unless the answer is "No," my real question: > > 1) Does chararray.capitalize() capitalize non-Roman letters > that have different lower-case and upper-case forms (e.g., > the Greek letters)?? If "yes," are there any exceptions > (e.g., Russian letters)? > > Thanks! > > DG > > > ? ? ? > From d_l_goldsmith at yahoo.com Tue Aug 11 19:50:46 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 16:50:46 -0700 (PDT) Subject: [SciPy-dev] 'nuther Q re: chararray Message-ID: <880273.12857.qm@web52110.mail.re2.yahoo.com> Doh! Jackpot, thanks! DG --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: > From: josef.pktd at gmail.com > Subject: Re: [SciPy-dev] 'nuther Q re: chararray > To: "SciPy Developers List" > Date: Tuesday, August 11, 2009, 4:36 PM > On Tue, Aug 11, 2009 at 7:30 PM, > > wrote: > > >From "Guide to Numpy": > > > > "Perhaps the easiest way to create a chararray is to > use self.view(chararray) where self is an ndarray of string > or unicode data-type." > > > > OK, but what is "(chararray)"?? In particular, my > best guesses yielded: > > > >>>> c = np.empty((2,2), dtype=unicode) > >>>> c > > array([[u'\u0f01\ude68', u'\uf880\udc00'], > > ? ? ???[u'\uf0c0\udc00', u'\u5300']], > > ? ? ? dtype=' >>>> c.view(chararray) # can't mean it > literally, right? > > Traceback (most recent call last): > > ? File "", line 1, in > > NameError: name 'chararray' is not defined > >>>> c.view(C) # does the expression assign a > value to the argument? > > Traceback (most recent call last): > > ? File "", line 1, in > > NameError: name 'C' is not defined > >>>> C = np.chararray((2,2)) # this is most > logical > >>>> c.view(C) > > Traceback (most recent call last): > > ? File "", line 1, in > > ValueError: Dtype must be a numpy data-type > > > > So, what does Travis mean? ?Thanks, > > > > DG > > > > > > with this type of error I usually check the namespace > > >>> import numpy as np > >>> np.chararray > > >>> c = np.empty((2,2), dtype=unicode) > >>> c.view(np.chararray) > chararray([['', u'\x04'], > ? ? ???[u'\x01', > u'\uf3db\ude20']], > ? ? ? dtype=' > But I have no idea what chararrays are > > Josef > > > > > _______________________________________________ > > Scipy-dev mailing list > > Scipy-dev at scipy.org > > http://mail.scipy.org/mailman/listinfo/scipy-dev > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From pgmdevlist at gmail.com Tue Aug 11 20:24:17 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 11 Aug 2009 20:24:17 -0400 Subject: [SciPy-dev] 'nuther Q re: chararray In-Reply-To: <164127.69670.qm@web52103.mail.re2.yahoo.com> References: <164127.69670.qm@web52103.mail.re2.yahoo.com> Message-ID: <8FD5A7ED-1F76-4701-A528-F61D1138739A@gmail.com> On Aug 11, 2009, at 7:30 PM, d_l_goldsmith at yahoo.com wrote: >> From "Guide to Numpy": > > "Perhaps the easiest way to create a chararray is to use > self.view(chararray) where self is an ndarray of string or unicode > data-type." > > OK, but what is "(chararray)"? In particular, my best guesses > yielded: numpy.char.chararray Check defcharray.py in numpy.core From josef.pktd at gmail.com Tue Aug 11 20:41:33 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 11 Aug 2009 20:41:33 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <902989.11820.qm@web52110.mail.re2.yahoo.com> References: <902989.11820.qm@web52110.mail.re2.yahoo.com> Message-ID: <1cd32cbb0908111741u59d3dd38o2212a77d42a8a8d5@mail.gmail.com> On Tue, Aug 11, 2009 at 7:49 PM, David Goldsmith wrote: > OK, may have answered Q1 myself: unless I'm misunderstanding what I'm seeing, what I'm finding is that capitalize() does nothing at all if the chararray is of dtype unicode - correct? ?Thanks, >>> b chararray(u'\xe9', dtype='>> b.capitalize() chararray(u'\xc9', dtype=' > DG > > --- On Tue, 8/11/09, David Goldsmith wrote: > >> From: David Goldsmith >> Subject: Some Q's vis-a-vis Numpy unicode support >> To: scipy-dev at scipy.org >> Date: Tuesday, August 11, 2009, 4:02 PM >> First, a "reality check" question: >> >> 0) Is Windows (DOS) Terminal capable of rendering unicode? not by default ( in US english at least) but the code page number can be changed, which I never tried >help graftabl Enable Windows to display an extended character set in graphics mode. GRAFTABL [xxx] GRAFTABL /STATUS xxx Specifies a code page number. /STATUS Displays the current code page selected for use with GRAFTABL. from python session in windows command shell (it prints correctly in case mail doesn't render it) >>> print u'\xe9' ? >>> print u'\xe9'.capitalize() ? >>> u'\xe9'.capitalize() u'\xc9' >>> but I cannot print any numpy.chararrays without getting >>> c= np.array(u'\xe9','>> print c .... UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 0: ordinal not in range(128) (this is in Idle, with cp1252 I think) the usual encode, decode problems with unicode, which take several hours of trial and error and reading docs to figure out. Josef >> >> Unless the answer is "No," my real question: >> >> 1) Does chararray.capitalize() capitalize non-Roman letters >> that have different lower-case and upper-case forms (e.g., >> the Greek letters)?? If "yes," are there any exceptions >> (e.g., Russian letters)? >> >> Thanks! >> >> DG >> >> >> >> > > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From josef.pktd at gmail.com Tue Aug 11 21:03:18 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 11 Aug 2009 21:03:18 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <1cd32cbb0908111741u59d3dd38o2212a77d42a8a8d5@mail.gmail.com> References: <902989.11820.qm@web52110.mail.re2.yahoo.com> <1cd32cbb0908111741u59d3dd38o2212a77d42a8a8d5@mail.gmail.com> Message-ID: <1cd32cbb0908111803m33f140adwe4f633bee5f6cf7f@mail.gmail.com> On Tue, Aug 11, 2009 at 8:41 PM, wrote: > On Tue, Aug 11, 2009 at 7:49 PM, David Goldsmith wrote: >> OK, may have answered Q1 myself: unless I'm misunderstanding what I'm seeing, what I'm finding is that capitalize() does nothing at all if the chararray is of dtype unicode - correct? ?Thanks, > > >>>> b > chararray(u'\xe9', > ? ? ?dtype='>>> b.capitalize() > chararray(u'\xc9', > ? ? ?dtype=' > see http://stackoverflow.com/questions/1006450/capitalizing-non-ascii-words-in-python > > > >> >> DG >> >> --- On Tue, 8/11/09, David Goldsmith wrote: >> >>> From: David Goldsmith >>> Subject: Some Q's vis-a-vis Numpy unicode support >>> To: scipy-dev at scipy.org >>> Date: Tuesday, August 11, 2009, 4:02 PM >>> First, a "reality check" question: >>> >>> 0) Is Windows (DOS) Terminal capable of rendering unicode? > > not by default ( in US english at least) > but the code page number can be changed, which I never tried > >>help graftabl > Enable Windows to display an extended character set in graphics mode. > > GRAFTABL [xxx] > GRAFTABL /STATUS > > ? xxx ? ? ?Specifies a code page number. > ? /STATUS ?Displays the current code page selected for use with GRAFTABL. > > > > from python session in windows command shell (it prints correctly in > case mail doesn't render it) >>>> print u'\xe9' > ? >>>> print u'\xe9'.capitalize() > ? >>>> u'\xe9'.capitalize() > u'\xc9' >>>> > > > but I cannot print any numpy.chararrays without getting >>>> c= np.array(u'\xe9','>>> print c > .... > UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in > position 0: ordinal not in range(128) > > (this is in Idle, with cp1252 I think) > > the usual encode, decode problems with unicode, which take several > hours of trial and error and reading docs to figure out. actually this works (in Idle) >>> b = np.array([u'\xe9',u'\xe9'],'>> print b.encode('cp1252')[0] ? >>> print b.capitalize().encode('cp1252')[0] ? >>> print b[0].encode('cp1252') ? this looks like a bug ? or is it a known limitation that chararrays cannot be 0-d >>> b0= np.array(u'\xe9','>> print b0.encode('cp1252') Traceback (most recent call last): File "", line 1, in print b0.encode('cp1252') File "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", line 217, in encode return self._generalmethod('encode', broadcast(self, encoding, errors)) File "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", line 162, in _generalmethod newarr[:] = res ValueError: cannot slice a 0-d array > > Josef > >>> >>> Unless the answer is "No," my real question: >>> >>> 1) Does chararray.capitalize() capitalize non-Roman letters >>> that have different lower-case and upper-case forms (e.g., >>> the Greek letters)?? If "yes," are there any exceptions >>> (e.g., Russian letters)? >>> >>> Thanks! >>> >>> DG >>> >>> >>> >>> >> >> >> >> _______________________________________________ >> Scipy-dev mailing list >> Scipy-dev at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-dev >> > From d_l_goldsmith at yahoo.com Tue Aug 11 22:28:32 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 19:28:32 -0700 (PDT) Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support Message-ID: <877871.14415.qm@web52104.mail.re2.yahoo.com> Thanks, Josef. This may just be an artifact of working in a DOS Terminal (but your example, though not printing the accented e, did at least print something different for b vs. b.capitalize()), or it may be because I don't know the right encoding to use, but I tried your code w/ what I found on Wikipedia to be the unicode for the Greek letter delta, namely, u'\x03b04', with both 'cp1252' and 'iso8859-7' encoding (the latter being inferred from the same Wikipedia article) and here's what I get: >>> b = np.array([u'\x03b04',u'\x03b04'],'>> print b.encode('cp1252')[0] ? >>> print b.capitalize().encode('cp1252')[0] ? >>> print b.encode('iso8859-7')[0] ? >>> print b.capitalize().encode('iso8859-7')[0] ? i.e., no difference. If I'm doing something wrong, please let me know; otherwise, for the purpose of documenting chararray.capitalize() - which is my ultimate goal - is there any rhyme or reason behind which unicode characters capitalize() works on and which it doesn't? Thanks, DG --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: > actually this works (in Idle) > > >>> b = > np.array([u'\xe9',u'\xe9'],' >>> print b.encode('cp1252')[0] > ? > >>> print b.capitalize().encode('cp1252')[0] > ? > >>> print b[0].encode('cp1252') > ? > > > this looks like a bug ? or is it a known limitation that > chararrays > cannot be 0-d > > >>> b0= > np.array(u'\xe9',' >>> print b0.encode('cp1252') > Traceback (most recent call last): > ? File "", line 1, in > > ? ? print b0.encode('cp1252') > ? File > "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", > line 217, in encode > ? ? return self._generalmethod('encode', > broadcast(self, encoding, errors)) > ? File > "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", > line 162, in _generalmethod > ? ? newarr[:] = res > ValueError: cannot slice a 0-d array > > > > > > Josef > > > >>> > >>> Unless the answer is "No," my real question: > >>> > >>> 1) Does chararray.capitalize() capitalize > non-Roman letters > >>> that have different lower-case and upper-case > forms (e.g., > >>> the Greek letters)?? If "yes," are there any > exceptions > >>> (e.g., Russian letters)? > >>> > >>> Thanks! > >>> > >>> DG > >>> > >>> > >>> > >>> > >> > >> > >> > >> _______________________________________________ > >> Scipy-dev mailing list > >> Scipy-dev at scipy.org > >> http://mail.scipy.org/mailman/listinfo/scipy-dev > >> > > > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From josef.pktd at gmail.com Tue Aug 11 23:18:14 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 11 Aug 2009 23:18:14 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <877871.14415.qm@web52104.mail.re2.yahoo.com> References: <877871.14415.qm@web52104.mail.re2.yahoo.com> Message-ID: <1cd32cbb0908112018x35546e80u1e42d18548dd659a@mail.gmail.com> On Tue, Aug 11, 2009 at 10:28 PM, David Goldsmith wrote: > Thanks, Josef. ?This may just be an artifact of working in a DOS Terminal (but your example, though not printing the accented e, did at least print something different for b vs. b.capitalize()), or it may be because I don't know the right encoding to use, but I tried your code w/ what I found on Wikipedia to be the unicode for the Greek letter delta, namely, u'\x03b04', with both 'cp1252' and 'iso8859-7' encoding (the latter being inferred from the same Wikipedia article) and here's what I get: > >>>> b = np.array([u'\x03b04',u'\x03b04'],'>>> print b.encode('cp1252')[0] > ? >>>> print b.capitalize().encode('cp1252')[0] > ? >>>> print b.encode('iso8859-7')[0] > ? >>>> print b.capitalize().encode('iso8859-7')[0] > ? > > i.e., no difference. ?If I'm doing something wrong, please let me know; otherwise, for the purpose of documenting chararray.capitalize() - which is my ultimate goal - is there any rhyme or reason behind which unicode characters capitalize() works on and which it doesn't? > > Thanks, > > DG > --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: > >> actually this works (in Idle) >> >> >>> b = >> np.array([u'\xe9',u'\xe9'],'> >>> print b.encode('cp1252')[0] >> ? >> >>> print b.capitalize().encode('cp1252')[0] >> ? >> >>> print b[0].encode('cp1252') >> ? >> >> >> this looks like a bug ? or is it a known limitation that >> chararrays >> cannot be 0-d >> >> >>> b0= >> np.array(u'\xe9','> >>> print b0.encode('cp1252') >> Traceback (most recent call last): >> ? File "", line 1, in >> >> ? ? print b0.encode('cp1252') >> ? File >> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >> line 217, in encode >> ? ? return self._generalmethod('encode', >> broadcast(self, encoding, errors)) >> ? File >> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >> line 162, in _generalmethod >> ? ? newarr[:] = res >> ValueError: cannot slice a 0-d array >> >> >> > >> > Josef >> > >> >>> >> >>> Unless the answer is "No," my real question: >> >>> >> >>> 1) Does chararray.capitalize() capitalize >> non-Roman letters >> >>> that have different lower-case and upper-case >> forms (e.g., >> >>> the Greek letters)?? If "yes," are there any >> exceptions >> >>> (e.g., Russian letters)? I think yes, exceptions are languages for which no capital letters exist, Cantonese(Chinese) ? http://www.isthisthingon.org/unicode/index.phtml?page=03&subpage=B&glyph=03B04 ??? google search for 03B04, >> >>> >> >>> Thanks! >> >>> >> >>> DG >> >>> >> >>> I have problems finding the correct codes for the characters and usually need a word processor. To me it looks like your character is not a greek delta >>> print u'\x03b04' b04 >>> print u'\u03b04' ?4 >>> print u'\u03b4' ? I don't know what it is since it doesn't render to anything meaningful I managed to get the greek delta through the html code for it δ from page: http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&hilite=003B4 running this script: # -*- coding: utf-8 -*- sd = u'?' print sd b = np.array([u'\u03b4',u'\u0394'],'>> ? ? u'\u03b4' ? u'\u0394' delta is correctly capitalized Josef From josef.pktd at gmail.com Tue Aug 11 23:40:24 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 11 Aug 2009 23:40:24 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <1cd32cbb0908112018x35546e80u1e42d18548dd659a@mail.gmail.com> References: <877871.14415.qm@web52104.mail.re2.yahoo.com> <1cd32cbb0908112018x35546e80u1e42d18548dd659a@mail.gmail.com> Message-ID: <1cd32cbb0908112040j78c28f04nb07bdf786252305@mail.gmail.com> On Tue, Aug 11, 2009 at 11:18 PM, wrote: > On Tue, Aug 11, 2009 at 10:28 PM, David > Goldsmith wrote: >> Thanks, Josef. ?This may just be an artifact of working in a DOS Terminal (but your example, though not printing the accented e, did at least print something different for b vs. b.capitalize()), or it may be because I don't know the right encoding to use, but I tried your code w/ what I found on Wikipedia to be the unicode for the Greek letter delta, namely, u'\x03b04', with both 'cp1252' and 'iso8859-7' encoding (the latter being inferred from the same Wikipedia article) and here's what I get: >> >>>>> b = np.array([u'\x03b04',u'\x03b04'],'>>>> print b.encode('cp1252')[0] >> ? >>>>> print b.capitalize().encode('cp1252')[0] >> ? >>>>> print b.encode('iso8859-7')[0] >> ? >>>>> print b.capitalize().encode('iso8859-7')[0] >> ? >> >> i.e., no difference. ?If I'm doing something wrong, please let me know; otherwise, for the purpose of documenting chararray.capitalize() - which is my ultimate goal - is there any rhyme or reason behind which unicode characters capitalize() works on and which it doesn't? >> >> Thanks, >> >> DG >> --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: >> >>> actually this works (in Idle) >>> >>> >>> b = >>> np.array([u'\xe9',u'\xe9'],'>> >>> print b.encode('cp1252')[0] >>> ? >>> >>> print b.capitalize().encode('cp1252')[0] >>> ? >>> >>> print b[0].encode('cp1252') >>> ? >>> >>> >>> this looks like a bug ? or is it a known limitation that >>> chararrays >>> cannot be 0-d >>> >>> >>> b0= >>> np.array(u'\xe9','>> >>> print b0.encode('cp1252') >>> Traceback (most recent call last): >>> ? File "", line 1, in >>> >>> ? ? print b0.encode('cp1252') >>> ? File >>> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >>> line 217, in encode >>> ? ? return self._generalmethod('encode', >>> broadcast(self, encoding, errors)) >>> ? File >>> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >>> line 162, in _generalmethod >>> ? ? newarr[:] = res >>> ValueError: cannot slice a 0-d array >>> >>> >>> > >>> > Josef >>> > >>> >>> >>> >>> Unless the answer is "No," my real question: >>> >>> >>> >>> 1) Does chararray.capitalize() capitalize >>> non-Roman letters >>> >>> that have different lower-case and upper-case >>> forms (e.g., >>> >>> the Greek letters)?? If "yes," are there any >>> exceptions >>> >>> (e.g., Russian letters)? > > I think yes, exceptions are languages for which no capital letters > exist, Cantonese(Chinese) ? > http://www.isthisthingon.org/unicode/index.phtml?page=03&subpage=B&glyph=03B04 > ???? google search for 03B04, > >>> >>> >>> >>> Thanks! >>> >>> >>> >>> DG >>> >>> >>> >>> > > I have problems finding the correct codes for the characters and > usually need a word processor. > > To me it looks like your character is not a greek delta > >>>> print u'\x03b04' > ?b04 >>>> print u'\u03b04' > ?4 >>>> print u'\u03b4' > ? > > I don't know what it is since it doesn't render to anything meaningful > > I managed to get the greek delta through the html code for it δ from page: > http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&hilite=003B4 > > > running this script: > > > # -*- coding: utf-8 -*- > > sd = u'?' > print sd > > b = np.array([u'\u03b4',u'\u0394'],' print b[0] > print repr(b[0]) > print b.capitalize()[0] > print repr(b.capitalize()[0]) > > *********** > prints this in my Idle shell >>>> > ? > ? > u'\u03b4' > ? > u'\u0394' > > delta is correctly capitalized > > > Josef > trying without copy and past non-Ascii characters the page at http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&glyph=003B4 also has the utf8 code \xCE\xB4, everything looks ok starting from this. Josef >>> '\xCE\xB4'.decode('utf8') u'\u03b4' >>> print '\xCE\xB4'.decode('utf8') ? >>> print '\xCE\xB4'.decode('utf8').capitalize() ? >>> b = np.array(['\xCE\xB4'.decode('utf8'),'\xCE\xB4'.decode('utf8')],'>> b chararray([u'\u03b4', u'\u03b4'], dtype='>> print b[0] ? >>> print b.capitalize()[0] ? From josef.pktd at gmail.com Tue Aug 11 23:59:59 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 11 Aug 2009 23:59:59 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <1cd32cbb0908112040j78c28f04nb07bdf786252305@mail.gmail.com> References: <877871.14415.qm@web52104.mail.re2.yahoo.com> <1cd32cbb0908112018x35546e80u1e42d18548dd659a@mail.gmail.com> <1cd32cbb0908112040j78c28f04nb07bdf786252305@mail.gmail.com> Message-ID: <1cd32cbb0908112059t1d1c2edfnca30e3ccf63c7f3c@mail.gmail.com> On Tue, Aug 11, 2009 at 11:40 PM, wrote: > On Tue, Aug 11, 2009 at 11:18 PM, wrote: >> On Tue, Aug 11, 2009 at 10:28 PM, David >> Goldsmith wrote: >>> Thanks, Josef. ?This may just be an artifact of working in a DOS Terminal (but your example, though not printing the accented e, did at least print something different for b vs. b.capitalize()), or it may be because I don't know the right encoding to use, but I tried your code w/ what I found on Wikipedia to be the unicode for the Greek letter delta, namely, u'\x03b04', with both 'cp1252' and 'iso8859-7' encoding (the latter being inferred from the same Wikipedia article) and here's what I get: >>> >>>>>> b = np.array([u'\x03b04',u'\x03b04'],'>>>>> print b.encode('cp1252')[0] >>> ? >>>>>> print b.capitalize().encode('cp1252')[0] >>> ? >>>>>> print b.encode('iso8859-7')[0] >>> ? >>>>>> print b.capitalize().encode('iso8859-7')[0] >>> ? >>> >>> i.e., no difference. ?If I'm doing something wrong, please let me know; otherwise, for the purpose of documenting chararray.capitalize() - which is my ultimate goal - is there any rhyme or reason behind which unicode characters capitalize() works on and which it doesn't? >>> >>> Thanks, >>> >>> DG >>> --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: >>> >>>> actually this works (in Idle) >>>> >>>> >>> b = >>>> np.array([u'\xe9',u'\xe9'],'>>> >>> print b.encode('cp1252')[0] >>>> ? >>>> >>> print b.capitalize().encode('cp1252')[0] >>>> ? >>>> >>> print b[0].encode('cp1252') >>>> ? >>>> >>>> >>>> this looks like a bug ? or is it a known limitation that >>>> chararrays >>>> cannot be 0-d >>>> >>>> >>> b0= >>>> np.array(u'\xe9','>>> >>> print b0.encode('cp1252') >>>> Traceback (most recent call last): >>>> ? File "", line 1, in >>>> >>>> ? ? print b0.encode('cp1252') >>>> ? File >>>> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >>>> line 217, in encode >>>> ? ? return self._generalmethod('encode', >>>> broadcast(self, encoding, errors)) >>>> ? File >>>> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >>>> line 162, in _generalmethod >>>> ? ? newarr[:] = res >>>> ValueError: cannot slice a 0-d array >>>> >>>> >>>> > >>>> > Josef >>>> > >>>> >>> >>>> >>> Unless the answer is "No," my real question: >>>> >>> >>>> >>> 1) Does chararray.capitalize() capitalize >>>> non-Roman letters >>>> >>> that have different lower-case and upper-case >>>> forms (e.g., >>>> >>> the Greek letters)?? If "yes," are there any >>>> exceptions >>>> >>> (e.g., Russian letters)? >> >> I think yes, exceptions are languages for which no capital letters >> exist, Cantonese(Chinese) ? >> http://www.isthisthingon.org/unicode/index.phtml?page=03&subpage=B&glyph=03B04 >> ???? google search for 03B04, >> >>>> >>> >>>> >>> Thanks! >>>> >>> >>>> >>> DG >>>> >>> >>>> >>> >> >> I have problems finding the correct codes for the characters and >> usually need a word processor. >> >> To me it looks like your character is not a greek delta >> >>>>> print u'\x03b04' >> ?b04 >>>>> print u'\u03b04' >> ?4 >>>>> print u'\u03b4' >> ? >> >> I don't know what it is since it doesn't render to anything meaningful >> >> I managed to get the greek delta through the html code for it δ from page: >> http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&hilite=003B4 >> >> >> running this script: >> >> >> # -*- coding: utf-8 -*- >> >> sd = u'?' >> print sd >> >> b = np.array([u'\u03b4',u'\u0394'],'> print b[0] >> print repr(b[0]) >> print b.capitalize()[0] >> print repr(b.capitalize()[0]) >> >> *********** >> prints this in my Idle shell >>>>> >> ? >> ? >> u'\u03b4' >> ? >> u'\u0394' >> >> delta is correctly capitalized >> >> >> Josef >> > > > trying without copy and past non-Ascii characters > the page at > http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&glyph=003B4 > > also has the utf8 code \xCE\xB4, ?everything looks ok starting from this. > > Josef > >>>> '\xCE\xB4'.decode('utf8') > u'\u03b4' >>>> print '\xCE\xB4'.decode('utf8') > ? >>>> print '\xCE\xB4'.decode('utf8').capitalize() > ? >>>> b = np.array(['\xCE\xB4'.decode('utf8'),'\xCE\xB4'.decode('utf8')],'>>> b > chararray([u'\u03b4', u'\u03b4'], > ? ? ?dtype='>>> print b[0] > ? >>>> print b.capitalize()[0] > ? > and for the fun of it, a Russian (cyrillic) character that capitalizes >>> print '\xD0\xB9'.decode('utf8') ? >>> print '\xD0\xB9'.decode('utf8').capitalize() ? >>> '\xD0\xB9'.decode('utf8') u'\u0439' >>> '\xD0\xB9'.decode('utf8').capitalize() u'\u0419' and a german letter that doesn't have a capitalized version >>> print '\xC3\x9F'.decode('utf8').capitalize() ? >>> print '\xC3\x9F'.decode('utf8') ? >>> '\xC3\x9F'.decode('utf8') u'\xdf' >>> '\xC3\x9F'.decode('utf8').capitalize() u'\xdf' and here's a nice picture of unicode 03B04 http://www.cns11643.gov.tw/seeker/english/showfont.jsp?ucode=03B04 and here are all unicode characters (although my browser doesn't display most of them) http://www.isthisthingon.org/unicode/allchars1.php I hope this helps, Josef From josef.pktd at gmail.com Wed Aug 12 00:18:58 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 12 Aug 2009 00:18:58 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <1cd32cbb0908112059t1d1c2edfnca30e3ccf63c7f3c@mail.gmail.com> References: <877871.14415.qm@web52104.mail.re2.yahoo.com> <1cd32cbb0908112018x35546e80u1e42d18548dd659a@mail.gmail.com> <1cd32cbb0908112040j78c28f04nb07bdf786252305@mail.gmail.com> <1cd32cbb0908112059t1d1c2edfnca30e3ccf63c7f3c@mail.gmail.com> Message-ID: <1cd32cbb0908112118i2ea11905i40b8f644cfa8bd4b@mail.gmail.com> On Tue, Aug 11, 2009 at 11:59 PM, wrote: > On Tue, Aug 11, 2009 at 11:40 PM, wrote: >> On Tue, Aug 11, 2009 at 11:18 PM, wrote: >>> On Tue, Aug 11, 2009 at 10:28 PM, David >>> Goldsmith wrote: >>>> Thanks, Josef. ?This may just be an artifact of working in a DOS Terminal (but your example, though not printing the accented e, did at least print something different for b vs. b.capitalize()), or it may be because I don't know the right encoding to use, but I tried your code w/ what I found on Wikipedia to be the unicode for the Greek letter delta, namely, u'\x03b04', with both 'cp1252' and 'iso8859-7' encoding (the latter being inferred from the same Wikipedia article) and here's what I get: >>>> >>>>>>> b = np.array([u'\x03b04',u'\x03b04'],'>>>>>> print b.encode('cp1252')[0] >>>> ? >>>>>>> print b.capitalize().encode('cp1252')[0] >>>> ? >>>>>>> print b.encode('iso8859-7')[0] >>>> ? >>>>>>> print b.capitalize().encode('iso8859-7')[0] >>>> ? >>>> >>>> i.e., no difference. ?If I'm doing something wrong, please let me know; otherwise, for the purpose of documenting chararray.capitalize() - which is my ultimate goal - is there any rhyme or reason behind which unicode characters capitalize() works on and which it doesn't? >>>> >>>> Thanks, >>>> >>>> DG >>>> --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: >>>> >>>>> actually this works (in Idle) >>>>> >>>>> >>> b = >>>>> np.array([u'\xe9',u'\xe9'],'>>>> >>> print b.encode('cp1252')[0] >>>>> ? >>>>> >>> print b.capitalize().encode('cp1252')[0] >>>>> ? >>>>> >>> print b[0].encode('cp1252') >>>>> ? >>>>> >>>>> >>>>> this looks like a bug ? or is it a known limitation that >>>>> chararrays >>>>> cannot be 0-d >>>>> >>>>> >>> b0= >>>>> np.array(u'\xe9','>>>> >>> print b0.encode('cp1252') >>>>> Traceback (most recent call last): >>>>> ? File "", line 1, in >>>>> >>>>> ? ? print b0.encode('cp1252') >>>>> ? File >>>>> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >>>>> line 217, in encode >>>>> ? ? return self._generalmethod('encode', >>>>> broadcast(self, encoding, errors)) >>>>> ? File >>>>> "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", >>>>> line 162, in _generalmethod >>>>> ? ? newarr[:] = res >>>>> ValueError: cannot slice a 0-d array >>>>> >>>>> >>>>> > >>>>> > Josef >>>>> > >>>>> >>> >>>>> >>> Unless the answer is "No," my real question: >>>>> >>> >>>>> >>> 1) Does chararray.capitalize() capitalize >>>>> non-Roman letters >>>>> >>> that have different lower-case and upper-case >>>>> forms (e.g., >>>>> >>> the Greek letters)?? If "yes," are there any >>>>> exceptions >>>>> >>> (e.g., Russian letters)? >>> >>> I think yes, exceptions are languages for which no capital letters >>> exist, Cantonese(Chinese) ? >>> http://www.isthisthingon.org/unicode/index.phtml?page=03&subpage=B&glyph=03B04 >>> ???? google search for 03B04, >>> >>>>> >>> >>>>> >>> Thanks! >>>>> >>> >>>>> >>> DG >>>>> >>> >>>>> >>> >>> >>> I have problems finding the correct codes for the characters and >>> usually need a word processor. >>> >>> To me it looks like your character is not a greek delta >>> >>>>>> print u'\x03b04' >>> ?b04 >>>>>> print u'\u03b04' >>> ?4 >>>>>> print u'\u03b4' >>> ? >>> >>> I don't know what it is since it doesn't render to anything meaningful >>> >>> I managed to get the greek delta through the html code for it δ from page: >>> http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&hilite=003B4 >>> >>> >>> running this script: >>> >>> >>> # -*- coding: utf-8 -*- >>> >>> sd = u'?' >>> print sd >>> >>> b = np.array([u'\u03b4',u'\u0394'],'>> print b[0] >>> print repr(b[0]) >>> print b.capitalize()[0] >>> print repr(b.capitalize()[0]) >>> >>> *********** >>> prints this in my Idle shell >>>>>> >>> ? >>> ? >>> u'\u03b4' >>> ? >>> u'\u0394' >>> >>> delta is correctly capitalized >>> >>> >>> Josef >>> >> >> >> trying without copy and past non-Ascii characters >> the page at >> http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&glyph=003B4 >> >> also has the utf8 code \xCE\xB4, ?everything looks ok starting from this. >> >> Josef >> >>>>> '\xCE\xB4'.decode('utf8') >> u'\u03b4' >>>>> print '\xCE\xB4'.decode('utf8') >> ? >>>>> print '\xCE\xB4'.decode('utf8').capitalize() >> ? >>>>> b = np.array(['\xCE\xB4'.decode('utf8'),'\xCE\xB4'.decode('utf8')],'>>>> b >> chararray([u'\u03b4', u'\u03b4'], >> ? ? ?dtype='>>>> print b[0] >> ? >>>>> print b.capitalize()[0] >> ? >> > > and for the fun of it, > a Russian (cyrillic) character that capitalizes > >>>> print '\xD0\xB9'.decode('utf8') > ? >>>> print '\xD0\xB9'.decode('utf8').capitalize() > ? >>>> '\xD0\xB9'.decode('utf8') > u'\u0439' >>>> '\xD0\xB9'.decode('utf8').capitalize() > u'\u0419' > > > and a german letter that doesn't have a capitalized version > >>>> print '\xC3\x9F'.decode('utf8').capitalize() > ? >>>> print '\xC3\x9F'.decode('utf8') > ? >>>> '\xC3\x9F'.decode('utf8') > u'\xdf' >>>> '\xC3\x9F'.decode('utf8').capitalize() > u'\xdf' > > and here's a nice picture of unicode 03B04 > http://www.cns11643.gov.tw/seeker/english/showfont.jsp?ucode=03B04 > > and here are all unicode characters (although my browser doesn't > display most of them) > http://www.isthisthingon.org/unicode/allchars1.php > > > I hope this helps, > > Josef > and then there is also >>> b = np.array([u'\u03b4\u03b4', u'\u03b4\u03b4'],'>> print b.capitalize() [u'\u0394\u03b4' u'\u0394\u03b4'] >>> print b.capitalize()[0] ?? >>> print b.upper()[0] ?? >>> print b.upper().lower()[0] ?? >>> print b.title()[0] ?? that's enough fun for the night Josef From d_l_goldsmith at yahoo.com Wed Aug 12 01:45:17 2009 From: d_l_goldsmith at yahoo.com (David Goldsmith) Date: Tue, 11 Aug 2009 22:45:17 -0700 (PDT) Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <1cd32cbb0908112059t1d1c2edfnca30e3ccf63c7f3c@mail.gmail.com> Message-ID: <941048.92622.qm@web52101.mail.re2.yahoo.com> Actually, since you seem so into it ;-) can you write me a little script (just 'cause it seems like you could do it faster) to print all the unicode characters u such that u == u.capitalize()? DG --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: > From: josef.pktd at gmail.com > Subject: Re: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support > To: "SciPy Developers List" > Date: Tuesday, August 11, 2009, 8:59 PM > On Tue, Aug 11, 2009 at 11:40 PM, > > wrote: > > On Tue, Aug 11, 2009 at 11:18 PM, > wrote: > >> On Tue, Aug 11, 2009 at 10:28 PM, David > >> Goldsmith > wrote: > >>> Thanks, Josef. ?This may just be an artifact > of working in a DOS Terminal (but your example, though not > printing the accented e, did at least print something > different for b vs. b.capitalize()), or it may be because I > don't know the right encoding to use, but I tried your code > w/ what I found on Wikipedia to be the unicode for the Greek > letter delta, namely, u'\x03b04', with both 'cp1252' and > 'iso8859-7' encoding (the latter being inferred from the > same Wikipedia article) and here's what I get: > >>> > >>>>>> b = > np.array([u'\x03b04',u'\x03b04'],' >>>>>> print b.encode('cp1252')[0] > >>> ? > >>>>>> print > b.capitalize().encode('cp1252')[0] > >>> ? > >>>>>> print b.encode('iso8859-7')[0] > >>> ? > >>>>>> print > b.capitalize().encode('iso8859-7')[0] > >>> ? > >>> > >>> i.e., no difference. ?If I'm doing something > wrong, please let me know; otherwise, for the purpose of > documenting chararray.capitalize() - which is my ultimate > goal - is there any rhyme or reason behind which unicode > characters capitalize() works on and which it doesn't? > >>> > >>> Thanks, > >>> > >>> DG > >>> --- On Tue, 8/11/09, josef.pktd at gmail.com > > wrote: > >>> > >>>> actually this works (in Idle) > >>>> > >>>> >>> b = > >>>> > np.array([u'\xe9',u'\xe9'],' >>>> >>> print b.encode('cp1252')[0] > >>>> ? > >>>> >>> print > b.capitalize().encode('cp1252')[0] > >>>> ? > >>>> >>> print b[0].encode('cp1252') > >>>> ? > >>>> > >>>> > >>>> this looks like a bug ? or is it a known > limitation that > >>>> chararrays > >>>> cannot be 0-d > >>>> > >>>> >>> b0= > >>>> > np.array(u'\xe9',' >>>> >>> print b0.encode('cp1252') > >>>> Traceback (most recent call last): > >>>> ? File "", line 1, in > >>>> > >>>> ? ? print b0.encode('cp1252') > >>>> ? File > >>>> > "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", > >>>> line 217, in encode > >>>> ? ? return > self._generalmethod('encode', > >>>> broadcast(self, encoding, errors)) > >>>> ? File > >>>> > "C:\Programs\Python25\Lib\site-packages\numpy\core\defchararray.py", > >>>> line 162, in _generalmethod > >>>> ? ? newarr[:] = res > >>>> ValueError: cannot slice a 0-d array > >>>> > >>>> > >>>> > > >>>> > Josef > >>>> > > >>>> >>> > >>>> >>> Unless the answer is "No," my > real question: > >>>> >>> > >>>> >>> 1) Does > chararray.capitalize() capitalize > >>>> non-Roman letters > >>>> >>> that have different > lower-case and upper-case > >>>> forms (e.g., > >>>> >>> the Greek letters)?? If > "yes," are there any > >>>> exceptions > >>>> >>> (e.g., Russian letters)? > >> > >> I think yes, exceptions are languages for which no > capital letters > >> exist, Cantonese(Chinese) ? > >> http://www.isthisthingon.org/unicode/index.phtml?page=03&subpage=B&glyph=03B04 > >> ???? google search for 03B04, > >> > >>>> >>> > >>>> >>> Thanks! > >>>> >>> > >>>> >>> DG > >>>> >>> > >>>> >>> > >> > >> I have problems finding the correct codes for the > characters and > >> usually need a word processor. > >> > >> To me it looks like your character is not a greek > delta > >> > >>>>> print u'\x03b04' > >> ?b04 > >>>>> print u'\u03b04' > >> ?4 > >>>>> print u'\u03b4' > >> ? > >> > >> I don't know what it is since it doesn't render to > anything meaningful > >> > >> I managed to get the greek delta through the html > code for it ? from page: > >> http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&hilite=003B4 > >> > >> > >> running this script: > >> > >> > >> # -*- coding: utf-8 -*- > >> > >> sd = u'?' > >> print sd > >> > >> b = > np.array([u'\u03b4',u'\u0394'],' >> print b[0] > >> print repr(b[0]) > >> print b.capitalize()[0] > >> print repr(b.capitalize()[0]) > >> > >> *********** > >> prints this in my Idle shell > >>>>> > >> ? > >> ? > >> u'\u03b4' > >> ? > >> u'\u0394' > >> > >> delta is correctly capitalized > >> > >> > >> Josef > >> > > > > > > trying without copy and past non-Ascii characters > > the page at > > http://www.isthisthingon.org/unicode/index.phtml?page=00&subpage=3&glyph=003B4 > > > > also has the utf8 code \xCE\xB4, ?everything looks ok > starting from this. > > > > Josef > > > >>>> '\xCE\xB4'.decode('utf8') > > u'\u03b4' > >>>> print '\xCE\xB4'.decode('utf8') > > ? > >>>> print > '\xCE\xB4'.decode('utf8').capitalize() > > ? > >>>> b = > np.array(['\xCE\xB4'.decode('utf8'),'\xCE\xB4'.decode('utf8')],' >>>> b > > chararray([u'\u03b4', u'\u03b4'], > > ? ? ?dtype=' >>>> print b[0] > > ? > >>>> print b.capitalize()[0] > > ? > > > > and for the fun of it, > a Russian (cyrillic) character that capitalizes > > >>> print '\xD0\xB9'.decode('utf8') > ? > >>> print '\xD0\xB9'.decode('utf8').capitalize() > ? > >>> '\xD0\xB9'.decode('utf8') > u'\u0439' > >>> '\xD0\xB9'.decode('utf8').capitalize() > u'\u0419' > > > and a german letter that doesn't have a capitalized > version > > >>> print '\xC3\x9F'.decode('utf8').capitalize() > ? > >>> print '\xC3\x9F'.decode('utf8') > ? > >>> '\xC3\x9F'.decode('utf8') > u'\xdf' > >>> '\xC3\x9F'.decode('utf8').capitalize() > u'\xdf' > > and here's a nice picture of unicode 03B04 > http://www.cns11643.gov.tw/seeker/english/showfont.jsp?ucode=03B04 > > and here are all unicode characters (although my browser > doesn't > display most of them) > http://www.isthisthingon.org/unicode/allchars1.php > > > I hope this helps, > > Josef > _______________________________________________ > Scipy-dev mailing list > Scipy-dev at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-dev > From josef.pktd at gmail.com Wed Aug 12 09:54:09 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 12 Aug 2009 09:54:09 -0400 Subject: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support In-Reply-To: <941048.92622.qm@web52101.mail.re2.yahoo.com> References: <1cd32cbb0908112059t1d1c2edfnca30e3ccf63c7f3c@mail.gmail.com> <941048.92622.qm@web52101.mail.re2.yahoo.com> Message-ID: <1cd32cbb0908120654j29577d1dmb9fa207efbbc2f1a@mail.gmail.com> On Wed, Aug 12, 2009 at 1:45 AM, David Goldsmith wrote: > Actually, since you seem so into it ;-) This was just a refresher, I struggled much more the first time I tried to use non-english filenames and files. > can you write me a little script (just 'cause it seems like you could do it faster) to print all the unicode characters u such >that u == u.capitalize()? u == u.capitalize() that' s for most of them, The webpage lists 89,674 unicode characters and I didn't want to try all of them. Below are the unicode characters in the first 1000 for which u != u.capitalize() josef ----------------------------- print unichr(30) maxcode = 1000 # I don't want to try 38000 start = 0 # 38000 is boring umany = np.array([unichr(i) for i in xrange(start,start+maxcode)], ' > DG > > --- On Tue, 8/11/09, josef.pktd at gmail.com wrote: > >> From: josef.pktd at gmail.com >> Subject: Re: [SciPy-dev] Some Q's vis-a-vis Numpy unicode support >> To: "SciPy Developers List" >> Date: Tuesday, August 11, 2009, 8:59 PM >> On Tue, Aug 11, 2009 at 11:40 PM, >> >> wrote: >> > On Tue, Aug 11, 2009 at 11:18 PM, >> wrote: >> >> On Tue, Aug 11, 2009 at 10:28 PM, David >> >> Goldsmith >> wrote: >> >>> Thanks, Josef. ?This may just be an artifact >> of working in a DOS Terminal (but your example, though not >> printing the accented e, did at least print something >> different for b vs. b.capitalize()), or it may be because I >> don't know the right encoding to use, but I tried your code >> w/ what I found on Wikipedia to be the unicode for the Greek >> letter delta, namely, u'\x03b04', with both 'cp1252' and >> 'iso8859-7' encoding (the latter being inferred from the >> same Wikipedia article) and here's what I get: >> >>> >> >>>>>> b = >> np.array([u'\x03b04',u'\x03b04'],'> >>>>>> print b.encode('cp1252')[0] >> >>> ? >> >>>>>> print >> b.capitalize().encode('cp1252')[0] >> >>> ? >> >>>>>> print b.encode('iso8859-7')[0] >> >>> ? >> >>>>>> print >> b.capitalize().encode('iso8859-7')[0] >> >>> ? >> >>> >> >>> i.e., no difference. ?If I'm doing something >> wrong, please let me know; otherwise, for the purpose of >> documenting chararray.capitalize() - which is my ultimate >> goal - is there any rhyme or reason behind which unicode >> characters capitalize() works on and which it doesn't? >> >>> >> >>> Thanks, >> >>> >> >>> DG >> >>> --- On Tue, 8/11/09, josef.pktd at gmail.com >> >> wrote: >> >>> >> >>>> actually this works (in Idle) >> >>>> >> >>>> >>> b = >> >>>> >> np.array([u'\xe9',u'\xe9'],'