From josef.pktd at gmail.com Sat Jun 1 11:35:33 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 1 Jun 2013 11:35:33 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com> Message-ID: On Tue, May 28, 2013 at 10:34 PM, Matthew Brett wrote: > Hi, > > On Tue, May 28, 2013 at 7:18 PM, Paulo Jabardo wrote: >> I'm an engineer working in research but I spend a good deal of time coding. >> What I've seen with most of my colleagues and friends is that they will only >> code whenever it is extremely necessary for an immediate application in an >> experiment or for their PhD. The problem starts very early, when I was >> beginning my studies, we were taught C (and that is still the case almost 20 >> years later). A small percentage of the students (10%?) enjoy programming >> and they will profit. I really loved pointers and doing neat tricks. For the >> rest it was torture, plain and simple torture. And completely useless. Most >> students couldn't do anything useful with programming. All their suffering >> was for nothing. What happened later was obvious: they would avoid >> programming at all costs and if they had to do something they would use >> MS-Excel. The spreadsheets I've seen... I still have nightmares. The things >> they accomplished humbles me, proves that I'm a lower being. I've seen >> people solve partial differential equations where each cell was an element >> in the solution and it was colored according to the result. Beautiful but >> I'd rather suffer accute physical pain than to do something like that, or >> worse, debug such a "program". By the way, this sort of application was not >> a joke or a neat hack, it was actually the only way those guys knew how to >> solve a problem. >> >> 15 years later... I have a physics undergraduate student working with me. >> Very smart and interested. They still learn C and later on when they need to >> do something, what is it they do? Most professors use Origin. A huge >> improvement over Excel, but still. A couple of months ago, he had to turn in >> a report and since we don't have Origin, he was using Excel. I kind of felt >> sorry for him and I helped him out to do it in Python. He couldn't believe >> it. > > Oh - dear; you probably saw this stuff? > > http://blog.stodden.net/2013/04/19/what-the-reinhart-rogoff-debacle-really-shows-verifying-empirical-results-needs-to-be-routine/ I think that's a good example that peer review works. > >> I did my Masters and PhD in CFD. Most other students had almost no >> background in programming and did most things using Excel! When they had to >> modify some code, it was almost by accident that things worked. You can >> imagine what sort of code comes out of this. The professors didn't know >> programming much better. Just getting them to understand the concept of >> version control took a while. >> >> In my opinion, If schools taught, at the begining, something like >> Python/Octave/R instead of C, students would be able to use this knowledge >> easily and productively throughout their courses and eventually learn C when >> they really needed it. > > That's surely one of the big arguments for Python - it is a great > first language, and it is capable across a wider range than Octave or > R - or even Excel :) We can mistake in any language I just read this """ Abstract [Correction Notice: An Erratum for this article was reported in Vol 17(4) of Psychological Methods (see record 2012-33502-001). The R code for arriving at adjusted p values for one of the methods is incorrect. The specific changes that need to be made are provided in the erratum.] """ It's still functioning peer review if a mistake is found after an article has been published, or after a pull request has landed in master. --------- in general: in the research areas that I know, the vast majority of researchers use Windows, and everything that is not core task is point and click. As long as Matlab, Stata and GAUSS, or whatever else, doesn't have version control build in, VC won't be used by the majority of researchers that I know. We didn't grow up when version control was popular. And we don't have IT guys to manage it for us. (There is the old fashioned version control of starting new directories at crucial stages, or for specific conference talks and paper submissions.) (DVCS are only a few years old, and it will take a few more years for diffusion to "non-programmers" to happen.) Even after using git some time, I only find it usable because I can do all the regular stuff with git gui (and for unusual stuff I can use commandline and git gui at the same time). --------- (just in case I'm misunderstood: I'm all in favor of best practices and unit and functional tests, but I don't expect that researchers will adopt it (fast) if it goes against their usual pattern of using tools. example: If you teach a software carpentry course that uses Linux, then I wouldn't be surprised if some users go back to their office and the first thing they do is use Excel. :) Josef (I used a virtual Debian for one month.) > > Cheers, > > Matthew > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From matthew.brett at gmail.com Sat Jun 1 17:39:37 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Sat, 1 Jun 2013 14:39:37 -0700 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: Hi, On Sat, Jun 1, 2013 at 8:35 AM, wrote: > On Tue, May 28, 2013 at 10:34 PM, Matthew Brett wrote: >> Hi, >> >> On Tue, May 28, 2013 at 7:18 PM, Paulo Jabardo wrote: >>> I'm an engineer working in research but I spend a good deal of time coding. >>> What I've seen with most of my colleagues and friends is that they will only >>> code whenever it is extremely necessary for an immediate application in an >>> experiment or for their PhD. The problem starts very early, when I was >>> beginning my studies, we were taught C (and that is still the case almost 20 >>> years later). A small percentage of the students (10%?) enjoy programming >>> and they will profit. I really loved pointers and doing neat tricks. For the >>> rest it was torture, plain and simple torture. And completely useless. Most >>> students couldn't do anything useful with programming. All their suffering >>> was for nothing. What happened later was obvious: they would avoid >>> programming at all costs and if they had to do something they would use >>> MS-Excel. The spreadsheets I've seen... I still have nightmares. The things >>> they accomplished humbles me, proves that I'm a lower being. I've seen >>> people solve partial differential equations where each cell was an element >>> in the solution and it was colored according to the result. Beautiful but >>> I'd rather suffer accute physical pain than to do something like that, or >>> worse, debug such a "program". By the way, this sort of application was not >>> a joke or a neat hack, it was actually the only way those guys knew how to >>> solve a problem. >>> >>> 15 years later... I have a physics undergraduate student working with me. >>> Very smart and interested. They still learn C and later on when they need to >>> do something, what is it they do? Most professors use Origin. A huge >>> improvement over Excel, but still. A couple of months ago, he had to turn in >>> a report and since we don't have Origin, he was using Excel. I kind of felt >>> sorry for him and I helped him out to do it in Python. He couldn't believe >>> it. >> >> Oh - dear; you probably saw this stuff? >> >> http://blog.stodden.net/2013/04/19/what-the-reinhart-rogoff-debacle-really-shows-verifying-empirical-results-needs-to-be-routine/ > > I think that's a good example that peer review works. It's a good example of how peer-review should work, but it's very uncommon for the reviewer to have the original spreadsheet, and that was the key to the problem. >>> I did my Masters and PhD in CFD. Most other students had almost no >>> background in programming and did most things using Excel! When they had to >>> modify some code, it was almost by accident that things worked. You can >>> imagine what sort of code comes out of this. The professors didn't know >>> programming much better. Just getting them to understand the concept of >>> version control took a while. >>> >>> In my opinion, If schools taught, at the begining, something like >>> Python/Octave/R instead of C, students would be able to use this knowledge >>> easily and productively throughout their courses and eventually learn C when >>> they really needed it. >> >> That's surely one of the big arguments for Python - it is a great >> first language, and it is capable across a wider range than Octave or >> R - or even Excel :) > > We can mistake in any language > > I just read this > > """ > Abstract > > [Correction Notice: An Erratum for this article was reported in > Vol 17(4) of Psychological Methods (see record 2012-33502-001). The R > code for arriving at adjusted p values for one of the methods is > incorrect. The specific changes that need to be made are provided in > the erratum.] > """ > > It's still functioning peer review if a mistake is found after an > article has been published, or after a pull request has landed in > master. The problem is that the peers don't get to review what has been done, in general, they get to review what the author said had been done. Donoho's point - about computational science - is that this can be very different. The question is then : does this matter? Are - most published research findings false? > --------- > in general: > > in the research areas that I know, the vast majority of researchers > use Windows, and everything that is not core task is point and click. > As long as Matlab, Stata and GAUSS, or whatever else, doesn't have > version control build in, VC won't be used by the majority of > researchers that I know. We didn't grow up when version control was > popular. And we don't have IT guys to manage it for us. > (There is the old fashioned version control of starting new > directories at crucial stages, or for specific conference talks and > paper submissions.) > (DVCS are only a few years old, and it will take a few more years for > diffusion to "non-programmers" to happen.) We get taught some complicated things when we are training - calculus, algebra... Does it make sense that we don't teach less complicated things like version control and programming? > Even after using git some time, I only find it usable because I can do > all the regular stuff with git gui (and for unusual stuff I can use > commandline and git gui at the same time). > > > --------- > (just in case I'm misunderstood: > I'm all in favor of best practices and unit and functional tests, but > I don't expect that researchers will adopt it (fast) if it goes > against their usual pattern of using tools. > example: If you teach a software carpentry course that uses Linux, > then I wouldn't be surprised if some users go back to their office and > the first thing they do is use Excel. :) In general as you know I agree completely that it doesn't make sense to persuade people to switch from Windows to Linux at the same time as persuading them to use good software tools. We should teach people stuff that they will and can use, and it's a common them among software-carpentry types that it would be better to teach Windows people how to best use Windows rather than teaching them on a virtual machine that they are unlikely to use for their work. Cheers, Matthew From josef.pktd at gmail.com Sat Jun 1 23:29:26 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 1 Jun 2013 23:29:26 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: On Sat, Jun 1, 2013 at 5:39 PM, Matthew Brett wrote: > Hi, > > On Sat, Jun 1, 2013 at 8:35 AM, wrote: >> On Tue, May 28, 2013 at 10:34 PM, Matthew Brett wrote: >>> Hi, >>> >>> On Tue, May 28, 2013 at 7:18 PM, Paulo Jabardo wrote: >>>> I'm an engineer working in research but I spend a good deal of time coding. >>>> What I've seen with most of my colleagues and friends is that they will only >>>> code whenever it is extremely necessary for an immediate application in an >>>> experiment or for their PhD. The problem starts very early, when I was >>>> beginning my studies, we were taught C (and that is still the case almost 20 >>>> years later). A small percentage of the students (10%?) enjoy programming >>>> and they will profit. I really loved pointers and doing neat tricks. For the >>>> rest it was torture, plain and simple torture. And completely useless. Most >>>> students couldn't do anything useful with programming. All their suffering >>>> was for nothing. What happened later was obvious: they would avoid >>>> programming at all costs and if they had to do something they would use >>>> MS-Excel. The spreadsheets I've seen... I still have nightmares. The things >>>> they accomplished humbles me, proves that I'm a lower being. I've seen >>>> people solve partial differential equations where each cell was an element >>>> in the solution and it was colored according to the result. Beautiful but >>>> I'd rather suffer accute physical pain than to do something like that, or >>>> worse, debug such a "program". By the way, this sort of application was not >>>> a joke or a neat hack, it was actually the only way those guys knew how to >>>> solve a problem. >>>> >>>> 15 years later... I have a physics undergraduate student working with me. >>>> Very smart and interested. They still learn C and later on when they need to >>>> do something, what is it they do? Most professors use Origin. A huge >>>> improvement over Excel, but still. A couple of months ago, he had to turn in >>>> a report and since we don't have Origin, he was using Excel. I kind of felt >>>> sorry for him and I helped him out to do it in Python. He couldn't believe >>>> it. >>> >>> Oh - dear; you probably saw this stuff? >>> >>> http://blog.stodden.net/2013/04/19/what-the-reinhart-rogoff-debacle-really-shows-verifying-empirical-results-needs-to-be-routine/ >> >> I think that's a good example that peer review works. > > It's a good example of how peer-review should work, but it's very > uncommon for the reviewer to have the original spreadsheet, and that > was the key to the problem. The spreadsheet mistake was only one point driving the result, the rest were modelling decisions. Even without having access to their original work, the study can be independently redone and show that there is no "big" effect. Even in their results, using robust measures like median doesn't show much of an effect. So it's mainly a few outliers (or coding mistakes) my favorite outside economics http://www.genomesunzipped.org/2012/03/questioning-the-evidence-for-non-canonical-rna-editing-in-humans.php (one advantage of economics is that there have always been "schools of thought" partially lined up with the political orientation. This has the consequence that if one side finds something "good", the other side tries to disprove it. And the compensating bias might uncover what is a robust finding.) > >>>> I did my Masters and PhD in CFD. Most other students had almost no >>>> background in programming and did most things using Excel! When they had to >>>> modify some code, it was almost by accident that things worked. You can >>>> imagine what sort of code comes out of this. The professors didn't know >>>> programming much better. Just getting them to understand the concept of >>>> version control took a while. >>>> >>>> In my opinion, If schools taught, at the begining, something like >>>> Python/Octave/R instead of C, students would be able to use this knowledge >>>> easily and productively throughout their courses and eventually learn C when >>>> they really needed it. >>> >>> That's surely one of the big arguments for Python - it is a great >>> first language, and it is capable across a wider range than Octave or >>> R - or even Excel :) >> >> We can mistake in any language >> >> I just read this >> >> """ >> Abstract >> >> [Correction Notice: An Erratum for this article was reported in >> Vol 17(4) of Psychological Methods (see record 2012-33502-001). The R >> code for arriving at adjusted p values for one of the methods is >> incorrect. The specific changes that need to be made are provided in >> the erratum.] >> """ >> >> It's still functioning peer review if a mistake is found after an >> article has been published, or after a pull request has landed in >> master. > > The problem is that the peers don't get to review what has been done, > in general, they get to review what the author said had been done. > > Donoho's point - about computational science - is that this can be > very different. > > The question is then : does this matter? Are - most published > research findings false? following the link from the PLOS editorial statement http://www.plosmedicine.org/article/info:doi/10.1371/journal.pmed.0020124 I think the entire premise "are research findings false" is completely misguided. It just continuous the magic 0.05 tradition. (However I think it makes a good polemic to illustrate a point.) Disclaimer: I never read the applied part of any paper outside of economics, and I can only imagine from second hand readings that some articles really only report a p-value or if their result is statistically significant or not. I have been reading now for several months articles criticizing research tradition and editorial recommendations to improve statistical reporting in various fields, starting with psychological methods and behavioral research. The general recommendation is to report effect sizes and confidence intervals instead of, or additional to p-values. So we can actually see what the size of this statistical (non-)significant effect is, and learn from it. Maybe the interval is not completely "false". and there are other problems in some fields with the majority of the research: the studies are underpowered, they ignore multiple testing problems, ... (according to some editorials and reports) Where open access to research methodology comes in is in undermining the reputation of researchers that systematically bias (Ioannidis) their results. In economics this debate happened a few years ago after some famous failures to (independently) replicate results, and now most, I think all, top economics journals require that the data/source is published. > >> --------- >> in general: >> >> in the research areas that I know, the vast majority of researchers >> use Windows, and everything that is not core task is point and click. >> As long as Matlab, Stata and GAUSS, or whatever else, doesn't have >> version control build in, VC won't be used by the majority of >> researchers that I know. We didn't grow up when version control was >> popular. And we don't have IT guys to manage it for us. >> (There is the old fashioned version control of starting new >> directories at crucial stages, or for specific conference talks and >> paper submissions.) >> (DVCS are only a few years old, and it will take a few more years for >> diffusion to "non-programmers" to happen.) > > We get taught some complicated things when we are training - calculus, > algebra... > > Does it make sense that we don't teach less complicated things like > version control and programming? (I don't know about teaching computer programming in American undergraduate programs, I'm a resident alien.) Programming within economics is not directly part of the curriculum. Students (undergraduates once they are beyond Excel!) learn programming in statistics, in my PhD program it was applied statistics/econometrics and computational economics (simulating macroeconomy) where we learned to program, and got paid for it as research assistant (with no requirement for unit tests nor version control.) My impression is that for "non-programmers", the behavioral pattern for using the tools is acquired in the applied fields that use computer programming. Once unit/functional testing and version control is used there by professors and teaching assistants and required as part of the best practice for doing your work, then it will stick. Otherwise it's like calculus. Some need it most of their life, the other ones forget about it as soon as the exams are over. (But you cannot learn calculus and statistics by doing, and there is only limited amount of time students have. More statistics please.) ----- Two more Version control systems are not available for Word Processing which rules out version control for large parts of the actual work. network and peer effects: One reason I think that version control will be the standard in a few more years (if usability gets better) is that you just need one or a few "programming types" in a group to spread it like an infection. You need those guys both as an advertising to see how to do things in a better way, but also as a support when a new user gets lost. I only found git acceptable because I knew that I have the rescue and support team on the mailing lists. (Thanks for that.) Josef > >> Even after using git some time, I only find it usable because I can do >> all the regular stuff with git gui (and for unusual stuff I can use >> commandline and git gui at the same time). >> >> >> --------- >> (just in case I'm misunderstood: >> I'm all in favor of best practices and unit and functional tests, but >> I don't expect that researchers will adopt it (fast) if it goes >> against their usual pattern of using tools. >> example: If you teach a software carpentry course that uses Linux, >> then I wouldn't be surprised if some users go back to their office and >> the first thing they do is use Excel. :) > > In general as you know I agree completely that it doesn't make sense > to persuade people to switch from Windows to Linux at the same time as > persuading them to use good software tools. We should teach people > stuff that they will and can use, and it's a common them among > software-carpentry types that it would be better to teach Windows > people how to best use Windows rather than teaching them on a virtual > machine that they are unlikely to use for their work. > > Cheers, > > Matthew > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From matthew.brett at gmail.com Sun Jun 2 01:47:17 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Sat, 1 Jun 2013 22:47:17 -0700 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: Hi, On Sat, Jun 1, 2013 at 8:29 PM, wrote: > following the link from the PLOS editorial statement > http://www.plosmedicine.org/article/info:doi/10.1371/journal.pmed.0020124 > > I think the entire premise "are research findings false" is completely > misguided. It just continuous the magic 0.05 tradition. I don't think it is as simple as that. For example, one of the studies I cited before was only able to replicate 6 / 53 'landmark' studies in hematological oncology. http://www.nature.com/nature/journal/v483/n7391/full/483531a.html "Clearly there are fundamental problems in both academia and industry in the way such research is conducted and reported. Addressing these systemic issues will require tremendous commitment and a desire to change the prevalent culture. Perhaps the most crucial element for change is to acknowledge that the bar for reproducibility in performing and presenting preclinical studies must be raised." I've been canvassing my colleagues over the last year or so about what replication rate they would guess in brain imaging, and the answers are rather variable, but have a mean around 30 percent. These estimates are from people running brain imaging centers or very experienced in the field. If these estimates are correct, the waste is enormous, overwhelming. > Otherwise it's like calculus. Some need it most of their life, the > other ones forget about it as soon as the exams are over. > (But you cannot learn calculus and statistics by doing, and there is > only limited amount of time students have. More statistics please.) The person who is trying to do work in Excel, that should be done in a programming language, needed that training. They will be doing slower work. and make more errors for the lack of a small amount of training. For sure the tech-smart guy or gal in the lab makes a big difference, but not every lab has such a person, and it's common (believe me) for researchers who don't know this stuff to assume it's only for nerds and that it only slows down getting real work done. That's largely a function of lack of training in how easy it is to make mistakes, and therefore the necessity of using tools to reduce mistakes and improve transparency. I've also noticed that when people are not comfortable with their tools, they often fail to notice obvious statistical issues that they would normally expect to spot at once. Here's an obvious example from brain imaging: http://www.edvul.com/voodoocorr.php So, if you teach people statistics and you don't teach them how and when to program, and they have to do anything other than point and click in SPSS, you'll often get bad statistics none the less. Cheers, Matthew From takowl at gmail.com Sun Jun 2 07:00:25 2013 From: takowl at gmail.com (Thomas Kluyver) Date: Sun, 2 Jun 2013 12:00:25 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: On 2 June 2013 06:47, Matthew Brett wrote: > The person who is trying to do work in Excel, that should be done in a > programming language, needed that training. They will be doing slower > work. and make more errors for the lack of a small amount of training. > I agree with the argument, but let's not understate the amount of learning involved. Here, all new PhD students are given a seven day intensive R course, by a lecturer who's good enough at teaching R that he makes money from running the course elsewhere. That covers the basics, but it certainly doesn't mean that they can do anything in R that they would otherwise do in Excel. And it doesn't even touch on version control or writing tests. I found one of my labmates editing the copy of a modelling script that she'd named 'foobar_DONOTEDIT', but I still couldn't persuade her to use version control. I think there's a fascinating question as to why people find Excel so much easier than a 'real' programming language, even if they create really complex spreadsheets. I think it's a combination of: - Familiarity: people are taught spreadsheets, and often Excel specifically, at school, whereas 'programming' is seen as a kind of geek sorcery. - Mingling code and data: I think it's conceptually harder to have your data in one place and your analysis in another, even though that's ultimately good practice - Seeing what you're doing: In Excel, you calculate something by putting a formula in a cell. You press enter, and there's the result. In code, you store it in a variable, and you have to explicitly ask for it to be displayed. If you're calculating 1000 variables in a loop, then it's not obvious from the display which one corresponds to which input. Can we mix some of that comfort with the robustness we're used to in conventional code? E.g. I can imagine a different kind of spreadsheet tool, where instead of putting formulae in cells, you define new columns and tables, and where you can save the steps you've done to apply to another data file in the same format. Perhaps it could even naturally progress to real code so that it acts as a kind of gateway drug for programming. Thomas -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Sun Jun 2 07:49:47 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sun, 2 Jun 2013 07:49:47 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: On Sun, Jun 2, 2013 at 7:00 AM, Thomas Kluyver wrote: > On 2 June 2013 06:47, Matthew Brett wrote: >> >> The person who is trying to do work in Excel, that should be done in a >> programming language, needed that training. They will be doing slower >> work. and make more errors for the lack of a small amount of training. > > > I agree with the argument, but let's not understate the amount of learning > involved. Here, all new PhD students are given a seven day intensive R > course, by a lecturer who's good enough at teaching R that he makes money > from running the course elsewhere. That covers the basics, but it certainly > doesn't mean that they can do anything in R that they would otherwise do in > Excel. And it doesn't even touch on version control or writing tests. I > found one of my labmates editing the copy of a modelling script that she'd > named 'foobar_DONOTEDIT', but I still couldn't persuade her to use version > control. > > I think there's a fascinating question as to why people find Excel so much > easier than a 'real' programming language, even if they create really > complex spreadsheets. I think it's a combination of: > > - Familiarity: people are taught spreadsheets, and often Excel specifically, > at school, whereas 'programming' is seen as a kind of geek sorcery. > - Mingling code and data: I think it's conceptually harder to have your data > in one place and your analysis in another, even though that's ultimately > good practice > - Seeing what you're doing: In Excel, you calculate something by putting a > formula in a cell. You press enter, and there's the result. In code, you > store it in a variable, and you have to explicitly ask for it to be > displayed. If you're calculating 1000 variables in a loop, then it's not > obvious from the display which one corresponds to which input. The last point is where I still use Excel or OpenOffice calc. Visual inspection of a larger amount of heterogeneous data. for another area where excel use is still very heavy http://robertkugel.ventanaresearch.com/2013/01/29/the-spreadsheet-and-the-whale/ via http://blog.enthought.com/?p=113067 (the advantages and perils of using Excel when you bet a few million dollars on the outcome.) > > Can we mix some of that comfort with the robustness we're used to in > conventional code? E.g. I can imagine a different kind of spreadsheet tool, > where instead of putting formulae in cells, you define new columns and > tables, and where you can save the steps you've done to apply to another > data file in the same format. Perhaps it could even naturally progress to > real code so that it acts as a kind of gateway drug for programming. Stata has a very good combination, point and click for the commands that do the statistics or data handling, then the commands are printed to the console. The results can be seen in the console or the dataframe viewer. (in matlab the plot wizard works similar, point and click and save to script) It's easy to build up a collection of reproducable, reusable scripts this way. This was great for me as beginner or when I use parts that I don't know or remember (or the syntax and options for it). In contrast, a new plot in matplotlib is a few hours of reading documentation and googling for examples. Josef > > Thomas > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > From otrov at hush.ai Sun Jun 2 08:21:04 2013 From: otrov at hush.ai (zetah) Date: Sun, 02 Jun 2013 14:21:04 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: <20130602122104.E09CEA6E40@smtp.hushmail.com> Matthew Brett wrote: >The person who is trying to do work in Excel, that should be done in a >programming language, needed that training. They will be doing slower >work. and make more errors for the lack of a small amount of training. Thomas Kluyver wrote: >I think there's a fascinating question as to why people find Excel so much >easier than a 'real' programming language, even if they create really >complex spreadsheets. I find you use term "Excel" too vague. One way to look at Excel is as visual interface to your data, that you can slice and dice and apply most common tools in least amount of time, even if you are average user. But Excel data is also available to you in object oriented VBA programming and then also VSTO (.NET Framework) if VBA is too coarse for your sensitive project. So Excel (and good part of Office) expose it's interface and data to both VBA (builtin programming interface) and Visual Studio. It's as real programming as you are up to. As for Python applicability in scientific software, I find it most useful in environments similar to Matlab/R/IDL. I guess that's the SciPy paradigm after all I feel that Python can offer new and original possibilities, or adapt to new trends like Mathematica and IPython Notebook, but Excel is just different league From takowl at gmail.com Sun Jun 2 08:48:37 2013 From: takowl at gmail.com (Thomas Kluyver) Date: Sun, 2 Jun 2013 13:48:37 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: On 2 June 2013 12:49, wrote: > This was great for me as beginner or when I use parts that I don't > know or remember (or the syntax and options for it). > In contrast, a new plot in matplotlib is a few hours of reading > documentation and googling for examples. > Yes, I've thought this about plotting as well. If I want to, say, rotate the labels on the x axis by 45 degrees, I'd much rather just click on them and edit a number in a text box, rather than googling what function and parameter will make it look like I want. In the Python world, Veusz (http://home.gna.org/veusz/ ) has some of this capability, and it can save a Python script to recreate a plot that you've produced interactively. But I've not really used it in earnest, because it would be awkward to integrate with my usual tools. zetah: > But Excel data is also available to you in object oriented VBA programming and then also VSTO (.NET Framework) if VBA is too coarse for your sensitive project. So Excel (and good part of Office) expose it's interface and data to both VBA (builtin programming interface) and Visual Studio. It's as real programming as you are up to. You're technically correct... the best kind of correct. ;-) I wrote VBA macros once years ago. But the group of users we're discussing don't use those features. It's not a natural extension of making spreadsheets, but a completely different set of skills to learn. And I don't think we want to encourage them down that route - data analysis in VBA or even .NET would be much more offputting than using Python/R/Matlab. Thomas -------------- next part -------------- An HTML attachment was scrubbed... URL: From otrov at hush.ai Sun Jun 2 09:29:27 2013 From: otrov at hush.ai (zetah) Date: Sun, 02 Jun 2013 15:29:27 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

Message-ID: <20130602132927.B6CC4A6E40@smtp.hushmail.com> Thomas Kluyver wrote: >You're technically correct... the best kind of correct. ;-) I wrote VBA >macros once years ago. But the group of users we're discussing >don't use those features. It's not a natural extension of making >spreadsheets, but a completely different set of skills to learn. And I don't think we >want to encourage them down that route - data analysis in VBA or even .NET >would be much more offputting than using Python/R/Matlab. Oops... did I jump in semi-private discussion? Apologies, I wasn't aware, I thought you guys discuss scientific software generally, and I just read last couple of emails in my inbox. Cheers From takowl at gmail.com Sun Jun 2 10:33:12 2013 From: takowl at gmail.com (Thomas Kluyver) Date: Sun, 2 Jun 2013 15:33:12 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <20130602132927.B6CC4A6E40@smtp.hushmail.com> References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> Message-ID: On 2 June 2013 14:29, zetah wrote: > Oops... did I jump in semi-private discussion? No, sorry if I gave that impression. This is all intended to be public, as far as I'm aware. Best wishes, Thomas -------------- next part -------------- An HTML attachment was scrubbed... URL: From otrov at hush.ai Sun Jun 2 10:59:58 2013 From: otrov at hush.ai (zetah) Date: Sun, 02 Jun 2013 16:59:58 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> Message-ID: <20130602145958.DB291A6E38@smtp.hushmail.com> Thomas Kluyver wrote: >> Oops... did I jump in semi-private discussion? > >No, sorry if I gave that impression. This is all intended to be public, as >far as I'm aware. All right, thanks. You were mentioning a group of users, and encouraging some routes, so I thought your initial discussion was concerning some group known to you. Never mind, I don't plan to advocate Excel in Python user group, but to raise concern about terminology used. "Excel > spreadsheet" Cheers From takowl at gmail.com Sun Jun 2 11:51:02 2013 From: takowl at gmail.com (Thomas Kluyver) Date: Sun, 2 Jun 2013 16:51:02 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <20130602145958.DB291A6E38@smtp.hushmail.com> References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> Message-ID: On 2 June 2013 15:59, zetah wrote: > You were mentioning a group of users, and encouraging some routes, so I > thought your initial discussion was concerning some group known to you. > 'type of users' might have been a more accurate phrase, but it has an unfortunate negative ring that I wanted to avoid. There are a lot of people doing important data analysis in quite risky and hard-to-maintain ways. Using spreadsheets where some simple code might be more reliable is one symptom of that, and there have been a couple of major examples from economics where spreadsheet errors led to serious mistakes. The discussion is revolving roughly around whether and how we can push those users towards better tools and methods, like coding, version control and testing. Thomas -------------- next part -------------- An HTML attachment was scrubbed... URL: From otrov at hush.ai Sun Jun 2 14:00:54 2013 From: otrov at hush.ai (zetah) Date: Sun, 02 Jun 2013 20:00:54 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> Message-ID: <20130602180055.5CC67A6E40@smtp.hushmail.com> Thomas Kluyver wrote: >'type of users' might have been a more accurate phrase, but it has an >unfortunate negative ring that I wanted to avoid. There are a lot of people >doing important data analysis in quite risky and hard-to-maintain ways. >Using spreadsheets where some simple code might be more reliable is one >symptom of that, and there have been a couple of major examples from >economics where spreadsheet errors led to serious mistakes. >The discussion is revolving roughly around whether and how we can push >those users towards better tools and methods, like coding, version control >and testing. Thanks for overview Thomas, I read all emails on the subject and will comment briefly, for the sake of my participation, although topic is huge I don't have experience with critical modeling, but I do and learn data analysis with historical data and generally. If we speak about errors, I think that most of it, like taught in Numerical analysis course, are due to human factor not understanding data types and also variety of data sources representing data differently. Trivial example that sql and netcdf databases represent same data in different format. Similarly for other data sources which in turn can be just plain text dumps. If that is handled correctly and user is familiar with the tool used, there shouldn't be any surprises. If it is of any interest, I thought to generalize my usual workflow, as single user example (hope it's not useless): - collecting data: if not directly available I use Python, and depending on source do validation. I don't change format if it's not necessary. - pre-processing: if I preprocess (usually with Python), I store data to sql server. - using data: single set or multiple datasets in PowerPivot (limited just by amount of RAM), where DAX allows calculations on pivoted views values. I haven't yet found any other tool that allows such diverse views in such short time. - post-processing: when needed I export results to CSV. Usually to just load in numpy array and plot with Matplotlib, or 3D viewing in VisIt or Gephi. - versioning: data in source database(s) stays intact, and all calculations can be saved to a file (with values), and then opened again even if datasource is not available. So I use Excel mainly for data manipulation and Python back and forth. Also I use additional tools for 3D visualization. I never liked to learn about versioning systems, and I'm happy with my current scheme From charlesr.harris at gmail.com Sun Jun 2 14:38:09 2013 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 2 Jun 2013 12:38:09 -0600 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <20130602180055.5CC67A6E40@smtp.hushmail.com> References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> <20130602180055.5CC67A6E40@smtp.hushmail.com> Message-ID: On Sun, Jun 2, 2013 at 12:00 PM, zetah wrote: > Thomas Kluyver wrote: > >'type of users' might have been a more accurate phrase, but it has an > >unfortunate negative ring that I wanted to avoid. There are a lot of > people > >doing important data analysis in quite risky and hard-to-maintain ways. > >Using spreadsheets where some simple code might be more reliable is one > >symptom of that, and there have been a couple of major examples from > >economics where spreadsheet errors led to serious mistakes. > >The discussion is revolving roughly around whether and how we can push > >those users towards better tools and methods, like coding, version control > >and testing. > > Thanks for overview Thomas, I read all emails on the subject and will > comment briefly, for the sake of my participation, although topic is huge > > I don't have experience with critical modeling, but I do and learn data > analysis with historical data and generally. > > If we speak about errors, I think that most of it, like taught in > Numerical analysis course, are due to human factor not understanding data > types and also variety of data sources representing data differently. > Trivial example that sql and netcdf databases represent same data in > different format. Similarly for other data sources which in turn can be > just plain text dumps. If that is handled correctly and user is familiar > with the tool used, there shouldn't be any surprises. > At least when no one checks ;) The errors that the gods of analysis gift to us are often hidden away and are easy to overlook. They also tend to creep in when one is overconfident. It's all part of the devine sense of humor. > > If it is of any interest, I thought to generalize my usual workflow, as > single user example (hope it's not useless): > - collecting data: if not directly available I use Python, and depending > on source do validation. I don't change format if it's not necessary. > - pre-processing: if I preprocess (usually with Python), I store data to > sql server. > - using data: single set or multiple datasets in PowerPivot (limited just > by amount of RAM), where DAX allows calculations on pivoted views values. I > haven't yet found any other tool that allows such diverse views in such > short time. > - post-processing: when needed I export results to CSV. Usually to just > load in numpy array and plot with Matplotlib, or 3D viewing in VisIt or > Gephi. > - versioning: data in source database(s) stays intact, and all > calculations can be saved to a file (with values), and then opened again > even if datasource is not available. > > So I use Excel mainly for data manipulation and Python back and forth. > Also I use additional tools for 3D visualization. > I never liked to learn about versioning systems, and I'm happy with my > current scheme > I confess to my shame that I have never learned to use a spreadsheet for any but the simplest things. It's just so darn complicated ;) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Sun Jun 2 15:51:00 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Sun, 2 Jun 2013 12:51:00 -0700 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> <20130602180055.5CC67A6E40@smtp.hushmail.com> Message-ID: Hi, On Sun, Jun 2, 2013 at 11:38 AM, Charles R Harris wrote: > > > On Sun, Jun 2, 2013 at 12:00 PM, zetah wrote: >> >> Thomas Kluyver wrote: >> >'type of users' might have been a more accurate phrase, but it has an >> >unfortunate negative ring that I wanted to avoid. There are a lot of >> > people >> >doing important data analysis in quite risky and hard-to-maintain ways. >> >Using spreadsheets where some simple code might be more reliable is one >> >symptom of that, and there have been a couple of major examples from >> >economics where spreadsheet errors led to serious mistakes. >> >The discussion is revolving roughly around whether and how we can push >> >those users towards better tools and methods, like coding, version >> > control >> >and testing. >> >> Thanks for overview Thomas, I read all emails on the subject and will >> comment briefly, for the sake of my participation, although topic is huge >> >> I don't have experience with critical modeling, but I do and learn data >> analysis with historical data and generally. >> >> If we speak about errors, I think that most of it, like taught in >> Numerical analysis course, are due to human factor not understanding data >> types and also variety of data sources representing data differently. >> Trivial example that sql and netcdf databases represent same data in >> different format. Similarly for other data sources which in turn can be just >> plain text dumps. If that is handled correctly and user is familiar with the >> tool used, there shouldn't be any surprises. > > > At least when no one checks ;) The errors that the gods of analysis gift to > us are often hidden away and are easy to overlook. They also tend to creep > in when one is overconfident. It's all part of the devine sense of humor. Yes - when no-one checks! I wish I still shared the feeling that mostly when I do stuff it's correct, or mostly correct, or correct enough. It was only when I started checking that I started to worry. I well remember the happier times I'd write a 100 line analysis script with no tests and be "pretty sure" that it was correct. Cheers, Matthew From otrov at hush.ai Sun Jun 2 16:06:01 2013 From: otrov at hush.ai (zetah) Date: Sun, 02 Jun 2013 22:06:01 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> <20130602180055.5CC67A6E40@smtp.hushm ail.com> Message-ID: <20130602200602.49733A6E42@smtp.hushmail.com> Charles R Harris wrote: >> If we speak about errors, I think that most of it, like taught in >> Numerical analysis course, are due to human factor not understanding data >> types and also variety of data sources representing data differently. >> Trivial example that sql and netcdf databases represent same data in >> different format. Similarly for other data sources which in turn can be >> just plain text dumps. If that is handled correctly and user is familiar >> with the tool used, there shouldn't be any surprises. >> > >At least when no one checks ;) The errors that the gods of analysis gift to >us are often hidden away and are easy to overlook. They also tend to creep >in when one is overconfident. It's all part of the devine sense of humor. Probably true. I know this comes from experience that I have not enough >I confess to my shame that I have never learned to use a spreadsheet for >any but the simplest things. It's just so darn complicated ;) That's fine, maybe it's just a legacy habit no one wants to break or preference toward familiar data manipulation environment. For myself, even with all that numpy broadcasting magics, I'd spend much more time slicing data in Python then doing it as I currently prefer, as more abstractions I'd have to use for same outcome. Viewing the values at the same time while calculating feels more natural to me and provides instant "validation" to say. But if I want real validation I can make validation scenario. Earlier my only annoyance with pivoted data was that I couldn't do more then trivial calculations on values in pivoted view, unless using programmatic approach. Now that's possible (with DAX), and I can't imagine what else could make data manipulation more intuitive to me. There are many aspects on this subject, and please do continue if I stepped in too carelessly :) Cheers From trive at astro.su.se Sun Jun 2 18:31:34 2013 From: trive at astro.su.se (=?ISO-8859-1?Q?Th=F8ger_Emil_Rivera-Thorsen?=) Date: Mon, 03 Jun 2013 00:31:34 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <20130602200602.49733A6E42@smtp.hushmail.com> References: <51A39B6D.4030607@gmail.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> <20130602180055.5CC67A6E40@smtp.hushm ail.com> <20130602200602.49733A6E42@smtp.hushmail.com> Message-ID: <51ABC7C6.3080506@astro.su.se> On 02-06-2013 22:06, zetah wrote: > Charles R Harris wrote: >>> If we speak about errors, I think that most of it, like taught in >>> Numerical analysis course, are due to human factor not understanding data >>> types and also variety of data sources representing data differently. >>> Trivial example that sql and netcdf databases represent same data in >>> different format. Similarly for other data sources which in turn can be >>> just plain text dumps. If that is handled correctly and user is familiar >>> with the tool used, there shouldn't be any surprises. >>> >> At least when no one checks ;) The errors that the gods of analysis gift to >> us are often hidden away and are easy to overlook. They also tend to creep >> in when one is overconfident. It's all part of the devine sense of humor. > Probably true. I know this comes from experience that I have not enough > > >> I confess to my shame that I have never learned to use a spreadsheet for >> any but the simplest things. It's just so darn complicated ;) > That's fine, maybe it's just a legacy habit no one wants to break or preference toward familiar data manipulation environment. > > For myself, even with all that numpy broadcasting magics, I'd spend much more time slicing data in Python then doing it as I currently prefer, as more abstractions I'd have to use for same outcome. Viewing the values at the same time while calculating feels more natural to me and provides instant "validation" to say. But if I want real validation I can make validation scenario. > > Earlier my only annoyance with pivoted data was that I couldn't do more then trivial calculations on values in pivoted view, unless using programmatic approach. Now that's possible (with DAX), and I can't imagine what else could make data manipulation more intuitive to me. > > There are many aspects on this subject, and please do continue if I stepped in too carelessly :) You may of course be perfectly happy with your current work setup, but it seems to me like you could do everything you describe without leaving Python, by using Pandas. Pivot tables, slicing and dicing of heterogenous data types, indexing by multi-layer labels, arbitrary operations on pivoted, sliced and diced data frames, importing/exporting csv, ascii, html and even LaTeX, quick plotting for data ionspection purposes etc. Of course, the interactive element isn't there. On the other hand, it is very powerful, and you don't have to switch between several different environments and tools. The frames are basically enhanced numpy arrays, so the data can be passed directly to numpy or matplotlib. Also, if working in the IPython qtconsole or notebook, simply typing the dataframe's name will show it nicely rendered as an html table. I have definitely enjoyed working with it. Sorry for going slightly off-topic. /Emil > > Cheers > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From otrov at hush.ai Sun Jun 2 20:59:26 2013 From: otrov at hush.ai (zetah) Date: Mon, 03 Jun 2013 02:59:26 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <51ABC7C6.3080506@astro.su.se> References: <51A39B6D.4030607@gmail.com>

<20130602132927.B6CC4A6E40@smtp.hushmail.com> <20130602145958.DB291A6E38@smtp.hushmail.com> <20130602180055.5CC67A6E40@smtp.hushm ail.com> <20130602200602.49733A6E42@smtp.hushmail.com> <51AB C7C6.3080506@astro.su.se> Message-ID: <20130603005926.A32CEA6E38@smtp.hushmail.com> Th?ger Emil Rivera-Thorsen wrote: >You may of course be perfectly happy with your current work setup, but >it seems to me like you could do everything you describe without leaving >Python, by using Pandas. Pivot tables, slicing and dicing of >heterogenous data types, indexing by multi-layer labels, arbitrary >operations on pivoted, sliced and diced data frames, importing/exporting >csv, ascii, html and even LaTeX, quick plotting for data ionspection >purposes etc. I tried it shortly couple of month ago or so, and it seemed like impressive work in progress. I remember time series handling was very intuitive, but I didn't study the module for some reason... and I didn't know about pivoting dataframes >Of course, the interactive element isn't there. On the >other hand, it is very powerful, and you don't have to switch between >several different environments and tools. You are right there too, reliable interactive interface provides less mental effort, and surely using same environment has it's benefits. UI as extension is important, not many get impressed on command line. I hope to see IPython Notebook drive new ideas and attract more developers, more then I expect Excel on Surface to implement new interface exclusively based on touch and filters. >The frames are basically enhanced numpy arrays, so the data can be >passed directly to numpy or matplotlib. Also, if working in the IPython >qtconsole or notebook, simply typing the dataframe's name will show it >nicely rendered as an html table. >I have definitely enjoyed working with it. Sounds like fun. I'll experiment Thanks From nadavh at visionsense.com Mon Jun 3 01:26:22 2013 From: nadavh at visionsense.com (Nadav Horesh) Date: Mon, 3 Jun 2013 05:26:22 +0000 Subject: [SciPy-User] SciPy-User Digest, Vol 118, Issue 4 In-Reply-To: References: Message-ID: <520e0eec44af4501b4100c172b6ff08c@BN1PR08MB076.namprd08.prod.outlook.com> For me an important use case is a file transfer over the VPN. Is there any way to test it? Nadav. ________________________________________ From: scipy-user-bounces at scipy.org on behalf of scipy-user-request at scipy.org Sent: 03 June 2013 01:26 To: scipy-user at scipy.org Subject: SciPy-User Digest, Vol 118, Issue 4 Send SciPy-User mailing list submissions to scipy-user at scipy.org To subscribe or unsubscribe via the World Wide Web, visit http://mail.scipy.org/mailman/listinfo/scipy-user or, via email, send a message with subject or body 'help' to scipy-user-request at scipy.org You can reach the person managing the list at scipy-user-owner at scipy.org When replying, please edit your Subject line so it is more specific than "Re: Contents of SciPy-User digest..." Today's Topics: 1. Re: peer review of scientific software (zetah) 2. Re: peer review of scientific software (Charles R Harris) 3. Re: peer review of scientific software (Matthew Brett) 4. Re: peer review of scientific software (zetah) 5. Re: peer review of scientific software (Th?ger Emil Rivera-Thorsen) ---------------------------------------------------------------------- Message: 1 Date: Sun, 02 Jun 2013 20:00:54 +0200 From: "zetah" Subject: Re: [SciPy-User] peer review of scientific software To: "SciPy Users List" Message-ID: <20130602180055.5CC67A6E40 at smtp.hushmail.com> Content-Type: text/plain; charset="UTF-8" Thomas Kluyver wrote: >'type of users' might have been a more accurate phrase, but it has an >unfortunate negative ring that I wanted to avoid. There are a lot of people >doing important data analysis in quite risky and hard-to-maintain ways. >Using spreadsheets where some simple code might be more reliable is one >symptom of that, and there have been a couple of major examples from >economics where spreadsheet errors led to serious mistakes. >The discussion is revolving roughly around whether and how we can push >those users towards better tools and methods, like coding, version control >and testing. Thanks for overview Thomas, I read all emails on the subject and will comment briefly, for the sake of my participation, although topic is huge I don't have experience with critical modeling, but I do and learn data analysis with historical data and generally. If we speak about errors, I think that most of it, like taught in Numerical analysis course, are due to human factor not understanding data types and also variety of data sources representing data differently. Trivial example that sql and netcdf databases represent same data in different format. Similarly for other data sources which in turn can be just plain text dumps. If that is handled correctly and user is familiar with the tool used, there shouldn't be any surprises. If it is of any interest, I thought to generalize my usual workflow, as single user example (hope it's not useless): - collecting data: if not directly available I use Python, and depending on source do validation. I don't change format if it's not necessary. - pre-processing: if I preprocess (usually with Python), I store data to sql server. - using data: single set or multiple datasets in PowerPivot (limited just by amount of RAM), where DAX allows calculations on pivoted views values. I haven't yet found any other tool that allows such diverse views in such short time. - post-processing: when needed I export results to CSV. Usually to just load in numpy array and plot with Matplotlib, or 3D viewing in VisIt or Gephi. - versioning: data in source database(s) stays intact, and all calculations can be saved to a file (with values), and then opened again even if datasource is not available. So I use Excel mainly for data manipulation and Python back and forth. Also I use additional tools for 3D visualization. I never liked to learn about versioning systems, and I'm happy with my current scheme ------------------------------ Message: 2 Date: Sun, 2 Jun 2013 12:38:09 -0600 From: Charles R Harris Subject: Re: [SciPy-User] peer review of scientific software To: SciPy Users List Message-ID: Content-Type: text/plain; charset="iso-8859-1" On Sun, Jun 2, 2013 at 12:00 PM, zetah wrote: > Thomas Kluyver wrote: > >'type of users' might have been a more accurate phrase, but it has an > >unfortunate negative ring that I wanted to avoid. There are a lot of > people > >doing important data analysis in quite risky and hard-to-maintain ways. > >Using spreadsheets where some simple code might be more reliable is one > >symptom of that, and there have been a couple of major examples from > >economics where spreadsheet errors led to serious mistakes. > >The discussion is revolving roughly around whether and how we can push > >those users towards better tools and methods, like coding, version control > >and testing. > > Thanks for overview Thomas, I read all emails on the subject and will > comment briefly, for the sake of my participation, although topic is huge > > I don't have experience with critical modeling, but I do and learn data > analysis with historical data and generally. > > If we speak about errors, I think that most of it, like taught in > Numerical analysis course, are due to human factor not understanding data > types and also variety of data sources representing data differently. > Trivial example that sql and netcdf databases represent same data in > different format. Similarly for other data sources which in turn can be > just plain text dumps. If that is handled correctly and user is familiar > with the tool used, there shouldn't be any surprises. > At least when no one checks ;) The errors that the gods of analysis gift to us are often hidden away and are easy to overlook. They also tend to creep in when one is overconfident. It's all part of the devine sense of humor. > > If it is of any interest, I thought to generalize my usual workflow, as > single user example (hope it's not useless): > - collecting data: if not directly available I use Python, and depending > on source do validation. I don't change format if it's not necessary. > - pre-processing: if I preprocess (usually with Python), I store data to > sql server. > - using data: single set or multiple datasets in PowerPivot (limited just > by amount of RAM), where DAX allows calculations on pivoted views values. I > haven't yet found any other tool that allows such diverse views in such > short time. > - post-processing: when needed I export results to CSV. Usually to just > load in numpy array and plot with Matplotlib, or 3D viewing in VisIt or > Gephi. > - versioning: data in source database(s) stays intact, and all > calculations can be saved to a file (with values), and then opened again > even if datasource is not available. > > So I use Excel mainly for data manipulation and Python back and forth. > Also I use additional tools for 3D visualization. > I never liked to learn about versioning systems, and I'm happy with my > current scheme > I confess to my shame that I have never learned to use a spreadsheet for any but the simplest things. It's just so darn complicated ;) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.scipy.org/pipermail/scipy-user/attachments/20130602/48da6418/attachment-0001.html ------------------------------ Message: 3 Date: Sun, 2 Jun 2013 12:51:00 -0700 From: Matthew Brett Subject: Re: [SciPy-User] peer review of scientific software To: SciPy Users List Message-ID: Content-Type: text/plain; charset=ISO-8859-1 Hi, On Sun, Jun 2, 2013 at 11:38 AM, Charles R Harris wrote: > > > On Sun, Jun 2, 2013 at 12:00 PM, zetah wrote: >> >> Thomas Kluyver wrote: >> >'type of users' might have been a more accurate phrase, but it has an >> >unfortunate negative ring that I wanted to avoid. There are a lot of >> > people >> >doing important data analysis in quite risky and hard-to-maintain ways. >> >Using spreadsheets where some simple code might be more reliable is one >> >symptom of that, and there have been a couple of major examples from >> >economics where spreadsheet errors led to serious mistakes. >> >The discussion is revolving roughly around whether and how we can push >> >those users towards better tools and methods, like coding, version >> > control >> >and testing. >> >> Thanks for overview Thomas, I read all emails on the subject and will >> comment briefly, for the sake of my participation, although topic is huge >> >> I don't have experience with critical modeling, but I do and learn data >> analysis with historical data and generally. >> >> If we speak about errors, I think that most of it, like taught in >> Numerical analysis course, are due to human factor not understanding data >> types and also variety of data sources representing data differently. >> Trivial example that sql and netcdf databases represent same data in >> different format. Similarly for other data sources which in turn can be just >> plain text dumps. If that is handled correctly and user is familiar with the >> tool used, there shouldn't be any surprises. > > > At least when no one checks ;) The errors that the gods of analysis gift to > us are often hidden away and are easy to overlook. They also tend to creep > in when one is overconfident. It's all part of the devine sense of humor. Yes - when no-one checks! I wish I still shared the feeling that mostly when I do stuff it's correct, or mostly correct, or correct enough. It was only when I started checking that I started to worry. I well remember the happier times I'd write a 100 line analysis script with no tests and be "pretty sure" that it was correct. Cheers, Matthew ------------------------------ Message: 4 Date: Sun, 02 Jun 2013 22:06:01 +0200 From: "zetah" Subject: Re: [SciPy-User] peer review of scientific software To: "SciPy Users List" Message-ID: <20130602200602.49733A6E42 at smtp.hushmail.com> Content-Type: text/plain; charset="UTF-8" Charles R Harris wrote: >> If we speak about errors, I think that most of it, like taught in >> Numerical analysis course, are due to human factor not understanding data >> types and also variety of data sources representing data differently. >> Trivial example that sql and netcdf databases represent same data in >> different format. Similarly for other data sources which in turn can be >> just plain text dumps. If that is handled correctly and user is familiar >> with the tool used, there shouldn't be any surprises. >> > >At least when no one checks ;) The errors that the gods of analysis gift to >us are often hidden away and are easy to overlook. They also tend to creep >in when one is overconfident. It's all part of the devine sense of humor. Probably true. I know this comes from experience that I have not enough >I confess to my shame that I have never learned to use a spreadsheet for >any but the simplest things. It's just so darn complicated ;) That's fine, maybe it's just a legacy habit no one wants to break or preference toward familiar data manipulation environment. For myself, even with all that numpy broadcasting magics, I'd spend much more time slicing data in Python then doing it as I currently prefer, as more abstractions I'd have to use for same outcome. Viewing the values at the same time while calculating feels more natural to me and provides instant "validation" to say. But if I want real validation I can make validation scenario. Earlier my only annoyance with pivoted data was that I couldn't do more then trivial calculations on values in pivoted view, unless using programmatic approach. Now that's possible (with DAX), and I can't imagine what else could make data manipulation more intuitive to me. There are many aspects on this subject, and please do continue if I stepped in too carelessly :) Cheers ------------------------------ Message: 5 Date: Mon, 03 Jun 2013 00:31:34 +0200 From: Th?ger Emil Rivera-Thorsen Subject: Re: [SciPy-User] peer review of scientific software To: SciPy Users List Message-ID: <51ABC7C6.3080506 at astro.su.se> Content-Type: text/plain; charset=ISO-8859-1; format=flowed On 02-06-2013 22:06, zetah wrote: > Charles R Harris wrote: >>> If we speak about errors, I think that most of it, like taught in >>> Numerical analysis course, are due to human factor not understanding data >>> types and also variety of data sources representing data differently. >>> Trivial example that sql and netcdf databases represent same data in >>> different format. Similarly for other data sources which in turn can be >>> just plain text dumps. If that is handled correctly and user is familiar >>> with the tool used, there shouldn't be any surprises. >>> >> At least when no one checks ;) The errors that the gods of analysis gift to >> us are often hidden away and are easy to overlook. They also tend to creep >> in when one is overconfident. It's all part of the devine sense of humor. > Probably true. I know this comes from experience that I have not enough > > >> I confess to my shame that I have never learned to use a spreadsheet for >> any but the simplest things. It's just so darn complicated ;) > That's fine, maybe it's just a legacy habit no one wants to break or preference toward familiar data manipulation environment. > > For myself, even with all that numpy broadcasting magics, I'd spend much more time slicing data in Python then doing it as I currently prefer, as more abstractions I'd have to use for same outcome. Viewing the values at the same time while calculating feels more natural to me and provides instant "validation" to say. But if I want real validation I can make validation scenario. > > Earlier my only annoyance with pivoted data was that I couldn't do more then trivial calculations on values in pivoted view, unless using programmatic approach. Now that's possible (with DAX), and I can't imagine what else could make data manipulation more intuitive to me. > > There are many aspects on this subject, and please do continue if I stepped in too carelessly :) You may of course be perfectly happy with your current work setup, but it seems to me like you could do everything you describe without leaving Python, by using Pandas. Pivot tables, slicing and dicing of heterogenous data types, indexing by multi-layer labels, arbitrary operations on pivoted, sliced and diced data frames, importing/exporting csv, ascii, html and even LaTeX, quick plotting for data ionspection purposes etc. Of course, the interactive element isn't there. On the other hand, it is very powerful, and you don't have to switch between several different environments and tools. The frames are basically enhanced numpy arrays, so the data can be passed directly to numpy or matplotlib. Also, if working in the IPython qtconsole or notebook, simply typing the dataframe's name will show it nicely rendered as an html table. I have definitely enjoyed working with it. Sorry for going slightly off-topic. /Emil > > Cheers > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user ------------------------------ _______________________________________________ SciPy-User mailing list SciPy-User at scipy.org http://mail.scipy.org/mailman/listinfo/scipy-user End of SciPy-User Digest, Vol 118, Issue 4 ****************************************** From trive at astro.su.se Mon Jun 3 08:35:29 2013 From: trive at astro.su.se (=?UTF-8?B?VGjDuGdlciBFbWlsIFJpdmVyYS1UaG9yc2Vu?=) Date: Mon, 03 Jun 2013 14:35:29 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <20130603005926.A32CEA6E38@smtp.hushmail.com> References: <51A39B6D.4030607@gmail.com>

Message-ID: <51AC9833.8040306@creativetrax.com> On 6/2/13 1:47 AM, Matthew Brett wrote: > The person who is trying to do work in Excel, that should be done in a > programming language, needed that training. I'm not sure where in the discussion I should post this, but I wanted to make a comment about the prevalence and power of tools like Excel that I just realized. I've been watching Brett Victor's videos recently, and I just realized that a spreadsheet, with its initial orientation to concrete data, does a good job of implementing his "ladder of abstraction" [1]. You first work with concrete data, then you parametrize the results (e.g., write formulas for the cells), etc. With programming, we need to basically come up with the abstraction right away, which is more difficult. It seems like there is a good tool in the middle ground there that would basically be a spreadsheet that writes your python program for you, letting you play with the data interactively, but the parametrization of your operations writes the python code. Anyways, some thoughts. I realize there are some stats packages that basically do this (write a script as you click through a gui). Thanks, Jason [1] http://worrydream.com/#!/LadderOfAbstraction; another very interesting Bret Victor video is: http://worrydream.com/#!/DrawingDynamicVisualizationsTalkAddendum From evilper at gmail.com Mon Jun 3 09:35:35 2013 From: evilper at gmail.com (Per Nielsen) Date: Mon, 3 Jun 2013 15:35:35 +0200 Subject: [SciPy-User] Unexpectedly large memory usage in scipy.ode class In-Reply-To: References: Message-ID: You are right, I checked the size of the working arrays and they corresponded perfectly with my memory usage. I checked some of the Runge-Kutta based solvers made available through the complex_ode class, and they have a lower memory overhead compared to zvode, about half according to my tests. Thank you for your help :) Per On Fri, May 31, 2013 at 4:23 PM, Warren Weckesser < warren.weckesser at gmail.com> wrote: > > On Fri, May 31, 2013 at 5:58 AM, Per Nielsen wrote: > >> Hi all, >> >> I am solving large linear ODE systems using the QuTip python package ( >> https://code.google.com/p/qutip/) which uses scipy ODE solvers under the >> hood. The system is of he form >> >> dydt = L*y, >> >> where L is a large complex sparse matrix, all pretty standard. In this >> type of problem the matrix L is the biggest memory user, expected to be >> much larger than the solution vector y itself. >> >> Below is the output of a @profile from the memory_profiler package on the >> function setting up the ode object, no actual time-stepping is done (the >> code can be found here: >> https://github.com/qutip/qutip/blob/master/qutip/mesolve.py#L561). >> >> Line # Mem usage Increment Line Contents >> ================================================ >> 562 @profile >> 563 def _mesolve_const(H, rho0, tlist, >> c_op_list, expt_ops, args, opt, >> 564 progress_bar): >> 565 """! >> 566 Evolve the density matrix using an >> ODE solver, for constant hamiltonian >> 567 and collapse operators. >> 568 """ >> 569 61.961 MB 0.000 MB >> 570 61.961 MB 0.000 MB if debug: >> 571 print(inspect.stack()[0][3]) >> 572 >> 573 # >> 574 # check initial state >> 575 # >> 576 61.961 MB 0.000 MB if isket(rho0): >> 577 # if initial state is a ket >> and no collapse operator where given, >> 578 # fallback on the unitary >> schrodinger equation solver >> 579 61.961 MB 0.000 MB if len(c_op_list) == 0 and >> isoper(H): >> 580 return _sesolve_const(H, >> rho0, tlist, expt_ops, args, opt) >> 581 >> 582 # Got a wave function as >> initial state: convert to density matrix. >> 583 61.973 MB 0.012 MB rho0 = rho0 * rho0.dag() >> 584 >> 585 # >> 586 # construct liouvillian >> 587 # >> 588 61.973 MB 0.000 MB if opt.tidy: >> 589 61.973 MB 0.000 MB H = H.tidyup(opt.atol) >> 590 >> 591 327.887 MB 265.914 MB L = liouvillian_fast(H, c_op_list) >> 592 >> 593 # >> 594 # setup integrator >> 595 # >> 596 343.168 MB 15.281 MB initial_vector = >> mat2vec(rho0.full()) >> 597 343.168 MB 0.000 MB r = scipy.integrate.ode(cy_ode_rhs) >> 598 343.168 MB 0.000 MB r.set_f_params(L.data.data, >> L.data.indices, L.data.indptr) >> 599 343.168 MB 0.000 MB r.set_integrator('zvode', >> method=opt.method, order=opt.order, >> 600 343.168 MB 0.000 MB atol=opt.atol, >> rtol=opt.rtol, nsteps=opt.nsteps, >> 601 343.168 MB 0.000 MB >> first_step=opt.first_step, min_step=opt.min_step, >> 602 343.172 MB 0.004 MB >> max_step=opt.max_step) >> 603 572.055 MB 228.883 MB >> r.set_initial_value(initial_vector, tlist[0]) >> 604 >> 605 # >> 606 # call generic ODE code >> 607 # >> 608 602.805 MB 30.750 MB return _generic_ode_solve(r, rho0, >> tlist, expt_ops, opt, progress_bar) >> >> On line 591 the L matrix generated and eats a large chunk of memory, as >> expected. However, on line 603 setting the initial condition eats an almost >> comparable chunk, despite the fact that the initial vector itself only >> takes up ~ 15 MB (line 596). >> >> I find this strange, as I would expect that setting the initial condition >> would at most increase the memory usage by approximately the size of the >> initial vector. >> >> I have tried to reproduce the problem using a minimal script (see >> attachment), but here the memory usage is as expected: >> >> Filename: test_ode2.py >> >> Line # Mem usage Increment Line Contents >> ================================================ >> 7 @profile >> 8 18.707 MB 0.000 MB def runode(): >> 9 18.707 MB 0.000 MB N = 5000 >> 10 >> 11 # M = np.random.rand(N, N) >> 12 111.230 MB 92.523 MB M = sparse.rand(N, N, >> density=0.05, format='csr') \ >> 13 198.797 MB 87.566 MB + 1j * sparse.rand(N, N, >> density=0.05, format='csr') >> 14 199.031 MB 0.234 MB y0 = np.random.rand(N, 1) + 1j * >> np.random.rand(N, 1) >> 15 >> 16 199.031 MB 0.000 MB t0 = 0.0 >> 17 >> 18 199.031 MB 0.000 MB def f(t, y, M): >> 19 # return np.dot(M, y) >> 20 return M.dot(y) >> 21 >> 22 199.031 MB 0.000 MB r = ode(f) >> 23 199.031 MB 0.000 MB r.set_integrator('zvode', >> atol=1e-10) >> 24 199.035 MB 0.004 MB r.set_f_params(M) >> 25 199.035 MB 0.000 MB r.set_initial_value(y0, t0) >> >> Does someone with more insight into the scipy.ode solver might have an >> idea of whats going on? I looked in the file myself but didnt not see any >> indications of large memory consumptions. >> > > > The `set_initial_value` method calls the integrator's `reset` method. The > `reset` method of the 'zvode' integrator allocates three "work" arrays, > `iwork`, `rwork` and `zwork`, whose sizes depends on the size of `y0`. To > verify that these are the cause of the memory growth, you can access these > arrays after calling `r.set_initial_value(y0, t0)` as > `r._integrator.iwork`, etc. > > Warren > > > >> Best, >> Per >> >> >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user >> >> > > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From otrov at hush.ai Mon Jun 3 10:32:08 2013 From: otrov at hush.ai (zetah) Date: Mon, 03 Jun 2013 16:32:08 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <51AC9833.8040306@creativetrax.com> References: <51A39B6D.4030607@gmail.com> <1369793909.71134.YahooMailNeo@web121502.mail.ne1.yahoo.com>

<51AC9833.8040306@creativetrax.com> Message-ID: <20130603143209.E27C1A6E40@smtp.hushmail.com> Jason Grout wrote: >I'm not sure where in the discussion I should post this, but I wanted to >make a comment about the prevalence and power of tools like Excel that I >just realized. I've been watching Brett Victor's videos recently, and I >just realized that a spreadsheet, with its initial orientation to >concrete data, does a good job of implementing his "ladder of >abstraction" [1]. You first work with concrete data, then you >parametrize the results (e.g., write formulas for the cells), etc. With >programming, we need to basically come up with the abstraction right >away, which is more difficult. It seems like there is a good tool in >the middle ground there that would basically be a spreadsheet that >writes your python program for you, letting you play with the data >interactively, but the parametrization of your operations writes the >python code. Same thought here, glad you wrote that Following the stream of discussion, pandas dataframe as object is annotated with metadata and allows different ways of manipulating this annotated data. If some higher state of mind (armed with skills and vision) can see this as interactive helper in IPython Notebook (called as magic command), perhaps it can sell itself. So I'm not talking about visible streadsheet of million element array (like array showed in Matlab), but just metadata scheme and numbered axis. Numbered axis could be thorough thought manipulator on basic numpy array, while some advanced featured available to pandas dataframe, as it offers diverse transformation potentials This helper could provide filter, but not filter as in Excel, but filter as numpy array ufuncs... I can't see the sneak preview right away, but hopefully that's not a problem Oh let someone sees this as a challenge :) From scipy at whamra.com Mon Jun 3 15:42:24 2013 From: scipy at whamra.com (Waleed Hamra) Date: Mon, 03 Jun 2013 22:42:24 +0300 Subject: [SciPy-User] how do I get the subtrees of dendrogram made by scipy.cluster.hierarchy? Message-ID: <2213506.DbI8pMOm7X@waleed-virtual-machine> I had a confusion regarding this module (scipy.cluster.hierarchy) ... and still have some ! For example we have this dendrogram: http://img62.imageshack.us/img62/8130/3ieb4.png My question is how can I extract the coloured subtrees (each one represent a cluster) in a nice format, say SIF format ? Now the code to get the plot above is: In [1]: import scipy In [2]: import scipy.cluster.hierarchy as sch In [3]: import matplotlib.pylab as plt In [4]: X = scipy.randn(100,2) In [5]: d = sch.distance.pdist(X) In [6]: Z= sch.linkage(d,method='complete') In [7]: P =sch.dendrogram(Z) In [8]: plt.savefig('plot_dendrogram.png') In [9]: T = sch.fcluster(Z, 0.5*d.max(), 'distance') In [10]: T Out[10]: array([4, 5, 3, 2, 2, 3, 5, 2, 2, 5, 2, 2, 2, 3, 2, 3, 2, 5, 4, 5, 2, 5, 2, 3, 3, 3, 1, 3, 4, 2, 2, 4, 2, 4, 3, 3, 2, 5, 5, 5, 3, 2, 2, 2, 5, 4, 2, 4, 2, 2, 5, 5, 1, 2, 3, 2, 2, 5, 4, 2, 5, 4, 3, 5, 4, 4, 2, 2, 2, 4, 2, 5, 2, 2, 3, 3, 2, 4, 5, 3, 4, 4, 2, 1, 5, 4, 2, 2, 5, 5, 2, 2, 5, 5, 5, 4, 3, 3, 2, 4], dtype=int32) In [11]: sch.leaders(Z,T) Out[11]: (array([190, 191, 182, 193, 194], dtype=int32), array([2, 3, 1, 4,5],dtype=int32)) So now, the output of fcluster() gives the clustering of the nodes (by their id's), and leaders() described here is supposed to return 2 arrays: first one contains the leader nodes of the clusters generated by Z, here we can see we have 5 clusters, as well as in the plot and the second one the id's of these clusters So if this leaders() returns resp. L and M : L[2]=182 and M[2]=1, then cluster 1 is leaded by node id 182, which doesn't exist in the observations set X, the documentation says "... then it corresponds to a non-singleton cluster". But I can't get it ... Also, I converted the Z to a tree by sch.to_tree(Z), that will return an easy- to-use tree object, which I want to visualize, but which tool should I use as a graphical platform that manipulate these kind of tree objects as inputs? thanks in advance :) From msuzen at gmail.com Tue Jun 4 04:07:17 2013 From: msuzen at gmail.com (Suzen, Mehmet) Date: Tue, 4 Jun 2013 10:07:17 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> Message-ID: On 28 May 2013 20:23, Calvin Morrison wrote: > > http://arxiv.org/pdf/1210.0530v3.pdf > > Pissed-off Scientific Programmer, > Calvin Morrison Those recent papers and discussions all talk about good practises. I was thinking today in the bus, why there are not many literature on scientific software development methodologies. One explicit paper I found was from 80s called A Development Methodology for Scientific Software Cort, G. et. al. http://dx.doi.org/10.1109/TNS.1985.4333629 It is pretty classic approach for today's standard, There is also a book about generic style and good practice, its a pretty good book (might be mentioned in this list before): Writing Scientific Software: A Guide to Good Style Suely Oliveira and David E. Stewart http://www.cambridge.org/9780521858960 but I don't see any reference to modern development methodologies specifically address to scientific software. For example: extensions of test driven development, which would suit better than classic specification-design-coding-testing. Test cases would be directly related to what we would like to achieve in the first place. For example a generic density of something etc. I haven't heard anyone developing scientific software in this way...yet. Best, -m From josef.pktd at gmail.com Tue Jun 4 07:27:05 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 4 Jun 2013 07:27:05 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> Message-ID: On Tue, Jun 4, 2013 at 4:07 AM, Suzen, Mehmet wrote: > On 28 May 2013 20:23, Calvin Morrison wrote: >> >> http://arxiv.org/pdf/1210.0530v3.pdf >> >> Pissed-off Scientific Programmer, >> Calvin Morrison > > Those recent papers and discussions all talk about good practises. I > was thinking > today in the bus, why there are not many literature on scientific > software development > methodologies. One explicit paper I found was from 80s called > > A Development Methodology for Scientific Software > Cort, G. et. al. > http://dx.doi.org/10.1109/TNS.1985.4333629 > > It is pretty classic approach for today's standard, There is also a book about > generic style and good practice, its a pretty good book (might be > mentioned in this list before): > > Writing Scientific Software: A Guide to Good Style > Suely Oliveira and David E. Stewart > http://www.cambridge.org/9780521858960 > > but I don't see any reference to modern development methodologies specifically > address to scientific software. For example: extensions of test driven > development, > which would suit better than classic > specification-design-coding-testing. Test cases > would be directly related to what we would like to achieve in the > first place. For example > a generic density of something etc. I haven't heard anyone developing > scientific software > in this way...yet. I think functional (not unit) testing is pretty much the standard in the area of developing statistical algorithms even if nobody calls it that way. And I don't know of any references to software development for it. When writing a library function for existing algorithms, then it is standard to test it against existing results. Many (or most) software packages, or articles that describe the software, show that they reproduce existing results as test cases. (And that's the way we work for statsmodels.) For new algorithms, it is standard to publish Monte Carlo studies that show that the new algorithm is "better" in at least some cases or directions than the existing algorithms (or statistical estimators and tests), and often they use published case studies or applied results to show how the conclusion would differ or be unchanged (Just for illustration: the workflow of some friends of mine that are theoretical econometricians. First write the paper with the heavy theory and proofs, then start to write the MonteCarlo, the first version doesn't deliver the results that can be expected based on the theory, look for bugs and fix those, rerun MonteCarlo, iterate, then find different test cases, simulated data generating processes, and show where it works and where it doesn't, and check the theoretical explanation/intuition why it doesn't work in some cases. Submit only cases that work, and write a footnote for the other cases.) And after, that there are many published articles that present MonteCarlo studies to show that an algorithm does not work properly if some assumptions are violated, and that something else is better. (This doesn't mean that they produce a "pretty" piece of software, but it shows that it works as advertised.) I don't think I ever heard of unit or functional testing for applied research, that is testing the workflow and not the computational tools. Josef > > Best, > -m > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From josef.pktd at gmail.com Tue Jun 4 07:51:13 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Tue, 4 Jun 2013 07:51:13 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> Message-ID: On Tue, Jun 4, 2013 at 7:27 AM, wrote: > On Tue, Jun 4, 2013 at 4:07 AM, Suzen, Mehmet wrote: >> On 28 May 2013 20:23, Calvin Morrison wrote: >>> >>> http://arxiv.org/pdf/1210.0530v3.pdf >>> >>> Pissed-off Scientific Programmer, >>> Calvin Morrison >> >> Those recent papers and discussions all talk about good practises. I >> was thinking >> today in the bus, why there are not many literature on scientific >> software development >> methodologies. One explicit paper I found was from 80s called >> >> A Development Methodology for Scientific Software >> Cort, G. et. al. >> http://dx.doi.org/10.1109/TNS.1985.4333629 >> >> It is pretty classic approach for today's standard, There is also a book about >> generic style and good practice, its a pretty good book (might be >> mentioned in this list before): >> >> Writing Scientific Software: A Guide to Good Style >> Suely Oliveira and David E. Stewart >> http://www.cambridge.org/9780521858960 >> >> but I don't see any reference to modern development methodologies specifically >> address to scientific software. For example: extensions of test driven >> development, >> which would suit better than classic >> specification-design-coding-testing. Test cases >> would be directly related to what we would like to achieve in the >> first place. For example >> a generic density of something etc. I haven't heard anyone developing >> scientific software >> in this way...yet. > > I think functional (not unit) testing is pretty much the standard in > the area of developing statistical algorithms even if nobody calls it > that way. And I don't know of any references to software development > for it. > > When writing a library function for existing algorithms, then it is > standard to test it against existing results. Many (or most) software > packages, or articles that describe the software, show that they > reproduce existing results as test cases. > (And that's the way we work for statsmodels.) > > For new algorithms, it is standard to publish Monte Carlo studies that > show that the new algorithm is "better" in at least some cases or > directions than the existing algorithms (or statistical estimators and > tests), and often they use published case studies or applied results > to show how the conclusion would differ or be unchanged > > (Just for illustration: the workflow of some friends of mine that are > theoretical econometricians. > First write the paper with the heavy theory and proofs, then start to > write the MonteCarlo, the first version doesn't deliver the results > that can be expected based on the theory, look for bugs and fix those, > rerun MonteCarlo, iterate, then find different test cases, simulated > data generating processes, and show where it works and where it > doesn't, and check the theoretical explanation/intuition why it > doesn't work in some cases. Submit only cases that work, and write a > footnote for the other cases. Sorry I forgot one step After the submission, one referee of the paper doesn't like some parts or wants additional simulations. Iterate until publication or rejection. If rejection, then submit to another journal, and iterate. By the time the article is finally published, other researchers already started to use the algorithm and possibly the code. Sounds partially like functional test driven developement to me. ) > > And after, that there are many published articles that present > MonteCarlo studies to show that an algorithm does not work properly if > some assumptions are violated, and that something else is better. > > (This doesn't mean that they produce a "pretty" piece of software, but > it shows that it works as advertised.) > > > I don't think I ever heard of unit or functional testing for applied > research, that is testing the workflow and not the computational > tools. > > Josef > >> >> Best, >> -m >> _______________________________________________ >> SciPy-User mailing list >> SciPy-User at scipy.org >> http://mail.scipy.org/mailman/listinfo/scipy-user From matthew.brett at gmail.com Wed Jun 5 05:05:23 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 5 Jun 2013 02:05:23 -0700 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> Message-ID: Hi, On Tue, Jun 4, 2013 at 4:27 AM, wrote: > On Tue, Jun 4, 2013 at 4:07 AM, Suzen, Mehmet wrote: >> On 28 May 2013 20:23, Calvin Morrison wrote: >>> >>> http://arxiv.org/pdf/1210.0530v3.pdf >>> >>> Pissed-off Scientific Programmer, >>> Calvin Morrison >> >> Those recent papers and discussions all talk about good practises. I >> was thinking >> today in the bus, why there are not many literature on scientific >> software development >> methodologies. One explicit paper I found was from 80s called >> >> A Development Methodology for Scientific Software >> Cort, G. et. al. >> http://dx.doi.org/10.1109/TNS.1985.4333629 >> >> It is pretty classic approach for today's standard, There is also a book about >> generic style and good practice, its a pretty good book (might be >> mentioned in this list before): >> >> Writing Scientific Software: A Guide to Good Style >> Suely Oliveira and David E. Stewart >> http://www.cambridge.org/9780521858960 >> >> but I don't see any reference to modern development methodologies specifically >> address to scientific software. For example: extensions of test driven >> development, >> which would suit better than classic >> specification-design-coding-testing. Test cases >> would be directly related to what we would like to achieve in the >> first place. For example >> a generic density of something etc. I haven't heard anyone developing >> scientific software >> in this way...yet. > > I think functional (not unit) testing is pretty much the standard in > the area of developing statistical algorithms even if nobody calls it > that way. And I don't know of any references to software development > for it. > > When writing a library function for existing algorithms, then it is > standard to test it against existing results. Many (or most) software > packages, or articles that describe the software, show that they > reproduce existing results as test cases. > (And that's the way we work for statsmodels.) > > For new algorithms, it is standard to publish Monte Carlo studies that > show that the new algorithm is "better" in at least some cases or > directions than the existing algorithms (or statistical estimators and > tests), and often they use published case studies or applied results > to show how the conclusion would differ or be unchanged > > (Just for illustration: the workflow of some friends of mine that are > theoretical econometricians. > First write the paper with the heavy theory and proofs, then start to > write the MonteCarlo, the first version doesn't deliver the results > that can be expected based on the theory, look for bugs and fix those, > rerun MonteCarlo, iterate, then find different test cases, simulated > data generating processes, and show where it works and where it > doesn't, and check the theoretical explanation/intuition why it > doesn't work in some cases. Submit only cases that work, and write a > footnote for the other cases.) Here is an example of some incorrect theory combined with a simulation showing correct results. It turned out there were two separate errors in theory which balanced each other out in the particular case used for the simulation. This paper reviews and corrects the previous paper: http://www.math.mcgill.ca/keith/fmriagain/fmriagain.abstract.html Quote from section 2.2: "In general the variance of the parameter estimates is underestimated by equation (3) but the estimator of the variance is overestimated by equation (6), so that the two tend to cancel each other out in the T statistic (5). It can be shown that they do cancel out almost exactly for the random regressors that were chosen for validating the methods, which explains why the biases were not obsereved. However for other non-random regressors these e?ects do not cancel and large discrepancies can occur." I think that points at the need to write tests for all parts not just the whole. Cheers, Matthew From msuzen at gmail.com Wed Jun 5 07:24:42 2013 From: msuzen at gmail.com (Suzen, Mehmet) Date: Wed, 5 Jun 2013 13:24:42 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com> Message-ID: On 4 June 2013 13:27, wrote: > I think functional (not unit) testing is pretty much the standard in > the area of developing statistical algorithms even if nobody calls it > that way. And I don't know of any references to software development > for it. Yes, functional and unit testing of an existing implimentations appears to be, as you said, pretty much common practice. But What I had in mind was a methodology of first designing and coding with the functional tests (initially failing) c.f. TDD http://en.wikipedia.org/wiki/Test-driven_development Your MC workflow seems to follow this logic though. Regarding the iteration process you refer:since we are doing many iterations, at some point we lose track of where did we start in the process: but I think TDD could help us to focus on the result in the scientific software projects. > research, that is testing the workflow and not the computational > tools. This is very curicial point I think. Workflow and computational tools are two seperate things; If we think tools as APIs. A workflow may require to use an API but it isn't our responsibility to test the API. Best, -m From newville at cars.uchicago.edu Wed Jun 5 17:36:54 2013 From: newville at cars.uchicago.edu (Matt Newville) Date: Wed, 5 Jun 2013 16:36:54 -0500 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

Message-ID: Hi, On Wed, Jun 5, 2013 at 4:05 AM, Matthew Brett wrote: > Hi, > > On Tue, Jun 4, 2013 at 4:27 AM, wrote: >> On Tue, Jun 4, 2013 at 4:07 AM, Suzen, Mehmet wrote: >>> On 28 May 2013 20:23, Calvin Morrison wrote: >>>> >>>> http://arxiv.org/pdf/1210.0530v3.pdf >>>> >>>> Pissed-off Scientific Programmer, >>>> Calvin Morrison >>> >>> Those recent papers and discussions all talk about good practises. I >>> was thinking >>> today in the bus, why there are not many literature on scientific >>> software development >>> methodologies. One explicit paper I found was from 80s called >>> >>> A Development Methodology for Scientific Software >>> Cort, G. et. al. >>> http://dx.doi.org/10.1109/TNS.1985.4333629 >>> >>> It is pretty classic approach for today's standard, There is also a book about >>> generic style and good practice, its a pretty good book (might be >>> mentioned in this list before): >>> >>> Writing Scientific Software: A Guide to Good Style >>> Suely Oliveira and David E. Stewart >>> http://www.cambridge.org/9780521858960 >>> >>> but I don't see any reference to modern development methodologies specifically >>> address to scientific software. For example: extensions of test driven >>> development, >>> which would suit better than classic >>> specification-design-coding-testing. Test cases >>> would be directly related to what we would like to achieve in the >>> first place. For example >>> a generic density of something etc. I haven't heard anyone developing >>> scientific software >>> in this way...yet. >> >> I think functional (not unit) testing is pretty much the standard in >> the area of developing statistical algorithms even if nobody calls it >> that way. And I don't know of any references to software development >> for it. >> >> When writing a library function for existing algorithms, then it is >> standard to test it against existing results. Many (or most) software >> packages, or articles that describe the software, show that they >> reproduce existing results as test cases. >> (And that's the way we work for statsmodels.) >> >> For new algorithms, it is standard to publish Monte Carlo studies that >> show that the new algorithm is "better" in at least some cases or >> directions than the existing algorithms (or statistical estimators and >> tests), and often they use published case studies or applied results >> to show how the conclusion would differ or be unchanged >> >> (Just for illustration: the workflow of some friends of mine that are >> theoretical econometricians. >> First write the paper with the heavy theory and proofs, then start to >> write the MonteCarlo, the first version doesn't deliver the results >> that can be expected based on the theory, look for bugs and fix those, >> rerun MonteCarlo, iterate, then find different test cases, simulated >> data generating processes, and show where it works and where it >> doesn't, and check the theoretical explanation/intuition why it >> doesn't work in some cases. Submit only cases that work, and write a >> footnote for the other cases.) > > Here is an example of some incorrect theory combined with a simulation > showing correct results. It turned out there were two separate errors > in theory which balanced each other out in the particular case used > for the simulation. > > This paper reviews and corrects the previous paper: > > http://www.math.mcgill.ca/keith/fmriagain/fmriagain.abstract.html > > Quote from section 2.2: > > "In general the variance of the parameter estimates is underestimated > by equation (3) but the > estimator of the variance is overestimated by equation (6), so that > the two tend to cancel > each other out in the T statistic (5). It can be shown that they do > cancel out almost exactly > for the random regressors that were chosen for validating the methods, > which explains why > the biases were not obsereved. However for other non-random regressors > these e?ects do not > cancel and large discrepancies can occur." > > I think that points at the need to write tests for all parts not just the whole. > > Cheers, > > Matthew > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > This has been a fairly wide-ranging conversation, but I want to say that I agree completely with Josef's views here. Functional testing has been the norm for scientific software, and, I would say, justifiably so. It is how scientists are trained to think and work. Yes, one must question and understand the details of the instruments and algorithms you use and keep them well-calibrated (which is approximately the meaning of the currently fashionable "unit testing"). But at some point, you trust that your instruments are calibrated and working, and the algorithms you're using are mostly bug-free, and build on these to make real measurements and do real analyses on new, untested systems. The paper that Alan Isaac referred to that started this conversation seemed to advocate for unit testing in the sense of "don't trust the codes you're using, always test them". At first reading, this seems like good advice. Since unit testing (or, at least, the phrase) is relatively new for software development, it gives the appearance of being new advice. But the authors damage their case by continuing on by saying not to trust analysis tools built by other scientists based on the reputation and prior use of thse tools. Here, they expose the weakness of favoring "unit tests" over "functional tests". They are essentially advocating throwing out decades of proven, tested work (and claiming that the use of this work to date is not justified, as it derives from un-due reputation of the authors of prior work) for a fashionable new trend. Science is deliberately conservative, and telling scientists that unit testing is all the rage among the cool programmers and they should jump on that bandwagon is not likely to gain much traction. To be clear, and to use an example I am familiar with, these authors imply "don't trust scipy.optimize.leastsq() -- test it and be skeptical before using it". The problem is that this is easily read as "if you write your own minimization code and write unit tests, you're doing better than this older, outdated work. That piece of junk was used by others based solely on the reputation of the authors, they didn't have any unit tests at all!". One of the key features of scipy is that it reuses well-tested work (LAPACK, MINPACK-1, FFTPACK, and similar well-tested approaches). Now, there might be a bug in these (and, there might be a bug in the scipy wrapper), but the likelihood of finding a new bug with any particular use case is vanishingly small. No one is saying MINPACK-1 (or scipy.optimize.leastsq, or the Standard Model) is perfect and complete. But it works well on one heck of a lot of cases. At some point you *must* rely on these to make progress. In fact, doing so (applying existing models to new problems and using the results, that is, functional testing) is the classic way in which flaws in the underlying models are found. Yes, calibrating the wazoo out of instruments and algorithms, and pushing every button independently (unit testing) is very useful. I don't think anyone is advocating against doing this. But doing this to the exclusion of existing methodologies is going to meet resistance among scientists, and for good reason. The main problem with the Reinhart and Rogoff paper (and the success of the re-interpretation by Herndon, Ash, and Pollin) is a good example of how science works, and what to avoid. And that is *not* (as some seemed to have suggested) to avoid Excel or spreadsheets in favor of procedural programming approaches, but rather to not use home-built, poorly-tested and poorly-described algorithms. Yes, if they had used R or scipy they may have been better off. Unit testing would have helped them. Any testing would have helped them, as would better explanation of their methods. I'm sorry to admit that I read only the abstract, but I would not be surprised if Matthew Brett's example also fell into this category. That is, were the nearly-cancelling mistakes discovered because of unit testing or because of tests of the whole? Obviously, if two functions were always (always!) used together, and had canceling errors (say, one function "incorrectly" scaled by a factor of 2 and the other incorrectly scaled by a factor or 1/2), unit testing might show flaws that never, ever changed the end results. Functional testing (applying a set of analysis tools to a wide range of data, as with a Monte Carlo approach), seems completely sensible to me. You would not be saying that every component is independently checked and proven reliable on its own, but you are testing the whole. Sorry this was so long, --Matt From matthew.brett at gmail.com Wed Jun 5 17:47:47 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 5 Jun 2013 14:47:47 -0700 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

Message-ID: Hi, On Wed, Jun 5, 2013 at 2:36 PM, Matt Newville wrote: > I'm sorry to admit that I read only the abstract, but I would not be > surprised if Matthew Brett's example also fell into this category. > That is, were the nearly-cancelling mistakes discovered because of > unit testing or because of tests of the whole? Obviously, if two > functions were always (always!) used together, and had canceling > errors (say, one function "incorrectly" scaled by a factor of 2 and > the other incorrectly scaled by a factor or 1/2), unit testing might > show flaws that never, ever changed the end results. I believe what happened was that the first author of the paper read the previous paper and saw the errors in the math. As with your example, if the previous paper's algorithms had only been run on similar data then we would never have had a problem. If you had two functions both off by a factor of two you will have to hope that no-one is calling only one of those functions. If we want to provide a library that our users can trust, we must test the whole public API of our code. Of course even then we've only got 'I don't know of any bugs for the ranges of parameters I've tested'. Cheers, Matthew From njs at pobox.com Wed Jun 5 18:08:10 2013 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 5 Jun 2013 23:08:10 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

Message-ID: On Wed, Jun 5, 2013 at 10:36 PM, Matt Newville wrote: > The paper that Alan Isaac referred to that started this conversation > seemed to advocate for unit testing in the sense of "don't trust the > codes you're using, always test them". At first reading, this seems > like good advice. Since unit testing (or, at least, the phrase) is > relatively new for software development, it gives the appearance of > being new advice. But the authors damage their case by continuing on > by saying not to trust analysis tools built by other scientists based > on the reputation and prior use of thse tools. Here, they expose the > weakness of favoring "unit tests" over "functional tests". They are > essentially advocating throwing out decades of proven, tested work > (and claiming that the use of this work to date is not justified, as > it derives from un-due reputation of the authors of prior work) for a > fashionable new trend. Science is deliberately conservative, and > telling scientists that unit testing is all the rage among the cool > programmers and they should jump on that bandwagon is not likely to > gain much traction. But... have you ever sat down and written tests for a piece of widely used academic software? (Not LAPACK, but some random large package that's widely used within a field but doesn't have a comprehensive test suite of its own.) Everyone I've heard of who's done this discovers bugs all over the place. Would you personally trip over them if you didn't test the code? Who knows, maybe not. And probably most of the rest -- off by one errors here and there, maybe an incorrect normalizing constant, etc., -- end up not mattering too much. Or maybe they do. How could you even tell? You should absolutely check scipy.optimize.leastsq before using it! You could rewrite it too if you want, I guess, and if you write a thorough test suite it might even work out. But it's pretty bizarre to me to think that someone is going to think "ah-hah, writing my own code + test suite will be easier than just writing a test suite!" Sure some people are going to find ways to procrastinate on the real problem (*cough*grad students*cough*) and NIH ain't just a funding body. But that's totally orthogonal to whether tests are good. Honestly I'm not even sure what unit-testing "bandwagon" you're talking about. I insist on unit tests for my code because every time I fail to write them I regret it sooner or later, and I'd rather it be sooner. And because they pay themselves back ridiculously quickly because you never have to debug more than 15 lines of code at a time, you always know that everything the current 15 lines of code depends on is working correctly. Plus, white-box unit-testing can be comprehensive in a way that black-box functional testing just can't be. The code paths in a system grow like 2**n; you can reasonably test all of them for a short function with n < 5, but not for a whole system with n >> 100. And white-box unit-testing is what lets you move quickly when programming, because you can quickly isolate errors instead of spending all your time tracing through stuff in a debugger. If you want to *know* your code is correct, this kind of thorough testing is just a necessary (not sufficient!) condition. (Building on libraries that have large user bases is also very helpful!) -n From Phillip.M.Feldman at gmail.com Wed Jun 5 18:44:03 2013 From: Phillip.M.Feldman at gmail.com (pfeldman) Date: Wed, 5 Jun 2013 15:44:03 -0700 (PDT) Subject: [SciPy-User] ftol and xtol Message-ID: <1370472243657-18355.post@n7.nabble.com> It would be very helpful if one could specify `ftol` and `xtol` with any of the optimization algorithms. How difficult would it be to implement this? Phillip -- View this message in context: http://scipy-user.10969.n7.nabble.com/ftol-and-xtol-tp18355.html Sent from the Scipy-User mailing list archive at Nabble.com. From guziy.sasha at gmail.com Wed Jun 5 18:47:40 2013 From: guziy.sasha at gmail.com (Oleksandr Huziy) Date: Wed, 5 Jun 2013 18:47:40 -0400 Subject: [SciPy-User] ftol and xtol In-Reply-To: <1370472243657-18355.post@n7.nabble.com> References: <1370472243657-18355.post@n7.nabble.com> Message-ID: It does exist for fmin http://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.fmin.html#scipy.optimize.fmin Which one do you want to use? -- Oleksandr (Sasha) Huziy 2013/6/5 pfeldman > It would be very helpful if one could specify `ftol` and `xtol` with any of > the optimization algorithms. How difficult would it be to implement this? > > Phillip > > > > -- > View this message in context: > http://scipy-user.10969.n7.nabble.com/ftol-and-xtol-tp18355.html > Sent from the Scipy-User mailing list archive at Nabble.com. > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user > -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Wed Jun 5 22:46:00 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 5 Jun 2013 22:46:00 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

Message-ID: On Wed, Jun 5, 2013 at 6:08 PM, Nathaniel Smith wrote: > On Wed, Jun 5, 2013 at 10:36 PM, Matt Newville > wrote: >> The paper that Alan Isaac referred to that started this conversation >> seemed to advocate for unit testing in the sense of "don't trust the >> codes you're using, always test them". At first reading, this seems >> like good advice. Since unit testing (or, at least, the phrase) is >> relatively new for software development, it gives the appearance of >> being new advice. But the authors damage their case by continuing on >> by saying not to trust analysis tools built by other scientists based >> on the reputation and prior use of thse tools. Here, they expose the >> weakness of favoring "unit tests" over "functional tests". They are >> essentially advocating throwing out decades of proven, tested work >> (and claiming that the use of this work to date is not justified, as >> it derives from un-due reputation of the authors of prior work) for a >> fashionable new trend. Science is deliberately conservative, and >> telling scientists that unit testing is all the rage among the cool >> programmers and they should jump on that bandwagon is not likely to >> gain much traction. > > But... have you ever sat down and written tests for a piece of widely > used academic software? (Not LAPACK, but some random large package > that's widely used within a field but doesn't have a comprehensive > test suite of its own.) Everyone I've heard of who's done this > discovers bugs all over the place. Would you personally trip over them > if you didn't test the code? Who knows, maybe not. And probably most > of the rest -- off by one errors here and there, maybe an incorrect > normalizing constant, etc., -- end up not mattering too much. Or maybe > they do. How could you even tell? > > You should absolutely check scipy.optimize.leastsq before using it! But leastsq has seen it's uses and we "know" it works. My main work for scipy.stats has been to make reasonably sure it works as advertised, (with adding sometimes "don't trust those results"). Optimizers either work or they don't work, and we see whether they work for our problems in the "functional" testing, in statsmodels for example. (notwithstanding that many bugs have been fixed in scipy.optimize where the optimizers did not work correctly and someone went to see why.) The recent discussion on global optimizers was on how successful they are for different problems, not whether each individual piece is unit tested. > You could rewrite it too if you want, I guess, and if you write a > thorough test suite it might even work out. But it's pretty bizarre to > me to think that someone is going to think "ah-hah, writing my own > code + test suite will be easier than just writing a test suite!" Sure > some people are going to find ways to procrastinate on the real > problem (*cough*grad students*cough*) and NIH ain't just a funding > body. But that's totally orthogonal to whether tests are good. > > Honestly I'm not even sure what unit-testing "bandwagon" you're > talking about. I insist on unit tests for my code because every time I > fail to write them I regret it sooner or later, and I'd rather it be > sooner. And because they pay themselves back ridiculously quickly > because you never have to debug more than 15 lines of code at a time, > you always know that everything the current 15 lines of code depends > on is working correctly. > > Plus, white-box unit-testing can be comprehensive in a way that > black-box functional testing just can't be. The code paths in a system > grow like 2**n; you can reasonably test all of them for a short > function with n < 5, but not for a whole system with n >> 100. And > white-box unit-testing is what lets you move quickly when programming, > because you can quickly isolate errors instead of spending all your > time tracing through stuff in a debugger. If you want to *know* your > code is correct, this kind of thorough testing is just a necessary > (not sufficient!) condition. (Building on libraries that have large > user bases is also very helpful!) I think there are two different areas, one is writing library code for general usage, and the other is code that is written for (initially) one time usage as part of the research. Library code is reasonable tested, either by usage or unit/functional tests. If it passes functional test and usage, it should be reasonably "safe". However, competition among packages (in statistics/econometrics) for example creates a large incentive for software developers to make sure the code is correct, and respond to any reports of imprecision. For example, nonlinear least squares, optimize.leastsq and packages that use it have the NIST test cases. Either we pass them or we don't. Another example, every once in a while a journal article is published on the quality of an estimation in the most popular statistical software packages. If one commercial package gets a bad report, then it is usually quickly fixed, or they show that the default values that the author of the paper used are not correct. (Last example that I remember is GARCH fitting where most packages didn't do well because of numerical derivative problems. The author that criticized this also showed how to do it better, which was then quickly adapted by all major packages.) If users cannot trust a package, then there is a big incentives for users to switch to a different one. But if everyone else is using one package in a field, then an individual researcher cannot be "blamed" for using it also, and has little incentive to unit test. How do you know what's the correct result for a code unit to write a unit test? Often I don't know, and all I can do is wait for the functional test results. Minor bugs are indistinguishable from minor design decisions, and it's not worth a huge amount of effort to find them all: example: in time series analysis, an indexing mistake at the beginning of the time series changes some decimals and the mistake hides behind other decisions for how initial conditions are treated across methods and packages. An indexing mistake at the end of the time series screws up the forecasting and will be visible the first time someone looks at the forecasts. example: degrees of freedom and small sample corrections: I have no idea what different packages use, until I specifically read the docs for this (if there are any, which is true for Stata and SAS, but false for R), and test and verify against it. If I don't have exactly the same algorithm, then I don't see minor bugs/design choices because it doesn't make much difference in most Monte Carlos and applications. example: Is the low precision of the Jacobian of scipy.optimize.leastsq a feature or a bug? I don't use it. example: Is the limited precision of scipy.special in some parameter ranges a feature or a bug? It's a feature for me, but Pauli and some other contributors consider them as bugs, if they can do something about it, and have removed many of them. example: scipy.stats.distributions numerical problems and low precision for unusual cases. bug or feature. I can work with 6 to 10 decimals precision in most cases, but sometimes I or some users would like to have a lot more, or want to evaluate the distributions at some "weird" parameters. example: I implemented 11 methods to correct p-values for multiple testing, 9 are verified against R, 2 are not available in R and I have to trust my code and that they are doing well in the Monte Carlo (although slightly worse than I would have expected). The other part: code written for one time research: Why would you spend a lot of time unit testing, if all you are interested is the functional test that it works for the given application? And, as above, how would you know what the correct result should be, besides "works for me". bottom line: I think unit testing and functional testing for scientific code is pretty different from many other areas of software development. It's easy to write a unit test that the right record is retrieved from a database. It's a lot more difficult to write a unit test that .... (I coded correctly the asymptotic distribution for a new estimator or test statistic.) (How did I end up at the wrong side of the argument? I have been advocating TDD, unit tests and verified functional tests for five years on this mailing list.) Josef > > -n > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From Phillip.M.Feldman at gmail.com Thu Jun 6 00:08:46 2013 From: Phillip.M.Feldman at gmail.com (pfeldman) Date: Wed, 5 Jun 2013 21:08:46 -0700 (PDT) Subject: [SciPy-User] ftol and xtol In-Reply-To: References: <1370472243657-18355.post@n7.nabble.com> Message-ID: <1370491726005-18358.post@n7.nabble.com> In particular, I'd like to be able to specify `ftol` and `xtol` with optimize.fmin_bfgs, optimize.fmin_l_bfgs_b, and optimize.anneal. optimize.anneal has a parameter called `feps` that seems similar to `ftol`, but there is no parameter comparable to `xtol`. Also, it would be great if the parameter names were the same across the board--to the extent possible--because that would make it much easier to compare alternative optimization algorithms. -- View this message in context: http://scipy-user.10969.n7.nabble.com/ftol-and-xtol-tp18355p18358.html Sent from the Scipy-User mailing list archive at Nabble.com. From Jerome.Kieffer at esrf.fr Thu Jun 6 01:23:27 2013 From: Jerome.Kieffer at esrf.fr (Jerome Kieffer) Date: Thu, 6 Jun 2013 07:23:27 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

Message-ID: <20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr> On Wed, 5 Jun 2013 23:08:10 +0100 Nathaniel Smith wrote: > But... have you ever sat down and written tests for a piece of widely > used academic software? (Not LAPACK, but some random large package > that's widely used within a field but doesn't have a comprehensive > test suite of its own.) Everyone I've heard of who's done this > discovers bugs all over the place. Would you personally trip over them > if you didn't test the code? Who knows, maybe not. And probably most > of the rest -- off by one errors here and there, maybe an incorrect > normalizing constant, etc., -- end up not mattering too much. Or maybe > they do. How could you even tell? I found bugs in scipy.ndimage.shift and in scipy.stats.linregress. The first took me ages to be spotted as I was assuming the error was on my side as scipy was seen as a "large library widely used". Cheers, -- J?r?me Kieffer Data analysis unit - ESRF PS: I blame nobody: I probably write more bugs than most of you. From msuzen at gmail.com Thu Jun 6 02:00:58 2013 From: msuzen at gmail.com (Suzen, Mehmet) Date: Thu, 6 Jun 2013 08:00:58 +0200 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

Message-ID: On 6 June 2013 00:08, Nathaniel Smith wrote: > time tracing through stuff in a debugger. If you want to *know* your > code is correct, this kind of thorough testing is just a necessary > (not sufficient!) condition. (Building on libraries that have large > user bases is also very helpful!) Good point. I don't think a standard user of a well established API should do unit testing or something similar on the library; except maybe running 'intall test'. After usage, correctness of outputs has to be checked against the overall science behind the code i.e. functional testing. Healthy scepticism is good, more of it would constitute paranoia. -m From matthew.brett at gmail.com Thu Jun 6 07:21:52 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Thu, 6 Jun 2013 12:21:52 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: <20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr> References: <51A39B6D.4030607@gmail.com>

<20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr> Message-ID: Hi, On Thu, Jun 6, 2013 at 6:23 AM, Jerome Kieffer wrote: > On Wed, 5 Jun 2013 23:08:10 +0100 > Nathaniel Smith wrote: > >> But... have you ever sat down and written tests for a piece of widely >> used academic software? (Not LAPACK, but some random large package >> that's widely used within a field but doesn't have a comprehensive >> test suite of its own.) Everyone I've heard of who's done this >> discovers bugs all over the place. Would you personally trip over them >> if you didn't test the code? Who knows, maybe not. And probably most >> of the rest -- off by one errors here and there, maybe an incorrect >> normalizing constant, etc., -- end up not mattering too much. Or maybe >> they do. How could you even tell? > > I found bugs in scipy.ndimage.shift and in scipy.stats.linregress. > The first took me ages to be spotted as I was assuming the error was on > my side as scipy was seen as a "large library widely used". Well said. See also Blake Griffith's current struggles with scipy.sparse (last message title "parametric tests, known failures and skipped tests"). If it's not tested - assume it's broken. If it's not tested and it's not broken, assume it will break soon. Don't use anything for serious work that isn't tested. At least - that has been my experience. Cheers, Matthew From josef.pktd at gmail.com Thu Jun 6 08:19:03 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 6 Jun 2013 08:19:03 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

<20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr> Message-ID: On Thu, Jun 6, 2013 at 7:21 AM, Matthew Brett wrote: > Hi, > > On Thu, Jun 6, 2013 at 6:23 AM, Jerome Kieffer wrote: >> On Wed, 5 Jun 2013 23:08:10 +0100 >> Nathaniel Smith wrote: >> >>> But... have you ever sat down and written tests for a piece of widely >>> used academic software? (Not LAPACK, but some random large package >>> that's widely used within a field but doesn't have a comprehensive >>> test suite of its own.) Everyone I've heard of who's done this >>> discovers bugs all over the place. Would you personally trip over them >>> if you didn't test the code? Who knows, maybe not. And probably most >>> of the rest -- off by one errors here and there, maybe an incorrect >>> normalizing constant, etc., -- end up not mattering too much. Or maybe >>> they do. How could you even tell? >> >> I found bugs in scipy.ndimage.shift and in scipy.stats.linregress. >> The first took me ages to be spotted as I was assuming the error was on >> my side as scipy was seen as a "large library widely used". > > Well said. See also Blake Griffith's current struggles with > scipy.sparse (last message title "parametric tests, known failures and > skipped tests"). As far as I understand these are not BUGs. These are TDD test failures during development while adding support to additional dtypes. Daniel Smith was adding better indexing support, with many TDD test failures. But that doesn't mean scipy.sparse didn't work correctly for the initial implementation for float matrices. I think scipy is overall in a very good shape now. Most bugs are enhancements or refactorings and cleanup. (Of course it doesn't mean that there are no real bugs.) > > If it's not tested - assume it's broken. > > If it's not tested and it's not broken, assume it will break soon. > > Don't use anything for serious work that isn't tested. > > At least - that has been my experience. I agree. But in the heavily used parts of a library, we get the bug reports by users very fast for cases that are not covered by the unit tests. (It took 1 to 2 years to fix all bugs in the distribution fit with some fixed parameters, for all different combinations of fixed and not fixed parameters.) Josef > > Cheers, > > Matthew > _______________________________________________ > SciPy-User mailing list > SciPy-User at scipy.org > http://mail.scipy.org/mailman/listinfo/scipy-user From josef.pktd at gmail.com Thu Jun 6 08:56:35 2013 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 6 Jun 2013 08:56:35 -0400 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

<20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr>

Message-ID: >>> I found bugs in scipy.ndimage.shift and in scipy.stats.linregress. >>> The first took me ages to be spotted as I was assuming the error was on >>> my side as scipy was seen as a "large library widely used". Ok, I found the stats.linregress case https://github.com/scipy/scipy/pull/433 There is no way I write unit tests for all edge cases that I never expect to show up. For sure you find bugs/behavior like this in many packages, and I wouldn't trust any package for extreme cases, no matter what their test suite is. Josef From matthew.brett at gmail.com Thu Jun 6 08:57:28 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Thu, 6 Jun 2013 13:57:28 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

<20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr>

Message-ID: Hi, On Thu, Jun 6, 2013 at 1:19 PM, wrote: > On Thu, Jun 6, 2013 at 7:21 AM, Matthew Brett wrote: >> Hi, >> >> On Thu, Jun 6, 2013 at 6:23 AM, Jerome Kieffer wrote: >>> On Wed, 5 Jun 2013 23:08:10 +0100 >>> Nathaniel Smith wrote: >>> >>>> But... have you ever sat down and written tests for a piece of widely >>>> used academic software? (Not LAPACK, but some random large package >>>> that's widely used within a field but doesn't have a comprehensive >>>> test suite of its own.) Everyone I've heard of who's done this >>>> discovers bugs all over the place. Would you personally trip over them >>>> if you didn't test the code? Who knows, maybe not. And probably most >>>> of the rest -- off by one errors here and there, maybe an incorrect >>>> normalizing constant, etc., -- end up not mattering too much. Or maybe >>>> they do. How could you even tell? >>> >>> I found bugs in scipy.ndimage.shift and in scipy.stats.linregress. >>> The first took me ages to be spotted as I was assuming the error was on >>> my side as scipy was seen as a "large library widely used". >> >> Well said. See also Blake Griffith's current struggles with >> scipy.sparse (last message title "parametric tests, known failures and >> skipped tests"). > > As far as I understand these are not BUGs. > These are TDD test failures during development while adding support to > additional dtypes. See for example : https://github.com/scipy/scipy/issues/2542 In particular that ticket ends with "Existing tests only tested lil with float data." Not that this is surprising - I learned to test the hell out of everything by finding how often I wrote broken code myself. Cheers, Matthew From matthew.brett at gmail.com Thu Jun 6 08:59:44 2013 From: matthew.brett at gmail.com (Matthew Brett) Date: Thu, 6 Jun 2013 13:59:44 +0100 Subject: [SciPy-User] peer review of scientific software In-Reply-To: References: <51A39B6D.4030607@gmail.com>

<20130606072327.cf0288bce43ffbae6947d7ca@esrf.fr>

Message-ID: Hi, On Thu, Jun 6, 2013 at 1:56 PM, wrote: >>>> I found bugs in scipy.ndimage.shift and in scipy.stats.linregress. >>>> The first took me ages to be spotted as I was assuming the error was on >>>> my side as scipy was seen as a "large library widely used". > > Ok, I found the stats.linregress case > https://github.com/scipy/scipy/pull/433 > > There is no way I write unit tests for all edge cases that I never > expect to show up. > For sure you find bugs/behavior like this in many packages, and I > wouldn't trust any package for extreme cases, no matter what their > test suite is. I guess that means the user has to know what you thought an extreme case was? I think the point of test driven development is precisely in order to specify the edges before you've locked yourself down to an implementation. If one write's the implementation first one often does forget the edges. Cheers, Matthew From Valene.Pellissier at cedrat.com Thu Jun 6 09:05:47 2013 From: Valene.Pellissier at cedrat.com (=?iso-8859-1?Q?Val=E8ne_Pellissier?=) Date: Thu, 6 Jun 2013 13:05:47 +0000 Subject: [SciPy-User] Read matrix from matrix market format file Message-ID: <7BE88C19F22BFE44979FC114121314D2413EAF2E@MBX2.OPENHOST.FR> Hi, I've got some problems reading a matrix from a matrix market format file. I have Python 3.2 and installed Numpy 1.7.1, Scipy 0.12.0 and matplotlib 1.2.1 on Windows 64. I tried the scipy.io.mmread function but got an error I don't understand. >>> B=scipy.io.mmread("my_matrix.mtx") Traceback (most recent call last): File "", line 1, in File "C:\Python32\lib\site-packages\scipy\io\mmio.py", line 70, in mmread return MMFile().read(source) File "C:\Python32\lib\site-packages\scipy\io\mmio.py", line 301, in read self._parse_header(stream) File "C:\Python32\lib\site-packages\scipy\io\mmio.py", line 337, in _parse_he der self.__class__.info(stream) File "C:\Python32\lib\site-packages\scipy\io\mmio.py", line 208, in info raise ValueError("Header line not of length 3: " + line) It is a big matrix. Is there someone who has a clue about what I'm doing wrong ? Your help will be much appreciated. Thanks Valene [cid:image001.jpg at 01CE62C7.51BB5AD0]Val?ne PELLISSIER - R&D Engineer CEDRAT S.A. 15 Chemin de Malacher - Inovall?e - 38246 MEYLAN cedex - FRANCE Phone: +33 (0)4 76 90 50 45 - Fax: +33 (0)4 56 38 08 30 valene.pellissier at cedrat.com - www.cedrat.com [youtube] [youtube] -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image002.png Type: image/png Size: 1014 bytes Desc: image002.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image003.png Type: image/png Size: 1130 bytes Desc: image003.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.jpg Type: image/jpeg Size: 4777 bytes Desc: image001.jpg URL: