Data cleaning workouts

rusi rustompmody at gmail.com
Fri Aug 24 02:01:00 EDT 2012


On Aug 23, 12:52 pm, Fg Nu <fgn... at yahoo.com> wrote:
> List folk,
>
> I am a newbie trying to get used to Python. I was wondering if anyone knows of web resources that teach good practices in data cleaning and management for statistics/analytics/machine learning, particularly using Python.
>
> Ideally, these would be exercises of the form: here is some horrible raw data --> here is what it should look like after it has been cleaned. Guidelines about steps that should always be taken, practices that should be avoided; basically, workflow of data analysis in Python with special emphasis on the cleaning part.

Since no one has answered, I suggest you narrow your searching from
'python' to 'scipy' (or 'numpy').
Also perhaps ipython.
And then perhaps try those specific mailing lists/fora.

Since I dont know this area much, not saying more.



More information about the Python-list mailing list