Data cleaning workouts

Mark Lawrence breamoreboy at yahoo.co.uk
Fri Aug 24 04:16:30 EDT 2012


Elevated Python types don't get their hands dirty top posting, but I'm 
certain that they would when talking data or there wouldn't be so many 
debates on which data type to use :)

On 24/08/2012 07:48, Fg Nu wrote:
>
>
> Thanks. I will try the SciPy list. It was a bit of a hail mary anyway. Pretty sure elevated Python types don't actually get their hands dirty with data. ;)
>
>
>
> ----- Original Message -----
> From: rusi <rustompmody at gmail.com>
> To: python-list at python.org
> Cc:
> Sent: Thursday, August 23, 2012 11:01 PM
> Subject: Re: Data cleaning workouts
>
> On Aug 23, 12:52 pm, Fg Nu <fgn... at yahoo.com> wrote:
>> List folk,
>>
>> I am a newbie trying to get used to Python. I was wondering if anyone knows of web resources that teach good practices in data cleaning and management for statistics/analytics/machine learning, particularly using Python.
>>
>> Ideally, these would be exercises of the form: here is some horrible raw data --> here is what it should look like after it has been cleaned. Guidelines about steps that should always be taken, practices that should be avoided; basically, workflow of data analysis in Python with special emphasis on the cleaning part.
>
> Since no one has answered, I suggest you narrow your searching from
> 'python' to 'scipy' (or 'numpy').
> Also perhaps ipython.
> And then perhaps try those specific mailing lists/fora.
>
> Since I dont know this area much, not saying more.
>


-- 
Cheers.

Mark Lawrence.




More information about the Python-list mailing list