[Tutor] Processing CSV files
Sivaram Neelakantan
nsivaram.net at gmail.com
Wed Oct 9 19:34:08 CEST 2013
On Wed, Oct 09 2013,Leena Gupta wrote:
> Hello,
>
> Looking for some inputs on Python's csv processing feature.
>
> I need to process a large csv file every 5-10 minutes. The file could
> contain 3mill to 10 mill rows and size could be 6MB to 10MB(+). As part of
> the processing, I need to sum up a number value by grouping on certain
> attributes and store the output in a datastore. I wanted to know if Python
> is recommended and can it be used for processing data in csv files of this
> size? Any issues that we need to be aware of? I believe Python has a csv
> library as well.
[snipped 6 lines]
I've found pandas to be very useful for this. It provides good
functions to read CSVs and higher order functions to sum the generated
dataframes in pandas.
sivaram
--
More information about the Tutor
mailing list