[Chicago] Data export ?

Robare, Phillip (TEKSystems) proba at allstate.com
Fri Sep 4 16:36:46 CEST 2015


I was going to respond by pointing you to Pandas but it looks like Tanya beat me to it, so I will just second her suggestion.

Here is an article (http://pbpython.com/excel-pandas-comp.html) on doing common excel tasks using Pandas to get you started with manipulating data within Pandas.

If you are familiar with Relational Calculus and the  internals of SQL you can see that all the data manipulations that are theoretically needed for transforming data are present in the Pandas APIs.

Phil Robare

From: Chicago [mailto:chicago-bounces+proba=allstate.com at python.org] On Behalf Of Tanya Schlusser
Sent: Friday, September 04, 2015 6:28 AM
To: chicago at python.org
Subject: Re: [Chicago] Data export ?

I'm looking for a library or some online tool similar to how bootstrap is
transforming html..to do the following with data:  [list of tasks...]

Try Pandas (PANel Data Analysis Suite). Here's the full API<http://pandas.pydata.org/pandas-docs/stable/api.html>. It provides Series, Data Frame, and Panel objects for containing and transforming data, and does all of the things you ask except for queueing and cron jobs, which can be done using `sched` or celery, respectively.
The caveat is it relies on NumPy so if you're on windows you need a Visual C++ (nonfree version) for 100% compatibility or the Gnu C compiler (for like 99% compatibiity) or the Anaconda distribution<http://continuum.io/downloads>. (Anaconda bought us food last month. They rock.)
For the other  platforms binary wheels exist on PyPi and so  `pip install pandas` will just work.

The below just link to the diffrent relevant sections in the API according to your questions
1. SQL connection options<http://pandas.pydata.org/pandas-docs/stable/api.html#sql>
2. Column fields, etc..<http://pandas.pydata.org/pandas-docs/stable/api.html#attributes-and-underlying-data>. columnwise string functions<http://pandas.pydata.org/pandas-docs/stable/api.html#string-handling>
3. Export to csv / xls<http://pandas.pydata.org/pandas-docs/stable/api.html#id12>
And for the scheduling / queueing:
4. Scheduling is in the standard library already<https://docs.python.org/3/library/sched.html>
5. Here's an awesome task queues for python tutorial<http://www.fullstackpython.com/task-queues.html>
Cheers,
Tanya
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/chicago/attachments/20150904/13eb2c1f/attachment.html>


More information about the Chicago mailing list