[AstroPy] anyone using Pandas?

Thøger Rivera-Thorsen trive at astro.su.se
Sat Dec 15 06:44:18 EST 2012


Hi;

It still depends. Your MultiIndex can be made from lists, tuples or 
arrays; I usually use arrays since I'm familiar with them from using 
NumPy.  For a 3-level index, you need to have 3 1D-arrays of equal 
length, that civer alle the atomic nr. - Ion - Level combinations you 
want covered. Toy example; 3 atoms with three levels each, atoms denoted 
by number and levels by letters. Let's say the two first atoms have 3, 
and your third atom five levels. Then your arrays should be:
atoms = sp.array(['1', '1', '1', '2', '2', '2', '3', '3', '3', '3', '3'])
levels  = sp.array(['a', 'b', 'c', 'a', 'b', 'c', 'a', 'b', 'c', 'd', 'e'])

idx = pd.MultiIndex.from_arrays([atoms, levels], names=['atoms', 'levels'])

Your data should then be a 11x2 array with energy and g as the columns. 
You then build the array as:

MyDF = pd.DataFrame(dataarray, index=idx, columns=['Nrg_lvl', 'g.'])


That gives:

In [22]: print MyDF

                Nrg_lvl        g.

atoms levels

1     a       0.052846  0.533835

       b       0.185949  0.064069

       c       0.384630  0.646803

2     a       0.835958  0.392594

       b       0.016399  0.165862

       c       0.300874  0.975590

3     a       0.124640  0.815488

       b       0.590613  0.749555

       c       0.284481  0.299149

       d       0.104408  0.723406

       e       0.733087  0.730055


(Here I have just used random numbers for the data).

 From here, you can do all the Pandas goodness you want:

In [23]: print MyDF.xs('b', level=1)

         Nrg_lvl        g.

atoms

1      0.185949  0.064069

2      0.016399  0.165862

3      0.590613  0.749555


Once you have the basic structure built, it is easy to extend either by 
concatenating or by using the set_value function. However, most of these 
operations are returning a modified copy rather than in-place 
operations, so better build as much of the DataFrame as possible in one 
go, if you're concerned about memory overhead. Åandas doesn't seem to be 
designed to gradually build its data structures.

Cheers;

Thøger


On 12/15/2012 05:52 AM, Wolfgang Kerzendorf wrote:
> Sorry before this didn't go to the list:
>
> Hey guys,
>
> I'm slowly understanding better what pandas is about. The object I'm representing in a pandas data frame is an atomic database.
> Each line in there has atomic_number, ion_number, level_number, energy, g. So I have created a Pandas dataFrame and then set the index to atomic_number, ion_number, level_number. Now I want to make a new DataFrame where atomic_number in (6, 7, 8, 9) - but it is an index. how do I do that?
>
> Cheers
>     W
> On 2012-12-14, at 8:15 PM, Thøger Rivera-Thorsen <trive at astro.su.se> wrote:
>
>> Like Tyler said, can you be a bit more specific about what you want to
>> obtain?
>> What is your starting point, and where do you want to go from there?
>>
>> I've been looking quite a bit into multiindexing lately, and it is very
>> handy but it does have some caveats.
>>
>>
>>
>>
>>
>>
>> On 12/14/2012 09:04 PM, Wolfgang Kerzendorf wrote:
>>> Hey guys,
>>>
>>> I'm trying to play around with pandas. Currently I have a look at join and it always seems to copy the data. I believe I want to use advanced indexing, but am not quite sure how to do that: any Pandas experts here?
>>>
>>> Cheers
>>>     Wolfgang
>>> _______________________________________________
>>> AstroPy mailing list
>>> AstroPy at scipy.org
>>> http://mail.scipy.org/mailman/listinfo/astropy
>> _______________________________________________
>> AstroPy mailing list
>> AstroPy at scipy.org
>> http://mail.scipy.org/mailman/listinfo/astropy
> _______________________________________________
> AstroPy mailing list
> AstroPy at scipy.org
> http://mail.scipy.org/mailman/listinfo/astropy




More information about the AstroPy mailing list