[Tutor] weird lambda expression -- can someone help me understand how this works

Amit Saha amitsaha.in at gmail.com
Sat Dec 14 03:31:47 CET 2013


On Sat, Dec 14, 2013 at 12:29 PM, Amit Saha <amitsaha.in at gmail.com> wrote:
> On Sat, Dec 14, 2013 at 12:14 PM, Michael Crawford <dalupus at gmail.com> wrote:
>> I found this piece of code on github
>>
>> https://gist.github.com/kljensen/5452382
>>
>> def one_hot_dataframe(data, cols, replace=False):
>>     """ Takes a dataframe and a list of columns that need to be encoded.
>>         Returns a 3-tuple comprising the data, the vectorized data,
>>         and the fitted vectorizor.
>>     """
>>     vec = DictVectorizer()
>>     mkdict = lambda row: dict((col, row[col]) for col in cols)
>> #<<<<<<<<<<<<<<<<<<
>>     vecData = pandas.DataFrame(vec.fit_transform(data[cols].apply(mkdict,
>> axis=1)).toarray())
>>     vecData.columns = vec.get_feature_names()
>>     vecData.index = data.index
>>     if replace is True:
>>         data = data.drop(cols, axis=1)
>>         data = data.join(vecData)
>>     return (data, vecData, vec)
>>
>> I don't understand how that lambda expression works.
>> For starters where did row come from?
>> How did it know it was working on data?
>
> Consider this simple example:
>
>>>> l = lambda x: x**2
>>>> apply(l, (3,))
> 9
>
> A lambda is an anonymous function. So, when you use apply(), the
> lambda, l gets the value 3 in x and then returns x**2 which is 9 in
> this case.

Argh, no sorry, that doesn't answer your question. Sorry, my bad. I
should have read your query properly.


-- 
http://echorand.me


More information about the Tutor mailing list