Combinations of n Lists
Remco Gerlich
scarblac at pino.selwerd.nl
Wed Feb 28 18:10:45 EST 2001
Warren Postma <embed at geocities.com> wrote in comp.lang.python:
> I was just wondering if anyone has a more general version of this little
> helper function:
>
> def combinations(list1,list2):
> return [ (i,j) for i in list1 for j in list2 ]
>
> print combinations( [1,2,3], ['a','b','c'] )
>
> [(1, 'a'), (1, 'b'), (1, 'c'), (2, 'a'), (2, 'b'), (2, 'c'), (3, 'a'), (3,
> 'b'), (3, 'c')]
>
> So, what if I want
>
> def combinations(*lists):
> ....
> print combinations( [1,2,3], [4,5,6], [7,8,9], .... )
> [ [1,4,7,....], .... [1,4,8, ... ], ..... ]
>
> Any takers!? ;-)
Let's see, obvious tail recursive solution...
def combinations1(*lists):
if len(lists) == 0:
return []
elif len(lists) == 1:
return [(x,) for x in lists[0]]
else:
return [(x,)+y for x in lists[0] for y in combinations(lists[1:])]
Change to iterative version:
def combinations2(*lists):
if len(lists) == 0:
return []
result = [(x,) for x in lists[0]]
for next in lists[1:]:
result = [x+(y,) for x in result for y in next]
return result
Well, that was obvious. Probably full of errors too, knowing myself (can't
test here). Hard to believe this is going to be the fastest way though,
building all those lists. Let's do something clever based on the fact that
every combination can be thought of as a number between 0 and
len(list0)*len(list1)*...*len(listn).
def combinations3(*lists):
import operator
lengths = map(len, lists)
modulos = [reduce(operator.mul, lengths[:i],1) for i in range(len(lengths))]
tuples = reduce(operator.mul, lengths)
result = [None]*tuples
for i in range(tuples):
result[i] = tuple([lists[(i/modulos[list])%(lengths[list])]
for list in range(len(lists))])
return result
Hmm, this is a bit too sick, I doubt it's going to work at once, but I'm
curious what its speed is compared to the others :). Alas, can't test until
tomorrow.
If you have, say, six lists of lengths 6,5,7,8,9,10, then tuple nr. 5432 is
going to have item (5432/(6*5*7*8))%9 = 3 chosen from the fifth list, and so
on. 'modulos' is therefore misnamed, but it stores the number '6*5*7*8' for
the fifth list, etc.
Whatever, not going to write more docs now, time to sleep :).
I think this version is going to get trouble quickly when the amount of
tuples is more than maxint. But I think lists get trouble then anyway.
Thanks for the fun assignment :)
--
Remco Gerlich
More information about the Python-list
mailing list