Is there a faster way to do this?

Roy H. Han starsareblueandfaraway at gmail.com
Tue Aug 5 22:41:15 EDT 2008


Why not just use sets?

a = set()
a.add(1)
a.add(2)

On Tue, Aug 5, 2008 at 10:14 PM, RPM1 <rpm9deleteme at earthlink.net> wrote:
> ronald.johnson at gmail.com wrote:
>>
>> I have a csv file containing product information that is 700+ MB in
>> size. I'm trying to go through and pull out unique product ID's only
>> as there are a lot of multiples. My problem is that I am appending the
>> ProductID to an array and then searching through that array each time
>> to see if I've seen the product ID before. So each search takes longer
>> and longer. I let the script run for 2 hours before killing it and had
>> only run through less than 1/10 if the file.
>>
>
> I think you need to learn about Python's dictionary data type.
> --
> http://mail.python.org/mailman/listinfo/python-list
>



More information about the Python-list mailing list