sorting 1172026 entries

J. Mwebaze jmwebaze at gmail.com
Sun May 6 12:29:10 EDT 2012


sorry see, corrected code


for filename in txtfiles:
   temp=[]
   f=open(filename)
   for line in f.readlines():
     line = line.strip()
     line=line.split()
     temp.append((parser.parse(line[0]), float(line[1])))
   temp=sorted(temp)
   with open(filename.strip('.txt')+ '.sorted', 'wb') as p:
        for i, j in temp:
           p.write('%s %s\n' %(str(i),j))


On Sun, May 6, 2012 at 6:26 PM, J. Mwebaze <jmwebaze at gmail.com> wrote:

> I have attached one of the files, try to sort and let me know the results.
>  Kindly sort by date. ooops - am told the file exceed 25M.
>
> below is the code
>
> import glob
> txtfiles =glob.glob('*.txt')
> import dateutil.parser as parser
>
>
> for filename in txtfiles:
>    temp=[]
>    f=open(filename)
>    for line in f.readlines():
>      line = line.strip()
>      line=line.split()
>      temp.append((parser.parse(line[0]), float(line[1])))
>    temp=sorted(temp)
>    with open(filename.strip('.txt')+ '.sorted', 'wb') as p:
>         for i, j in temp:
>            p.write('%s %s\n' %(str(i),j))
>
>
> On Sun, May 6, 2012 at 6:21 PM, Devin Jeanpierre <jeanpierreda at gmail.com>wrote:
>
>> On Sun, May 6, 2012 at 12:11 PM, J. Mwebaze <jmwebaze at gmail.com> wrote:
>> > [ (datatime, int) ] * 1172026
>>
>> I can't duplicate slowness. It finishes fairly quickly here. Maybe you
>> could try posting specific code? It might be something else that is
>> making your program take forever.
>>
>> >>> x = [(datetime.datetime.now() +
>> datetime.timedelta(random.getrandbits(10)), random.getrandbits(32)) for _
>> in xrange(1172026)]
>> >>> random.shuffle(x)
>> >>> x.sort()
>> >>>
>>
>> -- Devin
>>
>
>
>
> --
> *Mob UG: +256 (0) 70 1735800 | NL +31 (0) 6 852 841 38 | Gtalk: jmwebaze
> |  skype: mwebazej | URL: www.astro.rug.nl/~jmwebaze
>
> /* Life runs on code */*
>
>


-- 
*Mob UG: +256 (0) 70 1735800 | NL +31 (0) 6 852 841 38 | Gtalk: jmwebaze |
skype: mwebazej | URL: www.astro.rug.nl/~jmwebaze

/* Life runs on code */*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-list/attachments/20120506/8c98bac3/attachment-0001.html>


More information about the Python-list mailing list