What strategy for random accession of records in massive FASTA file?

Terry Reedy tjreedy at udel.edu
Wed Jan 12 19:19:21 EST 2005


RE: What strategy for random accession of records in massive FASTA file?
"Batista, Facundo" <FBatista at uniFON.com.ar> wrote in message 
news:A128D751272CD411BC9200508BC2194D053C7E95 at escpl.tcp.com.ar...
[If you want to keep the memory usage low, you can parse the file once and 
store in a list the byte position where the record starts and ends. Then 
access the list randomly and read each record with seek() and read().
----------
Or if you want to access by sequence name rather than number, use a dict 
instead.

tjr






More information about the Python-list mailing list