Using python to delta-load files into a central DB
Gabriel Genellina
gagsl-py2 at yahoo.com.ar
Thu Apr 12 21:19:32 EDT 2007
En Thu, 12 Apr 2007 14:05:15 -0300, Chris Nethery <gilcneth at earthlink.net>
escribió:
> At present, users of the separate application can run recalculation
> functions that modify all 700 files at once, causing my code to take the
> whole ball of wax, rather than just the data that has changed.
Are they text files, or what?
What kind of modifications? some lines changed/deleted/added? a column
recalculated along the whole file?
> What I would like to do is spawn separate processes and load only the
> delta
> data. The data must be 100% reliable, so I'm leary of using something
> like
> difflib. I also want to make sure that my code scales since the number
> of
> files is ever-increasing.
Why don't you like difflib? AFAIK it has no known bugs.
--
Gabriel Genellina
More information about the Python-list
mailing list