Using python to delta-load files into a central DB

Gabriel Genellina gagsl-py2 at yahoo.com.ar
Thu Apr 12 21:19:32 EDT 2007


En Thu, 12 Apr 2007 14:05:15 -0300, Chris Nethery <gilcneth at earthlink.net>  
escribió:

> At present, users of the separate application can run recalculation
> functions that modify all 700 files at once, causing my code to take the
> whole ball of wax, rather than just the data that has changed.

Are they text files, or what?
What kind of modifications? some lines changed/deleted/added? a column  
recalculated along the whole file?

> What I would like to do is spawn separate processes and load only the  
> delta
> data.  The data must be 100% reliable, so I'm leary of using something  
> like
> difflib.  I also want to make sure that my code scales since the number  
> of
> files is ever-increasing.

Why don't you like difflib? AFAIK it has no known bugs.

-- 
Gabriel Genellina




More information about the Python-list mailing list