Python to do CDC on XML files

Bruce Kirk bruce.kirk24 at gmail.com
Wed Mar 23 19:57:12 EDT 2016


I agree, the challenge is the volume of the data to compare is 13. Million records. So it needs to be very fast

Sent from my iPad

> On Mar 23, 2016, at 4:47 PM, Bob Gailer <bgailer at gmail.com> wrote:
> 
> 
> On Mar 23, 2016 4:20 PM, "Bruce Kirk" <bruce.kirk24 at gmail.com> wrote:
> >
> > Does anyone know of any existing projects on how to generate a change data capture on 2 very large xml files.
> >
> > The xml structures are the same, it is the data within the files that may differ.
> >
> It should not be too difficult to write a program that locates the tags delimiting each record, then compare them.



More information about the Python-list mailing list