Fuzzy matching of postal addresses

Tim Churches tchur at optushome.com.au
Sat Feb 19 21:33:17 EST 2005


McBooCzech wrote:

>Sorry for my "Ferbl typo". 
>
No offense was caused, I was just joking.

>For the local anti-smoking campaign 
>
As a public health epidemiologist, that's the sort of application of our 
project I like to see! And judging by the reports by Martin Mcgee and 
colleagues at the European Public Health Observatory, such campaigns are 
sorely needed in most of eastern Europe. I am sure Vaclav Havel is 
convinced of the need for anti-smoking campaigns.

>I am
>trying to link some addresses which contain following "linkable"
>informations (data fields) only:
>
>RECORD_ID, Street + No., City, Post code,
>
>All data are now w/o Unicode characters. Do you think it possible to
>try to link it with Febrl w/o deep code modification?
>  
>
Yes.

>I did try to link our data but the result is just a plenty of warning
>messages but no links. What is your suggestion? Please understand I do
>not want to bother you with my questions. I am just asking you your
>comments or pointers before I will try to dig in to the code. You
>probably know some "tricks" in data organization or something like
>that, which can be much easier then code digging.
>
>I can send our CSVs to you (they are small, just about 3204 records in
>the A data-set and about 1241 records in the B data-set) and a log as
>well.
>
>I have tried to organized oru files as following:
>       FEBRL reqirements : Our data
>       ==============================
>                 'rec_id': RECORD_ID,
>             'given_name': ""
>                'surname': ""
>             'street_num': ""
>         'address_part_1': ""
>         'address_part_2': Street + No.
>                 'suburb': City
>               'postcode': Post code
>                  'state': ""
>          'date_of_birth': ""
>             'soc_sec_id': ""
>
>Thanks for your answer and suggestions
>  
>
We would be pleased to assist - but off-list. Please send the exact text 
of the error messages (the log) directly to Peter Christen and myself in 
the first instance. Further discussion of these applictaon-specific 
issues is not appropriate for the general Python list - but you could 
report back to the Python list on your overall experience after we have 
solved the problems with you.

Regards,

Tim C




More information about the Python-list mailing list