newbie question: parsing street name from address

jay graves jaywgraves at gmail.com
Thu Jun 21 11:41:36 EDT 2007


On Jun 21, 8:47 am, cjl <cjl... at gmail.com> wrote:
> I would like to be able to extract the name of the street from the
> addresses in my data, however they vary significantly. Here a some
> examples:
>
> 25 Main St
> 2500 14th St
> 12 Bennet Pkwy
> Pearl St
> Bennet Rd and Main st
> 19th St
>
> As you can see, sometimes I have the house number, and sometimes I do
> not. Sometimes the street name is a number. Sometimes I simply have
> the names of intersecting streets.
>
> How might I approach this complex parsing problem?

In the past, I used the data cleansing routines from Febrl to do this
with some success.

http://www.google.com/search?q=febrl

HTH.
...
Jay Graves






More information about the Python-list mailing list