Skip to main content

Parse US addresses using conditional random fields

Project description

usaddress is a python library for parsing unstructured address strings into address components, using advanced NLP methods.

From the python interpreter:

>>> import usaddress
>>> usaddress.parse('123 Main St. Suite 100 Chicago, IL')
[('123', 'AddressNumber'),
 ('Main', 'StreetName'),
 ('St.', 'StreetNamePostType'),
 ('Suite', 'OccupancyType'),
 ('100', 'OccupancyIdentifier'),
 ('Chicago,', 'PlaceName'),
 ('IL', 'StateName')]

Project details


Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page