Skip to main content

A library for cleaning and parsing addresses.

Project description

CanAddress

CanAddress is a Python library for cleaning and parsing addresses. It standardizes addresses by converting them to a consistent format and splitting them into components such as street number, street name, city, province/state, and postal code.

Features

  • Clean and standardize addresses
  • Split addresses into components
  • Easily integrate with pandas DataFrames

Installation

You can install CanAddress using pip:

pip install canaddress

Usage

Here's a basic example of how to use CanAddress:

import pandas as pd
from canaddress import AddressParser

# Sample data
    data = {
        'Property_Address': [
            '1801  3077 WESTON ROAD, TORONTO, ONTARIO M9M3A1',
            '711  4673 JANE STREET, TORONTO, ONTARIO M3N2L1',
            '105  55 NEPTUNE DRIVE, TORONTO, ONTARIO M6A1X2',
            '104  5949 YONGE STREET, TORONTO, ONTARIO M2M3V8'
        ]
    }
df = pd.DataFrame(data)

# Initialize the AddressParser
parser = AddressParser(df, 'Property_Address')

# Clean and process the data
parser.clean_and_process_data()

# Display the cleaned and processed data
print(parser.display_data())

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Author

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

canaddress-0.1.2.tar.gz (3.6 kB view hashes)

Uploaded Source

Built Distribution

canaddress-0.1.2-py3-none-any.whl (3.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page