Skip to main content

Python address detector

Project description

Address Detector
==============================

How to detect if a user query might be an address and requires to launch a map answer.




Project Modules
------------
The project implements 3 classifiers, using an a la scikit template.

The first classifier is a simple scorer classifier, based on the parsing result of the address parser libpostal (https://github.com/openvenues/libpostal)
According to how the parser manage to work, and which fields are parsed, we make a score and decide if an address or not.

The second classifier is based on the FastText classifier trained on address data. The fasttext makes an embedding of the differents address it sees and therefore when a new address is submitted if it's in a close spaceto what have been learned.
The classifier is pre-trained, and the Fasttext zip model is store within the package.

The third classifier is a voting classifier combining the results of the two previous classifiers.


Project Dependencies
------------
####Installation of Postal:
Before you install `Postal` , make sure you have the following prerequisites:
```
sudo apt-get install curl autoconf automake libtool pkg-config
```

Then to install the C library:

```
git clone https://github.com/openvenues/libpostal
cd libpostal
./bootstrap.sh
./configure --datadir=[...some dir with a few GB of space...]
make
sudo make install

# On Linux it's probably a good idea to run
sudo ldconfig
```
#### Installation of FastText
In order to build `fastText`, use the following:

```
$ git clone https://github.com/facebookresearch/fastText.git
$ cd fastText
$ make
```



=======
History
=======

0.1.0 (2017-10-25)
------------------

* First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

addr_detector-0.2.1.tar.gz (52.8 MB view details)

Uploaded Source

File details

Details for the file addr_detector-0.2.1.tar.gz.

File metadata

File hashes

Hashes for addr_detector-0.2.1.tar.gz
Algorithm Hash digest
SHA256 fa234d7c48d00411ba2c9150297e034daf5b39a50b33e84b2ca6db6b8ea93f94
MD5 c047c38894b03143406d25941c402a29
BLAKE2b-256 752b18b4f722085eac23257c491f0539cb1045f8a155301a72a6242e1e543f8e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page