Tag the names of countries and in text.
Project description
countrytagger
This library finds the names of places in a string of text and tries to associate them with countries. The goal is to tag a piece (or set) of text with country metadata. The place names are derived from the GeoNames database, and they include names of countries, major administrative areas and large cities. Place names that are used in several countries are not used.
Usage
import countrytagger
# match in a string using sequential matching:
text = 'I am in Berlin'
for (code, score, country) in countrytagger.tag_text_countries(text):
print(score, country)
# find precise matches:
code, score, country = countrytagger.tag_place('Berlin')
Building the data
You can re-generate the place database like this:
$ make generate
This will download GeoNames and parse it into the format used by this library.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
countrytagger-0.1.2.tar.gz
(693.7 kB
view hashes)
Built Distribution
Close
Hashes for countrytagger-0.1.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 64f53d71db4917744b0d76104a444c9a025a30409e7892e8c506539850ce4d24 |
|
MD5 | 9b4f06a16b336cc68556019a833a360f |
|
BLAKE2b-256 | 73d9d792e5b2b77d934ed3ed0a15a2742a0e6f57f6c682a1b65b33512e981ff6 |