Fuzzy lookup of country information
Project description
countryguess looks up country information by country codes or name matching. It tries to be lean (but not mean) and fast: All dependencies are in the Python Standard Library and country data is loaded lazily on demand.
Usage
guess_country() uses the default country data that is already packaged.
>>> from countryguess import guess_country
>>> guess_country("britain")
{
'name_short': 'United Kingdom',
'name_official': 'United Kingdom of Great Britain and Northern Ireland',
'iso2': 'GB',
'iso3': 'GBR',
...
}
>>> guess_country("no such country")
None
>>> guess_country("no such country", default="Oh, well.")
'Oh, well.'
>>> guess_country("PoRtUgAl", attribute="iso2")
'PT'
>>> guess_country("TW", attribute="name_official") # 2-letter code lookup
'Republic of China'
>>> guess_country("tWn", attribute="name_short") # 3-letter code lookup
'Taiwan'
Matching by regular expression can be extended by mapping
ISO2 codes to
re.Pattern objects.
>>> regex_map = {
... "MN": re.compile(r'^mongol\s+uls$', flags=re.IGNORECASE),
... "JP": re.compile(r'^ni(?:pp|h)on', flags=re.IGNORECASE),
... }
>>> guess_country("Mongol Uls", attribute="name_short", regex_map=regex_map)
'Mongolia'
>>> guess_country("Nippon", attribute="name_short", regex_map=regex_map)
'Japan'
You can also create a CountryData instance yourself to provide your own
country data in a JSON file.
>>> from countryguess import CountryData
>>> countries = CountryData("path/to/countries.json")
>>> countries["vIeTnAm"]
{'name_short': 'Vietnam', ...}
>>> countries["vn"]
{'name_short': 'Vietnam', ...}
>>> countries["asdf"]
KeyError: 'asdf'
>>> countries.get("asdf")
None
>>> countries.get("kuwait")
{'name_short': 'Kuwait', ...}
On CountryData instances, every key in the JSON data is accessible as a
method.
>>> countries.name_official("portugal")
'Portuguese Republic'
>>> countries.continent("vanuatu")
'Oceania'
Country Lookup
Countries are identified by name, 2-letter code (ISO 3166-1 alpha-2) or 3-letter code (ISO 3166-1 alpha-3). All identifiers are matched case-insensitively.
Names are matched with regular expressions that are stored in the JSON data. If
that fails, fuzzy matching against name_short and name_official is done
with difflib.
Country Data
Country information is read from a JSON file. One is shipped with the package,
but you can also provide your own to the CountryData class as described
above. The information in the default file was gratefully extracted from
country-converter. (Many thanks!)
The country data file must contain a list of JSON objects. Each object represents a country that must contain at least the following keys:
name_shortname_officialiso2iso3regex
Packaged Classification Schemes
The following classification schemes are available in the included country data.
- ISO2 (ISO 3166-1 alpha-2) - including UK/EL for Britain/Greece (but always convert to GB/GR)
- ISO3 (ISO 3166-1 alpha-3)
- ISO - numeric (ISO 3166-1 numeric)
- UN numeric code (M.49 - follows to a large extend ISO-numeric)
- A standard or short name
- The "official" name
- Continent: 6 continent classification with Africa, Antarctica, Asia, Europe, America, Oceania
- Continent_7 classification - 7 continent classification spliting North/South America
- UN region
- EXIOBASE 1 classification (2 and 3 letters)
- EXIOBASE 2 classification (2 and 3 letters)
- EXIOBASE 3 classification (2 and 3 letters)
- WIOD classification
- Eora
- OECD membership (per year)
- MESSAGE 11-region classification
- IMAGE
- REMIND
- UN membership (per year)
- EU membership (including EU12, EU15, EU25, EU27, EU27_2007, EU28)
- CoE (Council of Europe) membership
- EEA membership
- Schengen region
- Cecilia 2050 classification
- APEC
- BRIC
- BASIC
- CIS (as by 2019, excl. Turkmenistan)
- G7
- G20 (listing all EU member states as individual members)
- FAOcode (numeric)
- GBDcode (numeric - Global Burden of Disease country codes)
- IEA (World Energy Balances 2021)
- DACcode (numeric - OECD Development Assistance Committee)
- ccTLD - country code top-level domains
- GWcode - Gledisch & Ward numerical codes as published in https://www.andybeger.com/states/articles/statelists.html
- CC41 - common classification for MRIOs (list of countries found in all public MRIOs)
- IOC - International Olympic Committee (IOC) country codes
- BACI - BACI: International Trade Database at the Product-Level
- UNIDO - UNIDO INDSTAT database
- EXIOBASE hybrid 3 classification
- EXIOBASE hybrid 3 consequential classification
- GEOnumeric GEO numerical codes (also used in Prodcom)
- FIFA List of FIFA country codes
Command Line Interface
countryguess comes with a simple CLI with the same name. It takes one or two arguments:
$ countryguess oman
{
"name_short": "Oman",
"name_official": "Sultanate of Oman",
...
}
$ countryguess 'puerto ricco' name_official
Puerto Rico
Contributing
All kinds of bug reports, feature requests and suggestions are welcome!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file countryguess-0.4.8.tar.gz.
File metadata
- Download URL: countryguess-0.4.8.tar.gz
- Upload date:
- Size: 49.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
385c7940498571263b5d61ae617a3127c9eec620bb0789ebc0334b688e608491
|
|
| MD5 |
d27018a0a492187dc63accb48c5eda8f
|
|
| BLAKE2b-256 |
a6de346e125596173c7bc6b0b32d63c65053d8689a024b08792af9e8af6bbff4
|
File details
Details for the file countryguess-0.4.8-py3-none-any.whl.
File metadata
- Download URL: countryguess-0.4.8-py3-none-any.whl
- Upload date:
- Size: 47.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
eea01ab0c281559fcc7e47947f2400e6f627cf942759b8572cee50d03fbb766a
|
|
| MD5 |
d701b80b09116bc944027322a0e9be7d
|
|
| BLAKE2b-256 |
676cc84e043858f46d7629891c704b8c611db2c86aa368e7713f716bc02f5721
|