A customizable geoparsing library for unstructured text
Project description
Irchel Geoparser
A Python library for extracting place names from text and linking them to geographic locations.
Features
- Project-Based Workflows: Store documents and results in a persistent database for long-term research
- Modular Architecture: Mix and match different recognizers and resolvers, or build your own
- Trainable Models: Fine-tune recognizers and resolvers on your own annotated data
- Custom Gazetteers: Integrate any geographic database through simple YAML configuration
Installation
pip install geoparser
Note for macOS users: The library requires SQLite extension support. Please see the macOS setup guide for installation instructions using Homebrew Python.
Quick Start
from geoparser import Geoparser
# Initialize with default settings
gp = Geoparser()
# Parse text
text = "Paris is the capital of France."
docs = gp.parse(text)
# Access results
for toponym in docs[0].toponyms:
print(f"{toponym.text} -> {toponym.location.data}")
Documentation
Full documentation is available at docs.geoparser.app
Contributing
Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.
Acknowledgments
The Irchel Geoparser originated as part of my Master's thesis and was further developed with support from the Department of Geography at the University of Zurich and the Public Data Lab of the Digitalization Initiative of the Zurich Higher Education Institutions. I thank Prof. Dr. Ross Purves for the opportunity to continue this work as part of a research project.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Third-party licenses are listed in THIRD_PARTY_LICENSES.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file geoparser-0.3.1.tar.gz.
File metadata
- Download URL: geoparser-0.3.1.tar.gz
- Upload date:
- Size: 82.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6d09138b0e6b6ed39044a73906856ddbbcc365f5d712b238e9c778e5c0e4cb23
|
|
| MD5 |
bd9f372bae2756d6f76d3dd055102d7d
|
|
| BLAKE2b-256 |
5118ac1a5add02564b3782131287546af19ea00b230f7f9fbc4c3f21ad5aec4d
|
Provenance
The following attestation bundles were made for geoparser-0.3.1.tar.gz:
Publisher:
ci.yml on dguzh/geoparser
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
geoparser-0.3.1.tar.gz -
Subject digest:
6d09138b0e6b6ed39044a73906856ddbbcc365f5d712b238e9c778e5c0e4cb23 - Sigstore transparency entry: 717856744
- Sigstore integration time:
-
Permalink:
dguzh/geoparser@54504cbc2553c56d9adbfae2a9d6387f64d5e353 -
Branch / Tag:
refs/tags/0.3.1 - Owner: https://github.com/dguzh
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@54504cbc2553c56d9adbfae2a9d6387f64d5e353 -
Trigger Event:
push
-
Statement type:
File details
Details for the file geoparser-0.3.1-py3-none-any.whl.
File metadata
- Download URL: geoparser-0.3.1-py3-none-any.whl
- Upload date:
- Size: 82.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7b5c25493ad406636a32a180a9ced4c04246c42b6461ddc5ad2fbf3b33be2049
|
|
| MD5 |
b78f0bbc0135faea8b61c6267f32b602
|
|
| BLAKE2b-256 |
741001a5559c51f31a8c4e98e6d5cac5807405aae6c54121f9d6cd428fb1c8ab
|
Provenance
The following attestation bundles were made for geoparser-0.3.1-py3-none-any.whl:
Publisher:
ci.yml on dguzh/geoparser
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
geoparser-0.3.1-py3-none-any.whl -
Subject digest:
7b5c25493ad406636a32a180a9ced4c04246c42b6461ddc5ad2fbf3b33be2049 - Sigstore transparency entry: 717856752
- Sigstore integration time:
-
Permalink:
dguzh/geoparser@54504cbc2553c56d9adbfae2a9d6387f64d5e353 -
Branch / Tag:
refs/tags/0.3.1 - Owner: https://github.com/dguzh
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@54504cbc2553c56d9adbfae2a9d6387f64d5e353 -
Trigger Event:
push
-
Statement type: