Programmatic curation of Glottography datasets
Project description
pyglottography
Programmatic curation of Glottography datasets
Installation
Install via pip from PyPI:
pip install pyglottography
[!NOTE] We use GDAL's ogr2ogr command to convert between GeoJSON and GeoPackage formats. Thus, some functionality of
pyglottographyrequires a working GDAL installation.
Curating Glottography datasets with pyglottography
Bootstrapping a new dataset
pyglottography provides a cldfbench project template,
which can be used with the cldfbench new command:
cldfbench new --template glottography
Providing the raw data
The cldfbench workflow uses data in a project's raw directory - enriched with information from
etc - to create a CLDF dataset in the cldf directory. By default, pyglottography expects input
data as follows:
- Geo-data, i.e. shapes for languoid areas, is expected in a GeoJSON file
raw/dataset.geojson. Each feature in thie GeoJSON file should have a unique value for theidproperty. - Metadata about the shapes is expected in a CSV file
etc/features.csv. This file must have anidcolumn with values corresponding to the featureids in the geo-data.
While metadata could be read entirely from the properties object of features in the GeoJSON file,
pyglottography looks up the metadata in a different file to allow for more transparent curation.
Since the Glottolog language catalog is released in a new version about twice a year, it is necessary
to be able to recreate a Glottography dataset with updated Glottocodes. With the raw data setup as
implemented in pyglottography, this only requires changes in etc/features.csv, which can easily
be tracked with versioning software such as git.
Running the CLDF creation
cldfbench makecldf cldfbench_<dsid>.py
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyglottography-2.0.3.tar.gz.
File metadata
- Download URL: pyglottography-2.0.3.tar.gz
- Upload date:
- Size: 26.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a3eb0b8e0af1d187cd2421777297e4498cc83e3559fe26bddf30e08a211ff290
|
|
| MD5 |
557c32ecb3b351336c84ad35cf7ad06b
|
|
| BLAKE2b-256 |
6320ffaada6d36698606a9ac2567e9138b74f562796dbb9a809f3217f302e98c
|
File details
Details for the file pyglottography-2.0.3-py2.py3-none-any.whl.
File metadata
- Download URL: pyglottography-2.0.3-py2.py3-none-any.whl
- Upload date:
- Size: 26.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4489f3db85406adb36e75c0348a150b2cda38f65d2e6a6543b7c64a166f5863
|
|
| MD5 |
78a35eb2ccc0704c58f17501ff7d7f74
|
|
| BLAKE2b-256 |
da93e33f524944c31179cb3319de801b6b74d67dc2b4c576f45767245616f810
|