Skip to main content

Programmatic curation of Glottography datasets

Project description

pyglottography

Programmatic curation of Glottography datasets

Build Status PyPI

Installation

Install via pip from PyPI:

pip install pyglottography

[!NOTE] We use GDAL's ogr2ogr command to convert between GeoJSON and GeoPackage formats. Thus, some functionality of pyglottography requires a working GDAL installation.

Curating Glottography datasets with pyglottography

Bootstrapping a new dataset

pyglottography provides a cldfbench project template, which can be used with the cldfbench new command:

cldfbench new --template glottography

Providing the raw data

The cldfbench workflow uses data in a project's raw directory - enriched with information from etc - to create a CLDF dataset in the cldf directory. By default, pyglottography expects input data as follows:

  • Geo-data, i.e. shapes for languoid areas, is expected in a GeoJSON file raw/dataset.geojson. Each feature in thie GeoJSON file should have a unique value for the id property.
  • Metadata about the shapes is expected in a CSV file etc/features.csv. This file must have an id column with values corresponding to the feature ids in the geo-data.

While metadata could be read entirely from the properties object of features in the GeoJSON file, pyglottography looks up the metadata in a different file to allow for more transparent curation. Since the Glottolog language catalog is released in a new version about twice a year, it is necessary to be able to recreate a Glottography dataset with updated Glottocodes. With the raw data setup as implemented in pyglottography, this only requires changes in etc/features.csv, which can easily be tracked with versioning software such as git.

Running the CLDF creation

cldfbench makecldf cldfbench_<dsid>.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyglottography-2.0.3.tar.gz (26.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyglottography-2.0.3-py2.py3-none-any.whl (26.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pyglottography-2.0.3.tar.gz.

File metadata

  • Download URL: pyglottography-2.0.3.tar.gz
  • Upload date:
  • Size: 26.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pyglottography-2.0.3.tar.gz
Algorithm Hash digest
SHA256 a3eb0b8e0af1d187cd2421777297e4498cc83e3559fe26bddf30e08a211ff290
MD5 557c32ecb3b351336c84ad35cf7ad06b
BLAKE2b-256 6320ffaada6d36698606a9ac2567e9138b74f562796dbb9a809f3217f302e98c

See more details on using hashes here.

File details

Details for the file pyglottography-2.0.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for pyglottography-2.0.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 c4489f3db85406adb36e75c0348a150b2cda38f65d2e6a6543b7c64a166f5863
MD5 78a35eb2ccc0704c58f17501ff7d7f74
BLAKE2b-256 da93e33f524944c31179cb3319de801b6b74d67dc2b4c576f45767245616f810

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page