Skip to main content

Programmatic curation of Glottography datasets

Project description

pyglottography

Programmatic curation of Glottography datasets

Build Status PyPI

Installation

Install via pip from PyPI:

pip install pyglottography

[!NOTE] We use GDAL's ogr2ogr command to convert between GeoJSON and GeoPackage formats. Thus, some functionality of pyglottography requires a working GDAL installation.

Curating Glottography datasets with pyglottography

Bootstrapping a new dataset

pyglottography provides a cldfbench project template, which can be used with the cldfbench new command:

cldfbench new --template glottography

Providing the raw data

The cldfbench workflow uses data in a project's raw directory - enriched with information from etc - to create a CLDF dataset in the cldf directory. By default, pyglottography expects input data as follows:

  • Geo-data, i.e. shapes for languoid areas, is expected in a GeoJSON file raw/dataset.geojson. Each feature in thie GeoJSON file should have a unique value for the id property.
  • Metadata about the shapes is expected in a CSV file etc/features.csv. This file must have an id column with values corresponding to the feature ids in the geo-data.

While metadata could be read entirely from the properties object of features in the GeoJSON file, pyglottography looks up the metadata in a different file to allow for more transparent curation. Since the Glottolog language catalog is released in a new version about twice a year, it is necessary to be able to recreate a Glottography dataset with updated Glottocodes. With the raw data setup as implemented in pyglottography, this only requires changes in etc/features.csv, which can easily be tracked with versioning software such as git.

Running the CLDF creation

cldfbench makecldf cldfbench_<dsid>.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyglottography-0.2.0.tar.gz (17.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyglottography-0.2.0-py2.py3-none-any.whl (17.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pyglottography-0.2.0.tar.gz.

File metadata

  • Download URL: pyglottography-0.2.0.tar.gz
  • Upload date:
  • Size: 17.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pyglottography-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b241f6c60cb48598afbce64d68b7c2195890ff683a013b88de37de053ed5d6b9
MD5 9e2451ebf84567bdf2d90ea4d43956ea
BLAKE2b-256 b23ed0920d7847488eaab3d26cf71c6e209192e1c47d9182e2da3d7d09a2369f

See more details on using hashes here.

File details

Details for the file pyglottography-0.2.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for pyglottography-0.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9f6d2cc760d85416b2ff53a9c8d2f9e29f688ff3ff3e709b7af03497bcc61568
MD5 fe28dab6cd78c9ad80d44532cf9b68c7
BLAKE2b-256 1280c721138fb5bee2e60552c0173f47f1da1871eaf353268c6548a269f57351

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page