Skip to main content

Programmatic curation of Glottography datasets

Project description

pyglottography

Programmatic curation of Glottography datasets

Build Status PyPI

Installation

Install via pip from PyPI:

pip install pyglottography

[!NOTE] We use GDAL's ogr2ogr command to convert between GeoJSON and GeoPackage formats. Thus, some functionality of pyglottography requires a working GDAL installation.

Curating Glottography datasets with pyglottography

Bootstrapping a new dataset

pyglottography provides a cldfbench project template, which can be used with the cldfbench new command:

cldfbench new --template glottography

Providing the raw data

The cldfbench workflow uses data in a project's raw directory - enriched with information from etc - to create a CLDF dataset in the cldf directory. By default, pyglottography expects input data as follows:

  • Geo-data, i.e. shapes for languoid areas, is expected in a GeoJSON file raw/dataset.geojson. Each feature in thie GeoJSON file should have a unique value for the id property.
  • Metadata about the shapes is expected in a CSV file etc/features.csv. This file must have an id column with values corresponding to the feature ids in the geo-data.

While metadata could be read entirely from the properties object of features in the GeoJSON file, pyglottography looks up the metadata in a different file to allow for more transparent curation. Since the Glottolog language catalog is released in a new version about twice a year, it is necessary to be able to recreate a Glottography dataset with updated Glottocodes. With the raw data setup as implemented in pyglottography, this only requires changes in etc/features.csv, which can easily be tracked with versioning software such as git.

Running the CLDF creation

cldfbench makecldf cldfbench_<dsid>.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyglottography-2.0.1.tar.gz (26.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyglottography-2.0.1-py2.py3-none-any.whl (26.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pyglottography-2.0.1.tar.gz.

File metadata

  • Download URL: pyglottography-2.0.1.tar.gz
  • Upload date:
  • Size: 26.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pyglottography-2.0.1.tar.gz
Algorithm Hash digest
SHA256 8d064f9d0ad33885ac291bc2f0f329fd8ceb412924c0a0e5f678d16cf4dedbaa
MD5 1adbfacca77415d57c2fece64a0bd61d
BLAKE2b-256 c5cbf7ae2679acec7e320eebea18c651ffec1f47d6c9ff77c885f6979d0c2e87

See more details on using hashes here.

File details

Details for the file pyglottography-2.0.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for pyglottography-2.0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 058dbb647ea7a8e0b9e6f32f243de497dcec10e193cc4ff023af90c062197acb
MD5 cd3a4c960a6ec8095f5267e4301dea04
BLAKE2b-256 a94deadd25cbea6c1241e3fddc684f060a71e1dc9baca452199758ef4df3950b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page