A python library to read and write CLDF datasets

These details have not been verified by PyPI

Project links

Homepage

Project description

pycldf

A python package to read and write CLDF datasets.

Writing CLDF

from pycldf import Wordlist, Source

dataset = Wordlist.in_dir('mydataset')
dataset.add_sources(Source('book', 'Meier2005', author='Hans Meier', year='2005', title='The Book'))
dataset.write(FormTable=[
    {
        'ID': '1', 
        'Form': 'word', 
        'Language_ID': 'abcd1234', 
        'Parameter_ID': '1277', 
        'Source': ['Meier2005[3-7]'],
    }])

results in

$ ls -1 mydataset/
forms.csv
sources.bib
Wordlist-metadata.json

mydataset/forms.csv

ID,Language_ID,Parameter_ID,Value,Segments,Comment,Source
1,abcd1234,1277,word,,,Meier2005[3-7]

mydataset/sources.bib

@book{Meier2005,
    author = {Meier, Hans},
    year = {2005},
    title = {The Book}
}

mydataset/Wordlist-metadata.json

Advanced writing

To add predefined CLDF components to a dataset, use the add_component method:

from pycldf import StructureDataset, term_uri

dataset = StructureDataset.in_dir('mydataset')
dataset.add_component('ParameterTable')
dataset.write(
    ValueTable=[{'ID': '1', 'Language_ID': 'abc', 'Parameter_ID': '1', 'Value': 'x'}],
	ParameterTable=[{'ID': '1', 'Name': 'Grammatical Feature'}])

It is also possible to add generic tables:

dataset.add_table('contributors.csv', term_uri('id'), term_uri('name'))

which can also be linked to other tables:

dataset.add_columns('ParameterTable', 'Contributor_ID')
dataset.add_foreign_key('ParameterTable', 'Contributor_ID', 'contributors.csv', 'ID')

Addressing tables and columns

Tables in a dataset can be referenced using a Dataset's __getitem__ method, passing

a full CLDF Ontology URI for the corresponding component,
the local name of the component in the CLDF Ontology,
the url of the table.

Columns in a dataset can be referenced using a Dataset's __getitem__ method, passing a tuple (<TABLE>, <COLUMN>) where <TABLE> specifies a table as explained above and <COLUMN> is

a full CLD Ontolgy URI used as propertyUrl of the column,
the name property of the column.

Reading CLDF

>>> from pycldf.dataset import Wordlist
>>> dataset = Wordlist.from_metadata('mydataset/Wordlist-metadata.json')
>>> print(dataset)
<cldf:v1.0:Wordlist at mydataset>
>>> forms = list(dataset['FormTable'])
>>> forms[0]
OrderedDict([('ID', '1'), ('Language_ID', 'abcd1234'), ('Parameter_ID', '1277'), ('Value', 'word'), ('Segments', []), ('Comment', None), ('Source', ['Meier2005[3-7]'])])
>>> refs = list(dataset.sources.expand_refs(forms[0]['Source']))
>>> refs
[<Reference Meier2005[3-7]>]
>>> print(refs[0].source)
Meier, Hans. 2005. The Book.

Command line usage

Installing the pycldf package will also install a command line interface cldf, which provides some sub-commands to manage CLDF datasets.

Summary statistics

$ cldf stats mydataset/Wordlist-metadata.json 
<cldf:v1.0:Wordlist at mydataset>

Path                   Type          Rows
---------------------  ----------  ------
forms.csv              Form Table       1
mydataset/sources.bib  Sources          1

Validation

By default, data files are read in strict-mode, i.e. invalid rows will result in an exception being raised. To validate a data file, it can be read in validating-mode.

For example the following output is generated

$ cldf validate mydataset/forms.csv
WARNING forms.csv: duplicate primary key: (u'1',)
WARNING forms.csv:4:Source missing source key: Mei2005

when reading the file

ID,Language_ID,Parameter_ID,Value,Segments,Comment,Source
1,abcd1234,1277,word,,,Meier2005[3-7]
1,stan1295,1277,hand,,,Meier2005[3-7]
2,stan1295,1277,hand,,,Mei2005[3-7]

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.43.1

Mar 25, 2026

1.43.0

Aug 4, 2025

1.42.0

Apr 7, 2025

1.41.0

Feb 15, 2025

1.40.4

Jan 15, 2025

1.40.3

Jan 3, 2025

1.40.2

Dec 23, 2024

1.40.1

Dec 16, 2024

1.40.0

Dec 13, 2024

1.39.0

Sep 9, 2024

1.38.1

May 6, 2024

1.38.0

Apr 26, 2024

1.37.1

Mar 18, 2024

1.37.0

Jan 22, 2024

1.36.0

Nov 14, 2023

1.35.1

Oct 23, 2023

1.35.0

Jul 10, 2023

1.34.1

Mar 15, 2023

1.34.0

Dec 5, 2022

1.33.0

Nov 24, 2022

1.32.0

Nov 23, 2022

1.31.0

Nov 22, 2022

1.30.0

Nov 22, 2022

1.29.0

Oct 28, 2022

1.28.0

Oct 11, 2022

1.27.0

Jul 7, 2022

1.26.1

May 23, 2022

1.26.0

May 19, 2022

1.25.1

Feb 6, 2022

1.25.0

Feb 5, 2022

1.24.0

Nov 24, 2021

1.23.0

Aug 15, 2021

1.22.0

Jun 4, 2021

1.21.2

May 28, 2021

1.21.1

May 26, 2021

1.21.0

May 10, 2021

1.20.2

May 3, 2021

1.20.1

Apr 30, 2021

1.20.0

Apr 28, 2021

1.19.0

Apr 3, 2021

1.18.1

Mar 9, 2021

1.18.0

Jan 13, 2021

1.17.0

Oct 31, 2020

1.16.0

Oct 13, 2020

1.15.2

Oct 12, 2020

1.15.1

Oct 7, 2020

1.15.0

Aug 19, 2020

1.14.1

Mar 7, 2020

1.14.0

Mar 7, 2020

1.13.0

Mar 4, 2020

1.12.1

Feb 14, 2020

1.12.0

Feb 13, 2020

1.11.0

Feb 12, 2020

1.10.0

Jan 10, 2020

1.9.0

Nov 26, 2019

1.8.2

Oct 24, 2019

1.8.1

Oct 14, 2019

1.8.0

Sep 17, 2019

1.7.0

Aug 16, 2019

1.6.4

Jun 12, 2019

1.6.3

Jun 3, 2019

1.6.2

May 9, 2019

1.6.1

May 6, 2019

1.6.0

May 2, 2019

1.5.3

Apr 1, 2019

1.5.2

Nov 16, 2018

1.5.1

Aug 2, 2018

1.5.0

Jul 31, 2018

This version

1.4.1

May 2, 2018

1.4.0

May 2, 2018

1.3.0

Apr 24, 2018

1.2.0

Apr 18, 2018

1.1.1

Apr 18, 2018

1.1.0

Apr 18, 2018

1.0.10

Jan 13, 2018

1.0.9

Dec 20, 2017

1.0.8

Dec 1, 2017

1.0.7

Nov 29, 2017

1.0.6

Oct 19, 2017

1.0.5

Oct 16, 2017

1.0.4

Oct 12, 2017

1.0.3

Aug 16, 2017

1.0.2

Jul 28, 2017

1.0.1

Jul 27, 2017

1.0r2

Jul 17, 2017

1.0r1

Jul 14, 2017

1.0.0

Jul 27, 2017

1.0rc1 pre-release

Jul 24, 2017

1.0b2 pre-release

Jul 17, 2017

0.6.4

Dec 21, 2016

0.6.3

Dec 15, 2016

0.6.2

Sep 7, 2016

0.6.1

Sep 7, 2016

0.6.0

Jul 6, 2016

0.5.2

Jun 28, 2016

0.5.1

Jun 28, 2016

0.5.0

Jun 28, 2016

0.4.2

Jun 23, 2016

0.4.1

Jun 23, 2016

0.4.0

Jun 22, 2016

0.3.0

Jun 22, 2016

0.2.1

Jun 20, 2016

0.2.0

Jun 20, 2016

0.1.0

Jun 16, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycldf-1.4.1.tar.gz (31.2 kB view details)

Uploaded May 2, 2018 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pycldf-1.4.1-py2.py3-none-any.whl (41.2 kB view details)

Uploaded May 2, 2018 Python 2Python 3

File details

Details for the file pycldf-1.4.1.tar.gz.

File metadata

Download URL: pycldf-1.4.1.tar.gz
Upload date: May 2, 2018
Size: 31.2 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for pycldf-1.4.1.tar.gz
Algorithm	Hash digest
SHA256	`421bf57604aced7d24f96b84ea33a99cbe9377d8bb7a489fb6c214b05914ab4d`
MD5	`5c26807b954ef7e2be629b8a58b6b31c`
BLAKE2b-256	`30d5c6cf3124c0549c0f74d18893bc0a28bd8aa1fb018e615212933807a57202`

See more details on using hashes here.

File details

Details for the file pycldf-1.4.1-py2.py3-none-any.whl.

File metadata

Download URL: pycldf-1.4.1-py2.py3-none-any.whl
Upload date: May 2, 2018
Size: 41.2 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for pycldf-1.4.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`ff13bff786ee66e9d8cbc2a80692015fe3d5494cb04db467509eae62e6a08674`
MD5	`f0cdcfdcb3785cb951f5c2dfb0a8b99c`
BLAKE2b-256	`3565b64f16e612bb230299fad2306997d24723f78fcf84cb82189074dc40b73d`

See more details on using hashes here.

pycldf 1.4.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pycldf

Writing CLDF

Advanced writing

Addressing tables and columns

Reading CLDF

Command line usage

Summary statistics

Validation

See also

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes