A Python library for DAKODA corpus management and querying.
Project description
Dakoda Core
A Python library for DAKODA corpus management.
DAKODA is an interdisciplinary project with the overarching goal of advancing the data competencies of early-career researchers in German as a Foreign/Second Language (DaF/DaZ) in the field of learner corpus research. The project develops language technology resources for acquisition-related research questions and tests them based on a broad data foundation.
Prerequisites
- Python 3.9+ (3.11+ recommended via pyenv)
- Poetry for dependency management (see installation guide)
Setup
-
Clone the repository
git clone git@github.com:dakoda-project/dakoda-core.git cd dakoda-core
-
Install dependencies and create virtual environment
poetry install -
Verify installation
poetry run python -c "from dakoda import *; print('Dakoda Core installed successfully!')"
Development
Project Structure
dakoda-core/
├── src/
│ └── dakoda/
│ ├── corpus.py
│ ├── countries.py
│ ├── languages.py
│ ├── dakoda_types.py
│ ├── dakoda_metadata_scheme.py
│ ├── util.py
│ └── res/
│ ├── dakoda-logo.png
│ └── dakoda_typesystem.xml
├── tests/
└── data/
Running Tests
# Run all tests
poetry run pytest
# Run with coverage
poetry run pytest --cov=dakoda
Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Run tests:
poetry run pytest - Submit a pull request
Release (PyPI)
Short release checklist:
-
Update the version in
pyproject.toml. -
Run tests:
poetry run pytest
-
Build distribution artifacts:
poetry build -
Publish:
export POETRY_PYPI_TOKEN_PYPI="pypi-XXXXXXXXXXXXXXXXXXXX" poetry publish
Notes:
- Use API tokens for authentication (
__token__username). - Create a fresh build (
poetry build) for every release.
License
This project is licensed under the Apache License 2.0. See LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dakoda_core-0.1.0.tar.gz.
File metadata
- Download URL: dakoda_core-0.1.0.tar.gz
- Upload date:
- Size: 450.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.2 CPython/3.13.0 Darwin/25.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b43aa8c78cc5c8f66e1d417d1c84d0a68dbed095e8edce4f4e7e24b0eace980e
|
|
| MD5 |
b0862c4bf680492f17d154da33e2ae08
|
|
| BLAKE2b-256 |
9e926875121743e58a4a9d854a6952ddc2698e82aae58da10857ebbd6ae9dd4b
|
File details
Details for the file dakoda_core-0.1.0-py3-none-any.whl.
File metadata
- Download URL: dakoda_core-0.1.0-py3-none-any.whl
- Upload date:
- Size: 451.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.2 CPython/3.13.0 Darwin/25.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
173c045d0372e9428bd355727d422f607de4899eb13b7fa08842bf23e47ad4df
|
|
| MD5 |
c55066451a04e8e92a4e21d9cd57a61b
|
|
| BLAKE2b-256 |
cbf83d90e2ca57ff6a84e2becce23d4a5b286fc67b538bc464eb20e9973aae5e
|