Skip to main content

A Python library for DAKODA corpus management and querying.

Project description

Dakoda Core

A Python library for DAKODA corpus management.

DAKODA is an interdisciplinary project with the overarching goal of advancing the data competencies of early-career researchers in German as a Foreign/Second Language (DaF/DaZ) in the field of learner corpus research. The project develops language technology resources for acquisition-related research questions and tests them based on a broad data foundation.

Prerequisites

  • Python 3.9+ (3.11+ recommended via pyenv)
  • Poetry for dependency management (see installation guide)

Setup

  1. Clone the repository

    git clone git@github.com:dakoda-project/dakoda-core.git
    cd dakoda-core
    
  2. Install dependencies and create virtual environment

    poetry install
    
  3. Verify installation

    poetry run python -c "from dakoda import *; print('Dakoda Core installed successfully!')"
    

Development

Project Structure

dakoda-core/
├── src/
│   └── dakoda/           
│       ├── corpus.py     
│       ├── countries.py  
│       ├── languages.py  
│       ├── dakoda_types.py          
│       ├── dakoda_metadata_scheme.py 
│       ├── util.py      
│       └── res/          
│           ├── dakoda-logo.png
│           └── dakoda_typesystem.xml
├── tests/                
└── data/                 

Running Tests

# Run all tests
poetry run pytest

# Run with coverage
poetry run pytest --cov=dakoda

Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Run tests: poetry run pytest
  5. Submit a pull request

Release (PyPI)

Short release checklist:

  1. Update the version in pyproject.toml.

  2. Run tests:

    poetry run pytest
    
  3. Build distribution artifacts:

    poetry build
    
  4. Publish:

    export POETRY_PYPI_TOKEN_PYPI="pypi-XXXXXXXXXXXXXXXXXXXX"
    poetry publish
    

Notes:

  • Use API tokens for authentication (__token__ username).
  • Create a fresh build (poetry build) for every release.

License

This project is licensed under the Apache License 2.0. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dakoda_core-0.1.0.tar.gz (450.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dakoda_core-0.1.0-py3-none-any.whl (451.1 kB view details)

Uploaded Python 3

File details

Details for the file dakoda_core-0.1.0.tar.gz.

File metadata

  • Download URL: dakoda_core-0.1.0.tar.gz
  • Upload date:
  • Size: 450.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.2 CPython/3.13.0 Darwin/25.5.0

File hashes

Hashes for dakoda_core-0.1.0.tar.gz
Algorithm Hash digest
SHA256 b43aa8c78cc5c8f66e1d417d1c84d0a68dbed095e8edce4f4e7e24b0eace980e
MD5 b0862c4bf680492f17d154da33e2ae08
BLAKE2b-256 9e926875121743e58a4a9d854a6952ddc2698e82aae58da10857ebbd6ae9dd4b

See more details on using hashes here.

File details

Details for the file dakoda_core-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: dakoda_core-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 451.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.2 CPython/3.13.0 Darwin/25.5.0

File hashes

Hashes for dakoda_core-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 173c045d0372e9428bd355727d422f607de4899eb13b7fa08842bf23e47ad4df
MD5 c55066451a04e8e92a4e21d9cd57a61b
BLAKE2b-256 cbf83d90e2ca57ff6a84e2becce23d4a5b286fc67b538bc464eb20e9973aae5e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page