Skip to main content

A Command Line application and Python library to conjugate French, English, Spanish, Italian, Portuguese and Romanian verbs using Machine Learning techniques.

Project description

mlconjug3 PyPi Home Page
Package Maintenance Status Package Maintener OpenSSF Best Practices OpenSSF ScoreCard Pypi Python Package Index Status Anaconda Package Index Status Supported platforms Conda Code Coverage Status Code Vulnerability Status DOI Follow me on Mastodon

mlconjug3: The multi-lingual conjugator

A Command Line application and Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese, and Romanian (with more languages soon to come) using Machine Learning techniques.

The mlconjug3 project is now a proud member of the ARS Linguistica organization. ARS Linguistica is a community-driven, open source project that aims to develop free and accessible linguistic tools and resources for all. With a focus on advancing linguistic research, documentation, and education, ARS Linguistica is dedicated to preserving and promoting linguistic diversity through the use of open source and open science.

With mlconjug3, you can:

  • Conjugate any verb in one of the supported languages, even completely new or made-up verbs, with the help of a pre-trained Machine Learning model.

  • Easily modify and retrain the models using any compatible classifiers from scikit-learn.

  • Integrate mlconjug3 in your own projects.


Compatibility Matrix (v4.0.1)

Platform

Supported

Notes

Linux

✔ Python 3.9–3.14

Fully supported

macOS

✔ Python 3.9–3.14

Fully supported

Windows

✔ Python 3.9–3.12

Stable

Windows (3.13+)

~ Experimental

SciPy / sklearn issues

Python 3.13+

~ Experimental

Native binary instability

⚠ IMPORTANT: Windows + Python 3.13/3.14 may crash due to upstream SciPy binary incompatibilities. These builds are marked experimental in CI.


Release Notes (v4.0.1)

This release introduces major internal improvements to the ML pipeline.

Improvements:

  • Reworked training pipeline for stability and reproducibility

  • Improved feature extraction for Italian and Romanian verb morphology

  • Unified sklearn Pipeline architecture for all models

  • Updated classifier to SGDClassifier (elasticnet regularization)

  • Better handling of unseen verb forms via enhanced feature engineering

  • Improved cross-platform consistency (Windows/macOS/Linux)

Behavioral changes:

  • Slight variations in prediction outputs due to improved feature representation

  • Optional sample weighting added to training pipeline

  • Internal API refactoring (public API remains backward compatible)

Migration notes:

  • No breaking changes in public API

  • Minor variation in predictions expected due to improved model generalization


Using mlconjug3 in Academic Research

mlconjug3 is a valuable tool for linguistic researchers, as it provides accurate and up-to-date conjugation information for a wide range of languages. With its ability to handle completely new or made-up verbs, mlconjug3 is perfect for exploring new linguistic concepts and theories. It can also be used to compare and contrast conjugation patterns across different languages, helping researchers to identify and understand linguistic trends.

Integrating mlconjug3 in Applications

In addition to academic research, mlconjug3 can be integrated into a wide range of web and desktop applications. For language learning platforms, mlconjug3 provides an accurate and comprehensive source of conjugation information, helping students to quickly and easily master verb conjugation. For language translation tools, mlconjug3 can help to ensure that translations are grammatically correct, by providing accurate verb conjugation information in real-time.

By using mlconjug3, you are not only getting a powerful and flexible tool for verb conjugation, but you are also supporting the goals and mission of ARS Linguistica. Whether you are a linguistic researcher, language teacher, or simply someone who is passionate about preserving linguistic heritage, your support is crucial to the success of our organization.

Join us in our mission to make linguistic tools and resources accessible to all!


Conjugation for the verb to be.

Supported Languages

  • French

  • English

  • Spanish

  • Italian

  • Portuguese

  • Romanian

Academic publications citing mlconjug3

BibTeX

If you want to cite mlconjug3 in an academic publication use this citation format:

@article{mlconjug3,
  title={mlconjug3},
  author={Sekou Diao},
  journal={GitHub. Note: https://github.com/Ars-Linguistica/mlconjug3 Cited by},
  year={2023}
}

Software projects using mlconjug3

Installation

You can install mlconjug3 using different methods depending on your workflow.

### 1. Install via pip (recommended)

pip install mlconjug3

### 2. Install from source (GitHub)

git clone https://github.com/Ars-Linguistica/mlconjug3.git
cd mlconjug3
pip install .

### 3. Install via conda (conda-forge)

conda install -c conda-forge mlconjug3

Signing of Releases

Starting with version 3.10, all releases of mlconjug3 published on PyPI and GitHub are signed using Sigstore.

What is Sigstore?

Sigstore is an open-source project that provides simple, transparent, and secure software signing. It allows developers to sign releases without managing long-lived cryptographic keys.

Instead, Sigstore uses short-lived certificates issued through identity providers, making the signing process both secure and easy to verify.

This ensures that mlconjug3 releases on PyPI can be cryptographically verified and have not been tampered with.

How to verify a release?

You can verify mlconjug3 package signatures using cosign, which is part of the Sigstore ecosystem.

Install cosign:

# Linux / macOS
curl -O -L https://github.com/sigstore/cosign/releases/latest/download/cosign-linux-amd64
chmod +x cosign-linux-amd64
sudo mv cosign-linux-amd64 /usr/local/bin/cosign

# macOS (Homebrew)
brew install cosign

Verify a release:

cosign verify-blob \
    --certificate-identity https://github.com/Ars-Linguistica/mlconjug3/.github/workflows/upload_wheels_to_pypi.yml \
    --certificate-oidc-issuer https://token.actions.githubusercontent.com \
    mlconjug3-<version>.tar.gz \
    --signature mlconjug3-<version>.tar.gz.sig

This ensures: - The package was built by the official CI pipeline - The release was not modified after publication - The signature matches the GitHub Actions identity

For more details, see: https://docs.sigstore.dev/

Credits

This package was created with the help of Verbiste and scikit-learn.

The logo was designed by Zuur.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlconjug3-4.0.1.tar.gz (5.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlconjug3-4.0.1-py3-none-any.whl (5.5 MB view details)

Uploaded Python 3

File details

Details for the file mlconjug3-4.0.1.tar.gz.

File metadata

  • Download URL: mlconjug3-4.0.1.tar.gz
  • Upload date:
  • Size: 5.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mlconjug3-4.0.1.tar.gz
Algorithm Hash digest
SHA256 e589f9db9e02ff61b586025937c8ec6ec8df6df49177bc20392287e03a78be13
MD5 55ddb0057050554b5d543e5eb5b2dbc5
BLAKE2b-256 23baa0329252556c7782f43b205537ced5738dac0fec06c27cf4c74be49061ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for mlconjug3-4.0.1.tar.gz:

Publisher: upload_wheels_to_pypi.yml on Ars-Linguistica/mlconjug3

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mlconjug3-4.0.1-py3-none-any.whl.

File metadata

  • Download URL: mlconjug3-4.0.1-py3-none-any.whl
  • Upload date:
  • Size: 5.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mlconjug3-4.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fbac083da0be5ca670bbde736111d0740867c39ae70da2ba345b369a5c9c3136
MD5 9d6d13645b2891186554b2298965468c
BLAKE2b-256 77e5ace2b037e578256aed9bab541e94f2848a820c569d232878e313cde0003c

See more details on using hashes here.

Provenance

The following attestation bundles were made for mlconjug3-4.0.1-py3-none-any.whl:

Publisher: upload_wheels_to_pypi.yml on Ars-Linguistica/mlconjug3

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page