A Command Line application and Python library to conjugate French, English, Spanish, Italian, Portuguese and Romanian verbs using Machine Learning techniques.
Project description
mlconjug3: The multi-lingual conjugator
A Command Line application and Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese, and Romanian (with more languages soon to come) using Machine Learning techniques.
The mlconjug3 project is now a proud member of the ARS Linguistica organization. ARS Linguistica is a community-driven, open source project that aims to develop free and accessible linguistic tools and resources for all. With a focus on advancing linguistic research, documentation, and education, ARS Linguistica is dedicated to preserving and promoting linguistic diversity through the use of open source and open science.
With mlconjug3, you can:
Conjugate any verb in one of the supported languages, even completely new or made-up verbs, with the help of a pre-trained Machine Learning model.
Easily modify and retrain the models using any compatible classifiers from scikit-learn.
Integrate mlconjug3 in your own projects.
Compatibility Matrix (v4.0.1)
Platform |
Supported |
Notes |
|---|---|---|
Linux |
✔ Python 3.9–3.14 |
Fully supported |
macOS |
✔ Python 3.9–3.14 |
Fully supported |
Windows |
✔ Python 3.9–3.12 |
Stable |
Windows (3.13+) |
~ Experimental |
SciPy / sklearn issues |
Python 3.13+ |
~ Experimental |
Native binary instability |
⚠ IMPORTANT: Windows + Python 3.13/3.14 may crash due to upstream SciPy binary incompatibilities. These builds are marked experimental in CI.
Release Notes (v4.0.1)
This release introduces major internal improvements to the ML pipeline.
Improvements:
Reworked training pipeline for stability and reproducibility
Improved feature extraction for Italian and Romanian verb morphology
Unified sklearn Pipeline architecture for all models
Updated classifier to SGDClassifier (elasticnet regularization)
Better handling of unseen verb forms via enhanced feature engineering
Improved cross-platform consistency (Windows/macOS/Linux)
Behavioral changes:
Slight variations in prediction outputs due to improved feature representation
Optional sample weighting added to training pipeline
Internal API refactoring (public API remains backward compatible)
Migration notes:
No breaking changes in public API
Minor variation in predictions expected due to improved model generalization
Using mlconjug3 in Academic Research
mlconjug3 is a valuable tool for linguistic researchers, as it provides accurate and up-to-date conjugation information for a wide range of languages. With its ability to handle completely new or made-up verbs, mlconjug3 is perfect for exploring new linguistic concepts and theories. It can also be used to compare and contrast conjugation patterns across different languages, helping researchers to identify and understand linguistic trends.
Integrating mlconjug3 in Applications
In addition to academic research, mlconjug3 can be integrated into a wide range of web and desktop applications. For language learning platforms, mlconjug3 provides an accurate and comprehensive source of conjugation information, helping students to quickly and easily master verb conjugation. For language translation tools, mlconjug3 can help to ensure that translations are grammatically correct, by providing accurate verb conjugation information in real-time.
By using mlconjug3, you are not only getting a powerful and flexible tool for verb conjugation, but you are also supporting the goals and mission of ARS Linguistica. Whether you are a linguistic researcher, language teacher, or simply someone who is passionate about preserving linguistic heritage, your support is crucial to the success of our organization.
Join us in our mission to make linguistic tools and resources accessible to all!
Free software: MIT license
Documentation: https://mlconjug3.readthedocs.io/en/latest/readme.html
Supported Languages
French
English
Spanish
Italian
Portuguese
Romanian
Academic publications citing mlconjug3
- Gerard Canal, Senka Krivic, Paul Luff, Andrew Coles.Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), 2022.
- Mike Hongfei Wu.A DISSERTATION SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY. May 2022.
- Spencer Ng, Lucy Teaford, Andy Yang, and Isaiah Zwick-Schachter.CMSC 25610: Computational Linguistics University of Chicago, 2021.
- Ali Malik and Mike Wu and Vrinda Vasavada and Jinpeng Song and John Mitchell and Noah D. Goodman and Chris Piech.Proceedings of the 34th AAAI conference on Artificial Intelligence, 2019.
BibTeX
If you want to cite mlconjug3 in an academic publication use this citation format:
@article{mlconjug3,
title={mlconjug3},
author={Sekou Diao},
journal={GitHub. Note: https://github.com/Ars-Linguistica/mlconjug3 Cited by},
year={2023}
}
Software projects using mlconjug3
- EDS-NLP provides a set of spaCy components that are used to extract information from clinical notes written in French.
- Translation flask API for the Helsinki NLP models available in the Huggingface Transformers library.
- NLP Suite is a package of tools designed for non-specialists, for scholars with no knowledge or little knowledge of Natural Language Processing.
- Runebook translates various references such as programming languages, frameworks, libraries, and APIs that software engineers refer to in development.
- This project offers tools to visualize the gender bias in pre-trained language models.
- Uses language models to generate adapted text.
- Dockerized microservice for conjugation.
- HTML transformation tool.
- A Tux bot.
- Tweets French words.
- Helps learn verb forms.
- NLP utilities.
- Detects masks.
- Generates excuses.
- NLP repository.
- Voice assistant.
- Random advice generator.
- Quiz generator.
- Rogue-like game.
- Spanish learning app.
- Vocabulary app.
Installation
You can install mlconjug3 using different methods depending on your workflow.
### 1. Install via pip (recommended)
pip install mlconjug3
### 2. Install from source (GitHub)
git clone https://github.com/Ars-Linguistica/mlconjug3.git
cd mlconjug3
pip install .
### 3. Install via conda (conda-forge)
conda install -c conda-forge mlconjug3
Signing of Releases
Starting with version 3.10, all releases of mlconjug3 published on PyPI and GitHub are signed using Sigstore.
What is Sigstore?
Sigstore is an open-source project that provides simple, transparent, and secure software signing. It allows developers to sign releases without managing long-lived cryptographic keys.
Instead, Sigstore uses short-lived certificates issued through identity providers, making the signing process both secure and easy to verify.
This ensures that mlconjug3 releases on PyPI can be cryptographically verified and have not been tampered with.
How to verify a release?
You can verify mlconjug3 package signatures using cosign, which is part of the Sigstore ecosystem.
Install cosign:
# Linux / macOS
curl -O -L https://github.com/sigstore/cosign/releases/latest/download/cosign-linux-amd64
chmod +x cosign-linux-amd64
sudo mv cosign-linux-amd64 /usr/local/bin/cosign
# macOS (Homebrew)
brew install cosign
Verify a release:
cosign verify-blob \
--certificate-identity https://github.com/Ars-Linguistica/mlconjug3/.github/workflows/upload_wheels_to_pypi.yml \
--certificate-oidc-issuer https://token.actions.githubusercontent.com \
mlconjug3-<version>.tar.gz \
--signature mlconjug3-<version>.tar.gz.sig
This ensures: - The package was built by the official CI pipeline - The release was not modified after publication - The signature matches the GitHub Actions identity
For more details, see: https://docs.sigstore.dev/
Credits
This package was created with the help of Verbiste and scikit-learn.
The logo was designed by Zuur.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mlconjug3-4.0.1.tar.gz.
File metadata
- Download URL: mlconjug3-4.0.1.tar.gz
- Upload date:
- Size: 5.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e589f9db9e02ff61b586025937c8ec6ec8df6df49177bc20392287e03a78be13
|
|
| MD5 |
55ddb0057050554b5d543e5eb5b2dbc5
|
|
| BLAKE2b-256 |
23baa0329252556c7782f43b205537ced5738dac0fec06c27cf4c74be49061ae
|
Provenance
The following attestation bundles were made for mlconjug3-4.0.1.tar.gz:
Publisher:
upload_wheels_to_pypi.yml on Ars-Linguistica/mlconjug3
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mlconjug3-4.0.1.tar.gz -
Subject digest:
e589f9db9e02ff61b586025937c8ec6ec8df6df49177bc20392287e03a78be13 - Sigstore transparency entry: 1340595583
- Sigstore integration time:
-
Permalink:
Ars-Linguistica/mlconjug3@691b199c60c523edcc3f0753390f02fbd8714e0b -
Branch / Tag:
refs/heads/master - Owner: https://github.com/Ars-Linguistica
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
upload_wheels_to_pypi.yml@691b199c60c523edcc3f0753390f02fbd8714e0b -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file mlconjug3-4.0.1-py3-none-any.whl.
File metadata
- Download URL: mlconjug3-4.0.1-py3-none-any.whl
- Upload date:
- Size: 5.5 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fbac083da0be5ca670bbde736111d0740867c39ae70da2ba345b369a5c9c3136
|
|
| MD5 |
9d6d13645b2891186554b2298965468c
|
|
| BLAKE2b-256 |
77e5ace2b037e578256aed9bab541e94f2848a820c569d232878e313cde0003c
|
Provenance
The following attestation bundles were made for mlconjug3-4.0.1-py3-none-any.whl:
Publisher:
upload_wheels_to_pypi.yml on Ars-Linguistica/mlconjug3
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mlconjug3-4.0.1-py3-none-any.whl -
Subject digest:
fbac083da0be5ca670bbde736111d0740867c39ae70da2ba345b369a5c9c3136 - Sigstore transparency entry: 1340595584
- Sigstore integration time:
-
Permalink:
Ars-Linguistica/mlconjug3@691b199c60c523edcc3f0753390f02fbd8714e0b -
Branch / Tag:
refs/heads/master - Owner: https://github.com/Ars-Linguistica
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
upload_wheels_to_pypi.yml@691b199c60c523edcc3f0753390f02fbd8714e0b -
Trigger Event:
workflow_dispatch
-
Statement type: