Knowledge Translation Transmutation Core - Transforming translations into gold-standard quality
Project description
KTTC - Knowledge Translation Transmutation Core
Transforming translations into gold-standard quality
Autonomous multi-agent platform with 90% cost reduction and 1000x speed improvement
๐ฏ Overview
KTTC (Knowledge Translation Transmutation Core) is an autonomous translation quality assurance platform powered by AI. It uses specialized multi-agent systems to automatically detect, analyze, and validate translation quality issues according to industry-standard MQM (Multidimensional Quality Metrics) framework.
Key Features:
- ๐ค Multi-agent QA - 3 specialized agents (Accuracy, Fluency, Terminology) + Orchestrator
- ๐ MQM Scoring - Industry-standard quality metrics from WMT benchmarks
- โก 90% cost reduction vs manual review
- ๐ 100-1000x faster than human evaluation
- ๐ CI/CD native - GitHub Actions ready
- ๐ฏ 95+ MQM target - Production-grade quality threshold
- ๐ Multi-LLM support - OpenAI, Anthropic, YandexGPT, GigaChat
๐ Quick Start
Installation
# Install from PyPI (coming soon)
pip install kttc
# Or install from source
git clone https://github.com/kttc-ai/kttc.git
cd kttc
pip install -e ".[dev]"
Basic Usage
# Set your API key
export KTTC_OPENAI_API_KEY="sk-..."
# Check translation quality
kttc check \
--source source.txt \
--translation translation.txt \
--source-lang en \
--target-lang es \
--threshold 95
# Output:
# โ
MQM Score: 96.5 (PASS)
# โ ๏ธ 2 minor issues found
Python API
import asyncio
from kttc.agents.orchestrator import AgentOrchestrator
from kttc.llm.openai_provider import OpenAIProvider
from kttc.core.models import TranslationTask
async def check_quality():
# Setup LLM provider
llm = OpenAIProvider(api_key="your-api-key")
# Create orchestrator
orchestrator = AgentOrchestrator(llm)
# Create translation task
task = TranslationTask(
source_text="Hello, world!",
translation="ยกHola, mundo!",
source_lang="en",
target_lang="es",
)
# Evaluate quality
report = await orchestrator.evaluate(task)
print(f"MQM Score: {report.mqm_score}")
print(f"Status: {report.status}")
print(f"Errors found: {len(report.errors)}")
# Run
asyncio.run(check_quality())
Available Commands
kttc check- Check translation quality for a single filekttc translate- Translate text with automatic quality checking (coming soon)kttc batch- Batch process multiple translation fileskttc report- Generate formatted reports (Markdown/HTML)
Run kttc <command> --help for detailed options.
๐๏ธ Architecture
System Architecture
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ CLI Layer โ
โ (Typer + Rich UI) โ
โโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โโโโโโโโโโโโโโโโโโโผโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Agent Orchestrator โ
โ (Coordinates QA Workflow) โ
โโโโโโโฌโโโโโโโโโโโฌโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ โ โ
โโโโโโโผโโโ โโโโโผโโโโโ โโโโโผโโโโโโโ
โAccuracyโ โFluency โ โTerminologyโ
โ Agent โ โ Agent โ โ Agent โ
โโโโโโฌโโโโ โโโโโโฌโโโโ โโโโโโฌโโโโโโ
โ โ โ
โโโโโโโโโโโโโผโโโโโโโโโโโโ
โ
โโโโโโโโโโผโโโโโโโโโโ
โ Error Parser โ
โโโโโโโโโโฌโโโโโโโโโโ
โ
โโโโโโโโโโผโโโโโโโโโโ
โ MQM Scorer โ
โโโโโโโโโโฌโโโโโโโโโโ
โ
โโโโโโโโโโผโโโโโโโโโโ
โ QA Report โ
โ (JSON/Markdown) โ
โโโโโโโโโโโโโโโโโโโโ
Agent System
Each specialized agent evaluates different quality aspects:
- Accuracy Agent: Semantic correctness, meaning preservation
- Fluency Agent: Grammar, naturalness, readability
- Terminology Agent: Domain-specific term consistency
The orchestrator coordinates agents, aggregates results, and calculates final MQM scores.
MQM Scoring
Quality scoring follows the Multidimensional Quality Metrics framework:
- Score Range: 0-100 (higher is better)
- Pass Threshold: 95+ (configurable)
- Error Weights:
- Neutral: 0 points
- Minor: 1 point
- Major: 5 points
- Critical: 10 points
Formula: MQM Score = 100 - (total_penalty / word_count * 1000)
๐ ๏ธ Development
Setup
# Clone repository
git clone git@github.com:kttc-ai/kttc.git
cd kttc
# Create virtual environment (Python 3.11+ required)
python3.11 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install dependencies
pip install -e ".[dev]"
# Setup pre-commit hooks
pre-commit install
# Run tests
pytest
# Run quality checks
black src/ tests/
ruff check src/ tests/
mypy src/kttc --strict
Project Structure
kttc/
โโโ .github/
โ โโโ workflows/ # CI/CD workflows
โ โโโ ISSUE_TEMPLATE/ # Issue templates
โ โโโ PULL_REQUEST_TEMPLATE.md
โโโ src/
โ โโโ kttc/ # Main package
โ โโโ cli/ # CLI interface (Typer)
โ โโโ agents/ # QA agents
โ โ โโโ accuracy.py
โ โ โโโ fluency.py
โ โ โโโ terminology.py
โ โ โโโ orchestrator.py
โ โ โโโ parser.py
โ โโโ core/ # Core logic
โ โ โโโ models.py # Pydantic models
โ โ โโโ mqm.py # MQM scoring
โ โโโ llm/ # LLM providers
โ โ โโโ openai_provider.py
โ โ โโโ anthropic_provider.py
โ โ โโโ yandex_provider.py
โ โ โโโ gigachat_provider.py
โ โโโ utils/ # Utilities
โ โโโ config.py # Configuration
โโโ tests/
โ โโโ unit/ # Unit tests
โ โโโ integration/ # Integration tests
โ โโโ e2e/ # End-to-end tests
โโโ docs/
โ โโโ api/ # API documentation
โ โโโ guides/ # User guides
โ โโโ development/ # Developer guides
โโโ examples/ # Example scripts
โ โโโ basic_usage.py
โ โโโ batch_processing.py
โโโ CODE_OF_CONDUCT.md
โโโ CONTRIBUTING.md
โโโ SECURITY.md
โโโ LICENSE
โโโ pyproject.toml
Testing
# Run all tests
pytest
# Run with coverage
pytest --cov=kttc --cov-report=html
# Run specific test categories
pytest tests/unit/
pytest tests/integration/
pytest tests/e2e/
# Run with markers
pytest -m "not slow"
Code Quality
The project maintains strict code quality standards:
- Type Checking: mypy with
--strictmode - Formatting: black (line length: 100)
- Linting: ruff (Python 3.11+)
- Testing: pytest with 100% coverage
- Pre-commit: Automated checks on commit
๐ Documentation
- API Documentation - Python API reference
- User Guide - Comprehensive user guide
- Developer Guide - Contributing guide
- Examples - Code examples and tutorials
๐ค Contributing
We welcome contributions! Please see our Contributing Guide for details.
Quick Start for Contributors:
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Make your changes and add tests
- Run quality checks:
pre-commit run --all-files && pytest - Commit using Conventional Commits
- Push and open a Pull Request
Please read our Code of Conduct before contributing.
Development Workflow
# Create feature branch
git checkout -b feature/my-feature
# Make changes and test
pytest
# Format and lint
black src/ tests/
ruff check src/ tests/ --fix
mypy src/kttc --strict
# Commit with conventional commit message
git commit -m "feat: add new feature"
# Push and create PR
git push origin feature/my-feature
Reporting Issues
Found a bug or have a feature request?
- Check our issue tracker
- Use issue templates for bugs and features
- Provide detailed reproduction steps
Security
For security vulnerabilities, please see our Security Policy. Do not open public issues for security concerns.
๐ Roadmap
Current Status (Alpha v0.1.0)
- โ Core multi-agent QA system
- โ MQM scoring engine
- โ CLI interface
- โ OpenAI & Anthropic support
- โ Batch processing
- โ CI/CD integration
Coming Soon (v0.2.0)
- ๐ Neural metrics (COMET, BLEURT)
- ๐ GitHub Actions workflow
- ๐ Translation memory integration
- ๐ Custom agent creation API
- ๐ WebUI dashboard
- ๐ PyPI package
Future (v1.0.0)
- ๐ Automatic translation fixing
- ๐ Multi-language support expansion
- ๐ Enterprise features
- ๐ Cloud-hosted service
๐ Benchmarks
Performance comparison with manual review:
| Metric | Manual Review | KTTC | Improvement |
|---|---|---|---|
| Speed | 1x baseline | 100-1000x | ๐ |
| Cost per word | $0.10-0.50 | $0.01-0.05 | 90% reduction |
| Consistency | Subjective | Objective | โ |
| Scalability | Limited | Unlimited | โ |
Benchmarks based on 10,000 word corpus evaluation
๐ License
MIT License - see LICENSE file for details.
๐ Links
- Repository: https://github.com/kttc-ai/kttc
- Issues: https://github.com/kttc-ai/kttc/issues
- Discussions: https://github.com/kttc-ai/kttc/discussions
- Documentation: https://github.com/kttc-ai/docs
๐ก Citation
If you use KTTC in your research, please cite:
@software{kttc2025,
title = {KTTC: Knowledge Translation Transmutation Core},
author = {KTTC Development Team},
year = {2025},
url = {https://github.com/kttc-ai/kttc},
version = {0.1.0}
}
Last Updated: November 10, 2025
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kttc-0.1.0.tar.gz.
File metadata
- Download URL: kttc-0.1.0.tar.gz
- Upload date:
- Size: 35.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
92a6bfc774d88709e7d1c736155364a3a47216ecb22ac9ea8a3c2f6747c4606b
|
|
| MD5 |
72812bf204b916525231200c8e12324c
|
|
| BLAKE2b-256 |
ef9709189f1e69255cda6bd142cec32343ae74505d18b398d77e2b83d1fe36fb
|
Provenance
The following attestation bundles were made for kttc-0.1.0.tar.gz:
Publisher:
publish.yml on kttc-ai/kttc
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
kttc-0.1.0.tar.gz -
Subject digest:
92a6bfc774d88709e7d1c736155364a3a47216ecb22ac9ea8a3c2f6747c4606b - Sigstore transparency entry: 686120775
- Sigstore integration time:
-
Permalink:
kttc-ai/kttc@96650de85d1ef5bfb03122d1b472befe2dc6cbdf -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/kttc-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@96650de85d1ef5bfb03122d1b472befe2dc6cbdf -
Trigger Event:
release
-
Statement type:
File details
Details for the file kttc-0.1.0-py3-none-any.whl.
File metadata
- Download URL: kttc-0.1.0-py3-none-any.whl
- Upload date:
- Size: 41.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5548041159733ae7510ec264feb1d90dd9514e576cfab59021e712743c5adbe1
|
|
| MD5 |
30d8843a579043dee4eb49b94b1962e2
|
|
| BLAKE2b-256 |
eadb12ec20b49ac1886b8ae5301751dcc801f6666a7e1cb0896f71cafdacf657
|
Provenance
The following attestation bundles were made for kttc-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on kttc-ai/kttc
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
kttc-0.1.0-py3-none-any.whl -
Subject digest:
5548041159733ae7510ec264feb1d90dd9514e576cfab59021e712743c5adbe1 - Sigstore transparency entry: 686120778
- Sigstore integration time:
-
Permalink:
kttc-ai/kttc@96650de85d1ef5bfb03122d1b472befe2dc6cbdf -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/kttc-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@96650de85d1ef5bfb03122d1b472befe2dc6cbdf -
Trigger Event:
release
-
Statement type: