Skip to main content

Cross-framework localization audit and translation QA toolkit

Project description

🌍 L10n Audit Toolkit (v1.2.4)

Version Architecture Tests Release

📚 Documentation: 👉 https://wael-daaboul.github.io/L10n-Audit-Toolkit/

pipx install l10n-audit-toolkit

The L10n Audit Toolkit is a professional-grade, project-agnostic localization QA and translation audit engine. Designed for modern engineering teams, it provides automated linguistic validation, semantic risk assessment, and smart auto-fixing for complex, multi-framework applications.


🏗️ Version 1.2.4: Universal Architecture

Starting with v1.2.4, the toolkit has transitioned to a Universal, Data-Driven Architecture. The core engine is now completely decoupled from specific project domains or frameworks.

[!IMPORTANT] All audit logic, terminology rules, and entity protections are now dynamically driven by your local configuration. This means the tool works flawlessly for medical apps, banking platforms, ridesharing services, or games without any code changes.


🚀 Quick Start & Configuration

The toolkit uses a Self-Documenting Configuration system with vertical, bilingual (Arabic/English) annotations to eliminate any ambiguity.

1. Initialize Your Workspace

Run the following command in your project root to generate the necessary directory structure:

l10n-audit init

2. Configure Your Audit

Copy the provided template and customize it to your project's needs:

cp config.json.example config.json

3. Namespace Overview

Your config.json is organized into four logical namespaces:

Namespace Responsibility Primary Settings
project_detection Framework discovery auto_detect, force_profile
audit_rules Linguistic precision role_identifiers, latin_whitelist, apply_safe_fixes
ai_review Semantic intelligence enabled, provider, model, api_key_env
output Results management results_dir, retention_mode

💎 Core Features

🧠 Smart AI Semantic Review

V1.2.4 integrates LiteLLM to provide deep semantic validation of identified issues. This eliminates false positives by understanding the intent and context of your translations.

  • Provider Agnostic: Supports OpenAI, DeepSeek, Anthropic, and local models.
  • Cost Optimization: Use low-cost 'mini' models (e.g., gpt-4o-mini, deepseek-chat) and tune the short_label_threshold to skip trivial labels like "OK" or "Save".
  • Secure Integration: Never hardcode keys; use api_key_env to point to your system's environment variables.

🛠️ The Smart Auto-Fixer (--apply-safe-fixes)

Standardize your terminology automatically. If enabled, the tool will read glossary.json and replace forbidden_terms with their approved equivalents directly in your locale files.

  • Whole-Word Matching: Prevents accidental substring replacements.
  • RTL/LTR Aware: Maintains script integrity during replacement.

📁 Results Archiving & Retention

Maintain full audit traceability across your project's history.

  • overwrite: Default mode. Replaces the last audit's Results directory.
  • archive: Moves previous results to a timestamped _archives/ folder before starting a new run. Perfect for CI/CD audit trails.

⌨️ CLI Command Reference

Execute audits with precision using the standardized CLI interface.

Command Description
l10n-audit --version Verify installation (should show 1.2.4)
l10n-audit run --stage fast Perform terminology and QC checks only
l10n-audit run --stage full Run the complete audit suite (Grammar, AI, Terminology, QC)
l10n-audit run --apply-safe-fixes Audit and automatically apply terminology corrections
l10n-audit doctor Diagnose workspace and framework discovery issues

📝 Technical Notes for Power Users

  • Brand Protection: Use the latin_whitelist in audit_rules to prevent the engine from flagging your brand name or technical terms (e.g., "DeepSeek", "API") as 'mixed-script' errors in Arabic text.
  • Context Preservation: Defining role_identifiers (e.g., ['admin', 'captain']) ensures the AI and heuristic engines understand your app's specific persona contexts.
  • Performance: Batch sizes can be adjusted via ai_review.batch_size (default: 20) to balance between execution speed and API rate limits.

🤝 Contributing & Support

For issues, architectural questions, or feature requests, please refer to the internal documentation or contact the Advanced Agentic Coding team.


Generated by Antigravity AI for L10n-Audit v1.2.4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

l10n_audit_toolkit-1.2.4.tar.gz (109.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

l10n_audit_toolkit-1.2.4-py3-none-any.whl (131.0 kB view details)

Uploaded Python 3

File details

Details for the file l10n_audit_toolkit-1.2.4.tar.gz.

File metadata

  • Download URL: l10n_audit_toolkit-1.2.4.tar.gz
  • Upload date:
  • Size: 109.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for l10n_audit_toolkit-1.2.4.tar.gz
Algorithm Hash digest
SHA256 f4a53837a40fffcab153e5fe809d6dee14ec2022ed5c2782907f803e8d207b80
MD5 e55d10041a1ab8c918cc29beb0dc12fe
BLAKE2b-256 50aa8a291ce7b35c19d1a414c7cec35da219ee2f7cedb8334a846f9037296cc1

See more details on using hashes here.

File details

Details for the file l10n_audit_toolkit-1.2.4-py3-none-any.whl.

File metadata

File hashes

Hashes for l10n_audit_toolkit-1.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 755f39d78d0cf7c75fd82af0d848b75b87c9b89e56c314c391adf5c076d6e98d
MD5 a354fb1cb8951cacf254553bd9223703
BLAKE2b-256 055d365a99c03655cf5940e062c2950df0ca29f1a9cd2a734d0015141d759e42

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page