Skip to main content

Cross-framework localization audit and translation QA toolkit

Project description

🌍 L10n Audit Toolkit (v1.2.5)

Version Architecture Tests Release

📚 Documentation: 👉 https://wael-daaboul.github.io/L10n-Audit-Toolkit/

pipx install l10n-audit-toolkit

The L10n Audit Toolkit is a professional-grade, project-agnostic localization QA and translation audit engine. Designed for modern engineering teams, it provides automated linguistic validation, semantic risk assessment, and smart auto-fixing for complex, multi-framework applications.


🏗️ Version 1.2.5: Universal Architecture

Starting with v1.2.5, the toolkit has transitioned to a Universal, Data-Driven Architecture. The core engine is now completely decoupled from specific project domains or frameworks.

[!IMPORTANT] All audit logic, terminology rules, and entity protections are now dynamically driven by your local configuration. This means the tool works flawlessly for medical apps, banking platforms, ridesharing services, or games without any code changes.


🚀 Quick Start & Configuration

The toolkit uses a Self-Documenting Configuration system with vertical, bilingual (Arabic/English) annotations to eliminate any ambiguity.

1. Initialize Your Workspace

Run the following command in your project root to generate the necessary directory structure:

l10n-audit init

2. Configure Your Audit

Copy the provided template and customize it to your project's needs:

cp config.json.example config.json

3. Namespace Overview

Your config.json is organized into four logical namespaces:

Namespace Responsibility Primary Settings
project_detection Framework discovery auto_detect, force_profile
audit_rules Linguistic precision role_identifiers, latin_whitelist, apply_safe_fixes
ai_review Semantic intelligence enabled, provider, model, api_key_env
output Results management results_dir, retention_mode

💎 Core Features

🧠 Smart AI Semantic Review

V1.2.5 integrates LiteLLM to provide deep semantic validation of identified issues. This eliminates false positives by understanding the intent and context of your translations.

  • Provider Agnostic: Supports OpenAI, DeepSeek, Anthropic, and local models.
  • Cost Optimization: Use low-cost 'mini' models (e.g., gpt-4o-mini, deepseek-chat) and tune the short_label_threshold to skip trivial labels like "OK" or "Save".
  • Secure Integration: Never hardcode keys; use api_key_env to point to your system's environment variables.

🛠️ The Smart Auto-Fixer (--apply-safe-fixes)

Standardize your terminology automatically. If enabled, the tool will read glossary.json and replace forbidden_terms with their approved equivalents directly in your locale files.

  • Whole-Word Matching: Prevents accidental substring replacements.
  • RTL/LTR Aware: Maintains script integrity during replacement.

📁 Results Archiving & Retention

Maintain full audit traceability across your project's history.

  • overwrite: Default mode. Replaces the last audit's Results directory.
  • archive: Moves previous results to a timestamped _archives/ folder before starting a new run. Perfect for CI/CD audit trails.

⌨️ CLI Command Reference

Execute audits with precision using the standardized CLI interface.

Command Description
l10n-audit --version Verify installation (should show 1.2.5)
l10n-audit run --stage fast Perform terminology and QC checks only
l10n-audit run --stage full Run the complete audit suite (Grammar, AI, Terminology, QC)
l10n-audit run --apply-safe-fixes Audit and automatically apply terminology corrections
l10n-audit doctor Diagnose workspace and framework discovery issues

📝 Technical Notes for Power Users

  • Brand Protection: Use the latin_whitelist in audit_rules to prevent the engine from flagging your brand name or technical terms (e.g., "DeepSeek", "API") as 'mixed-script' errors in Arabic text.
  • Context Preservation: Defining role_identifiers (e.g., ['admin', 'captain']) ensures the AI and heuristic engines understand your app's specific persona contexts.
  • Performance: Batch sizes can be adjusted via ai_review.batch_size (default: 20) to balance between execution speed and API rate limits.

🤝 Contributing & Support

For issues, architectural questions, or feature requests, please refer to the internal documentation or contact the Advanced Agentic Coding team.


Generated by Antigravity AI for L10n-Audit v1.2.5

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

l10n_audit_toolkit-1.2.6.tar.gz (111.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

l10n_audit_toolkit-1.2.6-py3-none-any.whl (132.9 kB view details)

Uploaded Python 3

File details

Details for the file l10n_audit_toolkit-1.2.6.tar.gz.

File metadata

  • Download URL: l10n_audit_toolkit-1.2.6.tar.gz
  • Upload date:
  • Size: 111.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for l10n_audit_toolkit-1.2.6.tar.gz
Algorithm Hash digest
SHA256 6368d201a1f9f067f99d591cf4986ac57aea308fa0f507316b2a6e8e6afcb27e
MD5 1ea7635a7dd011494e337288c261fd41
BLAKE2b-256 97841ff7c768d3cd0c1e0df167b5ed48efca16ef09a1edc74b53b414e6ca9dc8

See more details on using hashes here.

File details

Details for the file l10n_audit_toolkit-1.2.6-py3-none-any.whl.

File metadata

File hashes

Hashes for l10n_audit_toolkit-1.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 93d4f7663d2bd2a7beb19208ab80ea880983af2bff63ddbfb5caca0320b1cb08
MD5 4efc8e6fcaea4fdbdf3f7c9e1b320782
BLAKE2b-256 b2685218a9b7521da0c2c7a9be9aff540cdd756c10f7743b24c75b908b3039f2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page