Skip to main content

Cross-framework localization audit and translation QA toolkit

Project description

🌍 L10n Audit Toolkit (v1.2.2)

Version Architecture Tests

The L10n Audit Toolkit is a professional-grade, project-agnostic localization QA and translation audit engine. Designed for modern engineering teams, it provides automated linguistic validation, semantic risk assessment, and smart auto-fixing for complex, multi-framework applications.


🏗️ Version 1.2.2: Universal Architecture

Starting with v1.2.2, the toolkit has transitioned to a Universal, Data-Driven Architecture. The core engine is now completely decoupled from specific project domains or frameworks.

[!IMPORTANT] All audit logic, terminology rules, and entity protections are now dynamically driven by your local configuration. This means the tool works flawlessly for medical apps, banking platforms, ridesharing services, or games without any code changes.


🚀 Quick Start & Configuration

The toolkit uses a Self-Documenting Configuration system with vertical, bilingual (Arabic/English) annotations to eliminate any ambiguity.

1. Initialize Your Workspace

Run the following command in your project root to generate the necessary directory structure:

l10n-audit init

2. Configure Your Audit

Copy the provided template and customize it to your project's needs:

cp config.json.example config.json

3. Namespace Overview

Your config.json is organized into four logical namespaces:

Namespace Responsibility Primary Settings
project_detection Framework discovery auto_detect, force_profile
audit_rules Linguistic precision role_identifiers, latin_whitelist, apply_safe_fixes
ai_review Semantic intelligence enabled, provider, model, api_key_env
output Results management results_dir, retention_mode

💎 Core Features

🧠 Smart AI Semantic Review

V1.2.2 integrates LiteLLM to provide deep semantic validation of identified issues. This eliminates false positives by understanding the intent and context of your translations.

  • Provider Agnostic: Supports OpenAI, DeepSeek, Anthropic, and local models.
  • Cost Optimization: Use low-cost 'mini' models (e.g., gpt-4o-mini, deepseek-chat) and tune the short_label_threshold to skip trivial labels like "OK" or "Save".
  • Secure Integration: Never hardcode keys; use api_key_env to point to your system's environment variables.

🛠️ The Smart Auto-Fixer (--apply-safe-fixes)

Standardize your terminology automatically. If enabled, the tool will read glossary.json and replace forbidden_terms with their approved equivalents directly in your locale files.

  • Whole-Word Matching: Prevents accidental substring replacements.
  • RTL/LTR Aware: Maintains script integrity during replacement.

📁 Results Archiving & Retention

Maintain full audit traceability across your project's history.

  • overwrite: Default mode. Replaces the last audit's Results directory.
  • archive: Moves previous results to a timestamped _archives/ folder before starting a new run. Perfect for CI/CD audit trails.

⌨️ CLI Command Reference

Execute audits with precision using the standardized CLI interface.

Command Description
l10n-audit --version Verify installation (should show 1.2.2)
l10n-audit run --stage fast Perform terminology and QC checks only
l10n-audit run --stage full Run the complete audit suite (Grammar, AI, Terminology, QC)
l10n-audit run --apply-safe-fixes Audit and automatically apply terminology corrections
l10n-audit doctor Diagnose workspace and framework discovery issues

📝 Technical Notes for Power Users

  • Brand Protection: Use the latin_whitelist in audit_rules to prevent the engine from flagging your brand name or technical terms (e.g., "DeepSeek", "API") as 'mixed-script' errors in Arabic text.
  • Context Preservation: Defining role_identifiers (e.g., ['admin', 'captain']) ensures the AI and heuristic engines understand your app's specific persona contexts.
  • Performance: Batch sizes can be adjusted via ai_review.batch_size (default: 20) to balance between execution speed and API rate limits.

🤝 Contributing & Support

For issues, architectural questions, or feature requests, please refer to the internal documentation or contact the Advanced Agentic Coding team.


Generated by Antigravity AI for L10n-Audit v1.2.2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

l10n_audit_toolkit-1.2.2.tar.gz (107.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

l10n_audit_toolkit-1.2.2-py3-none-any.whl (129.6 kB view details)

Uploaded Python 3

File details

Details for the file l10n_audit_toolkit-1.2.2.tar.gz.

File metadata

  • Download URL: l10n_audit_toolkit-1.2.2.tar.gz
  • Upload date:
  • Size: 107.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for l10n_audit_toolkit-1.2.2.tar.gz
Algorithm Hash digest
SHA256 ede9dab9cb595425232703aa8ce6f5511b20f559f43102499b079bc6de1bd5a5
MD5 863b07deb632692f29619d563178e6d8
BLAKE2b-256 0152b399cdb174607a4fb70403040e47e710e292a3afef26734e09afcdfa2f8c

See more details on using hashes here.

File details

Details for the file l10n_audit_toolkit-1.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for l10n_audit_toolkit-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d4adbb4517071dff4d2dad21e8987b40b262122047776efe6d949a5c7bcad9d1
MD5 e297ce403d71c1bc7c17202fa01ff079
BLAKE2b-256 d06f916ee0af31531cbd980fccd9c9502b28651cb2123e04f530685ece4cc930

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page