Skip to main content

Multilingual phonetic-similarity replacement engine — a proper-noun substitution tool based on phonetic similarity, supporting ASR/LLM post-processing.

Project description

English 繁體中文

Phonofix (Multilingual Phonetic Substitution Engine)

Phonofix is a proper-noun substitution tool built around phonetic similarity. It is useful for ASR/LLM post-processing, proper-noun standardization, and homophone/near-homophone correction.

Project structure and the full API/symbol snapshot: snapshot.md

Changelog: CHANGELOG.md (Traditional Chinese: CHANGELOG.zh-TW.md)


Table of Contents


Supported Languages

Language Phonetic Key Engine Extras
Chinese Pinyin ChineseEngine phonofix[ch]
English IPA EnglishEngine phonofix[en]
Japanese Romaji JapaneseEngine phonofix[ja]

Note: More languages will be added over time. Since the author is not fluent in all languages, some language modules are developed with AI assistance and may contain mistakes. If you find issues in real usage, please report them via GitHub Issues:

Core Concepts

  • You provide a proper-noun dictionary (canonical + aliases/config).
  • The system maps both the dictionary and the input text into a phonetic key space.
  • Matches are applied back to the original string, outputting the canonical spelling.

Note: This is not a full-text spell checker; it focuses on the proper nouns you care about.

Quick Start (Latest API)

Chinese

from phonofix import ChineseEngine

engine = ChineseEngine()
corrector = engine.create_corrector({"台北車站": ["北車", "胎北車站"]})

print(corrector.correct("我在北車等你"))
# Output: 我在台北車站等你

English (requires espeak-ng)

from phonofix import EnglishEngine

engine = EnglishEngine()
corrector = engine.create_corrector({"TensorFlow": ["Ten so floor"], "Python": ["Pyton"]})

print(corrector.correct("I use Pyton to write Ten so floor code"))
# Output: I use Python to write TensorFlow code

Japanese

from phonofix import JapaneseEngine

engine = JapaneseEngine()
corrector = engine.create_corrector({"会議": ["kaigi"], "ロボット": ["robotto"]})

print(corrector.correct("明日のkaigiに参加します"))
# Output: 明日の会議に参加します
print(corrector.correct("新しいrobottoのkaihatsu"))  # Example: can also correct other terms
# Output: 新しいロボットのkaihatsu

Mixed Language (manual chaining)

This project does not perform automatic language detection. For mixed inputs, manually chain correctors:

from phonofix import ChineseEngine, EnglishEngine

ch = ChineseEngine().create_corrector({"台北車站": ["北車"]})
en = EnglishEngine().create_corrector({"Python": ["Pyton"]})

text = "我在北車用Pyton寫code"
text = en.correct(text, full_context=text)
text = ch.correct(text, full_context=text)
print(text)
# Output: 我在台北車站用Python寫code

Observability & Failure Policy

Principle: degrade is allowed, but silent degrade is not.

  • on_event: preferred SDK surface for collecting replacements, errors, and degrade signals
  • silent=True: only disables logger output; events can still be used for observability
  • fail_policy:
    • "degrade" (default): on fuzzy exception, fall back to exact-only and emit events
    • "raise": on fuzzy exception, raise immediately (good for CI/offline evaluation)
  • mode:
    • "production" is equivalent to fail_policy="degrade"
    • "evaluation" is equivalent to fail_policy="raise"
  • trace_id: correlation ID for events produced by a single correct() call (caller-provided)

Substitution Algorithm Flow (Overview)

Reference implementation: PipelineCorrectorBase.correct() (see snapshot.md for the full symbol list).

This project does not do automatic language detection. For mixed-language inputs, manually chain correctors (see the example above).

Input text
    │
    ▼
┌─────────────────────────────────────┐
│ 1. Build a protection mask          │
│    Mark spans covered by            │
│    protected_terms                  │
│    Protected spans are excluded     │
│    from substitution               │
└─────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────┐
│ 2. Generate candidate drafts        │
│    2.1 exact: Aho-Corasick matches  │
│    2.2 fuzzy: sliding windows +     │
│         phonetic similarity         │
│         - can degrade to exact-only │
│           via fail_policy           │
└─────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────┐
│ 3. Filter by keywords/exclude_when  │
│    - exclude_when matched → skip    │
│    - keywords not satisfied → skip  │
└─────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────┐
│ 4. Calculate final score            │
│    Score = error_ratio - weight -   │
│            context_bonus            │
│    (lower is better)                │
└─────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────┐
│ 5. Conflict resolution              │
│    Sort by score, keep the best     │
│    non-overlapping candidates       │
└─────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────┐
│ 6. Apply replacements               │
│    Rebuild the output string in     │
│    ascending start order to avoid   │
│    index shifting                   │
│    and ensure consistent output     │
└─────────────────────────────────────┘
    │
    ▼
Output

Installation

Requirements

  • Python >=3.10

Using uv (recommended)

uv add phonofix

You can also use extras to make dependency intent explicit (actual versions are defined in pyproject.toml):

uv add "phonofix[ch]"
uv add "phonofix[en]"
uv add "phonofix[ja]"

English Support (espeak-ng)

English phonetic features depend on the system package espeak-ng.

Recommended: use the setup scripts under scripts/ (they help install and configure environment variables):

  • Windows PowerShell: .\scripts\setup_espeak.ps1
  • Windows CMD: scripts\setup_espeak_windows.bat
  • macOS / Linux: ./scripts/setup_espeak.sh

Manual installation:

  • macOS: brew install espeak-ng
  • Linux: apt install espeak-ng

Development & Validation

  • Run tests: pytest -q
  • Run examples:
    • Chinese: python examples/chinese_examples.py
    • English: python examples/english_examples.py
    • Japanese: python examples/japanese_examples.py
  • Generate project snapshot: python tools/snapshot.py (outputs snapshot.md)

License

MIT License

Acknowledgments

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phonofix-0.3.0.tar.gz (265.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

phonofix-0.3.0-py3-none-any.whl (127.6 kB view details)

Uploaded Python 3

File details

Details for the file phonofix-0.3.0.tar.gz.

File metadata

  • Download URL: phonofix-0.3.0.tar.gz
  • Upload date:
  • Size: 265.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.8

File hashes

Hashes for phonofix-0.3.0.tar.gz
Algorithm Hash digest
SHA256 a9c2edacc4ab2617ad367a0b39822dd258bab9c27dc91cbef2cdfdc8d3732ee8
MD5 cfeac924abdaa16c630eb23f55960993
BLAKE2b-256 90cca900ccdfcf4ebe461fff85848b763a83695067b620aeab4a4998de39d22e

See more details on using hashes here.

File details

Details for the file phonofix-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: phonofix-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 127.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.8

File hashes

Hashes for phonofix-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8a36aa0b48d0e22ed67c851776fc466a5bba7fdc32035cc322190c8710cbefe7
MD5 4415fc4f2916eb0bdb532fd2d9f4b993
BLAKE2b-256 9b7d662cecb9bb20a886460f8aa9fff73ccaf93757eadd8e7f91363ec2513101

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page