Skip to main content

A translator wrapper for open-source translation models with batch optimization. Currently supports ArgosTranslate models and is designed for scalable deployment.

Project description

ABTranslate

A translator wrapper for open-source translation models with batch optimization. Currently supports ArgosTranslate models and is designed for scalable deployment.

Features

  • ✅ Optimized translation for list or pandas.Series of text
  • 🛡️ Supports masking of specific patterns to exclude them from translation
  • ⚙️ Thread-tuned for multi-threaded deployment with ctranslate2
  • 📦 Compatible with ArgosTranslate model packages

Installation

pip install abtranslate

Usage

from abtranslate import load_argostranslate_model

# Load model from local path
model_package = load_argostranslate_model(model_path, package_dir=package_extraction_dir)

# Load translator with optional optimizations
translator = model_package.load_translator(optimized_config=True, lazy_load=False)

# Define translation quality settings (based on CTranslate2)
QUALITY_CONFIG = {
    "beam_size": 4,
    "num_hypotheses": 1,
    "replace_unknowns": True,
}

# Translate a batch of texts (e.g. pandas Series or list)
translated_texts = translator.translate_batch(df["input_text"], translation_config=QUALITY_CONFIG)

🔗 Refer to CTranslate2 Translate Batch Parameters for more configuration options.

Main Functions

  • load_argostranslate_model(model_path: str, package_dir: str) -> ModelPackage: Loads and prepares an ArgosTranslate model for usage.

  • ModelPackage.load_translator(translator_config: dict, optimized_config: bool, lazy_load: bool) -> Translator: Returns a translator instance with optional optimizations.

  • Translator.translate_batch(text_list: List[str] | pd.Series, translation_config: dict, return_type: any) -> List[str]: Performs high-performance batch translation with support for multi-threading and tuning.

License

MIT

Author

Ichsan Takwa GitHub

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abtranslate-0.1.30.tar.gz (23.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

abtranslate-0.1.30-py3-none-any.whl (26.8 kB view details)

Uploaded Python 3

File details

Details for the file abtranslate-0.1.30.tar.gz.

File metadata

  • Download URL: abtranslate-0.1.30.tar.gz
  • Upload date:
  • Size: 23.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for abtranslate-0.1.30.tar.gz
Algorithm Hash digest
SHA256 11ef9a5911eee95989e3c9b6d7896e6b801cb69b9ee73f2606ff8a2e7191c2fb
MD5 a683e1cad674576415477fdfba691567
BLAKE2b-256 f88d68e10b95adb33ad331fea351acf4265701430fb8cdf514bc7221acc00ebf

See more details on using hashes here.

File details

Details for the file abtranslate-0.1.30-py3-none-any.whl.

File metadata

  • Download URL: abtranslate-0.1.30-py3-none-any.whl
  • Upload date:
  • Size: 26.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for abtranslate-0.1.30-py3-none-any.whl
Algorithm Hash digest
SHA256 6d8977e2f8fdd8d8e860f1366e07b649d4dc5d79f021bc0e21e260fd3d80c4ae
MD5 587022111a50427c63596e191ded3509
BLAKE2b-256 fba8daaae8e357d9c232ea8feed7b87296899534dbc9640b2f88c2a85c6673bd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page