Skip to main content

Utilities for converting rich text markup files to XLIFF format

Project description

This package provides a set of utilities for converting rich-text markup files to OASIS XLIFF translatable format. Currently only HTML <=> XLIFF 1.2 conversion is supported.

Installation

Python XLIFF Converter can be installed using pip:

pip install py-xliff-converter

Usage

HTML => XLIFF

Command line:

html2xliff <myfile>.html

This command will create <myfile>.xlf file that can be translated using most common online and offline CAT tools: Trados, memoQ, Transifex etc.

API:

from xliff_converter.html_parser import convert_html
...
with open(html_filename, 'r', encoding='utf-8') as fo:
    html = fo.read()
xliff = convert_html(html, html_filename)
...

The convert_html(...) function returns translatable XLIFF document as bytes string encoded in UTF-8.

XLIFF => HTML

Command line:

xliff2html <myfile>.xlf

This command will create <myfile>_<lang>.html file containing translated content of the source HTML file. <lang> is the language code of a target language.

API:

from xliff_converter.html_rebuilder import rebuild_html
...
with open(xliff_filename, 'r', encoding='utf-8') as fo:
    xliff = fo.read()
filename, html = rebuild_html(xliff)

The rebuild_html(...) function returns a tuple (named tuple) containing the name of a translated HTML file and its contents as str.

Notes

  • Currently Python XLIFF Converter supports only English as a source language.

  • Translatable text is segmented by sentences using NLTK sentence tokenizer.

  • The HTML converter accepts partial HTML markup, e.g. <body> tag contents and even plain text.

  • <br> tags are treated as translation segment delimiters.

  • <pre><code>...</code></pre> blocks are ignored.

To do

  • More file formats.

  • XLIFF 2.0 support.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py-xliff-converter-0.0.1.tar.gz (8.3 kB view details)

Uploaded Source

File details

Details for the file py-xliff-converter-0.0.1.tar.gz.

File metadata

  • Download URL: py-xliff-converter-0.0.1.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.5

File hashes

Hashes for py-xliff-converter-0.0.1.tar.gz
Algorithm Hash digest
SHA256 deb46a207f9c69bfae32c7aaaddbdc5798e334fd4c7151a759283d843ceda514
MD5 ad57bfd4dc4d7ce943fc776b82e85747
BLAKE2b-256 47a311d1f99446fc9b6f3f4c9bc43bb6bde7aad414cd532259408ca53ad7b622

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page