Skip to main content

Converts TMX files to text files

Project description

GitHub Actions status PyPI version PyPI downloads

Introduction

tmx-to-text allows convert TMX files into plain text.

This tool be used for example to:

  • Extract translations memories into text file for spell checking or post-editing
  • Extract corpuses into text files for traning machine learning similar

The following command will extract the Catalan and Italian texts out of the TMX file:

tmx-to-text.py -f ca-it.tmx -s ca -t it

Running the application with -h shows the options avaiable

Converts TMX into two text files.
Use -h for more information.
Usage: tmx-to-text [options]

Options:
  -h, --help            show this help message and exit
  -f TMX_FILE, --tmx-file=TMX_FILE
                        tmx File to convert to Text
  -s SOURCE_LANGUAGE, --source_lang=SOURCE_LANGUAGE
                        Source language to export
  -t TARGET_LANGUAGE, --target_lang=TARGET_LANGUAGE
                        Target language to export
  -p PREFIX, --prefix=PREFIX
                        Filename prefix used in the generated text files
  -d, --debug           Debug memory and execution time

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmx-to-text-0.1.3.tar.gz (3.9 kB view hashes)

Uploaded Source

Built Distribution

tmx_to_text-0.1.3-py3-none-any.whl (4.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page