TEA - Translation Engine Architect
Project description
TEA - Translation Engine Architect
A command line tool to create translation engine.
Install
First install pipx then:
pipx install pangeamt-tea
Usage
Step 1: Create a new project
tea new --customer customer --srcLang es --tgtLang en --flavor automotion --version 0.0.1
This command will create the project directory structure:
├── customer_es_en_automotion_0.0.1
│ ├── config.yml
│ └── data
Then enter in the directory
cd customer_es_en_automotion_0.0.1
Step 2: Configuration
Tokenizer
A tokenizer can be applied to source and target
tea tokenizer --src mecab --tgt moses
To list all available tokenizer:
tea tokenizer --list
Truecaser
tea truecaser --src --tgt
BPE
tea bpe -s -t
data['processors'], data['tokenizer'], data['truecaser'], data['bpe'], data['trainer'])
Step 3:
Copy some multilingual ressources (.tmx, bilingual files, .af ) into the 'data' directory
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pangeamt-tea-0.2.1.tar.gz
(15.0 kB
view hashes)
Built Distribution
Close
Hashes for pangeamt_tea-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4e2f9e3c7556fc4f3c9dc11645923ae051a07d11cdeec9c196bccd00a5f26311 |
|
MD5 | e41e5da5d3f57e4de2b3ebe194b8c9f1 |
|
BLAKE2b-256 | b5c0f91d0aa5ed7d99bac4cdd2b4e195dc73b29f5b7ff6b42a36ef38649d6f71 |