TEA - Translation Engine Architect
Project description
TEA - Translation Engine Architect
A command line tool to create translation engine.
Install
First install pipx then:
pipx install pangeamt-tea
Usage
Step 1: Create a new project
tea new --customer customer --srcLang es --tgtLang en --flavor automotion --version 0.0.1
This command will create the project directory structure:
├── customer_es_en_automotion_0.0.1
│ ├── config.yml
│ └── data
Then enter in the directory
cd customer_es_en_automotion_0.0.1
Step 2: Configuration
Tokenizer
A tokenizer can be applied to source and target
tea tokenizer --src mecab --tgt moses
To list all available tokenizer:
tea tokenizer --list
Truecaser
tea truecaser --src --tgt
BPE
tea bpe -s -t
Processors
tea config processors -s "{processors}"
being processors a list of preprocesses and postprocesses.
Step 3:
Copy some multilingual ressources (.tmx, bilingual files, .af ) into the 'data' directory
Step 4: Run
Clean the data passing the normalizers and validators:
tea workflow clean -n {clean_th} -d
being clean_th the number of threads.
Preprocess the data (split data in train, dev or test, tokenization, BPE):
tea workflow prepare -n {prepare_th} -s 3
being prepare_th the number of threads.
Training model
tea workflow train --gpu 0
Evaluate model
tea workflow eval --step {step} --src file.src --ref file.tgt --log file.log --out file.out --gpu 0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pangeamt-tea-0.2.22.tar.gz
.
File metadata
- Download URL: pangeamt-tea-0.2.22.tar.gz
- Upload date:
- Size: 18.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ff79658d62a9a6e3e86d678a55862c4b0aae9958bad50ceefe5ed2fb94edb063 |
|
MD5 | 72e8e6fd5c69d8e38fd232d59ea2481b |
|
BLAKE2b-256 | b468be13bf63fb14413a9e2784431e568b375c870809f55a8717c442d76030dd |
File details
Details for the file pangeamt_tea-0.2.22-py3-none-any.whl
.
File metadata
- Download URL: pangeamt_tea-0.2.22-py3-none-any.whl
- Upload date:
- Size: 24.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f91de41288d9681acf26d9f4587c1add82daab9e71d299bd1ef449e8c4f218ab |
|
MD5 | e625fc8394b93088f065c590cee7a612 |
|
BLAKE2b-256 | 36f787a06af27dbe4feb96252d0e1518970023c571a43c4dd3b77a3f3c667202 |