AI based translation from one language to many
Project description
What is multimodal translation?
Simply put, it’s translating content across various types of media.
Why is multimodality important?
Types of multimodal translation:
Text-to-text: This is the simplest form where you can translate text from one language to another language.
Audio-to-text: Here the audio is transcribed and then translated also into several languages.
Audio-to-audio: May be implemented in the future. It’s the same concept as audio to text but the output remains in audio format.
Technology used:
Speech recognition: Important to recognize spoken language for interpretation and translation. Output can then be in text or audio format.
Limitations:
language support: Hard to support all languages, since every language has its own modal that has to be trained and installed into the application.
Maintaining context: The context may change across different media. So it’s a must to ensure the context remains correct.
Improvements:
As mentioned above, audio to audio will be implemented in the future. Other media types can also be implemented like videos and images.
References:
Technical Debt
Change Log
Developer Guide
Quickstart
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file multimodal_translation-1.0.0.tar.gz.
File metadata
- Download URL: multimodal_translation-1.0.0.tar.gz
- Upload date:
- Size: 32.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3cde8a40d99f8ca07f8281676d325ad9f0f21c95edf5ccf91e487d76646c9adf
|
|
| MD5 |
d4704360a647a3e67ccf6f9a41450adc
|
|
| BLAKE2b-256 |
d2b818a996c0cbb490b105d38a26a354ca734efd1554242e73eca7ef8c32dfa7
|
Provenance
The following attestation bundles were made for multimodal_translation-1.0.0.tar.gz:
Publisher:
release_prod.yaml on Issamricin/multimodal-translation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
multimodal_translation-1.0.0.tar.gz -
Subject digest:
3cde8a40d99f8ca07f8281676d325ad9f0f21c95edf5ccf91e487d76646c9adf - Sigstore transparency entry: 632584940
- Sigstore integration time:
-
Permalink:
Issamricin/multimodal-translation@9ec4f95c88bf8bb6242a65f7702612a1bc9ba965 -
Branch / Tag:
refs/tags/release-1.0.0 - Owner: https://github.com/Issamricin
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release_prod.yaml@9ec4f95c88bf8bb6242a65f7702612a1bc9ba965 -
Trigger Event:
push
-
Statement type:
File details
Details for the file multimodal_translation-1.0.0-py3-none-any.whl.
File metadata
- Download URL: multimodal_translation-1.0.0-py3-none-any.whl
- Upload date:
- Size: 25.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4a757f0eed6df36fd80d1655bc70f2e774fc0b31a24e7b73521a1e5ec5b93a12
|
|
| MD5 |
554b59bbf68a322bd7e63b3021db25fc
|
|
| BLAKE2b-256 |
9755904f25c06d0f9e84a52b003d118fd0034db6e91200943bcac22228c3824c
|
Provenance
The following attestation bundles were made for multimodal_translation-1.0.0-py3-none-any.whl:
Publisher:
release_prod.yaml on Issamricin/multimodal-translation
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
multimodal_translation-1.0.0-py3-none-any.whl -
Subject digest:
4a757f0eed6df36fd80d1655bc70f2e774fc0b31a24e7b73521a1e5ec5b93a12 - Sigstore transparency entry: 632584946
- Sigstore integration time:
-
Permalink:
Issamricin/multimodal-translation@9ec4f95c88bf8bb6242a65f7702612a1bc9ba965 -
Branch / Tag:
refs/tags/release-1.0.0 - Owner: https://github.com/Issamricin
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release_prod.yaml@9ec4f95c88bf8bb6242a65f7702612a1bc9ba965 -
Trigger Event:
push
-
Statement type: