Python package for Indian language translation using AI4Bharat's IndicTrans2 API
Project description
anuvaad-rev
A Python package for Indian language translation using AI4Bharat's IndicTrans2 API service. This package provides an easy-to-use interface for translating text between English and various Indian languages, with automatic language detection.
Acknowledgment
This package uses the IndicTrans2 translation service provided by AI4Bharat. All translation capabilities are powered by AI4Bharat's API. Visit AI4Bharat's website for more information about their services.
Features
- Support for 23 Indian languages
- Automatic language detection
- Language information utilities
- User agent rotation to avoid rate limits
- Automatic session refresh
- Retry mechanism for failed requests
- Error handling and input validation
- Easy to integrate into existing projects
Installation
You can install the package using pip:
pip install anuvaad-rev
Or install from source:
git clone https://github.com/yourusername/anuvaad-rev
cd anuvaad-rev
pip install -e .
Quick Start
from anuvaad_rev import IndicTranslator
# Create translator instance
translator = IndicTranslator()
# Auto-detect language and translate
text = "नमस्ते, कैसे हैं आप?"
translation = translator.translate(text, target_lang="en")
print(translation) # Output: Hello, how are you?
# Get language information
languages = translator.get_supported_languages()
print(languages)
Language Detection and Information
# Detect language
text = "வணக்கம், எப்படி இருக்கிறீர்கள்?"
detected = translator.detect_language(text)
print(detected) # Output: ta
# Get language name
lang_name = translator.get_language_name("ta")
print(lang_name) # Output: Tamil
# Get confidence scores
confidence = translator.detect_language_confidence(text)
for lang, score in confidence:
print(f"{lang}: {score:.2%}")
# Get language code from name
code = translator.get_language_code("Hindi")
print(code) # Output: hi
# List all supported languages
codes = translator.get_supported_language_codes()
names = translator.get_supported_language_names()
Advanced Translation
# Create translator with custom settings
translator = IndicTranslator(
max_retries=3, # Number of retries for failed requests
session_refresh_interval=3600 # Refresh session every hour
)
# Batch translation with auto-detection
texts = [
"Hello world!",
"नमस्ते दुनिया!",
"வணக்கம் உலகம்!"
]
for text in texts:
# Source language will be auto-detected
translation = translator.translate(text, target_lang="hi")
print(f"Original: {text}")
print(f"Hindi: {translation}\n")
Supported Languages
| Language | Code | Language | Code |
|---|---|---|---|
| Assamese | as | Malayalam | ml |
| Bengali | bn | Manipuri | mni |
| Bodo | brx | Marathi | mr |
| Dogri | doi | Nepali | ne |
| English | en | Odia | or |
| Gujarati | gu | Punjabi | pa |
| Hindi | hi | Sanskrit | sa |
| Kannada | kn | Santali | sat |
| Kashmiri | ks | Sindhi | sd |
| Konkani | kok | Tamil | ta |
| Maithili | mai | Telugu | te |
| Urdu | ur |
API Reference
IndicTranslator
class IndicTranslator:
def translate(self, text, target_lang, source_lang=None):
"""
Translate text with optional auto-detection
Args:
text: Text to translate
target_lang: Target language code
source_lang: Source language code (if None, will be auto-detected)
Returns:
Translated text if successful, None otherwise
"""
def detect_language(self, text):
"""
Detect the language of input text
Returns:
Detected language code
"""
def detect_language_confidence(self, text):
"""
Detect language with confidence scores
Returns:
List of (language_code, confidence_score) tuples
"""
def get_language_name(self, lang_code):
"""Get language name from code"""
def get_language_code(self, language_name):
"""Get language code from name"""
def get_supported_languages(self):
"""Get dictionary of supported languages"""
def get_supported_language_codes(self):
"""Get list of supported language codes"""
def get_supported_language_names(self):
"""Get list of supported language names"""
Examples
See the examples directory for more detailed examples:
basic_translation.py: Basic translation usagebatch_translation.py: Batch translation with rate limit handlinglanguage_detection.py: Language detection and information features
Contributing
Contributions are welcome! Please feel free to submit pull requests.
License
This package is licensed under the MIT License - see the LICENSE file for details.
Disclaimer
This is an unofficial package that uses AI4Bharat's IndicTrans2 API service. Users must ensure their usage complies with AI4Bharat's terms of service and guidelines. This package is not affiliated with or endorsed by AI4Bharat.
Credits
Translation service provided by AI4Bharat.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file anuvaad_rev-0.1.1.tar.gz.
File metadata
- Download URL: anuvaad_rev-0.1.1.tar.gz
- Upload date:
- Size: 10.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
033bb656c6e6ad0472e38c2a31b14beecf4ee7b2bafdd502eadfdca9b6eb66e8
|
|
| MD5 |
1d7d642a684d6edf5a7fb5e8e1446bad
|
|
| BLAKE2b-256 |
73af8ceb58b5eafbb96b0426034ce9d34d970f14e97a3a3e17afdb9d23020a9b
|
File details
Details for the file anuvaad_rev-0.1.1-py3-none-any.whl.
File metadata
- Download URL: anuvaad_rev-0.1.1-py3-none-any.whl
- Upload date:
- Size: 8.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0af32a7ab884efdee1f42d940d421a36f3b3ceb65b7ef8c43c21bd1452f4b12d
|
|
| MD5 |
ede906671be13ab5fc9c64ed1813a556
|
|
| BLAKE2b-256 |
5aec748e90ebdde4536fd93ebe88c1d2f9bad779a093bd55b868f991d2cb3bf6
|