Skip to main content

A transliteration service for Middle-Eastern languages.

Project description

The MEHDIE Transliteration Service and Python Package

This repository contains the source code for the MEHDIE Transliteration Service and Python package. The service is a RESTful API that can be used to transliterate names between Hebrew, Arabic and Latin characters. The Python package provides a Python interface to the service.

The service was developed as part of the MEHDIE project- https://mehdie.org/.

MEHDIE is funded by the Israel Ministry of Science and Technology MOST.

Installation

You can use the Dockerfile and cloudbuild yaml file to deploy to a cloud run service or you can use the python package to use the service in your own code.

Usage

Python Package - Transliteration

import unittest
from translit_me.transliterator import transliterate as tr
from translit_me.lang_tables import *

class TestTransliterate(unittest.TestCase):
    def test_hebrew_arabic(self):
        names = ['נועַם', "מאנץ'", "בישינה", "דימונה"]
        expected = ['نوعَم', 'مانض', 'بيشينة', 'بيسينة', 'ديمونة', 'ضيمونة']
        res = tr(names, HE_AR)
        print(res)
        self.assertListEqual(res, expected)

More examples can be found in the tests folder.

RESTful API

The service is a RESTful API that can be used to transliterate names between Hebrew, Arabic and Latin characters.

import requests

def transliterate_service(to_transliterate: list,from_lang: str,to_lang: str):
  """
  This method invokes a cloud run service to transliterate a list of strings
  (e.g., ['نوعم', 'مانض', 'پيشينة'])
  from the from_lang (e.g., 'ar') to the to_lang (e.g., 'en').
  Supported languages: ('he','ar','en'). Anything non 'he'/'ar' will be treated
  as 'en'
  """
  url = 'https://hebrew-transliteration-service-snlwejaxvq-ez.a.run.app/'
  args = {'from_lang': from_lang, 'to_lang': to_lang, 'data': to_transliterate}
  x = requests.post(url, json=args)
  res_list = x.json()['transliterations']
  return res_list

names = ["תִפְלִיס","תַרְג'","תַרוּג'ה"]
from_language = 'he'
to_language = 'ar'

transliterate_service(names, from_language, to_language)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

translit_me-1.0.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

translit_me-1.0-py3-none-any.whl (9.3 kB view details)

Uploaded Python 3

File details

Details for the file translit_me-1.0.tar.gz.

File metadata

  • Download URL: translit_me-1.0.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for translit_me-1.0.tar.gz
Algorithm Hash digest
SHA256 a00e38d1363e8b466d9e3cbb20ca4464bcb7643c3e4c6afa3902a73c3fbda4b6
MD5 fa29c291b5f933c55d9704df8f7c7166
BLAKE2b-256 e2f3ad53ce96025e7f38123b95bca5a37ce2fbc31fa3363bd134da0c298086ad

See more details on using hashes here.

File details

Details for the file translit_me-1.0-py3-none-any.whl.

File metadata

  • Download URL: translit_me-1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for translit_me-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 84f7358ebb15a9255cd2906b9fce21d7ce93f61deb229a18db6108f5b217eec2
MD5 4b70eb23f0e2210fb005338a5134299f
BLAKE2b-256 a8bd0fbb87913c63361a46dfd5309223c10b3d17b04efc038a5964764a176c94

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page