Skip to main content

A Python library for processing Yiddish text

Project description

yiddish

A Python library for processing Yiddish text

Isaac L. Bleaman (bleaman@berkeley.edu) and contributors (see commit history)

What is this for?

This library includes functions to carry out common tasks when dealing with Yiddish text. For example, you might wish to replace precombined Unicode characters (such as אַ, U+FB2E) with their decomposed versions (אַ, which is U+05D0 followed by U+05B7). Or you might wish to transliterate YIVO Yiddish text (איבער to iber) or render it in the orthography used more commonly in the Hasidic community (שנײיִק to שנייאיג).

See the source file, yiddish.py, for the full list of supported functions.

How to install

pip install yiddish

How to cite

If you'd like to cite yiddish in a publication, you can include a link to the source: https://github.com/ibleaman/yiddish

Example

import yiddish

output = ''

string = 'אונדזער גאַנצע משפּחה װױנט אין די פֿאַראײניקטע שטאַטן.'

output += yiddish.replace_with_precombined(string) + '\n'
output += yiddish.respell_loshn_koydesh(string) + '\n'
output += yiddish.strip_diacritics(string) +  '\n'
output += yiddish.transliterate(string) +  '\n'
output += yiddish.transliterate(string, loshn_koydesh=True) +  '\n'
output += yiddish.hasidify(string)

output += '\n\n'

string_two = 'shloymele hot khasene gehat mit rokhls tokhter leye.'

output += yiddish.detransliterate(string_two) + '\n'
output += yiddish.detransliterate(string_two, loshn_koydesh=True)

print(output)

Output:

אונדזער גאַנצע משפּחה װױנט אין די פֿאַראײניקטע שטאַטן.
אונדזער גאַנצע מישפּאָכע װױנט אין די פֿאַראײניקטע שטאַטן.
אונדזער גאנצע משפחה וווינט אין די פאראייניקטע שטאטן.
undzer gantse mshpkhh voynt in di fareynikte shtatn.
undzer gantse mishpokhe voynt in di fareynikte shtatn.
אונזער גאנצע משפחה וואוינט אין די פאראייניקטע שטאטן.

שלױמעלע האָט כאַסענע געהאַט מיט ראָכלס טאָכטער לײע.
שלמהלע האָט חתונה געהאַט מיט רחלס טאָכטער לאה.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yiddish-0.0.8.tar.gz (10.2 kB view details)

Uploaded Source

Built Distribution

yiddish-0.0.8-py3-none-any.whl (9.7 kB view details)

Uploaded Python 3

File details

Details for the file yiddish-0.0.8.tar.gz.

File metadata

  • Download URL: yiddish-0.0.8.tar.gz
  • Upload date:
  • Size: 10.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for yiddish-0.0.8.tar.gz
Algorithm Hash digest
SHA256 ac012c84838317189692315481b3338cc0201d0c72023ae455ca93c14639627b
MD5 165c8bf0df00573c1b2ec7dc708f344c
BLAKE2b-256 e192e5e79d279e6096a731b9e577c79fcec12f8cff32ca03405e78dc8341caf6

See more details on using hashes here.

File details

Details for the file yiddish-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: yiddish-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 9.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for yiddish-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 8fe2efd55c3e4a3ed6c8c22c3c906b64fed9fba7690de8f5899fe1a592ec714b
MD5 68eb522d90db6cc57596fd8af0682485
BLAKE2b-256 3c3f1939d4b1256792a6b5f1dfedf7f9a89ec60b47cacd3e42db71033b615267

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page