Skip to main content

Python API for Romanian diacritics restoration

Project description

RO Diacritics module

RO Diacritics is a straightforward diacritics restoration module for Romanian Language

from ro_diacritics import restore_diacritics
print(restore_diacritics("fara poezie, viata e pustiu"))

or correcting a pandas dataframe:

from ro_diacritics import restore_diacritics
df['text-diacritice'] = df['text'].apply(restore_diacritics)

Installing

$ python -m pip install ro-diacritics

or

$ pip install ro-diacritics

Requirements

  • torch and torchtext
  • numpy
  • nltk and sklearn (for training)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ro-diacritics-0.9.1.tar.gz (11.8 kB view details)

Uploaded Source

File details

Details for the file ro-diacritics-0.9.1.tar.gz.

File metadata

  • Download URL: ro-diacritics-0.9.1.tar.gz
  • Upload date:
  • Size: 11.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.6

File hashes

Hashes for ro-diacritics-0.9.1.tar.gz
Algorithm Hash digest
SHA256 461b014fbbbf612751662e1346b5db369dfe006750a348c7744cfba63becb9ef
MD5 63720445ce72fe8d0dec74865fc30676
BLAKE2b-256 ce1dd0f2a25b5501fda50a370b9b517fab457a1ba540ebf4b3cf5fc292ded12b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page