Skip to main content

Python API for Romanian diacritics restoration

Project description

RO Diacritics module

RO Diacritics is a straightforward diacritics restoration module for Romanian Language

from ro_diacritics import restore_diacritics
print(restore_diacritics("fara poezie, viata e pustiu"))

or correcting a pandas dataframe:

from ro_diacritics import restore_diacritics
df['text-diacritice'] = df['text'].apply(restore_diacritics)

Installing

$ python -m pip install ro-diacritics

or

$ pip install ro-diacritics

Requirements

  • torch and torchtext
  • numpy
  • nltk and scikit-learn (for training)

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ro-diacritics-0.9.4.1.tar.gz (12.4 kB view details)

Uploaded Source

File details

Details for the file ro-diacritics-0.9.4.1.tar.gz.

File metadata

  • Download URL: ro-diacritics-0.9.4.1.tar.gz
  • Upload date:
  • Size: 12.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for ro-diacritics-0.9.4.1.tar.gz
Algorithm Hash digest
SHA256 fae682cb7c73b5e699551c239c4a9d263b539d539807fe2b9969e21c8473773a
MD5 781fbe6a688f83ad50d85b13187360c9
BLAKE2b-256 252b3a4e554a837f720af810f8a3aced48ed0dc39a110767d6231afbefdb329d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page