A Python package to extract hindi characters.
A command line based solution to pre-process hindi dataset and its cleaning. The abilities of this package includes-
- pre-processing given file into hindi characters only
- splitting paragraphs into sentences
- removal of punctuations from the dataset (if required)
extract -l -p <y/yes (to keep punctuation)>
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Hashes for textcleaner_hi-1.0.0-py3-none-any.whl