A Python package to extract hindi characters.
Project description
About
A command line based solution to pre-process hindi dataset and its cleaning. The abilities of this package includes-
- pre-processing given file into hindi characters only
- splitting paragraphs into sentences
- removal of punctuations from the dataset (if required)
Usage
extract -l -p <y/yes (to keep punctuation)>
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
textcleaner_hi-1.0.0.tar.gz
(2.5 kB
view hashes)
Built Distribution
Close
Hashes for textcleaner_hi-1.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b9cff503e0ce7c43dfa58936c2d04ad9b9c054f3522b53ee20f1582afa0709fd |
|
MD5 | a179ab56f8c29a92875372f7d06497e4 |
|
BLAKE2b-256 | f826c5d07ae4acbd936225dcd39fad6da92f3823b914633f23f44b3d2eb511b5 |