Kadot, unsupervised natural language processing.
Project description
Unsupervised natural language processing library.
Kadot just lets you process a text easily.
>>> hello_world = Text("Kadot just lets you process a text easily.") >>> hello_world.ngrams(n=2)
[(‘Kadot’, ‘just’), (‘just’, ‘lets’), (‘lets’, ‘you’), (‘you’, ‘process’), (‘process’, ‘a’), (‘a’, ‘text’), (‘text’, ‘easily’)]
🔋 What’s included ?
Kadot includes tokenizers, text generators, classifiers, word-level and document-level vectorizers as well as a spell checker, a fuzzy string matching utility or a stopwords detector.
The philosophy of Kadot is “never hardcode the language rules” : use unsupervised solutions to support most languages. So it will never includes Treebank based algorithms (like a POS Tagger).
🔨 Installation
Use the “pip” command that refair to the Python 3.x interpreter. In my case :
$ pip3 install kadot
It actually require the Python’s standard library, Numpy, Scipy and Scikit-Learn.
⚖️ License
Kadot is under MIT license.
More
Check the GitHub repository : https://github.com/the-new-sky/Kadot
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file Kadot-0.1.9.tar.gz
.
File metadata
- Download URL: Kadot-0.1.9.tar.gz
- Upload date:
- Size: 7.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a80a985e21232ec74bb02b5b3f1b9c14dc131c2470ced13b0e0e8ea0be248e9f |
|
MD5 | 3302bc91c84219516ada1b0d89a21f9d |
|
BLAKE2b-256 | 5659841e54023bd2e94837c1ff43989f770b06dc2b843ef1ee644aed26496550 |