Corpus library
Project description
For Developers
You can also see Python, Java, C++, Swift, or C# repository.
Requirements
Python
To check if you have a compatible version of Python installed, use the following command:
python -V
You can find the latest version of Python here.
Git
Install the latest version of Git.
Pip Install
pip3 install NlpToolkit-Corpus-Cy
Download Code
In order to work on code, create a fork from GitHub page. Use Git for cloning the code to your local or below line for Ubuntu:
git clone <your-fork-git-link>
A directory called Corpus will be created. Or you can use below link for exploring the code:
git clone https://github.com/olcaytaner/Corpus-Cy.git
Open project with Pycharm IDE
Steps for opening the cloned project:
- Start IDE
- Select File | Open from main menu
- Choose
Corpus-Cy
file - Select open as project option
- Couple of seconds, dependencies will be downloaded.
Detailed Description
Corpus
To store a corpus in memory
a = Corpus("derlem.txt")
If this corpus is split with dots but not in sentences
Corpus(self, fileName=None, splitterOrChecker=None)
The number of sentences in the corpus
sentenceCount(self) -> int
To get ith sentence in the corpus
getSentence(self, index: int) -> Sentence
TurkishSplitter
TurkishSplitter class is used to split the text into sentences in accordance with the . rules of Turkish.
split(self, line: str) -> list
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for NlpToolkit-Corpus-Cy-1.0.5.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | ba030ed5559105d9eb7f427f79f09eb0aa8edb32bd7520070c079da4e5121344 |
|
MD5 | 0865d0a385c0c6c37954aa9eb640204f |
|
BLAKE2b-256 | 454b4c75969306b6302dd7184097c3c872f1576a0a4dea6ed44b499b38daa917 |