Corpus library
Project description
Corpus
Video Lectures
For Developers
You can also see Python, Java, C++, Swift, Js, or C# repository.
Requirements
Python
To check if you have a compatible version of Python installed, use the following command:
python -V
You can find the latest version of Python here.
Git
Install the latest version of Git.
Pip Install
pip3 install NlpToolkit-Corpus-Cy
Download Code
In order to work on code, create a fork from GitHub page. Use Git for cloning the code to your local or below line for Ubuntu:
git clone <your-fork-git-link>
A directory called Corpus will be created. Or you can use below link for exploring the code:
git clone https://github.com/olcaytaner/Corpus-Cy.git
Open project with Pycharm IDE
Steps for opening the cloned project:
- Start IDE
- Select File | Open from main menu
- Choose
Corpus-Cy
file - Select open as project option
- Couple of seconds, dependencies will be downloaded.
Detailed Description
Corpus
To store a corpus in memory
a = Corpus("derlem.txt")
If this corpus is split with dots but not in sentences
Corpus(self, fileName=None, splitterOrChecker=None)
The number of sentences in the corpus
sentenceCount(self) -> int
To get ith sentence in the corpus
getSentence(self, index: int) -> Sentence
TurkishSplitter
TurkishSplitter class is used to split the text into sentences in accordance with the . rules of Turkish.
split(self, line: str) -> list
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for NlpToolkit-Corpus-Cy-1.0.21.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | b7df4339532e1b43e2084e7134cd341319eee7b9e22748b520a18d76e27eb6d9 |
|
MD5 | 5312d144d11c78359799c127d1d4534b |
|
BLAKE2b-256 | 32e61d50ec3bda4c2f0b35590ea86fd5c0609f12ac8daa9c64c2198f3fa32331 |