Corpus library
Project description
Corpus
Video Lectures
For Developers
You can also see Python, Java, C++, Swift, Js, or C# repository.
Requirements
Python
To check if you have a compatible version of Python installed, use the following command:
python -V
You can find the latest version of Python here.
Git
Install the latest version of Git.
Pip Install
pip3 install NlpToolkit-Corpus-Cy
Download Code
In order to work on code, create a fork from GitHub page. Use Git for cloning the code to your local or below line for Ubuntu:
git clone <your-fork-git-link>
A directory called Corpus will be created. Or you can use below link for exploring the code:
git clone https://github.com/olcaytaner/Corpus-Cy.git
Open project with Pycharm IDE
Steps for opening the cloned project:
- Start IDE
- Select File | Open from main menu
- Choose
Corpus-Cy
file - Select open as project option
- Couple of seconds, dependencies will be downloaded.
Detailed Description
Corpus
To store a corpus in memory
a = Corpus("derlem.txt")
If this corpus is split with dots but not in sentences
Corpus(self, fileName=None, splitterOrChecker=None)
The number of sentences in the corpus
sentenceCount(self) -> int
To get ith sentence in the corpus
getSentence(self, index: int) -> Sentence
TurkishSplitter
TurkishSplitter class is used to split the text into sentences in accordance with the . rules of Turkish.
split(self, line: str) -> list
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for NlpToolkit-Corpus-Cy-1.0.16.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 574458cae23ac7a4358502ed2a460d5cca5f3dc3b8dc8e905188df1e081a5fc4 |
|
MD5 | 8e0cd45892596f80591d81ecc6f4bfa8 |
|
BLAKE2b-256 | f065c8fbd7d2d2c85e479dadbc89dc8dbb03834dea16a4c70c629c0765866af3 |