A keyphrase extractor for Persian
Project description
Perke
Perke is a Python keyphrase extraction package for Persian language. It provides an end-to-end keyphrase extraction pipeline in which each component can be easily modified or extended to develop new models.
Installation
- The easiest way to install is from PyPI:
pip install perke
Alternatively, you can install directly from GitHub:pip install git+https://github.com/alirezatheh/perke.git
- Perke also requires a trained POS tagger model. We use
Hazm's POS tagger model. You can
easily download latest Hazm's POS
tagger using the following command:
python -m perke download
Alternatively, you can use another model with same tag names and structure, and put it in theresources
directory.
Simple Example
Perke provides a standardized API for extracting keyphrases from a text. Start
by typing the 4 lines below to use TextRank
keyphrase extractor.
from perke.unsupervised.graph_based import TextRank
# 1. Create a TextRank extractor.
extractor = TextRank()
# 2. Load the text.
extractor.load_text(input='text or path/to/input_file')
# 3. Build the graph representation of the text and weight the
# words. Keyphrase candidates are composed of the 33 percent
# highest weighted words.
extractor.weight_candidates(top_t_percent=0.33)
# 4. Get the 10 highest weighted candidates as keyphrases.
keyphrases = extractor.get_n_best(n=10)
For more in depth examples see the
examples
directory.
Documentation
Documentation and references are available at Read The Docs.
Implemented Models
Perke currently, implements the following keyphrase extraction models:
- Unsupervised models
Acknowledgements
Perke is inspired by pke.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file perke-0.4.4.tar.gz
.
File metadata
- Download URL: perke-0.4.4.tar.gz
- Upload date:
- Size: 20.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a2277223d68d51e4a70ebf1ed0d7b91f6804c05e66c623750e7cc2ecddcc8617 |
|
MD5 | ba7197beff7ae59a0253793e0b368f08 |
|
BLAKE2b-256 | 33a349f2b59bed4f550b0275de5bfc3bbb6c8f143ba648fe187a881cf30bc0f9 |
File details
Details for the file perke-0.4.4-py3-none-any.whl
.
File metadata
- Download URL: perke-0.4.4-py3-none-any.whl
- Upload date:
- Size: 25.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dc8f0777079e77e0b09ed4842b2e5632316548c6a52858de9d992cd7936008a2 |
|
MD5 | 4552cd02e9c49d84a3966825d09413bb |
|
BLAKE2b-256 | a25715d359c899837adfd6482b48dd6d7fdda46bd1e537859c5b136b5afb798d |