Python package for calculating famous measures in computational linguistics
Project description
LinguaF
LinguaF provides an easy access for researchers and developers to methods of quantitative language analysis, such as: readability, complexity, diversity, and other descriptive statistics.
Usage
documents = [
"Pain and suffering are always inevitable for a large intelligence and a deep heart. The really great men must, I think, have great sadness on earth.",
"To go wrong in one's own way is better than to go right in someone else's.",
"The darker the night, the brighter the stars, The deeper the grief, the closer is God!"
]
Descriptive Statistics
from linguaf import descriptive_statistics as ds
ds.words_per_sentence(documents)
# Output: 15
Lexical Complexity
from linguaf import lexical_complexity as lc
lc.mean_dependency_distance(documents)
# Output: 2.307306255835668
Lexical Diversity
from linguaf import lexical_diversity as ld
ld.log_type_token_ratio(documents)
# Output: 94.03574963462502
Readability
from linguaf import readability as r
r.flesch_kincaid_grade(documents)
# Output: 4.813333333333336
Install
Via PIP
pip install linguaf
Latest version from GitHub
git clone https://github.com/Perevalov/LinguaF.git
cd LinguaF
pip install .
Language Support
At the moment, library supports English and Russian languages for all the methods.
Open API
TBD
Citation
TBD
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
linguaf-0.0.8b0.tar.gz
(13.7 kB
view hashes)
Built Distribution
linguaf-0.0.8b0-py3-none-any.whl
(17.6 kB
view hashes)
Close
Hashes for linguaf-0.0.8b0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d5e8e77d68a53bd45afa09ff65d16f61d2451e5704f584d67f2e2d7944b7c98b |
|
MD5 | 653147d755efb5060e4a0426821f43a0 |
|
BLAKE2b-256 | 335cf506de47eba11bb27fe4d7e26b882ca6bcc0d4c28e5d9f7da545e673bafc |