A text analysis tool that predicts whether a given string appears random or meaningful.
Project description
Randetect
Randetect
is a Python-based text analysis tool that predicts whether a given string of text appears random or has a meaningful structure. The tool is based on a combination of heuristic measures, such as text entropy, and a pre-trained logistic regression model.
Features
- Predicts randomness of a given text string.
- Utilizes both heuristic measures and a machine learning model.
- Provides preprocessing tools to clean and prepare text for analysis.
Installation
You can easily install Randetect
using pip
:
pip install randetect
Usage
To use Randetect
, you need to instantiate the TextAnalyzer
class and then call the random_detect()
method to evaluate a text string. Here's how you can do it:
from randetect import random_detect
analyzer = random_detect.TextAnalyzer()
result, label = analyzer.random_detect('asdfasdf')
print(f"'{result}' is {label}.")
In addition to predicting randomness, Randetect
provides a preprocess_text()
method to clean and prepare text:
from randetect import random_detect
analyzer = random_detect.TextAnalyzer()
processed_text = analyzer.preprocess_text('text')
print(processed_text)
Contribution
Contributions are welcome! If you find a bug, have a feature request, or want to contribute to the code, please feel free to submit an issue or a pull request on the GitHub repository.
License
This project is licensed under the MIT License. See the LICENSE.md file for more details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for randetect-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e32c1b3c440981216c933d4c26b52aa80e8a32c99b2bf1c285ccf043cf5c88a9 |
|
MD5 | 73b63f3d12b2b175d49b7feadb5c5031 |
|
BLAKE2b-256 | 54d1954edf5785ce6fef114e8060a94132ba62ca99409d64ccbee337bca57aea |