A package to evaluate bm metrics
Project description
๐ bm-eval-metrics
bm-eval-metrics is a Python package providing easy-to-use evaluation metrics and utilities for Data Mining and Information Retrieval modules. It helps you access and view source code for various DM and IR algorithms efficiently.
โจ Features
- Data Mining algorithms (Hunt's, ID3, Bagging, AdaBoost, Apriori, etc.)
- Information Retrieval metrics (Jaccard, Precision/Recall/F-score, MAP, etc.)
- Near Duplicate Document detection (MinHash & LSH)
- Relevance Feedback (Rocchio & LCA)
- Source code viewer for all modules
- Built on NLTK, pandas, and scikit-learn
๐ฆ Installation
Install from PyPI:
pip install bm-eval-metrics
๐ Quick Start
Basic Usage
from eval_metrics.DM import adaboost, apriori, metrics
from eval_metrics.IR import eval_metrics, ndd, rel
# Print the source code directly
print("=== DM AdaBoost Module ===")
print(adaboost)
print("\n=== IR Evaluation Metrics ===")
print(eval_metrics)
print("\n=== IR Near Duplicate Documents ===")
print(ndd)
๐ ๏ธ Components Overview
| Component | Description |
|---|---|
| Information Retrieval (IR) | |
eval_metrics.IR.all |
Cohesive IR File: MinHash, LSH, Rocchio, Jaccard, VS |
eval_metrics.IR.all_vis |
Cohesive IR File + Matplotlib visualizations & Heatmaps |
eval_metrics.IR.ndd |
Near Duplicate Documents (MinHash & LSH) |
eval_metrics.IR.rel |
Relevance feedback & query expansion (Rocchio & LCA) |
eval_metrics.IR.eval_metrics |
Jaccard, PRF, Compression Ratios, MAP metrics & plots |
| Data Mining (DM) | |
eval_metrics.DM.all |
Cohesive DM File: Hunt's, ID3, Bagging, AdaBoost, Metrics |
eval_metrics.DM.all_vis |
Cohesive DM File + Graphviz & Matplotlib visualizations |
eval_metrics.DM.apriori |
Apriori algorithm |
eval_metrics.DM.adaboost |
Bagging & AdaBoost ensemble classifiers |
eval_metrics.DM.bagging |
Bagging ensemble classifier |
eval_metrics.DM.hash |
Hash-based mining |
eval_metrics.DM.hunts |
Hunt's decision tree algorithm |
eval_metrics.DM.hunts_test |
Hunt's decision tree with dataset visualization |
eval_metrics.DM.id3 |
ID3 decision tree algorithm |
eval_metrics.DM.id3_test |
ID3 decision tree with dataset visualization |
eval_metrics.DM.metrics |
Classification metrics & curves |
eval_metrics.DM.preprocessing |
Data preprocessing utilities |
eval_metrics.DM.lib_doc |
Pandas, NumPy, Sklearn cheat sheet (DM & IR logic) |
eval_metrics.DM.python_doc |
Python Basics cheat sheet (Sets, Dicts, Comprehensions, etc.) |
๐ Requirements
- Python 3.8+
- nltk
- pandas
- scikit-learn (for vectorization)
- matplotlib
- graphviz
Install dependencies automatically with:
pip install bm-eval-metrics
๐ Project Structure
eval_metrics/
โ
โโโ src/
โ โโโ eval_metrics/
โ โโโ __init__.py
โ โโโ DM/
โ โ โโโ __init__.py
โ โ โโโ adaboost.py
โ โ โโโ all.py
โ โ โโโ ...
โ โ โโโ sources/
โ โโโ IR/
โ โโโ __init__.py
โ โโโ all.py
โ โโโ ...
โ โโโ sources/
โโโ pyproject.toml
โโโ README.md
โโโ USAGE.md
โโโ INSTALLATION.md
๐ค Contributing
Contributions are welcome!
- Fork the repository
- Create a new branch
- Commit your changes
- Open a pull request
๐ License
This project is licensed under the MIT License.
๐ฌ Support
If you encounter any issues or have feature requests, please open an issue on GitHub.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bm_eval_metrics-1.0.0.tar.gz.
File metadata
- Download URL: bm_eval_metrics-1.0.0.tar.gz
- Upload date:
- Size: 30.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cc4404697cd894d70174564ee228773e556646c1e18b2a2a334f7ebec6a26a9b
|
|
| MD5 |
a2da4c9aa97c83a495cd941eecf6f7a5
|
|
| BLAKE2b-256 |
28a2acefa5119d017706a01b27bb8a8c0f6493fb6d2052f08d44033892a58427
|
File details
Details for the file bm_eval_metrics-1.0.0-py3-none-any.whl.
File metadata
- Download URL: bm_eval_metrics-1.0.0-py3-none-any.whl
- Upload date:
- Size: 54.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
14ae3a1733b8732283f192ac5022cc9acb1a7a4375ef253cb6bf547dc0163648
|
|
| MD5 |
6bfca2f99eb0cb1aa132cae2971e242e
|
|
| BLAKE2b-256 |
98977d596209ea75a8fd441f824ef135f1ac647951ccd2a229bf091b120a922a
|