Skip to main content

A package to evaluate bm metrics

Project description

๐Ÿ“„ bm-eval-metrics

bm-eval-metrics is a Python package providing easy-to-use evaluation metrics and utilities for Data Mining and Information Retrieval modules. It helps you access and view source code for various DM and IR algorithms efficiently.


โœจ Features

  • Data Mining algorithms (Hunt's, ID3, Bagging, AdaBoost, Apriori, etc.)
  • Information Retrieval metrics (Jaccard, Precision/Recall/F-score, MAP, etc.)
  • Near Duplicate Document detection (MinHash & LSH)
  • Relevance Feedback (Rocchio & LCA)
  • Source code viewer for all modules
  • Built on NLTK, pandas, and scikit-learn

๐Ÿ“ฆ Installation

Install from PyPI:

pip install bm-eval-metrics

๐Ÿš€ Quick Start

Basic Usage

from eval_metrics.DM import adaboost, apriori, metrics
from eval_metrics.IR import eval_metrics, ndd, rel

# Print the source code directly
print("=== DM AdaBoost Module ===")
print(adaboost)

print("\n=== IR Evaluation Metrics ===")
print(eval_metrics)

print("\n=== IR Near Duplicate Documents ===")
print(ndd)

๐Ÿ› ๏ธ Components Overview

Component Description
Information Retrieval (IR)
eval_metrics.IR.all Cohesive IR File: MinHash, LSH, Rocchio, Jaccard, VS
eval_metrics.IR.all_vis Cohesive IR File + Matplotlib visualizations & Heatmaps
eval_metrics.IR.ndd Near Duplicate Documents (MinHash & LSH)
eval_metrics.IR.rel Relevance feedback & query expansion (Rocchio & LCA)
eval_metrics.IR.eval_metrics Jaccard, PRF, Compression Ratios, MAP metrics & plots
Data Mining (DM)
eval_metrics.DM.all Cohesive DM File: Hunt's, ID3, Bagging, AdaBoost, Metrics
eval_metrics.DM.all_vis Cohesive DM File + Graphviz & Matplotlib visualizations
eval_metrics.DM.apriori Apriori algorithm
eval_metrics.DM.adaboost Bagging & AdaBoost ensemble classifiers
eval_metrics.DM.bagging Bagging ensemble classifier
eval_metrics.DM.hash Hash-based mining
eval_metrics.DM.hunts Hunt's decision tree algorithm
eval_metrics.DM.hunts_test Hunt's decision tree with dataset visualization
eval_metrics.DM.id3 ID3 decision tree algorithm
eval_metrics.DM.id3_test ID3 decision tree with dataset visualization
eval_metrics.DM.metrics Classification metrics & curves
eval_metrics.DM.preprocessing Data preprocessing utilities
eval_metrics.DM.lib_doc Pandas, NumPy, Sklearn cheat sheet (DM & IR logic)
eval_metrics.DM.python_doc Python Basics cheat sheet (Sets, Dicts, Comprehensions, etc.)

๐Ÿ“š Requirements

  • Python 3.8+
  • nltk
  • pandas
  • scikit-learn (for vectorization)
  • matplotlib
  • graphviz

Install dependencies automatically with:

pip install bm-eval-metrics

๐Ÿ“‚ Project Structure

eval_metrics/
โ”‚
โ”œโ”€โ”€ src/
โ”‚   โ””โ”€โ”€ eval_metrics/
โ”‚       โ”œโ”€โ”€ __init__.py
โ”‚       โ”œโ”€โ”€ DM/
โ”‚       โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚       โ”‚   โ”œโ”€โ”€ adaboost.py
โ”‚       โ”‚   โ”œโ”€โ”€ all.py
โ”‚       โ”‚   โ”œโ”€โ”€ ...
โ”‚       โ”‚   โ””โ”€โ”€ sources/
โ”‚       โ””โ”€โ”€ IR/
โ”‚           โ”œโ”€โ”€ __init__.py
โ”‚           โ”œโ”€โ”€ all.py
โ”‚           โ”œโ”€โ”€ ...
โ”‚           โ””โ”€โ”€ sources/
โ”œโ”€โ”€ pyproject.toml
โ”œโ”€โ”€ README.md
โ”œโ”€โ”€ USAGE.md
โ””โ”€โ”€ INSTALLATION.md

๐Ÿค Contributing

Contributions are welcome!

  1. Fork the repository
  2. Create a new branch
  3. Commit your changes
  4. Open a pull request

๐Ÿ“„ License

This project is licensed under the MIT License.


๐Ÿ“ฌ Support

If you encounter any issues or have feature requests, please open an issue on GitHub.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bm_eval_metrics-1.0.0.tar.gz (30.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bm_eval_metrics-1.0.0-py3-none-any.whl (54.7 kB view details)

Uploaded Python 3

File details

Details for the file bm_eval_metrics-1.0.0.tar.gz.

File metadata

  • Download URL: bm_eval_metrics-1.0.0.tar.gz
  • Upload date:
  • Size: 30.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for bm_eval_metrics-1.0.0.tar.gz
Algorithm Hash digest
SHA256 cc4404697cd894d70174564ee228773e556646c1e18b2a2a334f7ebec6a26a9b
MD5 a2da4c9aa97c83a495cd941eecf6f7a5
BLAKE2b-256 28a2acefa5119d017706a01b27bb8a8c0f6493fb6d2052f08d44033892a58427

See more details on using hashes here.

File details

Details for the file bm_eval_metrics-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for bm_eval_metrics-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 14ae3a1733b8732283f192ac5022cc9acb1a7a4375ef253cb6bf547dc0163648
MD5 6bfca2f99eb0cb1aa132cae2971e242e
BLAKE2b-256 98977d596209ea75a8fd441f824ef135f1ac647951ccd2a229bf091b120a922a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page