Skip to main content

Evaluation engine: RAGAS, DeepEval, LLM-as-Judge, and audit report generation

Project description

rag-forge-evaluator

RAG pipeline evaluation engine for the RAG-Forge toolkit: RAGAS, DeepEval, LLM-as-Judge, and the RAG Maturity Model.

Installation

pip install rag-forge-evaluator

Usage

from rag_forge_evaluator.assess import RMMAssessor

assessor = RMMAssessor()
result = assessor.assess(config={
    "retrieval_strategy": "hybrid",
    "input_guard_configured": True,
    "output_guard_configured": True,
})
print(result.badge)  # e.g., "RMM-3 Better Trust"

Features

  • RMM (RAG Maturity Model) scoring (levels 0-5)
  • RAGAS, DeepEval, and LLM-as-Judge evaluators
  • Golden set management with traffic sampling
  • Cost estimation
  • HTML and PDF report generation

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rag_forge_evaluator-0.1.0.tar.gz (31.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rag_forge_evaluator-0.1.0-py3-none-any.whl (36.7 kB view details)

Uploaded Python 3

File details

Details for the file rag_forge_evaluator-0.1.0.tar.gz.

File metadata

  • Download URL: rag_forge_evaluator-0.1.0.tar.gz
  • Upload date:
  • Size: 31.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rag_forge_evaluator-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ce5e3fc7ff681c7e371f4c4848fcb431be7446a60f4b55ba9b0e0af78d8ca98e
MD5 3e9f58ee67a717c2187ea31a528f3371
BLAKE2b-256 c57b708891424fc2ee4249a68d58679eb60dcb24bd5cb837d191ccbcac891219

See more details on using hashes here.

File details

Details for the file rag_forge_evaluator-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for rag_forge_evaluator-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c4d8d8be1984f0c37dd96156fe777196f615ca5ec750d122166458b27e39ffd7
MD5 ef43cd959a6a2ae5ac91fc07048b644a
BLAKE2b-256 81b0f830c0446d19c12ebbf127dc4b7a19bf56fb3d925bdd5a8cbade92fac4ce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page