A Python package for easily calculating information retrieval (IR) accuracy metrics using Elasticsearch and datasets.

Project description

Elasticsearch IR Evaluator

Overview

elasticsearch-ir-evaluator is a Python package designed for easily calculating a range of information retrieval (IR) accuracy metrics using Elasticsearch and datasets. This tool is ideal for users who need to assess the effectiveness of search queries in Elasticsearch. It supports the following key IR metrics:

Precision
Recall
Mean Reciprocal Rank (MRR)
Mean Average Precision (MAP)
Cumulative Gain (CG)
Normalized Discounted Cumulative Gain (nDCG)
False Positive Rate (FPR)
Binary Preference (BPref)

These metrics provide a comprehensive assessment of search performance, catering to various aspects of IR system evaluation. The tool's flexibility allows users to select specific metrics according to their evaluation needs.

Installation

To install elasticsearch-ir-evaluator, use pip:

pip install elasticsearch-ir-evaluator

Prerequisites

Elasticsearch version 8.11 or higher running on your system.
Python 3.8 or higher.

Complete Usage Process

The following steps will guide you through using elasticsearch-ir-evaluator to calculate search accuracy metrics. For more detailed and practical examples, please refer to the examples directory in this repository.

Step 1: Set Up Elasticsearch Client

Configure your Elasticsearch client with the appropriate credentials:

from elasticsearch import Elasticsearch

es_client = Elasticsearch(
    hosts="https://your-elasticsearch-host",
    basic_auth=("your-username", "your-password"),
    verify_certs=True,
    ssl_show_warn=True,
)

Step 2: Create and Index the Corpus

Create and index a new corpus. You can customize index settings and text field configurations, including analyzers:

from elasticsearch_ir_evaluator import ElasticsearchIrEvaluator, Document

# Initialize the ElasticsearchIrEvaluator
evaluator = ElasticsearchIrEvaluator(es_client)

# Specify your documents
documents = [
    Document(id="doc1", title="Title 1", text="Text of document 1"),
    Document(id="doc2", title="Title 2", text="Text of document 2"),
    # ... more documents
]

# Set custom index text field configurations
text_field_config = {"analyzer": "standard"}

evaluator.set_text_field_config(text_field_config)

# Create a new index or set an existing one
evaluator.set_index_name("your_index_name")

# Index documents with an optional ingest pipeline
evaluator.index(documents, pipeline="your_optional_pipeline")

Step 3: Set a Custom Search Template

Customize the search query template for Elasticsearch. Use {{question}} for the question text and {{vector}} for the vector value in QandA:

search_template = {
    "query": {
        "multi_match": {
            "query": "{{question}}",
            "fields": ["title", "text"],
        }
    },
    "knn": [
        {
            "field": "vector",
            "query_vector": "{{vector}}",
            "k": 5,
            "num_candidates": 100,
        }
    ],
}

evaluator.set_search_template(search_template)

Step 4: Calculate Accuracy Metrics

Use .calculate() to compute all possible metrics based on the structure of the provided dataset:

# Load QA pairs for evaluation
qa_pairs = [
    QandA(question="What is Elasticsearch?", answers=["doc1"]),
    # ... more QA pairs
]

# Calculate all metrics
results = evaluator.calculate(qa_pairs)

# Output results
print(result.to_markdown())

This step involves a comprehensive evaluation of search performance using the provided question-answer pairs. The .calculate() method computes all metrics that can be derived from the dataset's structure.

Progress Logging

elasticsearch-ir-evaluator supports progress logging to ensure that long-running indexing tasks can be safely interrupted and resumed. This feature is particularly useful for indexing large datasets or conducting extensive search evaluations, where the process might take an extended period.

Log File

When initiating indexing or evaluation processes, the tool automatically generates a log file named elasticsearch-ir-evaluator-log.json in the current working directory. This log file contains vital information about the progress, including:

last_processed_id: The ID of the last document that was successfully indexed or queried. This ensures that the process can resume from the exact point it was interrupted.
processed_count: The total number of documents that have been processed so far, providing a quick insight into the progress.
index_name: The name of the Elasticsearch index being used, allowing the process to resume with the correct index context.
last_checkpoint_timestamp: A timestamp marking the last update to the log file, offering a reference to when the process was last active.

Resuming Operations

Upon restart, elasticsearch-ir-evaluator automatically detects the presence of the elasticsearch-ir-evaluator-log.json file and uses the information within to resume operations from where they were left off. This mechanism ensures that no duplicate processing occurs and that every document is accounted for, streamlining the continuation of previously interrupted tasks.

Ensuring Data Integrity

This logging feature is designed with data integrity in mind. By recording the progress and using this data to resume operations, elasticsearch-ir-evaluator minimizes the risk of incomplete evaluations or indexing, ensuring that the accuracy of IR metrics and the completeness of indexed datasets are maintained.

License

elasticsearch-ir-evaluator is available under the MIT License.

Project details

Release history Release notifications | RSS feed

This version

0.4.4

Apr 3, 2024

0.4.3

Apr 2, 2024

0.4.2

Apr 1, 2024

0.4.1

Mar 29, 2024

0.4.0

Mar 29, 2024

0.3.1

Mar 18, 2024

0.3.0

Feb 25, 2024

0.2.0

Jan 10, 2024

0.1.0

Jan 1, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elasticsearch_ir_evaluator-0.4.4.tar.gz (17.4 kB view details)

Uploaded Apr 3, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

elasticsearch_ir_evaluator-0.4.4-py3-none-any.whl (16.7 kB view details)

Uploaded Apr 3, 2024 Python 3

File details

Details for the file elasticsearch_ir_evaluator-0.4.4.tar.gz.

File metadata

Download URL: elasticsearch_ir_evaluator-0.4.4.tar.gz
Upload date: Apr 3, 2024
Size: 17.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for elasticsearch_ir_evaluator-0.4.4.tar.gz
Algorithm	Hash digest
SHA256	`382c2a1ea5747b95bb14f7431d03d84ee1bfabf5f1316c4dfedc387f7db874ca`
MD5	`e5d68ecf8a64f206919466c8ac94326a`
BLAKE2b-256	`7f892944c3bbc2bfbfb3736c285673b68c6bebeece4df098b2b867e80f028529`

See more details on using hashes here.

File details

Details for the file elasticsearch_ir_evaluator-0.4.4-py3-none-any.whl.

File metadata

Download URL: elasticsearch_ir_evaluator-0.4.4-py3-none-any.whl
Upload date: Apr 3, 2024
Size: 16.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for elasticsearch_ir_evaluator-0.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`63e738f3c6648a0d799e9b3428a3b6c8554e507172ce316bcf71008f829929c4`
MD5	`ed9a1e5a12e6a6ecaca02d89708bcee4`
BLAKE2b-256	`8bf2cd1fcb910e797cd4ba85054ae42d1d89f194d8dc48125bdc22077b938707`

See more details on using hashes here.

elasticsearch-ir-evaluator 0.4.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Elasticsearch IR Evaluator

Overview

Installation

Prerequisites

Complete Usage Process

Step 1: Set Up Elasticsearch Client

Step 2: Create and Index the Corpus

Step 3: Set a Custom Search Template

Step 4: Calculate Accuracy Metrics

Progress Logging

Log File

Resuming Operations

Ensuring Data Integrity

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes