A collection of common recommendation system metrics

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Recommender metrics

This library is a colletion of common recommender system (RS) evaluation metrics. Moreover, as RS might perform differently for different user groups due to limitations in available data, this library supports the out-of-the-box computations for subsets of users.

Recommender metrics

Metrics Overview

The following metrics are supported (all with the cut-off threshold k):

This library focuses on efficient metric implementations for PyTorch tensors, NumPy arrays and sparse arrays.

Notes:
* Averaging average precision and reciprocal rank of multiple samples leads to mean average precision (MAP) and mean reciprocal rank (MRR), respectively, which are often used in research.

Installation

Install via pip: python -m pip install rmet
Or from source: python -m pip install .

Usage

There are different ways to compute metrics. In the following, we are going to list all of them.

Single computations

To compute individual metrics, simply import and call them with your model's output (the logits), the true (known) interactions and some cut-off value k:

from rmet import ndcg
ndcg(model_output, targets, k=10)

Sample output:

0.033423

Note: Coverage does not require the targets attribute.

Multiple metrics and thresholds

You can also call calculate multiple metrics and thresholds efficiently with a single function call. To do so, check out the calculate function:

from rmet import calculate

calculate(
    metrics=["ndcg", "recall"], 
    logits=model_output, 
    targets=targets, 
    k=[10, 50],
    return_individual=False,
    flatten_results=True,
)

Sample output:

{
 'ndcg@10': 0.479,
 'ndcg@50': 0.5,
 'recall@10': 0.350,
 'recall@50': 0.363
}

If return_individual is set, the metrics are also returned on sample level, e.g., for every user, when possible.

Please check out the functions docstring for the full feature description and its extended functionality.

Computations per user group

If you want to get insights into the performance of different user groups, e.g., to study differences in recommendation performance based on the users' countries of origin, check out the calculate_per_group function:

from rmet import calculate_per_group

# your actual groups as an iterable, e.g., list or pd.Series
group_assignment = ["AT", "DE", "FR", ...] 

calculate_per_group(
    group_name="country",
    group_assignment=group_assignment,
    metrics=["ndcg", "recall"], 
    logits=model_output, 
    targets=targets, 
    k=[10],
    return_individual=False,
    flatten_results=True,
)

Sample output:

{
 'ndcg@10/country_AT': 0.173,
 'ndcg@10/country_DE': 0.199,
 'ndcg@10/country_FR': 0.239,
 'recall@10/country_AT': 0.282,
 'recall@10/country_DE': 0.301,
 'recall@10/country_FR': 0.357,
}

Batch-wise evaluation

For big datasets and real-world applications, gathering all the logits and targets before computing the recommendation metrics may be too resource-intensive. To simplify calculations in such scenarios, we provide BatchEvaluator, a class that evaluates and stores intermediary results.

Overall computation

from rmet import BatchEvaluator

# instantiate the evaluator class
batch_evaluator = BatchEvaluator(
    metrics=["ndcg"],
    top_k=[10],
)

# iterate over the batches
for batch in batches:
    user_indices, logits, targets = batch

    # you need to call 'eval_batch' for each batch
    batch_evaluator.eval_batch(
        user_indices=user_indices
        logits=logits,
        targets=targets,
    )

# use 'get_results' to determine the final results
batch_evaluator.get_results()

Sample output:

{
 'ndcg@10': 0.121,
}

Including user groups

BatchEvaluator.eval_batch() also accepts group assignments as input, which allows the computation of metrics on group and global level.

from rmet import BatchEvaluator

# instantiate the evaluator class
batch_evaluator = BatchEvaluator(
    metrics=["ndcg"],
    top_k=[10],
)

# iterate over the batches
for batch in batches:
    # batch also returns group_assignments, which is 
    # a mapping from group_name to their values, e.g.,
    # {"country": ["AT", "DE", ...], "gender": ["m", "n", ...]} 
    user_indices, logits, targets, group_assignments = batch

    # you need to call 'eval_batch' for each batch
    batch_evaluator.eval_batch(
        user_indices=user_indices
        logits=logits,
        targets=targets,
        group_assignments=group_assignments,
    )

# use 'get_results' to determine the final results
batch_evaluator.get_results()

Sample output:

{
 'ndcg@10': 0.121,
 'ndcg@10/country_AT': 0.115,
 'ndcg@10/country_DE': 0.142,
 'ndcg@10/gender_m': 0.087,
 'ndcg@10/gender_f': 0.156,
}

[Deprecated] Usage metric differences for user features

[NOTE] This feature is deprecated, use calculate_per_group and the BatchEvaluator with group_assignments instead.

One can also instantiate the UserFeature class for some demographic user feature, such that the performance difference of RS on for different users can be evaluated, e.g., for male and female users in the context of gender.

To do so, you first need to specify which feature belongs to which user via the UserGroup class and then simply call calculate_for_group similar to calculate above.

from rmet import UserFeature, calculate_for_feature
ug_gender = UserFeature("gender", ["m", "m", "f", "d", "m"])

calculate_for_feature(
    ug_gender, 
    metrics=["ndcg", "recall"], 
    logits=model_output, 
    targets=targets, 
    k=10,
    return_individual=False,
    flatten_results=True,
)

Sample output:

{
    'gender_f': {'ndcg@10': 0.195, 'recall@10': 0.125},
    'gender_m': {'ndcg@10': 0.779, 'recall@10': 0.733},
    'gender_d': {'ndcg@10': 0.390, 'recall@10': 0.458},
    'gender_f-m': {'ndcg@10': -0.584, 'recall@10': -0.608},
    'gender_f-d': {'ndcg@10': -0.195, 'recall@10': -0.333},
    'gender_m-d': {'ndcg@10': 0.388, 'recall@10': 0.275}
}

License

MIT License - see the LICENSE file for more details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

cganhoer

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.6

Jul 28, 2025

0.1.5

Jun 27, 2025

0.1.4

Jun 26, 2025

0.1.3

Jun 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rmet-0.1.6.tar.gz (23.9 kB view details)

Uploaded Jul 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rmet-0.1.6-py3-none-any.whl (20.3 kB view details)

Uploaded Jul 28, 2025 Python 3

File details

Details for the file rmet-0.1.6.tar.gz.

File metadata

Download URL: rmet-0.1.6.tar.gz
Upload date: Jul 28, 2025
Size: 23.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for rmet-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`f806c58ab07470bd4831b5b207742584a3c4c005e99541f6cff20c0b94fb1a5c`
MD5	`940d977bb807607767df05b775b718d5`
BLAKE2b-256	`5fac47e2bec788b7ae4f9bbdc8806759e00e76dce5a3e95deb55bbc393562020`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rmet-0.1.6.tar.gz:

Publisher: publish-to-pypi.yaml on Tigxy/recommender-metrics

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rmet-0.1.6.tar.gz
- Subject digest: f806c58ab07470bd4831b5b207742584a3c4c005e99541f6cff20c0b94fb1a5c
- Sigstore transparency entry: 319724754
- Sigstore integration time: Jul 28, 2025
Source repository:
- Permalink: Tigxy/recommender-metrics@02dac994c618fe5dd5f6b6c0c1deaaab2d77a60b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Tigxy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yaml@02dac994c618fe5dd5f6b6c0c1deaaab2d77a60b
- Trigger Event: push

File details

Details for the file rmet-0.1.6-py3-none-any.whl.

File metadata

Download URL: rmet-0.1.6-py3-none-any.whl
Upload date: Jul 28, 2025
Size: 20.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for rmet-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0ccdc4a4e577717349b80dcf1a48288cf89791ab928fa7bedd2c319cdd7236c1`
MD5	`e3575dabbbc2ce3983869c042b4eaf70`
BLAKE2b-256	`668a0ce135902934f27288156299e3f7b9c167af5d8f26446576f909ab8123ab`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rmet-0.1.6-py3-none-any.whl:

Publisher: publish-to-pypi.yaml on Tigxy/recommender-metrics

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rmet-0.1.6-py3-none-any.whl
- Subject digest: 0ccdc4a4e577717349b80dcf1a48288cf89791ab928fa7bedd2c319cdd7236c1
- Sigstore transparency entry: 319724800
- Sigstore integration time: Jul 28, 2025
Source repository:
- Permalink: Tigxy/recommender-metrics@02dac994c618fe5dd5f6b6c0c1deaaab2d77a60b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Tigxy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yaml@02dac994c618fe5dd5f6b6c0c1deaaab2d77a60b
- Trigger Event: push

rmet 0.1.6

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Recommender metrics

Table of Contents

Metrics Overview

Installation

Usage

Single computations

Multiple metrics and thresholds

Computations per user group

Batch-wise evaluation

Overall computation

Including user groups

[Deprecated] Usage metric differences for user features

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance