RecSys Library

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: Artificial Intelligence
Typing
- Typed

Project description

RePlay

RePlay is a library providing tools for all stages of creating a recommendation system, from data preprocessing to model evaluation and comparison.

RePlay uses PySpark to handle big data.

You can

Filter and split data
Train models
Optimize hyper parameters
Evaluate predictions with metrics
Combine predictions from different models
Create a two-level model

Documentation is available here.

Installation
Quickstart
Resources
Contributing to RePlay

Installation

Use Linux machine with Python 3.7-3.9, Java 8+ and C++ compiler.

pip install replay-rec

To get the latest development version or RePlay, install it from the GitHab repository. It is preferable to use a virtual environment for your installation.

If you encounter an error during RePlay installation, check the troubleshooting guide.

Quickstart

from rs_datasets import MovieLens

from replay.data_preparator import DataPreparator, Indexer
from replay.metrics import HitRate, NDCG
from replay.models import ItemKNN
from replay.session_handler import State
from replay.splitters import UserSplitter

spark = State().session

ml_1m = MovieLens("1m")

# data preprocessing
preparator = DataPreparator()
log = preparator.transform(
    columns_mapping={
        'user_id': 'user_id',
        'item_id': 'item_id',
        'relevance': 'rating',
        'timestamp': 'timestamp'
    }, 
    data=ml_1m.ratings
)
indexer = Indexer(user_col='user_id', item_col='item_id')
indexer.fit(users=log.select('user_id'), items=log.select('item_id'))
log_replay = indexer.transform(df=log)

# data splitting
user_splitter = UserSplitter(
    item_test_size=10,
    user_test_size=500,
    drop_cold_items=True,
    drop_cold_users=True,
    shuffle=True,
    seed=42,
)
train, test = user_splitter.split(log_replay)

# model training
model = ItemKNN()
model.fit(train)

# model inference
recs = model.predict(
    log=train,
    k=K,
    users=test.select('user_idx').distinct(),
    filter_seen_items=True,
)

# model evaluation
metrics = Experiment(test,  {NDCG(): K, HitRate(): K})
metrics.add_result("knn", recs)

Resources

Usage examples

01_replay_basics.ipynb - get started with RePlay.
02_models_comparison.ipynb - reproducible models comparison on MovieLens-1M dataset.
03_features_preprocessing_and_lightFM.ipynb - LightFM example with pyspark for feature preprocessing.
04_splitters.ipynb - An example of using RePlay data splitters.
05_feature_generators.ipynb - Feature generation with RePlay.

Videos and papers

Video guides:
- Replay for offline recommendations, AI Journey 2021
Research papers:
- Yan-Martin Tamm, Rinchin Damdinov, Alexey Vasilev Quality Metrics in Recommender Systems: Do We Calculate Metrics Consistently?

Contributing to RePlay

For more details visit development section in docs

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: Artificial Intelligence
Typing
- Typed

Release history Release notifications | RSS feed

0.21.4

Mar 2, 2026

0.21.4rc0 pre-release

Mar 2, 2026

0.21.3

Feb 26, 2026

0.21.3rc0 pre-release

Feb 26, 2026

0.21.2

Feb 11, 2026

0.21.2rc0 pre-release

Feb 11, 2026

0.21.1

Feb 5, 2026

0.21.1rc0 pre-release

Feb 5, 2026

0.21.0

Jan 30, 2026

0.21.0rc0 pre-release

Jan 30, 2026

0.20.3

Dec 15, 2025

0.20.3rc0 pre-release

Dec 15, 2025

0.20.2

Dec 9, 2025

0.20.1

Nov 19, 2025

0.20.1rc0 pre-release

Nov 19, 2025

0.20.0

Oct 3, 2025

0.20.0rc0 pre-release

Oct 17, 2025

0.19.0

May 26, 2025

0.19.0rc0 pre-release

May 26, 2025

0.18.1

Mar 14, 2025

0.18.1rc0 pre-release

Mar 14, 2025

0.18.0

Sep 13, 2024

0.18.0rc0 pre-release

Sep 13, 2024

0.17.1

Aug 22, 2024

0.17.1rc0 pre-release

Aug 22, 2024

0.17.0

Jun 7, 2024

0.17.0rc0 pre-release

Jun 7, 2024

0.16.0

Mar 13, 2024

0.16.0rc0 pre-release

Mar 20, 2024

0.15.0

Nov 30, 2023

0.15.0rc0 pre-release

Nov 30, 2023

0.14.0

Nov 24, 2023

0.14.0rc0 pre-release

Nov 24, 2023

0.13.0

Nov 16, 2023

0.13.0rc0 pre-release

Nov 16, 2023

0.12.0

Oct 9, 2023

0.11.0

Jul 13, 2023

This version

0.10.0

Nov 29, 2022

0.9.0

Apr 13, 2022

0.8.0

Dec 6, 2021

0.7.0

Nov 11, 2021

0.6.1

Oct 21, 2021

0.6.0

Sep 13, 2021

0.5.1

Sep 6, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

replay_rec-0.10.0.tar.gz (91.6 kB view details)

Uploaded Nov 29, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

replay_rec-0.10.0-py3-none-any.whl (118.9 kB view details)

Uploaded Nov 29, 2022 Python 3

File details

Details for the file replay_rec-0.10.0.tar.gz.

File metadata

Download URL: replay_rec-0.10.0.tar.gz
Upload date: Nov 29, 2022
Size: 91.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.2.2 CPython/3.9.7 Darwin/21.5.0

File hashes

Hashes for replay_rec-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`671bb3bdbc501fdac1662fe97a69cc5dfae85174a579a6241357e2edf4844822`
MD5	`f9ba1b12c026350e8aa58d976f7e9598`
BLAKE2b-256	`0daa66c0e1bf586effb01788d36f75c54fe7cb721b93d63c7f4e175773322b69`

See more details on using hashes here.

File details

Details for the file replay_rec-0.10.0-py3-none-any.whl.

File metadata

Download URL: replay_rec-0.10.0-py3-none-any.whl
Upload date: Nov 29, 2022
Size: 118.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.2.2 CPython/3.9.7 Darwin/21.5.0

File hashes

Hashes for replay_rec-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d78f6929ddc17a9a8df1246be9f98c496d88a3ac7f6bdd0a46f3e870e252c05d`
MD5	`7c0c6fe51158df5688a2e71f714d446e`
BLAKE2b-256	`a85028e8adf115d95319fc4dd3cb625e6f4cde753125830d13be7b3f28222cde`

See more details on using hashes here.

replay-rec 0.10.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

RePlay

Table of Contents

Installation

Quickstart

Resources

Usage examples

Videos and papers

Contributing to RePlay

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes