Skip to main content

Tool for autodownloading recommendation systems datasets

Project description

Welcome to rs_datasets

This tool allows you download, unpack and read recommender systems datasets into pandas.DataFrame as easy as data = Dataset().

Installation

pip install rs_datasets

Documentation

Please see documentation to this project to see available datasets and examples of use.

Example of use

from rs_datasets import MovieLens
ml = MovieLens()
ml.info()
ratings
   user_id  item_id  rating  timestamp
0        1        1     4.0  964982703
1        1        3     4.0  964981247
2        1        6     4.0  964982224
items
   item_id  ...                                       genres
0        1  ...  Adventure|Animation|Children|Comedy|Fantasy
1        2  ...                   Adventure|Children|Fantasy
2        3  ...                               Comedy|Romance
[3 rows x 3 columns]
tags
   user_id  item_id              tag   timestamp
0        2    60756            funny  1445714994
1        2    60756  Highly quotable  1445714996
2        2    60756     will ferrell  1445714992
links
   item_id  imdb_id  tmdb_id
0        1   114709    862.0
1        2   113497   8844.0
2        3   113228  15602.0

Loaded DataFrames are available as class attributes.

Note

This package relies on datatable to read files. There are some known issues with reading some of the datasets, which should be solved with the release of datatable==1.1.0, but they are quite slow on releases. If you experience problems with reading datasets, you may try to downgrade datatable to 0.11 or 0.9. Or you can install a dev build 1.1.0a2102 or newer from s3. Find your python version, copy link for whl and do pip install link. Sorry for the inconvenience.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rs_datasets-0.5.0.tar.gz (13.1 kB view hashes)

Uploaded source

Built Distribution

rs_datasets-0.5.0-py3-none-any.whl (21.6 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page