Skip to main content

Tool for autodownloading recommendation systems datasets

Project description

Welcome to rs_datasets

This tool allows you download, unpack and read recommender systems datasets into pandas.DataFrame as easy as data = Dataset().

Installation

pip install rs_datasets

Documentation

Please see documentation to this project to see available datasets and examples of use.

Example of use

from rs_datasets import MovieLens
ml = MovieLens()
ml.info()
ratings
   user_id  item_id  rating  timestamp
0        1        1     4.0  964982703
1        1        3     4.0  964981247
2        1        6     4.0  964982224
items
   item_id  ...                                       genres
0        1  ...  Adventure|Animation|Children|Comedy|Fantasy
1        2  ...                   Adventure|Children|Fantasy
2        3  ...                               Comedy|Romance
[3 rows x 3 columns]
tags
   user_id  item_id              tag   timestamp
0        2    60756            funny  1445714994
1        2    60756  Highly quotable  1445714996
2        2    60756     will ferrell  1445714992
links
   item_id  imdb_id  tmdb_id
0        1   114709    862.0
1        2   113497   8844.0
2        3   113228  15602.0

Loaded DataFrames are available as class attributes.

Note

This package relies on datatable to read files. There are some known issues with reading some of the datasets, which should be solved with the release of datatable==1.1.0, but they are quite slow on releases. If you experience problems with reading datasets, you may try to downgrade datatable to 0.11 or 0.9. Or you can install a dev build 1.1.0a2102 or newer from s3. Find your python version, copy link for whl and do pip install link. Sorry for the inconvenience.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rs_datasets-0.5.1.tar.gz (13.1 kB view details)

Uploaded Source

Built Distribution

rs_datasets-0.5.1-py3-none-any.whl (21.6 kB view details)

Uploaded Python 3

File details

Details for the file rs_datasets-0.5.1.tar.gz.

File metadata

  • Download URL: rs_datasets-0.5.1.tar.gz
  • Upload date:
  • Size: 13.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.7.7

File hashes

Hashes for rs_datasets-0.5.1.tar.gz
Algorithm Hash digest
SHA256 3af2a97c80363ec43f2fa9dff69da76aee44b83dbda295b31a0401c025519852
MD5 aaed1753fd44c37a2b12f7ddd34345e4
BLAKE2b-256 501520590e684171c1884d15291a5a9d824d1f57bff3ec932dddbe8f3c5f7fbd

See more details on using hashes here.

File details

Details for the file rs_datasets-0.5.1-py3-none-any.whl.

File metadata

  • Download URL: rs_datasets-0.5.1-py3-none-any.whl
  • Upload date:
  • Size: 21.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.7.7

File hashes

Hashes for rs_datasets-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 accd8a845e9f7d2c48950bf1f5cb387d55bcc3d1d7c2fa0b547e553c7f5dd6f2
MD5 b246c25d9e06749bf508752e48f7e1db
BLAKE2b-256 c1f444e31f0d2483d1430ec54dfc488fd01256df891603701c938f6d2ae680b2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page