Tool for autodownloading recommendation systems datasets
Project description
Welcome to rs_datasets
This tool allows you download, unpack and read
recommender systems datasets into pandas.DataFrame
as easy as data = Dataset()
.
Installation
pip install rs_datasets
Documentation
Please see documentation to this project to see available datasets and examples of use.
Example of use
from rs_datasets import MovieLens ml = MovieLens() ml.info()
ratings user_id item_id rating timestamp 0 1 1 4.0 964982703 1 1 3 4.0 964981247 2 1 6 4.0 964982224 items item_id ... genres 0 1 ... Adventure|Animation|Children|Comedy|Fantasy 1 2 ... Adventure|Children|Fantasy 2 3 ... Comedy|Romance [3 rows x 3 columns] tags user_id item_id tag timestamp 0 2 60756 funny 1445714994 1 2 60756 Highly quotable 1445714996 2 2 60756 will ferrell 1445714992 links item_id imdb_id tmdb_id 0 1 114709 862.0 1 2 113497 8844.0 2 3 113228 15602.0
Loaded DataFrames are available as class attributes.
Note
This package relies on datatable
to read files.
There are some known issues with reading some of the datasets, which should be solved with the release of datatable==1.1.0
,
but they are quite slow on releases. If you experience problems with reading datasets, you may try to downgrade datatable
to 0.11 or 0.9. Or you can install a dev build 1.1.0a2102
or newer from s3.
Find your python version, copy link for whl and do pip install link
. Sorry for the inconvenience.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for rs_datasets-0.5.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 290231b812fb3b08f8762b8dd680bd56405947a6d811a0104200bb1e7908000f |
|
MD5 | 0d5b7eb044f2b10dfae1fc20c0f8e3da |
|
BLAKE2-256 | 9d2bf6c478d397654b96d16adb4bdd74b48a9d32eeaad11b6e3f69d8acd34641 |