Tool for autodownloading recommendation systems datasets
Project description
Welcome to rs_datasets
This tool allows you download, unpack and read
recommender systems datasets into pandas.DataFrame
as easy as data = Dataset()
.
Installation
pip install rs_datasets
Documentation
Please see documentation to this project to see available datasets and examples of use.
Example of use
from rs_datasets import MovieLens
ml = MovieLens()
ml.info()
ratings
user_id item_id rating timestamp
0 1 1 4.0 964982703
1 1 3 4.0 964981247
2 1 6 4.0 964982224
items
item_id ... genres
0 1 ... Adventure|Animation|Children|Comedy|Fantasy
1 2 ... Adventure|Children|Fantasy
2 3 ... Comedy|Romance
[3 rows x 3 columns]
tags
user_id item_id tag timestamp
0 2 60756 funny 1445714994
1 2 60756 Highly quotable 1445714996
2 2 60756 will ferrell 1445714992
links
item_id imdb_id tmdb_id
0 1 114709 862.0
1 2 113497 8844.0
2 3 113228 15602.0
Loaded DataFrames are available as class attributes.
Note
This package relies on datatable
to read files.
There are some known issues with reading some of the datasets, which should be solved with the release of datatable==1.1.0
,
but they are quite slow on releases. If you experience problems with reading datasets, you may try to downgrade datatable
to 0.11 or 0.9. Or you can install a dev build 1.1.0a2102
or newer from s3.
Find your python version, copy link for whl and do pip install link
. Sorry for the inconvenience.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file rs_datasets-0.5.1.tar.gz
.
File metadata
- Download URL: rs_datasets-0.5.1.tar.gz
- Upload date:
- Size: 13.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3af2a97c80363ec43f2fa9dff69da76aee44b83dbda295b31a0401c025519852 |
|
MD5 | aaed1753fd44c37a2b12f7ddd34345e4 |
|
BLAKE2b-256 | 501520590e684171c1884d15291a5a9d824d1f57bff3ec932dddbe8f3c5f7fbd |
File details
Details for the file rs_datasets-0.5.1-py3-none-any.whl
.
File metadata
- Download URL: rs_datasets-0.5.1-py3-none-any.whl
- Upload date:
- Size: 21.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | accd8a845e9f7d2c48950bf1f5cb387d55bcc3d1d7c2fa0b547e553c7f5dd6f2 |
|
MD5 | b246c25d9e06749bf508752e48f7e1db |
|
BLAKE2b-256 | c1f444e31f0d2483d1430ec54dfc488fd01256df891603701c938f6d2ae680b2 |