Skip to main content

easy access to commonly used datasets in pandas dataframe format

Project description

DataSetLib

Introduction

a simple library having access to some often used datasets in pandas dataframe format

Getting Started

There are two main functions in the module:

  • get_datasets() returns a list with all the available datasets
  • get_dataset() returns a specific dataset identified by a name

Usage of datasetlib

>>> import datasetlib as dsl
>>> dsl.get_datasets()
['aapl', 'amazon_reviews', 'avocado', 'babynames', 'bank_clients', 'bmw', 'canada_population', 
 'cancer', 'crypto_prices', 'crypto_returns', 'human_resources', 'project1_sales_data', 
 'project1_stores_data', 'sp500_prices', 'stock_prices', 'stocks', 'summergames', 
 'temperatures', 'titanic']
>>> dsl.get_dataset("titanic")
     survived  pclass     sex   age  sibsp  parch     fare embarked deck
0           0       3    male  22.0      1      0   7.2500        S  NaN
1           1       1  female  38.0      1      0  71.2833        C    C
2           1       3  female  26.0      0      0   7.9250        S  NaN
3           1       1  female  35.0      1      0  53.1000        S    C
4           0       3    male  35.0      0      0   8.0500        S  NaN
..        ...     ...     ...   ...    ...    ...      ...      ...  ...
886         0       2    male  27.0      0      0  13.0000        S  NaN
887         1       1  female  19.0      0      0  30.0000        S    B
888         0       3  female   NaN      1      2  23.4500        S  NaN
889         1       1    male  26.0      0      0  30.0000        C    C
890         0       3    male  32.0      0      0   7.7500        Q  NaN

[891 rows x 9 columns]
>>> 

Project Homepage

https://dev.azure.com/neuraldevelopment/datasetlib

Contribute

If you find a defect or suggest a new function, please send an eMail to neutro2@outlook.de

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasetlib-0.2.5.2.tar.gz (17.4 MB view details)

Uploaded Source

Built Distribution

datasetlib-0.2.5.2-py3-none-any.whl (17.7 MB view details)

Uploaded Python 3

File details

Details for the file datasetlib-0.2.5.2.tar.gz.

File metadata

  • Download URL: datasetlib-0.2.5.2.tar.gz
  • Upload date:
  • Size: 17.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for datasetlib-0.2.5.2.tar.gz
Algorithm Hash digest
SHA256 95745a56c3c65b4d6cefcb1ee2d9bba831ac1c37d2f5c500627d434d946881e4
MD5 1e1939f4a13ff0bb48c2888e629e7698
BLAKE2b-256 3d6971689b72e5beee84f4dcd14c8403fbe9c97803de5af253f2704d9ba67f9e

See more details on using hashes here.

File details

Details for the file datasetlib-0.2.5.2-py3-none-any.whl.

File metadata

  • Download URL: datasetlib-0.2.5.2-py3-none-any.whl
  • Upload date:
  • Size: 17.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for datasetlib-0.2.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6f801f4844e547e1663db718dc4a85ddda5c11c700b029fa112fa19e5f0adfa0
MD5 2d6e0181acb2143ff579adfb8fce9325
BLAKE2b-256 61aa6bda27110a176f5f203bcea8dd6825eed064092ce8d67b3275abeff9dca4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page