Skip to main content

easy access to commonly used datasets in pandas dataframe format

Project description

DataSetLib

Introduction

a simple library having access to some often used datasets in pandas dataframe format

Getting Started

There are two main functions in the module:

  • get_datasets() returns a list with all the available datasets
  • get_dataset() returns a specific dataset identified by a name

Usage of datasetlib

>>> import datasetlib as dsl
>>> dsl.get_datasets()
['aapl', 'amazon_reviews', 'avocado', 'babynames', 'bank_clients', 'bmw', 'canada_population', 
 'cancer', 'crypto_prices', 'crypto_returns', 'human_resources', 'project1_sales_data', 
 'project1_stores_data', 'sp500_prices', 'stock_prices', 'stocks', 'summergames', 
 'temperatures', 'titanic']
>>> dsl.get_dataset("titanic")
     survived  pclass     sex   age  sibsp  parch     fare embarked deck
0           0       3    male  22.0      1      0   7.2500        S  NaN
1           1       1  female  38.0      1      0  71.2833        C    C
2           1       3  female  26.0      0      0   7.9250        S  NaN
3           1       1  female  35.0      1      0  53.1000        S    C
4           0       3    male  35.0      0      0   8.0500        S  NaN
..        ...     ...     ...   ...    ...    ...      ...      ...  ...
886         0       2    male  27.0      0      0  13.0000        S  NaN
887         1       1  female  19.0      0      0  30.0000        S    B
888         0       3  female   NaN      1      2  23.4500        S  NaN
889         1       1    male  26.0      0      0  30.0000        C    C
890         0       3    male  32.0      0      0   7.7500        Q  NaN

[891 rows x 9 columns]
>>> 

Project Homepage

https://dev.azure.com/neuraldevelopment/datasetlib

Contribute

If you find a defect or suggest a new function, please send an eMail to neutro2@outlook.de

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasetlib-0.2.6.tar.gz (17.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datasetlib-0.2.6-py3-none-any.whl (17.7 MB view details)

Uploaded Python 3

File details

Details for the file datasetlib-0.2.6.tar.gz.

File metadata

  • Download URL: datasetlib-0.2.6.tar.gz
  • Upload date:
  • Size: 17.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for datasetlib-0.2.6.tar.gz
Algorithm Hash digest
SHA256 8f5abf756f1d14e019433e12621be87c9633ce6e3ded8cb409bb39b14f79f8bf
MD5 1faf044d33760eafb9e3db987d1ee006
BLAKE2b-256 c7ad9fe03e8dc7224dc8e07d01fb752c114ab69ba19f956e85f60210eb644e15

See more details on using hashes here.

File details

Details for the file datasetlib-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: datasetlib-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 17.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for datasetlib-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 ea6b4871b48f3bafb05520ccaf978236b391b39c8c3aa84cfc8214a5ff70ba9e
MD5 219be84ceb25a387ec4018b746e2d6ac
BLAKE2b-256 8311b94915a474952cd12ef2de891cd9f878609d8779f059a77f3593b129fecc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page