Skip to main content

Datazets is a python package to import well known example data sets.

Project description

datazets

Python Pypi Docs LOC Downloads Downloads License Forks Issues Project Status GitHub Repo stars GitHub repo size Donate

  • datazets is Python package

Star this repo if you like it! ⭐️

pip install datazets

Import datazets

# Import library
import datazets as dz
# Import data set
df = dz.get('titanic')

Data sets:

Dataset Name Shape Size Type Description
meta (1472, 20) Continuous time
bitcoin (2522, 2) Continuous time
energy (68, 3) Network Data on building energy consumption
gas_prices (6556, 2) Mixed time
iris (150, 3) Continuous Classic flower classification dataset with iris species measurements with coordinates
ads (10000, 10) Discrete Data on online ads, covering click-through rates and targeting information
bigbang (9, 3) Network Data on The Big Bang Theory episodes and characters
malicious_urls (387588, 2) Text URLs classified as malicious or benign, useful in cybersecurity
random_discrete (1000, 5) Discrete Synthetic dataset with random discrete variables, useful for probability modeling
stormofswords (352, 3) Network Character data from A Storm of Swords, with relationships, traits, and alliance info
sprinkler (1000, 4) Discrete Synthetic dataset with binary variables for rain and sprinkler probability illustration
auto_mpg (392, 8) Mixed Data on cars with features for predicting miles per gallon
breast_cancer (569, 30) Mixed Dataset for breast cancer diagnosis prediction using tumor cell features
cancer (4674, 9) Mixed Cancer patient data for classification and prediction of diagnosis outcome with Coordinates
census_income (32561, 15) Mixed US Census data with various demographic and economic factors for income prediction
elections_rus (94487, 23) Mixed Russian election data with demographic and political attributes
elections_usa (24611, 8) Mixed US election data with demographic and political attributes
fifa (128, 27) Mixed FIFA player stats including attributes like skill, position, country, and performance
marketing_retail (999, 8) Mixed Retail customer data for behavior and segmentation analysis
predictive_maintenance (10000, 14) Mixed Industrial equipment data for predictive maintenance
student (649, 33) Mixed Data on student performance with socio-demographic and academic factors
surfspots (9413, 4) Mixed latlon
tips (244, 7) Mixed Restaurant tipping data with variables on meal size, day, and tip amount
titanic (891, 12) Mixed Titanic passenger data with demographic, class, and survival information
waterpump (59400, 41) Mixed Water pump data with features for predicting functionality and maintenance needs
cat_and_dog None Image Images of cats and dogs for classification and object recognition
digits (1083, 65) Image Handwritten digit images (8x8 pixels) for recognition and classification
faces (400, 4097) Image Images of faces used in facial recognition and feature analysis
flowers None Image Various flower images for classification and image recognition
img_peaks1 (930, 930, 3) Image Synthetic peak images for image processing and analysis
img_peaks2 (125, 496, 3) Image Additional synthetic peak images for image processing
mnist (1797, 65) Image MNIST handwritten digit images (28x28 pixels) for classification tasks
scenes None Image Scene images for scene classification tasks
southern_nebula None Image Images of the Southern Nebula, suitable for astronomical analysis

Example:

import datazets as dz
df = dz.get(data='titanic')
import datazets as dz

# Import from url
url='https://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data'
df = dz.get(url=url, sep=',')

Maintainer

Contribute

  • All kinds of contributions are welcome!
  • If you wish to buy me a Coffee for this work, it is very appreciated :)

Licence

See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datazets-0.2.0.tar.gz (13.7 kB view details)

Uploaded Source

Built Distribution

datazets-0.2.0-py3-none-any.whl (13.3 kB view details)

Uploaded Python 3

File details

Details for the file datazets-0.2.0.tar.gz.

File metadata

  • Download URL: datazets-0.2.0.tar.gz
  • Upload date:
  • Size: 13.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for datazets-0.2.0.tar.gz
Algorithm Hash digest
SHA256 3c467c13778e6a0e60b2d7e90a1c0abc13b7c253d2a0bb6cd508e9469052c4dc
MD5 651b571f564f13dbd0b50324f6a02504
BLAKE2b-256 2429ea29416a2f705b415eb9c40fc1b5878d6471ab6797bdc82442bd883973a9

See more details on using hashes here.

File details

Details for the file datazets-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: datazets-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 13.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for datazets-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a4f67d9c3a370b49c32741a523c9bccec15e0dc63ceb8dde4888ec61784702a4
MD5 6869e2407833275df11389e390dbb5fc
BLAKE2b-256 c339b39c3f00c2d6231971fda3d81b06b02aeda730bd2700a4248c77cf8c03de

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page