Squirrel public datasets collection

These details have not been verified by PyPI

Development Status
- 5 - Production/Stable
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3.8
Typing
- Typed

Project description

Squirrel Datasets Core

What is Squirrel Datasets Core?

squirrel-datasets-core is an extension of the Squirrel library. squirrel-datasets-core is a hub where the user can 1) explore existing datasets registered in the data mesh by other users and 2) preprocess their datasets and share them with other users. As an end user, you will be able to load many publically available datasets with ease and speed with the help of squirrel, or load and preprocess your own datasets with the tools we provide here.

For preprocessing, we currently support Spark as the main tool to carry out the task.

If you have any questions or would like to contribute, join our Slack community!

Installation

Install squirrel-core and squirrel-datasets-core with pip. Note that you can install with different dependencies based on your requirements for squirrel drivers. For using the torchvision driver call:

pip install "squirrel-core[torch]"
pip install "squirrel-datasets-core[torchvision]"

For using the hub driver call:

pip install "squirrel-datasets-core[hub]"

For using the spark preprocessing pipelines call:

pip install "squirrel-datasets-core[preprocessing]"

If you would like to get Squirrel's full functionality, install squirrel-core and squirrel-datasets-core with all their dependencies.

pip install "squirrel-core[all]"
pip install "squirrel-datasets-core[all]"

Documentation

Visit our documentation on Readthedocs.

Contributing

squirrel-datasets-core is open source and community contributions are welcome!

Contributing

squirrel-datasets-core is open source and community contributions are welcome!

Check out the contribution guide to learn how to get involved. Please follow our recommendations for best practices and code style.

The humans behind Squirrel

We are Merantix Momentum, a team of ~30 machine learning engineers, developing machine learning solutions for industry and research. Each project comes with its own challenges, data types and learnings, but one issue we always faced was scalable data loading, transforming and sharing. We were looking for a solution that would allow us to load the data in a fast and cost-efficient way, while keeping the flexibility to work with any possible dataset and integrate with any API. That's why we build Squirrel – and we hope you'll find it as useful as we do! By the way, we are hiring!

Citation

If you use Squirrel Datasets in your research, please cite Squirrel using:

@article{2022squirrelcore,
  title={Squirrel: A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way.},
  author={Squirrel Developer Team},
  journal={GitHub. Note: https://github.com/merantix-momentum/squirrel-core},
  year={2022}
}

Project details

These details have not been verified by PyPI

Development Status
- 5 - Production/Stable
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3.8
Typing
- Typed

Release history Release notifications | RSS feed

0.3.1

Mar 27, 2023

0.3.1.dev577398 pre-release

Mar 27, 2023

0.3.1.dev22854 pre-release

May 24, 2023

0.3.0

Mar 15, 2023

0.3.0.dev6891 pre-release

Mar 15, 2023

0.3.0.dev210 pre-release

Mar 15, 2023

0.2.0

Dec 19, 2022

0.2.0.dev875274 pre-release

Dec 19, 2022

0.2.0.dev90727 pre-release

Feb 19, 2023

0.2.0.dev60845 pre-release

Feb 19, 2023

0.2.0.dev29685 pre-release

Feb 20, 2023

0.2.0.dev24580 pre-release

Feb 22, 2023

0.2.0.dev9312 pre-release

Feb 20, 2023

0.2.0.dev121 pre-release

Feb 22, 2023

0.2.0.dev62 pre-release

Feb 27, 2023

0.1.13

Dec 14, 2022

0.1.13.dev77337 pre-release

Dec 13, 2022

0.1.12.dev49528 pre-release

Nov 7, 2022

This version

0.1.12.dev8330 pre-release

Oct 31, 2022

0.1.12.dev6499 pre-release

Nov 4, 2022

0.1.12.dev5456 pre-release

Nov 4, 2022

0.1.12.dev3571 pre-release

Nov 28, 2022

0.1.12.dev942 pre-release

Nov 7, 2022

0.1.12.dev822 pre-release

Dec 1, 2022

0.1.11.dev831291 pre-release

Sep 28, 2022

0.1.11.dev99063 pre-release

Sep 28, 2022

0.1.11.dev6516 pre-release

Sep 21, 2022

0.1.11.dev5825 pre-release

Oct 31, 2022

0.1.11.dev911 pre-release

Sep 28, 2022

0.1.11.dev668 pre-release

Oct 31, 2022

0.1.11.dev182 pre-release

Sep 6, 2022

0.1.10

Sep 1, 2022

0.1.10.dev80070 pre-release

Sep 1, 2022

0.1.9.dev5682747 pre-release

Aug 15, 2022

0.1.9.dev360137 pre-release

Aug 16, 2022

0.1.8

Jul 14, 2022

0.1.8.dev23089 pre-release

Jul 14, 2022

0.1.7

Jun 14, 2022

0.1.7.dev86095 pre-release

Jun 14, 2022

0.1.7.dev45904 pre-release

Jul 7, 2022

0.1.7.dev41186 pre-release

Jun 28, 2022

0.1.7.dev5599 pre-release

Jun 30, 2022

0.1.7.dev53 pre-release

Jun 28, 2022

0.1.7.dev26 pre-release

Jul 5, 2022

0.1.6

Jun 13, 2022

0.1.6.dev791 pre-release

Jun 13, 2022

0.1.5.dev7856 pre-release

Jun 10, 2022

0.1.4.dev5458 pre-release

Jun 9, 2022

0.1.4.dev106 pre-release

Jun 10, 2022

0.1.3.dev18414 pre-release

May 12, 2022

0.1.3.dev7450 pre-release

May 16, 2022

0.1.2

Apr 7, 2022

0.1.2.dev452700 pre-release

May 4, 2022

0.1.2.dev66055 pre-release

May 4, 2022

0.1.2.dev60278 pre-release

Apr 13, 2022

0.1.2.dev297 pre-release

Apr 7, 2022

0.1.2.dev230 pre-release

May 4, 2022

0.1.1

Apr 7, 2022

0.1.1.dev419 pre-release

Apr 7, 2022

0.1.0

Apr 7, 2022

0.1.0.dev76788 pre-release

Apr 7, 2022

0.1.0.dev77 pre-release

Apr 5, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

squirrel_datasets_core-0.1.12.dev8330.tar.gz (44.6 kB view details)

Uploaded Oct 31, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

squirrel_datasets_core-0.1.12.dev8330-py3-none-any.whl (65.3 kB view details)

Uploaded Oct 31, 2022 Python 3

File details

Details for the file squirrel_datasets_core-0.1.12.dev8330.tar.gz.

File metadata

Download URL: squirrel_datasets_core-0.1.12.dev8330.tar.gz
Upload date: Oct 31, 2022
Size: 44.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.9.15

File hashes

Hashes for squirrel_datasets_core-0.1.12.dev8330.tar.gz
Algorithm	Hash digest
SHA256	`75946348b503649d076ed31d6d94316c5b609196f4a13e63d28c7896960b77a8`
MD5	`3226736af719e7c147b444d5ba40fc22`
BLAKE2b-256	`56da0173e02ca4cc6031003c729bbca05fbcada1c5859cbb568b3059f9dd6103`

See more details on using hashes here.

File details

Details for the file squirrel_datasets_core-0.1.12.dev8330-py3-none-any.whl.

File metadata

Download URL: squirrel_datasets_core-0.1.12.dev8330-py3-none-any.whl
Upload date: Oct 31, 2022
Size: 65.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.9.15

File hashes

Hashes for squirrel_datasets_core-0.1.12.dev8330-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0655d92e33294331992616f687d39ad876ba0767129ec6e428122227233c89dc`
MD5	`ad3bd29d817b6ec583f5008b73a311f5`
BLAKE2b-256	`85331df7d8e82df8e7ef6d54d45ae8a57274c405821cc5511bd135f13849cc04`

See more details on using hashes here.

squirrel-datasets-core 0.1.12.dev8330

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Squirrel Datasets Core

What is Squirrel Datasets Core?

Installation

Documentation

Contributing

Contributing

The humans behind Squirrel

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes