Skip to main content

Unified storage framework for machine learning datasets

Project description

Space: Unified Storage for Machine Learning

Unify data in your entire machine learning lifecycle with Space, a comprehensive storage solution that seamlessly handles data from ingestion to training.

Key Features:

  • Ground Truth Database
    • Store and manage multimodal data in open source file formats, row or columnar, local or in cloud.
    • Ingest from various sources, including ML datasets, files, and labeling tools.
    • Support data manipulation (append, insert, update, delete) and version control.
  • OLAP Database and Lakehouse
  • Distributed Data Processing Pipelines
    • Integrate with processing frameworks like Ray for efficient data transformation.
    • Store processed results as Materialized Views (MVs); incrementally update MVs when the source is changed.
  • Seamless Training Framework Integration
    • Access Space datasets and MVs directly via random access interfaces.
    • Convert to popular ML dataset formats (e.g., TFDS, HuggingFace, Ray).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

space-datasets-0.0.11.tar.gz (123.9 kB view details)

Uploaded Source

Built Distribution

space_datasets-0.0.11-py3-none-any.whl (180.3 kB view details)

Uploaded Python 3

File details

Details for the file space-datasets-0.0.11.tar.gz.

File metadata

  • Download URL: space-datasets-0.0.11.tar.gz
  • Upload date:
  • Size: 123.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for space-datasets-0.0.11.tar.gz
Algorithm Hash digest
SHA256 c2a1fc89291f8e2568e8031f04ec52021fc04bd14295fbd2ac0ac7203ea6ca3b
MD5 eac5f05f6b344c5947608279dec2d7f8
BLAKE2b-256 7fad97087f4729aea00a7e21160552f42ab0ffc95bc2f455d50071a205c4f0bd

See more details on using hashes here.

File details

Details for the file space_datasets-0.0.11-py3-none-any.whl.

File metadata

File hashes

Hashes for space_datasets-0.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 f78829e6a67247cd5d38d4f4aa50a5872d36381b480e6179fb5a77f26f45b095
MD5 184d938ed48a31ce34fc275e3ff00800
BLAKE2b-256 6f09ea265029725a380f0c21db0d1cfb0fec8c9cfd4976b983d87a92bd955d5a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page