Skip to main content

High-performance, multi-language dataset storage format

Project description

Zippy (ZDS) Python Package

High-performance, HuggingFace-compatible dataset storage format.

Installation

pip install zippy-zds

# With optional dependencies
pip install zippy-zds[pandas]
pip install zippy-zds[all]

Quick Start

from zippy import ZDSStore, ZDataset, ZIterableDataset

# Create a store
store = ZDSStore.open("./my_dataset", collection="train")

# Add documents
store.put("doc1", {"text": "Hello world", "label": 1})
store.put("doc2", {"text": "Goodbye world", "label": 0})

# Map-style dataset (random access)
dataset = store.to_dataset()
print(dataset[0])  # {"text": "Hello world", "label": 1}
print(len(dataset))  # 2

# Iterable dataset (streaming)
iterable = store.to_iterable_dataset()
for doc in iterable:
    print(doc)

# With shuffle buffer
for doc in iterable.shuffle(buffer_size=1000):
    print(doc)

DataFrame Integration

from zippy import read_zds, to_zds

# Load as DataFrame (requires pandas)
df = read_zds("./my_dataset", collection="train")

# Export DataFrame to ZDS
to_zds(df, "./output", collection="exported")

HuggingFace Compatibility

ZDS datasets are designed to work seamlessly with HuggingFace training loops:

from zippy import ZIterableDataset

dataset = ZIterableDataset.from_store("./my_dataset", collection="train")

# Works with DataLoader
from torch.utils.data import DataLoader
loader = DataLoader(dataset, batch_size=32)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zippy_data-0.1.0.tar.gz (70.9 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zippy_data-0.1.0-cp313-cp313-win_amd64.whl (474.3 kB view details)

Uploaded CPython 3.13Windows x86-64

zippy_data-0.1.0-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.13macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.0-cp312-cp312-win_amd64.whl (475.2 kB view details)

Uploaded CPython 3.12Windows x86-64

zippy_data-0.1.0-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.12macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.0-cp311-cp311-win_amd64.whl (474.5 kB view details)

Uploaded CPython 3.11Windows x86-64

zippy_data-0.1.0-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.11macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.0-cp310-cp310-win_amd64.whl (475.0 kB view details)

Uploaded CPython 3.10Windows x86-64

zippy_data-0.1.0-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.10macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.0-cp39-cp39-win_amd64.whl (475.9 kB view details)

Uploaded CPython 3.9Windows x86-64

zippy_data-0.1.0-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.9macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (648.7 kB view details)

Uploaded CPython 3.8manylinux: glibc 2.17+ x86-64

File details

Details for the file zippy_data-0.1.0.tar.gz.

File metadata

  • Download URL: zippy_data-0.1.0.tar.gz
  • Upload date:
  • Size: 70.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.0.tar.gz
Algorithm Hash digest
SHA256 967d49201a84cc644d6eff2199fcb2b58009532ba0854fe06fe0961f300966c0
MD5 77a526b9ed77145da90445b2aa7d1e15
BLAKE2b-256 660364c98dc5c49bd5bed9d7ec980d7855b848db738f7ae49c144daaf65aebd2

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0.tar.gz:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.0-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 474.3 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 310e40f4614ee3ca5883e50cc132f20a9e58463bc88dd341f0d2b534a89b622e
MD5 019d5ad5d6aeefd710459f3aacbd7de6
BLAKE2b-256 402475e896fea04fbb378eb869071847b2de2a89976acd0724cde1d8ad9630a1

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp313-cp313-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.0-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 0f835c7d0193bfbafe91dad9c685137f7aa3ee9ef60ed1e2130ca9f9748e8adb
MD5 f0aaf60d2f3c28590b095d2a601a475a
BLAKE2b-256 00b0887876a5814096b69816266a0046b24f6944ae841c20d10ec6b9b6cc1178

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.0-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 475.2 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 d7a7501567dfd550d94986c2ce90144e09ae7d54f82e321acb7a9d8c355b202b
MD5 ec97429f6a890b9d4cfccf4994302cf9
BLAKE2b-256 c89ad37022e5356d90c7afff5179ea1f934d43052d6a920ec297705fd7e98962

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp312-cp312-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.0-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 d6bdd5ef8db185bfe7d975097f0a90f6e63023985e997ef8bc106326d5023981
MD5 d89aa3f7971587a685c5d619240ba1cd
BLAKE2b-256 a66cf6d520e6892dbc8c9b3d60491ebd435483e6c59f6d9bfb4cce75c1a1e357

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.0-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 474.5 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 fdca04a26c84a8ee89c04843283b50bf2e14d2301c5b2aaa3e3ef9b548a2e7e2
MD5 ce1e9a7cd147b98b87e416d468850e21
BLAKE2b-256 2038df01febf8a17a7ec7c00bccb95ec1adc9d060f87e3827213aaeb98b157d3

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp311-cp311-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.0-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 3d6eef13d6f27eea3c4a58e9e759f11ebb81f243d1a004739c3a0d2624f5ea49
MD5 32b48a46f907093f62cc4660b8cfb06a
BLAKE2b-256 695c398eaea4067fc7e7047b63c334406c37ca006e7fd49d0ac64c292660f7a2

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp310-cp310-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.0-cp310-cp310-win_amd64.whl
  • Upload date:
  • Size: 475.0 kB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.0-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 710d728dbdf857bbd8d2ec32990e1eb28a0d9380bfd8eed554baa72a5c16eb3f
MD5 b776264df14a74f290668b26ed8bfc90
BLAKE2b-256 3ed70c916496c3da2d2203ef53b78e9c4f8e9a907702c4b28645a3fa375f6349

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp310-cp310-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.0-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 e291230b8363de49a9f9a350d72cbbd5f64bf7799830fbf7020e9c1c8561a1c3
MD5 8e7bef0a4d6436ca3f2700111fe917bb
BLAKE2b-256 5b0b59400a0e89d0865c25b1d2341ed5ad6f0cf1389a31b2085168b5a0206e26

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.0-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 475.9 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.0-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 8488fce3118907a503fdea7adcad3b5f8275258b97f18a9468167e0d2e804ff8
MD5 2dd71c69410452bc1db33b226d5aee4f
BLAKE2b-256 2cd3872e0b9103ed9a3c79daed447bd707f980839f5aeb5205025200373d9785

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp39-cp39-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.0-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 6ecf31c568bbbd57b977df28400ac51ddd751fbb0d29a66717269f6abbae246a
MD5 c416881cf9499cfa01ae9197c59f7730
BLAKE2b-256 544b8a138e3dc83918bae3c2b546108182d8ecffee41a4d5efb02d13912c44f2

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 42d2b52312ca85bb0999a2ae95b1954a83bfa1af5f5a153af61aa94d7b6630f5
MD5 f8d862e33a2dc671503c3dc6e4b6dffd
BLAKE2b-256 5f7dcfb6c13ae523cd4cb451d99488d5f8f16e4e8acbbe9700dd59cce50f9a15

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page