Skip to main content

High-performance, multi-language dataset storage format

Project description

Zippy (ZDS) Python Package

High-performance, HuggingFace-compatible dataset storage format.

Installation

pip install zippy-zds

# With optional dependencies
pip install zippy-zds[pandas]
pip install zippy-zds[all]

Quick Start

from zippy import ZDSStore, ZDataset, ZIterableDataset

# Create a store
store = ZDSStore.open("./my_dataset", collection="train")

# Add documents
store.put("doc1", {"text": "Hello world", "label": 1})
store.put("doc2", {"text": "Goodbye world", "label": 0})

# Map-style dataset (random access)
dataset = store.to_dataset()
print(dataset[0])  # {"text": "Hello world", "label": 1}
print(len(dataset))  # 2

# Iterable dataset (streaming)
iterable = store.to_iterable_dataset()
for doc in iterable:
    print(doc)

# With shuffle buffer
for doc in iterable.shuffle(buffer_size=1000):
    print(doc)

DataFrame Integration

from zippy import read_zds, to_zds

# Load as DataFrame (requires pandas)
df = read_zds("./my_dataset", collection="train")

# Export DataFrame to ZDS
to_zds(df, "./output", collection="exported")

HuggingFace Compatibility

ZDS datasets are designed to work seamlessly with HuggingFace training loops:

from zippy import ZIterableDataset

dataset = ZIterableDataset.from_store("./my_dataset", collection="train")

# Works with DataLoader
from torch.utils.data import DataLoader
loader = DataLoader(dataset, batch_size=32)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zippy_data-0.1.1.tar.gz (71.3 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

zippy_data-0.1.1-cp313-cp313-win_amd64.whl (474.3 kB view details)

Uploaded CPython 3.13Windows x86-64

zippy_data-0.1.1-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.13macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.1-cp312-cp312-win_amd64.whl (475.1 kB view details)

Uploaded CPython 3.12Windows x86-64

zippy_data-0.1.1-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.12macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.1-cp311-cp311-win_amd64.whl (474.9 kB view details)

Uploaded CPython 3.11Windows x86-64

zippy_data-0.1.1-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.11macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.1-cp310-cp310-win_amd64.whl (475.0 kB view details)

Uploaded CPython 3.10Windows x86-64

zippy_data-0.1.1-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.10macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.1-cp39-cp39-win_amd64.whl (475.9 kB view details)

Uploaded CPython 3.9Windows x86-64

zippy_data-0.1.1-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl (1.1 MB view details)

Uploaded CPython 3.9macOS 10.12+ universal2 (ARM64, x86-64)macOS 10.12+ x86-64macOS 11.0+ ARM64

zippy_data-0.1.1-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (648.4 kB view details)

Uploaded CPython 3.8manylinux: glibc 2.17+ x86-64

File details

Details for the file zippy_data-0.1.1.tar.gz.

File metadata

  • Download URL: zippy_data-0.1.1.tar.gz
  • Upload date:
  • Size: 71.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8a8a2291ac9743682bef3d72bb9ffae158c6b79c7d7809444b32d5e93604e336
MD5 703312f3c7520f857b336562ece56fd7
BLAKE2b-256 9dc758cfffb1b95ec2ad8dfd4ff4bbd2819ace4171ee9e7589037096e8db2cd9

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1.tar.gz:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.1-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 474.3 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.1-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 328a1cbcd39ce7e6f8e2e3076f8b1a88ee2cfd3e20316aaa095788b9afe6ac22
MD5 bc282c33e0b8d4ed2130e302e8a5058c
BLAKE2b-256 585840a346a694c86ec12ad8ff39336589ff439fec64ead68636dd800cd2e380

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp313-cp313-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.1-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 5041a5937611fb42a0cd1f2454d85ddca631a55a62197202c0849f556c7bc89b
MD5 742baae9b844653d22f78c0a73befdf2
BLAKE2b-256 f93d0a8a26509c1a847de3f70665cc5ee1df912f85b4b009e2d4d532170ab55f

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp313-cp313-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.1-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 475.1 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.1-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 67bc9f9cbb73f3b3ed3a4badf37ba44808d8d8a2bbe0e2740886c66f4fdf7570
MD5 0494c4f9f65aeb9b087d8758838a16d6
BLAKE2b-256 265a76ad8b4a5037a897c48a0c29e89132107e7a760c970364593cc0ca080e80

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp312-cp312-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.1-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 f24c08b50775ff58f242d2049dd0caccd535cb4c4ad86f9e04c9b1a1b21c98e2
MD5 b9e5b44e42712bc0b137733128c2efe8
BLAKE2b-256 e295f4b4305b83dbd25038e81a4793b3053e539ad194a03da851c0cf87cfd22d

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp312-cp312-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.1-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 474.9 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.1-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 6e4b50e5bb1696bb5b5b6f6db623aabc84e4bd59dcdfc60b5d98e4bebf057ad7
MD5 e3eac1ba2a17ab1e43c655ff49bfa4a7
BLAKE2b-256 6e9ab249a2e88cb2c7ae68f8711dfeaea93238df1be08fc742fd4e50cc4a75f9

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp311-cp311-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.1-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 0df462544e6243cc2697b0585a9d15c4169d6eb45356685925605f610ab09ab9
MD5 6d724993869a2c315e808648d141b68d
BLAKE2b-256 0a67ef7e202aca706f23dfae938f422a63f226021eff9e1c05acfc69f151a55f

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp311-cp311-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp310-cp310-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.1-cp310-cp310-win_amd64.whl
  • Upload date:
  • Size: 475.0 kB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.1-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 3d387d13355cce6144b03741d3777c64ea7717d258396ee6dbc54d53f9c5a579
MD5 de53036dd93402ad54e038845bcf930b
BLAKE2b-256 45ab68d67c7311516e774242b658bc975681810ed85f92a86e797ce49ea52ad4

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp310-cp310-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.1-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 11c46daf33e732714856cb3cfd96fa275c176c7db9936a8c09cb99b6cecf8a63
MD5 c50f32e869659200216819fbff48de6e
BLAKE2b-256 8af949d2a9036209c8d03f2f117f0d625fb29df5f8c2e508b40efb960b618a7e

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp310-cp310-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: zippy_data-0.1.1-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 475.9 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zippy_data-0.1.1-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 428dfa8b1437f92d27d4855293e6d24bb27da24f2f8a85e654d7cb2227962b12
MD5 38c435d8cda6bbeaeec8e54dbd14c5d6
BLAKE2b-256 55f7c55b4e7eff6d23b78bc67dbc2844e2030b6140efa824e34e0b8632273500

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp39-cp39-win_amd64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.1-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl
Algorithm Hash digest
SHA256 a5dcc2b8cb49eef21feadf8cd41c7ae74b22c93012ba36f296172be5500ddeab
MD5 a8f02e521f67d5da70d856b7473d498c
BLAKE2b-256 b2e02feee25d2eca6c400373b2280f274e3ea97717c8d25c44f54207f1ccef3f

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp39-cp39-macosx_10_12_x86_64.macosx_11_0_arm64.macosx_10_12_universal2.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zippy_data-0.1.1-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for zippy_data-0.1.1-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 386412d8e65fdaac3c39ed3391c09911d372aa2acca681d1d8cd545e1e0e8b06
MD5 0211a98eebc353d94342df80393a17b4
BLAKE2b-256 22a35a6bef4b8ceff6213507947e8ed3d61a27ec1c9be09b1dde8e943f58c4fa

See more details on using hashes here.

Provenance

The following attestation bundles were made for zippy_data-0.1.1-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: release.yml on zippydata/zippy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page