Skip to main content

DVC's data management subsystem

Project description

PyPI Status Python Version License

Tests Codecov pre-commit Black

Features

  • TODO

Requirements

  • TODO

Installation

You can install DVC data via pip from PyPI:

$ pip install dvc-data

Usage

HashFile

HashFile

Based on dvc-object’s Object, this is an object that has a particular hash that can be used to verify its contents. Similar to git’s ShaFile.

from dvc_data.hashfile import HashFile

obj = HashFile("/path/to/file", fs, HashInfo("md5", "36eba1e1e343279857ea7f69a597324e")

HashFileDB

Based on dvc-object’s ObjectDB, but stores HashFile objects and so is able to verify their contents by their hash_info. Similar to git’s ObjectStore.

from dvc_data.hashfile import HashFileDB

odb = HashFileDB(fs, "/path/to/odb")

Index

Index

A trie-like structure that represents data files and directories.

from dvc_data.index import DataIndex, DataIndexEntry

index = DataIndex()
index[("foo",)] = DataIndexEntry(hash_info=hash_info, meta=meta)

Storage

A mapping that describes where to find data contents for index entries. Can be either ObjectStorage for HashFileDB-based storage or FileStorage for backup-like plain file storage.

index.storage_map[("foo",)] = ObjectStorage(...)

Contributing

Contributions are very welcome. To learn more, see the Contributor Guide.

License

Distributed under the terms of the Apache 2.0 license, DVC data is free and open source software.

Issues

If you encounter any problems, please file an issue along with a detailed description.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dvc_data-3.18.3.tar.gz (84.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dvc_data-3.18.3-py3-none-any.whl (79.3 kB view details)

Uploaded Python 3

File details

Details for the file dvc_data-3.18.3.tar.gz.

File metadata

  • Download URL: dvc_data-3.18.3.tar.gz
  • Upload date:
  • Size: 84.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dvc_data-3.18.3.tar.gz
Algorithm Hash digest
SHA256 61c8085a7c5b5e3ecaa86d63c3f22dc7c951a5dc55bd0a1bb5afa15377a25b1a
MD5 16cced9d6c919dc9b74ea177ae815550
BLAKE2b-256 c32fe00e933d10512ca7ec4f3f2f450d915e6bed4c01cf666d50a1a8f9b033ff

See more details on using hashes here.

Provenance

The following attestation bundles were made for dvc_data-3.18.3.tar.gz:

Publisher: release.yml on treeverse/dvc-data

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dvc_data-3.18.3-py3-none-any.whl.

File metadata

  • Download URL: dvc_data-3.18.3-py3-none-any.whl
  • Upload date:
  • Size: 79.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dvc_data-3.18.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ab459a033c07ad94ec23ad4e83c14c68b73e7291367629813e751f882b10b4f3
MD5 9d4d289cf06f5fb56f5a888185c9f12f
BLAKE2b-256 65e0695d043a0a2a4e830456e841c4a14be99d31305198fc4feccaeaace0b979

See more details on using hashes here.

Provenance

The following attestation bundles were made for dvc_data-3.18.3-py3-none-any.whl:

Publisher: release.yml on treeverse/dvc-data

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page