Skip to main content

simple library to manage a dataset of shards to train machine learning models

Project description

iden

CI Nightly Tests Nightly Package Tests Codecov
Documentation Documentation
Code style: black Doc style: google Ruff Doc style: google
PYPI version Python BSD-3-Clause
Downloads Monthly downloads

Overview

iden is a simple Python library to manage a dataset of shards when training a machine learning model. iden uses a lazy loading approach to load the shard's data, so it is easy to manage shards without loading their data. iden supports different format to store shards on disk.

Installation

We highly recommend installing a virtual environment. iden can be installed from pip using the following command:

pip install iden

To make the package as slim as possible, only the minimal packages required to use iden are installed. To include all the dependencies, you can use the following command:

pip install iden[all]

Please check the get started page to see how to install only some specific dependencies or other alternatives to install the library. The following is the corresponding iden versions and tested dependencies.

iden coola objectory numpy* pyyaml* safetensors* torch* python
main >=0.8.4,<1.0 >=0.2,<1.0 >=1.24,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<3.0 >=3.9,<3.14
0.2.0 >=0.8.4,<1.0 >=0.2,<1.0 >=1.22,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<3.0 >=3.9,<3.14
0.1.0 >=0.8.4,<1.0 >=0.2,<1.0 >=1.22,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<3.0 >=3.9,<3.14
0.0.4 >=0.3,<1.0 >=0.1,<1.0 >=1.22,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<3.0 >=3.9,<3.13
0.0.3 >=0.3,<1.0 >=0.1,<1.0 >=1.22,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<3.0 >=3.9,<3.12
0.0.2 >=0.4,<1.0 >=0.1,<1.0 >=1.22,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<2.1 >=3.9,<3.12
0.0.1 >=0.4,<1.0 >=0.1,<1.0 >=1.22,<2.0 >=6.0,<7.0 >=0.4,<1.0 >=2.0,<2.1 >=3.9,<3.12
iden cloudpickle* joblib*
main >=3.0,<4.0 >=1.3,<2.0

* indicates an optional dependency

Contributing

Please check the instructions in CONTRIBUTING.md.

Suggestions and Communication

Everyone is welcome to contribute to the community. If you have any questions or suggestions, you can submit Github Issues. We will reply to you as soon as possible. Thank you very much.

API stability

:warning: While iden is in development stage, no API is guaranteed to be stable from one release to the next. In fact, it is very likely that the API will change multiple times before a stable 1.0.0 release. In practice, this means that upgrading iden to a new version will possibly break any code that was using the old version of iden.

License

iden is licensed under BSD 3-Clause "New" or "Revised" license available in LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iden-0.2.1a0.tar.gz (36.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

iden-0.2.1a0-py3-none-any.whl (83.0 kB view details)

Uploaded Python 3

File details

Details for the file iden-0.2.1a0.tar.gz.

File metadata

  • Download URL: iden-0.2.1a0.tar.gz
  • Upload date:
  • Size: 36.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iden-0.2.1a0.tar.gz
Algorithm Hash digest
SHA256 817ff36812ee2b3c0419d74af35d7c473a37c412d40df42d4db9c75852e1d1a5
MD5 02b2824cb6d4a2f8dac74ede28801960
BLAKE2b-256 a19b27907b5907a1cf2608a4d435bc3b73da45b6253f4017a4ae026dea67c94a

See more details on using hashes here.

File details

Details for the file iden-0.2.1a0-py3-none-any.whl.

File metadata

  • Download URL: iden-0.2.1a0-py3-none-any.whl
  • Upload date:
  • Size: 83.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iden-0.2.1a0-py3-none-any.whl
Algorithm Hash digest
SHA256 59b8836b7c076f03a9272ee58db176f5ee7668b3172cdb37a3e21a5e5df4d8b3
MD5 912310f1814bfded68beb77f05f89dbf
BLAKE2b-256 ebd5ba347a035fffb01365c82cc7dccdd29bcf0470a897f42d902175af128a32

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page