Skip to main content

Grain: A library for loading and transforming data for ML training.

Project description

Grain - Feeding JAX Models

Continuous integration PyPI version

Installation | Quickstart | Reference docs | Change logs

Grain is a Python library for reading and processing data for training and evaluating JAX models. It is flexible, fast and deterministic.

Grain allows to define data processing steps in a simple declarative way:

import grain

dataset = (
    grain.MapDataset.source([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10])
    .shuffle(seed=42)  # Shuffles elements globally.
    .map(lambda x: x+1)  # Maps each element.
    .batch(batch_size=2)  # Batches consecutive elements.
)

for batch in dataset:
  # Training step.

Grain is designed to work with JAX models but it does not require JAX to run and can be used with other frameworks as well.

Installation

Grain is available on PyPI and can be installed with pip install grain.

Supported platforms

Grain does not directly use GPU or TPU in its transformations, the processing within Grain will be done on the CPU by default.

Linux Mac Windows
x86_64 yes no no
aarch64 yes yes n/a

Quickstart

Citing Grain

To cite this repository:

@software{grain2023github,
  author = {Marvin Ritter and Ihor Indyk and Aayush Singh and Andrew Audibert and Anoosha Seelam and Camelia Hanes and Eric Lau and Jacek Olesiak and Jiyang Kang and Xihui Wu},
  title = {{Grain} - Feeding JAX Models},
  url = {http://github.com/google/grain},
  version = {0.2.12},
  year = {2023},
}

The version number is intended to be that from pyproject.toml, and the year corresponds to the project's open-source release.

Existing users

Grain is used by MaxText, Gemma, kauldron, maxdiffusion and multiple internal Google projects.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

grain_nightly-0.2.13.dev20251013-cp313-cp313-win_amd64.whl (428.7 kB view details)

Uploaded CPython 3.13Windows x86-64

grain_nightly-0.2.13.dev20251013-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (510.6 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

grain_nightly-0.2.13.dev20251013-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (510.1 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ ARM64

grain_nightly-0.2.13.dev20251013-cp313-cp313-macosx_11_0_arm64.whl (437.1 kB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

grain_nightly-0.2.13.dev20251013-cp312-cp312-win_amd64.whl (428.6 kB view details)

Uploaded CPython 3.12Windows x86-64

grain_nightly-0.2.13.dev20251013-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (510.5 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

grain_nightly-0.2.13.dev20251013-cp312-cp312-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (510.0 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ ARM64

grain_nightly-0.2.13.dev20251013-cp312-cp312-macosx_11_0_arm64.whl (437.0 kB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

grain_nightly-0.2.13.dev20251013-cp311-cp311-win_amd64.whl (428.4 kB view details)

Uploaded CPython 3.11Windows x86-64

grain_nightly-0.2.13.dev20251013-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (511.6 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

grain_nightly-0.2.13.dev20251013-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (510.8 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64

grain_nightly-0.2.13.dev20251013-cp311-cp311-macosx_11_0_arm64.whl (439.7 kB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

grain_nightly-0.2.13.dev20251013-cp310-cp310-win_amd64.whl (426.7 kB view details)

Uploaded CPython 3.10Windows x86-64

grain_nightly-0.2.13.dev20251013-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (510.7 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

grain_nightly-0.2.13.dev20251013-cp310-cp310-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (510.1 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ ARM64

grain_nightly-0.2.13.dev20251013-cp310-cp310-macosx_11_0_arm64.whl (438.3 kB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 f12ab65bac035e36910c37d15baadb81a8bbc0d034a8bd01fc7d0f894f75da2c
MD5 a55c8a7c225574326ed7f50eecfa3eb2
BLAKE2b-256 a5861d436344504ce4ec60dfcbc7f60f270d7141145c0ab032bc60bfa7c02055

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 28f691b1d7a5c67f17874b34f03e3e5b3370f0b52a1d7d3523edbac02a2bf276
MD5 bf7cee53e4600d3a31d0db2fba00c454
BLAKE2b-256 25de5cbcac86314c52994ef0b38befefeac0476936bc31e9560ab87ed7c9b82a

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 45c52f17874224b5f77fe1489506131a728f413dee7d8ee6021122f5d50b2677
MD5 c12c2fb3a166822259cd6ddfcb6f0d1b
BLAKE2b-256 2c1e3fe802b7620ef4fd5eec275483aea51c77ceb23fa0317dca682957b2e27e

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 a0bc7fbeed1947c9736b580041f210de27f79475809db3b2b3ab8b3cfd0c08f1
MD5 ce3cb361fce9e80fa23b9a020fd8db20
BLAKE2b-256 80a7131c1d1be5357c032e0993f3e8a7634364cd522fdcb7b8e6eb3efd60f94f

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 5f065cc95cbd34e699a0bade2c43e0d77cc588ec2f8cf34c2897fc3bea1b7452
MD5 44b82d2d4e188a41fde3601520032c1f
BLAKE2b-256 035e3bacfb68c4cf667519e9f107bed855f712fde8699ec68d2046501945b123

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp312-cp312-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 a88750c4b5fc9392319a9793d579bab6bea2a0376423be35183a2b2ca6bcd769
MD5 293d7766a124731ce35274b353a53543
BLAKE2b-256 5314676d1ba3dfcef16497f39700948404590e8f00206ab99dea71f112c69427

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp312-cp312-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp312-cp312-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 751a05a20f9888216fd2f34cb6d186721bdec685e43c836dcc78d54074a45222
MD5 4c125f96d1b51de92ed5abf5f49f361c
BLAKE2b-256 3c6dfb4772db8009a9bc00bab7d1cf23eeae1bdfe10ef270e4950c309f6f3965

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 78a7e685ef01ad05466c5d63926f39960a7f0aac7773d61dd02b507e07494f83
MD5 f4998e85639c545eb8b199f06eec27ee
BLAKE2b-256 59e9b93a7fe4f198aa52d1308c92ee2f4e1d69317189c51686dc8f8ec84a52be

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 47fe42b6a60cef96734506bf67eeeaeae38e9a3b8039cf25d9aa8f7a7c95537b
MD5 9fa914f33078b18f2f1b3059b57a2aa1
BLAKE2b-256 2755a90049324b2959d82afcf48c22a6d1bae5eb09f50043fb225311672652f7

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 17e07cc46a92e1d06c85c3fc7c1cbb068e43da0b7f5cfbf662b31dee805d8f54
MD5 41c30ced076f70008a49997b03961cf7
BLAKE2b-256 5a500dd6a8431fbe9e5d5a9c1992016a198bdf4d5469488db4abdee09eed7274

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 e204e450b4b1b45e92b58c2270a90a450fc1bc7143840646712f6f4f3875c70a
MD5 d15cd2fb3c16304c7921e084f4e599d2
BLAKE2b-256 af961d076e30b7ddc0330c2a65aacfc173c286f4231c8709966f9b5eff6e7783

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 ce3fe22401a2358fe30aab965666a2b93063c076496dd149adefecbd39ae728b
MD5 5674f320ba9cfac63c71b24a331bfcd5
BLAKE2b-256 032093bc0f71d73d36fa7c360234f8e86ab902de94316dce9346eb98a5ec0c0e

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 6e62e40a370a6c1af8ae2e34f4e3a4c66a0ede833ac69274940df0cfa67ec7c8
MD5 b29b3144c01d7df72625568e50062949
BLAKE2b-256 e72df48a4ccfee55d3cd09f5c4abaa8bd7da2f41548f17b8c08ce5cd1caec4ce

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 d3e46e46cf6bbfcda67a3769cf9aab3518d0e7c6f90e246bcbb04f2e7707c91a
MD5 6ead61485d97a2854de7c3a0509d5328
BLAKE2b-256 dfab891e79a77f4d3831bdeb05f39f1c637d8bb75e9282dd758f29911a96238b

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp310-cp310-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp310-cp310-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 a049472cf30a4dc6dd0998f2f2553f128a6cb4ce9c06d15e5d7dd3b6807450ec
MD5 fa521accc5c0a9eaeacb7ba2974e59f8
BLAKE2b-256 a5f690365d09b95ae7b8c215584659f79d08a1e844a3c84b8431be38fb008e52

See more details on using hashes here.

File details

Details for the file grain_nightly-0.2.13.dev20251013-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for grain_nightly-0.2.13.dev20251013-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 2fce9736722b949d6477b02a6471602727a02c15d7e4243615579c13818bc13e
MD5 289096dcfcadb1f70135c9723fd4afc5
BLAKE2b-256 de89fc33f3bff4e3e7b486319dc4dd5d5640de90698ae436f06b9b583298f160

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page