Skip to main content

A high-performance data loading library for JAX applications

Project description

JAX DataLoader

A high-performance data loading library for JAX, designed for efficient data loading and preprocessing in machine learning workflows.

Features

  • Efficient data loading with automatic batching
  • Multi-GPU support with automatic batch distribution
  • Memory management with automatic batch size tuning
  • Support for various data formats (CSV, JSON, Images)
  • Progress tracking and statistics
  • Data caching and prefetching
  • Error handling and recovery

Installation

pip install jax-dataloader

Quick Start

from jax_dataloader import JAXDataLoader, DataLoaderConfig

# Create a DataLoader configuration
config = DataLoaderConfig(
    batch_size=32,
    num_workers=4,
    multi_gpu=True
)

# Load your data
dataloader = JAXDataLoader(
    data_path="path/to/your/data",
    config=config
)

# Iterate over batches
for batch_x, batch_y in dataloader:
    # Process your batch
    ...

Examples

The package includes comprehensive examples demonstrating various features:

# Clone the repository
git clone https://github.com/yourusername/jax-dataloader.git
cd jax-dataloader

# Install example dependencies
pip install -r examples/requirements.txt

# Run the data loading demo
cd examples/data_loading
python demo.py

The examples demonstrate:

  • Loading different data formats (CSV, JSON, Images)
  • Multi-GPU support
  • Memory management
  • Progress tracking
  • Batch size optimization

For more examples and detailed documentation, visit our documentation.

Documentation

For detailed documentation, including API reference and advanced usage examples, visit our documentation.

Contributing

We welcome contributions! Please see our contributing guide for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.


Project Structure:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jax-dataloaders-0.1.6.tar.gz (7.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

jax_dataloaders-0.1.6-py3-none-any.whl (24.1 kB view details)

Uploaded Python 3

File details

Details for the file jax-dataloaders-0.1.6.tar.gz.

File metadata

  • Download URL: jax-dataloaders-0.1.6.tar.gz
  • Upload date:
  • Size: 7.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for jax-dataloaders-0.1.6.tar.gz
Algorithm Hash digest
SHA256 d3ef4b1ac28786ccc9f6b87d28319c24f854f0cf20b92cb84ce2ea04aa998130
MD5 d409baa688192cbf102058e6110bf288
BLAKE2b-256 96c97a18c20bc80d43c090dfd4e3d27077ca46baac58d00ce9123cdc796a7236

See more details on using hashes here.

File details

Details for the file jax_dataloaders-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for jax_dataloaders-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 0103455bbd48a738284781f90c3678d83ad1d11df29b98962fd4c3540ae07331
MD5 f30d8f9e73c8d1e8081beabc0ee2b765
BLAKE2b-256 565b83f320852ce052f768bb57f747b80ce658aee08e7d378f041c47ada82a3e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page