Skip to main content

A high-performance data loading library for JAX

Project description

JAX DataLoader

A high-performance data loading library for JAX, designed for efficient data loading and preprocessing in machine learning workflows.

Features

  • Efficient data loading with automatic batching
  • Multi-GPU support with automatic batch distribution
  • Memory management with automatic batch size tuning
  • Support for various data formats (CSV, JSON, Images)
  • Progress tracking and statistics
  • Data caching and prefetching
  • Error handling and recovery

Installation

pip install jax-dataloader

Quick Start

from jax_dataloader import JAXDataLoader, DataLoaderConfig

# Create a DataLoader configuration
config = DataLoaderConfig(
    batch_size=32,
    num_workers=4,
    multi_gpu=True
)

# Load your data
dataloader = JAXDataLoader(
    data_path="path/to/your/data",
    config=config
)

# Iterate over batches
for batch_x, batch_y in dataloader:
    # Process your batch
    ...

Examples

The package includes comprehensive examples demonstrating various features:

# Clone the repository
git clone https://github.com/yourusername/jax-dataloader.git
cd jax-dataloader

# Install example dependencies
pip install -r examples/requirements.txt

# Run the data loading demo
cd examples/data_loading
python demo.py

The examples demonstrate:

  • Loading different data formats (CSV, JSON, Images)
  • Multi-GPU support
  • Memory management
  • Progress tracking
  • Batch size optimization

For more examples and detailed documentation, visit our documentation.

Documentation

For detailed documentation, including API reference and advanced usage examples, visit our documentation.

Contributing

We welcome contributions! Please see our contributing guide for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.


Project Structure:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jax_dataloaders-0.1.2.tar.gz (186.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

jax_dataloaders-0.1.2-py3-none-any.whl (9.0 kB view details)

Uploaded Python 3

File details

Details for the file jax_dataloaders-0.1.2.tar.gz.

File metadata

  • Download URL: jax_dataloaders-0.1.2.tar.gz
  • Upload date:
  • Size: 186.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for jax_dataloaders-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c6c7adad721ad3d10e5bc31f01e6372a1f538e679a0788931fa4c7951d3b190f
MD5 244f1f2d1ce4b85faf02b06713daae7a
BLAKE2b-256 a866d36bdf04b7e64d4940509455f0c1d738c1975a58bd969cfa50651fb8c174

See more details on using hashes here.

File details

Details for the file jax_dataloaders-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for jax_dataloaders-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 f0b103b69f6ac5453b27fe1ec6429d3ba8d97e4eb783cbb03d453dda9476359c
MD5 17b9f820040ead09f94a3c07836d9485
BLAKE2b-256 329a341970c5c04d2eb93aa3da146da54117f32188116168107128cf8d3e9e75

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page