Skip to main content

No project description provided

Project description

Blended Dataset Loop

This repository contains a simple loop to compute a balanced ordering of dataset indices to train on. The resulting ordering ensures the data distribution is similar among batches. The loop is implemented in Rust for performance reasons and can be consumed as part of a Python package.

The package uses cffi (rather than e.g. PyO3) in order to be compatible with different Python versions.

Requirements

  • Conda (for Python)
  • Cargo with nightly Rust

Setup

Create a conda environment as follows:

conda create -n blended_dataset_loop python=3.9 -y
conda activate blended_dataset_loop

Install Rust nightly

rustup override set nightly-2024-02-03

Install the Python dev-dependencies:

pip install 'maturin[patchelf]'
pip install '.[dev]'

Develop

After changing the Rust code, run:

maturin develop

or, for release mode

maturin develop --release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

blended_dataset_loop-0.1.0.tar.gz (8.4 kB view hashes)

Uploaded Source

Built Distributions

blended_dataset_loop-0.1.0-py3-none-win_amd64.whl (142.1 kB view hashes)

Uploaded Python 3 Windows x86-64

blended_dataset_loop-0.1.0-py3-none-win32.whl (137.4 kB view hashes)

Uploaded Python 3 Windows x86

blended_dataset_loop-0.1.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.1 MB view hashes)

Uploaded Python 3 manylinux: glibc 2.17+ x86-64

blended_dataset_loop-0.1.0-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (1.0 MB view hashes)

Uploaded Python 3 manylinux: glibc 2.17+ ARM64

blended_dataset_loop-0.1.0-py3-none-macosx_11_0_arm64.whl (239.5 kB view hashes)

Uploaded Python 3 macOS 11.0+ ARM64

blended_dataset_loop-0.1.0-py3-none-macosx_10_12_x86_64.whl (252.8 kB view hashes)

Uploaded Python 3 macOS 10.12+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page