Skip to main content

Rust components for Dolma - Toolkit for pre-processing LLM training data.

Project description

dolma-rust-components

Rust components for Dolma - a toolkit for pre-processing large language model training data.

This package contains the low-level Rust implementations that provide high-performance data processing capabilities for the Dolma toolkit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dolma_rust_components-1.3.0.dev0.tar.gz (64.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dolma_rust_components-1.3.0.dev0-cp312-cp312-macosx_11_0_arm64.whl (8.4 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

File details

Details for the file dolma_rust_components-1.3.0.dev0.tar.gz.

File metadata

File hashes

Hashes for dolma_rust_components-1.3.0.dev0.tar.gz
Algorithm Hash digest
SHA256 e671523492ecb9f81eb6920d8b7865d875d7f004a5a224cdab40c60959af7007
MD5 ba62bfc0098c7c332801e9ce4646d3e3
BLAKE2b-256 414c270940af2d571946e0b0604e4c215372729986eccceb7f263fdef358bf1d

See more details on using hashes here.

File details

Details for the file dolma_rust_components-1.3.0.dev0-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for dolma_rust_components-1.3.0.dev0-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 383aa976d130a8f9f66a4c750349a038b1215650bfe1db4a7cc63ae2c11470b0
MD5 f103025d72dd18562cd52b58529cd261
BLAKE2b-256 6657e636b3b24ee0cde368b6fe4e7629c2b8a81a734a5c5282514c98db6811fa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page