Skip to main content

Veloxx: High-performance, lightweight Python library for in-memory data processing and analytics. Built on Rust for blazing speed and memory efficiency. Features DataFrames, Series, advanced I/O (CSV, JSON, Parquet), machine learning (linear regression, K-means, logistic regression), time-series analysis, data visualization, and parallel processing. Perfect for data science, analytics, and performance-critical applications where speed and memory usage matter.

Project description

Veloxx Logo

Veloxx: Ultra-High Performance Data Processing & Analytics Library

Crates.io PyPI npm GitHub docs.rs CI License: MIT Documentation


๐Ÿš€ v0.4.0 Released! Major performance overhaul with SIMD acceleration, Pivot, and Outer Join support.

Veloxx is a blazing-fast, ultra-lightweight data processing and analytics library in Rust, with seamless bindings for Python and WebAssembly. Built from the ground up for maximum performance, featuring advanced SIMD acceleration, memory optimization, and parallel processing that often outperforms industry leaders.

๐Ÿ† Performance Highlights

  • SIMD Acceleration: Vectorized aggregation (sum, mean, min, max) now 30-90x faster than scalar implementations.
  • Parallel Processing: Hybrid execution strategy using Rayon for large datasets (>500k rows), achieving near-linear scaling.
  • Optimized I/O: Multi-threaded memory-mapped CSV reading and zero-copy Parquet integration.
  • Lazy Evaluation: Refined Query Optimizer with predicate pushdown for efficient filtering.

โœจ New Features (v0.4.0)

  • Pivot: Reshape DataFrames from long to wide format with aggregation.
  • Outer Join: Full support for Left, Right, Inner, and Outer joins.
  • Deterministic Columns: Refactored internal storage to guarantee consistent column ordering.
  • Python Bindings: Updated PyDataFrame with pivot and outer_join support.

โœจ Project Links

๐Ÿงฉ Core Principles & Design Goals

  • ๐Ÿš€ Performance First: Advanced SIMD, parallel processing, cache-optimized algorithms
  • ๐Ÿชถ Lightweight: Minimal dependencies, optimized memory footprint
  • ๐Ÿฆบ Safety & Reliability: Memory-safe Rust, comprehensive testing
  • ๐Ÿง‘โ€๐Ÿ’ป Developer Experience: Intuitive APIs, excellent documentation
  • ๐Ÿ”ง Production Ready: Zero-warning compilation, extensive benchmarking

๐Ÿšฉ Key Features

Core Data Structures

  • DataFrame and Series for lightning-fast tabular data processing
  • SIMD-optimized operations with AVX2/NEON acceleration
  • Memory-efficient storage with advanced compression

High-Performance Operations

  • ๐Ÿš€ Ultra-fast analytics: filtering, joining, grouping, aggregation, pivoting
  • ๐Ÿ“Š Advanced statistics: correlation, regression, time-series analysis
  • Parallel processing: Multi-threaded execution with work-stealing
  • ๐Ÿงฎ Vectorized math: SIMD-accelerated arithmetic operations

Advanced I/O & Integration

  • ๐Ÿ“‚ Multiple formats: CSV, JSON, Parquet support
  • ๐Ÿ”Œ Database connectivity: SQLite, PostgreSQL, MySQL
  • ๐ŸŒŠ Streaming operations: Memory-efficient large dataset processing
  • โšก Async I/O: Non-blocking file and network operations

Data Quality & ML

  • ๐Ÿงน Data cleaning: Automated outlier detection, validation
  • ๐Ÿค– Machine learning: Linear/logistic regression, clustering, preprocessing
  • ๐Ÿ“ˆ Visualization: Charts, plots, statistical graphics
  • ๐Ÿ” Data profiling: Schema inference, quality metrics

Multi-Language Support

  • ๐Ÿฆ€ Rust: Native, zero-cost abstractions
  • Python: PyO3 bindings with NumPy integration
  • ๐ŸŒ WebAssembly: Browser and Node.js support
  • ๐Ÿ“ฆ Easy installation: Available on crates.io, PyPI, npm

โšก Quick Start

Rust

[dependencies]
veloxx = "0.4.0"
use veloxx::dataframe::DataFrame;
use veloxx::series::Series;

let df = DataFrame::new_from_csv("data.csv")?;
let filtered = df.filter(&your_condition)?;
let grouped = df.group_by(vec!["category"]).agg(vec![("amount", "sum")])?;

Python

import veloxx

df = veloxx.PyDataFrame({"name": veloxx.PySeries("name", ["Alice", "Bob"])})
filtered = df.filter(...)
pivoted = df.pivot(values="score", index=["name"], columns="subject", agg_fn="mean")

JavaScript/Wasm

const veloxx = require("veloxx");
const df = new veloxx.WasmDataFrame({name: ["Alice", "Bob"]});
const filtered = df.filter(...);

๐Ÿ› ๏ธ Feature Flags

Enable only what you need:

  • advanced_io โ€“ Parquet, databases, async
  • data_quality โ€“ Schema checks, anomaly detection
  • window_functions โ€“ Window analytics
  • visualization โ€“ Charting
  • ml โ€“ Machine learning
  • python โ€“ Python bindings
  • wasm โ€“ WebAssembly

๐Ÿ“š Documentation

๐Ÿง‘โ€๐Ÿ’ป Examples

Run ready-made examples:

cargo run --example basic_dataframe_operations
cargo run --example advanced_io --features advanced_io
# ... more in the examples/ folder

๐Ÿค Contributing

See CONTRIBUTING.md for guidelines. Please review our Code of Conduct.

๐Ÿ’ฌ Support

๐Ÿ“ License

MIT License. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

veloxx-0.4.0.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

veloxx-0.4.0-cp313-cp313-win_amd64.whl (1.7 MB view details)

Uploaded CPython 3.13Windows x86-64

File details

Details for the file veloxx-0.4.0.tar.gz.

File metadata

  • Download URL: veloxx-0.4.0.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.10.2

File hashes

Hashes for veloxx-0.4.0.tar.gz
Algorithm Hash digest
SHA256 d4d6788717ca878a4e0931e93cb9dd56a79b6792e62e30002c50aeca3f192ed5
MD5 158aa595204e1dff5a23f0cd6aa38aec
BLAKE2b-256 2d40d370028b60ecc6851ac4e0f0301005372e76a86a9a8b0c77fd8ad0577681

See more details on using hashes here.

File details

Details for the file veloxx-0.4.0-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: veloxx-0.4.0-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 1.7 MB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.10.2

File hashes

Hashes for veloxx-0.4.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 6de0ad753b8b6f3b46671785d45f00427f14ef787d1870a9bc2517a8a8cd4358
MD5 69307eda485f1380bf31ae5c87dcbe98
BLAKE2b-256 5c79e76b2907919218072639136f5b32a9fb0de0ada11fe2f9ee095549836d3b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page