Skip to main content

Veloxx: High-performance, lightweight Python library for in-memory data processing and analytics. Built on Rust for blazing speed and memory efficiency. Features DataFrames, Series, advanced I/O (CSV, JSON, Parquet), machine learning (linear regression, K-means, logistic regression), time-series analysis, data visualization, and parallel processing. Perfect for data science, analytics, and performance-critical applications where speed and memory usage matter.

Project description

Veloxx Logo

Veloxx: Ultra-High Performance Data Processing & Analytics Library

Crates.io PyPI npm GitHub Documentation


🚀 v0.3.1 Released! Major performance breakthroughs with industry-leading SIMD optimizations and comprehensive feature set.

Veloxx is a blazing-fast, ultra-lightweight data processing and analytics library in Rust, with seamless bindings for Python and WebAssembly. Built from the ground up for maximum performance, featuring advanced SIMD acceleration, memory optimization, and parallel processing that often outperforms industry leaders.

🏆 Performance Highlights

Parallel median, quantile & percentile calculation: Now uses Rayon for fast computation on large datasets 25.9x faster group-by operations: 1,466.3M rows/sec 172x faster filtering: 538.3M elements/sec
2-12x faster joins: 400,000M rows/sec Industry-leading I/O: CSV 93,066K rows/sec, JSON 8,722K objects/sec Advanced SIMD: 2,489.4M rows/sec query processing Memory optimized: 422.1MB/s compression, 13.8M allocs/sec


✨ Project Links

🧩 Core Principles & Design Goals

  • 🚀 Performance First: Advanced SIMD, parallel processing, cache-optimized algorithms
  • 🪶 Lightweight: Minimal dependencies, optimized memory footprint
  • 🦺 Safety & Reliability: Memory-safe Rust, comprehensive testing
  • 🧑‍💻 Developer Experience: Intuitive APIs, excellent documentation
  • 🔧 Production Ready: Zero-warning compilation, extensive benchmarking

🚩 Key Features

Core Data Structures

  • DataFrame and Series for lightning-fast tabular data processing
  • SIMD-optimized operations with AVX2/NEON acceleration
  • Memory-efficient storage with advanced compression

High-Performance Operations

  • 🚀 Ultra-fast analytics: filtering, joining, grouping, aggregation
  • 📊 Advanced statistics: correlation, regression, time-series analysis
  • Parallel processing: Multi-threaded execution with work-stealing
  • 🧮 Vectorized math: SIMD-accelerated arithmetic operations

Advanced I/O & Integration

  • 📂 Multiple formats: CSV, JSON, Parquet support
  • 🔌 Database connectivity: SQLite, PostgreSQL, MySQL
  • 🌊 Streaming operations: Memory-efficient large dataset processing
  • Async I/O: Non-blocking file and network operations

Data Quality & ML

  • 🧹 Data cleaning: Automated outlier detection, validation
  • 🤖 Machine learning: Linear/logistic regression, clustering, preprocessing
  • 📈 Visualization: Charts, plots, statistical graphics
  • 🔍 Data profiling: Schema inference, quality metrics

Multi-Language Support

  • 🦀 Rust: Native, zero-cost abstractions
  • Python: PyO3 bindings with NumPy integration
  • 🌐 WebAssembly: Browser and Node.js support
  • 📦 Easy installation: Available on crates.io, PyPI, npm

⚡ Quick Start

Rust

[dependencies]
veloxx = "0.3.1"
use veloxx::dataframe::DataFrame;
use veloxx::series::Series;

let df = DataFrame::new_from_csv("data.csv")?;
let filtered = df.filter(&your_condition)?;
let grouped = df.group_by(vec!["category"]).agg(vec![("amount", "sum")])?;

Python

import veloxx

df = veloxx.PyDataFrame({"name": veloxx.PySeries("name", ["Alice", "Bob"])})
filtered = df.filter([...])

JavaScript/Wasm

const veloxx = require("veloxx");
const df = new veloxx.WasmDataFrame({name: ["Alice", "Bob"]});
const filtered = df.filter(...);

🛠️ Feature Flags

Enable only what you need:

  • advanced_io – Parquet, databases, async
  • data_quality – Schema checks, anomaly detection
  • window_functions – Window analytics
  • visualization – Charting
  • ml – Machine learning
  • python – Python bindings
  • wasm – WebAssembly

📚 Documentation

🧑‍💻 Examples

Run ready-made examples:

cargo run --example basic_dataframe_operations
cargo run --example advanced_io --features advanced_io
# ... more in the examples/ folder

🤝 Contributing

See CONTRIBUTING.md for guidelines.

📝 License

MIT License. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

veloxx-0.3.2.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

veloxx-0.3.2-cp313-cp313-win_amd64.whl (1.6 MB view details)

Uploaded CPython 3.13Windows x86-64

File details

Details for the file veloxx-0.3.2.tar.gz.

File metadata

  • Download URL: veloxx-0.3.2.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.9.3

File hashes

Hashes for veloxx-0.3.2.tar.gz
Algorithm Hash digest
SHA256 af4b972df6dce9d63317664e23d1e6dfdfe20472f809951280aac530a829f0c9
MD5 146c73ef2cddb6524dccc3e400068701
BLAKE2b-256 359f086260d138a7dd393170aa35dcd822b2a67975c294b52f10b73d455a0117

See more details on using hashes here.

File details

Details for the file veloxx-0.3.2-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: veloxx-0.3.2-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.9.3

File hashes

Hashes for veloxx-0.3.2-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 df8539dc755ccf19112ffab80421c27ffcfd9a7f3edfcacb170afa03b244921e
MD5 4329deca00ff47bcdb82f11ac57772ca
BLAKE2b-256 6dcdb322867522c0bcd419c2a0b5149b0ce43b69a3bb2e56f4220a9009f6013c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page