Veloxx: High-performance, lightweight Python library for in-memory data processing and analytics. Built on Rust for blazing speed and memory efficiency. Features DataFrames, Series, advanced I/O (CSV, JSON, Parquet), machine learning (linear regression, K-means, logistic regression), time-series analysis, data visualization, and parallel processing. Perfect for data science, analytics, and performance-critical applications where speed and memory usage matter.
Project description
Veloxx: Ultra-High Performance Data Processing & Analytics Library
🚀 v0.3.1 Released! Major performance breakthroughs with industry-leading SIMD optimizations and comprehensive feature set.
Veloxx is a blazing-fast, ultra-lightweight data processing and analytics library in Rust, with seamless bindings for Python and WebAssembly. Built from the ground up for maximum performance, featuring advanced SIMD acceleration, memory optimization, and parallel processing that often outperforms industry leaders.
🏆 Performance Highlights
Parallel median, quantile & percentile calculation: Now uses Rayon for fast computation on large datasets
25.9x faster group-by operations: 1,466.3M rows/sec
172x faster filtering: 538.3M elements/sec
2-12x faster joins: 400,000M rows/sec
Industry-leading I/O: CSV 93,066K rows/sec, JSON 8,722K objects/sec
Advanced SIMD: 2,489.4M rows/sec query processing
Memory optimized: 422.1MB/s compression, 13.8M allocs/sec
✨ Project Links
- 🦀 Rust crate (crates.io)
- 🐍 Python package (PyPI)
- 📦 JavaScript package (npm)
- 🌐 GitHub
- 📖 Online Documentation
🧩 Core Principles & Design Goals
- 🚀 Performance First: Advanced SIMD, parallel processing, cache-optimized algorithms
- 🪶 Lightweight: Minimal dependencies, optimized memory footprint
- 🦺 Safety & Reliability: Memory-safe Rust, comprehensive testing
- 🧑💻 Developer Experience: Intuitive APIs, excellent documentation
- 🔧 Production Ready: Zero-warning compilation, extensive benchmarking
🚩 Key Features
Core Data Structures
- DataFrame and Series for lightning-fast tabular data processing
- SIMD-optimized operations with AVX2/NEON acceleration
- Memory-efficient storage with advanced compression
High-Performance Operations
- 🚀 Ultra-fast analytics: filtering, joining, grouping, aggregation
- 📊 Advanced statistics: correlation, regression, time-series analysis
- � Parallel processing: Multi-threaded execution with work-stealing
- 🧮 Vectorized math: SIMD-accelerated arithmetic operations
Advanced I/O & Integration
- 📂 Multiple formats: CSV, JSON, Parquet support
- 🔌 Database connectivity: SQLite, PostgreSQL, MySQL
- 🌊 Streaming operations: Memory-efficient large dataset processing
- ⚡ Async I/O: Non-blocking file and network operations
Data Quality & ML
- 🧹 Data cleaning: Automated outlier detection, validation
- 🤖 Machine learning: Linear/logistic regression, clustering, preprocessing
- 📈 Visualization: Charts, plots, statistical graphics
- 🔍 Data profiling: Schema inference, quality metrics
Multi-Language Support
- 🦀 Rust: Native, zero-cost abstractions
- � Python: PyO3 bindings with NumPy integration
- 🌐 WebAssembly: Browser and Node.js support
- 📦 Easy installation: Available on crates.io, PyPI, npm
⚡ Quick Start
Rust
[dependencies]
veloxx = "0.3.1"
use veloxx::dataframe::DataFrame;
use veloxx::series::Series;
let df = DataFrame::new_from_csv("data.csv")?;
let filtered = df.filter(&your_condition)?;
let grouped = df.group_by(vec!["category"]).agg(vec![("amount", "sum")])?;
Python
import veloxx
df = veloxx.PyDataFrame({"name": veloxx.PySeries("name", ["Alice", "Bob"])})
filtered = df.filter([...])
JavaScript/Wasm
const veloxx = require("veloxx");
const df = new veloxx.WasmDataFrame({name: ["Alice", "Bob"]});
const filtered = df.filter(...);
🛠️ Feature Flags
Enable only what you need:
advanced_io– Parquet, databases, asyncdata_quality– Schema checks, anomaly detectionwindow_functions– Window analyticsvisualization– Chartingml– Machine learningpython– Python bindingswasm– WebAssembly
📚 Documentation
🧑💻 Examples
Run ready-made examples:
cargo run --example basic_dataframe_operations
cargo run --example advanced_io --features advanced_io
# ... more in the examples/ folder
🤝 Contributing
See CONTRIBUTING.md for guidelines.
📝 License
MIT License. See LICENSE.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file veloxx-0.3.2.tar.gz.
File metadata
- Download URL: veloxx-0.3.2.tar.gz
- Upload date:
- Size: 2.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.9.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
af4b972df6dce9d63317664e23d1e6dfdfe20472f809951280aac530a829f0c9
|
|
| MD5 |
146c73ef2cddb6524dccc3e400068701
|
|
| BLAKE2b-256 |
359f086260d138a7dd393170aa35dcd822b2a67975c294b52f10b73d455a0117
|
File details
Details for the file veloxx-0.3.2-cp313-cp313-win_amd64.whl.
File metadata
- Download URL: veloxx-0.3.2-cp313-cp313-win_amd64.whl
- Upload date:
- Size: 1.6 MB
- Tags: CPython 3.13, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.9.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
df8539dc755ccf19112ffab80421c27ffcfd9a7f3edfcacb170afa03b244921e
|
|
| MD5 |
4329deca00ff47bcdb82f11ac57772ca
|
|
| BLAKE2b-256 |
6dcdb322867522c0bcd419c2a0b5149b0ce43b69a3bb2e56f4220a9009f6013c
|