KORE Binary Format - Complete 8-language ecosystem for efficient data storage and querying
Project description
🚀 Kore — Killer Optimized Record Exchange
The fastest, most compressed columnar format for big data | v0.1.0
KORE is a high-performance binary file format optimized for analytical workloads. It provides:
- 38% compression ratio (vs 63% for Parquet)
- 131x query speedup with column pruning & predicate pushdown
- Zero data loss verification (400K+ cells tested)
- Native Spark integration — read/write with PySpark
Quick Start
Rust Library
Add this crate as a dependency (when published) or include from path:
use kore_fileformat::*;
// Write data
kore_write_simple("output.kore", schema_json, data_json)?;
// Read data
let data = kore_read_simple("output.kore")?;
// Read specific column
let col = kore_read_col_simple("output.kore", "column_name")?;
// Get file info
let info = kore_info_simple("output.kore")?;
PySpark Integration ⭐ NEW
from pyspark.sql import SparkSession
from kore import KoreDataFrameReader, KoreDataFrameWriter
spark = SparkSession.builder.appName("KoreExample").getOrCreate()
# Read Kore file
df = KoreDataFrameReader(spark).load("data.kore")
# Write to Kore (38% compression!)
KoreDataFrameWriter(df).mode("overwrite").save("output.kore")
# Spark SQL support (3.5+)
spark.read.format("kore").load("file.kore").show()
See python/README.md for full PySpark documentation.
Publishing checklist
- Ensure
Cargo.tomlmetadata is correct (authors, repository, keywords). - Add
LICENSEfile if required (MIT by default here). - Replace any
unimplemented!()stubs with full implementations if you need runtime functionality. - Run
cargo build --releaseandcargo testto verify compilation and tests. - Optionally add CI configuration (GitHub Actions) for
cargo testandcargo clippy.
Notes
This workspace contains copies of the original KORE source files. Some long implementations were stubbed out in this initial export; if you want the full original source code included verbatim, I can replace the stubs with the complete implementations from the upstream project files.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kore_fileformat-0.1.0.tar.gz.
File metadata
- Download URL: kore_fileformat-0.1.0.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
985d7413fc37fa66dd930e39b6db1a312b836f005493b10d299df147c4c54f66
|
|
| MD5 |
5ebcdae54a3a2175a716a52412fdc94f
|
|
| BLAKE2b-256 |
b0a213096f90309c98f6c0c83405e5047a0043d8ec7c4817ee41d815a2f66232
|
Provenance
The following attestation bundles were made for kore_fileformat-0.1.0.tar.gz:
Publisher:
publish-pypi.yml on arunkatherashala/Kore
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
kore_fileformat-0.1.0.tar.gz -
Subject digest:
985d7413fc37fa66dd930e39b6db1a312b836f005493b10d299df147c4c54f66 - Sigstore transparency entry: 1486135926
- Sigstore integration time:
-
Permalink:
arunkatherashala/Kore@76f130c5f362e786ae314e317e20acab2d7a6bfe -
Branch / Tag:
refs/heads/main - Owner: https://github.com/arunkatherashala
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@76f130c5f362e786ae314e317e20acab2d7a6bfe -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file kore_fileformat-0.1.0-py3-none-any.whl.
File metadata
- Download URL: kore_fileformat-0.1.0-py3-none-any.whl
- Upload date:
- Size: 4.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b181e8a9860bd14bde30cd85525e70636608a956bb6f8d69c6bc4b943288e083
|
|
| MD5 |
76bef2bbadeaa6aa44e7f39ae19125b6
|
|
| BLAKE2b-256 |
6881daeae1ae7489ba658874c390a9424705f772a29a300e248d2cc728cf04d6
|
Provenance
The following attestation bundles were made for kore_fileformat-0.1.0-py3-none-any.whl:
Publisher:
publish-pypi.yml on arunkatherashala/Kore
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
kore_fileformat-0.1.0-py3-none-any.whl -
Subject digest:
b181e8a9860bd14bde30cd85525e70636608a956bb6f8d69c6bc4b943288e083 - Sigstore transparency entry: 1486135951
- Sigstore integration time:
-
Permalink:
arunkatherashala/Kore@76f130c5f362e786ae314e317e20acab2d7a6bfe -
Branch / Tag:
refs/heads/main - Owner: https://github.com/arunkatherashala
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@76f130c5f362e786ae314e317e20acab2d7a6bfe -
Trigger Event:
workflow_dispatch
-
Statement type: