Skip to main content

High-performance bitemporal data processing for Python

Project description

PyTemporal

High-performance bitemporal data processing for Python

PyTemporal is a Rust-powered library for processing bitemporal timeseries data with world-class performance (157,000+ rows/second). Perfect for financial services, audit systems, and applications requiring immutable data trails with both business and system time dimensions.

Quick Start

# Install from source
git clone <your-repo>
cd pytemporal
uv run maturin develop --release
import pandas as pd
from pytemporal import BitemporalTimeseriesProcessor

# Initialize processor
processor = BitemporalTimeseriesProcessor(
    id_columns=['id'],
    value_columns=['price']
)

# Process temporal updates
result = processor.process_updates(
    current_state=current_df,
    updates=updates_df, 
    system_date='2025-01-27'
)

print(f"Updated {len(result.to_insert)} records")

Key Features

  • ๐Ÿš€ World-Class Performance: 157,000+ rows/second throughput
  • ๐Ÿ”„ Bitemporal Processing: Track both business time and system time
  • ๐Ÿ Python-First: High-level DataFrame API with pandas integration
  • โšก Zero-Copy: Apache Arrow columnar format for memory efficiency
  • ๐Ÿ”ง Flexible Schema: Configure ID and value columns dynamically
  • ๐ŸŽฏ Two Update Modes: Delta updates or full state replacement
  • ๐Ÿ”€ Smart Conflation: Optional merging of consecutive records with identical values
  • ๐Ÿ—๏ธ Production Ready: Comprehensive test coverage and clean architecture

Documentation

What is Bitemporal Data?

Bitemporal data tracks two time dimensions:

  • Effective Time: When events occurred in the real world
  • As-Of Time: When information was recorded in the system

This enables powerful queries like "What did we think the price was on Jan 15th, as of Jan 20th?"

Use Cases

  • Financial Services: Price histories, portfolio valuations, risk calculations
  • Audit Systems: Immutable change tracking with full reconstruction capability
  • Regulatory Compliance: Time-accurate reporting for compliance requirements
  • Data Warehousing: Slowly changing dimensions with full history preservation

Performance

Dataset Size Processing Time Throughput Memory
800k ร— 80 cols 5.4 seconds 157k rows/sec ~14GB
100k ร— 20 cols 0.6 seconds 167k rows/sec ~2GB

Benchmarked on modern hardware with optimized settings

Architecture

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Python DataFrameโ”‚โ”€โ”€โ”€โ–ถโ”‚ PyTemporal (Rust)โ”‚โ”€โ”€โ”€โ–ถโ”‚ Processed Resultsโ”‚
โ”‚ (Pandas)        โ”‚    โ”‚ โ€ข Arrow Columnar โ”‚    โ”‚ (DataFrame)     โ”‚  
โ”‚                 โ”‚    โ”‚ โ€ข Parallel Proc  โ”‚    โ”‚                 โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜    โ”‚ โ€ข Timeline Logic โ”‚    โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                       โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Built With:

  • Rust: Core processing engine for maximum performance
  • Apache Arrow: Columnar data format for zero-copy operations
  • PyO3: Seamless Rust-Python integration
  • Rayon: Data parallelism for multi-core performance

Development

# Run tests
cargo test                                    # Rust tests
uv run python -m pytest tests/ -v           # Python tests

# Performance benchmarks  
cargo bench                                  # Detailed benchmarks
uv run python validate_refactoring.py       # End-to-end validation

# Build release
uv run maturin develop --release

Contributing

  1. Fork the repository
  2. Create a feature branch: git checkout -b feature-name
  3. Make changes and add tests
  4. Run the test suite: cargo test && uv run pytest
  5. Submit a pull request

License

This project is licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in PyTemporal by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Acknowledgments

Built with modern Rust performance engineering and extensive profiling to achieve world-class bitemporal processing speeds while maintaining clean, maintainable code.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pytemporal-1.4.14-cp312-cp312-manylinux_2_34_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.34+ x86-64

pytemporal-1.4.14-cp311-cp311-manylinux_2_34_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.34+ x86-64

pytemporal-1.4.14-cp310-cp310-manylinux_2_34_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

pytemporal-1.4.14-cp39-cp39-manylinux_2_34_x86_64.whl (2.5 MB view details)

Uploaded CPython 3.9manylinux: glibc 2.34+ x86-64

File details

Details for the file pytemporal-1.4.14-cp312-cp312-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pytemporal-1.4.14-cp312-cp312-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 ae83f38975f37a5e1d9f41878535bb6c1697c704bb9b3c1ff7bd11cbcb9a305e
MD5 e1ec02fcdd81115c327aee8262f32358
BLAKE2b-256 832ec4e8b3a6b119940b7fc0a62b636c2b164659dfa65614551fda3a43944be4

See more details on using hashes here.

Provenance

The following attestation bundles were made for pytemporal-1.4.14-cp312-cp312-manylinux_2_34_x86_64.whl:

Publisher: build-wheels.yml on gingermike/pytemporal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pytemporal-1.4.14-cp311-cp311-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pytemporal-1.4.14-cp311-cp311-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 35cddb5ae3b70d496aae077aaef829c71027e8ccc962e6c45f1eac71c4a39c8a
MD5 d8c9e00793f539108c2d062f09dc801d
BLAKE2b-256 09ad584f30233a781cbd2196fffb8879315672665322c6ee8279f430bb56e7a6

See more details on using hashes here.

Provenance

The following attestation bundles were made for pytemporal-1.4.14-cp311-cp311-manylinux_2_34_x86_64.whl:

Publisher: build-wheels.yml on gingermike/pytemporal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pytemporal-1.4.14-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pytemporal-1.4.14-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 fcbb52c955f4a9dfc1b1d0aeaf4ea6d30236fa746cc709acb651b9c122e1c24b
MD5 38c23c3fe454f9671242e362d64e0ab9
BLAKE2b-256 c0926d16f794d79db1a4b113dfe446ab47dc3f423a53bba393a90c6a280c5871

See more details on using hashes here.

Provenance

The following attestation bundles were made for pytemporal-1.4.14-cp310-cp310-manylinux_2_34_x86_64.whl:

Publisher: build-wheels.yml on gingermike/pytemporal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pytemporal-1.4.14-cp39-cp39-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pytemporal-1.4.14-cp39-cp39-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 5ef2a009f16f214dbc04aa6bab49b987f4c0c70f57b9a883186fe7dae19325d7
MD5 eadb5a6008ec340198260470b22dde75
BLAKE2b-256 7c461f54cb21d34c1756495f1f2d467122be3951d20c6b585e40b1b1a399af6d

See more details on using hashes here.

Provenance

The following attestation bundles were made for pytemporal-1.4.14-cp39-cp39-manylinux_2_34_x86_64.whl:

Publisher: build-wheels.yml on gingermike/pytemporal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page