Skip to main content

A semantic layer for your data.

Project description

Tesseract OLAP

A semantic layer for your data, enabling OLAP operations over relational databases. Tesseract OLAP provides a Python interface to translate multidimensional queries into optimized SQL, executing them against a backend, and returning the results in a structured format.

Key Capabilities

  • Multidimensional Analysis: Supports standard OLAP operations like drilling down, slicing, dicing, and pivoting.
  • Backend Agnostic: Designed to work with various data backends. Currently supports ClickHouse with specialized drivers.
  • Schema Validation: Uses a robust XML schema definition to validate queries and ensure data integrity.
  • High Performance: leverages polars for efficient data handling and transformation.
  • Caching: Built-in support for caching query results to improve performance (supports Redis).

Installation

Install the package using pip:

pip install tesseract-olap

Optional Dependencies

To use specific backends or features, install the corresponding extras:

  • ClickHouse Backend:
    pip install "tesseract-olap[clickhouse]"
    
  • Redis Caching:
    pip install "tesseract-olap[redis]"
    

Getting Started

In its most basic form, the tesseract-olap package provides a way to translate OLAP-type queries into request statements that a data backend can understand and execute safely.

1. Define your Schema

Tesseract uses a schema to define cubes, dimensions, and measures. Supported formats include XML, JSON, and YAML. See the OLAP Model Concepts and examples in the repository for schema details.

2. Initialize the Server

from tesseract_olap.backend.clickhouse import ClickhouseBackend
from tesseract_olap import OlapServer

# Initialize the backend connection
backend = ClickhouseBackend("clickhouse://user:pass@localhost:9000/database")

# Create the server instance with the schema
server = OlapServer(backend=backend, schema="./path/to/schema.xml")

3. Execute a Query

from tesseract_olap.query import DataRequest, DataQuery

def get_data():
    # Create a request object defining the query
    request = DataRequest.new("cube_name", {
        "drilldowns": ["Time", "Country"],
        "measures": ["Units", "Price"],
    })

    # Validate and compile the request against the schema
    query = DataQuery.from_request(server.schema, request)

    # Execute the query
    with server.session() as session:
        # Fetch results in different formats
        # result = session.fetch(query)           # Returns a Result object
        # result = session.fetch_records(query)   # Returns a list of dicts
        result = session.fetch_dataframe(query)   # Returns a Polars DataFrame

    return result.data

if __name__ == "__main__":
    data = get_data()
    print(data)

The server instance can be integrated into web applications (like FastAPI or LogicLayer) to provide a full-fledged OLAP API.


© 2022-2024 Datawheel, LLC.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tesseract_olap-0.23.1.tar.gz (110.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tesseract_olap-0.23.1-py3-none-any.whl (137.7 kB view details)

Uploaded Python 3

File details

Details for the file tesseract_olap-0.23.1.tar.gz.

File metadata

  • Download URL: tesseract_olap-0.23.1.tar.gz
  • Upload date:
  • Size: 110.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.4.1 CPython/3.12.3 Linux/6.17.0-1018-azure

File hashes

Hashes for tesseract_olap-0.23.1.tar.gz
Algorithm Hash digest
SHA256 6591b40831d3c1a5c56f92ceff88c098ac492bcbad2ed09ec704eddc3d9bc2e0
MD5 52ed8f7d38a87356c505165938dee2a0
BLAKE2b-256 944b0467cdc76cf4144eff91313e3b81d9ca528aaed3f94a140c4300f883a09a

See more details on using hashes here.

File details

Details for the file tesseract_olap-0.23.1-py3-none-any.whl.

File metadata

  • Download URL: tesseract_olap-0.23.1-py3-none-any.whl
  • Upload date:
  • Size: 137.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.4.1 CPython/3.12.3 Linux/6.17.0-1018-azure

File hashes

Hashes for tesseract_olap-0.23.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a548e8a19c10e4626e3ed4cc6decf4ee371ffb7594eb3b1f458edfbc4f7b45aa
MD5 e574a57f60466061c085c3158192c18b
BLAKE2b-256 21a63bfff66d4ae8a9049faadaf005c563a440efc7f36172a0c70a291ff1255f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page