A semantic layer for your data.
Project description
Tesseract OLAP
A semantic layer for your data, enabling OLAP operations over relational databases. Tesseract OLAP provides a Python interface to translate multidimensional queries into optimized SQL, executing them against a backend, and returning the results in a structured format.
Key Capabilities
- Multidimensional Analysis: Supports standard OLAP operations like drilling down, slicing, dicing, and pivoting.
- Backend Agnostic: Designed to work with various data backends. Currently supports ClickHouse with specialized drivers.
- Schema Validation: Uses a robust XML schema definition to validate queries and ensure data integrity.
- High Performance: leverages
polarsfor efficient data handling and transformation. - Caching: Built-in support for caching query results to improve performance (supports Redis).
Installation
Install the package using pip:
pip install tesseract-olap
Optional Dependencies
To use specific backends or features, install the corresponding extras:
- ClickHouse Backend:
pip install "tesseract-olap[clickhouse]"
- Redis Caching:
pip install "tesseract-olap[redis]"
Getting Started
In its most basic form, the tesseract-olap package provides a way to translate OLAP-type queries into request statements that a data backend can understand and execute safely.
1. Define your Schema
Tesseract uses a schema to define cubes, dimensions, and measures. Supported formats include XML, JSON, and YAML. See the OLAP Model Concepts and examples in the repository for schema details.
2. Initialize the Server
from tesseract_olap.backend.clickhouse import ClickhouseBackend
from tesseract_olap import OlapServer
# Initialize the backend connection
backend = ClickhouseBackend("clickhouse://user:pass@localhost:9000/database")
# Create the server instance with the schema
server = OlapServer(backend=backend, schema="./path/to/schema.xml")
3. Execute a Query
from tesseract_olap.query import DataRequest, DataQuery
def get_data():
# Create a request object defining the query
request = DataRequest.new("cube_name", {
"drilldowns": ["Time", "Country"],
"measures": ["Units", "Price"],
})
# Validate and compile the request against the schema
query = DataQuery.from_request(server.schema, request)
# Execute the query
with server.session() as session:
# Fetch results in different formats
# result = session.fetch(query) # Returns a Result object
# result = session.fetch_records(query) # Returns a list of dicts
result = session.fetch_dataframe(query) # Returns a Polars DataFrame
return result.data
if __name__ == "__main__":
data = get_data()
print(data)
The server instance can be integrated into web applications (like FastAPI or LogicLayer) to provide a full-fledged OLAP API.
© 2022-2024 Datawheel, LLC.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tesseract_olap-0.22.7.tar.gz.
File metadata
- Download URL: tesseract_olap-0.22.7.tar.gz
- Upload date:
- Size: 110.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.2 CPython/3.12.3 Linux/6.14.0-1017-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c312af4b6723122bcbe7a9175b4b017c0ff4a8196d894a7c60d0cceb9ec0175e
|
|
| MD5 |
7feacf16892334108b860d197e28f655
|
|
| BLAKE2b-256 |
cdade6cf65e702960bd99fe2b49de699497f692a69ccb7e6a502480bce1fed25
|
File details
Details for the file tesseract_olap-0.22.7-py3-none-any.whl.
File metadata
- Download URL: tesseract_olap-0.22.7-py3-none-any.whl
- Upload date:
- Size: 137.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.2 CPython/3.12.3 Linux/6.14.0-1017-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee42a3f87f4544d3305689ff7ff191e51baca7b09c5fbc80d37b92f07e2a95ba
|
|
| MD5 |
fc73538851a75b14bfc84897d4c5bf7f
|
|
| BLAKE2b-256 |
46c94f201ab54997b97f9895e44bac26d97ffdf84fbbd2c4d37af5fdb97af756
|