Skip to main content

AITÆM: All Interesting Things Are Essentially Metrics - A Python library for generating data insights

Project description

AITÆM: All Interesting Things Are Essentially Metrics

CI PyPI version Python versions Docs

This is the aitaem library, pronounced "i-tame".

Why?

TL;DR: Point this Python library toward your OLAP database or a local CSV file and start generating insights, i.e. metrics, slices, segments, and time series, in no time. This library is LLM friendly (more on that later).

Business leaders, PMs, EMs, and even individual contributors constantly require deep understanding of their businesses and products. Another term for this understanding is "data insights". The most common way to obtain these insights is to rely on a data scientist or a business analyst to dive into the data and compute the insights. Why is this a bad idea?

  1. Practically: Dashboards are limited in how much they can hold apriori. There's always a new question which existing dashboards cannot answer. There are always either too many dashboards or too few
  2. Operationally: It is a waste of time if a DS or a BA has to dive into the source tables and (re)write SQL queries to compute customized metrics, slices, or segments repeatedly
  3. Scientifically: The accuracy of ad-hoc analysis depends upon the individual; the same analysis done by different individuals can yield different results
  4. Organisationally: Inter-org trust should be built upon processes/toolings rather than on individuals

What?

This library provides powerful functionality in a compact API. The core consists of two componenents.

  1. Specifications: A simple declarative structure to modularly define metric specs, slice/breakdown specs, and segment specs
  2. Computation: A small collection of Python classes with compact signatures which compute the metrics

Additionally, there are utilities to connect to various data backends (simultaneously) and helpers to visualize/render charts and trends.

Quick Start

Installation

pip install aitaem

Three-Step API

from aitaem import SpecCache, ConnectionManager, MetricCompute

# 1. Load metric, slice, and segment specs from YAML files or directories
cache = SpecCache.from_yaml(
    metric_paths="examples/metrics/",
    slice_paths="examples/slices/",
    segment_paths="examples/segments/",
)

# 2. Configure the data backend (DuckDB, BigQuery, etc.)
conn = ConnectionManager()
conn.add_connection("duckdb", database=":memory:")  # or path to .duckdb file

# 3. Compute metrics
mc = MetricCompute(cache, conn)
df = mc.compute(
    metrics="ctr",
    slices="campaign_type",
    segments="platform",
    time_window=("2024-01-01", "2024-04-01"),
)
print(df)

Standard Output Format

Every compute() call returns a pandas DataFrame with exactly these 9 columns:

Column Description
period_type "all_time" or a named period
period_start_date ISO date string or None
period_end_date ISO date string or None
metric_name Name of the metric (e.g. "ctr")
slice_type Slice name or "none" for the all-data baseline
slice_value Slice value (e.g. "Search") or "all"
segment_name Segment name or "none" for the all-data baseline
segment_value Segment value (e.g. "Google Ads") or "all"
metric_value Computed numeric result

Example: Ad Campaigns Dataset

The examples/ directory contains sample YAML specs and a CSV dataset for an ad campaigns use case. You can run the following end-to-end:

from aitaem import SpecCache, ConnectionManager, MetricCompute
from aitaem.helpers import load_csvs_to_duckdb

# Load the sample CSV into a DuckDB file and get back a connector
connector = load_csvs_to_duckdb("examples/data/ad_campaigns.csv", "examples/data/ad_campaigns.duckdb")

conn = ConnectionManager()
conn.add_connection("duckdb", connector=connector)

cache = SpecCache.from_yaml(
    metric_paths="examples/metrics/",
    slice_paths="examples/slices/",
    segment_paths="examples/segments/",
)

mc = MetricCompute(cache, conn)

# CTR by campaign type for Q1 2024
df = mc.compute(
    metrics="ctr",
    slices="campaign_type",
    time_window=("2024-01-01", "2024-04-01"),
)
print(df[["slice_value", "metric_value"]])

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitaem-0.1.3.tar.gz (193.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitaem-0.1.3-py3-none-any.whl (64.6 kB view details)

Uploaded Python 3

File details

Details for the file aitaem-0.1.3.tar.gz.

File metadata

  • Download URL: aitaem-0.1.3.tar.gz
  • Upload date:
  • Size: 193.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aitaem-0.1.3.tar.gz
Algorithm Hash digest
SHA256 b64f5987356ef6f5f78421a87bf8754c071c7659d0439d19d9a66b2e825c1a1a
MD5 9a11313aaec96d49b3314284c07afa17
BLAKE2b-256 29d64e28ff3e7089cf3b97e2e9eeaf03113df235effe8546505aef89aeb868d3

See more details on using hashes here.

Provenance

The following attestation bundles were made for aitaem-0.1.3.tar.gz:

Publisher: publish.yml on chaturv3di/aitaem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aitaem-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: aitaem-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 64.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aitaem-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 81810ecd40d615eb3b985862fc53f00078d730383f457b9a6f18889f801c7149
MD5 5429a42d8e8708ae3c303f7a0451472b
BLAKE2b-256 1c6dc888fc134610c4209484d7c4fb7d67759efc517d017a8c7384fc8894abd5

See more details on using hashes here.

Provenance

The following attestation bundles were made for aitaem-0.1.3-py3-none-any.whl:

Publisher: publish.yml on chaturv3di/aitaem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page