Skip to main content

AITÆM: All Interesting Things Are Essentially Metrics - A Python library for generating data insights

Project description

AITÆM: All Interesting Things Are Essentially Metrics

CI PyPI version Python versions Docs

This is the aitaem library, pronounced "i-tame".

Full documentation: https://chaturv3di.github.io/aitaem/

Why?

TL;DR: Point this Python library toward your OLAP database or a local CSV file and start generating insights, i.e. metrics, slices, segments, and time series, in no time. This library is LLM friendly (more on that later).

Business leaders, PMs, EMs, and even individual contributors constantly require deep understanding of their businesses and products. Another term for this understanding is "data insights". The most common way to obtain these insights is to rely on a data scientist or a business analyst to dive into the data and compute the insights. Why is this a bad idea?

  1. Practically: Dashboards are limited in how much they can hold apriori. There's always a new question which existing dashboards cannot answer. There are always either too many dashboards or too few
  2. Operationally: It is a waste of time if a DS or a BA has to dive into the source tables and (re)write SQL queries to compute customized metrics, slices, or segments repeatedly
  3. Scientifically: The accuracy of ad-hoc analysis depends upon the individual; the same analysis done by different individuals can yield different results
  4. Organisationally: Inter-org trust should be built upon processes/toolings rather than on individuals

What?

This library provides powerful functionality in a compact API. The core consists of two componenents.

  1. Specifications: A simple declarative structure to modularly define metric specs, slice/breakdown specs, and segment specs
  2. Computation: A small collection of Python classes with compact signatures which compute the metrics

Additionally, there are utilities to connect to various data backends (simultaneously) and helpers to visualize/render charts and trends.

Quick Start

Installation

pip install aitaem

Three-Step API

from aitaem import SpecCache, ConnectionManager, MetricCompute

# 1. Load metric, slice, and segment specs from YAML files or directories
cache = SpecCache.from_yaml(
    metric_paths="examples/metrics/",
    slice_paths="examples/slices/",
    segment_paths="examples/segments/",
)

# 2. Configure the data backend (DuckDB, BigQuery, etc.)
conn = ConnectionManager()
conn.add_connection("duckdb", database=":memory:")  # or path to .duckdb file

# 3. Compute metrics
mc = MetricCompute(cache, conn)
df = mc.compute(
    metrics="ctr",
    slices="campaign_type",
    segments="platform",
    time_window=("2024-01-01", "2024-04-01"),
)
print(df)

Standard Output Format

Every compute() call returns a pandas DataFrame with exactly these 10 columns:

Column Description
period_type "all_time" or a named period
period_start_date ISO date string or None
period_end_date ISO date string or None
entity_id Entity column value when by_entity is set; None otherwise
metric_name Name of the metric (e.g. "ctr")
slice_type Slice name or "none" for the all-data baseline
slice_value Slice value (e.g. "Search") or "all"
segment_name Segment name or "none" for the all-data baseline
segment_value Segment value (e.g. "Google Ads") or "all"
metric_value Computed numeric result

Example: Ad Campaigns Dataset

The examples/ directory contains sample YAML specs and a CSV dataset for an ad campaigns use case. You can run the following end-to-end:

from aitaem import SpecCache, ConnectionManager, MetricCompute
from aitaem.helpers import load_csvs_to_duckdb

# Load the sample CSV into a DuckDB file and get back a connector
connector = load_csvs_to_duckdb("examples/data/ad_campaigns.csv", "examples/data/ad_campaigns.duckdb")

conn = ConnectionManager()
conn.add_connection("duckdb", connector=connector)

cache = SpecCache.from_yaml(
    metric_paths="examples/metrics/",
    slice_paths="examples/slices/",
    segment_paths="examples/segments/",
)

mc = MetricCompute(cache, conn)

# CTR by campaign type for Q1 2024
df = mc.compute(
    metrics="ctr",
    slices="campaign_type",
    time_window=("2024-01-01", "2024-04-01"),
)
print(df[["slice_value", "metric_value"]])

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aitaem-0.1.5.tar.gz (226.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aitaem-0.1.5-py3-none-any.whl (69.0 kB view details)

Uploaded Python 3

File details

Details for the file aitaem-0.1.5.tar.gz.

File metadata

  • Download URL: aitaem-0.1.5.tar.gz
  • Upload date:
  • Size: 226.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aitaem-0.1.5.tar.gz
Algorithm Hash digest
SHA256 ab14257cab5856de9fe2cffd553f58f9f04c754f6f9690d697c08d55f633904a
MD5 58403088e821e81ebb089f7d40dd09b2
BLAKE2b-256 9c59b28e97ed3681dcc65ef22737c5520a9e122f412d89ae4490ceb31674ac27

See more details on using hashes here.

Provenance

The following attestation bundles were made for aitaem-0.1.5.tar.gz:

Publisher: publish.yml on chaturv3di/aitaem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aitaem-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: aitaem-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 69.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aitaem-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 75db356d3b9b556fb4fca74d6ce965be17f0be17af724b1279fb2c51d0ca057d
MD5 c9bf98cbc2f685d8edce32db9a43e6b1
BLAKE2b-256 08ce1adb8618c479ebd7e5c00e078ad193eeff9d7b40f90f3d19155fce92ca0b

See more details on using hashes here.

Provenance

The following attestation bundles were made for aitaem-0.1.5-py3-none-any.whl:

Publisher: publish.yml on chaturv3di/aitaem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page