mitzu

Product analytics over your data warehouse

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Mit - License

Mitzu is an open source product analytics tool that queries your data lake or data warehouse.

Features

Visualization for:

Funnels

Segmentation

Retention

User Journey (coming soon)

Revenue calculations (coming soon)

User Lookup (coming soon)

Cohorts Analysis

Standalone web app for non-tech people

Notebook visual app

Notebook low-code analytics in python

Batch ETL jobs support

Supported Integrations

Mitzu integrates with most modern data lake and warehouse solutions:

AWS Athena

Databricks Spark (SQL)

Files - SQLite (csv, parquet, json, etc.)

MySQL

PostgreSQL

Snowflake

Trino / Starburst

Coming Soon

Clickhouse

BigQuery

Redshift

Quick Start

In this section, we will describe how to start with Mitzu on your local machine. Skip this section if you rather see Mitzu in a prepared notebook or webapp. Otherwise get ready and fire up your own data-science notebook.

Install the Mitzu python library

pip install mitzu

Reading The Sample Dataset

The simplest way to get started with Mitzu is in a data-science notebook. In your notebook read the sample user behavior dataset. Mitzu can discover your tables in a data warehouse or data lake. For the sake of simplicity we provide you an in-memory sqlite based table that contains

import mitzu.samples as smp dp = smp.get_sample_discovered_project() m = dp.create_notebook_class_model()

Segmentation

The following command visualizes the count of unique users over time who did page visit action in the last 30 days.

m.page_visit

In the example above m.page_visit refers to a user event segment for which the notebook representation is a segmentation chart. If this sounds unfamiliar, don't worry! Later we will explain you everything.

Funnels

You can create a funnel chart by placing the >> operator between two user event segments.

m.page_visit >> m.checkout

This will visualize the conversion rate of users that first did page_visit action and then did checkout within a day in the last 30 days.

Filtering

You can apply filters to user event segment the following way:

m.page_visit.user_country_code.is_us >> m.checkout # You can achieve the same filter with: # (m.page_visit.user_country_code == 'us') # # you can also apply >, >=, <, <=, !=, operators.

With this syntax we have narrowed down our page visit user event segment to page visits from the US. Stacking filters is possible with the & (and) and | (or) operators.

m.page_visit.user_country_code.is_us & m.page_visit.acquisition_campaign.is_organic # if using the comparison operators, make sure you put the user event segments in parenthesis. # (m.page_visit.user_country_code == 'us') & (m.page_visit.acquisition_campaign == 'organic')

Apply multi value filtering with the any_of or none_of functions:

m.page_visit.user_country_code.any_of('us', 'cn', 'de') # m.page_visit.user_country_code.none_of('us', 'cn', 'de')

Of course you can apply filters on every user event segment in a funnel.

m.add_to_cart >> (m.checkout.cost_usd <= 1000)

Metrics Configuration

To any funnel or segmentation you can apply the config method. Where you can define the parameters of the metric.

m.page_visit.config( start_dt="2021-08-01", end_dt="2021-09-01", group_by=m.page_visit.domain, time_group='total', )

start_dt should be an iso datetime string, or python datetime, where the metric should start.

end_dt should be an iso datetime string, or python datetime, where the metric should end.

group_by is a property that you can refer to from the notebook class model.

time_group is the time granularity of the query for which the possible values are: hour, day, week, month, year, total

Funnels have an extra configuration parameter conv_window, this has the following format: <VAL> <TIME WINDOW>, where VAL is a positive integer.

(m.page_visit >> m.checkout).config( start_dt="2021-08-01", end_dt="2021-09-01", group_by=m.page_visit.domain, time_group='total', conv_window='1 day', )

SQL Generator

For any metric you can print out the SQL code that Mitzu generates. This you can do by calling the .print_sql() method.

(m.page_visit >> m.checkout).config( start_dt="2021-08-01", end_dt="2021-09-01", group_by=m.page_visit.domain, time_group='total', conv_window='1 day', ).print_sql()

Pandas DataFrames

Similarly you can access the results in the form of a Pandas DataFrame with the method .get_df()

(m.page_visit >> m.checkout).config( start_dt="2021-08-01", end_dt="2021-09-01", group_by=m.page_visit.domain, time_group='total', conv_window='1 day', ).get_df()

Notebook Dashboards

You can also visualize the webapp in a Jupyter Notebook:

import mitzu.samples as smp dp = smp.get_sample_discovered_project() dp.notebook_dashboard()

Usage In Notebooks

Example notebook

Documentation

Webapp

Mitzu can run as a standalone webapp or embedded inside a notebook.

Trying out locally:

docker run -p 8082:8082 imeszaros/mitzu-webapp

Example webapp

Webapp documentation

Connect Your Own Data

Mitzu is be able to connect to your data warehouse or data lake. To get started with your own data integration please read our handy docs

Contribution Guide

Please read our Contribution Guide

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.6.11

Jun 25, 2023

0.6.10

Jun 22, 2023

0.6.9

Jun 14, 2023

0.6.8

Jun 14, 2023

0.6.7

Jun 9, 2023

0.6.6

Jun 5, 2023

0.6.5

Jun 5, 2023

0.6.4

Jun 4, 2023

0.6.3

May 31, 2023

0.6.2

May 31, 2023

0.6.1

May 29, 2023

0.6.0

May 16, 2023

0.6.0rc9 pre-release

May 15, 2023

0.6.0rc8 pre-release

May 12, 2023

0.6.0rc7 pre-release

May 11, 2023

0.6.0rc6 pre-release

May 6, 2023

0.6.0rc5 pre-release

May 4, 2023

0.6.0rc4 pre-release

Apr 29, 2023

0.6.0rc3 pre-release

Apr 25, 2023

0.6.0rc2 pre-release

Apr 25, 2023

0.6.0rc1 pre-release

Apr 22, 2023

0.5.15

Apr 12, 2023

0.5.14

Apr 11, 2023

0.5.13

Apr 11, 2023

0.5.12

Apr 8, 2023

0.5.11

Mar 29, 2023

0.5.10

Mar 24, 2023

0.5.9

Mar 24, 2023

0.5.8

Mar 17, 2023

0.5.7

Mar 12, 2023

0.5.6

Mar 9, 2023

0.5.6rc3 pre-release

Mar 8, 2023

0.5.6rc2 pre-release

Mar 8, 2023

0.5.6rc1 pre-release

Mar 8, 2023

This version

0.5.5

Mar 3, 2023

0.5.4

Mar 3, 2023

0.5.2

Mar 2, 2023

0.5.1

Mar 1, 2023

0.5.0rc2 pre-release

Feb 23, 2023

0.5.0rc1 pre-release

Feb 22, 2023

0.4.0rc2 pre-release

Jan 17, 2023

0.3.2

Dec 8, 2022

0.3.1

Dec 8, 2022

0.2.68

Nov 28, 2022

0.2.67

Nov 27, 2022

0.2.66

Nov 22, 2022

0.2.65

Nov 22, 2022

0.2.61

Nov 16, 2022

0.2.60

Nov 14, 2022

0.2.59

Nov 9, 2022

0.2.49

Oct 22, 2022

0.2.46

Oct 21, 2022

0.2.45

Oct 20, 2022

0.2.44

Oct 14, 2022

0.2.42

Oct 13, 2022

0.2.41

Oct 13, 2022

0.2.34

Oct 7, 2022

0.2.32

Oct 4, 2022

0.2.31

Oct 2, 2022

0.2.30

Oct 2, 2022

0.2.29

Oct 1, 2022

0.2.28

Sep 5, 2022

0.2.24

Aug 29, 2022

0.2.23

Aug 29, 2022

0.2.19

Aug 23, 2022

0.2.18

Aug 23, 2022

0.2.17

Aug 21, 2022

0.2.16

Aug 21, 2022

0.2.14

Aug 18, 2022

0.2.13

Aug 18, 2022

0.2.12

Aug 18, 2022

0.2.9

Aug 17, 2022

0.2.7

Aug 16, 2022

0.2.5

Aug 14, 2022

0.2.2

Aug 13, 2022

0.1.49

Aug 7, 2022

0.1.48

Aug 4, 2022

0.1.47

Aug 4, 2022

0.1.46

Aug 4, 2022

0.1.45

Aug 3, 2022

0.1.44

Aug 2, 2022

0.1.43

Aug 1, 2022

0.1.42

Jul 31, 2022

0.1.41

Jul 29, 2022

0.1.40

Jul 29, 2022

0.1.39

Jul 28, 2022

0.1.38

Jul 28, 2022

0.1.37

Jul 28, 2022

0.1.36

Jul 20, 2022

0.1.35

Jul 16, 2022

0.1.34

Jul 7, 2022

0.1.32

Jul 5, 2022

0.1.30

Jul 5, 2022

0.1.28

Jul 4, 2022

0.1.23

Jun 25, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mitzu-0.5.5.tar.gz (627.0 kB view hashes)

Uploaded Mar 3, 2023 Source

Built Distribution

mitzu-0.5.5-py3-none-any.whl (673.3 kB view hashes)

Uploaded Mar 3, 2023 Python 3

Hashes for mitzu-0.5.5.tar.gz

Hashes for mitzu-0.5.5.tar.gz
Algorithm	Hash digest
SHA256	`c98526ff447a080a5e5bff6901942b74a150dafcec51a040bdacb377b1f2ed07`
MD5	`79e96d30b5fb2b594b2d7fad74f7142d`
BLAKE2b-256	`c067346bb7a39ff73fcdc4766739de446a45b08d140d8527772c20fe0084b41b`

Hashes for mitzu-0.5.5-py3-none-any.whl

Hashes for mitzu-0.5.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3dac14ad5f96b4c08f7e0de73c2181373f8cb77893a47b5ea276e27e8cbc1e96`
MD5	`71df601ee0573eb0d5f8655fec3c0ecd`
BLAKE2b-256	`f9e38632c24c5f8b344aaa057730791a0e819e191f7459aa1678e08e392c3efc`