Skip to main content

Python analytics solution accelerator using composable data systems

Project description

Python analytics accelerator

A composable data system in a Python package.

layers

This project uses:

  1. query UI: Ibis (Python dataframe code and/or SQL)
  2. query engine: DuckDB and ClickHouse (local and remote)
  3. storage: Delta Lake tables (local and/or cloud storage)

You can swap out your query engine(s) or storage format(s) as needed, mixing and matching as needed in a data mesh fashion. You can also use composable query UIs in other languages like dplyr in R, though this project is Python-focused.

[!CAUTION] This accelerator is not necessarily production-ready, but could be a good starting point. For now, it is primarily intended as an educational resource.

setup

Get up and running in 4 commands.

Install:

pip install python-analytics-accelerator

[!TIP] This makes the acc CLI available as long as you have a Python environment with the package installed active. Type acc in your terminal to see available commands.

Initialize:

acc init

[!TIP] Edit the config.py file to the GitHub repository and PyPI package you want to analyze.

[!IMPORTANT] You must edit the .env to add your GitHub token or otherwise set GH_TOKEN as an environment variable.

You can use gh auth token to print your GitHub token if you have the GitHub CLI installed.

Ingest from external sources:

acc ingest

Run ETL:

acc run

Open dashboard:

acc dashboard

development

Clone and change into the repository:

gh repo clone lostmygithubaccount/python-analytics-accelerator
cd python-analytics-accelerator

Install:

pip install -r requirements.txt

Format your code:

ruff format .

contributing

Work in progress, contributions welcome. Consider opening an issue for discussion or reaching out in these early days.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python_analytics_accelerator-0.2.0.tar.gz (446.5 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file python_analytics_accelerator-0.2.0.tar.gz.

File metadata

File hashes

Hashes for python_analytics_accelerator-0.2.0.tar.gz
Algorithm Hash digest
SHA256 256e6ad3734c6718881ac926e3d42ca57a23a9206e6c1e189a859c9b22a2f6f6
MD5 3b1e2a9ffe8ef82115f0b31d1486c06e
BLAKE2b-256 44cf236d985dd5ace12973cd2f6fc1aa86fcd12ef8bf416d4f237bc0e74541f7

See more details on using hashes here.

File details

Details for the file python_analytics_accelerator-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for python_analytics_accelerator-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9533cea8e10060e4c76660a4a4dd3cd082d612b752a4069387a606214e7d8983
MD5 e19ffb04da2b1394d92de968aa655c48
BLAKE2b-256 bd1d0be0d600a683b1f7e4d122b4641a89c4862647be502fa946891ab25e9fa8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page