Python analytics solution accelerator using composable data systems
Project description
Python analytics accelerator
A composable data system in a Python package.
This project uses:
- query UI: Ibis (Python dataframe code and/or SQL)
- query engine: DuckDB and ClickHouse (local and remote)
- storage: Delta Lake tables (local and/or cloud storage)
You can swap out your query engine(s) or storage format(s) as needed, mixing and matching as needed in a data mesh fashion. You can also use composable query UIs in other languages like dplyr
in R, though this project is Python-focused.
[!CAUTION] This accelerator is not necessarily production-ready, but could be a good starting point. For now, it is primarily intended as an educational resource.
setup
Get up and running in 4 commands.
Install:
pip install python-analytics-accelerator
[!TIP] This makes the
acc
CLI available as long as you have a Python environment with the package installed active. Typeacc
in your terminal to see available commands.
Initialize:
acc init
[!TIP] Edit the
config.py
file to the GitHub repository and PyPI package you want to analyze.
[!IMPORTANT] You must edit the
.env
to add your GitHub token or otherwise setGH_TOKEN
as an environment variable.You can use
gh auth token
to print your GitHub token if you have the GitHub CLI installed.
Ingest from external sources:
acc ingest
Run ETL:
acc run
Open dashboard:
acc dashboard
development
Clone and change into the repository:
gh repo clone lostmygithubaccount/python-analytics-accelerator
cd python-analytics-accelerator
Install:
pip install -r requirements.txt
Format your code:
ruff format .
contributing
Work in progress, contributions welcome. Consider opening an issue for discussion or reaching out in these early days.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file python_analytics_accelerator-0.2.0.tar.gz
.
File metadata
- Download URL: python_analytics_accelerator-0.2.0.tar.gz
- Upload date:
- Size: 446.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 256e6ad3734c6718881ac926e3d42ca57a23a9206e6c1e189a859c9b22a2f6f6 |
|
MD5 | 3b1e2a9ffe8ef82115f0b31d1486c06e |
|
BLAKE2b-256 | 44cf236d985dd5ace12973cd2f6fc1aa86fcd12ef8bf416d4f237bc0e74541f7 |
File details
Details for the file python_analytics_accelerator-0.2.0-py3-none-any.whl
.
File metadata
- Download URL: python_analytics_accelerator-0.2.0-py3-none-any.whl
- Upload date:
- Size: 14.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9533cea8e10060e4c76660a4a4dd3cd082d612b752a4069387a606214e7d8983 |
|
MD5 | e19ffb04da2b1394d92de968aa655c48 |
|
BLAKE2b-256 | bd1d0be0d600a683b1f7e4d122b4641a89c4862647be502fa946891ab25e9fa8 |