Skip to main content

Python analytics solution accelerator using composable data systems

Project description

Python analytics accelerator

A composable data system in a Python package.

layers

This project uses:

  1. User interface: Ibis (Python dataframe code and/or SQL)
  2. Execution engine: DuckDB (local) and ClickHouse (remote)
  3. Data storage: Delta Lake tables (local and/or cloud storage)

You can mix and match these components as needed.

[!CAUTION] This accelerator is not necessarily production-ready, but might be a good starting point. For now, it is primarily intended as an educational resource.

setup

Get up and running in 5 steps.

1. install

Install:

pip install python-analytics-accelerator

[!TIP] This makes the acc CLI available as long as you have a Python environment with the package installed active. Type acc in your terminal to see available commands.

2. initialize

Initialize:

acc init

[!TIP] Edit the config.py file to the GitHub repository and PyPI package you want to analyze.

[!IMPORTANT] You must edit the .env to add your GitHub token or otherwise set GITHUB_TOKEN as an environment variable.

You can use gh auth token to print your GitHub token if you have the GitHub CLI installed.

3. ingest

Ingest from external sources:

acc ingest

4. run data DAG

Run data DAG:

acc run

5. dashboard

Open dashboard:

acc dashboard

development

Clone and change into the repository:

gh repo clone lostmygithubaccount/python-analytics-accelerator
cd python-analytics-accelerator

Install:

pip install -r requirements.txt

Format your code:

ruff format .

contributing

Work in progress, contributions welcome. Please consider asking before substantial changes in these early days.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python_analytics_accelerator-0.2.2.tar.gz (446.4 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file python_analytics_accelerator-0.2.2.tar.gz.

File metadata

File hashes

Hashes for python_analytics_accelerator-0.2.2.tar.gz
Algorithm Hash digest
SHA256 149a34c7f48b1eeb28595e83714c71bbd510962d813aa914fba1657edb7f408b
MD5 8ea653bc4d445f5e52291e8e96d4a1c4
BLAKE2b-256 ae6e375842305cfb9dcc1a36a5717ed476794103219a45c74b344d39df941c0e

See more details on using hashes here.

File details

Details for the file python_analytics_accelerator-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for python_analytics_accelerator-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 093b52728714b366b8a10cf86fb8d675ab73c93323c9bc998aa0cba36b346996
MD5 c5edca958d07785a9ad848fd1f5c6d3b
BLAKE2b-256 2f945778dba812b6a02ce9afcff75b49d972b313d0a9f8abc03df0ef34fa0a2d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page