Streamlit-first interactive data exploration with drill-down capabilities for exploring aggregated data

These details have not been verified by PyPI

Project links

Project description

Luxin

A Streamlit-first Python package for interactive data exploration with drill-down capabilities. Click on aggregated rows to instantly see the underlying detail data.

Luxin helps you explore aggregated data interactively through an intuitive, Streamlit-native interface. Perfect for data scientists, analysts, and engineers who need to drill down into summary statistics to understand the underlying data.

✨ Key Features

🔍 Interactive drill-down - Click on aggregated rows to see source data instantly
📊 Streamlit-native UI - Fully integrated with Streamlit's native widgets
🐼 Pandas support - Works seamlessly with pandas DataFrames
🦀 Polars support - Optional support for Polars DataFrames
🎯 Automatic tracking - TrackedDataFrame automatically tracks source rows during aggregations
📓 Jupyter support - Also works in Jupyter notebooks (legacy HTML backend)
🚀 Zero-config - Get started with minimal setup
🎨 Modern UI - Clean, responsive interface with side-by-side detail view
📈 Multi-column grouping - Support for complex multi-level aggregations
🔧 Configurable - Customize UI behavior with InspectorConfig (luxin.config)
🧭 Phase 3 (v0.3.0) - Optional multi-level drill (DrillHierarchySpec), comparison views (luxin.compare), data-quality panel, and aggregation builder—all feature-flagged; defaults stay backward compatible
✅ Well-tested - 85%+ test coverage with comprehensive test suite

📦 Installation

pip install luxin

Optional extras:

pip install luxin[notebook]  # Jupyter/HTML drill-down (luxin-nb)
pip install luxin[polars]    # Polars → pandas / TrackedDataFrame helpers
pip install luxin[compare]   # SciPy for optional Welch t-tests in luxin.compare.inspect_pair

🚀 Quick Start

import streamlit as st
from luxin import Inspector, TrackedDataFrame
import pandas as pd

# Load your data
df = TrackedDataFrame({
    'category': ['A', 'A', 'B', 'B', 'C'],
    'sales': [100, 150, 200, 250, 300],
    'profit': [10, 15, 20, 25, 30]
})

# Aggregate data - tracking is automatic
agg = df.groupby(['category']).agg({'sales': 'sum', 'profit': 'sum'})

# Display with drill-down capability
inspector = Inspector(agg)
inspector.render()  # Must be called within a Streamlit app context

Save this as app.py and run streamlit run app.py to see the interactive dashboard.

📚 Usage Examples

Basic Usage

import streamlit as st
from luxin import Inspector, TrackedDataFrame

# Create a TrackedDataFrame
df = TrackedDataFrame({
    'region': ['North', 'North', 'South', 'South'],
    'sales': [100, 200, 150, 250]
})

# Aggregate and inspect
agg = df.groupby('region').sum()
inspector = Inspector(agg)
inspector.render()

With Regular Pandas DataFrame

import streamlit as st
from luxin import Inspector, TrackedDataFrame
import pandas as pd

# Your existing workflow
df = pd.DataFrame({
    'category': ['A', 'A', 'B', 'B'],
    'sales': [100, 200, 150, 250]
})

# Convert to TrackedDataFrame for aggregation tracking
tracked_df = TrackedDataFrame(df)
agg = tracked_df.groupby('category').sum()

# Use Inspector
inspector = Inspector(agg)
inspector.render()

Multi-Column Grouping

import streamlit as st
from luxin import Inspector, TrackedDataFrame

df = TrackedDataFrame({
    'region': ['North', 'North', 'South', 'South'],
    'product': ['A', 'B', 'A', 'B'],
    'sales': [100, 150, 200, 250]
})

# Group by multiple columns
agg = df.groupby(['region', 'product']).sum()

# Inspect with drill-down
inspector = Inspector(agg)
inspector.render()

🎯 How It Works

When you aggregate data using TrackedDataFrame.groupby().agg(), Luxin automatically tracks which source rows contribute to each aggregated row. When you select a row in the Inspector interface, a side panel shows all the detail rows that were aggregated to create that summary.

Create TrackedDataFrame - Wrap your data in TrackedDataFrame
Aggregate - Use TrackedDataFrame.groupby() with tracked reductions only: .agg(...), .sum(), .mean(), .count(), .min(), .max(), .std(), .var(), .median(). Other pandas GroupBy APIs (e.g. .apply(), .transform(), .pipe()) are not supported because they cannot preserve drill lineage.
Inspect - Use Inspector(agg_df).render() to see interactive view
Drill Down - Select any aggregated row to see underlying detail data

Pre-aggregated workflows: use create_drill_table(agg_df, detail_df, groupby_cols) when the aggregate already exists (see User Guide). Manual source mapping is NA-aware for groupby(..., dropna=False) keys ([Unreleased] fixes in CHANGELOG.md).

Note: TrackedDataFrame.show_drill_table() tries Inspector (Streamlit) first; if Streamlit is not importable, it attempts luxin_nb — install luxin[notebook] for Jupyter.

Exploring sales data by region, then drilling into individual transactions
Analyzing error logs by error type, then viewing specific error instances
Reviewing survey responses by category, then reading individual responses
Investigating performance metrics by service, then examining individual requests
Understanding aggregated statistics by drilling into source data

📖 Examples

Check out the example files:

Basic Usage - Simple examples of Inspector API
Sales Analysis - Real-world sales data exploration
Phase 3 demo - Multi-level drill, quality panel, aggregation builder flags

Run examples with Streamlit:

streamlit run examples/basic_usage.py
streamlit run examples/sales_analysis.py
streamlit run examples/phase3_multi_level.py

📚 Documentation

Comprehensive documentation is available:

Getting Started - Installation and basic usage
Releasing - Monorepo version alignment and release checklist
User Guide - Comprehensive usage documentation
API Reference - Complete API documentation
Examples - Code examples and tutorials
Roadmap - Future features and development plans
Troubleshooting - Common issues and solutions
Migration Guide - Migrating from legacy APIs; Phase 3 optional features through v0.3.0 (packages v0.4.0 coordinate releases; see CHANGELOG.md)
Changelog - Release history

Full documentation: https://luxin.readthedocs.io/

🛠️ Development

# Clone and install (monorepo: core + notebook package needed to run tests)
git clone https://github.com/eddiethedean/luxin.git
cd luxin
pip install -e ./luxin_core -e ./luxin_nb -e ".[dev]"

# Run tests (if global pytest plugins emit async-fixture errors on your machine,
# autoload is disabled — load the coverage plugin explicitly when needed):
PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 pytest tests/

# Combined coverage for luxin + luxin_core + luxin_nb (matches CI):
PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 pytest tests/ -p pytest_cov \
  --cov=luxin --cov=luxin_core --cov=luxin_nb --cov-report=term --cov-fail-under=79

# HTML report
PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 pytest tests/ -p pytest_cov \
  --cov=luxin --cov=luxin_core --cov=luxin_nb --cov-report=html

Optional markers (see pytest.ini): -m streamlit, -m notebook, -m polars, -m slow.

Test Coverage

CI enforces a minimum combined line coverage across luxin, luxin_core, and luxin_nb (--cov-fail-under=79 in .github/workflows/ci.yml). The suite includes:

Core Inspector functionality
UI components (table view, drill stack, breadcrumbs, detail panel, filters, export, Phase 3 modules)
Jupyter/HTML rendering (luxin_nb)
Data validation and error handling
Polars integration
Configuration management
Integration workflows

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Releasing a version: follow docs/releasing.md. In short: align each package’s pyproject.toml project.version and __version__ in luxin/__init__.py, luxin_core/luxin_core/__init__.py, and luxin_nb/luxin_nb/__init__.py, update CHANGELOG.md, then create and push an annotated tag vX.Y.Z (for example v0.4.0). The tag must match those versions: the PyPI workflow checks pyproject.toml and __version__ before publishing.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📌 Current release

v0.4.0 — Coordinated luxin / luxin-core / luxin-nb; monorepo CI/release checks; fixes for manual drill NA keys and show_drill_table when Streamlit is missing; broader tests (incl. luxin_nb HTML and Streamlit-mocked UI) with a combined coverage gate in CI; docs refreshed (manual API, Releasing nav). Phase 3 Streamlit behavior unchanged from v0.3.0. See CHANGELOG.md.

Requirements: Streamlit >= 1.35 for dataframe row selection (on_select / selection_mode).

🗺️ Roadmap

See Roadmap for the full picture. At a glance:

v0.3.0: Phase 3 shipped — optional drill hierarchy, luxin.compare, quality dashboard, aggregation builder
v0.4.0 (current packaging line): aligned PyPI packages, notebook HTML tests, CI/release hygiene (not the Phase 1 feature milestone)
target v0.5.0: Phase 1 core enhancements (charts, richer filters, performance, export)
target v0.6.0: Phase 2 data sources (SQL, cloud, APIs)
target v0.7.0: Phase 4 collaboration & sharing
target v1.0.0: Phase 5 enterprise features

🔗 Links

🐙 GitHub Repository: https://github.com/eddiethedean/luxin
📦 PyPI Package: https://pypi.org/project/luxin/
📚 Documentation: https://luxin.readthedocs.io/
🐛 Issues: https://github.com/eddiethedean/luxin/issues

Made with ❤️ for the data exploration community

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.1

May 16, 2026

This version

0.4.0

May 16, 2026

0.3.0

May 15, 2026

0.2.1

May 15, 2026

0.1.0

Oct 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

luxin-0.4.0.tar.gz (50.2 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

luxin-0.4.0-py3-none-any.whl (30.0 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file luxin-0.4.0.tar.gz.

File metadata

Download URL: luxin-0.4.0.tar.gz
Upload date: May 16, 2026
Size: 50.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for luxin-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`15a072d31fb78315aa719d4a3c268832b5285a711bc91a0e8308e620ad449ed2`
MD5	`985b9f78cb2f0dc1509a71b5049975d4`
BLAKE2b-256	`12fd8aa337f5311459bf7bd7673c1817f00e65bc0680be5b71cba529be9e307b`

See more details on using hashes here.

File details

Details for the file luxin-0.4.0-py3-none-any.whl.

File metadata

Download URL: luxin-0.4.0-py3-none-any.whl
Upload date: May 16, 2026
Size: 30.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for luxin-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8a5624c46e730123bd7e0d10d2c412a1068d885603f5c2bacce976cb012b2b3c`
MD5	`974cd9599132078f3569f00a53db22a5`
BLAKE2b-256	`fd3a2540d6becfec9d08fdc27d61904d7c341ceef9aeb2f7cff130e3d080faf5`

See more details on using hashes here.

luxin 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Luxin

✨ Key Features

📦 Installation

🚀 Quick Start

📚 Usage Examples

Basic Usage

With Regular Pandas DataFrame

Multi-Column Grouping

🎯 How It Works

📖 Examples

📚 Documentation

🛠️ Development

Test Coverage

🤝 Contributing

📄 License

📌 Current release

🗺️ Roadmap

🔗 Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes