Command line AI for customer data

These details have not been verified by PyPI

Project links

Project description

Chuck Data

Chuck is a text-based user interface (TUI) for managing Databricks resources including Unity Catalog, SQL warehouses, models, and volumes. Chuck Data provides an interactive shell environment for customer data engineering tasks with AI-powered assistance.

Check us out at chuckdata.ai.

Join our community on Discord.

Features

Interactive TUI for managing Databricks resources
AI-powered "agentic" data engineering assistant
Identity resolution powered by Amperity's Stitch
Use LLMs from your Databricks account via Databricks Model Serving
Browse Unity Catalog resources (catalogs, schemas, tables)
Profile database tables with automated PII detection (via LLMs)
Tag tables in Unity Catalog with semantic tags for PII to power compliance and data governance use cases
Command-based interface with both natural language commands and slash commands

Authentication

Authenticates with Databricks using personal access tokens
Authenticates with Amperity using API keys (/login and /logout commands)

LLM Provider Support

Chuck supports multiple LLM providers, allowing you to choose the best option for your use case:

Supported Providers

Databricks (default) - Use LLMs from your Databricks account via Model Serving
AWS Bedrock - Use AWS Bedrock foundation models (Claude, Llama, Nova, etc.)
OpenAI - Direct OpenAI API integration (coming soon)
Anthropic - Direct Anthropic API integration (coming soon)

AWS Bedrock Setup

To use AWS Bedrock as your LLM provider:

Install AWS dependencies:
```
pip install chuck-data[aws]
```

Configure AWS credentials:

Option 1: AWS SSO (Recommended for enterprise)

# Login via SSO
aws sso login --profile your-profile

# Set profile for session
export AWS_PROFILE=your-profile
export AWS_REGION=us-east-1

Option 2: Environment variables

export AWS_REGION=us-east-1
export AWS_ACCESS_KEY_ID=your-access-key
export AWS_SECRET_ACCESS_KEY=your-secret-key

Option 3: AWS CLI configuration (~/.aws/credentials)

[default]
aws_access_key_id = your-access-key
aws_secret_access_key = your-secret-key
region = us-east-1

Option 4: IAM role (for EC2/ECS/Lambda deployments)

Set LLM provider:

Via environment variable:

export CHUCK_LLM_PROVIDER=aws_bedrock
chuck

Or via config file (~/.chuck_config.json):

{
  "llm_provider": "aws_bedrock",
  "active_model": "anthropic.claude-3-5-sonnet-20240620-v1:0",
  "llm_provider_config": {
    "aws_bedrock": {
      "region": "us-east-1"
    }
  }
}

Request model access in AWS Bedrock console:

Some models require explicit approval before use. Visit the AWS Bedrock console and request access to your desired models.

Use /list-models within Chuck to see all available models in your AWS account.

Provider Selection Priority

Chuck resolves the LLM provider in this order:

CHUCK_LLM_PROVIDER environment variable (highest priority)
llm_provider in config file
Default: databricks

Installation

Homebrew (Recommended)

brew tap amperity/chuck-data
brew install chuck-data

pip

pip install chuck-data

Usage

Chuck Data provides an interactive text-based user interface. Run the application using:

chuck

Or run directly with Python:

python -m chuck_data

Available Commands

Chuck Data supports a command-based interface with slash commands that can be used within the interactive TUI. Type /help within the application to see all available commands.

Some general commands to be aware of are:

/status - Show current connection status and application context
/login, /logout - Log in/out of Amperity, this is how Chuck interacts with Amperity to run Stitch
/list-models, /select-model <model_name> - Configure which LLM Chuck should use (Pick one designed for tools, we recommend databricks-claude-3-7-sonnet)
/list-warehouses, /select-warehouse <warehouse_name> - Many Chuck tools run SQL so make sure to select a warehouse

Many of Chuck's tools will use your selected Catalog and Schema so that you don't have to constantly specify them. Use these commands to manage your application context.

Catalog & Schema Management

/catalogs, /select-catalog <catalog_name> - Manage Catalog context
/schemas, /select-schema <schema_name> - Manage Schema context

Known Limitations & Best Practices

Known Limitations

Unstructured data - Stitch will ignore fields in formats that are not supported
GCP Support - Currently only AWS and Azure are formally supported, GCP will be added very soon
Stitching across Catalogs - Technically if you manually create Stitch manifests it can work but Chuck doesn't automatically handle this well

Best Practices

Use models designed for tools, we recommend databricks-claude-3-7-sonnet but have also tested extensively with databricks-llama-3.2-7b-instruct
Denormalized data models will work best with Stitch
Sample data to try out Stitch is available on the Databricks marketplace. (Use the bronze schema PII datasets)

Amperity Stitch

A key tool Chuck can use is Amperity's Stitch algorithm. This is a ML based identity resolution algorithm that has been refined with the world's biggest companies over the last decade.

Stitch outputs two tables in a schema called stitch_outputs. unified_coalesced is a table of standardized PII with Amperity IDs. unified_scores are the "edges" of the graph that have links and confidence scores for each match.
Stitch will create a new notebook in your workspace each time it runs that you can use to understand the results, be sure to check it out!
For a detailed breakdown of how Stitch works, see this great article breaking it down step by step

Support

Chuck is a research preview application that is actively being improved based on your usage and feedback. Always be sure to update to the latest version of Chuck to get the best experience!

Support Options

GitHub Issues
Report bugs or request features on our GitHub repository:
https://github.com/amperity/chuck-data/issues
Discord Community
Join our community to chat with other users and developers:
https://discord.gg/f3UZwyuQqe
Or run /discord in the application
Email Support
Contact our dedicated support team:
chuck-support@amperity.com
In-app Bug Reports
Let Chuck submit a bug report automatically with the /bug command

Development

Requirements

Python 3.10 or higher
uv - Python package installer and resolver (technically this is not required but it sure makes life easier)

Project Structure

chuck_data/             # Main package
├── __init__.py
├── __main__.py         # CLI entry point
├── commands/           # Command implementations
├── ui/                 # User interface components
├── agent/              # AI agent functionality
├── clients/            # External service clients
├── databricks/         # Databricks utilities
└── ...                 # Other modules

Installation

Install the project with development dependencies:

uv pip install -e .[dev]

Testing

Run the test suite:

uv run -m pytest

Run linters and static analysis:

uv run ruff .
uv run black --check --diff chuck_data tests
uv run ruff check
uv run pyright

For test coverage:

uv run -m pytest --cov=chuck_data

CI/CD

This project uses GitHub Actions for continuous integration:

Automated testing on Python 3.10
Code linting with flake8
Format checking with Black

The CI workflow runs on every push to main and on pull requests. You can also trigger it manually from the Actions tab in GitHub.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.0

Mar 24, 2026

0.4.9

Mar 17, 2026

0.4.8

Mar 2, 2026

0.4.7

Feb 24, 2026

0.4.6

Feb 24, 2026

0.4.5

Feb 13, 2026

This version

0.4.4

Feb 13, 2026

0.4.3

Feb 6, 2026

0.3.2

Jan 12, 2026

0.3.1

Jan 7, 2026

0.3.0

Dec 9, 2025

0.2.2

Nov 11, 2025

0.2.1

Nov 7, 2025

0.2.0

Nov 6, 2025

0.1.3

Jun 9, 2025

0.1.2

Jun 8, 2025

0.1.0

Jun 5, 2025

0.0.1

May 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chuck_data-0.4.4.tar.gz (556.8 kB view details)

Uploaded Feb 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chuck_data-0.4.4-py3-none-any.whl (380.1 kB view details)

Uploaded Feb 13, 2026 Python 3

File details

Details for the file chuck_data-0.4.4.tar.gz.

File metadata

Download URL: chuck_data-0.4.4.tar.gz
Upload date: Feb 13, 2026
Size: 556.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for chuck_data-0.4.4.tar.gz
Algorithm	Hash digest
SHA256	`bbcd0869a174909a0b1cc76a4cf6f7d890abcf57a347abcefd93882cab913aa8`
MD5	`4107505e51de2f662c591a43adabe9a2`
BLAKE2b-256	`691b159d2de150317093c9423a4526080d3c0bb8e4a4683718bbc8349caf4f1c`

See more details on using hashes here.

File details

Details for the file chuck_data-0.4.4-py3-none-any.whl.

File metadata

Download URL: chuck_data-0.4.4-py3-none-any.whl
Upload date: Feb 13, 2026
Size: 380.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for chuck_data-0.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7d1579cc2ac09d849e797933a557a4cc4f68a5cae0f7ef3ee778525792f995b1`
MD5	`62e04a998b65fe6ecc645c39f4032bae`
BLAKE2b-256	`f97084f9fe0971791c4b97955e2d87a4fd9bc0a4bcc23651a6149b0c95978313`

See more details on using hashes here.

chuck-data 0.4.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Chuck Data

Features

Authentication

LLM Provider Support

Supported Providers

AWS Bedrock Setup

Provider Selection Priority

Installation

Homebrew (Recommended)

pip

Usage

Available Commands

Some general commands to be aware of are:

Catalog & Schema Management

Known Limitations & Best Practices

Known Limitations

Best Practices

Amperity Stitch

Support

Support Options

Development

Requirements

Project Structure

Installation

Testing

CI/CD

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes