The Universal RAG Chatbot Factory. Zero-dependency AI deployments.

These details have not been verified by PyPI

Project links

Project description

ChatVat (The ChatBot🤖 Factory🏭)

The Universal RAG Chatbot Factory

🌟 The Vision

ChatVat is not just another chatbot script. It is a Manufacturing Plant for self-contained AI systems.

It solves the "It works on my machine" problem by adhering to a strict "Zero-Dependency" philosophy. ChatVat takes your raw data sources—websites, secured APIs, and documents—and fuses them with a production-grade RAG engine into a sealed Docker container. This "capsule" contains everything needed to run: the code, the database, the browser, and the API server.

You can deploy a ChatVat bot anywhere: from a MacBook Air to an air-gapped server in Antarctica, with nothing but Docker installed.

Core Philosophy

Split Architecture: A lightweight CLI (~15MB) for management, and a heavy-duty Docker container for the AI engine. No more installing 3GB of CUDA drivers on your laptop just to run a build tool.
Universal Connectivity: Acts as a generic MCP (Model Context Protocol) connector. Can ingest data from any API using custom headers and auth keys.

Disclaimer : We advise users to ensure they have authorized access to secured APIs and to follow the provider's guidelines when extracting data from sensitive APIs. Read the full disclaimer below for more details.

Self-Healing: Built-in deduplication (Content Hashing), crash recovery, and "Ghost Entry" prevention.
Production Parity: The bot you test locally is bit-for-bit identical to the bot you deploy, thanks to baked-in browser binaries.

⚡ Quick Start

1. Installation

Install the lightweight ChatVat CLI. (It installs in seconds and won't bloat your system).

pip install chatvat

2. Initialize the Assembly Line

Create a clean directory for your new bot and run the configuration wizard.

mkdir my-crypto-bot
cd my-crypto-bot
chatvat init

The wizard will guide you through:

Naming your bot
Setting up AI Brain (Groq Llama-3 + HuggingFace Embeddings)
Connecting Data Sources (URLs, Secured APIs, or Local Files)
Defining Deployment Ports

3. Build the Capsule

Compile your configuration and the ChatVat engine into a Docker Image.

chatvat build

What happens here? The CLI performs Source Injection: it copies the core engine code into a build context, injects your chatvat.config.json, and triggers a multi-stage Docker build. It optimizes the image by installing specific browser binaries (Chromium only) and purging build tools, keeping the final image lean.

4. Deploy Anywhere

Run your bot using standard Docker commands. Note the use of --ipc=host to prevent browser crashes on memory-heavy sites.

# Example: Running on Port 8000
docker run -d \
  -p 8000:8000 \
  --env-file .env \
  --ipc=host \
  --restart always \
  --name crypto-bot \
  chatvat-bot

🧠 Architecture Deep Dive

ChatVat implements a modular RAG (Retrieval-Augmented Generation) pipeline designed for resilience.

The Components

Component	Role	Description
The Builder	CLI Manager	Runs on host. Lightweight (~15MB). Orchestrates the factory process and Docker builds.
The Cortex	Intelligence	Powered by Groq for ultra-fast inference and HuggingFace for embeddings. Runs inside Docker.
The Memory	Vector Store	A persistent, thread-safe ChromaDB instance. Uses MD5 hashing to silently drop duplicate data during ingestion.
The Eyes	Crawler	A headless Chromium browser (via Crawl4AI/Playwright) managed with `--ipc=host` stability to read dynamic JS websites.
The Connector	Universal MCP	A polymorphic ingestor that can authenticate with secured APIs using environment-variable masking (e.g., `${API_KEY}`).
The API	Interface	A high-performance FastAPI server exposing REST endpoints.

The "Split Strategy" Workflow

Unlike traditional tools that force you to install heavy AI libraries locally:

Local (Host): You only have typer, rich, and requests. Fast and clean.
Container (Engine): The Docker build installs torch, langchain, playwright, and chromadb.
Result: You get the power of a heavy AI stack without polluting your local development environment.

🛠️ Configuration Guide

Your bot is defined by chatvat.config.json. You can edit this file manually after running init.

{
    "bot_name": "ChatVatBot",
    "port": 8000,
    "refresh_interval_minutes": 60,
    "system_prompt": "You are a helpful assistant for the .....",
    "llm_model": "llama-3.1-70b-versatile",
    "embedding_model": "all-MiniLM-L6-v2",
    "retriever_k": 5,
    "max_tokens": 400,
    "sources": [
        {
            "type": "static_url",
            "target": "[https://docs.stripe.com](https://docs.stripe.com)",
            "max_depth": 2,
            "recursion_scope": "restrictive"
        },
        {
            "type": "dynamic_json",
            "target": "[https://api.github.com/repos/my-org/my-repo/issues](https://api.github.com/repos/my-org/my-repo/issues)",
            "headers": {
                "Authorization": "Bearer ${GITHUB_TOKEN}",
                "Accept": "application/vnd.github.v3+json"
            }
        },
        {
            "type": "local_file",
            "target": "./policy_docs.pdf"
        }
    ]
}

Source Configuration

Field	Type	Default	Description
`type`	`string`	required	`"static_url"`, `"dynamic_json"`, or `"local_file"`
`target`	`string`	required	URL or local file path
`headers`	`object`	`{}`	Custom HTTP headers for API authentication
`max_depth`	`int`	`1`	Crawl depth. `1` = single page, `2+` = follow links recursively
`recursion_scope`	`string`	`"restrictive"`	`"restrictive"` = only follow links under the target path. `"domain"` = follow any link on the same domain

Global Configuration

Field	Type	Default	Description
`bot_name`	`string`	required	Name of your chatbot
`port`	`int`	`8000`	Deployment port for the FastAPI server
`refresh_interval_minutes`	`int`	`0`	Auto-update interval in minutes (0 = disabled)
`system_prompt`	`string`	(default assistant prompt)	Custom persona/system prompt for the LLM
`llm_model`	`string`	`"llama-3.3-70b-versatile"`	Groq LLM model name
`embedding_model`	`string`	`"all-MiniLM-L6-v2"`	HuggingFace embedding model
`retriever_k`	`int`	`5`	Number of top relevant chunks to retrieve from vector DB (1-20)
`max_tokens`	`int`	`400`	Count of total tokens to be produced in response depending upon task

Field Details

refresh_interval_minutes: Set to 0 to disable auto-updates.
static_url: Uses Playwright to render JavaScript before scraping.
max_depth: (static_url only) Controls how deep the crawler follows links. 1 = single page (default), 2+ = recursive BFS crawl.
recursion_scope: (static_url only) Controls which links the crawler follows. "restrictive" (default) = only links under the target path, "domain" = any same-domain link.
dynamic_json: Acts as a Universal Connector. Supports custom headers.
headers: Securely inject secrets using ${VAR_NAME} syntax. The engine resolves these from the container's environment variables at runtime.
llm_model: You can select your required Groq LLM model while initialising the ChatBot (e.g., llama-3.3-70b-versatile, mixtral-8x7b-32768).
embedding_model: HuggingFace model for generating text embeddings (e.g., all-MiniLM-L6-v2, all-mpnet-base-v2).
retriever_k: Controls how many relevant document chunks are retrieved from the vector database for each query. Higher values (e.g., 10-20) provide more context but may include less relevant information. Lower values (e.g., 2-3) are more focused but may miss contextual details. Default is 5.
max_tokens: Maximum number of tokens the LLM may generate in a single response. Use this to limit output length and guard against token-exhaustion attacks or runaway outputs. Default is 400.

📚 API Reference

Once the container is running, interact with it via HTTP REST API.

1. Health Check

Used by cloud balancers (AWS/Render) to verify the bot is alive.

GET /health

Response:

{
  "status": "healthy",
  "version": "0.1.10"
}

2. Chat Interface

The main endpoint for sending queries.

POST /chat

Payload:

{
  "message": "What is the return policy for digital items?"
}

Response:

{
  "message": "According to the policy document, digital items are non-refundable once downloaded..."
}

⚠️ Disclaimer & Legal Notice

Author: Madhav Kapila Project: ChatVat - Conversational AI & Web Crawling Engine

This software is provided for educational and research purposes only.

No Liability: The author (Madhav Kapila) is not responsible for any damage caused by the use of this tool. This includes, but is not limited to:
- IP bans or blacklisting of your device/server.
- Legal consequences of crawling restricted or sensitive websites.
- Data loss or corruption on the user's local machine or target infrastructure.
User Responsibility: You, the user, acknowledge that you are solely responsible for compliance with all applicable laws and regulations (such as GDPR, CFAA, or Terms of Service of target websites) when using this software.
"As Is" Warranty: This software is provided "as is", without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose, and non-infringement.

By downloading, installing, or using this software, you agree to these terms.

Built with ❤️ by Madhav Kapila.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.6

Feb 27, 2026

0.2.5

Feb 27, 2026

0.2.4

Feb 19, 2026

0.2.3

Feb 17, 2026

0.2.2

Feb 15, 2026

0.2.1

Feb 14, 2026

0.2.0

Feb 14, 2026

0.1.13

Feb 12, 2026

0.1.12

Jan 29, 2026

0.1.11

Jan 29, 2026

0.1.10

Jan 8, 2026

0.1.9

Jan 2, 2026

0.1.8

Jan 1, 2026

0.1.7

Jan 1, 2026

0.1.6

Dec 30, 2025

0.1.5

Dec 30, 2025

0.1.4

Dec 30, 2025

0.1.3

Dec 30, 2025

0.1.1

Dec 30, 2025

0.1.0

Dec 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chatvat-0.2.6.tar.gz (38.6 kB view details)

Uploaded Feb 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chatvat-0.2.6-py3-none-any.whl (43.1 kB view details)

Uploaded Feb 27, 2026 Python 3

File details

Details for the file chatvat-0.2.6.tar.gz.

File metadata

Download URL: chatvat-0.2.6.tar.gz
Upload date: Feb 27, 2026
Size: 38.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.2 CPython/3.13.7 Linux/6.17.0-14-generic

File hashes

Hashes for chatvat-0.2.6.tar.gz
Algorithm	Hash digest
SHA256	`479976e87ea4edc5e302512b3057ae7375821fde15bb446f385438c744d22234`
MD5	`0514490f46ad5b034f6e227acc35f9ab`
BLAKE2b-256	`4598017d5c7de8181467eb20d0ba00389e002b58ee8076c1fdccbb78abee6199`

See more details on using hashes here.

File details

Details for the file chatvat-0.2.6-py3-none-any.whl.

File metadata

Download URL: chatvat-0.2.6-py3-none-any.whl
Upload date: Feb 27, 2026
Size: 43.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.2 CPython/3.13.7 Linux/6.17.0-14-generic

File hashes

Hashes for chatvat-0.2.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f8631b907597ec1e0e90b7dbc63f21a6e1d34b104294417c3a63e1d6bc651e73`
MD5	`ccd8b6d586562785abed8e665c249d02`
BLAKE2b-256	`e7068074c4965abceef4f72c5f82a4eaebed9cc07bb9f5b5c3730d59392b7001`

See more details on using hashes here.

chatvat 0.2.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ChatVat (The ChatBot🤖 Factory🏭)

🌟 The Vision

Core Philosophy

⚡ Quick Start

1. Installation

2. Initialize the Assembly Line

3. Build the Capsule

4. Deploy Anywhere

🧠 Architecture Deep Dive

The Components

The "Split Strategy" Workflow

🛠️ Configuration Guide

Source Configuration

Global Configuration

Field Details

📚 API Reference

1. Health Check

2. Chat Interface

⚠️ Disclaimer & Legal Notice

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes