The Ultimate Agentic RAG Framework: Autonomous Agents, Self-Healing Code Execution, Multimodal Vision, Live Watcher & Dynamic Security.

These details have not been verified by PyPI

Project links

Homepage

Project description

🧠 RostaingChain

RostaingChain is a production-ready framework designed to build autonomous RAG (Retrieval-Augmented Generation) systems. It bridges the gap between local privacy (Ollama, Local Docs) and cloud power (OpenAI, Groq, Datastores), featuring a unique Live Watcher that updates your AI's knowledge in real-time and Agentic RAG with Self-Healing Data Analysis.

🚀 Key Features

Hybrid Intelligence: Switch instantly between Local LLMs (Ollama, Llama.cpp) and Remote giants (OpenAI, Groq, Claude, Gemini, DeepSeek, Grok).
Live Watcher (Auto-Sync): Drop a file in a folder, modify a SQL row, or update a website -> The AI learns it instantly.
Deep Profiling (Anti-Hallucination): Automatically calculates statistics (Max, Min, Mean) for CSV/SQL data so the LLM never hallucinates numbers.
DLP Security: Built-in Redaction system to mask sensitive data: (

EMAIL: Email masked,
PHONE: Phone number masked,
ID_NUM: Personal ID masked,
PASSPORT: Passport number masked,
SSN: Social Security Number masked,
POSTAL: City/Postal Code masked,
BIC: BIC code confidential,
IBAN: IBAN bank details protected,
VAT_ID: VAT number masked,
CREDIT_CARD: Credit card number masked,
MONEY: Financial amount masked,
CRYPTO: Crypto wallet masked,
IP_ADDR: IP address masked,
MAC_ADDR: MAC address masked,
API_KEY: API Key redacted,
DATE: Date masked ) before display. Set to True for ALL filters, False to disable, or a list to select specific fields.

Multi-Modal Native: Understands Text, PDFs (OCR included), Images, Audio (Whisper), and YouTube videos.
Universal Sources: Connects to Local Files, PostgreSQL, MySQL, Oracle, SQLite, MongoDB, Neo4j, and the Web.

📦 Standard installation (Quick):

pip install rostaingchain
# Optional: Install OCR capabilities
pip install rostaing-ocr

📦 “Power User” installation (All-inclusive):

pip install rostaingchain[all]

📦 Specific installation (e.g., only for SQL/NoSQL and using remote LLMs):

pip install rostaingchain[database,llms]

📦 For office documents and advanced OCR:

pip install rostaingchain[docs,llms]

📦 For multimedia (YouTube, audio, video, web):

pip install rostaingchain[media,llms]

🔑 Managing API Keys (Remote LLMs)

To use remote LLMs (like OpenAI, Groq, Claude, Gemini, Grok, Mistral, DeepSeek) without hardcoding your credentials in the code, RostaingChain supports environment variables.

Create a file named .env in your project root.
Add your API keys following this format:

# Standard Providers
OPENAI_API_KEY=sk-proj-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_API_KEY=AIzaSy...

# Fast Inference Providers
GROQ_API_KEY=gsk_...
MISTRAL_API_KEY=...

# OpenAI-Compatible Providers
DEEPSEEK_API_KEY=sk-...
XAI_API_KEY=...

Load the keys at the start of your script using python-dotenv:

pip install python-dotenv

⚡ Quick Start

1. The "Chat with Anything" Mode

Simply point data_source to a file, a folder, a database, or a URL.

from rostaingchain import RostaingBrain

# Initialize the Brain
agent = RostaingBrain(
    llm_model="llama3.2",          # Use local Ollama
    data_source="./my_documents", # Watches this folder
    auto_update=True             # Real-time ingestion
)

# Chat
response = agent.chat("What are the main topics in these documents?")
print(response)

🛠️ Advanced Usage

1. YouTube Video Analysis

Extract transcripts and metadata automatically.

from rostaingchain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

agent = RostaingBrain(
    llm_model="openai/gpt-oss-120b",
    llm_provider="groq",
    data_source="https://www.youtube.com/watch?v=3mTK0vYYXA4",
    vector_db="faiss"
)

# Streaming response for better UX
generator = agent.chat("Summarize this video in 3 bullet points.", stream=True)

for token in generator:
    print(token, end="", flush=True)

2. Data Security (DLP)

Protect sensitive information from being displayed.

from rostaingchain import RostaingBrain

agent = RostaingBrain(
    llm_model="llama3.2",
    data_source="bank_statements.pdf",
    # Enable Security
    security_filters=["IBAN", "BIC", "PHONE", "EMAIL", "MONEY", "CREDIT_CARD"] # Optional: DLP Security. Set to True for ALL filters, False to disable, or a list to select specific fields.
)

response = agent.chat("Give me the IBAN of the supplier.")
print(response)
# Output: "The IBAN is [Protected IBAN bank details]."

3. Working with DataFrames (Pandas/Polars)

import pandas as pd
from rostaingchain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

df = pd.read_csv("titanic.csv") # supports: Polars

# Direct Memory Ingestion
agent = RostaingBrain(
    llm_model="gpt-4o",
    data_source=df,
    vector_db="chroma" 
)

print(agent.chat("What is the average age of passengers?"))

4. Audio Analysis with Streaming & Markdown Output

RostaingChain natively handles audio files (like .m4a, .mp3) using OpenAI Whisper locally. This example demonstrates how to process an audio file, enforce security filters, and stream the result in a specific JSON format.

from rostaingchain import RostaingBrain
from dotenv import load_dotenv
import os
# Load environment variables from .env file
load_dotenv()

# Assuming your API key is set
llm_api_key = os.getenv("GROQ_API_KEY")

agent = RostaingBrain(
    llm_model="openai/gpt-oss-120b",
    llm_provider="groq",
    llm_api_key=llm_api_key,
    data_source="C:/Users/Rostaing/Desktop/data/audio.m4a", # Supports: .m4a, .mp3, .wav, .ogg, .flac, .webm
    poll_interval=3600, # Check for file updates every hour
    vector_db="faiss",  # Options: 'faiss' or 'chroma'
    reset_db=True,      # Re-index the file on startup
    memory=True,        # Enable conversation history
    security_filters=["PHONE", "BIC", "IBAN", "DATE"], # Optional: DLP Security. Set to True for ALL filters, False to disable, or a list to select specific fields.
    stream=True,
    output_format="markdown" # Options: "json", "text"
)

# Request a summary in JSON format with streaming enabled
response = agent.chat("Give me a summary.") # output_format supports: "json", "text (default)", "markdown", "toon"

# Real-time display loop
for token in response:
    # Prints every token as soon as it arrives (ChatGPT-like effect)
    print(token, end="", flush=True)

5. Chat with a Website (Web RAG)

from RostaingChain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

# Direct Memory Ingestion
agent = RostaingBrain(
    llm_model="gpt-4o",
    llm_provider="openai",
    data_source="https://en.wikipedia.org/wiki/Artificial_intelligence",
    vector_db="chroma",  # Options: 'faiss' or 'chroma'
)

response = gent.chat("Give me a summary.")
print(response)

6. Chat with an image (RAG)

from RostaingChain import RostaingBrain

# Direct Memory Ingestion
agent = RostaingBrain(
    llm_model="llama3.2", # Ensure you ran 'ollama pull llama3.2' in your terminal
    llm_provider="ollama", # Runs 100% locally on your machine for privacy
    embedding_model="nomic-embed-text", # Ensure you ran 'ollama pull nomic-embed-text' in your terminal
    data_source="invoice.jpg", # Supports: .png, .jpeg, .bmp, .tiff, .webp
    memory=True, # Enable conversation history
    vector_db="chroma",  # Options: 'faiss' or 'chroma'
)

response = gent.chat("Give me a summary.")
print(response)

7. Video Analysis with Streaming & Markdown Output

from RostaingChain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

# Direct Memory Ingestion
agent = RostaingBrain(
    llm_model="gpt-4o",
    llm_provider="openai",
    data_source="my_video.mp4", # Supports: .avi, .mov, .mkv
    vector_db="chroma",  # Options: 'faiss' or 'chroma'
    stream=True,
    output_format="markdown" # Options: "json", "text"
)

response = gent.chat("Give me a summary.") # output_format supports: "json", "text (default)", "markdown", "toon"

# Real-time display loop
for token in response:
    # Prints every token as soon as it arrives (ChatGPT-like effect)
    print(token, end="", flush=True)

8. Chat with a file (Streaming RAG)

from RostaingChain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

# Direct Memory Ingestion
agent = RostaingBrain(
    llm_model="gpt-4o",
    llm_provider="openai",
    data_source="my_file.txt", # Supports: .pdf, .docx, .doc, .xlsx, .xls, .pptx, .ppt, .html, .htm, .xml, .epub, .md, .json, .log, .py, .js, .sql, .yaml, .ini, etc.
    vector_db="chroma",  # Options: 'faiss' or 'chroma'
    stream=True,
    output_format="markdown"
)

response = gent.chat("Give me a summary.")

# Real-time display loop
for token in response:
    # Prints every token as soon as it arrives (ChatGPT-like effect)
    print(token, end="", flush=True)

9. Connecting to Databases (SQL / NoSQL)

RostaingChain uses a Polling Watcher to monitor database changes.

from rostaingchain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

# PostgreSQL Configuration
db_config = {
    "type": "sql",
    "connection_string": "postgresql+psycopg2://user:pass@localhost:5432/finance_db",
    "query": "SELECT * FROM sales_2024"
}

agent = RostaingBrain(
    llm_model="gpt-4o",
    llm_provider="openai",
    data_source=db_config,
    poll_interval=30, # Check for DB changes every 30 seconds
    reset_db=True,     # Start with a fresh index
    vector_db="faiss"
)

print(agent.chat("What is the total revenue for Q1?"))
# Thanks to Deep Profiling, the AI will know the exact sum/mean/max.

10 🗄️ Database Configuration Examples

To connect RostaingChain to a database, create a dictionary db_config and pass it to the data_source parameter.

1. SQL Databases (via SQLAlchemy)

PostgreSQL

pg_config = {
"type": "sql",
"connection_string": "postgresql+psycopg2://user:pass@localhost:5432/finance_db",
"query": "SELECT * FROM sales_2026"
}

MySQL

mysql_config = {
    "type": "sql",
    "connection_string": "mysql+pymysql://username:password@localhost:3306/my_database",
    "query": "SELECT * FROM orders WHERE status = 'shipped'"
}

Oracle

# Requires Oracle Instant Client installed
oracle_config = {
    "type": "sql",
    "connection_string": "oracle+cx_oracle://username:password@localhost:1521/?service_name=ORCL",
    "query": "SELECT * FROM employees"
}

SQLite

sqlite_config = {
    "type": "sql",
    "connection_string": "sqlite:///C:/path/to/my_data.db",
    "query": "SELECT * FROM invoices"
}

Microsoft SQL Server

mssql_config = {
    "type": "sql",
    "connection_string": "mssql+pymssql://username:password@localhost:1433/my_database",
    "query": "SELECT top 100 * FROM customers"
}

2. NoSQL Databases

MongoDB

mongo_config = {
    "type": "mongodb",
    "uri": "mongodb://localhost:27017/",
    "db": "ecommerce_db",
    "collection": "products",
    "limit": 50 # Optional: Limit the number of documents to ingest
}

Neo4j (Graph)

neo4j_config = {
    "type": "neo4j",
    "uri": "bolt://localhost:7687",
    "user": "neo4j",
    "password": "your_password",
    "query": "MATCH (p:Person)-[:WROTE]->(a:Article) RETURN p.name, a.title LIMIT 20"
}

Usage Example

agent = RostaingBrain(
    llm_model="gpt-4o",
    data_source=mysql_config, # Pass the dictionary here.
    poll_interval=60,         # Watch for changes every minute
    reset_db=True
)

11. Use a custom LLM (e.g., vLLM on another server)

from RostaingChain import RostaingBrain
from dotenv import load_dotenv
# Load environment variables from .env file
load_dotenv()

# Direct Memory Ingestion
agent = RostaingBrain(
    llm_model="my-finetuned-model",
    llm_provider="custom",
    llm_base_url="http://192.168.1.50:8000/v1", # Your vLLM server
    llm_api_key="token-if-needed",
    memory=True,
    vector_db="chroma",  # Options: 'faiss' or 'chroma'
    data_source="my_file.pdf", # Supports: .txt, .docx, .doc, .xlsx, .xls, .pptx, .ppt, .html, .htm, .xml, .epub, .md, .json, .log, .py, .js, .sql, .yaml, .ini, .jpg, .png, .jpeg, .bmp, .tiff, .webp, SQL/NoSQL Databases, Audio/Video/Web(link)
    reset_db=True, # Start with a fresh index
    temperature=0,
    top_k=0.1,
    top_p=1,
    max_tokens=1500,
    stream=True
)

response = gent.chat("Give me a summary.")

# Real-time display loop
for token in response:
    # Prints every token as soon as it arrives (ChatGPT-like effect)
    print(token, end="", flush=True)

12. Universal Intelligence: Switching LLM Providers

A. Use DeepSeek (the cheaper GPT-4 alternative)

agent = RostaingBrain(
    llm_model="deepseek-chat", # Auto-detection
    provider="deepseek",
    # If the key is not in the .env:
    llm_api_key="sk-your-deepseek-key" 
)

B. Use Groq (Lightning speed – 500 tokens/s)

agent = RostaingBrain(
    llm_model="openai/gpt-oss-120b",
    llm_provider="groq" # Force the provider to ensure it
)

C. Use Claude Sonnet (Best for coding)

agent = RostaingBrain(
    llm_model="claude-4.5-sonnet",
    llm_provider="anthropic" # Force the provider to ensure it
)

D. Use Gemini 3 Pro (Google)

agent = RostaingBrain(
    llm_model="gemini-3-pro-preview",
    llm_provider="google" # Force the provider to ensure it
)

E. Use Mistral (via Groq for Speed)

agent = RostaingBrain(
    llm_model="mistral-large-2512",
    llm_provider="mistral" # Force the provider for ultra-fast inference
)

F. Use Grok (xAI)

agent = RostaingBrain(
     llm_model="grok-4.1",
    llm_provider="grok" # Automatically configures the xAI API base_url
)

G. Use OpenAI (GPT-4o)

agent = RostaingBrain(
    llm_model="gpt-4o",
    llm_provider="openai" # Automatically uses OPENAI_API_KEY from your .env file
)

H. Use Local LLMs (Ollama)

agent = RostaingBrain(
    llm_model="llama3.2",  # Ensure you ran 'ollama pull llama3.2' in your terminal
    llm_provider="ollama", # Runs 100% locally on your machine for privacy
    # llm_base_url="http://localhost:11434" # Optional: Default URL
)

📝 Key Parameters Explained

stream=True: This is essential for User Experience (UX). Instead of waiting for the entire response to be generated (which can take time for long summaries), the method returns a Python Generator. You must iterate over it (using a for loop) to display tokens in real-time, exactly like ChatGPT.
output_format: This parameter enforces the structure or style of the LLM's response. It accepts three values:
- "text" (Default): A standard, conversational plain text response.
- "json": Forces the LLM to output a valid JSON object. Extremely useful if you are building an API or need to parse the result programmatically.
vector_db: Defines the local vector storage engine. RostaingChain currently supports two robust, file-based options:
- "chroma": Uses ChromaDB.
- "faiss": Uses Facebook AI Similarity Search (highly efficient for CPU).

⚙️ Configuration Parameters

Parameter	Type	Default	Description
RostaingBrain
`llm_model`	str	`"llama3.2"`	Name of the model (e.g., "gpt-4o", "claude-3-opus", "mistral").
`llm_provider`	str	`"auto"`	"openai", "groq", "ollama", "anthropic", "google", "deepseek".
`llm_api_key`	str	`None`	API Key (optional if environment variable is set).
`llm_base_url`	str	`None`	Custom endpoint URL (for local setups or proxies).
`embedding_model`	str	`"BAAI/bge-small-en-v1.5"`	Model used for vectorizing documents.
`embedding_source`	str	`"fastembed"`	"fastembed", "openai", "ollama", "huggingface".
`vector_db`	str	`"chroma"`	Vector Store backend: "chroma", "faiss", "qdrant".
`data_source`	str/dict/obj	`"./data"`	File path, Folder path, Image path, URL, SQL Config (dict), or DataFrame object.
Automation
`auto_update`	bool	`True`	Activates real-time Watcher (File system) or Polling (DB/Web).
`poll_interval`	int	`60`	Interval in seconds between DB/Web checks.
`reset_db`	bool	`False`	Wipes/Resets vector database storage on startup.
`memory`	bool	`False`	Enables conversational history (Multi-turn chat).
Generation Settings
`temperature`	float	`0.1`	Creativity of the model (0.0 = deterministic, 1.0 = creative).
`max_tokens`	int	`None`	Limit response length.
`top_p`	float	`None`	Nucleus sampling parameter.
`top_k`	int	`None`	Top-K sampling parameter.
`seed`	int	`None`	Seed for reproducible/deterministic outputs.
`stream`	bool	`False`	Enables streaming response (token by token).
`cache`	bool	`True`	Enables In-Memory caching for speed.
`output_format`	str	`"text"`	Enforce format: `"text"`, `"json"`, `"markdown"`.
Agent Identity
`role`	str	`"Helpful AI Assistant"`	Defines the persona/role of the agent.
`goal`	str	`"Assist the user..."`	The primary objective of the agent.
`instructions`	str	`"Answer concisely."`	Specific behavioral instructions or constraints.
`reflection`	bool	`False`	Enables "Step-by-step" thinking and self-correction before answering.
Company Context
`company_name`	str	`None`	Name of the organization for business context.
`company_description`	str	`None`	Description of the company's activity.
`company_url`	str	`None`	Website URL for context.
Security & User
`security_filters`	list/bool	`None`	List of DLP filters (e.g., `["IBAN", "EMAIL"]`) or `True` for all.
`user_profile`	str	`None`	Natural language description of user rights (e.g., "Intern, no access to salaries").
`user_id`	str	`None`	Unique identifier for the user.
`session_id`	str	`None`	Unique identifier for the chat session.
`agent_id`	str	`None`	Unique identifier for the specific agent instance.
`system_prompt`	str	`None`	Full override of the system prompt (Advanced).
Tools & UI
`mcp_tools`	list	`None`	List of Model Context Protocol tools for external integrations.
`canvas`	object	`None`	Canvas UI instance for visual updates (Charts/Graphs).

💡 Pro Tip: VSCode Autocomplete

Don't memorize the parameters! If you are using VSCode, you can view the complete list of available options for RostaingBrain instantly.

Just place your cursor inside the parentheses and press:

Ctrl + Space

This will trigger IntelliSense and display all configuration arguments (like memory, security_filters, temperature, cache, etc.) with their descriptions.

🏗️ Architecture

alt text

Useful Links

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.3.0

Feb 20, 2026

1.2.0

Feb 20, 2026

1.1.0

Feb 10, 2026

1.0.5

Feb 10, 2026

1.0.4

Feb 10, 2026

1.0.3

Feb 10, 2026

1.0.2

Feb 8, 2026

1.0.1

Feb 7, 2026

This version

1.0.0

Feb 6, 2026

0.2.3

Jan 24, 2026

0.2.2

Jan 23, 2026

0.2.1

Jan 23, 2026

0.2.0

Jan 23, 2026

0.1.5

Jan 23, 2026

0.1.4

Jan 23, 2026

0.1.3

Jan 23, 2026

0.1.2

Jan 22, 2026

0.1.1

Jan 22, 2026

0.1.0

Jan 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rostaingchain-1.0.0.tar.gz (91.0 kB view details)

Uploaded Feb 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rostaingchain-1.0.0-py3-none-any.whl (89.4 kB view details)

Uploaded Feb 6, 2026 Python 3

File details

Details for the file rostaingchain-1.0.0.tar.gz.

File metadata

Download URL: rostaingchain-1.0.0.tar.gz
Upload date: Feb 6, 2026
Size: 91.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for rostaingchain-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`a997f48e6d4b7787b6cf8990a3e92242b65b45425b3ea09b2014ed8c5061b114`
MD5	`dd5c6351b87047f5cc9bad9dab4a0418`
BLAKE2b-256	`87b749e82152b075cde7196383736afa4cfceacb30d83d496913aa89369b9526`

See more details on using hashes here.

File details

Details for the file rostaingchain-1.0.0-py3-none-any.whl.

File metadata

Download URL: rostaingchain-1.0.0-py3-none-any.whl
Upload date: Feb 6, 2026
Size: 89.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for rostaingchain-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`084ced035678e25fa8996d34c3e8e9b6aadf5da5039811c870882e0df61ed04e`
MD5	`14650d0d0db715ffc243fecf2adf9782`
BLAKE2b-256	`c0ed93b9d4c637041e2aef47fd0b6d7cd8c6f14b74065dff772c3347fd480b99`

See more details on using hashes here.

rostaingchain 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🧠 RostaingChain

🚀 Key Features

📦 Standard installation (Quick):

📦 “Power User” installation (All-inclusive):

📦 Specific installation (e.g., only for SQL/NoSQL and using remote LLMs):

📦 For office documents and advanced OCR:

📦 For multimedia (YouTube, audio, video, web):

🔑 Managing API Keys (Remote LLMs)

⚡ Quick Start

1. The "Chat with Anything" Mode

🛠️ Advanced Usage

1. YouTube Video Analysis

2. Data Security (DLP)

3. Working with DataFrames (Pandas/Polars)

4. Audio Analysis with Streaming & Markdown Output

5. Chat with a Website (Web RAG)

6. Chat with an image (RAG)

7. Video Analysis with Streaming & Markdown Output

8. Chat with a file (Streaming RAG)

9. Connecting to Databases (SQL / NoSQL)

10 🗄️ Database Configuration Examples

1. SQL Databases (via SQLAlchemy)

2. NoSQL Databases

Usage Example

11. Use a custom LLM (e.g., vLLM on another server)

12. Universal Intelligence: Switching LLM Providers

📝 Key Parameters Explained

⚙️ Configuration Parameters

💡 Pro Tip: VSCode Autocomplete

🏗️ Architecture

Useful Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes