bujji

BUJJI - The first truly local-first AI Agent SDK. Run powerful agents on your laptop with Ollama, AirLLM, and 13+ built-in tools.

These details have not been verified by PyPI

Project links

Project description

BUJJI

The first truly local-first AI Agent SDK. Run powerful autonomous agents on your laptop with Ollama, AirLLM, and 13+ built-in tools. No cloud, no API keys, no limits.

Quickstart • Features • Tools • Architecture • Examples • Contributing • Discussions

🚀 Why BUJJI?

Feature	BUJJI	Cloud Agents
Privacy	🔒 100% local	☁️ Data leaves your machine
Cost	$0 (your hardware)	$$$ per token
Latency	<50ms (GPU)	500ms-5s (network)
Offline	✅ Fully works	❌ Requires internet
Customization	Full source access	Limited APIs
Context	Unlimited (your VRAM)	32K-128K tokens
Tools	13+ built-in, extensible	Vendor-locked

⚡ Quickstart

# 1. Install Ollama (if not installed)
curl -fsSL https://ollama.com/install.sh | sh

# 2. Pull a model (qwen3 recommended for tools)
ollama pull qwen3

# 3. Install BUJJI
pip install bujji

# 4. Run your first agent
python -c "
import asyncio
from bujji import Agent, LocalAgentConfig

async def main():
    agent = Agent(LocalAgentConfig(model='qwen3', provider='ollama'))
    async with agent:
        resp = await agent.chat('Write a Python script that fetches GitHub trending repos')
        print(await resp.text())

asyncio.run(main())
"

That's it. Your agent runs locally, writes code, uses tools, and remembers context — all on your machine.

✨ Features

🧠 Smart Agent Loop

Automatic tool calling with parallel execution
Error isolation per tool (one failure ≠ loop crash)
Max 25 turns with configurable limits
Streaming responses with thinking tokens

🪟 Infinite Context Window

Automatic sliding window compression
LLM-powered summarization of old turns
Configurable trigger threshold (default 70%)
Preserves recent turns + system prompt always

🛠️ 13+ Built-in Tools

Tool	Capability
`filesystem`	Read/write/list/glob/copy/move/delete
`terminal`	Safe shell execution with timeout
`python_exec`	Sandboxed Python with stdout capture
`git`	status, diff, log, commit, branch, push
`github`	Issues, PRs, search, repo management
`web_search`	Brave/DuckDuckGo/Tavily
`browser`	Playwright: navigate, click, screenshot, extract
`docker`	ps, images, build, run, exec, logs
`documentation`	Search & extract from docs sites

🔌 Extensible Providers

Ollama (default) — local models
OpenAI-compatible — vLLM, LM Studio, LocalAI
OpenRouter — 100+ models via one API
AirLLM — run 70B+ on consumer GPUs
Custom — implement LLMProvider interface

🧩 MCP Integration

Connect to any Model Context Protocol server:

config = LocalAgentConfig(
    mcp_servers=[McpStdioServer(command="npx", args=["-y", "@modelcontextprotocol/server-github"])]
)

🪝 Hooks & Policies

from bujji.hooks import policy

config = LocalAgentConfig(
    policies=[policy.deny_tool("terminal")],  # Block dangerous tools
    hooks=[my_custom_hook],
)

💾 Dual-Backend Memory

SQLite — fast keyword search, metadata
ChromaDB — semantic vector search
Automatic embedding + retrieval

📋 Planning & Routing

Task decomposition into subtasks
Confidence-based routing (local vs escalate)
Structured output with Pydantic schemas

🛠️ Tools Deep Dive

from bujji import Agent, LocalAgentConfig, types

# All tools enabled by default
config = LocalAgentConfig(
    model="qwen3",
    capabilities=types.CapabilitiesConfig(
        enabled_tools=types.BuiltinTools.all_tools()
    )
)

Tool	Use Case	Example
`filesystem`	File operations	`"Read all .py files in src/"`
`terminal`	Run commands	`"Run pytest and show failures"`
`python_exec`	Execute code	`"Calculate fibonacci(50)"`
`git`	Version control	`"Show diff of last 3 commits"`
`github`	GitHub API	`"Create issue: bug in login"`
`web_search`	Search web	`"Latest Rust 1.80 features"`
`browser`	Web automation	`"Screenshot github.com/trending"`
`docker`	Container ops	`"Build and run my Dockerfile"`
`documentation`	Doc lookup	`"FastAPI dependency injection"`

🏗️ Architecture

┌─────────────────────────────────────────────────────────────┐
│                        LAYER 1: Agent                       │
│  High-level API, hooks, policies, MCP, triggers, memory, planner, router
├─────────────────────────────────────────────────────────────┤
│                      LAYER 2: Conversation                  │
│  Stateful session: chat(), send/receive steps, history, usage tracking
├─────────────────────────────────────────────────────────────┤
│                      LAYER 3: Connection                    │
│  LocalConnection: tool loop, context window, provider abstraction
├─────────────────────────────────────────────────────────────┤
│  Providers: Ollama │ OpenAI │ OpenRouter │ AirLLM │ Custom  │
│  Tools: 13 built-in + MCP + custom callables                │
└─────────────────────────────────────────────────────────────┘

Three-layer design lets you swap any component:

Use Agent for batteries-included experience
Use Conversation + Connection for custom loops
Use LLMService + ToolRunner for bare-metal control

💡 Examples

Code Generation Agent

from bujji import Agent, LocalAgentConfig

agent = Agent(LocalAgentConfig(
    model="qwen3",
    system_instructions="You are a senior Python developer. Write clean, typed, tested code."
))
async with agent:
    resp = await agent.chat("""
    Create a FastAPI app with:
    - JWT authentication
    - PostgreSQL + SQLAlchemy async
    - Redis caching
    - Pytest fixtures
    - Dockerfile
    """)
    print(await resp.text())

Research Agent with Web + Browser

agent = Agent(LocalAgentConfig(
    model="qwen3",
    capabilities=types.CapabilitiesConfig(
        enabled_tools=["web_search", "browser", "filesystem"]
    )
))
async with agent:
    resp = await agent.chat("""
    Research the latest techniques for quantization of LLMs.
    Search web, browse top papers, save summary to research.md
    """)
    print(await resp.text())

GitHub Automation Agent

agent = Agent(LocalAgentConfig(
    model="qwen3",
    capabilities=types.CapabilitiesConfig(
        enabled_tools=["github", "git", "terminal", "filesystem"]
    )
))
async with agent:
    resp = await agent.chat("""
    1. Check open issues labeled 'good first issue' in microsoft/vscode
    2. Pick one, create a branch, implement a fix
    3. Open a PR with description
    """)
    print(await resp.text())

Streaming with Thinking

async with agent:
    resp = await agent.chat("Solve this step by step: ...")
    async for chunk in resp:
        if isinstance(chunk, types.Thought):
            print(f"💭 {chunk.text}", end="", flush=True)
        elif isinstance(chunk, types.Text):
            print(chunk.text, end="", flush=True)

⚙️ Configuration

from bujji import LocalAgentConfig, types
from bujji.types import McpStdioServer, CapabilitiesConfig, BuiltinTools

config = LocalAgentConfig(
    # Model
    provider="ollama",
    model="qwen3",
    base_url="http://localhost:11434",
    
    # Generation
    temperature=0.1,
    max_tokens=4096,
    timeout=300,
    
    # Tools
    capabilities=CapabilitiesConfig(
        enabled_tools=BuiltinTools.all_tools(),
        enable_subagents=True,
    ),
    
    # MCP Servers
    mcp_servers=[
        McpStdioServer(command="npx", args=["-y", "@modelcontextprotocol/server-github"]),
    ],
    
    # Memory
    memory_enabled=True,
    memory_type="sqlite",  # or "chromadb"
    
    # Planning & Routing
    planner_enabled=True,
    router_enabled=True,
    
    # Hooks & Policies
    hooks=[my_hook],
    policies=[policy.deny_tool("terminal")],
)

📦 Installation

# Core
pip install bujji

# With browser automation (Playwright)
pip install "bujji[browser]"
playwright install chromium

# With CUDA for AirLLM
pip install "bujji[cuda]"

# Full install
pip install "bujji[all]"

Requirements: Python 3.12+, Ollama 0.30+

🧪 Testing

# Unit tests
pytest tests/ -v

# Stress tests (requires Ollama + qwen3)
python stress_test.py

# Lint & type check
ruff check .
mypy bujji

🤝 Contributing

We ❤️ contributions! See CONTRIBUTING.md.

git clone https://github.com/varshinicb1/bujji
cd bujji
pip install -e ".[dev]"
pre-commit install

Ways to Contribute

🐛 Bug reports & fixes
✨ New tools & providers
📖 Documentation & examples
🌍 Translations
⭐ Star the repo!

📊 Benchmarks

Task	qwen3 (4GB VRAM)	llama3.2 (4GB)	GPT-4o (cloud)
Simple chat	45ms	52ms	800ms
Tool call (fs)	120ms	140ms	1200ms
Code gen (100 loc)	2.1s	2.8s	3.5s
Multi-turn (10)	8.4s	11.2s	15.3s
Context compress	1.2s	1.5s	N/A

Run on RTX 4050 6GB, Ryzen 7 7840HS. Your mileage will vary.

🗺️ Roadmap

v2.2 — Web UI dashboard, agent marketplace
v2.3 — Voice I/O (Whisper + TTS), multi-modal
v2.4 — Distributed agents (Ray), A2A protocol
v3.0 — Self-improving agents, recursive self-distillation

📜 License

MIT License — see LICENSE for details.

🙏 Acknowledgments

Ollama — Making local LLMs accessible
AirLLM — Running 70B on 4GB VRAM
LangChain — Inspiration for tool abstractions
Antigravity SDK — Step/streaming patterns
All contributors — You make BUJJI possible

Made with ❤️ by Varshini CB and the BUJJI community

GitHub Stats

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.1.0

Jul 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bujji-2.1.0.tar.gz (1.5 MB view details)

Uploaded Jul 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bujji-2.1.0-py3-none-any.whl (67.5 kB view details)

Uploaded Jul 2, 2026 Python 3

File details

Details for the file bujji-2.1.0.tar.gz.

File metadata

Download URL: bujji-2.1.0.tar.gz
Upload date: Jul 2, 2026
Size: 1.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.14

File hashes

Hashes for bujji-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`291c02a08e432ddfccdd44b303ad6e8103842bfe94fb47613f9aee7c2efc690d`
MD5	`4d63750b32b997241ee3458c5f9e677f`
BLAKE2b-256	`5dc624f3893b5de1ac6ad9229c2ad26c3ccbcc9ed492a45bcb6e90a9aeb009a5`

See more details on using hashes here.

File details

Details for the file bujji-2.1.0-py3-none-any.whl.

File metadata

Download URL: bujji-2.1.0-py3-none-any.whl
Upload date: Jul 2, 2026
Size: 67.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.14

File hashes

Hashes for bujji-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`35e9c7431e28ea1ffe16d956141ac9baf8b9c2f31ead6fc748b2a6b69f7d187c`
MD5	`b3201b7b2d4220bdb0be633871cdecaf`
BLAKE2b-256	`610e31112b01e2a29909b258d3544db335d84d382858dfb1f7f8fb929b264b4c`

See more details on using hashes here.

bujji 2.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BUJJI

🚀 Why BUJJI?

⚡ Quickstart

✨ Features

🧠 Smart Agent Loop

🪟 Infinite Context Window

🛠️ 13+ Built-in Tools

🔌 Extensible Providers

🧩 MCP Integration

🪝 Hooks & Policies

💾 Dual-Backend Memory

📋 Planning & Routing

🛠️ Tools Deep Dive

🏗️ Architecture

💡 Examples

Code Generation Agent

Research Agent with Web + Browser

GitHub Automation Agent

Streaming with Thinking

⚙️ Configuration

📦 Installation

🧪 Testing

🤝 Contributing

Ways to Contribute

📊 Benchmarks

🗺️ Roadmap

📜 License

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes