Fast, embedded, and multi-modal DB based on SQLite for AI-powered applications.

Project description

beaver 🦫

A fast, single-file, multi-modal database for Python, built with the standard sqlite3 library.

beaver is the Backend for Embedded, All-in-one Vector, Entity, and Relationship storage. It's a simple, local, and embedded database designed to manage complex, modern data types without requiring a database server, built on top of SQLite.

Design Philosophy

beaver is built with a minimalistic philosophy for small, local use cases where a full-blown database server would be overkill.

Minimalistic: Uses only Python's standard libraries (sqlite3) and numpy/scipy.
Schemaless: Flexible data storage without rigid schemas across all modalities.
Synchronous, Multi-Process, and Thread-Safe: Designed for simplicity and safety in multi-threaded and multi-process environments.
Built for Local Applications: Perfect for local AI tools, RAG prototypes, chatbots, and desktop utilities that need persistent, structured data without network overhead.
Fast by Default: It's built on SQLite, which is famously fast and reliable for local applications. The vector search is accelerated with an in-memory k-d tree.
Standard Relational Interface: While beaver provides high-level features, you can always use the same SQLite file for normal relational tasks with standard SQL.

Core Features

Sync/Async High-Efficiency Pub/Sub: A powerful, thread and process-safe publish-subscribe system for real-time messaging with a fan-out architecture. Sync by default, but with an as_async wrapper for async applications.
Namespaced Key-Value Dictionaries: A Pythonic, dictionary-like interface for storing any JSON-serializable object within separate namespaces with optional TTL for cache implementations.
Pythonic List Management: A fluent, Redis-like interface for managing persistent, ordered lists.
Persistent Priority Queue: A high-performance, persistent queue that always returns the item with the highest priority, perfect for task management.
Efficient Vector Storage & Search: Store vector embeddings and perform fast approximate nearest neighbor searches using an in-memory k-d tree.
Full-Text Search and Fuzzy: Automatically index and search through document metadata using SQLite's powerful FTS5 engine, enhanced with optional fuzzy saerch.
Graph Traversal: Create relationships between documents and traverse the graph to find neighbors or perform multi-hop walks.
Single-File & Portable: All data is stored in a single SQLite file, making it incredibly easy to move, back up, or embed in your application.

Installation

pip install beaver-db

Quickstart

Get up and running in 30 seconds. This example showcases a dictionary, a list, and full-text search in a single script.

from beaver import BeaverDB, Document

# 1. Initialize the database
db = BeaverDB("data.db")

# 2. Use a namespaced dictionary for app configuration
config = db.dict("app_config")
config["theme"] = "dark"
print(f"Theme set to: {config['theme']}")

# 3. Use a persistent list to manage a task queue
tasks = db.list("daily_tasks")
tasks.push("Write the project report")
tasks.push("Deploy the new feature")
print(f"First task is: {tasks[0]}")

# 4. Use a collection for document storage and search
articles = db.collection("articles")
doc = Document(
    id="sqlite-001",
    content="SQLite is a powerful embedded database ideal for local apps."
)
articles.index(doc)

# Perform a full-text search
results = articles.match(query="database")
top_doc, rank = results[0]
print(f"FTS Result: '{top_doc.content}'")

db.close()

Things You Can Build with Beaver

Here are a few ideas to inspire your next project, showcasing how to combine Beaver's features to build powerful local applications.

1. AI Agent Task Management

Use a persistent priority queue to manage tasks for an AI agent. This ensures the agent always works on the most important task first, even if the application restarts.

tasks = db.queue("agent_tasks")

# Tasks are added with a priority (lower is higher)
tasks.put({"action": "summarize_news"}, priority=10)
tasks.put({"action": "respond_to_user"}, priority=1)
tasks.put({"action": "run_backup"}, priority=20)

# The agent retrieves the highest-priority task
next_task = tasks.get() # -> Returns the "respond_to_user" task
print(f"Agent's next task: {next_task.data['action']}")

2. User Authentication and Profile Store

Use a namespaced dictionary to create a simple and secure user store. The key can be the username, and the value can be a dictionary containing the hashed password and other profile information.

users = db.dict("user_profiles")

# Create a new user
users["alice"] = {
    "hashed_password": "...",
    "email": "alice@example.com",
    "permissions": ["read", "write"]
}

# Retrieve a user's profile
alice_profile = users.get("alice")

3. Chatbot Conversation History

A persistent list is perfect for storing the history of a conversation. Each time the user or the bot sends a message, just push it to the list. This maintains a chronological record of the entire dialogue.

chat_history = db.list("conversation_with_user_123")

chat_history.push({"role": "user", "content": "Hello, Beaver!"})
chat_history.push({"role": "assistant", "content": "Hello! How can I help you today?"})

# Retrieve the full conversation
for message in chat_history:
    print(f"{message['role']}: {message['content']}")

4. Build a RAG (Retrieval-Augmented Generation) System

Combine vector search and full-text search to build a powerful RAG pipeline for your local documents.

# Get context for a user query like "fast python web frameworks"
vector_results = [doc for doc, _ in docs.search(vector=query_vector)]
text_results = [doc for doc, _ in docs.match(query="python web framework")]

# Combine and rerank for the best context
from beaver.collections import rerank
best_context = rerank(vector_results, text_results, weights=[0.6, 0.4])

5. Caching for Expensive API Calls

Leverage a dictionary with a TTL (Time-To-Live) to cache the results of slow network requests. This can dramatically speed up your application and reduce your reliance on external services.

api_cache = db.dict("external_api_cache")

# Check the cache first
response = api_cache.get("weather_new_york")
if response is None:
    # If not in cache, make the real API call
    response = make_slow_weather_api_call("New York")
    # Cache the result for 1 hour
    api_cache.set("weather_new_york", response, ttl_seconds=3600)

6. Real-time Event-Driven Systems

Use the high-efficiency pub/sub system to build applications where different components react to events in real-time. This is perfect for decoupled systems, real-time UIs, or monitoring services.

# In one process or thread (e.g., a monitoring service)
system_events = db.channel("system_events")
system_events.publish({"event": "user_login", "user_id": "alice"})

# In another process or thread (e.g., a UI updater or logger)
with db.channel("system_events").subscribe() as listener:
    for message in listener.listen():
        print(f"Event received: {message}")
        # >> Event received: {'event': 'user_login', 'user_id': 'alice'}

More Examples

For more in-depth examples, check out the scripts in the examples/ directory:

examples/kvstore.py: A comprehensive demo of the namespaced dictionary feature.
examples/list.py: Shows the full capabilities of the persistent list, including slicing and in-place updates.
examples/queue.py: A practical example of using the persistent priority queue for task management.
examples/vector.py: Demonstrates how to index and search vector embeddings, including upserts.
examples/fts.py: A detailed look at full-text search, including targeted searches on specific metadata fields.
examples/graph.py: Shows how to create relationships between documents and perform multi-hop graph traversals.
examples/pubsub.py: A demonstration of the synchronous, thread-safe publish/subscribe system in a single process.
examples/async_pubsub.py: A demonstration of the asynchronous wrapper for the publish/subscribe system.
examples/publisher.py and examples/subscriber.py: A pair of examples demonstrating inter-process message passing with the publish/subscribe system.
examples/cache.py: A practical example of using a dictionary with TTL as a cache for API calls.
examples/rerank.py: Shows how to combine results from vector and text search for more refined results.
examples/fuzzy.py: Demonstrates fuzzy search capabilities for text search.

Roadmap

These are some of the features and improvements planned for future releases:

Faster ANN: Explore integrating more advanced ANN libraries like faiss for improved vector search performance.
Full Async API: Comprehensive async support with on-demand wrappers for all collections.

Check out the roadmap for a detailed list of upcoming features and design ideas.

License

This project is licensed under the MIT License.

Project details

Release history Release notifications | RSS feed

2.0rc4 pre-release

May 15, 2026

2.0rc3 pre-release

Dec 2, 2025

2.0rc2 pre-release

Nov 29, 2025

1.3.0

Nov 27, 2025

1.2.0

Nov 26, 2025

1.1.1

Nov 26, 2025

1.0.0

Nov 11, 2025

0.27.1

Nov 11, 2025

0.26.1

Nov 10, 2025

0.25.2

Nov 9, 2025

0.24.5

Nov 6, 2025

0.24.3

Nov 3, 2025

0.24.2

Nov 3, 2025

0.23.1

Nov 2, 2025

0.22.1

Nov 2, 2025

0.21.1

Nov 2, 2025

0.20.2

Nov 2, 2025

0.20.1

Nov 1, 2025

0.19.3

Oct 31, 2025

0.19.2

Oct 31, 2025

0.19.1

Oct 31, 2025

0.18.6

Oct 29, 2025

0.18.5

Oct 27, 2025

0.18.4

Oct 23, 2025

0.18.3

Oct 23, 2025

0.18.2

Oct 23, 2025

0.18.1

Oct 17, 2025

0.18.0

Oct 17, 2025

0.17.6

Oct 2, 2025

0.17.5

Oct 2, 2025

0.17.4

Oct 2, 2025

0.17.3

Oct 1, 2025

0.17.2

Oct 1, 2025

0.17.1

Oct 1, 2025

0.17.0

Oct 1, 2025

0.16.8

Sep 26, 2025

0.16.7

Sep 26, 2025

0.16.6

Sep 25, 2025

0.16.5

Sep 25, 2025

0.16.4

Sep 25, 2025

0.16.3

Sep 25, 2025

0.16.2

Sep 25, 2025

0.16.1

Sep 24, 2025

0.16.0

Sep 24, 2025

0.15.0

Sep 24, 2025

0.14.0

Sep 24, 2025

0.13.1

Sep 24, 2025

0.13.0

Sep 24, 2025

0.12.2

Sep 23, 2025

0.12.0

Sep 23, 2025

0.11.1

Sep 23, 2025

0.11.0

Sep 22, 2025

This version

0.10.0

Sep 20, 2025

0.9.2

Sep 20, 2025

0.9.1

Sep 20, 2025

0.9.0

Sep 20, 2025

0.8.0

Sep 20, 2025

0.7.1

Sep 20, 2025

0.7.0

Sep 19, 2025

0.6.2

Sep 19, 2025

0.6.0

Sep 18, 2025

0.5.3

Sep 18, 2025

0.5.2

Sep 18, 2025

0.5.1

Sep 17, 2025

0.5.0

Sep 17, 2025

0.4.0

Sep 17, 2025

0.3.0

Sep 17, 2025

0.2.0

Sep 14, 2025

0.1.0

Sep 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

beaver_db-0.10.0.tar.gz (22.4 kB view details)

Uploaded Sep 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

beaver_db-0.10.0-py3-none-any.whl (21.4 kB view details)

Uploaded Sep 20, 2025 Python 3

File details

Details for the file beaver_db-0.10.0.tar.gz.

File metadata

Download URL: beaver_db-0.10.0.tar.gz
Upload date: Sep 20, 2025
Size: 22.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.13

File hashes

Hashes for beaver_db-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`44801d9fcf63eadab43be9a071573afc6b1f15bd726fd6a3ce979af8319cf526`
MD5	`c959a49fa45f01a1f6f128ae013b7a9a`
BLAKE2b-256	`f2112b7990ff3d57e7b66c06b8acb0fb59a467b3a2a04ac5492bf7ce7ce98858`

See more details on using hashes here.

File details

Details for the file beaver_db-0.10.0-py3-none-any.whl.

File metadata

Download URL: beaver_db-0.10.0-py3-none-any.whl
Upload date: Sep 20, 2025
Size: 21.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.13

File hashes

Hashes for beaver_db-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a5d3e0ec383f39b800560d52d575287811ce9349b75dbc6ac0d09c2fa8a91fdd`
MD5	`0d7eb3ba84759f85d4f27c19a4a76959`
BLAKE2b-256	`b1a4d371f14ede4cddabd4760886992feaa336f5c2ec814c9f4991be584c6414`

See more details on using hashes here.

beaver-db 0.10.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

beaver 🦫

Design Philosophy

Core Features

Installation

Quickstart

Things You Can Build with Beaver

1. AI Agent Task Management

2. User Authentication and Profile Store

3. Chatbot Conversation History

4. Build a RAG (Retrieval-Augmented Generation) System

5. Caching for Expensive API Calls

6. Real-time Event-Driven Systems

More Examples

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes