Fast, embedded, and multi-modal DB based on SQLite for AI-powered applications.

Project description

beaver 🦫

PyPI - Downloads PyPI License

A fast, single-file, multi-modal database for Python, built with the standard sqlite3 library.

beaver is the Backend for Embedded, All-in-one Vector, Entity, and Relationship storage. It's a simple, local, and embedded database designed to manage complex, modern data types without requiring a database server, built on top of SQLite.

Design Philosophy

beaver is built with a minimalistic philosophy for small, local use cases where a full-blown database server would be overkill.

Minimalistic & Zero-Dependency: Uses only Python's standard libraries (sqlite3) and numpy/scipy.
Synchronous & Thread-Safe: Designed for simplicity and safety in multi-threaded environments.
Built for Local Applications: Perfect for local AI tools, RAG prototypes, chatbots, and desktop utilities that need persistent, structured data without network overhead.
Fast by Default: It's built on SQLite, which is famously fast and reliable for local applications. The vector search is accelerated with an in-memory k-d tree.
Standard Relational Interface: While beaver provides high-level features, you can always use the same SQLite file for normal relational tasks with standard SQL.

Core Features

Synchronous Pub/Sub: A simple, thread-safe, Redis-like publish-subscribe system for real-time messaging.
Namespaced Key-Value Dictionaries: A Pythonic, dictionary-like interface for storing any JSON-serializable object within separate namespaces with optional TTL for cache implementations.
Pythonic List Management: A fluent, Redis-like interface for managing persistent, ordered lists, with all operations in constant time.
Efficient Vector Storage & Search: Store vector embeddings and perform fast approximate nearest neighbor searches using an in-memory k-d tree.
Full-Text Search: Automatically index and search through document metadata using SQLite's powerful FTS5 engine.
Graph Traversal: Create relationships between documents and traverse the graph to find neighbors or perform multi-hop walks.
Single-File & Portable: All data is stored in a single SQLite file, making it incredibly easy to move, back up, or embed in your application.

Installation

pip install beaver-db

Quickstart & API Guide

Initialization

All you need to do is import and instantiate the BeaverDB class with a file path.

from beaver import BeaverDB, Document

db = BeaverDB("my_application.db")

Namespaced Dictionaries

Use db.dict() to get a dictionary-like object for a specific namespace. The value can be any JSON-encodable object.

# Get a handle to the 'app_config' namespace
config = db.dict("app_config")

# Set values using standard dictionary syntax
config["theme"] = "dark"
config["user_id"] = 123

# Get a value
theme = config.get("theme")
print(f"Theme: {theme}") # Output: Theme: dark

List Management

Get a list wrapper with db.list() and use Pythonic methods to manage it.

tasks = db.list("daily_tasks")
tasks.push("Write the project report")
tasks.prepend("Plan the day's agenda")
print(f"The first task is: {tasks[0]}")

Vector & Text Search

Store Document objects containing vector embeddings and metadata. When you index a document, its string fields are automatically made available for full-text search.

# Get a handle to a collection
docs = db.collection("articles")

# Create and index a multi-modal document
doc = Document(
    id="sql-001",
    embedding=[0.8, 0.1, 0.1],
    content="SQLite is a powerful embedded database ideal for local apps.",
    author="John Smith"
)
docs.index(doc)

# 1. Perform a vector search to find semantically similar documents
query_vector = [0.7, 0.2, 0.2]
vector_results = docs.search(vector=query_vector, top_k=3)
top_doc, distance = vector_results[0]
print(f"Vector Search Result: {top_doc.content} (distance: {distance:.2f})")

# 2. Perform a full-text search to find documents with specific words
text_results = docs.match(query="database", top_k=3)
top_doc, rank = text_results[0]
print(f"Full-Text Search Result: {top_doc.content} (rank: {rank:.2f})")

# 3. Combine both vector and text search for refined results
from beaver.collections import rerank
combined_results = rerank([d for d,_ in vector_results], [d for d,_ in text_results], weights=[2,1])

Graph Traversal

Create relationships between documents and traverse them.

from beaver import WalkDirection

# Create documents
alice = Document(id="alice", name="Alice")
bob = Document(id="bob", name="Bob")
charlie = Document(id="charlie", name="Charlie")

# Index them
social_net = db.collection("social")
social_net.index(alice)
social_net.index(bob)
social_net.index(charlie)

# Create edges
social_net.connect(alice, bob, label="FOLLOWS")
social_net.connect(bob, charlie, label="FOLLOWS")

# Find direct neighbors
following = social_net.neighbors(alice, label="FOLLOWS")
print(f"Alice follows: {[p.id for p in following]}")

# Perform a multi-hop walk to find friends of friends
foaf = social_net.walk(
    source=alice,
    labels=["FOLLOWS"],
    depth=2,
    direction=WalkDirection.OUTGOING,
)
print(f"Alice's extended network: {[p.id for p in foaf]}")

Synchronous Pub/Sub

Publish events from one part of your app and listen in another using threads.

import threading

def listener():
    for message in db.subscribe("system_events"):
        print(f"LISTENER: Received -> {message}")
        if message.get("event") == "shutdown":
            break

def publisher():
    db.publish("system_events", {"event": "user_login", "user": "alice"})
    db.publish("system_events", {"event": "shutdown"})

# Run them concurrently
listener_thread = threading.Thread(target=listener)
publisher_thread = threading.Thread(target=publisher)
listener_thread.start()
publisher_thread.start()
listener_thread.join()
publisher_thread.join()

Roadmap

These are some of the features and improvements planned for future releases:

Fuzzy search: Implement fuzzy matching capabilities for text search.
Faster ANN: Explore integrating more advanced ANN libraries like faiss for improved vector search performance.
Priority Queues: Introduce a priority queue data structure for task management.
Improved Pub/Sub: Fan-out implementation with a more Pythonic API.
Async API: Comprehensive async support with on-demand wrappers for all collections.

Check out the roadmap for a detailed list of upcoming features and design ideas.

License

This project is licensed under the MIT License.

Project details

Release history Release notifications | RSS feed

2.0rc4 pre-release

May 15, 2026

2.0rc3 pre-release

Dec 2, 2025

2.0rc2 pre-release

Nov 29, 2025

1.3.0

Nov 27, 2025

1.2.0

Nov 26, 2025

1.1.1

Nov 26, 2025

1.0.0

Nov 11, 2025

0.27.1

Nov 11, 2025

0.26.1

Nov 10, 2025

0.25.2

Nov 9, 2025

0.24.5

Nov 6, 2025

0.24.3

Nov 3, 2025

0.24.2

Nov 3, 2025

0.23.1

Nov 2, 2025

0.22.1

Nov 2, 2025

0.21.1

Nov 2, 2025

0.20.2

Nov 2, 2025

0.20.1

Nov 1, 2025

0.19.3

Oct 31, 2025

0.19.2

Oct 31, 2025

0.19.1

Oct 31, 2025

0.18.6

Oct 29, 2025

0.18.5

Oct 27, 2025

0.18.4

Oct 23, 2025

0.18.3

Oct 23, 2025

0.18.2

Oct 23, 2025

0.18.1

Oct 17, 2025

0.18.0

Oct 17, 2025

0.17.6

Oct 2, 2025

0.17.5

Oct 2, 2025

0.17.4

Oct 2, 2025

0.17.3

Oct 1, 2025

0.17.2

Oct 1, 2025

0.17.1

Oct 1, 2025

0.17.0

Oct 1, 2025

0.16.8

Sep 26, 2025

0.16.7

Sep 26, 2025

0.16.6

Sep 25, 2025

0.16.5

Sep 25, 2025

0.16.4

Sep 25, 2025

0.16.3

Sep 25, 2025

0.16.2

Sep 25, 2025

0.16.1

Sep 24, 2025

0.16.0

Sep 24, 2025

0.15.0

Sep 24, 2025

0.14.0

Sep 24, 2025

0.13.1

Sep 24, 2025

0.13.0

Sep 24, 2025

0.12.2

Sep 23, 2025

0.12.0

Sep 23, 2025

0.11.1

Sep 23, 2025

0.11.0

Sep 22, 2025

0.10.0

Sep 20, 2025

0.9.2

Sep 20, 2025

0.9.1

Sep 20, 2025

0.9.0

Sep 20, 2025

0.8.0

Sep 20, 2025

0.7.1

Sep 20, 2025

This version

0.7.0

Sep 19, 2025

0.6.2

Sep 19, 2025

0.6.0

Sep 18, 2025

0.5.3

Sep 18, 2025

0.5.2

Sep 18, 2025

0.5.1

Sep 17, 2025

0.5.0

Sep 17, 2025

0.4.0

Sep 17, 2025

0.3.0

Sep 17, 2025

0.2.0

Sep 14, 2025

0.1.0

Sep 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

beaver_db-0.7.0.tar.gz (15.8 kB view details)

Uploaded Sep 19, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

beaver_db-0.7.0-py3-none-any.whl (15.2 kB view details)

Uploaded Sep 19, 2025 Python 3

File details

Details for the file beaver_db-0.7.0.tar.gz.

File metadata

Download URL: beaver_db-0.7.0.tar.gz
Upload date: Sep 19, 2025
Size: 15.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.13

File hashes

Hashes for beaver_db-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`4696a186882f3819ed0ae2bec01668db5cc586f96f225c987131538b50617980`
MD5	`37c1977b6d80e9d1a5e970a28a15fd06`
BLAKE2b-256	`864e4a3a901dcf3c1ef96257b94037418c5b5d0d90306adb11ee543ea01848fa`

See more details on using hashes here.

File details

Details for the file beaver_db-0.7.0-py3-none-any.whl.

File metadata

Download URL: beaver_db-0.7.0-py3-none-any.whl
Upload date: Sep 19, 2025
Size: 15.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.13

File hashes

Hashes for beaver_db-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b4cc46068d64b6c9b8b14825c6752a23fbd99fb753f24adf159567ce514d1064`
MD5	`158247b52cfecb739236ecd4521e4b1e`
BLAKE2b-256	`918f62df55ea579ce830faca500fd05a3cfc85d0c967044db12a33895ab59b58`

See more details on using hashes here.

beaver-db 0.7.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

beaver 🦫

Design Philosophy

Core Features

Installation

Quickstart & API Guide

Initialization

Namespaced Dictionaries

List Management

Vector & Text Search

Graph Traversal

Synchronous Pub/Sub

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes