Skip to main content

A lightweight, file-based database implementation in Python

Project description

Bazoola

CI Python Version License: MIT

A lightweight, file-based database implementation in Python designed for educational purposes and simple applications.

Overview

Bazoola is a minimal database system that stores data in text files with fixed-width fields. It supports basic database operations like CRUD (Create, Read, Update, Delete), schema validation, foreign key relationships, and simple querying capabilities.

Features

  • File-based storage: Data is stored in human-readable text files with fixed-width fields
  • Schema definition: Define table schemas with typed fields
  • Primary keys: Automatic ID generation (manual ID assignment validates uniqueness but doesn't support custom sequences)
  • Foreign keys: Support for relationships between tables
  • Indexing: ID-based indexing for fast lookups
  • Joins: Support for joining related tables
  • Querying: Find records by field values, substrings, or custom conditions
  • Free space management: Reuses deleted record space

Installation

Install as a package

# Install directly from GitHub
pip install git+https://github.com/ddoroshev/bazoola.git

Development setup

# Clone the repository
git clone https://github.com/ddoroshev/bazoola.git
cd bazoola

# Install dependencies using Poetry
poetry install

Quick Start

from bazoola import DB, Table, Schema, Field, PK, FK, CHAR, INT, Join

# Define table schemas
class TableAuthors(Table):
    name = "authors"
    schema = Schema([
        Field("id", PK()),
        Field("name", CHAR(20)),
    ])

class TableBooks(Table):
    name = "books"
    schema = Schema([
        Field("id", PK()),
        Field("title", CHAR(20)),
        Field("author_id", FK("authors")),
        Field("year", INT(null=True)),
    ])

# Create database instance
db = DB([TableAuthors, TableBooks])

# Insert data
author = db.insert("authors", {"name": "John Doe"})
book = db.insert("books", {
    "title": "My Book",
    "author_id": author["id"],
    "year": 2024
})

# Query with joins
book_with_author = db.find_by_id(
    "books",
    book["id"],
    joins=[Join("author_id", "author", "authors")]
)
print(book_with_author)
# Output: {'id': 1, 'title': 'My Book', 'author_id': 1, 'year': 2024, 'author': {'id': 1, 'name': 'John Doe'}}

# Close the database
db.close()

Field Types

  • PK(): Primary key field (auto-incrementing integer)
  • INT(null=False): Integer field
  • CHAR(size, null=False): Fixed-size character field
  • FK(table_name, null=False): Foreign key field

API Reference

Database Operations

# Insert a record
row = db.insert("table_name", {"field": "value"})

# Find by ID
row = db.find_by_id("table_name", id)

# Find all records
rows = db.find_all("table_name")

# Find by field value
rows = db.find_by("table_name", "field_name", value)

# Find by substring
rows = db.find_by_substr("table_name", "field_name", "substr")

# Update a record
row = db.update_by_id("table_name", id, {"field": "new_value"})

# Delete a record
db.delete_by_id("table_name", id)

# Delete by substring match
db.delete_by_substr("table_name", "field_name", "substr")

# Truncate table
db.truncate("table_name", cascade=False)

Advanced Queries

from bazoola import GT, LT

# Find with conditions
rows = db.find_by_cond("books", GT(year=2020))
rows = db.find_by_cond("books", LT(year=2000))

# Query with joins
rows = db.find_all("books", joins=[
    Join("author_id", "author", "authors")
])

# Inverse joins (one-to-many)
from bazoola import InverseJoin
author = db.find_by_id("authors", 1, joins=[
    InverseJoin("author_id", "books", "books")
])

File Structure

Bazoola creates the following files for each table:

  • {table_name}.dat - Main data file
  • {table_name}__seqnum.dat - Sequence number for auto-increment
  • {table_name}__id.idx.dat - Primary key index
  • {table_name}__free.dat - Free rownum stack for space reuse

Configuration

By default, database files are stored in a data directory. You can specify a different directory when creating the database:

# Use default 'data' directory
db = DB([TableAuthors, TableBooks])

# Use custom directory
db = DB([TableAuthors, TableBooks], base_dir="/path/to/data/directory")

Testing

# Run tests
poetry run pytest

# Run tests with coverage
poetry run pytest --cov

# Type checking
poetry run mypy .

# Linting
poetry run ruff check .

Limitations

  • No multi-threading support
  • No transactions or rollback support
  • Limited query optimization
  • Fixed-size fields only
  • No SQL interface (by design)

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bazoola-0.0.4.tar.gz (10.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bazoola-0.0.4-py3-none-any.whl (9.1 kB view details)

Uploaded Python 3

File details

Details for the file bazoola-0.0.4.tar.gz.

File metadata

  • Download URL: bazoola-0.0.4.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.10.18 Linux/6.11.0-1018-azure

File hashes

Hashes for bazoola-0.0.4.tar.gz
Algorithm Hash digest
SHA256 7b10b3781479676c9289718d44a63160fa7d5c42acbe3827bf03062a3f4b3722
MD5 370d5c11989dd1accdf6f17c423ff62f
BLAKE2b-256 43cff6c97a807d006fe8d83b8b236c190e6fa8cb4fc356e67245a2b20b5cf80d

See more details on using hashes here.

File details

Details for the file bazoola-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: bazoola-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 9.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.10.18 Linux/6.11.0-1018-azure

File hashes

Hashes for bazoola-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 1c956ebfed927e03b97b5e5e107e4b652b7e627d3fb1ff7f5043204b6c7fdacb
MD5 2f8f5436e179d61cb42a85d9d95448d8
BLAKE2b-256 e8955da1bb0795a46cb21741148d9ae83690baacc2b214db4778fe1939b771e0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page