Deterministic knowledge compiler for LLM output

These details have not been verified by PyPI

Project links

Homepage

Project description

AI Knowledge Filler

Deterministic knowledge compiler for LLM output

What is AKF

AKF (AI Knowledge Filler) is an AI-Native Cognitive Operating System — a deterministic validation pipeline that turns LLM output into schema-compliant, ontology-governed knowledge files.

LLMs generate text. Text is not knowledge.

What it is not: a note-taking app, a chat assistant, a markdown generator, an Obsidian plugin. What it is: the operating contract for a knowledge base that scales.

The Problem

Without a validation layer, AI-generated content produces:

Error	Example
Domain violation	`domain: Technology` → valid: `domain: system-design`
Enum violation	`level: expert` → valid: `beginner \| intermediate \| advanced`
Type mismatch	`tags: security` → valid: `tags: [security, api, auth]`
Date format	`created: 12-02-2026` → valid: `created: 2026-02-12`

Each error is trivial. Across hundreds of files, they make a vault unsearchable, Dataview queries return nothing, and the knowledge graph becomes noise.

AKF solves this at the generation layer, not the review layer.

What Every Committed File Guarantees

Required fields: title, type, domain, level, status, tags, created, updated
Valid enums: type, level, status from controlled sets
Domain from configured taxonomy (akf.yaml) — not hardcoded
ISO 8601 dates with created ≤ updated
tags as array (≥3), title as string — no type mismatches

Violations produce error codes E001–E007. Retry instructions are derived from those codes, not from free-form prompts.

Retry = Ontology Signal

Retry pressure is not a failure metric. When a domain triggers elevated retries, the taxonomy has a boundary problem — not the model. Telemetry captures this signal. Ontology improves from data, not intuition.

⚡ Quick Start (60 seconds)

Option 1: pip install (Recommended)

pip install ai-knowledge-filler

# Set at least one API key — Groq is free and fastest to start
export GROQ_API_KEY="gsk_..."          # free at console.groq.com (recommended)
# export ANTHROPIC_API_KEY="sk-ant-..." # or any other provider

# Generate
akf generate "Create Docker security checklist"

Output: outputs/Docker_Security_Checklist.md — production-ready, validated.

Option 2: Python API (Stage 2)

from akf import Pipeline

pipeline = Pipeline(output="./vault/")

# Single file
result = pipeline.generate("Create Docker security checklist")
print(result.path, result.attempts)

# Batch
results = pipeline.batch_generate([
    "Docker deployment guide",
    "Kubernetes security hardening",
    "API authentication strategies",
])

Option 3: REST API (Stage 3)

# Start server
akf serve --port 8000

# Generate via HTTP
curl -X POST http://localhost:8000/v1/generate \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Create Docker security checklist"}'

# Swagger UI
open http://localhost:8000/docs

Option 4: Claude Projects (No CLI)

1. Open Claude.ai → Create new Project
2. Project Knowledge → Upload akf/system_prompt.md
3. Custom Instructions → Paste akf/system_prompt.md
4. Prompt: "Create guide on API authentication"
5. Done. Claude generates structured files.

What You Get

Core System

System Prompt — Transforms LLM from chat to file generator
Metadata Standard — YAML structure specification
Domain Taxonomy — 30+ classification domains
Update Protocol — File merge rules
Validation Script — Automated quality gates
CLI — Multi-LLM interface (Claude, Gemini, GPT-4, Ollama)

Quality Assurance

✅ 88% test coverage (425 tests)
✅ Automated YAML validation
✅ CI/CD pipelines (GitHub Actions)
✅ Type hints (100% coverage)
✅ Linting (Pylint 9.55/10)

CLI Commands

Generate Files

# Auto-select first available LLM
akf generate "Create Kubernetes deployment guide"

# Specific model
akf generate "Create API checklist" --model claude
akf generate "Create Docker guide" --model gemini
akf generate "Create REST concept" --model gpt4
akf generate "Create microservices reference" --model ollama

Validate Files

# Single file
akf validate --file outputs/Guide.md

# All files in outputs/
akf validate

Start REST API Server

akf serve
akf serve --port 8001
akf serve --host 0.0.0.0 --port 8000

Endpoints: POST /v1/generate · POST /v1/validate · POST /v1/batch · GET /v1/models · GET /health

Swagger UI: http://localhost:8000/docs

List Available Models

akf models

# Output:
# ✅ groq      Groq — llama-3.3-70b-versatile
# ❌ grok      Grok (xAI) — Set XAI_API_KEY
# ✅ claude    Claude (Anthropic) — claude-sonnet-4-20250514
# ✅ gemini    Gemini (Google) — gemini-3-flash-preview
# ❌ gpt4      GPT-3.5 (OpenAI) — Set OPENAI_API_KEY
# ✅ ollama    Ollama — llama3.2:3b

Example Output

Input:

Create guide on API rate limiting

Output:

---
title: "API Rate Limiting Strategy"
type: guide
domain: api-design
level: intermediate
status: active
version: v1.0
tags: [api, rate-limiting, performance]
related:
  - "[[API Design Principles]]"
  - "[[System Scalability]]"
created: 2026-02-12
updated: 2026-02-12
---

## Purpose
Comprehensive strategy for implementing API rate limits...

## Core Principles
[Structured content with sections, code examples]

## Implementation
[Step-by-step technical guidance]

## Conclusion
[Summary and next steps]

Every file. Same structure. Validated automatically.

Architecture

User Prompt
    ↓
System Prompt (behavior definition)
    ↓
LLM Provider (Claude/Gemini/GPT-4/Ollama)
    ↓
Structured Markdown + YAML
    ↓
Automated Validation
    ↓
Production-Ready File

Key Insight: System prompt is the source of truth. Same prompt works across all LLMs.

Validation Pipeline (Phase 2.1)

LLM Output
    ↓
Validation Engine  ← deterministic
    ↓
Error Normalizer   ← deterministic
    ↓
Retry Controller   ← non-deterministic (LLM, max 3 attempts)
    ↓
Commit Gate        ← deterministic (schema_version + atomic write)
    ↓
Vault File

Determinism boundary: LLM is the only non-deterministic component.

Model Selection

Model	Key	Speed	Cost	Best For
Groq	`GROQ_API_KEY`	⚡ Fastest	Free tier	First installs, CI, high volume
Grok	`XAI_API_KEY`	Fast	$$	General purpose
Claude	`ANTHROPIC_API_KEY`	Medium	$$$	Technical docs, architecture
Gemini	`GOOGLE_API_KEY`	Fast	$	Quick drafts, summaries
GPT-3.5	`OPENAI_API_KEY`	Medium	$$	Versatile content
Ollama	—	Very Fast	Free	Privacy, offline, local

Auto-selection: CLI tries providers in order: Groq → Grok → Claude → Gemini → GPT-4 → Ollama (first available).

Installation

Via pip (Recommended)

pip install ai-knowledge-filler

From Source

git clone https://github.com/petrnzrnk-creator/ai-knowledge-filler.git
cd ai-knowledge-filler
pip install -r requirements.txt

API Keys

# Set at least one (Groq recommended — free tier available)
export GROQ_API_KEY="gsk_..."          # console.groq.com
export XAI_API_KEY="xai-..."           # console.x.ai
export ANTHROPIC_API_KEY="sk-ant-..."  # console.anthropic.com
export GOOGLE_API_KEY="AIza..."        # aistudio.google.com
export OPENAI_API_KEY="sk-..."         # platform.openai.com

# Or add to ~/.bashrc / ~/.zshrc to persist across sessions:
# export GROQ_API_KEY="gsk_..."

Testing

# Run all tests
pytest --cov=. --cov-report=term-missing -v

# Run validation
akf validate

# Run linting
pylint *.py tests/

Coverage: 88% (425 tests) Linting: Pylint 9.55/10 CI/CD: All checks passing (Python 3.10/3.11/3.12)

Use Cases

1. Technical Documentation Generate API docs, architecture decisions, deployment guides.

2. Knowledge Management Structure meeting notes, research findings, learning content.

3. Consulting Deliverables Create frameworks, methodologies, client reports.

4. Batch Processing Generate multiple files programmatically via CLI or API.

File Types

type: concept      # Theoretical entity, definition
type: guide        # Step-by-step process
type: reference    # Specification, standard
type: checklist    # Validation criteria
type: project      # Project description
type: template     # Reusable template

30+ domains: api-design, system-design, devops, security, data-engineering, etc.

Documentation

System Prompt — LLM behavior definition
User Guide — Installation, quick start, troubleshooting
CLI Reference — All commands, flags, env vars, exit codes
Architecture — Module map, data flow, extension points
Contributing — Dev setup, quality gates, adding providers

Advanced Usage

Python API

from akf import Pipeline

# Initialize once
pipeline = Pipeline(
    output="./vault/",
    model="groq",           # optional, auto-selects if omitted
    telemetry_path="./telemetry/",
)

# Single file — returns GenerateResult
result = pipeline.generate("Create API security checklist")
print(result.success)        # True
print(result.path)           # PosixPath('vault/API_Security_Checklist.md')
print(result.attempts)       # 1
print(result.generation_id)  # uuid4 for telemetry join

# Batch — returns list[GenerateResult]
results = pipeline.batch_generate([
    "Docker deployment best practices",
    "Kubernetes security hardening",
    "API authentication strategies",
])

# Validate existing file
v = pipeline.validate("vault/my_file.md")
print(v.valid, v.errors)

REST API

# Start server
akf serve --port 8000

# Generate
curl -X POST http://localhost:8000/v1/generate \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Create Docker security checklist", "model": "groq"}'

# Batch
curl -X POST http://localhost:8000/v1/batch \
  -H "Content-Type: application/json" \
  -d '{"prompts": ["Docker guide", "Kubernetes guide"]}'

Full API docs: http://localhost:8000/docs (Swagger UI, auto-generated)

Batch via CLI

cat > topics.txt << 'EOF'
Docker deployment best practices
Kubernetes security hardening
API authentication strategies
EOF

while read topic; do
    akf generate "Create guide on $topic" --model gemini
done < topics.txt

Validation

Automated checks:

✅ YAML frontmatter present
✅ Required fields (title, type, domain, level, status, created, updated)
✅ Valid enum values (type, level, status)
✅ Domain in taxonomy
✅ ISO 8601 dates (YYYY-MM-DD)
✅ Tags array (3+ items)

Output:

✅ outputs/Guide.md
❌ drafts/incomplete.md
   ERROR: Missing field: domain
   ERROR: Invalid type: document

Roadmap

v0.1.x ✅ (Shipped)

System Prompt (universal LLM compatibility)
YAML Metadata Standard
Domain Taxonomy (30+ domains)
Validation Script
Multi-LLM CLI (Claude, Gemini, GPT-4, Ollama, Groq)
CI/CD Pipelines (GitHub Actions)
PyPI package (pip install ai-knowledge-filler)

v0.2.x ✅ (Shipped)

Validation pipeline (ValidationError, Error Normalizer, Retry Controller, Commit Gate)
Hard enum enforcement — E001–E006
Telemetry layer — append-only JSONL, generation_id, convergence metrics

v0.3.0 ✅ (Shipped)

Config layer — external akf.yaml, taxonomy configurable without code changes
akf init — generates akf.yaml for a new vault
Validator Model D — created ≤ updated (E007)

v0.4.x ✅ (Current)

Marine crew domain pilot — 20-year practitioner taxonomy
Onboarding layer — README v2, Quickstart, CONTRIBUTING, error audit
Pipeline API — from akf import Pipeline (Stage 2)
REST API — akf serve, FastAPI, Swagger UI (Stage 3)
425 tests, 88% coverage, CI green Python 3.10/3.11/3.12

v1.0.0 (Planned)

Telegram Bot connector
Web UI
n8n / Make integration templates

Later

Web UI
n8n / Make integration templates
Graph extraction layer

License

MIT License — Free for commercial and personal use.

Philosophy

This is knowledge engineering, not chat enhancement.

LLMs are deterministic infrastructure, not conversational toys.

Before: "AI helps me write notes" After: "AI compiles my knowledge base"

Created by: Petr — AI Solutions Architect PyPI: https://pypi.org/project/ai-knowledge-filler/ Repository: https://github.com/petrnzrnk-creator/ai-knowledge-filler Version: 0.4.1

Support

Issues: GitHub Issues
Discussions: GitHub Discussions

Quick Links: Quick Start | CLI Commands | Documentation | Examples

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.10

Mar 19, 2026

1.0.9

Mar 18, 2026

1.0.8

Mar 18, 2026

1.0.7

Mar 18, 2026

1.0.6

Mar 18, 2026

1.0.5

Mar 17, 2026

1.0.4

Mar 17, 2026

1.0.3

Mar 17, 2026

1.0.2

Mar 16, 2026

1.0.1

Mar 13, 2026

1.0.0

Mar 11, 2026

0.6.2

Mar 7, 2026

0.6.1

Mar 2, 2026

0.6.0

Mar 2, 2026

0.5.4

Feb 28, 2026

0.5.3

Feb 27, 2026

0.5.2

Feb 27, 2026

0.5.1

Feb 27, 2026

0.5.0

Feb 27, 2026

This version

0.4.2

Feb 26, 2026

0.4.1

Feb 26, 2026

0.4.0

Feb 26, 2026

0.3.0

Feb 25, 2026

0.2.0

Feb 21, 2026

0.1.4

Feb 20, 2026

0.1.3

Feb 19, 2026

0.1.1

Feb 19, 2026

0.1.0

Feb 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_knowledge_filler-0.4.2.tar.gz (66.0 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_knowledge_filler-0.4.2-py3-none-any.whl (49.3 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file ai_knowledge_filler-0.4.2.tar.gz.

File metadata

Download URL: ai_knowledge_filler-0.4.2.tar.gz
Upload date: Feb 26, 2026
Size: 66.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ai_knowledge_filler-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`6d5273c4e5503519772379ed82701667c50a30b4d673ec879be400030a82859c`
MD5	`c6a533ef8a5600b75f80b4e425d73169`
BLAKE2b-256	`bff6d6619475ed88b785216b4ae84165f05b643f322eb288e6dfbc9097418522`

See more details on using hashes here.

File details

Details for the file ai_knowledge_filler-0.4.2-py3-none-any.whl.

File metadata

Download URL: ai_knowledge_filler-0.4.2-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 49.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ai_knowledge_filler-0.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`80501fbb196f64eb8fdefc6f0176e8d1e35add33f51aebca71de11a91da25eb6`
MD5	`f207571a7d4ea5eaa1bce526f2276082`
BLAKE2b-256	`c0a8908df9e4690bb0e19bf309f7cb5a337dbdaeb41df6680b0903b637d49582`

See more details on using hashes here.

ai-knowledge-filler 0.4.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AI Knowledge Filler

What is AKF

The Problem

What Every Committed File Guarantees

Retry = Ontology Signal

Retry pressure is not a failure metric. When a domain triggers elevated retries, the taxonomy has a boundary problem — not the model. Telemetry captures this signal. Ontology improves from data, not intuition.

⚡ Quick Start (60 seconds)

Option 1: pip install (Recommended)

Option 2: Python API (Stage 2)

Option 3: REST API (Stage 3)

Option 4: Claude Projects (No CLI)

What You Get

Core System

Quality Assurance

CLI Commands

Generate Files

Validate Files

Start REST API Server

List Available Models

Example Output

Architecture

Validation Pipeline (Phase 2.1)

Model Selection

Installation

Via pip (Recommended)

From Source

API Keys

Testing

Use Cases

File Types

Documentation

Advanced Usage

Python API

REST API

Batch via CLI

Validation

Roadmap

v0.1.x ✅ (Shipped)

v0.2.x ✅ (Shipped)

v0.3.0 ✅ (Shipped)

v0.4.x ✅ (Current)

v1.0.0 (Planned)

Later

License

Philosophy

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes