Skip to main content

AI Monorepo — AI-agnostic versioned knowledge store with ingestion, index and MCP access

Project description

AI Monorepo

AI-agnostisches, versioniertes Wissens-Repo als Single Source of Truth für beliebige AIs. Heterogene Eingaben (YouTube, URLs, PDF, Audio, Bild, Slides) werden automatisch zu sauberem Markdown aufbereitet und über einen MCP-Server für jede MCP-fähige AI abrufbar gemacht.

Konzept & Entscheidungen: siehe AI-Monorepo-Konzept.md (im Begleit-Repo MacOS-monitoring).

Zwei Repos — bewusst getrennt

Repo Inhalt
dieses (ai-monorepo) Code: Ingestion, Index, MCP-Server, (später) App
Store (separat, privat) Daten = Source of Truth: Markdown + Frontmatter, vom Tooling beschrieben

Der Store-Pfad wird über die Config (monorepo.example.tomlmonorepo.toml) gesetzt. Der Code committet nie sich selbst in den Store.

Struktur

src/monorepo/
  pipeline/          Ingestion-Orchestrierung
    adapters/        je Quellentyp ein Adapter (youtube, pdf, url, audio, …)
  index/             Vektorindex (LanceDB), aus dem Store gebaut
  mcp/               MCP-Server (search_context …)
  cli.py             Kommandozeilen-Einstieg
docs/
  frontmatter-schema.md   Vertrag für jeden Store-Eintrag

Leitprinzipien

  1. Embeddings/Index = wegwerfbarer Cache, nie Source of Truth (→ rebuild statt migrieren).
  2. Originale werden zu Pointern; das Transkript ist die SoT.
  3. Pipeline-AI veredelt beim Ablegen, Roh-Original bleibt immer erhalten.
  4. MCP filtert zur Abfragezeit nach Lebenszyklus (as_of-Zeitreise möglich).
  5. Lokal-first, „nur ich".

Phasen-Status

  • Phase 0 — Skelett & Konventionen
  • Phase 1 — YouTube-Adapter
  • Phase 2 — AI-Veredelung
  • Phase 3 — Index (LanceDB)
  • Phase 4 — MCP-Server
  • Phase 5 — Lebenszyklus + weitere Adapter
  • Phase 6 — Host + Mac-Fallback
  • Phase 7 — App + CaptureKit (eigener Track)

Entwicklung

python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
ruff check .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_monorepo-0.2.0.tar.gz (34.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai_monorepo-0.2.0-py3-none-any.whl (37.3 kB view details)

Uploaded Python 3

File details

Details for the file ai_monorepo-0.2.0.tar.gz.

File metadata

  • Download URL: ai_monorepo-0.2.0.tar.gz
  • Upload date:
  • Size: 34.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for ai_monorepo-0.2.0.tar.gz
Algorithm Hash digest
SHA256 487b5efb5ec340b6a6d341a97ba2955c3ffdc0a69a414e0e0db5321e84fbae06
MD5 eb32dcdfa29609baef16271457a3cb38
BLAKE2b-256 13335c2543ee30190460e92504edc93f0d1838cb01d4902a9a38fe600cdbbc3b

See more details on using hashes here.

File details

Details for the file ai_monorepo-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: ai_monorepo-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 37.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for ai_monorepo-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e70c123b28980431536aade7e581184a133ab1b9e2465d299df0d69ab871f2b1
MD5 8dfa2936d420e9bfb11dca7f3af2df3e
BLAKE2b-256 4efd3c65b7b6246ac910e0564ba4617d824a937e505720e81fd066e0da5f0e1d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page