Graph-native agent harness and substrate for knowledge, memory, workflow, and MCP tooling

These details have not been verified by PyPI

Project links

Project description

Kogwistar

AI substrate system, from provenance to wisdom where execution becomes reusable memory

Graph / Hypergraph-Native execution, memory and observability substrate for AI systems

Build knowledge graphs, workflow runtimes, conversation memory, and provenance systems as a single substrate for AI agents.

Open-source substrate and harness for AI agent systems.

kogwistar is a graph-native memory and execution system.

What This Is

A substrate for storing knowledge, conversations, workflows, and provenance in the same graph-oriented model.
A runnable harness for executing workflows and replaying history.
A foundation for building agent systems with inspectable history and traceable answers.

What This Is Not

Not just another GraphRAG demo.
Not an agent framework with a fixed opinionated loop.
Not a finished enterprise product.

See docs/positioning.md for the full architectural framing and docs/LEARNING_PATH.md for the learning order.

Glossary

substrate: the shared graph-oriented foundation for knowledge, conversation, memory, workflow, execution, and provenance.
harness: the execution layer that runs workflows on top of the substrate.
provenance: the trace back to the source data, steps, or spans behind a node or answer.
replay: rebuilding state or views from stored history.
projection: turning authoritative history into queryable views or materialized state.
authoritative evented path: the event-history-driven path that carries the strongest replay and provenance guarantees.

Example

Run the graph-native artifact demo to see the full loop end to end:

python -m kogwistar.demo.graph_native_artifact_demo

What it does:

takes raw notes as input
validates and normalizes them into graph artifacts
links related artifacts
commits the result into history
rebuilds usable views from that history
explains results with provenance

This is the quickest way to see how execution becomes memory in the same graph.

See docs/graph_native_artifact_demo.md for the short design note and docs/positioning.md for the deeper architectural framing.

If you want the closer agent-loop comparison, the older framework_then_agent_demo remains available as a companion example.

Today the repo implements graph memory and query, workflow design/runtime, provenance and replay-oriented surfaces, CDC/event-oriented patterns, and self-hostable development paths.

This repo is a substrate with reusable building blocks, especially around replayability, provenance tracking, and projection along the authoritative evented path.

Mental Model In Pseudocode

If the terminology feels abstract, read this first. It is the simplest way to understand the design:

input = user message, document, or workflow definition

events = capture(input)
validated = validate(events)
graph = normalize_to_nodes_and_edges(validated)
connected = link_related_graph_artifacts(graph)
commit(connected)                  # writes authoritative history

projection = rebuild_views_from_history()
answer = retrieve_with_provenance(projection, question)

In plain words:

execution is recorded as durable history
durable history is projected into graph state
graph state can be replayed or repaired later
provenance links answers back to the source steps and source spans

More Details

For the layer model, guarantees, and comparisons, see docs/positioning.md and docs/LEARNING_PATH.md.

Graph-Native Artifact Pipeline

This is the clearest demo if you want to see how execution, memory, conversation, and provenance are unified in one graph.

Run the full suite with:

python -m kogwistar.demo.graph_native_artifact_demo

Use --summary-only if you want the one-screen proof view instead of the full report.

The demo runs three small scenarios:

execution becomes memory
conversation and workflow live in the same graph
provenance answers “why” from stored execution history

These are not separate features; they come from the same underlying graph model.

The example stages raw notes, validates them, normalizes them into graph artifacts, links related notes, and commits the result into the same graph store that also holds workflow runs, step executions, and conversation turns. That makes the substrate point visible: execution is not just logged, it becomes graph memory; conversation and workflow are linked in one graph; and provenance queries can explain a result from stored history.

See docs/graph_native_artifact_demo.md for the short design note.

If you want the closer agent-loop comparison, the older framework_then_agent_demo remains available as a companion example.

Quickstart

As this is a substrate, you can pick anywhere you immediately want to start with e.g. hypergraphrag, runtime, mcp server, conversation primitives, workflow design. As you need grows, you can compose more consistently and coherently.
Standalone setup then run simple flow in 2 minutes. (no frontend integration): QUICKSTART.md
Detailed comparison with adjacent products/frameworks: docs/llm-generated-comparison.md
Author notes, build context, and design history: docs/author-notes.md
Runtime rationale: kogwistar/docs/ARD-custom-runtime-rationale.md
Local conversation walkthrough: docs/tutorials/conversation-pipeline-basics.md
Roadmap and research direction: docs/roadmap.md
Alternatively, let your AI agent read through and set up the credential/ keys and environment variables for you, and run the appropriate commands.

Core Features

Graph/hypergraph-oriented memory and query surfaces.
Knowledge can go outdated. Trace available knowledge/graph status back in any time.
Workflow design stored as graph structure, with runtime, replay, and event-oriented execution seams.
Support conversation execution events stored as hypergraph/graph
CDC-oriented graph updates and replay workflows. (Observability is FREE, NOT freemium!)
Provenance-heavy first class primitives with lifecycle-aware and temporal retrieval support.
Multiple storage backends, including Chroma and PostgreSQL/pgvector paths. With dual-store eventual consistency or transactional atomicity.
MCP/ REST tooling surface for graph query, extraction, and admin operations.
Visualization helpers for D3 payloads.
Security design, servers with RBAC, namespaces. LLM call has built-in privacy guards. Data model has slice guards to prevent data leakage.
Since everything is node and each node has provenance and embeddings, besides graph algorithms, you can trace provenence down everything and semantically search trace logs, design nodes, conversation nodes. Future conversation can semantically search past history for wisdom.

How This Differs

Unlike typical agent products, this repo centers a unified graph/hypergraph substrate, more than only chat, skills, or tool orchestration.
Unlike workflow-first frameworks, it treats provenance, replay, and event history (event source) as part of the core data model rather than secondary runtime features.
Compared with local/self-hosted agent products, it emphasizes graph-native memory and workflow design seams more than channel breadth or app-registry breadth.
It spans retrieval, memory, runtime, and provenance concerns together, so it maps less cleanly to a single existing OSS category.

Application Best Fit

A strong base platform for building audit-heavy systems, ranging from local personal agents to scalable AI backends.
Best suited for use cases where provenance, replay, lifecycle-aware retrieval, and workflow history matter.
Designed to run efficiently on normal local machines (via SQLite/Chroma or lightweight Docker with PostgreSQL/pgvector).
A foundation repo, not yet a finished enterprise product.
Personal use that require robustness and stability.
Collect data for future AI wisdom on what and how to do things.

Security Support

Role- and namespace-based access control across API and MCP surfaces.
OIDC/PKCE support for authenticated deployments, with simpler dev-mode auth for local work.
Sandboxable runtime paths, including container-based execution with networking disabled by default.
Security-aware boundaries are part of the design, but this repo should still be treated as a foundation platform rather than a fully hardened security product.

Research Direction

Move toward privacy-first personal agents with stronger local-first memory and execution.
Distill conversation and workflow traces into a future wisdom layer.
Explore agents that can propose and revise their own workflow graphs under human-auditable constraints.
These are research directions and design seams, not completed product claims. Runtime design rationale: kogwistar/docs/ARD-custom-runtime-rationale.md
Research application note: docs/research-applications.md

Why This Repo Exists

Raise the engineering bar for graph-native agent systems.
Lower the barrier to entry for building high-quality agent memory and execution layers on normal local hardware.
Provide a reusable foundation where retrieval, memory, workflow, and provenance are already structurally integrated. Additional motivation, build-cost context, and design history: docs/author-notes.md

Important Tool Sets

Graph substrate and provenance toolkit for storing nodes, edges, grounding, lifecycle state, and replayable graph changes as first-class primitives.
Conversation and retrieval orchestration toolkit for building local conversation flows, memory retrieval, KG retrieval, evidence pinning, and workflow-driven v2 turn execution.
Workflow runtime and replay toolkit for graph-defined execution, checkpoints, joins, suspend/resume flows, and inspectable run traces.
LLM task abstraction and strategy toolkit for provider-neutral extraction, filtering, summarization, answering, citation repair, verification, and merge/adjudication behavior.
Server and MCP integration toolkit for exposing conversations, workflow runs, admin operations, and graph tooling through app and tool boundaries.
Tutorial, CDC, and debugging toolkit for local development ladders, event-stream inspection, CDC viewers, and operational debugging workflows.

Run (Development)

Clone the repo and enter it.
- git clone git@github.com:humblemat810/kogwistar.git
- Or HTTPS: git clone https://github.com/humblemat810/kogwistar.git
- cd kogwistar
Create and activate a Python 3.13 environment.
Install dependencies for local work.
Pick a development mode:
- Server-style MCP app:
  - Install with pip install -e ".[server,chroma]" and start the app with knowledge-mcp (defaults to port 28110).
- CLI-style workflow/runtime loop:
  - Use the standalone tutorial/runtime script in QUICKSTART.md under scripts/claw_runtime_loop.py.
  - This mode is useful for local development when you want to iterate on workflow execution, CDC flow, and event-loop behavior without running the full server surface.

Development and Test Install

Minimal engine core:
- pip install -e .
Lightweight local development:
- pip install -e ".[server,chroma,test]"
Development with LangGraph-related tests:
- pip install -e ".[server,chroma,test,langgraph]"
PostgreSQL parity/integration work:
- pip install -e ".[server,pgvector,test]"
Full local contributor setup:
- pip install -e ".[full,test]"

Run Tests

Run the default test suite:
- pytest
Run a specific test file:
- pytest tests/workflow/test_save_load_progress.py -q
Real Keycloak OIDC end-to-end test:
- pytest tests/server/test_oidc_keycloak_e2e.py -q
Browser-visible manual Keycloak OIDC test:
- pytest tests/server/test_oidc_keycloak_manual.py::test_oidc_keycloak_browser_manual --run-manual -q -s
Some integration tests may require Docker/testcontainers or optional extras such as langgraph.

Install Options

Base/core only:
- pip install -e .
Server/MCP runtime:
- pip install -e ".[server]"
Chroma backend:
- pip install -e ".[chroma]"
PostgreSQL + pgvector backend:
- pip install -e ".[pgvector]"
LLM provider (OpenAI):
- pip install -e ".[openai]"
LLM provider (Gemini):
- pip install -e ".[gemini]"
Ingestion with OpenAI:
- pip install -e ".[ingestion-openai]"
Ingestion with Gemini:
- pip install -e ".[ingestion-gemini]"
LangGraph converter:
- pip install -e ".[langgraph]"
Everything:
- pip install -e ".[full]"

GitHub Install

Use the package name, not the repo name, when installing with extras from a repository.
Examples:
- pip install "kogwistar[chroma] @ git+ssh://git@github.com/humblemat810/kogwistar.git@main"
- pip install "kogwistar[pgvector,openai] @ git+ssh://git@github.com/humblemat810/kogwistar.git@<commit>"
For HTTPS-based installs, use the same direct-reference form with a token-authenticated git+https://... URL.

Runtime Configuration

GKE_BACKEND=chroma|pg
Shared local persistence root:
- GKE_PERSIST_DIRECTORY=/path/to/data
Chroma-specific overrides:
- GKE_KNOWLEDGE_PERSIST_DIRECTORY
- GKE_CONVERSATION_PERSIST_DIRECTORY
- GKE_WORKFLOW_PERSIST_DIRECTORY
- GKE_WISDOM_PERSIST_DIRECTORY
- Legacy MCP_CHROMA_DIR* envs still work.
Postgres-specific settings:
- GKE_PG_URL
- GKE_PG_SCHEMA
- GKE_EMBEDDING_DIM

Local Docker / Compose

Build and run with embedded Chroma persistence:
- docker compose --profile chroma up -d
Build and run with PostgreSQL + pgvector:
- docker compose --profile pg up -d
Start Keycloak only (OIDC dev realm):
- docker compose --profile auth up -d keycloak
Stop the stack:
- docker compose down
Stop and delete named volumes:
- docker compose down -v
The chroma profile uses an embedded Chroma-backed app container with named volumes.
The pg profile starts both the app container and a pgvector Postgres container.

Auth Modes

The server supports two auth modes controlled by AUTH_MODE:

AUTH_MODE=oidc (default)
- Uses OIDC Authorization Code + PKCE.
- Compose defaults point to the local Keycloak realm imported from keycloak/realm-kge.json.
- Default client: kge-local (public client).
- Default user: dev / dev.
- Required envs when running locally without compose:
  - OIDC_DISCOVERY_URL
  - OIDC_CLIENT_ID
  - OIDC_REDIRECT_URI
  - UI_URL
AUTH_MODE=dev
- Skips OIDC and issues an app JWT directly on GET /api/auth/login.
- Defaults are configurable with:
  - DEV_AUTH_EMAIL, DEV_AUTH_SUBJECT, DEV_AUTH_NAME
  - DEV_AUTH_ROLE, DEV_AUTH_NS

Windows note (Keycloak realm volume)

compose.yml uses a relative bind mount for keycloak/realm-kge.json. On Windows with Docker Desktop, this is the most reliable option.

Best practices:

Keep the realm file inside the repo (relative path).
Avoid drive-letter paths in compose.yml unless you must.
Ensure the file exists before running docker compose up, otherwise Keycloak starts without importing.

Misnomer

Relationship to LangGraph

LangGraph focuses on execution: it provides a strong runtime model for orchestrating stateful workflows with persistence and resume capabilities.

Kogwistar overlaps with that execution layer, but extends the system model by treating workflow structure, conversation structure, knowledge, and provenance as first-class graph entities. Rather than representing state primarily as messages or dictionaries, it aims to unify these domains on a shared node/edge/event substrate with replay and projection semantics.

The difference is not only in execution, but in what is considered part of the system’s core data model. Further details

License

MIT. See LICENSE.

Author Thoughts/ ideas

Zen.md

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.1

Apr 24, 2026

This version

0.1.0

Apr 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cloister-0.1.0.tar.gz (479.3 kB view details)

Uploaded Apr 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cloister-0.1.0-py3-none-any.whl (536.0 kB view details)

Uploaded Apr 17, 2026 Python 3

File details

Details for the file cloister-0.1.0.tar.gz.

File metadata

Download URL: cloister-0.1.0.tar.gz
Upload date: Apr 17, 2026
Size: 479.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for cloister-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e3bddc8c0f667340712defa1b867758baca458607549a0f6e493f5bd679831c7`
MD5	`1c3ff37ff2b0ef2dae642c17c7207a6e`
BLAKE2b-256	`574d0e81ec933241bb0d7d32b5554c577f4b84811dd52b8ab4bd229085010b83`

See more details on using hashes here.

File details

Details for the file cloister-0.1.0-py3-none-any.whl.

File metadata

Download URL: cloister-0.1.0-py3-none-any.whl
Upload date: Apr 17, 2026
Size: 536.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for cloister-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4602c84240c94bafc542bb5b45fca9a7112a1593967d135eec4f020dc95d0fbd`
MD5	`67eb4c1be2a194c6b3e1b980a917601c`
BLAKE2b-256	`7c9e800af7bdc3ceeb36ca96647c9999d5dad6591da3b41aa73aba9d70d8e8eb`

See more details on using hashes here.

cloister 0.1.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Kogwistar

AI substrate system, from provenance to wisdom where execution becomes reusable memory

What This Is

What This Is Not

Glossary

Example

Mental Model In Pseudocode

More Details

Graph-Native Artifact Pipeline

Quickstart

Core Features

How This Differs

Application Best Fit

Security Support

Research Direction

Why This Repo Exists

Important Tool Sets

Run (Development)

Development and Test Install

Run Tests

Install Options

GitHub Install

Runtime Configuration

Local Docker / Compose

Auth Modes

Windows note (Keycloak realm volume)

Misnomer

Relationship to LangGraph

License

Author Thoughts/ ideas

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes