Two-pass LLM pipeline that turns Markdown files into a confidence-scored knowledge graph with an induced RDFS/OWL ontology

These details have not been verified by PyPI

Project description

myKG — Knowledge Graph Extractor

myKG automatically generates a confidence-scored knowledge graph from a directory of Markdown files, grounded in an induced RDFS/OWL ontology schema.

It uses a two-pass LLM pipeline: Pass 1 induces a global RDFS/OWL schema from your document corpus; Pass 2 extracts typed entity and relationship instances per file against that schema. The result is exported to multiple formats: JSONL for property-graph consumers such as Neo4j, Turtle RDF for OWL toolchains, and seven NetworkX formats for graph analysis and visualization.

Command line

mykg extract-graph my_notes/

Output

sessions/2026-05-17T18-31-07/
  output/
    nodes.jsonl                    ← typed entities with confidence scores
    edges.jsonl                    ← typed relationships with provenance
    knowledge_graph.ttl            ← RDFS/OWL TBox + RDF ABox (Protégé, SPARQL)
    networkx_output/               ← GML, GraphML, GEXF, Pajek, JSON node-link,
                                      knowledge_graph.html (interactive vis)
  walkthrough.md                   ← per-run report: schema, stats, timing

Features
Quick Start
Using with Claude Code
Configuration
Extract Pipeline
Advanced Options
Development
Roadmap
Design

Features

Ontology-Guided Extraction

Schema-guided knowledge graph generation — the extracted graph is always grounded in a formal RDFS/OWL schema: concept types, property names, domain/range constraints, and the is-a hierarchy are explicit and inspectable before any entity is extracted
Bring your own ontology — supply a --base-schema TTL file to lock in classes and properties from an existing formal ontology; the LLM expands it with domain-specific concepts but cannot rename, remove, or contradict your authoritative vocabulary
SKOS thesaurus support — pass --thesaurus to load a SKOS vocabulary; skos:exactMatch terms are collapsed silently, skos:closeMatch terms trigger a warning — giving the schema merger richer synonym awareness than string matching alone
Verifiable TTL ontology — after Pass 1, the induced schema is exported as a valid RDFS/OWL Turtle file (intermediate/schema.ttl) that can be opened directly in ontology editors such as Protégé. The TTL is validated by rdflib (syntax + semantic checks: domain/range refer to declared classes, no conflicting ranges) before any extraction begins
Human-in-the-loop ontology design — run with --review to pause after schema induction; inspect and edit schema.json (or load schema.ttl in Protégé, modify, and save back) before a single entity is extracted; resume with mykg approve-schema
Incremental updates — run with --append on an existing session to add new or modified Markdown files without re-running Pass 1; the schema is reused and only the new files go through Pass 2
AI coding assistant friendly — designed for smooth use alongside AI coding assistants such as Claude Code; run extractions, inspect outputs, and iterate on your knowledge graph without leaving your coding environment; see Using with Claude Code

Input

Markdown files — any directory of .md files; subdirectory structure is preserved; YAML/TOML frontmatter, headings, lists, and code blocks are all treated as structural signals
Other formats — convert PDFs, Word docs, HTML, and other formats to Markdown first using a document parser such as MinerU, then point myKG at the output directory

Graph & Output

Provider-agnostic — works with Anthropic (Claude), OpenAI (GPT-4o), Ollama (local), OpenRouter, or the claude CLI with no API key
Three output families — JSONL for Neo4j/NetworkX/RAG, Turtle RDF for OWL toolchains, NetworkX multi-format for graph analysis
Interactive HTML graph — node/edge filtering, search, hover popups; opens directly in a browser
Confidence scoring — every extracted attribute, node, and edge carries a 0.0–1.0 confidence score
Name normalization — surface-form variants ("Acme Corp", "ACME", "Acme Corporation") resolved to a single canonical node with aliases
Orphan-connection pass — reconnects isolated nodes via co-occurrence heuristic + LLM confirmation
Cross-session merge — combine two independently-produced graphs into one unified knowledge graph
Resumable pipeline — every stage persists intermediate state; re-enter at any step after a crash or edit
Session isolation — each run is fully self-contained; inputs, intermediate state, outputs, and logs co-located
Query knowledge graph — natural-language and structured queries directly against the extracted graph via AI coding assistants such as Claude Code, SPARQL endpoints, or graph traversal APIs

Quick Start

Requires Python 3.11+ and one of: an Anthropic/OpenAI/OpenRouter API key, Ollama running locally, or the claude CLI.

Install from PyPI

pip install mykg

mykg init            # interactive setup: choose provider, paste API key
                     # writes pipeline_config.yaml and .env in one step

# Run
mykg extract-graph my_notes/
# → open sessions/<timestamp>/output/knowledge_graph.html in your browser

Install from source

# 0. Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh   # macOS / Linux
# Windows: winget install astral-sh.uv

# 1. Install
git clone https://github.com/SenolIsci/mykg && cd mykg
uv sync

# 2. Configure
mykg init           # interactive: choose provider, paste API key

# 3. Run
uv run mykg extract-graph my_notes/

For Ollama (no API key needed):

ollama pull llama3.3
# set profile: ollama-local in pipeline_config.yaml
mykg extract-graph my_notes/

Using with Claude Code

myKG ships with a claude-cli profile that runs extractions through the locally-installed claude CLI — no API key or billing setup needed beyond your existing Claude Pro/Max plan.

Setup

# 1. Install the claude CLI (if not already installed)
npm install -g @anthropic-ai/claude-code

# 2. Install mykg and run the setup wizard
pip install mykg
mkdir my-kg-project && cd my-kg-project
mykg init
# → select [5] Claude CLI when prompted (no API key needed)

# 3. Run
mykg extract-graph my_notes/

How it works

The claude-cli provider calls claude -p as a subprocess for every LLM step (Pass 1 schema induction, Pass 2 extraction, orphan connection, name normalization). All pipeline features — session isolation, resumability, orphan recovery, cross-session merge — work identically to API-based providers.

Key constraints of the claude-cli profile:

max_workers must be 1 — the claude CLI is serial by design; parallel workers will queue
No API key required — billing goes through your Claude Pro/Max subscription
The effort and model fields in pipeline_config.yaml map directly to --effort and --model flags passed to claude -p

Using myKG from inside Claude Code

You can run myKG extractions as a tool call from within a Claude Code session. This is useful for building knowledge graphs from notes or documentation while you work:

# From any Claude Code session terminal:
mykg extract-graph ./docs/ --session my-docs-kg

# Then reference the output in your session:
# sessions/my-docs-kg/output/nodes.jsonl
# sessions/my-docs-kg/output/knowledge_graph.ttl

Claude Code can then read nodes.jsonl or edges.jsonl directly to answer questions about the extracted graph, or load knowledge_graph.ttl into a SPARQL tool for structured queries.

Recommended `pipeline_config.yaml` settings for Claude Code

profile: claude-cli

profiles:
  claude-cli:
    llm:
      model: sonnet       # or opus for higher quality
      effort: medium      # low | medium | high
    pipeline:
      pass1:
        max_workers: 1    # required — claude CLI is serial
      pass2:
        max_workers: 1

Configuration

All configuration lives in a single pipeline_config.yaml file discovered automatically from the working directory (or any parent). There are no hardcoded defaults in the code — the YAML is the sole source of truth.

mykg init           # interactive setup: choose provider, paste API key,
                    # writes pipeline_config.yaml and .env in one step
mykg init --force   # overwrite an existing config
mykg init --profile openrouter-free --api-key sk-or-...   # non-interactive

API Keys

myKG reads API keys from environment variables. Set them by exporting directly or by creating a .env file in your project directory (loaded automatically on startup).

Option A — export in your shell:

export ANTHROPIC_API_KEY=sk-ant-...

Option B — create a .env file:

# .env
ANTHROPIC_API_KEY=sk-ant-...

Variable	Profile	Notes
`ANTHROPIC_API_KEY`	`anthropic-claude`	Claude API key
`OPENAI_API_KEY`	`openai`	OpenAI API key
`OPENROUTER_API_KEY`	`openrouter-free`	OpenRouter API key
(none required)	`claude-cli`	Billing via Claude Pro/Max subscription
(none required)	`ollama-local`	Local inference, no account needed

For source installs you can also copy sample.env to .env as a starting template.

LLM Providers

Provider	Profile name	API key env var	Notes
Anthropic (Claude)	custom (see Quick Start)	`ANTHROPIC_API_KEY`	Recommended for quality
OpenAI (GPT-4o)	`openai`	`OPENAI_API_KEY`
Ollama	`ollama-local`	—	Local inference, no key needed
OpenRouter	`openrouter-free`	`OPENROUTER_API_KEY`	Access many models via one key
Claude CLI	`claude-cli`	—	Uses `claude -p` subprocess; billing via Claude Pro/Max; serial only

Switch provider by setting profile: at the top of pipeline_config.yaml.

Key Pipeline Parameters

Key	Default	Description
`pipeline.chunking.window_tokens`	`2000`	Chunk size in tokens
`pipeline.chunking.overlap_tokens`	`200`	Overlap between adjacent chunks
`pipeline.pass1.batch_token_target`	`8000`	Max tokens per Pass 1 LLM batch
`pipeline.pass1.max_workers`	`4`	Parallel LLM workers for Pass 1
`pipeline.pass2.max_workers`	`1`	Parallel workers for Pass 2
`pipeline.pass2.stateful_chunks`	`false`	Pass prior-chunk node IDs to subsequent chunks for stable IDs
`pipeline.pass2.prep_mode`	`per_file`	`per_file` \| `concat` \| `batch_chunks`
`pipeline.normalize_names.enabled`	`true`	Run LLM name normalization step
`pipeline.orphan_pass.enabled`	`true`	Run the orphan-connection pass
`pipeline.orphan_pass.schema_max_restarts`	`1`	Max automated Pass 2 restarts from schema-gap recovery
`pipeline.export.networkx_enabled`	`true`	Write NetworkX formats to `output/networkx_output/`
`pipeline.error_gate.enabled`	`true`	Pause all workers on repeated API errors

Run context-calculator --context <N> --max-output <M> to compute correct window_tokens and batch_token_target for a different model's context window.

Extract Pipeline

Reads a directory of .md files and produces a typed knowledge graph in three output formats. The pipeline runs 11 sequential steps; all intermediate state is persisted so any step can be re-entered without repeating upstream work.

Running

mykg extract-graph <input_dir> [OPTIONS]
# source installs: uv run mykg extract-graph <input_dir> [OPTIONS]

<input_dir> is any directory of .md files. Subdirectories are included recursively.

Options

Option	Description
`--session NAME`	Resume an existing session by folder name
`--from-step NAME`	Delete a step's outputs and re-run from that point
`--review`	Pause after Pass 1 for manual schema review
`--append`	Skip Pass 1; re-run only on new/modified files
`--workers N`	Parallel workers for Pass 2
`--confidence-agg mean\|max`	Confidence aggregation when deduplicating
`--base-schema PATH`	Locked TBox TTL file (locked classes/properties cannot be changed by the LLM)
`--thesaurus PATH`	SKOS TTL thesaurus for synonym resolution in schema merge
`--log-file PATH`	Write logs here (relative paths placed inside the session folder)
`--verbose / -v`	Enable DEBUG-level logging

Examples

# New run — auto-creates a timestamped session
mykg extract-graph my_notes/

# Resume a session with 4 parallel Pass 2 workers
mykg extract-graph my_notes/ --session 2026-05-17T18-31-07 --workers 4

# Pause for schema review after Pass 1
mykg extract-graph my_notes/ --review
# → edit sessions/<name>/intermediate/schema.json
mykg approve-schema --session 2026-05-17T18-31-07
mykg extract-graph my_notes/ --session 2026-05-17T18-31-07 --review

# Re-run from assembly onward (reuses existing extractions)
mykg extract-graph my_notes/ --session 2026-05-17T18-31-07 --from-step assemble

# Lock a base ontology so the LLM won't rename its classes
mykg extract-graph my_notes/ --base-schema ontology/core.ttl

Sessions

Every run automatically creates an isolated session folder:

sessions/
  2026-05-17T18-31-07/
    input/           ← archived copy of all input Markdown files
    intermediate/    ← all intermediate pipeline state
    output/          ← final outputs (JSONL, TTL, HTML, NetworkX)
    run.log          ← log file
    walkthrough.md   ← post-run report

Sessions are the primary unit of resumability. Pass --session <name> to resume from the last completed step. Pass --from-step <step> to force-restart from a specific point.

The sessions root is configurable via pipeline.paths.sessions_dir (default: sessions/ in the current directory).

Pipeline Steps

The pipeline runs 11 steps in sequence. All intermediate state is written to disk so any step can be re-entered without repeating upstream work.

#	Step	LLM	Key outputs
1	`ingest`	—	`file_manifest.json`
2	`pass1`	✓ (3 calls)	`schema.json`, `schema.ttl`, `schema_history/`
3	`schema_validate`	—	`schema_validate.done`
4	`human_review`	—	`schema_approved.flag` (only with `--review`)
5	`schema_flatten`	—	`flattened_schema.json`
6	`pass2`	✓	`raw_extractions.json`, `chunk_node_index.json`
7	`normalize_names`	✓	`name_normalization.json`
8	`assemble`	—	`edge_metadata.json`, `nodes.json`, `merge_log.json`
9	`orphan_score`	—	`orphan_candidates.json`
10	`orphan_connect`	✓	`orphan_connections.json`, `orphan_log.json`
11	`validate_graph`	—	`nodes.jsonl`, `edges.jsonl`, `knowledge_graph.ttl`, `knowledge_graph.html`, `networkx_output/`

Pass 1 internally runs four sequential stages: parallel batch induction → algorithmic merge → harmonization LLM call → quality review LLM call.

Outputs

Property Graph (JSONL)

nodes.jsonl — one JSON line per entity:

{
  "id": "person-alice",
  "type": "Person",
  "confidence": 0.94,
  "source_files": ["team.md"],
  "attributes": {
    "name":  {"value": "Alice",          "confidence": 1.0},
    "email": {"value": "alice@acme.com", "confidence": 0.88}
  },
  "aliases": ["Alice Smith", "A. Smith"]
}

edges.jsonl — one JSON line per relationship:

{
  "id": "works_at-abc123",
  "type": "works_at",
  "from": "person-alice",
  "to": "org-acme-corp",
  "confidence": 0.96,
  "method": "llm_extraction",
  "attributes": {
    "role":       {"value": "Engineer", "confidence": 0.91},
    "start_date": {"value": null,       "confidence": 0.0}
  }
}

Missing attributes are never dropped — they are represented as {"value": null, "confidence": 0.0}.

The method field distinguishes edges extracted by Pass 2 (llm_extraction) from edges inferred by the orphan pass (orphan_inferred).

RDF / OWL (Turtle)

knowledge_graph.ttl — pure RDFS/OWL triples, no edge metadata:

@prefix ex: <http://mykg.local/schema/> .
@prefix :   <http://mykg.local/data/> .

ex:Person  a rdfs:Class .
ex:works_at  rdfs:domain ex:Person ;  rdfs:range ex:Organization .

:person-alice  a ex:Person ;  rdfs:label "Alice" .
:person-alice  ex:works_at  :org-acme-corp .

Load in Protégé, query with SPARQL (Fuseki, GraphDB), or reason with HermiT/Pellet.

Interactive HTML

knowledge_graph.html — self-contained D3.js force-directed graph. Open in any browser, no server required. Supports:

Filter nodes and edges by type
Filter by confidence threshold
Search by name
Hover popups with full attribute values
Resizable sidebar

NetworkX Formats (`networkx_output/`)

File	Format	Best for
`knowledge_graph.graphml`	GraphML	yEd, Gephi, Cytoscape
`knowledge_graph.gexf`	GEXF	Gephi native (rich metadata)
`knowledge_graph.json`	JSON node-link	D3.js, Sigma.js, web apps
`knowledge_graph.gml`	GML	Human-readable inspection
`knowledge_graph.net`	Pajek	Network analysis
`edges_nx.txt`	Edge list	Text pipelines
`adjacency.txt`	Adjacency list	Topology consumers

Node/edge attributes are exported as attr_<name>_value / attr_<name>_confidence scalar pairs for GML compatibility.

Re-running from a Specific Step

Use --from-step to delete a step's outputs and all downstream outputs, then re-run from that point.

SESSION=2026-05-17T18-31-07

# Re-run from Pass 2 (reuse the existing schema)
mykg extract-graph my_notes/ --session $SESSION --from-step pass2

# Re-run only assembly + export (reuse raw extractions)
mykg extract-graph my_notes/ --session $SESSION --from-step assemble

# Re-run both orphan stages
mykg extract-graph my_notes/ --session $SESSION --from-step orphan_score

# Orphan LLM pass only — full clean sweep
mykg extract-graph my_notes/ --session $SESSION --from-step orphan_connect_fullsweep

# Orphan LLM pass only — additive (preserves prior confirmed edges)
mykg extract-graph my_notes/ --session $SESSION --from-step orphan_connect_incremental

Four re-entry patterns:

Pattern	When to use	Command
A — Schema changed	Wrong concept types, missing properties	Edit `schema.json` → `approve-schema` → `--from-step pass1`
B — Extraction errors	LLM missed entities or invented edge types	Edit shard in `raw_extractions_shards/` → `--from-step pass2`
C — Assembly errors	Bad dedup decisions in `merge_log.json`	Edit `raw_extractions.json` → `--from-step assemble`
D — Orphan pass	Wrong candidates or confirmations	`--from-step orphan_score` or `orphan_connect_fullsweep`

Orphan-Connection Pass

After assembly, nodes with zero edges are "orphans" — present in the graph but unreachable by traversal. The orphan pass reconnects them in two stages:

Stage 1 — orphan_score (no LLM): Uses chunk_node_index.json to find nodes that co-occur in the same source chunk as each orphan. Candidates are scored by co-occurrence frequency and filtered by schema type compatibility. Written to orphan_candidates.json.

Stage 2 — orphan_connect (LLM): One LLM call per source chunk. The prompt includes the full chunk text, all orphan IDs from that chunk, co-occurring connected nodes, and all schema properties. Confirmed edges carry "method": "orphan_inferred" and are merged directly into edge_metadata.json.

Unconnectable orphans (no resolvable source chunk) are logged as orphan_unconnectable advisory events in orphan_log.json.

Configure via pipeline.orphan_pass.* in pipeline_config.yaml. Disable entirely with pipeline.orphan_pass.enabled: false.

Advanced Options

Human Review Gate (`--review`)

Pause after Pass 1 to inspect and edit the induced schema before Pass 2 runs:

mykg extract-graph my_notes/ --review
# → pipeline halts; edit sessions/<name>/intermediate/schema.json
mykg approve-schema --session <name>
mykg extract-graph my_notes/ --session <name> --review   # resumes from Pass 2

Locked Base Schema (`--base-schema`)

Lock certain classes and properties so the LLM cannot rename, remove, or restructure them:

mykg extract-graph my_notes/ --base-schema ontology/base.ttl

Locked entries can still receive additional attributes proposed by the LLM. Near-duplicate LLM proposals are collapsed into the locked entry with a warning.

SKOS Thesaurus (`--thesaurus`)

Resolve near-duplicate concept names during schema merge using a SKOS vocabulary:

mykg extract-graph my_notes/ --thesaurus ontology/terms.skos.ttl

skos:exactMatch → silent collapse
skos:closeMatch → collapse with warning in merge_log.json
skos:broader / skos:narrower → advisory hints only

Append Mode

Re-run the pipeline on new or modified files without re-running Pass 1:

mykg extract-graph my_notes/ --session <name> --append

Merging Sessions

Combine two independently-produced sessions into a unified knowledge graph:

mykg merge-graphs <session-A> <session-B> [OPTIONS]

# Example
mykg merge-graphs 2026-05-01T10-00-00 2026-05-15T14-30-00

# Resume a merge (last incomplete step auto-detected)
mykg merge-graphs A B --output-session <merged-name>

Options:

Option	Description
`--output-session TEXT`	Name for the merged session (default: auto-timestamped)
`--no-review`	Skip the human review gate after schema merge
`--thesaurus PATH`	SKOS thesaurus for schema synonym matching
`--base-schema PATH`	Locked TBox TTL base schema
`--from-step NAME`	Force re-run from a specific merge step

What happens:

Both schemas are merged via the same three-stage chain as Pass 1 (algorithmic union → LLM harmonization → LLM quality review)
All file-keyed structures are namespaced (session_a/<filename>, session_b/<filename>) before merging
Nodes are deduplicated across sessions: same type + canonical name → single node, regardless of source session
Re-extraction strategy (none / surgical / full) handles properties absent from one session's schema
source_map.json records full file provenance; merge_manifest.json records schema deltas and strategy used
walkthrough.md includes a Merge Provenance section with before/after counts and node/edge breakdowns

Configure the re-extraction strategy:

merge_graphs:
  reextraction_strategy: surgical   # none | surgical | full

Walkthrough Report

A human-readable summary is written to sessions/<name>/walkthrough.md after every run:

# Regenerate the walkthrough for an existing session
mykg walkthrough --session 2026-05-17T18-31-07

Disable with pipeline.report.enabled: false.

Development

Installation

git clone https://github.com/SenolIsci/mykg && cd mykg
uv sync

Testing

# All non-live tests (fast, no API key needed)
uv run pytest -m "not live" -v

# All tests including live API integration tests
# Requires OPENROUTER_API_KEY in environment or .env (see sample.env)
uv run pytest -m live -v

# Single file
uv run pytest tests/test_assembler.py -v

# With coverage (HTML report at htmlcov/index.html)
uv run pytest -m "not live"
open htmlcov/index.html

Linting and Formatting

uv run ruff check src/ tests/          # lint
uv run ruff check --fix src/ tests/    # auto-fix
uv run ruff format src/ tests/         # format

Token Budget Calculator

When switching to a model with a different context window:

context-calculator --context 128000 --max-output 16384

Outputs a ready-to-paste YAML snippet for the pipeline: block.

Profiling

python -m cProfile -o profile.out -m mykg.cli extract input_files/
uv run snakeviz profile.out

Roadmap

Query knowledge graph — natural-language and structured queries directly against the extracted graph; planned support for SPARQL, graph traversal, and LLM-assisted Q&A over nodes and edges

Design

For a thorough description of the architecture, algorithm, data models, and design decisions, see architecture.md.

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.10

May 30, 2026

0.2.9 yanked

May 29, 2026

0.2.8 yanked

May 29, 2026

0.2.7

May 28, 2026

0.2.6 yanked

May 28, 2026

0.2.5 yanked

May 28, 2026

0.2.4 yanked

May 28, 2026

This version

0.2.3 yanked

May 28, 2026

0.2.2 yanked

May 28, 2026

0.2.1 yanked

May 28, 2026

0.2.0 yanked

May 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mykg-0.2.3.tar.gz (485.1 kB view details)

Uploaded May 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mykg-0.2.3-py3-none-any.whl (165.8 kB view details)

Uploaded May 28, 2026 Python 3

File details

Details for the file mykg-0.2.3.tar.gz.

File metadata

Download URL: mykg-0.2.3.tar.gz
Upload date: May 28, 2026
Size: 485.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for mykg-0.2.3.tar.gz
Algorithm	Hash digest
SHA256	`f481c7b2070778cb2d2a6538992ad77c8dbe1d49c4d8a73d1662bf9adfe76d61`
MD5	`db2d4a1ab69a8203d4750f2264ec9419`
BLAKE2b-256	`4041875c51a61d0c43698fc9c80ddac4c3ba3800d2a530efc61070821046763d`

See more details on using hashes here.

File details

Details for the file mykg-0.2.3-py3-none-any.whl.

File metadata

Download URL: mykg-0.2.3-py3-none-any.whl
Upload date: May 28, 2026
Size: 165.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for mykg-0.2.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`12f76e10d4b730aaf8a67d621901a319266a7f9c1d8587ab94e3867b06c86873`
MD5	`2f66a00168213f5c234509977bfcf4f1`
BLAKE2b-256	`cddfab1ebc35e614e8ee3b10cf26fb244fc1334882f7eb151cc145ca871c7bd3`

See more details on using hashes here.

mykg 0.2.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

myKG — Knowledge Graph Extractor

Command line

Output

Contents

Features

Ontology-Guided Extraction

Input

Graph & Output

Quick Start

Install from PyPI

Install from source

Using with Claude Code

Setup

How it works

Using myKG from inside Claude Code

Recommended pipeline_config.yaml settings for Claude Code

Configuration

API Keys

LLM Providers

Key Pipeline Parameters

Extract Pipeline

Running

Options

Examples

Sessions

Pipeline Steps

Outputs

Property Graph (JSONL)

RDF / OWL (Turtle)

Interactive HTML

NetworkX Formats (networkx_output/)

Re-running from a Specific Step

Orphan-Connection Pass

Advanced Options

Human Review Gate (--review)

Locked Base Schema (--base-schema)

SKOS Thesaurus (--thesaurus)

Append Mode

Merging Sessions

Walkthrough Report

Development

Installation

Testing

Linting and Formatting

Token Budget Calculator

Profiling

Roadmap

Design

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Recommended `pipeline_config.yaml` settings for Claude Code

NetworkX Formats (`networkx_output/`)

Human Review Gate (`--review`)

Locked Base Schema (`--base-schema`)

SKOS Thesaurus (`--thesaurus`)