Skip to main content

AI-powered knowledge management with YAKE keyword extraction for Claude Code development

Project description

Claude Knowledge Catalyst (CKC) v0.10.1

Claude Code โ‡„ Obsidian Seamless Integration System

Automatically synchronize insights from Claude Code development processes with Obsidian vaults for structured knowledge management. Automated analysis reduces manual classification overhead.

๐Ÿ“‹ Japanese Version | ๐ŸŒ Documentation

Python 3.11+ PyPI version PyPI downloads License: MIT Ruff Documentation

๐ŸŽฏ Claude Code โ‡„ Obsidian Seamless Integration

๐Ÿ”„ Automatic Synchronization System

  • Real-time Sync: Instantly reflect changes in .claude/ directory to Obsidian vault
  • Bidirectional Integration: Complete integration between Claude Code development and Obsidian knowledge management
  • Structured Organization: Systematize knowledge using Obsidian's powerful features

๐Ÿค– Automated Metadata Enhancement with YAKE Integration

  • Advanced Keyword Extraction: YAKE (Yet Another Keyword Extractor) for unsupervised keyword discovery
  • Multi-Language Support: English, Japanese, Spanish, French, German, Italian, Portuguese
  • Smart Tagging: AI-powered tag suggestions with confidence scoring
  • Evidence-Based Classification: Reliable organization with clear rationale for automated decisions
# Enhanced Metadata Example (Secondary Effect)
type: [prompt, code, concept, resource]           # Content nature
tech: [python, react, fastapi, kubernetes, ...]   # Technology stack
domain: [web-dev, ml, devops, mobile, ...]        # Application domain
team: [backend, frontend, ml-research, devops]    # Team ownership
status: [draft, tested, production, deprecated]   # Lifecycle state
complexity: [beginner, intermediate, advanced]    # Skill level
confidence: [low, medium, high]                   # Content reliability

๐Ÿ›๏ธ Obsidian-Optimized Vault Structure

obsidian-vault/
โ”œโ”€โ”€ _system/          # Templates and configuration
โ”œโ”€โ”€ _attachments/     # Media files
โ”œโ”€โ”€ inbox/            # Unprocessed content
โ”œโ”€โ”€ active/           # Work-in-progress content
โ”œโ”€โ”€ archive/          # Completed/deprecated content
โ””โ”€โ”€ knowledge/        # Mature knowledge (main area)

Prerequisites

  • uv: Modern Python package manager (includes Python 3.11+ automatically)
    • Installation: Follow the official uv installation guide
    • Quick install: curl -LsSf https://astral.sh/uv/install.sh | sh (Unix/macOS) or powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex" (Windows)
  • Python: Not required separately - uv manages Python 3.11+ automatically

๐ŸŽฏ 3-Minute Claude Code โ‡„ Obsidian Integration Experience

๐Ÿš€ v0.10.1 Test Stabilization: Complete test-until-pass implementation with 396/396 passing tests and 48.09% test coverage ensures production stability.

Experience seamless integration:

# Install CKC
uv pip install claude-knowledge-catalyst

# Initialize in Claude Code project
cd your-claude-project
uv run ckc init

# Connect to Obsidian vault
uv run ckc add my-vault /path/to/obsidian/vault

# Sync .claude/ files with Obsidian
uv run ckc sync

What happens:

  • โœ… Seamless Integration: Complete integration between Claude Code development and Obsidian knowledge management
  • โœ… Automatic Structuring: Organize .claude/ content with Obsidian-optimized structure
  • โœ… Enhanced Metadata: Automatic tagging that reduces manual classification
  • โœ… Real-time Sync: Instantly reflect knowledge accumulation during development process

Core Features

๐Ÿ”„ Claude Code โ‡„ Obsidian Complete Integration

  • Seamless Sync: Automatic bidirectional sync between .claude/ directory and Obsidian vault
  • Structured Migration: Optimization and structural enhancement of existing Obsidian vaults
  • Dynamic Query Generation: Automatic generation of Obsidian dataview queries
  • Knowledge Discovery: Cross-project search of Claude Code development insights within Obsidian

๐Ÿš€ YAKE Keyword Extraction (New in v0.10.0)

  • Unsupervised Learning: Extract keywords without training data
  • Multi-Language: Automatic language detection and processing
  • Confidence Scoring: Filter high-quality keyword suggestions
  • Technical Content: Optimized for technical documentation and code

๐Ÿ”’ Secure CLAUDE.md Sync

  • Privacy-First: Section-level filtering for sensitive information
  • Configurable Exclusion: Protect API keys, credentials, personal data
  • Safe by Default: CLAUDE.md sync disabled unless explicitly enabled

๐Ÿ“Š Obsidian Integrated Analytics

  • Knowledge Usage Tracking: Analyze knowledge utilization patterns in Claude Code development
  • Prompt Effectiveness Measurement: Success rates and improvement suggestions within Obsidian
  • Cross-Project Insights: Discover relationships between development insights
  • Team Knowledge Sharing: Collaborative knowledge management through Obsidian

๐ŸŽจ Obsidian-Optimized Templates

  • Claude Code Specialized: Obsidian templates for prompts, code, concepts, and resources
  • Smart Suggestions: Automatic template selection based on development context
  • Evolving Structure: Obsidian vault optimization according to project growth

Quick Start

Installation

# Install from PyPI using uv (recommended)
uv pip install claude-knowledge-catalyst

# Or using pip
pip install claude-knowledge-catalyst

# Or install from source for development
git clone https://github.com/drillan/claude-knowledge-catalyst.git
cd claude-knowledge-catalyst
uv sync --dev

Claude Code Project Integration

# Navigate to Claude Code project
cd your-claude-project

# Initialize CKC (detects .claude/ directory)
uv run ckc init

# Connect to Obsidian vault
uv run ckc add main-vault /path/to/your/obsidian/vault

# Experience automatic analysis of .claude/ content
echo "# Git Useful Commands

## Branch Status Check
\`\`\`bash
git branch -vv
git status --porcelain
\`\`\`" > .claude/git_tips.md

# Verify automated analysis and Obsidian metadata generation
uv run ckc classify .claude/git_tips.md --show-evidence

Existing Obsidian Vault Enhancement

# Enhance existing Obsidian vault for Claude Code integration
uv run ckc migrate --source /existing/obsidian --target /enhanced/vault

# Preview changes
uv run ckc migrate --source /existing/obsidian --target /enhanced/vault --dry-run

Available CLI Commands

๐Ÿš€ Automated Classification

# Automatic content analysis
uv run ckc classify file.md --show-evidence

# Batch classification
uv run ckc batch-classify .claude/

# Missing metadata detection
uv run ckc scan-missing-metadata

๐Ÿ“ Core Operations

# Zero-config initialization
uv run ckc init

# Vault connection
uv run ckc add vault-name /path/to/obsidian

# State-based synchronization
uv run ckc sync
uv run ckc sync --project "My Project"

# Real-time monitoring
uv run ckc watch

# System status
uv run ckc status

๐Ÿ“Š Advanced Analytics

# File analysis with evidence
uv run ckc analyze .claude/my-prompt.md

# Cross-dimensional search
uv run ckc search --tech python --status production
uv run ckc search --team frontend --complexity advanced

# Project insights
uv run ckc project stats my-project

Configuration

CKC uses a YAML configuration file with pure tag-centered settings:

version: "1.0"
project_name: "My AI Project"
auto_sync: true

# Tag-centered architecture
tag_system:
  enabled: true
  multi_dimensional: true
  auto_classification: true
  confidence_threshold: 0.75

# 7-dimensional tag schema
tags:
  type_tags: ["prompt", "code", "concept", "resource"]
  tech_tags: ["python", "javascript", "react", "fastapi"]
  domain_tags: ["web-dev", "machine-learning", "devops"]
  team_tags: ["backend", "frontend", "ml-research"]
  status_tags: ["draft", "tested", "production", "deprecated"]
  complexity_tags: ["beginner", "intermediate", "advanced"]
  confidence_tags: ["low", "medium", "high"]

# Obsidian integration
sync_targets:
  - name: "main-vault"
    type: "obsidian"
    path: "/Users/me/Documents/ObsidianVault"
    enabled: true
    enhance_metadata: true

# Automated features
automation:
  auto_classification: true
  evidence_tracking: true
  natural_language_search: true

# State-based workflow
workflow:
  inbox_pattern: "status:draft"
  active_pattern: "status:tested"
  knowledge_pattern: "status:production"
  archive_pattern: "status:deprecated"

# Security settings
watch:
  include_claude_md: false  # Enable with caution
  claude_md_sections_exclude:
    - "# secrets"
    - "# private"
    - "# api-keys"

Architecture

CKC implements a revolutionary pure tag-centered architecture:

  • Cognitive Load Zero: Eliminates category decision fatigue
  • 7-Dimensional Classification: Multi-layer tag system for precise organization
  • Automated Intelligence: Pattern-matching content understanding
  • State-Based Workflow: Organization by lifecycle, not content type
  • Dynamic Discovery: Cross-dimensional knowledge search
  • Obsidian Enhancement: Transform basic vaults โ†’ intelligent systems

Pure Tag-Centered vs Traditional

โŒ Traditional Category-Based Problems

โ”œโ”€โ”€ prompts/          # "Is this a prompt or template?"
โ”œโ”€โ”€ code/             # "Code snippet or tool?"
โ”œโ”€โ”€ concepts/         # "Concept or best practice?"
โ””โ”€โ”€ misc/             # Catch-all confusion

Issues:

  • Decision fatigue: Which category?
  • Rigid boundaries: Content doesn't fit neatly
  • Poor discoverability: Single-dimension search
  • Maintenance overhead: Moving files between categories

โœ… Pure Tag-Centered Solution

โ”œโ”€โ”€ _system/          # System files and templates
โ”œโ”€โ”€ inbox/            # Unprocessed items (workflow state)
โ”œโ”€โ”€ active/           # Currently working (activity state)
โ”œโ”€โ”€ archive/          # Deprecated/old (lifecycle state)
โ””โ”€โ”€ knowledge/        # Mature content (90% of files)
    โ””โ”€โ”€ Dynamic discovery through enhanced multi-layer tags

Benefits:

  • ๐Ÿง  Cognitive Load Reduction: No "which category?" decisions
  • ๐Ÿ” Multi-Dimensional Discovery: Search across tech, domain, team
  • ๐Ÿ“ˆ Scalable Organization: Tags evolve with your knowledge
  • โšก Flexible Workflow: State-based, not content-based organization
  • ๐Ÿ”— Rich Relationships: Multi-project, multi-domain connections

Documentation

Try the Revolution

Demo the cognitive transformation:

# Experience tag-centered migration
./demo/tag_centered_demo.sh

# Try automated classification
./demo/demo.sh

# Multi-team collaboration
./demo/multi_project_demo.sh

Requirements

  • Python Runtime: 3.11+ (managed automatically by uv)
  • Package Manager: uv (handles Python installation and dependency management)
  • Memory: Minimum 512MB, Recommended 2GB for large vaults
  • Storage: 10MB for CKC, varies based on vault size
  • OS: Windows 10+, macOS 11+, Linux (Ubuntu 20.04+)

Support & Community

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

We welcome contributions! Please see our Contributing Guide for details.


Welcome to the cognitive revolution! No more "which category?" decisions - experience pure, discoverable knowledge management.

Built with โค๏ธ by the Claude community

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

claude_knowledge_catalyst-0.10.1.tar.gz (258.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

claude_knowledge_catalyst-0.10.1-py3-none-any.whl (174.8 kB view details)

Uploaded Python 3

File details

Details for the file claude_knowledge_catalyst-0.10.1.tar.gz.

File metadata

File hashes

Hashes for claude_knowledge_catalyst-0.10.1.tar.gz
Algorithm Hash digest
SHA256 2a7295c90222e03674b0b3073f769ee82735c34c8a1f7d3216a81fc31eab85e9
MD5 8cbe6db7bab8f904fca5cf7d96dbbcfe
BLAKE2b-256 c1eb29ba8efdafd78594210716e1454bad99c0d766458a70d6d70453a6112237

See more details on using hashes here.

Provenance

The following attestation bundles were made for claude_knowledge_catalyst-0.10.1.tar.gz:

Publisher: publish.yml on drillan/claude-knowledge-catalyst

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file claude_knowledge_catalyst-0.10.1-py3-none-any.whl.

File metadata

File hashes

Hashes for claude_knowledge_catalyst-0.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 19803645441858bec02008d27c10a9b33cb0e3de8faa055cd5be2bb72f5cee7f
MD5 e02ac2579e80dc34522ac4b3619dcb16
BLAKE2b-256 35c20bdc116a0884d3ae9f19bd190cdc06930e6c3624acf9d17bd0ed2ad5b350

See more details on using hashes here.

Provenance

The following attestation bundles were made for claude_knowledge_catalyst-0.10.1-py3-none-any.whl:

Publisher: publish.yml on drillan/claude-knowledge-catalyst

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page