Code2Logic - Source code to logical representation converter for LLM analysis, featuring Tree-sitter parsing, dependency graph analysis, and multi-language support.

These details have not been verified by PyPI

Project links

Project description

Code2Logic

alt text

Convert source code to logical representation for LLM analysis.

Code2Logic analyzes codebases and generates compact, LLM-friendly representations with semantic understanding. Perfect for feeding project context to AI assistants, building code documentation, or analyzing code structure.

✨ Features

🌳 Multi-language support - Python, JavaScript, TypeScript, Java, Go, Rust, and more
🎯 Tree-sitter AST parsing - 99% accuracy with graceful fallback
📊 NetworkX dependency graphs - PageRank, hub detection, cycle analysis
🔍 Rapidfuzz similarity - Find duplicate and similar functions
🧠 NLP intent extraction - Human-readable function descriptions
📦 Zero dependencies - Core works without any external libs

🚀 Installation

Basic (no dependencies)

pip install code2logic

Full (all features)

pip install code2logic[full]

Selective features

pip install code2logic[treesitter]  # High-accuracy AST parsing
pip install code2logic[graph]       # Dependency analysis
pip install code2logic[similarity]  # Similar function detection
pip install code2logic[nlp]         # Enhanced intents

📖 Quick Start

# TOON compact (best token efficiency — 5.9x smaller than JSON)
code2logic ./ -f toon --compact --name project -o ./

# TOON with function-logic + structural context
code2logic ./ -f toon --compact --no-repeat-module \
  --function-logic function.toon --function-logic-context minimal --name project -o ./

# TOON-Hybrid (project structure + function details for hub modules)
code2logic ./ -f toon --hybrid --no-repeat-module --name project -o ./

# YAML compact (human-readable, good compromise)
code2logic ./ -f yaml --compact --name project -o ./

Command Line

# Standard Markdown output
code2logic /path/to/project

# If the `code2logic` entrypoint is not available (e.g. running from source without install):
python -m code2logic /path/to/project

# Compact YAML (14% smaller, meta.legend transparency)
code2logic /path/to/project -f yaml --compact -o analysis-compact.yaml

# Ultra-compact TOON (71% smaller, single-letter keys)
code2logic /path/to/project -f toon --ultra-compact -o analysis-ultra.toon


# Generate schema alongside output
code2logic /path/to/project -f yaml --compact --with-schema

# With detailed analysis
code2logic /path/to/project -d detailed

alt text

Python API

from code2logic import analyze_project, MarkdownGenerator

# Analyze a project
project = analyze_project("/path/to/project")

# Generate output
generator = MarkdownGenerator()
output = generator.generate(project, detail_level='standard')
print(output)

# Access analysis results
print(f"Files: {project.total_files}")
print(f"Lines: {project.total_lines}")
print(f"Languages: {project.languages}")

# Get hub modules (most important)
hubs = [p for p, n in project.dependency_metrics.items() if n.is_hub]
print(f"Key modules: {hubs}")

Organized Imports

# Core analysis
from code2logic import ProjectInfo, ProjectAnalyzer, analyze_project

# Format generators
from code2logic import (
    YAMLGenerator,
    JSONGenerator,
    TOONGenerator,
    LogicMLGenerator,
    GherkinGenerator,
)

# LLM clients
from code2logic import get_client, BaseLLMClient

# Development tools
from code2logic import run_benchmark, CodeReviewer

📋 Output Formats

Markdown (default)

Human-readable documentation with:

Project structure tree with hub markers (★)
Dependency graphs with PageRank scores
Classes with methods and intents
Functions with signatures and descriptions

Compact

Ultra-compact format optimized for LLM context:

# myproject | 102f 31875L | typescript:79/python:23
ENTRY: index.ts main.py
HUBS: evolution-manager llm-orchestrator

[core/evolution]
  evolution-manager.ts (3719L) C:EvolutionManager | F:createEvolutionManager
  task-queue.ts (139L) C:TaskQueue,Task

JSON

Machine-readable format for:

RAG (Retrieval-Augmented Generation)
Database storage
Further analysis

🔧 Configuration

Library Status

Check which features are available:

code2logic --status

Library Status:
  tree_sitter: ✓
  networkx: ✓
  rapidfuzz: ✓
  nltk: ✗
  spacy: ✗

LLM Configuration

Manage LLM providers, models, API keys, and routing priorities:

code2logic llm status
code2logic llm set-provider auto
code2logic llm set-model openrouter nvidia/nemotron-3-nano-30b-a3b:free
code2logic llm key set openrouter <OPENROUTER_API_KEY>
code2logic llm priority set-provider openrouter 10
code2logic llm priority set-mode provider-first
code2logic llm priority set-llm-model nvidia/nemotron-3-nano-30b-a3b:free 5
code2logic llm priority set-llm-family nvidia/ 5
code2logic llm config list

Notes:

code2logic llm set-provider auto enables automatic fallback selection: providers are tried in priority order.
API keys should be stored in .env (or environment variables), not in litellm_config.yaml.
These commands write configuration files:
- .env in the current working directory
- litellm_config.yaml in the current working directory
- ~/.code2logic/llm_config.json in your home directory

Priority modes

You can choose how automatic fallback ordering is computed:

provider-first providers are ordered by provider priority (defaults + overrides)
model-first providers are ordered by priority rules for the provider's configured model (exact/prefix)
mixed providers are ordered by the best (lowest) priority from either provider priority or model rules

Configure the mode:

code2logic llm priority set-mode provider-first
code2logic llm priority set-mode model-first
code2logic llm priority set-mode mixed

Model priority rules are stored in ~/.code2logic/llm_config.json.

Python API (Library Status)

from code2logic import get_library_status

status = get_library_status()
# {'tree_sitter': True, 'networkx': True, ...}

📊 Analysis Features

Dependency Analysis

PageRank - Identifies most important modules
Hub detection - Central modules marked with ★
Cycle detection - Find circular dependencies
Clustering - Group related modules

Intent Generation

Functions get human-readable descriptions:

methods:
  async findById(id:string) -> Promise<User>  # retrieves user by id
  async createUser(data:UserDTO) -> Promise<User>  # creates user
  validateEmail(email:string) -> boolean  # validates email

Similarity Detection

Find duplicate and similar functions:

Similar Functions:
  core/auth.ts::validateToken:
    - python/auth.py::validate_token (92%)
    - services/jwt.ts::verifyToken (85%)

🏗️ Architecture

code2logic/
├── analyzer.py          # Main orchestrator
├── parsers.py           # Tree-sitter + fallback parser
├── dependency.py        # NetworkX dependency analysis
├── similarity.py        # Rapidfuzz similar detection
├── intent.py            # NLP intent generation
├── generators.py        # Output generators (MD/Compact/JSON/YAML/CSV)
├── toon_format.py       # TOON generator (compact, hybrid)
├── logicml.py           # LogicML generator (typed signatures)
├── function_logic.py    # Function-logic TOON with structural context
├── metrics.py           # AST-based quality metrics
├── models.py            # Data structures
├── cli.py               # Command-line interface
├── benchmarks/          # Benchmark runner, results, common utils
└── llm_clients.py       # Unified LLM client (OpenRouter/Ollama/LiteLLM)

🔌 Integration Examples

With Claude/ChatGPT

from code2logic import analyze_project, CompactGenerator

project = analyze_project("./my-project")
context = CompactGenerator().generate(project)

# Use in your LLM prompt
prompt = f"""
Analyze this codebase and suggest improvements:

{context}
"""

With RAG Systems

import json
from code2logic import analyze_project, JSONGenerator

project = analyze_project("./my-project")
data = json.loads(JSONGenerator().generate(project))

# Index in vector DB
for module in data['modules']:
    for func in module['functions']:
        embed_and_store(
            text=f"{func['name']}: {func['intent']}",
            metadata={'path': module['path'], 'type': 'function'}
        )

🧪 Development

Setup

git clone https://github.com/wronai/code2logic
cd code2logic
poetry install --with dev -E full
poetry run pre-commit install

# Alternatively, you can use Makefile targets (prefer Poetry if available)
make install-full

Tests

make test
make test-cov

# Or directly:
poetry run pytest
poetry run pytest --cov=code2logic --cov-report=html

Type Checking

make typecheck

# Or directly:
poetry run mypy code2logic

Linting

make lint
make format

# Or directly:
poetry run ruff check code2logic
poetry run black code2logic

📈 Performance

Codebase Size	Files	Lines	Time	Output Size
Small	10	1K	<1s	~5KB
Medium	100	30K	~2s	~50KB
Large	500	150K	~10s	~200KB

Compact format is ~10-15x smaller than Markdown.

🔬 Code Reproduction Benchmarks

Benchmark results (20 files, model: arcee-ai/trinity-large-preview, 2026-02-25):

Project Benchmark — Format Comparison

Format	Score	Syntax OK	Runs OK	~Tokens	Efficiency (p/kT)
toon	63,8%	100%	60%	17 875	3,57
json	62,9%	100%	60%	104 914	0,60
markdown	62,5%	100%	55%	36 851	1,70
yaml	62,4%	100%	55%	68 651	0,91
logicml	60,4%	100%	55%	~30 000	~2,01
csv	53,0%	100%	40%	80 779	0,66
function.toon	49,3%	95%	35%	29 271	1,68
gherkin	38,6%	95%	30%	~25 000	~1,54

Behavioral benchmark: 85,7% (6/7 functions passed).

Key Findings

TOON wins on efficiency — best score (63,8%) at 5,9x fewer tokens than JSON
Syntax OK = 100% for all major formats — LLM always generates valid syntax
function.toon paradox — worse than project.toon despite larger file, due to missing class/module context (fixed in v1.0.43 with --function-logic-context)
gherkin/csv — poor fit for code description, their structure doesn't map to programming constructs

Run Benchmarks

make benchmark          # Full benchmark suite (requires OPENROUTER_API_KEY)

# Or individually:
python examples/15_unified_benchmark.py --type format --folder tests/samples/ --limit 20
python examples/15_unified_benchmark.py --type project --folder tests/samples/ --limit 20
python examples/15_unified_benchmark.py --type function --file tests/samples/sample_functions.py

🤝 Contributing

Contributions welcome! Please read our Contributing Guide.

📄 License

Apache 2 License - see LICENSE for details.

🔄 Companion Packages

logic2test - Generate Tests from Logic

Generate test scaffolds from Code2Logic output:

# Show what can be generated
python -m logic2test out/code2logic/project.c2l.yaml --summary

# Generate unit tests
python -m logic2test out/code2logic/project.c2l.yaml -o out/logic2test/tests/

# Generate all test types (unit, integration, property)
python -m logic2test out/code2logic/project.c2l.yaml -o out/logic2test/tests/ --type all

from logic2test import TestGenerator

generator = TestGenerator('out/code2logic/project.c2l.yaml')
result = generator.generate_unit_tests('out/logic2test/tests/')
print(f"Generated {result.tests_generated} tests")

logic2code - Generate Code from Logic

Generate source code from Code2Logic output:

# Show what can be generated
python -m logic2code out/code2logic/project.c2l.yaml --summary

# Generate Python code
python -m logic2code out/code2logic/project.c2l.yaml -o out/logic2code/generated_code/

# Generate stubs only
python -m logic2code out/code2logic/project.c2l.yaml -o out/logic2code/generated_code/ --stubs-only

from logic2code import CodeGenerator

generator = CodeGenerator('out/code2logic/project.c2l.yaml')
result = generator.generate('out/logic2code/generated_code/')
print(f"Generated {result.files_generated} files")

Full Workflow: Code → Logic → Tests/Code

# 1. Analyze existing codebase
code2logic src/ -f yaml -o out/code2logic/project.c2l.yaml

# 2. Generate tests for the codebase
python -m logic2test out/code2logic/project.c2l.yaml -o out/logic2test/tests/ --type all

# 3. Generate code scaffolds (for refactoring)
python -m logic2code out/code2logic/project.c2l.yaml -o out/logic2code/generated_code/ --stubs-only

📚 Documentation

00 - Docs Index - Documentation home (start here)
01 - Getting Started - Install and first steps
02 - Configuration - API keys, environment setup
03 - CLI Reference - Command-line usage
04 - Python API - Programmatic usage
05 - Output Formats - Format comparison and usage
06 - Format Specifications - Detailed format specs
07 - TOON Format - Token-Oriented Object Notation
08 - LLM Integration - OpenRouter/Ollama/LiteLLM
09 - LLM Comparison - Provider/model comparison
10 - Benchmarking - Benchmark methodology and results
11 - Repeatability - Repeatability testing
12 - Examples - Usage workflows and examples
13 - Architecture - System design and components
14 - Format Analysis - Deeper format evaluation
15 - Logic2Test - Test generation from logic files
16 - Logic2Code - Code generation from logic files
17 - LOLM - LLM provider management
18 - Reproduction Testing - Format validation and code regeneration
19 - Monorepo Workflow - Managing all packages from repo root

🧩 Examples

examples/ - All runnable examples
examples/run_examples.sh - Example runner script (multi-command workflows)
examples/code2logic/ - Minimal project + docker example for code2logic
examples/logic2test/ - Minimal project + docker example for logic2test
examples/logic2code/ - Minimal project + docker example for logic2code

🔗 Links

License

Apache License 2.0 - see LICENSE for details.

Author

Created by Tom Sapletta - tom@sapletta.com

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.48

Feb 26, 2026

1.0.47

Feb 26, 2026

This version

1.0.46

Feb 26, 2026

1.0.45

Feb 26, 2026

1.0.44

Feb 26, 2026

1.0.43

Feb 25, 2026

1.0.42

Feb 25, 2026

1.0.41

Feb 25, 2026

1.0.40

Feb 25, 2026

1.0.39

Feb 25, 2026

1.0.38

Feb 25, 2026

1.0.37

Feb 25, 2026

1.0.36

Feb 25, 2026

1.0.35

Feb 24, 2026

1.0.34

Feb 24, 2026

1.0.30

Jan 27, 2026

1.0.29

Jan 27, 2026

1.0.28

Jan 24, 2026

1.0.27

Jan 18, 2026

1.0.26

Jan 18, 2026

1.0.25

Jan 18, 2026

1.0.24

Jan 12, 2026

1.0.23

Jan 12, 2026

1.0.22

Jan 12, 2026

1.0.21

Jan 9, 2026

1.0.20

Jan 8, 2026

1.0.19

Jan 7, 2026

1.0.18

Jan 7, 2026

1.0.17

Jan 7, 2026

1.0.16

Jan 6, 2026

1.0.15

Jan 6, 2026

1.0.14

Jan 6, 2026

1.0.13

Jan 6, 2026

1.0.12

Jan 6, 2026

1.0.11

Jan 6, 2026

1.0.10

Jan 6, 2026

1.0.9

Jan 6, 2026

1.0.8

Jan 5, 2026

1.0.7

Jan 5, 2026

1.0.6

Jan 5, 2026

1.0.5

Jan 4, 2026

1.0.4

Jan 4, 2026

1.0.3

Jan 4, 2026

1.0.2

Jan 3, 2026

1.0.1

Jan 3, 2026

1.0.0

Jan 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code2logic-1.0.46.tar.gz (191.8 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

code2logic-1.0.46-py3-none-any.whl (213.5 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file code2logic-1.0.46.tar.gz.

File metadata

Download URL: code2logic-1.0.46.tar.gz
Upload date: Feb 26, 2026
Size: 191.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for code2logic-1.0.46.tar.gz
Algorithm	Hash digest
SHA256	`3c7f2a5d2ce08afeafd89a4383d010ff6961e7fe248e20ce06fd206cecc34880`
MD5	`68828ea76bcf7eb1ccd5ded95597ffcb`
BLAKE2b-256	`3b81de7a752d2445ff01cda55a600a03cd4d0eeb9944ed58d089711547003516`

See more details on using hashes here.

File details

Details for the file code2logic-1.0.46-py3-none-any.whl.

File metadata

Download URL: code2logic-1.0.46-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 213.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for code2logic-1.0.46-py3-none-any.whl
Algorithm	Hash digest
SHA256	`19582e5c8fb07ff86620710e87e891fd57598868e360a4ad37902af5c2d25844`
MD5	`d954b8065f9390526bbb0a5de7709bab`
BLAKE2b-256	`c6fe0fd17de3d3d23c6624a552cdcc89513f1e59513a151d1e516dfc853bf044`

See more details on using hashes here.

code2logic 1.0.46

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Code2Logic

✨ Features

🚀 Installation

Basic (no dependencies)

Full (all features)

Selective features

📖 Quick Start

Command Line

Python API

Organized Imports

📋 Output Formats

Markdown (default)

Compact

JSON

🔧 Configuration

Library Status

LLM Configuration

Priority modes

Python API (Library Status)

📊 Analysis Features

Dependency Analysis

Intent Generation

Similarity Detection

🏗️ Architecture

🔌 Integration Examples

With Claude/ChatGPT

With RAG Systems

🧪 Development

Setup

Tests

Type Checking

Linting

📈 Performance

🔬 Code Reproduction Benchmarks

Project Benchmark — Format Comparison

Key Findings

Run Benchmarks

🤝 Contributing

📄 License

🔄 Companion Packages

logic2test - Generate Tests from Logic

logic2code - Generate Code from Logic

Full Workflow: Code → Logic → Tests/Code

📚 Documentation

🧩 Examples

🔗 Links

License

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes