LLM Engineering Navigation System — discovers recurring solution patterns from LLM research papers

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

LENS — LLM Engineering Navigation System

Automatically discovers recurring solution patterns, contradiction resolutions, architecture innovations, and agentic design patterns from LLM research papers (arxiv).

Inspired by TRIZ methodology — but with richer knowledge structures, fully automated discovery, and continuous learning from the evolving LLM research landscape.

Core Knowledge Structures

Contradiction Matrix — Maps LLM tradeoffs (e.g., accuracy vs. latency) to resolution techniques (e.g., distillation, speculative decoding). Uses a canonical vocabulary of parameters and principles, extensible via LLM-proposed new concepts.
Architecture Catalog — Organizes LLM architecture components (attention, positional encoding, FFN, etc.) by slot with property-based comparison across variants. Answers "what are my options for component X?"
Agentic Pattern Catalog — Catalogs recurring patterns for building LLM-based agents (ReAct, Reflexion, multi-agent debate, etc.) with emergent categories discovered from data.

Status

Core pipeline implemented. All three knowledge structures are functional: contradiction matrix, architecture catalog (property-based comparison), and agentic pattern catalog (emergent categories). Full monitor pipeline: acquire → enrich → extract → build → ideate.

See docs/architecture.md for the full architecture doc.

Quick Start

# Install dependencies
uv sync

# Install pre-commit hooks (uses prek, a fast Rust-based pre-commit runner)
uv tool install prek
prek install

# Initialize the database and config
uv run lens init

# Acquire seed papers (10 landmark LLM papers)
uv run lens acquire seed

# Initialize canonical vocabulary (12 parameters + 12 principles)
uv run lens vocab init

# Extract tradeoffs, architecture, and agentic patterns from papers
uv run lens extract

# Build taxonomy and contradiction matrix
uv run lens build all

Usage

# Analyze a tradeoff — suggests resolution techniques from the matrix
uv run lens analyze "reduce hallucination without hurting latency"

# Analyze architecture — find matching variants by property
uv run lens analyze --type architecture "efficient attention for long context"

# Analyze agentic — find matching patterns
uv run lens analyze --type agentic "reliable multi-step code generation"

# Explain any LLM concept with adaptive depth
uv run lens explain "grouped-query attention"
uv run lens explain "knowledge distillation" --tradeoffs
uv run lens explain "MoE" --related

# Search papers
uv run lens search "attention mechanisms"          # hybrid keyword + semantic
uv run lens search --author "Vaswani"              # filter by author
uv run lens search "efficiency" --after 2024-01-01 # combine search + filters
uv run lens search --venue "NeurIPS" --limit 5     # filter by venue

# Browse the knowledge base
uv run lens vocab list                      # list vocabulary (parameters + principles)
uv run lens vocab list --kind parameter     # filter by kind
uv run lens vocab show inference-latency    # details for a concept
uv run lens explore matrix
uv run lens explore paper 2401.12345

# Browse architecture catalog
uv run lens explore architecture            # list all slots with variant counts
uv run lens explore architecture Attention  # compare variants with properties
uv run lens explore evolution Attention     # timeline view by paper date

# Browse agentic patterns
uv run lens explore agents                  # list patterns by category
uv run lens explore agents Reasoning        # filter by category

# Acquire more papers from arxiv
uv run lens acquire arxiv --query "LLM" --since 2025-01
uv run lens acquire file paper.pdf          # ingest a local PDF

# Acquire via DeepXiv (requires: uv sync --extra deepxiv)
uv run lens acquire deepxiv "LLM agent architecture" --max-results 10
uv run lens acquire deepxiv --paper 2507.01701  # single paper with rich metadata

# Fetch SPECTER2 embeddings from Semantic Scholar
uv run lens acquire semantic                    # all papers missing embeddings
uv run lens acquire semantic --paper-id 2401.12345  # specific paper

# Knowledge base overview
uv run lens status                          # paper counts, vocab, matrix, issues

# Run a monitoring cycle (acquire → enrich → extract → build → ideate)
uv run lens monitor
uv run lens monitor --skip-enrich           # skip OpenAlex enrichment
uv run lens monitor --skip-build            # skip taxonomy/matrix rebuild
uv run lens monitor --trending              # show ideation gaps

# Browse research opportunities
uv run lens explore ideas
uv run lens explore ideas --type sparse_cell

# Health-check the knowledge base
uv run lens lint                               # report issues across 6 categories
uv run lens lint --fix                         # auto-fix safe issues
uv run lens lint --check orphans,stale         # run specific checks only

# View the event log (audit trail of all mutations)
uv run lens log                                # last 20 events
uv run lens log --kind extract                 # filter by event kind
uv run lens log --since 2026-04-01 --limit 50  # date range + limit

# Configuration
uv run lens config show
uv run lens config set llm.default_model openrouter/anthropic/claude-sonnet-4-6

# Verbose logging (-v=INFO, -vv=DEBUG)
uv run lens -v extract
uv run lens -vv monitor

LLM Backend

LENS needs an LLM for extraction, taxonomy labeling, and analysis. For production deployment, see docs/deployment.md. Two options:

Gateway mode (recommended for production) — Point to any OpenAI-compatible endpoint (litellm gateway, vLLM, Ollama). No litellm dependency needed. Keeps API keys out of application pods.

# ~/.lens/config.yaml
llm:
  api_base: "http://litellm-gateway:4000/v1"
  api_key: "your-gateway-key"
  default_model: "gpt-4"

Direct mode — Install litellm for multi-provider routing (OpenRouter, OpenAI, Anthropic, etc.):

uv add lens[litellm]

Embeddings

Two embedding providers, configurable via ~/.lens/config.yaml:

Local (default) — sentence-transformers (SPECTER2 / MiniLM fallback). Free, works offline, but requires ~400MB model download on first use.

Cloud — Any embedding API via litellm or OpenAI-compatible endpoint. Fast, scalable, no local model needed.

# Switch to cloud embeddings
uv run lens config set embeddings.provider cloud
uv run lens config set embeddings.model text-embedding-3-small

Architecture

Python 3.12+ with uv package manager
SQLite + sqlite-vec — embedded database with vector search (cosine distance)
openai SDK — LLM and embedding client (works with any OpenAI-compatible endpoint)
litellm (optional) — multi-provider routing for direct API access
deepxiv-sdk (optional) — agent-optimized paper search with hybrid retrieval and progressive reading
Guided extraction — canonical vocabulary for all extraction types (tradeoffs, architecture, agentic)
sentence-transformers or cloud embeddings — configurable provider
Typer — CLI framework

Data flows through four layers:

Layer 0: Papers (arxiv, DeepXiv, PDF, OpenAlex enrichment)
    ↓
Layer 1: Raw Extractions (LLM-extracted tradeoffs, architecture, agentic patterns)
    ↓
Layer 2: Taxonomy (unified vocabulary: parameters, principles, arch slots, agentic categories)
    ↓
Layer 3: Knowledge Structures (contradiction matrix, ideation gaps)

Public API is synchronous; async internals are wrapped with asyncio.run().

Testing

uv run pytest                          # run all tests
uv run pytest tests/test_store.py -v   # run specific test file
uv run pytest -m "not integration"     # skip live API tests

Tests use tmp_path fixtures for isolated SQLite instances. No mocking — real embedded database instances are used in all tests.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

flyerwolf

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.10.1

Apr 24, 2026

0.10.0

Apr 21, 2026

0.9.1

Apr 21, 2026

This version

0.9.0

Apr 12, 2026

0.8.0

Apr 12, 2026

0.7.0

Apr 8, 2026

0.6.0

Apr 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lens_research-0.9.0.tar.gz (295.0 kB view details)

Uploaded Apr 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lens_research-0.9.0-py3-none-any.whl (114.1 kB view details)

Uploaded Apr 12, 2026 Python 3

File details

Details for the file lens_research-0.9.0.tar.gz.

File metadata

Download URL: lens_research-0.9.0.tar.gz
Upload date: Apr 12, 2026
Size: 295.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for lens_research-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`7bf0eec6e8e0a216c8abb388afa44f295aef7deedd4c117cb482095c5ff16312`
MD5	`793a573e5ff9f1301c9e21209abbd443`
BLAKE2b-256	`27ccacd267dcbe78dfc00de734c4d1af0ac842c08bf15bc2372286839d5cdde6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lens_research-0.9.0.tar.gz:

Publisher: ci-and-publish.yml on flyersworder/lens

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lens_research-0.9.0.tar.gz
- Subject digest: 7bf0eec6e8e0a216c8abb388afa44f295aef7deedd4c117cb482095c5ff16312
- Sigstore transparency entry: 1280904092
- Sigstore integration time: Apr 12, 2026
Source repository:
- Permalink: flyersworder/lens@6776d1df2ea783ef520f40dd6813d1f3b5fd29aa
- Branch / Tag: refs/tags/v0.9.0
- Owner: https://github.com/flyersworder
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci-and-publish.yml@6776d1df2ea783ef520f40dd6813d1f3b5fd29aa
- Trigger Event: release

File details

Details for the file lens_research-0.9.0-py3-none-any.whl.

File metadata

Download URL: lens_research-0.9.0-py3-none-any.whl
Upload date: Apr 12, 2026
Size: 114.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for lens_research-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b53e7bdaa8840e210add77518522f0e88c6e41b8ddfd1747133571e83c5ccbac`
MD5	`be114bab4d6d9c12fd48363e49e740b2`
BLAKE2b-256	`02dbf1d32c262f8ef6f37543fd786280fa5cb02d5e63410cd19fa9ebcbc23620`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lens_research-0.9.0-py3-none-any.whl:

Publisher: ci-and-publish.yml on flyersworder/lens

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lens_research-0.9.0-py3-none-any.whl
- Subject digest: b53e7bdaa8840e210add77518522f0e88c6e41b8ddfd1747133571e83c5ccbac
- Sigstore transparency entry: 1280904105
- Sigstore integration time: Apr 12, 2026
Source repository:
- Permalink: flyersworder/lens@6776d1df2ea783ef520f40dd6813d1f3b5fd29aa
- Branch / Tag: refs/tags/v0.9.0
- Owner: https://github.com/flyersworder
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci-and-publish.yml@6776d1df2ea783ef520f40dd6813d1f3b5fd29aa
- Trigger Event: release

lens-research 0.9.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

LENS — LLM Engineering Navigation System

Core Knowledge Structures

Status

Quick Start

Usage

LLM Backend

Embeddings

Architecture

Testing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance