learn-lock

The app that argues with you. Adversarial Socratic learning with spaced repetition.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

██╗     ███████╗ █████╗ ██████╗ ███╗   ██╗██╗      ██████╗  ██████╗██╗  ██╗
██║     ██╔════╝██╔══██╗██╔══██╗████╗  ██║██║     ██╔═══██╗██╔════╝██║ ██╔╝
██║     █████╗  ███████║██████╔╝██╔██╗ ██║██║     ██║   ██║██║     █████╔╝
██║     ██╔══╝  ██╔══██║██╔══██╗██║╚██╗██║██║     ██║   ██║██║     ██╔═██╗
███████╗███████╗██║  ██║██║  ██║██║ ╚████║███████╗╚██████╔╝╚██████╗██║  ██╗
╚══════╝╚══════╝╚═╝  ╚═╝╚═╝  ╚═╝╚═╝  ╚═══╝╚══════╝ ╚═════╝  ╚═════╝╚═╝  ╚═╝

The app that argues with you.

LearnLock is a CLI learning tool that uses adversarial Socratic dialogue to expose gaps in your understanding. It doesn't quiz you — it interrogates you.

Installation
Quick Start
How It Works
Architecture
The Duel Engine
Claim Pipeline
Module Reference
Database Schema
Configuration
CLI Commands
Known Limitations
Development
License

Installation

From PyPI

Install using pip. Requires Python 3.11 or higher.

From Source

Clone the repository and install in editable mode.

Optional Dependencies

learnlock[ocr] — EasyOCR for handwritten answer support
learnlock[whisper] — Whisper fallback for YouTube videos without transcripts

Quick Start

Set your API keys as environment variables:
- GROQ_API_KEY (required) — Get free at console.groq.com
- GEMINI_API_KEY (recommended) — Get free at aistudio.google.com
Launch the CLI by running learnlock
Add content by pasting a YouTube URL, article link, PDF path, or GitHub repo
Start studying with /study
Double Enter to send your answer

How It Works

You explain a concept in your own words
The engine infers what you believe
It compares your belief against ground truth claims
It finds contradictions and attacks the weakest point
After 3 turns (or success), it reveals your belief trajectory
Your score feeds into SM-2 spaced repetition scheduling

Architecture

Tools (youtube, article, pdf, github)
    │
    ▼
LLM ──▶ extract concepts & claims
    │
    ▼
Duel Engine ──▶ belief modeling, contradiction detection, interrogation
    │
    ├──▶ Scheduler (SM-2) ──▶ Storage (SQLite)
    │
    └──▶ HUD ──▶ CLI (claims, belief, attack, reveal)

Data Flow

Source (YouTube/PDF/Article/GitHub)
    │
    ▼
Content Extraction (tools/)
    │
    ▼
Concept Extraction (llm.py) ──▶ 8-12 concepts with claims
    │
    ▼
Storage (storage.py) ──▶ SQLite: sources, concepts, progress, duel_memory
    │
    ▼
Scheduler (scheduler.py) ──▶ SM-2 spaced repetition
    │
    ▼
Duel Engine (duel.py) ──▶ Adversarial Socratic interrogation
    │
    ▼
HUD (hud.py) ──▶ Live visualization of engine state

The Duel Engine

The cognitive core of LearnLock. Located in duel.py.

Philosophy

Traditional learning apps ask: "Do you know X?"

LearnLock asks: "What do you believe about X, and where is it wrong?"

Pipeline

Belief Modeling — Infers what the user thinks from their response
Contradiction Detection — Compares belief against claims, finds violations
Interrogation — Generates attack question targeting highest-severity error
Snapshot — Records belief state for trajectory tracking

Behaviors

Vague answers trigger mechanism probes
Wrong answers trigger claim-specific attacks
"I don't know" triggers guiding questions (not punishment)
Correct answers pass after verification
3 turns exhausted triggers reveal with full trajectory

Graded Harshness

Turn 1: Forgiving — only clear violations flagged
Turn 2: Moderate — violations plus omissions
Turn 3: Strict — all violations surfaced

Error Types

wrong_mechanism — Incorrect explanation of how something works
missing_mechanism — Omitted critical mechanism
boundary_error — Wrong about limitations or scope
conflation — Confused two distinct concepts
superficial — Surface-level understanding without depth

Claim Pipeline

Claims are the epistemic foundation. The duel is only as fair as the claims.

Three-Pass Verification

Pass 1: Generation — LLM generates claims with explicit instructions to produce conceptual truths, not transcript parroting. Demands falsifiable statements about WHY and HOW, not just WHAT.

Pass 2: Garbage Filter — Pattern matching rejects stateful claims ("is running", "must remain active"), tautologies ("processes requests", "serves requests"), and vague claims ("is useful", "is important").

Pass 3: Sharpness Filter — Rejects blurry truths that are technically correct but unfalsifiable ("handles security", "manages data", "deals with").

Claim Types

definition — What the concept is
mechanism — How it works internally
requirement — What it needs to function
boundary — What it cannot do or where it fails

Good vs Bad Claims

Bad claims get rejected:

"The server processes requests" (tautology)
"It handles security" (blurry)
"Must be running to work" (stateful)

Good claims survive:

"Validates request payloads against a JSON schema"
"Enforces authentication via JWT token verification"
"Uses Python type hints for automatic request validation"

Module Reference

duel.py — The Engine

Core dataclasses: Claim, BeliefError, BeliefSnapshot, BeliefState

Main class DuelEngine provides:

process(user_input) — Process response, return attack or reveal
get_reveal() — Get final state with claims, errors, trajectory
get_claims() — Get parsed claims
finished — Boolean indicating duel completion

Helper functions:

create_duel() — Factory for DuelEngine
belief_to_score() — Convert final state to 1-5 score
export_duel_data() — Export for research/training
save_duel_data() — Persist to disk

hud.py — Visualization

set_gentle_mode() — Toggle between brutal and gentle UI
render_duel_state() — Render claims panel, belief panel, attack target
render_attack() — Render interrogation panel with question
render_reveal() — Render final verdict with trajectory and claim satisfaction

cli.py — Interface

Entry point main() launches the REPL.

Key commands routed through handle_input():

cmd_study() — Main duel session loop
cmd_add() — Add content from URL
cmd_stats() — Display progress statistics
cmd_list() — List all concepts
cmd_due() — Show due concepts

storage.py — Persistence

SQLite database with tables for sources, concepts, explanations, progress, and duel_memory.

Key functions:

add_source() / get_source() — Source CRUD
add_concept() / get_concept() — Concept CRUD
get_due_concepts() — Query due items
save_duel_memory() / get_duel_memory() — Persist last duel state per concept
update_progress() — Update SM-2 scheduling data

scheduler.py — SM-2 Spaced Repetition

Implements SM-2 algorithm for scheduling reviews.

update_after_review() — Update ease factor and interval after scoring
get_next_due() — Get single next due concept
get_all_due() — Get all due concepts
get_study_summary() — Aggregate statistics

llm.py — LLM Interface

Dual-provider setup: Groq for extraction, Gemini for evaluation.

extract_concepts() — Extract concepts with claims from content
evaluate_explanation() — Score user explanation (legacy, replaced by duel)
generate_title() — Generate topic-based title from content

tools/ — Content Extraction

youtube.py

extract_youtube() — Get transcript with timestamps
find_timestamp_for_text() — Find timestamp for concept
extract_frame_at_timestamp() — Extract and describe frame with Gemini Vision
Whisper fallback for videos without transcripts

article.py

extract_article() — Extract text from web articles using trafilatura

pdf.py

extract_pdf() — Extract text from local or remote PDFs using pymupdf

github.py

extract_github() — Extract README from GitHub repositories

ocr.py — Image Input

extract_text_from_image() — OCR using EasyOCR or Tesseract
check_relevance() — Verify extracted text relates to concept

Database Schema

sources

Stores raw content from URLs. Fields: id, url, title, source_type, raw_content, segments (JSON for YouTube timestamps), created_at

concepts

Stores extracted concepts. Fields: id, source_id, name, source_quote (ground truth), question, skipped, created_at

explanations

Stores user responses and scores. Fields: id, concept_id, text, score, covered, missed, feedback, created_at

progress

SM-2 scheduling data. Fields: id, concept_id, ease_factor, interval_days, due_date, review_count, last_score

duel_memory

Persists last duel state for returning users. Fields: id, concept_id, last_belief, last_errors, last_attack, updated_at

Configuration

All settings configurable via environment variables.

Paths

LEARNLOCK_DATA_DIR — Data directory (default: ~/.learnlock)

Models

LEARNLOCK_GROQ_MODEL — Groq model for extraction
LEARNLOCK_GEMINI_MODEL — Gemini model for evaluation and vision

SM-2 Parameters

LEARNLOCK_SM2_INITIAL_EASE — Starting ease factor (default: 2.5)
LEARNLOCK_SM2_INITIAL_INTERVAL — Starting interval in days (default: 1.0)
LEARNLOCK_SM2_MIN_EASE — Minimum ease factor (default: 1.3)
LEARNLOCK_SM2_MAX_INTERVAL — Maximum interval in days (default: 180)

Extraction

LEARNLOCK_MIN_CONCEPTS — Minimum concepts per source (default: 8)
LEARNLOCK_MAX_CONCEPTS — Maximum concepts per source (default: 12)
LEARNLOCK_CONTENT_MAX_CHARS — Max content length for processing (default: 8000)

CLI Commands

Command	Description
`/add <url>`	Add YouTube, article, PDF, or GitHub
`/study`	Start duel session
`/stats`	View progress statistics
`/list`	List all concepts
`/due`	Show concepts due for review
`/skip <name>`	Skip a concept
`/unskip <name>`	Restore skipped concept
`/config`	Show current configuration
`/help`	Show help
`/quit`	Exit

Flags

--gentle or -g — Gentle UI mode (minimal, supportive feedback)
--version or -v — Show version

Known Limitations

1. Claim Quality (Epistemic Risk)

Claims are LLM-generated. Despite three-pass filtering, semantic drift can occur. A source saying "enforces authentication" might become "handles security" — technically related but unfalsifiable.

Mitigation: Pattern filters and sharpness checks reduce but don't eliminate this risk.

2. Hallucinated Errors (Moral Risk)

The contradiction detector can invent violations. A correct answer might be flagged as "missing_mechanism" due to LLM drift, causing unfair attacks.

Mitigation: Graded harshness (forgiving on turn 1), claim-index validation (errors must reference real claims). Still possible.

3. UI Density

The HUD displays claims, belief, attack target, and interrogation panel simultaneously. Powerful for power users, overwhelming for beginners.

Mitigation: --gentle flag provides minimal UI with supportive framing.

4. No Confidence Signals

Errors are binary. The engine cannot express "I might be wrong here."

Future: Multi-pass agreement, confidence scores, human-in-the-loop for high-stakes content.

Development

Setup

Clone the repo and install with dev dependencies using pip editable mode with the [dev] extra.

Testing

Run pytest from the project root.

Linting

Run ruff check on the src directory.

Building

Use python -m build to create distribution packages.

File Structure

src/learnlock/
├── __init__.py
├── cli.py          # CLI interface and command routing
├── config.py       # Environment-based configuration
├── duel.py         # Duel Engine (core logic)
├── hud.py          # Rich-based visualization
├── llm.py          # LLM interface (Groq/Gemini)
├── ocr.py          # Image text extraction
├── scheduler.py    # SM-2 spaced repetition
├── storage.py      # SQLite persistence
└── tools/
    ├── __init__.py
    ├── youtube.py  # YouTube extraction with timestamps
    ├── article.py  # Web article extraction
    ├── pdf.py      # PDF extraction
    └── github.py   # GitHub README extraction

License

MIT

Stop consuming. Start retaining.

LearnLock doesn't teach you.
It finds out what you don't know.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

itsvoid

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.8

Mar 17, 2026

0.1.7

Mar 15, 2026

0.1.6

Mar 11, 2026

This version

0.1.5

Mar 8, 2026

0.1.0

Jan 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

learn_lock-0.1.5.tar.gz (41.4 kB view details)

Uploaded Mar 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

learn_lock-0.1.5-py3-none-any.whl (46.5 kB view details)

Uploaded Mar 8, 2026 Python 3

File details

Details for the file learn_lock-0.1.5.tar.gz.

File metadata

Download URL: learn_lock-0.1.5.tar.gz
Upload date: Mar 8, 2026
Size: 41.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for learn_lock-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`5698809872d5fc49b63e765bbb7cc1367914a0049edb722610b3bf93ad57dc4d`
MD5	`8b3ac737dfa7fd5ffd66547f50136d00`
BLAKE2b-256	`13e797956a8aa6024705f526ac211ea6caed5b2eda32a9c42727934f1e4ee422`

See more details on using hashes here.

Provenance

The following attestation bundles were made for learn_lock-0.1.5.tar.gz:

Publisher: publish.yml on MitudruDutta/learnlock

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: learn_lock-0.1.5.tar.gz
- Subject digest: 5698809872d5fc49b63e765bbb7cc1367914a0049edb722610b3bf93ad57dc4d
- Sigstore transparency entry: 1059464449
- Sigstore integration time: Mar 8, 2026
Source repository:
- Permalink: MitudruDutta/learnlock@89b343a68190d2be2af3f88769e63b17889d5445
- Branch / Tag: refs/tags/v0.1.5
- Owner: https://github.com/MitudruDutta
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@89b343a68190d2be2af3f88769e63b17889d5445
- Trigger Event: release

File details

Details for the file learn_lock-0.1.5-py3-none-any.whl.

File metadata

Download URL: learn_lock-0.1.5-py3-none-any.whl
Upload date: Mar 8, 2026
Size: 46.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for learn_lock-0.1.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5c2d934f2ad2608763a4d73b7aab63cb0496b29b04cea59032cbe2e82e81815f`
MD5	`129ac1e3f898f7382c4a685a05e3be74`
BLAKE2b-256	`c137d634e55ddcf75bdb404b2486b7827654d8d6180f5066f1b708d7eac92fa5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for learn_lock-0.1.5-py3-none-any.whl:

Publisher: publish.yml on MitudruDutta/learnlock

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: learn_lock-0.1.5-py3-none-any.whl
- Subject digest: 5c2d934f2ad2608763a4d73b7aab63cb0496b29b04cea59032cbe2e82e81815f
- Sigstore transparency entry: 1059464451
- Sigstore integration time: Mar 8, 2026
Source repository:
- Permalink: MitudruDutta/learnlock@89b343a68190d2be2af3f88769e63b17889d5445
- Branch / Tag: refs/tags/v0.1.5
- Owner: https://github.com/MitudruDutta
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@89b343a68190d2be2af3f88769e63b17889d5445
- Trigger Event: release

learn-lock 0.1.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Table of Contents

Installation

From PyPI

From Source

Optional Dependencies

Quick Start

How It Works

Architecture

Data Flow

The Duel Engine

Philosophy

Pipeline

Behaviors

Graded Harshness

Error Types

Claim Pipeline

Three-Pass Verification

Claim Types

Good vs Bad Claims

Module Reference

duel.py — The Engine

hud.py — Visualization

cli.py — Interface

storage.py — Persistence

scheduler.py — SM-2 Spaced Repetition

llm.py — LLM Interface

tools/ — Content Extraction

ocr.py — Image Input

Database Schema

sources

concepts

explanations

progress

duel_memory

Configuration

Paths

Models

SM-2 Parameters

Extraction

CLI Commands

Flags

Known Limitations

1. Claim Quality (Epistemic Risk)

2. Hallucinated Errors (Moral Risk)

3. UI Density

4. No Confidence Signals

Development

Setup

Testing

Linting

Building

File Structure

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details