Self-learning AI agent that gets smarter over time. Routes between local and cloud models by confidence; learns from every escalation.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

buffalotechrider

These details have not been verified by PyPI

Project description

Autodidact

A self-evolving AI agent that learns like a new employee.

Autodidact is an AI agent with a local brain that works like a human, or a new employee. When asked a question or given a task, it thinks first and evaluates whether it can handle the task or answer the question by itself (local brain). If yes, it executes. If not, it escalates — by searching Google or asking a more powerful cloud model — just like how humans work. After the escalation, it learns the new knowledge, skills, or tool usages so next time it won't have to ask similar questions again.

On day one, it asks a lot of questions. By week two, it handles most tasks independently. By month three, it's the expert. Every cloud escalation becomes permanent local knowledge. Every interaction makes it smarter. It never forgets what it learned.

Query → Think  (check memory)
      → Try    (local model answers if confident)
      → Ask    (escalate to cloud when uncertain)
      → Learn  (store the answer for next time)
      ──────────────────────────────────────────
      Next similar query → Answer from memory, $0.00

Three-command quickstart

pip install autodidact             # or: pip install "autodidact[openai,bedrock,pdf]"
autodidact init                    # zero-friction setup: auto-detects Ollama, pulls models, configures cloud
autodidact chat                    # start talking to the agent

That's it. autodidact init walks you through three setup modes:

Local + Cloud (default) — Ollama local model + cloud API for escalation. Best cost savings.
Cloud + Cloud — cheap cloud model (gpt-4o-mini, DeepSeek, etc.) + expensive cloud model. No GPU or Ollama required.
Local only — Ollama only. Free. No escalation learning.

If Ollama isn't installed, the wizard shows the install command for your platform. If your model isn't pulled, it pulls it automatically. You're in chat within three minutes of pip install, regardless of starting point.

How it works — the human analogy

When you encounter a question, you go through this sequence:

Do I know the answer? → Check your memory
Am I confident I can answer it? → Self-assess
If yes → Answer (free, fast)
If no → Ask someone smarter (costs time, but you get the right answer)
Remember what you learned → Store it
Next time, start from step 1 → You're smarter now

A new employee does this every day. The more tasks they do, the more knowledgeable they become, the fewer questions they ask. Eventually, they're the person others come to.

Autodidact makes AI work the same way.

The visible thought process

Every response shows the agent's reasoning, in real time:

You> What's our PTO policy?
[THINKING] Checking memory... no relevant entries yet (0 hits)
[CLOUD] Escalated — gpt-4o-mini took 1.2s
[LEARNED] ✅ Stored: "Company PTO is 20 days per year, accrued monthly."
  💰 $0.003 | Confidence: 0.34 → escalated | ✅ Learned

You> How much vacation do I get?
[THINKING] Checking memory... found 1 similar entry (similarity: 0.91)
[MEMORY] Company PTO is 20 days per year, accrued monthly.
  ↳ Recalled from: "What's our PTO policy?" (learned 0 days ago)
  💰 $0.00 | Confidence: 0.91 | Route: memory

That's the "magic moment" — the user watching the agent answer from learned knowledge, for free, because it remembered a question it was asked moments ago.

Solving the cold start

A brand-new agent has an empty brain. autodidact learn seeds it with existing knowledge:

autodidact learn ~/docs/policies/     # ingest a folder of docs
autodidact learn ./README.md          # ingest a single file
autodidact learn --stats              # show what's been ingested

Supports .md, .txt, .py, .ts, .yaml, .json, .csv, .html, and 15+ other text formats. PDFs via pip install "autodidact[pdf]". Chunks are stored separately from learned Q&A (one is reference material, the other is experience), but both get retrieved and injected into the prompt at query time.

What's in v1.0

Zero-friction setup wizard. Auto-detects Ollama, pulls models, presets for OpenAI / OpenRouter / DeepSeek / Bedrock.
Three setup modes. Local+Cloud (default), Cloud+Cloud (no GPU), Local-only (free).
Confidence-based routing. logprob_uncertainty decides when to escalate (validated AUROC 0.65-0.83 across 3 model families × 2 datasets).
Learning from escalations. Structured knowledge extraction from cloud responses, deduplication on insert, staleness-aware re-verification.
Cold-start fix. autodidact learn <path> ingests docs so the agent has knowledge from day one.
Visible learning UX. [THINKING], [MEMORY], [LOCAL], [CLOUD], [LEARNED] tags show what the agent is doing and why.
Cost tracking. autodidact savings reports cumulative cost avoided vs an all-cloud baseline.
Local-first. All state in one portable SQLite file (~/.autodidact/memory.db). Works offline after setup.
Multi-provider. Ollama local. OpenAI-compatible cloud (OpenRouter, DeepSeek, Together, Anthropic proxies). AWS Bedrock via optional [bedrock] extra.

Commands

autodidact init             Zero-friction setup wizard
autodidact chat             Interactive chat with visible thought process
autodidact query "q"        Single-query mode
autodidact learn <path>     Ingest documents (cold-start fix)
autodidact savings          Cumulative cost savings
autodidact memory stats     Knowledge store size + breakdown
autodidact memory search    Search what the agent has learned

What's NOT in v1.0

No multi-turn conversation sessions across restarts (in-session history only)
No skill extraction or skill store (only fact extraction)
No autonomous tool discovery
No self-verification cycle (stale entries are flagged but not proactively re-checked)
No MCP server (coming in v1.1)
No TypeScript SDK

All of these are in the roadmap. See ROADMAP.md and .kiro/specs/autodidact-full/requirements.md for what ships when.

What we have verified empirically:

logprob_uncertainty is the dominant routing signal (AUROC 0.65-0.83 across 3 model families × 2 datasets).
Zero-shot inference-time signals match supervised routing baselines (RouteLLM) at zero per-model training cost.
Naive multi-signal fusion hurts — the best single signal beats the mean of all 6 signals.
Signal quality correlates with RLHF calibration training across model families (Qwen > Llama).

Full write-up: paper/blog-post.md. Research findings have their own home at zero-shot-llm-confidence.

Roadmap

Phase	What	Status
v1.0	Zero-friction self-learning agent	Shipping now
v1.1	Skill extraction, self-verification, MCP server	Planned
v2.0	Hive — agents teaching each other	Planned
v3.0	Hierarchical agent networks	Vision
Phase 4	LoRA consolidation (episodic → parametric)	Research

See ROADMAP.md for the timeline.

Tech stack

Python 3.10+
SQLite (WAL mode) — all state in one portable file
FAISS — vector retrieval
Pydantic v2 — validation
Typer + Rich — CLI
Ollama / OpenAI-compatible / AWS Bedrock — LLM backends

Contributing

See CONTRIBUTING.md.

Good first issues after launch:

Multi-turn session persistence across restarts
Skill extraction (extract procedures from cloud, not just facts)
MCP server for Claude Desktop / Cursor / Gemini CLI
Self-verification cycle
Additional benchmarks (TriviaQA, LongMemEval)
TypeScript SDK

License

MIT — see LICENSE.

Built by BuffaloTechRider. Repository: BuffaloTechRider/Autodidact.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

buffalotechrider

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.7

May 22, 2026

1.0.6

May 19, 2026

This version

1.0.2

May 15, 2026

1.0.1

May 14, 2026

1.0.0

May 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autodidact-1.0.2.tar.gz (154.7 kB view details)

Uploaded May 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

autodidact-1.0.2-py3-none-any.whl (85.5 kB view details)

Uploaded May 15, 2026 Python 3

File details

Details for the file autodidact-1.0.2.tar.gz.

File metadata

Download URL: autodidact-1.0.2.tar.gz
Upload date: May 15, 2026
Size: 154.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autodidact-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`c9d84871ddbcaa116d0b28b60768502fcb40238bdc8c6147380ebbe97538c3b4`
MD5	`9e522f2b0c14b0d8519c381a332a2fe8`
BLAKE2b-256	`18f71c16466f4402aef7406cf0c31e6e7dbd83b868964b31d29f1f1db67934cb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autodidact-1.0.2.tar.gz:

Publisher: release.yml on BuffaloTechRider/Autodidact

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autodidact-1.0.2.tar.gz
- Subject digest: c9d84871ddbcaa116d0b28b60768502fcb40238bdc8c6147380ebbe97538c3b4
- Sigstore transparency entry: 1550573977
- Sigstore integration time: May 15, 2026
Source repository:
- Permalink: BuffaloTechRider/Autodidact@9bb6ef6cace0c3e4609c6b4dfda7ebcc1a2883e8
- Branch / Tag: refs/tags/v1.0.2
- Owner: https://github.com/BuffaloTechRider
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@9bb6ef6cace0c3e4609c6b4dfda7ebcc1a2883e8
- Trigger Event: push

File details

Details for the file autodidact-1.0.2-py3-none-any.whl.

File metadata

Download URL: autodidact-1.0.2-py3-none-any.whl
Upload date: May 15, 2026
Size: 85.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autodidact-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ad15e9ffc6313366d230e6cd38e2d4f77095f49c3754cd96480c7360eeafb1d6`
MD5	`1e491565581bfca2f76843f00af260e4`
BLAKE2b-256	`df9c750a0d8d58f6a0a714431d00e3e09d3226f7a5edc8175a23ea52d01e1389`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autodidact-1.0.2-py3-none-any.whl:

Publisher: release.yml on BuffaloTechRider/Autodidact

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autodidact-1.0.2-py3-none-any.whl
- Subject digest: ad15e9ffc6313366d230e6cd38e2d4f77095f49c3754cd96480c7360eeafb1d6
- Sigstore transparency entry: 1550574142
- Sigstore integration time: May 15, 2026
Source repository:
- Permalink: BuffaloTechRider/Autodidact@9bb6ef6cace0c3e4609c6b4dfda7ebcc1a2883e8
- Branch / Tag: refs/tags/v1.0.2
- Owner: https://github.com/BuffaloTechRider
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@9bb6ef6cace0c3e4609c6b4dfda7ebcc1a2883e8
- Trigger Event: push

autodidact 1.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Autodidact

Three-command quickstart

How it works — the human analogy

The visible thought process

Solving the cold start

What's in v1.0

Commands

What's NOT in v1.0

What we have verified empirically:

Roadmap

Tech stack

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance