Multi-agent scientific literature research system with persistent memory

These details have not been verified by PyPI

Project links

Project description

ScholarAgent

A multi-agent scientific research system that searches, analyzes, and synthesizes academic literature — then serves what it learns as persistent memory to your coding agent via MCP.

Quick Start

git clone https://github.com/byBasiliosP/RLM-Agent.git
cd RLM-Agent

# Set your API keys
export OPENAI_API_KEY="sk-..."
export ANTHROPIC_API_KEY="sk-ant-..."

# Install and register in your coding agent
./install.sh

That's it. Restart your coding agent (Claude Code, Cursor, Windsurf, VS Code) and you'll have 7 new tools available.

What It Does

ScholarAgent runs a pipeline of 5 specialist AI agents that collaborate on research tasks:

Your Query
    |
    v
 Dispatcher (orchestrator)
    |
    v
 Scout -----> Reader -----> Critic -----> Analyst -----> Synthesizer
 (find        (extract      (evaluate     (compare       (write
  papers)      findings)     rigor)        across)        review)
    |
    v
 Memory Store (SQLite + embeddings)
    |
    v
 MCP Server (7 tools for your coding agent)

Papers are found on arXiv and Semantic Scholar. Code examples come from GitHub. Documentation comes from the web. Everything gets indexed with embeddings for fast semantic search.

MCP Tools

Once installed, your coding agent gets these tools:

Tool	What It Does	Speed
`memory_lookup`	Semantic search — returns compact summaries with IDs	~100ms
`memory_get`	Get full content of a single entry by ID (use after lookup)	instant
`memory_research`	Run new research — searches papers, docs, code and indexes results	5s - 5min
`memory_store`	Manually save a finding, snippet, or insight for later	instant
`memory_forget`	Remove stale entries by ID or semantic query	instant
`memory_status`	Check what's in memory (counts, sources, history)	instant
`memory_model_config`	Show active LLM backend configuration (strong/cheap models)	instant

Research Depth Levels

memory_research supports three depth levels:

quick — Search only, index raw results (~5-10s)
normal — Search + agent analysis (~30-60s)
deep — Full 5-agent pipeline with synthesis (~2-5min)

Focus Modes

Shape what the research emphasizes:

implementation — Code examples, API usage, how-to guides
theory — Concepts, algorithms, mathematical foundations
comparison — Alternatives, benchmarks, pros/cons

Architecture

Agents

Each agent runs an RLM loop (Reasoning via Language Models): generate Python code, execute in a sandboxed REPL, check for a final answer, iterate.

Agent	Role	Tools
Scout	Finds relevant papers	arXiv search, Semantic Scholar search, citation/reference graphs. Deduplicates across sources, ranks by relevance then citations.
Reader	Extracts key findings	Extracts key claims, methodology, results, limitations. Assigns confidence (high/medium/low) based on available text.
Critic	Evaluates methodology	Scores rigor and relevance (0-1) with defined rubrics. Flags biases (selection, confirmation, publication, funding, small sample). Rates reliability.
Analyst	Compares across papers	Identifies themes (3+ papers), contradictions, research gaps, consensus areas. Weights findings by critic reliability scores.
Synthesizer	Writes the review	Produces structured markdown (Introduction, Methodology Overview, Key Findings, Contradictions & Debates, Research Gaps, Conclusion, References) with `[Author et al., Year]` citations.

Multi-LLM Routing

The Scout agent uses a cheap/fast model (e.g. gpt-4o-mini) since it just searches. All other agents use a strong model (e.g. claude-sonnet-4-6) for analytical work.

Supported backends: OpenAI, Anthropic, and LM Studio (any OpenAI-compatible local server).

Memory Layer

SQLite database at ~/.scholaragent/memory.db
OpenAI embeddings (text-embedding-3-small) for semantic search — swappable via ABC
Deduplication — won't re-research the same query within 7 days
Access tracking — entries track how often they're retrieved

Source Types

Source	Adapter	API
Papers	`tools/arxiv.py`, `tools/semantic_scholar.py`	arXiv XML, S2 REST
Code	`sources/github.py`	GitHub Search (needs `GITHUB_TOKEN`)
Docs	`sources/docs.py`	Any URL (HTML-to-text extraction)

Python API

Use ScholarAgent directly from Python:

from scholaragent import ScholarAgent

agent = ScholarAgent(
    strong_model={"backend": "anthropic", "model_name": "claude-sonnet-4-6"},
    cheap_model={"backend": "openai", "model_name": "gpt-4o-mini"},
    max_papers=10,
    max_iterations=15,
    verbose=True,
)

result = agent.research("What are the latest advances in RLHF?")
print(result.result)  # Markdown literature review

Using LM Studio (Local Models)

Run entirely on local models with zero API cost:

agent = ScholarAgent(
    strong_model={"backend": "lmstudio", "model_name": "kimi-dev-72b"},
    cheap_model={"backend": "lmstudio", "model_name": "llama-3.2-3b-instruct"},
)

LM Studio must be running at http://localhost:1234/v1 (default). For a custom URL:

strong_model={"backend": "lmstudio", "model_name": "kimi-dev-72b", "base_url": "http://192.168.1.100:1234/v1"}

To run both the agent models and embeddings locally through LM Studio, set:

export SCHOLAR_STRONG_BACKEND=lmstudio
export SCHOLAR_CHEAP_BACKEND=lmstudio
export SCHOLAR_LMSTUDIO_URL=http://localhost:1234/v1
export SCHOLAR_EMBEDDING_BACKEND=lmstudio
export SCHOLAR_EMBEDDING_MODEL=text-embedding-nomic-embed-text-v1.5

Installation

pip install

pip install scholaragent
scholaragent-install

That's it. The installer auto-detects and registers the MCP server in Claude Code, Cursor, Windsurf, and VS Code.

Using LM Studio (local models)

scholaragent-install --backend lmstudio

Pick specific models:

scholaragent-install --backend lmstudio --strong-model kimi-dev-72b --cheap-model llama-3.2-3b-instruct

Uninstall

scholaragent-install --uninstall

From source

git clone https://github.com/byBasiliosP/RLM-Agent.git
cd RLM-Agent
./install.sh

Or manually:

pip install -e .
scholaragent-install

Environment Variables

Variable	Required	Purpose
`OPENAI_API_KEY`	Yes*	OpenAI embeddings + OpenAI model routing
`ANTHROPIC_API_KEY`	Yes*	Strong model for analysis agents
`GITHUB_TOKEN`	No	GitHub code search (higher rate limits)

*When using LM Studio for all agents and embeddings, OpenAI and Anthropic API keys are not required. ANTHROPIC_API_KEY is only needed if using the Anthropic backend.

Project Structure

scholaragent/
  agents/          # 5 specialist agents (scout, reader, critic, analyst, synthesizer)
  core/            # Orchestration (dispatcher, registry, handler, REPL, comms)
  clients/         # LLM clients (OpenAI, Anthropic, LM Studio) + model router
  memory/          # Persistent store (SQLite, embeddings, research pipeline)
  sources/         # Source adapters (GitHub code, documentation)
  tools/           # Search tools (arXiv, Semantic Scholar)
  environments/    # Sandboxed Python REPL
  utils/           # Parsing, prompts, budget tracking
  mcp_server.py    # FastMCP server (6 tools)
  installer.py     # CLI installer (scholaragent-install)
tests/             # 341 tests across 23 files
examples/          # Usage examples
install.sh         # Bash installer (alternative)

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
python -m pytest tests/ -v

# Run a specific test file
python -m pytest tests/test_store.py -v

Built On

RLM — REPL-driven LM orchestration patterns
MCP — Model Context Protocol for agent interoperability
FastMCP — Python MCP server framework

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Mar 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scholaragent-0.2.0.tar.gz (107.9 kB view details)

Uploaded Mar 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scholaragent-0.2.0-py3-none-any.whl (61.9 kB view details)

Uploaded Mar 11, 2026 Python 3

File details

Details for the file scholaragent-0.2.0.tar.gz.

File metadata

Download URL: scholaragent-0.2.0.tar.gz
Upload date: Mar 11, 2026
Size: 107.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for scholaragent-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`36bf50b517e164afcaa2d91df4545da88f29b3f4fb9f111f721375f4dbe4fbb1`
MD5	`116345265aa079bb705b67ded2c66d61`
BLAKE2b-256	`1bb9790b4d3f8a758664a8637c291f39b4eee3c4a52a1901014f86f66eb663ac`

See more details on using hashes here.

File details

Details for the file scholaragent-0.2.0-py3-none-any.whl.

File metadata

Download URL: scholaragent-0.2.0-py3-none-any.whl
Upload date: Mar 11, 2026
Size: 61.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for scholaragent-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5f2c99f424ce4aa7337eedd17ca3c87338c2937f6b3e604747f9170ddd1c1694`
MD5	`703a690826094bb4fcd4fa39fe6faac8`
BLAKE2b-256	`1c133d5d1d1048c0344dae24cba3e31eaa6b7cc5ccc699b5f7d0cdf497bc30fe`

See more details on using hashes here.

scholaragent 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ScholarAgent

Quick Start

What It Does

MCP Tools

Research Depth Levels

Focus Modes

Architecture

Agents

Multi-LLM Routing

Memory Layer

Source Types

Python API

Using LM Studio (Local Models)

Installation

pip install

Using LM Studio (local models)

Uninstall

From source

Environment Variables

Project Structure

Development

Built On

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes