PRISMA 2020 systematic literature review agent powered by Pydantic AI

These details have not been verified by PyPI

Project links

Repository

Project description

PRISMA Agent — Pydantic AI Systematic Review

A standalone, agent-based systematic literature review tool following PRISMA 2020 guidelines. Built with pydantic-ai for structured LLM interactions and typed outputs via OpenRouter.

Architecture

prisma-agent/
├── models.py      # Pydantic v2 models (Article, Protocol, Evidence, GRADE, etc.)
├── clients.py     # HTTP clients: PubMed (NCBI E-utilities), bioRxiv, SQLite cache
├── agents.py      # 9 pydantic-ai agents with typed outputs + runner functions
├── evidence.py    # LLM-powered evidence extraction (delegates to agent)
├── pipeline.py    # Async orchestrator — runs the full 15-step PRISMA pipeline
├── export.py      # Export: Markdown (PRISMA 2020 format), JSON, BibTeX
├── main.py        # Standalone CLI with argparse + interactive mode
└── README.md

Design Principles

Agent-per-task: Each PRISMA step that requires LLM reasoning has a dedicated pydantic-ai Agent with a typed output_type. No raw string parsing — the LLM returns validated Pydantic models.
No hardcoded heuristics: Evidence extraction, screening, bias assessment, and synthesis are all handled by specialized LLM agents. No keyword lists or regex scoring.
Typed throughout: Every data structure is a Pydantic BaseModel with validation. Structured outputs from agents are parsed and validated automatically by pydantic-ai.
Async pipeline: The orchestrator uses asyncio for concurrent LLM calls (bias + GRADE + limitations run in parallel).
Standalone: No web framework dependency. Runs as a CLI tool. Can be imported as a library.

Installation

From PyPI (recommended)

pip install prisma-review-agent

From source

git clone https://github.com/tekrajchhetri/prisma-review-agent.git
cd prisma-review-agent
pip install -e .

Quick Start

Set API Key

export OPENROUTER_API_KEY="sk-or-v1-..."

# Optional: higher PubMed rate limits (10 req/s vs 3 req/s)
export NCBI_API_KEY="your-ncbi-key"

CLI — installed package

After pip install prisma-review-agent the prisma-review command is available globally:

# Simple review
prisma-review \
  --title "CRISPR gene therapy efficacy" \
  --inclusion "Clinical trials, human subjects, English" \
  --exclusion "Animal-only studies, reviews, commentaries"

# Full PICO specification
prisma-review \
  --title "GLP-1 agonists for type 2 diabetes: a systematic review" \
  --objective "Evaluate efficacy of GLP-1 RAs vs placebo for glycemic control" \
  --population "Adults with type 2 diabetes mellitus" \
  --intervention "GLP-1 receptor agonists" \
  --comparison "Placebo or standard care" \
  --outcome "HbA1c reduction, weight change, adverse events" \
  --inclusion "RCTs, English, 2019-2024, peer-reviewed" \
  --exclusion "Case reports, editorials, conference abstracts" \
  --model "anthropic/claude-sonnet-4" \
  --max-results 30 \
  --hops 2 \
  --rob-tool "RoB 2" \
  --extract-data \
  --export md json bib

# Interactive mode
prisma-review --interactive

CLI — from source (without installing)

python main.py --title "..." --interactive

Python API

import asyncio
from pathlib import Path
from prisma_review_agent import (
    PRISMAReviewPipeline,
    ReviewProtocol,
    RoBTool,
    to_markdown,
    to_json,
)

protocol = ReviewProtocol(
    title="Gut microbiome and depression",
    objective="Examine the relationship between gut microbiota composition and depressive disorders",
    pico_population="Adults with major depressive disorder",
    pico_intervention="Gut microbiome profiling",
    pico_comparison="Healthy controls",
    pico_outcome="Microbiome diversity, specific taxa abundance",
    inclusion_criteria="Human studies, English, 2018-2024",
    exclusion_criteria="Animal studies, reviews, case reports",
    max_hops=1,
    rob_tool=RoBTool.NEWCASTLE_OTTAWA,
)

async def run():
    pipeline = PRISMAReviewPipeline(
        api_key="sk-or-v1-...",
        model_name="anthropic/claude-sonnet-4",
        protocol=protocol,
        max_per_query=25,
        related_depth=1,
    )
    result = await pipeline.run()

    # Export
    Path("review.md").write_text(to_markdown(result))
    Path("review.json").write_text(to_json(result))

    # Access structured data
    print(f"Included: {result.flow.included_synthesis} studies")
    for article in result.included_articles:
        rob = article.risk_of_bias.overall.value if article.risk_of_bias else "?"
        print(f"  [{article.pmid}] {article.authors} ({article.year}) — RoB: {rob}")

    for span in result.evidence_spans[:5]:
        print(f"  Evidence [{span.paper_pmid}]: {span.text[:100]}...")

asyncio.run(run())

Pipeline Steps (15-step PRISMA 2020)

Step	Agent	Output Type	Description
1. Search Strategy	`search_strategy_agent`	`SearchStrategy`	Generates PubMed + bioRxiv queries from protocol
2. PubMed Search	— (HTTP)	`list[Article]`	E-utilities esearch + efetch
3. bioRxiv Search	— (HTTP)	`list[Article]`	bioRxiv API keyword matching
4. Related Articles	— (HTTP)	`list[str]`	elink neighbor_score
5. Citation Hops	— (HTTP)	`list[Article]`	Forward (cited-by) + backward navigation
6. Deduplication	— (logic)	`list[Article]`	DOI/PMID dedup
7. Title/Abstract Screening	`screening_agent`	`ScreeningBatchResult`	LLM batch screening (inclusive)
8. Full-text Retrieval	— (HTTP)	`dict[str, str]`	PMC efetch
9. Full-text Screening	`screening_agent`	`ScreeningBatchResult`	LLM batch screening (strict)
10. Evidence Extraction	`evidence_extraction_agent`	`BatchEvidenceExtraction`	LLM identifies claims + evidence spans
11. Data Extraction	`data_extraction_agent`	`StudyDataExtraction`	Per-study structured data
12. Risk of Bias	`rob_agent`	`RiskOfBiasResult`	Per-study RoB 2 / ROBINS-I / NOS
13. Synthesis	`synthesis_agent`	`str`	Grounded narrative with PMID citations
14. Bias + GRADE	`bias_summary_agent` + `grade_agent`	`str` + `GRADEAssessment`	Parallel assessment
15. Limitations	`limitations_agent`	`str`	Review limitations section

Agents Reference

Agent Architecture

Each agent is defined as a module-level pydantic_ai.Agent with:

Typed output: Pydantic model that the LLM must conform to
System prompt: Static instructions + dynamic context from RunContext[AgentDeps]
Deferred model: defer_model_check=True — model is provided at runtime via build_model()
Dependencies: AgentDeps dataclass carrying protocol + API credentials

from agents import AgentDeps, build_model, rob_agent
from models import ReviewProtocol

deps = AgentDeps(
    protocol=ReviewProtocol(title="..."),
    api_key="sk-or-v1-...",
    model_name="anthropic/claude-sonnet-4",
)
model = build_model(deps.api_key, deps.model_name)

# Run directly
result = await rob_agent.run(
    "Title: ...\nAbstract: ...",
    deps=deps,
    model=model,
)
rob: RiskOfBiasResult = result.output
print(rob.overall)  # RoBJudgment.LOW

Selecting a Model

Pass any OpenRouter model ID via --model on the CLI or the model_name argument in Python.

CLI

# Claude Sonnet 4 (default)
prisma-review --title "..." --model anthropic/claude-sonnet-4

# Gemini 2.5 Pro
prisma-review --title "..." --model google/gemini-2.5-pro

# GPT-4o
prisma-review --title "..." --model openai/gpt-4o

# DeepSeek (cost-effective)
prisma-review --title "..." --model deepseek/deepseek-chat

Python API

pipeline = PRISMAReviewPipeline(
    api_key="sk-or-v1-...",
    model_name="google/gemini-2.5-pro",   # ← change here
    protocol=protocol,
)

Interactive mode — prompts you to type a model name at startup:

prisma-review --interactive
# Enter model ID when prompted, or press Enter for the default

Supported Models (via OpenRouter)

Any model available on OpenRouter works. Tested with:

Model	ID	Notes
Claude Sonnet 4	`anthropic/claude-sonnet-4`	Best balance of quality/speed
Claude Haiku 4	`anthropic/claude-haiku-4`	Faster, good for screening
Gemini 2.5 Pro	`google/gemini-2.5-pro`	Good structured output
GPT-4o	`openai/gpt-4o`	Strong general performance
DeepSeek Chat	`deepseek/deepseek-chat`	Cost-effective
Llama 3.1 70B	`meta-llama/llama-3.1-70b-instruct`	Open-source option

Data Models

Core Models

Model	Purpose
`Article`	Research article with metadata, full text, RoB, extracted data
`EvidenceSpan`	Single evidence sentence with source, claim label, relevance score
`ReviewProtocol`	Full PRISMA protocol: PICO, criteria, databases, registration
`PRISMAFlowCounts`	PRISMA flow diagram counts for all stages
`PRISMAReviewResult`	Complete review result with all outputs

LLM Output Models

Model	Used By	Description
`SearchStrategy`	search_strategy_agent	PubMed/bioRxiv queries, MeSH terms
`ScreeningBatchResult`	screening_agent	Batch of include/exclude decisions
`RiskOfBiasResult`	rob_agent	Per-domain RoB with overall judgment
`StudyDataExtraction`	data_extraction_agent	Study design, findings, effect measures
`GRADEAssessment`	grade_agent	GRADE domains + overall certainty
`BatchEvidenceExtraction`	evidence_extraction_agent	Evidence spans per article

Export Formats

Markdown

Full PRISMA 2020 structured report with:

Abstract, Introduction (rationale + PICO), Methods (criteria, search strategy, selection, RoB)
Results (flow table, study characteristics, synthesis, RoB, GRADE)
Discussion (limitations), Other Information (registration, funding)
References, Appendix (evidence spans)

JSON

Complete PRISMAReviewResult serialized via model_dump_json().

BibTeX

Standard @article{} entries for all included studies.

Caching

SQLite cache (prisma_agent_cache.db) stores:

PubMed search results (72h TTL)
Article metadata
Full-text content
Related article links
bioRxiv search results

Disable with --no-cache or enable_cache=False.

CLI Reference

prisma-review [OPTIONS]

Protocol:
  --title, -t          Review title / research question
  --objective          Detailed objective
  --population         PICO: Population
  --intervention       PICO: Intervention
  --comparison         PICO: Comparison
  --outcome            PICO: Outcome
  --inclusion          Inclusion criteria
  --exclusion          Exclusion criteria
  --registration       PROSPERO registration number

Search:
  --model, -m          OpenRouter model (default: anthropic/claude-sonnet-4)
  --databases          Databases to search (default: PubMed bioRxiv)
  --max-results        Max results per query (default: 20)
  --related-depth      Related article depth (default: 1)
  --hops               Citation hop depth 0-4 (default: 1)
  --biorxiv-days       bioRxiv lookback days (default: 180)
  --date-start         Start date YYYY-MM-DD
  --date-end           End date YYYY-MM-DD
  --rob-tool           RoB 2 | ROBINS-I | Newcastle-Ottawa Scale

Pipeline:
  --no-cache           Disable SQLite cache
  --extract-data       Enable per-study data extraction

Output:
  --export, -e         Export formats: md json bib (default: md)
  --interactive, -i    Interactive protocol setup

Extending

Add a New Agent

Define the output model in models.py:

class MyOutput(BaseModel):
    field: str
    score: float

Create the agent in agents.py:

my_agent = Agent(
    output_type=MyOutput,
    deps_type=AgentDeps,
    system_prompt="...",
    defer_model_check=True,
    name="my_agent",
)

async def run_my_agent(data: str, deps: AgentDeps) -> MyOutput:
    model = build_model(deps.api_key, deps.model_name)
    result = await my_agent.run(data, deps=deps, model=model)
    return result.output

Integrate into pipeline.py.

Add a New Data Source

Create a client class in clients.py following the PubMedClient pattern.
Add it to PRISMAReviewPipeline.__init__().
Add a search step in pipeline.py.

Dependencies

Package	Version	Purpose
`pydantic-ai`	>=1.0	Agent framework with typed outputs
`pydantic`	>=2.0	Data validation and serialization
`httpx`	>=0.25	Async-capable HTTP client

License

Apache 2.0 — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.2.5

Apr 18, 2026

0.2.4

Apr 17, 2026

0.2.3

Apr 17, 2026

0.2.2

Apr 17, 2026

This version

0.2.1

Apr 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prisma_review_agent-0.2.1.tar.gz (53.2 kB view details)

Uploaded Apr 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

prisma_review_agent-0.2.1-py3-none-any.whl (35.8 kB view details)

Uploaded Apr 4, 2026 Python 3

File details

Details for the file prisma_review_agent-0.2.1.tar.gz.

File metadata

Download URL: prisma_review_agent-0.2.1.tar.gz
Upload date: Apr 4, 2026
Size: 53.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: Hatch/1.16.5 cpython/3.13.5 HTTPX/0.28.1

File hashes

Hashes for prisma_review_agent-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`168ebf6fa8ed074671ddd97b7f7cfa9d0de2703ef5e73b697b63dbfed1676eb6`
MD5	`bf9e8d3750c301c04c6236832c80fc20`
BLAKE2b-256	`0b154e4d1414e4625ff24808ec8165ebbf0f85a2d5b61415530679565a507064`

See more details on using hashes here.

File details

Details for the file prisma_review_agent-0.2.1-py3-none-any.whl.

File metadata

Download URL: prisma_review_agent-0.2.1-py3-none-any.whl
Upload date: Apr 4, 2026
Size: 35.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: Hatch/1.16.5 cpython/3.13.5 HTTPX/0.28.1

File hashes

Hashes for prisma_review_agent-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d5828cae700cb361ed1c256ec91595bb0aac4b0a488d24cfeee620075934fbdd`
MD5	`d857801395d82ebf7eb79c0bef1324ab`
BLAKE2b-256	`5cda608c8b53c344c6c662777167b21712c24103b71e5924f33bd2caa07c69d6`

See more details on using hashes here.

prisma-review-agent 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PRISMA Agent — Pydantic AI Systematic Review

Architecture

Design Principles

Installation

From PyPI (recommended)

From source

Quick Start

Set API Key

CLI — installed package

CLI — from source (without installing)

Python API

Pipeline Steps (15-step PRISMA 2020)

Agents Reference

Agent Architecture

Selecting a Model

Supported Models (via OpenRouter)

Data Models

Core Models

LLM Output Models

Export Formats

Markdown

JSON

BibTeX

Caching

CLI Reference

Extending

Add a New Agent

Add a New Data Source

Dependencies

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes