Wrapper for multiple deep research tools

These details have not been verified by PyPI

Project description

deep-research-client logo

deep-research-client

A simple Python wrapper for multiple deep research tools including OpenAI Deep Research, Edison Scientific (formerly FutureHouse Falcon), Asta scientific corpus retrieval, Perplexity AI, Consensus Academic Search, Cyberian agent-based research, and OpenScientist autonomous research.

Features

🔍 Multiple Providers: Support for OpenAI Deep Research, Edison Scientific, Asta, Perplexity AI, Consensus, Cyberian (agent-based), and OpenScientist (autonomous)
📚 Rich Output: Returns comprehensive markdown reports with citations
💾 Smart Caching: File-based caching to avoid expensive re-queries
🔧 Simple Configuration: Auto-detects providers from environment variables
📝 Library + CLI: Use as a Python library or command-line tool
📋 Advanced Templates: Support for both simple f-string and powerful Jinja2 templates
🏗️ Extensible: Easy to add new research providers

Installation

# Install with pip
pip install deep-research-client

# Or use uvx to run directly without installation
uvx deep-research-client research "What is CRISPR?"

# Or add to a uv project
uv add deep-research-client

# For development
git clone <repo-url>
cd deep-research-client
uv sync

Quick Start

Set up API Keys

# For OpenAI Deep Research
export OPENAI_API_KEY="your-openai-key"

# For Edison Scientific (formerly FutureHouse Falcon)
export EDISON_API_KEY="your-edison-key"

# For Asta retrieval
export ASTA_API_KEY="your-asta-key"

# For Perplexity AI
export PERPLEXITY_API_KEY="your-perplexity-key"

# For Consensus AI (requires application approval)
export CONSENSUS_API_KEY="your-consensus-key"

# For OpenScientist (autonomous research agent) - see setup guide below
export OPENSCIENTIST_API_KEY="name:secret"
# Optional: custom instance URL (defaults to https://www.openscientist.io)
export OPENSCIENTIST_URL="https://www.openscientist.io"

# For Cyberian (agent-based research) - requires cyberian installation
pip install deep-research-client[cyberian]
# Cyberian uses your local AI agents (Claude, etc.) - no separate API key needed

Note: the Asta provider is retrieval-only and does not consume prompts verbatim. Markdown-heavy or template-style inputs are pre-processed into plain text before submission, and long inputs are truncated to the configured query_char_limit (500 characters by default).

Command Line Usage

# Basic research query
deep-research-client research "What is CRISPR gene editing?"

# Run without installation using uvx
uvx deep-research-client research "What is CRISPR gene editing?"

# Use specific provider and model
deep-research-client research "Explain quantum computing" --provider perplexity --model sonar-pro

# Save to file (citations included by default)
deep-research-client research "Machine learning trends 2024" --output report.md

# Save citations to separate file
deep-research-client research "AI trends 2024" --output report.md --separate-citations

# Use simple template with variables (f-string style)
deep-research-client research --template gene_research.md --var "gene=TP53" --var "organism=human"

# Load the query text directly from a Markdown/text file
deep-research-client research --input-file prompts/topic.md

# Use advanced Jinja2 template with conditionals
deep-research-client research --template gene_advanced.md.j2 \
  --var "gene=BRCA1" \
  --var "organism=human" \
  --var "detail_level=high"

# List available providers
deep-research-client providers

# List available models (with costs, speeds, and capabilities)
deep-research-client models
deep-research-client models --provider perplexity --detailed

# Cache management
deep-research-client list-cache   # Show cached files
deep-research-client clear-cache  # Remove all cache

You can provide the research question directly as a positional argument, read it from a file with --input-file, or generate it from a template (--template). These modes are mutually exclusive to keep intent clear.

Python Library Usage

from deep_research_client import DeepResearchClient

# Initialize client (auto-detects providers from env vars)
client = DeepResearchClient()

# Perform research with model selection
result = client.research("What are the latest developments in AI?", provider="perplexity", model="sonar-pro")

print(result.markdown)  # Full markdown report
print(f"Provider: {result.provider}")
print(f"Model: {result.model}")
print(f"Duration: {result.duration_seconds:.2f}s")
print(f"Citations: {len(result.citations)}")
print(f"Cached: {result.cached}")

Advanced Configuration

from deep_research_client import DeepResearchClient, ProviderConfig, CacheConfig

# Custom cache settings
cache_config = CacheConfig(
    enabled=True,
    directory="./my_cache"  # Custom cache location
)

# Custom provider settings
provider_configs = {
    "openai": ProviderConfig(
        name="openai",
        api_key="your-key",
        timeout=900,  # 15 minutes
        max_retries=5
    )
}

client = DeepResearchClient(
    cache_config=cache_config,
    provider_configs=provider_configs
)

Provider-Specific Parameters

Each provider supports specific parameters for fine-tuning behavior. The library provides both harmonized parameters (work across all providers) and provider-specific parameters (native API options).

Harmonized Parameters

These parameters work consistently across all providers:

allowed_domains: Filter web search to specific domains (max 20)

client.research(
    "What is CRISPR?",
    provider="openai",
    provider_params={"allowed_domains": ["wikipedia.org", "nih.gov"]}
)

OpenAI-Specific Parameters

from deep_research_client.provider_params import OpenAIParams

params = OpenAIParams(
    allowed_domains=["pubmed.ncbi.nlm.nih.gov", "clinicaltrials.gov"],
    temperature=0.2,
    max_tokens=4000,
    top_p=0.95
)

result = client.research("Cancer immunotherapy research", provider="openai", provider_params=params)

Perplexity-Specific Parameters

Perplexity supports both the harmonized allowed_domains and its native search_domain_filter (which supports deny-lists):

from deep_research_client.provider_params import PerplexityParams

# Using harmonized parameter
params = PerplexityParams(
    allowed_domains=["wikipedia.org", "github.com"],  # Allowlist only
    reasoning_effort="high",
    search_recency_filter="month"
)

# Or using Perplexity's native parameter for more control
params = PerplexityParams(
    search_domain_filter=[
        "github.com",           # Allow
        "stackoverflow.com",    # Allow
        "-reddit.com",          # Deny (prefix with -)
        "-quora.com"            # Deny
    ],
    reasoning_effort="high"
)

result = client.research("Python best practices", provider="perplexity", provider_params=params)

Falcon-Specific Parameters

from deep_research_client.provider_params import FalconParams

params = FalconParams(
    temperature=0.1,
    max_tokens=8000
)

result = client.research("Protein folding mechanisms", provider="falcon", provider_params=params)

Asta-Specific Parameters

The Asta provider sanitizes inputs before sending them to the Asta endpoint. In practice, that means Markdown formatting, frontmatter, bullets, and similar prompt scaffolding are stripped out, and the resulting plain-text query is truncated near a word boundary to query_char_limit. This is intentional: Asta works best with compact retrieval queries, not full deep-research prompt templates.

from deep_research_client.provider_params import AstaParams

params = AstaParams(
    query_char_limit=500,
    paper_limit=50,
    snippet_limit=20,
    publication_date_range="2020:",
)

result = client.research("CRISPR off-target effects", provider="asta", provider_params=params)

Cyberian-Specific Parameters (Agent-Based Research)

Cyberian is unique among the providers - it uses local AI agents (Claude, Aider, etc.) running through the cyberian workflow system to perform iterative, multi-step research.

from deep_research_client.provider_params import CyberianParams

# Install cyberian provider support first
# pip install deep-research-client[cyberian]

params = CyberianParams(
    agent_type="claude",           # Agent to use: claude, aider, cursor, goose
    workflow_file=None,            # Custom workflow (defaults to deep-research.yaml)
    port=3284,                     # agentapi server port
    skip_permissions=True,         # Skip permission checks for agent
    sources="academic papers"      # Source guidance for research
)

# Note: Cyberian research is slow but thorough
# - Typically takes 10-30 minutes per query
# - Performs iterative research with citation management
# - Creates structured reports with REPORT.md and citations/
result = client.research(
    "What are the mechanisms of autophagy in cancer?",
    provider="cyberian",
    provider_params=params
)

# The result includes:
# - Comprehensive markdown report (from REPORT.md)
# - Citations extracted from citations/ directory
# - Timing information for the full workflow
print(result.markdown)
print(f"Citations found: {len(result.citations)}")
print(f"Research took: {result.duration_seconds / 60:.1f} minutes")

When to use Cyberian:

📚 Comprehensive literature reviews
🔬 Deep technical research requiring synthesis
📊 Multi-source citation management
🔄 Iterative hypothesis exploration

Trade-offs:

⏱️ Slow: 10-30+ minutes per query (vs seconds for API providers)
💰 Variable cost: Depends on agent and research depth
🖥️ Local compute: Requires agentapi and agent setup
🎯 Thorough: More comprehensive than API-based providers

OpenScientist-Specific Parameters (Autonomous Research)

OpenScientist is an autonomous AI research agent from Berkeley Lab that runs iterative hypothesis-driven research using PubMed search and code execution. Jobs are submitted to a hosted instance and take 10-60+ minutes to complete.

Setup Guide

Create an account: Go to openscientist.io and sign up
Wait for approval: Your account must be approved by an administrator before you can create jobs. Until approved, the API returns 403 Forbidden: "Your account is pending administrator approval"
Generate an API key: Once approved, create an API key via the web UI or API. The key is shown only once in name:secret format — copy it immediately

Set environment variables:

export OPENSCIENTIST_API_KEY="yourname:yoursecret"
# Optional: custom instance URL (defaults to https://www.openscientist.io)
export OPENSCIENTIST_URL="https://www.openscientist.io"

Verify: deep-research-client providers should list openscientist

Parameters

from deep_research_client.provider_params import OpenScientistParams

params = OpenScientistParams(
    max_iterations=5,              # Research iterations (1-20, default 5)
    use_hypotheses=False,          # Enable hypothesis tracking
    investigation_mode="autonomous",  # "autonomous" or "coinvestigate"
    poll_interval=30,              # Seconds between status checks
    timeout=3600,                  # Max wait time in seconds (default 1 hour)
)

result = client.research(
    "What are the pathophysiology mechanisms of Phelan-McDermid syndrome?",
    provider="openscientist",
    provider_params=params
)

# OpenScientist returns PMID citations
print(f"Citations: {result.citations}")  # ['PMID:12345678', 'PMID:87654321', ...]
print(f"Research took: {result.duration_seconds / 60:.1f} minutes")

When to use OpenScientist:

🔬 Disease pathophysiology research with PubMed citations
📚 Hypothesis-driven scientific literature reviews
🧬 Biomedical mechanism discovery
📖 Evidence synthesis with structured PMID references

Trade-offs:

⏱️ Very slow: 10-60+ minutes per job (iterative multi-step research)
💰 Variable cost: Uses Claude under the hood; cost scales with iterations
🔑 Requires approval: Account must be admin-approved on openscientist.io
🎯 PubMed-focused: Deep scientific literature coverage with real PMID citations

CLI Usage with Provider Parameters

Currently, provider-specific parameters are primarily accessible via the Python API. For CLI usage, use default parameters or create custom wrapper scripts.

Using Proxies and OpenAI-Compatible Endpoints

The client supports using proxy services and OpenAI-compatible endpoints, enabling you to route requests through institutional proxies like CBORG, or use alternative services like Azure OpenAI, LiteLLM, or OpenRouter.

CBORG (Berkeley Lab's AI Portal)

CBORG provides an OpenAI-compatible proxy for institutional access with cost management and tracking.

# Set up CBORG API key
export CBORG_API_KEY="your-cborg-key"

# Use CBORG proxy with convenient shortcut
deep-research-client research "Quantum computing advances" --use-cborg

# Run without installation using uvx
uvx deep-research-client research "AI ethics" --use-cborg

The --use-cborg flag automatically:

Sets the base URL to https://api.cborg.lbl.gov
Uses the CBORG_API_KEY environment variable
Maintains compatibility with all OpenAI models

Custom OpenAI-Compatible Endpoints

Use any OpenAI-compatible service with --base-url:

# Azure OpenAI
deep-research-client research "AI trends" \
  --base-url https://your-resource.openai.azure.com \
  --api-key-env AZURE_OPENAI_KEY

# LiteLLM proxy (local or hosted)
deep-research-client research "ML developments" \
  --base-url http://localhost:4000 \
  --api-key-env LITELLM_API_KEY

# OpenRouter
deep-research-client research "Technology review" \
  --base-url https://openrouter.ai/api/v1 \
  --api-key-env OPENROUTER_API_KEY

# Custom proxy with default OPENAI_API_KEY
deep-research-client research "Research query" \
  --base-url https://api.example.com

Proxy Configuration Options

Option	Description	Example
`--use-cborg`	Use CBORG proxy (shortcut)	`--use-cborg`
`--base-url <url>`	Custom API endpoint URL	`--base-url https://api.example.com`
`--api-key-env <name>`	Environment variable for API key	`--api-key-env CBORG_API_KEY`

Python API with Proxies

import os
from deep_research_client import DeepResearchClient, ProviderConfig

# CBORG configuration
cborg_config = {
    "openai": ProviderConfig(
        name="openai",
        api_key=os.getenv("CBORG_API_KEY"),
        base_url="https://api.cborg.lbl.gov",
        enabled=True
    )
}

client = DeepResearchClient(provider_configs=cborg_config)
result = client.research("Latest AI developments")

# Custom endpoint (e.g., Azure OpenAI)
azure_config = {
    "openai": ProviderConfig(
        name="openai",
        api_key=os.getenv("AZURE_OPENAI_KEY"),
        base_url="https://your-resource.openai.azure.com",
        enabled=True
    )
}

client = DeepResearchClient(provider_configs=azure_config)
result = client.research("AI ethics research")

Common Proxy Use Cases

Institutional Access (CBORG)

# Keep personal OpenAI key for direct access
export OPENAI_API_KEY="sk-personal-key"

# Use CBORG for institutional billing
export CBORG_API_KEY="cborg-institutional-key"
deep-research-client research "Scientific topic" --use-cborg

LiteLLM Multi-Provider Gateway

# Run LiteLLM proxy locally
litellm --port 4000

# Use it with deep-research-client
export LITELLM_API_KEY="your-key"
deep-research-client research "Research query" \
  --base-url http://localhost:4000 \
  --api-key-env LITELLM_API_KEY

Azure OpenAI Deployment

export AZURE_OPENAI_KEY="your-azure-key"
deep-research-client research "Enterprise research" \
  --base-url https://your-resource.openai.azure.com \
  --api-key-env AZURE_OPENAI_KEY

Benefits of Proxy Usage

Institutional Billing: Route through services like CBORG for cost management
Compliance: Meet data residency and privacy requirements
Multi-Provider: Use LiteLLM to access multiple LLM providers through one interface
Fallback: Implement automatic provider fallback for reliability
Cost Control: Add budget limits and rate limiting through gateway services

Caching

The client uses intelligent file-based caching to avoid expensive re-queries:

Location: Caches stored in ~/.deep_research_cache/ directory (user's home)
Persistence: Cache files never expire (permanent until manually cleared)
Human-Readable Names: Files named like openai-what-is-crispr-gene-a1b2c3d4.json
Uniqueness: SHA256 hash suffix ensures no collisions
Management:
- deep-research-client clear-cache - Remove all cached files
- deep-research-client list-cache - Show all cached files
- --no-cache CLI flag to bypass cache for single query

Cache File Structure

~/.deep_research_cache/
├── openai-what-is-crispr-gene-a1b2c3d4.json
├── falcon-machine-learning-trends-f6e5d4c3.json
├── openai-quantum-computing-basics-b8f7e2a9.json
└── ...

Each cache file contains:

{
  "markdown": "# Research Report\n...",
  "citations": ["Citation 1", "Citation 2"],
  "provider": "openai",
  "query": "original query text"
}

Templates

Use template files with variable substitution for reusable research queries. The client supports both simple f-string templates and powerful Jinja2 templates with conditionals, loops, and filters.

Template Formats

F-String Templates (Simple)

Create a template file with {variable} placeholders:

# Gene Research Template

Please research the gene {gene} in {organism}, focusing on:

1. Function and molecular mechanisms
2. Disease associations
3. Expression patterns in {tissue} tissue
4. Recent discoveries since {year}

Gene: {gene}
Organism: {organism}
Tissue: {tissue}
Year: {year}

Jinja2 Templates (Advanced)

For more complex templates with conditionals, loops, and logic, use Jinja2 syntax with {{variable}} and save with .j2 extension:

# Gene Research: {{gene}}

Research the {{gene}} gene in {{organism}}.

{% if detail_level == "high" %}
Provide comprehensive information including:
- Detailed molecular mechanisms
- All known disease associations
- Complete expression atlas
- Evolutionary conservation
{% else %}
Provide a concise overview covering:
- Primary function
- Major disease links
- Key expression patterns
{% endif %}

Focus areas:
{% for topic in topics %}
- {{topic|capitalize}}
{% endfor %}

{% if year %}
Emphasize discoveries from {{year}} onwards.
{% endif %}

Format Detection

Templates are automatically detected as Jinja2 based on:

File extension: .j2, .jinja, .jinja2 (e.g., template.md.j2)
Frontmatter: YAML frontmatter with format: jinja in .md files
Default: F-string format for backward compatibility

Example with frontmatter:

---
format: jinja
description: Advanced gene research
---
Research {{gene}}{% if organism %} in {{organism}}{% endif %}.

Using Templates

# Simple f-string template
deep-research-client research \
  --template gene_research.md \
  --var "gene=TP53" \
  --var "organism=human" \
  --var "tissue=brain" \
  --var "year=2020"

# Jinja2 template with conditionals
deep-research-client research \
  --template gene_advanced.md.j2 \
  --var "gene=BRCA1" \
  --var "organism=human" \
  --var "detail_level=high"

# With uvx (no installation)
uvx deep-research-client research \
  --template gene_research.md \
  --var "gene=TP53" \
  --var "organism=human"

# Missing variables will show helpful error
deep-research-client research --template gene_research.md
# Error: Template requires variables: gene, organism, tissue, year
# Use --var key=value for each variable

Template Examples

The repository includes example templates:

templates/gene_family.md - Simple f-string template for gene families
templates/gene_jinja.md.j2 - Jinja2 template with conditionals
templates/gene_frontmatter.md - Markdown with frontmatter-based format detection

Python API with Templates

from deep_research_client import DeepResearchClient
from deep_research_client.processing import TemplateProcessor
from pathlib import Path

# Initialize processor (auto-detects format)
template_processor = TemplateProcessor()

# Simple f-string template
query = template_processor.render_template(
    Path("gene_research.md"),
    {"gene": "TP53", "organism": "human", "tissue": "brain", "year": "2020"}
)

# Jinja2 template with advanced features
query = template_processor.render_template(
    Path("gene_advanced.md.j2"),
    {
        "gene": "BRCA1",
        "organism": "human",
        "detail_level": "high",
        "topics": ["function", "mutations", "expression"],
        "year": "2020"
    }
)

# Process template with metadata (includes format info)
rendered_query, metadata = template_processor.process_template(
    Path("gene_research.md.j2"),
    {"gene": "TP53", "organism": "human"}
)
print(f"Format used: {metadata['template_format']}")  # 'jinja' or 'fstring'

# Use with client
client = DeepResearchClient()
result = client.research(query)

Jinja2 Features

Jinja2 templates support powerful features:

Conditionals: {% if condition %}...{% endif %}
Loops: {% for item in list %}...{% endfor %}
Filters: {{gene|upper}}, {{organism|capitalize}}
Default values: {{year|default('2024')}}
Comments: {# This is a comment #}

See docs/templates.md for comprehensive template documentation.

Output Format

The client outputs structured markdown with YAML frontmatter containing metadata:

---
provider: perplexity
model: sonar-pro
cached: false
start_time: '2025-10-18T17:43:49.437056'
end_time: '2025-10-18T17:44:08.922200'
duration_seconds: 19.49
template_file: examples/gene_research.md.j2  # If template used
template_format: jinja                        # Template format: 'jinja' or 'fstring'
template_variables:                           # If template used
  gene: TP53
  organism: human
  detail_level: high
provider_config:
  timeout: 600
  max_retries: 3
citation_count: 18
---

## Question

What is machine learning?

## Output

**Machine learning** is a branch of artificial intelligence...

## Citations

1. https://www.ibm.com/topics/machine-learning
2. https://www.coursera.org/articles/what-is-machine-learning
...

Citation Handling

Default: Citations included at end of markdown output
Separate file: Use --separate-citations to save citations to .citations.md file
Library usage: Access citations via result.citations list

Model Discovery & Introspection

The client includes comprehensive model discovery capabilities to help you find the right model for your needs.

Listing Available Models

# List all available models across all providers
deep-research-client models

# Show models for a specific provider
deep-research-client models --provider openai
deep-research-client models --provider perplexity

# Filter by cost level
deep-research-client models --cost low      # Budget-friendly options
deep-research-client models --cost medium   # Balanced options
deep-research-client models --cost high     # Premium options
deep-research-client models --cost very_high  # Most expensive/comprehensive

# Filter by capability
deep-research-client models --capability web_search
deep-research-client models --capability academic_search
deep-research-client models --capability scientific_literature

# Show detailed information including pricing, use cases, and limitations
deep-research-client models --detailed

# Combine filters
deep-research-client models --provider perplexity --cost low --detailed

Model Aliases

Most models support convenient short aliases:

# These are equivalent
deep-research-client research "AI trends" --provider openai --model o3-deep-research-2025-06-26
deep-research-client research "AI trends" --provider openai --model o3
deep-research-client research "AI trends" --provider openai --model o3-deep

# Perplexity aliases
--model sonar-deep-research  # Can use: deep, deep-research, sdr
--model sonar-pro            # Can use: pro, sp
--model sonar                # Can use: basic, fast, s

# Using uvx with aliases
uvx deep-research-client research "AI" --provider openai --model o3

Model Selection

Use --model to override the default:

# Use faster Perplexity model
deep-research-client research "AI trends" --provider perplexity --model sonar-pro

# Use default deep research model (comprehensive but slower)
deep-research-client research "AI trends" --provider perplexity  # Uses sonar-deep-research

# Override OpenAI model using alias
deep-research-client research "AI trends" --provider openai --model o4-mini

Available Models by Provider

Provider	Model	Aliases	Cost	Speed	Context	Key Features
OpenAI	`o3-deep-research-2025-06-26`	o3, o3-deep, o3dr	💰💰💰💰	🐢	128K	Most comprehensive, web search, code analysis
OpenAI	`o4-mini-deep-research-2025-06-26`	o4m, o4-mini, mini	💰💰	⏳	128K	Balanced speed/quality
Perplexity	`sonar-deep-research`	deep, deep-research, sdr	💰💰💰	🐌	200K	Comprehensive with real-time data
Perplexity	`sonar-pro`	pro, sp	💰💰	⏳	200K	Fast with good quality
Perplexity	`sonar`	basic, fast, s	💰	⚡	100K	Fastest, budget-friendly
Edison	`Edison Scientific Literature`	falcon, edison, eds, science	💰💰💰	🐌	-	Scientific literature focus
Consensus	`Consensus Academic Search`	consensus, academic, papers, c	💰	⚡	-	Peer-reviewed papers only

Cost Legend: 💰 = Low, 💰💰 = Medium, 💰💰💰 = High, 💰💰💰💰 = Very High Speed Legend: ⚡ = Fast, ⏳ = Medium, 🐌 = Slow, 🐢 = Very Slow

Model Capabilities

Each model has different capabilities:

Capability	Models
Web Search	OpenAI (o3, o4-mini), Perplexity (all)
Academic Search	Edison, Consensus
Scientific Literature	Edison
Real-time Data	OpenAI (all), Perplexity (all)
Citation Tracking	All providers
Code Interpretation	OpenAI (o3)

Python API for Model Discovery

from deep_research_client.model_cards import (
    get_provider_model_cards,
    list_all_models,
    find_models_by_cost,
    find_models_by_capability,
    resolve_model_alias,
    CostLevel,
    ModelCapability
)

# Get all models for a provider
openai_cards = get_provider_model_cards("openai")
models = openai_cards.list_models()
print(models)  # ['o3-deep-research-2025-06-26', 'o4-mini-deep-research-2025-06-26']

# Get detailed model information
card = openai_cards.get_model_card("o3-deep-research-2025-06-26")
print(f"Cost: {card.cost_level}")
print(f"Speed: {card.time_estimate}")
print(f"Capabilities: {card.capabilities}")
print(f"Context Window: {card.context_window}")
print(f"Use Cases: {card.use_cases}")
print(f"Limitations: {card.limitations}")

# Find models across all providers by criteria
cheap_models = find_models_by_cost(CostLevel.LOW)
# Returns: {'perplexity': ['sonar'], 'consensus': ['Consensus Academic Search']}

web_search_models = find_models_by_capability(ModelCapability.WEB_SEARCH)
# Returns: {'openai': ['o3-deep-research-2025-06-26', ...], 'perplexity': [...]}

# Resolve aliases to full model names
full_name = resolve_model_alias("openai", "o3")
print(full_name)  # 'o3-deep-research-2025-06-26'

# List all models across all providers
all_models = list_all_models()
# Returns: {'openai': [...], 'perplexity': [...], 'falcon': [...], ...}

# Get models by speed
fast_models = openai_cards.get_models_by_time(TimeEstimate.FAST)

Model Selection Guidelines

For comprehensive research:

OpenAI o3 (most thorough, expensive, slow)
Perplexity sonar-deep-research (thorough with real-time data)

For balanced performance:

OpenAI o4-mini (good quality, reasonable cost)
Perplexity sonar-pro (fast with good coverage)

For quick lookups:

Perplexity sonar (fastest, cheapest)

For academic/scientific research:

Edison (scientific literature focused, powered by PaperQA3)
Consensus (peer-reviewed papers only)

Budget considerations:

# Most expensive but most comprehensive
deep-research-client research "complex topic" --provider openai --model o3

# Good balance of cost and quality
deep-research-client research "complex topic" --provider perplexity --model pro

# Cheapest option for quick answers
deep-research-client research "simple question" --provider perplexity --model sonar

Supported Providers

Provider	Environment Variable	Model/Service	Strengths
OpenAI	`OPENAI_API_KEY`	o3-deep-research-2025-06-26	Deep research, comprehensive reports
Edison	`EDISON_API_KEY`	Edison Scientific Literature	Scientific literature focus, powered by PaperQA3
Perplexity	`PERPLEXITY_API_KEY`	sonar-deep-research	Real-time web search, recent sources
Consensus	`CONSENSUS_API_KEY`	Consensus Academic Search	Peer-reviewed academic papers, evidence-based research
OpenScientist	`OPENSCIENTIST_API_KEY`	openscientist-autonomous	Iterative hypothesis-driven research, PubMed PMID citations

Development

Essential Commands

# Run tests
just test

# Type checking
just mypy

# Format code
just format

# Install dependencies
uv sync

# Run CLI locally
uv run deep-research-client --help

Adding New Providers

Create a new provider class in src/deep_research_client/providers/
Inherit from ResearchProvider
Implement the research() method
Register in client.py

Example:

from . import ResearchProvider
from ..models import ResearchResult

class NewProvider(ResearchProvider):
    async def research(self, query: str) -> ResearchResult:
        # Your implementation
        return ResearchResult(...)

Repository Structure

docs/ - mkdocs documentation
src/deep_research_client/ - main package
- client.py - main client class
- models.py - Pydantic models
- cache.py - caching implementation
- providers/ - research providers
tests/ - test suite

Potential Future Providers

The following providers were researched but not yet implemented:

Research-Focused Providers

Semantic Scholar API - Free access to 200+ million academic papers with comprehensive metadata
Elicit AI - AI research assistant focused on answering research questions
Scite.ai - Citation-based research with smart citations and paper analysis

General AI Providers with Research Capabilities

Anthropic Claude - Has web search API but lacks dedicated deep research mode
Google Gemini - Deep Research feature exists but requires allowlist API access
Exa (Metaphor) - AI-powered search API designed for research workflows

Specialized Tools

Qatalog - Enterprise knowledge search and research
Azure AI Search - Microsoft's cognitive search capabilities
IBM Watson Discovery - Enterprise document analysis and research

These providers could be added based on user demand and API availability. Contributions welcome!

Claude Code Skill

This repository includes a Claude Code skill that makes it easy to use deep-research-client directly from Claude Code.

Installation

To use the skill:

# Copy to your local Claude skills directory
cp -r .claude/skills/run-deep-research ~/.claude/skills/

# Or use it project-wide (already available in this repo)
# Claude Code will auto-detect skills in .claude/skills/

Features

The skill provides:

Guided research workflow: Automatically asks about speed vs depth preferences
Provider selection: Helps choose the right provider and model for your needs
Template support: Easy access to research templates for common patterns
Smart defaults: Leverages caching and best practices automatically

Usage with Claude Code

Once installed, simply ask Claude to research topics:

"Research the latest developments in CRISPR gene editing"
"Do a quick overview of quantum computing (fast approach)"
"I need a comprehensive analysis of TP53 gene in humans"

Claude will automatically:

Ask whether you want a fast/light or comprehensive/slow approach
Select the appropriate provider and model
Execute the research with proper caching
Save results with citations and metadata

See .claude/skills/README.md for more details.

License

BSD-3-Clause

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.4

Apr 10, 2026

0.2.3

Mar 27, 2026

0.2.2

Mar 16, 2026

0.2.1

Mar 13, 2026

0.2.0

Jan 27, 2026

0.1.6

Dec 18, 2025

0.1.5

Dec 16, 2025

0.1.3

Nov 14, 2025

0.1.2

Nov 4, 2025

0.1.1

Nov 3, 2025

0.1.0

Oct 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deep_research_client-0.2.4.tar.gz (2.8 MB view details)

Uploaded Apr 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

deep_research_client-0.2.4-py3-none-any.whl (116.4 kB view details)

Uploaded Apr 10, 2026 Python 3

File details

Details for the file deep_research_client-0.2.4.tar.gz.

File metadata

Download URL: deep_research_client-0.2.4.tar.gz
Upload date: Apr 10, 2026
Size: 2.8 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for deep_research_client-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`346b02199d55150fabf4ad4e5e50130402feeea53ae1f0fa8a3fa06fa7c937df`
MD5	`bc57879bc38fa7da43e3321446f4da93`
BLAKE2b-256	`fecc56c518effed8816846f1142ea8f255b8dd820cad49e784bdde8fb56f747d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for deep_research_client-0.2.4.tar.gz:

Publisher: pypi-publish.yaml on monarch-initiative/deep-research-client

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: deep_research_client-0.2.4.tar.gz
- Subject digest: 346b02199d55150fabf4ad4e5e50130402feeea53ae1f0fa8a3fa06fa7c937df
- Sigstore transparency entry: 1272247868
- Sigstore integration time: Apr 10, 2026
Source repository:
- Permalink: monarch-initiative/deep-research-client@bf459da57b7b63ea337b574e253cb200c2b865a7
- Branch / Tag: refs/tags/v0.2.4
- Owner: https://github.com/monarch-initiative
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yaml@bf459da57b7b63ea337b574e253cb200c2b865a7
- Trigger Event: release

File details

Details for the file deep_research_client-0.2.4-py3-none-any.whl.

File metadata

Download URL: deep_research_client-0.2.4-py3-none-any.whl
Upload date: Apr 10, 2026
Size: 116.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for deep_research_client-0.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`70d9fd064a3306850a0a0209bb3a6953ac1b0c6edb655e4a07cf4a97cf001264`
MD5	`d9583f0c5933f2999b2b48a9ed6c9f02`
BLAKE2b-256	`17f1886c64cb54be55e0b675751b61fff0e9109ee538609c88aa8d02f34f179e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for deep_research_client-0.2.4-py3-none-any.whl:

Publisher: pypi-publish.yaml on monarch-initiative/deep-research-client

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: deep_research_client-0.2.4-py3-none-any.whl
- Subject digest: 70d9fd064a3306850a0a0209bb3a6953ac1b0c6edb655e4a07cf4a97cf001264
- Sigstore transparency entry: 1272247960
- Sigstore integration time: Apr 10, 2026
Source repository:
- Permalink: monarch-initiative/deep-research-client@bf459da57b7b63ea337b574e253cb200c2b865a7
- Branch / Tag: refs/tags/v0.2.4
- Owner: https://github.com/monarch-initiative
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yaml@bf459da57b7b63ea337b574e253cb200c2b865a7
- Trigger Event: release

deep-research-client 0.2.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

deep-research-client

Features

Installation

Quick Start

Set up API Keys

Command Line Usage

Python Library Usage

Advanced Configuration

Provider-Specific Parameters

Harmonized Parameters

OpenAI-Specific Parameters

Perplexity-Specific Parameters

Falcon-Specific Parameters

Asta-Specific Parameters

Cyberian-Specific Parameters (Agent-Based Research)

OpenScientist-Specific Parameters (Autonomous Research)

Setup Guide

Parameters

CLI Usage with Provider Parameters

Using Proxies and OpenAI-Compatible Endpoints

CBORG (Berkeley Lab's AI Portal)

Custom OpenAI-Compatible Endpoints

Proxy Configuration Options

Python API with Proxies

Common Proxy Use Cases

Benefits of Proxy Usage

Caching

Cache File Structure

Templates

Template Formats

F-String Templates (Simple)

Jinja2 Templates (Advanced)

Format Detection

Using Templates

Template Examples

Python API with Templates

Jinja2 Features

Output Format

Citation Handling

Model Discovery & Introspection

Listing Available Models

Model Aliases

Model Selection

Available Models by Provider

Model Capabilities

Python API for Model Discovery

Model Selection Guidelines

Supported Providers

Development

Essential Commands

Adding New Providers

Repository Structure

Potential Future Providers

Research-Focused Providers

General AI Providers with Research Capabilities

Specialized Tools

Claude Code Skill

Installation

Features

Usage with Claude Code

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes