Free open-source research agent: daily paper digests, multi-source search, and interactive CLI

These details have not been verified by PyPI

Project links

Project description

ResearchPulse

Your daily pulse on research. A free, open-source agent that emails researchers a simple morning digest of new papers in their fields, plus what's happening across the research community.

Works in two modes:

Newsletter -- People subscribe via email. Every morning a GitHub Actions cron sends each subscriber a personalized digest.
Local agent -- Run on your own machine to search papers, get previews, compare papers, track reading history, and generate insights. No account needed.

Runs on 100% free infrastructure. No servers, no database, no paid services required.

Install from PyPI (recommended)

pip install research-pulse
research-pulse help

Install with pipx (Mac / Linux)

pipx install research-pulse
pipx ensurepath          # adds ~/.local/bin to your shell PATH
exec $SHELL              # reload shell (or open a new terminal)
research-pulse help

If you still see command not found, the install worked but your shell PATH is missing pipx’s bin directory:

# Check where pipx put the command
pipx list
ls ~/.local/bin/research-pulse

# Run directly (always works)
~/.local/bin/research-pulse help

# Or without installing
pipx run research-pulse help

# Or via Python module
python3 -m research_agent help

Add to ~/.zshrc (macOS default shell) if needed:

export PATH="$HOME/.local/bin:$PATH"

That's it — no clone, no setup wizard, no API keys. Data and preferences are stored in ~/.research-pulse/ (override with RESEARCHPULSE_HOME).

Optional extras:

pip install "research-pulse[web]"   # browser UI (Flask)

Publish to PyPI (maintainers)

The package is configured as research-pulse on PyPI. Built artifacts live in dist/.

# 1. Create token: https://pypi.org/manage/account/token/
$env:TWINE_USERNAME = "__token__"
$env:TWINE_PASSWORD = "pypi-Ag..."

# 2. Build + upload (or use the helper script)
.\scripts\publish.ps1

# Test on TestPyPI first (optional)
.\scripts\publish.ps1 -TestPyPI

See PUBLISHING.md for full details.

Quick start (from source)

git clone https://github.com/research-pulse/research-pulse.git
cd research-pulse
pip install -e .

# Run every morning:
research-pulse

Daily commands

What you want	Command
Today's papers	`research-pulse`
Search	`research-pulse search "transformer efficiency"`
Follow ANY field (plain English)	`research-pulse follow "quantum error correction"`
Change topics	`research-pulse topics`
Set topics directly	`research-pulse topics ai-ml nlp cv`
Set papers per topic	`research-pulse config papers 10`
Help	`research-pulse help`

Works for every research domain

ResearchPulse ships with 31 built-in domains spanning computer science, the life sciences, physics, chemistry, math, engineering, economics, and the social sciences. Not listed? Just follow it in plain English — no config editing, no API keys:

research-pulse follow "protein folding"
research-pulse follow "behavioral economics"
research-pulse follow "cultural heritage preservation"

follow creates the topic on the fly, wires it to the keyless open-access sources (OpenAlex, Europe PMC, Semantic Scholar, arXiv, Crossref), saves it to your daily digest, and immediately shows recent papers.

Optional: research-pulse chat opens the full interactive agent (compare, insights, memory).

Zotero integration

If you have Zotero installed, ResearchPulse can auto-detect your research domains from your existing library:

python -m research_agent zotero

This reads your local zotero.sqlite database (read-only, never modifies it), analyzes your collections, tags, and recent titles, and suggests which ResearchPulse topics match your interests. It also gives you a ready-to-paste command:

  Suggested ResearchPulse topics based on your library:

    ai-ml        Artificial Intelligence & Machine Learning  [####################] 100%
    nlp          Natural Language Processing                 [#######             ] 36%
    security     Security & Cryptography                    [#                   ] 8%

  Quick start with your domains:
    python -m research_agent digest --topics ai-ml nlp --open

Zotero detection works on Windows, macOS, and Linux. If Zotero is in a custom location, set ZOTERO_DATA_DIR=/path/to/your/zotero/data.

CLI reference

python -m research_agent                  # Interactive agent (REPL)
python -m research_agent setup            # First-time setup wizard
python -m research_agent search "query"   # One-shot paper search
python -m research_agent follow "field"   # Follow any research area (plain English)
python -m research_agent topics           # List available topics
python -m research_agent memory           # View research memory
python -m research_agent zotero           # Detect Zotero domains
python -m research_agent digest           # Run daily digest pipeline
python -m research_agent digest --dry-run # Preview without sending
python -m research_agent digest --topics ai-ml nlp --open  # Local preview
python -m research_agent web              # Start web UI (requires flask)
python -m research_agent help             # Full help with all commands

Interactive agent commands

SEARCH:      search <query> | search --all <query> | topic <id> | topics
PAPERS:      summarize <n> | rate <n> <1-5> | compare <n1> <n2> | features <n>
AI:          ask <question> | ask <n> <question> | explain <concept>
INSIGHTS:    insights | recommend | briefing | critique <hypothesis>
MEMORY:      memory | memory set name/field/role <value> | memory papers

Newsletter setup (run the public instance)

Estimated time: ~15 minutes. Everything below is free.

1. Fork this repository

Click Fork. The daily workflow comes with it.

2. Create a free email sender (Brevo)

Sign up at brevo.com (free plan: 300 emails/day, no card).
Go to SMTP & API > SMTP and note your login and SMTP key.
Verify a sender address under Senders.

3. Set up the subscriber database (Google Sheet + Apps Script)

Create a Google Sheet with this header row: email | topics | confirmed | token | created
Open Extensions > Apps Script, paste apps_script/Code.gs, and set SHEET_ID to your sheet's id (from its URL).
Deploy > New deployment > Web app: execute as Me, access Anyone. Copy the web app URL.
File > Share > Publish to web > CSV for the sheet. Copy that CSV URL.

4. Publish the signup page (GitHub Pages)

Edit docs/index.html: set APPS_SCRIPT_URL to your web app URL.
In your repo: Settings > Pages > Source: main / docs.

5. Add repository secrets

Settings > Secrets and variables > Actions > New repository secret:

Secret	Value
`SUBSCRIBERS_CSV_URL`	Published CSV URL from step 3.4
`SMTP_HOST`	`smtp-relay.brevo.com`
`SMTP_PORT`	`587`
`SMTP_USER`	Brevo SMTP login
`SMTP_KEY`	Brevo SMTP key
`SENDER_EMAIL`	Your verified sender address
`SENDER_NAME`	e.g. `ResearchPulse`
`SITE_URL`	Your Apps Script web app URL (handles unsubscribe links)
`GROQ_API_KEY` (optional)	Free Groq key for LLM summaries
`GEMINI_API_KEY` (optional)	Free Gemini key (alternative)

6. Done

The workflow runs every morning at 11:00 UTC. Trigger it manually from the Actions tab (with an optional dry run).

Features

Free, open-access sources: arXiv, OpenAlex (250M+ works, all fields), Europe PMC (40M+ life-science papers), Crossref, bioRxiv/medRxiv, optional Semantic Scholar, and curated research-news RSS. All keyless and free; sources are queried in parallel and one failing never blocks the digest.
Every research domain: 31 built-in topics from AI to agriculture, plus follow "<anything>" to track fields that aren't listed — no config or keys.
Personalized: each subscriber only gets the topics they chose.
Plain-language summaries: uses the paper abstract by default (zero cost), or an LLM TL;DR if you provide a free API key (Groq / Gemini) or run a local model (Ollama).
Interactive agent: search, compare, summarize, critique papers, and get personalized insights from the command line. Tracks your reading history.
Zotero detection: auto-detects your research domains from your local Zotero library.
Research memory: remembers papers you read, your ratings, and research questions.
BM25 ranking: papers ranked by relevance using BM25 + keyword + citation scoring.
Cross-source dedup: same paper from arXiv and OpenAlex is shown only once.
No duplicates across days: a committed data/seen.json cache prevents repeats.

Customizing

Add a research domain: the easiest way is research-pulse follow "<field>", which appends a ready-to-use entry to config/topics.yaml automatically. For the hosted newsletter, also add the matching { id, label } to docs/index.html so it appears on the signup page.
Tune behavior: config/settings.yaml controls papers per topic, news items, lookback window, abstract length, and branding.
Summaries: provide one of GROQ_API_KEY / GEMINI_API_KEY / OLLAMA_HOST to enable LLM TL;DRs; otherwise abstracts are used.

Project layout

research_agent/
  sources/        arxiv.py, biorxiv.py, openalex.py, crossref.py,
                  semanticscholar.py, europepmc.py, rss.py, http.py
  models.py       Paper / NewsItem data structures
  config.py       YAML + env/.env loading
  log.py          Centralized logging (replaces print())
  subscribers.py  Read confirmed subscribers (published CSV or sample)
  cache.py        data/seen.json dedup state
  rank.py         Relevance ranking + per-topic selection
  bm25.py         BM25 scoring for search
  summarize.py    Abstract default + Groq/Gemini/Ollama backends
  render.py       Jinja2 HTML rendering per subscriber
  mailer.py       SMTP delivery
  pipeline.py     End-to-end digest orchestration
  agent.py        Interactive CLI agent with search/compare/insights
  search.py       On-demand keyword search across all sources
  memory.py       Persistent researcher memory (data/memory.json)
  chat.py         LLM conversation for ask/explain commands
  compare.py      Side-by-side paper comparison
  critique.py     Hypothesis challenging with evidence
  insights.py     Contradiction/gap/trend detection
  recommend.py    Personalized paper recommendations
  features.py     Feature extraction from papers
  setup.py        First-time setup wizard
  zotero.py       Zotero library detection + domain inference
  web.py          Flask web UI
  __main__.py     CLI router
config/           topics.yaml, settings.yaml, subscribers.sample.csv
templates/        email.html.j2
docs/             index.html (GitHub Pages signup)
apps_script/      Code.gs (subscription backend)
.github/workflows/daily.yml
data/             seen.json (dedup), memory.json (reading history)

Limits & scaling (all still free)

Brevo free = 300 emails/day, i.e. ~300 confirmed subscribers on one instance.
arXiv has no hard rate limit but asks for responsible use; we wait 3s between calls and poll once daily.
OpenAlex / Europe PMC / bioRxiv / Crossref are free and keyless with generous limits, comfortably serving the whole research community from one instance.
Semantic Scholar is optional: its keyless pool is heavily rate-limited, so the source only activates when you set a free S2_API_KEY. Everything else works without it.

License

MIT -- see LICENSE. Contributions welcome.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.6

Jul 3, 2026

This version

0.4.4

Jul 2, 2026

0.4.3

Jul 2, 2026

0.4.2

Jul 2, 2026

0.4.1

Jul 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

research_pulse-0.4.4-py3-none-any.whl (96.3 kB view details)

Uploaded Jul 2, 2026 Python 3

File details

Details for the file research_pulse-0.4.4-py3-none-any.whl.

File metadata

Download URL: research_pulse-0.4.4-py3-none-any.whl
Upload date: Jul 2, 2026
Size: 96.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.9

File hashes

Hashes for research_pulse-0.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`99f67feeacc19039ad51ddaab101398c01ba8807f2ca13c56a30bbffc0dc693b`
MD5	`db04868c481fc16a134dd1b88b9fb5e5`
BLAKE2b-256	`ef722ca3d121f6b7e2bae15857eb9e8e0bf8fccbd4f65acf47f4930d60c8f1b7`

See more details on using hashes here.

research-pulse 0.4.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ResearchPulse

Install from PyPI (recommended)

Install with pipx (Mac / Linux)

Publish to PyPI (maintainers)

Quick start (from source)

Daily commands

Works for every research domain

Zotero integration

CLI reference

Interactive agent commands

Newsletter setup (run the public instance)

1. Fork this repository

2. Create a free email sender (Brevo)

3. Set up the subscriber database (Google Sheet + Apps Script)

4. Publish the signup page (GitHub Pages)

5. Add repository secrets

6. Done

Features

Customizing

Project layout

Limits & scaling (all still free)

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes