Skip to main content

A configurable daily ArXiv digest agent — filter and summarise papers in any research area

Reason this release was yanked:

uploaded under wrong name, use correct-name instead

Project description

arxiv-paper-digest

A daily agent that monitors ArXiv for XAI and interpretability papers, filters by semantic similarity, summarises with a local LLM, and saves a Markdown digest. No external API keys required.

How it works

1. ArXiv RSS 
2. filter_unseen (SQLite)
3. semantic filter (sentence-transformers)
4. summarise (Ollama) 
5. outputs/digests/YYYY-MM-DD.md

Quickstart

# Requires Ollama running with the model pulled
ollama pull llama3.2:3b

pip install arxiv-xai-digest
arxiv-digest              # full run
arxiv-digest --dry-run    # skip LLM, test the rest of the pipeline

Or from source:

git clone https://github.com/ilonae/research-agent
cd research-agent
pip install -e ".[dev]"

Configuration

Copy .env.example to .env. All variables are optional — defaults shown:

Variable Default
AGENT_OLLAMA_MODEL llama3.2:3b
AGENT_OLLAMA_URL http://localhost:11434
AGENT_MAX_PER_FEED 20
AGENT_ARXIV_CATEGORIES ["cs.LG","cs.AI","cs.CV"]
AGENT_SIMILARITY_THRESHOLD 0.35
AGENT_EMBEDDING_MODEL all-MiniLM-L6-v2
AGENT_ANCHORS 8 XAI topic sentences

Docker

docker compose up   # starts Ollama sidecar + agent

Scheduled runs

.github/workflows/daily-digest.yml runs at 07:00 UTC and commits the digest back to the repo. Trigger manually from Actions : Run workflow to test.

Querying the memory store

sqlite3 outputs/seen_papers.db \
  "SELECT title, first_seen FROM seen_papers
   WHERE first_seen >= date('now', '-7 days')
   ORDER BY first_seen DESC;"

Development

pip install -e ".[dev]"
pytest && ruff check . && mypy agent/ tools/ config/

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv_xai_digest-0.1.0.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

arxiv_xai_digest-0.1.0-py3-none-any.whl (14.2 kB view details)

Uploaded Python 3

File details

Details for the file arxiv_xai_digest-0.1.0.tar.gz.

File metadata

  • Download URL: arxiv_xai_digest-0.1.0.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for arxiv_xai_digest-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e5ce74a2b8072d173343d9298c9b1f3206e1e7ea0a4b7579367be0afb0c44ca0
MD5 c7af421af58ae8a4e03988a027931943
BLAKE2b-256 dafd5997c52a4e342209d02395febffe76b4ec7316601a65af825c72bba67705

See more details on using hashes here.

File details

Details for the file arxiv_xai_digest-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for arxiv_xai_digest-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3945d168e8a23a751def221254ef0398f86b4c7fe01bc9d0795b968c95ecefe9
MD5 1f1c62814c7101d1c3a5f06f63273353
BLAKE2b-256 e8871ff066d2ce8e8602e40831bb1718183c76f509b34ba771dc22656d0daae6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page