A configurable daily ArXiv digest agent — filter and summarise papers in any research area
Reason this release was yanked:
uploaded under wrong name, use correct-name instead
Project description
arxiv-paper-digest
A daily agent that monitors ArXiv for XAI and interpretability papers, filters by semantic similarity, summarises with a local LLM, and saves a Markdown digest. No external API keys required.
How it works
1. ArXiv RSS
2. filter_unseen (SQLite)
3. semantic filter (sentence-transformers)
4. summarise (Ollama)
5. outputs/digests/YYYY-MM-DD.md
Quickstart
# Requires Ollama running with the model pulled
ollama pull llama3.2:3b
pip install arxiv-xai-digest
arxiv-digest # full run
arxiv-digest --dry-run # skip LLM, test the rest of the pipeline
Or from source:
git clone https://github.com/ilonae/research-agent
cd research-agent
pip install -e ".[dev]"
Configuration
Copy .env.example to .env. All variables are optional — defaults shown:
| Variable | Default |
|---|---|
AGENT_OLLAMA_MODEL |
llama3.2:3b |
AGENT_OLLAMA_URL |
http://localhost:11434 |
AGENT_MAX_PER_FEED |
20 |
AGENT_ARXIV_CATEGORIES |
["cs.LG","cs.AI","cs.CV"] |
AGENT_SIMILARITY_THRESHOLD |
0.35 |
AGENT_EMBEDDING_MODEL |
all-MiniLM-L6-v2 |
AGENT_ANCHORS |
8 XAI topic sentences |
Docker
docker compose up # starts Ollama sidecar + agent
Scheduled runs
.github/workflows/daily-digest.yml runs at 07:00 UTC and commits the digest back to the repo. Trigger manually from Actions : Run workflow to test.
Querying the memory store
sqlite3 outputs/seen_papers.db \
"SELECT title, first_seen FROM seen_papers
WHERE first_seen >= date('now', '-7 days')
ORDER BY first_seen DESC;"
Development
pip install -e ".[dev]"
pytest && ruff check . && mypy agent/ tools/ config/
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file arxiv_xai_digest-0.1.0.tar.gz.
File metadata
- Download URL: arxiv_xai_digest-0.1.0.tar.gz
- Upload date:
- Size: 10.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e5ce74a2b8072d173343d9298c9b1f3206e1e7ea0a4b7579367be0afb0c44ca0
|
|
| MD5 |
c7af421af58ae8a4e03988a027931943
|
|
| BLAKE2b-256 |
dafd5997c52a4e342209d02395febffe76b4ec7316601a65af825c72bba67705
|
File details
Details for the file arxiv_xai_digest-0.1.0-py3-none-any.whl.
File metadata
- Download URL: arxiv_xai_digest-0.1.0-py3-none-any.whl
- Upload date:
- Size: 14.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3945d168e8a23a751def221254ef0398f86b4c7fe01bc9d0795b968c95ecefe9
|
|
| MD5 |
1f1c62814c7101d1c3a5f06f63273353
|
|
| BLAKE2b-256 |
e8871ff066d2ce8e8602e40831bb1718183c76f509b34ba771dc22656d0daae6
|