AI-news aggregation: the `brf` tool CLI (agent-side) plus the `cron` package (host-side cron + Slack receiver).

Project description

Blog Research Feed

A daily AI engineering signal curator. Every morning a Claude agent hosted on Anthropic Managed Agents fetches yesterday's frontier engineering content (RSS, X, YouTube, podcasts, Firecrawl-indexed blogs), drills into the most promising items, picks a Top 10, and posts a report to Slack.

This is not a news digest. The target reader is an engineer working on VLM, video agents, multimodal, or coding agents — the agent surfaces the ten things worth reading yesterday and writes its own takeaway for each.

How it works

GitHub Action (cron, 09:00 UTC)
  └── python -m cron.daily
        ├── Resolve agent + environment by name via Anthropic API
        ├── sessions.create(...) mounts /workspace/.env (pre-uploaded Files API object)
        └── Stream session events until idle

Inside the session container:
  $ brf fetch-all --since YESTERDAY        → /tmp/feed/index.json
  $ jq ... /tmp/feed/index.json            triage candidates
  $ brf fetch-full --id <id>               drill into selected items
  $ brf report slack --message-file ...    post to Slack

The runner only calls the Anthropic API. All third-party API keys (Firecrawl, X, OpenAI, Slack) live in a .env file pre-uploaded once to the Files API; the runner mounts it by file id and never sees the contents.

Setup

1. API keys

You need:

Key	Purpose
`ANTHROPIC_API_KEY`	Runner calls sessions/files API
`FIRECRAWL_API_KEY`	Scrape + index endpoints (https://firecrawl.dev)
`X_BEARER_TOKEN`	X API v2
`OPENAI_API_KEY`	Whisper transcription
`SLACK_WEBHOOK_URL`	Slack incoming webhook for the target channel

Put them in a local .env (copy from .env.example).

2. Provision the Managed Agent + Environment

pip install -e .
python scripts/create_agent.py

This creates the environment (cloud container with blog-research-feed from PyPI + apt jq/ffmpeg) and the coordinator agent plus its reader and reviewer subagents. To push config changes later, run with --update.

The script identifies resources by the name: field in agent/*.yaml — concrete IDs are looked up at runtime, so you never have to track them as secrets.

3. Upload the container `.env`

python -m scripts.upload_env --from-file .env

This prints a file id like file_01ABC.... Save it as a GitHub repository variable named ENV_FILE_ID (Settings → Variables → Actions). Re-run whenever you rotate a key.

4. GitHub Secrets

The only secret the runner needs:

ANTHROPIC_API_KEY — for calling sessions/files API

Third-party keys do not go on the runner; they live inside the uploaded .env.

5. Test

Trigger manually from GitHub Actions → "daily-ai-news" → Run workflow, or dry-run locally:

python -m cron.daily --dry-run

Operations

Releasing a new `brf` version

# bump version in pyproject.toml, then:
rm -rf dist/ build/ *.egg-info
python -m build
TWINE_USERNAME=__token__ TWINE_PASSWORD=$PYPI_API_TOKEN twine upload dist/blog_research_feed-*

# bump agent/environment.yaml:
#   name: blog-research-feed-env-v8 → v9
#   "blog-research-feed==0.2.4"     → "==0.2.5"

python scripts/create_agent.py --update

The cron run picks up the new environment by name on its next invocation.

Rotating secrets

Update the value in your local .env, re-run python -m scripts.upload_env, update the ENV_FILE_ID repo variable with the new file id.

Local development

pip install -e .[dev]
pytest -q
brf fetch-all --since 2026-05-20      # end-to-end smoke (~$0.2 firecrawl spend)

CLI reference

Main pipeline (what the agent uses):

brf fetch-all  --since <date> --output-dir /tmp/feed   # parallel fetch, writes index.json
brf fetch-full --id <id>      --output-dir /tmp/feed   # drill into one item
brf report slack --message-file <path>                 # post to Slack webhook

Auxiliary (rarely needed):

brf fetch x-user --handle <handle> --since <date>
brf fetch youtube-transcript --url <url>
brf fetch podcast-transcript --url <feed-or-episode-url>
brf firecrawl scrape --url <url>
brf firecrawl search --query <q> --limit 10

FeedItem shape (see brf/feed_item.py):

{
  "id": "ab12cd34ef567890",
  "source_type": "rss",
  "source": "Simon Willison's Weblog",
  "title": "...",
  "url": "https://...",
  "published": "2026-05-19T14:23:00+00:00",
  "summary": "≤ 500 chars",
  "has_full": true,
  "needs_firecrawl": false,
  "extra": {}
}

Repository layout

Path	Purpose
`brf/`	CLI bundle, installed into the session container
`brf/main.py`	Click entry: `fetch-all` / `fetch-full` are the main flow
`brf/aggregator.py`, `brf/fetchers/`	Parallel fetch + dedupe + scheduling
`brf/sources.yaml`	Single source of truth for all feeds / handles / channels
`brf/firecrawl_client.py`, `slack.py`, `rss.py`, `x_client.py`, `youtube.py`, `podcast.py`	Per-service clients
`agent/agent.yaml`, `reader.yaml`, `reviewer.yaml`	Managed Agent definitions (lookup by name)
`agent/*_prompt.md`	System prompts for coordinator / reader / reviewer
`cron/daily.py`	Host-side runner: resolves agent/env by name, opens a session, streams events
`scripts/create_agent.py`	One-shot provisioning / updates for agent + env
`scripts/upload_env.py`	One-shot upload of `.env` to the Files API
`.github/workflows/daily.yml`	09:00 UTC cron
`docs/managed_agents/`, `docs/agent_sdk/`	Local copies of Anthropic docs

For deeper design notes see ARCHITECTURE.md and BRF_FETCHER_DESIGN.md.

Known limits

Whisper has a 25 MB per-file cap; long podcasts are rejected (status too_large). The system prompt caps daily transcriptions to keep cost predictable.
Some Chinese-language RSS feeds are flaky; firecrawl_index backfills the gap (see SOURCES_HEALTH.md).
The SSE event stream does not auto-reconnect on drop. The 45-minute job timeout normally avoids the issue.
YouTube channel RSS occasionally rate-limits residential IPs. The fetcher swallows the error and continues.

Project details

Release history Release notifications | RSS feed

0.2.6

May 24, 2026

This version

0.2.5

May 23, 2026

0.2.4

May 21, 2026

0.1.4

May 20, 2026

0.1.3

May 20, 2026

0.1.2

May 20, 2026

0.1.1

May 20, 2026

0.1.0

May 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

blog_research_feed-0.2.5.tar.gz (63.4 kB view details)

Uploaded May 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

blog_research_feed-0.2.5-py3-none-any.whl (67.0 kB view details)

Uploaded May 23, 2026 Python 3

File details

Details for the file blog_research_feed-0.2.5.tar.gz.

File metadata

Download URL: blog_research_feed-0.2.5.tar.gz
Upload date: May 23, 2026
Size: 63.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for blog_research_feed-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`51a7b9f56b5d34c172727efb211e29bd6ef3c6ac7b38834e1ff80495203660a2`
MD5	`acb7af0a0a747372c566d2028122e644`
BLAKE2b-256	`677f4e63b9af93adfa418a26b25f967e585a2c7622bc65b16031b0f641301e77`

See more details on using hashes here.

File details

Details for the file blog_research_feed-0.2.5-py3-none-any.whl.

File metadata

Download URL: blog_research_feed-0.2.5-py3-none-any.whl
Upload date: May 23, 2026
Size: 67.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for blog_research_feed-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b30afac5b818a26986322d7bf7360563658ff2f3b631e58e183f21ff48259ab4`
MD5	`4e8e092b23a76f7cc3056a34d3593d19`
BLAKE2b-256	`cb77df528cf49ea24d81823f125f2bf865005a7915846cc250e68d0ebc34b724`

See more details on using hashes here.

blog-research-feed 0.2.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Blog Research Feed

How it works

Setup

1. API keys

2. Provision the Managed Agent + Environment

3. Upload the container `.env`

4. GitHub Secrets

5. Test

Operations

Releasing a new `brf` version

Rotating secrets

Local development

CLI reference

Repository layout

Known limits

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

blog-research-feed 0.2.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Blog Research Feed

How it works

Setup

1. API keys

2. Provision the Managed Agent + Environment

3. Upload the container .env

4. GitHub Secrets

5. Test

Operations

Releasing a new brf version

Rotating secrets

Local development

CLI reference

Repository layout

Known limits

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

3. Upload the container `.env`

Releasing a new `brf` version