Evidence-first social-media research CLI + Claude Code skill

Project description

social-research-probe

Evidence-first social-media research CLI + Claude Code skill

srp fetches YouTube content, scores it by trust/trend/opportunity, auto-corroborates claims, runs 20+ statistical models, and renders an HTML report — all from a single CLI command.

What it does
Why this exists
Quickstart
Installation
Usage
Configuration
Roadmap — what's planned
Documentation
Architecture
Contributing
Security
License
Changelog

What it does

srp research runs a five-stage evidence pipeline:

Fetch — queries YouTube for recent content on a topic (configurable count and recency window).
Score — ranks items by a composite trust / trend / opportunity score derived from channel credibility, view velocity, and cross-channel repetition.
Enrich — fetches transcripts (yt-dlp captions → Whisper fallback) and generates 100-word LLM summaries for the top results.
Analyse — runs 20+ statistical models (regression, Bayesian linear, bootstrap, k-means, PCA, Kaplan–Meier, …) and renders charts as PNGs.
Synthesise — auto-corroborates the top results via Exa/Brave/Tavily, attaches structured LLM synthesis sections, and emits a Markdown + HTML report.

It also ships as a Claude Code skill (/srp) so you can invoke research, manage topics/purposes, and receive formatted reports directly inside a Claude Code session.

Why this exists

srp is an evidence-first research CLI. It does most of the work on your own computer and only calls an LLM when no cheaper tool can do the job.

Objective — conduct real research with citations, the lowest possible cost per query, and no lock-in to any single model.
Audience — independent researchers, journalists, content strategists, analysts, and indie builders who need repeatable research without burning LLM credits.
Philosophy — scoring, stats, charts, transcripts (via captions or local Whisper), and caching all run on your CPU. The LLM is used only for 100-word per-item summaries and the final synthesis sections, behind a swappable runner interface. If your preferred model disappears, change one config key and keep going.
Status — work in progress (pre-1.0). The pipeline, scoring model, and CLI shape are stable. The research-packet schema and additional platforms/backends are still evolving. Pin a version in production.

See docs/objective.md for the full rationale and docs/cost-optimization.md for every place the design avoids an LLM call.

Quickstart

pip install social-research-probe
srp config set-secret youtube_api_key
srp research "AI safety" "latest-news"

Or in natural-language mode from the CLI (requires llm.runner configured):

srp research "what are researchers saying about model collapse?"

Installation

# pip
pip install social-research-probe

# pipx (isolated environment, recommended for CLI tools)
pipx install social-research-probe

# uvx (run without installing)
uvx social-research-probe research "AI safety" "latest-news"

# From source (development)
git clone https://github.com/reshinto/social-research-probe
cd social-research-probe
pip install -e '.[dev]'

After installing, set your YouTube API key:

srp config set-secret youtube_api_key

Pick an LLM runner — Gemini CLI is the free default

srp does not bundle any LLM. It shells out to the runner's CLI — so you must install the matching CLI on your machine yourself (via npm, brew, a package manager, or an Ollama install for the local runner) and authenticate it before pointing srp at it.

srp can use Claude, Gemini, Codex, or a local model. Gemini CLI is recommended as the default for cost-conscious users: it logs in via a browser OAuth flow, needs no API key, and runs on a free tier for typical workloads. Install the Gemini CLI and log in, then:

srp config set llm.runner gemini

The llm_search corroboration backend is now runner-agnostic: it dispatches via your configured llm_runner, so Claude's web_search tool and Codex's --search flag work the same way as Gemini's grounded search. See docs/llm-runners.md for the capability matrix and docs/corroboration.md for the full flow.

See docs/installation.md for full setup including optional keys and the Claude Code skill bundle.

Usage

# Research a topic with a specific purpose
srp research "AI agents" "latest-news"
srp research youtube "AI agents" "latest-news,trends"

# Natural-language query (auto-classifies topic + purpose)
srp research "who is winning the LLM benchmarks race?"

# Manage topics and purposes
srp show-topics
srp update-topics --add '"climate tech"'
srp update-purposes --add '"emerging-research"="Track peer-reviewed preprints"'

# Config
srp config set llm.runner claude
srp config set-secret youtube_api_key

Full flag reference: srp --help / srp <subcommand> --help.

Claude Code skill

srp install-skill

/srp research "AI safety" "latest-news"
/srp show-topics
/srp show-purposes
/srp suggest-topics --count 3
/srp show-pending
/srp apply-pending --topics all --purposes all

The Claude Code skill shells out to the same srp CLI, so the flags and quoted edit syntax are identical. For example: /srp update-topics --add '"climate tech"'. When llm.runner = none, the skill can still use the host model for natural-language request mapping and inline narrative sections; when a concrete runner is configured, the skill defers to the CLI-produced LLM output.

See docs/usage.md for a complete guide including Claude Code workflows, corroboration modes, report generation, and the pending-proposal review flow.

Configuration

Key settings in ~/.social-research-probe/config.toml:

Setting	Default	What it controls
`platforms.youtube.max_items`	`20`	How many videos to fetch per search
`platforms.youtube.enrich_top_n`	`5`	How many top items get transcripts + summaries (biggest cost lever)
`llm.runner`	`none`	Which LLM runner to use — `gemini` is the free default
`corroboration.backend`	`host`	`host` tries every healthy backend; `none` disables corroboration

Change any setting with srp config set <key> <value>. Running srp setup again after an upgrade adds missing keys without overwriting your edits, and secrets.toml is never overwritten (it's chmod 0600).

See docs/configuration.md for the full key list, env overrides, and ready-made recipes (cheapest / deepest / fastest / offline).

Roadmap

Today: YouTube, 4 LLM runners (claude, gemini, codex, local), 4 corroboration backends (llm_search, tavily, brave, exa).

Planned platforms — each implemented by adding one adapter to social_research_probe/platforms/:

TikTok, X / Twitter, Facebook, Reddit — social adapters.
RSS, web search, web crawling — long-tail and general-web sources.
Document research — PDF, arXiv, Google Drive, and local files.

None of these require new LLM capabilities. See docs/objective.md for motivation and docs/adding-a-platform.md for the contract.

Documentation

Document	What it covers
docs/README.md	Documentation hub — all docs grouped by audience
docs/objective.md	Project objective, audience, philosophy, roadmap
docs/how-it-works.md	What happens during a `srp research` run, in plain English
docs/cost-optimization.md	Every place the design avoids an LLM call; recipes by budget
docs/installation.md	Install (pip/pipx/uvx), API keys, LLM runner, verification
docs/usage.md	Run research, read output, Claude Code skill workflow
docs/configuration.md	Every config key, env overrides, config/secrets lifecycle
docs/corroboration.md	Claim corroboration: backends, free tiers, configuration
docs/llm-runners.md	Runner comparison, auth, ensemble mode, runner-agnostic agentic search
docs/data-directory.md	Canonical reference for every artefact under `~/.social-research-probe/`
docs/llm-reliability-harness.md	Multi-sample judge-LLM reliability harness with variance gates
docs/runtime-dependencies.md	Every third-party library `srp` imports at runtime — where/why
docs/summary-quality-report.md	Diagnosis + Phase 9 redesign of the transcript summarizer
docs/statistics.md	All 20+ statistical models: what they measure, how to interpret
docs/charts.md	Every chart: what it shows, where it's saved
docs/commands.md	Every `srp` subcommand, flag, exit code, environment variable
docs/synthesis-authoring.md	Override sections 10–11 in the HTML report
docs/architecture.md	System design, data flow, tradeoffs, extension points
docs/module-reference.md	Every root file and folder: what it is and why it exists
docs/adding-a-platform.md	How to add a new platform adapter (TikTok, Reddit, RSS, …)
docs/design-patterns.md	Patterns with code examples and why/why-not rationale
docs/python-language-guide.md	All 13 Python idioms used in the codebase
docs/testing.md	Test tiers, `make test-evidence`, eval scripts, TDD workflow, 100% coverage gate
docs/security.md	Secret storage, trust boundaries, hardening
CONTRIBUTING.md	Development workflow, TDD rules, file-size limits
SECURITY.md	Responsible disclosure policy
CHANGELOG.md	Release history

Architecture

srp is a layered Python CLI. The CLI parses arguments and delegates to the research pipeline, which calls platform adapters, a scoring layer, a stats suite, an LLM ensemble, and corroboration backends, then feeds the result to the synthesis layer.

See docs/architecture.md for the full system design including data flow diagrams, async model, design tradeoffs, and extension points.

Contributing

See CONTRIBUTING.md.

Security

See SECURITY.md.

License

Changelog

See CHANGELOG.md.

Project details

Release history Release notifications | RSS feed

0.0.10

Apr 30, 2026

0.0.9

Apr 28, 2026

0.0.8

Apr 27, 2026

This version

0.0.7

Apr 21, 2026

0.0.6

Apr 21, 2026

0.0.3

Apr 20, 2026

0.0.2

Apr 20, 2026

0.0.1

Apr 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

social_research_probe-0.0.7.tar.gz (818.3 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

social_research_probe-0.0.7-py3-none-any.whl (232.9 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file social_research_probe-0.0.7.tar.gz.

File metadata

Download URL: social_research_probe-0.0.7.tar.gz
Upload date: Apr 21, 2026
Size: 818.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for social_research_probe-0.0.7.tar.gz
Algorithm	Hash digest
SHA256	`6ff60db38f965fe3e6eed466048205af925b9647c446a150cb2983e123eb7c36`
MD5	`7889ab9773179b181f785a1c6b1da902`
BLAKE2b-256	`41dc12cadb63df8a3691b9860650bd9f81b2cdd452daa3521289c27c03aa386c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for social_research_probe-0.0.7.tar.gz:

Publisher: release.yml on reshinto/social-research-probe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: social_research_probe-0.0.7.tar.gz
- Subject digest: 6ff60db38f965fe3e6eed466048205af925b9647c446a150cb2983e123eb7c36
- Sigstore transparency entry: 1352143275
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: reshinto/social-research-probe@72114e7a75e1f4b68049239077d31f42776f4d3a
- Branch / Tag: refs/heads/main
- Owner: https://github.com/reshinto
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@72114e7a75e1f4b68049239077d31f42776f4d3a
- Trigger Event: push

File details

Details for the file social_research_probe-0.0.7-py3-none-any.whl.

File metadata

Download URL: social_research_probe-0.0.7-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 232.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for social_research_probe-0.0.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ee1e54b9b994713f3ce835f8f7c22ae40826aa1ca8e1f992fa3f84e4a6baddca`
MD5	`c403d2d63d8f1af332173fde5090fb29`
BLAKE2b-256	`6f62e1fb4719c1d00cdb4927b4578d93553a2e681e526bb1737b2c33c560899e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for social_research_probe-0.0.7-py3-none-any.whl:

Publisher: release.yml on reshinto/social-research-probe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: social_research_probe-0.0.7-py3-none-any.whl
- Subject digest: ee1e54b9b994713f3ce835f8f7c22ae40826aa1ca8e1f992fa3f84e4a6baddca
- Sigstore transparency entry: 1352143470
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: reshinto/social-research-probe@72114e7a75e1f4b68049239077d31f42776f4d3a
- Branch / Tag: refs/heads/main
- Owner: https://github.com/reshinto
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@72114e7a75e1f4b68049239077d31f42776f4d3a
- Trigger Event: push

social-research-probe 0.0.7

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

social-research-probe

Table of contents

What it does

Why this exists

Quickstart

Installation

Pick an LLM runner — Gemini CLI is the free default

Usage

Claude Code skill

Configuration

Roadmap

Documentation

Architecture

Contributing

Security

License

Changelog

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance