interviewsignal

AI-native interview platform — capture thought process, not puzzle performance

These details have not been verified by PyPI

Project links

Project description

pip install interviewsignal && interview install
# Codex: pip install interviewsignal && interview install --platform codex

What is AI-native broad-interviewing?

Traditional hiring relies on broadcast-rejection — filtering out hundreds of talented developers based on resume keywords or rigid pass/fail LeetCode puzzles because manual screening doesn't scale.

interviewsignal enables AI-native broad-interviewing: a high-volume, high-fidelity asynchronous screening model that opens the funnel wide without draining engineering resources. Share a code. Every candidate works the problem on their own time, in their own IDE, with their own AI tools. The session captures the full thought process — every prompt, every decision, every iteration. Submissions arrive auto-graded and ranked. You spend 15 minutes triaging, not 200 hours interviewing.

When every candidate uses AI, code quality converges. Output is no longer signal. ATS platforms grade the output — did the code pass tests? We grade the thinking — how the candidate decomposes the problem, directs the AI, and iterates on failures. The transcript captures who drove the thinking. That's the signal no one else can see.

Broad-interview, not broadcast-reject. Pure signal.

The Engine in Action

Live demo — candidate session → AI grading → dashboard review

Candidate works in terminal. Dashboard auto-grades and ranks.

See full screenshots

Candidate starts a session in the terminal

Candidate starts interview session — GitHub OAuth, problem statement appears

Candidate works with full-power AI

Candidate doing EDA on Titanic dataset — AI collaboration captured

HM reviews auto-graded submissions in the dashboard

Dashboard showing candidates ranked by score with flags

Full transcript with diffs, grading, and tamper detection

Candidate detail page — transcript with GitHub-style diffs, grade panel, verify chain

AI grades against your rubric — dimension by dimension

Claude's Analysis — per-dimension rubric scores with evidence from transcript

The Unfair Advantage

🔗 Capture the Process

Every prompt, tool call, and iteration is hash-chained and tamper-evident. You see how they solved it, not just what they submitted.

🤖 AI-Native Baseline

Candidates use full-power AI — that's the point. High-leverage use (directs, verifies, iterates) scores well. Low-leverage use (paste and accept) scores poorly.

📊 Triaged in Minutes

Submissions arrive auto-graded and ranked against your rubric. Batch advance or reject. 200 candidates in 15 minutes.

🔒 Fully Private

Your relay, your API key. Nothing leaves your network. No telemetry. No analytics. No tracking. Zero external dependencies.

⚡ Zero Setup Cost

pip install, share a code, done. No platform to sign up for. No vendor contract. No procurement cycle.

🤝 Fair to Candidates

Real problems, real tools, real feedback. Every candidate gets the same shot regardless of timezone, schedule, or interview anxiety.

interviewsignal vs the status quo

	Phone screen	Take-home test	LeetCode	AI screening SaaS	interviewsignal
Scales to 200+ candidates	🚫	⚠️ Manual review	⚠️ Pass/fail only	✅	✅
Captures thought process	⚠️ Interviewer notes	🚫	🚫	⚠️ Sandboxed only	✅ Hash-chained transcript
AI-native	🚫	🚫 "No AI" policies	🚫	✅	✅ Full-power AI, graded on usage
Real problems, real tools	⚠️	✅	🚫 Contrived	⚠️ Sandboxed	✅ Candidate's own IDE
Candidate gets feedback	🚫 Usually ghosted	🚫	🚫	⚠️ Varies	✅ Score + summary
Setup cost	High (scheduling)	Medium	Medium (platform)	High (vendor + procurement)	`pip install`, done
Tamper detection	N/A	🚫 Honor system	⚠️ Proctoring	⚠️ Sandboxed browser	✅ 9 automated flags
Self-hosted / private	N/A	N/A	🚫	🚫 Multi-tenant cloud	✅ Your infra, your data
Cost	Engineer time	Engineer time	$$$$/seat	$100+/seat, 5-20 assessments/mo	Free forever

Quickstart

Hiring manager — create an interview

interview dashboard

First launch opens a setup wizard in your browser — relay URL, API key, create your first interview. Three screens and you're live. The form asks for three things: problem, rubric, time limit. You get back a code like INT-4829-XK. Share it with 5 candidates or 500.

Your rubric dimensions are your weights. If you want thought process to matter more than code quality, make more of your dimensions about process.

Candidate — take the interview

pip install interviewsignal && interview install
# Codex: pip install interviewsignal && interview install --platform codex
/interview INT-4829-XK

The session starts, GitHub OAuth opens (one account = one submission), and the problem appears. Work normally — ask the AI questions, write code, run tests. When done:

/submit

Session sealed. Pushed to relay. Auto-graded. Score + summary shown in terminal.

Hiring manager — review

interview dashboard              # → http://localhost:7832
interview dashboard INT-4829-XK  # → jump to one interview's submissions

Submissions arrive sorted by score. Flags highlight anomalies. Select candidates in bulk → advance or reject. Click into any candidate for the full transcript, dimension scores, and diff.

Batch actions: ↻ Regrade (re-run AI grading after rubric tuning) · ✓ Yes / → Maybe / ✗ No · ↓ Export CSV

How it works

graph TD
    A[Candidate Prompts AI] --> B[Shell Hooks Capture Tool Calls]
    B --> C[Append-Only SHA-256 Event Log]
    C --> D[Automatic Git Micro-Commit after each turn]
    D --> E[Log Sealed on /submit]
    E --> F[Relay Server Auto-Grades via Rubric]
    F --> G[HM Dashboard ranks candidates by thinking score]

interviewsignal installs as a skill into your AI coding assistant. It captures the full conversation — prompts, reasoning, every tool call — and builds an append-only, hash-chained session log. After each turn, it silently commits changed files to the local repo. On /submit, the log is sealed and pushed to the relay.

HM creates interview

interview dashboard
  → setup wizard (first run)
  → problem + rubric + time limit
  → code INT-4829-XK created
  → package pushed to relay

HM reviews

interview dashboard
  → submissions arrive, auto-graded
  → flags highlight anomalies
  → batch advance / reject

Candidate works

/interview INT-4829-XK
  → fetches problem from relay
  → GitHub OAuth (1 account = 1 submission)
  → interview-{code} repo created
  → session recording starts
      → hooks capture every tool call
      → append-only events.jsonl
      → SHA-256 hash chain
      → silent commit after each turn

Candidate submits

/submit
  → session sealed
  → git push → GitHub
  → pushed to relay
  → score + summary shown

Tamper-Evident Architecture

Candidates control their own machine. Security is detection, not prevention. A sparse or gapped session is its own red flag.

Hash chain architecture — Prompt → AI Tool Call → Git Commit, linked by SHA-256

Quality Flags catch sessions completed in under 10 minutes, fewer than 3 tool calls, no iteration pattern, statistically uniform timing, and zero prompts.

Tamper Flags catch large gaps in the event stream (hooks disabled), code changes that don't match Write/Edit tool calls (work outside AI), tool calls with no corresponding prompts (selective suppression), and commits with no matching events (cross-verification).

Overtime Scoring

Submissions after the time limit are accepted — but automatically penalized. The deduction is applied to the AI-graded score post-grading, not injected into the grading prompt, so it can't be argued away.

The math

Overtime is divided into bands: 0–10 min, 10–20 min, 20–30 min, 30–60 min, and 60+ min (capped). Within each band the penalty grows as a quadratic curve — not a flat step — so a candidate barely over the boundary loses almost nothing while one deep in the band feels it accelerate:

position = (overtime - band_start) / (band_end - band_start)   ← 0.0 to 1.0
penalty  = prev_band_max + (this_band_max - prev_band_max) × position²

Band maxima on the 0–10 score scale: −0.5 at 10 min, −1.0 at 20 min, −1.5 at 30 min, −2.5 at 60 min, −4.0 cap beyond that.

Example — 7 minutes over a 60-minute interview, raw AI score 7.5

overtime  = 7 min  →  falls in the 0–10 min band
position  = 7 / 10 = 0.70
penalty   = 0 + (0.5 − 0) × 0.70² = 0.5 × 0.49 = −0.245
final score = 7.5 − 0.245 = 7.26

Compare: 1 min over costs only −0.005; 10 min over costs the full −0.5. The curve means being slightly late is forgiving, but lingering well past the limit compounds quickly.

The dashboard shows the raw AI score, overtime deduction, and adjusted final score separately. A flag is raised alongside — yellow for ≤20 min over, red beyond.

What gets captured

💬 Conversation

Candidate prompts — exact message to the AI
AI reasoning — plan before each action
Timestamps — millisecond precision on every event

🛠️ Tool Calls

File reads — path
File writes — path + content hash
File edits — path + change summary
Bash commands — command + exit code

📂 Git State

Per-prompt commits — timestamp + prompt snippet
Full commit log — hash, message, files changed
GitHub repo — auto-created interview-{code}, pushed on submit

🚩 Session Flags

Quality — too fast, few interactions, no iteration, uniform timing, no prompts
Tamper — hooks gap, diff mismatch, commit mismatch, prompt ratio

The session log is append-only and hash-chained. Any tampering breaks the chain. Raw file contents are never stored — only paths, hashes, and summaries.

Platform support

Platform	Install	Activity capture
Claude Code	`interview install`	✅ Full — prompts, tool calls, reasoning
Codex	`interview install --platform codex`	✅ Full
Gemini CLI	`interview install --platform gemini`	✅ Full
Cursor	`interview install --platform cursor`	⚠️ Limited — skill instructions only
Aider	`interview install --platform aider`	⚠️ Limited — skill instructions only

Relay setup

The relay stores interview packages and candidate sessions so everyone only needs to share a short code.

Option 1 — Self-hosted (~$5/mo, fully private) ← recommended

# After deploying:
# 1. Set RELAY_API_KEY (any random string) in Railway → Variables
# 2. Add a /data volume
# 3. Copy your Railway URL → paste into dashboard setup wizard

# Optional — auto-grading on submission:
GRADING_API_KEY=<anthropic-key>
GRADING_MODEL=claude-haiku-4-5-20251001

Or Docker:

docker build -t interviewsignal-relay .
docker run -e RELAY_API_KEY=secret -v /data:/data -p 8080:8080 interviewsignal-relay

GitHub OAuth (one account = one submission)

Relay operator step — done once at deploy time.

GITHUB_CLIENT_ID=<your_client_id>
GITHUB_CLIENT_SECRET=<your_client_secret>
RELAY_BASE_URL=https://myrelay.up.railway.app

Create the OAuth App at github.com/settings/developers with callback URL: https://myrelay.up.railway.app/auth/github/callback

Option 2 — Email only (free, no server)

interview configure-relay   # choose 2
interview configure-email   # set up SMTP

Reports emailed directly to HM on /submit.

Enterprise configuration

interview configure-llm

Pattern	What to set
Anthropic direct	API key only (default)
Internal proxy (Floodgate, corporate gateway)	Base URL + optional key
OpenAI-compatible endpoint	Base URL + key + `format=openai`

Environment variable overrides: ANTHROPIC_API_KEY, ANTHROPIC_BASE_URL, INTERVIEW_GRADING_MODEL

Privacy

Sessions stored on relay: events.jsonl, manifest.json, flags.json — raw file contents never stored
Grading uses your own API key — interviewsignal never sees it
Self-hosted relay: nothing leaves your network
No telemetry. No analytics. No tracking.

FAQ

How do you prevent candidates from using a second screen to get answers?

Security is detection, not prevention. When someone pastes pre-written code from another screen, they produce large blocks of finished code with no corresponding prompts, no trial-and-error, no iteration. This triggers Ghost Edits and Zero Prompts flags automatically. The absence of signal is itself signal — a sparse session ranks itself at the bottom.

Can we run this completely offline or in a private network?

Yes. The relay server runs inside your own infrastructure — VPC, air-gapped network, whatever you need. Configure your internal LLM proxy for grading. Zero telemetry, zero trackers, zero external dependencies. Python stdlib only.

What coding platforms are supported?

Full hook support (prompts, tool calls, reasoning): Claude Code, Codex, Gemini CLI. Skill instruction support (limited capture): Cursor, Aider. Each new platform adapter is ~30 lines.

Built with

Python stdlib only — zero external dependencies for core and relay. Grading via Anthropic Messages API or any compatible endpoint. Dashboard is a self-contained local HTTP server. Relay is a single-process stdlib server backed by flat files.

Contributing

Prompts — grading instructions are open and community-editable: interview/skills/interview/SKILL.md

Worked examples — run a session, save to worked/{slug}/, write a review.md, open a PR.

Platform adapters — each new platform is ~30 lines in cli.py.

See ARCHITECTURE.md for module map · docs/relay-api.md for the relay API.

Broad-interview, not broadcast-reject. Pure signal.

_{No contrived puzzles. No whiteboard anxiety. No ghosting. Just signal.}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.9.25

May 30, 2026

0.9.24

May 30, 2026

This version

0.9.23

May 26, 2026

0.9.21

May 26, 2026

0.9.20

May 26, 2026

0.9.17

May 20, 2026

0.9.16

May 20, 2026

0.9.15

May 18, 2026

0.9.14

May 18, 2026

0.9.13

May 18, 2026

0.9.12

May 16, 2026

0.9.11

May 16, 2026

0.9.10

May 16, 2026

0.9.9

May 16, 2026

0.9.8

May 16, 2026

0.9.7

May 16, 2026

0.9.6

May 16, 2026

0.9.5

May 16, 2026

0.9.4

Apr 28, 2026

0.9.2

Apr 27, 2026

0.9.1

Apr 26, 2026

0.9.0

Apr 22, 2026

0.8.5

Apr 22, 2026

0.8.4

Apr 22, 2026

0.8.3

Apr 22, 2026

0.8.2

Apr 21, 2026

0.8.1

Apr 21, 2026

0.8.0

Apr 21, 2026

0.7.9

Apr 21, 2026

0.7.8

Apr 21, 2026

0.7.6

Apr 21, 2026

0.7.5

Apr 21, 2026

0.7.1

Apr 20, 2026

0.6.0

Apr 20, 2026

0.5.8

Apr 20, 2026

0.5.7

Apr 20, 2026

0.5.6

Apr 20, 2026

0.5.5

Apr 20, 2026

0.5.4

Apr 20, 2026

0.5.3

Apr 19, 2026

0.5.2

Apr 19, 2026

0.5.1

Apr 19, 2026

0.5.0

Apr 18, 2026

0.4.0

Apr 18, 2026

0.3.0

Apr 17, 2026

0.2.0

Apr 17, 2026

0.1.5

Apr 17, 2026

0.1.4

Apr 17, 2026

0.1.3

Apr 17, 2026

0.1.2

Apr 16, 2026

0.1.1

Apr 16, 2026

0.1.0

Apr 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

interviewsignal-0.9.23.tar.gz (128.3 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

interviewsignal-0.9.23-py3-none-any.whl (120.9 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file interviewsignal-0.9.23.tar.gz.

File metadata

Download URL: interviewsignal-0.9.23.tar.gz
Upload date: May 26, 2026
Size: 128.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for interviewsignal-0.9.23.tar.gz
Algorithm	Hash digest
SHA256	`11e6de50b4494e012d9d61b07febd0f92c7f5221726bc23b29ed473086c39652`
MD5	`a88f374ae3763a4f9bb82656d5d665e7`
BLAKE2b-256	`9015196e786240f646f5256c2c104f2a07fb630729ca08557af4a7b9e3c5e82d`

See more details on using hashes here.

File details

Details for the file interviewsignal-0.9.23-py3-none-any.whl.

File metadata

Download URL: interviewsignal-0.9.23-py3-none-any.whl
Upload date: May 26, 2026
Size: 120.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for interviewsignal-0.9.23-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2ee83bdcbe26524fd23b426283f9d6827a79261fb882c64ef55a82adbbad9580`
MD5	`4edc880a78ecaa3f2c7cca5558b8a42f`
BLAKE2b-256	`db68140c316fb64ef2fd2cc49ce1a8ef772e4d1e1a7d5fe7167a6576708b781a`

See more details on using hashes here.

interviewsignal 0.9.23

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

What is AI-native broad-interviewing?

The Engine in Action

Candidate starts a session in the terminal

Candidate works with full-power AI

HM reviews auto-graded submissions in the dashboard

Full transcript with diffs, grading, and tamper detection

AI grades against your rubric — dimension by dimension

The Unfair Advantage

🔗 Capture the Process

🤖 AI-Native Baseline

📊 Triaged in Minutes

🔒 Fully Private

⚡ Zero Setup Cost

🤝 Fair to Candidates

interviewsignal vs the status quo

Quickstart

Hiring manager — create an interview

Candidate — take the interview

Hiring manager — review

How it works

Tamper-Evident Architecture

Overtime Scoring

What gets captured

💬 Conversation

🛠️ Tool Calls

📂 Git State

🚩 Session Flags

Platform support

Relay setup

Option 1 — Self-hosted (~$5/mo, fully private) ← recommended

Option 2 — Email only (free, no server)

FAQ

Built with

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes