AI-assisted narrative processing with human-screening — pipeline + web UI for audio/text narratives.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

xianlbuild

These details have not been verified by PyPI

Project description

narRaters

AI-assisted narrative processing with human-screening.

narRaters supports human cognitive studies and LLM research on long, naturalistic language—narratives as audio or text. It is built around six widely used processing steps for complex stimuli: audioTranscribe, eventSegment, sentenceCorrect, textParsing, textMatching, and causalRating.

You are not locked into one workflow: pick only the steps your study needs, combine them in the order you want, and choose among multiple methods per step (rules, local models, cloud APIs, and more).

The app automates and visualizes those steps; human-screening is facilitated through interface at every step, allowing human raters to review, edit, and sign off on outputs. The same platform supports human vs. LLM comparisons when you want to benchmark summarization, alignment, or causality reasoning on shared materials and prompts.

narRater app icon
macOS: build narRater.app with packaging/macos/build_app_bundle.sh (uses this icon).

Quick start
Getting started
Pipeline overview
Installation
Where to put your data
Using the web interface
Command-line pipeline
Prompt templates
Validation / testing
Performance notes
Library / Python use
Project layout
Further reading
Author
Acknowledgements
License

Quick start

You need Python 3.10+ installed (Windows: check “Add python.exe to PATH” during the installer).

Get the project folder. Either:
- Download ZIP: https://github.com/xianNeuro/narRaters/archive/refs/heads/main.zip, then unzip — or
- Clone: git clone https://github.com/xianNeuro/narRaters.git
Run the installer in that folder:
- Mac: double-click install_narRater.command (first time: Control-click → Open → Open)
- Windows: double-click narRaters_installer.bat
- Linux / Terminal: cd narRaters && bash install.sh
Your browser opens http://127.0.0.1:5000/pipeline-config. Enter a rater name, drag the steps you need, Continue, and run from the dashboard.

That's it. Your data goes in data/ inside the project folder — see Where to put your data. Bundled examples (pieman_edited, the_siren) are already in data/ so you can try the pipeline before adding your own files.

Getting started

Install — get the project folder (clone or download ZIP) and double-click the installer for your OS. Bundled examples live in data/.
Add inputs under data/ — the repo includes bundled examples (pieman_edited, the_siren) you can inspect or run as-is; see that section for paths. Smaller demo/data/ samples are also available.
Start the web UI — the installer starts it for you. To restart later: double-click the installer again, or bash install.sh (Mac/Linux) / narRaters_installer.bat (Windows). PyPI users: narraters serve from any folder.
Configure your workflow — on the first screen, enter a rater name (any label you like), drag in only the steps you need, set each step’s paths and (when you run) its method, then Continue. You need a name and at least one step before Continue enables.
Run and review — use the dashboard grid to run steps per cell; open a subject or story to see tabs for each step, switch versions in the dropdown (automated vs *-edit), edit, save, and export -edit files for analysis.

Pipeline overview

Six steps, your configuration. narRaters does not require all six steps or a fixed order. On the configuration page you select and chain only what your study needs; when you run a step you choose its method (and model or prompt, if applicable). The table below lists each step’s ID (used in the web UI and pipeline_config.json), role, CLI command, and default folders. In typical recall work, audioTranscribe / eventSegment target the story, sentenceCorrect–textMatching each subject recall, and causalRating the story event list—but text-only projects may skip audioTranscribe, and you might run only eventSegment and causalRating, or sentenceCorrect → textParsing → textMatching, and so on. Every included step is available from the GUI or narraters CLI, has a lightweight default method, and supports hand-editing afterward.

#	Step ID	What it does	Terminal command	Default in / out
1	`audioTranscribe`	Audio recordings → text (Whisper/WhisperX); story vs recall via `audioScope` or `--kind`	`narraters transcribe`	`data/4_recall_audio/` (or `data/1_story_audio/` with `--kind story`) → `output/*_audio-transcribed/`
2	`eventSegment`	Story transcript → numbered events	`narraters segment`	`data/2_story_transcript/` → `data/3_story_events/`
3	`sentenceCorrect`	Fix spelling/grammar in recall text (no rewriting)	`narraters correct`	`data/5_recall_texts/` → `output/recall_corrected/`
4	`textParsing`	Corrected recall → clause-level segments	`narraters parse`	`output/recall_corrected/` → `output/recall_parsed/`
5	`textMatching`	Recall segments ↔ story events	`narraters match`	`output/recall_parsed/` + `data/3_story_events/` → `output/recall_rated/`
6	`causalRating`	Causal strength of every story-event pair	`narraters rate`	`data/3_story_events/` → `output/causal_rated/`

For each step, the GUI runs the same backends as the CLI. Available methods, flags, and examples are under Command-line pipeline below.

Installation

Prerequisite: Python 3.10 or newer. On Windows, tick “Add python.exe to PATH” in the installer.
You’re done when your browser shows the pipeline setup page (…/pipeline-config).

Recommended — clone or download the project, then run the installer

This is the easiest way for everyone. You get the full project folder (including the data/ folder for your inputs and output/ for results).

Step 1 — Get the project folder

Pick one:

Method	How
Download ZIP (no git needed)	github.com/xianNeuro/narRaters → main.zip → unzip → rename folder to `narRaters` (optional)
Git clone (recommended; no Gatekeeper warnings)	`git clone https://github.com/xianNeuro/narRaters.git`

Step 2 — Run the installer

Open the narRaters folder in Finder / File Explorer and:

Platform	Double-click	What it does
macOS	`install_narRater.command`	Creates `.venv/`, installs narRaters into it, starts the web UI, opens your browser
Windows	`narRaters_installer.bat`	Same, on Windows
Linux / Terminal	run `bash install.sh`	Same, in Terminal

The installer is safe to run multiple times — re-running just re-launches the app.

macOS — first launch warning: if Finder says “Apple cannot verify…”, Control-click (or right-click) install_narRater.command → Open → Open in the dialog. You only need to do this once. (git clone avoids this; the warning only appears for files extracted from a downloaded ZIP.)

Next time: double-click the same installer file. Or open Terminal/Command Prompt in the folder and run bash install.sh (Mac/Linux) / narRaters_installer.bat (Windows).

Problem	Fix
“Apple cannot verify…” / “malicious software”	Control-click → Open (Mac), or `cd narRaters && bash install.sh` in Terminal.
`Python 3.10+ required`	Install from python.org/downloads, restart Terminal, run again.
Blank browser page	Turn off AirPlay Receiver (macOS: System Settings → General → AirDrop & Handoff) — it claims port 5000.
Port already in use	The installer auto-tries `5001`-`5010`. Use the URL printed in Terminal.
Need sample data	Already in `data/` (`pieman_edited`, `the_siren`).

Alternate — install from PyPI

For users who already have a working Python environment and don't need the bundled example data.

⚠️ Always use python3 -m pip, not bare pip. On macOS, pip often points at an old Python (e.g. system 3.9), which makes PyPI report No matching distribution found for narraters even though the package exists. python3 -m pip runs pip through the same Python you'd use to run the app.

Step 1 — Check Python (must be ≥ 3.10):

python3 --version    # must show 3.10, 3.11, 3.12, or 3.13

If 3.9 or older, install Python from python.org/downloads, then restart Terminal before continuing.

Step 2 — Install into a venv:

python3 -m venv ~/narRaters-venv
source ~/narRaters-venv/bin/activate     # Windows: ~\narRaters-venv\Scripts\activate
python3 -m pip install --upgrade pip
python3 -m pip install narraters
narraters serve

The browser auto-opens http://127.0.0.1:5000/pipeline-config. If you see a blank page on localhost:5000, manually visit http://127.0.0.1:5000/pipeline-config instead — localhost resolves to IPv6 (::1) on some macOS setups while Flask binds to IPv4 (127.0.0.1). Fixed in 0.1.1+ (auto-rewrites to 127.0.0.1).

Exact package name: narraters (all lowercase) — pypi.org/project/narraters. Pin a version: python3 -m pip install narraters==0.1.1.

Next time: source ~/narRaters-venv/bin/activate && narraters serve.

Problem	Fix
`No matching distribution found for narraters` when running `pip install narraters`	Use `python3 -m pip install narraters` instead. Bare `pip` likely points at Python < 3.10. (Verify with `python3 --version`.)
Same error even with `python3 -m pip`	`python3 --version` must be ≥ 3.10. Install Python 3.10+, open a new Terminal, recreate the venv, retry.
`narraters: command not found`	Activate the venv: `source ~/narRaters-venv/bin/activate`.
Browser opens `localhost:5000` but page is blank	Visit `http://127.0.0.1:5000/pipeline-config` instead, or upgrade: `python3 -m pip install --upgrade narraters`.
Port 5000 in use (macOS AirPlay Receiver)	`narraters serve --port 5001`, or System Settings → General → AirDrop & Handoff → turn off AirPlay Receiver.
`Could not find a version` / offline	`python3 -m pip install narraters -i https://pypi.org/simple`
You need bundled `data/` examples	Use the clone or download path above — PyPI installs the app only.

PyPI installs the app only. To get the data/ folder structure with bundled examples, also clone the repo or use the recommended path above.

After install (every path)

Browser opens http://127.0.0.1:5000/pipeline-config (port may differ — read the URL printed in the Terminal window).
Type a rater name (any label).
Drag steps into the pipeline → Continue → run from the dashboard.

Default methods all work offline with no huge downloads:

Step	Default
segment	`clause`
correct	`rules`
parse	`rules`
match	`test`
rate	`linguistic`

Optional extras (Whisper, APIs, big models)

Add only what you need — after the base install. From inside a clone, with the venv activated:

python3 -m pip install -e ".[audio]"     # transcription (Whisper)
python3 -m pip install -e ".[api]"       # cloud LLM steps (Anthropic / OpenAI)
python3 -m pip install -e ".[nlp]"       # finer segmentation (spaCy)
python3 -m pip install -e ".[grammar]"   # grammar checker
python3 -m pip install -e ".[local-llm]" # local Gemma (large)
python3 -m pip install -e ".[match]"     # rmatch
python3 -m pip install -e ".[all]"       # api + match

If you installed from PyPI: python3 -m pip install "narraters[audio]", etc.

Heavy downloads show a RAM/disk warning first — see Heavy local models.

Ollama (local Gemma): install Ollama, then ollama pull gemma4:e4b.

API keys (cloud): copy .env.example to .env and fill in the keys — see SETUP_API.md.

Developers

The install.sh script already does an editable install. To work on the codebase:

git clone https://github.com/xianNeuro/narRaters.git
cd narRaters
bash install.sh         # creates .venv, installs editable, starts server
# or, manual:
python3 -m venv .venv && source .venv/bin/activate && python3 -m pip install -e .

Build a local narRater.app for icon/Dock testing: bash packaging/macos/build_app_bundle.sh.

Advanced — macOS DMG (unsigned, may be blocked)

macOS Gatekeeper blocks unsigned .app and .command files downloaded from the internet (we are not paying for an Apple Developer ID). The recommended path above (install_narRater.command from a git clone) avoids this entirely. The DMG is provided for users who want the .app icon in ~/narRaters/.

If the DMG is blocked with “malicious software”, use the recommended path instead. Workaround if you really want the app bundle:

xattr -dr com.apple.quarantine ~/Downloads/narRaters-macos-installer.dmg
open ~/Downloads/narRaters-macos-installer.dmg
xattr -dr com.apple.quarantine /Volumes/narRaters

Maintainer rebuild: bash packaging/macos/build_installer_dmg.sh.

Where to put your data

After installation, place files so the paths match what you configured on the pipeline page (defaults below are relative to the project root). You can remap any step’s input/output folders there without moving data.

You have…	Put it in…	Format / naming
Story transcript (text)	`data/2_story_transcript/`	`{story}.txt` — plain UTF-8 text, one story per file
Story event list (pre-segmented)	`data/3_story_events/`	`{story}_events.xlsx` — columns `event`, `story_texts`
Subject recall text	`data/5_recall_texts/`	`{subj_id}.txt` — e.g. `the_siren_sub-01.txt`
Story audio (optional, Step 1)	`data/1_story_audio/`	`.wav` / `.mp3` / `.m4a`, named by story
Recall audio (optional, Step 1)	`data/4_recall_audio/`	`.wav` / `.mp3` / `.m4a`, named by subject

Outputs are written under output/ — one subdirectory per step (output/recall_corrected/, output/recall_parsed/, output/recall_rated/, …). A smaller alternate layout lives in demo/data/ (lighthouse story, three recall .txt files).

Bundled examples (`pieman_edited`, `the_siren`)

The repository ships realistic sample inputs and outputs under data/ and output/ so you can see accepted naming and file types before adding your own study. Your private files in those folders stay untracked (see .gitignore); only the examples below are committed.

Stories: pieman_edited (story audio + transcript + events) and the_siren (transcript, events, two recall subjects).

Role	Folder	Example file(s)
Story audio (input)	`data/1_story_audio/`	`pieman_edited.wav`
Story transcript (input)	`data/2_story_transcript/`	`pieman_edited.txt`, `the_siren.txt`
Story events (input)	`data/3_story_events/`	`pieman_edited_events.xlsx`, `the_siren_events.xlsx`
Recall audio (input)	`data/4_recall_audio/`	`the_siren_sub-01.mp4`, `the_siren_sub-02.mp4`
Recall text (input)	`data/5_recall_texts/`	`the_siren_sub-01.txt`, `the_siren_sub-02.txt`
Story transcription (output)	`output/story_audio-transcribed/`	`pieman_edited.txt`
Recall transcription (output)	`output/recall_audio-transcribed/`	`the_siren_sub-01.txt`, `the_siren_sub-02.txt`
Spell/grammar correction (output)	`output/recall_corrected/`	`the_siren_sub-01.txt`, `the_siren_sub-02.txt`
Parsed recall (output)	`output/recall_parsed/`	`the_siren_sub-01_parsed.xlsx`, `the_siren_sub-02_parsed.xlsx`
Recall ↔ events (output)	`output/recall_rated/`	`the_siren_sub-02_rate-recall-test_mode.xlsx` (method slug in filename)
Causal ratings (output)	`output/causal_rated/`	`pieman_edited_causal-linguistic.xlsx`, `the_siren_causal-linguistic.xlsx`

Quick try: after install, point a pipeline at the default folders above and run sentenceCorrect → textParsing → textMatching on the_siren_sub-01 / the_siren_sub-02, or open the bundled output/ files in Excel to inspect column layouts. Story pieman_edited is useful for audioTranscribe (large .wav) and causalRating on pieman_edited_events.xlsx.

File versioning is a core feature. Automated runs write {subj_id}_{method}.ext (or {story}_… for story-level steps); your hand-edited versions are saved as {subj_id}_{ratername}-edit.ext and never overwrite the originals. The web UI lets you switch between versions via a dropdown, and the -edit files are what you export for analysis.

Using the web interface

The app is a local Flask site (default http://127.0.0.1:5000). Start it from the project directory in any of these ways:

How	What to do
Terminal (macOS, Linux, Windows)	`narraters serve` — usually opens your browser automatically.
macOS — script	Double-click `server/START_HERE.command` (can install missing deps on first run).
macOS — app bundle	Build once: `bash packaging/macos/build_app_bundle.sh`, then double-click `narRater.app`. Not committed to Git; see the icon at the top of this README.

On first visit you see pipeline configuration unless a pipeline was already saved, in which case you land on the dashboard.

`narraters serve` options

Flag	Default	Purpose
`--port`	`5000`	Another port if `5000` is busy
`--host`	`127.0.0.1`	Bind address; use `0.0.0.0` only on a trusted network (the UI runs subprocesses on your machine)
`--no-browser`	off	Do not open a browser tab (SSH, headless)
`--debug`	off	Flask debug / auto-reload while hacking on the server

narraters serve --port 8080 --no-browser

Navigating the three main screens

Use this table as a mental map; URLs are for bookmarking or support.

Screen	Route	What you do there
Pipeline setup	`/pipeline-config`	Drag steps from Available Steps into Pipeline Flow, set per-step folders, enter a rater name (or 🎲). Continue unlocks only when there is a name and at least one step; it saves config and opens the dashboard.
Dashboard	`/`	Grid: rows = subjects or stories, columns = steps. Click a cell to run that step for that row (pick method / model / prompt / variant if the step offers them). Batch actions run one step across all rows. Change rater returns to setup.
Detail view	`/subject/…` or `/story/…`	Tabs per pipeline step for one row. Read outputs, use the version dropdown to compare the latest automated file vs your `{id}_{ratername}-edit` saves, edit, save. Use `-edit` files for downstream analysis.

Flow: setup → dashboard (bulk status + runs) → open a row when you need to inspect, hand-correct, or compare versions. You can return to setup anytime to add steps or change paths.

Heavy local models

Before a step that would load Whisper, Gemma via Ollama, rMatch embeddings, or local Transformers, the app runs a RAM / disk / model preflight. If the run is likely unsafe for your machine, a popup explains why and can switch you to a lighter method (for example rules, test, clause). The check does not download or start a model just to decide, so it should not wedge the system. Capable machines with models already installed often see no popup.

Command-line pipeline

Each of the six steps is a separate narraters subcommand with its own --method (and related options). Use the CLI for scripts, clusters, or reproducible runs—with or without the web UI, and with any subset of steps your study uses. General shape:

narraters <step> [--method METHOD] [--model MODEL] [-i INPUT] [-o OUTPUT] [--prompt-version VERSION] ...

Discover what's available at any time:

narraters --help                 # list all subcommands
narraters <step> --help          # step-specific options
narraters segment --list-prompts # list available prompt versions for a step
narraters segment --list-models  # list supported model identifiers

The method choices below are exactly those accepted by the CLI (src/narraters/cli.py).

Step 1 — `transcribe` (audio → text)

narraters transcribe --model large-v3 --timestamps          # recall audio (default)
narraters transcribe --kind story --model small              # story audio instead
narraters transcribe -i path/to/audio -o path/to/out         # custom directories
narraters transcribe --filter sub-01                         # one item only

Option	Choices	Notes
`--model`	`tiny`, `base`, `small`, `medium`, `large-v2`, `large-v3`	Whisper model name
`--timestamps`	flag	Also write Excel files with word-level timestamps
`--kind`	`recall` (default), `story`	Picks the conventional directories: `recall` = `data/4_recall_audio/` → `output/recall_audio-transcribed/`; `story` = `data/1_story_audio/` → `output/story_audio-transcribed/`
`-i, --input`	path	Input audio directory (overrides the `--kind` default)
`-o, --output`	path	Output directory (overrides the `--kind` default)
`--filter`	substring	Only transcribe files whose name matches this item id

Requires pip install "narraters[audio]" (or pip install -e ".[audio]" from a clone). Text-only projects can skip Step 1 entirely.

Step 2 — `segment` (story → events)

narraters segment --method clause
narraters segment --method api --model <anthropic-model-id> --prompt-version event_segment
narraters segment --method fine --input data/2_story_transcript/my_story.txt

Run narraters segment --list-models for the exact --model strings (Anthropic, OpenAI, and Ollama-backed presets).

Option	Choices	Notes
`--method`	`clause`, `fine`, `coarse`, `api`	`clause` needs no model; `fine`/`coarse` use spaCy if installed; `api` calls an LLM
`--model`	see `narraters segment --list-models`	Only used with `--method api` (Anthropic, OpenAI, or Ollama preset keys)
`--prompt-version`	see `--list-prompts`	Selects a template from `scripts/prompt/event_segment*.txt`
`-i, --input`	path	Single transcript file or a directory (else processes all)
`-o, --output`	path	Output directory (default: `data/3_story_events/`)

Step 3 — `correct` (spell / grammar fixes)

narraters correct --method rules
narraters correct --method gemma-ollama --ollama-model gemma4:e4b

Option	Choices	Notes
`--method`	`rules`, `gemma-ollama`	`rules` runs entirely locally with no model; `gemma-ollama` needs a local Ollama server
`--ollama-model`	e.g. `gemma4:e4b`	Local Ollama model tag (with `gemma-ollama`)
`--prompt-file`	path	Override the instructions file (default: `scripts/prompt/spell_gram.txt`)
`-i, --input`	path	Single recall text file
`-o, --output`	path	Output directory

Minimal corrections only — Step 3 fixes spelling/grammar errors and never rewrites or paraphrases.

Step 4 — `parse` (recall text → clause-level segments)

narraters parse --method rules
narraters parse --method ollama --model gemma4:e4b --prompt-version recall_parse_clause
narraters parse --filter-pattern sub-02            # process one subject only

Option	Choices	Notes
`--method`	`rules`, `ollama`	`rules` is the default (regex, no model); `ollama` uses local Gemma
`--model`	e.g. `gemma4:e4b`	Ollama model tag (with `--method ollama`)
`--prompt-version`	see `scripts/prompt/recall_parse_*.txt`	Prompt template name
`-i, --input`	path	Input directory (default: `output/recall_corrected/`)
`-o, --output`	path	Output directory (default: `output/recall_parsed/`)
`--filter-pattern`	substring	Optional filter to process a single subject

Step 5 — `match` (recall segments ↔ story events)

narraters match --test-mode                       # simulated keyword matching, no model/API
narraters match --method api --story-events data/3_story_events
narraters match --method gemma-ollama
narraters match --method rmatch                   # embedding matcher (requires [match])

Option	Choices	Notes
`--method`	`test`, `api`, `gemma-ollama`, `rmatch`	`test` is keyword-based, free, and always available; `rmatch` needs `pip install "narraters[match]"`
`--story-events`	path	Directory of `{story}_events.xlsx` (default: `data/3_story_events`)
`-i, --input`	path	Recall-parsed input directory (default: `output/recall_parsed/`)
`-o, --output`	path	Output directory (default: `output/recall_rated/`)
`--test-mode`	flag	Equivalent to `--method test` — simulated matching, no API calls

Step 6 — `rate` (causal relationships between event pairs)

narraters rate --method linguistic
narraters rate --method api --model <anthropic-or-openai-model-id> --prompt-version causal_rating
narraters rate --method manual                    # write an empty matrix for hand rating

Use narraters rate --help and the Step 6 model dropdown in the web UI for supported --model values when using --method api.

Option	Choices	Notes
`--method`	`linguistic`, `api`, `manual`	`linguistic` is rule-based (no model); `manual` scaffolds an N×N matrix to fill in by hand
`--model`	see web UI / provider docs	Only used with `--method api`
`--prompt-version`	see `scripts/prompt/causal_rating*.txt`	Prompt template name
`-i, --input`	path	Input file/directory
`-o, --output`	path	Output directory

Prompt templates

LLM-backed methods load text from scripts/prompt/ (you can add versions or override paths; see scripts/prompt/README.md). Bundled templates:

File	Step	Used by
`event_segment.txt`	2 — segment	`--method api`
`spell_gram.txt`	3 — correct	`--method gemma-ollama`
`recall_parse_clause.txt`	4 — parse	`--method ollama`
`recall_rating.txt`	5 — match	`--method api`, `--method gemma-ollama`
`causal_rating.txt`	6 — rate	`--method api`

You can:

Browse available versions with narraters <step> --list-prompts
Select a version with --prompt-version <name>
Override the file directly for Step 3 with --prompt-file path/to/prompt.txt
Add your own by dropping a new .txt into scripts/prompt/ — it's picked up automatically

Validation / testing

There is no bundled pytest suite. Use the helper scripts under helpers/ for smoke checks and regression-style runs, for example:

python helpers/test_recall_rater_single_subject.py
python helpers/test_story_event_segment.py
python helpers/test_recall_rater_all_stories.py
python helpers/test_bar_metrics_all_rated.py

Research background (by pipeline step)

The steps below follow the same numbering as the pipeline overview. Citations motivate or validate automated approaches similar to optional narRaters methods; your study still needs design-appropriate evaluation.

audioTranscribe (step 1)
No paper cited here; validation is Whisper/WhisperX accuracy on your audio and manual spot checks. See Installation ([audio] extra) and the helper scripts above.

eventSegment (step 2)
Michelmann, Kumar, Norman, & Toneva, Large language models can segment narrative events similarly to humans: GPT-3 zero-shot boundaries correlate with human segmentations and approximate crowd consensus on continuous text—useful precedent for LLM-based story segmentation in narRaters. arXiv:2301.10297, Behavior Research Methods (2025), companion code.

sentenceCorrect (step 3)
No external benchmark listed; the package enforces minimal, non-paraphrasing edits. Exercise the recall-correction helpers if you change rules or prompts.

textParsing (step 4)
No paper cited here; clause-level structure is checked against the same independent-clause logic as eventSegment (see above and the web UI tooltips).

textMatching (step 5)

Norman lab / Computational Memory (Princeton) — Toneva et al., Memory for long narratives (presentation materials, 2021; includes K. A. Norman): long-form novel recall scored by aligning recalled events to chapter events with GPT-2 representations, toward scalable scoring without fully manual coding. PDF (Princeton Computational Memory Lab).
rMatch — Kressin Palacios & Arekar: embedding-based recall-to-event matching with human-data validation. GabrielKP/rMatch.

causalRating (step 6)
Li et al., Agency personalizes episodic memories (PsyArXiv, 2024): behavioral work with choose-your-own-adventure narratives and controlled choice, examining how agency shapes memory for branching, choice-contingent event sequences—aligned with rich event-wise materials for which pairwise causal ratings are meaningful. DOI:10.31234/osf.io/7evwj.

Performance notes

The dashboard caches each output directory's listing once per page request and reuses it for every cell in the status grid, instead of scanning the disk again for each subject and step separately. On large studies that difference is very noticeable.

Library / Python use

from narraters import __version__, project_root
print(__version__, project_root())

Direct per-step imports are planned for a future release; for now, programmatic use should call the CLI via subprocess or import the modules under scripts/.

Project layout

narRaters/
├── pyproject.toml                # package metadata, deps, console scripts
├── requirements.txt              # minimal runtime deps (extras commented)
├── src/narraters/                # the installed package
│   ├── cli.py                    # argparse entry point (`narraters` command)
│   └── paths.py                  # repo-root resolution
├── scripts/                      # pipeline scripts (delegated to by the CLI)
│   ├── 1_audio-transcribe.py … 6_causal-rater.py
│   └── prompt/                   # LLM prompt templates
├── server/web-interface.py       # Flask web UI (routes, subprocess orchestration)
├── templates/, static/           # HTML / CSS / JS for the web UI
├── helpers/                      # paths, Ollama/disk/RAM preflight, plotting, tests
│   ├── disk_space.py             # free-disk preflight for local models
│   └── resource_preflight.py     # heavy-method (RAM/disk) assessment
├── data/                         # inputs (bundled pieman_edited + the_siren examples; your files stay local)
├── output/                       # pipeline outputs (sample outputs for the same examples)
├── demo/                         # smaller demo dataset (lighthouse)
├── packaging/macos/              # build script for the `.app` bundle
├── SETUP_API.md                  # user-facing API key and provider setup
└── .env.example                  # template for local API keys (copy to `.env`)

Author

Xian Li — xianl.cogneuro@gmail.com

Acknowledgements

Janice Chen for brainstorming the causal-rating step interface and for help testing and improving package functionality.
Gabi Kressin Palacios and Dhruva Arekar for an additional method for the recall-matching step (matching human recall text to story events). See GabrielKP/rMatch for human-data–validated AI-assisted recall rating.
Xiyu Li (Rita) for contributions to the recall_rating prompt development and for validating model performance on human recall data (commercial LLM APIs were close to human raters).

License

In short: free for research, education, and other non-commercial use; commercial or for-profit use needs prior written permission from the copyright holder (contact below).

The Software is licensed under the NarRaters Research and Non-Commercial License (see LICENSE): free use for research, education, and other non-commercial purposes; commercial or for-profit use requires prior written permission from the copyright holder. Contact xianl.cogneuro@gmail.com for commercial licensing.

This model is in the same family as widely used non-commercial / academic first terms (for example the PolyForm Noncommercial pattern for permitted non-commercial purposes, and dual-license or commercial-license-required approaches similar in spirit to the Prosperity Public License model, where commercial rights are negotiated separately with the author).

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

xianlbuild

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.1

May 17, 2026

0.1.0

May 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

narraters-0.1.1.tar.gz (458.2 kB view details)

Uploaded May 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

narraters-0.1.1-py3-none-any.whl (479.5 kB view details)

Uploaded May 17, 2026 Python 3

File details

Details for the file narraters-0.1.1.tar.gz.

File metadata

Download URL: narraters-0.1.1.tar.gz
Upload date: May 17, 2026
Size: 458.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for narraters-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`1a958ef4a6b7b05d9d8c235049457905d90c45ee38918d708cd6a58c1e3216e8`
MD5	`f950d6d2933f57e09b18ce050a72ae84`
BLAKE2b-256	`f7f7037786b2e2896207003875b45f560de3e99c48fcdbdbe2d81ca026078ae5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for narraters-0.1.1.tar.gz:

Publisher: python-publish.yml on xianNeuro/narRaters

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: narraters-0.1.1.tar.gz
- Subject digest: 1a958ef4a6b7b05d9d8c235049457905d90c45ee38918d708cd6a58c1e3216e8
- Sigstore transparency entry: 1563528907
- Sigstore integration time: May 17, 2026
Source repository:
- Permalink: xianNeuro/narRaters@5013a0673611d6b5a550a5df1b6d78939a3e8c80
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/xianNeuro
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@5013a0673611d6b5a550a5df1b6d78939a3e8c80
- Trigger Event: release

File details

Details for the file narraters-0.1.1-py3-none-any.whl.

File metadata

Download URL: narraters-0.1.1-py3-none-any.whl
Upload date: May 17, 2026
Size: 479.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for narraters-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`27884a94521201fa6eab51b11b1767b46d32398eef5283ecee2dbe6edc46ba64`
MD5	`b0a0ba3db1b56232f70447633eb5bfc2`
BLAKE2b-256	`79ea878a45b7747fffe56940cb9bb8fd7a8c1c19634d30d98c41d27f19f134cd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for narraters-0.1.1-py3-none-any.whl:

Publisher: python-publish.yml on xianNeuro/narRaters

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: narraters-0.1.1-py3-none-any.whl
- Subject digest: 27884a94521201fa6eab51b11b1767b46d32398eef5283ecee2dbe6edc46ba64
- Sigstore transparency entry: 1563528915
- Sigstore integration time: May 17, 2026
Source repository:
- Permalink: xianNeuro/narRaters@5013a0673611d6b5a550a5df1b6d78939a3e8c80
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/xianNeuro
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@5013a0673611d6b5a550a5df1b6d78939a3e8c80
- Trigger Event: release

narRaters 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

narRaters

Table of contents

Quick start

Getting started

Pipeline overview

Installation

Recommended — clone or download the project, then run the installer

Alternate — install from PyPI

After install (every path)

Optional extras (Whisper, APIs, big models)

Developers

Advanced — macOS DMG (unsigned, may be blocked)

Where to put your data

Bundled examples (pieman_edited, the_siren)

Using the web interface

narraters serve options

Navigating the three main screens

Heavy local models

Command-line pipeline

Step 1 — transcribe (audio → text)

Step 2 — segment (story → events)

Step 3 — correct (spell / grammar fixes)

Step 4 — parse (recall text → clause-level segments)

Step 5 — match (recall segments ↔ story events)

Step 6 — rate (causal relationships between event pairs)

Prompt templates

Validation / testing

Research background (by pipeline step)

Performance notes

Library / Python use

Project layout

Further reading

Author

Acknowledgements

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Bundled examples (`pieman_edited`, `the_siren`)

`narraters serve` options

Step 1 — `transcribe` (audio → text)

Step 2 — `segment` (story → events)

Step 3 — `correct` (spell / grammar fixes)

Step 4 — `parse` (recall text → clause-level segments)

Step 5 — `match` (recall segments ↔ story events)

Step 6 — `rate` (causal relationships between event pairs)