Turn language-learning audio into Anki-ready study materials.
Project description
audawispr
Split audio files into high-quality sentence-based learning materials.
Features
- Transcription — Local speech-to-text via
faster-whisper; no API keys required. - Segmentation — Splits transcriptions into sentence-level segments using punctuation, pauses, and duration bounds.
- Enrichment — French IPA transcription (optional). Translation scaffolding is in place but not yet implemented.
- Clipping — Extracts audio snippets for each segment using FFmpeg (bundled via
static-ffmpeg). - Export — Outputs Anki-compatible CSV or native
.apkgpackages with embedded audio. - One-shot CLI — Runs the full pipeline with a single command.
Requirements
- Python 3.11+
- FFmpeg is bundled via the
static-ffmpegdependency — no separate installation is needed. Runaudawispr doctorto verify availability. uvis only required for local development (see Setup).
Setup
Install audawispr from PyPI:
pip install audawispr
Or with uv:
uv pip install audawispr
After installing, run audawispr directly. Use uv run audawispr only when working in a cloned repository.
For local development, install runtime and development dependencies:
uv sync --dev
Quickstart
Turn an audio file into an Anki deck with one command:
audawispr lesson.mp3 --output deck.apkg --language fr --ipa
Or use the Python API:
from pathlib import Path
from audawispr import Pipeline
Pipeline(
output=Path("deck.apkg"),
language="fr",
ipa=True,
).run(Path("lesson.mp3"))
The one-shot command runs transcription, segmentation, enrichment, clipping, and export in sequence.
Usage
This section and Quickstart use the bare audawispr command. For development, prefix with uv run.
Show the CLI help:
audawispr --help
Show the installed package version:
audawispr --version
Check local runtime readiness:
audawispr doctor
audawispr doctor reports the package version, Python version, and whether FFmpeg and FFprobe are available from AUDAWISPR_FFMPEG, AUDAWISPR_FFPROBE, PATH, or the static-ffmpeg fallback.
Transcribe audio locally into a transcript manifest:
audawispr transcribe lesson.mp3 --output out/transcript.json --language fr
Validate an existing transcript manifest:
audawispr validate out/transcript.json
Segment a transcript manifest and write an inspection TSV:
audawispr segment out/transcript.json --output out/segments.json
Enrich a segmented French manifest with IPA:
audawispr enrich out/segments.json --ipa --output out/enriched.json
transcribe defaults to French, the small faster-whisper model, automatic device selection, int8 compute, VAD enabled, and required word timestamps. The first real transcription may download model files, but no API key is required. Tests and CI use fakes and do not download models.
segment preserves the transcript manifest schema and rebuilds only the segment list. It splits on sentence punctuation, pauses, and duration bounds. By default, it writes out/segments.tsv next to the JSON output; use --inspection-tsv path/to/review.tsv to choose a different TSV path.
enrich preserves timestamps, words, and source metadata while adding optional study fields. IPA (--ipa) is available for French. Translation is not yet implemented; pass --translate none (the default) to skip it. The pipeline works with any language faster-whisper supports — IPA is the only French-specific feature.
Clip audio snippets from a segmented manifest:
audawispr clip out/enriched.json --output out/clipped.json --output-dir out/media
clip reads a segmented or enriched manifest, extracts each segment's audio from the source file using FFmpeg, and writes the clipped manifest with audio_file paths. By default it reuses existing snippets; use --force to re-clip. Padding (--padding-before-ms, --padding-after-ms), format (--format), and bitrate (--bitrate) are configurable.
Export a clipped manifest for Anki import:
audawispr export out/clipped.json --format anki-csv --output out/anki-csv
export reads a clipped manifest, copies audio snippets, and writes out/anki-csv/cards.csv with columns SourceText, Audio, IPA, Translation, SourceFile, TimestampRange, and SegmentId. Audio references use Anki's [sound:...] syntax.
Manual import in Anki Desktop: File → Import → select cards.csv, set "Fields separated by: Comma", and copy the media/ folder contents into your Anki collection.media folder.
Export as a native Anki package (.apkg) with embedded audio:
audawispr export out/clipped.json --output deck.apkg --deck-name "My French Deck"
When the output path ends in .apkg, the apkg format is inferred automatically. Use --deck-name to set the deck name; the default is audawispr::{language} (e.g. audawispr::fr). The resulting .apkg file can be opened directly in Anki Desktop via File → Import.
Development Checks
uv run pytest
uv run ruff check .
uv run ruff format --check .
uv run ty check src tests
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file audawispr-0.1.1.tar.gz.
File metadata
- Download URL: audawispr-0.1.1.tar.gz
- Upload date:
- Size: 138.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dfddf54d824f06c2501c5a38f268de11f9beee93bcf78514c83c93021d112242
|
|
| MD5 |
b5a86d819333757c79f970e47da1caa9
|
|
| BLAKE2b-256 |
795a198711643290f6035d13905bd01a0808736055cf9f65dd413b382cc47ef8
|
Provenance
The following attestation bundles were made for audawispr-0.1.1.tar.gz:
Publisher:
release.yml on lanhhoang/audawispr
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
audawispr-0.1.1.tar.gz -
Subject digest:
dfddf54d824f06c2501c5a38f268de11f9beee93bcf78514c83c93021d112242 - Sigstore transparency entry: 1408713219
- Sigstore integration time:
-
Permalink:
lanhhoang/audawispr@d47e8e994c40bd9d1a3d445eab24b73cb9784c3e -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/lanhhoang
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@d47e8e994c40bd9d1a3d445eab24b73cb9784c3e -
Trigger Event:
push
-
Statement type:
File details
Details for the file audawispr-0.1.1-py3-none-any.whl.
File metadata
- Download URL: audawispr-0.1.1-py3-none-any.whl
- Upload date:
- Size: 30.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7f959f851b4746b03163b3eed555713534d45460d4699e90a0c32dda9df77b2b
|
|
| MD5 |
8676987adaa9f57effa9404372d5c49c
|
|
| BLAKE2b-256 |
1fb08b57c7895230e8469099ffe728e5e9a553db3f7c296a572c0ab656014c73
|
Provenance
The following attestation bundles were made for audawispr-0.1.1-py3-none-any.whl:
Publisher:
release.yml on lanhhoang/audawispr
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
audawispr-0.1.1-py3-none-any.whl -
Subject digest:
7f959f851b4746b03163b3eed555713534d45460d4699e90a0c32dda9df77b2b - Sigstore transparency entry: 1408713407
- Sigstore integration time:
-
Permalink:
lanhhoang/audawispr@d47e8e994c40bd9d1a3d445eab24b73cb9784c3e -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/lanhhoang
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@d47e8e994c40bd9d1a3d445eab24b73cb9784c3e -
Trigger Event:
push
-
Statement type: