No project description provided

Project description

Podcast Pile

Data processing infra for Podcast Pile. Manager server

Architecture

The manager server maintains a queue of episodes to process. Worker servers request jobs from the manager, which assigns them tasks. Workers then process their assigned jobs and return results (transcription, diarization, and metadata) to the manager, which stores them in the database.

Jobs automatically expire after 2 hours if not completed. The manager tracks worker IP addresses and worker IDs for each job.

Workers download episodes from their URLs, perform diarization using Nvidia NeMo, transcribe audio using Parakeet, and send the results back to the manager.

Installation

Manager Server

For running the manager server only (no worker):

pip install -U podcastpile

Worker

For running a worker, you need additional ML dependencies:

pip install -U "podcastpile[worker]"

This will install:

librosa (audio processing)
soundfile (audio I/O)
nemo_toolkit[asr] (NeMo ASR models for English)

For Chinese language support, also install:

pip install funasr

The Paraformer model will be automatically downloaded from HuggingFace on first use.

Or using requirements.txt:

pip install -r requirements.txt

Configuration

Copy .env.example to .env and configure as needed:

cp .env.example .env

Configuration options:

DATABASE_URL: Database connection string (default: SQLite)
HOST: Server host (default: 0.0.0.0)
PORT: Server port (default: 8000)
WORKER_AUTH_ENABLED: Enable worker authentication (default: false)
WORKER_PASSWORD: Password for worker authentication (required if worker auth enabled)
ADMIN_AUTH_ENABLED: Enable admin dashboard authentication (default: false)
ADMIN_USERNAME: Username for admin login (default: admin)
ADMIN_PASSWORD: Password for admin login (required if admin auth enabled)
JOB_TIMEOUT_HOURS: Job timeout in hours (default: 2)

Running

Manager Server

Start the manager server:

ppcli manager -p 8000

Options:

-p, --port: Port to run on (default: 8000)
--host: Host to bind to (default: 0.0.0.0)
--reload: Enable auto-reload for development

The admin dashboard will be available at http://localhost:8000

Worker

Start a worker to process jobs:

# Basic usage (processes English jobs only by default)
ppcli am http://localhost:8000

# With worker ID and password
ppcli worker -m http://localhost:8000 -i my-worker -p worker-password

# Process multiple languages
ppcli worker -m http://localhost:8000 -l en,es,fr

# Use custom diarization configuration
ppcli worker -m http://localhost:8000 -c low_latency

# Process one job and exit
ppcli worker -m http://localhost:8000 --once

# Verbose logging
ppcli worker -m http://localhost:8000 -v

# GPU Selection - Use specific GPU
ppcli worker -m http://localhost:8000 --gpu 0

# Multi-GPU - Spawn worker on each available GPU
ppcli worker -m http://localhost:8000 --all-gpus

# Multi-GPU - Use specific GPUs (e.g., 0, 1, and 3)
ppcli worker -m http://localhost:8000 --gpus 0,1,3

# Chinese transcription with custom batch size
ppcli worker -m http://localhost:8000 -l zh --batch-size 8

Worker Options:

-m, --manager: Manager URL (required)
-i, --worker-id: Worker ID (default: hostname-gpu{N})
-p, --password: Worker password (can also use WORKER_PASSWORD env var)
-l, --languages: Comma-separated language codes to process (default: en)
-c, --config: Diarization configuration - very_high_latency, high_latency (default), low_latency, ultra_low_latency
--model: Path to custom .nemo model file
--batch-size: Batch size for FireRedASR transcription (1, 2, 4, 8, 16, etc.) Default: 4
--gpu: GPU device ID to use (e.g., 0, 1, 2)
--all-gpus: Spawn a worker process on each available GPU
--gpus: Comma-separated list of GPU IDs to use (e.g., "0,1,3")
--once: Process one job and exit
--poll-interval: Seconds between polling for jobs (default: 10)
-v, --verbose: Enable verbose logging

The worker will:

Load models once at startup - Models are loaded based on selected languages:
- English/other: Parakeet TDT (NeMo ASR)
- Chinese (zh): Paraformer (FunASR) with VAD and punctuation
- If languages contain zh or cn, Paraformer is loaded
- If languages contain non-Chinese codes, Parakeet is loaded
Request jobs from the manager (filtered by language)
Download the audio file
Perform diarization (always uses NeMo SortFormer)
Perform transcription using the appropriate model based on job language
Compute SHA256 and MD5 hashes of the audio file
Upload results (JSON, transcription, diarization, GPU info, processing time) back to the manager
Repeat continuously (unless --once is used)

Multi-GPU Support:

When using --all-gpus or --gpus, each GPU gets its own worker process
Each worker has a unique ID: hostname-gpu0, hostname-gpu1, etc.
Press Ctrl+C once to gracefully stop all workers
All workers share the same configuration and can process jobs in parallel

CLI Commands

Create a job

# Create a job with an episode URL
ppcli create https://example.com/podcast.mp3

# Create a job with language tag
ppcli create https://example.com/spanish-podcast.mp3 --language es

List jobs

# List all jobs
ppcli list

# Filter by status
ppcli list --status pending

# Limit results
ppcli list --limit 10

View statistics

ppcli stats

API Documentation

Once the server is running, visit http://localhost:8000/docs for interactive API documentation.

Key Endpoints

GET / - Admin dashboard
GET /api/stats - Get job statistics
POST /api/jobs - Create a new job
GET /api/jobs - List jobs
POST /api/jobs/request - Worker requests a job (requires auth if enabled)
POST /api/jobs/{job_id}/start - Mark job as processing (requires auth if enabled)
POST /api/jobs/{job_id}/complete - Submit job results (requires auth if enabled)
POST /api/jobs/{job_id}/fail - Report job failure (requires auth if enabled)

Authentication

Admin Dashboard Authentication

To protect the admin dashboard and charts with a password:

Set ADMIN_AUTH_ENABLED=true in .env
Set ADMIN_USERNAME=admin in .env (or your preferred username)
Set ADMIN_PASSWORD=your-secure-password in .env

When enabled, accessing the dashboard at http://localhost:8000 will prompt for HTTP Basic Authentication credentials.

Worker Authentication

To enable worker authentication:

Set WORKER_AUTH_ENABLED=true in .env
Set WORKER_PASSWORD=your-secure-password in .env
Workers must include X-Worker-Password header in all authenticated requests

Example:

curl -X POST http://localhost:8000/api/jobs/request \
  -H "X-Worker-Password: your-secure-password" \
  -H "Content-Type: application/json" \
  -d '{"worker_id": "worker-1"}'

Note: Admin and worker authentication are independent and can be enabled separately or together.

Development

Run in development mode with auto-reload:

ppcli manager --reload

Project details

Release history Release notifications | RSS feed

This version

0.1.1

Oct 31, 2025

0.1.0

Oct 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

podcastpile-0.1.1.tar.gz (28.8 kB view details)

Uploaded Oct 31, 2025 Source

File details

Details for the file podcastpile-0.1.1.tar.gz.

File metadata

Download URL: podcastpile-0.1.1.tar.gz
Upload date: Oct 31, 2025
Size: 28.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for podcastpile-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`3e42725f34d4dd4ff6040c04b20c13954f348cc8d7cb9c516754b9c03013bfff`
MD5	`c7a82b1f531aac6893afcaa3a7dff453`
BLAKE2b-256	`9760bb31680c96b6513b247e6d6098fb0d3e542419a9e6d85648c3ced0167d3f`

See more details on using hashes here.

podcastpile 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Podcast Pile

Architecture

Installation

Manager Server

Worker

Configuration

Running

Manager Server

Worker

CLI Commands

Create a job

List jobs

View statistics

API Documentation

Key Endpoints

Authentication

Admin Dashboard Authentication

Worker Authentication

Development

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes