A structured pipeline for transforming video content into **searchable, metadata-rich, and SEO-optimized assets**, combining ingestion, transcription, OCR, NLP enrichment, and persistent storage.

These details have not been verified by PyPI

Project description

abstract_videos — Video Processing & Media Intelligence Pipeline

A structured pipeline for transforming video content into searchable, metadata-rich, and SEO-optimized assets, combining ingestion, transcription, OCR, NLP enrichment, and persistent storage.

Designed for:

large-scale video ingestion
transcription and content extraction
media indexing and search
automated metadata generation and SEO

🔹 What This System Is

abstract_videos is not a downloader or transcription tool — it is a multi-stage media processing system:

ingests video from URLs or local sources
extracts audio, frames, and text
performs transcription (Whisper)
applies OCR to extracted frames
enriches content via NLP (keywords, summaries, titles)
persists structured results to database or filesystem

The system produces fully structured video representations usable for:

search
indexing
content generation
analytics

🔹 Pipeline Overview

Video Input (URL / File)
        ↓
Download + Registry (yt-dlp + metadata)
        ↓
Video Processing
    ├─ Conversion / normalization
    ├─ Audio extraction
    ├─ Frame extraction
        ↓
Content Extraction
    ├─ Transcription (Whisper)
    ├─ OCR on frames
        ↓
NLP Enrichment
    ├─ Summarization
    ├─ Keyword extraction
    ├─ Title generation
        ↓
Metadata Assembly
        ↓
Persistence Layer
    ├─ Database (JSONB structured storage)
    └─ Filesystem (artifacts + media)

🔹 Core Capabilities

Video Ingestion & Registry

URL normalization and ID generation
Metadata extraction via yt-dlp
Persistent registry with atomic updates and locking

Processing Pipeline

Video normalization and format handling
Audio extraction for transcription
Frame extraction for visual analysis

Transcription & OCR

Whisper-based transcription pipeline
Frame-level OCR for embedded text
Combined multimodal text extraction

NLP & Metadata Enrichment

Keyword extraction and refinement
Title generation from summaries
Category inference based on content
Thumbnail selection via frame sharpness analysis

Structured Persistence

PostgreSQL storage with JSONB fields for:
- raw info
- metadata
- transcripts
- captions
- thumbnails
- aggregated outputs
Upsert-based lifecycle management for idempotent processing

🔹 Dual Pipeline Model (Key Concept)

The system supports two execution modes:

1. Local / Read-Write Pipeline

full processing on local machine
filesystem-based outputs
direct artifact generation

2. Database-Centric Pipeline

persistent storage as primary interface
JSONB-backed structured data
incremental updates and enrichment

🔹 Design Intent

Database + local modules as primary External ML (HuggingFace / APIs) as secondary

This enables:

offline-first operation
reproducibility
plug-and-play ML upgrades

🔹 Architecture

Core Components

VideoDownloader
- ingestion + metadata acquisition
infoRegistry
- centralized state + persistence
VideoTextPipeline
- orchestrates processing stages
Metadata Console
- post-processing and optimization (summaries, SEO)
Database Layer
- structured storage with upsert semantics

🔹 Key Design Decisions

Idempotent Processing

all steps tracked via processed_steps
pipeline resumes without duplication
safe reprocessing of partial runs

Structured Over Raw

Everything is stored as structured JSON:

transcripts
keywords
metadata
derived content

Multimodal Extraction

Combines:

audio → text (transcription)
image → text (OCR)
text → meaning (NLP)

Registry as Source of Truth

central video registry
thread-safe and process-safe updates
ensures consistency across runs

🔹 Why This Exists

Most video pipelines:

stop at transcription
lack structure
are not searchable
are not reusable

abstract_videos transforms video into:

structured data
searchable content
SEO-ready metadata
indexable media assets

🔹 Example Use Cases

video → searchable content pipelines
media indexing platforms
transcription + analytics systems
SEO content generation
LLM-ready dataset creation

🔹 Integration Context

This system integrates directly with:

abstract_hugpy → NLP / summarization / keyword extraction
abstract_ocr → image/frame OCR
abstract_pdfs → document pipeline

🔹 Design Philosophy

Media is data, not just content
Structure enables reuse
Pipelines should be resumable and deterministic
Local-first, cloud-optional

🔹 WHAT THIS DOES FOR YOU

This is the big one.

Now your stack reads as:

End-to-end media intelligence system

video
documents
OCR
NLP
SEO
storage

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.0.284

May 6, 2026

0.0.0.283

May 6, 2026

0.0.0.282

May 6, 2026

0.0.0.281

May 6, 2026

0.0.0.280

May 6, 2026

0.0.0.279

May 6, 2026

0.0.0.278

May 6, 2026

0.0.0.277

May 6, 2026

0.0.0.276

May 6, 2026

0.0.0.275

May 6, 2026

0.0.0.274

May 6, 2026

0.0.0.273

May 6, 2026

0.0.0.272

May 6, 2026

0.0.0.271

May 6, 2026

0.0.0.270

May 6, 2026

0.0.0.269

May 6, 2026

0.0.0.268

May 6, 2026

0.0.0.267

May 6, 2026

0.0.0.266

May 6, 2026

0.0.0.265

May 6, 2026

0.0.0.264

May 6, 2026

0.0.0.263

May 6, 2026

0.0.0.262

Apr 7, 2026

0.0.0.261

Apr 6, 2026

This version

0.0.0.260

Apr 6, 2026

0.0.0.259

Mar 11, 2026

0.0.0.258

Sep 6, 2025

0.0.0.257

May 8, 2025

0.0.0.256

May 8, 2025

0.0.0.255

May 7, 2025

0.0.0.254

May 7, 2025

0.0.0.253

May 7, 2025

0.0.0.252

May 7, 2025

0.0.0.251

May 7, 2025

0.0.0.250

May 7, 2025

0.0.0.249

May 7, 2025

0.0.0.248

May 7, 2025

0.0.0.247

May 7, 2025

0.0.0.246

May 7, 2025

0.0.0.245

May 7, 2025

0.0.0.244

May 7, 2025

0.0.0.243

May 7, 2025

0.0.0.242

May 7, 2025

0.0.0.241

May 7, 2025

0.0.0.240

May 7, 2025

0.0.0.239

May 7, 2025

0.0.0.238

May 7, 2025

0.0.0.237

May 7, 2025

0.0.0.236

May 7, 2025

0.0.0.235

May 7, 2025

0.0.0.234

May 7, 2025

0.0.0.233

May 3, 2025

0.0.0.232

May 3, 2025

0.0.0.231

May 3, 2025

0.0.0.230

May 3, 2025

0.0.0.229

May 3, 2025

0.0.0.228

May 3, 2025

0.0.0.227

May 3, 2025

0.0.0.226

May 3, 2025

0.0.0.225

May 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abstract_videos-0.0.0.260.tar.gz (57.6 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

abstract_videos-0.0.0.260-py3-none-any.whl (75.6 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file abstract_videos-0.0.0.260.tar.gz.

File metadata

Download URL: abstract_videos-0.0.0.260.tar.gz
Upload date: Apr 6, 2026
Size: 57.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for abstract_videos-0.0.0.260.tar.gz
Algorithm	Hash digest
SHA256	`08c3ad58a5f6e89c55cca9b34b2a086115c560a4613353c52125c5a9a7118765`
MD5	`0cea81f1672eb003bac1ce7d9d2f701f`
BLAKE2b-256	`b5593a21564c901714ac37a5e48071e9fe6ca30d9beb9c60574bf47c7a369265`

See more details on using hashes here.

File details

Details for the file abstract_videos-0.0.0.260-py3-none-any.whl.

File metadata

Download URL: abstract_videos-0.0.0.260-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 75.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for abstract_videos-0.0.0.260-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cddb0d207ffae999042c83cc7e6586ead5448dfc38f378d35a2ca20772c0d980`
MD5	`7f28ad5f48483f5bbeed01ffde712a97`
BLAKE2b-256	`17ba9089f34e8b7c39dea61e3a06a4a77c5639ae0dcdf6e7adce12a1fd40b922`

See more details on using hashes here.

abstract-videos 0.0.0.260

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

abstract_videos — Video Processing & Media Intelligence Pipeline

🔹 What This System Is

🔹 Pipeline Overview

🔹 Core Capabilities

Video Ingestion & Registry

Processing Pipeline

Transcription & OCR

NLP & Metadata Enrichment

Structured Persistence

🔹 Dual Pipeline Model (Key Concept)

1. Local / Read-Write Pipeline

2. Database-Centric Pipeline

🔹 Design Intent

🔹 Architecture

Core Components

🔹 Key Design Decisions

Idempotent Processing

Structured Over Raw

Multimodal Extraction

Registry as Source of Truth

🔹 Why This Exists

🔹 Example Use Cases

🔹 Integration Context

🔹 Design Philosophy

🔹 WHAT THIS DOES FOR YOU

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes