The `abstract_hugpy` module is designed to facilitate hugging face modules
Project description
Part of the Abstract Media Intelligence Platform
This module provides NLP and ML enrichment across the media pipeline.
abstract_hugpy focuses on:
- summarization and keyword extraction
- metadata generation (titles, descriptions, SEO)
- multimodal refinement (text, audio, video)
Full system: https://github.com/AbstractEndeavors/abstract_media_platform
abstract_hugpy — NLP & Media Enrichment Engine
A modular NLP and ML layer for transforming extracted media content into structured, enriched, and decision-ready data.
Designed to operate as part of a larger pipeline, abstract_hugpy provides:
- summarization
- keyword extraction
- metadata generation
- transcription
- content refinement
🔹 What This System Does
abstract_hugpy converts raw text and media-derived content into:
- summaries
- keywords and density analysis
- titles and descriptions
- structured metadata
- SEO-ready outputs
It sits after extraction and before storage/publishing in the pipeline.
🔹 Core Capabilities
Summarization
- Long-form text summarization (chunked + consolidated)
- Multiple output modes (brief, medium, full)
- Designed for large documents beyond model context limits
Keyword Extraction (Dual Backend)
-
Transformer-based (KeyBERT) + rule-based (spaCy)
-
Preset-driven modes:
- SEO
- metadata
- social
- long-tail
-
Density scoring and keyword classification
Content Refinement
-
Multi-stage generation:
- prompt generation (BigBird / LED)
- refinement via generator model
-
Produces:
- titles
- descriptions
- abstracts
Transcription (Whisper Integration)
- Audio extraction + transcription pipeline
- Singleton-managed models for reuse and performance
Media Metadata Generation
- Title, keywords, and category derivation from transcripts
- Thumbnail extraction via frame sharpness scoring
- URL generation for media assets
🔹 Architecture
Raw Text / Transcript
↓
Summarization
↓
Keyword Extraction
↓
Content Refinement
↓
Metadata Generation
↓
Structured Output
🔹 Key Design Decisions
Singleton Model Management
- models loaded once and reused
- avoids repeated initialization overhead
Preset-Driven Processing
- consistent outputs via named configurations
- avoids ad-hoc parameter tuning
Multi-Backend Strategy
- combines rule-based + transformer approaches
- ensures fallback and robustness
Structured Outputs
- all results returned as typed objects / JSON
- no raw string-only outputs
🔹 Role in the Platform
abstract_hugpy is the enrichment layer of the system:
| Layer | Module |
|---|---|
| Extraction | abstract_ocr |
| Structuring | abstract_pdfs |
| Video | abstract_videos |
| Enrichment | abstract_hugpy |
🔹 Why This Exists
Most ML pipelines:
- operate in isolation
- lack structure
- produce inconsistent outputs
abstract_hugpy provides:
- consistent enrichment
- reusable pipelines
- integration with upstream extraction systems
🔹 Design Philosophy
- Models are tools, pipelines are systems
- Structure over raw output
- Consistency over novelty
- Enrichment is part of the pipeline, not an afterthought
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file abstract_hugpy-0.1.166.tar.gz.
File metadata
- Download URL: abstract_hugpy-0.1.166.tar.gz
- Upload date:
- Size: 30.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dee2f05d62f8aa96c811f27eba710c35cf08b551cde06501fc6dbd594d37a85c
|
|
| MD5 |
b700547360d75e06ff2dcbba6ac2e8e6
|
|
| BLAKE2b-256 |
978e102e7ed36a717bbf981143c4794e2a3117b20ee448e311cfb6e73577bf00
|
File details
Details for the file abstract_hugpy-0.1.166-py3-none-any.whl.
File metadata
- Download URL: abstract_hugpy-0.1.166-py3-none-any.whl
- Upload date:
- Size: 34.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3a255127ebe611f41876263cc1ad523df0db9cb0ce758646480510f909377a2a
|
|
| MD5 |
168475ca87f0a130f6ad87769239b5ef
|
|
| BLAKE2b-256 |
e8d3d9a755ef80c1253592bab3d5543abf03f27c3cb78fccb285a15bd9aed69e
|