Extract topics directly from audio or text and text files

These details have not been verified by PyPI

Project description

Overview

`Audio2Topics` Package 🔊🗂️

Audio2Topics is python packge that automizes topic extraction from voice and text files. The package is originally designed to aid researchers that performs interview research. Interview research typically incorporates a series of steps, starting from planning research questions, performing interviews, transcribing, and thorough manual text analysis to extract the main themes and topics of the transcribed text. This manual analysis phase is usually long and time-consuming. Additionally, the feasibility of manual analysis is limited by the volume of transcribed data. Audio2Topics accelerating this step by automtically converting voice files of interviews into topics simply and effectively.

The application provides an end-to-end pipeline that handles:

Audio transcription using OpenAI's Whisper
Text preprocessing and cleaning
Topic modeling with multiple algorithms
Topic validation and optimization
Advanced visualizations
Model comparison and evaluation

Audio2Topics bridges the gap between audio content and text analysis, making topic discovery accessible to researchers, content creators, analysts, and anyone who needs to identify key themes in audio materials.

Key Features

Automated Audio Transcription: Convert audio files to text using state-of-the-art speech recognition
Advanced Text Processing: Clean and normalize text with customizable preprocessing options
Multiple Topic Modeling Approaches: Choose from BERTopic, LDA, and NMF algorithms
Interactive Topic Exploration: Visualize and explore topics through multiple visualization techniques
Topic Quality Validation: Assess topic coherence, diversity, and coverage
Topic Highlighting: See how topics appear in original text with color highlighting
Model Comparison: Compare different topic modeling approaches side by side
LLM Integration: Enhance topic interpretability with AI-generated descriptions
Comprehensive Reporting: Generate detailed reports with findings and visualizations

Workflow Overview

The Audio2Topics application guides users through a structured workflow:

Audio Transcription → 2. Text Processing → 3. Topic Modeling → 4. Validation & Visualization → 5. Topics Comparison

1. Audio Transcription

The first step involves converting audio files to text using OpenAI's Whisper model:

Upload audio files (MP3, WAV, M4A)
Select Whisper model size (tiny to large)
Transcribe with automatic language detection
Review and save transcriptions

2. Text Processing

The transcribed text is then prepared for topic modeling:

Clean text by removing stopwords, special characters, etc.
Apply stemming or lemmatization
Process multiple documents simultaneously
Analyze text statistics for quality assessment

3. Topic Modeling

The core functionality extracts topics from processed text:

Choose from multiple algorithms (BERTopic, NMF, LDA)
Configure parameters like number of topics and n-gram range
Extract topics with adaptive processing for challenging data
Refine topics with LLM-generated descriptions

4. Validation & Visualization

Assess topic quality and explore results:

Validate topics with coherence and diversity metrics
Find the optimal number of topics for your data
Create visualizations from word clouds to interactive topic maps
Highlight topics in original text
Compare different topic modeling approaches
Generate reports for sharing and documentation

5. Comparison Module

The comparison tool enables side-by-side evaluation of different topic modeling approaches.

Can compare between upto 5 topic modeling runs
Creats supporting visualizations
Create comparsion summary report

Installation and Setup

System Requirements

Python 3.8 or higher
4GB RAM minimum (8GB+ recommended for larger models)
NVIDIA GPU with CUDA support (optional, for faster processing)
Internet connection for model downloads and API access

Dependencies

Audio2Topics relies on several key libraries:

PyQt5 for the user interface
OpenAI Whisper for speech recognition
NLTK and spaCy for text processing
BERTopic, scikit-learn, and UMAP for topic modeling
Matplotlib, seaborn, and wordcloud for visualizations
OpenAI/Anthropic APIs for LLM integration (optional)

Installation Steps

Install the package
```
pip install audio2topics
```

Install required language models

python -m spacy download en_core_web_sm
python -m nltk.downloader punkt stopwords

Configure API Keys (Optional)
- For LLM topic refinement, configure OpenAI or Anthropic API keys in the settings
Launch the application
```
python -m audio2topics
```

Use Cases

Audio2Topics serves a wide range of applications across different domains:

Academic Research

Analyze interview recordings for qualitative research
Process lecture content to identify key themes
Extract topics from academic presentations and symposia

Best Practices

Audio Recording Quality

Use clear recordings with minimal background noise
Ensure adequate volume levels
Use lossless formats when possible

Document Preparation

Split long recordings into shorter segments
Group related content into meaningful documents
Remove irrelevant sections before processing

Topic Modeling Strategy

Start with a small subset of documents to test different approaches
Use the Comparison tab to identify the best method for your data
Validate topic quality before drawing conclusions
Use LLM refinement for clearer topic interpretations
Combine quantitative metrics with human judgment

Reporting and Sharing

Export visualizations in appropriate formats for your audience
Include topic words and example documents in reports
Provide context for topics through LLM-generated descriptions
Use interactive formats (HTML) for detailed exploration

References and Resources

Underlying Technologies

License

Developed by: Mohsen Askar ceaser198511@gmail.com

Citation

If you use Audio2Topics in your research, please cite:

@software{audio2topics2025,
    title = {Audio2Topics: A Python package to automatically extract topics from audio or text files},
    author = {Mohsen Askar},
    e-mail = {ceaser198511@gmail.com},
    year = {2025},
    url = {https://pypi.org/project/audio2topics/}
}

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.1.2

Apr 20, 2025

1.1.1

Apr 13, 2025

1.1.0

Apr 13, 2025

This version

1.0.0

Apr 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audio2topics-1.0.0.tar.gz (385.4 kB view details)

Uploaded Apr 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audio2topics-1.0.0-py3-none-any.whl (448.2 kB view details)

Uploaded Apr 11, 2025 Python 3

File details

Details for the file audio2topics-1.0.0.tar.gz.

File metadata

Download URL: audio2topics-1.0.0.tar.gz
Upload date: Apr 11, 2025
Size: 385.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.0 importlib_metadata/1.7.0 pkginfo/1.9.6 requests/2.27.1 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.6.12

File hashes

Hashes for audio2topics-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`3570b32cbec987ddde27a021d39ff6933e3b0a65aa2930501ab5f4abd34b8852`
MD5	`190dd643603c5a093f80e83133a66a4d`
BLAKE2b-256	`9f7eb45de1251bcba344275b76f4369a353f319f9d8386ecc4abdef029f2ad9d`

See more details on using hashes here.

File details

Details for the file audio2topics-1.0.0-py3-none-any.whl.

File metadata

Download URL: audio2topics-1.0.0-py3-none-any.whl
Upload date: Apr 11, 2025
Size: 448.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.0 importlib_metadata/1.7.0 pkginfo/1.9.6 requests/2.27.1 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.6.12

File hashes

Hashes for audio2topics-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc0664a609c21ac8b7aa7f743320d5868c11e853631dd54507c1365cf81ff43a`
MD5	`be2c6e3e4a2b62363ac081b18ff3b0da`
BLAKE2b-256	`fbeef1689c7a0cceb87b895e5b591be031e1a1f767a770698dfcd3d70ccf15a6`

See more details on using hashes here.

audio2topics 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Overview

Audio2Topics Package 🔊🗂️

Key Features

Workflow Overview

1. Audio Transcription

2. Text Processing

3. Topic Modeling

4. Validation & Visualization

5. Comparison Module

Installation and Setup

System Requirements

Dependencies

Installation Steps

Use Cases

Academic Research

Best Practices

Audio Recording Quality

Document Preparation

Topic Modeling Strategy

Reporting and Sharing

References and Resources

Underlying Technologies

License

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`Audio2Topics` Package 🔊🗂️