A CLI tool to download audio from a YouTube video, transcribe it, and refine the transcription using AI.

These details have not been verified by PyPI

Project links

Homepage

Project description

ytdebunk

Overview

Current Features:

ytdebunk is a command-line tool designed to:

Download audio from YouTube videos.
Transcribe the audio content.
Optionally enhance the transcription using the Gemini API.
Optionally detect logical faults in the transctiption using the Gemini API.

Features in queue:

Classifying assertive claims from the transcription.
Fact-checking and validation of the claims from reliable source using online search and agentic AI.
Re-organizing the factual faults and logical faults.
Preparing a script for a hypothetical debunker charecter using generative AI (or AI Agents).
Synthesizing the script to create an audio and a video using generative AI (or AI Agents).

This tool is particularly useful for analyzing transcriptions to identify logical fallacies and incorrect claims made by YouTubers and prepare a debunk video.

Installation

For avoiding conflicts better create a virtual environment and start working on it:

python3.11 -m venv .venv
source .venv/bin/activate

Now, you can install from PyPI using,

pip install ytdebunk

Alternatively, for latest updated please try installing directly from Github using:

pip install git+https://github.com/hissain/youtuber-debunked.git

Usage (The CLI Tool)

The ytdebunk is a command-line interface (CLI) with several options.

Arguments

video_url (str) – URL of the YouTube video to download audio from.

Options

Option	Description
`-e, --enhance` (bool)	Enhance the transcription using the Gemini API. (Default: False)
`-d, --detect` (bool)	Detect logical faults in the transcription using Gemini API. (Default: False)
`-v, --verbose` (bool)	Increase output verbosity.
`-t, --token` (str)	API token for the Gemini API (Required if `--enhance` or `--detect`is enabled).
`-st, --start_time` (float)	Start time of the audio clip in seconds
`-et, --end_time` (float)	End time of the audio clip in seconds
`-m, --model` (str)	A transcription model name from Huggingface (WhisperFeatureExtractor)

Example Usage

ytdebunk "https://www.youtube.com/watch?v=example" -e -d -v -t YOUR_GEMINI_API_TOKEN

export GEMINI_API_TOKEN="your_api_key"
ytdebunk "https://www.youtube.com/watch?v=example" -e -d -v #when Gemini API key is in environment

See an example notebook Example Notebook file for details usage.

Usage (The Streamlit App)

You can simply run the streamlit app to see the demo.

Install the streamlit using pip

pip install streamlit

Run the app.py using streamlit

streamlit run app.py

Screenshots of the Streamlit App

Query Fields Transcription Result Logical Fults Detected

Environment Variables

If preferred, you can set the Gemini API token as an environment variable instead of passing it as a CLI argument:

export GEMINI_API_TOKEN="your_api_key"

Detailed Process

Download Audio
- Uses the download_audio function from ytdebunk.downloader to download audio from the given YouTube URL.
Transcribe Audio
- Uses the transcribe_audio function from ytdebunk.transcriber to generate a text transcription.
Enhance Transcription (Optional)
- If --enhance is enabled, the script uses enhance_transcription from ytdebunk.refiner to refine the transcription using the Gemini API.
- The API token must be provided via --token or as an environment variable.
Detect Logical Faults (Optional)
- If --detect is enabled, the script uses detect_logical_faults from ytdebunk.philosopher to detect logical fults, fallacies, bias, irony and so on in the transcription using the Gemini API.
- The API token must be provided via --token or as an environment variable.
Save Transcription
- The final transcription and logical faults (raw or enhanced) are saved to the ./download folder.

Error Handling

If --enhance or --detect are enabled but no Gemini API token is provided, the script prints an error message and exits.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contribution and Contact

You can fork this project and submit pull request in the project. Please contact to the author at hissain.khan@gmail.com

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.1.3

Mar 26, 2025

1.1.2

Mar 26, 2025

1.1.1

Mar 26, 2025

1.1.0

Mar 25, 2025

This version

1.0.4

Mar 25, 2025

1.0.3

Mar 23, 2025

1.0.2

Mar 23, 2025

1.0.1

Mar 23, 2025

1.0.0

Mar 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ytdebunk-1.0.4.tar.gz (9.6 kB view details)

Uploaded Mar 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ytdebunk-1.0.4-py3-none-any.whl (9.8 kB view details)

Uploaded Mar 25, 2025 Python 3

File details

Details for the file ytdebunk-1.0.4.tar.gz.

File metadata

Download URL: ytdebunk-1.0.4.tar.gz
Upload date: Mar 25, 2025
Size: 9.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.0

File hashes

Hashes for ytdebunk-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`4be8fa8cb7890603b37718278d3ff6aa2027512ac4a4e3428f6a9c1698109949`
MD5	`877b2bda15dd287877c2acfd52c4af78`
BLAKE2b-256	`1f678a264f30253fb49dd78ed855f78940f904ac77cb38b793d68159e1e65fda`

See more details on using hashes here.

File details

Details for the file ytdebunk-1.0.4-py3-none-any.whl.

File metadata

Download URL: ytdebunk-1.0.4-py3-none-any.whl
Upload date: Mar 25, 2025
Size: 9.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.0

File hashes

Hashes for ytdebunk-1.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`faebfb19359f453381ea9ffeb66570a66aa38452a29397dd661e0baedc31e81e`
MD5	`de230d6c8d975c319c9793fc6df14f1d`
BLAKE2b-256	`ef274726f379a4e86672b267d4f1d6285bed50d273a31d9ed3134313af2735e0`

See more details on using hashes here.

ytdebunk 1.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ytdebunk

Overview

Current Features:

Features in queue:

Installation

Usage (The CLI Tool)

Arguments

Options

Example Usage

Usage (The Streamlit App)

Screenshots of the Streamlit App

Environment Variables

Detailed Process

Error Handling

License

Contribution and Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes