Skip to main content

Extract, refine, and analyze YouTube video segments with precision

Project description

SegScript

License: MIT Python 3.11+ PyPI version

A command-line tool for managing, enhancing, and interacting with YouTube transcripts.

Overview

SegScript allows you to download, view, and query YouTube video transcripts directly from your terminal. It provides a clean interface for working with transcripts, including the ability to extract specific time ranges and view enhanced transcript content. I've used the langchain-google-genai package in conjunction with Google's Gemini Flash 2.0 model, which has delivered exceptional results in transcript enhancement.

Features

  • Download transcripts from any YouTube video using its ID
  • List all downloaded transcripts stored in your local collection
  • View full transcripts or segments based on time ranges
  • Interactive mode for browsing and working with your transcript collection
  • Rich text formatting for improved readability in the terminal

Installation

pip install segscript

For testing purposes,

# Clone the repository
git clone https://github.com/keshavsharma25/segscript.git
cd segscript

# Install dependencies
pip install -r pyproject.toml

# Install the package (optional)
pip install -e .

Dependencies

  • youtube-transcript-api: Fetch youtube transcripts with ease
  • click: Command-line interface creation kit
  • rich: Terminal formatting and styling
  • python-dotenv: Load GOOGLE_API_KEY from the command line environment
  • pathlib: Object-oriented filesystem paths
  • langchain-google-genai: For synthesizing transcript into a well structured format

Usage

Basic Commands

# List all downloaded transcripts
segscript list

# Download a transcript for a YouTube video
segscript download VIDEO_ID

# Get a transcript (downloads if not already available)
segscript get VIDEO_ID

# Get a transcript for a specific time range
segscript get VIDEO_ID --time-range "10:00;20:00"

# Start interactive mode
segscript interactive

Interactive Mode

Interactive mode provides a user-friendly interface for:

  1. Browsing your transcript collection
  2. Selecting a transcript to work with
  3. Viewing full transcripts or specific segments
  4. Querying transcripts by time range

File Structure

Transcripts are stored in the ~/.segscript/ directory with the following structure:

~/.segscript/
├── .env                    # Environment variables file
├── VIDEO_ID_1/
│   ├── VIDEO_ID_1.json     # Raw transcript data
│   └── metadata.json       # Video metadata
├── VIDEO_ID_2/
│   ├── VIDEO_ID_2.json
│   └── metadata.json
└── ...

Examples

Download a transcript

segscript download dQw4w9WgXcQ

View a transcript for a specific section of a video

segscript get dQw4w9WgXcQ --time-range "1:30;2:45"

Interactive browsing

segscript interactive

Next TODOs

  • Add transcript summary support

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • A huge thanks to Youtube Transcript API for making transcript retrieval so easy and accessible.
  • Also kudos to Langchain Google for the langchain-google-genai.
  • Built with Rich for beautiful terminal output.
  • Uses Click for command-line interface.

Note: SegScript is not affiliated with YouTube or Google.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

segscript-0.1.2.tar.gz (15.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

segscript-0.1.2-py3-none-any.whl (14.9 kB view details)

Uploaded Python 3

File details

Details for the file segscript-0.1.2.tar.gz.

File metadata

  • Download URL: segscript-0.1.2.tar.gz
  • Upload date:
  • Size: 15.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.3

File hashes

Hashes for segscript-0.1.2.tar.gz
Algorithm Hash digest
SHA256 2bdea42ae16a5e96db513433f6de6bf4629db1fb98db5c2bb2dca1b2bb46ff26
MD5 3e3f591f35e34568b7aacd829f8d8d01
BLAKE2b-256 f2196c2dbd4591a12d69c14535e10e34252944e9c8148448101cbb25b089a16c

See more details on using hashes here.

File details

Details for the file segscript-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: segscript-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 14.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.3

File hashes

Hashes for segscript-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 37f32c4f9fd7565b4fa55949b70fdd85131cd437959505e4cc004b51233a189f
MD5 e1118169ea847fe2e50ebeb5e5d32dde
BLAKE2b-256 fbe6ae71533e5b9f78439426c2d351cadec45c2631eabbb91e2e7ba0ebaab52c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page