A tool that automatically generates step-by-step documentation from instructional videos

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

VideoInstruct

VideoInstruct is a tool that automatically generates step-by-step documentation from instructional videos. It uses AI to extract transcriptions, interpret video content, and create comprehensive markdown guides.

Pipeline:

VideoInstruct employs a sophisticated pipeline that transforms instructional videos into comprehensive documentation through multiple AI-powered stages. The process begins with the Video Interpreter, which simultaneously handles video transcription extraction and visual content analysis. This dual-stream approach ensures both spoken instructions and visual demonstrations are captured. The extracted information is then fed into the Documentation Generator, which creates structured, step-by-step documentation. Before finalization, the Documentation Evaluator assesses the quality and completeness of the generated content using conversation memory and interactive Q&A between AI agents. If the documentation doesn't meet the defined standards, it's sent back for refinement, ensuring high-quality output that accurately represents the video's instructional content.

Quick Start

Using Docker (Recommended)

The fastest and simplest way to use VideoInstruct is through our Docker image. See DOCKER_USAGE.md for detailed instructions on:

Installation and prerequisites
Downloading the Docker file from Docker Hub.
Configuration options
Troubleshooting common issues

Using Python Package

# Install from PyPI
pip install videoinstruct

# Set up environment variables
export OPENAI_API_KEY=your_openai_key
export GEMINI_API_KEY=your_gemini_key
export DEEPSEEK_API_KEY=your_deepseek_key

# Use in your code
from videoinstruct import VideoInstructor
instructor = VideoInstructor(video_path="path/to/video.mp4")
documentation = instructor.generate_documentation()

Features

Automatic video transcription extraction
AI-powered video interpretation
Step-by-step documentation generation
Automated documentation quality evaluation with conversation memory
Interactive Q&A workflow between AI agents
User feedback integration for documentation refinement
Configurable escalation to human users
Screenshot generation and annotation
PDF export capabilities
Enhanced workflow visibility with real-time status updates
Transparent model information display for each agent

Installation Options

Docker (Recommended): See DOCKER_USAGE.md
PyPI: pip install videoinstruct

Source:

git clone https://github.com/PouriaRouzrokh/VideoInstruct.git
cd VideoInstruct
pip install -r requirements.txt

Project Structure

VideoInstruct/
├── data/                  # Place your video files here
├── docs/                  # Documentation files
│   ├── README.md         # Main documentation
│   ├── DOCKER_USAGE.md   # Docker setup guide
│   └── Figure.png        # Pipeline diagram
├── examples/              # Example usage scripts
│   └── example_usage.py  # Basic usage example
├── output/               # Generated documentation output
├── scripts/              # Utility scripts
├── temp/                 # Temporary files directory
├── videoinstruct/        # Main package
│   ├── agents/          # AI agent modules
│   ├── prompts/         # System prompts for agents
│   ├── tools/           # Utility tools
│   ├── utils/           # Utility functions
│   ├── __init__.py      # Package initialization
│   ├── configs.py       # Configuration classes
│   ├── prompt_loader.py # Prompt loading utilities
│   └── videoinstructor.py # Main orchestration class
├── Dockerfile           # Docker configuration
├── LICENSE              # MIT License
├── MANIFEST.in         # Package manifest
├── pyproject.toml      # Project metadata
├── requirements.txt    # Python dependencies
└── setup.py           # Package setup script

Using as a Python Package

from videoinstruct import VideoInstructor, VideoInstructorConfig
from videoinstruct.agents import DocGeneratorConfig, VideoInterpreterConfig, DocEvaluatorConfig

# Configure the VideoInstructor
config = VideoInstructorConfig(
    doc_generator_config=DocGeneratorConfig(
        api_key=openai_api_key,
        model_provider="openai",
        model="o3-mini",
        temperature=0.7
    ),
    video_interpreter_config=VideoInterpreterConfig(
        api_key=gemini_api_key,
        model="gemini-2.0-flash"
    ),
    doc_evaluator_config=DocEvaluatorConfig(
        api_key=deepseek_api_key,
        model="deepseek-reasoner"
    )
)

# Initialize and run
instructor = VideoInstructor(
    video_path="path/to/video.mp4",
    config=config
)
documentation = instructor.generate_documentation()

Contributing

To contribute to VideoInstruct:

Fork the repository
Create a feature branch: git checkout -b feature-name
Commit your changes: git commit -am 'Add some feature'
Push to the branch: git push origin feature-name
Submit a pull request

Troubleshooting

For Docker-related issues, see DOCKER_USAGE.md
For Python package issues:
- Make sure all dependencies are installed
- Check your Python version (3.8+ required)
- Verify your API keys and internet connection

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.1.9

Mar 5, 2025

0.1.7

Mar 5, 2025

0.1.6

Mar 4, 2025

0.1.5

Mar 4, 2025

0.1.4

Mar 4, 2025

0.1.3

Mar 4, 2025

0.1.2

Mar 2, 2025

0.1.1

Mar 2, 2025

0.1.0

Mar 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videoinstruct-0.1.9.tar.gz (766.2 kB view details)

Uploaded Mar 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

videoinstruct-0.1.9-py3-none-any.whl (39.1 kB view details)

Uploaded Mar 5, 2025 Python 3

File details

Details for the file videoinstruct-0.1.9.tar.gz.

File metadata

Download URL: videoinstruct-0.1.9.tar.gz
Upload date: Mar 5, 2025
Size: 766.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for videoinstruct-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`ff776001a5cac1f4d805ec1c31e19d0b470e13cb99b0d7132eca3683c27e7e18`
MD5	`8e169964a3ec73d11c9937bba17b5bf7`
BLAKE2b-256	`32727b20dbc3636b3c84f7dd569975c1837b840104dfc34f3e8a3fb38ebf9d37`

See more details on using hashes here.

File details

Details for the file videoinstruct-0.1.9-py3-none-any.whl.

File metadata

Download URL: videoinstruct-0.1.9-py3-none-any.whl
Upload date: Mar 5, 2025
Size: 39.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for videoinstruct-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a35807dd08fe18cebd066e9a93236c75f1c3c00b510959879279f54c3f2caa32`
MD5	`a5fac577e5d944c15697c94f7ef8572e`
BLAKE2b-256	`2a11077893b9868468e9c0a84d498d3af68ed75f413da23691954ccafc05bc59`

See more details on using hashes here.

videoinstruct 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

VideoInstruct

Pipeline:

Quick Start

Using Docker (Recommended)

Using Python Package

Features

Installation Options

Project Structure

Using as a Python Package

Contributing

Troubleshooting

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes