Videogrep is a command line tool that searches through dialog in video and audio files and makes supercuts based on what it finds. Like grep but for video.

These details have not been verified by PyPI

Project links

Project description

Videogrep

Videogrep is a command line tool that searches through dialog in video or audio files and makes supercuts based on what it finds. It will recognize .srt or .vtt subtitle tracks, or transcriptions that can be generated with vosk, pocketsphinx, and other tools.

Examples

Tutorial

See my blog for a short tutorial on videogrep and yt-dlp, and part 2, on videogrep and natural language processing.

Installation

Videogrep is compatible with Python versions 3.6 to 3.10.

To install:

pip install videogrep

If you want to transcribe video or audio, you also need to install vosk:

pip install vosk

Note: the previous version of videogrep supported pocketsphinx for speech-to-text. Vosk seems much better so I've added support for it and will likely be phasing out support for pocketsphinx.

Usage

The most basic use:

videogrep --input path/to/video.mp4 --search 'search phrase'

It works with audio too:

videogrep --input path/to/audio.mp3 --search 'search phrase'

You can put any regular expression in the search phrase.

NOTE: videogrep requires a matching subtitle track with each video you want to use. The video/audio file and subtitle file need to have the exact same name, up to the extension. For example, my_movie.mp4 and my_movie.srt will work, and my_movie.mp4 and my_movie_subtitle.srt will not work.

Videogrep will search for matching srt and vtt subtitles, as well as json transcript files that can be generated with the --transcribe argument.

Options

`--input [filename(s)] / -i [filename(s)]`

File or files to use as input. Most video or audio formats should work. If you mix audio and video input files, the resulting output will only be audio.

`--output [filename] / -o [filename]`

Name of the file to generate. By default this is supercut.mp4. Any standard video or audio extension will also work. (If you're using audio input or mixed audio and video input and you keep the default supercut.mp4 as the output filename, videogrep will automatically change the output to supercut.mp3)

Videogrep will also recognize the following extensions for saving files:

.mpv.edl: generates an edl file playable by mpv (useful for previews)
.m3u: media playlist
.xml: Final Cut Pro timeline, compatible with Adobe Premiere and Davinci Resolve

videogrep --input path/to/video --search 'search phrase' --output coolvid.mp4

`--search [query] / -s [query]`

Search term, as a regular expression. You can add as many of these as you want. For example:

videogrep --input path/to/video --search 'search phrase' --search 'another search' --search 'a third search' --output coolvid.mp4

`--search-type [type] / -st [type]`

Type of search you want to perform. There are two options:

sentence: (default): Generates clips containing the full sentences of your search query.
fragment: Generates clips containing the exact word or phrase of your search query.

Both options take regular expressions. You may only use the fragment search if your transcript has word-level timestamps, which will be the case for youtube .vtt files, or if you generated a transcript using Videogrep itself.

videogrep --input path/to/video --search 'experience' --search-type fragment

`--max-clips [num] / -m [num]`

Maximum number of clips to use for the supercut.

`--demo / -d`

Show the search results without making the supercut.

`--preview / -pr`

Preview the supercut in mpv (requires mpv to be installed)

`--randomize / -r`

Randomize the order of the clips.

`--padding [seconds] / -p [seconds]`

Padding in seconds to add to the start and end of each clip.

`--resyncsubs [seconds] / -rs [seconds]`

Time in seconds to shift the shift the subtitles forwards or backwards.

`--transcribe / -tr`

Transcribe the video/audio using vosk. This will generate a .json file in the same folder as the video. By default this uses vosk's small english model.

NOTE: Because of some compatibility issues, vosk must be installed separately with pip install vosk.

videogrep -i vid.mp4 --transcribe

`--model [modelpath] / -mo [modelpath]`

In combination with the --transcribe option, allows you to specify the path to a vosk model folder to use. Vosk models are available here in a variety of languages.

videogrep -i vid.mp4 --transcribe --model path/to/model/

`--export-clips / -ec`

Exports clips as individual files rather than as a supercut.

videogrep -i vid.mp4 --search 'whatever' --export-clips

`--export-vtt / -ev`

Exports the transcript of the supercut as a WebVTT file next to the video.

videogrep -i vid.mp4 --search 'whatever' --export-vtt

`--ngrams [num] / -n [num]`

Shows common words and phrases from the video or audio file.

videogrep -i vid.mp4 --ngrams 1

Use it as a module

from videogrep import videogrep

videogrep('path/to/your/files','output_file_name.mp4', 'search_term', 'search_type')

The videogrep module accepts the same parameters as the command line script. To see the usage check out the source.

Example Scripts

Also see the examples folder for:

Credits

Videogrep is maintained by Sam Lavigne, and built using MoviePy and Vosk. A big thanks goes out to all those who have contributed, particuarly to Charlie Macquarie for his efforts in getting the project to work with audio-only media.

Videogrep has received financial support from the Department of Digital Humanities, King’s College London and from the Clinic for Open Source Arts.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.3.0

Apr 19, 2024

2.2.1

Nov 2, 2023

2.2.0

Oct 31, 2023

2.1.3

May 11, 2023

2.1.2

Jul 12, 2022

2.1.1

Jun 25, 2022

2.1.0

May 30, 2022

2.0.1

May 22, 2022

2.0.0

May 21, 2022

0.5.8

Jun 2, 2018

0.5.6

Feb 9, 2018

0.5.5

Feb 9, 2018

0.5.3

Feb 8, 2018

0.5.2

Feb 8, 2018

0.5.1

Feb 4, 2018

0.5.0

Sep 30, 2016

0.4.4

Jun 30, 2015

0.4.3

Jun 28, 2015

0.4.2

Jun 28, 2015

0.4.1

Jun 28, 2015

0.4

Jun 28, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videogrep-2.3.0.tar.gz (41.1 MB view details)

Uploaded Apr 19, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

videogrep-2.3.0-py3-none-any.whl (41.2 MB view details)

Uploaded Apr 19, 2024 Python 3

File details

Details for the file videogrep-2.3.0.tar.gz.

File metadata

Download URL: videogrep-2.3.0.tar.gz
Upload date: Apr 19, 2024
Size: 41.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.7.1 CPython/3.11.6 Darwin/23.2.0

File hashes

Hashes for videogrep-2.3.0.tar.gz
Algorithm	Hash digest
SHA256	`180a4bd2ea8ba5566f59acf94edbbcb03514f745b9818594ba9bfd75bb4c4086`
MD5	`dfbce4d34cdea4aa42b2bb8d8580b165`
BLAKE2b-256	`6fb2314193adada9800b724bbf6c959f64d3e01d912d4701f75160c899a6a605`

See more details on using hashes here.

File details

Details for the file videogrep-2.3.0-py3-none-any.whl.

File metadata

Download URL: videogrep-2.3.0-py3-none-any.whl
Upload date: Apr 19, 2024
Size: 41.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.7.1 CPython/3.11.6 Darwin/23.2.0

File hashes

Hashes for videogrep-2.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`471edd50cb1d0c1eb6a525e2729d4d4a0751eb729c8cb73b794ab2dbd2f10a59`
MD5	`9f9da866f5191192597acf9b9e085950`
BLAKE2b-256	`d58fa3ca788f9550cef044051a297dce33d5a0d459cd97c0ae9f410037f38140`

See more details on using hashes here.

videogrep 2.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Videogrep

Examples

Tutorial

Installation

Usage

Options

--input [filename(s)] / -i [filename(s)]

--output [filename] / -o [filename]

--search [query] / -s [query]

--search-type [type] / -st [type]

--max-clips [num] / -m [num]

--demo / -d

--preview / -pr

--randomize / -r

--padding [seconds] / -p [seconds]

--resyncsubs [seconds] / -rs [seconds]

--transcribe / -tr

--model [modelpath] / -mo [modelpath]

--export-clips / -ec

--export-vtt / -ev

--ngrams [num] / -n [num]

Use it as a module

Example Scripts

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`--input [filename(s)] / -i [filename(s)]`

`--output [filename] / -o [filename]`

`--search [query] / -s [query]`

`--search-type [type] / -st [type]`

`--max-clips [num] / -m [num]`

`--demo / -d`

`--preview / -pr`

`--randomize / -r`

`--padding [seconds] / -p [seconds]`

`--resyncsubs [seconds] / -rs [seconds]`

`--transcribe / -tr`

`--model [modelpath] / -mo [modelpath]`

`--export-clips / -ec`

`--export-vtt / -ev`

`--ngrams [num] / -n [num]`