Skip to main content

Get scenecuts from a video file using ffmpeg

Project description

Scenecut Extractor

PyPI version

Python package

Extract scenecuts from video files using ffmpeg.

This tool uses the select filter from ffmpeg to determine the scene cut probability of adjacent frames, and allows users to determine which frames (or at which timestamps) the scene cuts happen.

Note: Previous versions installed a scenecut_extractor executable. To harmonize it with other tools, now the executable is called scenecut-extractor. Please ensure you remove the old executable (e.g. run which scenecut_extractor and remove the file).

Author: Werner Robitza werner.robitza@gmail.com

Contents:

Requirements

  • Python 3.8 or higher
  • FFmpeg:
    • download a static build from their website)
    • put the ffmpeg executable in your $PATH

Installation

pip3 install --user scenecut_extractor

Or clone this repository, then run the tool with python3 -m scenecut_extractor.

Usage

Run:

scenecut-extractor <input-file>

This might take a while depending on the length of your input file, and then output a list of scene cuts in JSON format:

[
  {
    "frame": 114,
    "pts": 114.0,
    "pts_time": 3.8,
    "score": 0.445904
  },
  {
    "frame": 159,
    "pts": 159.0,
    "pts_time": 5.3,
    "score": 0.440126
  }
]

To extract the scene cuts, use the -x flag and optionally specify an output directory with -d:

scenecut-extractor <input-file> -x -d output-directory

This will create a directory called output-directory and put the extracted scenes in there. The filenames will be the same as the input file, but with the scene times appended to them.

Note: Cutting may not be frame-accurate. To be precise, you have to re-encode the video. Use the --no-copy flag to do this. The output will use libx264 encoding with CRF 23 to achieve a good balance between quality and file size. Future versions of this tool will allow you to specify your own encoding options.

Extended Usage

The command supports the following arguments and options, see scenecut-extractor -h:

usage: scenecut-extractor [-h] [-t THRESHOLD] [-o {all,frames,seconds}]
                   [-of {json,csv}] [-x] [-d OUTPUT_DIRECTORY] [--no-copy]
                   [-p] [-v]
                   input

scenecut_extractor v0.5.0

positional arguments:
  input                 input file

options:
  -h, --help            show this help message and exit
  -t THRESHOLD, --threshold THRESHOLD
                        threshold (between 0 and 1) (default: 0.3)
  -o {all,frames,seconds}, --output {all,frames,seconds}
                        output which information (default: all)
  -of {json,csv}, --output-format {json,csv}
                        output in which format (default: json)
  -x, --extract         extract the scene cuts (default: False)
  -d OUTPUT_DIRECTORY, --output-directory OUTPUT_DIRECTORY
                        Set the output directory. Default is the current
                        working directory. (default: None)
  --no-copy             Don't stream-copy, but re-encode the video. (default:
                        False)
  -p, --progress        Show a progress bar on stderr (default: False)
  -v, --verbose         Print verbose info to stderr (default: False)

You can use the -t parameter to set the threshold that ffmpeg internally uses (between 0 and 1) – if you set it to 0, all frames will be printed with their probabilities.

API

This program has a simple API that can be used to integrate it into other Python programs.

For more information see the API documentation.

Alternatives

For extended scene detection features such as automatic splitting or perceptual hashing, you may want to check out PySceneDetect.

License

scenecut_extractor, Copyright (c) 2018–2023 Werner Robitza

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scenecut_extractor-0.6.2.tar.gz (11.4 kB view details)

Uploaded Source

Built Distribution

scenecut_extractor-0.6.2-py2.py3-none-any.whl (9.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file scenecut_extractor-0.6.2.tar.gz.

File metadata

  • Download URL: scenecut_extractor-0.6.2.tar.gz
  • Upload date:
  • Size: 11.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for scenecut_extractor-0.6.2.tar.gz
Algorithm Hash digest
SHA256 0f03fe974d289f39758468e1373971afcc29efa7dc68f9cc8efb125853b609f2
MD5 28c9daa76162c989702c623615360d06
BLAKE2b-256 b73adad0b08da91c0666c76cdfca70d9d90c2ac2832c7c028379976cf83bb5af

See more details on using hashes here.

File details

Details for the file scenecut_extractor-0.6.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for scenecut_extractor-0.6.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 d830cb4c353d5d8fe973f101f80667087e97443c03e5790c9d07f143644e907f
MD5 c14fe09c4d2b16d1957a2e0db444f0bc
BLAKE2b-256 ccd66495707a789bf16c843f92f437dfab4243acdc14aab30f36c6515193ed09

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page