Skip to main content

Transcript to subtitles

Project description

Automatic Subtitle Placement

This contains code that allows users to download videos (or just audio) from youtube as well as their respective transcripts for training data. The model itself is able to place time stamps onto a transcript for any given video.

Features

  • Machine learning model for voice activity detection (not recognition)
  • Generates timestamps for transcript

Dependencies

Help

usage: tosync [-h] [--version] [--graph] [-d SECONDS] [-m SECONDS] [-s]
                   [--logfile PATH]
                   MEDIA [MEDIA ...]

positional arguments:
  MEDIA                 media for which to synchronize subtitles

optional arguments:
  -h, --help            show this help message and exit

Special thanks

[1] tympanix/subsync whose code was invaluable

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tosync-1.0.2.tar.gz (9.2 kB view hashes)

Uploaded Source

Built Distribution

tosync-1.0.2-py3-none-any.whl (21.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page