A tiny library for parsing, modifying, and composing SRT files.
srt is a tiny but featureful Python library for parsing, modifying, and composing SRT files. Take a look at the quickstart for a basic overview of the library. Detailed API documentation is also available.
Why choose this library?
- Can parse many broken SRT files which other SRT libraries cannot, and fix them
- Extremely lightweight, ~150 lines of code excluding docstrings
- Simple, intuitive API
- High quality test suite using Hypothesis
- 100% test coverage (including branches)
- Well documented API, at both a high and low level
- ~30% faster than pysrt on typical workloads
- Full support for PyPy
- No dependencies outside of the standard library
- Tolerant of many common errors found in real-world SRT files
- Support for Asian-style SRT formats (ie. “fullwidth” SRT format)
- Completely Unicode compliant
- Released under a highly permissive license (MIT)
- Real world tested — used in production to process thousands of SRT files every day
- Portable — runs on Linux, OSX, and Windows
- Tools included — contains lightweight tools to perform generic tasks with the library
$ cat pe.srt 1 00:00:33,843 --> 00:00:38,097 Only 3% of the water on our planet is fresh. 地球上只有3%的水是淡水 2 00:00:40,641 --> 00:00:44,687 Yet, these precious waters are rich with surprise. 可是这些珍贵的淡水中却充满了惊奇 $ srt lines-matching -m hanzidentifier -f hanzidentifier.has_chinese -i pe.srt 1 00:00:33,843 --> 00:00:38,097 地球上只有3%的水是淡水 2 00:00:40,641 --> 00:00:44,687 可是这些珍贵的淡水中却充满了惊奇
These tools are easy to chain together, for example, say you have one subtitle with Chinese and English, and other with French, but you want Chinese and French only. Oh, and the Chinese one is 5 seconds later than it should be. That’s easy enough to sort out:
$ srt lines-matching -m hanzidentifier -f hanzidentifier.has_chinese -i chs+eng.srt | > srt fixed-timeshift --seconds -5 | > srt mux --input - --input fra.srt
See the srt_tools/ directory for more information.
Detailed API documentation is available, but here are the basics.
Here’s how you convert SRT input to Subtitle objects which you can manipulate:
>>> data = '''\ 1 00:00:33,843 --> 00:00:38,097 地球上只有3%的水是淡水 2 00:00:40,641 --> 00:00:44,687 可是这些珍贵的淡水中却充满了惊奇 3 00:00:57,908 --> 00:01:03,414 所有陆地生命归根结底都依赖於淡水 ''' >>> for sub in srt.parse(data): ... print(sub) Subtitle(index=1, start=datetime.timedelta(seconds=33, microseconds=843000), end=datetime.timedelta(seconds=38, microseconds=97000), content='地球上只有3%的水是淡水', proprietary='') Subtitle(index=2, start=datetime.timedelta(seconds=40, microseconds=641000), end=datetime.timedelta(seconds=44, microseconds=687000), content='可是这些珍贵的淡水中却充满了惊奇', proprietary='') Subtitle(index=3, start=datetime.timedelta(seconds=57, microseconds=908000), end=datetime.timedelta(seconds=63, microseconds=414000), content='所有陆地生命归根结底都依赖於淡水', proprietary='')
And here’s how you go back from Subtitle objects to SRT output:
>>> subs = list(srt.parse(data)) >>> subs.content = "Changing subtitle data is easy!" >>> print(srt.compose(subs)) 1 00:00:33,843 --> 00:00:38,097 地球上只有3%的水是淡水 2 00:00:40,641 --> 00:00:44,687 Changing subtitle data is easy! 3 00:00:57,908 --> 00:01:03,414 所有陆地生命归根结底都依赖於淡水
To install the latest stable version from PyPi:
pip install -U srt
To install the latest development version directly from GitHub:
pip install -U git+https://github.com/cdown/srt.git@develop
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.