Skip to main content

Find the intro of episodes of a series

Project description

SeriesIntroRecognizer

Comparing episodes of a series to find the opening/endings of the series.

This library receives a list of episodes, extracts the audio features of each episode and compares them to find the common part of the series.

To reduce the number of comparisons, the library compares 4 sequential episodes. The number of episodes to be compared can be changed in configuration.

Installation

The project uses Cupy. It supports both Cuda and AMD GPUs. Please refer to the Cupy installation guide to install the correct version for your system.

pip install series_intro_recognizer

Options

Input options:

  • Iterator of audio samples (numpy ndarrays), offset and duration
  • Iterator of audio files
  • Iterator of audio files, offset and duration

Offset and duration are used to extract the audio features of the episodes and can be None. If None, the whole audio will be used.

WARN: See the warning #1 in the Usage section.

Output:

  • List of intervals of the same fragment in the episodes

To find an opening, pass the first minutes (e.g. 5 minutes) of the episodes. To find an ending, pass the last minutes (e.g. 5 minutes) of the episodes.

WARN: See the warning #2 in the Usage section.

Configuration:

Every call of methods should pass a configuration object. Feel free to just pass the default configuration.

The configuration object has the following fields:

  • rate: Audio sample rate.
  • min_segment_length_sec: Minimum length of the intro in seconds.
  • max_segment_length_sec: Maximum length of the intro in seconds.
  • precision_secs: Precision of the correlation in seconds.
  • series_window: Number of sequential audio samples to be matched with each other.
  • offset_searcher__sequential_secs: Number of sequential 'non-intro' seconds that signal the end of the intro.
  • adjustment_threshold: Threshold for adjusting the borders of the intro.
  • adjustment_threshold_secs: Threshold for adjusting the borders of the intro to the borders of the audio.
  • save_intermediate_results: Save the correlation results.

Important warnings

WARNING #1: Do not pass the whole episodes, it will take a long time to process and the results will not be accurate (in this case, the library will find the intro or outro, depending on which part is longer).

WARNING #2: If you use recognise_from_audio_files_with_offsets and offsets are provided, the library WILL NOT add it to the output intervals. Please, add it manually.

WARNING #3: There is a well-known memory leak in sklearn. You will see a warning when running the app. The memory leak should not be a problem for a short run, but it is recommended to add OMP_NUM_THREADS=1 to the environment variables.

Usage

You can find other examples in the tests/processors folder.

To find the opening of a series by audio paths:

cfg = Config()
files = [f'assets/audio_files/{i}.wav' for i in paths]
recognised = recognise_from_audio_files(files, cfg)
# Returns a list of intervals
# [(start=0.25, end=30.25), (start=0.20, end=30.20), ...]

To find the opening of a series by audio samples:

def some_audio_loading_function() -> np.ndarray:
    return np.random.rand(1000)


cfg = Config()
samples = map(lambda _: (some_audio_loading_funciton(), None, None), range(10))
recognised = recognise_from_audio_samples(samples, cfg)
# Returns a list of intervals
# [(start=0.25, end=30.25), (start=0.20, end=30.20), ...]

To find the ending of a series by audio paths, analysing intervals from 1 to 35 seconds:

cfg = Config()
files = [(f'assets/audio_files/{i}.wav', 2, 30) for i in paths]
recognised = recognise_from_audio_files(files, cfg)
# Returns a list of intervals
# [(start=0, end=28.25), (start=0, end=28.20), ...]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

series_intro_recognizer-0.5.0.tar.gz (13.7 kB view details)

Uploaded Source

Built Distribution

series_intro_recognizer-0.5.0-py3-none-any.whl (16.6 kB view details)

Uploaded Python 3

File details

Details for the file series_intro_recognizer-0.5.0.tar.gz.

File metadata

File hashes

Hashes for series_intro_recognizer-0.5.0.tar.gz
Algorithm Hash digest
SHA256 705b4d00680e9d47be93c649a66836320d90e676e1e3e1278bdb96590263c56a
MD5 a7ca0101f5d54ad0dcbfa8b1da383877
BLAKE2b-256 e60fc6068286d815373fd10fdd3f452aaf007c4dbecc8582b4f5fab5309aff3e

See more details on using hashes here.

File details

Details for the file series_intro_recognizer-0.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for series_intro_recognizer-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dc7a1c9239ef5828cd1de3d7b3289c5614654530c9b476c8f64c4e407387684b
MD5 11fe6563ba90a51e607f3e0ac66e2471
BLAKE2b-256 000f17722058cce837037c481c4dc6788628f6a6dbd5953705ffa39af9ac1762

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page