A Python library for streaming audio/video files using FFmpeg with automatic resampling and channel mixing

Project description

ffmpeg-audio

A Python library for streaming audio/video files using FFmpeg with automatic resampling and channel mixing.

Features

Streaming audio reading: Stream large audio/video files in chunks without loading everything into memory
Automatic resampling: Automatically resamples audio to 16kHz (configurable)
Channel mixing: Automatically converts to mono channel
Segment reading: Read specific time segments from audio files
Format support: Supports all audio/video formats that FFmpeg supports (MP3, WAV, FLAC, Opus, MP4, etc.)

Installation

pip install ffmpeg-audio

Note: This package requires FFmpeg to be installed on your system. Make sure FFmpeg is available in your PATH.

Quick Start

Streaming Audio

from ffmpeg_audio import AudioStreamer
import numpy as np

# Stream audio file in chunks
for chunk in AudioStreamer.stream("audio.mp3"):
    # chunk is a numpy array (float32, range -1.0 ~ 1.0)
    # Process chunk here
    print(f"Chunk shape: {chunk.shape}, dtype: {chunk.dtype}")

Reading Audio Segments

from ffmpeg_audio import AudioSegmentReader

# Read a specific time segment (from 10s to 15s)
audio_data = AudioSegmentReader.read(
    file_path="audio.mp3",
    start_ms=10000,  # 10 seconds
    duration_ms=5000,  # 5 seconds
)

# audio_data is a numpy array (float32, range -1.0 ~ 1.0, 16kHz mono)
print(f"Audio shape: {audio_data.shape}, sample rate: {AudioSegmentReader.SAMPLE_RATE} Hz")

API Reference

AudioStreamer

Main class for streaming audio/video files.

`AudioStreamer.stream(file_path, chunk_duration_sec=None)`

Stream audio file in chunks.

Parameters:

file_path (str): Path to the audio/video file
chunk_duration_sec (int, optional): Duration of each chunk in seconds. Defaults to 1200 (20 minutes).

Yields:

np.ndarray: Audio chunk as float32 numpy array (range -1.0 ~ 1.0, 16kHz mono)

Raises:

TypeError: If chunk_duration_sec is not an int or None
ValueError: If file_path is empty or invalid
FFmpegNotFoundError: If FFmpeg is not installed or not in PATH
FileNotFoundError: If the audio file does not exist
PermissionError: If permission is denied accessing the file
UnsupportedFormatError: If the file format is not supported or invalid
FFmpegAudioError: If FFmpeg process fails (for other errors)

Constants:

AudioStreamer.SAMPLE_RATE = 16000: Output sample rate (Hz)
AudioStreamer.AUDIO_CHANNELS = 1: Output channel count (mono)
AudioStreamer.STREAM_CHUNK_DURATION_SEC = 1200: Default chunk duration (seconds)

AudioSegmentReader

Class for reading specific time segments from audio files.

`AudioSegmentReader.read(file_path, start_ms, duration_ms)`

Read a specific time segment from an audio file.

Parameters:

file_path (str): Path to the audio/video file
start_ms (int): Start time in milliseconds (must be >= 0)
duration_ms (int): Duration in milliseconds (must be > 0)

Returns:

np.ndarray: Audio data as float32 numpy array (range -1.0 ~ 1.0, 16kHz mono)

Raises:

FileNotFoundError: If file doesn't exist
ValueError: If time parameters are invalid
FFmpegNotFoundError: If FFmpeg is not installed or not in PATH
FFmpegAudioError: If FFmpeg processing fails

Constants:

AudioSegmentReader.SAMPLE_RATE = 16000: Output sample rate (Hz)

Exceptions

`FFmpegNotFoundError`

Raised when FFmpeg is not installed or not available in PATH.

Attributes:

message: Error message

`FFmpegAudioError`

Raised when FFmpeg audio processing fails.

Attributes:

message: Error message
file_path: Path to the file that caused the error (optional)
returncode: FFmpeg process return code (optional)
stderr: FFmpeg standard error output (optional)

`UnsupportedFormatError`

Raised when the audio file format is not supported or contains invalid data.

Attributes:

message: Error message
file_path: Path to the file that caused the error (optional)
returncode: FFmpeg process return code (optional)
stderr: FFmpeg standard error output (optional)

Requirements

Python >= 3.10
FFmpeg (must be installed separately)
numpy >= 1.26.4

License

MIT License

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details

Release history Release notifications | RSS feed

1.1.0

Mar 24, 2026

1.0.1

Mar 24, 2026

1.0.0

Jan 11, 2026

0.3.0

Jan 6, 2026

0.2.0

Jan 1, 2026

0.1.4

Jan 1, 2026

0.1.3

Dec 30, 2025

0.1.2

Dec 30, 2025

0.1.1

Dec 30, 2025

This version

0.1.0

Dec 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ffmpeg_audio-0.1.0.tar.gz (8.7 kB view details)

Uploaded Dec 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ffmpeg_audio-0.1.0-py3-none-any.whl (3.5 kB view details)

Uploaded Dec 30, 2025 Python 3

File details

Details for the file ffmpeg_audio-0.1.0.tar.gz.

File metadata

Download URL: ffmpeg_audio-0.1.0.tar.gz
Upload date: Dec 30, 2025
Size: 8.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for ffmpeg_audio-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5f072bd719ad42ec63f98f9d9748d31a08f4f2563a0da8f3c786bb2eacf10356`
MD5	`1a97bd76b7b8dc278fb33feb80a591e9`
BLAKE2b-256	`567ca80cbebc94c458ee8bc3bd589b864b937a5eee70a248746aae86870c5523`

See more details on using hashes here.

File details

Details for the file ffmpeg_audio-0.1.0-py3-none-any.whl.

File metadata

Download URL: ffmpeg_audio-0.1.0-py3-none-any.whl
Upload date: Dec 30, 2025
Size: 3.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for ffmpeg_audio-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4aed4ba25f659053c6a085222fe3f792c36d8b840bc251ad86af9db6da54de93`
MD5	`d108f71981dc1466eed3bd38d8856fcc`
BLAKE2b-256	`6f6f98d482b0bf96b031ecb74048b5fa6c53e7c5ba3d1537498d17b1938692e9`

See more details on using hashes here.

ffmpeg-audio 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

ffmpeg-audio

Features

Installation

Quick Start

Streaming Audio

Reading Audio Segments

API Reference

AudioStreamer

`AudioStreamer.stream(file_path, chunk_duration_sec=None)`

AudioSegmentReader

`AudioSegmentReader.read(file_path, start_ms, duration_ms)`

Exceptions

`FFmpegNotFoundError`

`FFmpegAudioError`

`UnsupportedFormatError`

Requirements

License

Contributing

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes