Separate audio into stems using AI (Demucs)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Su1ph3r

These details have not been verified by PyPI

Project description

Stem Separator

Separates audio into vocals, drums, bass, and other instruments using Demucs by Meta Research.

Features

AI-Powered Separation: Uses state-of-the-art Demucs models for high-quality stem separation
Multiple Models: Support for 4-stem and 6-stem (guitar + piano) models
Multiple Formats: Output as WAV, MP3, FLAC, OGG, or AAC
YouTube Support: Download and process YouTube videos directly, including playlists
Spotify Support: Download and process Spotify tracks (requires spotdl)
Batch Processing: Process entire directories of audio files
Custom Remixing: Mix stems with custom volume levels
Audio Preview: Preview separated stems before saving
GPU Acceleration: Automatic GPU detection for faster processing
Metadata Preservation: Preserve ID3 tags from source files
Configuration File: Save your preferences in ~/.stem-separator.yaml

Installation

Quick Install

Windows:

install.bat

Linux/macOS:

chmod +x install.sh && ./install.sh

Manual Install

Install FFmpeg (required):

# Windows
winget install FFmpeg.FFmpeg

# macOS
brew install ffmpeg

# Linux (Debian/Ubuntu)
sudo apt install ffmpeg

Install Python packages:

pip install demucs yt-dlp soundfile scipy pyyaml rich

Optional dependencies:

# Spotify support
pip install spotdl

# Audio preview
pip install sounddevice

# Enhanced metadata handling
pip install mutagen

Install as Package (Recommended)

pip install -e .

This enables:

Running as stem-separator song.mp3 or stems song.mp3
Running as python -m stem_separator song.mp3

Docker (Recommended for Easy Setup)

Run with Docker for the easiest setup - no dependencies to install:

# CPU mode (works on any machine)
docker compose --profile cpu up -d

# GPU mode (requires NVIDIA GPU + Container Toolkit)
docker compose --profile gpu up -d

# Access the web UI at http://localhost:8080

See DOCKER.md for complete Docker documentation, including:

Building custom images
Volume mounting for input/output
Model caching
CLI usage in Docker
API documentation

Usage

Basic Usage

# Process a local file
python stem_separator.py song.mp3

# YouTube URL
python stem_separator.py "https://www.youtube.com/watch?v=VIDEO_ID"

# Specify output folder
python stem_separator.py song.mp3 -o ./output

# Use 6-stem model (adds guitar + piano separation)
python stem_separator.py song.mp3 --model htdemucs_6s

# Export as MP3
python stem_separator.py song.mp3 --format mp3

# Extract only specific stems
python stem_separator.py song.mp3 --stems vocals,drums

# Create karaoke version (everything except vocals)
python stem_separator.py song.mp3 --stems karaoke --format mp3

# Extract acapella (vocals only)
python stem_separator.py song.mp3 --stems acapella

Batch Processing

# Process all audio files in a directory
python stem_separator.py ./music_folder --batch -o ./stems

# Process recursively
python stem_separator.py ./music_folder --batch --recursive

# Dry run (see what would be processed)
python stem_separator.py ./music_folder --batch --dry-run

YouTube Playlists

# Process an entire playlist
python stem_separator.py "https://youtube.com/playlist?list=..." --playlist

# Resume interrupted playlist download
python stem_separator.py "https://youtube.com/playlist?list=..." --playlist

Spotify Support

# Single track
python stem_separator.py "https://open.spotify.com/track/..."

# Playlist or album
python stem_separator.py "https://open.spotify.com/playlist/..." --playlist

Custom Remixing

# Create a custom mix with volume control
python stem_separator.py song.mp3 --remix "vocals:0.5,drums:1.0,bass:0.8"

# Process then remix
python stem_separator.py song.mp3 --remix "vocals:0,drums:1.5,bass:1.2,other:0.5"

Audio Preview

# Preview stems interactively after processing
python stem_separator.py song.mp3 --preview

Quality Analysis

# Analyze separation quality
python stem_separator.py song.mp3 --quality

Options Reference

Output Options

Option	Description
`-o, --output`	Output directory (default: current)
`--format`	Output format: `wav`, `mp3`, `flac`, `ogg`, `aac` (default: wav)
`--naming`	Output naming template (default: `{name}_{stem}`)

Model Options

Option	Description
`--model`	AI model: `htdemucs`, `htdemucs_ft`, `htdemucs_6s` (default: htdemucs)
`--stems`	Stems to export: comma-separated or preset

Processing Options

Option	Description
`--cpu`	Force CPU mode (skip GPU)
`--normalize`	Normalize audio levels in output stems
`--low-memory`	Low memory mode for very long tracks
`--quality`	Analyze and report stem separation quality

Batch Options

Option	Description
`--batch`	Process directory of files
`-r, --recursive`	Search directories recursively
`-j, --parallel N`	Number of parallel jobs

Source Options

Option	Description
`--playlist`	Process YouTube/Spotify playlist
`--browser`	Use browser cookies (chrome, firefox, edge, safari)

Other Options

Option	Description
`--remix`	Remix stems with volume control
`--preview`	Preview stems interactively
`--no-metadata`	Don't preserve metadata
`-v, --verbose`	Verbose output
`-q, --quiet`	Quiet mode
`--dry-run`	Show what would be done
`--version`	Show version

Models

Model	Stems	Description
`htdemucs`	4	Default model (vocals, drums, bass, other)
`htdemucs_ft`	4	Fine-tuned version (better quality)
`htdemucs_6s`	6	Adds guitar and piano separation

Stem Presets

Preset	Description
`all`	All stems (default)
`karaoke`	Everything except vocals (instrumental)
`acapella`	Vocals only
`instrumental`	Same as karaoke

Configuration File

Create ~/.stem-separator.yaml to set defaults:

# Default settings
model: htdemucs
format: wav
output_dir: .
normalize: false
naming_template: "{name}_{stem}"

# Processing
cpu: false
low_memory: false

# YouTube
browser: chrome

# Output
verbose: false
quiet: false

Generate a sample config:

python stem_separator.py --generate-config > ~/.stem-separator.yaml

Output

Creates a folder named {song}_stems containing the separated stems.

4-stem model output:

vocals - Singing/voice
drums - Drums and percussion
bass - Bass
other - Guitar, piano, synths, etc.

6-stem model output (htdemucs_6s):

All of the above, plus:
guitar - Guitar
piano - Piano

GPU Support

The script automatically tries GPU first and falls back to CPU if needed.

NVIDIA GPUs

# Standard CUDA support
pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu121

# RTX 5090 / Blackwell GPUs (requires nightly)
pip install --pre torch torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128

Apple Silicon (M1/M2/M3)

pip install torch torchaudio
# MPS backend is used automatically

Performance

Setup	Time per Song
GPU (CUDA)	~30 seconds
Apple Silicon (MPS)	~1 minute
CPU	2-4 minutes

Web UI

The Docker container includes a modern web interface at http://localhost:8080 with:

File Upload: Drag & drop audio files (MP3, WAV, FLAC, OGG, AAC, M4A)
URL Processing: Paste YouTube or Spotify URLs directly
Playlist Support: Process entire playlists
All CLI Options: Model selection, output format, stem selection, etc.
Real-time Progress: Live progress updates via WebSocket
Easy Downloads: Download individual stems or all as ZIP

Web UI Screenshot

API Usage

from stem_separator import separate_audio, StemSeparator, remix_stems

# Simple usage
result = separate_audio(
    input_file="song.mp3",
    output_dir="./output",
    model_name="htdemucs",
    output_format="mp3",
)

# With model pre-loading for batch
separator = StemSeparator(model_name="htdemucs_6s")
separator.load_model()

for song in songs:
    result = separator.separate(song, output_dir)

separator.unload_model()

# Remix stems
result = remix_stems(
    stems_dir="./output/song_stems",
    output_path="./remix.mp3",
    mix_components="vocals:0.5,drums:1.0,bass:0.8",
)

Troubleshooting

YouTube 403 Errors

python stem_separator.py URL --browser edge

GPU Not Working

Check CUDA is installed: nvidia-smi
Check PyTorch CUDA: python -c "import torch; print(torch.cuda.is_available())"
For newest GPUs, install nightly PyTorch

Memory Issues

# Use low-memory mode for long tracks
python stem_separator.py long_song.mp3 --low-memory

License

MIT License

Acknowledgments

Demucs by Meta Research
yt-dlp for YouTube downloads
spotdl for Spotify downloads

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Su1ph3r

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.0.0

May 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stem_separator-2.0.0.tar.gz (46.6 kB view details)

Uploaded May 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

stem_separator-2.0.0-py3-none-any.whl (44.9 kB view details)

Uploaded May 27, 2026 Python 3

File details

Details for the file stem_separator-2.0.0.tar.gz.

File metadata

Download URL: stem_separator-2.0.0.tar.gz
Upload date: May 27, 2026
Size: 46.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for stem_separator-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`57f9075d15bc5430168e4456dee5f40ec54924ce3e347a63c863455f3e89d411`
MD5	`2f8d33dd6a5326e0cc35a5267b342603`
BLAKE2b-256	`4b75691fafff02ce7f507a1e7306ada3b5532861625b7eede3a74cf07e4681ed`

See more details on using hashes here.

Provenance

The following attestation bundles were made for stem_separator-2.0.0.tar.gz:

Publisher: release.yml on Su1ph3r/stem-separator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: stem_separator-2.0.0.tar.gz
- Subject digest: 57f9075d15bc5430168e4456dee5f40ec54924ce3e347a63c863455f3e89d411
- Sigstore transparency entry: 1647374692
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: Su1ph3r/stem-separator@a2ff4145d30579cb06c93efce9d7938b6452cf6b
- Branch / Tag: refs/tags/v2.0.0
- Owner: https://github.com/Su1ph3r
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a2ff4145d30579cb06c93efce9d7938b6452cf6b
- Trigger Event: push

File details

Details for the file stem_separator-2.0.0-py3-none-any.whl.

File metadata

Download URL: stem_separator-2.0.0-py3-none-any.whl
Upload date: May 27, 2026
Size: 44.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for stem_separator-2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`773e7d39ee5f13e6b5858295a2b462c9ada24b0033f6ef2853c1fc07a9c46166`
MD5	`4506a6a48cc5a19fb5be0a651a5d82ac`
BLAKE2b-256	`26addfd707565b899b9ac21f81b73f8dbbd3bd254709c9f9ca679e693909719a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for stem_separator-2.0.0-py3-none-any.whl:

Publisher: release.yml on Su1ph3r/stem-separator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: stem_separator-2.0.0-py3-none-any.whl
- Subject digest: 773e7d39ee5f13e6b5858295a2b462c9ada24b0033f6ef2853c1fc07a9c46166
- Sigstore transparency entry: 1647374775
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: Su1ph3r/stem-separator@a2ff4145d30579cb06c93efce9d7938b6452cf6b
- Branch / Tag: refs/tags/v2.0.0
- Owner: https://github.com/Su1ph3r
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a2ff4145d30579cb06c93efce9d7938b6452cf6b
- Trigger Event: push

stem-separator 2.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Stem Separator

Features

Installation

Quick Install

Manual Install

Install as Package (Recommended)

Docker (Recommended for Easy Setup)

Usage

Basic Usage

Batch Processing

YouTube Playlists

Spotify Support

Custom Remixing

Audio Preview

Quality Analysis

Options Reference

Output Options

Model Options

Processing Options

Batch Options

Source Options

Other Options

Models

Stem Presets

Configuration File

Output

GPU Support

NVIDIA GPUs

Apple Silicon (M1/M2/M3)

Performance

Web UI

API Usage

Troubleshooting

YouTube 403 Errors

GPU Not Working

Memory Issues

License

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance