Official Python SDK for Cleanvoice AI audio processing

These details have not been verified by PyPI

Project links

Project description

Cleanvoice Python SDK

AI-powered audio and video enhancement — remove fillers, clean noise, transcribe, and summarize from Python in one call.

pip install cleanvoice-sdk

Quick Start
Authentication
Common Patterns
Examples
File Upload & Download
API Reference
Configuration Options
Local Media Utilities
Error Handling
Supported Formats
Network Resilience
Development

Quick Start

from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()  # reads CLEANVOICE_API_KEY

result = client.process(
    "https://example.com/podcast.mp3",
    fillers=True,
    normalize=True,
    studio_sound=True,
    summarize=True,
    output_path="podcast_clean.wav",
)

print(result.audio.url)              # remote download URL
print(result.audio.local_path)       # local saved file
print(result.transcript.summary)     # AI-generated summary

Authentication

Get your API key from the Cleanvoice Dashboard.

Option A — environment variable (recommended)

export CLEANVOICE_API_KEY="your-api-key-here"

from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()

check_auth() mirrors the current API response shape:

account = client.check_auth()
print(account["credit"]["total"])
print(account["credit"]["subscription"])
print(account["meta"])

Option B — explicit constructor

from cleanvoice import Cleanvoice

client = Cleanvoice(
    api_key="your-api-key-here",
    base_url="https://api.cleanvoice.ai/v2",  # optional
    timeout=60,                                # optional
)

Common Patterns

Choose the pattern that fits your workflow:

Pattern	When to use
Process + save in one call	Backend jobs, scripts, CLI tools
Process first, download later	When you want to inspect metadata before saving
Async	FastAPI, async workers, high-concurrency services

# 1. Process and save in one call (recommended for most backends)
result = client.process(
    "recording.mp3",
    normalize=True,
    studio_sound=True,
    output_path="cleaned.wav",        # SDK uploads → waits → downloads → saves
)
print(result.audio.local_path)

# 2. Process first, download later
result = client.process("recording.mp3", normalize=True)
saved = result.audio.download("cleaned.wav")

# 3. Async
result = await async_client.process(
    "recording.mp3",
    normalize=True,
    output_path="cleaned.wav",
)

Tip: output_path=... is the lowest-friction option for backend jobs — the SDK handles upload, polling, and download, returning a ready-to-use local path in result.audio.local_path.

Examples

Basic Audio Cleaning

from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()

result = client.process(
    "https://example.com/podcast.mp3",
    fillers=True,
    long_silences=True,
    normalize=True,
    remove_noise=True,
)

print(f"Cleaned audio: {result.audio.url}")
print(f"Removed {result.audio.statistics.FILLER_SOUND} filler sounds")

Transcription & Summary

from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()

result = client.process(
    "https://example.com/interview.wav",
    summarize=True,    # auto-enables transcription
    normalize=True,
)

print("Title:   ", result.transcript.title)
print("Summary: ", result.transcript.summary)
print("Chapters:", result.transcript.chapters)

summarize=True automatically enables transcription. social_content=True automatically enables summarize.

Video Processing

from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()

result = client.process(
    "https://example.com/video.mp4",
    studio_sound=True,
    remove_noise=True,
    transcription=True,
    output_path="processed_video.mp4",
)

print("Media type:", "video" if result.is_video else "audio")
print("Remote URL:", result.media.url)
print("Saved to:  ", result.media.local_path)

When the SDK detects a video extension (.mp4, .mov, etc.) it automatically sets video=True and emits a warning so you know the returned asset will be a video file.

In-Memory Audio (NumPy / librosa)

Pass a (audio_array, sample_rate) tuple directly — the SDK writes a temporary WAV, uploads it, and continues normally.

import librosa
from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()

audio, sr = librosa.load("recording.wav", sr=None, mono=True)

result = client.process(
    (audio, sr),
    studio_sound=True,
    remove_noise=True,
    output_path="processed.mp3",
)

# Download back as a NumPy array
audio_out, sr_out = result.download_audio(as_numpy=True)

Async Usage

import asyncio
from cleanvoice import AsyncCleanvoice

async def main():
    async with AsyncCleanvoice.from_env() as client:
        result = await client.process(
            "https://example.com/audio.mp3",
            normalize=True,
            studio_sound=True,
            output_path="output.wav",
        )
        print(result.audio.local_path)

asyncio.run(main())

Batch Processing

Submit all jobs first, then poll — avoids waiting serially between files.

import time
from cleanvoice import Cleanvoice

client = Cleanvoice.from_env()

files = [
    "https://example.com/episode1.mp3",
    "https://example.com/episode2.mp3",
    "https://example.com/episode3.mp3",
]

# Submit all jobs
edit_ids = [
    client.create_edit(f, fillers=True, normalize=True)
    for f in files
]

# Poll for completion
results = []
for edit_id in edit_ids:
    while True:
        edit = client.get_edit(edit_id)
        if edit.status == "SUCCESS":
            results.append(edit)
            break
        elif edit.status == "FAILURE":
            print(f"Failed: {edit_id}")
            break
        time.sleep(5)

print(f"Completed: {len(results)}/{len(files)} files")

File Upload & Download

Upload

# Upload a file and get its remote URL
url = client.upload_file("recording.mp3")

# Upload with a custom remote filename
url = client.upload_file("recording.mp3", "my_episode.mp3")

# Upload a librosa array
import librosa
audio, sr = librosa.load("recording.wav", sr=None, mono=True)
url = client.upload_file((audio, sr), "from_array.wav")

# Local files are uploaded automatically when passed to process()
result = client.process("local_audio.mp3", fillers=True)

Local uploads currently support the same formats as the API upload endpoint: .mp3, .wav, .flac, .m4a, plus the supported video formats.

Download

# Download from a result object
path = result.audio.download("enhanced.mp3")

# Download back as a NumPy array
audio, sr = result.download_audio(as_numpy=True)

# Async download as NumPy array
audio, sr = await result.download_audio_async(as_numpy=True)

# One-liner: process and download together
result, path = client.process_and_download(
    "audio.mp3",
    "output.mp3",
    fillers=True,
    normalize=True,
)

output_path saves the exact bytes returned by the API — the SDK does not transcode locally after download.

API Reference

`process()`

client.process(
    file_input,           # str (URL or path) or (array, sample_rate)
    config=None,          # ProcessingConfig or dict
    progress_callback=None,
    *,
    output_path=None,     # save finished audio as part of the task
    download=False,       # download even without output_path
    template_id=None,     # apply a saved Cleanvoice template
    upload_type=None,     # backend-specific upload type hint
    **options,            # normalize=True, fillers=True, etc.
)
# Returns: ProcessResult

Remote media inputs for process() and create_edit() must use http:// or https:// URLs.

With a progress callback:

def on_progress(data):
    pct = data.get("result", {}).get("done", 0)
    print(f"Status: {data['status']}  {pct}%")

result = client.process(
    "audio.mp3",
    fillers=True, stutters=True, long_silences=True,
    mouth_sounds=True, breath=True, remove_noise=True,
    normalize=True, studio_sound=True,
    mute_lufs=-80, target_lufs=-16,
    export_format="wav",
    summarize=True, social_content=True,
    progress_callback=on_progress,
    output_path="enhanced.wav",
)

`create_edit()`

Submit a job without waiting for completion. Returns an edit_id.

edit_id = client.create_edit(
    "https://example.com/audio.mp3",
    fillers=True,
    normalize=True,
    studio_sound=True,
    upload_type="podcast",
)

`get_edit(edit_id)`

Check the status and results of a submitted job.

edit = client.get_edit(edit_id)

if edit.status == "SUCCESS":
    print(edit.result.download_url)
else:
    print(edit.status)  # PENDING | STARTED | RETRY | FAILURE

`check_auth()`

Verify credentials and inspect account details.

account = client.check_auth()
print(account["credit"]["total"])
print(account["credit"]["payg"])
print(account["meta"])

Returns the current /v1/account payload, including credit.total, credit.payg, credit.subscription, and meta.

Configuration Options

Audio Processing

Option	Type	Default	Description
`fillers`	bool	`False`	Remove filler sounds (um, uh, etc.)
`stutters`	bool	`False`	Remove stutters
`long_silences`	bool	`False`	Remove long silences
`mouth_sounds`	bool	`False`	Remove mouth sounds
`hesitations`	bool	`False`	Remove hesitations
`breath`	bool or str	`False`	Reduce breath sounds
`remove_noise`	bool	`True`	Remove background noise
`keep_music`	bool	`False`	Preserve music sections
`normalize`	bool	`False`	Normalize audio levels
`studio_sound`	bool or str	`False`	AI-powered enhancement

Output

Option	Type	Default	Description
`export_format`	str	`'auto'`	`auto`, `mp3`, `wav`, `flac`, `m4a`
`mute_lufs`	float	`-80`	Mute threshold in LUFS
`target_lufs`	float	`-16`	Target loudness in LUFS
`export_timestamps`	bool	`False`	Export edit timestamps

AI Features

Option	Type	Default	Description
`transcription`	bool	`False`	Generate speech-to-text
`summarize`	bool	`False`	Generate AI summary (auto-enables transcription)
`social_content`	bool	`False`	Social media optimization (auto-enables summarize)

Other

Option	Type	Default	Description
`video`	bool	auto	Process as video file
`merge`	bool	`False`	Merge multi-track audio
`send_email`	bool	`False`	Email results to account

Local Media Utilities

Pure-Python helpers — FFmpeg is not required.

Audio Info

from cleanvoice import get_audio_info

info = get_audio_info("recording.mp3")
print(f"Duration:    {info.duration}s")
print(f"Sample rate: {info.sample_rate}Hz")
print(f"Channels:    {info.channels}")

Video Info

from cleanvoice import get_video_info

info = get_video_info("video.mp4")
print(f"Duration:   {info.duration}s")
print(f"Resolution: {info.width}x{info.height}")
print(f"FPS:        {info.fps}")
print(f"Has audio:  {info.has_audio}")

Extract Audio from Video

from cleanvoice import extract_audio_from_video

audio_path = extract_audio_from_video("video.mp4", "extracted.wav")

Error Handling

from cleanvoice import Cleanvoice, ApiError, FileValidationError

client = Cleanvoice.from_env()

try:
    result = client.process("audio.mp3", fillers=True, normalize=True)
    print("Success:", result.audio.url)
except FileValidationError as e:
    print(f"File error: {e}")
except ApiError as e:
    print(f"API error {e.status_code} [{e.error_code}]: {e.message}")
except Exception as e:
    print(f"Unexpected error: {e}")

Async uploads/downloads raise the same SDK exceptions as sync flows: ApiError for API/upload failures and FileValidationError for local file validation/download issues.

Network Resilience

The client automatically retries brief transient failures including connection resets, connect/read timeouts on safe requests, and temporary HTTP errors (429, 502, 503, 504). This is designed to absorb short backend restart windows without immediately failing common flows.

Retries cover: check_auth(), create_edit(), get_edit(), and process() during polling. Edit creation retries are intentionally conservative to avoid duplicating work.

Supported Formats

Audio: .wav .mp3 .ogg .flac .m4a .aiff .aac

Video: .mp4 .mov .webm .avi .mkv

Development

git clone https://github.com/cleanvoice/cleanvoice-python-sdk
cd cleanvoice-python-sdk
pip install -e .

pytest                   # run tests
black src/ && isort src/ # format
mypy src/                # type check

End-to-end local test — uploads, waits, downloads, and writes JSON summaries into results_test/:

CLEANVOICE_API_KEY=your-key python examples/manual_test_showcase.py

# target specific scenarios:
# --scenario audio_studio_sound_only
# --scenario audio_all_inclusive
# --scenario video_defaults
# --scenario video_all_inclusive

Contributing

Fork the repository
Create a feature branch
Make your changes with tests
Submit a pull request

Requirements

Python 3.8+
FFmpeg not required

Support


📖 Documentation	docs.cleanvoice.ai
📧 Email	support@cleanvoice.ai
🐛 Issues	GitHub Issues

License

MIT — see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.0.5

Apr 1, 2026

2.0.4

Mar 31, 2026

2.0.3

Mar 27, 2026

2.0.2

Mar 18, 2026

2.0.1

Mar 12, 2026

2.0.0

Mar 12, 2026

1.0.1

Jun 26, 2025

1.0.0

Jun 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanvoice_sdk-2.0.5.tar.gz (53.2 kB view details)

Uploaded Apr 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cleanvoice_sdk-2.0.5-py3-none-any.whl (24.1 kB view details)

Uploaded Apr 1, 2026 Python 3

File details

Details for the file cleanvoice_sdk-2.0.5.tar.gz.

File metadata

Download URL: cleanvoice_sdk-2.0.5.tar.gz
Upload date: Apr 1, 2026
Size: 53.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for cleanvoice_sdk-2.0.5.tar.gz
Algorithm	Hash digest
SHA256	`a36e4151c051ab5e6b676a159fa995425ce13086e71a0cac43dd6be773bc2339`
MD5	`8636b6e0767dac91d7b0192e8f213bae`
BLAKE2b-256	`0a4a63131c6f36f38d2da21f76f1f91c56b675c125f3ecc604d2f68af64636b9`

See more details on using hashes here.

File details

Details for the file cleanvoice_sdk-2.0.5-py3-none-any.whl.

File metadata

Download URL: cleanvoice_sdk-2.0.5-py3-none-any.whl
Upload date: Apr 1, 2026
Size: 24.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for cleanvoice_sdk-2.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`916df02830073335138361d0d94d8d8ef5f8f5609247e2c3af70daa2464dff65`
MD5	`1fb29906b43de6b8846bedbb5d969aa7`
BLAKE2b-256	`f3f44267aa89ed51e2407a00b7f26fa891eb6412ad991872e954b8660f299044`

See more details on using hashes here.

cleanvoice-sdk 2.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cleanvoice Python SDK

Table of Contents

Quick Start

Authentication

Common Patterns

Examples

Basic Audio Cleaning

Transcription & Summary

Video Processing

In-Memory Audio (NumPy / librosa)

Async Usage

Batch Processing

File Upload & Download

Upload

Download

API Reference

process()

create_edit()

get_edit(edit_id)

check_auth()

Configuration Options

Audio Processing

Output

AI Features

Other

Local Media Utilities

Audio Info

Video Info

Extract Audio from Video

Error Handling

Network Resilience

Supported Formats

Development

Contributing

Requirements

Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`process()`

`create_edit()`

`get_edit(edit_id)`

`check_auth()`