Official Python SDK for Cleanvoice AI audio processing

These details have not been verified by PyPI

Project links

Project description

Cleanvoice Python SDK

Official Python SDK for Cleanvoice AI - AI-powered audio processing and enhancement.

Features

🎵 Audio Processing: Remove fillers, background noise, long silences, and more
📹 Video Support: Process audio tracks from video files without ffmpeg
📝 Transcription: Convert speech to text with high accuracy
📊 Summarization: Generate summaries, chapters, and key learnings
🔧 Type Safe: Full type hints with Pydantic models
⚡ Developer Friendly: Simple, intuitive API design
🔄 Async Support: Modern async/await patterns
🎛️ Extensible: Comprehensive configuration options
📦 No FFmpeg Required: Built-in audio/video handling with librosa, soundfile, and PyAV

Installation

pip install cleanvoice-sdk

Optional Dependencies

For development:

pip install cleanvoice-sdk[dev]

Quick Start

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key-here'})

# Process audio with AI
result = cv.process(
    "https://example.com/podcast.mp3",
    {
        'fillers': True,
        'normalize': True,
        'transcription': True,
        'summarize': True
    }
)

print(f"Processed audio: {result.audio.url}")
print(f"Summary: {result.transcript.summary}")

Authentication

Get your API key from the Cleanvoice Dashboard.

from cleanvoice import Cleanvoice

cv = Cleanvoice({
    'api_key': 'your-api-key-here',
    # Optional: custom base URL
    'base_url': 'https://api.cleanvoice.ai/v2',
    # Optional: request timeout in seconds
    'timeout': 60
})

API Reference

`process(file_input, config, progress_callback=None)`

Process an audio or video file with AI enhancement.

Parameters:

file_input (str): URL to audio/video file
config (ProcessingConfig or dict): Processing options
progress_callback (callable, optional): Callback function for progress updates

Returns: ProcessResult

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

def progress_callback(data):
    print(f"Status: {data['status']}, Progress: {data.get('result', {}).get('done', 0)}%")

result = cv.process(
    "https://example.com/audio.mp3",
    {
        # Audio Enhancement
        'fillers': True,           # Remove filler sounds (um, uh, etc.)
        'stutters': True,          # Remove stutters
        'long_silences': True,     # Remove long silences
        'mouth_sounds': True,      # Remove mouth sounds
        'breath': True,            # Reduce breath sounds
        'remove_noise': True,      # Remove background noise
        'normalize': True,         # Normalize audio levels
        
        # Advanced Options
        'mute_lufs': -80,         # Mute threshold (negative number)
        'target_lufs': -16,       # Target loudness level
        'export_format': 'mp3',   # Output format: auto, mp3, wav, flac, m4a
        
        # AI Features
        'transcription': True,     # Generate transcript
        'summarize': True,         # Generate summary (requires transcription)
        'social_content': True,    # Optimize for social media
        
        # Video
        'video': False,           # Set to True for video files (auto-detected)
        
        # Multi-track
        'merge': False,           # Merge multi-track audio
    },
    progress_callback=progress_callback
)

# Access results
print(result.audio.url)           # Download URL
print(result.audio.statistics)    # Processing stats
print(result.transcript.text)     # Full transcript
print(result.transcript.summary)  # AI summary

`create_edit(file_input, config)`

Create an edit job without waiting for completion.

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

edit_id = cv.create_edit(
    "https://example.com/audio.mp3",
    {'fillers': True, 'normalize': True}
)

print(f'Edit ID: {edit_id}')

`get_edit(edit_id)`

Get the status and results of an edit job.

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

edit = cv.get_edit(edit_id)

if edit.status == 'SUCCESS':
    print(f'Download URL: {edit.result.download_url}')
else:
    print(f'Status: {edit.status}')  # PENDING, STARTED, RETRY, FAILURE

`check_auth()`

Verify API authentication and get account information.

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

account = cv.check_auth()
print('Account info:', account)

File Handling Without FFmpeg

The SDK includes built-in support for audio and video files using PyAV without requiring FFmpeg:

Audio File Information

from cleanvoice import get_audio_info

info = get_audio_info('path/to/audio.mp3')
print(f"Duration: {info.duration}s")
print(f"Sample Rate: {info.sample_rate}Hz")
print(f"Channels: {info.channels}")

Video File Information

from cleanvoice import get_video_info

info = get_video_info('path/to/video.mp4')
print(f"Duration: {info.duration}s")
print(f"Resolution: {info.width}x{info.height}")
print(f"FPS: {info.fps}")
print(f"Has Audio: {info.has_audio}")

Extract Audio from Video

from cleanvoice import extract_audio_from_video

audio_path = extract_audio_from_video(
    'path/to/video.mp4',
    'extracted_audio.wav'  # Optional output path
)
print(f"Extracted audio: {audio_path}")

Configuration Options

Audio Processing

Option	Type	Default	Description
`fillers`	bool	False	Remove filler sounds (um, uh, etc.)
`stutters`	bool	False	Remove stutters
`long_silences`	bool	False	Remove long silences
`mouth_sounds`	bool	False	Remove mouth sounds
`hesitations`	bool	False	Remove hesitations
`breath`	bool	False	Reduce breath sounds
`remove_noise`	bool	True	Remove background noise
`keep_music`	bool	False	Preserve music sections
`normalize`	bool	False	Normalize audio levels
`sound_studio`	bool	False	AI-powered enhancement

Output Options

Option	Type	Default	Description
`export_format`	str	'auto'	Output format: auto, mp3, wav, flac, m4a
`mute_lufs`	float	-80	Mute threshold in LUFS (negative)
`target_lufs`	float	-16	Target loudness in LUFS (negative)
`export_timestamps`	bool	False	Export edit timestamps

AI Features

Option	Type	Default	Description
`transcription`	bool	False	Generate speech-to-text
`summarize`	bool	False	Generate AI summary (requires transcription)
`social_content`	bool	False	Optimize for social media (requires summarize)

Other Options

Option	Type	Default	Description
`video`	bool	auto-detected	Process video file
`merge`	bool	False	Merge multi-track audio
`send_email`	bool	False	Email results to account

Examples

Basic Audio Cleaning

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

result = cv.process(
    "https://example.com/podcast.mp3",
    {
        'fillers': True,
        'long_silences': True,
        'normalize': True,
        'remove_noise': True
    }
)

print(f"Cleaned audio: {result.audio.url}")
print(f"Removed {result.audio.statistics.FILLER_SOUND} filler sounds")

Transcription and Summary

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

result = cv.process(
    "https://example.com/interview.wav",
    {
        'transcription': True,
        'summarize': True,
        'normalize': True
    }
)

print('Title:', result.transcript.title)
print('Summary:', result.transcript.summary)
print('Chapters:', result.transcript.chapters)

Video Processing

from cleanvoice import Cleanvoice

cv = Cleanvoice({'api_key': 'your-api-key'})

result = cv.process(
    "https://example.com/video.mp4",
    {
        'video': True,  # Optional: auto-detected
        'fillers': True,
        'transcription': True,
        'export_format': 'mp3'
    }
)

print('Processed audio:', result.audio.url)

Batch Processing

from cleanvoice import Cleanvoice
import time

cv = Cleanvoice({'api_key': 'your-api-key'})

files = [
    "https://example.com/episode1.mp3",
    "https://example.com/episode2.mp3",
    "https://example.com/episode3.mp3"
]

edit_ids = []
for file in files:
    edit_id = cv.create_edit(file, {'fillers': True, 'normalize': True})
    edit_ids.append(edit_id)

# Poll for completion
results = []
for edit_id in edit_ids:
    while True:
        edit = cv.get_edit(edit_id)
        if edit.status == 'SUCCESS':
            results.append(edit)
            break
        elif edit.status == 'FAILURE':
            print(f"Failed: {edit_id}")
            break
        else:
            time.sleep(5)  # Wait 5 seconds before polling again

print(f'All processing completed: {len(results)} files')

Error Handling

from cleanvoice import Cleanvoice, ApiError, FileValidationError

cv = Cleanvoice({'api_key': 'your-api-key'})

try:
    result = cv.process(
        "https://example.com/audio.mp3",
        {'fillers': True, 'normalize': True}
    )
    print('Success:', result.audio.url)
except ApiError as e:
    print(f'API Error: {e.message}')
    if e.status_code:
        print(f'HTTP Status: {e.status_code}')
        print(f'Error Code: {e.error_code}')
except FileValidationError as e:
    print(f'File Error: {e}')
except Exception as e:
    print(f'Unexpected Error: {e}')

Supported File Formats

Audio Formats

WAV (.wav)
MP3 (.mp3)
OGG (.ogg)
FLAC (.flac)
M4A (.m4a)
AIFF (.aiff)
AAC (.aac)

Video Formats

MP4 (.mp4)
MOV (.mov)
WebM (.webm)
AVI (.avi)
MKV (.mkv)

Requirements

Python 3.8+
No FFmpeg required for basic audio/video processing

Development

Installing for Development

git clone https://github.com/cleanvoice/cleanvoice-python-sdk
cd cleanvoice-python-sdk
pip install -e .[dev]

Running Tests

pytest

Code Formatting

black src/
isort src/

Type Checking

mypy src/

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

License

MIT License - see LICENSE file for details.

Support

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.5

Apr 1, 2026

2.0.4

Mar 31, 2026

2.0.3

Mar 27, 2026

2.0.2

Mar 18, 2026

2.0.1

Mar 12, 2026

2.0.0

Mar 12, 2026

1.0.1

Jun 26, 2025

This version

1.0.0

Jun 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanvoice_sdk-1.0.0.tar.gz (29.9 kB view details)

Uploaded Jun 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cleanvoice_sdk-1.0.0-py3-none-any.whl (14.9 kB view details)

Uploaded Jun 25, 2025 Python 3

File details

Details for the file cleanvoice_sdk-1.0.0.tar.gz.

File metadata

Download URL: cleanvoice_sdk-1.0.0.tar.gz
Upload date: Jun 25, 2025
Size: 29.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.8

File hashes

Hashes for cleanvoice_sdk-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`0fc828dcab06507832bf8a522a61d6ced74b8f9a16c8754743a5cca6be74315a`
MD5	`855c9150f68b51329abd08ae33d9625b`
BLAKE2b-256	`630f68f094d332039fffe7b8784d1439783e7e8296a24cd5bb5b65f31a20ce4f`

See more details on using hashes here.

File details

Details for the file cleanvoice_sdk-1.0.0-py3-none-any.whl.

File metadata

Download URL: cleanvoice_sdk-1.0.0-py3-none-any.whl
Upload date: Jun 25, 2025
Size: 14.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.8

File hashes

Hashes for cleanvoice_sdk-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6778360a24f131097d9939fc26854cad67b1f02aa13bd509e79f67ca610610be`
MD5	`9d606c36958a30d310a4f3fc4d50b8a4`
BLAKE2b-256	`68b24f7882ac218c63ecf6b75f666a483cb2060c66173de138191d5cfc556740`

See more details on using hashes here.

cleanvoice-sdk 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cleanvoice Python SDK

Features

Installation

Optional Dependencies

Quick Start

Authentication

API Reference

process(file_input, config, progress_callback=None)

create_edit(file_input, config)

get_edit(edit_id)

check_auth()

File Handling Without FFmpeg

Audio File Information

Video File Information

Extract Audio from Video

Configuration Options

Audio Processing

Output Options

AI Features

Other Options

Examples

Basic Audio Cleaning

Transcription and Summary

Video Processing

Batch Processing

Error Handling

Supported File Formats

Audio Formats

Video Formats

Requirements

Development

Installing for Development

Running Tests

Code Formatting

Type Checking

Contributing

License

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`process(file_input, config, progress_callback=None)`

`create_edit(file_input, config)`

`get_edit(edit_id)`

`check_auth()`