a single interface around speech-to-speech foundation models
Project description
spoken!
pip install spoken
currently supports batch/offline evaluation for offline evaluations/benchmarking but can easily propagate audio chunks forward
import spoken
model = spoken("gpt-4o-realtime-preview-2024-12-17", "examples/input.wav")
input_asr, output_asr, output_audio = await model.run()
output_asr # "That's quite the story..."
len(output_audio) # 8549ms
model.output_audio_tokens # 254
A single interface around speech-to-speech foundation models.
Supports
- OpenAI Realtime
- gpt-4o-realtime-preview-2024-12-17
- gpt-4o-mini-audio-preview-2024-12-17
- Gemini Multimodal Live
- gemini-2.5-flash-preview-native-audio-dialog
- gemini-2.5-flash-exp-native-audio-thinking-dialog
- Amazon Nova Sonic
- amazon.nova-sonic-v1:0
Installation
- need
portaudio.hfor Amazon Nova Sonic support (macbrew install portaudio)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
spoken-0.0.1.tar.gz
(11.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
spoken-0.0.1-py3-none-any.whl
(12.9 kB
view details)
File details
Details for the file spoken-0.0.1.tar.gz.
File metadata
- Download URL: spoken-0.0.1.tar.gz
- Upload date:
- Size: 11.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ec2ccb99d3a0bb3635c47d35ee86af3c8dd47fcdd84eabf7cabc440dccce0b48
|
|
| MD5 |
cb02934dd469e9399469b4dce0bf8797
|
|
| BLAKE2b-256 |
ac12dbd155928f4f5c46e1f63049fa45a347dc97e4eed8b34b38cad5a3653400
|
File details
Details for the file spoken-0.0.1-py3-none-any.whl.
File metadata
- Download URL: spoken-0.0.1-py3-none-any.whl
- Upload date:
- Size: 12.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8587d93dcdf028434963747f913caaa36a4da3d63241ebb7db4e696c6a4011d8
|
|
| MD5 |
9583eb76f1c59db431e102de7c855eb4
|
|
| BLAKE2b-256 |
154dc8113cca4d5c917150141d151d254bd65eff7e1d0024f157c5d7a0745abe
|