dora-gradio
Project description
Dora Gradio UI Interface
A versatile UI interface for Dora-rs that provides text, audio, and video input capabilities using Gradio.
Features
- Text Input: Direct text input through a chat-like interface
- Audio Input: Real-time audio streaming in 16kHz format
- Video Input: WebRTC camera streaming at 640x480
- Multiple Output Channels:
text: For direct text messagesaudio: For raw audio streamimage: For camera feed
- Clean Interface: Simple and intuitive UI with tabbed sections
- Auto Port Management: Automatically handles port conflicts
Installation
Using pip:
python -m venv .venv
source .venv/bin/activate
pip install -e .
Usage
1. Web Interface
The interface will be available at: http://localhost:7860
2. As a Dora Node
Create a YAML configuration:
nodes:
- id: ui
build: pip install -e .
path: dora-gradio
outputs:
- text # Text from chat interface
- audio # Raw audio stream
- image # Camera feed
env:
VIRTUAL_ENV: path to your venv # comment this if not using venv
Run with Dora:
dora run demo.yml
3. Integration Examples
Video Processing Pipeline
nodes:
- id: ui
build: pip install -e .
path: dora-gradio
outputs:
- image # Camera feed
- id: video_processor
build: pip install -e path/to/processor
path: video-processor
inputs:
video: ui/image
Audio Processing Pipeline
nodes:
- id: ui
build: pip install -e .
path: dora-gradio
outputs:
- audio # Raw audio stream
- id: audio_processor
build: pip install -e path/to/processor
path: audio-processor
inputs:
audio: ui/audio
4. Demo Example
Here's a complete demo pipeline using the UI with visualization and audio processing:
nodes:
- id: ui
build: pip install -e .
path: dora-gradio
outputs:
- text # Text messages
- audio # Raw audio stream
- image # Camera feed
- id: plot
build: pip install dora-rerun
path: dora-rerun
inputs:
text_input: ui/text
audio: ui/audio
image: ui/image
- id: dora-vad
build: pip install -e path/to/dora-vad
path: dora-vad
inputs:
audio: ui/audio
- id: dora-distil-whisper
build: pip install -e path/to/dora-distil-whisper
path: dora-distil-whisper
inputs:
audio: ui/audio
This demo showcases:
- Real-time visualization with dora-rerun
- Voice Activity Detection with dora-vad
- Speech processing with dora-distil-whisper
Interface Features
Camera Tab
- Real-time WebRTC video streaming
- Fixed 640x480 resolution
- BGR8 color format
- Automatic timestamp synchronization
Audio and Text Input Tab
- Chat-like interface for text input
- Real-time audio streaming (16kHz, mono)
- Status indicators for streaming state
- Immediate output through respective channels
Controls
- Send Text button for chat messages
- Stop Server button for graceful shutdown
System Requirements
- Python ≥ 3.10
- Required ports:
- 7860 (Gradio interface)
Known Limitations
- Fixed video resolution (640x480)
- Fixed audio sample rate (16kHz)
- Requires port 7860 to be available
License
dora-gradio's code are released under the MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dora_gradio-0.3.12.tar.gz.
File metadata
- Download URL: dora_gradio-0.3.12.tar.gz
- Upload date:
- Size: 5.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d377fba2b6525e1e845b41d3aa8bd16ebbe20041792e971996f39639e8749e2f
|
|
| MD5 |
8f4157c661f4697b9d8852a5c7785908
|
|
| BLAKE2b-256 |
cc09809a47ecb3f1a6220e38787da916a44d4837e3fd930672ff64a678eb9376
|
File details
Details for the file dora_gradio-0.3.12-py3-none-any.whl.
File metadata
- Download URL: dora_gradio-0.3.12-py3-none-any.whl
- Upload date:
- Size: 5.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
74b1f0d32a888c2c7c2a5a9ee9a8ec4757f84dc41b73466e6f60967c1c022432
|
|
| MD5 |
b1d0ec0d8acae11b0bf3c6b39ff8c42c
|
|
| BLAKE2b-256 |
ca976146eefe636202c7c9c0e3735f8df55fe8bbc01843a8382295059f251288
|