On-device voice processing pipeline for fast, private voice interaction

These details have not been verified by PyPI

Project links

Homepage

Project description

Llama Voice

Llama Voice is a [brief description, e.g., real-time voice processing and transcription component] within the LlamaSearch AI ecosystem. It provides capabilities for [list key capabilities, e.g., voice activity detection (VAD), automatic speech recognition (ASR), speaker diarization].

Features

Real-time Processing: Designed for low-latency voice stream handling.
Accurate Transcription: Leverages [mention model or technique, e.g., Whisper-based models] for high-quality ASR.
Speaker Identification: [Describe capability, e.g., Differentiates between multiple speakers in an audio stream].
Voice Activity Detection: Efficiently detects speech segments to reduce processing load.
[Add other relevant features]

Installation

# Ensure you are in the root of the llamasearchai-git repository
pip install -e ./batch2/llama-voice

Or, if installing dependencies listed in its pyproject.toml is preferred:

cd batch2/llama-voice
pip install .
cd ../..

Dependencies

Python 3.8+
[List key dependencies, e.g., PyTorch, Transformers, LibROSA, PyAudio]
Refer to pyproject.toml for a complete list.

Usage

Provide a basic example of how to use the core functionality.

# Example: Basic ASR usage
# NOTE: This is a hypothetical example, adjust based on actual implementation

from llama_voice.asr_processor import ASRProcessor # Assuming this structure
# from llama_voice.vad import VoiceActivityDetector # Example

# Initialize components (adjust parameters as needed)
# vad = VoiceActivityDetector() 
processor = ASRProcessor(model_size="base")

async def process_audio_stream(stream):
    async for audio_chunk in stream:
        # Optional VAD
        # if vad.is_speech(audio_chunk):
        
        transcription = await processor.transcribe(audio_chunk)
        if transcription:
            print(f"Transcription: {transcription}")

# Example of setting up and running the stream processing
# setup_and_run(process_audio_stream)

Configuration

Explain any necessary configuration, e.g., model selection, language settings, device selection (CPU/GPU/MPS). Mention if environment variables are used.

Architecture

Briefly describe the main components and their interaction (e.g., VAD module, ASR model loader, processing pipeline).

Contributing

Please refer to the main CONTRIBUTING.md file in the root of the LlamaSearchAI repository for contribution guidelines. Specific notes for Llama Voice development can be added here if necessary.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.0

Apr 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_voice_llamasearch-0.1.0.tar.gz (38.0 kB view details)

Uploaded Apr 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_voice_llamasearch-0.1.0-py3-none-any.whl (43.8 kB view details)

Uploaded Apr 4, 2025 Python 3

File details

Details for the file llama_voice_llamasearch-0.1.0.tar.gz.

File metadata

Download URL: llama_voice_llamasearch-0.1.0.tar.gz
Upload date: Apr 4, 2025
Size: 38.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for llama_voice_llamasearch-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`78f33635089c9bb658d498668da89c46356e801f9a95ff7862d9942b58fd4a86`
MD5	`d9e155a13866393a9db05267eee9cd07`
BLAKE2b-256	`8678eb463f4d3a1bf673c09e2d53b95317fc2596741c27174e2f80aa05cb9ae8`

See more details on using hashes here.

File details

Details for the file llama_voice_llamasearch-0.1.0-py3-none-any.whl.

File metadata

Download URL: llama_voice_llamasearch-0.1.0-py3-none-any.whl
Upload date: Apr 4, 2025
Size: 43.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for llama_voice_llamasearch-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6935f9b3524124895361593394f75708adb371199c1b8b95696412b9e1a7343d`
MD5	`0c719a43fe27973b75e31936fe9f4eda`
BLAKE2b-256	`f551905c3331654b54141f7c6cb6b28b238057e9dbf27cab065dc877c9c306e3`

See more details on using hashes here.

llama-voice-llamasearch 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Llama Voice

Features

Installation

Dependencies

Usage

Configuration

Architecture

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes