LiveKit Agents plugin for Gnani Vachana speech AI — STT & TTS for Indian languages

These details have been verified by PyPI

Project links

Owner

LiveKit

GitHub Statistics

These details have not been verified by PyPI

Project links

Project description

livekit-plugins-gnani

LiveKit Agents plugin for Gnani Vachana — high-accuracy Speech-to-Text and low-latency Text-to-Speech for Indian languages.

Vachana is a production-ready speech AI platform by Gnani.ai supporting 22+ Indian languages with 250+ voices, real-time streaming, multilingual transcription, and code-switching capabilities.

Installation

pip install livekit-plugins-gnani

This will also install the gnani-vachana core SDK as a dependency.

Prerequisites

You need a Gnani API key. Email speechstack@gnani.ai to get started — all new accounts receive free credits, no credit card required.

Set your credentials as environment variables:

export GNANI_API_KEY="your-api-key"

# For REST STT only (optional):
export GNANI_ORGANIZATION_ID="your-org-id"
export GNANI_USER_ID="your-user-id"

Quick Start

Speech-to-Text

from livekit.plugins.gnani import STT

stt = STT(language="hi-IN")

# Use with a LiveKit voice agent pipeline

Text-to-Speech

from livekit.plugins.gnani import TTS

tts = TTS(voice="Karan")

# Use with a LiveKit voice agent pipeline

Features

STT

Batch recognition — REST API (POST /stt/v3) for file-based transcription
Real-time streaming — WebSocket API for live audio transcription with VAD
23 Indian languages — Assamese, Bengali, Bodo, Dogri, English (India), Gujarati, Hindi, Kannada, Kashmiri, Konkani, Maithili, Malayalam, Manipuri, Marathi, Nepali, Odia, Punjabi, Sanskrit, Santhali, Sindhi, Tamil, Telugu, Urdu
Code-switching — Hinglish (en-hi-IN-latn) and Hindi-English mixed (en-hi-in-cm) for streaming
Sample rates — 8 kHz and 16 kHz

TTS

Chunked synthesis — REST API for single-request audio generation
Real-time streaming — WebSocket API for low-latency streaming synthesis
250+ voices — language-specific voices across 22 Indian languages (e.g. Karan, Simran, Riya, Aarav, Priya, and many more)
Model — vachana-voice-v3 with voice cloning support
Configurable output — sample rate (8000–44100), encoding (linear_pcm, oggopus), container (raw, mp3, wav, mulaw, ogg)

Supported Languages

STT Languages (Speech-to-Text) — 10 languages

STT uses BCP-47 locale codes (e.g. hi-IN):

Language	Code
English (India)	`en-IN`
Hindi	`hi-IN`
Gujarati	`gu-IN`
Tamil	`ta-IN`
Kannada	`kn-IN`
Telugu	`te-IN`
Marathi	`mr-IN`
Bengali	`bn-IN`
Malayalam	`ml-IN`
Punjabi	`pa-IN`

Plus streaming-only experimental: en-hi-IN-latn (Hinglish), en-hi-in-cm (code-mixed).

TTS Languages (Text-to-Speech) — 22 languages

TTS uses ISO 639 language codes (e.g. hi, bn). Pass these via the language parameter.

Language	Code	Sample Voices
Assamese	`as`	Priya, Arjun
Bengali	`bn`	Ananya, Abhik
Bodo	`brx`	Anamika, Anil
Dogri	`doi`	Asha, Ajay
Gujarati	`gu`	Avani, Akshay
Hindi	`hi`	Aarav, Deepak
Kannada	`kn`	Anitha, Aditya
Kashmiri	`ks`	Aafreen, Altaf
Konkani	`kok`	Alka, Agnelo
Maithili	`mai`	Archana, Amaresh
Malayalam	`ml`	Ambika, Abhilash
Manipuri	`mni`	Achouba, Dinamani
Marathi	`mr`	Aparna, Amol
Nepali	`ne`	Anita, Amar
Odia	`or`	Anuradha, Asutosh
Punjabi	`pa`	Amandeep, Amarjit
Sanskrit	`sa`	Akshara, Achyut
Santhali	`sat`	Arjun_S, Birsa
Sindhi	`sd`	Ameena, Bhagwanti
Tamil	`ta`	Abinaya, Anbarasan
Telugu	`te`	Alekhya, Adithya
Urdu	`ur`	Aiza, Asad

Available Voices (Sample)

The TTS API supports 250+ voices across all supported languages. Here are some primary voices:

Voice	ID	Language
Karan	`Karan`	Primary
Simran	`Simran`	Primary
Nara	`Nara`	Primary
Riya	`Riya`	Primary
Viraj	`Viraj`	Primary
Raju	`Raju`	Primary
Aarav	`Aarav`	Hindi
Priya	`Priya`	Assamese
Ananya	`Ananya`	Bengali
Avani	`Avani`	Gujarati
Anitha	`Anitha`	Kannada
Ambika	`Ambika`	Malayalam
Aparna	`Aparna`	Marathi
Abinaya	`Abinaya`	Tamil
Alekhya	`Alekhya`	Telugu
Aiza	`Aiza`	Urdu

Each language has 8–16 dedicated voices (male and female). Use SUPPORTED_VOICES from the package for the full list.

Legacy v2 Voices

The following lowercase voices from vachana-voice-v2 are still supported for backward compatibility:

Voice	ID	Model
Sia	`sia`	vachana-voice-v2
Raju	`raju`	vachana-voice-v2
Kanika	`kanika`	vachana-voice-v2
Nikita	`nikita`	vachana-voice-v2
Ravan	`ravan`	vachana-voice-v2
Simran	`simran`	vachana-voice-v2
Karan	`karan`	vachana-voice-v2
Neha	`neha`	vachana-voice-v2

Note: Casing matters! "karan" (lowercase) uses the old v2 model, while "Karan" (capitalized) uses the new v3 model.

Architecture

gnani-vachana (>=0.3.0)   ← Core SDK (REST, WebSocket, SSE clients, voice/language constants)
    ↑
livekit-plugins-gnani     ← This package (LiveKit Agents adapter)

This plugin is a thin adapter that wraps the gnani-vachana SDK into LiveKit's stt.STT and tts.TTS base classes. Voice lists, language constants, and model definitions are shared with the core SDK.

Documentation

License

Apache 2.0 — see LICENSE.

Project details

These details have been verified by PyPI

Project links

Owner

LiveKit

GitHub Statistics

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.3

May 22, 2026

0.4.2

May 22, 2026

0.4.1

May 22, 2026

0.4.0

May 22, 2026

0.3.5

May 22, 2026

0.3.2

May 22, 2026

0.3.1

May 21, 2026

This version

0.3.0

May 21, 2026

0.2.0

May 13, 2026

0.1.0

May 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livekit_plugins_gnani-0.3.0.tar.gz (12.2 kB view details)

Uploaded May 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

livekit_plugins_gnani-0.3.0-py3-none-any.whl (15.2 kB view details)

Uploaded May 21, 2026 Python 3

File details

Details for the file livekit_plugins_gnani-0.3.0.tar.gz.

File metadata

Download URL: livekit_plugins_gnani-0.3.0.tar.gz
Upload date: May 21, 2026
Size: 12.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for livekit_plugins_gnani-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`b136d3c4e1909365c0adde83a3d9dd848e8370bb46b5ece865d5e1dd5b873a86`
MD5	`7fe573857382ec3072554b5068c37219`
BLAKE2b-256	`52df978572972cc923ab99c74c82269b9d7e6cda2b74f1391d7ea883102e02c3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for livekit_plugins_gnani-0.3.0.tar.gz:

Publisher: workflow.yml on Gnani-AI-Mintlify/livekit-plugins-gnani

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: livekit_plugins_gnani-0.3.0.tar.gz
- Subject digest: b136d3c4e1909365c0adde83a3d9dd848e8370bb46b5ece865d5e1dd5b873a86
- Sigstore transparency entry: 1592052744
- Sigstore integration time: May 21, 2026
Source repository:
- Permalink: Gnani-AI-Mintlify/livekit-plugins-gnani@40ba14084fd05de3c2b0dfbf03b6cce2938392bd
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/Gnani-AI-Mintlify
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@40ba14084fd05de3c2b0dfbf03b6cce2938392bd
- Trigger Event: release

File details

Details for the file livekit_plugins_gnani-0.3.0-py3-none-any.whl.

File metadata

Download URL: livekit_plugins_gnani-0.3.0-py3-none-any.whl
Upload date: May 21, 2026
Size: 15.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for livekit_plugins_gnani-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fedd40518af8da194e0fff5a72b4858c9db6197031d117e6b2e7d115e8d2772d`
MD5	`69e04ab39491e1abfd7685ac0f656208`
BLAKE2b-256	`c1970a1f461c37bd43b2c322a25bc42dc1da8dee9a55461bfc54769bce401f83`

See more details on using hashes here.

Provenance

The following attestation bundles were made for livekit_plugins_gnani-0.3.0-py3-none-any.whl:

Publisher: workflow.yml on Gnani-AI-Mintlify/livekit-plugins-gnani

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: livekit_plugins_gnani-0.3.0-py3-none-any.whl
- Subject digest: fedd40518af8da194e0fff5a72b4858c9db6197031d117e6b2e7d115e8d2772d
- Sigstore transparency entry: 1592052757
- Sigstore integration time: May 21, 2026
Source repository:
- Permalink: Gnani-AI-Mintlify/livekit-plugins-gnani@40ba14084fd05de3c2b0dfbf03b6cce2938392bd
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/Gnani-AI-Mintlify
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@40ba14084fd05de3c2b0dfbf03b6cce2938392bd
- Trigger Event: release

livekit-plugins-gnani 0.3.0

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Project description

livekit-plugins-gnani

Installation

Prerequisites

Quick Start

Speech-to-Text

Text-to-Speech

Features

STT

TTS

Supported Languages

STT Languages (Speech-to-Text) — 10 languages

TTS Languages (Text-to-Speech) — 22 languages

Available Voices (Sample)

Legacy v2 Voices

Architecture

Documentation

License

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance