AI Agent SDK on Huddle01 dRTC Network

These details have not been verified by PyPI

Project description

The dRTC Infra for AI.

Overview 🚀

This repository aims to bridge the gap between Artificial Intelligence (AI) and Real-Time Communication (RTC) technologies. It provides a set of examples and tutorials to help developers integrate AI into their WebRTC applications.

Features 🎯

Agents: AI-powered agents that can be integrated into WebRTC applications, such as chatbots, voicebots, and video bots.
LLM Models: API for integrating Large Language Models (LLM) into WebRTC applications, such as Realtime API, Text-to-Speech, and Speech-to-Text.

Quick Start 🚀

To install the core Agents library:

pip install ai01

Internally the Library uses huddle01-ai package to interact with the dRTC Network and build WebRTC Connections.

Basic Usage 📝

import asyncio
from ai01.agent import Agent, AgentOptions
from ai01.providers.openai import AudioTrack
from ai01.rtc import RTCOptions, Role, HuddleClientOptions

# Configure RTC options
rtc_options = RTCOptions(
    api_key="your_huddle_api_key",
    project_id="your_project_id",
    room_id="your_room_id",
    role=Role.HOST,
    metadata={"displayName": "AI Agent"}
)

# Initialize Agent
agent = Agent(
    options=AgentOptions(
        rtc_options=rtc_options,
        audio_track=AudioTrack()
    )
)

Module Documentation 📖

The core module can be broken down into the following submodules:

ai01.agent: Core module for creating AI agents that can be integrated into WebRTC applications, each agent can be considered as a separate entity that can be connected to a dRTC room, and can interact with other agents and users.
ai01.providers: Module for integrating AI providers into the core module, such as OpenAI, Google Cloud, and Microsoft Azure, and also exposes API to integrate custom AI providers.
ai01.rtc: Module for integrating Real-Time Communication (RTC) technologies into the core module, such as Huddle, Twilio, and Agora, and also exposes API to integrate custom RTC providers, Each Agent in itself is connected to an RTC room.

Anything can be built with mixing and matching different agents with different models in any pattern which is suitable for the application.

Agent Module

The Agent module provides the core functionality for creating AI agents that can participate in real-time communication rooms.

An Agent is a high-level entity that can:

Connect to dRTC network
Interact with AI models
Process media streams
Handle room events

Agent Methods

join()

Join Method is used to join the dRTC Network and establish a websocket connection, upon successful connection the agent is assigned a room using which agent can setup necessary event listeners before calling connect method.

from ai01.agent import Agent, AgentOptions
from ai01.rtc import RoomEvents

room = await agent.join()

@room.on(RoomEvent.RoomJoined)
def on_room_joined():
    print("Room Joined")

Note: The join method is an async method and should be awaited before calling any other method.

connect()

Connect Method is used to establish a WebRTC Connection with the room, upon successful connection the agent can start sending and receiving media streams.

await agent.connect()

Events

The Agent module exposes a set of events that can be used to handle room events

from ai01.rtc import RoomEvents, RoomEventsData

@room.on(RoomEvents.NewPeerJoined)
def on_room_joined(data: RoomEventsData.NewPeerJoined):
    print("New Peer Joined the Room",data['remote_peer_id'])

The following events are supported:

RoomJoined: Triggered when the agent successfully joins the room, which means the agent is successfully connected to the dRTC network.
RoomJoinFailed: Triggered when the agent fails to join the room, which means the agent is not connected to the dRTC network.
RoomClosed: Triggered when the room is closed, which means the room is no longer available.
RoomConnecting: Triggered when the agent successfully connects to the room, which means the agent is connected to the room.
NewPeerJoined: Triggered when a new peer joins the room, which means a new peer is connected to the room.
PeerLeft: Triggered when a peer leaves the room, which means a peer is disconnected from the room.
RemoteProducerAdded: Triggered when a remote producer is added to the room, which means a remote producer is added to the room.
RemoteProducerRemoved: Triggered when a remote producer is removed from the room, which means a remote producer is removed from the room.
NewConsumerAdded: Triggered when a new consumer is added to the room, which means a new consumer is added to the room.
ConsumerClosed: Triggered when a consumer is closed, which means a consumer is closed.
ConsumerPaused: Triggered when a consumer is paused, which means a consumer is paused.
ConsumerResumed: Triggered when a consumer is resumed, which means a consumer is resumed.

Providers Module

The Providers module provides a set of APIs for integrating AI providers into the core module.

Currently, the following providers are supported:

OpenAI:
1. Realtime API -> ALPHA
2. Speech-to-Text -> ALPHA
3. Text-to-Speech -> TODO
Gemini: TODO
Anthropic: TODO
grok: TODO

OpenAI Provider

The OpenAI provider provides an API for integrating OpenAI models into the core module.

Note: Currently, the OpenAI Realtime API is only available as provider. Issues are open for adding more providers and models, feel free to open a PR or Issue for adding more providers and models.

Realtime API

Realtime API is a most advanced model by Open_AI which provides direct voice-to-voice communication with the model, which makes it very powerful and the response time is the fastest among all the models.

from ai01.providers.openai.realtime import RealTimeModel, RealTimeModelOptions

openai_api_key = os.getenv("OPENAI_API_KEY")


llm = RealTimeModel(
    agent=agent,
    options=RealTimeModelOptions(
        oai_api_key=openai_api_key,
        instructions=bot_prompt,
    ),
)

await llm.connect()

This model is still in Beta, but we actively working on it to make it more stable and reliable, and also adding more features to it.

Methods

connect(): Connect method is used to establish a connection with the OpenAI Realtime API, upon successful connection the agent can start sending and receiving media streams.

await llm.connect()

Upon successful connection, the agent can start sending and receiving media streams, to and from the OpenAI Realtime API.

Events

Right now the Model pushes all the streams to the audio_track of the agent, and the agent can process the audio stream as per the requirement.

RTC Module

The Core RTC Module is built on-top of the huddle01 python package, for detailed documentation refer to the documentaion of the huddle01 package on pypi

Contributing 🤝

This Repository is under active development, and we are actively looking for contributors to help us build this project.

If you are interested in contributing to this project, please refer to the Contributing Guidelines.

Keep checking for Latest Issues and PRs, and feel free to open an Issue or PR for any feature or bug.

You can also join the Huddle01 Discord Community for any queries or discussions.

Discord: Huddle01 Discord

Setting Up the Project Locally 🛠️

To set up the project locally, follow the steps mentioned in the Setup Guide

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.18

May 13, 2025

0.2.17

Feb 26, 2025

0.2.16

Jan 21, 2025

0.2.15

Jan 20, 2025

0.2.14

Jan 17, 2025

0.2.13

Jan 17, 2025

0.2.12

Jan 16, 2025

0.2.11

Jan 16, 2025

0.2.10

Jan 16, 2025

0.2.9

Jan 16, 2025

0.2.8

Jan 14, 2025

0.2.7

Jan 14, 2025

0.2.6

Dec 24, 2024

0.2.5

Dec 6, 2024

0.2.4

Dec 6, 2024

0.2.3

Dec 6, 2024

0.2.2

Dec 3, 2024

0.2.1

Dec 3, 2024

0.2.0

Dec 2, 2024

0.1.1a0 pre-release

Dec 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai01-0.2.18.tar.gz (23.3 kB view details)

Uploaded May 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai01-0.2.18-py3-none-any.whl (27.7 kB view details)

Uploaded May 13, 2025 Python 3

File details

Details for the file ai01-0.2.18.tar.gz.

File metadata

Download URL: ai01-0.2.18.tar.gz
Upload date: May 13, 2025
Size: 23.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.3 CPython/3.12.6 Darwin/24.4.0

File hashes

Hashes for ai01-0.2.18.tar.gz
Algorithm	Hash digest
SHA256	`4a558eeb6f1706481f6204b4ba5ec92af483e14a17ca271fbb22c4a638424eda`
MD5	`17457adc98855f3d6f7ebeb2be3a896f`
BLAKE2b-256	`773f41fac92c4c9182ce0e40e12a6275e569e65c050d4d854f060f45ac0742f7`

See more details on using hashes here.

File details

Details for the file ai01-0.2.18-py3-none-any.whl.

File metadata

Download URL: ai01-0.2.18-py3-none-any.whl
Upload date: May 13, 2025
Size: 27.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.3 CPython/3.12.6 Darwin/24.4.0

File hashes

Hashes for ai01-0.2.18-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1bbd800e38e8678a27c9f6532a74152c3b76b9e7ad30a942e379345a79204c1d`
MD5	`bd8aed642f41469e7b16d1dc3bed4032`
BLAKE2b-256	`9f6d26b67e4892711ce9a68b2ffc38802fad214f1e04124847dddeb07ce9717c`

See more details on using hashes here.

ai01 0.2.18

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Overview 🚀

Features 🎯

Quick Start 🚀

Basic Usage 📝

Module Documentation 📖

Agent Module

Agent Methods

Events

Providers Module

OpenAI Provider

Realtime API

Methods

Events

RTC Module

Contributing 🤝

Setting Up the Project Locally 🛠️

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes