Skip to main content

Humanlike avatar plugin for Pipecat — real-time talking-head video with expression control

Project description

pipecat-ai-humanlike

Humanlike avatar plugin for Pipecat — real-time talking-head video with expression control.

Streams TTS audio to a Humanlike GPU orchestrator and receives lip-synced video frames with facial expressions guided by a natural-language prompt.

Installation

pip install pipecat-ai-humanlike

For faster audio resampling:

pip install pipecat-ai-humanlike[fast]

Quick Start

from pipecat.services.humanlike import HumanlikeVideoService

humanlike = HumanlikeVideoService(
    ws_url="ws://your-gpu-server:8000/ws/stream",
    image="./face.png",
    avatar_model="humanlike-homo",
    prompt="warm, friendly, subtly smiling, occasional nods",
)

pipeline = Pipeline([
    transport.input(),
    stt,
    context_aggregator.user(),
    llm,
    tts,
    humanlike,            # after TTS, before transport output
    transport.output(),
    context_aggregator.assistant(),
])

Parameters

Parameter Type Default Description
ws_url str ws://127.0.0.1:8000/ws/stream WebSocket URL of the Humanlike GPU orchestrator
image str | bytes ./face.png Path to a face image, or raw PNG/JPEG bytes
avatar_model str humanlike-homo Model identifier
prompt str warm, friendly, subtly smiling Expression prompt guiding facial behaviour
seed int 42 Random seed for reproducible generation
video_width int 512 Output video width
video_height int 512 Output video height

Live Expression Updates

Update the expression prompt during a live session:

await humanlike.update_prompt("excited, wide eyes, big smile")

How It Works

  1. On pipeline start, connects to the orchestrator via WebSocket and sends the face image + config
  2. Intercepts TTSAudioRawFrame from the TTS service, resamples to 16 kHz mono, and streams PCM chunks to the orchestrator
  3. Receives JPEG video frames back, decodes them, and pushes OutputImageRawFrame downstream
  4. All frames (including audio) pass through so the user still hears the TTS output
  5. Shows the reference image as an idle frame until the first GPU-generated frame arrives

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pipecat_ai_humanlike-0.1.1.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pipecat_ai_humanlike-0.1.1-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file pipecat_ai_humanlike-0.1.1.tar.gz.

File metadata

  • Download URL: pipecat_ai_humanlike-0.1.1.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for pipecat_ai_humanlike-0.1.1.tar.gz
Algorithm Hash digest
SHA256 aa9e09816d1154464ac2a04aa6bcae93e227cacc888ba3297a81e8b1bb8eb20d
MD5 d0cab32485686fbffaf81cd69155dace
BLAKE2b-256 234644e5ca8c2c25d391780a24807ce245603b825a6bb89d133875a72d0ea8b8

See more details on using hashes here.

File details

Details for the file pipecat_ai_humanlike-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for pipecat_ai_humanlike-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 876f979cac46f937e96abac79792976aa157a62c38653c092036fc0e0a77ae99
MD5 94f866da78ef90e6b8a220175bcb45c2
BLAKE2b-256 e460ba2c36c46a5dbcbfc33e7eef5b68306ffbfbc6b6d9fb156ce7f7dcd78345

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page