Skip to main content

Humanlike avatar plugin for Pipecat — real-time talking-head video with expression control

Project description

pipecat-ai-humanlike

Humanlike avatar plugin for Pipecat — real-time talking-head video with expression control.

Streams TTS audio to a Humanlike GPU orchestrator and receives lip-synced video frames with facial expressions guided by a natural-language prompt.

Installation

pip install pipecat-ai-humanlike

For faster audio resampling:

pip install pipecat-ai-humanlike[fast]

Quick Start

from pipecat.services.humanlike import HumanlikeVideoService

avatar = HumanlikeVideoService(
    ws_url="ws://your-gpu-server:8000/ws/stream",
    image="./face.png",
    avatar_model="humanlike-homo",
    prompt="warm, friendly, subtly smiling, occasional nods",
)

pipeline = Pipeline([
    transport.input(),
    stt,
    context_aggregator.user(),
    llm,
    tts,
    avatar,               # after TTS, before transport output
    transport.output(),
    context_aggregator.assistant(),
])

Parameters

Parameter Type Default Description
ws_url str ws://127.0.0.1:8000/ws/stream WebSocket URL of the Humanlike GPU orchestrator
image str | bytes ./face.png Path to a face image, or raw PNG/JPEG bytes
avatar_model str humanlike-homo Model identifier
prompt str warm, friendly, subtly smiling Expression prompt guiding facial behaviour
seed int 42 Random seed for reproducible generation
video_width int 512 Output video width
video_height int 512 Output video height

Live Expression Updates

Update the expression prompt during a live session:

await avatar.update_prompt("excited, wide eyes, big smile")

How It Works

  1. On pipeline start, connects to the orchestrator via WebSocket and sends the face image + config
  2. Intercepts TTSAudioRawFrame from the TTS service, resamples to 16 kHz mono, and streams PCM chunks to the orchestrator
  3. Receives JPEG video frames back, decodes them, and pushes OutputImageRawFrame downstream
  4. All frames (including audio) pass through so the user still hears the TTS output
  5. Shows the reference image as an idle frame until the first GPU-generated frame arrives

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pipecat_ai_humanlike-0.1.0.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pipecat_ai_humanlike-0.1.0-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file pipecat_ai_humanlike-0.1.0.tar.gz.

File metadata

  • Download URL: pipecat_ai_humanlike-0.1.0.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for pipecat_ai_humanlike-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e11fe167ff068f966d1122285df2cf184d5848f76137096a0a4b2fd19a116a44
MD5 ac8f13bfb7c75305d10a298c178fcec5
BLAKE2b-256 faa4d9240a608bb9dbe75fc5ad4ccb6a782baeff5ed50df997a274a4b585f5a8

See more details on using hashes here.

File details

Details for the file pipecat_ai_humanlike-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for pipecat_ai_humanlike-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 21aff84205719a40017389fccd1e183ff7271918a1e5da4025a0ac0680f94e09
MD5 ddbc2cc5e066e182b393f1b09da3b464
BLAKE2b-256 ab28fab577fc1634fb286e647d5843c01e5ac246f19540624b91e7c06819895b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page