Skip to main content

Humanlike avatar plugin for Pipecat — real-time talking-head video with expression control

Project description

pipecat-ai-humanlike

Humanlike avatar plugin for Pipecat — real-time talking-head video with expression control.

Streams TTS audio to a Humanlike GPU orchestrator and receives lip-synced video frames with facial expressions guided by a natural-language prompt.

Installation

pip install pipecat-ai-humanlike

For faster audio resampling:

pip install pipecat-ai-humanlike[fast]

Quick Start

from pipecat.services.humanlike import HumanlikeVideoService

humanlike = HumanlikeVideoService(
    ws_url="ws://your-gpu-server:8000/ws/stream",
    image="./face.png",
    avatar_model="humanlike-homo",
    prompt="warm, friendly, subtly smiling, occasional nods",
)

pipeline = Pipeline([
    transport.input(),
    stt,
    context_aggregator.user(),
    llm,
    tts,
    humanlike,            # after TTS, before transport output
    transport.output(),
    context_aggregator.assistant(),
])

Parameters

Parameter Type Default Description
ws_url str ws://127.0.0.1:8000/ws/stream WebSocket URL of the Humanlike GPU orchestrator
image str | bytes ./face.png Path to a face image, or raw PNG/JPEG bytes
avatar_model str humanlike-homo Model identifier
prompt str warm, friendly, subtly smiling Expression prompt guiding facial behaviour
seed int 42 Random seed for reproducible generation
video_width int 512 Output video width
video_height int 512 Output video height

Live Expression Updates

Update the expression prompt during a live session:

await humanlike.update_prompt("excited, wide eyes, big smile")

How It Works

  1. On pipeline start, connects to the orchestrator via WebSocket and sends the face image + config
  2. Intercepts TTSAudioRawFrame from the TTS service, resamples to 16 kHz mono, and streams PCM chunks to the orchestrator
  3. Receives JPEG video frames back, decodes them, and pushes OutputImageRawFrame downstream
  4. All frames (including audio) pass through so the user still hears the TTS output
  5. Shows the reference image as an idle frame until the first GPU-generated frame arrives

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pipecat_ai_humanlike-0.1.2.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pipecat_ai_humanlike-0.1.2-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file pipecat_ai_humanlike-0.1.2.tar.gz.

File metadata

  • Download URL: pipecat_ai_humanlike-0.1.2.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for pipecat_ai_humanlike-0.1.2.tar.gz
Algorithm Hash digest
SHA256 068b8e6691112153f18fdefa91061d16d535ef20394d41ca82abc5ac319ead97
MD5 ff7c5a72afc7c5c986c6c0d554b3c501
BLAKE2b-256 1e86a49b634fceb1e3b9f871f88c55c409a4e735f01001683a6a6a02a53447b3

See more details on using hashes here.

File details

Details for the file pipecat_ai_humanlike-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for pipecat_ai_humanlike-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e039bbbeded63fb73da5620f22a94871e48c6066b34c9378831d082b54c9d91f
MD5 8134d3386a216740188eee7b463cfc22
BLAKE2b-256 9d4655f346cca30ce84bf3801f58b3795ca1affa719794c3b5656fd179240d77

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page