Skip to main content

Overview

Agent Human provides native integrations for the two most popular real-time voice AI frameworks — Pipecat and LiveKit Agents. Both integrations handle all the complexity of session management, audio streaming, and video delivery so you can add a talking avatar to your pipeline with just a few lines of code.

Choose Your Integration

Pipecat

AgentHumanVideoService — slots into any Pipecat pipeline right after your TTS service. Sends TTS audio to the avatar and injects the avatar’s video frames back into your pipeline.

LiveKit Agents

AvatarSession — drops into your LiveKit Agents entrypoint and renders your agent’s voice output as a talking avatar video in the LiveKit room.

How It Works

Both integrations follow the same underlying flow:
  1. Create a session — the integration calls the Agent Human REST API to provision an avatar server
  2. Stream audio — your pipeline’s TTS audio is forwarded to the avatar server in real time
  3. Receive video — the avatar server generates synchronized talking-head video and publishes it to your video room
You never need to manage sessions, tokens, or audio encoding manually — the integration handles all of it.

Prerequisites

Next Steps

Pipecat Quick Start

Add a talking avatar to a Pipecat bot in minutes

LiveKit Quick Start

Add a talking avatar to a LiveKit agent in minutes

API Reference

REST API for sessions, avatars, and usage