Introduction - Documentation

Overview

Agent Human provides native integrations for the two most popular real-time voice AI frameworks — Pipecat and LiveKit Agents. Both integrations handle all the complexity of session management, audio streaming, and video delivery so you can add a talking avatar to your pipeline with just a few lines of code.

Choose Your Integration

Pipecat

AgentHumanVideoService — slots into any Pipecat pipeline right after your TTS service. Sends TTS audio to the avatar and injects the avatar’s video frames back into your pipeline.

LiveKit Agents

AvatarSession — drops into your LiveKit Agents entrypoint and renders your agent’s voice output as a talking avatar video in the LiveKit room.

How It Works

Both integrations follow the same underlying flow:

Create a session — the integration calls the Agent Human REST API to provision an avatar server
Stream audio — your pipeline’s TTS audio is forwarded to the avatar server in real time
Receive video — the avatar server generates synchronized talking-head video and publishes it to your video room

You never need to manage sessions, tokens, or audio encoding manually — the integration handles all of it.

Prerequisites

An Agent Human account (sign up here)
An API key from Settings → API Keys
An avatar image (URL or base64) from your dashboard

Next Steps

Pipecat Quick Start

Add a talking avatar to a Pipecat bot in minutes

LiveKit Quick Start

Add a talking avatar to a LiveKit agent in minutes

API Reference

REST API for sessions, avatars, and usage

Pipecat Integration

​Overview

​Choose Your Integration