Skip to content

ianchiu111/Pipecat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome

This project implements a real-time, multi-user conversational AI voice agent. The system is divided into a modern web frontend and a Python-based AI backend, communicating seamlessly via WebRTC.

System: Frontend (Client & Token API)

  • Tech Stack: Next.js, React, TypeScript.
  • RTC Interface: Utilizes @livekit/components-react to build a responsive, real-time meeting room UI.
  • Authentication: A Next.js API route uses the livekit-server-sdk to securely generate dynamic room access tokens based on user inputs.
  • Real-time UI: Listens to WebRTC Data Channels to instantly render AI-generated transcripts.
  • Notes: Summarize with react-markdown in mind map.

System: Backend (AI Voice Agent)

  • Tech Stack: Python, Pipecat AI Framework.
  • AI Pipeline: Orchestrates Voice Activity Detection (VAD), Speech-to-Text (STT), LLM processing, and Text-to-Speech (TTS) into a continuous, low-latency stream.

Present

image

Reference

  1. LiveKit
  2. Pipecat

Code Examples

  1. transports-livekit

About

This project implements a real-time, multi-user conversational AI voice agent. The system is divided into a modern web frontend and a Python-based AI backend, communicating seamlessly via WebRTC.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors