- San Francisco, Helsinki
-
11:48
(UTC -07:00) - varunsingh.net
- @vr000m
Highlights
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
An open-source AI agent that brings the power of Gemini directly into your terminal.
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
A TTS model capable of generating ultra-realistic dialogue in one pass.
c/ua is the Docker Container for Computer-Use AI Agents.
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
A Conversational Speech Generation Model
Real Time (WebRTC & WebTransport) Proxy for LLM WebSocket APIs
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
Instructions on how to use the Realtime API on Microcontrollers and Embedded Platforms
Voice Agent Framework for Conversational AI
CLI scripts for processing Daily raw-tracks recordings.
An example of the cloud infrastructure to store Daily's recordings to an s3 bucket
Work In Progress: ComfyUI for Pipecat pipelines
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Build Phone Calling Voice Agent fully powered by open source models.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Daily Bots Web Demo showcasing how to build real-time voice AI agents