Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Building NousyBooks - Orchestrating Low-Latency Multimodal Voice Agents with Gemini Live
Learn to build low-latency multimodal voice agents using Gemini Live. This talk covers bidirectional audio streaming, managing 12 function-calling tools, and handling client-side state for live voice interactions.
I built NousyBooks, an AI-powered storytelling platform where children become the heroes of their own books. I built this project as part of Gemini Live Agent Hackathon Challenge. The core of the experience is “Nousy,” a floating multimodal voice assistant that uses the Gemini Live API to brainstorm story themes, collect character details, and select art styles through natural, bidirectional conversation.