AI agents are revolutionizing automation and real-time communication—but how fast can you build one? In this hands-on session, we’ll rapidly develop a real-time voice-to-voice translator using open-source tools, walking through every step from speech recognition to translation and voice synthesis.
Designed for AI practitioners and engineers, this interactive workshop will cover:
- Speech-to-text processing with open-source models
- Language translation using LLM-powered tools
- Speech synthesis for real-time responses
- Optimizing latency for seamless interaction
- Deploying with open-source frameworks and APIs
We’ll demonstrate a live build, with attendees coding alongside us. A GitHub repo will be provided for reproducible learning. Whether you're new to AI automation or refining your prototyping skills, you’ll leave with a functional AI agent and the tools to iterate further.
Bring your laptop, and let’s build!
-------Celebrate 10 Years of AI Innovation at ODSC East 2025!
Join us on May 13th-15th, 2025, for 3 days of immersive learning and networking with AI experts -
https://lu.ma/lqgq5hi2
Use code CommunityEast2025 for an extra discount.