logo
episode-header-image
Aug 29
25m 40s

Episode 253 - The Future of Voice? Explo...

Mark and Allen
About this episode

In this episode of Two Voice Devs, Mark and Allen dive into the new experimental Text-to-Speech (TTS) model in Google's Gemini 2.5. They explore its capabilities, from single-speaker to multi-speaker audio generation, and discuss how it's a significant leap from the old days of SSML. They also touch on how this new technology can be integrated with LangChainJS to create more dynamic and natural-sounding voice applications. Is this the return of voice as the primary interface for AI?


[00:00:00] Introduction

[00:00:45] Google's new experimental TTS model for Gemini

[00:01:55] Demo of single-speaker TTS in Google's AI Studio

[00:03:05] Code walkthrough for single-speaker TTS

[00:04:30] Lack of fine-grained control compared to SSML

[00:05:15] Using text cues to shape the TTS output

[00:06:20] Demo of multi-speaker TTS with a script

[00:09:50] Code walkthrough for multi-speaker TTS

[00:11:30] The model is tuned for TTS, not general conversation

[00:12:10] Using a separate LLM to generate a script for the TTS model

[00:13:30] Code walkthrough of the two-function approach with LangChainJS

[00:16:15] LangChainJS integration details

[00:19:00] Is Speech Markdown still relevant?

[00:21:20] Latency issues with the current TTS model

[00:22:00] Caching strategies for TTS

[00:23:30] Voice as the natural UI for AI

[00:25:30] Outro


#Gemini #TTS #VoiceAI #VoiceFirst #AI #Google #LangChainJS #LLM #Developer #Podcast

Up next
Sep 25
Episode 255 - Agonizing About Agent-to-Agent
Join Allen Firstenberg and Noble Ackerson in a deep dive into the evolving world of AI agent protocols. In this episode of Two Voice Devs, they unpack the Agent-to-Agent (A2A) protocol, comparing it with the Model Context Protocol (MCP). They explore the fundamental differences, ... Show More
49m 6s
Sep 18
Episode 254 - Agent Frameworks Compared: Google's ADK vs LangChainJS
Allen and Mark are back to discuss AI agent frameworks again. This time, Allen compares Google's Agent Development Kit (ADK) with LangChainJS and LangGraphJS. He walks through building a simple agent in both frameworks, highlighting the differences in their approaches, from confi ... Show More
33m 21s
Aug 15
Episode 252 - GPT-5 First Look: Evolution, Not Revolution
Join Allen and Mark as they take a first look at the newly released GPT-5 from OpenAI. They dive into the details of what's new, what's changed, and what's missing, frequently comparing it to other models like Google's Gemini. From the new mini and nano models to the pricing wars ... Show More
27m 35s
Recommended Episodes
Sep 2024
#311 - Série Tech : Les meilleurs outils pour optimiser son e-commerce
<p><em>“Il faut toujours regarder que l’outil soit bien noté et par un grand nombre d’utilisateurs”.</em><br></p><p>C’est une autre question qui revient souvent lorsque l’on est e-commerçant… Parmi les plusieurs milliers d’outils disponibles, quels sont les <strong>incontournable ... Show More
13m 33s
Mar 2025
How to Stay Ahead as a Software Engineer - No Matter What Changes!
<p>🔥 <strong>How do top software engineers stay ahead—no matter how fast technology evolves?</strong></p><p>The tech industry is constantly shifting, and staying relevant as a software engineer isn’t just about learning the latest frameworks. In this episode, we dive into:**</p> ... Show More
45m 21s
May 2022
Episode 67: Tools for Organization and Follow-Through
<p class="" style="white-space:pre-wrap;"><strong>Episode 67: Tools for Organization and Follow-Through</strong></p><p class="" style="white-space:pre-wrap;">As professional translators and business owners, we have to stay organized to make sure our businesses run efficiently. An ... Show More
38m 34s
Aug 2019
Building Tools And Platforms For Data Analytics
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data engineers are responsible for building tools and platforms to power the workflows of other members of the business. Each group of users has their own set of requirements for the way that they access and interact wit ... Show More
48m 7s
Jul 2025
✍️ Making lists (Part 1 ) + Transcript
Get access to our episode archive: https://www.patreon.com/ieltssfs Do you make a list when you shop? Do you prefer to make a list on paper or your phone? Do you make a list for your work? Why don't some people like making lists? Tune in and have a great day! - Book a class with ... Show More
16m 14s
May 2019
The Best Mobile Apps for YouTubers
In this episode, I introduce you to the best mobile apps for YouTubers. These are apps that you can download on your phone, that will help you in producing your YouTube videos. Music https://soundcloud.com/cluelesskit Want more tips, tricks, and techniques on YouTube growth? Subs ... Show More
7m 57s
Mar 2025
How to Choose Tools Your Product Team Will Actually Use (with Moshe Mikanovsky)
<p>How many logins do you use at work each week? If you’re not sure, you’re not alone. A 2024 report found that the average employee uses 36 cloud-based services daily—engineering teams use twice as many! Yet, over half of SaaS licenses go unused, wasting valuable resources.</p>< ... Show More
21m 8s
Oct 2024
838: Productivity Tools for Web Developers
Get your productivity game on point! Scott and Wes serve up the best tools for web developers, covering everything from password managers and to-do apps to mind-mapping tools and little scripts that make life easier. Plus, find out what snippet managers they swear by and how they ... Show More
54m 28s