logo
episode-header-image
Sep 2022
38m 9s

Episode 111 - Beyond the Assistants: Sen...

Mark and Allen
About this episode

Not every assistant needs to be part of Amazon Alexa or the Google Assistant. What if you're developing your own voice assistant? How do you take care of some tasks like getting output to your users? In this episode, Allen and Mark give an overview of some of the technologies available to you to send audio exactly the way you want it to sound and some of the tools that are available to use.

Resources mentioned:

  • Speech Synthesis Markup Language (SSML) specification - https://www.w3.org/TR/speech-synthesis11/
  • Amazon Lex - https://aws.amazon.com/polly/
  • Google Cloud Text to Speech - https://cloud.google.com/text-to-speech
  • SSML Guru - ssml.guru
  • Speech Markdown - SpeechMarkdown.org
  • Jovo Marketplace TTS - https://www.jovo.tech/marketplace#tts
Up next
Jun 25
Set the scene with Gemini TTS
Roll tape and prompt! In this episode of Two Voice Devs, Allen and Mark explore how Google’s new advanced prompting guidelines turn developers into voice directors for Gemini Text-to-Speech. Instead of coding rigid SSML tags, you can now establish a scene, write stage directions, ... Show More
14m 47s
Jun 11
Project Solara: Welcome to Agent-First Hardware
After months of conferences and busy schedules, Mark Tucker and Allen Firstenberg return to discuss Microsoft’s surprising Build conference announcement: Project Solara. Moving from the legacy voice-first consumer world of Amazon Alexa and Google Assistant, Microsoft is pioneerin ... Show More
15m 30s
Jun 4
New Horizons for Android: XR, MCP, and Agents
Allen and Mike record live from Google I/O in the Builders podcast space. They discuss their impressions of this year's conference, the evolution of I/O over the years, and the big announcements from the keynote. Key topics include Gemini's "any output from any input" vision, how ... Show More
19m 29s
Recommended Episodes
Jun 2024
Generative AI and Hardware Upgrades, Google History and the Dark GPU Theory, Amazon’s Logistics Long Play
A question about generative AI and the hardware powering voice assistants, projecting the GPU future with the dot com bubble as context, and thoughts on Amazon’s capital expenditures and the state of AWS and Amazon Supply Chain. At the end: An Elon hater reviews the Cybertruck. 
1h 10m
Jul 2018
I'm sorry, what did you say?
We explore the history and evolution of speech recognition, one of the foundational technologies behind voice assistants like Siri, Alexa and Google Assistant. Learn more about your ad-choices at https://www.iheartpodcastnetwork.comSee omnystudio.com/listener for privacy informat ... Show More
28m 39s
Apr 2025
Agentic AI for IT Pros with Tim Warner
<p>What can agentic AI do for you? Richard talks to Tim Warner about his work utilizing next generation agentic AI technologies to help with sysadmin tasks. Tim talks about the early lead that Cursor AI took with AI agents capable of writing and executing scripts on your behalf - ... Show More
34m 44s
Dec 2024
AI Voice Technology Just Got INSANE (ElevenLabs GenFM Demo + More)
Episode 38: How revolutionary is the latest in AI voice technology? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) dive deep into this topic with Ammaar Reshi (https://x.com/ammaar), head of design at ElevenLabs and AI enthusiast who has made wave ... Show More
38m 33s
Sep 2024
AI is more than GenAI
GenAI is often what people think of when someone mentions AI. However, AI is much more. In this episode, Daniel breaks down a history of developments in data science, machine learning, AI, and GenAI in this episode to give listeners a better mental model. Don’t miss this one if y ... Show More
40m 3s
Sep 2024
Study Reveals Vulnerabilities in Alexa, Siri, and Google Assistant to Malicious Commands
In this episode, we explore a recent study that uncovers how popular voice assistants like Alexa, Siri, and Google Assistant are susceptible to malicious commands. We discuss the potential risks and what users can do to protect their devices. Get on the AI Box Waitlist: ⁠⁠⁠https: ... Show More
6m 17s
Apr 2024
Episode 294 Alexa's New AI Voice
<p>It looks like a version of the new AI Voice, for Alexa is here. Back in the September Amazon Event, we were told and even got to hear a small segment of what the new Alexa Voice would sound like, join us as we dive into the Alexa Beta and test out the new voice. Its hard to te ... Show More
19m 1s
Oct 2024
A Big Week in Tech: NotebookLM, OpenAI’s Speech API, & Custom Audio
<p>Last week was another big week in technology. </p><p>Google’s NotebookLM introduced its Audio Overview feature, enabling users to create customizable podcasts in over 35 languages. OpenAI followed with their real-time speech-to-speech API, making voice integration easier for d ... Show More
32m 1s
Nov 2024
Build An App with a Backend Using Ai in 20 min (Cursor Ai, Replit, Firebase, Wispr Flow)
Episode 32: How can you build an app with a backend using AI in just 20 minutes? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) sit down with AI enthusiast Riley Brown (https://x.com/rileybrown_ai) to explore this exciting and challenging process. ... Show More
38m 34s