logo
episode-header-image
Sep 2022
38m 9s

Episode 111 - Beyond the Assistants: Sen...

Mark and Allen
About this episode

Not every assistant needs to be part of Amazon Alexa or the Google Assistant. What if you're developing your own voice assistant? How do you take care of some tasks like getting output to your users? In this episode, Allen and Mark give an overview of some of the technologies available to you to send audio exactly the way you want it to sound and some of the tools that are available to use.

Resources mentioned:

  • Speech Synthesis Markup Language (SSML) specification - https://www.w3.org/TR/speech-synthesis11/
  • Amazon Lex - https://aws.amazon.com/polly/
  • Google Cloud Text to Speech - https://cloud.google.com/text-to-speech
  • SSML Guru - ssml.guru
  • Speech Markdown - SpeechMarkdown.org
  • Jovo Marketplace TTS - https://www.jovo.tech/marketplace#tts
Up next
Mar 5
Episode 270 - Beyond the Big Three: Open Models, Agents, & the Future of Devs
In part two of this insightful conversation, Allen and Sam Witteveen dive deep into the rapidly expanding world of AI models beyond the "big three." They explore the impact of open-weight and Chinese models like DeepSeek, Mistral, and Qwen, discussing their impressive efficiency ... Show More
49m 18s
Mar 3
Episode 269 - The "Big Three" AI Models and Training Evolution
In Part 1 of a two-part series, guest host Sam Witteveen joins Allen to catch up and dive deep into the rapidly evolving world of AI models. Sam shares his fascinating journey from being a successful pop songwriter to becoming a Machine Learning Google Developer Expert (GDE) and ... Show More
37m 34s
Feb 19
Episode 268 - The New @langchain/google Package
Allen has been busy! This week, he unveils the new `@langchain/google` package for LangChain JS. This major update consolidates five previous libraries into a single, standardized, and powerful tool for developers working with Gemini and Vertex AI. Allen walks Mark through the mo ... Show More
18m 7s
Recommended Episodes
Jun 2024
Generative AI and Hardware Upgrades, Google History and the Dark GPU Theory, Amazon’s Logistics Long Play
A question about generative AI and the hardware powering voice assistants, projecting the GPU future with the dot com bubble as context, and thoughts on Amazon’s capital expenditures and the state of AWS and Amazon Supply Chain. At the end: An Elon hater reviews the Cybertruck. 
1h 10m
Jul 2018
I'm sorry, what did you say?
We explore the history and evolution of speech recognition, one of the foundational technologies behind voice assistants like Siri, Alexa and Google Assistant. Learn more about your ad-choices at https://www.iheartpodcastnetwork.comSee omnystudio.com/listener for privacy informat ... Show More
28m 39s
Apr 2025
Agentic AI for IT Pros with Tim Warner
<p>What can agentic AI do for you? Richard talks to Tim Warner about his work utilizing next generation agentic AI technologies to help with sysadmin tasks. Tim talks about the early lead that Cursor AI took with AI agents capable of writing and executing scripts on your behalf - ... Show More
34m 44s
Dec 2024
AI Voice Technology Just Got INSANE (ElevenLabs GenFM Demo + More)
Episode 38: How revolutionary is the latest in AI voice technology? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) dive deep into this topic with Ammaar Reshi (https://x.com/ammaar), head of design at ElevenLabs and AI enthusiast who has made wave ... Show More
38m 33s
Sep 2024
AI is more than GenAI
GenAI is often what people think of when someone mentions AI. However, AI is much more. In this episode, Daniel breaks down a history of developments in data science, machine learning, AI, and GenAI in this episode to give listeners a better mental model. Don’t miss this one if y ... Show More
40m 3s
Sep 2024
Study Reveals Vulnerabilities in Alexa, Siri, and Google Assistant to Malicious Commands
<p>In this episode, we explore a recent study that uncovers how popular voice assistants like Alexa, Siri, and Google Assistant are susceptible to malicious commands. We discuss the potential risks and what users can do to protect their devices.</p> <p><br></p> <p><br></p> <p>Get ... Show More
6m 17s
Apr 2024
Episode 294 Alexa's New AI Voice
<p>It looks like a version of the new AI Voice, for Alexa is here. Back in the September Amazon Event, we were told and even got to hear a small segment of what the new Alexa Voice would sound like, join us as we dive into the Alexa Beta and test out the new voice. Its hard to te ... Show More
19m 1s
Oct 2024
A Big Week in Tech: NotebookLM, OpenAI’s Speech API, & Custom Audio
<p>Last week was another big week in technology. </p><p>Google’s NotebookLM introduced its Audio Overview feature, enabling users to create customizable podcasts in over 35 languages. OpenAI followed with their real-time speech-to-speech API, making voice integration easier for d ... Show More
32m 1s
Nov 2024
Build An App with a Backend Using Ai in 20 min (Cursor Ai, Replit, Firebase, Wispr Flow)
Episode 32: How can you build an app with a backend using AI in just 20 minutes? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) sit down with AI enthusiast Riley Brown (https://x.com/rileybrown_ai) to explore this exciting and challenging process. ... Show More
38m 34s