logo
episode-header-image
Mar 2020
49m 15s

Speech recognition to say it just right

Practical AI LLC
About this episode

Catherine Breslin of Cobalt joins Daniel and Chris to do a deep dive on speech recognition. She also discusses how the technology is integrated into virtual assistants (like Alexa) and is used in other non-assistant contexts (like transcription and captioning). Along the way, she teaches us how to assemble a lexicon, acoustic model, and language model to bring speech recognition to life.

Join the discussion

Changelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!

Sponsors:

  • LinodeOur cloud of choice and the home of Changelog.com. Deploy a fast, efficient, native SSD cloud server for only $5/month. Get 4 months free using the code changelog2019 OR changelog2020. To learn more and get started head to linode.com/changelog.
  • AI Classroom – An immersive, 3 day virtual training in AI with Practical AI co-host Daniel Whitenack
  • FastlyOur bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com.
  • RollbarWe move fast and fix things because of Rollbar. Resolve errors in minutes. Deploy with confidence. Learn more at rollbar.com/changelog.

Featuring:

Show Notes:

Something missing or broken? PRs welcome!

★ Support this podcast ★
Up next
Jul 7
AI in the shadows: From hallucinations to blackmail
In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can lead ... Show More
44m 50s
Jul 2
Finding Nemotron
In this episode, we sit down with Joey Conway to explore NVIDIA's open source AI, from the reasoning-focused Nemotron models built on top of Llama, to the blazing-fast Parakeet speech model. We chat about what makes open foundation models so valuable, how enterprises can think ab ... Show More
46m 23s
Jun 27
AI hot takes and debates: Autonomy
Can AI-driven autonomy reduce harm, or does it risk dehumanizing decision-making? In this “AI Hot Takes & Debates” series episode, Daniel and Chris dive deep into the ethical crossroads of AI, autonomy, and military applications. They trade perspectives on ethics, precision, resp ... Show More
45m 36s
Recommended Episodes
Mar 2020
Speech recognition to say it just right (Practical AI #82)
Catherine Breslin of Cobalt joins Daniel and Chris to do a deep dive on speech recognition. She also discusses how the technology is integrated into virtual assistants (like Alexa) and is used in other non-assistant contexts (like transcription and captioning). Along the way, she ... Show More
49m 14s
Feb 2022
🌍 AI in Africa - Voice & language tools (Practical AI #167)
In the third of the “AI in Africa” spotlight episodes, we welcome Kathleen Siminyu, who is building Kiswahili voice tools at Mozilla. We had a great discussion with Kathleen about creating more diverse voice and language datasets, involving local language communities in NLP work, ... Show More
43m 37s
Jul 2018
Welcome to Away from Keyboard (Away from Keyboard #0)
Away from Keyboard is a new show from Changelog that talks to creative professionals about how they do what they do, where they started, and how they deal with the things that make us all humans. As exciting as our work can sometimes be, we all face burnout, a lack of motivation, ... Show More
2m 32s
Mar 2022
Creating a culture of innovation (Practical AI #170)
Daniel and Chris talk with Lukas Egger, Head of Innovation Office and Strategic Projects at SAP Business Process Intelligence. Lukas describes what it takes to bring a culture of innovation into an organization, and how to infuse product development with that innovation culture. ... Show More
52m 4s
Oct 2023
AI's impact on developers (Practical AI #241)
Chris & Daniel are out this week, so we’re bringing you a panel discussion from All Things Open 2023 moderated by Jerod Santo (Practical AI producer and co-host of The Changelog) and featuring keynoters Emily Freeman and James Q Quick. Leave us a comment Changelog++ members save ... Show More
48m 24s
Mar 2020
What exactly is "data science" these days? (Practical AI #80)
Matt Brems from General Assembly joins us to explain what “data science” actually means these days and how that has changed over time. He also gives us some insight into how people are going about data science education, how AI fits into the data science workflow, and how to diff ... Show More
48m 40s
Feb 2019
With great power comes great responsibility (Changelog Interviews #334)
Adam and Jerod are joined by JS Party panelist Nick Nisi and #causeascene advocate Kim Crayton for a deep discussion on ethics in the technology industry at-large and our roles as software developers. If you’ve never heard Kim describe what life is like online for underrepresente ... Show More
1h 27m
Mar 2021
Green AI 🌲 (Practical AI #124)
Empirical analysis from Roy Schwartz (Hebrew University of Jerusalem) and Jesse Dodge (AI2) suggests the AI research community has paid relatively little attention to computational efficiency. A focus on accuracy rather than efficiency increases the carbon footprint of AI researc ... Show More
1 h
Sep 2023
Alexa Gets an AI Makeover
Alexa was due for an upgrade, and now it has gotten one. This week, Amazon held its annual media event where it debuted a slate of new hardware, software, and services. The company reserved the spot at center stage for Alexa, the voice assistant powering all of Amazon’s smart hom ... Show More
30m 12s