logo
episode-header-image
Mar 2020
49m 15s

Speech recognition to say it just right

Practical AI LLC
About this episode

Catherine Breslin of Cobalt joins Daniel and Chris to do a deep dive on speech recognition. She also discusses how the technology is integrated into virtual assistants (like Alexa) and is used in other non-assistant contexts (like transcription and captioning). Along the way, she teaches us how to assemble a lexicon, acoustic model, and language model to bring speech recognition to life.

Join the discussion

Changelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!

Sponsors:

  • LinodeOur cloud of choice and the home of Changelog.com. Deploy a fast, efficient, native SSD cloud server for only $5/month. Get 4 months free using the code changelog2019 OR changelog2020. To learn more and get started head to linode.com/changelog.
  • AI Classroom – An immersive, 3 day virtual training in AI with Practical AI co-host Daniel Whitenack
  • FastlyOur bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com.
  • RollbarWe move fast and fix things because of Rollbar. Resolve errors in minutes. Deploy with confidence. Learn more at rollbar.com/changelog.

Featuring:

Show Notes:

Something missing or broken? PRs welcome!

Up next
Nov 19
Beyond note-taking with Fireflies
<p>Fireflies CEO, Krish Ramineni shares how the company is transforming AI-powered note-taking into a deeper layer of knowledge automation. He breaks down the technology behind real-time functionality like Live Assist, the user behavior patterns driving product evolution, and how ... Show More
48m 59s
Nov 13
Autonomous Vehicle Research at Waymo
Waymo’s VP of Research, Drago Anguelov, joins Practical AI to explore how advances in autonomy, vision models, and large-scale testing are shaping the future of driverless technology. The conversation dives into the dual challenges of building an onboard driver and testing that d ... Show More
52m 8s
Nov 10
Are we in an AI bubble?
Dan and Chris unpack whether today’s surge in AI deployment across enterprise workflows, manufacturing, healthcare, and scientific research signals a lasting transformation or an overhyped bubble. Drawing parallels to the dot-com era, they explore how technology integration is re ... Show More
49m 41s
Recommended Episodes
Mar 2020
Speech recognition to say it just right (Practical AI #82)
Catherine Breslin of Cobalt joins Daniel and Chris to do a deep dive on speech recognition. She also discusses how the technology is integrated into virtual assistants (like Alexa) and is used in other non-assistant contexts (like transcription and captioning). Along the way, she ... Show More
49m 14s
Feb 2022
🌍 AI in Africa - Voice & language tools (Practical AI #167)
In the third of the “AI in Africa” spotlight episodes, we welcome Kathleen Siminyu, who is building Kiswahili voice tools at Mozilla. We had a great discussion with Kathleen about creating more diverse voice and language datasets, involving local language communities in NLP work, ... Show More
43m 37s
Jul 2018
Welcome to Away from Keyboard (Away from Keyboard #0)
Away from Keyboard is a new show from Changelog that talks to creative professionals about how they do what they do, where they started, and how they deal with the things that make us all humans. As exciting as our work can sometimes be, we all face burnout, a lack of motivation, ... Show More
2m 32s
Mar 2022
Creating a culture of innovation (Practical AI #170)
Daniel and Chris talk with Lukas Egger, Head of Innovation Office and Strategic Projects at SAP Business Process Intelligence. Lukas describes what it takes to bring a culture of innovation into an organization, and how to infuse product development with that innovation culture. ... Show More
52m 4s
Oct 2023
AI's impact on developers (Practical AI #241)
Chris & Daniel are out this week, so we're bringing you a panel discussion from All Things Open 2023 moderated by Jerod Santo (Practical AI producer and co-host of The Changelog) and featuring keynoters Emily Freeman and James Q Quick. 
48m 24s
Mar 2020
What exactly is "data science" these days? (Practical AI #80)
Matt Brems from General Assembly joins us to explain what "data science" actually means these days and how that has changed over time. He also gives us some insight into how people are going about data science education, how AI fits into the data science workflow, and how to diff ... Show More
48m 40s
Feb 2019
With great power comes great responsibility (Changelog Interviews #334)
Adam and Jerod are joined by JS Party panelist Nick Nisi and #causeascene advocate Kim Crayton for a deep discussion on ethics in the technology industry at-large and our roles as software developers. If you've never heard Kim describe what life is like online for underrepresente ... Show More
1h 27m
Mar 2021
Green AI 🌲 (Practical AI #124)
Empirical analysis from Roy Schwartz (Hebrew University of Jerusalem) and Jesse Dodge (AI2) suggests the AI research community has paid relatively little attention to computational efficiency. A focus on accuracy rather than efficiency increases the carbon footprint of AI researc ... Show More
1 h
Sep 2023
Alexa Gets an AI Makeover
<p>Alexa was due for an upgrade, and now it has gotten one. This week, Amazon held its annual media event where it debuted a slate of new hardware, software, and services. The company reserved the spot at center stage for Alexa, the voice assistant powering all of Amazon’s smart ... Show More
30m 12s