logo
episode-header-image
Sep 2024
29m 23s

Looking under the hood of multimodal AI

The Stack Overflow Podcast
About this episode

Multimodal AI combines different modalities—audio, video, text, etc.—to enable more humanlike engagement and higher-quality responses from the AI model. 

WebRTC is a free, open-source project that allows developers to add real-time communication capabilities that work on top of an open standard to their applications. It supports video, voice, and generic data.

LiveKit is an open-source project that provides scalable, multi-user conferencing based on WebRTC. It’s designed to provide everything developers need to build real-time voice and video applications. Check them out on GitHub.

Connect with Russ on LinkedIn or X and explore his posts on the LiveKit blog.

Stack Overflow user Kristi Jorgji threw inquiring minds a lifejacket (badge) by answering their own question: Error trying to import dump from mysql 5.7 into 8.0.23.

Up next
Today
Svelte was built on “slinging code for the sheer love of it”
Rich Harris, creator of Svelte and software engineer at Vercel, joins Ryan on the show to dive into the evolution and future of web frameworks. They discuss the birth and growth of Svelte during the rise of mobile, the challenges of building robust and efficient web applications, ... Show More
35m 9s
Aug 22
Robots in the skies (and they use Transformer models)
Ryan welcomes Nathan Michael, CTO at Shield AI, to discuss what AI looks like in defense technologies, both technically and ethically. They cover how the Hivemind technology works in coordinating the autonomous decisions of drones in the field while keeping humans in the loop, wh ... Show More
26m 50s
Aug 22
Learning in the flow: Unlocking employee potential through continuous learning
In this episode of Leaders of Code, Stack Overflow CEO Prashanth Chandrasekar and Christina Dacauaziliqua, Senior Learning Specialist at Morgan Stanley, talk about the importance of experiential learning in fast-paced environments. They emphasize the value of creating intentional ... Show More
33m 1s
Recommended Episodes
Sep 2024
Pausing to think about scikit-learn & OpenAI o1
Recently the company stewarding the open source library scikit-learn announced their seed funding. Also, OpenAI released “o1” with new behavior in which it pauses to “think” about complex tasks. Chris and Daniel take some time to do their own thinking about o1 and the contrast to ... Show More
50m 10s
Mar 2024
Open sourcing AI app development with Harrison Chase from LangChain
Companies are employing AI agents and co-pilots to help their teams increase efficiency and accuracy, but developing apps that are trained properly can require a skill set many enterprise teams don’t have. This week on No Priors, Sarah and Elad are joined by Harrison Chase, the C ... Show More
27m 32s
Nov 2024
Build An App with a Backend Using Ai in 20 min (Cursor Ai, Replit, Firebase, Wispr Flow)
Episode 32: How can you build an app with a backend using AI in just 20 minutes? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) sit down with AI enthusiast Riley Brown (https://x.com/rileybrown_ai) to explore this exciting and challenging process. ... Show More
39m 34s
Jun 2024
#218 Designing AI Applications with Robb Wilson, Co-Founder & CEO at Onereach.ai
All the hype around generative AI means that every software maker seems to be stuffing chat interfaces into their products whenever they can. For the most part, the jury is still out on whether this is a good idea or not. However, design goes deeper than just the user interface, ... Show More
46m 36s
Aug 2023
AI Superpowers for Frontend Developers, with Vercel Founder/CEO Guillermo Rauch
Everything digital is increasingly intermediated through web user experiences, and now AI development can be frontend-first, too. Just ask Guillermo Rauch, the founder and CEO of Vercel, the company behind Next.js. In this episode of No Priors, hosts Sarah Guo and Elad Gil speak ... Show More
38m 13s
Nov 2024
Building an AI creator community w/ Civitai founders Justin Maier and Maxfield Hulker
Ever since generative AI tools like Midjourney became available to the public in 2022, curious users and AI fanatics alike have been experimenting with the technology. But for tech aficionados and AI enthusiasts like Justin Maier and Maxfield Hulker, Midjourney’s closed-source mo ... Show More
49m 45s
Nov 2024
Making Sense of Agentic AI | ThoughtWorks Birgitta Boeckeler
There’s AI agents. There’s AI tooling. Do either drive business impact or are they just more things your dev team is supposed to stay on top of? Birgitta Boeckeler, Global Lead for AI Assisted Software Delivery at ThoughtWorks, joins the show to discuss the practical applications ... Show More
47m 40s
Feb 2025
LangChain and Agentic AI Engineering with Erick Friis
LangChain is a popular open-source framework to build applications that integrate LLMs with external data sources like APIs, databases, or custom knowledge bases. It’s commonly used for chatbots, question-answering systems, and workflow automation. Its flexibility and extensibili ... Show More
41m 50s
Nov 2024
scikit-learn & data science you own
We are at GenAI saturation, so let’s talk about scikit-learn, a long time favorite for data scientists building classifiers, time series analyzers, dimensionality reducers, and more! Scikit-learn is deployed across industry and driving a significant portion of the “AI” that is ac ... Show More
52m 2s
Dec 2024
AI Voice Technology Just Got INSANE (ElevenLabs GenFM Demo + More)
Episode 38: How revolutionary is the latest in AI voice technology? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) dive deep into this topic with Ammaar Reshi (https://x.com/ammaar), head of design at ElevenLabs and AI enthusiast who has made wave ... Show More
39m 33s