Apr 16
How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765
In this episode, Rashmi Shetty, senior director of enterprise generative AI platform at Capital One, joins us to explore how the company is designing, deploying, and scaling multi-agent systems in a highly regulated environment. Rashmi walks us through Chat Concierge, a multi-age ... Show More
54m 18s
Mar 26
The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764
Today, we're joined by Stefano Ermon, associate professor at Stanford University and CEO of Inception Labs to discuss diffusion language models. We dig into how diffusion approaches—traditionally used for images—are being adapted for text and code generation, the technical challe ... Show More
1h 3m
Mar 10
Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi - #763
In this episode, Sid Pardeshi, co-founder and CTO of Blitzy, joins us to discuss building autonomous development systems able to deliver production-ready software at enterprise scale. Sid contrasts AI-assisted coding with end-to-end autonomy, arguing that “code is a commodity” an ... Show More
1h 16m
Oct 2017
Data science tools and other announcements from Ignite
<p>In this episode, Microsoft's Corporate Vice President for Cloud Artificial Intelligence, Joseph Sirosh, joins host Kyle Polich to share some of the Microsoft's latest and most exciting innovations in AI development platforms. Last month, Microsoft launched a set of three power ... Show More
31m 40s
Oct 2024
#692: A Discussion About Serverless and How to Make the Most of It
Simon is joined by Stephen Liedig to discuss the evolution of serverless technology and its impact on application development, exploring benefits like scalability, cost optimization, and faster dev cycles. They delve into key services and concepts in serverless design, including ... Show More
35m 28s
Dec 2024
AI Semiconductor Landscape feat. Dylan Patel | BG2 w/ Bill Gurley & Brad Gerstner
<p>Open Source bi-weekly convo w/ Bill Gurley and Brad Gerstner on all things tech, markets, investing & capitalism. This week they are joined by Dylan Patel, Founder & Chief Analyst at SemiAnalysis, to discuss origins of SemiAnalysis, Google's AI workload, NVIDIA' ... Show More
1h 29m
Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the design and role of the recently released Trainium2 chip. We explore the architectural differences between Trainium and GPUs, highlighting its systolic array-based compute design, and how it balances performance a ... Show More
<p dir="ltr">This episode is sponsored by Netsuite by Oracle, the number one cloud financial system, streamlining accounting, financial management, inventory, HR, and more.</p> <p><strong> </strong></p> <p dir="ltr">NetSuite is offering a one-of-a-kind flexible financing program. ... Show More
<p>In episode 66 of The Gradient Podcast, <a target="_blank" href="https://twitter.com/spaniel_bashir">Daniel Bashir</a> speaks to <a target="_blank" href="https://twitter.com/soumithchintala?s=20">Soumith Chintala</a>.</p><p>Soumith is a Research Engineer at Meta AI Research in ... Show More