Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the design and role of the recently released Trainium2 chip. We explore the architectural differences between Trainium and GPUs, highlighting its systolic array-based compute design, and how it balances performance a ... Show More
Jun 16
Why AI Agents Break the GenAI Security Model with Devvret Rishi - #770
In this episode, Sam talks with Dev Rishi, GM of AI at Rubrik, about what happens when agents move beyond answering questions and start taking action across tools, systems, and business processes. We explore why the enterprise playbook of static guardrails plus human approval sta ... Show More
56m 18s
Jun 9
Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769
As context windows grow into the millions of tokens, many AI practitioners are questioning whether retrieval-augmented generation (RAG) is still necessary. If modern models can ingest entire libraries of documents, why bother with retrieval at all? In this episode, Alex Bowcut, H ... Show More
51m 32s
May 21
Relational Foundation Models for Enterprise Data with Jure Leskovec - #768
In this episode, Jure Leskovec, co-founder and chief scientist at Kumo and professor of computer science at Stanford, joins us to explore two fronts of his work: AI for science and relational deep learning. We begin with AI Virtual Cell, a multiscale effort to learn data-driven r ... Show More
1h 6m
Oct 2017
Data science tools and other announcements from Ignite
<p>In this episode, Microsoft's Corporate Vice President for Cloud Artificial Intelligence, Joseph Sirosh, joins host Kyle Polich to share some of the Microsoft's latest and most exciting innovations in AI development platforms. Last month, Microsoft launched a set of three power ... Show More
31m 40s
Oct 2024
#692: A Discussion About Serverless and How to Make the Most of It
Simon is joined by Stephen Liedig to discuss the evolution of serverless technology and its impact on application development, exploring benefits like scalability, cost optimization, and faster dev cycles. They delve into key services and concepts in serverless design, including ... Show More
35m 28s
Dec 2024
AI Semiconductor Landscape feat. Dylan Patel | BG2 w/ Bill Gurley & Brad Gerstner
<p>Open Source bi-weekly convo w/ Bill Gurley and Brad Gerstner on all things tech, markets, investing & capitalism. This week they are joined by Dylan Patel, Founder & Chief Analyst at SemiAnalysis, to discuss origins of SemiAnalysis, Google's AI workload, NVIDIA' ... Show More
1h 29m
<p dir="ltr">This episode is sponsored by Netsuite by Oracle, the number one cloud financial system, streamlining accounting, financial management, inventory, HR, and more.</p> <p><strong> </strong></p> <p dir="ltr">NetSuite is offering a one-of-a-kind flexible financing program. ... Show More
<p>In episode 66 of The Gradient Podcast, <a target="_blank" href="https://twitter.com/spaniel_bashir">Daniel Bashir</a> speaks to <a target="_blank" href="https://twitter.com/soumithchintala?s=20">Soumith Chintala</a>.</p><p>Soumith is a Research Engineer at Meta AI Research in ... Show More