903: LLM Benchmarks Are Lying to You (And What to Do Instead), with Sinan Ozdemir| Listen on Anghami

Jul 31

1014: OpenAI Agent Breaches Hugging Face: All You Must Know incl. How to Protect Yourself

In Episode #1014, Jon Krohn breaks down a security incident that reads like science fiction: during an internal evaluation, an autonomous OpenAI agent broke out of its sandbox, exploited a zero-day, and hacked its way into Hugging Face to steal the answers to the very benchmark i ... Show More

20m 57s

Jul 28

1013: Weapons of Math Destruction, Ten Years On, with Dr. Cathy O’Neil

In Episode #1013, Dr. Cathy O'Neil (Harvard math PhD, former Wall Street quant and author of the mega-bestseller Weapons of Math Destruction) joins Jon Krohn to explain what actually makes an algorithm terrifying: not the complexity of the math, but the secrecy, the unaccountabil ... Show More

1h 24m

Jul 24

1012: The Open-Weight 2.8-Trillion Parameter Competing at the Frontier

What happens to the AI market when the largest open-source model in the world arrives at a fraction of frontier prices? In this week’s episode, host Jon Krohn digs into Kimi K3, the 2.8-trillion-parameter release from Beijing-based Moonshot AI that, in the space of a single week, ... Show More

12m 13s

Nov 2024

The Future of AI: Predictions and Realities

In this episode, Jaeden Schafer discusses the current challenges and developments in the AI industry, particularly focusing on the limitations faced by major players like OpenAI and Anthropic. The conversation explores the anticipated improvements in AI models, the predictions fo ... Show More

19m 52s

Jan 2024

Why AI Should Be Taught to Know Its Limits

One of AI’s biggest, unsolved problems is what the advanced algorithms should do when they confront a situation they don’t have an answer for. For programs like Chat GPT, that could mean providing a confidently wrong answer, what’s often called a “hallucination”; for others, as w ... Show More

14m 58s

Jul 2022

Why Artificial Intelligence Projects Fail ?

Podcast with Gautam Siwach and Jin Vanstee ! Speaker - Elpida Tzortzatos is an IBM Fellow and CTO AI on IBM zSystems. In this Podcast we listen to Elpida's thoughts about driving Artificial Intelligence strategies, associated risks, and Values.We will learn how to drive industry- ... Show More

10m 50s

Mar 2023

#312 — The Trouble with AI

Sam Harris speaks with Stuart Russell and Gary Marcus about recent developments in artificial intelligence and the long-term risks of producing artificial general intelligence (AGI). They discuss the limitations of Deep Learning, the surprising power of narrow AI, Ch ... Show More

1h 26m

Oct 2024

What Big Tech Isn’t Telling You About AI (Ep. 267)

Are AI giants really building trustworthy systems? A groundbreaking transparency report by Stanford, MIT, and Princeton says no. In this episode, we expose the shocking lack of transparency in AI development and how it impacts bias, safety, and trust in the technology. We’ll b ... Show More

19m 15s

Jan 2024

Careers, Skills, and the Evolution of AI (Ep. 248)

!!WARNING!! Due to some technical issues the volume is not always constant during the show. I sincerely apologise for any inconvenience Francesco In this episode, I speak with Richie Cotton, Data Evangelist at DataCamp, as he delves into the d ... Show More

32m 27s

Feb 2025

Industry Roundup #3: The Rise of Reasoning LLMs, OpenAI Operator, Project Stargate, and Gemini’s Struggle for Recognition

Welcome to DataFramed Industry Roundups! In this series of episodes, Adel & Richie sit down to discuss the latest and greatest in data & AI. In this episode, we discuss the rise of reasoning LLMs like DeepSeek R1 and the competition shaping the AI space, OpenAI’s Operator and the ... Show More

29m 46s

903: LLM Benchmarks Are Lying to You (An...