logo
episode-header-image
Yesterday
50m 16s

#281 Leon Song: The Research Driving Nex...

Craig S. Smith
About this episode

AGNTCY - Unlock agents at scale with an open Internet of Agents. Visit https://agntcy.org/ and add your support.


In this episode of Eye on AI, we sit down with Leon Song, VP of Research at Together AI, to explore how open-source models and cutting-edge infrastructure are reshaping the AI landscape. 

 

From speculative decoding to FlashAttention and RedPajama, Leon shares how Together AI is building one of the fastest, most cost-efficient AI clouds—helping enterprises fine-tune, deploy, and scale open-source models at the level of GPT-4 and beyond.

 

We dive into Leon’s journey from leading DeepSpeed and AI for Science at Microsoft to driving system-level innovation at Together AI.

Topics include:

  • The future of open-source vs. closed-source AI models

  • Breakthroughs in speculative decoding for faster inference

  • How Together AI’s cloud platform empowers enterprises with data sovereignty and model ownership

  • Why open-source models like DeepSeek R1 and Llama 4 are now rivaling proprietary systems

  • The role of GPUs vs. ASIC accelerators in scaling AI infrastructure

 

Whether you’re an AI researcher, enterprise leader, or curious about where generative AI is heading, this conversation reveals the technology and strategy behind one of the most important players in the open-source AI movement.

Stay Updated:

Craig Smith on X:https://x.com/craigss

Eye on A.I. on X: https://x.com/EyeOn_AI

Up next
Aug 17
#280 Aytekin Tank: How to Fully Automate Your Customer Service with AI Agents
This episode is brought to you by Extreme Networks, the company radically improving customer experiences with AI-powered automation for networking.Extreme is driving the convergence of AI, networking, and security to transform the way businesses connect and protect their networks ... Show More
45m 27s
Aug 14
#279 Matthew Carroll: Immuta’s Approach to Secure, Scalable Data Access in the Age of AI
Try OCI for free at http://oracle.com/eyeonai This episode is sponsored by Oracle. OCI is the next-generation cloud designed for every workload – where you can run any application, including any AI projects, faster and more securely for less. On average, OCI costs 50% less for co ... Show More
53m 48s
Aug 10
#278 Julia Peyre: How Schneider Electric is Pioneering Enterprise AI at Scale
AGNTCY - Unlock agents at scale with an open Internet of Agents. Visit https://agntcy.org/ and add your support. How does a 150,000-employee global leader make AI work at scale? In this episode of Eye on AI, host Craig Smith sits down with Julia Peyre, Head of AI Strategy & Innov ... Show More
55m 35s
Recommended Episodes
Aug 2024
809: Agentic AI, with Shingai Manjengwa
Agentic AI is revolutionizing the tech landscape, and Shingai Manjengwa from ChainML is here to tell us why. Discover how AI agents are becoming an integral part of our lives, automating tasks like travel bookings and daily inspiration. Shingai explains the power of multi-agent s ... Show More
1h 10m
Jul 8
How I'm Building a Zero-Employee Business with AI
Want to Automate your work with AI? Get the playbook here: https://clickhubspot.com/wgk Episode 66: Can you really build a zero-employee business with AI? Nathan Lands (https://x.com/NathanLands) sits down with John Rush (https://x.com/johnrushx), founder and self-proclaimed buil ... Show More
46 m
Aug 3
Where AI Is Right Now: 15 Charts in 15 Minutes
In today’s episode, we take a rapid-fire tour through 15(ish) charts that capture the current state of artificial intelligence across consumer use, enterprise adoption, agents, and infrastructure. From skyrocketing usage metrics and token consumption to the rise of agentic workfl ... Show More
22m 24s
Apr 2024
Episode 192 - Google Cloud Next 2024 Recap
Join Allen Firstenberg and guest host Stefania Pecore on Two Voice Devs as they delve into the exciting announcements and highlights from Google Cloud Next 2024! This episode focuses on the latest advancements in AI and their impact on the healthcare industry, providing valuable ... Show More
40m 35s
Apr 2024
Measuring The Speed of AI Through Benchmarks
David Kanter, Executive Director at MLCommons, discusses the work they’re doing with MLPerf Benchmarks, creating the world’s first industry standard approach to measuring AI speed and safety. He also shares ways they’re testing AI and LLMs for harm, to measure—and, over time, red ... Show More
31m 45s
Aug 22
Is Pixel 10 the AI Phone iPhone Never Was?
Google's Pixel 10 delivers the AI phone features Apple promised but never shipped. While Apple continues to struggle with delayed and underwhelming AI rollouts, Google has just launched its most AI-integrated smartphone yet, featuring Magic Q (an agentic assistant that searches t ... Show More
25m 48s
Aug 5
Everyone’s Using AI Wrong — Here’s the Real Opportunity
Want Nicholas' 3-step AI framework for businesses? get it here: https://clickhubspot.com/pge Episode 70: Is AI just a productivity booster, or are we missing the real transformation right in front of us? Matt Wolfe (https://x.com/mreflow) is joined by Nicholas Holland (https://x. ... Show More
47m 12s
Dec 2024
How Diamond Cooling Could Power the Future of AI, with Akash Systems
In this episode of No Priors, Sarah sits down with Felix Ejeckam and Ty Mitchell, founders of Akash Systems, a company pioneering diamond-based cooling technology for semiconductors used in space applications and large-scale AI data centers. Felix and Ty discuss how their backgro ... Show More
42m 21s
Nov 2024
Making Sense of Agentic AI | ThoughtWorks Birgitta Boeckeler
There’s AI agents. There’s AI tooling. Do either drive business impact or are they just more things your dev team is supposed to stay on top of? Birgitta Boeckeler, Global Lead for AI Assisted Software Delivery at ThoughtWorks, joins the show to discuss the practical applications ... Show More
47m 40s
Jun 25
The Role of AI in Electric Vehicle Charging
Diego Pareschi, Global Product Line Manager for EV chargers at ABB, discusses the rapidly evolving field of EV charging technology, focusing on the intersection of user experience, data analytics, and AI. He highlights the role of AI and machine learning in enhancing charging sol ... Show More
30m 4s