logo
episode-header-image
Aug 2024
39m 18s

Fireworks Founder Lin Qiao on How Fast I...

Sequoia Capital
About this episode

In the first wave of the generative AI revolution, startups and enterprises built on top of the best closed-source models available, mostly from OpenAI. The AI customer journey moves from training to inference, and as these first products find PMF, many are hitting a wall on latency and cost.


Fireworks Founder and CEO Lin Qiao led the PyTorch team at Meta that rebuilt the whole stack to meet the complex needs of the world’s largest B2C company. Meta moved PyTorch to its own non-profit foundation in 2022 and Lin started Fireworks with the mission to compress the timeframe of training and inference and democratize access to GenAI beyond the hyperscalers to let a diversity of AI applications thrive.


Lin predicts when open and closed source models will converge and reveals her goal to build simple API access to the totality of knowledge.


Hosted by: Sonya Huang and Pat Grady, Sequoia Capital 


Mentioned in this episode:

  • Pytorch: the leading framework for building deep learning models, originated at Meta and now part of the Linux Foundation umbrella
  • Caffe2 and ONNX: ML frameworks Meta used that PyTorch eventually replaced
  • Conservation of complexity: the idea that that every computer application has inherent complexity that cannot be reduced but merely moved between the backend and frontend, originated by Xerox PARC researcher Larry Tesler 
  • Mixture of Experts: a class of transformer models that route requests between different subsets of a model based on use case
  • Fathom: a product the Fireworks team uses for video conference summarization 
  • LMSYS Chatbot Arena: crowdsourced open platform for LLM evals hosted on Hugging Face


 00:00 - Introduction

02:01 - What is Fireworks?

02:48 - Leading Pytorch

05:01 - What do researchers like about PyTorch?

07:50 - How Fireworks compares to open source

10:38 - Simplicity scales

12:51 - From training to inference

17:46 - Will open and closed source converge?

22:18 - Can you match OpenAI on the Fireworks stack?

26:53 - What is your vision for the Fireworks platform?

31:17 - Competition for Nvidia?

32:47 - Are returns to scale starting to slow down?

34:28 - Competition

36:32 - Lightning round

Up next
Jul 8
Mapping the Mind of a Neural Net: Goodfire’s Eric Ho on the Future of Interpretability
Eric Ho is building Goodfire to solve one of AI’s most critical challenges: understanding what’s actually happening inside neural networks. His team is developing techniques to understand, audit and edit neural networks at the feature level. Eric discusses breakthrough results in ... Show More
47m 7s
Jul 1
ElevenLabs’ Mati Staniszewski: Why Voice Will Be the Fundamental Interface for Tech
Mati Staniszewski, co-founder and CEO of ElevenLabs, explains how staying laser-focused on audio innovation has allowed his company to thrive despite the push into multimodality from foundation models. From a high school friendship in Poland to building one of the fastest-growing ... Show More
59m 53s
Jun 24
From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents
Anish Agarwal and Raj Agrawal, co-founders of Traversal, are transforming how enterprises handle critical system failures. Their AI agents can perform root cause analysis in 2-4 minutes instead of the hours typically spent by teams of engineers scrambling in Slack channels. Drawi ... Show More
40m 32s
Recommended Episodes
Oct 2024
Building the Open Source AI Revolution (with Hugging Face CEO, Clem Delangue)
We sit down with Hugging Face CEO Clem Delangue to understand the current state of the open source AI ecosystem. Hugging Face is the leading platform to host and collaborate on AI models, datasets, and applications. They also have a compute offering for AI builders to train their ... Show More
1h 8m
Jul 2024
#229 Inside Meta's Biggest and Best Open-Source AI Model Yet with Thomas Scialom, Co-Creator of Llama3
Meta has been at the absolute edge of the open-source AI ecosystem, and with the recent release of Llama 3.1, they have officially created the largest open-source model to date. So, what's the secret behind the performance gains of Llama 3.1? What will the future of open-source A ... Show More
39m 23s
May 1
The rise of Cursor: The $300M ARR AI tool that engineers can’t stop using | Michael Truell (co-founder and CEO)
Michael Truell is the co-founder and CEO of Anysphere, the company behind Cursor—the fastest-growing AI code editor in the world, reaching $300 million in annual recurring revenue just two years after its launch. In this conversation, Michael shares his vision for the future, les ... Show More
1h 11m
Nov 2024
Building an AI creator community w/ Civitai founders Justin Maier and Maxfield Hulker
Ever since generative AI tools like Midjourney became available to the public in 2022, curious users and AI fanatics alike have been experimenting with the technology. But for tech aficionados and AI enthusiasts like Justin Maier and Maxfield Hulker, Midjourney’s closed-source mo ... Show More
49m 45s
Oct 2024
Why climate tech startups get this one thing wrong
This might be our wonkiest topic yet: Techno-economic analysis, or TEA. Before a startup proves its technology is commercially viable, it models how a technology would work. These TEAs include things like assumptions about inputs, prices, and market landscape. They help investors ... Show More
49m 38s
Feb 2025
Scaling AI: Building the Right AI Team
You’re smart. You know your business. But do you know how to build the right AI team? It’s harder than it looks, and the old playbook won’t cut it. In this episode, host Courtney Baker is joined by CEO David DeWolf, Chief Product & Technology Officer Mohan Rao, and NordLight CEO ... Show More
33m 21s
Jan 2025
What you need to know about DeepSeek and the AI race
Today, we’re diving into a listener’s question about the new artificial intelligence chatbot on the scene. Chinese start-up DeepSeek’s AI model is said to be more cost-effective, less complex, and in some ways, just plain better than OpenAI’s ChatGPT. We’ll explain why the stock ... Show More
12 m
Feb 2025
Vercel’s Developer Frameworks with Ary Khandelwal and Max Leiter
The availability of high-quality AI model APIs has drastically lowered the barriers developing AI applications. These tools abstract away complex tasks such as model deployment, scaling, data retrieval, natural language processing, and text generation. Vercel has developed a comp ... Show More
52m 25s
May 2023
TinyML: Bringing machine learning to the edge
When we think about machine learning today we often think in terms of immense scale — large language models that require huge amounts of computational power, for example. But one of the most interesting innovations in machine learning right now is actually happening on a really s ... Show More
45m 45s
Jan 2025
Inside Gong: How teams work with design partners, their pod structure, autonomy, trust, and more | Eilon Reshef (co-founder and CPO)
Eilon Reshef is the co-founder and chief product officer at Gong, one of the most ubiquitous B2B products in the world. In our conversation, we discuss:• Gong’s unique approach to working with design partners• Their unique pod model• Why Eilon makes big decisions quickly• Lessons ... Show More
56m 42s