logo
episode-header-image
Mar 2022
44m 1s

Full-Stack AI Systems Development with M...

Sam Charrington
About this episode

Today we’re joined by Murali Akula, a Sr. director of Software Engineering at Qualcomm. In our conversation with Murali, we explore his role at Qualcomm, where he leads the corporate research team focused on the development and deployment of AI onto Snapdragon chips, their unique definition of “full stack”, and how that philosophy permeates into every step of the software development process. We explore the complexities that are unique to doing machine learning on resource constrained devices, some of the techniques that are being applied to get complex models working on mobile devices, and the process for taking these models from research into real-world applications. We also discuss a few more tools and recent developments, including DONNA for neural architecture search, X-Distill, a method of improving the self-supervised training of monocular depth, and the AI Model Effeciency Toolkit, a library that provides advanced quantization and compression techniques for trained neural network models.

The complete show notes for this episode can be found at twimlai.com/go/563

Up next
Oct 7
Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750
Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and laten ... Show More
57m 23s
Sep 30
The Decentralized Future of Private AI with Illia Polosukhin - #749
In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer archit ... Show More
1h 5m
Sep 23
Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748
Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with th ... Show More
1h 3m
Recommended Episodes
Aug 2020
الذكاء الإصطناعي 2 : مابين الثورة و المقاومة
هذا الجزء الثاني من سلسلتنا عن الذكاء الاصطناعي  ، أذا ماسمعت الجزء الاول انصحك تبدا فيه ، بنتكلم في هالحلقة عن استخدامات و تطبيقات تعلم الالة والذكاء الاصطناعي ، من السيارات ذاتية القيادة ، أنظمة فبركة الصوت والفيديو ، نظام ساهر لرصد المخالفات المرورية ، أنظمة توقع واستباق الاعط ... Show More
1h 19m
Mar 2023
AI’s Impact on Software Engineering: Killing Old Principles? (Ep. 220)
In this episode, we dive into the ways in which AI and machine learning are disrupting traditional software engineering principles. With the advent of automation and intelligent systems, developers are increasingly relying on algorithms to create efficient and effective code. How ... Show More
13m 26s
Jul 2023
MosaicML's Naveen Rao on Making Custom LLMs More Accessible - Ep. 199
Startup MosaicML is on a mission to help the AI community enhance prediction accuracy, decrease costs, and save time by providing tools for easy training and deployment of large AI models. In this episode of NVIDIA's AI Podcast, host Noah Kravitz speaks with MosaicML CEO and co-f ... Show More
31m 26s
May 2024
2882: From Chess Grandmaster to ML Innovator: Tal Shaked’s Journey
Are machines really capable of thinking like humans, or are we merely programming them to mimic our own patterns? Today on Tech Talks Daily, we delve into this intriguing question with Tal Shaked, an American chess grandmaster and Chief Machine Learning Fellow at Moloco, a leadin ... Show More
29m 26s
Dec 2022
Hittin’ the Sim: NVIDIA’s Matt Cragun on Conditioning Autonomous Vehicles in Simulation - Ep. 185
Training, testing and validating autonomous vehicles requires a continuous pipeline — or data factory — to introduce new scenarios and refine deep neural networks. A key component of this process is simulation. AV developers can test a virtually limitless number of scenarios, rep ... Show More
29m 13s
Apr 2023
The Power of Graph Neural Networks: Understanding the Future of AI - Part 1/2 (Ep.223)
In this episode, I explore the cutting-edge technology of graph neural networks (GNNs) and how they are revolutionizing the field of artificial intelligence. I break down the complex concepts behind GNNs and explain how they work by modeling the relationships between data points ... Show More
27m 40s
Nov 2023
NVIDIA’s Annamalai Chockalingam on the Rise of LLMs - Ep. 206
Generative AI and large language models (LLMs) are stirring change across industries — but according to NVIDIA Senior Product Manager of Developer Marketing Annamalai Chockalingam, “we’re still in the early innings.” In the latest episode of NVIDIA’s AI Podcast, host Noah Kravitz ... Show More
38m 32s
Aug 2023
AI Accelerated Engineering - Matthias Bauer | Podcast #103
🌍 Official Navasto Website: https://www.navasto.de/ 💌 My weekly science newsletter - https://jousef.substack.com/ Navasto has experience gained through many years of work in the industry and research landscape, in particular in the automobile and aerospace sectors. This enables ... Show More
35m 51s