logo
episode-header-image
Nov 2024
27m 44s

Episode 215 - Unlock Cross-Platform Mach...

Mark and Allen
About this episode

Tired of wrestling with platform-specific machine learning model formats? Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they explore ONNX (Open Neural Network Exchange), a game-changing open format built to streamline your ML model deployment workflow. Discover how ONNX empowers you to train models in your preferred framework (PyTorch, TensorFlow, scikit-learn, etc.) and seamlessly execute them across diverse platforms (Windows, Mac, Linux, iOS, Android, Web) using the efficient ONNX Runtime.


In this episode, we delve into:


[00:00:00] Introduction: A warm welcome and a quick overview of the show's agenda.

[00:01:18] What is ONNX?: Unraveling the mysteries of ONNX and its purpose in the ML ecosystem.

[00:02:38] Model Preparation: Understanding how to prepare models for ONNX conversion and the concept of inference.

[00:04:05] Hugging Face Example: A practical demonstration of a BERT model in ONNX format on Hugging Face.

[00:06:00] The Developer's Perspective: Why ONNX matters for developers building applications that leverage ML models.

[00:07:24] ONNX Optimization: How ONNX optimizes models for inference and the trade-offs involved.

[00:08:56] The Cross-Platform Advantage: Breaking free from framework limitations and enabling deployment flexibility.

[00:11:19] ONNX Runtime Introduction: Exploring the ONNX Runtime and its support for various languages and platforms.

[00:14:04] ONNX Runtime Deep Dive: A closer look at the ONNX Runtime website and its features.

[00:15:45] ONNX for Mobile and Web: Extending ONNX's reach to mobile devices and web browsers.

[00:16:56] Conversion Process: Learn how to convert models from different formats to ONNX.

[00:18:08] Performance Considerations: Addressing concerns about performance and speed in ONNX.

[00:19:58] Code Examples: Practical code snippets demonstrating ONNX Runtime usage in JavaScript, Python, and C#.

[00:23:23] ONNX and MLOps: Integrating ONNX into your MLOps pipeline for seamless deployment.

[00:23:42] Netron Tool Introduction: Visualizing ONNX models using the Netron tool.


Whether you're a seasoned data scientist or a developer just beginning your ML journey, this episode provides valuable insights into leveraging ONNX for efficient and cross-platform model deployment. Share your experiences and questions in the comments below!


Thumbnail by Imagen 3 with prompt:

Cartoon ink and paint, with a touch of tech.

Scene: Two podcast hosts, sitting in front of microphones,

smiling and engaging in conversation.

Both hosts are male, caucasian, software developers in their early 50s,

wearing glasses, and are clean shaven.

The host on the left is wearing an olive t-shirt and a brown flat cap.

The host on the right is wearing a light blue polo shirt.

Warm, inviting lighting.

Background:

A polished, dark, onyx gemstone, reflecting light and giving it depth.

The gemstone facets should subtly reflect stylized icons of different

operating systems (Windows logo, Apple logo, Android logo, a cloud icon),

hinting at cross-platform compatibility.

Dark, sleek, and mysterious, with the onyx stone as the centerpiece.

The reflected platform icons should be subtle and not overly distracting.

The overall impression should be one of sophisticated power and hidden

potential, alluding to the capabilities of ONNX.

Negative prompt:

beards


#ONNX #MachineLearning #ML #AI #ArtificialIntelligence #DeepLearning #ModelDeployment #CrossPlatform #PyTorch #TensorFlow #ScikitLearn #MLOps #SoftwareDevelopment #WebDevelopment #MobileDevelopment #JavaScript #Python #CSharp #HuggingFace #Netron

Up next
Jul 31
Episode 250 - Five Years Up, Up, and Away in Voice & AI
Join Mark and Allen for a very special 250th episode as they celebrate five years of Two Voice Devs! You won't want to miss the unique, AI-animated opening that takes them to new heights, or the special closing that brings it all home, both created with the help of Veo 3. In betw ... Show More
36m 14s
Jul 24
Episode 249 - Cracking Copilot and the Mysteries of Microsoft 365
In this episode, guest host Andrew Connell, a Microsoft MVP of 21 years, joins Allen to unravel the complexities of Microsoft's AI strategy, particularly within the enterprise. They explore the world of Microsoft 365 Copilot, distinguishing it from the broader AI landscape and co ... Show More
52m 7s
Jul 17
Episode 248 - AI Showdown: Gemini CLI vs. Claude Code CLI
Join Allen Firstenberg and guest host Isaac Johnson, a Google Developer Expert with a deep background in DevOps and SRE, as they dive into the world of command-line AI assistants. In this episode, they compare and contrast two powerful tools: Anthropic's Claude Code CLI and Googl ... Show More
41m 31s
Recommended Episodes
Nov 2024
Making Sense of Agentic AI | ThoughtWorks Birgitta Boeckeler
There’s AI agents. There’s AI tooling. Do either drive business impact or are they just more things your dev team is supposed to stay on top of? Birgitta Boeckeler, Global Lead for AI Assisted Software Delivery at ThoughtWorks, joins the show to discuss the practical applications ... Show More
47m 40s
Sep 2023
Meta’s Quest 3, AI chatbots and Ray-Ban smart glasses
This week, it’s Meta’s turn to highlight AI during its device event. In this episode, Devindra and Cherlynn dive into all of the news from Meta’s Connect 2023 event, where it unveiled Meta AI and accompanying celebrity-powered chatbots. Oh yah, and it introduced the Meta Quest 3 ... Show More
1h 6m
Sep 2024
Study Reveals Vulnerabilities in Alexa, Siri, and Google Assistant to Malicious Commands
In this episode, we explore a recent study that uncovers how popular voice assistants like Alexa, Siri, and Google Assistant are susceptible to malicious commands. We discuss the potential risks and what users can do to protect their devices. Get on the AI Box Waitlist: ⁠⁠⁠https: ... Show More
6m 17s
Nov 2024
SN 1001: Artificial General Intelligence (AGI) - Gmail Temp Addresses, Russia's Internet Off Switch
How Microsoft lured the US Government into a far deeper and expensive dependency upon its cybersecurity solutions. Gmail to offer native throwaway email aliases like Apple and Mozilla. Russia to ban several additional hosting companies and give its big Internet disconnect switch ... Show More
2h 26m
Sep 2024
AI is more than GenAI
GenAI is often what people think of when someone mentions AI. However, AI is much more. In this episode, Daniel breaks down a history of developments in data science, machine learning, AI, and GenAI in this episode to give listeners a better mental model. Don’t miss this one if y ... Show More
40m 3s
Jul 2019
AWS’ new text-to-speech engine sounds like a newscaster
Thanks to modern machine learning techniques, text-to-speech engines have made massive strides over the last few years. It used to be incredibly easy to know that it was a computer that was reading a text and not a human being. But that’s changing quickly. Amazon’s AWS cloud comp ... Show More
2m 48s
Jan 2021
How Salesforce will make Einstein smarter in 2021
Salesforce launched Einstein, its artificial intelligence tool, in 2016. It was memorable because of the marketing materials, featuring a cute cartoon of the world's most misquoted-scientist. It was also memorable because of the unique capabilities Einsten brought to the table. T ... Show More
27m 46s
May 2021
397: Customer Feedback vs. Team Intuition
This week, we talk about the tension between building what customers explicitly ask for versus building towards a team’s internal vision. In The Sidebar, we talk about the lack of public software critique: Why isn’t there an MKBHD equivalent for software design?Golden Ratio Suppo ... Show More
23m 29s