logo
episode-header-image
May 2023
26m 33s

Creating instruction tuned models (Pract...

CHANGELOG MEDIA
About this episode

At the recent ODSC East conference, Daniel got a chance to sit down with Erin Mikail Staples to discuss the process of gathering human feedback and creating an instruction tuned Large Language Models (LLM). They also chatted about the importance of open data and practical tooling for data annotation and fine-tuning. Do you want to create your own custom generative AI models? This is the episode for you!

Leave us a comment

Changelog++ members save 1 minute on this episode because they made the ads disappear. Join today!

Sponsors:

  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
  • Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs.
  • Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You iterlly can’t get any faster!

Featuring:

Show Notes:

Something missing or broken? PRs welcome!

Timestamps:

(00:00) - Welcome to Practical AI
(00:43) - Erin Mikail Staples
(02:09) - Open source attendees
(03:54) - The key to RLHF
(05:35) - Tooling for RLHF
(07:33) - Humanities in data science
(11:22) - Label Studio's workflow
(15:41) - The open data ecosystem
(21:04) - Do data labeling
(22:33) - Exciting changes coming
(24:15) - DevRel(ish) and other resources
(25:13) - Goodbyes
(25:45) - Outro

Up next
Oct 8
Vite documentary companion pod (Changelog Interviews #661)
Our friends at Cult.Repo launch their epic Vite documentary on October 9th, 2025! To celebrate, Jerod sat down with Evan You to discuss Vite's adoption story, why he raised money to start VoidZero, how developer documentaries get made, open source sustainability, and more. 
1h 12m
Oct 6
The best coders should exit the feed (Changelog News #164)
Abner Coimbre makes a compelling case why our biggest technical talent should abandon for-profit social platforms, Noah Brier creates a Claude Code and Obsidian starter kit, Bharath Natarajan documents the Vercel vs Cloudflare fight, Toolbrew is a well-designed website brimming w ... Show More
7m 42s
Oct 3
npm under siege (what to do about it) (Changelog & Friends #111)
Over the past two months, we’ve seen some of the most serious supply chain attacks in npm history: phishing campaigns, maintainer account takeovers, and malware published to packages with billions of weekly downloads. What is going on?! What can we do about it? Our old friend, Fe ... Show More
1h 35m
Recommended Episodes
Jan 2023
Machine learning at small organizations
Why is ML is so poorly adopted in small organizations (hint: it’s not because they don’t have enough data)? In this episode, Kirsten Lum from Storytellers shares the patterns she has seen in small orgs that lead to a successful ML practice. We discuss how the job of a ML Engineer ... Show More
49m 50s
Feb 2023
Is htmx the way to Go?
A quick look at the history of building web apps, followed by a discussion of htmx and how it compares to both modern and traditional ways of building. Discuss on Changelog News Changelog++ members save 4 minutes on this episode because they made the ads disappear. Join today! Sp ... Show More
1h 16m
Apr 2024
Mamba & Jamba
First there was Mamba… now there is Jamba from AI21. This is a model that combines the best non-transformer goodness of Mamba with good ‘ol attention layers. This results in a highly performant and efficient model that AI21 has open sourced! We hear all about it (along with a var ... Show More
41m 13s
May 2024
Microsoft is all-in on AI: Part 1 (Interview)
Scott Guthrie joins the show this week from Microsoft Build 2024 to discuss Microsoft being all-in on AI. From Copilot, to Azure AI and Prompty, to their developer first focus, leading GitHub, VS Code being the long bet that paid off, to the future of a doctor’s bedside manner as ... Show More
1h 4m
Mar 2024
AI vs software devs
Daniel and Chris are out this week, so we’re bringing you conversations all about AI’s complicated relationship to software developers from other Changelog pods: JS Party, Go Time & The Changelog.Join the discussionChangelog++ members save 2 minutes on this episode because they m ... Show More
57 m
Jun 2023
Digging through Nick Nisi’s tool box
KBall interviews Nick Nisi about the Pandora’s box that is his tooling/developer setup. Starting at the lowest layer of the terminal emulator he uses, they move upwards into command line tools, into Tmux (terminals within terminals!), his epic NeoVim configuration, and finally in ... Show More
1 h
Jun 2024
Rise of the AI PC & local LLMs
We’ve seen a rise in interest recently and a number of major announcements related to local LLMs and AI PCs. NVIDIA, Apple, and Intel are getting into this along with models like the Phi family from Microsoft. In this episode, we dig into local AI tooling, frameworks, and optimiz ... Show More
35m 35s
May 2023
Chanuki Seresinhe - Head of Data Science at Zoopla - Generative AI & AI for happiness #33
Our guest today is Chanuki Seresinhe, head of Data Science at Zoopla,  a company which provides millions of users with access to properties for sale and for rent. In our conversation, we first talk about Chanuki’s PhD where she used machine learning to identify relationships betw ... Show More
57m 13s
Mar 2023
AI and Coding with ChatGPT
In this episode of Syntax, Wes and Scott talk about the current landscape of AI, how AI is trained, is AI going to take your job, who’s going to train AI, and adding AI to your applications. Sentry - Sponsor If you want to know what’s happening with your code, track errors and mo ... Show More
1h 6m
May 2024
Is it too late to opt out of AI? (Friends)
Tech lawyer Luis Villa returns to answer our most pressing questions: what’s up with all these new content deals? How did Google think it was a good idea to ship AI Summaries in its current state? Is it too late to opt out of AI? We also discuss AI in Hollywood (spoilers!), posit ... Show More
1h 39m