logo
episode-header-image
May 2023
26m 33s

Creating instruction tuned models (Pract...

CHANGELOG MEDIA
About this episode

At the recent ODSC East conference, Daniel got a chance to sit down with Erin Mikail Staples to discuss the process of gathering human feedback and creating an instruction tuned Large Language Models (LLM). They also chatted about the importance of open data and practical tooling for data annotation and fine-tuning. Do you want to create your own custom generative AI models? This is the episode for you!

Leave us a comment

Changelog++ members save 1 minute on this episode because they made the ads disappear. Join today!

Sponsors:

  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
  • Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs.
  • Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You iterlly can’t get any faster!

Featuring:

Show Notes:

Something missing or broken? PRs welcome!

Timestamps:

(00:00) - Welcome to Practical AI
(00:43) - Erin Mikail Staples
(02:09) - Open source attendees
(03:54) - The key to RLHF
(05:35) - Tooling for RLHF
(07:33) - Humanities in data science
(11:22) - Label Studio's workflow
(15:41) - The open data ecosystem
(21:04) - Do data labeling
(22:33) - Exciting changes coming
(24:15) - DevRel(ish) and other resources
(25:13) - Goodbyes
(25:45) - Outro

Up next
Aug 22
Git with your friends (remastered) (Changelog & Friends #107)
Our Changelog & Friends proof-of-concept with Mat Ryer has been remastered! Now with full-length video on YouTube. Originally recorded: 2023-02-08 Mat joins us for some good conversation about some Git tooling that's been on our radar. We speculate, we discuss, we laugh, and Mat ... Show More
1h 47m
Aug 21
From Chef to System Initiative (remastered) (Changelog Interviews #655)
The epic show with Adam Jacob has been remastered! Now with full-length video on YouTube. Adam goes solo with Adam Jacob for an epic pod into his journey to get to System Initiative. From SysAdmin at 8 years old, to discovering Linux and working for Mom-and-pop ISPs, to open sour ... Show More
2h 22m
Aug 18
Cursor’s problem isn't just Cursor's problem (Changelog News #157)
Cursor has a big problem, Alireza Bashiri thinks plaintext beats todo apps, Manish built an offline AI workspace, OverType is a WYSIWYG markdown editor that's just a textarea, and sshrc lets you bring your config with you to remote machines. 
7m 37s
Recommended Episodes
Jan 2023
Machine learning at small organizations
Why is ML is so poorly adopted in small organizations (hint: it’s not because they don’t have enough data)? In this episode, Kirsten Lum from Storytellers shares the patterns she has seen in small orgs that lead to a successful ML practice. We discuss how the job of a ML Engineer ... Show More
49m 50s
Feb 2023
Is htmx the way to Go?
A quick look at the history of building web apps, followed by a discussion of htmx and how it compares to both modern and traditional ways of building. Discuss on Changelog News Changelog++ members save 4 minutes on this episode because they made the ads disappear. Join today! Sp ... Show More
1h 16m
Apr 2024
Mamba & Jamba
First there was Mamba… now there is Jamba from AI21. This is a model that combines the best non-transformer goodness of Mamba with good ‘ol attention layers. This results in a highly performant and efficient model that AI21 has open sourced! We hear all about it (along with a var ... Show More
41m 13s
May 2024
Microsoft is all-in on AI: Part 1 (Interview)
Scott Guthrie joins the show this week from Microsoft Build 2024 to discuss Microsoft being all-in on AI. From Copilot, to Azure AI and Prompty, to their developer first focus, leading GitHub, VS Code being the long bet that paid off, to the future of a doctor’s bedside manner as ... Show More
1h 4m
Mar 2024
AI vs software devs
Daniel and Chris are out this week, so we’re bringing you conversations all about AI’s complicated relationship to software developers from other Changelog pods: JS Party, Go Time & The Changelog.Join the discussionChangelog++ members save 2 minutes on this episode because they m ... Show More
57 m
Jun 2023
Digging through Nick Nisi’s tool box
KBall interviews Nick Nisi about the Pandora’s box that is his tooling/developer setup. Starting at the lowest layer of the terminal emulator he uses, they move upwards into command line tools, into Tmux (terminals within terminals!), his epic NeoVim configuration, and finally in ... Show More
1 h
Jun 2024
Rise of the AI PC & local LLMs
We’ve seen a rise in interest recently and a number of major announcements related to local LLMs and AI PCs. NVIDIA, Apple, and Intel are getting into this along with models like the Phi family from Microsoft. In this episode, we dig into local AI tooling, frameworks, and optimiz ... Show More
35m 35s
May 2023
Chanuki Seresinhe - Head of Data Science at Zoopla - Generative AI & AI for happiness #33
Our guest today is Chanuki Seresinhe, head of Data Science at Zoopla,  a company which provides millions of users with access to properties for sale and for rent. In our conversation, we first talk about Chanuki’s PhD where she used machine learning to identify relationships betw ... Show More
57m 13s
Mar 2023
AI and Coding with ChatGPT
In this episode of Syntax, Wes and Scott talk about the current landscape of AI, how AI is trained, is AI going to take your job, who’s going to train AI, and adding AI to your applications. Sentry - Sponsor If you want to know what’s happening with your code, track errors and mo ... Show More
1h 6m
May 2024
Is it too late to opt out of AI? (Friends)
Tech lawyer Luis Villa returns to answer our most pressing questions: what’s up with all these new content deals? How did Google think it was a good idea to ship AI Summaries in its current state? Is it too late to opt out of AI? We also discuss AI in Hollywood (spoilers!), posit ... Show More
1h 39m