logo
episode-header-image
May 2023
26m 33s

Creating instruction tuned models (Pract...

CHANGELOG MEDIA
About this episode

At the recent ODSC East conference, Daniel got a chance to sit down with Erin Mikail Staples to discuss the process of gathering human feedback and creating an instruction tuned Large Language Models (LLM). They also chatted about the importance of open data and practical tooling for data annotation and fine-tuning. Do you want to create your own custom generative AI models? This is the episode for you!

Leave us a comment

Changelog++ members save 1 minute on this episode because they made the ads disappear. Join today!

Sponsors:

  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
  • Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs.
  • Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You iterlly can’t get any faster!

Featuring:

Show Notes:

Something missing or broken? PRs welcome!

Timestamps:

(00:00) - Welcome to Practical AI
(00:43) - Erin Mikail Staples
(02:09) - Open source attendees
(03:54) - The key to RLHF
(05:35) - Tooling for RLHF
(07:33) - Humanities in data science
(11:22) - Label Studio's workflow
(15:41) - The open data ecosystem
(21:04) - Do data labeling
(22:33) - Exciting changes coming
(24:15) - DevRel(ish) and other resources
(25:13) - Goodbyes
(25:45) - Outro

Up next
Jul 7
Full-breadth developers for the win (Changelog News #151)
Justin Searls describes the "full-breadth developer" and why they'll win because AI, Cloudflare comes up with a way publishers can charge crawlers for access, Hugo Bowne-Anderson explains why building AI agents fails so often, the Job Worth Calculator tells you if your job is wor ... Show More
8m 54s
Jul 4
Selling mountain bikes all over the planet (Changelog & Friends #100)
Jeff Cayley joins Adam to talk about selling mountain bikes all over the planet and making some of the best outdoor and mountain bike gear, parts, and accessories you can buy. They have a killer YouTube channel as well. 
2h 8m
Jul 2
Agent, take the wheel (Changelog Interviews #648)
Thorsten Ball returned to Sourcegraph to work on Amp because he believes being able to talk to an alien intelligence that edits your code changes everything. On this episode, Thorsten joins us to discuss exactly how coding agents work, recent advancements in AI tooling, Amp's uni ... Show More
1h 53m
Recommended Episodes
Jan 2023
Machine learning at small organizations
Why is ML is so poorly adopted in small organizations (hint: it’s not because they don’t have enough data)? In this episode, Kirsten Lum from Storytellers shares the patterns she has seen in small orgs that lead to a successful ML practice. We discuss how the job of a ML Engineer ... Show More
49m 50s
Feb 2023
Is htmx the way to Go?
A quick look at the history of building web apps, followed by a discussion of htmx and how it compares to both modern and traditional ways of building. Discuss on Changelog News Changelog++ members save 4 minutes on this episode because they made the ads disappear. Join today! Sp ... Show More
1h 16m
Apr 2024
Mamba & Jamba
First there was Mamba… now there is Jamba from AI21. This is a model that combines the best non-transformer goodness of Mamba with good ‘ol attention layers. This results in a highly performant and efficient model that AI21 has open sourced! We hear all about it (along with a var ... Show More
41m 13s
May 2024
Microsoft is all-in on AI: Part 1 (Interview)
Scott Guthrie joins the show this week from Microsoft Build 2024 to discuss Microsoft being all-in on AI. From Copilot, to Azure AI and Prompty, to their developer first focus, leading GitHub, VS Code being the long bet that paid off, to the future of a doctor’s bedside manner as ... Show More
1h 4m
Mar 2024
AI vs software devs
Daniel and Chris are out this week, so we’re bringing you conversations all about AI’s complicated relationship to software developers from other Changelog pods: JS Party, Go Time & The Changelog.Join the discussionChangelog++ members save 2 minutes on this episode because they m ... Show More
57 m
Jun 2023
Digging through Nick Nisi’s tool box
KBall interviews Nick Nisi about the Pandora’s box that is his tooling/developer setup. Starting at the lowest layer of the terminal emulator he uses, they move upwards into command line tools, into Tmux (terminals within terminals!), his epic NeoVim configuration, and finally in ... Show More
1 h
Jun 2024
Rise of the AI PC & local LLMs
We’ve seen a rise in interest recently and a number of major announcements related to local LLMs and AI PCs. NVIDIA, Apple, and Intel are getting into this along with models like the Phi family from Microsoft. In this episode, we dig into local AI tooling, frameworks, and optimiz ... Show More
35m 35s
May 2023
Chanuki Seresinhe - Head of Data Science at Zoopla - Generative AI & AI for happiness #33
Our guest today is Chanuki Seresinhe, head of Data Science at Zoopla,  a company which provides millions of users with access to properties for sale and for rent. In our conversation, we first talk about Chanuki’s PhD where she used machine learning to identify relationships betw ... Show More
57m 13s
Mar 2023
AI and Coding with ChatGPT
In this episode of Syntax, Wes and Scott talk about the current landscape of AI, how AI is trained, is AI going to take your job, who’s going to train AI, and adding AI to your applications. Sentry - Sponsor If you want to know what’s happening with your code, track errors and mo ... Show More
1h 6m
May 2024
Is it too late to opt out of AI? (Friends)
Tech lawyer Luis Villa returns to answer our most pressing questions: what’s up with all these new content deals? How did Google think it was a good idea to ship AI Summaries in its current state? Is it too late to opt out of AI? We also discuss AI in Hollywood (spoilers!), posit ... Show More
1h 39m