logo
episode-header-image
Mar 2024
1h 26m

172: Transformers and Large Language Mod...

Patrick Wheeler and Jason Gauci
About this episode

172: Transformers and Large Language Models


Intro topic: Is WFH actually WFC?

News/Links:


Book of the Show


Patreon Plug https://www.patreon.com/programmingthrowdown?ty=h


Tool of the Show


Topic: Transformers and Large Language Models

  • How neural networks store information
    • Latent variables
  • Transformers
    • Encoders & Decoders
  • Attention Layers
    • History
      • RNN
        • Vanishing Gradient Problem
      • LSTM
        • Short term (gradient explodes), Long term (gradient vanishes)
    • Differentiable algebra
    • Key-Query-Value
    • Self Attention
  • Self-Supervised Learning & Forward Models
  • Human Feedback
    • Reinforcement Learning from Human Feedback
    • Direct Policy Optimization (Pairwise Ranking)



★ Support this podcast on Patreon ★
Up next
Jun 30
182: AI Assisted Coding
Intro topic: Getting an entry-level jobNews/Links:Mario Kart 64 Fully Decompiledhttps://gbatemp.net/threads/mario-kart-64-decompilation-project-reaches-100-completion.671104/Q-Learning is not yet scalablehttps://seohong.me/blog/q-learning-is-not-yet-scalable/Grover’s Algorithmhtt ... Show More
1h 37m
May 12
181: Memory Management
Intro topic: Video Game PricesNews/Links:Step one: Jump in the Lava - Abyssofthttps://youtu.be/WdadpHLAfdA?si=oXYnhB0EdkR_RaPEScalable world models for continuous controlhttps://www.tdmpc2.com/Clever code is probably the worst code you could write - Engineer’s Codexhttps://read.e ... Show More
1h 46m
Mar 2025
180: Reinforcement Learning
Intro topic: GrillsNews/Links:You can’t call yourself a senior until you’ve worked on a legacy projecthttps://www.infobip.com/developers/blog/seniors-working-on-a-legacy-projectRecraft might be the most powerful AI image platform I’ve ever used — here’s whyhttps://www.tomsguide.c ... Show More
1h 52m
Recommended Episodes
Oct 2023
Episode 39: The Art of Architectures
Episode 39: In this episode of Critical Thinking - Bug Bounty Podcast, We're catching up on news, including new override updates from Chrome, GPT-4, SAML presentations, and even a shoutout from Live Overflow! Then we get busy laying the groundwork on a discussion of web architect ... Show More
1h 21m
Apr 2024
Building Secure Software: Unveiling the Hidden Dependencies with Niels Tanis
Avalonia XPF This episode of The Modern .NET Show is supported, in part, by Avalonia XPF, a binary-compatible cross-platform fork of WPF, enables WPF apps to run on new platforms with minimal effort and maximum compatibility. Show Notes And keep in mind that, not to bash OWASP an ... Show More
1h 15m
Jan 2023
Episode 2: Exploit Writing & Automation / Do you need to know how to program to hack?
Episode 2: In this episode of Critical Thinking - Bug Bounty Podcast we talk about exploit writing/automation, some new tools released in the industry (Of-CORS), the age old question of "Do you have to know how to program to hack?", a walk-through of some very impactful bug bount ... Show More
1h 14m
Mar 2023
#210 Will Robots Take Over? (AI Wrote this Episode)
As time goes on, we're seeing more and more impressive advancements in AI (Artificial Intelligence) technology. Recently, a chat bot called Chat GPT has become popular and I asked this program to design my episode today. Let's see how it did! In this episode, we talk ab ... Show More
33m 45s
Apr 2024
The GitLab way: Kindness, transparency, and short toes | David DeSanto (CPO)
David DeSanto is the chief product officer of GitLab, which is the largest remote-only company in the world. They share many of their team meetings on YouTube, and they’ve grown from being an open-source code management product competing with GitHub to a multi-product platform th ... Show More
1h 21m
Jun 2023
#381 – Chris Lattner: Future of Programming and AI
Chris Lattner is a legendary software and hardware engineer, leading projects at Apple, Tesla, Google, SiFive, and Modular AI, including the development of Swift, LLVM, Clang, MLIR, CIRCT, TPUs, and Mojo. Please support this podcast by checking out our sponsors: – iHerb: https:// ... Show More
3h 38m