logo
episode-header-image
Mar 2024
1h 26m

172: Transformers and Large Language Mod...

Patrick Wheeler and Jason Gauci
About this episode

172: Transformers and Large Language Models


Intro topic: Is WFH actually WFC?

News/Links:


Book of the Show


Patreon Plug https://www.patreon.com/programmingthrowdown?ty=h


Tool of the Show


Topic: Transformers and Large Language Models

  • How neural networks store information
    • Latent variables
  • Transformers
    • Encoders & Decoders
  • Attention Layers
    • History
      • RNN
        • Vanishing Gradient Problem
      • LSTM
        • Short term (gradient explodes), Long term (gradient vanishes)
    • Differentiable algebra
    • Key-Query-Value
    • Self Attention
  • Self-Supervised Learning & Forward Models
  • Human Feedback
    • Reinforcement Learning from Human Feedback
    • Direct Policy Optimization (Pairwise Ranking)



★ Support this podcast on Patreon ★
Up next
Nov 4
185: Workflow Orchestrators
Intro topic: Asymmetric ReturnsNews/Links:NanoChat by Andrej Karpathyhttps://github.com/karpathy/nanochatPydantic AIhttps://www.marktechpost.com/2025/03/25/pydanticai-advancing-generative-ai-agent-development-through-intelligent-framework-design/1000th Starlink this yearhttps://s ... Show More
1h 32m
Sep 23
184: Asynchronous Programming
184: Asynchronous ProgrammingIntro topic: AI ScamsNews/Links:Coding Adventure: Ray-Tracing Glass and Caustics (Sebastian Lague)https://www.youtube.com/watch?v=wA1KVZ1eOuABoson AI announces Higgs Audio V2https://www.boson.ai/technologies/voice The Misconception that Almost Stopped ... Show More
1h 30m
Jul 2025
183: Landing a Software Job in 2025
00:00:00 Intro00:01:58 Introducing Mark Cunningham00:07:01 How Do You Find A Job?00:15:43 How to Get the Best Interview00:33:06 Tips on How To Pass An Interview00:38:38 How to Have a Good Interview00:48:12 What is the Reverse Interview?00:54:24 What Is The Hiring Manager's Role?0 ... Show More
1h 46m
Recommended Episodes
Oct 2023
Episode 39: The Art of Architectures
Episode 39: In this episode of Critical Thinking - Bug Bounty Podcast, We're catching up on news, including new override updates from Chrome, GPT-4, SAML presentations, and even a shoutout from Live Overflow! Then we get busy laying the groundwork on a discussion of web architect ... Show More
1h 21m
Apr 2024
Building Secure Software: Unveiling the Hidden Dependencies with Niels Tanis
<h3>Avalonia XPF</h3> <p>This episode of The Modern .NET Show is supported, in part, by <a href= "http://avaloniaui.net/themoderndotnetshow?utm_source=Podcasts&utm_campaign=The+.Modern+NET+Show+s6e16" target="_blank" rel="noopener">Avalonia XPF</a>, a binary-compatible cross-plat ... Show More
1h 15m
Jan 2023
Episode 2: Exploit Writing & Automation / Do you need to know how to program to hack?
Episode 2: In this episode of Critical Thinking - Bug Bounty Podcast we talk about exploit writing/automation, some new tools released in the industry (Of-CORS), the age old question of "Do you have to know how to program to hack?", a walk-through of some very impactful bug bount ... Show More
1h 14m
Mar 2023
#210 Will Robots Take Over? (AI Wrote this Episode)
<p>As time goes on, we&apos;re seeing more and more impressive advancements in AI (Artificial Intelligence) technology. Recently, a chat bot called Chat GPT has become popular and I asked this program to design my episode today. Let&apos;s see how it did!<br/><br/>In this episode ... Show More
35m 5s
Apr 2024
The GitLab way: Kindness, transparency, and short toes | David DeSanto (CPO)
<p><strong>David DeSanto</strong> is the chief product officer of GitLab, which is the largest remote-only company in the world. They share many of their team meetings on YouTube, and they’ve grown from being an open-source code management product competing with GitHub to a multi ... Show More
1h 21m
Jun 2023
#381 – Chris Lattner: Future of Programming and AI
Chris Lattner is a legendary software and hardware engineer, leading projects at Apple, Tesla, Google, SiFive, and Modular AI, including the development of Swift, LLVM, Clang, MLIR, CIRCT, TPUs, and Mojo. Please support this podcast by checking out our sponsors: - iHerb: https:/ ... Show More
3h 38m