logo
episode-header-image
Jan 2018
47m 4s

MLG 027 Hyperparameters 1

OCDevel
About this episode

Full notes and resources at  ocdevel.com/mlg/27 

Try a walking desk to stay healthy while you study or work!

Hyperparameters are crucial elements in the configuration of machine learning models. Unlike parameters, which are learned by the model during training, hyperparameters are set by humans before the learning process begins. They are the knobs and dials that humans can control to influence the training and performance of machine learning models.

Definition and Importance

Hyperparameters differ from parameters like theta in linear and logistic regression, which are learned weights. They are choices made by humans, such as the type of model, number of neurons in a layer, or the model architecture. These choices can have significant effects on the model's performance, making them vital to conscious and informed tuning.

Types of Hyperparameters

Model Selection:

Choosing what model to use is itself a hyperparameter. For example, deciding between linear regression, logistic regression, naive Bayes, or neural networks.

Architecture of Neural Networks:

  • Number of Layers and Neurons: Deciding the width (number of neurons) and depth (number of layers).
  • Types of Layers: Whether to use LSTMs, convolutional layers, or dense layers.

Activation Functions:

They transform linear outputs into non-linear outputs. Popular choices include ReLU, tanh, and sigmoid, with ReLU being the default for most neural network layers.

Regularization and Optimization:

These influence the learning process. The use of L1/L2 regularization or dropout, as well as the type of optimizer (e.g., Adam, Adagrad), are hyperparameters.

Optimization Techniques

Techniques like grid search, random search, and Bayesian optimization are used to systematically explore combinations of hyperparameters to find the best configuration for a given task. While these methods can be computationally expensive, they are necessary for achieving optimal model performance.

Challenges and Future Directions

The field strives towards simplifying the choice of hyperparameters, ideally automating them to become parameters of the model itself. Efforts like Google's AutoML aim to handle hyperparameter tuning automatically.

Understanding and optimizing hyperparameters is a cornerstone in machine learning, directly impacting the effectiveness and efficiency of a model. Progress continues to integrate these choices into model training, reducing the dependency on human intervention and trial-and-error experimentation.

Decision Tree

  • Model selection
    • Unsupervised? K-means Clustering => DL
    • Linear? Linear regression, logistic regression
    • Simple? Naive Bayes, Decision Tree (Random Forest, Gradient Boosting)
    • Little data? Boosting
    • Lots of data, complex situation? Deep learning
  • Network
    • Layer arch
      • Vision? CNN
      • Time? LSTM
      • Other? MLP
      • Trading LSTM => CNN decision
    • Layer size design (funnel, etc)
      • Face pics
      • From BTC episode
      • Don't know? Layers=1, Neurons=mean(inputs, output) link
  • Activations / nonlinearity
    • Output
      • Sigmoid = predict probability of output, usually at output
      • Softmax = multi-class
      • Nothing = regression
    • Relu family (Leaky Relu, Elu, Selu, ...) = vanishing gradient (gradient is constant), performance, usually better
    • Tanh = classification between two classes, mean 0 important
Up next
Feb 2018
MLG 028 Hyperparameters 2
<p>Notes and resources:  <a href= "https://ocdevel.com/mlg/28?utm_source=podcast&utm_medium=mlg&utm_campaign=mlg28"> ocdevel.com/mlg/28</a> </p> <p><a href= "https://ocdevel.com/walk?utm_source=podcast&utm_medium=mlg&utm_campaign=mlg28"> Try a walking desk</a> to stay healthy whi ... Show More
51m 7s
Feb 2018
MLG 029 Reinforcement Learning Intro
<p>Notes and resources:  <a href= "https://ocdevel.com/mlg/29?utm_source=podcast&utm_medium=mlg&utm_campaign=mlg29">ocdevel.com/mlg/29</a> </p> <p><a href= "https://ocdevel.com/walk?utm_source=podcast&utm_medium=mlg&utm_campaign=mlg29"> Try a walking desk</a> to stay healthy whil ... Show More
43m 21s
May 2018
MLA 001 Degrees, Certificates, and Machine Learning Careers
<div> <p>While industry-respected credentials like Udacity Nanodegrees help build a practical portfolio for machine learning job interviews, they remain insufficient stand-alone qualifications—most roles require a Master's degree as a near-hard requirement, especially compared to ... Show More
11m 21s
Recommended Episodes
Aug 1
Live DEMO: We Build 3 AI Apps (In Under An Hour!)
In this hands-on episode, Corey and Grant attempt to build three different AI apps in one hour using Google AI Studio - with zero coding experience required. They create an Inbox Zero email organizer, a meme generator that roasts their photos, and a spontaneous adventure planner ... Show More
1h 27m
Mar 2025
How to Avoid AI Project Failure
<p>More and more project professionals are managing artificial intelligence (AI) projects, but teams are struggling to make sure they deliver real value. What makes AI projects different—and what are some common reasons they fail? What does it take to lead successful AI projects— ... Show More
23m 5s
May 2025
Solution Design Document Template: From Architecture Review to Implementation Success
Why This Episode Matters:You've passed your Architecture Review Board. Budget approved. Team assembled. So why do so many projects still fail during implementation?Host Shahid Sharif reveals the critical gap most IT professionals miss: the bridge between High-Level Design (ARB ap ... Show More
9m 10s
Sep 2021
An Exploration Of The Data Engineering Requirements For Bioinformatics
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Biology has been gaining a lot of attention in recent years, even before the pandemic. As an outgrowth of that popularity, a new field has grown up that pairs statistics and compuational analysis with scientific research ... Show More
55m 10s
Dec 2024
63. 7 Game-Changing Time Management Strategies for Teachers
<p>We know that as a teacher, time is one of your most valuable resources. That’s why, in this episode, we’re diving into practical strategies to help you reclaim your time so you can focus on what really matters: teaching and connecting with your students. From organisation hack ... Show More
20m 42s
Jul 2024
162. Stop Procrastinating & Start Studying Now With These 3 Powerful Methods
<p>Overcome procrastination when studying with my 3 favourite strategies to get moving NOW!<br/><br/>Mentioned in this episode:</p><ul><li>Printable calendar chain template: https://examstudyexpert.com/chain</li><li>1-on-1 exam success coaching: https://examstudyexpert.com/coachi ... Show More
25m 33s
Jul 2024
227. 5 Tips for a successful Start Teaching Special Needs Students [Getting Ready for BTS]
What can you do in the first few weeks of school to help create a more smoothly operating classroom for teaching special needs students? I often talk about the importance of implementing strategies and systems to ensure your classroom runs efficiently and implementing some very b ... Show More
20m 49s
Aug 29
Episode 28 Task Prioritization for Productivity
<p>Today, we’re diving into one of the most powerful productivity tools you can adopt: task prioritization. Whether you’re managing projects, juggling work and home life, or simply trying to make it through your to-do list without overwhelm, prioritization can completely transfor ... Show More
18m 32s
Jan 2025
3470: Do You Have Productivity The Wrong Way Round? by Carl Pullein on How to Eliminate Distractions
Discover all of the podcasts in our network, search for specific episodes, get the Optimal Living Daily workbook, and learn more at: OLDPodcast.com. Episode 3470: Carl Pullein challenges the traditional approach to productivity, arguing that doing more doesn’t necessarily mean ac ... Show More
9m 16s