logo
episode-header-image
Nov 2020
41m 52s

MLG 032 Cartesian Similarity Metrics

OCDevel
About this episode

Try a walking desk to stay healthy while you study or work!

Show notes at ocdevel.com/mlg/32.

L1/L2 norm, Manhattan, Euclidean, cosine distances, dot product


Normed distances link

  • A norm is a function that assigns a strictly positive length to each vector in a vector space. link
  • Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below.
  • L1: Manhattan/city-block/taxicab. abs(x2-x1)+abs(y2-y1). Grid-like distance (triangle legs). Preferred for high-dim space.
  • L2: Euclidean. sqrt((x2-x1)^2+(y2-y1)^2sqrt(dot-product). Straight-line distance; min distance (Pythagorean triangle edge)
  • Others: Mahalanobis, Chebyshev (p=inf), etc

Dot product

  • A type of inner product.
    Outer-product: lies outside the involved planes. Inner-product: dot product lies inside the planes/axes involved link. Dot product: inner product on a finite dimensional Euclidean space link

Cosine (normalized dot)

Up next
Nov 2020
MLA 012 Docker for Machine Learning Workflows
<div> <p>Docker enables efficient, consistent machine learning environment setup across local development and cloud deployment, avoiding many pitfalls of virtual machines and manual dependency management. It streamlines system reproduction, resource allocation, and GPU access, su ... Show More
31m 41s
Jan 2021
MLA 013 Tech Stack for Customer-Facing Machine Learning Products
<div> <p>Primary technology recommendations for building a customer-facing machine learning product include React and React Native for the front end, serverless platforms like AWS Amplify or GCP Firebase for authentication and basic server/database needs, and Postgres as the rela ... Show More
47m 37s
Jan 2021
MLA 014 Machine Learning Hosting and Serverless Deployment
<div> <p>Machine learning model deployment on the cloud is typically handled with solutions like AWS SageMaker for end-to-end training and inference as a REST endpoint, AWS Batch for cost-effective on-demand batch jobs using Docker containers, and AWS Lambda for low-usage, server ... Show More
52m 33s
Recommended Episodes
Aug 2024
813: Solving Business Problems Optimally with Data, with Jerry Yurchisin
Jerry Yurchisin from Gurobi joins Jon Krohn to break down mathematical optimization, showing why it often outshines machine learning for real-world challenges. Find out how innovations like NVIDIA’s latest CPUs are speeding up solutions to problems like the Traveling Salesman in ... Show More
1h 43m
Dec 2024
From probabilistic bisimulation to representation learning via metrics
Strachey Lecture: From probabilistic bisimulation to representation learning via metrics - Professor Prakash Panangaden Bisimulation is a fundamental equivalence relation in process theory invented by Robin Milner and with an elegant fixed-point definition due to David Park. In t ... Show More
55m 3s
Jan 2025
Linear’s secret to building beloved B2B products | Nan Yu (Head of Product)
<p><strong>Nan Yu</strong> is the head of product at Linear, one of the most beloved and fastest-growing B2B SaaS products out there today, and the gold standard for high-performing tech teams. In our conversation, we discuss:</p><p>• Why speed and quality aren’t actually at odds ... Show More
1h 21m
Mar 2024
Better Satellite World: Intelligence from Space
<p>In this Better Satellite World podcast, SSPI's Lou Zacharilla speaks with <strong>John Murtagh</strong>, Head of Strategy at Airbus Defence and Space - Intelligence. They discuss John's work with Airbus' Intelligence program unit and the key role of Earth observation data anal ... Show More
36m 44s
Aug 2024
Metrics Driven Development
<p>How do you systematically measure, optimize, and improve the performance of LLM applications (like those powered by RAG or tool use)? Ragas is an open source effort that has been trying to answer this question comprehensively, and they are promoting a “Metrics Driven Developme ... Show More
42m 12s
Oct 5
The Data Model That Captures Your Business: Metric Trees Explained
SummaryIn this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach to data modeling that directly captures a company's business model. Vijay shares insights from his decade-long experience building data pr ... Show More
1h 1m
Apr 2020
Manometers - Short #91
<p>In this short podcast episode, Sal Hamidi of ProductsByPros joins Bryan to discuss what manometers are and how they measure pressure.</p> <p>Manometers measure pressure at a much higher scale than a micron gauge but lower than that of a pressure gauge; they measure pressure di ... Show More
17m 48s
Nov 2024
AI and the Future of Math, with DeepMind’s AlphaProof Team
In this week’s episode of No Priors, Sarah and Elad sit down with the Google DeepMind team behind AlphaProof, Laurent Sartran, Rishi Mehta, and Thomas Hubert. AlphaProof is a new reinforcement learning-based system for formal math reasoning that recently reached a silver-medal st ... Show More
39m 21s
Jan 2025
D2DO262: The Science and Magic of Network Mapping and Measurement
Measuring network speed seems straightforward: send some traffic between two endpoints and see how long it took. But what path did the traffic take? Were wired or wireless networks involved? What intermediary devices stood between your two endpoints? Was there interference? Conge ... Show More
38m 40s