logo
episode-header-image
Nov 2020
41m 52s

MLG 032 Cartesian Similarity Metrics

OCDevel
About this episode

Try a walking desk while studying ML or working on your projects!

Show notes at ocdevel.com/mlg/32.

L1/L2 norm, Manhattan, Euclidean, cosine distances, dot product

Normed distances link

  • A norm is a function that assigns a strictly positive length to each vector in a vector space. link
  • Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below.
  • L1: Manhattan/city-block/taxicab. abs(x2-x1)+abs(y2-y1). Grid-like distance (triangle legs). Preferred for high-dim space.
  • L2: Euclidean. sqrt((x2-x1)^2+(y2-y1)^2. sqrt(dot-product). Straight-line distance; min distance (Pythagorean triangle edge)
  • Others: Mahalanobis, Chebyshev (p=inf), etc

Dot product

  • A type of inner product. Outer-product: lies outside the involved planes. Inner-product: dot product lies inside the planes/axes involved link. Dot product: inner product on a finite dimensional Euclidean space link

Cosine (normalized dot)

Up next
Jul 14
MLA 027 AI Video End-to-End Workflow
How to maintain character consistency, style consistency, etc in an AI video. Prosumers can use Google Veo 3’s "High-Quality Chaining" for fast social media content. Indie filmmakers can achieve narrative consistency by combining Midjourney V7 for style, Kling for lip-synced dial ... Show More
1h 11m
Jul 12
MLA 026 AI Video Generation: Veo 3 vs Sora, Kling, Runway, Stable Video Diffusion
Google Veo leads the generative video market with superior 4K photorealism and integrated audio, an advantage derived from its YouTube training data. OpenAI Sora is the top tool for narrative storytelling, while Kuaishou Kling excels at animating static images with realistic, hig ... Show More
40m 39s
Jul 9
MLA 025 AI Image Generation: Midjourney vs Stable Diffusion, GPT-4o, Imagen & Firefly
The AI image market has split: Midjourney creates the highest quality artistic images but fails at text and precision. For business use, OpenAI's GPT-4o offers the best conversational control, while Adobe Firefly provides the strongest commercial safety from its exclusively licen ... Show More
58m 51s
Recommended Episodes
Aug 2024
813: Solving Business Problems Optimally with Data, with Jerry Yurchisin
Jerry Yurchisin from Gurobi joins Jon Krohn to break down mathematical optimization, showing why it often outshines machine learning for real-world challenges. Find out how innovations like NVIDIA’s latest CPUs are speeding up solutions to problems like the Traveling Salesman in ... Show More
1h 43m
Dec 2024
From probabilistic bisimulation to representation learning via metrics
Strachey Lecture: From probabilistic bisimulation to representation learning via metrics - Professor Prakash Panangaden Bisimulation is a fundamental equivalence relation in process theory invented by Robin Milner and with an elegant fixed-point definition due to David Park. In t ... Show More
55m 3s
Jan 2025
Linear’s secret to building beloved B2B products | Nan Yu (Head of Product)
Nan Yu is the head of product at Linear, one of the most beloved and fastest-growing B2B SaaS products out there today, and the gold standard for high-performing tech teams. In our conversation, we discuss:• Why speed and quality aren’t actually at odds• Linear’s unique approach ... Show More
1h 21m
Aug 2024
Metrics Driven Development
How do you systematically measure, optimize, and improve the performance of LLM applications (like those powered by RAG or tool use)? Ragas is an open source effort that has been trying to answer this question comprehensively, and they are promoting a “Metrics Driven Development” ... Show More
42m 12s
Nov 2024
AI and the Future of Math, with DeepMind’s AlphaProof Team
In this week’s episode of No Priors, Sarah and Elad sit down with the Google DeepMind team behind AlphaProof, a new reinforcement learning-based system for formal math reasoning that recently reached a silver-medal standard in solving International Mathematical Olympiad problems. ... Show More
39m 21s
Jan 2025
D2DO262: The Science and Magic of Network Mapping and Measurement
Measuring network speed seems straightforward: send some traffic between two endpoints and see how long it took. But what path did the traffic take? Were wired or wireless networks involved? What intermediary devices stood between your two endpoints? Was there interference? Conge ... Show More
38m 40s
Jul 2024
7 Dimensions of ABA
ABA is a data-driven and evidence-based field. The 7 dimensions of ABA are the core principles that ensure effectiveness in each treatment plan. As BCBAs, we had to memorize these 7 dimensions for our exam, but it may have been a while since you last revisited them. Here, we reca ... Show More
14m 23s
Feb 2025
#287 Self-Service Generative AI Product Development at Credit Karma with Madelaine Daianu, Head of Data & AI at Credit Karma
As businesses collect more data than ever, the question arises: is bigger always better? Companies are beginning to question whether massive datasets and complex infrastructures are truly delivering results or just adding unnecessary costs. How can you align your data strategy wi ... Show More
48m 17s
Jun 2023
AI trends: a Latent Space crossover
Daniel had the chance to sit down with @swyx and Alessio from the Latent Space pod in SF to talk about current AI trends and to highlight some key learnings from past episodes. The discussion covers open access LLMs, smol models, model controls, prompt engineering, and LLMOps. Th ... Show More
59m 39s