logo
episode-header-image
Jan 2025
40m 26s

Auditing LLMs and Twitter

Kyle Polich
About this episode

Our guests, Erwan Le Merrer and Gilles Tredan, are long-time collaborators in graph theory and distributed systems. They share their expertise on applying graph-based approaches to understanding both large language model (LLM) hallucinations and shadow banning on social media platforms.

In this episode, listeners will learn how graph structures and metrics can reveal patterns in algorithmic behavior and platform moderation practices.

Key insights include the use of graph theory to evaluate LLM outputs, uncovering patterns in hallucinated graphs that might hint at the underlying structure and training data of the models, and applying epidemic models to analyze the uneven spread of shadow banning on Twitter.

-------------------------------

Want to listen ad-free?  Try our Graphs Course?  Join Data Skeptic+ for $5 / month of $50 / year

https://plus.dataskeptic.com

Up next
Jul 6
The Network Diversion Problem
In this episode, Professor Pål Grønås Drange from the University of Bergen, introduces the field of Parameterized Complexity - a powerful framework for tackling hard computational problems by focusing on specific structural aspects of the input. This framework allows researchers ... Show More
46m 14s
Jun 28
Complex Dynamic in Networks
In this episode, we learn why simply analyzing the structure of a network is not enough, and how the dynamics - the actual mechanisms of interaction between components - can drastically change how information or influence spreads. Our guest, Professor Baruch Barzel of Bar-Ilan Un ... Show More
56 m
Jun 22
Github Network Analysis
In this episode we'll discuss how to use Github data as a network to extract insights about teamwork. Our guest, Gabriel Ramirez, manager of the notifications team at GitHub, will show how to apply network analysis to better understand and improve collaboration within his enginee ... Show More
36m 46s
Recommended Episodes
Nov 2018
ML/DL for Non-Stationary Time Series Analysis in Financial Markets and Beyond with Stuart Reid - TWiML Talk #203
Today, we’re joined by Stuart Reid, Chief Scientist at NMRQL Research. NMRQL is an investment management firm that uses ML algorithms to make adaptive, unbiased, scalable, and testable trading decisions for its funds. In our conversation, Stuart and I dig into the way NMRQL uses ... Show More
58m 29s
Jul 2024
Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694
Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of building real-world products using large language models (LLMs). We kick things off discussing novel applications of LLMs and how to think about modern AI user experiences. We then dig i ... Show More
1h 20m
Sep 2024
Graphiques boursiers : Que dit la science ?
Peut-on vraiment prédire le marché avec l'analyse technique ? Beaucoup d'investisseurs se fient aux graphiques pour leurs décisions d'achat et de vente, mais que se passe-t-il réellement dans notre cerveau quand on regarde ces graphiques de prix ? Une nouvelle étude fascinante ut ... Show More
8m 26s
Dec 2019
Automated Machine Learning with Erez Barak - #323
Today we’re joined by Erez Barak, Partner Group Manager of Azure ML at Microsoft. In our conversation, Erez gives us a full breakdown of his AutoML philosophy, and his take on the AutoML space, its role, and its importance. We also discuss the application of AutoML as a contribut ... Show More
42m 45s
Apr 8
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726
Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverages reinforcement learning to improve language model reasoning ... Show More
51m 45s
Sep 2021
Hashing It Out - Pranav Maheshwari - TheGraph
Today Corey talks with Pranav Maheshwari from The Graph, an indexing protocol for querying networks like Ethereum and IPFS. We'll dive deep into how The Graph works, what it aims to solve, what complications it has faced as it has scaled out to its current state, and where it exp ... Show More
1h 5m
Aug 2024
The Building Blocks of Agentic Systems with Harrison Chase - #698
Today, we're joined by Harrison Chase, co-founder and CEO of LangChain to discuss LLM frameworks, agentic systems, RAG, evaluation, and more. We dig into the elements of a modern LLM framework, including the most productive developer experiences and appropriate levels of abstract ... Show More
59m 17s
Sep 2024
Data for Dummies: A Crash Course for Non-Technical PMs (with Mo Hallaba)
In today's data-driven landscape, organizations often find themselves drowning in a sea of data, yet struggling to glean actionable insights from it. Many companies are eager to label themselves as data-centric, but the reality is that not everyone is equally adept at interp ... Show More
23m 23s
Apr 2024
1973: Beyond the Numbers
On today’s episode, Dr. Mark Costes catches up with Jake Conway, the analytics guru behind Custom Practice Analytics and a foundational figure at DSI. They reminisce about their early days as office mates and discuss the evolving challenges in dentistry post-COVID. As practices f ... Show More
49m 16s