logo
episode-header-image
Jan 2025
40m 26s

Auditing LLMs and Twitter

Kyle Polich
About this episode

Our guests, Erwan Le Merrer and Gilles Tredan, are long-time collaborators in graph theory and distributed systems. They share their expertise on applying graph-based approaches to understanding both large language model (LLM) hallucinations and shadow banning on social media platforms.

In this episode, listeners will learn how graph structures and metrics can reveal patterns in algorithmic behavior and platform moderation practices.

Key insights include the use of graph theory to evaluate LLM outputs, uncovering patterns in hallucinated graphs that might hint at the underlying structure and training data of the models, and applying epidemic models to analyze the uneven spread of shadow banning on Twitter.

-------------------------------

Want to listen ad-free?  Try our Graphs Course?  Join Data Skeptic+ for $5 / month of $50 / year

https://plus.dataskeptic.com

Up next
Aug 17
Networks and Recommender Systems
Kyle reveals the next season's topic will be "Recommender Systems". Asaf shares insights on how network science contributes to the recommender system field. 
17m 45s
Jul 21
Network of Past Guests Collaborations
Kyle and Asaf discuss a project in which we link former guests of the podcast based on their co-authorship of academic papers. 
34m 10s
Jul 6
The Network Diversion Problem
In this episode, Professor Pål Grønås Drange from the University of Bergen, introduces the field of Parameterized Complexity - a powerful framework for tackling hard computational problems by focusing on specific structural aspects of the input. This framework allows researchers ... Show More
46m 14s
Recommended Episodes
Nov 2018
ML/DL for Non-Stationary Time Series Analysis in Financial Markets and Beyond with Stuart Reid - TWiML Talk #203
Today, we’re joined by Stuart Reid, Chief Scientist at NMRQL Research. NMRQL is an investment management firm that uses ML algorithms to make adaptive, unbiased, scalable, and testable trading decisions for its funds. In our conversation, Stuart and I dig into the way NMRQL uses ... Show More
58m 29s
Jul 2024
Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694
Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of building real-world products using large language models (LLMs). We kick things off discussing novel applications of LLMs and how to think about modern AI user experiences. We then dig i ... Show More
1h 20m
Sep 2024
Graphiques boursiers : Que dit la science ?
Peut-on vraiment prédire le marché avec l'analyse technique ? Beaucoup d'investisseurs se fient aux graphiques pour leurs décisions d'achat et de vente, mais que se passe-t-il réellement dans notre cerveau quand on regarde ces graphiques de prix ? Une nouvelle étude fascinante ut ... Show More
8m 26s
Dec 2019
Automated Machine Learning with Erez Barak - #323
Today we’re joined by Erez Barak, Partner Group Manager of Azure ML at Microsoft. In our conversation, Erez gives us a full breakdown of his AutoML philosophy, and his take on the AutoML space, its role, and its importance. We also discuss the application of AutoML as a contribut ... Show More
42m 45s
Apr 2025
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726
Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverages reinforcement learning to improve language model reasoning ... Show More
51m 45s
Sep 2021
Hashing It Out - Pranav Maheshwari - TheGraph
Today Corey talks with Pranav Maheshwari from The Graph, an indexing protocol for querying networks like Ethereum and IPFS. We'll dive deep into how The Graph works, what it aims to solve, what complications it has faced as it has scaled out to its current state, and where it exp ... Show More
1h 5m
Aug 2024
The Building Blocks of Agentic Systems with Harrison Chase - #698
Today, we're joined by Harrison Chase, co-founder and CEO of LangChain to discuss LLM frameworks, agentic systems, RAG, evaluation, and more. We dig into the elements of a modern LLM framework, including the most productive developer experiences and appropriate levels of abstract ... Show More
59m 17s
Mar 2023
#21 Unlocking the power of real-world data – Patrick Ryan
The vast amounts of real-world data collected during routine clinical care are a treasure trove of safety information – but there are challenges to overcome before this rich source of evidence can be applied to pharmacovigilance. Patrick Ryan from Johnson & Johnson discusses how ... Show More
33m 42s
Sep 2024
Data for Dummies: A Crash Course for Non-Technical PMs (with Mo Hallaba)
In today's data-driven landscape, organizations often find themselves drowning in a sea of data, yet struggling to glean actionable insights from it. Many companies are eager to label themselves as data-centric, but the reality is that not everyone is equally adept at interp ... Show More
23m 23s
Dec 2024
À CONTRE/TEMPS - Episode #4 - Tendances et innovation : les contraintes comme levier
Dans cet épisode du podcast Ipsos "À CONTRE/TEMPS", Youmna Ovazza, Partner Ipsos Strategy3 et Mathieu Doiret de l’Ipsos Knowledge Centre reçoivent Dominique Desjeux, anthropologue, sociologue et Professeur des Universités émérite à l’Université de Paris, pour une discussion passi ... Show More
54m 52s