logo
episode-header-image
Nov 2024
49m 33s

Humans vs. Bots: Are You Talking to a Ma...

FRANCESCO GADALETA
About this episode

In this episode of Data Science at Home, host Francesco Gadaleta dives deep into the evolving world of AI-generated content detection with experts Souradip Chakraborty, Ph.D. grad student at the University of Maryland, and Amrit Singh Bedi, CS faculty at the University of Central Florida. 

Together, they explore the growing importance of distinguishing human-written from AI-generated text, discussing real-world examples from social media to news. How reliable are current detection tools like DetectGPT? What are the ethical and technical challenges ahead as AI continues to advance? And is the balance between innovation and regulation tipping in the right direction? 

 

Tune in for insights on the future of AI text detection and the broader implications for media, academia, and policy.

 

Chapters 

 

00:00 - Intro 

00:23 - Guests: Souradip Chakraborty and Amrit Singh Bedi 

01:25 - Distinguish Text Generation By AI 

04:33 - Research on Safety and Alignment of Generative Model 

06:01 - Tools to Detect Generated AI Text  

11:28 - Water Marking

18:27 - Challenges in Detecting Large Documents Generated by AI 

23:34 - Number of Tokens 

26:22 - Adversarial Attack

29:01 - True Positive and False Positive of Detectors 

31:01 - Limit of Technologies 

41:01 - Future of AI Detection Techniques 

46:04 - Closing Thought

 

Subscribe to our new YouTube channel https://www.youtube.com/@DataScienceatHome

 

Up next
Jul 7
Tech's Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 286) [RB]
From the viral article "Tech's Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything" on my newsletter at https://defragzone.substack.com/p/techs-dumbest-mistake-why-firing here are my thoughts about AI replacing programmers... 🎙️ Sponsors AGNTCY — The open sour ... Show More
18m 44s
Jun 18
Brains in the Machine: The Rise of Neuromorphic Computing (Ep. 285)
In this episode of Data Science at Home, we explore the fascinating world of neuromorphic computing — a brain-inspired approach to computation that could reshape the future of AI and robotics. The episode breaks down how neuromorphic systems differ from conventional AI architectu ... Show More
24m 18s
Jun 3
DSH/Warcoded - AI in the Invisible Battlespace (Ep. 284)
This episode explores the invisible battlespace of cyber and electronic warfare, where AI takes center stage. From autonomous hacking bots to smart jamming and adversarial attacks on machine learning models, we uncover how modern conflicts are increasingly fought with code, not b ... Show More
21m 22s
Recommended Episodes
Feb 2017
MLG 001 Introduction
Show notes: ocdevel.com/mlg/1. MLG teaches the fundamentals of machine learning and artificial intelligence. It covers intuition, models, math, languages, frameworks, etc. Where your other ML resources provide the trees, I provide the forest. Consider MLG your syllabus, with high ... Show More
8m 11s
Apr 2017
Feature Processing for Text Analytics
It seems like every day there's more and more machine learning problems that involve learning on text data, but text itself makes for fairly lousy inputs to machine learning algorithms.  That's why there are text vectorization algorithms, which re-format text data so it's ready f ... Show More
17m 28s
Feb 2017
MLG 004 Algorithms - Intuition
Try a walking desk while studying ML or working on your projects! Overview of machine learning algorithms. Infer/predict, error/loss, train/learn. Supervised, unsupervised, reinforcement learning. ocdevel.com/mlg/4 for notes and resources 
22m 56s
Mar 2017
MLG 010 Languages & Frameworks
Try a walking desk while studying ML or working on your projects! Languages & frameworks comparison. Languages: Python, R, MATLAB/Octave, Julia, Java/Scala, C/C++. Frameworks: Hadoop/Spark, Deeplearning4J, Theano, Torch, TensorFlow. ocdevel.com/mlg/10 for notes and resources 
44m 36s
Jun 2024
SE Radio 622: Wolf Vollprecht on Python Tooling in Rust
Wolf Vollprecht, the CEO and founder of Prefix.dev, speaks with host Gregory M. Kapfhammer about how to implement Python tools, such as package managers, in the Rust programming language. They discuss the challenges associated with building Python infrastructure tooling in Python ... Show More
55m 10s
Jul 2023
AI Today Podcast: AI Glossary Series – Automated Machine Learning (AutoML)
In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the term Automated Machine Learning (AutoML), explain how this term relate to AI and why it’s important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certifi ... Show More
9m 11s
Jul 2023
AI Today Podcast: AI Glossary Series – Data Science Notebooks, Jupyter, Colab
In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Data Science Notebooks, Jupyter, Colab, explain how these terms relate to AI and why it’s important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and C ... Show More
11 m
Mar 2017
MLG 009 Deep Learning
Try a walking desk while studying ML or working on your projects! Deep learning and neural networks. How to stack our logisitic regression units into a multi-layer perceptron. ocdevel.com/mlg/9 for notes and resources 
51m 28s
Nov 2024
SE Radio 641: Catherine Nelson on Machine Learning in Data Science
Catherine Nelson, author of the new O’Reilly book, Software Engineering for Data Scientists, discusses the collaboration between data scientists and software engineers -- an increasingly common pairing on machine learning and AI projects. Host Philip Winston speaks with Nelson ab ... Show More
48m 19s
Aug 2024
#474: Python Performance for Data Science
Python performance has come a long way in recent times. And it's often the data scientists, with their computational algorithms and large quantities of data, who care the most about this form of performance. It's great to have Stan Seibert back on the show to talk about Python's ... Show More
1h 8m