logo
episode-header-image
Sep 2023
22m 14s

Attacking LLMs for fun and profit (Ep. 2...

FRANCESCO GADALETA
About this episode

As a continuation of Episode 238, I explain some effective and fun attacks to conduct against LLMs. Such attacks are even more effective on models served locally, that are hardly controlled by human feedback.

Have great fun and learn them responsibly.

 

References

https://www.jailbreakchat.com/

https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/

https://arxiv.org/abs/2305.13860

 

Up next
Jul 7
Tech's Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything (Ep. 286) [RB]
From the viral article "Tech's Dumbest Mistake: Why Firing Programmers for AI Will Destroy Everything" on my newsletter at https://defragzone.substack.com/p/techs-dumbest-mistake-why-firing here are my thoughts about AI replacing programmers... 🎙️ Sponsors AGNTCY — The open sour ... Show More
18m 44s
Jun 18
Brains in the Machine: The Rise of Neuromorphic Computing (Ep. 285)
In this episode of Data Science at Home, we explore the fascinating world of neuromorphic computing — a brain-inspired approach to computation that could reshape the future of AI and robotics. The episode breaks down how neuromorphic systems differ from conventional AI architectu ... Show More
24m 18s
Jun 3
DSH/Warcoded - AI in the Invisible Battlespace (Ep. 284)
This episode explores the invisible battlespace of cyber and electronic warfare, where AI takes center stage. From autonomous hacking bots to smart jamming and adversarial attacks on machine learning models, we uncover how modern conflicts are increasingly fought with code, not b ... Show More
21m 22s
Recommended Episodes
Aug 2023
التقنية مو سهالات مع فيصل السيف
يستضيف برنامج سهالات في الحلقة الأولى المؤثر في مجال التقنية أ.فيصل السيف في حديث عفوي وشيّق حول التقنيات، الشبكات، القديم والحديث منها، مستقبل الألعاب والواقع الافتراضي، يُناقش حقيقة المخاوف والشكوك حول التطور التقني والمستوى المطلوب من الأمان والحذر في التعاطي معها، يناقش الذكا ... Show More
1h 17m
Aug 2022
OSPod Episode 50: Conservation of Ninjutsu, 2 Million Special, and Supervillains on Skates
We hit 50 episode AND 2 million subscribers! What a rush! Thank you all for listening and watching along with us. And never fear, there is plenty to keep listening to this episode. How many ninjas does it take to defeat Ultron? Is Blue the protagonist or a lancer in Breath of the ... Show More
1h 11m
Dec 2020
May you have a very unsafe 2021 [E190]
Playing it safe won't keep you safe for very long.  Episode Links: https://www.france24.com/en/europe/20201222-covid-19-french-bill-could-ban-unvaccinated-from-public-transport  Until next time… Be a purpose seeker, truth lover, and own your future.To take more steps to live a fo ... Show More
21m 58s
Sep 2023
Episode 35: King of Collaboration: Douglas Day
Episode 35: In this episode of Critical Thinking - Bug Bounty Podcast, we're thrilled to welcome Douglas Day, a bug bounty hunter known for his unique methodologies and collaborative spirit. We talk about his approach to finding new endpoints in applications, his ingenious techni ... Show More
1h 25m
Jan 2021
Best of In the Bubble: What No One Knows About COVID-19 (with Larry Brilliant)
If you didn’t catch this Best of In the Bubble episode the first time around, you are in for a treat! And even if you've heard it already, go ahead and give it another listen. If there is one expert Andy could talk with about coronavirus and how we are really doing, it is epidemi ... Show More
50m 13s
Nov 2023
Episode 44: URL Parsing & Auth Bypass Magic
Episode 44: In this episode of Critical Thinking - Bug Bounty Podcast, the topic is URL structure, and Justin and Joel break down the elements that make up a URL and some common tips and tricks surrounding them which allow for all sorts of bypasses. We also round out the episode ... Show More
1h 11m
Oct 2023
OSPod Episode 78: Byzantines, Fearless Lads, and Delicious Delicious Power Gaming!
The OSPod crew is back from a busy couple weeks! Epic-length Byzantine videos, boys without fear, talks and conventions oh my! And at the end of it all, perhaps the return of a beloved thought experiment...Our podcast, like our videos, sometimes touches on the violence, assaults, ... Show More
59m 16s
Feb 2019
Hybrid war and tactical influence operations. Separ lives off the land. NoRelationship attacks get past email filters. Responsible disclosure. Man-in-the-room bug. Ship hacking. Password managers.
In today’s podcast we hear about a test of influencing soldiers through their social media: Instagram works best, Twitter not so much. Separ credential-stealing malware successfully lives off the land. NoRelationship attacks get past some email filters. Spamming users to get your ... Show More
21m 33s
Aug 2023
OSPod Episode 74: Malta, Empty Worlds, and A Bold New Yiga Strategy!
The crew is reunited at last, and they've got some spicy topics to cover! From Blue's ASMR recs to Red's superhero hot takes, we've got it all and Malta in this week's episode of the Overly Sarcastic Podcast!Our podcast, like our videos, sometimes touches on the violence, assault ... Show More
1h 3m