After all the back-patting around making data science datasets and code more openly available, we figured it was time to also dump a bucket of cold water on everyone's heads and talk about the things that can go wrong when data and code is a little too open.
In this episode, we'll talk about two interesting recent examples: a de-identified medical dataset ... Show More
Yesterday
It's RAG time: Retrieval-Augmented Generation
Today we are going to talk about the feature with the worst acronym in generative AI: RAG, or Retrieval Augmented Generation. If you've ever used something like "Chat with My Docs," if you have an internal AI chatbot that has access to your company's documents, or you've created ... Show More
17m 14s
Mar 2023
A History of Data from the Age of Reason to the Age of Algorithms
At Columbia University, data scientist Chris Wiggins and historian Matthew Jones teach a course called Data: Past, Present and Future. Out of this collaboration has come a book, How Data Happened: A History from the Age of Reason to the Age of Algorithms, to be published on Tuesd ... Show More
46m 27s
Sep 2021
Les données médicales d’1,5 millions de patients volées ?
<p>La semaine dernière, l'Assistance publique - Hôpitaux de Paris expliquait qu’elle avait subi une attaque informatique d’une ampleur inédite, ayant entraîné la perte de plus d’un million et demi de données médicales. Une attaque portée directement contre un service sécurisé de ... Show More
2m 1s
Jan 2022
Academics and Data Science Innovation with Dr. David Bader, Distinguished Professor and Director, Institute for Data Science, New Jersey Institute of Technology
<p>The data science field is expanding because so many businesses and other institutions require skilled workers who can manage data as well as provide insights. Companies and students are clamoring for more academic programs. There is great need, but academic institutions are st ... Show More
39m 32s