logo
episode-header-image
Dec 2021
50m 30s

531: Data Science at the Command Line

Jon Krohn
About this episode

Jeroen Janssens joins on the podcast to discuss his book on utilizing the command line for data science and the importance of polyglot data science work.


In this episode you will learn:

  • The genesis of Jeroen’s book [3:24]
  • Data Science at the Command Line [8:55]
  • Creating your own command line tools [22:07]
  • Polyglot data scientist [24:29]
  • Data Science Workshops [27:01]
  • Jeroen’s PhD research [30:38]


Additional materials: www.superdatascience.com/531

Up next
Jul 8
903: LLM Benchmarks Are Lying to You (And What to Do Instead), with Sinan Ozdemir
Has AI benchmarking reached its limit, and what do we have to fill this gap? Sinan Ozdemir speaks to Jon Krohn about the lack of transparency in training data and the necessity of human-led quality assurance to detect AI hallucinations, when and why to be skeptical of AI benchmar ... Show More
1h 28m
Jul 4
902: In Case You Missed It in June 2025
In this episode of “In Case You Missed It”, Jon recaps his June interviews on The SuperDataScience Podcast. Hear from Diane Hare, Avery Smith, Kirill Eremenko, and Shaun Johnson as they talk about the best portfolios for AI practitioners, how to stand out in a saturated candidate ... Show More
29m 29s
Jul 1
901: Automating Legal Work with Data-Centric ML (feat. Lilith Bat-Leah)
Senior Director of AI Labs for Epiq Lilith Bat-Leah speaks to Jon Krohn about the ways AI have disrupted the legal industry using LLMs and retrieval-augmented generation (RAG), as well as how the data-centric machine learning research movement (DMLR) is systematically improving d ... Show More
1h 6m
Recommended Episodes
Oct 2021
AI Today Podcast: Data science in the Enterprise: Interview with Sanyam Bhutani, host of Chai Time Data Science podcast
On the AI Today podcast we regularly interview thought leaders who are implementing AI and cognitive technology at various companies and agencies. However in this episode hosts Kathleen Walch and Ron Schmelzer interview Sanyam Bhutani, host of Chai Time Data Science podcast. As h ... Show More
23m 38s
Nov 2024
SE Radio 641: Catherine Nelson on Machine Learning in Data Science
Catherine Nelson, author of the new O’Reilly book, Software Engineering for Data Scientists, discusses the collaboration between data scientists and software engineers -- an increasingly common pairing on machine learning and AI projects. Host Philip Winston speaks with Nelson ab ... Show More
48m 19s
Jul 2023
AI Today Podcast: AI Glossary Series – Data Science Notebooks, Jupyter, Colab
In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Data Science Notebooks, Jupyter, Colab, explain how these terms relate to AI and why it’s important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and C ... Show More
11 m
Apr 2021
AI Today Podcast: Leading Data Scientists The Right Way, Interview with Ylan Kazi, UnitedHealth Group
As organizations continue to hire more data scientists it’s important to make sure they are being utilized to emphasize their skill sets. In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer interview Ylan Kazi, Vice President, Data Science and Machine L ... Show More
25m 39s
Oct 2017
Data science tools and other announcements from Ignite
In this episode, Microsoft's Corporate Vice President for Cloud Artificial Intelligence, Joseph Sirosh, joins host Kyle Polich to share some of the Microsoft's latest and most exciting innovations in AI development platforms. Last month, Microsoft launched a set of three powerful ... Show More
31m 40s
Feb 2022
Nick Singh - Ace the Data Science Interview #8
Our guest today is Nick Singh, ex-Facebook, Google, Microsoft and Author of "Ace the Data Science Interview", an Amazon best seller book which helps you land your dream Data Science job. In our conversation, we first talk about Nick's career in industry. We explore how he ma ... Show More
59m 12s
Dec 2024
Adam Brown – How Future Civilizations Could Change The Laws of Physics
Adam Brown is a founder and lead of BlueShift with is cracking maths and reasoning at Google DeepMind and a theoretical physicist at Stanford.We discuss: destroying the light cone with vacuum decay, holographic principle, mining black holes, & what it would take to train LLMs tha ... Show More
2h 43m
Jun 2024
#467: Data Science Panel at PyCon 2024
I have a special episode for you this time around. We're coming to you live from PyCon 2024. I had the chance to sit down with some amazing people from the data science side of things: Jodie Burchell, Maria Jose Molina-Contreras, and Jessica Greene. We cover a whole set of recent ... Show More
34m 40s
Jun 2024
113: Operations Research, Prescriptive Analytics, & Decision Science w/ Adam De Jans & Steven Stark
Get insights into career transitions, the importance of networking, and the tools used in data positions in this episode! Avery talks with data experts Steven Stark and Adam Dijans as they explore the fascinating field of operations research. 🤝 Connect with Adam De Jans 🤝 Conne ... Show More
41m 35s
Jul 2024
#225 The Full Stack Data Scientist with Savin Goyal, Co-Founder & CTO at Outerbounds
The role of the data scientist is changing. Some organizations are splitting the role into more narrowly focused jobs, while others are broadening it. The latter approach, known as the Full Stack Data Scientist, is derived from the concept of a full stack software engineer, with ... Show More
48m 44s