In this episode I speak about data transformation frameworks available for the data scientist who writes Python code. The usual suspect is clearly Pandas, as the most widely used library and de-facto standard. However when data volumes increase and distributed algorithms are in place (according to a map-reduce paradigm of computation), Pandas no longer perfo ... Show More
Dec 22
When Data Stops Being Code and Starts Being Conversation (Ep. 297)
Mark Brocato built Mockaroo—the tool that taught millions of developers how to fake data. Now, as Head of Engineering at Tonic.ai, he's building the AI agent that's making his own creation obsolete. In this episode, we explore why static test data can't survive the AI era, what i ... Show More
33m 37s