logo
episode-header-image
Aug 2019
48m 7s

Building Tools And Platforms For Data An...

Tobias Macey
About this episode

Summary

Data engineers are responsible for building tools and platforms to power the workflows of other members of the business. Each group of users has their own set of requirements for the way that they access and interact with those platforms depending on the insights they are trying to gather. Benn Stancil is the chief analyst at Mode Analytics and in this episode he explains the set of considerations and requirements that data analysts need in their tools and. He also explains useful patterns for collaboration between data engineers and data analysts, and what they can learn from each other.

Announcements

  • Hello and welcome to the Data Engineering Podcast, the show about modern data management
  • When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With 200Gbit private networking, scalable shared block storage, and a 40Gbit public network, you’ve got everything you need to run a fast, reliable, and bullet-proof data platform. If you need global distribution, they’ve got that covered too with world-wide datacenters including new ones in Toronto and Mumbai. And for your machine learning workloads, they just announced dedicated CPU instances. Go to dataengineeringpodcast.com/linode today to get a $20 credit and launch a new server in under a minute. And don’t forget to thank them for their continued support of this show!
  • You listen to this show to learn and stay up to date with what’s happening in databases, streaming platforms, big data, and everything else you need to know about modern data management.For even more opportunities to meet, listen, and learn from your peers you don’t want to miss out on this year’s conference season. We have partnered with organizations such as O’Reilly Media, Dataversity, Corinium Global Intelligence, and Data Counsil. Upcoming events include the O’Reilly AI conference, the Strata Data conference, the combined events of the Data Architecture Summit and Graphorum, and Data Council in Barcelona. Go to dataengineeringpodcast.com/conferences to learn more about these and other events, and take advantage of our partner discounts to save money when you register today.
  • Your host is Tobias Macey and today I’m interviewing Benn Stancil, chief analyst at Mode Analytics, about what data engineers need to know when building tools for analysts

Interview

  • Introduction
  • How did you get involved in the area of data management?
  • Can you start by describing some of the main features that you are looking for in the tools that you use?
  • What are some of the common shortcomings that you have found in out-of-the-box tools that organizations use to build their data stack?
  • What should data engineers be considering as they design and implement the foundational data platforms that higher order systems are built on, which are ultimately used by analysts and data scientists?
    • In terms of mindset, what are the ways that data engineers and analysts can align and where are the points of conflict?
  • In terms of team and organizational structure, what have you found to be useful patterns for reducing friction in the product lifecycle for data tools (internal or external)?
  • What are some anti-patterns that data engineers can guard against as they are designing their pipelines?
  • In your experience as an analyst, what have been the characteristics of the most seamless projects that you have been involved with?
  • How much understanding of analytics are necessary for data engineers to be successful in their projects and careers?
    • Conversely, how much understanding of data management should analysts have?
  • What are the industry trends that you are most excited by as an analyst?

Contact Info

Parting Question

  • From your perspective, what is the biggest gap in the tooling or technology for data management today?

Closing Announcements

  • Thank you for listening! Don’t forget to check out our other show, Podcast.__init__ to learn about the Python language, its community, and the innovative ways it is being used.
  • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
  • If you’ve learned something or tried out a project from the show then tell us about it! Email hosts@dataengineeringpodcast.com) with your story.
  • To help other people find the show please leave a review on iTunes and tell your friends and co-workers
  • Join the community in the new Zulip chat workspace at dataengineeringpodcast.com/chat

Links

The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

Support Data Engineering Podcast

Up next
Aug 18
High Performance And Low Overhead Graphs With KuzuDB
SummaryIn this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. ... Show More
1h 1m
Aug 12
Bridging Data and Decision-Making: AI's Role in Modern Analytics
SummaryIn this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds ... Show More
1h 10m
Aug 5
From Bits to Tables: The Evolution of S3 Storage
SummaryIn this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates ... Show More
50m 8s
Recommended Episodes
Nov 2024
#262 Self-Service Business Intelligence with Sameer Al-Sakran, CEO at Metabase
We’re improving DataFramed, and we need your help! We want to hear what you have to say about the show, and how we can make it more enjoyable for you—find out more here.We’re often caught chasing the dream of “self-serve” data—a place where data empowers stakeholders to answer th ... Show More
51m 33s
Jul 2022
IoT, IIoT and Managing Edge Data
Brian Gilmore (@BrianMGilmore, Director IoT/Emerging Technology @InfluxDB) talks about Edge and Industrial Edge Computing, as well as application and data challenges at the edge.SHOW: 634CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotwCHECK OUT OUR NEW PODCAST - "CLOUDCAST ... Show More
35m 37s
Dec 2024
Best of 2024: The Art of Prompt Engineering with Alex Banks, Founder and Educator, Sunday Signal
As we look back at 2024, we're highlighting some of our favourite episodes of the year, and with 100 of them to choose from, it wasn't easy!The four guests we'll be recapping with are:Lea Pica - A celebrity in the data storytelling and visualisation space. Richie and Lea cover th ... Show More
44m 58s
Jan 2025
The Role of Analytics in Shaping the Future of MLOps
Sophia Rowland, Senior Product Manager at SAS, discusses her journey from data science to product management at SAS, focusing on the integration of AI and analytics. She explains the concepts of Model Ops and ML Ops, the challenges organizations face in operationalizing machine l ... Show More
32m 42s
Jul 15
169: ChatGPT vs Julius AI: Who Analyzes Data Better?
The data analysis landscape is changing rapidly. New AI tools are emerging every week, and it can sometimes feel overwhelming. So in this video, I compare ChatGPT and Julius AI to see how they stack up against each other. We'll use a dataset of 1,444 data job listings from FindAD ... Show More
29m 3s
Apr 2023
2344: Cloudera: Moving Beyond Big Data to Hybrid Data Mastery
I sit down with Chris Royles, EMEA Field CTO at Cloudera, to discuss the evolution of Big Data and why hybrid data is the next challenge for businesses to tackle. In this episode, we explore how the term 'Big Data' has become dated and how the rapid rise of hybrid data has shifte ... Show More
39m 54s
Feb 2025
Building Data Excellence at Nordstrom: Scaling Standards & Measurement for Impact
In this episode of the Data Science Salon Podcast, host Anna Anisin sits down with two data leaders from Nordstrom to explore how organizations can build a culture of technical excellence and measurement in data science. First, Gina Schmalzle, Principal Data Scientist at Nordstro ... Show More
34m 50s
Jun 2024
How Avangrid built a data foundation for AI
Mark Waclawiak was tuned into energy issues at an early age. Both his parents worked in the industry: his mom designed electrical systems for buildings and his dad worked at the utility. So the importance of electricity was always apparent to him.When he started working for a uti ... Show More
24m 35s
Jul 2024
Low-Code Magic: Can It Transform Analytics? (Ep. 260)
Join us as David Marom, Head of Panoply Business, explores the benefits of all-in-one data platforms. Learn how tech stack consolidation boosts efficiency, improves data accuracy, and cuts costs. David shares insights on overcoming common challenges, enhancing data governance, an ... Show More
33m 45s
Dec 2023
A successful year starts with your dairy’s data – Taliah Danzinger, VAS (Sponsored Podcast)
When it comes to your dairy’s data, a one-size-fits-all approach won’t cut it. How you use your data to manage your herd should be based on your farm’s goals. And what better time than now to reflect on your goals and priorities as we transition to a new year. In this episode, Ta ... Show More
16m 50s