Get the app
Help
Download the app
Anghami Plus
Browse content
Moods and genres
Podcasts
OSN Plus
Mar 2024
58m 25s
#454: Data Pipelines with Dagster
MICHAEL KENNEDY
Play for free
About this episode
See the full show notes for this episode on the website at
talkpython.fm/454
Up next
May 5
#504: Developer Trends in 2025
What trends and technologies should you be paying attention to today? Are there hot new database servers you should check out? Or will that just be a flash in the pan? I love these forward looking episodes and this one is super fun. I've put together an amazing panel: Gina Häußge ...
Show More
1h 9m
Apr 28
#503: The PyArrow Revolution
Pandas is at a the core of virtually all data science done in Python, that is virtually all data science. Since it's beginning, Pandas has been based upon numpy. But changes are afoot to update those internals and you can now optionally use PyArrow. PyArrow comes with a ton of be ...
Show More
1h 8m
Apr 21
#502: Django Ledger: Accounting with Python
Do you or your company need accounting software? Well, there are plenty of SaaS products out there that you can give your data to. but maybe you also really like Django and would rather have a foundation to build your own accounting system exactly as you need for your company or ...
Show More
1h 3m
Recommended Episodes
Sep 2021
Massively Parallel Data Processing In Python Without The Effort Using Bodo
Summary Python has beome the de facto language for working with data. That has brought with it a number of challenges having to do with the speed and scalability of working with large volumes of information.There have been many projects and strategies for overcoming these challen ...
Show More
1h 4m
Feb 2023
Shorten the distance between production data and insight
Modern networked applications generate a lot of data, and every business wants to make the most of that data. Most of the time, that means moving production data through some transformation process to get it ready for the analytics process. But what if you could have in-app analy ...
Show More
20m 27s
Mar 2024
Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+
Summary A core differentiator of Dagster in the ecosystem of data orchestration is their focus on software defined assets as a means of building declarative workflows. With their launch of Dagster+ as the redesigned commercial companion to the open source project they are investi ...
Show More
55m 40s
Apr 2024
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
Summary Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while maintaining a single ...
Show More
56m 23s
Oct 2023
#628: Data on EKS
Organizations use their data to make better decisions and build innovative experiences for their customers. With the exponential growth in data, and the rapid pace of innovation in machine learning (ML), there is a growing need to build modern data applications that are agile and ...
Show More
20m 56s
Mar 2024
Version Your Data Lakehouse Like Your Software With Nessie
Summary Data lakehouse architectures are gaining popularity due to the flexibility and cost effectiveness that they offer. The link that bridges the gap between data lake and warehouse capabilities is the catalog. The primary purpose of the catalog is to inform the query engine o ...
Show More
40m 55s