logo
episode-header-image
Feb 2023
20m 27s

Shorten the distance between production ...

The Stack Overflow Podcast
About this episode

Modern networked applications generate a lot of data, and every business wants to make the most of that data. Most of the time, that means moving production data through some transformation process to get it ready for the analytics process. But what if you could have in-app analytics? What if you could generate insights directly from production data?

On this sponsored episode of the podcast, we talk with Stanimira Vlaeva, Developer Advocate at MongoDB, and Fredric Favelin, Technical Director, Partner Presales at MongoDB, about how a serverless database can minimize the distance between producing data and understanding it.

Episode notes:

Stanimira talked a lot about using BigQuery with MongoDB Atlas on Google Cloud Run. If you need to skill up on these three tools, check out this tutorial

Once you’ve got the hang of it, get your data connected with Confluent Connetors. 

With Atlas, you can transform your data in JavaScript

Connect with Stanimira on LinkedIn and Twitter

Connect with Fredric on LinkedIn

Congrats to Stellar Question winner SubniC  for  Get name of current script in Python. 

Up next
Aug 22
Robots in the skies (and they use Transformer models)
Ryan welcomes Nathan Michael, CTO at Shield AI, to discuss what AI looks like in defense technologies, both technically and ethically. They cover how the Hivemind technology works in coordinating the autonomous decisions of drones in the field while keeping humans in the loop, wh ... Show More
26m 50s
Aug 22
Learning in the flow: Unlocking employee potential through continuous learning
In this episode of Leaders of Code, Stack Overflow CEO Prashanth Chandrasekar and Christina Dacauaziliqua, Senior Learning Specialist at Morgan Stanley, talk about the importance of experiential learning in fast-paced environments. They emphasize the value of creating intentional ... Show More
33m 1s
Aug 19
The server-side rendering equivalent for LLM inference workloads
Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potenti ... Show More
21m 44s
Recommended Episodes
Jun 2021
A Candid Exploration Of Timeseries Data Analysis With InfluxDB
Summary While the overall concept of timeseries data is uniform, its usage and applications are far from it. One of the most demanding applications of timeseries data is for application and server monitoring due to the problem of high cardinality. In his quest to build a generali ... Show More
1h 6m
Nov 2022
Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase
Summary The most expensive part of working with massive data sets is the work of retrieving and processing the files that contain the raw information. FeatureBase (formerly Pilosa) avoids that overhead by converting the data into bitmaps. In this episode Matt Jaffee explains how ... Show More
59m 25s
Aug 2022
An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications
Summary Data has permeated every aspect of our lives and the products that we interact with. As a result, end users and customers have come to expect interactions and updates with services and analytics to be fast and up to date. In this episode Shruti Bhat gives her view on the ... Show More
1h 6m
Apr 2021
Moving Machine Learning Into The Data Pipeline at Cherre
Summary Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformations is actually a full-fledged machine learning projec ... Show More
48m 5s
Aug 2022
Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery
Summary Data is useless if it isn’t being used, and you can’t use it if you don’t know where it is. Data catalogs were the first solution to this problem, but they are only helpful if you know what you are looking for. In this episode Shinji Kim discusses the challenges of data d ... Show More
53m 24s
Jun 2021
Accelerating ML Training And Delivery With In-Database Machine Learning
Summary When you build a machine learning model, the first step is always to load your data. Typically this means downloading files from object storage, or querying a database. To speed up the process, why not build the model inside the database so that you don’t have to move the ... Show More
1h 5m
Dec 2019
Building The Materialize Engine For Interactive Streaming Analytics In SQL
Summary Transactional databases used in applications are optimized for fast reads and writes with relatively simple queries on a small number of records. Data warehouses are optimized for batched writes and complex analytical queries. Between those use cases there are varying lev ... Show More
48m 7s
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
Summary Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminating all of the ... Show More
1h 8m
Nov 2021
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake
Summary One of the perennial challenges posed by data lakes is how to keep them up to date as new data is collected. With the improvements in streaming engines it is now possible to perform all of your data integration in near real time, but it can be challenging to understand th ... Show More
52m 53s
Apr 2024
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
Summary Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while maintaining a single ... Show More
56m 23s