logo
episode-header-image
Feb 2023
46m 3s

Better Science Volume 2: Maps, Metadata,...

Pure Storage
About this episode
Jump in on a second episode of the Better Science series with guest host and Technical Evangelist Justin Emerson interviewing FlashArray engineer Feng Wang about how Pure maps data at scale with a single, scalable data structure. Managing storage in modern times requires a strategy for storing quadrillions of bits: petabytes of data, all stored efficiently o ... Show More
Up next
Aug 19
15 Architectural Decisions Series: Snapshots, Realistic Efficiency Metrics, and Adaptive RAID
Episode 4 of our series around Pure's fundamental design principles dives into aspects of Pure's technology that drive realistic efficiency for users. This episode tackles capabilities most Storage professionals are familiar with - snapshots, efficiency metrics and adaptive flexi ... Show More
34m 32s
Aug 12
15 Architectural Decisions Series: Controller Architecture and Limited Error Paths
Our series around Pure's fundamental design principles continues. Episode 3 discusses three key architectural decisions: stateless controllers, the second controller, and limited error paths. Hear from co-hosts JD Wallace and Andrew Miller about how stateless controllers mean no ... Show More
33m 25s
Aug 5
15 Architectural Decisions Series: Flash and Data Reduction
Our series around Pure's fundamental design principles continues. Episode two of the series centers in on the choice to use exclusively Flash memory and how best to leverage it. Next, the team dives into innovations in a wide range of data reduction technologies and advantages fo ... Show More
37m 54s
Recommended Episodes
Jan 2024
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel
Summary Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophisticatio ... Show More
50m 26s
Jan 2022
Making Agile work for data science
Data scientists and engineers don’t always play well together. Data scientists will plan out a solution, carefully build models, test them in notebooks, then throw that solution over the wall to engineering. Implementing that solution can take months.Historically, the data scienc ... Show More
20m 52s
Jul 2021
Exploring The Design And Benefits Of The Modern Data Stack
Summary We have been building platforms and workflows to store, process, and analyze data since the earliest days of computing. Over that time there have been countless architectures, patterns, and "best practices" to make that task manageable. With the growing popularity of clou ... Show More
49m 2s
Mar 2020
What exactly is "data science" these days? (Practical AI #80)
Matt Brems from General Assembly joins us to explain what “data science” actually means these days and how that has changed over time. He also gives us some insight into how people are going about data science education, how AI fits into the data science workflow, and how to diff ... Show More
48m 40s
Nov 2022
Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase
Summary The most expensive part of working with massive data sets is the work of retrieving and processing the files that contain the raw information. FeatureBase (formerly Pilosa) avoids that overhead by converting the data into bitmaps. In this episode Matt Jaffee explains how ... Show More
59m 25s
Feb 2024
Tackling Real Time Streaming Data With SQL Using RisingWave
Summary Stream processing systems have long been built with a code-first design, adding SQL as a layer on top of the existing framework. RisingWave is a database engine that was created specifically for stream processing, with S3 as the storage layer. In this episode Yingjun Wu e ... Show More
56m 55s
Nov 2015
Data Science for Making the World a Better Place
There's a good chance that great data science is going on close to you, and that it's going toward making your city, state, country, and planet a better place. Not all the data science questions being tackled out there are about finding the sleekest new algorithm or billion-dolla ... Show More
9m 31s
Nov 2021
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake
Summary One of the perennial challenges posed by data lakes is how to keep them up to date as new data is collected. With the improvements in streaming engines it is now possible to perform all of your data integration in near real time, but it can be challenging to understand th ... Show More
52m 53s
Aug 2021
Building the Better, More Scalable Algorithms with SigOpt’s Scott Clark
An A.I. the model is similar to a boat in that it needs constant maintenance to perform. The reality is  A.I. models need adjusted boundaries and guidelines to remain efficient.  And when you live in a world where everyone is trying to get bigger and faster and have a certain edg ... Show More
35m 36s