logo
episode-header-image
Sep 2023
48m 50s

Unplugged Volume 14: Metadata - the Secr...

Pure Storage
About this episode
The intrepid Unplugged crew returns for an exciting Volume 14 where Andrew Miller, JD Wallace and I take on another of Pure's architectural decisions - this time we dig deep into the use of Metadata in the software stack. The extensive use of Metadata enables FlashArray to deliver efficiency across data reduction, compacting, and snapshot capabilities. We ... Show More
Up next
Yesterday
Accelerating Enterprise AI Inference with Pure KVA
In this episode, we sit down with Solution Architect Robert Alvarez to discuss the technology behind Pure Key-Value Accelerator (KVA) and its role in accelerating AI inference. Pure KVA is a protocol-agnostic, key-value caching solution that, when combined with FlashBlade data st ... Show More
29m 38s
Nov 18
Tackling Myths Around AI Data and FlashBlade//EXA
In this episode, we welcome Lead Principal Technologist Hari Kannan to cut through the noise and tackle some of the biggest myths surrounding AI data management and the revolutionary FlashBlade//EXA platform. With GPU shipments now outstripping CPUs, the foundation of modern AI i ... Show More
39m 21s
Nov 11
FlashStack: A Decade of Converged Infrastructure Innovation
This episode of the Pure Report features a conversation with Eugene McGrath, a nine-year veteran of Pure Storage and a Field Solution Architect. Our discussion delves into Gene’s diverse background, starting from his early days in IT racking and stacking servers at companies like ... Show More
44m 35s
Recommended Episodes
Aug 2022
Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data is useless if it isn&#8217;t being used, and you can&#8217;t use it if you don&#8217;t know where it is. Data catalogs were the first solution to this problem, but they are only helpful if you know what you are look ... Show More
53m 24s
Sep 2021
From notebooks to Netflix scale with Metaflow (Practical AI #150)
As you start developing an AI/ML based solution, you quickly figure out that you need to run workflows. Not only that, you might need to run those workflows across various kinds of infrastructure (including GPUs) at scale. Ville Tuulos developed Metaflow while working at Netflix ... Show More
47m 34s
Nov 2021
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>One of the perennial challenges posed by data lakes is how to keep them up to date as new data is collected. With the improvements in streaming engines it is now possible to perform all of your data integration in near r ... Show More
52m 53s
Jul 2021
Exploring The Design And Benefits Of The Modern Data Stack
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>We have been building platforms and workflows to store, process, and analyze data since the earliest days of computing. Over that time there have been countless architectures, patterns, and &quot;best practices&quot; to ... Show More
49m 2s
May 2022
Unlocking The Value Of Data Across The Organization Through User Friendly Data Tools With Prophecy
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The interfaces and design cues that a tool offers can have a massive impact on who is able to use it and the tasks that they are able to perform. With an eye to making data workflows more accessible to everyone in an org ... Show More
1h 10m
May 2023
What Happens When The Abstractions Leak On Your Data
<h2>Summary</h2> <p>All of the advancements in our technology is based around the principles of abstraction. These are valuable until they break down, which is an inevitable occurrence. In this episode the host Tobias Macey shares his reflections on recent experiences where th ... Show More
26m 42s
Sep 2021
Declarative Machine Learning Without The Operational Overhead Using Continual
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Building, scaling, and maintaining the operational components of a machine learning workflow are all hard problems. Add the work of creating the model itself, and it&#8217;s not surprising that a majority of companies th ... Show More
1h 11m
Nov 2022
Supporting And Expanding The Arrow Ecosystem For Fast And Efficient Data Processing At Voltron Data
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The data ecosystem has been growing rapidly, with new communities joining and bringing their preferred programming languages to the mix. This has led to inefficiencies in how data is stored, accessed, and shared across p ... Show More
50m 25s
Aug 2021
Prepare Your Unstructured Data For Machine Learning And Computer Vision Without The Toil Using Activeloop
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The vast majority of data tools and platforms that you hear about are designed for working with structured, text-based data. What do you do when you need to manage unstructured information, or build a computer vision mod ... Show More
48m 39s
Jan 2024
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel
<h2>Summary</h2> <p>Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As th ... Show More
50m 26s