The Data Stack Show

By Rudderstack

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Rudderstack

Category: Technology

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 21
Reviews: 0
Episodes: 343

Description

Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

Episode Date
187: Startup Lessons and Torch Passing with Kostas Pardalis
May 01, 2024
The PRQL: Why Is Kostas a Guest on His Own Podcast?
Apr 29, 2024
186: Data Fusion and The Future Of Specialized Databases with Andrew Lamb of InfluxData
Apr 24, 2024
The PRQL: Open Source and the Evolution of Data Systems with Andrew Lamb of InfluxData
Apr 22, 2024
Data Council Week: A Decade of Supporting the Data Community with Pete Soderling
Apr 18, 2024
Data Council Week: AI Isn’t Just Hype - How To Successfully Apply LLMs Today with Tristan Zajonc of Continual
Apr 17, 2024
Data Council Week: How To Do Self-Service Data Analytics and Business Intelligence Right with Ryan Dolley of GoodData
Apr 15, 2024
185: The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular
Apr 10, 2024
The PRQL: The Two Parallel Tracks of Development In Data Processing with Ryan Blue of Tabular
Apr 08, 2024
184: Kafka Streams and Operationalizing Event Driven Applications with Aprurva Mehta of Responsive
Apr 03, 2024
The PRQL: Event-Driven Applications: Where Low Latency Meets High Impact with Apruva Mehta of Responsive
Apr 01, 2024
183: Why Modern Data Quality Must Move Beyond Traditional Data Management Practices with Chad Sanderson of Gable.ai
Mar 27, 2024
The PRQL: The Data Supply Chain with Chad Sanderson of Gable.ai
Mar 25, 2024
182: Building a Dynamic Data Infrastructure at Enterprise Scale Featuring Kevin Liu of Stripe
Mar 20, 2024
The PRQL: Exploring the Intersection of Software Engineering and Data Management with Kevin Liu of Stripe
Mar 18, 2024
181: OLAP Engines and the Next Generation of Business Intelligence with Mike Driscoll of Rill Data
Mar 13, 2024
The PRQL: Making the Data Stack Serverless in the Cloud with Mike Driscoll of Rill Data
Mar 11, 2024
180: Data Observability and AI for Data Operations Featuring Kunal Agarwal of Unravel Data
Mar 06, 2024
The PRQL: What’s Driving The Evolution of Data Operations? Featuring Kunal Agarwal of Unravel Data
Mar 04, 2024
179: Time Series Data Management and Data Modeling with Tony Wang of Stanford University
Feb 28, 2024
The PRQL: How is Academic Research Shaping the Future of Data Processing Systems? Featuring Tony Wang of Stanford University
Feb 26, 2024
178: How to Build a Data Stack to Win PLG, Featuring Peter Chapman
Feb 21, 2024
The PRQL: Building a Future-Proof Data Stack from Day Zero? Featuring Peter Chapman
Feb 19, 2024
177: AI-Based Data Cleaning, Data Labelling, and Data Enrichment with LLMs Featuring Rishabh Bhargava of refuel
Feb 14, 2024
The PRQL: Exploring the Evolution of AI and ML with Rishabh Bhargava of refuel
Feb 12, 2024
176: The Fundamentals of Event-Driven Orchestration and How Generative AI Is Shaping Its Future with Viren Baraiya of orkes.io
Feb 07, 2024
The PRQL: The Evolution of Application Orchestration Featuring Viren Baraiya of orkes.io
Feb 05, 2024
175: The Parts, Pieces, and Future of Composable Data Systems, Featuring Wes McKinney, Pedro Pedreira, Chris Riccomini, and Ryan Blue
Jan 31, 2024
The PRQL: Exploring the Evolution, Challenges, and Benefits of Composable Data Stacks Featuring Wes McKinney, Pedro Pedreira, Chris Riccomini, and Ryan Blue
Jan 29, 2024
174: Does Your Data Stack Need a Semantic Layer? Featuring Artyom Keydunov of Cube Dev
Jan 24, 2024
The PRQL: Why is a Semantic Layer Important in the Modern Data Stack? Featuring Artyom Keydunov of Cube Dev
Jan 22, 2024
173: Data Analytics Is a Team Sport, Featuring Jay Henderson of Alteryx
Jan 17, 2024
The PRQL: Bridging the Gap Between Messy Data and Sophisticated Analytics with Jay Henderson of Alteryx
Jan 15, 2024
172: How WebAssembly is Enabling the Third Wave of Cloud Compute with Matt Butcher of Fermyon Technologies
Jan 10, 2024
The PRQL: WebAssembly: The Future of Cloud Workloads Made Simple with Matt Butcher of Fermyon Technologies
Jan 08, 2024
171: Machine Learning Pipelines Are Still Data Pipelines with Sandy Ryza of Dagster
Jan 03, 2024
The PRQL: Does Machine Learning Need Its Own Orchestrator? Featuring Sandy Ryza of Dagster
Jan 02, 2024
170: Discussing Data Roles and Solving Data Problems with Katie Bauer of GlossGenius
Dec 27, 2023
The PRQL: What is a Data Scientist? Featuring Katie Bauer of GlossGenius
Dec 26, 2023
169: Data Models: From Warehouse to Business Impact with Tasso Argyros of ActionIQ
Dec 20, 2023
The PRQL: From Databases to Customer Data Platforms with Tasso Argyros of ActionIQ
Dec 18, 2023
168: Decoding Data Mesh: Principles, Practices, and Real-World Applications Featuring Paolo Platter, Zhamak Dehghani, and Melissa Logan
Dec 13, 2023
The PRQL: A Data Mesh Deep Dive with Paolo Platter, Zhamak Dehghani, and Melissa Logan
Dec 11, 2023
167: Data-Driven Investing and Company Building with Ben Miller of Fundrise
Dec 06, 2023
The PRQL: Fundrise's Data-Driven Approach to Investment in Real Estate and Tech with Ben Miller
Dec 04, 2023
166: Data Processing Fundamentals and Building a Unified Execution Engine Featuring Pedro Pedreira of Meta
Nov 29, 2023
The PRQL: How Does Composability in Data Infrastructure Differ at Different Levels of Abstraction? Featuring Pedro Pedreira of Meta
Nov 27, 2023
165: SQL Queries, Data Modeling, and Data Visualization with Colin Zima of Omni
Nov 22, 2023
The PRQL: Building a Data Product for Data People: Looker's Vision and Omni's Future with Colin Zima
Nov 20, 2023
164: How The GTM and Data Teams at Snowflake Work Together with Travis Henry and Hillary Carpio
Nov 15, 2023
The PRQL: Navigating the World of Data Overload with Travis Henry and Hillary Carpio of Snowflake
Nov 13, 2023
163: Simplifying Real-Time Streaming with David Yaffe and Johnny Graettinger of Estuary
Nov 08, 2023
The PRQL: The Shortcomings of Apache Kafka with David Yaffe and Johnny Graettinger of Estuary
Nov 06, 2023
162: Accelerating Enterprise AI Transformation With Open Source LLMs Featuring Mark Huang of Gradient
Nov 01, 2023
The PRQL: How LLMs are Transforming Enterprise Workflows with Mark Huang of Gradient
Oct 30, 2023
161: The Intersection of Generative AI and Data Infrastructure with Chang She of LanceDB
Oct 25, 2023
The PRQL: How Did Pandas Become a Data Science Powerhouse? Featuring Chang She of Eto Labs
Oct 23, 2023
160: Closing the Gap Between Dev Teams and Data Teams with Santona Tuli of Upsolver
Oct 18, 2023
The PRQL: The Intersection of Physics, Data Science, and Product Development with Santona Tuli of Upsolver
Oct 16, 2023
159: What Is a Vector Database? Featuring Bob van Luijt of Weaviate
Oct 11, 2023
The PRQL: Enhancing Search and Recommendation Systems with Vector Databases with Bob van Luijt of Weaviate
Oct 09, 2023
158: The Orchestration Layer as the Data Platform Control Plane With Nick Schrock of Dagster Labs
Oct 04, 2023
The PRQL: The Power of Data Orchestration: A Game-Changer for Data Infrastructure, Featuring Nick Schrock of Dagster Labs
Oct 02, 2023
157: From Search Engine to Answer Engine Using Grounded Generative AI, Featuring Amr Awadallah of Vectara
Sep 27, 2023
The PRQL: How Can Large Language Models Revolutionize Decision-Making? Featuring Amr Awadallah of Vectara
Sep 25, 2023
156: Simple, Performant, Cost-effective Data Streaming with Alex Gallego of Redpanda Data
Sep 20, 2023
The PRQL: Redpanda: Revolutionizing Streaming Systems and Challenging the Kafka Status Quo with Alex Gallego
Sep 18, 2023
155: Bringing Innovation to Enterprise Resource Planning with Emilie Schario of Turbine
Sep 13, 2023
The PRQL: Making ERP Systems More User-Friendly with Emilie Schario of Turbine
Sep 11, 2023
154: Making Cross-Company Data Exchange Easy with Pardis Noorzad of General Folders
Sep 06, 2023
The PRQL: Simplifying Data Collaboration with Pardis Noorzad of General Folders
Sep 04, 2023
153: The Future of Data Science Notebooks with Jakub Jurových of Deepnote
Aug 30, 2023
The PRQL: Exploring the Evolution of Notebooks with Jakub Jurových of Deepnote
Aug 28, 2023
152: Three Steps To Enhance Product Analytics with Ken Fine of Heap
Aug 23, 2023
The PRQL: Auto Tracking in Product Analytics with Ken Fine of Heap
Aug 21, 2023
151: How To Unlock the Data Warehouse for Marketing with Chris Sell of GrowthLoop
Aug 16, 2023
The PRQL: How Can Reverse ETL Revolutionize Marketing Data Management? Featuring Chris Sell of GrowthLoop
Aug 14, 2023
150: How Salespeople Use Data, Salesforce vs. Snowflake, and How LLMs Are Transforming Sales with Brendan Short of Groundswell
Aug 09, 2023
The PRQL: Generative AI Transforming the Sales Process Featuring Brendan Short of Groundswell
Aug 07, 2023
149: Turning Tables Into APIs for Real-time Data Apps, Featuring Matteo Pelati and Vivek Gudapuri of Dozer
Aug 02, 2023
The PRQL: Turning Data Into an API with Matteo Pelati and Vivek Gudapuri of Dozer
Jul 31, 2023
148: Exploring the Intersection of DAGs, ML Code, and Complex Code Bases: An Elegant Solution Unveiled with Stefan Krawczyk of DAGWorks
Jul 26, 2023
The PRQL: A Methodology for Better DAGs with Stefan Krawczyk of DAGWorks
Jul 24, 2023
Shop Talk: Snowflake Summit Recap
Jul 21, 2023
147: Where Data and Infrastructure Converge Featuring Lars Kamp of Resoto
Jul 19, 2023
The PRQL: Cloud Resource Management Is a Data Problem Featuring Lars Kamp of Resoto
Jul 17, 2023
146: What Is a Customer Data Platform? Featuring Soumyadeb Mitra of Rudderstack
Jul 12, 2023
The PRQL: Building Data Products for Multiple Personas with Soumyadeb Mitra of Rudderstack
Jul 10, 2023
145: What is Synthetic Data? Featuring Omar Maher of Parallel Domain
Jul 05, 2023
The PRQL: Synthetic Data and Self Driving Cars with Omar Maher of Parallel Domain
Jul 03, 2023
144: Explaining Features, Embeddings, and the Difference Between ML and AI with Simba Khadder of Featureform
Jun 28, 2023
The PRQL: Feature Stores and ML Ops with Simba Khadder of Featureform
Jun 26, 2023
Shop Talk: Accountability and Opportunity for AI
Jun 23, 2023
143: Collaborative Data Analytics on the Data Warehouse, featuring Rob Woollen & Stipo Josipovic of Sigma
Jun 21, 2023
The PRQL: Modern Analytics Using Common Paradigms, Featuring Rob Woollen & Stipo Josipovic of Sigma
Jun 19, 2023
Shop Talk: Why AI Is Not Another Crypto
Jun 16, 2023
142: Martech’s Separation and Return to Data Infrastructure with Scott Brinker of HubSpot
Jun 14, 2023
The PRQL: Marketing, Martech, and Data with Scott Brinker of HubSpot
Jun 12, 2023
141: A Journey From Backend Engineer to Data Engineer with Ioannis Foukarakis of Mattermost
Jun 07, 2023
The PRQL: The Portability of Engineering Fundamentals with Ioannis Foukarakis of Mattermost
Jun 05, 2023
140: Stream Processing for Machine Learning with Davor Bonaci of DataStax
May 31, 2023
The PRQL: Kaskada Serving as a Recommendation Engine with Davor Bonaci of DataStax
May 29, 2023
139: Decoupling the Execution Engine From Python’s Pandas with Aditya Parameswaran of Ponder
May 24, 2023
The PRQL: Removing the Execution Engine Language Barrier with Aditya Parameswaran of Ponder
May 22, 2023
138: Paradigm Shift: Batch to Data Streaming with A.J. Hunyady of InfinyOn
May 17, 2023
The PRQL: Data Infrastructure Systems and the Rust / WebAssembly Combo with A.J. Hunyady of InfinyOn
May 15, 2023
137: Data Collection Secrets & The Search Data Problem with Josh Wills
May 10, 2023
The PRQL: Data Engineers in the Front End with Josh Wills
May 08, 2023
136: System Evolution from Hadoop to RocksDB with Dhruba Borthakur of Rockset
May 03, 2023
The PRQL: Hardware Innovation Begets Software Innovation with Dhruba Borthakur Co-Founder and CTO, Rockset
May 01, 2023
Data Council Week (Ep 7) - What’s Next for Data Council? With Pete Soderling of Data Council
Apr 28, 2023
Data Council Week (Ep 6) - All About Debezium and Change Data Capture With Gunnar Morling of Decodable
Apr 27, 2023
Data Council Week (Ep 5) - The Difference Between Data Platforms and ML Platforms with Michael Del Balso of Tecton
Apr 26, 2023
Data Council Week (Ep 4) - Using Data Anonymization for Identity Protection With Will Thompson of Privacy Dynamics
Apr 26, 2023
Data Council Week (Ep 3) - GTM 101 for Engineers With Chase Roberts of Vertex Ventures
Apr 25, 2023
Data Council Week (Ep 2) - The Convergence of MLops and DataOps With Team Featureform
Apr 24, 2023
Data Council Week (Ep 1) - The Evolution of Stream Processing With Eric Sammer of Decodable
Apr 23, 2023
135: Database Knob Tuning and AI with Andy Pavlo and Dana Van Aken of OtterTune
Apr 19, 2023
The PRQL: Database Tuning and Optimization with Andy Pavlo and Dana Van Aken of OtterTune
Apr 17, 2023
134: Unpacking the AI Revolution and the Technology Behind A Feature-First Future with H.O. Maycotte of FeatureBase
Apr 12, 2023
The PRQL: AI and the Super Evolution with H.O. Maycotte, CEO at FeatureBase
Apr 10, 2023
133: Building the Data Warehouse for Everything Else with Sammy Sidhu of Eventual
Apr 05, 2023
The PRQL: Self-Driving Technology and Data Infrastructure with Sammy Sidhu, Co-Founder and CEO of Eventual
Apr 03, 2023
132: Data Quality and Data Contracts with Chad Sanderson of Data Quality Camp
Mar 29, 2023
The PRQL: The Value of Data Contracts with Chad Sanderson, Head of Data, Data Contracts Advocate, Data Quality Camp
Mar 27, 2023
131: How Data Teams Interact With Marketing Tools with Jason Davis of Simon Data
Mar 22, 2023
The PRQL: Unleashing the Potential of CDPs with Jason Davis, Co-Founder and CEO of Simon Data
Mar 20, 2023
130: From Business Intelligence to Product Analytics and Beyond with Vijay Ganesan of NetSpring.io
Mar 15, 2023
The PRQL: Business Intelligence and Product Analytics With Vijay Ganesan, Co-Founder and CEO at NetSpring.io
Mar 13, 2023
129: Databases, Data Warehouses, and Timeseries Data with David Kogn of Timescale
Mar 08, 2023
The PRQL: Time-Series Data 101
Mar 06, 2023
128: The Possibilities Are Endless for Synthetic Data with Alex Watson of Gretel.ai
Mar 01, 2023
The PRQL: Boundaries Between Synthetic Data and Prediction Models
Feb 27, 2023
127: The Anatomy of a Data Lakehouse with Alex Merced of Dremio
Feb 22, 2023
The PRQL: What Does It Mean to be a Developer Advocate?
Feb 20, 2023
126: Crossing the Product Analytics Chasm with Spenser Skates of Amplitude Analytics
Feb 15, 2023
The PRQL: Amplitude - From Startup to IPO
Feb 13, 2023
125: Authorization Is A Data Problem with Jeff Chao of Abbey Labs
Feb 08, 2023
The PRQL: Solving Identity in Marketing vs. Security
Feb 06, 2023
124: Pragmatism About Data Stacks with Pedram Navid of West Marin Data
Feb 01, 2023
The PRQL: What Does the Modern Data Stack Mean to Normal Companies?
Jan 30, 2023
123: What Is a Universal Database? Featuring Stavros Papadopoulos of TileDB, Inc.
Jan 25, 2023
The PRQL: Bringing Innovation to the Lowest Level of Databases
Jan 23, 2023
122: Why Accounting Needs Its Own Database with Joran Greef of Tiger Beetle
Jan 18, 2023
The PRQL: Do We Need A Dedicated Database for Every Use Case?
Jan 16, 2023
121: Materialize Origins: Breaking Down Data Flow Layers with Arjun Narayan and Frank McSherry
Jan 11, 2023
120: Materialize Origins: A Timely Dataflow Story with Arjun Narayan and Frank McSherry
Jan 04, 2023
The PRQL: What’s our first ever two-part episode about?
Jan 02, 2023
119: The Data Stack Show Wrapped: 2022
Dec 28, 2022
118: Bringing Powerful Business Intelligence to Mobile with Zack Hendlin of Zing Data
Dec 21, 2022
The PRQL: Do You Need Business Intelligence on Your Phone?
Dec 19, 2022
117: DX for Data Tooling with Taylor Murphy of Meltano
Dec 14, 2022
The PRQL: Is Business Logic Moving Further Down the Stack?
Dec 12, 2022
116: Data Democratization & Self Service with Aron Clymer of Data Clymer
Dec 07, 2022
The PRQL: Data Analytics: Same As It Ever Was
Dec 05, 2022
115: What Is Production Grade Data? Featuring Ashwin Kamath of Spectre
Nov 30, 2022
The PRQL: Managing Complexities of Financial Data
Nov 28, 2022
114: Solving Data Infrastructure Problems at Startups and Enterprises with Max Werner of Obsessive Analytics Consulting
Nov 23, 2022
The PRQL: The Data Switchboard
Nov 21, 2022
Shop Talk: Kostas Settles the Real-Time vs. Streaming Debate
Nov 18, 2022
113: What Is Streaming Graph? Featuring Ryan Wright of thatDot
Nov 16, 2022
The PRQL: Graph as a Utility
Nov 14, 2022
Shop Talk: What Coalesced at Coalesce?
Nov 11, 2022
112: Python Native Stream Processing with Zander Matheson of bytewax
Nov 09, 2022
The PRQL: Who Needs a Stream Processing Engine?
Nov 07, 2022
Shop Talk: The Business of Data Infrastructure is Uniquely Challenging
Nov 04, 2022
111: What if Your Code Just Ran in the Cloud for You? Featuring Erik Bernhardsson of Modal Labs
Nov 02, 2022
The PRQL: Erik Bernhardsson Just Can’t Stop Solving Problems
Oct 28, 2022
110: How Can Data Discovery Help You Understand Your Data? Featuring Shinji Kim of Select Star
Oct 26, 2022
Shop Talk: Why Are There So Many Flavors of Databases?
Oct 24, 2022
The PRQL: What is Data Discovery?
Oct 21, 2022
109: How Does Headless Business Intelligence Work? Featuring Artyom Keydunov and Pavel Tiunov of Cube Dev
Oct 19, 2022
Shop Talk: Will the Future of the Customer Data Platform Include a Shared Logic Layer?
Oct 17, 2022
The PRQL: What Comes to Mind When You Think of ‘Headless’?
Oct 14, 2022
108: You Can’t Separate Data Reliability From Workflow with Gleb Mezhanskiy of Datafold
Oct 12, 2022
Shop Talk: Is It Possible for Excel To Die?
Oct 10, 2022
The PRQL: Are Marketers the Worst Data Quality Offenders?
Oct 07, 2022
107: Building Modern Data Teams with dbt Labs, REI, and Robinhood
Oct 05, 2022
Shop Talk With Eric and Kostas: Transitioning From Consumer to Builder
Oct 03, 2022
The PRQL: What Can We Learn From the Patterns of Successful Data Teams?
Sep 30, 2022
106: Optimizing Query Workloads (and Your Snowflake Bill) with Vinoo Ganesh of Bluesky Data
Sep 28, 2022
Shop Talk With Eric and Kostas: Data Politicians
Sep 26, 2022
The PRQL: Comparing Snowflake to a Car
Sep 23, 2022
105: The Modern Data Stack Is Just Getting Started with Astasia Myers of Quiet Capital
Sep 21, 2022
The PRQL: Kostas Becomes a Prophet
Sep 16, 2022
104: A Decade of Change in the Data Space with Benn Stancil of Mode
Sep 14, 2022
The PRQL: What Does 10 Years in the Data Space Give You?
Sep 09, 2022
103: Everyone Is Invited to the Data Lakehouse with Kyle Weller of Onehouse.ai
Sep 07, 2022
The PRQL: Who Really Needs To Know How a DBMS Works?
Sep 02, 2022
102: Building Pinot for Real-Time, Interactive User Analytics with Kishore Gopalakrishna of StarTree
Aug 31, 2022
The PRQL: Data Warehouses on Steroids
Aug 26, 2022
101: The Future of Machine Learning with Willen Pienaar of Tecton and Tristan Zajonc of Continual
Aug 24, 2022
The PRQL: Can Machine Learning Be Commoditized?
Aug 19, 2022
100: Data Quality Is Relative to Purpose with James Campbell of Superconductive
Aug 17, 2022
The PRQL: What’s the Hardest Part About Data Quality?
Aug 12, 2022
99: State of the Data Lakehouse with Vinoth Chandar of Apache Hudi
Aug 10, 2022
The PRQL: Does Lakehouse Architecture Really Mean the End of the Data Warehouse and Data Lake As We Know It?
Aug 05, 2022
98: Category Theory and the Mathematical Foundation of the Technologies We Use with Eric Daimler of Conexus
Aug 03, 2022
The PRQL: Farm to Table Abstract Mathematics
Jul 29, 2022
97: How To Build an Organization-Empowering Data Team with Emilie Schario of Amplify Partners
Jul 27, 2022
The PRQL: If You Were Building a Data Team What Would Your First Hire Be?
Jul 22, 2022
96: How To Collect and Leverage Data From the Physical World with Prateek Joshi of Plutoshift
Jul 20, 2022
The PRQL: Collecting Data in the Physical World
Jul 15, 2022
95: How the Metrics Layer Bridges the Gap Between Data & Business with Nick Handel of Transform
Jul 13, 2022
The PRQL: Data Marts Aren’t Just for the Enterprise
Jul 08, 2022
94: Notebooks Aren’t Just for Data Scientists With Barry McCardel of Hex Technologies
Jul 06, 2022
The PRQL: Have You Ever Been a Part of a Company That Has Done Analytics Really Well?
Jul 01, 2022
93: There Is No Data Observability Without Lineage with Kevin Hu of Metaplane
Jun 29, 2022
The PRQL: What Are the Similarities Between VCs and Tilapia?
Jun 24, 2022
92: Building a Decentralized Storage System for Media File Collaboration with Tejas Chopra
Jun 22, 2022
The PRQL: What is Netflix Cloud?
Jun 17, 2022
91: The Future of Streaming Data with Stripe, Deephaven, Materialize, and Benthos
Jun 15, 2022
The PRQL: Can Streaming Simplify Your Data Flows?
Jun 10, 2022
90: The Modern Data Stack Has a Join Problem with Ahmed Elsamadisi of Narrator AI
Jun 08, 2022
The PRQL: Can One Table Rule Them All?
Jun 03, 2022
89: Solving Microservice Orchestration Issues at Netflix with Viren Baraiya of Orkes
Jun 01, 2022
The PRQL: What are the Different Flavors of Orchestration?
May 27, 2022
88: What Is Data Observability? With Tristan Spaulding of Acceldata
May 25, 2022
The PRQL: Does Data Exist if We Do Not Observe It?
May 20, 2022
87: Why Is Now the Golden Age of Data Analytics? With Cindi Howson of ThoughtSpot
May 18, 2022
The PRQL: Can You Trust AI Enabled Analytics?
May 13, 2022
86: Solving the Data Quality Problem with Bigeye, Great Expectations, Metaplane, and Lightup.ai
May 11, 2022
85: You Can Stop Doing Data Fire Drills with Barr Moses of Monte Carlo
May 04, 2022
The PRQL: Be Careful, Young Padawan, When Comparing Software Observability and Data Observability
May 02, 2022
Data Council Week (Ep 5): A Primer on Spatial Data With Gabriel Hidalgo of Carto
Apr 29, 2022
Data Council Week (Ep 4): The Data Council Origin Story With Pete Soderling
Apr 28, 2022
Data Council Week (Ep 3): Product Analytics the Right Way With James Greenhill of PostHog
Apr 27, 2022
Data Council Week (Ep 2): Testing and Observability Are Two Sides of the Same Coin With Ben Castleton of Great Expectations
Apr 26, 2022
Data Council Week (Ep 1): Discussing Firebolt’s Engine With Benjamin HoppDiscussing Firebolt’s Engine With Benjamin Hopp
Apr 25, 2022
The PRQL: A Data Council Austin Quintuple
Apr 22, 2022
The Data Stack Show Live: Solving the Data Quality Problem
Apr 21, 2022
84: Why Are Analytics Still So Hard? With Kaycee Lai of Promethium
Apr 20, 2022
The PRQL: Does Putting All Your Data in One Place Create More Problems Than it Solves?
Apr 15, 2022
83: Closing the Gap Between Business Analytics and Operational Analytics With Max Beauchemin of Preset
Apr 13, 2022
The PRQL: BI, Real-Time, and Data Tooling
Apr 08, 2022
82: Databases: The Fun Never Stops with Robert Hodges of Altinity
Apr 06, 2022
The PRQL: What Inspires Continued Innovation in Databases?
Apr 01, 2022
81: Digging into Data Ops with Prukalpa Sankar of Atlan
Mar 30, 2022
The PRQL: Data Team Diversity & Maturing Data Ops
Mar 25, 2022
80: Is Reverse-ETL Just Another Data Pipeline? With Census, Hightouch, & Workato
Mar 23, 2022
The PRQL: Is Reverse ETL New or Old?
Mar 18, 2022
79: All About Experimentation with Che Sharma of Eppo
Mar 16, 2022
The PRQL: Is A/B Testing Only Relevant for B2C?
Mar 11, 2022
78: The Etymology of Reverse ETL & Why It’s a Key Piece Of The Modern Data Stack with Boris Jabes of Census
Mar 09, 2022
The PRQL: Reverse ETL and the Distinction Between Operation vs Analysis on Data
Mar 04, 2022
77: Standardizing Unstructured Data with Verl Allen of Claravine
Mar 02, 2022
The PRQL: If Everything Is Data, How Can We Make Sense of It All?
Feb 25, 2022
76: Why a Data Team Should Limit Its Own Superpowers with Sean Halliburton of CNN
Feb 23, 2022
The PRQL: How Important Is the Human Factor When Working With Data?
Feb 18, 2022
75: How To Become a Data Engineer with Parham Parvizi of the Data Stack Academy
Feb 16, 2022
The PRQL: Can We Define the Role of the Data Engineer (Yet)?
Feb 11, 2022
74: Kostas Respawns at Starburst, is Interviewed by Eric, and Reminisces About Winamp
Feb 09, 2022
The PRQL: What Prompts a Conversation About Winamp & Quake Arena on The Data Stack Show?
Feb 04, 2022
73: What a High Performing Data Team (and Stack) Looks Like with Paige Berry of Netlify
Feb 02, 2022
The PRQL: How High Performing Data Teams Put Tooling in the Background
Jan 28, 2022
72: Building Data Ops Into the Data Lifecycle with Douwe Maan of Meltano
Jan 26, 2022
The PRQL: Is It Viable to Manage Integrations Open Source?
Jan 21, 2022
71: ETL at the Edges with Jimmy Chan of Dropbase
Jan 19, 2022
The PRQL: Is Kostas an Excel Power User Yes/No?
Jan 14, 2022
70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi
Jan 12, 2022
The PRQL: What Old Tech Concepts Were Borrowed to Build the Data Lake House?
Jan 07, 2022
69: What is the Modern Data Stack?
Jan 05, 2022
The PRQL: Should Data Trust Drive the Evolution of Your Data Stack?
Dec 31, 2021
68: Season Three Recap: Holiday Edition with Eric Dodds and Kostas Pardalis
Dec 29, 2021
67: Now is the Time to Think About Data Quality with Manu Bansal of Lightup Data
Dec 22, 2021
The PRQL: Will Data Quality Always Require a Human in the Loop?
Dec 21, 2021
66: How Data Infrastructure Has Evolved and Managing High Performing Data Teams with Srivatsan Sridharan
Dec 15, 2021
The PRQL: How Would You Define a Data Pipeline? Featuring the RudderStack Eng. Team
Dec 10, 2021
65: Operationalizing Data from the Warehouse With Aayush Jain of Cliff.ai
Dec 08, 2021
The PRQL: Why is the Data Engineer's Role Expanding?
Dec 03, 2021
64: Data Stack Composability and Commoditization with Michel Tricot of Airbyte
Dec 01, 2021
The PRQL: The Beauty of Commoditization
Nov 26, 2021
63: The ETL - ELT Flip With Ciaran Dynes of Matillion
Nov 24, 2021
The PRQL: What Part of the Data Stack Will Be Commoditized Next?
Nov 19, 2021
62: The Internet of Everything with Rob Rastovich of ThingLogix
Nov 17, 2021
The PRQL: Are you afraid of IOT?
Nov 12, 2021
61: What is Data Design? With Kevin Gervais of Touchless
Nov 10, 2021
The PRQL: Will we ever get rid of the CSV?
Nov 08, 2021
Data Debrief: The Highs and Lows of Open Source Projects
Nov 05, 2021
60: Architecting a Boring Stream Processing Tool With Ashley Jeffs of Benthos
Nov 03, 2021
Data Debrief: What Open Source Data Projects Have Come Out of Facebook, Whoops, *Meta?
Oct 29, 2021
59: Making ETL Optional with Justin Borgman of Starburst Data
Oct 27, 2021
Data Debrief: Will Enterprise Build The Future of Data Tooling?
Oct 22, 2021
58: Data Federation is No Longer The "F" Word with Scott Gnau of InterSystems
Oct 20, 2021
Data Debrief: Can Tools Help Solve Data Quality Organizational Challenges?
Oct 15, 2021
57: Improving Data Quality Using Data Product SLAs with Egor Gryaznov of Bigeye
Oct 13, 2021
56: Stream Processing and Observability with Jeff Chao of Stripe
Oct 06, 2021
55: Tables vs. Streams and Defining Real-Time with Pete Goddard of Deephaven Data Labs
Sep 29, 2021
54: The Center of the Modern Data Stack with Neil Rahilly of Mixpanel
Sep 22, 2021
53: What Religion, a Cult, and a Tech Product Have in Common, with Bart Farrell of DoKC
Sep 15, 2021
52: Discussing Data Warehouses, Lakes, and Meshes with James Serra of EY
Sep 08, 2021
51: Democratizing AI and ML with Tristan Zajonc of Continual
Sep 01, 2021
50: From Data Infrastructure to Data Management with Ananth Packkildurai
Aug 25, 2021
49: MLops - The Finalization of the Data Stack with Ben Rogojan of Facebook
Aug 18, 2021
48: Season Two Recap with Eric Dodds and Kostas Pardalis
Aug 11, 2021
47: Taming the Four Dragons of Data with Sven Balnojan of Mercateo Gruppe
Aug 04, 2021
46: A New Paradigm in Stream Processing with Arjun Narayan of Materialize
Jul 28, 2021
45: Open Source and Attribution with Ophir Prusak of Codesmith
Jul 21, 2021
44: Leveraging Data in a Post-Covid World with Ruben Ugarte of Practico Analytics
Jul 14, 2021
43: Modern Authentication and User Management with Sokratis Vidros of Clerk.dev
Jul 07, 2021
42: Scaling Data Science with Ryan Boyer of Shipt
Jun 30, 2021
41: Doing MLOps on Top of Apache Pulsar and Trino with Joshua Odmark of Pandio
Jun 23, 2021
40: Graph Processing on Snowflake for Customer Behavioral Analytics
Jun 16, 2021
39: Diving deeper into CDC with Ali Hamidi and Taron Foxworth of Meroxa
Jun 11, 2021
38: Graph Databases & Data Governance with David Allen of Neo4j
Jun 02, 2021
37: The Components of Data Governance with Dave Melillo of FanDuel
May 26, 2021
36: Crypto and Compliance with Nick Fogle, Co-Founder of Churnkey and Wavve
May 19, 2021
35: The Future of Development is Distributed with Jim Walker of Cockroach Labs
May 12, 2021
34: The Intersection of Data Engineering and Marketing with John Marbachm of Grafana Labs
Apr 28, 2021
33: ML is a Data Quality Problem with Peter Gao from Aquarium Learning
Apr 14, 2021
32: Cooking with Data Ops with Chris Bergh from DataKitchen
Apr 07, 2021
31: How a 160 Year-Old Publisher is Using Data with Jenna Lemonias From the Atlantic
Mar 31, 2021
30: The DataStack Journey with Rachel Bradley-Haas and Alex Dovenmuehle of Big Time Data
Mar 24, 2021
29: The Present and Future of Data Engineering with Joe Reis and Matthew Housley from Ternary Data
Mar 17, 2021
28: Next Gen Data Governance with Stefania from Avo
Mar 10, 2021
27: Building B2B Marketplaces with Mike Luby from LeafLink
Mar 03, 2021
26: Democratizing the Insurance Market with Daniel Gremmell from Policygenius Inc.
Feb 24, 2021
25: MLOps and Feature Stores with Willem Pienaar from Tecton
Feb 17, 2021
24: Demystifying AI with Duc Haba
Feb 10, 2021
23: Migrating from On-Premises to the Cloud with Alex Lancaster from Intuit
Feb 03, 2021
22: Season One Recap with Eric Dodds and Kostas Pardalis
Jan 29, 2021
21: Data Integrity and Governance with Patrick Thompson and Ondrej Hrebicek from Iteratively
Jan 20, 2021
20: Transforming the Real Estate Market with Predictive Analytics with Arian Osman from Homesnap
Jan 13, 2021
19: Defining Data Governance with Stephen Bailey from Immuta
Jan 06, 2021
18: Data Science in Health Insurance with Jason Haupt of Bind
Dec 31, 2020
17: Working with Data at Netflix with Ioannis Papapanagiotou
Dec 09, 2020
16: Applying the Event Sourcing Pattern at Scale with Andrew Elster from Earnnest
Dec 03, 2020
15: Early Stage Analytics and Learning from the Y Combinator Experience with Axel Delafosse from Pool
Nov 19, 2020
14: Breaking Down Electronic Money Transfers and Modernizing Real Estate Transactions with Dan Jeffords of Earnnest
Nov 11, 2020
13: Building Open Source Products at Scale with Reza Shafii from Kong Inc.
Nov 06, 2020
12: Building a CDP on your Data Warehouse with Nicholas Ziech-Lopez of MessageGears
Oct 28, 2020
11: Why Modern Cyber Security is a Data Problem with Jack Naglieri of Panther Labs
Oct 21, 2020
10: The Evolution of the BI Market with Huy Nguyen of Holistics
Oct 14, 2020
09: Building the Operating System for Work with Ivan Kanevski of Slapdash
Oct 07, 2020
08: When data alone is not enough - Reinventing book shopping at Bookshop.org with Mason Stewart
Sep 30, 2020
07: Discussing Data Engineering Best Practices with IFTTT’s Peter Darche
Sep 23, 2020
06: The Technical Challenges and Opportunities of Building a Startup Inside a Large Bank with Sam Bledsoe of Ruby
Sep 16, 2020
05: The Convergence of Data Engineering and Marketing with Nic Discepoli of Ruby
Sep 09, 2020
04: Relational to Real-Time with Change Data Capture with DeVaris Brown of Meroxa
Sep 02, 2020
03: Turning All Data at Grofers into Live Event Streams
Aug 27, 2020
02: The Importance of Data During a Global Pandemic with Utkarsh Gupta of 1mg
Aug 19, 2020
01: Discussing Mattermost Data Infrastructure with Alex Dovenmuehle
Aug 12, 2020
00: Welcome to the Data Stack Show
Jul 31, 2020