TitleDate
Data Labeling That You Can Feel Good About - Episode 89Jul 15, 2019 Listen
Scale Your Analytics On The Clickhouse Data Warehouse - Episode 88Jul 08, 2019 Listen
Stress Testing Kafka And Cassandra For Real-Time Anomaly Detection - Episode 87Jul 02, 2019 Listen
The Workflow Engine For Data Engineers And Data Scientists - Episode 86Jun 25, 2019 Listen
Maintaining Your Data Lake At Scale With Spark - Episode 85Jun 17, 2019 Listen
Managing The Machine Learning Lifecycle - Episode 84Jun 10, 2019 Listen
Evolving An ETL Pipeline For Better Productivity - Episode 83Jun 04, 2019 Listen
Data Lineage For Your Pipelines - Episode 82May 27, 2019 Listen
Build Your Data Analytics Like An Engineer - Episode 81May 20, 2019 Listen
Using FoundationDB As The Bedrock For Your Distributed Systems - Episode 80May 07, 2019 Listen
Running Your Database On Kubernetes With KubeDB - Episode 79Apr 29, 2019 Listen
Unpacking Fauna: A Global Scale Cloud Native Database - Episode 78Apr 22, 2019 Listen
Index Your Big Data With Pilosa For Faster Analytics - Episode 77Apr 15, 2019 Listen
Serverless Data Pipelines On DataCoral - Episode 76Apr 08, 2019 Listen
Why Analytics Projects Fail And What To Do About It - Episode 75Apr 01, 2019 Listen
Building An Enterprise Data Fabric At CluedIn - Episode 74Mar 25, 2019 Listen
A DataOps vs DevOps Cookoff In The Data Kitchen - Episode 73Mar 18, 2019 Listen
Customer Analytics At Scale With Segment - Episode 72Mar 04, 2019 Listen
Deep Learning For Data Engineers - Episode 71Feb 25, 2019 Listen
The Alluxio Distributed Storage System - Episode 70Feb 19, 2019 Listen
Building Machine Learning Projects In The Enterprise - Episode 69Feb 11, 2019 Listen
Cleaning And Curating Open Data For Archaeology - Episode 68Feb 04, 2019 Listen
Managing Database Access Control For Teams With strongDM - Episode 67Jan 29, 2019 Listen
Building Enterprise Big Data Systems At LEGO - Episode 66Jan 21, 2019 Listen
TimescaleDB: The Timeseries Database Built For SQL And Scale - Episode 65Jan 14, 2019 Listen
Performing Fast Data Analytics Using Apache Kudu - Episode 64Jan 07, 2019 Listen
Simplifying Continuous Data Processing Using Stream Native Storage In Pravega with Tom Kaitchuck - Episode 63Dec 31, 2018 Listen
Continuously Query Your Time-Series Data Using PipelineDB with Derek Nelson and Usman Masood - Episode 62Dec 24, 2018 Listen
Advice On Scaling Your Data Pipeline Alongside Your Business with Christian Heinzmann - Episode 61Dec 17, 2018 Listen
Putting Apache Spark Into Action with Jean Georges Perrin - Episode 60Dec 10, 2018 Listen
Apache Zookeeper As A Building Block For Distributed Systems with Patrick Hunt - Episode 59Dec 03, 2018 Listen
Set Up Your Own Data-as-a-Service Platform On Dremio with Tomer Shiran - Episode 58Nov 26, 2018 Listen
Stateful, Distributed Stream Processing on Flink with Fabian Hueske - Episode 57Nov 19, 2018 Listen
How Upsolver Is Building A Data Lake Platform In The Cloud with Yoni Iny - Episode 56Nov 11, 2018 Listen
Self Service Business Intelligence And Data Sharing Using Looker with Daniel Mintz - Episode 55Nov 05, 2018 Listen
Using Notebooks As The Unifying Layer For Data Roles At Netflix with Matthew Seal - Episode 54Oct 29, 2018 Listen
Of Checklists, Ethics, and Data with Emily Miller and Peter Bull (Cross Post from Podcast.__init__) - Episode 53Oct 22, 2018 Listen
Improving The Performance Of Cloud-Native Big Data At Netflix Using The Iceberg Table Format with Ryan Blue - Episode 52Oct 15, 2018 Listen
Combining Transactional And Analytical Workloads On MemSQL with Nikita Shamgunov - Episode 51Oct 09, 2018 Listen
Building A Knowledge Graph From Public Data At Enigma With Chris Groskopf - Episode 50Oct 01, 2018 Listen
A Primer On Enterprise Data Curation with Todd Walter - Episode 49Sep 24, 2018 Listen
Take Control Of Your Web Analytics Using Snowplow With Alexander Dean - Episode 48Sep 17, 2018 Listen
Keep Your Data And Query It Too Using Chaos Search with Thomas Hazel and Pete Cheslock - Episode 47Sep 10, 2018 Listen
An Agile Approach To Master Data Management with Mark Marinelli - Episode 46Sep 03, 2018 Listen
Protecting Your Data In Use At Enveil with Ellison Anne Williams - Episode 45Aug 27, 2018 Listen
Graph Databases In Production At Scale Using DGraph with Manish Jain - Episode 44Aug 20, 2018 Listen
Putting Airflow Into Production With James Meickle - Episode 43Aug 13, 2018 Listen
Taking A Tour Of PostgreSQL with Jonathan Katz - Episode 42Aug 06, 2018 Listen
Mobile Data Collection And Analysis Using Ona And Canopy With Peter Lubell-Doughtie - Episode 41Jul 30, 2018 Listen
Ceph: A Reliable And Scalable Distributed Filesystem with Sage Weil - Episode 40Jul 16, 2018 Listen
Building Data Flows In Apache NiFi With Kevin Doran and Andy LoPresto - Episode 39Jul 08, 2018 Listen
Leveraging Human Intelligence For Better AI At Alegion With Cheryl Martin - Episode 38Jul 02, 2018 Listen
Package Management And Distribution For Your Data Using Quilt with Kevin Moore - Episode 37Jun 25, 2018 Listen
User Analytics In Depth At Heap with Dan Robinson - Episode 36Jun 17, 2018 Listen
CockroachDB In Depth with Peter Mattis - Episode 35Jun 11, 2018 Listen
ArangoDB: Fast, Scalable, and Multi-Model Data Storage with Jan Steeman and Jan Stücke - Episode 34Jun 04, 2018 Listen
The Alooma Data Pipeline With CTO Yair Weinberger - Episode 33May 28, 2018 Listen
PrestoDB and Starburst Data with Kamil Bajda-Pawlikowski - Episode 32May 21, 2018 Listen
Brief Conversations From The Open Data Science Conference: Part 2 - Episode 31May 14, 2018 Listen
Brief Conversations From The Open Data Science Conference: Part 1 - Episode 30May 07, 2018 Listen
Metabase Self Service Business Intelligence with Sameer Al-Sakran - Episode 29Apr 30, 2018 Listen
Octopai: Metadata Management for Better Business Intelligence with Amnon Drori - Episode 28Apr 23, 2018 Listen
Data Engineering Weekly with Joe Crobak - Episode 27Apr 15, 2018 Listen
Defining DataOps with Chris Bergh - Episode 26Apr 08, 2018 Listen
ThreatStack: Data Driven Cloud Security with Pete Cheslock and Patrick Cable - Episode 25Apr 01, 2018 Listen
MarketStore: Managing Timeseries Financial Data with Hitoshi Harada and Christopher Ryan - Episode 24Mar 25, 2018 Listen
Stretching The Elastic Stack with Philipp Krenn - Episode 23Mar 19, 2018 Listen
Database Refactoring Patterns with Pramod Sadalage - Episode 22Mar 12, 2018 Listen
The Future Data Economy with Roger Chen - Episode 21Mar 05, 2018 Listen
Honeycomb Data Infrastructure with Sam Stokes - Episode 20Feb 26, 2018 Listen
Data Teams with Will McGinnis - Episode 19Feb 19, 2018 Listen
TimescaleDB: Fast And Scalable Timeseries with Ajay Kulkarni and Mike Freedman - Episode 18Feb 11, 2018 Listen
Pulsar: Fast And Scalable Messaging with Rajan Dhabalia and Matteo Merli - Episode 17Feb 04, 2018 Listen
Dat: Distributed Versioned Data Sharing with Danielle Robinson and Joe Hand - Episode 16Jan 29, 2018 Listen
Snorkel: Extracting Value From Dark Data with Alex Ratner - Episode 15Jan 22, 2018 Listen
CRDTs and Distributed Consensus with Christopher Meiklejohn - Episode 14Jan 15, 2018 Listen
Citus Data: Distributed PostGreSQL for Big Data with Ozgun Erdogan and Craig Kerstiens - Episode 13Jan 08, 2018 Listen
Wallaroo with Sean T. Allen - Episode 12Dec 25, 2017 Listen
SiriDB: Scalable Open Source Timeseries Database with Jeroen van der Heijden - Episode 11Dec 18, 2017 Listen
Confluent Schema Registry with Ewen Cheslack-Postava - Episode 10Dec 10, 2017 Listen
data.world with Bryon Jacob - Episode 9Dec 03, 2017 Listen
Data Serialization Formats with Doug Cutting and Julien Le Dem - Episode 8Nov 22, 2017 Listen
Buzzfeed Data Infrastructure with Walter Menendez - Episode 7Nov 14, 2017 Listen
Astronomer with Ry Walker - Episode 6Aug 06, 2017 Listen
Rebuilding Yelp's Data Pipeline with Justin Cunningham - Episode 5Jun 18, 2017 Listen
ScyllaDB with Eyal Gutkind - Episode 4Mar 18, 2017 Listen
Defining Data Engineering with Maxime Beauchemin - Episode 3Mar 05, 2017 Listen
Dask with Matthew Rocklin - Episode 2Jan 22, 2017 Listen
Pachyderm with Daniel Whitenack - Episode 1Jan 14, 2017 Listen
Introducing The Show - Episode 0Jan 08, 2017 Listen