Nike’s Principal Data Engineer Ashok Singamaneni joins Benjamin and Eldad to discuss his open-source data quality framework, Spark Expectations. Ashok explains how the tool, which was inspired by Databricks DLT Expectations, shifts data quality checks to before the data is written to a final table. This proactive approach uses row-level, aggregation-level, and query data quality checks to fail jobs, drop bad records, or alert teams - ultimately saving huge costs on recompute and engineering effort in mission-critical data pipelines.

Podden och tillhörande omslagsbild på den här sidan tillhör The Firebolt Data Bros. Innehållet i podden är skapat av The Firebolt Data Bros och inte av, eller tillsammans med, Poddtoppen.