June 05, 2017

Drill to Detail Ep.29 'New-World BI Development using BigQuery, Looker, Kakfa and Streamsets' With Special Guest Stewart Bryson

June 05, 2017/ Mark Rittman

Stewart Bryson returns to the show to join Mark Rittman to discuss new-world BI and data warehousing development using Google BigQuery and Amazon Athena, Apache Kafka and StreamSets, and talks about his experiences with Looker, the cloud-native BI tool that brings semantic modeling and modern development practices to the world of business intelligence.

May 22, 2017

Drill to Detail Ep.27 'Apache Kafka, Streaming Data Integration and Schema Registry' with Special Guest Gwen Shapira

May 22, 2017/ Mark Rittman

Mark Rittman is joined by Gwen Shapira from Confluent to talk about Apache Kafka, streaming data integration and how it differs from batch-based, GUI-developed ETL development, the problem with architects, exactly-once processing and how data governance is coming to Kafka development with Confluent's new schema registry server.

May 15, 2017

Drill to Detail Ep.26 'Airflow, Superset & The Rise of the Data Engineer' with Special Guest Maxime Beauchemin

May 15, 2017/ Mark Rittman

Mark Rittman is joined by Maxime Beauchemin to talk about analytics and data integration at Airbnb, the Apache Airflow and Superset open-source projects he helped launch and now works with day-to-day at Airbnb , and his recent Medium article on "The Rise of the Data Engineer".

"The Rise of the Data Engineer" blog by Maxime Beauchemin
Apache Airflow
Airbnb Superset
"Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department" blog by Jeff Magnusson

March 20, 2017

Drill to Detail Ep.22 'SnapLogic's Enterprise Integration Cloud' With Special Guest Craig Stewart

March 20, 2017/ Mark Rittman

Mark Rittman is joined by Craig Stewart to talk about application and data integration, ODI and Sunopsis, SnapLogic's approach to hybrid on-premise/cloud integration and the rise of data preparation and dataflow-based cloud integration tools.

January 24, 2017

Drill to Detail Ep.16 'Qubit, Visitor Cloud & Google BigQuery' With Special Guest Alex Olivier

January 24, 2017/ Mark Rittman

Mark Rittman is joined by Alex Olivier from Qubit to talk about their platform journey from on-premise Hadoop to petabytes of data running in Google Cloud Platform, using Google Cloud Dataflow (aka Apache Beam), Google PubSub and Google BigQuery along with machine learning and analytics to deliver personalisation at-scale for digital retailers around the world.

November 15, 2016

Drill to Detail Ep.9 'Streamsets, Data-in-Motion and Data Drift' with Special Guest Pat Patterson

November 15, 2016/ Mark Rittman

Mark Rittman is joined by StreamSets' Pat Patterson, talking about data in motion and doing it at scale, the story behind StreamSets and the problem of data drift, and the challenges involved in managing dataflows at scale as a continuous operation.

Drill to Detail Podcast