Apache Hudi: Large Scale Data Systems with Vinoth Chandar

Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development. This framework more efficiently manages business requirements like data lifecycle and improves data quality. Some common use cases for Hudi is record-level insert, update, and delete, simplified file management and near real-time data access, and simplified CDC data pipeline development (AWS.amazon.com).

In this episode we speak to Vinoth Chandar, VP of Apache Hudi. Vinoth is the creator of the Hudi project at Uber. He continues to lead its evolution at the Apache Software Foundation. Previously he was a Principal Engineer at Confluent, and a Sr Staff Engineer/Manager at Uber before that. We discuss building large scale distributed and data systems.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

Transcript

Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com to get 15% off the first three months of audio editing and transcription services with code: SED. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript.

Sponsors

Triplebyte is a network of 200,000+ Top Engineers. Triplebyte works with more than 400 tech companies including Coinbase, Zoox, Dropbox, and Facebook.  Triplebyte is focused on matching high-quality engineers with great jobs. Let the right roles come to you. Want to know your strengths? Take the Triplebyte quiz and receive your personalized feedback report. Tracks offered: Generalist, Front End Mobile, Machine Learning, DevOps, DataScience, and Entry Level. Visit triplebyte.com/sedaily.

Today’s podcast is brought to you by Google Cloud and DORA research team. The team recently launched a survey to collect insights for the 2021 State of DevOps report and would love your input! The State of DevOps report is the largest and longest running research


This article is purposely trimmed, please visit the source to read the full article.

The post Apache Hudi: Large Scale Data Systems with Vinoth Chandar appeared first on Software Engineering Daily.

This post was originally published on this site