Live Demo: Build Scalable Event-Driven Microservices with Confluent | Register Now

Presentation

From 🐛 to 🦋: Data Pipelines Evolution from Batch to Streaming

« Current 2023

Despite data streaming being in most companies’ agenda, transitioning out of consolidated batch systems is not as simple as flipping a switch: new technologies, processes and coding frameworks need to be assessed and then adopted which can make the evolution a long and painful process. But, what if we could keep the same framework? This session explores how Apache Flink can narrow the gap between batch and streaming by keeping the same data pipelines definition while the underlying technology evolves.

We’ll start the journey with a typical batch system, based on a relational database, and then showcase how to evolve it to streaming using Apache Flink and Apache Kafka with minimal changes on the data pipeline definition. We’ll cover query based connectors, mimicking the batch behavior, and then move to more advanced change data capture solutions with Debezium. Finally we’ll touch on critical topics like data validation and late arrival of events and expose strategies on how to minimize related risks.

If you’re thinking about migrating from batch to streaming, but are afraid of any disruption the process may cause in your organization, this session is for you!

Presenter

Francesco Tisiot

Aiven

Francesco comes from Verona, Italy and works as a Staff Developer Advocate at Aiven. With his many years of experience as a data engineer, he has stories to tell and advice for data-wranglers everywhere. Francesco loves sharing knowledge with others as a speaker and writer, and is on a mission to defend the world from bad Italian food!

From 🐛 to 🦋: Data Pipelines Evolution from Batch to Streaming

Presenter

Francesco Tisiot

Related Links

How Confluent Completes Apache Kafka eBook

Leverage a cloud-native service 10x better than Apache Kafka

Confluent Developer Center

Spend less on Kafka with Confluent, come see how