OSS Kafka couldn’t save them. See how data streaming came to the rescue! | Watch now

Presentation

Restoring Restoration's Reputation in Kafka Streams

« Kafka Summit London 2023

Restoring local state in Kafka Streams applications is indispensable for recovering after a failure or for moving stream processors between Kafka Streams clients. However, restoration has a reputation for being operationally problematic, because a Streams client occupied with restoration of some stream processors blocks other stream processors that are ready from processing new records. When the state is large this can have a considerable impact on the overall throughput of the Streams application. Additionally, when failures interrupt restoration, restoration restarts from the beginning, thus negatively impacting throughput further.

In this talk, we will explain how Kafka Streams currently restores local state and processes records. We will show how we decouple processing from restoring by moving restoration to a dedicated thread and how throughput profits from this decoupling. We will present how we avoid restarting restoration from the beginning after a failure. Finally, we will talk about the concurrency and performance problems that we had to overcome and we will present benchmarks that show the effects of our improvements.

Presenter

Bruno Cadonna

Confluent

Bruno Cadonna is an Apache Kafka committer and a software developer at Confluent working on ksqlDB and Kafka Streams. Prior to Confluent, he was a software developer at SAP, where he worked on a distributed in-memory computing engine for big data. Bruno holds a Ph.D. in computer science from Free University of Bozen-Bolzano in Italy and held a postdoc position at Humboldt-Universität zu Berlin. His academic research focused on data stream and event processing.

Presenter

Lucas Brutschy

Confluent

Lucas is an engineer working on Apache Kafka Streams at Confluent. He is a born Berliner, and after acquiring a PhD from ETH Zurich, where he worked on program analysis for data stores with weak consistency guarantees, he moved back to Berlin. Here, he joined HERE technologies (formerly Nokia Maps), where he worked on large-scale data processing for location-based data, before joining Confluent in 2022.

Restoring Restoration's Reputation in Kafka Streams

Presenter

Bruno Cadonna

Presenter

Lucas Brutschy

Related Links

How Confluent Completes Apache Kafka eBook

Leverage a cloud-native service 10x better than Apache Kafka

Confluent Developer Center

Spend less on Kafka with Confluent, come see how