[Webinar] How to Implement Data Contracts: A Shift Left to First-Class Data Products | Register Now

Presentation

Interactive Query in Kafka Streams: The Next Generation

« Kafka Summit London 2022

Kafka Streams offers Interactive Queries (IQ) that allow one to interact with the internal stream processing state from outside of the application. This functionality has proven invaluable to users over the years for everything from debugging to serving low latency queries straight from the Streams runtime.

However, the actual interfaces for IQ were designed in the very early days of Kafka Streams and have proven cumbersome to use. Adding new custom queries such as reverse scan requires changes to more than 6000 loc spanning 108 files. There is no way currently to customize how a query executes - if, for instance, it should use the caching layer or not. Moreover, there is no way to extend the query result to include extra information such as which store layers or segments participated in the query, execution time, cache hit/miss, etc. Finally, IQ allows users to tradeoff consistency for availability by querying standby stores during rebalances. Eventual consistency, although useful as a concept, makes it difficult to provide applications with a good user experience.

In this presentation, we unveil the next generation of Interactive Query (IQv2) that addresses all these shortcomings. We demonstrate the key benefits of the new query API:

Customizability: You can use IQv2 to easily plug in your own store implementations and queries. You can use it, for example, to implement complex queries to push down filters and indices into the storage layer itself.
Control: It gives you far more control over how each query is executed and exposes all the execution details you need to power high performance, production-grade queries.
Consistency: It provides a tunable consistency model, allowing you to choose your tradeoff between consistency, availability, and latency.

Attend this talk if you want to learn about how to write high performance, semantically strong applications in a modern, data-in-motion environment.

Presenter

Vasiliki Papavasileiou

Confluent

Vicky Papavasileiou is part of the ksqlDB team where she focuses her efforts on improving performance and operability of pull and push queries. Prior to joining Confluent, Vicky completed her doctorate studies at the University of California, San Diego, with an emphasis on big data management, large-scale graph processing, and query optimization.

Presenter

John Roesler

Confluent

John Roesler is a software engineer at Confluent, focusing mainly on Kafka Streams. Prior to that, he helped build and operate the high-volume event streaming platform at Bazaarvoice.

Interactive Query in Kafka Streams: The Next Generation

Presenter

Vasiliki Papavasileiou

Presenter

John Roesler

Related Links

How Confluent Completes Apache Kafka eBook

Leverage a cloud-native service 10x better than Apache Kafka

Confluent Developer Center

Spend less on Kafka with Confluent, come see how