[Webinar] How to Implement Data Contracts: A Shift Left to First-Class Data Products | Register Now

Sep 11, 2018読み取り時間: 3 min

Streams and Tables: Two Sides of the Same Coin

作成者 :

Matthias J. SaxSoftware Engineer
Guozhang WangSoftware Engineer

Sep 11, 2018読み取り時間: 3 min

We are happy to announce that our paper Streams and Tables: Two Sides of the Same Coin is published and available for free download. The paper was presented at the Twelfth International Workshop on Real-Time Business Intelligence and Analytics (BIRTE) held in conjunction with the 44th International Conference on Very Large Data Bases (VLDB) in Rio de Janeiro, Brazil, in August of this year.

The BIRTE workshop attracted many participants and hosted a keynote, research, industry and demo session as well as a panel discussion about data stream processing.

Paper summary

The paper is a joint work between Confluent and Humboldt-Universität zu Berlin that describes the Dual Streaming Model, which is the foundation of Kafka Streams’ and KSQL’s stream processing semantics:

In this paper, we introduce the Dual Streaming Model to reason about physical and logical order in data stream processing. This model presents the result of an operator as a stream of successive updates, which induces a duality of results and streams. As such, it provides a natural way to cope with inconsistencies between the physical and logical order of streaming data in a continuous manner, without explicit buffering and reordering. We further discuss the trade-offs and challenges faced when implementing this model in terms of correctness, latency, and processing cost. A case study based on Apache Kafka illustrates the effectiveness of our model in the light of real-world requirements.
Original Source

The Dual Streaming Model builds on the so-called stream-table duality, which allows you to unify data streams and relational tables into a holistic data processing model. Thus, data streams and continuously updating tables are the two core abstractions in the model. Additionally, the Dual Streaming Model decouples the handling of data that arrives later (i.e., out-of-order) from latency concerns and opens up a design space between processing cost, accepted latency and result completeness for the user that no other model offers.

Figure 1. Design Space

Figure 1. Design space

The wide adoption and growth of Kafka Streams and KSQL among enterprises shows that the Dual Streaming Model solves real-world problems across all types of industries. As a result, we are elated to share our paper for free so you can become the stream processing expert in your company and take the business to the next level.

Happy reading! 🙂

Next steps

Download the paper Streams and Tables: Two Sides of the Same Coin
View the slides from BIRTE 2018
Download Confluent Platform to try out Kafka Streams and KSQL
Read the Kafka Streams and KSQL documentation
Check out this recommended read by Eno Thereska: Watermarks, Tables, Event Time, and the Dataflow Model

Matthias is an Apache Kafka committer and PMC member, and works as a software engineer at Confluent. His focus is data stream processing in general, and thus he contributes to ksqlDB and Kafka Streams. Before joining Confluent, Matthias conducted research on distributed data stream processing systems at Humboldt-University of Berlin, were he received his Ph.D. Matthias is also a committer at Apache Flink and Apache Storm.
Guozhang Wang is a PMC member of Apache Kafka, and also a tech lead at Confluent leading the Kafka Streams team. He received his Ph.D. from Cornell University where he worked on scaling data-driven applications. Prior to Confluent, Guozhang was a senior software engineer at LinkedIn, developing and maintaining its backbone streaming infrastructure on Apache Kafka and Apache Samza.

このブログ記事は気に入りましたか？今すぐ共有

Confluent Deepens India Commitment With Major Expansion on Jio Cloud

Jul 10, 2025

Confluent Cloud is expanding on Jio Cloud in India. New features include Public and Private Link networking, the new Jio India Central region for multi-region resilience, and streamlined procurement via Azure Marketplace. These features empower Indian businesses with high-performance data streaming.

Drew Taylor

From Pawns to Pipelines: Stream Processing Fundamentals Through Chess

Jul 2, 2025

We will use Chess to explain some of the core ideas behind Confluent Cloud for Apache Flink. We’ve used the chessboard as an analogy to explain the Stream/Table duality before, but will expand on a few other concepts. Both systems involve sequences, state, timing, and pattern recognition and...

Streams and Tables: Two Sides of the Same Coin

Get started free with Confluent

Watch demo: Kafka streaming in 10 minutes

作成者 :

Paper summary

Next steps

Get started free with Confluent

Watch demo: Kafka streaming in 10 minutes

このブログ記事は気に入りましたか？今すぐ共有

Confluent Deepens India Commitment With Major Expansion on Jio Cloud

From Pawns to Pipelines: Stream Processing Fundamentals Through Chess

Paper summary

Next steps

Get started free with Confluent

Watch demo: Kafka streaming in 10 minutes

このブログ記事は気に入りましたか？今すぐ共有

Confluent ブログの登録

Confluent Deepens India Commitment With Major Expansion on Jio Cloud

From Pawns to Pipelines: Stream Processing Fundamentals Through Chess