Should You Read Kafka as a Stream or in Batch? Should You Even Care?

« Kafka Summit APAC 2021

Should you consume Kafka in a stream OR batch? When should you choose each one? What is more efficient, and cost effective?

In this talk we’ll give you the tools and metrics to decide which solution you should apply when, and show you a real life example with cost & time comparisons.

To highlight the differences, we’ll dive into a project we’ve done, transitioning from reading Kafka in a stream to reading it in batch.

By turning conventional thinking on its head and reading our multi-petabyte Kafka stream in batch using Spark and Airflow, we’ve achieved a huge cost reduction of 65% while at the same time getting a more scalable and resilient solution.

We’ll explore the tradeoffs and give you the metrics and intuition you’ll need to make such decisions yourself.

We’ll cover:

Costs of processing in stream compared to batch
Scaling up for bursts and reprocessing
Making the tradeoff between wait times and costs
Recovering from outages
And much more...

Chinese Japanese Korean

Presenter

Ido Nadler

Nielsen

Presenter

Opher Dubrovsky

Nielsen

Should You Read Kafka as a Stream or in Batch? Should You Even Care?

Presenter

Ido Nadler

Presenter

Opher Dubrovsky

Related Links

How Confluent Completes Apache Kafka eBook

Leverage a cloud-native service 10x better than Apache Kafka

Confluent Developer Center

Spend less on Kafka with Confluent, come see how