Choosing the right deployment model is critical to successfully run a scalable streaming data platform in production. Selecting the right hardware or cloud deployment architecture for each use case is important to ensure that the system reliably provides high-throughput and low-latency data streams.
This white paper describes the reference architecture of Confluent Enterprise, which is the most complete platform to build enterprise-scale streaming pipelines using Apache Kafka and to simplify the development of stream processing applications.
This paper is intended for data architects and system administrators planning to deploy Apache Kafka in production. Readers will learn about the components of Confluent Enterprise, key considerations for production deployments, guidelines for hardware selection and the selection of instances for cloud providers.
Gwen Shapira, Product Manager, Confluent
Gwen is a product manager at Confluent managing Confluent Platform, a stream data platform powered by Apache Kafka. She has 15 years of experience working with code and customers to build scalable data architectures, integrating relational and big data technologies. She currently specializes in building real-time reliable data processing pipelines using Apache Kafka. Gwen is an Oracle Ace Director, an author of books including "Kafka, the Definitive Guide", and a frequent presenter at data related conferences. Gwen is also a committer on the Apache Kafka and Apache Sqoop projects.