0% found this document useful (0 votes)

10 views11 pages

Kafka Introduction1

Kafka is a distributed publish-subscribe messaging system that operates on the Java Virtual Machine and relies on Zookeeper for coordination. It organizes messages into topics, with producers publishing messages and consumers subscribing to them, allowing for high-throughput and scalable data flow. Key concepts include the use of logs for message storage, partitioning for scaling, and consumer groups for load balancing and ordering guarantees.

Uploaded by

navyasrinarayanadas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views11 pages

Kafka Introduction1

Uploaded by

navyasrinarayanadas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Messaging Architectures: Messaging Models

1. Point to Point
2. Publish and Subscribe

Kafka is an example of publish-and-subscribe messaging model

Kafka Overview

• Kafka is a unique distributed publish-subscribe messaging system written in the Scala language with multi-
language support and runs on the Java Virtual Machine (JVM).

• Kafka relies on another service named Zookeeper – a distributed coordination system – to function.

• Kafka has high-throughput and is built to scale-out in a distributed model on multiple servers.

• Kafka persists messages on disk and can be used for batched consumption as well as real time applications.

2
Key Terminology

• Kafka maintains feeds of messages in categories called topics.

• Processes that publish messages to a Kafka topic are called producers.

• Processes that subscribe to topics and process the feed of published messages are called consumers.

• Kafka is run as a cluster comprised of one or more servers each of which is called a broker.

• Communication between all components is done via a high performance simple binary API over TCP protocol

3
Kafka Architecture

Kafka Cluster

Broker
Producer Consumer
Broker

Producer Broker Consumer

Broker

Zookeeper

Understanding Kafka
• Kafka is based on the simple storage-abstraction concept called a log, an append-only totally-ordered sequence
of records ordered by time.

4
• Records are appended to the end of the record and reads proceed from left to right in the log (or topic).

• Each entry is assigned a unique sequential log-entry number (an offset).

• The log entry number is a convenient property that correlates to the notion of a “timestamp” entry but is
decoupled from any clock due to the distributed nature of Kafka.
Kafka Key Design Concepts

• A log is synonymous to a file or table where the records are appended and sorted by the concept of time.

• Conceptually, the log is a natural data-structure for handling data-flow between systems.

5
• Kafka is designed for centralizing an organization’s data into an enterprise log (message bus) for real-time
subscription by other subscribers or application consumers.

Kafka Conceptual Design

• Each logical data source can be modeled as a log corresponding to a topic or data feed in Kafka.
• Each subscribing consuming application should read as quickly as it can from each topic, persist the record it
reads into it’s own data store and advances the offset to the next message entry to be read.
• Subscribers can be any type of data system or middleware system like a cache, Hadoop, a streaming system like
Spark or Storm, a search system, a web services provisioning system, a data warehouse, etc.
• In Kafka, partitioning is a concept applied to the log/topic in other to allow horizontal scaling.

6
Kafka Logical Design

• Each partition is a totally ordered log within a topic, and there is no global ordering between partitions.

• Assignment of messages to specific partitions is controlled by the publisher and may be assigned based on a
unique identification key or messages can be allowed to be randomly assigned to partitions.
• Partitioning allows throughput to scale linearly with the Kafka cluster size.
Kafka Topics

• Kafka topics should have a small number of consumer groups assigned with each one representing a “logical
subscriber”.

• Kafka topic consumption can be scaled by increasing the number of consumer subscriber instances within the
same group which will automatically load-balance message consumption.

7
• Kafka has a notion of partitioning within a topic to provide the notion of parallel consumption

• Partitions in a topic are assigned to the consumers within a consumer group.

• There can be no more consumer instances within a consumer group than partitions within a topic.

• If the total order in which messages are published is important in the consumption, then a single partition for the
topic is the solution which will mean only one consumer process in the consumer group.
Kafka Topic Partitions

• A topic consists of partitions.

• Partition: ordered + immutable sequence of messages that is continually appended to

8
Kafka Topic Partitions
• #partitions of a topic is configurable
• #partitions determines max consumer (group) parallelism
– Cf. parallelism of Storm’s KafkaSpout via builder.setSpout(,,N)

9
– Consumer group A, with 2 consumers, reads from a 4-partition topic
– Consumer group B, with 4 consumers, reads from the same topic

10
Kafka Consumer Groups

• Kafka assigns the partitions in a topic to the consumer instances in a consumer group to provide ordering
guarantees and load balancing over a pool of consumer process. Note that there can be no more consumer
instances per group than total partition count.

• Kafka is a unique distributed publish-subscribe messaging system written in the Scala language with multi-language
support and runs on the Java Virtual Machine (JVM).

Kafka
No ratings yet
Kafka
3 pages
Configuring Kafka For High Throughput
No ratings yet
Configuring Kafka For High Throughput
11 pages
Unit 5 Apache Kafka Notes
No ratings yet
Unit 5 Apache Kafka Notes
54 pages
Big Data - Group 14
No ratings yet
Big Data - Group 14
26 pages
Kafka
No ratings yet
Kafka
23 pages
5 Kafka 2.7m
No ratings yet
5 Kafka 2.7m
46 pages
Kafka Clustering v1.0.0
No ratings yet
Kafka Clustering v1.0.0
20 pages
Data and AI Kafka Overview 1740507867
No ratings yet
Data and AI Kafka Overview 1740507867
20 pages
Kafka Concepts For SQS User
No ratings yet
Kafka Concepts For SQS User
17 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Apache Kafka - Thi Nguyen's Blog
No ratings yet
Apache Kafka - Thi Nguyen's Blog
39 pages
Kafka
No ratings yet
Kafka
12 pages
Apache Kafka
No ratings yet
Apache Kafka
6 pages
Kafka Notes Linkedin
100% (1)
Kafka Notes Linkedin
33 pages
Kafka & Spring Boot for Developers
No ratings yet
Kafka & Spring Boot for Developers
150 pages
Kafka Using Spring Boot
No ratings yet
Kafka Using Spring Boot
136 pages
? Kafka
No ratings yet
? Kafka
2 pages
Introduction To Apache Kafka
No ratings yet
Introduction To Apache Kafka
18 pages
Pache Kafka Is An Open-Source Distr
No ratings yet
Pache Kafka Is An Open-Source Distr
1 page
Introduction To Apache Kafka and Its Setup
No ratings yet
Introduction To Apache Kafka and Its Setup
29 pages
Apache Kafka
No ratings yet
Apache Kafka
7 pages
Kafka & Confluent: A Technical Guide
No ratings yet
Kafka & Confluent: A Technical Guide
72 pages
Kafka for Developers and Engineers
No ratings yet
Kafka for Developers and Engineers
7 pages
Kafka
No ratings yet
Kafka
5 pages
Kafkha
No ratings yet
Kafkha
32 pages
Understanding Apache Kafka White Paper
No ratings yet
Understanding Apache Kafka White Paper
7 pages
Fundamentals and Architecture of Apache Kafka
No ratings yet
Fundamentals and Architecture of Apache Kafka
30 pages
Introduction to Kafka & Microservices
No ratings yet
Introduction to Kafka & Microservices
17 pages
Apache Kafka Beginner Guide
No ratings yet
Apache Kafka Beginner Guide
40 pages
Kafka
No ratings yet
Kafka
43 pages
Kafka Interview Questions
No ratings yet
Kafka Interview Questions
10 pages
08 Apache Kafka
No ratings yet
08 Apache Kafka
45 pages
Kafka
No ratings yet
Kafka
15 pages
Kafka
No ratings yet
Kafka
3 pages
Kafka
No ratings yet
Kafka
28 pages
Kafka Development and Functionality
No ratings yet
Kafka Development and Functionality
43 pages
Kafka
No ratings yet
Kafka
19 pages
Kafka and Mongodb
No ratings yet
Kafka and Mongodb
15 pages
KAFKAExample 2
No ratings yet
KAFKAExample 2
12 pages
Getting Started With Apache Kafka
No ratings yet
Getting Started With Apache Kafka
5 pages
Kafka Architectures Notes
No ratings yet
Kafka Architectures Notes
9 pages
Kafka Interview Preparation
No ratings yet
Kafka Interview Preparation
13 pages
Apache Kafka Description
No ratings yet
Apache Kafka Description
36 pages
Instaclustr Understanding Apache Kafka White Paper
No ratings yet
Instaclustr Understanding Apache Kafka White Paper
8 pages
Kafka Topic Questions
No ratings yet
Kafka Topic Questions
9 pages
CloudApps2 Kafka
No ratings yet
CloudApps2 Kafka
15 pages
Kafka Presentation
No ratings yet
Kafka Presentation
16 pages
Kafka
No ratings yet
Kafka
20 pages
Kafka Core Concepts Guide
No ratings yet
Kafka Core Concepts Guide
76 pages
Apache Kafka 360 1631077800
No ratings yet
Apache Kafka 360 1631077800
137 pages
SITA1603 Unit 3 Material
No ratings yet
SITA1603 Unit 3 Material
45 pages
AK
No ratings yet
AK
22 pages
Introduction To Apache Kafka - 070224-1155-334
No ratings yet
Introduction To Apache Kafka - 070224-1155-334
7 pages
Unit 3
No ratings yet
Unit 3
26 pages
05 05 22 Production Report 1
No ratings yet
05 05 22 Production Report 1
40 pages
Road Vehicle Count Form: Elementary
No ratings yet
Road Vehicle Count Form: Elementary
52 pages
NSCP Ce Lawswith Chapter 5 6
No ratings yet
NSCP Ce Lawswith Chapter 5 6
23 pages
Anirudh Suram
No ratings yet
Anirudh Suram
2 pages
Additional Manual For KLK1-With Marchin Room Lift
No ratings yet
Additional Manual For KLK1-With Marchin Room Lift
50 pages
Internship Report
No ratings yet
Internship Report
20 pages
Enterprise Business Systems
No ratings yet
Enterprise Business Systems
7 pages
Lesson Plan Daily Commerce PD Form 5
100% (1)
Lesson Plan Daily Commerce PD Form 5
6 pages
MHI Turbine (Main)
100% (6)
MHI Turbine (Main)
86 pages
Short Circuit & Motor Acceleration Analysis in Etap
100% (1)
Short Circuit & Motor Acceleration Analysis in Etap
12 pages
Project Report
No ratings yet
Project Report
95 pages
WR SNMP Programmers Guide 10.5
No ratings yet
WR SNMP Programmers Guide 10.5
240 pages
IAS Case Study 1
No ratings yet
IAS Case Study 1
2 pages
Sequence Diagram
No ratings yet
Sequence Diagram
4 pages
Alternator
No ratings yet
Alternator
50 pages
Static Equipment Integrity
No ratings yet
Static Equipment Integrity
1 page
UXDesign JD PDF
No ratings yet
UXDesign JD PDF
2 pages
NASA Facts F-8 Digital Fly-By-Wire
No ratings yet
NASA Facts F-8 Digital Fly-By-Wire
3 pages
Digital Outcome Driven Metrics For Manufacturing
100% (1)
Digital Outcome Driven Metrics For Manufacturing
8 pages
MODULE 2 Assessment 1 - Assignment - Individual Task - PILONGO
No ratings yet
MODULE 2 Assessment 1 - Assignment - Individual Task - PILONGO
5 pages
IoTtinkerCAD LAB Manual
100% (2)
IoTtinkerCAD LAB Manual
59 pages
User Manual: ATEQ D520
No ratings yet
User Manual: ATEQ D520
126 pages
Bauer HRM Chapter 3 Data Management and HR Information Systems 0
No ratings yet
Bauer HRM Chapter 3 Data Management and HR Information Systems 0
42 pages
1 13 010 Is mc1 - Datasheet
No ratings yet
1 13 010 Is mc1 - Datasheet
3 pages
Idrac9 7 10 90 RN en Us
No ratings yet
Idrac9 7 10 90 RN en Us
30 pages
Cisco CCIE Lab Study Guide
No ratings yet
Cisco CCIE Lab Study Guide
7 pages
Candidatename Gender Degree Branch
No ratings yet
Candidatename Gender Degree Branch
2 pages
Excel Spreadsheets - Student Version
No ratings yet
Excel Spreadsheets - Student Version
23 pages
JCM Training Overview Uba 10-11-12 14
No ratings yet
JCM Training Overview Uba 10-11-12 14
25 pages
Speech Recognition
No ratings yet
Speech Recognition
12 pages

Kafka Introduction1

Uploaded by

Kafka Introduction1

Uploaded by

Messaging Architectures: Messaging Models

Kafka is an example of publish-and-subscribe messaging model

• Kafka maintains feeds of messages in categories called topics.

• Processes that publish messages to a Kafka topic are called producers.

Producer Broker Consumer

• Each entry is assigned a unique sequential log-entry number (an offset).

Kafka Conceptual Design

• Partitions in a topic are assigned to the consumers within a consumer group.

• A topic consists of partitions.

• Partition: ordered + immutable sequence of messages that is continually appended to

You might also like