Simple tutorial in Python for Kafka

Using Java/Scala to work with Kafka is hard (especially for someone like me). Ta-da!

What is Kafka?

Kafka is a distributed messaging system. It is at the core of many production systems in places such as Uber and LinkedIn (who created Kafka).

The architecture is a publish-subscribe model, where consumers read messages from topics that they have subscribed, where the messages are sent by producers.

Common use-cases:

messaging between applications, where you can have applications "talk" to each using messages
data processing pipelines from source systems to target destinations, thereby processing information on a streaming basis, rather than in batches as with your traditional ETL jobs

From Confluent.io

There is a lot more complexity under the hood, and I suggest you read the official docs for more information.

Step 1: Download the distribution

Download the 2.0.0 release and un-tar it.

tar -xzf kafka_2.11-2.0.0.tgz

Also, install pip requirements by running pip install -r requirements.txt

Step 2: Start the server

Kafka uses ZooKeeper so you need to first start a ZooKeeper server.

cd kafka_2.11-2.0.0

bin/zookeeper-server-start.sh config/zookeeper.properties

Now start the Kafka server:

bin/kafka-server-start.sh config/server.properties

Step 3: Create a topic

Create a topic named "test":

bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic test

See list of topics using the following command:

bin/kafka-topics.sh --list --zookeeper localhost:2181 test

Step 4: Run consumer and producer

In separate command shells, run the following:

python consumer.py

This is a consumer of the messages sent through Kafka. Simple writing to CSV of the streams is implemented.

Note: Press CTRL + C to send KeyboardInterrupt to exit the process. Alternatively, close the shell session.

python producer.py

Producer of messages. Key in any valid string to send. Type "quit" to exit.

You should now see the shell running consumer.py displaying the messages from Kafka!

Step 5: Tear things down

bin/kafka-server-stop.sh

bin/zookeeper-server-stop.sh

This would terminate both server processes.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
consumer.py		consumer.py
producer.py		producer.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple tutorial in Python for Kafka

What is Kafka?

Step 1: Download the distribution

Step 2: Start the server

Step 3: Create a topic

Step 4: Run consumer and producer

Step 5: Tear things down

About

Releases

Packages

Languages

jaabberwocky/kafka-python-example

Folders and files

Latest commit

History

Repository files navigation

Simple tutorial in Python for Kafka

What is Kafka?

Step 1: Download the distribution

Step 2: Start the server

Step 3: Create a topic

Step 4: Run consumer and producer

Step 5: Tear things down

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages