Anbu Anand Gurusamy
2 min readFeb 14, 2022

--

PART — 2— Dockerized Ubuntu 20.04 — SPARK 3.2.1 with KAFKA 2.13–2.8.1 integration

Firstly due credits to the BOS the BOSS!

I took the docker image from his github

But it did not work for 3.0, got the usually java errors, so as per my part 1, upgraded to 3.2.1 and added the missing jars

Here is the customized docker image, which is working now.

Docker customization done

Docker file

So guys please follow Wesley Bos’s link above, docker file is form my side.

Other good references that I used, credits to Shuyi Yang — https://towardsdatascience.com/kafka-docker-python-408baf0e1088

Happy Sparking and Kafkaying in an Docker !!

--

--