Data Engineering using Kafka and Spark Structured Streaming
A comprehensive Data Engineering course on building streaming pipelines using Kafka and Spark Structured Streaming
4.37 (259 reviews)

4,662
students
9.5 hours
content
Dec 2024
last update
$69.99
regular price
What you will learn
Setting up self support lab with Hadoop (HDFS and YARN), Hive, Spark, and Kafka
Overview of Kafka to build streaming pipelines
Data Ingestion to Kafka topics using Kafka Connect using File Source
Data Ingestion to HDFS using Kafka Connect using HDFS 3 Connector Plugin
Overview of Spark Structured Streaming to process data as part of Streaming Pipelines
Incremental Data Processing using Spark Structured Streaming using File Source and File Target
Integration of Kafka and Spark Structured Streaming - Reading Data from Kafka Topics
Related Topics
4239988
udemy ID
8/13/2021
course created date
10/23/2021
course indexed date
Bot
course submited by