Data Engineering using Kafka and Spark Structured Streaming

A comprehensive Data Engineering course on building streaming pipelines using Kafka and Spark Structured Streaming
4.37 (259 reviews)
Udemy
platform
English
language
Databases
category
Data Engineering using Kafka and Spark Structured Streaming
4,662
students
9.5 hours
content
Dec 2024
last update
$69.99
regular price

What you will learn

Setting up self support lab with Hadoop (HDFS and YARN), Hive, Spark, and Kafka

Overview of Kafka to build streaming pipelines

Data Ingestion to Kafka topics using Kafka Connect using File Source

Data Ingestion to HDFS using Kafka Connect using HDFS 3 Connector Plugin

Overview of Spark Structured Streaming to process data as part of Streaming Pipelines

Incremental Data Processing using Spark Structured Streaming using File Source and File Target

Integration of Kafka and Spark Structured Streaming - Reading Data from Kafka Topics

4239988
udemy ID
8/13/2021
course created date
10/23/2021
course indexed date
Bot
course submited by