Best Hands-on Big Data Practices with PySpark & Spark Tuning
Semi-Structured (JSON), Structured and Unstructured Data Analysis with Spark and Python & Spark Performance Tuning
4.60 (1198 reviews)

10,130
students
13 hours
content
Jan 2025
last update
$84.99
regular price
What you will learn
Understand Apache Spark’s framework, execution and programming model for the development of Big Data Systems
Learn step-by-step hands-on PySpark practices on structured, unstructured and semi-structured data using RDD, DataFrame and SQL
Learn how to work with a free Cloud-based and a Desktop computer for Spark setup and configuration
Build simple to advanced Big Data applications for different types of data (volume, variety, veracity) through real case studies
Investigate and apply optimization and performance tuning methods to manage data Skewness and prevent Spill
Investigate and apply Adaptive Query Execution (AQE) to optimize Spark SQL query execution at runtime
Investigate and be able to explain the lazy evaluations (Narrow vs Wide transformation) and internal working of Spark
Build and learn Spark SQL applications using JDBC (Java Database Connectivity)
Screenshots




Related Topics
4496750
udemy ID
1/15/2022
course created date
4/17/2022
course indexed date
Bot
course submited by