Best Hands-on Big Data Practices with PySpark & Spark Tuning

Semi-Structured (JSON), Structured and Unstructured Data Analysis with Spark and Python & Spark Performance Tuning
4.60 (1198 reviews)
Udemy
platform
English
language
Other
category
instructor
Best Hands-on Big Data Practices with PySpark & Spark Tuning
10,130
students
13 hours
content
Jan 2025
last update
$84.99
regular price

What you will learn

Understand Apache Spark’s framework, execution and programming model for the development of Big Data Systems

Learn step-by-step hands-on PySpark practices on structured, unstructured and semi-structured data using RDD, DataFrame and SQL

Learn how to work with a free Cloud-based and a Desktop computer for Spark setup and configuration

Build simple to advanced Big Data applications for different types of data (volume, variety, veracity) through real case studies

Investigate and apply optimization and performance tuning methods to manage data Skewness and prevent Spill

Investigate and apply Adaptive Query Execution (AQE) to optimize Spark SQL query execution at runtime

Investigate and be able to explain the lazy evaluations (Narrow vs Wide transformation) and internal working of Spark

Build and learn Spark SQL applications using JDBC (Java Database Connectivity)

Screenshots

Best Hands-on Big Data Practices with PySpark & Spark Tuning - Screenshot_01Best Hands-on Big Data Practices with PySpark & Spark Tuning - Screenshot_02Best Hands-on Big Data Practices with PySpark & Spark Tuning - Screenshot_03Best Hands-on Big Data Practices with PySpark & Spark Tuning - Screenshot_04
4496750
udemy ID
1/15/2022
course created date
4/17/2022
course indexed date
Bot
course submited by
Best Hands-on Big Data Practices with PySpark & Spark Tuning - | Comidoc