Master Apache Spark using Spark SQL and PySpark 3
Master Apache Spark using Spark SQL as well as PySpark with Python3 with complementary lab access
4.54 (2426 reviews)

18,080
students
32 hours
content
May 2024
last update
$74.99
regular price
What you will learn
Setup the Single Node Hadoop and Spark using Docker locally or on AWS Cloud9
Review ITVersity Labs (exclusively for ITVersity Lab Customers)
All the HDFS Commands that are relevant to validate files and folders in HDFS.
Quick recap of Python which is relevant to learn Spark
Ability to use Spark SQL to solve the problems using SQL style syntax.
Pyspark Dataframe APIs to solve the problems using Dataframe style APIs.
Relevance of Spark Metastore to convert Dataframs into Temporary Views so that one can process data in Dataframes using Spark SQL.
Apache Spark Application Development Life Cycle
Apache Spark Application Execution Life Cycle and Spark UI
Setup SSH Proxy to access Spark Application logs
Deployment Modes of Spark Applications (Cluster and Client)
Passing Application Properties Files and External Dependencies while running Spark Applications
Screenshots




1398116
udemy ID
10/17/2017
course created date
11/20/2019
course indexed date
Bot
course submited by