Apache Spark for Data Engineering — Hands-On with PySpark https://WebToolTip.com Published 2/2026
Created by Big Data Expertise
MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz, 2 Ch
Level: All Levels | Genre: eLearning | Language: English | Duration: 39 Lectures ( 3h 54m ) | Size: 3 GB
Go from beginner to building real Spark ETL pipelines using DataFrames and Spark SQL
What you'll learn
✓ Set up and work with an Apache Spark environment using PySpark to process real-world datasets.
✓ Read data from common formats such as CSV and Parquet
✓ Clean, transform and aggregate data using the Spark DataFrame API & Spark SQL
✓ Build a complete end-to-end Spark ETL pipeline
✓ Understand how Apache Spark works under the hood
Requirements
● Basic programming knowledge : You should be comfortable with basic programming concepts such as variables, functions, and loops (Python or any similar language).
● Basic Python or Scala familiarity (recommended, not mandatory) : Knowing Python or Scala basics will help you follow the examples, but Spark concepts apply to both languages.
● Basic SQL knowledge Understanding simple SQL queries (SELECT, WHERE, GROUP BY) is helpful but not required.
● A computer with internet access A standard laptop or desktop computer is enough. No special hardware is required.