Pyspark For Data Scientists
https://FreeCourseWeb.com
Published 10/2024
MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz
Language: English | Size: 2.23 GB | Duration: 4h 43m
PySpark for Data Scientists
What you'll learn
Foundations of PySpark: Gain a solid understanding of fundamental PySpark concepts and principles.
Data Manipulation Techniques: Explore key data manipulation techniques such as dataframes, RDDs, and SQL queries in PySpark.
Distributed Data Processing: Learn techniques for distributed data processing and optimisation.
Data Preparation: Understand and implement strategies for data cleaning and transformation.
Requirements
Basic Understanding of Python Programming: This includes familiarity with libraries such as NumPy and Pandas.
Knowledge of Data Science Fundamentals: Understanding of data manipulation, exploratory data analysis, and basic machine learning concepts.
Familiarity with Big Data Concepts: Basic knowledge of big data concepts and distributed computing is beneficial but not required.