“This course provides a comprehensive introduction to Big Data concepts and
practical implementation using Apache Hadoop and Apache Spark. Participants
will explore the sources, characteristics, and importance of Big Data, along
with the technologies used for its storage and processing. The course delves
into Apache Spark’s ecosystem, emphasizing Resilient Distributed Datasets
(RDDs) and their operations. A crash course in Python programming is included,
focusing on essential packages like NumPy, Matplotlib, and Pandas. Through
practical labs and real-world projects using PySpark, students will gain hands-on
experience in Big Data analytics tasks such as churn prediction and word count
analysis.”
Upcoming Events
40 Hours, Class ID: 25462, Course, English, In-Class
June 21, 2026
Big Data Analytics
Mohandessin Premises

