Big Data Analytics
DATE & TIME:
May 25, 2025
All Day
LOCATION:
This course provides a comprehensive introduction to Big Data concepts and practical implementation using Apache Hadoop and Apache Spark. Participants will explore the sources, characteristics, and importance of Big Data, along with the technologies used for its storage and processing.
The course delves into Apache Spark’s ecosystem, emphasizing Resilient Distributed Datasets (RDDs) and their operations. A crash course in Python programming is included, focusing on essential packages like NumPy, Matplotlib, and Pandas. Through practical labs and real-world projects using PySpark, students will gain hands-on experience in Big Data analytics tasks such as churn prediction and word count analysis.