This course is for people looking for an introduction to Hadoop and Spark Eco-System. We will be covering the essentials required to start learning both these technologies with the help of ITVersity Labs.
Let us now see what we will be covering below:
- Python Pre-Requisites: Collections and Map-Reduce
- Using ITVersity Labs
- Cluster Overview Using Ambari
- Distributed File System
- Basics of HDFS
- Spark Example
- Introduction to Spark
- Spark RDD and Actions
- Spark RDD Transformations
- Practice Exercises