Quite often we have to build Big Data clusters using plain vanilla distributions rather than using vendor distributions such as Cloudera or Hortonworks. It is not practical to manually set up the cluster, rather we need to use server automation tools like Puppet, Chef or Ansible. We are going to set up a 7 node Hadoop (HDFS + YARN) cluster using Ansible.
Here are the skills that are covered as part of this unique course.
- Virtualization and Vagrant
- AWS Basics
- Installation of Ansible and running individual commands
- Developing ansible playbook
- Setting up Hadoop (HDFS and YARN with Map Reduce 2)
- Setting up Spark