Big Data on Cloud (Hadoop and Spark on AWS)

Current Status
Not Enrolled
Price
94.95
Get Started
This course is currently closed

Are you familiar with Big Data Technologies such as Hadoop and Spark and planning to understand how to build Big Data pipelines leveraging pay as you go model of cloud such as AWS?

This course is answer for that.

Pre-requisites

  • Basic programming using Python or Scala or both
  • Good knowledge about distributed file systems such as HDFS
  • Experience or Knowledge with distributed resource management frameworks such as YARN or Mesos
  • Good knowledge about distributed computing frameworks such as Map Reduce and Spark
  • Basic knowledge about Data Warehousing, ETL, Data Integration frameworks

Curriculum

Here is the curriculum for the course. If you are already familiar with Big Data technologies you can quickly add this important skill by going through this course.

  • Overview of AWS barebones (EC2, S3, EBS, Networking, Security, CLI etc)
  • Overview of AWS analytical services and comparison between on-premise cluster vs. cloud services. This session includes creating EMR cluster using quick options.
  • Step Execution and other advanced options of EMR
  • Quick revision of programming language – Scala 2.11
  • Quick revision of programming language – Python 3.x (including Dataframes)
  • Development life cycle of Spark 2 applications using Scala (using IntelliJ)
  • Development life cycle of Spark 2 applications using Python (using Pycharm)
  • Running Scala and Python applications on EMR Cluster

We might have another course in near future where we will be covering DynamoDB, Kinesis etc to deep dive into other services under analytics services category of AWS.

There is no lab associated with the course. You might have to pay money to AWS to get your hands dirty as demonstrated in the course.

Notable Replies

  1. Here are the differences:

    • Live training
    • Covers both Scala and Python
    • Advanced concepts of EMR - Step Execution and other advanced features of EMR
    • Discount for future courses on AWS (e.g.: Building Streaming pipelines using Kinesis and DynamoDB)

    If you have already signed up with Udemy, you do not have to sign up for course or I can give discount with price difference.

  2. Hello,

    I have already brough Udemy course for AWS EMR and Spark using Scala , But I didnt start the course yet . Please let me know if this is different content from that available in Udemy.

  3. Is life long access to videos and course material for the course Big Data on Cloud (Hadoop and Spark on AWS) possible if we purchase the course or can this course be made available on Udemy. Thank you.

  4. I want to purchaage this course

Continue the discussion discuss.itversity.com

Participants