Durga Gadiraju

Configuring Database Engines – Hive and Impala

As part of this section, we will see how to set up the Hive in the Cloudera distribution. We will also understand important concepts related to Hive. Setup Hive and Impala Validating Hive and Impala Components and Properties of Hive Troubleshooting Hive issues Hive Commands and Queries – Overview Different Query Engines Components and Properties …

Configuring Database Engines – Hive and Impala Read More »

Review Concepts

Let us review some of the important concepts in Google Cloud Platform. Instance Templates Instance Groups VM Instances Boot Disks Additional Storage or Disks Internal IP and External IP SSH Options – gcloud and regular SSH Instance states – Start, Stop, Reset, Delete Firewall Rules

Setup Spark 2.3.x  

Once parcels is setup we can setup Spark 2.3.x on the existing cluster. Download Oracle JDK 1.8 on all the servers – ansible all -i hosts -a ” wget –no-check-certificate -c –header ‘Cookie: oraclelicense=accept-securebackup-cookie’ http://download.oracle.com/otn-pub/java/jdk/8u191-b12/2787e4a523244c269598db4e85c51e0c/jdk-8u191-linux-x64.rpm ” –private-key=~/.ssh/google_compute_engine Install Oracle JDK 1.8 on all the servers – ansible all -i hosts -a ” rpm -ivh jdk-8u191-linux-x64.rpm ” –become –private-key=~/.ssh/google_compute_engine …

Setup Spark 2.3.x   Read More »