Durga Gadiraju

CCA 175 Spark and Hadoop Developer using Scala – Introduction

This is the reference material for CCA 175 Spark and Hadoop Developer using Scala. Agenda Introduction Curriculum Required Skills Setup Environment HDFS and YARN Data Sets Windows Environment (labs) Introduction CCA Spark and Hadoop Developer is well recognized certification in the industry Conducted by Cloudera – a major Big Data vendor– ITVersity played key role …

CCA 175 Spark and Hadoop Developer using Scala – Introduction Read More »

Resolve performance problems/errors in cluster operation

Let us discuss some of the common performance problems or errors in cluster operation. We might see performance problems/errors in almost all the services. But most common ones are related to applications. We typically run applications using one of these – Map Reduce, Spark, Impala, HBase etc. Map Reduce and Spark are typically run using …

Resolve performance problems/errors in cluster operation Read More »