Durga Gadiraju

configuring files and important properties – Running Jobs

As we have changed the properties with respect to node manager capacity, let us run randomtextwriter again and see how long it take. We can override individual properties at runtime using -D and multiple properties using -conf and xml file similar to yarn-site.xml or mapred-site.xml. https://gist.github.com/dgadiraju/f2852840916b1e79f4fb6830d93c8b22 Now let us run word count program from hadoop examples …

configuring files and important properties – Running Jobs Read More »

Configuration files and Important Properties – Overview

Unlike plain vanilla distribution and other vendor distributions, Cloudera manages configuration files a bit different. Typically configuration files will be in /etc/hadoop/conf. But when it comes to Cloudera, /etc/hadoop/conf will only have templates. Actual properties files are managed under /var/run/cloudera-scm-agent/process on each node. hadoop-env.sh – for memory settings of Resource Manager, Node Manager etc. core-site.xml …

Configuration files and Important Properties – Overview Read More »