Day: February 21, 2023

Section 6:49. Overview of Hive Properties -Set and .hiverc

Hive properties are configuration settings that control the behavior of the Hive runtime. They can be set at various levels, including the system level, the Hive server level, and the session level. Here are two common ways to set Hive properties: SET command: You can set Hive properties at the session level using the SET …

Section 6:49. Overview of Hive Properties -Set and .hiverc Read More »

Section 6:48. Launching and using Hive CLI

Hive CLI (Command Line Interface) is a tool that provides a command-line interface to interact with the Hive database. It allows users to run HiveQL queries and perform administrative tasks such as creating databases and tables, managing partitions, and executing scripts. Hive CLI is typically used by developers, analysts, and administrators who prefer a command-line …

Section 6:48. Launching and using Hive CLI Read More »

Section 5:45.Getting metadata of Files using “hdfs fsck”

The  hdfs fsck  command is a tool used in HDFS (Hadoop Distributed File System) to check the health of the file system and diagnose any issues.  hdfs fsck  can be used to check the consistency of file system metadata, such as block placement and replication, and to detect and correct any inconsistencies. Here’s an example …

Section 5:45.Getting metadata of Files using “hdfs fsck” Read More »

Section 5:43. Understanding Size of the File System and Data Sets using df and du

The  hadoop fs -df -h  command in HDFS (Hadoop Distributed File System) is used to display the disk space used and disk space available in a human-readable format. The -df    option is used to show disk usage statistics and the  -h  option is used to display the output in a human-readable format (i.e., using units …

Section 5:43. Understanding Size of the File System and Data Sets using df and du Read More »