Uncategorized

Section 6:48. Launching and using Hive CLI

Hive CLI (Command Line Interface) is a tool that provides a command-line interface to interact with the Hive database. It allows users to run HiveQL queries and perform administrative tasks such as creating databases and tables, managing partitions, and executing scripts. Hive CLI is typically used by developers, analysts, and administrators who prefer a command-line …

Section 6:48. Launching and using Hive CLI Read More »

Section 5:45.Getting metadata of Files using “hdfs fsck”

The  hdfs fsck  command is a tool used in HDFS (Hadoop Distributed File System) to check the health of the file system and diagnose any issues.  hdfs fsck  can be used to check the consistency of file system metadata, such as block placement and replication, and to detect and correct any inconsistencies. Here’s an example …

Section 5:45.Getting metadata of Files using “hdfs fsck” Read More »

Section 5:43. Understanding Size of the File System and Data Sets using df and du

The  hadoop fs -df -h  command in HDFS (Hadoop Distributed File System) is used to display the disk space used and disk space available in a human-readable format. The -df    option is used to show disk usage statistics and the  -h  option is used to display the output in a human-readable format (i.e., using units …

Section 5:43. Understanding Size of the File System and Data Sets using df and du Read More »

Section 5:42. Copying or Moving Files from one HDFS location to other HDFS location

We can also copy files within HDFS using commands like cp and mv. hadoop fs -cp  to copy files from one HDFS location to another HDFS location hadoop fs -mv  to move files from one HDFS location to another HDFS location mv is faster than cp as mv deals with only metadata whereas cp has …

Section 5:42. Copying or Moving Files from one HDFS location to other HDFS location Read More »

Section 5:41. Previewing Text Files in HDFS

The “cat” and “tail” commands in HDFS are used to view the contents of a file stored in the Hadoop Distributed File System. The “cat” command allows you to display the entire contents of a file in the terminal. For example: hadoop fs -cat /user/data/file.txt This command will display the contents of the file “/user/data/file.txt” …

Section 5:41. Previewing Text Files in HDFS Read More »