March 12, 2023

Section 10:119.Sorting Data within Groups Using DISTRIBUTE BY and SORT BY

Leave a Comment / Uncategorized / By Durga Gadiraju

Sorting data within groups using DISTRIBUTE BY and SORT BY in Hive is a way to sort the data within each group based on one or more columns. This is useful when you want to group your data and sort it within each group to get more meaningful insights. Use the SELECT statement with DISTRIBUTE …

Section 10:119.Sorting Data within Groups Using DISTRIBUTE BY and SORT BY Read More »

Section 10:118.Global Sorting Using ORDER BY

Leave a Comment / Uncategorized / By Durga Gadiraju

Use the ORDER BY clause in a SELECT statement to sort the data based on a specific column. You can sort the data by multiple columns by including them in the ORDER BY clause. By default, the sorting is done in ascending order. You can specify descending order by adding the DESC keyword after the …

Section 10:118.Global Sorting Using ORDER BY Read More »

Section 10:117.Performing Basic Aggregations Sum, Min,Max Using Group by

Leave a Comment / Uncategorized / By Durga Gadiraju

Performing basic aggregations using HAVING in Hive is a way to filter the results of a query based on a condition applied to an aggregate function result. HAVING clause is used with the GROUP BY clause and it filters the groups based on a condition specified after the HAVING keyword. To perform basic aggregations using …

Section 10:117.Performing Basic Aggregations Sum, Min,Max Using Group by Read More »

Section 10:116.Performing Basic Aggregations Sum, Min,Max Using Group by

Leave a Comment / Uncategorized / By Durga Gadiraju

Performing basic aggregations using GROUP BY in Hive is a common task in data analysis and data processing. Hive is a data warehousing tool that provides a SQL-like interface to query and analyze large datasets stored in Hadoop Distributed File System (HDFS). To perform basic aggregations using GROUP BY in Hive, you can follow these …

Section 10:116.Performing Basic Aggregations Sum, Min,Max Using Group by Read More »

Section 10:107. Basic Aggregations Using Aggregate

Leave a Comment / Uncategorized / By Durga Gadiraju

In Apache Hive, the AGGREGATE function is used to perform basic aggregations on data. The AGGREGATE function takes a column or expression as an argument and returns the aggregate result for that column or expression. Here are some basic aggregations that can be performed using AGGREGATE in Hive: COUNT: The COUNT aggregation returns the number …

Section 10:107. Basic Aggregations Using Aggregate Read More »

Day: March 12, 2023

Section 10:119.Sorting Data within Groups Using DISTRIBUTE BY and SORT BY

Section 10:118.Global Sorting Using ORDER BY

Section 10:117.Performing Basic Aggregations Sum, Min,Max Using Group by

Section 10:116.Performing Basic Aggregations Sum, Min,Max Using Group by

Section 10:107. Basic Aggregations Using Aggregate

Join Our Community

Follow Us

Links

Contact Info

Address

Phone

Email