Data Frame Operations – Getting Started

Different Dataframe Operations-

In the upcoming videos we will be working on problem statement-Get top n products by revenue, consider only complete and closed orders.Our input data sets are -Orders,order_items,products.The different operations we will be performing is-

  • Selection or projection
  • Filtering datasets
  • Aggregations like sum,avg,min,max
  • Sorting and ranking etc.

First create a Scala class GetTopProductsPerDay.