Problem Statement

  • Problem Statement-Get top n Products per day by revenue
  • Datasets– orders, order_items, products, categories, departments
  • Output-order_date,product_name and revenue
  • Consider only COMPLETE and CLOSED orders
  • Create scala class – GetTopNProductsPerDay

object GetTopNProductsPerDay{

def main(args:Array[String]):Unit = {
val conf = new SparkConf().setMaster(args[0]).setAppName("Get top n Products per day by revenue")
val sc = new SparkContext(conf)
sc.setlogLevel("ERROR")
}