- Problem Statement-Get top n Products per day by revenue
- Datasets– orders, order_items, products, categories, departments
- Output-order_date,product_name and revenue
- Consider only COMPLETE and CLOSED orders
- Create scala class – GetTopNProductsPerDay
object GetTopNProductsPerDay{
def main(args:Array[String]):Unit = {
val conf = new SparkConf().setMaster(args[0]).setAppName("Get top n Products per day by revenue")
val sc = new SparkContext(conf)
sc.setlogLevel("ERROR")
}