Using spark-shell we can validate ad hoc code to confirm it is working. It will also confirm whether the installation is successful or not.
spark-shell
val orderItems = sc.textFile("C:\\data\\retail_db\\order_items") val revenuePerOrder = orderItems. map(oi => (oi.split(",")(1).toInt, oi.split(",")(4).toFloat)). reduceByKey(_ + _). map(oi => oi._1 + "," + oi._2) revenuePerOrder.take(10).foreach(println)
On Windows after showing the output, it might throw the exception.
We can validate the jar file by using spark-submit
spark-submit
is the main command to submit the job--class retail_db.GetRevenuePerOrder
, to pass the class name--master
spark-submit --class retail_db.GetRevenuePerOrder <PATH_TO_JAR> local <INPUT_PATH> <OUTPUT_PATH>