Preparing Data Sets for Understanding Map-Reduce Libraries
Preparing Data Sets¶ We will be primarily using orders and order_items data set to understand about manipulating collections. orders is available at path /data/retail_db/orders/part-00000 order_items is available at path /data/retail_db/order_items/part-00000 orders – columns order_id – it is of type integer and unique order_date – it can be considered as string order_customer_id – it is of …
Preparing Data Sets for Understanding Map-Reduce Libraries Read More »