- Details – Duration 20 minutes
- Data is available in HDFS
/public/randomtextwriter
- Get word count for the input data using space as delimiter
- Number of executors should be 10
- Executor memory should be 3 GB
- Executor cores should be 20 in total (2 per executor)
- Number of output files should be 8
- Target Directory:
/user/<YOUR_USER_ID>/solutions/solution05/wordcount
- Target File Format: Avro
- Compression: N/A
- Data is available in HDFS
- Validation