Exercise 05 – Develop word count program

  • Details – Duration 20 minutes
    • Data is available in HDFS /public/randomtextwriter
    • Get word count for the input data using space as delimiter
    • Number of executors should be 10
    • Executor memory should be 3 GB
    • Executor cores should be 20 in total (2 per executor)
    • Number of output files should be 8
    • Target Directory: /user/<YOUR_USER_ID>/solutions/solution05/wordcount
    • Target File Format: Avro
    • Compression: N/A
  • Validation

Share this post