What is AWS EMR?
Amazon EMR is a web service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data. It utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage Service (Amazon S3).
In this Lesson, we will see
EMR is Elastic MapReduce, MapReduce is primarily to process data at a scale and elastic is to mitigate the costs leveraging cloud pay as you go model.
In an Enterprise, data has to be processed once or twice in a day, but not continuously. For those scenarios, having a cluster is not a good idea. EMR adds value here, by leveraging the cloud to mitigate the costs. EMR is a service in analytics category.