Getting started with Hadoop with Amazon Web Services
Last night, I gave a presentation at the Boulder/Denver Hadoop Meetup on getting started using Amazon’s Elastic MapReduce. Amazon’s EMR is Apache Hadoop, cloud resources and Web services that implement a scalable, on-demand MapReduce solution for analyzing very large data sets.
The presentation contains step-by-step instructions for getting your first simple project running on EMR and a short overview of the map-reduce way of thinking about data problems. The slides also contain many links to tools, papers, and examples.