Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for Hadoop Clusters

Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for Hadoop Clusters

Thursday, June 21
10:20 AM - 11:00 AM
Meeting Room 230B

Many organizations today have already migrated Hadoop workloads to cloud infrastructure or they are actively planning to do such a migration. A common question in this scenario is "Which instance types should I use for my Hadoop cluster?" There are nuances to cloud infrastructure that require careful consideration when deciding which instances types to use. This session will show the results of performance comparison of Amazon Web Services (AWS) Elastic Compute Cloud (EC2) instance types commonly used in Hadoop clusters. More importantly, we will discuss the relative cost comparison of these instance types to demonstrate the which AWS instances offer the best price to performance ratio using standard benchmarks. Attendees of this session with leave with a better understanding of the performance of AWS EC2 instance types when used for Hadoop workloads and be able to make more informed decisions about which instance types makes the most sense for their needs.

Presentation Video

SPEAKERS

Michael Young
Senior Solutions Engineer
Hortonworks
Michael is a Senior Solutions Engineer at Hortonworks on the Public Sector team. He has worked in the public sector space for 19 years in a broad variety of IT roles, with more than 13 years’ experience as a Solutions Architect. Michael has been focused on the Hadoop space for the last 4 years. His other “big data” passion is information retrieval using Solr and Elasticsearch.
Marcus Waineo
Principal Solutions Engineer
Hortonworks
Marcus has been helping Federal, State, and Local governments adopt transformative big data technologies at Hortonworks for the last 4 years. Prior to joining Hortonworks, Marcus was a Computer Scientist at the US Department of Justice where he drove the adoption of Open Source Software and big data technologies such as Apache Hadoop, Apache Accumulo, Apache Nifi and related technology to solve analytic problems for agents and analysts in support of the U.S. National Security mission.