Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
Thursday, May 18 • 3:40pm - 4:30pm
Performance Benchmarking in Open-Source at Amazon EMR - Stephen Tak Lon Wu, Amazon AWS EMR

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Amazon EMR is a cloud-based provider that allows companies, research centers and academic divisions to leverage managed clusters at massive scale. In order to maintain and achieve performance in the open-source world of big data processing, Amazon EMR built an automatic performance benchmarking pipeline to aid in validating a new release prior to release. Why do we need this performance benchmarking pipeline? Open source communities move fast; innovations and implementations often need multiple iterations in order to effectively work at massive scale. Amazon EMR aims to provide a stable service; historical performance metrics help us to preview and capture the issues of each product before releasing to the market, meanwhile Amazon EMR is following closely to the open source releases.



Amazon EMR
Tak Lon (Stephen) Wu is a software development engineer of Amazon EMR. Before joining the company, he was working toward his PhD at Indiana University and got his candidate in late 2015. His research interests are Big data application analysis, MapReduce, data mining and performance... Read More →

Thursday May 18, 2017 3:40pm - 4:30pm EDT
  Big Data