Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
Back To Schedule
Tuesday, May 16 • 11:05am - 11:55am
Starting with Apache Spark, Best Practices and Learning from the Field - Felix Cheung, Microsoft

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Apache Spark is one of the most popular Big Data platform. In this talk we will have a quick introduction of some of the high-level concepts in Spark and its various modules: SQL, Streaming, ML, Graph and Structured Streaming.

Then we will go through some of the current Best Practices to operationalize Spark for better performance in production, and tips to detect and avoid some of the most common issues.

And lastly we will explore how some enterprises are building solutions with Spark.

avatar for Felix Cheung

Felix Cheung

Engineering Manager, Uber
Felix started in the big data space about 5 years ago with the then state-of-the-art MapReduce. Since then, he (re-)built Hadoop cluster from metal more times than he would like, created a Hadoop “distro” from two dozens or so projects into .rpm/.deb, and kicked off clusters in... Read More →

Tuesday May 16, 2017 11:05am - 11:55am EDT