Loading…
Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
Spark [clear filter]
Tuesday, May 16
 

11:05am EDT

Starting with Apache Spark, Best Practices and Learning from the Field - Felix Cheung, Microsoft
Apache Spark is one of the most popular Big Data platform. In this talk we will have a quick introduction of some of the high-level concepts in Spark and its various modules: SQL, Streaming, ML, Graph and Structured Streaming.

Then we will go through some of the current Best Practices to operationalize Spark for better performance in production, and tips to detect and avoid some of the most common issues.

And lastly we will explore how some enterprises are building solutions with Spark.

Speakers
avatar for Felix Cheung

Felix Cheung

Engineering Manager, Uber
Felix started in the big data space about 5 years ago with the then state-of-the-art MapReduce. Since then, he (re-)built Hadoop cluster from metal more times than he would like, created a Hadoop “distro” from two dozens or so projects into .rpm/.deb, and kicked off clusters in... Read More →


Tuesday May 16, 2017 11:05am - 11:55am EDT
Trianon
  Spark

12:05pm EDT

Profiling Spark Applications - Jayesh Thakrar, Conversant
Are you interested in harnessing and analyzing the data that drives the Spark Web UI? Are you keen to use that data to tune your applications or understand fluctuations in runtime of your production applications? Do you want to understand the efficiency of your Spark executors and system resources?

This presentation will help you do that and more, by walking through the wealth of data in Spark application events. This data can be used as a foundation for a Spark profiler and advisor that analyzes application events in batch or real-time.

Speakers
avatar for Jayesh Thakrar

Jayesh Thakrar

Sr. Software Engineer, Conversant
Jayesh Thakrar is a Sr. Data Engineer at Conversant (http://www.conversantmedia.com/). He is a data geek who gets to build and play with large data systems consisting of Hadoop, Spark, HBase, Cassandra, Flume and Kafka. To rest after a good day's work, he uses OpenTSDB with 500+ million... Read More →



Tuesday May 16, 2017 12:05pm - 12:55pm EDT
Trianon
  Spark

2:30pm EDT

Writing Apache Spark Applications Using Apache Bahir - Luciano Resende & Leucir Marin, IBM
Big Data is all about being to access and process data in various formats, and from various sources. Apache Bahir provides extensions to distributed analytic platforms providing them access to different data sources. In this talk, we will introduce you to Apache Bahir and its various connectors that are available for Apache Spark and Apache Flink. We will also go over the details of how to build, test and deploy a Spark Application using the MQTT data source for the new Apache Spark Structure Streaming functionality.

Speakers
LM

LEUCIR MARIN

Sr. Software Engineer, IBM
avatar for Luciano Resende

Luciano Resende

Architect, Spark Technology Center, IBM
Luciano Resende is an Architect in IBM Analytics. He has been contributing to open source at The ASF for over 10 years, he is a member of ASF and is currently contributing to various big data related Apache projects including Spark, Zeppelin, Bahir. Luciano is the project chair for... Read More →


Tuesday May 16, 2017 2:30pm - 3:20pm EDT
Trianon
  Spark

3:30pm EDT

Creating a Recommender System with ElasticSearch & Apache Spark - Alvaro Santos Andres, Ericsson
Recommender Systems have changed the way companies and people interact with each other. Does your organisation need a 360° view of their customer? Today it is possible to recommend the right products to customers or potential customers. For example, a film based on their previous interests or a new accessory that fits their model of smartphone.

The technology behind recommender systems has evolved significantly over the past 20 years and with the explosion of Big Data technologies, there are tools that can create very powerful recommender systems. This introduction will explain how Recommender Systems work, describing their main functionalities, and providing some basic algorithms frequently used in such systems. We will look at how to create a Recommender System using technologies like Apache Spark and ElasticSearch.

Speakers
avatar for Alvaro Santos Andres

Alvaro Santos Andres

Big Data Solution Architect, Ericsson
Big Data Software Architect with more than 10 years of experience. Since 3 years ago, I am focused 100% of the time on Big Data projects in which I have developed several Personalization services used by millions of users given them a better experience and Company Data transformations.Born... Read More →


Tuesday May 16, 2017 3:30pm - 4:20pm EDT
Trianon
  Spark
 
Filter sessions
Apply filters to sessions.