This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
View analytic
Wednesday, May 17 • 2:30pm - 3:20pm
Nexmark, a Unified Framework to Evaluate Big Data Processing Systems with Apache Beam - Ismael Mejia & Etienne Chauchot, Talend

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
Big Data processing in real-time is on the rise at Apache with projects like Apache Spark, Apache Flink or Apache Apex. However at this moment we don’t have a unified framework to evaluate the correctness and the performance of these systems. Apache Beam implements a unified model to write both Batch and Streaming jobs with a single API and execute them independently in any of the supported platforms (runners), this makes Beam an ideal candidate to support an evaluation framework.

In this talk we will present Nexmark, a benchmark framework to evaluate queries over data streams. An implementation of Nexmark was donated by Google as part of the Apache Beam incubation process. Nexmark bridges the gap for evaluating data processing frameworks, but also serves as a rich integration test to evaluate the correct implementation of both the Beam runners and the new features of the Beam SDK.

avatar for Etienne Chauchot

Etienne Chauchot

Etienne has been working in software engineering and architecture for more than 13 years in domains such as retail or financial groups. He has been focusing on Big Data for a few years on technologies such as Apache Cassandra, ElasticSearch or Apache Spark. He is an Open Source f... Read More →
avatar for Ismael Mejia

Ismael Mejia

Open Source Software Engineer, Talend
Ismaël Mejía is an Apache Beam committer and a software engineer at Talend. He loves to tackle complex problems and build simple and elegant solutions. His main area of focus is distributed systems (Big Data and Cloud). He has been working on web services and large scale system... Read More →

Wednesday May 17, 2017 2:30pm - 3:20pm
  • Experience Level Any

Attendees (23)