Loading…
Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
View analytic
Tuesday, May 16 • 11:05am - 11:55am
Apache​ ​Mahout:​ ​An​ ​Extendable​ ​Machine​ ​Learning​ ​Framework​ ​for​ ​Spark​ ​and​ ​Flink - Trevor Grant, IBM

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
 A serious issue when developing distributed machine learning algorithms is the lack of people who understand the mathematics, distributed data, AND have free time. Further, most distributed engines have APIs that were not designed to be mathematically expressive, implementations are hard to follow; another qualified person must review. The Mahout project has spent two years building modular system bindings for distributed engines such as Apache Spark and Apache Flink, native solvers to enable CPU/GPU acceleration, an abstracted R-Like Scala DSL for tensor algebra on distributed matrices, and a consistent API to implement distributed algorithms. This creates an extendable and new-contributor friendly framework for machine learning. We’ll also discuss the project vision for creating a CRAN like repository of user contributed algorithms and how we are evangelizing this vision.    Mail Merge Schedule ABD submissions ABD proposals from AC Stuff that doesn't belong to ABD Mail Merge Logs  Explore A serious issue when developing distributed machine learning algorithms is the lack of people who understand the mathematics, distributed data, AND have free time. Further, most distributed engines have APIs that were not designed to be mathematically expressive, implementations are hard to follow; another qualified person must review. The Mahout project has spent two years building modular system bindings for distributed engines such as Apache Spark and Apache Flink, native solvers to enable CPU/GPU acceleration, an abstracted R-Like Scala DSL for tensor algebra on distributed matrices, and a consistent API to implement distributed algorithms. This creates an extendable and new-contributor friendly framework for machine learning. We’ll also discuss the project vision for creating a CRAN like repository of user contributed algorithms and how we are evangelizing this vision.


Speakers
TG

Trevor Grant

IBM
Trevor Grant is PMC Member on the Apache Mahout project, and contributor on Apache Streams (incubating), Apache Zeppelin, and Apache Flink projects. By day he is an Open Source Technical Evangelist at IBM. In former rolls he called himself a data scientist, but the term is so ov... Read More →


Tuesday May 16, 2017 11:05am - 11:55am
Balmoral

Attendees (25)