Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
Back To Schedule
Thursday, May 18 • 12:20pm - 1:10pm
MOHA: Many-Task Computing Framework on Hadoop - Soonwook Hwang, Korea Institute of Science and Technology Information

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
In this talk, we present design and implementation of MOHA (MTC on Hadoop) framework which can effectively combine Many-Task Computing (MTC) technologies with Big Data platform Hadoop to enable more rich data analytics workflows in the ecosystem. MTC is a new computing paradigm that can consist of, e.g., millions of small tasks where each task communicates through files resulting in another type of data-intensive workload. MOHA is developed as one of YARN applications so that it can transparently cohost existing MTC applications with other Big Data processing frameworks in a single Hadoop cluster. MOHA can substantially reduce the overall execution time of many-task processing with minimal amount of resources compared to an existing Hadoop YARN application by effectively exploiting open-source distributed message queues (Apache ActiveMQ, Kafka) and streamlined task dispatching mechanism.

avatar for Soonwook Hwang

Soonwook Hwang

Principal Researcher, KISTI
Dr. Soonwook Hwang is a principal researcher at Korea Institute of Science and Technology Information (KISTI), where he is responsible for the research and development of enabling technologies for the realization of cyber infrastructure for Korea. KISTI is running the biggest national... Read More →

Thursday May 18, 2017 12:20pm - 1:10pm EDT

Attendees (6)