Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
Back To Schedule
Tuesday, May 16 • 2:30pm - 3:20pm
Online and Offline Analytics on Cassandra in eBay - DongQian Liu, eBay

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
ebay is one of largest e-commerce company in the world, providing C2C and B2C sales services via the Internet. We use Cassandra to store large tables for online query. To reduce the Cassandra load, we do offline Analytics of Cassandra table, we dump sstables to HDFS and transform to Hadoop file formats. In this session, we introduce how we build high-performance, cross datacenter Cassandra cluster for online query, and for offline Analytics, we introduce how we implement splittable input format for sstables and transform to Hadoop file formats. We also introduce how we use bulk loader tool to load data from Hadoop to Cassandra quickly.


Tuesday May 16, 2017 2:30pm - 3:20pm EDT