Loading…
Apache: Big Data North America 2017 will be held at the Intercontinental Miami in Miami, Florida. 

Register now for the event taking place May 16-18, 2017. 
Thursday, May 18 • 4:40pm - 5:30pm
Multi-Model Big Data Platform for Complex Real Estate Analytics - Karthik Karuppaiya, Ten-X

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Building an online real-estate marketplace is an extremely complex high touch business. The data that the business deals with varies from scanned PDFs and complex excel spread sheets to transactional RDBMSes(?) and click stream data. Data engineering at Ten-X has spent the last couple of years building a highly effective multi-model data platform that brings all of this data together and analyses it to help the business make better decisions and move faster. In this talk we will talk about how our data platform evolved, including the technology choices we made and why we made them. Our data lake is built as a multi-model platform on top of technologies including Hadoop, JanusGraph, Spark, Hive, Cassandra and HBase. We will also introduce you to some of the complex pattern matching algorithms and Natural Language Processing techniques we have implemented on our platform.

Speakers
avatar for Karthik Karuppaiya

Karthik Karuppaiya

Sr. Engineering Manager, Data and Analytics, Ten-X
Leading the Data Engineering team at Ten-X. Have been working on Hadoop and NoSQL technologies since 2010. Currently helping to build the next generation Data Platform for Ten-X using Hadoop, Kafka, JanusGraph, Spark and Cassandra. Prior to Ten-X, I led the Big Data Engineering team... Read More →


Thursday May 18, 2017 4:40pm - 5:30pm EDT
Windsor
  SQL
  • Experience Level Any