Waiting for Hive queries to finish teaches Your analysts patience and respect to technology. Unfortunately it is not what they expect and not what You get paid for. Interactive SQL on Hadoop has been The Holy Grail within Hadoop community and our analysts at Allegro - the biggest ecommerce platform in central-eastern Europe. We have read several benchmark papers regarding alternatives to Hive and we have run benchmarks on our own but they did not answer the question - which one to choose and is it worth adding Hive alternative to existing stack. Some technologies performed better with Parquet, others with ORC. None of the benchmarks consider user experience, new technology adoption within existing stack, and productivity of query development. In this talk we present how we ended up with Presto and our tips and tricks to hack it.
Senior Data Platform Engineer, Grupa Allegro Sp. z o.o.
Mainly interested in:
- big data platform architecture
- data governance
Enthusiast of scalable distributed solutions, processing large amounts of data and continuous improvement.
Senior Data Platform Engineer, Allegro Group Sp. z o.o.
Since 6 years in Infrastructure and Services Maintenance Team where he takes care of technical support for the scrum teams and maintenance of multiple services included in the Allegro Group's portfolio. He is now developing big data solutions. Passionate about web technologies and... Read More →
Thursday May 18, 2017 11:20am - 12:10pm EDT
Windsor