Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP Databases

Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP Databases

Thursday, May 23
2:00 PM - 2:40 PM
Marquis Salon 12

At Walmart Labs, we get close to 200 million customers every week across our 11000+ stores & online all over the world. As part of our data lake initiatives, we started a full-fledged migration to Hadoop based solutions for all our data needs at lower cost than traditional RDBMS/MPP solutions. While we have seen significant success in migrating to Hadoop based Data Lake solutions from traditional RDBMS based data warehouses, one challenge that we have faced is around migrating end users to Hadoop due to query latency issues. To solve this problem and to reduce the cost of the solution, Walmart Labs started using Hive LLAP.

In this session, we will introduce you to Hive LLAP, its architecture, best practices for deployment to achieve sub-second query performance and its cost comparison with traditional RDBMS systems for the same use case.

Presentation Video


Naveen Peddamail
Sr. Manager - Software Engineering
Naveen is an Engineering Manager with 7+ Years of Data Engineering, Data Science & Analytics experience across Retail, Finance & Marketing industries. In his current role at WalmartLabs, Naveen leads Walmart's Customer Experience and Store Marketing engineering team and one of his key initiatives in the last few months has been to bring all the various data assets at Walmart under one single data lake platform. He has worked across various database technologies throughout his career and has been extensively working on the entire Hadoop stack at WalmartLabs. Over the past few years, he led several teams building end to end data and visualization platforms and also worked on evaluating and implementing multiple query acceleration and SQL on Hadoop layers such as Druid, LLAP, Spark, Kinetica, SAP HANA etc. to power Walmart's BI platforms. Speaking & Presentation Experience: @NWA IISE conference: Topic: 'Data Cafe: Enabling Real Time Insights Through Visualization' @Bentonville Data Science Meetup: ' Data Cafe: Ask Me Anything - Bot Framework using NLP' Naveen's work at WalmartLabs was featured on Forbes as one of the "The Most Practical Big Data Use Cases Of 2016"
Abhishek Gupta
Software Engineer - 3 Tech
This is Abhishek Gupta with around 4 years of professional experience in IT Industry, currently working in Walmart Labs as a Software Engineer 3 - Tech. At Walmart, I am working in the Data Lake Initiative practicing principles of different pillar of data solutions such as Data Architecture, Data Engineering, Metadata Management & Data Governance. From tools & technlogies standpoint, I'm an active user of Hadoop, Hive, Spark, Springboot etc. Prior to this, I had worked for more than 2 years in the area of data warehousing and business intelligence at AIG (American International Group). I have pursued my Master's in Management Information Systems with the specialization in Data Analytics from the University of Arizona. I enjoy public speaking and I am starting to put my foot in the arena of knowledge sharing by public speaking, talks, sessions etc. Recently, I had the chance to be a host at the "Open Data Science Conference West 2018". Always have a vision !!