Geospatial data platform at Uber

Thursday, June 21
11:30 AM - 12:10 PM
Grand Ballroom 220B

From determining the most convenient rider pickup points to predicting the fastest routes, Uber aims to use data-driven analytics to create seamless trip experiences. Within engineering, analytics inform decision-making processes across the board.

One of the distinct challenges for Uber is analyzing geospatial big data. City locations, trips, and event information, for instance, provide insights that can improve business decisions and better serve users. Geospatial data analysis is particularly challenging, especially in a big data scenario, such as computing how many rides start at a transit location, how many drivers are crossing state lines, and so on. For these analytical requests, we must achieve efficiency, usability, and scalability in order to meet user needs and business requirements.

To accomplish this, we use Hadoop, Hive, and Presto in our production environment to process the big data powering our interactive SQL engine. In this talk, we discuss our engineering effort to optimize geospatial queries in the whole Hadoop stack.

Presentation Video

講演者

Zhenxiao Luo
Engineering Manager
Uber
Zhenxiao is leading Interactive Analytics Team at Uber. Previously, he led the development and operations of Presto at Netflix and worked on big data and Hadoop-related projects at Facebook, Cloudera, and Vertica. Zhenxiao holds a master's degree from the University of Wisconsin-Madison and a bachelor's degree from Fudan University.
Lu Niu
Sr Software Engineer
UBER
Lu is with Hadoop Infrastructure Team at Uber, mainly working on Presto and geospatial data analysis. Before Uber, Lu worked at Yahoo Ads and Yahoo Finance team, built Yahoo's user profile system and financial data serving system respectively. Lu holds a CS master's degree from University of Southern California and bachelor's degree from Sun Yat-Sen University