Effect of Weather on Uber Ridership

Similar documents
Analyzing the effect of Weather on Uber Ridership

Forecasting demand for pickups per hour in 6 New York City boroughs for Uber

ABSTRACT INTRODUCTION SUMMARY OF ANALYSIS. Paper

Predicting MTA Bus Arrival Times in New York City

Tornado Inflicted Damages Pattern

Understanding Travel Time to Airports in New York City Sierra Gentry Dominik Schunack

Issued by: National Weather Service San Joaquin Valley - Hanford CA. Issue time: 409 AM PST Wed Jan

Member Level Forecasting Using SAS Enterprise Guide and SAS Forecast Studio

Sri Harsha Ramineni Koteswarrao Sriram

Data Analysis of NYC Cab Services

Project proposal. Abstract:

Parking Study MAIN ST

Integrated Electricity Demand and Price Forecasting

3. If a forecast is too high when compared to an actual outcome, will that forecast error be positive or negative?

Fundamentals of Transmission Operations

SMART GRID FORECASTING

The Job Forecast: Partly Cloudy; Or Is It Partly Sunny? Some Bright Spots Are Emerging From A Gloomy IT Job Market. (On Topic: IT Careers).

California OES Weather Threat Briefing

Short-Term Electrical Load Forecasting for Iraqi Power System based on Multiple Linear Regression Method

Analyzing the NYC Subway Dataset

Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information

2017 rainfall by zip code

Ex-Ante Forecast Model Performance with Rolling Simulations

Holiday Demand Forecasting

Defining Normal Weather for Energy and Peak Normalization

Understanding Michigan snowfall. Jim Keysor - NWS Gaylord

SHORT TERM LOAD FORECASTING

Responsive Traffic Management Through Short-Term Weather and Collision Prediction

Normalization of Peak Demand for an Electric Utility using PROC MODEL

Capacity Market Load Forecast

June 17, 2015 Lab Exercise

What Does It Take to Get Out of Drought?

Advance signals that can be used to foresee demand response days PJM SODRTF - March 9, 2018

If looking for the ebook by David Southgate The job forecast: partly cloudy; or is it partly sunny? Some bright spots are emerging from a gloomy IT

New York City Taxis: Demand and Revenue in an Uber World

Complete Weather Intelligence for Public Safety from DTN

Put the Weather to Work for Your Company

Ridership Patterns in an Urban Bike Share System

Developing a Mathematical Model Based on Weather Parameters to Predict the Daily Demand for Electricity

INDIAN OCEAN STATE February 14, 2018 MJO INDEX

Shutdown: NYCT s Extreme Weather Strategy for Safety and Recovery

A Multistage Modeling Strategy for Demand Forecasting

NatGasWeather.com Daily Report

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR

DRIVING ROI. The Business Case for Advanced Weather Solutions for the Energy Market

Holiday Demand Forecasting in the Electric Utility Industry

NYS Mesonet Data Access Policy

Weather and Travel Time Decision Support

NatGasWeather.com Daily Report

Use of Normals in Load Forecasting at National Grid

Skupos Data Helps Convenience Stores Predict Inventory Needs Before Hurricanes

Preliminary Material Data Sheet

Use of Weather Inputs in Traffic Volume Forecasting

CropCast Corn and Soybean Report Kenny Miller Tuesday, March 14, 2017

US History, Ms. Brown Need Help? or Call

Appendix BAL Baltimore, Maryland 2003 Annual Report on Freeway Mobility and Reliability

2013 Weather Normalization Survey. Itron, Inc El Camino Real San Diego, CA

Eric Snodgrass Co-Founder and Senior Atmospheric Scientist of Agrible, Inc Director of Undergraduate Studies Department of Atmospheric Sciences

CustomWeather Statistical Forecasting (MOS)

Visualizing Big Data on Maps: Emerging Tools and Techniques. Ilir Bejleri, Sanjay Ranka

CoCoRaHS Condition Monitoring. Webinar with Carolinas Observers January 25, 2017

Lab 21. Forecasting Extreme Weather: When and Under What Atmospheric Conditions Are Tornadoes Likely to Develop in the Oklahoma City Area?

At the start of the talk will be a trivia question. Be prepared to write your answer.

Paper SA-08. Are Sales Figures in Line With Expectations? Using PROC ARIMA in SAS to Forecast Company Revenue

CropCast Corn and Soybean Report Kenny Miller Monday, March 13, 2017

HAZUS th Annual Conference

CropCast Corn and Soybean Report Kenny Miller Friday, March 17, 2017

CropCast Corn and Soybean Report Kyle Tapley Monday, March 20, 2017

Snowfall totals by zip code in ny

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER

The Impact of Weather on Bus Ridership in Pierce County, Washington

Earth Science The Weather

2129 NORTH MAIN STREET HOTE PROJECT ULI SHARED PARKING STUDY City of Santa Ana, California

Impact on Agriculture

Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall.

Operations Report. Tag B. Short, Director South Region Operations. Entergy Regional State Committee (ERSC) February 14, 2018

The UK Flood Forecasting Centre

Thank you for choosing AIMS!

Champaign-Urbana 2001 Annual Weather Summary

An Approach of Correlation Inter-variable Modeling with Limited Data for Inter-Bus Transformer Weather Sensitive Loading Prediction

Seasonal Hazard Outlook

Confidence Intervals for Comparing Means

Messages of the Day November 2012

EVALUATION OF ALGORITHM PERFORMANCE 2012/13 GAS YEAR SCALING FACTOR AND WEATHER CORRECTION FACTOR

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons

How Accurate is My Forecast?

Bus Routes, New York NY, August 2017

Potentially Wet Wed-Fri

An overview of Wet Season Forecasting in the Northern Territory

Weather Company Data for Advanced Analytics

Application and verification of ECMWF products 2012

TopGolf Inc: Demand Forecasting

Earth Networks ENcast 6- Day Hourly Lat- Lon Forecast Feed

A Prepared Marylander Creates a Resilient Maryland

IWT Scenario 1 Integrated Warning Team Workshop National Weather Service Albany, NY October 31, 2014

Guided Notes Weather. Part 2: Meteorology Air Masses Fronts Weather Maps Storms Storm Preparation

ECML PKDD Discovery Challenges 2017

Tropical Update. 5 AM EDT Tuesday, September 12, 2017 Post-Tropical Cyclone Irma, Hurricane Jose, & Tropical Wave (10%)

ONE REGION: PLANNING FOR A STRONG AND JUST METROPOLIS. NYLON # 14: Regional Planning. October 31, 2018

WeatherManager Weekly

Transcription:

SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies.

ABSTRACT RESULTS (CLICK TO EDIT) Uber has changed the face of taxi ridership, making it more convenient and comfortable for riders. But, there are times when customers are left unsatisfied because of shortage of vehicles which ultimately led to Uber adopting surge pricing. It s a very difficult task to forecast number of riders at different locations in a city at different points in time. This gets more complicated with changes in weather. In this paper we attempt to estimate the number of trips per borough on a daily basis in New York City. We add an exogenous factor, weather to this analysis to see how it impacts the changes in number of trips. We fetched six month worth data (approximately 1 million records) of Uber rides in New York City ranging from January 2015 to June 2015 from github. We also gathered weather data (from Weather Underground) of New York City Borough wise for the same period of six months from Jan 2015 to June 2015. In this poster, we attempted to analyze Uber data and weather data together to estimate the change in the number of trips per borough due to changing weather conditions. We used SAS Forecast Studio and built a model to predict the number of trips per day for the one week ahead forecast for each borough of the New York City. Data Preparation and Analysis METHODS We merged three datasets namely Uber_PickUp_data, Taxi_LookUp_Zone and Weather data to obtain the final dataset as shown in Fig. 1. In the first step, we combined Uber_PickUp_Data and Taxi_Lookup_Zone datasets to bring uber rides to borough level (Merged1 in Fig. 1). We considered Uber rides information for five boroughs of New York City: Manhattan, Brooklyn, Bronx, Queens and Staten Island for our analysis. Final dataset Uber_Final is obtained by merging Merged1 dataset with Weather dataset. Information about the various weather metrics taken for analysis are listed in the Table 1. Uber Location Merged1 Merged1 Weather Uber_Final Dataset Uber_PickUp_data Taxi_Lookup_Zone Weather Variables Fig. 1 (Data Preparation) Table 1 Dispatching_Base_Num, Date_of_Pickup,Time, Affiliated_Base_Num, LocationID Location_ID, Borough, Zone Date,Temperature, Humidity, Sea_level_pressure, Precipitation (Max, Mean and Min ), Cloud_Cover, Wind, Event Fig. 2: Number of Rides by Borough Fig. 3: Average Number of Rides by Event The Initial exploratory analysis revealed following insights: Manhattan has the largest user base being the most populous county and financial capital of world followed by Brooklyn and Queens. On an average a rainy day has reported more number of rides followed by snow than a normal day. Fig. 4: Average Number of Rides by Borough and Event Fog seems to impact the number of rides in Bronx more than any other borough. Fog-Rain and Rain- Thunderstorms are influencing the number of rides in Queens the most. Rain and Snow is increasing the number of rides in Manhattan more than any other event. Considering all five boroughs, Wednesdays are the busiest day representing mid week followed by Sunday (weekend). Saturdays have the least turnout. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. Fig. 5: Mean Number of Rides by Day of Week

Exploratory Analysis(Contd.) Exploratory Analysis(Contd.) Fig. 6: Mean Number of Rides by Borough by Day of Week Fig. 7: Number of Rides by Month Wednesdays are the busiest day for Manhattan followed by Sunday. Whereas Sundays turn out to be the busiest day in the other boroughs. The month of May shows the peak number of rides when compared with the other months. April has the least number of rides. Fig. 10: Mean Change in Number of Rides by Day Transition Data Cleaning and Transformation Mean change in number of rides is highest when compared between Saturday vs. Sunday and Tuesday vs. Wednesday. There is not much change between Wednesday and Thursday Uber_Pickup_Data has ridership information missing for certain days. We used average of immediate previous interval non missing value and immediate next interval non missing values to fill in the missing values. We used SAS Enterprise Guide s create time series data node to produce the final time series data on a daily level. We used SAS Enterprise Miner s Transform Variables node to transform number of rides and weather metrics to bring the data to normality. Time Series Forecasting Fig. 8: Mean Change in Number of Rides by Borough by Event Fi.g 9: Mean Change in Number of Rides by Borough by Weather Transition Mean change in number of rides is highest on a Rainy day with Snow in every borough except Bronx. In Bronx the change in number of rides is highest on a foggy day. On an average change in number of rides is most prevalent when there is a transition of weather from normal to rain-snow or fog-snow. We used SAS Forecast Studio for Desktop 14.1 version to perform time series forecasting. We considered one month of rides (June) as the holdout sample and borough as a by group variable to generate forecasts at borough level. Set the cutoff to 2% for the detection of outliers. ARIMA Model(Top_1) has been chosen as the best model based on holdout MAPE. Mean Temperature with a lag(1), Mean Visibility Miles, Wind direction, cloud cover and sea pressure turned out to be significant in predicting the number of rides.

Time Series Forecasting Configuration Diagnostic Plots There appears to be no significant correlation in the residuals. Fig. 14: Diagnostic Plots Fig. 11: Seasonality Configuration Fig. 12: Diagnostic Configuration Forecast Results Fig 13: Hold out sample selection ARIMA Models selected at different borough levels Fig 14: MAPE values for models at Borough levels Fig. 15: Forecast Output Bronx Forecasted Rides Brooklyn Forecasted Rides Queens Forecasted Rides

Snigdha Gutha, Anusha Mamillapalli Forecast Results References http://toddwschneider.com/posts/a-tale-of-twenty-two-million-citi-bikes-analyzing-the-nyc-bike-share-system/%20- %20citibike-weather https://newsroom.uber.com/uberdata-uber-for-style-and-comfort/ Manhattan forecasted Rides Staten Island forecasted Rides Conclusion Acknwoledgement We thank Dr. Goutam Chakraborty, Professor, Department of Marketing, Director of Business Analytics program- Oklahoma State University for his continuous support and guidance. The above results show that weather does influence Uber ridership, especially when it is little distracted from normal but not too extreme again. Manhattan being the most populous city is bringing a huge variation in the number of rides especially on a rainy day. The demand is highest during Wednesdays in Manhattan and Sundays in other boroughs. Staten Island has the least Uber user base, Hence the Mean Absolute Percent Error turned out to be highest as there are not many rides to predict. Future Work The scope of the project can be extended to hourly analysis. Analyzing at least a year worth of data will bring further insights. Demand can be more accurately predicted if the actual number of rides requested information is available along with the live number of rides.

SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies.