My caption 😄

Statistical Models vs Machine Learning: A Reflection with Two Case Studies

Abstract

Previous applications in the urban planning field heavily lean toward the artificial intelligence approach for developing decision support system (for example, @tanic_urban_1986 and @silva_dna_2004). In this paper, I first define and introduce the two approaches to data analysis (what), and explain why/where both approaches have their appeal. I then demonstrate when and how each approach has its strength and weakness through two case studies with real data. In the first case study, I show that imputing missing data for the National Household Travel Survey with machine learning techniques enables more robust regression analysis while capturing the uncertainties introduced in the imputation process. In the second case study, again with the NHTS data, I show how machine learning algorithms facilitate variable selection and consideration of non-linear and interaction effects, which can help inform estimations of regression models, and when machine learning techniques can replace or complement statistical models. Through a review of the development of machine learning methods and their applications in social science and two case studies with actual data commonly used in transportation research, I hope to show the value of machine learning to planning researchers.

Date
Oct 13, 2017 10:15 AM
Location
Denver, CO
Associate Professor of Urban Studies and Planning

Dr. Wang is an Associate Professor of Urban Studies and Planning and affiliated professor of Civil Engineering at Portland State University.