Hierarchical Additive Modeling of Nonlinear Association with Spatial Correlations

Size: px
Start display at page:

Download "Hierarchical Additive Modeling of Nonlinear Association with Spatial Correlations"

Transcription

1 1 ENAR09 LSUHSC p. 1/18 Hierarchical Additive Modeling of Nonlinear Association with Spatial Correlations Qingzhao Yu, Bin Li, Richard Scribner Louisiana State University, School of Public Health Supported by: NIAAA-5 R01 AA (PI Richard Scribner)

2 1 ENAR09 LSUHSC p. 2/18 Objective Data and Challenges Multivariate Additive Regression Trees Conditional Autoregressive Model Hierarchical Additive Model Model Convergence Results

3 1 ENAR09 LSUHSC p. 3/18 The aim is to determine whether a change in the alcohol environment results in a change of assaultive violence rates in the neighborhood. Hypothesis: Neighborhood alcohol outlet density is positively associated with alcohol related crimes, such as the assaultive crime.

4 1 ENAR09 LSUHSC p. 4/18 In 1992, a civil unrest occurred over a large area of south central Los Angeles. During the civil unrest, a number of alcohol outlets were damaged, so that the alcohol density decreased in that area. We restrict our analysis to the areas affected by civil unrest thereby controlling for a possible global effect of the unrest on outcomes. We control for the other factors that are associated with both alcohol density and the related crime rates.

5 1 ENAR09 LSUHSC p. 5/18 There are a total of 290 census tracts. All data are collected at the census tract level. We collected 10 years data: 1990 to 1999.

6 1 ENAR09 LSUHSC p. 5/18 There are a total of 290 census tracts. All data are collected at the census tract level. We collected 10 years data: 1990 to Alcohol Environment: data are obtained from the PRC and California ABC.

7 1 ENAR09 LSUHSC p. 5/18 There are a total of 290 census tracts. All data are collected at the census tract level. We collected 10 years data: 1990 to Alcohol Environment: data are obtained from the PRC and California ABC. outlet license surrender data following the 1992 Civil Unrest: revealing 279 outlets suspended operation following the 1992 unrest, 252 of those outlets were off-sale type outlets. annual count of offsale and onsale alcohol outlets

8 1 ENAR09 LSUHSC p. 6/18 Sociodemographic Data. These data are collected through US Census and LA county department of health services for the years 1990 to The variables we used include: % Black, % White, % Hispanic, % Asian, % male, % adults and % households in poverty

9 1 ENAR09 LSUHSC p. 6/18 Sociodemographic Data. These data are collected through US Census and LA county department of health services for the years 1990 to The variables we used include: % Black, % White, % Hispanic, % Asian, % male, % adults and % households in poverty Outcomes. The crime rates are collected from the LAPD annual police reports for the years 1990 to The data collected include: Assaultive Violence Rate: homicide, rape, robbery and assault. Complicated interactions might exist among covariates.

10 1 ENAR09 LSUHSC p. 7/18 The association between the assault violence rates and other covariate maybe nonlinear.

11 1 ENAR09 LSUHSC p. 7/18 The association between the assault violence rates and other covariate maybe nonlinear. Complicated interactions might exist among covariates.

12 1 ENAR09 LSUHSC p. 7/18 The association between the assault violence rates and other covariate maybe nonlinear. Complicated interactions might exist among covariates. More than 7% of the observations are missing one or more values for some covariates.

13 1 ENAR09 LSUHSC p. 7/18 The association between the assault violence rates and other covariate maybe nonlinear. Complicated interactions might exist among covariates. More than 7% of the observations are missing one or more values for some covariates. we should take into account the spatial correlations among observations in adjacent tracts.

14 MART is a special case of the generic gradient boosting approach. 1 ENAR09 LSUHSC p. 8/18

15 1 ENAR09 LSUHSC p. 8/18 MART is a special case of the generic gradient boosting approach. MART approximates the target function F(x) by an additive expansion of trees ˆf(x) = M m=1 νb H (x;γ m ).

16 1 ENAR09 LSUHSC p. 8/18 MART is a special case of the generic gradient boosting approach. MART approximates the target function F(x) by an additive expansion of trees ˆf(x) = M m=1 νb H (x;γ m ). Friedman, 2001, Friedman and Meulman, 2003.

17 1 ENAR09 LSUHSC p. 9/18 MART is able to capitalize on the nonlinear relationships between the dependent and independent variables with no need for specifying the basis functions.

18 1 ENAR09 LSUHSC p. 9/18 MART is able to capitalize on the nonlinear relationships between the dependent and independent variables with no need for specifying the basis functions. MART is able to capture complex and/or high order interaction effects.

19 1 ENAR09 LSUHSC p. 9/18 MART is able to capitalize on the nonlinear relationships between the dependent and independent variables with no need for specifying the basis functions. MART is able to capture complex and/or high order interaction effects. MART can handle mixed-type predictors and missing values in covariates.

20 1 ENAR09 LSUHSC p. 10/18 We use the vector {φ Ti,C i } to capture spatial autocorrelations among areas C i at time T i. Assume that an area C i is correlated with only the areas that are adjacent to it. y with a hierarchical structure on its mean function: y i N(µ i, σ 2 ) and µ i = f(x i ) + φ Ti,C i ; (1) The hierarchical CAR structure for {φ Ti,C i } has the form φ Ti,C i =j φ Ti,C i j N 1 1 φ Ti,k,. n j n j τ Ti k j

21 1 ENAR09 LSUHSC p. 11/18 1. Let φ 0 T i,c i = 0 where C i {1,..., C}, T i {1,..., T }; q = 0, δ = 1000, µ 1i = 0 and i = 1,..., n. 2. If δ <, go to step 3, otherwise q=q+1 and (a) Let z i = y i φ [q 1] T i,c i. Fit MART f [q] (x) on z and the covariates x. (b) Let e i = y i f [q] (x i ), calculate the Moran s I of e i within time slots 1 to T. Let S be the collection of time slots in which the spatial correlation test show a p-value smaller than 0.01(this indicates a strong spatial correlation in e i ).

22 1 ENAR09 LSUHSC p. 12/18 (c) If S is empty, let φ [q] T i,c i = 0 and go to step 3; otherwise for the observations i {i : T i S}, let the f(x i ) in Equation (1) be f [q] (x i ) and calculate the generalized MLEs of ˆφ Ti,C i. Let φ [q] T i S,C i = ˆφ Ti,C i and φ [q] T i / S,C i = 0. (d) Let µ 0i = µ 1i, µ 1i = f [q] (x i ) + φ [q] T i,c i and let δ = go back to Output the results from step q. n i=1 (µ 1i µ 0i ) 2 n i=1 µ2 1i,

23 1 ENAR09 LSUHSC p. 13/18 Little is known of the convergence properties of the above procedure.

24 1 ENAR09 LSUHSC p. 13/18 Little is known of the convergence properties of the above procedure. ICM (Iterated Conditional Modes), Besag (1986).

25 1 ENAR09 LSUHSC p. 13/18 Little is known of the convergence properties of the above procedure. ICM (Iterated Conditional Modes), Besag (1986). Under certain conditions, iterative conditional maximization converges to local maximizers. Meng and Rubin (1993).

26 1 ENAR09 LSUHSC p. 13/18 Little is known of the convergence properties of the above procedure. ICM (Iterated Conditional Modes), Besag (1986). Under certain conditions, iterative conditional maximization converges to local maximizers. Meng and Rubin (1993). Backfitting process, Hastie and Tibshirani, 2000; Buja et al

27 1 ENAR09 LSUHSC p. 13/18 Little is known of the convergence properties of the above procedure. ICM (Iterated Conditional Modes), Besag (1986). Under certain conditions, iterative conditional maximization converges to local maximizers. Meng and Rubin (1993). Backfitting process, Hastie and Tibshirani, 2000; Buja et al Let = 10 7 in our analysis.

28 1 ENAR09 LSUHSC p. 14/18 pctoffsurryn2 pctonsurryn2 hispanic male_15_30 damage asian totaldensity year black white poverty onsurryn offsurryn hispanic offsale onsale male_15_30 damage asian year black white poverty Relative Variable Importance Relative Variable Importance Figure 1: Relative variable importance in MART-fitted models.

29 Partial dependence Partial dependence poverty black Partial dependence Partial dependence white year Partial dependence Partial dependence totaldensity male_15_30 Figure 2: Partial dependence plots. 1 ENAR09 LSUHSC p. 15/18

30 1 ENAR09 LSUHSC p. 16/18 Year Origin P-value Res1 P-value Res2 P-value < e < e < e < e < e < e < e < e < e < e

31 1 ENAR09 LSUHSC p. 17/18 values for loghatrate98 (4) < 0.0 values for logmartrate98 (2) < 0.0 N (20) N (117) (8) (113) (128) (165) (21) >= 3.0 (2) >= 3.0 values for spatialerror km (6) < values for randomerror km (7) < N (51) N (165) (67) (135) (65) (76) (3) >= 0.75 (5) >= 0.75

32 1 ENAR09 LSUHSC p. 18/18

Multilevel spatio-temporal dual changepoint models for relating alcohol outlet destruction and changes in neighbourhood rates of assaultive violence

Multilevel spatio-temporal dual changepoint models for relating alcohol outlet destruction and changes in neighbourhood rates of assaultive violence Geospatial Health 2(2), 2008, pp. 161-172 Multilevel spatio-temporal dual changepoint models for relating alcohol outlet destruction and changes in neighbourhood rates of assaultive violence Qingzhao Yu

More information

MODULE 12: Spatial Statistics in Epidemiology and Public Health Lecture 7: Slippery Slopes: Spatially Varying Associations

MODULE 12: Spatial Statistics in Epidemiology and Public Health Lecture 7: Slippery Slopes: Spatially Varying Associations MODULE 12: Spatial Statistics in Epidemiology and Public Health Lecture 7: Slippery Slopes: Spatially Varying Associations Jon Wakefield and Lance Waller 1 / 53 What are we doing? Alcohol Illegal drugs

More information

Community Health Needs Assessment through Spatial Regression Modeling

Community Health Needs Assessment through Spatial Regression Modeling Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular

More information

Environmental Criminology

Environmental Criminology ENVIRONMENTAL CRIMINOLOGY AND THE IMPACT OF PROBLEM BUSINESSES ON CRIME IN PORTLAND NEIGHBORHOODS Presented by Ryan Arnold Hector Osuna Allen Byrd Environmental Criminology Annually crime plays a large

More information

arxiv: v1 [stat.ap] 23 Jan 2019

arxiv: v1 [stat.ap] 23 Jan 2019 arxiv: arxiv:0000.0000 SPATIAL MODELING OF TRENDS IN CRIME OVER TIME IN PHILADELPHIA By Cecilia Balocchi and Shane T. Jensen University of Pennsylvania arxiv:1901.08117v1 [stat.ap] 23 Jan 2019 Understanding

More information

A Micro-Spatial Analysis of Violent Crime

A Micro-Spatial Analysis of Violent Crime A Micro-Spatial Analysis of Violent Crime The Experience of a Small Southern City Isaac T. Van Patten Radford University Levels of Measurement Macro-spatial analysis Crime at the state, regional, or national

More information

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns Aly Kane alykane@stanford.edu Ariel Sagalovsky asagalov@stanford.edu Abstract Equipped with an understanding of the factors that influence

More information

UC POLICE DEPARTMENT REPORTS DASHBOARD

UC POLICE DEPARTMENT REPORTS DASHBOARD UC POLICE DEPARTMENT REPORTS DASHBOARD UC MERCED Annual 1. UC Merced FBI Part I Crime Offenses 2 2. UC Merced FBI Part II Crime Offenses 3 3. UC Merced Arrests - FBI Crime Offenses 4 4. UC Merced Value

More information

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX Well, it depends on where you're born: A practical application of geographically weighted regression to the study of infant mortality in the U.S. P. Johnelle Sparks and Corey S. Sparks 1 Introduction Infant

More information

Belton Police Department

Belton Police Department Belton Police Department Uniform Crime Report for June 2013 RETURN-A - MONTHLY RETURN OF OFFENSES KNOWN TO THE POLICE 1 Classification of offenses 1 Criminal Homicide A Murder/nonnegligent homicide 2 Offenses

More information

13.1 Categorical Data and the Multinomial Experiment

13.1 Categorical Data and the Multinomial Experiment Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)

More information

Evaluating racial clusters as a unit of aggregation for neighborhood effects. Extended Abstract. Draft. Do not cite.

Evaluating racial clusters as a unit of aggregation for neighborhood effects. Extended Abstract. Draft. Do not cite. Evaluating racial clusters as a unit of aggregation for neighborhood effects. Extended Abstract Draft. Do not cite. Jonathan Tannen Ph.D. Candidate, Princeton University March 24, 2014 Introduction The

More information

Package mma. March 8, 2018

Package mma. March 8, 2018 Type Package Title Multiple Mediation Analysis Version 6.1-0 Date 2018-03-08 Author Qingzhao Yu and Bin Li Maintainer Qingzhao Yu Package mma March 8, 2018 Depends R (>= 2.14.1), gbm,

More information

Multivariate statistical methods and data mining in particle physics

Multivariate statistical methods and data mining in particle physics Multivariate statistical methods and data mining in particle physics RHUL Physics www.pp.rhul.ac.uk/~cowan Academic Training Lectures CERN 16 19 June, 2008 1 Outline Statement of the problem Some general

More information

Gradient descent. Barnabas Poczos & Ryan Tibshirani Convex Optimization /36-725

Gradient descent. Barnabas Poczos & Ryan Tibshirani Convex Optimization /36-725 Gradient descent Barnabas Poczos & Ryan Tibshirani Convex Optimization 10-725/36-725 1 Gradient descent First consider unconstrained minimization of f : R n R, convex and differentiable. We want to solve

More information

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2 ARIC Manuscript Proposal # 1186 PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2 1.a. Full Title: Comparing Methods of Incorporating Spatial Correlation in

More information

Modeling spatial variation in the relationships between violence and alcohol and drug

Modeling spatial variation in the relationships between violence and alcohol and drug Modeling spatial variation in the relationships between violence and alcohol and drug use: A comparison of geographically weighted regression and Bayesian spatially varying coefficient process models David

More information

DIFFERENT INFLUENCES OF SOCIOECONOMIC FACTORS ON THE HUNTING AND FISHING LICENSE SALES IN COOK COUNTY, IL

DIFFERENT INFLUENCES OF SOCIOECONOMIC FACTORS ON THE HUNTING AND FISHING LICENSE SALES IN COOK COUNTY, IL DIFFERENT INFLUENCES OF SOCIOECONOMIC FACTORS ON THE HUNTING AND FISHING LICENSE SALES IN COOK COUNTY, IL Xiaohan Zhang and Craig Miller Illinois Natural History Survey University of Illinois at Urbana

More information

Cluster Analysis Techniques for Neighborhood Change

Cluster Analysis Techniques for Neighborhood Change Cluster Analysis Techniques for Neighborhood Change www.csupomona.edu/~mreibel/aag06.pdf Michael Reibel, Cal Poly Pomona Moira Regelson, Yahoo! Search Marketing The Problem Define distinct types of neighborhood

More information

Green Spaces in Healthy Places: Objective data demonstrates an association between greenness and momentary measures of physical activity in children

Green Spaces in Healthy Places: Objective data demonstrates an association between greenness and momentary measures of physical activity in children Green Spaces in Healthy Places: Objective data demonstrates an association between greenness and momentary measures of physical activity in children Estela Almanza, Michael Jerrett, Genevieve Dunton, Edmund

More information

Driving Forces of Houston s Burglary Hotspots During Hurricane Rita

Driving Forces of Houston s Burglary Hotspots During Hurricane Rita Driving Forces of Houston s Burglary Hotspots During Hurricane Rita Marco Helbich Department of Geography University of Heidelberg Heidelberg, Germany & Michael Leitner Department of Geography and Anthropology

More information

Modeling the Ecology of Urban Inequality in Space and Time

Modeling the Ecology of Urban Inequality in Space and Time Modeling the Ecology of Urban Inequality in Space and Time Corina Graif PhD Candidate, Department Of Sociology Harvard University Presentation for the Workshop on Spatial and Temporal Modeling, Center

More information

DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES

DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES Deo Chimba, PhD., P.E., PTOE Associate Professor Civil Engineering Department Tennessee State University

More information

U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010

U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010 U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010 2010 PART I OFFENSES 2008 Number of Actual Offenses 2009 Number of Actual Offenses 2010 Number of Actual Offenses 2009 to 2010 Percent Change

More information

Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso

Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso by David C. Wheeler Technical Report 07-08 October 2007 Department of

More information

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Prepared for consideration for PAA 2013 Short Abstract Empirical research

More information

FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime Rate Charts

FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime Rate Charts U.C. Davis Medical Center Crime Statistics (Medical Center) PDF Version FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime

More information

Exploring the Relationship Between Vacant and Distressed Properties and Community Health and Safety

Exploring the Relationship Between Vacant and Distressed Properties and Community Health and Safety Exploring the Relationship Between Vacant and Distressed Properties and Community Health and Safety Prepared for: Western Reserve Land Conservancy Prepared by: Center on Urban Poverty and Community Development

More information

Exploratory quantile regression with many covariates: An application to adverse birth outcomes

Exploratory quantile regression with many covariates: An application to adverse birth outcomes Exploratory quantile regression with many covariates: An application to adverse birth outcomes June 3, 2011 eappendix 30 Percent of Total 20 10 0 0 1000 2000 3000 4000 5000 Birth weights efigure 1: Histogram

More information

Analysis of Violent Crime in Los Angeles County

Analysis of Violent Crime in Los Angeles County Analysis of Violent Crime in Los Angeles County Xiaohong Huang UID: 004693375 March 20, 2017 Abstract Violent crime can have a negative impact to the victims and the neighborhoods. It can affect people

More information

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University Causal Modeling in Environmental Epidemiology Joel Schwartz Harvard University When I was Young What do I mean by Causal Modeling? What would have happened if the population had been exposed to a instead

More information

Spatio-Temporal Modelling of Credit Default Data

Spatio-Temporal Modelling of Credit Default Data 1/20 Spatio-Temporal Modelling of Credit Default Data Sathyanarayan Anand Advisor: Prof. Robert Stine The Wharton School, University of Pennsylvania April 29, 2011 2/20 Outline 1 Background 2 Conditional

More information

Gradient Descent. Ryan Tibshirani Convex Optimization /36-725

Gradient Descent. Ryan Tibshirani Convex Optimization /36-725 Gradient Descent Ryan Tibshirani Convex Optimization 10-725/36-725 Last time: canonical convex programs Linear program (LP): takes the form min x subject to c T x Gx h Ax = b Quadratic program (QP): like

More information

Social Epidemiology and Spatial Epidemiology: An Empirical Comparison of Perspectives

Social Epidemiology and Spatial Epidemiology: An Empirical Comparison of Perspectives Social Epidemiology and Spatial Epidemiology: An Empirical Comparison of Perspectives A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Kelsey Nathel McDonald

More information

Chapters 9 and 10. Review for Exam. Chapter 9. Correlation and Regression. Overview. Paired Data

Chapters 9 and 10. Review for Exam. Chapter 9. Correlation and Regression. Overview. Paired Data Chapters 9 and 10 Review for Exam 1 Chapter 9 Correlation and Regression 2 Overview Paired Data is there a relationship if so, what is the equation use the equation for prediction 3 Definition Correlation

More information

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign GIS and Spatial Analysis Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu Outline GIS and Spatial Analysis

More information

Environmental Analysis, Chapter 4 Consequences, and Mitigation

Environmental Analysis, Chapter 4 Consequences, and Mitigation Environmental Analysis, Chapter 4 4.17 Environmental Justice This section summarizes the potential impacts described in Chapter 3, Transportation Impacts and Mitigation, and other sections of Chapter 4,

More information

Testing Restrictions and Comparing Models

Testing Restrictions and Comparing Models Econ. 513, Time Series Econometrics Fall 00 Chris Sims Testing Restrictions and Comparing Models 1. THE PROBLEM We consider here the problem of comparing two parametric models for the data X, defined by

More information

GIS Spatial Statistics for Public Opinion Survey Response Rates

GIS Spatial Statistics for Public Opinion Survey Response Rates GIS Spatial Statistics for Public Opinion Survey Response Rates July 22, 2015 Timothy Michalowski Senior Statistical GIS Analyst Abt SRBI - New York, NY t.michalowski@srbi.com www.srbi.com Introduction

More information

A Spatial Dependency and Causality Analysis of Crime in Savannah, Georgia, 2000

A Spatial Dependency and Causality Analysis of Crime in Savannah, Georgia, 2000 A Spatial Dependency and Causality Analysis of Crime in Savannah, Georgia, 2000 Naresh Kumar Department of Geography University of Iowa Iowa City, IA 52242 Email: naresh-kumar@uiowa.edu 1 Abstract: This

More information

Gaussian Graphical Models and Graphical Lasso

Gaussian Graphical Models and Graphical Lasso ELE 538B: Sparsity, Structure and Inference Gaussian Graphical Models and Graphical Lasso Yuxin Chen Princeton University, Spring 2017 Multivariate Gaussians Consider a random vector x N (0, Σ) with pdf

More information

Spatial Disparities in the Distribution of Parks and Green Spaces in the United States

Spatial Disparities in the Distribution of Parks and Green Spaces in the United States March 11 th, 2012 Active Living Research Conference Spatial Disparities in the Distribution of Parks and Green Spaces in the United States Ming Wen, Ph.D., University of Utah Xingyou Zhang, Ph.D., CDC

More information

Can Public Transport Infrastructure Relieve Spatial Mismatch?

Can Public Transport Infrastructure Relieve Spatial Mismatch? Can Public Transport Infrastructure Relieve Spatial Mismatch? Evidence from Recent Light Rail Extensions Kilian Heilmann University of California San Diego April 20, 2015 Motivation Paradox: Even though

More information

Biostat 2065 Analysis of Incomplete Data

Biostat 2065 Analysis of Incomplete Data Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh October 20, 2005 1. Large-sample inference based on ML Let θ is the MLE, then the large-sample theory implies

More information

EICCARS WORKBOOK FOR INITIATING COMMUNITY PROFILES WITH STATISTICAL AND OTHER SECONDARY DATA

EICCARS WORKBOOK FOR INITIATING COMMUNITY PROFILES WITH STATISTICAL AND OTHER SECONDARY DATA The Cultural Ecology of Health and Change (CEHC) Working Papers Ethnographically Informed Community and Cultural Assessment Research Systems (EICCARS) Workbooks EICCARS WORKBOOK FOR INITIATING COMMUNITY

More information

Probabilistic Models of Past Climate Change

Probabilistic Models of Past Climate Change Probabilistic Models of Past Climate Change Julien Emile-Geay USC College, Department of Earth Sciences SIAM International Conference on Data Mining Anaheim, California April 27, 2012 Probabilistic Models

More information

Partial Generalized Additive Models

Partial Generalized Additive Models An information theoretical approach to avoid the concurvity Hong Gu 1 Mu Zhu 2 1 Department of Mathematics and Statistics Dalhousie University 2 Department of Statistics and Actuarial Science University

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Matt Ruther Population Studies Center University of Pennsylvania. September 1, 2011

Matt Ruther Population Studies Center University of Pennsylvania. September 1, 2011 Spatial Analysis of Metropolitan Area Homicide Rates and their Relationship to Immigrant Growth and Concentration, 1970-2000 Matt Ruther Population Studies Center University of Pennsylvania September 1,

More information

LogitBoost with Trees Applied to the WCCI 2006 Performance Prediction Challenge Datasets

LogitBoost with Trees Applied to the WCCI 2006 Performance Prediction Challenge Datasets Applied to the WCCI 2006 Performance Prediction Challenge Datasets Roman Lutz Seminar für Statistik, ETH Zürich, Switzerland 18. July 2006 Overview Motivation 1 Motivation 2 The logistic framework The

More information

Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania

Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania Stephen E. Mitchell Department of Resource Analysis, Saint Mary s University

More information

Multi-level Models: Idea

Multi-level Models: Idea Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random

More information

Spatial Regression Modeling

Spatial Regression Modeling Spatial Regression Modeling Paul Voss & Katherine Curtis The Center for Spatially Integrated Social Science Santa Barbara, CA July 12-17, 2009 Day 4 Plan for today Focus on spatial heterogeneity A bit

More information

Space and the City: Modern Chicago (and Beyond)

Space and the City: Modern Chicago (and Beyond) Space and the City: Modern Chicago (and Beyond) Robert J. Sampson Department of Sociology Harvard University Concentration of Collective Action Events in Chicago Spatial Interdependence of Concentrated

More information

Inverse Covariance Estimation with Missing Data using the Concave-Convex Procedure

Inverse Covariance Estimation with Missing Data using the Concave-Convex Procedure Inverse Covariance Estimation with Missing Data using the Concave-Convex Procedure Jérôme Thai 1 Timothy Hunter 1 Anayo Akametalu 1 Claire Tomlin 1 Alex Bayen 1,2 1 Department of Electrical Engineering

More information

Outline. 15. Descriptive Summary, Design, and Inference. Descriptive summaries. Data mining. The centroid

Outline. 15. Descriptive Summary, Design, and Inference. Descriptive summaries. Data mining. The centroid Outline 15. Descriptive Summary, Design, and Inference Geographic Information Systems and Science SECOND EDITION Paul A. Longley, Michael F. Goodchild, David J. Maguire, David W. Rhind 2005 John Wiley

More information

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013 Midterm 3 4/5/2013 Instructions: You may use a calculator, and one sheet of notes. You will never be penalized for showing work, but if what is asked for can be computed directly, points awarded will depend

More information

Lecture 5 Multivariate Linear Regression

Lecture 5 Multivariate Linear Regression Lecture 5 Multivariate Linear Regression Dan Sheldon September 23, 2014 Topics Multivariate linear regression Model Cost function Normal equations Gradient descent Features Book Data 10 8 Weight (lbs.)

More information

Boosting Methods: Why They Can Be Useful for High-Dimensional Data

Boosting Methods: Why They Can Be Useful for High-Dimensional Data New URL: http://www.r-project.org/conferences/dsc-2003/ Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003) March 20 22, Vienna, Austria ISSN 1609-395X Kurt Hornik,

More information

A Gentle Introduction to Gradient Boosting. Cheng Li College of Computer and Information Science Northeastern University

A Gentle Introduction to Gradient Boosting. Cheng Li College of Computer and Information Science Northeastern University A Gentle Introduction to Gradient Boosting Cheng Li chengli@ccs.neu.edu College of Computer and Information Science Northeastern University Gradient Boosting a powerful machine learning algorithm it can

More information

Maximum Likelihood Estimation. only training data is available to design a classifier

Maximum Likelihood Estimation. only training data is available to design a classifier Introduction to Pattern Recognition [ Part 5 ] Mahdi Vasighi Introduction Bayesian Decision Theory shows that we could design an optimal classifier if we knew: P( i ) : priors p(x i ) : class-conditional

More information

Generalized Linear Models 1

Generalized Linear Models 1 Generalized Linear Models 1 STA 2101/442: Fall 2012 1 See last slide for copyright information. 1 / 24 Suggested Reading: Davison s Statistical models Exponential families of distributions Sec. 5.2 Chapter

More information

GIS and the Built Environment

GIS and the Built Environment GIS and the Built Environment GIS Workshop Active Living Research Conference February 9, 2010 Anne Vernez Moudon, Dr. es Sc. University of Washington Chanam Lee, Ph.D., Texas A&M University OBJECTIVES

More information

Sparse Functional Models: Predicting Crop Yields

Sparse Functional Models: Predicting Crop Yields Sparse Functional Models: Predicting Crop Yields Dustin Lennon Lead Statistician dustin@inferentialist.com Executive Summary Here, we develop a general methodology for extracting optimal functionals mapping

More information

Logistic regression: Miscellaneous topics

Logistic regression: Miscellaneous topics Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio

More information

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf 1 Introduction to Machine Learning Maximum Likelihood and Bayesian Inference Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf 2013-14 We know that X ~ B(n,p), but we do not know p. We get a random sample

More information

PS 3: Sections 110 & 111

PS 3: Sections 110 & 111 PS 3: Sections 110 & 111 GSI: L. Jason Anastasopoulos janastas@berkeley.edu University of California, Berkeley April 3, 2013 For today 1. Standard Deviation 2. Correlation 3. Regression Standard Deviation

More information

Using Spatial Statistics Social Service Applications Public Safety and Public Health

Using Spatial Statistics Social Service Applications Public Safety and Public Health Using Spatial Statistics Social Service Applications Public Safety and Public Health Lauren Rosenshein 1 Regression analysis Regression analysis allows you to model, examine, and explore spatial relationships,

More information

Tracey Farrigan Research Geographer USDA-Economic Research Service

Tracey Farrigan Research Geographer USDA-Economic Research Service Rural Poverty Symposium Federal Reserve Bank of Atlanta December 2-3, 2013 Tracey Farrigan Research Geographer USDA-Economic Research Service Justification Increasing demand for sub-county analysis Policy

More information

Social and Economic Impacts of Brownfield Redevelopment in Florida New Analytical Tools to Assess the State and Tribal Response Program

Social and Economic Impacts of Brownfield Redevelopment in Florida New Analytical Tools to Assess the State and Tribal Response Program Social and Economic Impacts of Brownfield Redevelopment in Florida New Analytical Tools to Assess the State and Tribal Response Program Christian Wells Our mission is to leverage interdisciplinary scientific

More information

Spatial Variation in Local Road Pedestrian and Bicycle Crashes

Spatial Variation in Local Road Pedestrian and Bicycle Crashes 2015 Esri International User Conference July 20 24, 2015 San Diego, California Spatial Variation in Local Road Pedestrian and Bicycle Crashes Musinguzi, Abram, Graduate Research Assistant Chimba,Deo, PhD.,

More information

How do criminals locate? Crime and spatial dependence in Minas Gerais

How do criminals locate? Crime and spatial dependence in Minas Gerais How do criminals locate? Crime and spatial dependence in Minas Gerais Frédéric Puech December 2004 CERDI-CNRS, Université d Auvergne Abstract The aim of this paper is to investigate the existence of a

More information

Summary and discussion of: Exact Post-selection Inference for Forward Stepwise and Least Angle Regression Statistics Journal Club

Summary and discussion of: Exact Post-selection Inference for Forward Stepwise and Least Angle Regression Statistics Journal Club Summary and discussion of: Exact Post-selection Inference for Forward Stepwise and Least Angle Regression Statistics Journal Club 36-825 1 Introduction Jisu Kim and Veeranjaneyulu Sadhanala In this report

More information

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 Work all problems. 60 points are needed to pass at the Masters Level and 75

More information

Spatial segregation and socioeconomic inequalities in health in major Brazilian cities. An ESRC pathfinder project

Spatial segregation and socioeconomic inequalities in health in major Brazilian cities. An ESRC pathfinder project Spatial segregation and socioeconomic inequalities in health in major Brazilian cities An ESRC pathfinder project Income per head and life-expectancy: rich & poor countries Source: Wilkinson & Pickett,

More information

Medical GIS: New Uses of Mapping Technology in Public Health. Peter Hayward, PhD Department of Geography SUNY College at Oneonta

Medical GIS: New Uses of Mapping Technology in Public Health. Peter Hayward, PhD Department of Geography SUNY College at Oneonta Medical GIS: New Uses of Mapping Technology in Public Health Peter Hayward, PhD Department of Geography SUNY College at Oneonta Invited research seminar presentation at Bassett Healthcare. Cooperstown,

More information

Statistical Estimation

Statistical Estimation Statistical Estimation Use data and a model. The plug-in estimators are based on the simple principle of applying the defining functional to the ECDF. Other methods of estimation: minimize residuals from

More information

Spatial Temporal Models for Retail Credit

Spatial Temporal Models for Retail Credit Spatial Temporal Models for Retail Credit Bob Stine The School, University of Pennsylvania stat.wharton.upenn.edu/~stine Introduction Exploratory analysis Trends and maps Outline Measuring spatial association

More information

Generalized Boosted Models: A guide to the gbm package

Generalized Boosted Models: A guide to the gbm package Generalized Boosted Models: A guide to the gbm package Greg Ridgeway April 15, 2006 Boosting takes on various forms with different programs using different loss functions, different base models, and different

More information

Migration Clusters in Brazil: an Analysis of Areas of Origin and Destination Ernesto Friedrich Amaral

Migration Clusters in Brazil: an Analysis of Areas of Origin and Destination Ernesto Friedrich Amaral 1 Migration Clusters in Brazil: an Analysis of Areas of Origin and Destination Ernesto Friedrich Amaral Research question and data The main goal of this research is to analyze whether the pattern of concentration

More information

Ensemble Methods. Charles Sutton Data Mining and Exploration Spring Friday, 27 January 12

Ensemble Methods. Charles Sutton Data Mining and Exploration Spring Friday, 27 January 12 Ensemble Methods Charles Sutton Data Mining and Exploration Spring 2012 Bias and Variance Consider a regression problem Y = f(x)+ N(0, 2 ) With an estimate regression function ˆf, e.g., ˆf(x) =w > x Suppose

More information

CENTERING IN MULTILEVEL MODELS. Consider the situation in which we have m groups of individuals, where

CENTERING IN MULTILEVEL MODELS. Consider the situation in which we have m groups of individuals, where CENTERING IN MULTILEVEL MODELS JAN DE LEEUW ABSTRACT. This is an entry for The Encyclopedia of Statistics in Behavioral Science, to be published by Wiley in 2005. Consider the situation in which we have

More information

Clusters. Unsupervised Learning. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Clusters. Unsupervised Learning. Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Clusters Unsupervised Learning Luc Anselin http://spatial.uchicago.edu 1 curse of dimensionality principal components multidimensional scaling classical clustering methods 2 Curse of Dimensionality 3 Curse

More information

F-tests for Incomplete Data in Multiple Regression Setup

F-tests for Incomplete Data in Multiple Regression Setup F-tests for Incomplete Data in Multiple Regression Setup ASHOK CHAURASIA Advisor: Dr. Ofer Harel University of Connecticut / 1 of 19 OUTLINE INTRODUCTION F-tests in Multiple Linear Regression Incomplete

More information

Bridging the two cultures: Latent variable statistical modeling with boosted regression trees

Bridging the two cultures: Latent variable statistical modeling with boosted regression trees Bridging the two cultures: Latent variable statistical modeling with boosted regression trees Thomas G. Dietterich and Rebecca Hutchinson Oregon State University Corvallis, Oregon, USA 1 A Species Distribution

More information

Obesity, Fast Food, and Grocery Stores: Evidence from Geo-referenced Micro Data

Obesity, Fast Food, and Grocery Stores: Evidence from Geo-referenced Micro Data Obesity, Fast Food, and Grocery Stores: Evidence from Geo-referenced Micro Data Susan E. Chen 1, Raymond J.G.M. Florax 1,, and Samantha D. Snyder 1 1 Purdue University, West Lafayette, USA VU University

More information

Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method

Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method Yan Wang 1, Michael Ong 2, Honghu Liu 1,2,3 1 Department of Biostatistics, UCLA School

More information

GIS = Geographic Information Systems;

GIS = Geographic Information Systems; What is GIS GIS = Geographic Information Systems; What Information are we talking about? Information about anything that has a place (e.g. locations of features, address of people) on Earth s surface,

More information

A Survey of L 1. Regression. Céline Cunen, 20/10/2014. Vidaurre, Bielza and Larranaga (2013)

A Survey of L 1. Regression. Céline Cunen, 20/10/2014. Vidaurre, Bielza and Larranaga (2013) A Survey of L 1 Regression Vidaurre, Bielza and Larranaga (2013) Céline Cunen, 20/10/2014 Outline of article 1.Introduction 2.The Lasso for Linear Regression a) Notation and Main Concepts b) Statistical

More information

Inversion Base Height. Daggot Pressure Gradient Visibility (miles)

Inversion Base Height. Daggot Pressure Gradient Visibility (miles) Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:

More information

Spatio-temporal dynamics of Marbled Murrelet hotspots during nesting in nearshore waters along the Washington to California coast

Spatio-temporal dynamics of Marbled Murrelet hotspots during nesting in nearshore waters along the Washington to California coast Western Washington University Western CEDAR Salish Sea Ecosystem Conference 2014 Salish Sea Ecosystem Conference (Seattle, Wash.) May 1st, 10:30 AM - 12:00 PM Spatio-temporal dynamics of Marbled Murrelet

More information

Bayesian Hierarchical Models

Bayesian Hierarchical Models Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology

More information

Review of the General Linear Model

Review of the General Linear Model Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis

More information

Spatial Pattern Analysis: Mapping Trends and Clusters. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Spatial Pattern Analysis: Mapping Trends and Clusters. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Spatial Pattern Analysis: Mapping Trends and Clusters Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Presentation Outline Spatial statistics overview Describing spatial patterns Quantifying spatial

More information

OBESITY AND LOCATION IN MARION COUNTY, INDIANA MIDWEST STUDENT SUMMIT, APRIL Samantha Snyder, Purdue University

OBESITY AND LOCATION IN MARION COUNTY, INDIANA MIDWEST STUDENT SUMMIT, APRIL Samantha Snyder, Purdue University OBESITY AND LOCATION IN MARION COUNTY, INDIANA MIDWEST STUDENT SUMMIT, APRIL 2008 Samantha Snyder, Purdue University Organization Introduction Literature and Motivation Data Geographic Distributions ib

More information

Convex Optimization / Homework 1, due September 19

Convex Optimization / Homework 1, due September 19 Convex Optimization 1-725/36-725 Homework 1, due September 19 Instructions: You must complete Problems 1 3 and either Problem 4 or Problem 5 (your choice between the two). When you submit the homework,

More information

WlLPEN L. GORR KRISTEN S. KURLAND. Universitats- und Landesbibliothek. Bibliothek Architektur und Stadtebau ESRI

WlLPEN L. GORR KRISTEN S. KURLAND. Universitats- und Landesbibliothek. Bibliothek Architektur und Stadtebau ESRI hj WlLPEN L. GORR KRISTEN S. KURLAND Universitats- und Landesbibliothek Darmstadt Bibliothek Architektur und Stadtebau ESRI Contents Acknowledgments Preface xvii XV GIS Tutorial 1 Introduction Launch ArcMap

More information

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Last lecture 1/35. General optimization problems Newton Raphson Fisher scoring Quasi Newton

Last lecture 1/35. General optimization problems Newton Raphson Fisher scoring Quasi Newton EM Algorithm Last lecture 1/35 General optimization problems Newton Raphson Fisher scoring Quasi Newton Nonlinear regression models Gauss-Newton Generalized linear models Iteratively reweighted least squares

More information