Prospect. February 8, Geographically Weighted Analysis - Review and. Prospect. Chris Brunsdon. The Basics GWPCA. Conclusion

Size: px
Start display at page:

Download "Prospect. February 8, Geographically Weighted Analysis - Review and. Prospect. Chris Brunsdon. The Basics GWPCA. Conclusion"

Transcription

1 bruary 8, 0

2 Regression (GWR) In a nutshell: A local statistical technique to analyse spatial variations in relationships Global averages of spatial data are not always helpful: climate data health data This problem can also occur with global statistics that measure relationships in spatial data. regression correlation

3 Spatial Non-Stationarity Spatial Non-Stationarity occurs when a relationship (or pattern) that applies in one region does not apply in another Global models are statements about processes or patterns which are assumed to be stationary and as such are location independent are assumed to apply in all locations Local models are spatial disaggregations of global models, the results of which are location-specific The template of the model is the same - but the specifics may alter. i.e. The model may always be a linear regression model with certain variables, but the coefficients alter geographically

4 Spatial Non-Stationarity Spatial Non-Stationarity occurs when a relationship (or pattern) that applies in one region does not apply in another Global models are statements about processes or patterns which are assumed to be stationary and as such are location independent are assumed to apply in all locations Local models are spatial disaggregations of global models, the results of which are location-specific The template of the model is the same - but the specifics may alter. i.e. The model may always be a linear regression model with certain variables, but the coefficients alter geographically The above is essentially a description of GWR

5 An Example of Spatial Non-Stationarity - % With Degrees % Foreign Born < > 5.0 < >.0 Georgia State (Source: US Census 990)

6 An Example of Spatial Non-Stationarity - Northing % With Degrees % Foreign Born

7 The GWR Model Standard Global Regression y i = α + β x i + β x i ε i where ε i N(0, σ ) Regression y i = α(u i, v i ) + β (u i, v i )x i + β (u i, v i )x i ε i where ε i N(0, σ ) and (u i, v i ) is the location of observation i Note: the coefficients in GWR are now functions, not variables.

8 A Calibration gorithm - For a given point (u, v): Consider a window of radius h Calibrate regression just using data falling in that window. Scanning the window across the study area gives a surface of regression parameters...

9 A Calibration gorithm - To avoid sudden jumps when scanning the window: We use a weighted regression calibration. Hence Regression.

10 A Calibration gorithm - Weighting Details: A possible Scheme { } d if d < h h w i (u, v) = 0 otherwise where d = (u u i ) + (v v i ). h is called the bandwidth. Other weighting functions could be used - i.e. Gaussian Results generally more sensitive to h than choice of weighting function.

11 A Calibration gorithm - 4 Calibration Formula { ˆβ(u, v) = X T W(u, v)x} X T W(u, v)y where W = Diagonal(w (u, v), w (u, v),..., w n (u, v)) X is the matrix of independent variables. y is the vector of the dependent variable. cf Global Regression Formula { ˆβ = X X} T X T y

12 A Calibration gorithm - 5 An extension of the method Use a different bandwidth in different places - i.e. h(u, v) Typically, bandwidth at (u, v) is distance to kth nearest neighbour. Useful if density of observations is variable - e.g. urban/rural.

13 Over- and Under- Fitting

14 Cross-Validation and h RMS Prediction Error Cross Validation Example Bandwidth h (km) cross-validation - fit the model to a holdback sample using the remaining data for a range of h-values, then find the h-value that is the best predictor.

15 Results of GWR - Slope Slope Coefficient < > 4.

16 Results of GWR - Intercept Intercept Coefficient < > 8.

17 Results of GWR - Slope Slope Coefficient - Using Grid Sampling < > 4.

18 Further Issues Local standard error - reliability of estimates Significance testing - Monte Carlo approaches H 0 : No spatial variation in coefficients H : GWR assumption is true Tests whether GWR assumption is valid Could also be used to justify global models on occasions Multivariate GWR

19 Results of GWR - Multivariate % Foreign Born % Elderly ( 65) < >.9 < > 0. Note - adding extra variables can alter interpretation due to correlation between predictors. Just like in other kinds of regression...

20 PCA Multivariate relationships: Issues with collinearity Treating variables symmetrically Multivariate outliers Principal Components Identifies collinearity: Based on Σ-matrix of several variables Can identify multivariate outliers

21 PCA as Model - y Comparison: OLS Regression 4 0 4

22 PCA as Model - y Comparison: PCA 4 0 4

23 Interpretation PCA is a kind of line fitting algorithm Based on perpendicular distances. Error to be minimised is based on fitting both x and y, not just y. Residuals are the perpendicular distances mentioned above The equation of the best fit line gives the loadings on each variable for PC The projection of the points on the line correspond to the scores for PC

24 The Multivariate Situation Same idea still applies BUT For first k components in m dimensions: Find k-dimensional subspace minimising perpendicular distances in m-space - the equations of the subspace gives the loadings in terms of input variables. Residuals are the perpendicular distances mentioned above Coordinates projected onto subspace ordination plot found in above plot Type multidimensional outliers They fit the model subspace model, but are unusual in the subspace Big residuals Type multidimensional outliers Don t even fit the subspace model!

25 Geographical Weighting PCA Might want to find outliers locally A local outlier: Is not an unusual observation in the data set as a whole But is unlike its geographical neighbours Can use locally weighted PCA to investigate local multivariate outliers. How to do it: Apply geographical weighting windows to the perpendicular distance minimising algorithm Thus PCA loadings are viewed as functions of (u, v) - like regression coefficients in GWR.

26 An Example Baltic Soil Survey (Reimann et al, 000). Agricultural soils were collected from 0 European countries over a large region surrounding the Baltic Sea 768 sites Here we concentrate on topsoil samples - Trace compounds: SiO, O, O, O, MnO, MgO, CaO, Na O, O and P O 5. % by weight calculated. Data has 768 rows and 0 columns. so, the x and y coordinates of each site are recorded. Data standardised to z-scores. ey task: identify local patterns and outliers...

27 Survey Locations

28 Choosing h for Bandwidth Selection Much like the procedure in GWR Measure perpendicular distances in a holdback sample CV Score Choose h to minimise this Bandwidth

29 PCA Results - Highest Loadings

30 PCA Results - Sternutation Plot of Loading

31 PCA Results - Sternutation Plot of Loading

32 Unique sign patterns in geographically weighted loadings SiO O O O MnO MgO CaO Na O O P O Relatively small number of patterns exhibited - only out of a possible 0 = 04 NB. First sign always positive by convention

33 PCA Results - Sign Patterns

34 Hunting of Type - High Perpendicular Distances

35 Hunting of Type - Parallel Coordinates Site 0 Site 59 SiO O O O MnO MgO CaO NaO O PO5 Site 50 SiO O O O MnO MgO CaO NaO O PO5 SiO O O O MnO MgO CaO NaO O PO5 Site 65 SiO O O O MnO MgO CaO NaO O PO5

36 s GWR/ as data miner Certainly a useful rôle for But PCA can also be seen as a model Possibly data mining / data modelling not such a clear cut distinction? Further extensions...

37 s The End with thanks to Martin Charlton for his helpful comments and discussion.

Statistics: A review. Why statistics?

Statistics: A review. Why statistics? Statistics: A review Why statistics? What statistical concepts should we know? Why statistics? To summarize, to explore, to look for relations, to predict What kinds of data exist? Nominal, Ordinal, Interval

More information

ESRI 2008 Health GIS Conference

ESRI 2008 Health GIS Conference ESRI 2008 Health GIS Conference An Exploration of Geographically Weighted Regression on Spatial Non- Stationarity and Principal Component Extraction of Determinative Information from Robust Datasets A

More information

Multiple Dependent Hypothesis Tests in Geographically Weighted Regression

Multiple Dependent Hypothesis Tests in Geographically Weighted Regression Multiple Dependent Hypothesis Tests in Geographically Weighted Regression Graeme Byrne 1, Martin Charlton 2, and Stewart Fotheringham 3 1 La Trobe University, Bendigo, Victoria Austrlaia Telephone: +61

More information

Using Spatial Statistics Social Service Applications Public Safety and Public Health

Using Spatial Statistics Social Service Applications Public Safety and Public Health Using Spatial Statistics Social Service Applications Public Safety and Public Health Lauren Rosenshein 1 Regression analysis Regression analysis allows you to model, examine, and explore spatial relationships,

More information

GeoDa-GWR Results: GeoDa-GWR Output (portion only): Program began at 4/8/2016 4:40:38 PM

GeoDa-GWR Results: GeoDa-GWR Output (portion only): Program began at 4/8/2016 4:40:38 PM New Mexico Health Insurance Coverage, 2009-2013 Exploratory, Ordinary Least Squares, and Geographically Weighted Regression Using GeoDa-GWR, R, and QGIS Larry Spear 4/13/2016 (Draft) A dataset consisting

More information

GIS Analysis: Spatial Statistics for Public Health: Lauren M. Scott, PhD; Mark V. Janikas, PhD

GIS Analysis: Spatial Statistics for Public Health: Lauren M. Scott, PhD; Mark V. Janikas, PhD Some Slides to Go Along with the Demo Hot spot analysis of average age of death Section B DEMO: Mortality Data Analysis 2 Some Slides to Go Along with the Demo Do Economic Factors Alone Explain Early Death?

More information

Exploratory Spatial Data Analysis (ESDA)

Exploratory Spatial Data Analysis (ESDA) Exploratory Spatial Data Analysis (ESDA) VANGHR s method of ESDA follows a typical geospatial framework of selecting variables, exploring spatial patterns, and regression analysis. The primary software

More information

Modeling Spatial Relationships using Regression Analysis

Modeling Spatial Relationships using Regression Analysis Esri International User Conference San Diego, CA Technical Workshops July 2011 Modeling Spatial Relationships using Regression Analysis Lauren M. Scott, PhD Lauren Rosenshein, MS Mark V. Janikas, PhD Answering

More information

Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach

Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach Kristina Pestaria Sinaga, Manuntun Hutahaean 2, Petrus Gea 3 1, 2, 3 University of Sumatera Utara,

More information

The GWmodel R package: Further Topics for Exploring Spatial Heterogeneity using Geographically Weighted Models

The GWmodel R package: Further Topics for Exploring Spatial Heterogeneity using Geographically Weighted Models The GWmodel R package: Further Topics for Exploring Spatial Heterogeneity using Geographically Weighted Models Binbin Lu a*, Paul Harris a, Martin Charlton a, Chris Brunsdon b a. National Centre for Geocomputation,

More information

Section 2.2 RAINFALL DATABASE S.D. Lynch and R.E. Schulze

Section 2.2 RAINFALL DATABASE S.D. Lynch and R.E. Schulze Section 2.2 RAINFALL DATABASE S.D. Lynch and R.E. Schulze Background to the Rainfall Database The rainfall database described in this Section derives from a WRC project the final report of which was titled

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Modeling Spatial Relationships Using Regression Analysis Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Workshop Overview Answering why? questions Introduce regression analysis - What it is and why

More information

Modeling Spatial Relationships Using Regression Analysis

Modeling Spatial Relationships Using Regression Analysis Esri International User Conference San Diego, California Technical Workshops July 24, 2012 Modeling Spatial Relationships Using Regression Analysis Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Answering

More information

MATH 829: Introduction to Data Mining and Analysis Principal component analysis

MATH 829: Introduction to Data Mining and Analysis Principal component analysis 1/11 MATH 829: Introduction to Data Mining and Analysis Principal component analysis Dominique Guillot Departments of Mathematical Sciences University of Delaware April 4, 2016 Motivation 2/11 High-dimensional

More information

Geographical General Regression Neural Network (GGRNN) Tool For Geographically Weighted Regression Analysis

Geographical General Regression Neural Network (GGRNN) Tool For Geographically Weighted Regression Analysis Geographical General Regression Neural Network (GGRNN) Tool For Geographically Weighted Regression Analysis Muhammad Irfan, Aleksandra Koj, Hywel R. Thomas, Majid Sedighi Geoenvironmental Research Centre,

More information

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1 Models for Count and Binary Data Poisson and Logistic GWR Models 24/07/2008 GWR Workshop 1 Outline I: Modelling counts Poisson regression II: Modelling binary events Logistic Regression III: Poisson Regression

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Geographically Weighted Regression LECTURE 2 : Introduction to GWR II

Geographically Weighted Regression LECTURE 2 : Introduction to GWR II Geographically Weighted Regression LECTURE 2 : Introduction to GWR II Stewart.Fotheringham@nuim.ie http://ncg.nuim.ie/gwr A Simulation Experiment Y i = α i + β 1i X 1i + β 2i X 2i Data on X 1 and X 2 drawn

More information

Geographically weighted regression approach for origin-destination flows

Geographically weighted regression approach for origin-destination flows Geographically weighted regression approach for origin-destination flows Kazuki Tamesue 1 and Morito Tsutsumi 2 1 Graduate School of Information and Engineering, University of Tsukuba 1-1-1 Tennodai, Tsukuba,

More information

Bootstrapping, Randomization, 2B-PLS

Bootstrapping, Randomization, 2B-PLS Bootstrapping, Randomization, 2B-PLS Statistics, Tests, and Bootstrapping Statistic a measure that summarizes some feature of a set of data (e.g., mean, standard deviation, skew, coefficient of variation,

More information

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE Katherine E. Williams University of Denver GEOG3010 Geogrpahic Information Analysis April 28, 2011 A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE Overview Data

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

A Short Note on the Proportional Effect and Direct Sequential Simulation

A Short Note on the Proportional Effect and Direct Sequential Simulation A Short Note on the Proportional Effect and Direct Sequential Simulation Abstract B. Oz (boz@ualberta.ca) and C. V. Deutsch (cdeutsch@ualberta.ca) University of Alberta, Edmonton, Alberta, CANADA Direct

More information

ANALYSIS OF LARGE SCALE SOIL SPECTRAL LIBRARIES

ANALYSIS OF LARGE SCALE SOIL SPECTRAL LIBRARIES Antoine Stevens (1), Marco Nocita (1,2), & Bas van Wesemael (1) ANALYSIS OF LARGE SCALE SOIL SPECTRAL LIBRARIES 1 Georges Lemaître Centre for Earth and Climate Research, Earth and Life Institute, UCLouvain,

More information

Community Health Needs Assessment through Spatial Regression Modeling

Community Health Needs Assessment through Spatial Regression Modeling Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular

More information

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model?

Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model? Statistical downscaling daily rainfall statistics from seasonal forecasts using canonical correlation analysis or a hidden Markov model? Andrew W. Robertson International Research Institute for Climate

More information

Statistical View of Least Squares

Statistical View of Least Squares May 23, 2006 Purpose of Regression Some Examples Least Squares Purpose of Regression Purpose of Regression Some Examples Least Squares Suppose we have two variables x and y Purpose of Regression Some Examples

More information

Geographically Weighted Regression (GWR)

Geographically Weighted Regression (GWR) Geographically Weighted Regression (GWR) rahmaanisa@apps.ipb.ac.id Global Vs Local Statistics Global similarities across space single-valued statistics non-mappable search for regularities aspatial Local

More information

STATISTICAL LEARNING SYSTEMS

STATISTICAL LEARNING SYSTEMS STATISTICAL LEARNING SYSTEMS LECTURE 8: UNSUPERVISED LEARNING: FINDING STRUCTURE IN DATA Institute of Computer Science, Polish Academy of Sciences Ph. D. Program 2013/2014 Principal Component Analysis

More information

Data Mining Based Anomaly Detection In PMU Measurements And Event Detection

Data Mining Based Anomaly Detection In PMU Measurements And Event Detection Data Mining Based Anomaly Detection In PMU Measurements And Event Detection P. Banerjee, S. Pandey, M. Zhou, A. Srivastava, Y. Wu Smart Grid Demonstration and Research Investigation Lab (SGDRIL) Energy

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Regression Review. Statistics 149. Spring Copyright c 2006 by Mark E. Irwin

Regression Review. Statistics 149. Spring Copyright c 2006 by Mark E. Irwin Regression Review Statistics 149 Spring 2006 Copyright c 2006 by Mark E. Irwin Matrix Approach to Regression Linear Model: Y i = β 0 + β 1 X i1 +... + β p X ip + ɛ i ; ɛ i iid N(0, σ 2 ), i = 1,..., n

More information

Multimodel Ensemble forecasts

Multimodel Ensemble forecasts Multimodel Ensemble forecasts Calibrated methods Michael K. Tippett International Research Institute for Climate and Society The Earth Institute, Columbia University ERFS Climate Predictability Tool Training

More information

Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction

Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction Dragoljub Pokrajac DPOKRAJA@EECS.WSU.EDU Zoran Obradovic ZORAN@EECS.WSU.EDU School of Electrical Engineering and Computer

More information

Geographically Weighted Regression as a Statistical Model

Geographically Weighted Regression as a Statistical Model Geographically Weighted Regression as a Statistical Model Chris Brunsdon Stewart Fotheringham Martin Charlton October 6, 2000 Spatial Analysis Research Group Department of Geography University of Newcastle-upon-Tyne

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 370 Regression models are used to study the relationship of a response variable and one or more predictors. The response is also called the dependent variable, and the predictors

More information

Appendix A : rational of the spatial Principal Component Analysis

Appendix A : rational of the spatial Principal Component Analysis Appendix A : rational of the spatial Principal Component Analysis In this appendix, the following notations are used : X is the n-by-p table of centred allelic frequencies, where rows are observations

More information

Geospatial dynamics of Northwest. fisheries in the 1990s and 2000s: environmental and trophic impacts

Geospatial dynamics of Northwest. fisheries in the 1990s and 2000s: environmental and trophic impacts Geospatial dynamics of Northwest Atlantic ti cod and crustacean fisheries in the 1990s and 2000s: environmental and trophic impacts Matthew J.S. WINDLE 1, George A. ROSE 2, Rodolphe DEVILLERS 3, and Marie-Josée

More information

Data Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction

Data Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction Data Mining 3.6 Regression Analysis Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction Straight-Line Linear Regression Multiple Linear Regression Other Regression Models References Introduction

More information

Linear Model Selection and Regularization

Linear Model Selection and Regularization Linear Model Selection and Regularization Recall the linear model Y = β 0 + β 1 X 1 + + β p X p + ɛ. In the lectures that follow, we consider some approaches for extending the linear model framework. In

More information

Functional time series

Functional time series Rob J Hyndman Functional time series with applications in demography 4. Connections, extensions and applications Outline 1 Yield curves 2 Electricity prices 3 Dynamic updating with partially observed functions

More information

Measuring the fit of the model - SSR

Measuring the fit of the model - SSR Measuring the fit of the model - SSR Once we ve determined our estimated regression line, we d like to know how well the model fits. How far/close are the observations to the fitted line? One way to do

More information

STA 2101/442 Assignment Four 1

STA 2101/442 Assignment Four 1 STA 2101/442 Assignment Four 1 One version of the general linear model with fixed effects is y = Xβ + ɛ, where X is an n p matrix of known constants with n > p and the columns of X linearly independent.

More information

Single and multiple linear regression analysis

Single and multiple linear regression analysis Single and multiple linear regression analysis Marike Cockeran 2017 Introduction Outline of the session Simple linear regression analysis SPSS example of simple linear regression analysis Additional topics

More information

EXTENDING PARTIAL LEAST SQUARES REGRESSION

EXTENDING PARTIAL LEAST SQUARES REGRESSION EXTENDING PARTIAL LEAST SQUARES REGRESSION ATHANASSIOS KONDYLIS UNIVERSITY OF NEUCHÂTEL 1 Outline Multivariate Calibration in Chemometrics PLS regression (PLSR) and the PLS1 algorithm PLS1 from a statistical

More information

-Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the

-Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the 1 2 3 -Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the 1950's. -PCA is based on covariance or correlation

More information

The Degree of Standardisation in the An Sơn Ceramic Assemblage

The Degree of Standardisation in the An Sơn Ceramic Assemblage 7 The Degree of Standardisation in the An Sơn Ceramic Assemblage Introduction: Methodology for the study of standardisation The level of standardisation within an assemblage of pottery is used as an indirect

More information

Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis

Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis For example Data reduction approaches Cluster analysis Principal components analysis

More information

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms Arthur Getis* and Jared Aldstadt** *San Diego State University **SDSU/UCSB

More information

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line Chapter 7 Linear Regression (Pt. 1) 7.1 Introduction Recall that r, the correlation coefficient, measures the linear association between two quantitative variables. Linear regression is the method of fitting

More information

Regime switching models

Regime switching models Regime switching models Structural change and nonlinearities Matthieu Stigler Matthieu.Stigler at gmail.com April 30, 2009 Version 1.1 This document is released under the Creative Commons Attribution-Noncommercial

More information

Processing Big Data Matrix Sketching

Processing Big Data Matrix Sketching Processing Big Data Matrix Sketching Dimensionality reduction Linear Principal Component Analysis: SVD-based Compressed sensing Matrix sketching Non-linear Kernel PCA Isometric mapping Matrix sketching

More information

The GWmodel R package: further topics for exploring spatial heterogeneity using geographically weighted models

The GWmodel R package: further topics for exploring spatial heterogeneity using geographically weighted models Geo-spatial Information Science ISSN: 1009-5020 (Print) 1993-5153 (Online) Journal homepage: https://www.tandfonline.com/loi/tgsi20 The GWmodel R package: further topics for exploring spatial heterogeneity

More information

Bulyanhulu: Anomalous gold mineralisation in the Archaean of Tanzania. Claire Chamberlain, Jamie Wilkinson, Richard Herrington, Ettienne du Plessis

Bulyanhulu: Anomalous gold mineralisation in the Archaean of Tanzania. Claire Chamberlain, Jamie Wilkinson, Richard Herrington, Ettienne du Plessis Bulyanhulu: Anomalous gold mineralisation in the Archaean of Tanzania Claire Chamberlain, Jamie Wilkinson, Richard Herrington, Ettienne du Plessis Atypical Archaean gold deposits Groves et al., 2003 Regional

More information

Treatment of Data. Methods of determining analytical error -Counting statistics -Reproducibility of reference materials -Homogeneity of sample

Treatment of Data. Methods of determining analytical error -Counting statistics -Reproducibility of reference materials -Homogeneity of sample Treatment of Data Methods of determining analytical error -Counting statistics -Reproducibility of reference materials -Homogeneity of sample Detection Limits Assessment of analytical quality -Analytical

More information

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram

More information

2.5 Forecasting and Impulse Response Functions

2.5 Forecasting and Impulse Response Functions 2.5 Forecasting and Impulse Response Functions Principles of forecasting Forecast based on conditional expectations Suppose we are interested in forecasting the value of y t+1 based on a set of variables

More information

Small Sample Corrections for LTS and MCD

Small Sample Corrections for LTS and MCD myjournal manuscript No. (will be inserted by the editor) Small Sample Corrections for LTS and MCD G. Pison, S. Van Aelst, and G. Willems Department of Mathematics and Computer Science, Universitaire Instelling

More information

CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition

CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition Ad Feelders Universiteit Utrecht Department of Information and Computing Sciences Algorithmic Data

More information

Dimensionality Reduction Techniques (DRT)

Dimensionality Reduction Techniques (DRT) Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,

More information

Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso

Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso by David C. Wheeler Technical Report 07-08 October 2007 Department of

More information

Regression diagnostics

Regression diagnostics Regression diagnostics Kerby Shedden Department of Statistics, University of Michigan November 5, 018 1 / 6 Motivation When working with a linear model with design matrix X, the conventional linear model

More information

Robot Image Credit: Viktoriya Sukhanova 123RF.com. Dimensionality Reduction

Robot Image Credit: Viktoriya Sukhanova 123RF.com. Dimensionality Reduction Robot Image Credit: Viktoriya Sukhanova 13RF.com Dimensionality Reduction Feature Selection vs. Dimensionality Reduction Feature Selection (last time) Select a subset of features. When classifying novel

More information

Ecological indicators: Software development

Ecological indicators: Software development Ecological indicators: Software development Sergei N. Rodionov Joint Institute for the Study of the Atmosphere and Ocean, University of Washington, Seattle, WA 98185, U.S.A. E-mail: sergei.rodionov@noaa.gov

More information

Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations.

Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations. Previously Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations y = Ax Or A simply represents data Notion of eigenvectors,

More information

Simple and Multiple Linear Regression

Simple and Multiple Linear Regression Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where

More information

Lecture 4: Regression Analysis

Lecture 4: Regression Analysis Lecture 4: Regression Analysis 1 Regression Regression is a multivariate analysis, i.e., we are interested in relationship between several variables. For corporate audience, it is sufficient to show correlation.

More information

Hunting for Anomalies in PMU Data

Hunting for Anomalies in PMU Data Hunting for Anomalies in PMU Data BRETT AMIDAN JAMES FOLLUM JEFFERY DAGLE Pacific Northwest National Laboratory NASPI Presentation (October 23, 2014) November 3, 2014 b.amidan@pnnl.gov 1 Big Picture Objective

More information

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu Dimension Reduction Techniques Presented by Jie (Jerry) Yu Outline Problem Modeling Review of PCA and MDS Isomap Local Linear Embedding (LLE) Charting Background Advances in data collection and storage

More information

Multiple Linear Regression II. Lecture 8. Overview. Readings

Multiple Linear Regression II. Lecture 8. Overview. Readings Multiple Linear Regression II Lecture 8 Image source:https://commons.wikimedia.org/wiki/file:autobunnskr%c3%a4iz-ro-a201.jpg Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution

More information

Multiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I

Multiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I Multiple Linear Regression II Lecture 8 Image source:https://commons.wikimedia.org/wiki/file:autobunnskr%c3%a4iz-ro-a201.jpg Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution

More information

Time: the late arrival at the Geocomputation party and the need for considered approaches to spatio- temporal analyses

Time: the late arrival at the Geocomputation party and the need for considered approaches to spatio- temporal analyses Time: the late arrival at the Geocomputation party and the need for considered approaches to spatio- temporal analyses Alexis Comber 1, Paul Harris* 2, Narumasa Tsutsumida 3 1 School of Geography, University

More information

SIMULATION AND APPLICATION OF THE SPATIAL AUTOREGRESSIVE GEOGRAPHICALLY WEIGHTED REGRESSION MODEL (SAR-GWR)

SIMULATION AND APPLICATION OF THE SPATIAL AUTOREGRESSIVE GEOGRAPHICALLY WEIGHTED REGRESSION MODEL (SAR-GWR) SIMULATION AND APPLICATION OF THE SPATIAL AUTOREGRESSIVE GEOGRAPHICALLY WEIGHTED REGRESSION MODEL (SAR-GWR) I. Gede Nyoman Mindra Jaya 1, Budi Nurani Ruchjana 2, Bertho Tantular 1, Zulhanif 1 and Yudhie

More information

Explorative Spatial Analysis of Coastal Community Incomes in Setiu Wetlands: Geographically Weighted Regression

Explorative Spatial Analysis of Coastal Community Incomes in Setiu Wetlands: Geographically Weighted Regression Explorative Spatial Analysis of Coastal Community Incomes in Setiu Wetlands: Geographically Weighted Regression Z. Syerrina 1, A.R. Naeim, L. Muhamad Safiih 3 and Z. Nuredayu 4 1,,3,4 School of Informatics

More information

Classification 2: Linear discriminant analysis (continued); logistic regression

Classification 2: Linear discriminant analysis (continued); logistic regression Classification 2: Linear discriminant analysis (continued); logistic regression Ryan Tibshirani Data Mining: 36-462/36-662 April 4 2013 Optional reading: ISL 4.4, ESL 4.3; ISL 4.3, ESL 4.4 1 Reminder:

More information

Econometrics 2, Class 1

Econometrics 2, Class 1 Econometrics 2, Class Problem Set #2 September 9, 25 Remember! Send an email to let me know that you are following these classes: paul.sharp@econ.ku.dk That way I can contact you e.g. if I need to cancel

More information

Inter Item Correlation Matrix (R )

Inter Item Correlation Matrix (R ) 7 1. I have the ability to influence my child s well-being. 2. Whether my child avoids injury is just a matter of luck. 3. Luck plays a big part in determining how healthy my child is. 4. I can do a lot

More information

Spatial Regression. 6. Specification Spatial Heterogeneity. Luc Anselin.

Spatial Regression. 6. Specification Spatial Heterogeneity. Luc Anselin. Spatial Regression 6. Specification Spatial Heterogeneity Luc Anselin http://spatial.uchicago.edu 1 homogeneity and heterogeneity spatial regimes spatially varying coefficients spatial random effects 2

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 4: Factor analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical Engineering Pedro

More information

Alternatives to Difference Scores: Polynomial Regression and Response Surface Methodology. Jeffrey R. Edwards University of North Carolina

Alternatives to Difference Scores: Polynomial Regression and Response Surface Methodology. Jeffrey R. Edwards University of North Carolina Alternatives to Difference Scores: Polynomial Regression and Response Surface Methodology Jeffrey R. Edwards University of North Carolina 1 Outline I. Types of Difference Scores II. Questions Difference

More information

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2

More information

Modelling Non-linear and Non-stationary Time Series

Modelling Non-linear and Non-stationary Time Series Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September

More information

Multivariate and Multivariable Regression. Stella Babalola Johns Hopkins University

Multivariate and Multivariable Regression. Stella Babalola Johns Hopkins University Multivariate and Multivariable Regression Stella Babalola Johns Hopkins University Session Objectives At the end of the session, participants will be able to: Explain the difference between multivariable

More information

Multiple Linear Regression II. Lecture 8. Overview. Readings

Multiple Linear Regression II. Lecture 8. Overview. Readings Multiple Linear Regression II Lecture 8 Image source:http://commons.wikimedia.org/wiki/file:vidrarias_de_laboratorio.jpg Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution

More information

Multiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I

Multiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I Multiple Linear Regression II Lecture 8 Image source:http://commons.wikimedia.org/wiki/file:vidrarias_de_laboratorio.jpg Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution

More information

Introduction to Machine Learning

Introduction to Machine Learning 1, DATA11002 Introduction to Machine Learning Lecturer: Teemu Roos TAs: Ville Hyvönen and Janne Leppä-aho Department of Computer Science University of Helsinki (based in part on material by Patrik Hoyer

More information

ST430 Exam 1 with Answers

ST430 Exam 1 with Answers ST430 Exam 1 with Answers Date: October 5, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textook are permitted but you may use a calculator.

More information

ReducedPCR/PLSRmodelsbysubspaceprojections

ReducedPCR/PLSRmodelsbysubspaceprojections ReducedPCR/PLSRmodelsbysubspaceprojections Rolf Ergon Telemark University College P.O.Box 2, N-9 Porsgrunn, Norway e-mail: rolf.ergon@hit.no Published in Chemometrics and Intelligent Laboratory Systems

More information

MULTI-VARIATION ANALYSIS AND OPTIMISATION OF ELECTRICAL CONDUCTIVITY OF MnO-SiO 2 -CaO SLAGS

MULTI-VARIATION ANALYSIS AND OPTIMISATION OF ELECTRICAL CONDUCTIVITY OF MnO-SiO 2 -CaO SLAGS MULTI-VARIATION ANALYSIS AND OPTIMISATION OF ELECTRICAL CONDUCTIVITY OF MnO-SiO 2 -CaO SLAGS M. M. Gasik 1, and M. I. Gasik 2 1 Aalto University School of Science and Technology, 00076 AALTO, Espoo, Finland;

More information

Regression Retrieval Overview. Larry McMillin

Regression Retrieval Overview. Larry McMillin Regression Retrieval Overview Larry McMillin Climate Research and Applications Division National Environmental Satellite, Data, and Information Service Washington, D.C. Larry.McMillin@noaa.gov Pick one

More information

Principal Component Analysis vs. Independent Component Analysis for Damage Detection

Principal Component Analysis vs. Independent Component Analysis for Damage Detection 6th European Workshop on Structural Health Monitoring - Fr..D.4 Principal Component Analysis vs. Independent Component Analysis for Damage Detection D. A. TIBADUIZA, L. E. MUJICA, M. ANAYA, J. RODELLAR

More information

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17 Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into

More information

Chemometrics. Matti Hotokka Physical chemistry Åbo Akademi University

Chemometrics. Matti Hotokka Physical chemistry Åbo Akademi University Chemometrics Matti Hotokka Physical chemistry Åbo Akademi University Linear regression Experiment Consider spectrophotometry as an example Beer-Lamberts law: A = cå Experiment Make three known references

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Satellite and gauge rainfall merging using geographically weighted regression

Satellite and gauge rainfall merging using geographically weighted regression 132 Remote Sensing and GIS for Hydrology and Water Resources (IAHS Publ. 368, 2015) (Proceedings RSHS14 and ICGRHWE14, Guangzhou, China, August 2014). Satellite and gauge rainfall merging using geographically

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

Principal component analysis for compositional data with outliers

Principal component analysis for compositional data with outliers ENVIRONMETRICS Environmetrics 2009; 20: 621 632 Published online 11 February 2009 in Wiley InterScience (www.interscience.wiley.com).966 Principal component analysis for compositional data with outliers

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or

More information

Lecture 14 Simple Linear Regression

Lecture 14 Simple Linear Regression Lecture 4 Simple Linear Regression Ordinary Least Squares (OLS) Consider the following simple linear regression model where, for each unit i, Y i is the dependent variable (response). X i is the independent

More information