Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
|
|
- Suzanna Crawford
- 5 years ago
- Views:
Transcription
1 Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity
2 Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio of evets Measure the cluster (spatial autocorrelatio)i poit patter This week, we will lear: How to measure ad detect clusters/spatial autocorrelatio i areal data (regioal data)
3 Spatial Autocorrelatio Spatial autocorrelatioship is everywhere Spatial poit patter K, F, G fuctios Kerel fuctios Areal/lattice (this topic) Geostatistical data (ext topic) 3
4 Spatial Autocorrelatio of Areal Data 4
5 Spatial Autocorrelatio Tobler s first law of geography Spatial auto/cross correlatio If like values ted to cluster together, the the field exhibits high positive spatial If there is o apparet relatioship betwee attribute value ad locatio the there is zero spatial autocorrelatio If like values ted to be located away from each other, the there is egative spatial autocorrelatio 5
6 Positive spatial autocorrelatio - high values surrouded by earby high values - itermediate values surrouded by earby itermediate values - low values surrouded by earby low values 2002 populatio desity Source: Ro Briggs of UT Dallas 6
7 Negative spatial autocorrelatio - high values surrouded by earby low values - itermediate values surrouded by earby itermediate values - low values surrouded by earby high values competitio for space Grocery store desity Source: Ro Briggs of UT Dallas 7
8 Measurig Spatial Autocorrelatio: the problem of measurig earess To measure spatial autocorrelatio, we must kow the earess of our observatios as we did for poit patter case Which poits or polygos are ear or ext to other poits or polygos? Which states are ear Texas? How to measure this? Seems simple ad obvious, but it is ot! 8
9 Spatial Weight Matrix Core cocept i statistical aalysis of areal data Two steps ivolved: defie which relatioships betwee observatios are to be give a ozero weight, i.e., defie spatial eighbors assig weights to the eighbors 9
10 Spatial Neighbors Cotiguity-based eighbors Zoe i ad j are eighbors if zoe i is cotiguity or adjacet to zoe j But what costitutes cotiguity? Distace-based eighbors Zoe i ad j are eighbors if the distace betwee them are less tha the threshold distace But what distace do we use? 10
11 Cotiguity-based Spatial Neighbors Sharig a border or boudary Rook: sharig a border Quee: sharig a border or a poit rook quee Hexagos Irregular Which use? 11
12 Problem Situatios for Irregular Polygos Close but o commo border Legth of border Is Arizoa as close to Califoria as to Utah? Base closeess o proportio of shared border, ot just oe (1) or zero (0) w ij = border legth ij /border legth j ) X 12
13 Higher-Order Cotiguity 1 st order Nearest eighbor rook hexago quee 2 d order Next earest eighbor 13
14 Distace-based Neighbors How to measure distace betwee polygos? Distace metrics 2D Cartesia distace (projected data) 3D spherical distace/great-circle distace (lat/log data) Haversie formula 14
15 Distace-based Neighbors k-earest eighbors Source: Bivad ad Pebesma ad Gomez-Rubio 15
16 Distace-based Neighbors thresh-hold distace (buffer) Source: Bivad ad Pebesma ad Gomez-Rubio 16
17 Neighbor/Coectivity Histogram Source: Bivad ad Pebesma ad Gomez-Rubio 17
18 Spatial Weight Matrix Spatial weights ca be see as a list of weights idexed by a list of eighbors If zoe j is ot a eighbor of zoe i, weights Wij will set to zero The weight matrix ca be illustrated as a image Sparse matrix 18
19 A Simple Example for Rook case Matrix cotais a: 1 if share a border 0 if do ot share a border 4 areal uits 4x4 matrix A C B D Commo border W = A B C D A B C D
20 20
21 Sparse Cotiguity Matrix for US States -- obtaied from Aseli's web site (see powerpoit for lik) Name Fips Ncout N1 N2 N3 N4 N5 N6 N7 N8 Alabama Arizoa Arkasas Califoria Colorado Coecticut Delaware District of Columbia Florida Georgia Idaho Illiois Idiaa Iowa Kasas Ketucky Louisiaa Maie Marylad Massachusetts Michiga Miesota Mississippi Missouri Motaa Nebraska Nevada New Hampshire New Jersey New Mexico New York North Carolia North Dakota Ohio Oklahoma Orego Pesylvaia Rhode Islad South Carolia South Dakota Teessee Texas Utah Vermot Virgiia Washigto West Virgiia Wiscosi Wyomig
22 Style of Spatial Weight Matrix Row a weight of uity for each eighbor relatioship Row stadardizatio Symmetry ot guarateed ca be iterpreted as allowig the calculatio of average values across eighbors Geeral spatial weights based o distaces 22
23 Row vs. Row stadardizatio A B C D E F Divide each umber by the row sum Total umber of eighbors --some have more tha others A B C D E F Row Sum A B C D E F Row stadardized --usually use this A B C D E F Row Sum A B C D E F
24 Geeral Spatial Weights Based o Distace Decay fuctios of distace Most commo choice is the iverse (reciprocal) of the distace betwee locatios i ad j (w ij = 1/d ij ) Other fuctios also used iverse of squared distace (w ij =1/d ij2 ), or egative expoetial (w ij = e -d or w ij = e -d2 ) 24
25 Measure of Spatial Autocorrelatio 25
26 Global Measures ad Local Measures Global Measures A sigle value which applies to the etire data set The same patter or process occurs over the etire geographic area A average for the etire area Local Measures A value calculated for each observatio uit Differet patters or processes may occur i differet parts of the regio A uique umber for each locatio Global measures usually ca be decomposed ito a combiatio of local measures 26
27 Global Measures ad Local Measures Global Measures Joi Cout Mora s I (ad Getis-Ord s G) Local Measures Local Mora s I (ad Getis-Ord s G) 27
28 Joi (or Joit or Jois) Cout Statistic 60 for Rook Case 110 for Quee Case 28
29 Joi Cout: Test Statistic Test Statistic give by: Z= Observed - Expected Expected give by: SD of Expected Expected = radom patter geerated by tossig a coi i each cell. Stadard Deviatio of Expected (stadard error) give by: Where: k is the total umber of jois (eighbors) p B is the expected proportio Black, if radom p W is the expected proportio White m is calculated from k accordig to: 29
30 Gore/Bush Presidetial Electio 2000 Actual Jbb 60 Jgg 21 Jbg 28 Total
31 Joi Cout Statistic for Gore/Bush 2000 by State cadidates probability Bush Gore Actual Expected Sta Dev Z-score Jbb Jgg Jbg Total The expected umber of jois is calculated based o the proportio of votes each received i the electio (for Bush = 109*.499*.499=27.125) There are far more Bush/Bush jois (actual = 60) tha would be expected (27) Positive autocorrelatio There are far fewer Bush/Gore jois (actual = 28) tha would be expected (54) Positive autocorrelatio No strog clusterig evidece for Gore (actual = 21 slightly less tha ) 31
32 Mora s I The most commo measure of Spatial Autocorrelatio Use for poits or polygos Joi Cout statistic oly for polygos Use for a cotiuous variable (ay value) Joi Cout statistic oly for biary variable (1,0) Patrick Alfred Pierce Mora ( ) 32
33 Formula for Mora s I I N åå ij i i= 1 j= 1 = ( åå i= 1 j= 1 w w ij (x ) å i= 1 - x)(x (x i - j - x) x) 2 Where: N is the umber of observatios (poits or polygos) x is the mea of the variable X i is the variable value at a particular locatio X j is the variable value at aother locatio is a weight idexig locatio of i relative to j W ij 33
34 Mora s I ad Correlatio Coefficiet Correlatio Coefficiet [-1, 1] Relatioship betwee two differet variables Mora s I [-1, 1] Spatial autocorrelatio ad ofte ivolves oe (spatially idexed) variable oly Correlatio betwee observatios of a spatial variable at locatio X ad spatial lag of X formed by averagig all the observatio at eighbors of X
35 å i= 1 å i= 1 (y 1(y i - i - y) 2 y)(x i å i= 1 - x)/ (x i - x) 2 Correlatio Coefficiet Note the similarity of the umerator (top) to the measures of spatial associatio discussed earlier if we view Yi as beig the Xi for the eighborig polygo (see ext slide) N åå i= 1 j= 1 ( åå i= 1 j= 1 w w ij ij (x ) i å i= 1 - x)(x (x Spatial auto-correlatio i j - x) - x) 2 = åå w i= 1 (x (x i - x)(x x) 2 i= 1 x)/ ij i j i= 1 j= 1 i= 1 j= 1 å Source: Ro Briggs of UT Dallas - - å (x i åå - x) 2 w 35 ij
36 å i= 1 å i= 1 (y 1(y i - i - y) Yi is the Xi for the eighborig polygo N åå i= 1 j= 1 ( åå i= 1 j= 1 w w ij ij (x ) i å i= 1 - x)(x (x i j 2 - x) y)(x - x) i å i= 1 Mora s I 2 = - (x x)/ i - åå x) 2 w i= 1 (x (x i x) x)(x 2 i= 1 x)/ ij i j i= 1 j= 1 i= 1 j= 1 å Source: Ro Briggs of UT Dallas Correlatio Coefficiet å Spatial weights (x i åå - x) 2 w 36 ij
37 Mora Scatter Plots We ca draw a scatter diagram betwee these two variables (i stadardized form): X ad lag-x (or W_X) The slope of this regressio lie is Mora s I 37
38 Mora Scatter Plots Low/High egative SA High/High positive SA Low/Low positive SA High/Low egative SA 38
39 Mora Scatterplot: Example 39
40 Statistical Sigificace Tests for Mora s I Based o the ormal frequecy distributio with Z I - E( I) = Serror( I ) Where: I is the calculated value for Mora s I from the sample E(I) is the expected value if radom S is the stadard error Statistical sigificace test Mote Carlo test, as we did for spatial patter aalysis Permutatio test No-parametric Data-drive, o assumptio of the data Implemeted i GeoDa 40
41 Test Statistic for Normal Frequecy Distributio *techically 1/(-1) 2.5% Reject ull /(-1) % 1% 2.54 Null Hypothesis: o spatial autocorrelatio *Mora s I = 0 Alterative Hypothesis: spatial autocorrelatio exists *Mora s I > 0 Reject Null Hypothesis if Z test statistic > 1.96 (or < -1.96) ---less tha a 5% chace that, i the populatio, there is o spatial autocorrelatio ---95% cofidet that spatial auto correlatio exits Reject ull at 5% Reject ull at 1% 41
42 Null Hypothesis: o spatial autocorrelatio *Mora s I = 0 Alterative Hypothesis: spatial autocorrelatio exists *Mora s I > 0 Reject Null Hypothesis if Z test statistic > 1.96 (or < -1.96) ---less tha a 5% chace that, i the populatio, there is o spatial autocorrelatio ---95% cofidet that spatial auto correlatio exits 42
43 Local Measures of Spatial Autocorrelatio 43
44 Local Idicators of Spatial Associatio (LISA) Local versios of Mora s I, ad the Getis-Ord G statistic Mora s I is most commoly used, ad the local versio is ofte called Aseli s LISA, or just LISA See: Luc Aseli 1995 Local Idicators of Spatial Associatio-LISA Geographical Aalysis 27:
45 Local Idicators of Spatial Associatio (LISA) The statistic is calculated for each areal uit i the data For each polygo, the idex is calculated based o eighborig polygos with which it shares a border A measure is available for each polygo, these ca be mapped to idicate how spatial autocorrelatio varies over the study regio Each idex has a associated test statistic, we ca also map which of the polygos has a statistically sigificat relatioship with its eighbors, ad show type of relatioship 45
46 Example: 46
47 Bivariate LISA Mora Scatter Plot for GDI vs AL Mora s I is the correlatio betwee X ad Lag-X--the same variable but i earby areas Uivariate Mora s I Bivariate Mora s I is a correlatio betwee X ad a differet variable i earby areas. Mora Sigificace Map for GDI vs. AL 47
48 Bivariate LISA ad the Correlatio Coefficiet Correlatio Coefficiet is the relatioship betwee two differet variables i the same area Bivariate LISA is a correlatio betwee two differet variables i a area ad i earby areas. 48
49 Bivariate Mora Scatter Plot Low/High egative SA High/High positive SA Low/Low positive SA High/Low egative SA 49
50 Summary Spatial autocorrelatio of areal data Spatial weight matrix Measures of spatial autocorrelatio Global Measure Mora s I Local LISA: Mora s I Bivariate LISA Sigificace test 50
51 Cosequeces of Igorig Spatial Autocorrelatio correlatio coefficiets ad coefficiets of determiatio appear bigger tha they really are You thik the relatioship is stroger tha it really is the variables i earby areas affect each other Stadard errors appear smaller tha they really are exaggerated precisio You thik your predictios are better tha they really are sice stadard errors measure predictive accuracy More likely to coclude relatioship is statistically sigificat. 51
52 Diagostic of Spatial Depedece For correlatio calculate Mora s I for each variable ad test its statistical sigificace If Mora s I is sigificat, you may have a problem! For regressio calculate the residuals map the residuals: do you see ay spatial patters? Calculate Mora s I for the residuals: is it statistically sigificat? 52
53 53
54 Whe (spatial) correlatio happes Try to thik of omitted variables ad iclude them i a multiple regressio. Missig (omitted) variables may cause spatial autocorrelatio Regressio assumes all relevat variables ifluecig the depedet variable are icluded If relevat variables are missig, model is misspecified 54
55 Spatial Regressio Methods Spatial Ecoometrics Approaches (available i GeoDa) Lag model Error model Spatial Statistics Approaches Simultaeous Autoregressive Models (SAR) A more geeral case of Spatial Ecoometrics Coditioal Autoregressive Models (CAR) Other methods: Bayesia spatiotemporal methods 55
56 Ed of this topic 56
Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationCorrelation and Regression
Correlatio ad Regressio Lecturer, Departmet of Agroomy Sher-e-Bagla Agricultural Uiversity Correlatio Whe there is a relatioship betwee quatitative measures betwee two sets of pheomea, the appropriate
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationDESCRIPTIVE STATISTICS
DESCRIPTIVE STATISTICS REVIEW OF KEY CONCEPTS SECTION. Measures of Locatio.. Arithmetic Mea xi x i x+ x + + x Cosider the data i Table.. They represet serum-cholesterol levels from a group of hospital
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationTMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.
Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx
More informationStatistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes
Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:
More informationCorrelation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph
Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical
More information10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random
Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),
More informationChapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).
Chapters 5 ad 13: REGREION AND CORRELATION (ectios 5.5 ad 13.5 are omitted) Uivariate data: x, Bivariate data (x,y). Example: x: umber of years studets studied paish y: score o a proficiecy test For each
More informationResponse Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable
Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated
More information3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.
3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationChapter 12 Correlation
Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.
ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationWorksheet 23 ( ) Introduction to Simple Linear Regression (continued)
Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This
More informationStat 139 Homework 7 Solutions, Fall 2015
Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More informationHYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationCircle the single best answer for each multiple choice question. Your choice should be made clearly.
TEST #1 STA 4853 March 6, 2017 Name: Please read the followig directios. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directios This exam is closed book ad closed otes. There are 32 multiple choice questios.
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationCorrelation Regression
Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother
More informationExample: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.
1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers
More informationChapter 23: Inferences About Means
Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For
More informationAlgebra of Least Squares
October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal
More informationLinear Regression Models
Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationNCSS Statistical Software. Tolerance Intervals
Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More information1 Constructing and Interpreting a Confidence Interval
Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationSTATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:
Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 9 Multicolliearity Dr Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Multicolliearity diagostics A importat questio that
More informationA proposed discrete distribution for the statistical modeling of
It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical
More informationPH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More informationMEASURES OF DISPERSION (VARIABILITY)
POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral
More informationRegression, Inference, and Model Building
Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship
More informationDr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)
Dr Maddah NMG 617 M Statistics 11/6/1 Multiple egressio () (Chapter 15, Hies) Test for sigificace of regressio This is a test to determie whether there is a liear relatioship betwee the depedet variable
More informationEconomics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator
Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationCommutativity in Permutation Groups
Commutativity i Permutatio Groups Richard Wito, PhD Abstract I the group Sym(S) of permutatios o a oempty set S, fixed poits ad trasiet poits are defied Prelimiary results o fixed ad trasiet poits are
More informationLecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)
Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationMedian and IQR The median is the value which divides the ordered data values in half.
STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media
More informationPSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9
Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More information4 Multidimensional quantitative data
Chapter 4 Multidimesioal quatitative data 4 Multidimesioal statistics Basic statistics are ow part of the curriculum of most ecologists However, statistical techiques based o such simple distributios as
More informationSets are collection of objects that can be displayed in different forms. Two of these forms are called Roster Method and Builder Set Notation.
Sectio 2.1 Set ad Set Operators Defiitio of a set set is a collectio of objects thigs or umbers. Sets are collectio of objects that ca be displayed i differet forms. Two of these forms are called Roster
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!
ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Solutios Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationSuccessful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile
Successful HE applicats Sigificace tests use data from samples to test hypotheses. You will use data o successful applicatios for courses i higher educatio to aswer questios about proportios, for example,
More informationIntroductory statistics
CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key
More informationSTP 226 EXAMPLE EXAM #1
STP 226 EXAMPLE EXAM #1 Istructor: Hoor Statemet: I have either give or received iformatio regardig this exam, ad I will ot do so util all exams have bee graded ad retured. PRINTED NAME: Siged Date: DIRECTIONS:
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationII. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation
II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio
More informationGUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)*
GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)* *Note: I Frech SAD (Système d Aide à la Décisio) 1. Itroductio to the DSS Eightee statistical distributios are available i HYFRAN-PLUS software to
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationMap as Outcomes of Processes. Outline. Map as Outcomes of Processes
Map as Outcomes of Processes Outlie Defiitios: processes & patters A Startig poit: Complete Spatial Radomess More defiitio: Statioary Map as Outcomes of Processes The basic assumptio of Spatial Aalysis:
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationJoint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }
UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationS Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y
1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationTopic 18: Composite Hypotheses
Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:
More informationECON 3150/4150, Spring term Lecture 3
Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio
More informationMidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday
Aoucemets MidtermII Review Sta 101 - Fall 2016 Duke Uiversity, Departmet of Statistical Sciece Office Hours Wedesday 12:30-2:30pm Watch liear regressio videos before lab o Thursday Dr. Abrahamse Slides
More information