Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
|
|
- Bruno Parker
- 5 years ago
- Views:
Transcription
1 Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity
2 Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio of evets Measure the cluster (spatial autocorrelatio)i poit patter This week, we will lear: How to measure ad detect clusters/spatial autocorrelatio i areal data (regioal data)
3 Spatial Autocorrelatio Spatial autocorrelatioship is everywhere Spatial poit patter K, G fuctios Kerel fuctios Areal/lattice (this topic) Geostatistical data (ext topic) 3
4 Spatial Autocorrelatio of Areal Data 4
5 Spatial Autocorrelatio Tobler s first law of geography Spatial auto/cross correlatio If like values ted to cluster together, the the field exhibits high positive spatial If there is o apparet relatioship betwee attribute value ad locatio the there is zero spatial autocorrelatio If like values ted to be located away from each other, the there is egative spatial autocorrelatio 5
6 Positive spatial autocorrelatio - high values surrouded by earby high values - itermediate values surrouded by earby itermediate values - low values surrouded by earby low values 2002 populatio desity Source: Ro Briggs of UT Dallas 6
7 Negative spatial autocorrelatio - high values surrouded by earby low values - itermediate values surrouded by earby itermediate values - low values surrouded by earby high values competitio for space Grocery store desity Source: Ro Briggs of UT Dallas 7
8 Measurig Spatial Autocorrelatio: the problem of measurig earess To measure spatial autocorrelatio, we must kow the earess of our observatios as we did for poit patter case Which poits or polygos are ear or ext to other poits or polygos? Which states are ear Texas? How to measure this? Seems simple ad obvious, but it is ot! 8
9 Spatial Weight Matrix Core cocept i statistical aalysis of areal data Two steps ivolved: defie which relatioships betwee observatios are to be give a ozero weight, i.e., defie spatial eighbors assig weights to the eighbors 9
10 Spatial Neighbors Cotiguity-based eighbors Zoe i ad j are eighbors if zoe i is cotiguity or adjacet to zoe j But what costitutes cotiguity? Distace-based eighbors Zoe i ad j are eighbors if the distace betwee them are less tha the threshold distace But what distace do we use? 10
11 Cotiguity-based Spatial Neighbors Sharig a border or boudary Rook: sharig a border Quee: sharig a border or a poit rook quee Hexagos Irregular Which use? 11
12 Higher-Order Cotiguity 1 st order Nearest eighbor rook hexago quee 2 d order Next earest eighbor 12
13 Distace-based Neighbors How to measure distace betwee polygos? Distace metrics 2D Cartesia distace (projected data) 3D spherical distace/great-circle distace (lat/ log data) Haversie formula 13
14 Distace-based Neighbors k-earest eighbors Source: Bivad ad Pebesma ad Gomez-Rubio 14
15 Distace-based Neighbors thresh-hold distace (buffer) Source: Bivad ad Pebesma ad Gomez-Rubio 15
16 Neighbor/Coectivity Histogram Source: Bivad ad Pebesma ad Gomez-Rubio 16
17 Spatial Weight Matrix Spatial weights ca be see as a list of weights idexed by a list of eighbors If zoe j is ot a eighbor of zoe i, weights Wij will set to zero The weight matrix ca be illustrated as a image Sparse matrix 17
18 A Simple Example for Rook case Matrix cotais a: 1 if share a border 0 if do ot share a border 4 areal uits 4x4 matrix A C B D Commo border W = A B C D A B C D
19 19
20 Sparse Cotiguity Matrix for US States -- obtaied from Aseli's web site (see powerpoit for lik) Name Fips Ncout N1 N2 N3 N4 N5 N6 N7 N8 Alabama Arizoa Arkasas Califoria Colorado Coecticut Delaware District of Columbia Florida Georgia Idaho Illiois Idiaa Iowa Kasas Ketucky Louisiaa Maie Marylad Massachusetts Michiga Miesota Mississippi Missouri Motaa Nebraska Nevada New Hampshire New Jersey New Mexico New York North Carolia North Dakota Ohio Oklahoma Orego Pesylvaia Rhode Islad South Carolia South Dakota Teessee Texas Utah Vermot Virgiia Washigto West Virgiia Wiscosi Wyomig
21 Style of Spatial Weight Matrix Row a weight of uity for each eighbor relatioship Row stadardizatio Symmetry ot guarateed ca be iterpreted as allowig the calculatio of average values across eighbors Geeral spatial weights based o distaces 21
22 Row vs. Row stadardizatio A B C D E F Divide each umber by the row sum Total umber of eighbors --some have more tha others A B C D E F Row Sum A B C D E F Row stadardized --usually use this A B C D E F Row Sum A B C D E F
23 Geeral Spatial Weights Based o Distace Decay fuctios of distace Most commo choice is the iverse (reciprocal) of the distace betwee locatios i ad j (w ij = 1/d ij ) Other fuctios also used iverse of squared distace (w ij =1/d ij2 ), or egative expoetial (w ij = e -d or w ij = e -d2 ) 23
24 Measure of Spatial Autocorrelatio 24
25 Global Measures ad Local Measures Global Measures A sigle value which applies to the etire data set The same patter or process occurs over the etire geographic area A average for the etire area Local Measures A value calculated for each observatio uit Differet patters or processes may occur i differet parts of the regio A uique umber for each locatio Global measures usually ca be decomposed ito a combiatio of local measures 25
26 Global Measures ad Local Measures Global Measures Joi Cout Mora s I Local Measures Local Mora s I 26
27 Joi (or Joit or Jois) Cout Statistic 60 for Rook Case 110 for Quee Case 27
28 Joi Cout: Test Statistic Test Statistic give by: Z= Observed - Expected SD of Expected Expected = radom patter geerated by tossig a coi i each cell. Expected give by: Stadard Deviatio of Expected (stadard error) give by: Where: k is the total umber of jois (eighbors) p B is the expected proportio Black, if radom p W is the expected proportio White m is calculated from k accordig to: 28
29 Gore/Bush Presidetial Electio 2000 Actual Jbb 60 Jgg 21 Jbg 28 Total
30 Joi Cout Statistic for Gore/Bush 2000 by State cadidates probability Bush Gore Actual Expected Sta Dev Z-score Jbb Jgg Jbg Total The expected umber of jois is calculated based o the proportio of votes each received i the electio (for Bush = 109*.499*.499=27.125) There are far more Bush/Bush jois (actual = 60) tha would be expected (27) Positive autocorrelatio There are far fewer Bush/Gore jois (actual = 28) tha would be expected (54) Positive autocorrelatio No strog clusterig evidece for Gore (actual = 21 slightly less tha ) 30
31 Mora s I The most commo measure of Spatial Autocorrelatio Use for poits or polygos Joi Cout statistic oly for polygos Use for a cotiuous variable (ay value) Joi Cout statistic oly for biary variable (1,0) Patrick Alfred Pierce Mora ( ) 31
32 Formula for Mora s I I N ij i i= 1 j= 1 = ( i= 1 j= 1 w w ij (x ) i= 1 x)(x (x i j x) x) 2 Where: N is the umber of observatios (poits or polygos) x is the mea of the variable X i is the variable value at a particular locatio X j is the variable value at aother locatio is a weight idexig locatio of i relative to j W ij 32
33 Mora s I ad Correlatio Coefficiet Correlatio Coefficiet [-1, 1] Relatioship betwee two differet variables Mora s I [-1, 1] Spatial autocorrelatio ad ofte ivolves oe (spatially idexed) variable oly Correlatio betwee observatios of a spatial variable at locatio X ad spatial lag of X formed by averagig all the observatio at eighbors of X
34 i= 1 i= 1 (y 1(y i i y) 2 y)(x i i= 1 x)/ (x i x) 2 Correlatio Coefficiet Note the similarity of the umerator (top) to the measures of spatial associatio discussed earlier if we view Yi as beig the Xi for the eighborig polygo (see ext slide) N i= 1 j= 1 ( i= 1 j= 1 w w ij ij (x ) i i= 1 x)(x (x Spatial auto-correlatio i j x) x) 2 = w i= 1 (x (x i x)(x x) 2 i= 1 x)/ ij i j i= 1 j= 1 i= 1 j= 1 Source: Ro Briggs of UT Dallas (x i x) 2 w 34 ij
35 i= 1 i= 1 (y 1(y i i y) Yi is the Xi for the eighborig polygo N i= 1 j= 1 ( i= 1 j= 1 w w ij ij (x ) i i= 1 x)(x (x i j 2 x) y)(x x) i i= 1 Mora s I 2 = (x x)/ i x) 2 w i= 1 (x (x i x) x)(x 2 i= 1 x)/ ij i j i= 1 j= 1 i= 1 j= 1 Source: Ro Briggs of UT Dallas Correlatio Coefficiet Spatial weights (x i x) 2 w 35 ij
36 Mora Scatter Plots We ca draw a scatter diagram betwee these two variables (i stadardized form): X ad lag-x (or W_X) The slope of this regressio lie is Mora s I 36
37 Mora Scatter Plots Low/High egative SA High/High positive SA Low/Low positive SA High/Low egative SA 37
38 Mora Scatterplot: Example 38
39 Statistical Sigificace Tests for Mora s I Based o the ormal frequecy distributio with Z I E( I) = Serror( I ) Where: I is the calculated value for Mora s I from the sample E(I) is the expected value if radom S is the stadard error Statistical sigificace test Mote Carlo test, as we did for spatial patter aalysis Permutatio test No-parametric Data-drive, o assumptio of the data Implemeted i GeoDa 39
40 Test Statistic for Normal Frequecy Distributio *techically 1/(-1) 2.5% Reject ull /(-1) % 1% 2.54 Null Hypothesis: o spatial autocorrelatio *Mora s I = 0 Alterative Hypothesis: spatial autocorrelatio exists *Mora s I > 0 Reject Null Hypothesis if Z test statistic > 1.96 (or < -1.96) ---less tha a 5% chace that, i the populatio, there is o spatial autocorrelatio ---95% cofidet that spatial auto correlatio exits Reject ull at 5% Reject ull at 1% 40
41 Null Hypothesis: o spatial autocorrelatio *Mora s I = 0 Alterative Hypothesis: spatial autocorrelatio exists *Mora s I > 0 Reject Null Hypothesis if Z test statistic > 1.96 (or < -1.96) ---less tha a 5% chace that, i the populatio, there is o spatial autocorrelatio ---95% cofidet that spatial auto correlatio exits 41
42 Bivariate Mora Scatter Plot Low/High egative SA High/High positive SA Low/Low positive SA High/Low egative SA 42
43 Spatial Autocorrelatio vs Correlatio Spatial Autocorrelatio: shows the associatio or relatioship betwee the same variable i earby areas. Stadard Correlatio shows the associatio or relatioship betwee two differet variables 43
44 Cosequeces of Igorig Spatial Autocorrelatio correlatio coefficiets ad coefficiets of determiatio appear bigger tha they really are You thik the relatioship is stroger tha it really is the variables i earby areas affect each other Stadard errors appear smaller tha they really are exaggerated precisio You thik your predictios are better tha they really are sice stadard errors measure predictive accuracy More likely to coclude relatioship is statistically sigificat. 44
45 Diagostic of Spatial Depedece For correlatio calculate Mora s I for each variable ad test its statistical sigificace If Mora s I is sigificat, you may have a problem! For regressio calculate the residuals map the residuals: do you see ay spatial patters? Calculate Mora s I for the residuals: is it statistically sigificat? 45
46 Summary Spatial autocorrelatio of areal data Spatial weight matrix Measures of spatial autocorrelatio Global Measure Mora s I Cosequeces of igorig spatial autocorrelatio Sigificace test 46
47 Ed of this topic 47
Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationDESCRIPTIVE STATISTICS
DESCRIPTIVE STATISTICS REVIEW OF KEY CONCEPTS SECTION. Measures of Locatio.. Arithmetic Mea xi x i x+ x + + x Cosider the data i Table.. They represet serum-cholesterol levels from a group of hospital
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationCorrelation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph
Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationStatistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes
Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:
More informationStat 139 Homework 7 Solutions, Fall 2015
Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,
More informationWorksheet 23 ( ) Introduction to Simple Linear Regression (continued)
Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationCorrelation and Regression
Correlatio ad Regressio Lecturer, Departmet of Agroomy Sher-e-Bagla Agricultural Uiversity Correlatio Whe there is a relatioship betwee quatitative measures betwee two sets of pheomea, the appropriate
More informationTMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.
Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More information10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random
Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationResponse Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable
Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated
More informationChapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).
Chapters 5 ad 13: REGREION AND CORRELATION (ectios 5.5 ad 13.5 are omitted) Uivariate data: x, Bivariate data (x,y). Example: x: umber of years studets studied paish y: score o a proficiecy test For each
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationCorrelation Regression
Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.
ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor
More information3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.
3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear
More informationSets are collection of objects that can be displayed in different forms. Two of these forms are called Roster Method and Builder Set Notation.
Sectio 2.1 Set ad Set Operators Defiitio of a set set is a collectio of objects thigs or umbers. Sets are collectio of objects that ca be displayed i differet forms. Two of these forms are called Roster
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More informationA proposed discrete distribution for the statistical modeling of
It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical
More informationHYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationChapter 12 Correlation
Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio
More informationExample: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.
1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More informationLinear Regression Models
Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationCircle the single best answer for each multiple choice question. Your choice should be made clearly.
TEST #1 STA 4853 March 6, 2017 Name: Please read the followig directios. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directios This exam is closed book ad closed otes. There are 32 multiple choice questios.
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationRegression, Inference, and Model Building
Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship
More informationFinal Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x
More informationDr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)
Dr Maddah NMG 617 M Statistics 11/6/1 Multiple egressio () (Chapter 15, Hies) Test for sigificace of regressio This is a test to determie whether there is a liear relatioship betwee the depedet variable
More information10-701/ Machine Learning Mid-term Exam Solution
0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationSuccessful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile
Successful HE applicats Sigificace tests use data from samples to test hypotheses. You will use data o successful applicatios for courses i higher educatio to aswer questios about proportios, for example,
More informationENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!
ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Solutios Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced
More informationMEASURES OF DISPERSION (VARIABILITY)
POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral
More informationPH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationMap as Outcomes of Processes. Outline. Map as Outcomes of Processes
Map as Outcomes of Processes Outlie Defiitios: processes & patters A Startig poit: Complete Spatial Radomess More defiitio: Statioary Map as Outcomes of Processes The basic assumptio of Spatial Aalysis:
More information4 Multidimensional quantitative data
Chapter 4 Multidimesioal quatitative data 4 Multidimesioal statistics Basic statistics are ow part of the curriculum of most ecologists However, statistical techiques based o such simple distributios as
More information1 Constructing and Interpreting a Confidence Interval
Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationCommon Large/Small Sample Tests 1/55
Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationChapter 23: Inferences About Means
Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For
More informationMedian and IQR The median is the value which divides the ordered data values in half.
STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationNCSS Statistical Software. Tolerance Intervals
Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided
More informationDotting The Dot Map, Revisited. A. Jon Kimerling Dept. of Geosciences Oregon State University
Dottig The Dot Map, Revisited A. Jo Kimerlig Dept. of Geoscieces Orego State Uiversity Dot maps show the geographic distributio of features i a area by placig dots represetig a certai quatity of features
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationID = x d. LAB EXERCISE #1 Scaling Techniques. Objectives. Background Information. Instructor: K. McGarigal
LAB EXERCISE #1 Scalig Techiques Istructor: K. McGarigal Overview: I this exercise, you will gai familiarity with a few commo procedures for elucidatig the scale of patter i poit, cotiuous ad categorical
More informationStat 200 -Testing Summary Page 1
Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationStatistics Revision Solutions
Statistics Revisio Solutios (i) H ~N (00, ) ad W ~N (7, 9 ) P ( 7. 0) 0. 978 P (iii) H + W ~N (7, ) P ( H + W > A) > 0.9 P( H + W < A) < 0.0 A< ivnorm(0.0,
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More informationMidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday
Aoucemets MidtermII Review Sta 101 - Fall 2016 Duke Uiversity, Departmet of Statistical Sciece Office Hours Wedesday 12:30-2:30pm Watch liear regressio videos before lab o Thursday Dr. Abrahamse Slides
More informationMA238 Assignment 4 Solutions (part a)
(i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationUnderstanding Samples
1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationGUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)*
GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)* *Note: I Frech SAD (Système d Aide à la Décisio) 1. Itroductio to the DSS Eightee statistical distributios are available i HYFRAN-PLUS software to
More informationGG313 GEOLOGICAL DATA ANALYSIS
GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationEconomics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator
Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters
More informationCategorical Data Analysis
Categorical Data Aalysis Refereces : Ala Agresti, Categorical Data Aalysis, Wiley Itersciece, New Jersey, 2002 Bhattacharya, G.K., Johso, R.A., Statistical Cocepts ad Methods, Wiley,1977 Outlie Categorical
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More information