Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
|
|
- Arthur Williams
- 5 years ago
- Views:
Transcription
1 Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity
2 Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio of evets This week, we will lear: spatial autocorrelatio global measures of spatial autocorrelatio local measure of spatial autocorrelatio
3 Spatial Autocorrelatio Tobler s first law of geography Spatial auto/cross correlatio If like values ted to cluster together, the the field exhibits high positive spatial If there is o apparet relatioship betwee attribute value ad locatio the there is zero spatial autocorrelatio If like values ted to be located away from each other, the there is egative spatial autocorrelatio 3
4 Spatial Autocorrelatio Spatial autocorrelatioship is everywhere Spatial poit patter K, F, G fuctios Kerel fuctios Areal/lattice (this topic) Geostatistical data (ext topic) 4
5 Spatial Autocorrelatio of Areal Data 5
6 Positive spatial autocorrelatio - high values surrouded by earby high values - itermediate values surrouded by earby itermediate values - low values surrouded by earby low values 2002 populatio desity Source: Ro Briggs of UT Dallas 6
7 Negative spatial autocorrelatio - high values surrouded by earby low values - itermediate values surrouded by earby itermediate values - low values surrouded by earby high values competitio for space Grocery store desity Source: Ro Briggs of UT Dallas 7
8 Spatial Weight Matrix Core cocept i statistical aalysis of areal data Two steps ivolved: defie which relatioships betwee observatios are to be give a ozero weight, i.e., defie spatial eighbors assig weights to the eighbors Makig the eighbors ad weights is ot easy as it seems to be Which states are ear Texas? 8
9 Spatial Neighbors Cotiguity-based eighbors Zoe i ad j are eighbors if zoe i is cotiguity or adjacet to zoe j But what costitutes cotiguity? Distace-based eighbors Zoe i ad j are eighbors if the distace betwee them are less tha the threshold distace But what distace do we use? 9
10 Cotiguity-based Spatial Neighbors Sharig a border or boudary Rook: sharig a border Quee: sharig a border or a poit rook quee Hexagos Irregular Which use? 10
11 Example Source: Bivad ad Pebesma ad Gomez-Rubio 11
12 Higher-Order Cotiguity 1 st order Nearest eighbor rook hexago quee 2 d order Next earest eighbor 12
13 Distace-based Neighbors How to measure distace betwee polygos? Distace metrics 2D Cartesia distace (projected data) 3D spherical distace/great-circle distace (lat/log data) Haversie formula 13
14 Distace-based Neighbors k-earest eighbors Source: Bivad ad Pebesma ad Gomez-Rubio 14
15 Distace-based Neighbors thresh-hold distace (buffer) Source: Bivad ad Pebesma ad Gomez-Rubio 15
16 Neighbor/Coectivity Histogram Source: Bivad ad Pebesma ad Gomez-Rubio 16
17 Side Note: Box-plot Help idicate the degree of dispersio ad skewess ad idetify outliers No-parametric 25%, 50%, 75% percetiles ed of the hige could mea differetly depedig o implemetatio Poits outside the rage are usually take as outliers 17
18 Spatial Weight Matrix Spatial weights ca be see as a list of weights idexed by a list of eighbors If zoe j is ot a eighbor of zoe i, weights Wij will set to zero The weight matrix ca be illustrated as a image Sparse matrix 18
19 A Simple Example for Rook case Matrix cotais a: 1 if share a border 0 if do ot share a border 4 areal uits 4x4 matrix A B C D A B A C D W = B C Commo border D
20 20
21 Sparse Cotiguity Matrix for US States -- obtaied from Aseli's web site (see powerpoit for lik) Name Fips Ncout N1 N2 N3 N4 N5 N6 N7 N8 Alabama Arizoa Arkasas Califoria Colorado Coecticut Delaware District of Columbia Florida Georgia Idaho Illiois Idiaa Iowa Kasas Ketucky Louisiaa Maie Marylad Massachusetts Michiga Miesota Mississippi Missouri Motaa Nebraska Nevada New Hampshire New Jersey New Mexico New York North Carolia North Dakota Ohio Oklahoma Orego Pesylvaia Rhode Islad South Carolia South Dakota Teessee Texas Utah Vermot Virgiia Washigto West Virgiia Wiscosi Wyomig
22 Style of Spatial Weight Matrix Row a weight of uity for each eighbor relatioship Row stadardizatio Symmetry ot guarateed ca be iterpreted as allowig the calculatio of average values across eighbors Geeral spatial weights based o distaces 22
23 Row vs. Row stadardizatio A B C D E F Divide each umber by the row sum Total umber of eighbors --some have more tha others A B C D E F Row Sum A B C D E F Row stadardized --usually use this A B C D E F Row Sum A B C D E F
24 Geeral Spatial Weights Based o Distace Decay fuctios of distace Most commo choice is the iverse (reciprocal) of the distace betwee locatios i ad j (w ij = 1/d ij ) Other fuctios also used iverse of squared distace (w ij =1/d ij2 ), or egative expoetial (w ij = e -d or w ij = e -d2 ) 24
25 Example Compare three differet weight matrix i images 25
26 Measure of Spatial Autocorrelatio 26
27 Global Measures ad Local Measures Global Measures A sigle value which applies to the etire data set The same patter or process occurs over the etire geographic area A average for the etire area Local Measures A value calculated for each observatio uit Differet patters or processes may occur i differet parts of the regio A uique umber for each locatio Global measures usually ca be decomposed ito a combiatio of local measures 27
28 Global Measures ad Local Measures Global Measures Joi Cout Mora s I, Getis-Ord s G Local Measures Local Mora s I, Getis-Ord s G 28
29 Joi (or Joit or Jois) Cout Statistic 60 for Rook Case 110 for Quee Case 29
30 Joi Cout: Test Statistic Test Statistic give by: Z= Observed - Expected Expected give by: SD of Expected Expected = radom patter geerated by tossig a coi i each cell. Stadard Deviatio of Expected (stadard error) give by: Where: k is the total umber of jois (eighbors) p B is the expected proportio Black, if radom p W is the expected proportio White m is calculated from k accordig to: 30
31 Gore/Bush Presidetial Electio 2000 Actual Jbb 60 Jgg 21 Jbg 28 Total
32 Joi Cout Statistic for Gore/Bush 2000 by State cadidates probability Bush Gore Actual Expected Sta Dev Z-score Jbb Jgg Jbg Total The expected umber of jois is calculated based o the proportio of votes each received i the electio (for Bush = 109*.499*.499=27.125) There are far more Bush/Bush jois (actual = 60) tha would be expected (27) Positive autocorrelatio There are far fewer Bush/Gore jois (actual = 28) tha would be expected (54) Positive autocorrelatio No strog clusterig evidece for Gore (actual = 21 slightly less tha ) 32
33 Mora s I The most commo measure of Spatial Autocorrelatio Use for poits or polygos Joi Cout statistic oly for polygos Use for a cotiuous variable (ay value) Joi Cout statistic oly for biary variable (1,0) Patrick Alfred Pierce Mora ( ) 33
34 Formula for Mora s I I N ij i i= 1 j= 1 = ( i= 1 j= 1 w w ij (x ) i= 1 x)(x (x i j x) x) 2 Where: N is the umber of observatios (poits or polygos) x is the mea of the variable X i is the variable value at a particular locatio X j is the variable value at aother locatio is a weight idexig locatio of i relative to j W ij 34
35 Mora s I ad Correlatio Coefficiet Correlatio Coefficiet [-1, 1] Relatioship betwee two differet variables Mora s I [-1, 1] Spatial autocorrelatio ad ofte ivolves oe (spatially idexed) variable oly Correlatio betwee observatios of a spatial variable at locatio X ad spatial lag of X formed by averagig all the observatio at eighbors of X
36 i= 1 i= 1 (y 1(y i i y) 2 y)(x i i= 1 x)/ (x i x) 2 Correlatio Coefficiet Note the similarity of the umerator (top) to the measures of spatial associatio discussed earlier if we view Yi as beig the Xi for the eighborig polygo (see ext slide) N i= 1 j= 1 ( i= 1 j= 1 w w ij ij (x ) i i= 1 x)(x (x Spatial auto-correlatio i j x) x) 2 = w i= 1 (x (x i x)(x x) 2 i= 1 x)/ ij i j i= 1 j= 1 i= 1 j= 1 Source: Ro Briggs of UT Dallas (x i x) 2 w 36 ij
37 i= 1 i= 1 (y 1(y i i y) Yi is the Xi for the eighborig polygo N i= 1 j= 1 ( i= 1 j= 1 w w ij ij (x ) i i= 1 x)(x (x i j 2 x) y)(x x) i i= 1 Mora s I 2 = (x x)/ i x) 2 w i= 1 (x (x i x) x)(x 2 i= 1 x)/ ij i j i= 1 j= 1 i= 1 j= 1 Source: Ro Briggs of UT Dallas Correlatio Coefficiet Spatial weights (x i x) 2 w 37 ij
38 Statistical Sigificace Tests for Mora s I Based o the ormal frequecy distributio with Z I E( I) = Serror( I ) Where: I is the calculated value for Mora s I from the sample E(I) is the expected value if radom S is the stadard error Statistical sigificace test Mote Carlo test, as we did for spatial patter aalysis Permutatio test No-parametric Data-drive, o assumptio of the data Implemeted i GeoDa 38
39 Mora Scatter Plots We ca draw a scatter diagram betwee these two variables (i stadardized form): X ad lag-x (or W_X) The slope of this regressio lie is Mora s I 39
40 Mora Scatter Plots Low/High egative SA High/High positive SA Low/Low positive SA High/Low egative SA 40
41 Mora Scatterplot: Example 41
42 Mora s I for rate-based data Mora s I is ofte calculated for rates, such as crime rates (e.g. umber of crimes per 1,000 populatio) or ifat mortality rates (e.g. umber of deaths per 1,000 births) A adjustmet should be made, especially if the deomiator i the rate (populatio or umber of births) varies greatly (as it usually does) Adjustmet is kow as the EB adjustmet: see Assucao-Reis Empirical Bayes Stadardizatio Statistics i Medicie, 1999 GeoDA software icludes a optio for this adjustmet 42
43 Hot Spots ad Cold Spots What is a hot spot? A place where high values cluster together What is a cold spot? A place where low values cluster together Mora s I ad Geary s C caot distiguish them They oly idicate clusterig Caot tell if these are hot spots, cold spots, or both 43
44 Getis-Ord Geeral/Global G-Statistic The G statistic distiguishes betwee hot spots ad cold spots. It idetifies spatial cocetratios. G is relatively large if high values cluster together G is relatively low if low values cluster together The Geeral G statistic is iterpreted relative to its expected value The value for which there is o spatial associatio G > (larger tha) expected value potetial hot spots G < (smaller tha) expected value potetial cold spots Commets: Geeral G will ot show egative spatial autocorrelatio Should oly be calculated for ratio scale data data with a atural zero such as crime rates, birth rates Although it was defied usig a cotiguity (0,1) weights matrix, ay type of spatial weights matrix ca be used ArcGIS gives multiple optios 44
45 Local Measures of Spatial Autocorrelatio 45
46 Local Idicators of Spatial Associatio (LISA) Local versios of Mora s I, ad the Getis-Ord G statistic Mora s I is most commoly used, ad the local versio is ofte called Aseli s LISA, or just LISA See: Luc Aseli 1995 Local Idicators of Spatial Associatio-LISA Geographical Aalysis 27:
47 Local Idicators of Spatial Associatio (LISA) The statistic is calculated for each areal uit i the data For each polygo, the idex is calculated based o eighborig polygos with which it shares a border A measure is available for each polygo, these ca be mapped to idicate how spatial autocorrelatio varies over the study regio Each idex has a associated test statistic, we ca also map which of the polygos has a statistically sigificat relatioship with its eighbors, ad show type of relatioship 47
48 Example: 48
49 Local Getis-Ord G ad G* Statistics Local Getis-Ord G It is the proportio of all x values i the study area accouted for by the eighbors of locatio I G* will iclude the self value G i ( d) = j j w ij x x j j G will be high where high values cluster G will be low where low values cluster Iterpreted relative to expected value if radomly distributed. E( G i ( d)) = j w ij 1 ( d) 49
50 Bivariate LISA Mora Scatter Plot for GDI vs AL Mora s I is the correlatio betwee X ad Lag-X--the same variable but i earby areas Uivariate Mora s I Bivariate Mora s I is a correlatio betwee X ad a differet variable i earby areas. Mora Sigificace Map for GDI vs. AL 50
51 Bivariate LISA ad the Correlatio Coefficiet Correlatio Coefficiet is the relatioship betwee two differet variables i the same area Bivariate LISA is a correlatio betwee two differet variables i a area ad i earby areas. 51
52 Bivariate Mora Scatter Plot Low/High egative SA High/High positive SA Low/Low positive SA High/Low egative SA 52
53 Summary Spatial autocorrelatio of areal data Spatial weight matrix Measures of spatial autocorrelatio Global Measure Mora s I/Geeral G ad G* Local LISA: Mora s I/Geeral G ad G* Bivariate LISA Sigificace test 53
54 Ed of this topic 54
Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationSpatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spatial Aalysis ad Modelig (GIST 4302/5302) Guofeg Cao Departmet of Geoscieces Texas Tech Uiversity Outlie of This Week Last week, we leared: spatial poit patter aalysis (PPA) focus o locatio distributio
More informationDESCRIPTIVE STATISTICS
DESCRIPTIVE STATISTICS REVIEW OF KEY CONCEPTS SECTION. Measures of Locatio.. Arithmetic Mea xi x i x+ x + + x Cosider the data i Table.. They represet serum-cholesterol levels from a group of hospital
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationStatistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes
Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:
More informationENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!
ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Solutios Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More informationChapter 2 Descriptive Statistics
Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationExample: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.
1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers
More informationMedian and IQR The median is the value which divides the ordered data values in half.
STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationMEASURES OF DISPERSION (VARIABILITY)
POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral
More informationChapter 4 - Summarizing Numerical Data
Chapter 4 - Summarizig Numerical Data 15.075 Cythia Rudi Here are some ways we ca summarize data umerically. Sample Mea: i=1 x i x :=. Note: i this class we will work with both the populatio mea µ ad the
More informationCHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements.
CHAPTER 2 umerical Measures Graphical method may ot always be sufficiet for describig data. You ca use the data to calculate a set of umbers that will covey a good metal picture of the frequecy distributio.
More informationResponse Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable
Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated
More informationLecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.
Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example
More informationID = x d. LAB EXERCISE #1 Scaling Techniques. Objectives. Background Information. Instructor: K. McGarigal
LAB EXERCISE #1 Scalig Techiques Istructor: K. McGarigal Overview: I this exercise, you will gai familiarity with a few commo procedures for elucidatig the scale of patter i poit, cotiuous ad categorical
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationData Description. Measure of Central Tendency. Data Description. Chapter x i
Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationMap as Outcomes of Processes. Outline. Map as Outcomes of Processes
Map as Outcomes of Processes Outlie Defiitios: processes & patters A Startig poit: Complete Spatial Radomess More defiitio: Statioary Map as Outcomes of Processes The basic assumptio of Spatial Aalysis:
More informationDotting The Dot Map, Revisited. A. Jon Kimerling Dept. of Geosciences Oregon State University
Dottig The Dot Map, Revisited A. Jo Kimerlig Dept. of Geoscieces Orego State Uiversity Dot maps show the geographic distributio of features i a area by placig dots represetig a certai quatity of features
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationSets are collection of objects that can be displayed in different forms. Two of these forms are called Roster Method and Builder Set Notation.
Sectio 2.1 Set ad Set Operators Defiitio of a set set is a collectio of objects thigs or umbers. Sets are collectio of objects that ca be displayed i differet forms. Two of these forms are called Roster
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationA proposed discrete distribution for the statistical modeling of
It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationSTP 226 EXAMPLE EXAM #1
STP 226 EXAMPLE EXAM #1 Istructor: Hoor Statemet: I have either give or received iformatio regardig this exam, ad I will ot do so util all exams have bee graded ad retured. PRINTED NAME: Siged Date: DIRECTIONS:
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationHYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible
More informationLinear Regression Models
Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationEconomics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator
Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters
More information10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random
Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More informationNCSS Statistical Software. Tolerance Intervals
Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationIE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.
IE 230 Seat # Name < KEY > Please read these directios. Closed book ad otes. 60 miutes. Covers through the ormal distributio, Sectio 4.7 of Motgomery ad Ruger, fourth editio. Cover page ad four pages of
More information4 Multidimensional quantitative data
Chapter 4 Multidimesioal quatitative data 4 Multidimesioal statistics Basic statistics are ow part of the curriculum of most ecologists However, statistical techiques based o such simple distributios as
More informationII. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation
II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio
More informationSummary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.
Key Cocepts: 1) Sketchig of scatter diagram The scatter diagram of bivariate (i.e. cotaiig two variables) data ca be easily obtaied usig GC. Studets are advised to refer to lecture otes for the GC operatios
More information3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.
3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationIntroducing Sample Proportions
Itroducig Sample Proportios Probability ad statistics Studet Activity TI-Nspire Ivestigatio Studet 60 mi 7 8 9 10 11 12 Itroductio A 2010 survey of attitudes to climate chage, coducted i Australia by the
More informationST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.
ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic
More informationWorksheet 23 ( ) Introduction to Simple Linear Regression (continued)
Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This
More informationCorrelation Regression
Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother
More informationUnderstanding Dissimilarity Among Samples
Aoucemets: Midterm is Wed. Review sheet is o class webpage (i the list of lectures) ad will be covered i discussio o Moday. Two sheets of otes are allowed, same rules as for the oe sheet last time. Office
More informationEconomics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls
Ecoomics 250 Assigmet 1 Suggested Aswers 1. We have the followig data set o the legths (i miutes) of a sample of log-distace phoe calls 1 20 10 20 13 23 3 7 18 7 4 5 15 7 29 10 18 10 10 23 4 12 8 6 (1)
More informationTopic 10: Introduction to Estimation
Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio
More informationChapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers
Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:
More informationTMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.
Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx
More informationSTAT 515 fa 2016 Lec Sampling distribution of the mean, part 2 (central limit theorem)
STAT 515 fa 2016 Lec 15-16 Samplig distributio of the mea, part 2 cetral limit theorem Karl B. Gregory Moday, Sep 26th Cotets 1 The cetral limit theorem 1 1.1 The most importat theorem i statistics.............
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationCorrelation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph
Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical
More informationChapter 12 Correlation
Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationStat 139 Homework 7 Solutions, Fall 2015
Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationQuick Review of Probability
Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter 2 & Teachig
More informationChapter 23: Inferences About Means
Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For
More informationExample 3.3: Rainfall reported at a group of five stations (see Fig. 3.7) is as follows. Kundla. Sabli
3.4.4 Spatial Cosistecy Check Raifall data exhibit some spatial cosistecy ad this forms the basis of ivestigatig the observed raifall values. A estimate of the iterpolated raifall value at a statio is
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationQuick Review of Probability
Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter & Teachig Material.
More informationLesson 11: Simple Linear Regression
Lesso 11: Simple Liear Regressio Ka-fu WONG December 2, 2004 I previous lessos, we have covered maily about the estimatio of populatio mea (or expected value) ad its iferece. Sometimes we are iterested
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationCircle the single best answer for each multiple choice question. Your choice should be made clearly.
TEST #1 STA 4853 March 6, 2017 Name: Please read the followig directios. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directios This exam is closed book ad closed otes. There are 32 multiple choice questios.
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationStatistical Fundamentals and Control Charts
Statistical Fudametals ad Cotrol Charts 1. Statistical Process Cotrol Basics Chace causes of variatio uavoidable causes of variatios Assigable causes of variatio large variatios related to machies, materials,
More informationUnderstanding Samples
1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationAppendix D Some Portfolio Theory Math for Water Supply
DESALINATION, WITH A GRAIN OF SALT A CALIFORNIA PERSPECTIVE 9 Appedix D Some Portfolio Theory Math for Water Supply Costat-Reliability-Beefit Uit Costs The reliability ad cost of differet water-supply
More informationM1 for method for S xy. M1 for method for at least one of S xx or S yy. A1 for at least one of S xy, S xx, S yy correct. M1 for structure of r
Questio 1 (i) EITHER: 1 S xy = xy x y = 198.56 1 19.8 140.4 =.44 x x = 1411.66 1 19.8 = 15.657 1 S xx = y y = 1417.88 1 140.4 = 9.869 14 Sxy -.44 r = = SxxSyy 15.6579.869 = 0.76 1 S yy = 14 14 M1 for method
More informationCorrelation and Covariance
Correlatio ad Covariace Tom Ilveto FREC 9 What is Next? Correlatio ad Regressio Regressio We specify a depedet variable as a liear fuctio of oe or more idepedet variables, based o co-variace Regressio
More informationExam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234
STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6
More information