StatiStical MethodS for GeoGraphy

Size: px
Start display at page:

Download "StatiStical MethodS for GeoGraphy"

Transcription

1 Peter A. Rogerso StatStcal MethodS for GeoGraphy A Studet S Gude F o u r t h e d t o 00_Rogerso_4e_BAB1405B0092_Prelms.dd 3 10/13/2014 5:30:29 PM

2 2 DESCRIPTIVE STATISTICS Learg Objectves Types of Data 26 Vsual Descrptve Methods 27 Measures of Cetral Tedecy 31 Measures of Varablty 33 Other Numercal Measures for Descrbg Data 35 Descrptve Spatal Statstcs 38 Descrptve Statstcs SPSS 21 for Wdows 49 I Chapter 1, a fudametal dstcto was draw betwee descrptve ad feretal statstcs. We saw that descrbg data costtutes a mportat early phase of the scetfc method. I ths chapter, we wll focus upo vsual ad umercal descrptve summares of data. We wll beg by descrbg dfferet types of data ad by coverg some of the vsual approaches that are commoly used to explore ad descrbe data. Followg ths, umercal measures of descrpto are revewed. Fally, descrpto s dscussed for the specal cotext of spatal data. 2.1 TYPES OF DATA Data may be classfed as omal, ordal, terval, or rato. Nomal data are observatos that have bee placed to a set of mutually exclusve ad collectvely exhaustve categores. Examples of omal data clude sol type ad vegetato type. Ordal data cosst of observatos that are raked. Thus, t s possble to say that oe observato s greater tha (or less tha) aother, but wth ths much formato t s ot possble to say by how much a observato s greater or less tha aother. It s ot ucommo to fd ordal data almaacs ad statstcal abstracts; a example s data o the sze of ctes, by rak. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 26 10/3/ :40:35 AM

3 Descrptve Statstcs 27 Whe t s possble to say by how much oe observato s greater or less tha aother, data are ether terval or rato. Wth terval data, dffereces values are detfable. For example, o the Fahrehet temperature scale, 44 degrees s 12 degrees warmer tha 32 degrees. However, the zero s ot meagful o the terval scale, ad cosequetly rato terpretatos are ot possble. Thus, 44 degrees s ot twce as warm as 22 degrees. Rato data, o the other had, does have a meagful zero. Thus, 100 degrees Kelv s twce as warm as 50 degrees Kelv. Most umercal data are rato data deed, t s dffcult to thk of examples for terval data other tha the Fahrehet ad Celsus scales. Data may cosst of values that are ether dscrete or cotuous. Dscrete varables take o oly a fte set of values examples clude the umber of suy days a year, the aual umber of vsts by a famly to a local publc faclty, ad the mothly umber of collsos betwee automobles ad deer a rego. Cotuous varables take o a fte umber of values; examples clude temperature ad elevato. 2.2 VISUAL DESCRIPTIVE METHODS Suppose that we wsh to lear somethg about the commutg behavor of resdets a commuty. Perhaps we are o a commttee that s vestgatg the potetal mplemetato of a publc trast alteratve, ad we eed to kow how may mutes, o average, t takes people to get to work by car. We do ot have the resources to ask everyoe, ad so we decde to take a sample of automoble commuters. Let s say we survey = 30 resdets, askg them to record ther average tme t takes to get to work. We receve the resposes show pael (a) of Table 2.1. We may summarze our data vsually by costructg a hstogram, whch s a vertcal bar graph. To costruct a hstogram, the data are frst grouped to categores. The hstogram cotas oe vertcal bar for each category. The heght of the bar represets the umber of observatos the category (.e., the absolute frequecy), ad t s commo to ote the mdpot of the category o the horzotal axs. Fgure 2.1 s a hstogram for the hypothetcal commutg data Table 2.1, produced by SPSS for Wdows. A alteratve to the hstogram s the frequecy polygo; t may be draw by coectg the pots formed at the mddle of the top of each vertcal bar. Data may also be summarzed va box plots. Fgure 2.2 depcts a box plot for the commutg data. The horzotal le rug through the rectagle deotes the meda (21), ad the lower ad upper eds of the rectagle (sometmes called the hges ) represet the 25th ad 75th percetles, respectvely. Vellema ad Hoagl (1981) ote that there are two commo ways to draw the whskers, whch exted upward ad dowward from the hges. Oe way s to sed the whskers out to the mmum ad maxmum values. I ths case, the box plot represets a graphcal summary of what s sometmes called a fve-umber summary of the dstrbuto (the mmum, maxmum, 25th ad 75th percetles, ad the meda). 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 27 10/3/ :40:35 AM

4 28 Statstcal Methods for Geography Table 2.1 Commutg data (a) Data o dvduals Idvdual o. Commutg tme (m.) Idvdual o. Commutg tme (m.) (b) raked commutg tmes 5, 5, 6, 9, 10, 11, 11, 12, 12, 14, 16, 17, 19, 21, 21, 21, 21, 21, 22, 23, 24, 24, 26, 26, 31, 31, 36, 42, 44, Frequecy Std. Dev = Mea = 21.9 N = Fgure 2.1 Hstogram for commutg data 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 28 10/3/ :40:36 AM

5 Descrptve Statstcs 29 Commutg tme Fgure 2.2 Boxplot for commutg data There are ofte extreme outlers the data that are far from the mea, ad ths case t s ot preferable to sed whskers out to these extreme values. Istead, whskers are set out to the outermost observatos that are stll wth a dstace from the hge that s equal to 1.5 tmes the terquartle rage (where the terquartle rage s determed as the dfferece betwee the 75th ad 25th percetles, equvalet to the vertcal legth of the rectagle). All other observatos beyod ths are cosdered outlers, ad are show dvdually. I the commutg data, 1.5 tmes the terquartle rage s equal to 1.5(14.25) = The whsker extedg dowward from the lower hge exteds to the mmum value of 5, sce ths s greater tha the lower hge (11.75) mus The whsker extedg upward from the upper hge stops at 44, whch s the hghest observato less tha (whch tur s equal to the upper hge (26) plus ). Note that there s a sgle outler observato 9 ad t has a value of 77 mutes. A stem-ad-leaf plot s a alteratve way to dsplay the frequeces of observatos. It s smlar to a hstogram tlted oto ts sde, wth the actual dgts of each observato s value used place of bars. The leadg dgts costtute the stem, ad the tralg dgts make up the leaf. Each stem has oe or more leaves, wth each leaf correspodg to a observato. The vsual depcto of the frequecy of leaves coveys to the reader a mpresso of the frequecy of observatos that fall wth gve rages. Joh Tukey, the desger of the stem-ad-leaf plot, has sad, If we are gog to make a mark, t may as well be a meagful oe. The smplest ad most useful meagful mark s a dgt (Tukey 1972, p. 269). For the commutg data, whch have at most two-dgt values, the frst dgt s the stem, ad the secod s the leaf (see Fgure 2.3). Note that for each tem the stem, the fal dgts are arraged umercal order, from lowest to hghest. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 29 10/3/ :40:36 AM

6 30 Statstcal Methods for Geography Frequecy Stem & Leaf Extremes > =77) Stem wdth: Each leaf: 1 case(s) Fgure 2.3 Stem-ad-leaf plot for commutg data To gve aother example that makes use of vsual descrpto, cosder school dstrct admstrators, who ofte take cesuses of the umber of school-age chldre ther dstrct, so that they may form hopefully accurate estmates of future erollmet. Table 2.2 gves the hypothetcal resposes of 750 households whe asked how may school-age chldre are co-resdets. Table 2.2 Frequecy of chldre households Number of chldre Absolute frequecy Total 750 The absolute frequeces may be traslated to relatve frequeces by dvdg by the total umber of observatos ( ths case, 750). Table 2.3 reveals, for example, that 26.7% of all households surveyed had oe chld. Note that the sum of the relatve frequeces s equal to oe. Note also that we ca easly costruct a hstogram usg the relatve frequeces stead of the absolute frequeces (see Fgure 2.4); the hstogram pael (b) has precsely 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 30 10/3/ :40:36 AM

7 Descrptve Statstcs 31 Table 2.3 Absolute ad relatve frequeces Number of chldre Absolute frequecy Relatve frequecy /750 = /750 = /750 = /750 = /750 =.067 Total (a) (b) Fgure 2.4 Number of chldre households: (a) absolute frequecy; (b) relatve frequecy the same shape as that pael (a); the vertcal scale has just bee chaged by a factor equal to the sample sze of MEASURES OF CENTRAL TENDENCY We may cotue our descrptve aalyss of the data Table 2.1 by summarzg the formato umercally. The sample mea commutg tme s smply the average of our observatos; t s foud by addg all of the dvdual resposes ad dvdg by the umber of observatos. The sample mea s tradtoally deoted by x ; our example, we have x = 658/30 = mutes. I practce, ths could sesbly be rouded to 22 mutes. We ca use otato to state more formally that the mea s the sum of the observatos, dvded by the umber of observatos: x x = =Σ 1 (2.1) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 31 10/3/ :40:37 AM

8 32 Statstcal Methods for Geography where x deotes the value of observato, ad where there are observatos. (A revew of mathematcal covetos ad mathematcal otato may lkely be order for may readers; see Appedx B.) The meda s defed as the observato that splts the raked lst of observatos (arraged from lowest to hghest, or hghest to lowest) half. Whe the umber of observatos s odd, the meda s smply equal to the mddle value o a raked lst of the observatos. Whe the umber of observatos s eve, we take the meda to be the average of the two values the mddle of the raked lst. Half of all respodets our example have commutes that are loger tha the meda, ad half have commutes that are shorter. Whe the resposes are raked as pael (b) of Table 2.1, the two the mddle are 21 ad 21. The meda ths case s therefore equal to 21 mutes. The mode s defed as the most frequetly occurrg value; here the mode s also 21 mutes, sce that value occurs more frequetly tha ay other outcome. May varables have dstrbutos where a small umber of hgh values cause the mea to be much larger tha the meda; ths s typcally true for come dstrbutos ad dstace dstrbutos. For example, Rogerso et al. (1993) used the US Natoal Survey of Famles ad Households to study the dstace that adult chldre lved from ther parets. For adult chldre wth both parets alve ad lvg together, the mea dstace to parets s over 200 mles, ad yet the meda dstace s oly 25 mles! Because the mea s ot represetatve of the data crcumstaces such as these, t s commo to use the meda as a measure of cetral tedecy. Whe data are avalable oly for categores, grouped meas may be calculated. Ths s acheved by assumg that all of the data wth a partcular category take o the mdpot value of the category. For example, Table 2.4 portrays some hypothetcal data o come (uts have bee delberately omtted to keep the example locato-free!). Table 2.4 Absolute frequeces assocated wth observatos o come Icome Frequecy (Number of dvduals) < 15, ,000 34, ,000 54, ,000 99, The grouped mea s foud by assumg that the te dvduals the frst category have a come of 7500 (the mdpot of the category), the 20 dvduals the secod category each have a come of 25,000, the 30 dvduals the ext category each have a come of 45,000, ad those the fal category each have a come of 77,500. All 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 32 10/3/ :40:37 AM

9 Descrptve Statstcs 33 of these dvdual values are added, ad the result s dvded by the umber of dvduals. Thus, the grouped mea for ths example s: More formally, 10(7,500) + 20(25,000) + 30(45,000) + 15(77,500) x G Σ fx md g = =, 1 G Σ f = 1 = 41, 167 (2.2) (2.3) where x g deotes the grouped mea, G s the umber of groups, f s the umber of observatos group, ad x,md deotes the value of the mdpot of the group. The grouped mea may be see as a weghted average, where the mdpot of each group s weghted by the frequecy of observatos that group. After all of these weghted quattes have bee calculated ad summed, the last step the calculato of a weghted average s to dvde by the sum of the weghts. A smlar way to look at the cocept of the grouped mea s to mage that we eed to create a specfc value for each observato, eve though we do t have ths formato. I the example above, we oly kow that there are te observatos that are less tha 15,000 we do t kow ther dvdual values. Image startg a lst we wll guess that the frst observato s 7,500, the secod observato s 7,500, ad so o. The begg of our lst thus has te values of 7,500. The we move o to the secod category there are 20 observatos the ext category ad so we add 20 staces of 25,000 to the lst, sce ths would be our best guess for each of these observatos. Whe we get doe wth the costructo of ths lst, we smply add the umbers (whch gves us the umerator Equatos 2.2 ad 2.3), ad we dvde by the umber of tems o the lst (whch s the deomator Equatos 2.2 ad 2.3). It s ot ucommo to fd that the last category s ope-eded; stead of the 55,000 99,999 category, t mght be more commo for data to be reported a category labeled 55,000 ad above. I ths case, a educated estmate of the average salary for those ths group should be made. It would also be useful to make a umber of such estmates ad repeat the calculato of the grouped mea, to see how sestve the result s to dfferet choces for the estmate. 2.4 MEASURES OF VARIABILITY We may also summarze hstograms ad datasets by characterzg ther varablty. The commutg data Table 2.1 rage from a low of 5 mutes to a hgh of 77 mutes. The rage s the dfferece betwee the two values: here t s equal to 77 5 = 72 mutes. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 33 10/3/ :40:38 AM

10 34 Statstcal Methods for Geography The terquartle rage s the dfferece betwee the 25th ad 75th percetles. Wth observatos, the 25th percetle s represeted by observato ( + 1)/4, whe the data have bee raked from lowest to hghest. The 75th percetle s represeted by observato 3( + 1)/4. These wll ofte ot be tegers, ad that case terpolato s used, just as t s for the meda whe there s a eve umber of observatos. For the commutg data, the 25th percetle s represeted by observato (30 + 1)/4 = Iterpolato betwee the 7th ad 8th lowest observatos requres that we go 3/4 of the way from the 7th lowest observato (whch s 11) to the 8th lowest observato (whch s 12). Ths mples that the 25th percetle s (sce s 3/4 of the way from 11 to 12; more formally, ths ca be foud by (a) multplyg 3/4 by the dfferece the two observatos (3/4 (12 11) = 3/4, ad the (b) addg the result to the smaller of the observatos (11 + 3/4 = 11.75). Smlarly, the 75th percetle s represeted by observato 3(30 + 1)/4 = Sce both the 23rd ad 24th observatos are equal to 26, the 75th percetle s equal to 26. The terquartle rage s the dfferece betwee these two values, or = The sample varace of the data (deoted s 2 ) may be thought of as the average squared devato of the observatos from the mea. To esure that the sample varace gves a ubased estmate of the true, ukow varace of the populato from whch the sample was draw (deoted σ 2 ), s 2 s computed by takg the sum of the squared devatos, ad the dvdg the result by 1, stead of by. Here the term ubased mples that f we were to repeat ths samplg may tmes, we would fd that the average or mea of our may sample varaces would be equal to the true varace. Thus, the sample varace s foud by takg the sum of squared devatos from the mea, ad the dvdg by 1: s 2 = 1 2 Σ( x x) = 1 (2.4) A approxmate terpretato of the varace s that t represets the average squared devato of a observato from the mea (t s a approxmate terpretato because 1, stead of, s used the deomator). I our example, s 2 = mutes 2. The sample stadard devato s equal to the square root of the sample varace; here we have s = = mutes. Note that the uts for the stadard devato are the same as those for the varable tself here, for example, the stadard devato s gve mutes. Sce the sample varace characterzes the average squared devato from the mea, by takg the square root to calculate the stadard devato, we are puttg the measure of varablty back o a scale closer to that used for the mea ad the orgal data. Although the stadard devato s ot equal to the average absolute devato of a observato from the mea, t s usually close. Varaces for grouped data are foud by assumg that all observatos are at the mdpot of ther category, ad the calculato s based o the sum of squared devatos of these mdpot values from the grouped mea: 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 34 10/3/ :40:38 AM

11 Descrptve Statstcs 35 s G 2 = 1 g = 2, md g Σ f ( x x ) G ( Σ f ) 1 = 1 (2.5) For the data Table 2.4, the grouped varace s (10(7,500 41, 167) 2 +20(25,000 41,167) (45,000 41,167) + 15(77,500 41,167) ) = ( ) 1 8 (2.6) The square root of ths, 22,301, s the grouped stadard devato. 2.5 OTHER NUMERICAL MEASURES FOR DESCRIBING DATA Coeffcet of Varato Cosder the sellg prce of homes two commutes. I commuty A, the mea prce s 150,000 (uts are aga delberately omtted, so that the llustrato may apply to more tha oe ut of currecy!). The stadard devato s 75,000. I commuty B, the mea sellg prce s 80,000, ad the stadard devato s 60,000. The stadard devato s a absolute measure of varablty; ths example, such varablty s clearly lower commuty B. However, t s also useful to thk terms of relatve varablty. Relatve to ts mea, the varablty commuty B s greater tha that commuty A. More specfcally, the coeffcet of varato s defed as the rato of the stadard devato to the mea. Here, the coeffcet of varato commuty A s 75,000/150,000 = 0.5; commuty B, t s 60,000/80,000 = Skewess Skewess measures the degree of asymmetry exhbted by the data ad hstogram. Fgure 2.5 s clearly asymmetrc, ad t reveals that there are more observatos below the mea tha above t ths s kow as postve skewess. Postve skewess ca also be detected by comparg the mea ad meda. Whe the mea s greater tha the meda as t s here, the dstrbuto s postvely skewed. I cotrast, whe there are a small umber of low observatos ad a large umber of hgh oes, the mea s less tha the meda, ad the data exhbt egatve skewess (see Fgure 2.6). The sample skewess s computed by frst addg together the cubed devatos from the mea ad the dvdg by the product of the cubed stadard devato ad the umber of observatos: 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 35 10/3/ :40:39 AM

12 36 Statstcal Methods for Geography Relatve frequecy x Fgure 2.5 A postvely skewed dstrbuto Relatve frequecy x Fgure 2.6 A egatvely skewed dstrbuto skewess = Σ ( x x) = 1 3 s 3 (2.7) The 30 commutg tmes Table 2.1 have a postve skewess of If skewess equals zero, the hstogram s symmetrc about the mea Kurtoss Kurtoss measures how peaked the hstogram s. Its defto s smlar to that for skewess, wth the excepto that the fourth power s used stead of the thrd: kurtoss = Σ( x x) = 1 s 4 4. (2.8) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 36 10/3/ :40:40 AM

13 Descrptve Statstcs 37 (a) Relatve frequecy Fgure 2.7(a) Leptokurtc dstrbuto (b) Relatve frequecy Fgure 2.7(b) Platykurtc dstrbuto Data wth a hgh degree of peakedess are sad to be leptokurtc, ad have values of kurtoss over 3.0 (see Fgure 2.7a). Flat hstograms are platykurtc, ad have kurtoss values less tha 3.0 (Fgure 2.7b). The kurtoss of the commutg tmes s equal to 7.68, ad hece the dstrbuto of commutg tmes s relatvely peaked Stadard Scores Sce data come from dstrbutos wth dfferet meas ad dfferet degrees of varablty, t s commo to stadardze observatos. Oe way to do ths s to trasform each observato to a z-score by frst subtractg the mea of all observatos, ad the dvdg the result by the stadard devato: 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 37 10/3/ :40:40 AM

14 38 Statstcal Methods for Geography x x z =. (2.9) s z-scores may be terpreted as the umber of stadard devatos a observato s away from the mea. Data below the mea have egatve z-scores, whle data above the mea have postve z-scores. For the commutg data Table 2.1, the z-score for dvdual 1 s ( )/14.3 = Ths dvdual has a commutg tme that s 1.18 stadard devatos below the mea. 2.6 DESCRIPTIVE SPATIAL STATISTICS To ths pot, our dscusso of descrptve statstcs has bee geeral, the sese that the cocepts ad methods covered apply to a wde rage of data types. I ths secto, we revew a umber of descrptve statstcs that are useful provdg umercal summares of spatal data. Descrptve measures of spatal data are mportat uderstadg ad evaluatg such fudametal geographc cocepts as accessblty ad dsperso. For example, t s mportat to locate publc facltes so that they are accessble to defed populatos. Spatal measures of cetralty appled to the locato of dvduals the populato wll result geographc locatos that are some sese optmal wth respect to accessblty to the faclty. Smlarly, t s mportat to characterze the dsperso of evets aroud a pot. It s useful to summarze the spatal dsperso of dvduals aroud a hazardous waste ste. Are dvduals wth a partcular dsease less dspersed aroud the ste tha are people wthout the dsease? If so, ths could dcate that there s creased rsk of dsease at locatos ear the ste The Measuremet of Dstace The measuremet of dstace plays a key role the descrpto of spatal data. The shortest dstace betwee two pots o a global scale s called the great crcle dstace. Ths s represeted by a arc that, f exteded, would form a crcle whose plae would pass through the ceter of the earth. Ar travel over log dstaces s plaed usg these arcs. The great crcle dstace betwee ay two pots o a sphere may be calculated as follows. Let (a 1, b 1 ) ad (a 2, b 2 ) be the (lattude, logtude) pars for pots 1 ad 2, respectvely. The great crcle dstace betwee pots 1 ad 2 s d = r{arccos[sa s a + cosa cosa cos( b b )]} (2.10) where the sphere has radus r ad where the otato arccos[] sgfes the agle whose cose s equal to the term wth the square brackets. (A equvalet alteratve otato for arccos[x] s cos -1 [x].) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 38 10/3/ :40:41 AM

15 Descrptve Statstcs 39 Example Let s fd the great crcle dstace betwee Lodo (lattude a 1 = 51.51N, b 1 = logtude 0.13W) ad New York Cty (lattude a 2 = 40.67N, logtude b 2 = 73.94W). The earth has a radus of approxmately 3,963 mles. We must assume that t s a sphere (although all grade school chldre kow that t s actually a oblate spherod). Usg Equato 2.10, ths results d = 3,963 {arcos[(0.7827)(0.6517)+(0.6224)(0.7585)(0.2788)]} = 3,963{arccos[0.6417]} = 3,963 (.8741) = 3,464 mles The agle whose cose s s degrees. Oe fal pot about Equato 2.10 t requres the use of agles radas, ad ot degrees. To covert a agle from degrees to radas, multply by 3.14/180. Here 50.08(3.14)/180 = radas. The fal step s to multply ths result by r: (0.8741)(3,963) = 3,464 mles. For the calculato of dstaces o a smaller scale (say, wth a cty) we ca make the much more drastc assumpto that the world s flat. The we ca supermpose a grd o a map of the area, ad use the grd coordates for our calculatos. The two most commo measures of dstace o such a flat plae are Eucldea ad Mahatta dstaces. Let (x 1, y 1 ) ad (x 2, y 2 ) represet the x- ad y-coordates of pots 1 ad 2, respectvely. The Eucldea dstace s the shortest dstace betwee two pots ad s calculated as d = ( x x ) + ( y y ) (2.11) You may recogze ths s as the Pythagorea formula for the legth of the hypoteuse (the sold le Fgure 2.8) of a rght tragle. The Mahatta dstace s gve by d = ( x2 x1) + ( y2 y1 ) (2.12) where the vertcal les Equato 2.12 dcate that the absolute value s to be take. Ths s the dstace betwee two pots whe travel s restrcted to occur oly horzotal ad vertcal drectos. It s equal to the sum of the legth of the two dotted les Fgure 2.8. For example, suppose (x 1, y 1 ) = (8,4) ad (x 2, y 2 ) = (7,10). The the Eucldea dstace s 2 2 d = ( 7 8) + ( 10 4) = 37 = 6.08 ad the Mahatta dstace s d = ( 7 8) + ( 10 4 ) = 1+6 = 7. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 39 10/3/ :40:42 AM

16 40 Statstcal Methods for Geography Y X 2, Y 2 X 1, Y 1 X Fgure 2.8 Mahatta ad Eucldea dstaces Mea Ceter The most commoly used spatal measure of cetral tedecy s the mea ceter. For pot data, the x- ad y-coordates of the mea ceter are foud by smply fdg the mea of the x-coordates ad the mea of the y-coordates, respectvely. For areal data, the mea ceter ca be foud by assumg that we kow the x- ad y-coordates assocated wth the cetrods of each subrego. Geographc cetrods are pot locatos represetg the balace pot of the rego. Thus f you were holdg up a physcal map of the rego (say made out of metal, plastc, wood, or some other materal), the geographc cetrod s that locato where you could balace the map o oe fger. A populato cetrod s the same as the populato-weghted ceter of populato the very quatty we are dscussg ths secto! Gve the x- ad y-coordates of the cetrods for each surego, t s ofte useful to attach weghts to them. To fd the mea ceter of populato for stace, the weghts are take as the umber of people lvg each subrego. The weghted mea of the x-coordates ad y-coordates the provdes the locato of the mea ceter of populato. More specfcally, whe there are subregos, Σ wx Σ wy = 1 = 1 x = ; y = (2.13) Σ w Σ w where the w are the weghts (e.g., populato rego ) ad x ad y are the coordates of the cetrod rego. Coceptually, ths s detcal to assumg that all dvduals 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 40 10/3/ :40:43 AM

17 Descrptve Statstcs 41 lvg a partcular subrego lve at a prespecfed pot ( ths case, the cetrod) of that subrego. The mea ceter s detcal to the locato that would be foud f the x- ad y-coordates for each dvdual were frst wrtte out, ad the the mea of all the x s ad all the y s were foud. Equato 2.13, through ts use of a weghted mea, merely provdes a qucker way to arrve at the soluto. The mea ceter of populato the USA has mgrated west ad south over tme (see Fgure 2.9). The weghted mea ceter has the property that t mmzes the sum of squared dstaces that dvduals must travel (assumg that each perso travels to the cetralzed faclty located at the mea ceter). Although t s easy to calculate, ths terpretato s a lttle usatsfyg t would be cer to be able to fd a cetral locato that mmzes the sum of dstaces, rather tha the sum of squared dstaces Meda Ceter The locato that mmzes the sum of dstaces traveled s kow as the meda ceter. Although ts terpretato s more straghtforward tha that of the mea ceter, ts calculato s more complex. Calculato of the meda ceter s teratve, ad oe begs by guessg a tal locato (a coveet tal locato s the weghted mea ceter). The the ew x- ad y-coordates (deoted by x ad y, respectvely) are updated usg the followg: wx Σ = 1 d x = ; y = w Σ = 1 d wy Σ d w Σ = 1 d = 1 (2.14) where d s the dstace from pot to the specfed tal locato of the meda ceter. Ths process s the carred out aga ew x- ad y-coordates are aga foud usg these same equatos, wth the oly dfferece beg that d s redefed as the dstace from pot to the most recetly calculated locato for the meda ceter. Ths teratve process s termated whe the ewly computed locato of the meda ceter does ot dffer sgfcatly from the prevously computed locato. I the applcato of socal physcs to spatal teracto, populato dvded by dstace s cosdered a measure of populato potetal or accessblty. If the w s are defed as populatos, the each terato fds a updated locato based upo weghtg each pot or areal cetrod by ts accessblty to the curret meda ceter. The meda ceter s the fxed pot that s mapped to tself whe weghted by accessblty. Alteratvely stated, the meda ceter s a accessblty-weghted mea ceter, where accessblty s defed terms of the dstaces from each pot or areal cetrod to the meda ceter. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 41 10/3/ :40:44 AM

18 Fgure 2.9 Mea Ceter of Populato for the Uted States: 1790 to 2010 Source: US Bureau of the Cesus (2011) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 42 10/3/ :40:44 AM

19 Descrptve Statstcs Stadard Dstace Aspatal measures of varablty, such as the varace ad stadard devato, characterze the amout of dsperso of data pots aroud the mea. Smlarly, the spatal varablty of locatos aroud a fxed cetral locato may be summarzed. The stadard dstace (Bach 1963) s defed as the square root of the average squared dstace of pots to the mea ceter: s d Σ d = 1 = 2 c (2.15) where d c s the dstace from pot to the mea ceter ad s the umber of locatos aroud the cetral locato. Although Bach s measure of stadard dstace s coceptually appealg as a spatal verso of the stadard devato, t s ot really ecessary to mata the strct aalogy wth the stadard devato by takg the square root of the average squared dstace. Wth the aspatal verso (.e., the stadard devato), loosely speakg, the square root udoes the squarg ad thus the stadard devato may be roughly terpreted as a quatty that s o the same approxmate scale as the average absolute devato of observatos from the mea. Squarg ad takg square roots s carred out part because devatos from the mea may be ether postve or egatve. But, the spatal verso, dstaces are always postve, ad so a more terpretable ad atural defto of stadard dstace would be to smply use the average dstace of observatos from the mea ceter: d s Σ dc = =1 (2.16) I practce, the results foud usg Equatos 2.15 ad 2.16 wll be qute smlar. Equatos 2.15 ad 2.16 represet deftos of stadard dstace whe each pot or locato s gve equal weght the calculato. More commoly, each locato wll have a assocated weght, w. I ths case, the weghted stadard dstaces are calculated usg 2 Σ wd c = 1 sd = Σ w = 1 (2.17) s d Σ wd c = 1 = Σ w = 1 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 43 10/3/ :40:45 AM

20 44 Statstcal Methods for Geography The uts for all of these deftos of stadard dstace are the uts of the dstace varable (typcally mles, klometers, etc.) Relatve Dstace Oe drawback to the stadard dstace measure descrbed above s that t s a measure of absolute dsperso; t retas the uts whch dstace s measured. Furthermore, t s affected by the sze of the study area. The two paels of Fgure 2.10 show stuatos where the stadard dstace s detcal, but clearly the amout of dsperso about the cetral locato, relatve to the study area, s lower pael (b). Relatve dstace s a measure of relatve dsperso ad t may be derved by dvdg the stadard dstace by the radus of a crcle wth area equal to the sze of the study area (McGrew ad Moroe 2000). Ths makes the measure of dsperso utless ad stadardzes for the sze of the study area, thereby facltatg comparso of dsperso study areas of dfferet szes. For a crcular study area, the relatve dstace s s d,rel = s d /r ad for a square study area, 2 sdrel, = sd π / s (where r ad s represet the radus of the crcle ad the sde of the square, respectvely). Note that the maxmum relatve dstace for a crcle s 1; ths occurs whe all pots are located o the crcumferece of the crcle. For a square, the maxmum relatve dstace s π / 2 = , ad ths occurs whe all pots are located at corers of the square Illustrato of Spatal Measures of Cetral Tedecy ad Dsperso The descrptve spatal statstcs outled above are ow llustrated usg the data Table 2.5. Ths s a smple set of te locatos; a smple ad small dataset has bee chose X X (a) (b) Fgure 2.10 Illustrato of stadard dstace. Note that dsperso relatve to the study area s lower pael (b) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 44 10/3/ :40:46 AM

21 Descrptve Statstcs 45 delberately, to facltate the dervato of the quattes by had, f desred. The study area s assumed to be a square wth the legth of each sde equal to oe. I ths example, we assume mplctly that there are equal weghts at each locato (or, equvaletly, that oe dvdual s at each locato). The mea ceter s at (0.4513, ), ad t s foud smply by takg the mea of each colum. The locato of the meda ceter s at (0.4611, ). Accuracy to three dgts s acheved after 33 teratos. The frst few teratos of Equato 2.14 are show Table 2.6. Table 2.5 x y coordate pars x y Table 2.6 Covergece of teratos toward the meda ceter Iterato x-coordate y-coordate _Rogerso_4e_BAB1405B0092_Ch-02.dd 45 10/3/ :40:46 AM

22 46 Statstcal Methods for Geography It s terestg to ote that the approach to the y-coordate of the meda ceter s mootoc, whle the approach to the x-coordate s a damped harmoc. The sum of squared dstaces to the mea ceter s ; ote that ths s lower tha the sum of squared dstaces to the meda ceter (0.9328). Smlarly, the sum of dstaces to the meda ceter s 2.655, ad ths s lower tha the sum of dstaces to the mea ceter (2.712). The stadard dstace s (whch s the square root of /10); ote that ths s smlar to the average dstace of a pot from the mea ceter (2.712/10 = ) Agular Data Agular data arse a umber of geographcal applcatos; the aalyss of wd drecto, ad the study of the algmet of crystals bedrock provde two examples. The latter example has bee partcularly mportat the study of cotetal drft, ad establshg the tmg of reversals earth s magetc feld. Specal cosderatos arse the vsual ad umercal descrpto of agular data. Cosder the 146 observatos o wd drecto gve Table 2.7. A hstogram Table 2.7 Hypothetcal data o wd drecto Drecto Agular drecto Frequecy North 0 10 Northeast 45 8 East 90 5 Southeast South Southwest West Northwest Total 146 could be costructed, but t s ot clear how the horzotal axs should be labeled. A hstogram could arbtrarly start wth North o the left, as Fgure 2.11a; aother possblty s to arrage for the mode to be ear the mddle of the hstogram, as Fgure 2.11b. Nether of the optos depcted Fgure 2.11 for costructg a hstogram usg agular data s deal, sce the observatos at the far left of the horzotal axs are smlar 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 46 10/3/ :40:46 AM

23 Descrptve Statstcs 47 (a) Frequecy N NE E SE S SW W NW Drecto (b) Frequecy SE S SW W NW Drecto N NE E Fgure 2.11 Absolute frequeces for drectoal data drecto to the observatos o the far rght of the horzotal axs. I partcular, there s o provso for wrappg the hstogram aroud o tself. A alteratve s the crcular hstogram (Fgure 2.12a). Here bars exted outward all drectos, reflectg the ature of the data. As s the case wth more typcal hstograms, the legths of the bars are proportoal to the frequecy. A slght varato of ths s the more commo rose dagram (Fgure 2.12b); here the rectagular bars have bee replaced wth pe- or wedge-shapes. The cocetrc rgs show Fgure 2.12b are ofte ot dsplayed; they are show here to emphasze how the relatve frequeces are used to costruct the dagram. The rose dagram s effectve portrayg vsually the ature of agular data. There are also specal cosderatos that are ecessary whe cosderg umercal summares of agular data. Cosder the very smple case where we have two observatos oe observato s 1 ad the other s 359. If 0 s take to be orth, both of these 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 47 10/3/ :40:46 AM

24 48 Statstcal Methods for Geography (a) N (b) N W E W E S S Fgure 2.12 Absolute frequeces for drectoal data: (a) crcular hstogram; (b) rose dagram observatos are very close to orth. However, f we take the smple average or mea of 1 ad 359, we get ( )/2 = 180 due south! Clearly, some other approach s eeded, sce the average of two observatos that are very close to orth should ot be south. Here we descrbe how to fd the mea ad varace for agular data. Mea: 1. Fd the se ad cose of each agular observato. 2. Fd the mea of the ses ( S ) ad the mea of the coses ( C ) Fd R = S + C 4. The mea agle (say α ) s the agle whose cose s equal to C / R ad whose se s equal to S / R. Thus, α = arccos (C / R) ad α = arcs ( S / R ). Varace: A measure of varace for agular data (termed the crcular varace) provdes a dcato of how much varablty there s the data. For example, f all observatos cossted of the same agle, the varablty, ad hece the crcular varace, should be zero. The crcular varace, desgated by S 0, s smply equal to 1 R. It vares from zero to oe. A hgh value ear oe dcates that the agular data are dspersed, ad come from may dfferet drectos. A value ear zero mples that the observatos are clustered aroud partcular drectos. Readers terested more detal regardg agular data may fd more extesve coverage Marda ad Jupp (1999). 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 48 10/3/ :40:48 AM

25 Descrptve Statstcs 49 Example 2.1 Three observatos of wd drecto yeld measuremets of 43, 88, ad 279. Fd the agular mea ad the crcular varace. Soluto. We start by costructg the followg table: Observato Cose Se ( ) = The mea of the coses s equal to C = / The mea of the ses s equal to S = ( )/3 = The 2 2 R = = The mea agle, α, s the agle whose cose s equal to C / R = / = , ad whose se s equal to S / R = / = Usg ether a calculator or table, we fd that the mea agle s arccos(0.7994) = arcs(0.6008) = 37. The crcular varace s equal to 1 R = = Ths value s closer to oe tha to zero, dcatg a tedecy for hgh varablty that s, the agles are relatvely dspersed ad are comg from dfferet drectos. 2.7 DESCRIPTIVE STATISTICS IN SPSS 21 FOR WINDOWS Data Iput After startg SPSS, data are put for the varable or varables of terest. Each colum represets a varable. For the commutg example set out Table 2.1, the 30 observatos were etered to the frst colum of the spreadsheet. Alteratvely, respodet ID could have bee etered to the frst colum (.e., the sequece of tegers, from 1 to 30), ad the commutg tmes would the have bee etered the secod colum. The order that the data are etered to a colum s umportat Descrptve Aalyss Smple descrptve statstcs Oce the data are etered, clck o Aalyze (or Statstcs, older versos of SPSS for Wdows). The clck o Descrptve Statstcs. The clck o Explore. A splt 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 49 10/3/ :40:50 AM

26 50 Statstcal Methods for Geography box wll appear o the scree; move the varable or varables of terest from the left box to the box o the rght that s headed Depedet Lst by hghlghtg the varable(s) ad clckg o the arrow. The clck o OK. Table 2.8 Descrptve summary of commutg data usg SPSS Case Processg Summary Cases Vald Mssg Total N Percet N Percet N Percet VAR % 0.0% % Descrptves Statstc Std. Error VAR00001 Mea % Cofdece Lower Boud Iterval for Mea Upper Boud % Trmmed Mea Meda Varace Std. Devato Mmum 5.00 Maxmum Rage Iterquartle Rage Skewess Kurtoss Other optos Optos for producg other related statstcs ad graphs are avalable. To produce a hstogram, for stace, before clckg OK above, clck o Plots, ad you ca the check a box to produce a hstogram. The clck o Cotue ad OK Results Table 2.8 dsplays results of the output. I addto to ths table, box plots (Fgure 2.2), stem ad leaf dsplays (Fgure 2.3) ad, optoally, hstograms (Fgure 2.1) are also produced. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 50 10/3/ :40:50 AM

27 Descrptve Statstcs 51 SOLVED EXERCISES 1. A ew park s plaed for a commuty, ad plaers wsh to cosder the mea ceter of four resdetal areas as a possble locato. Usg the coordates ad resdetal populatos lsted below, fd the weghted mea ceter. x-coordate y-coordate resdetal populato Soluto. We wll frst fd the x-coordate of the weghted mea ceter. Coceptually, ths s the average x-coordate amog all resdets. There are 200 people wth a x-coordate of 2, 100 wth a x-coordate of 4, ad so o. If we totaled the x-coordates amog all resdets, the result would be 200(2) + 100(4) + 50(4) (8) = = There are = 550 resdets, ad so the average x-coordate amog all resdets s 2600/550 = Smlarly, the y-coordate of the mea ceter s 200(1) + 100(4) + 50(8) + 200(2) 550 Thus, the weghted mea ceter s at (4.73, 2.55) = = Usg the followg data, fd the mea, meda, ad rage: Data values: 29, 35, 17, 30, 231, 6, 27, 35, 23, 29, 13 Soluto. The mea (or average) s equal to the sum of the observatos ( = 475), dvded by the umber of observatos ( = 11); thus the mea s 475/11 = To fd the meda, frst order the observatos: 6, 13, 17, 23, 27, 29, 29, 30, 35, 35, 231. The meda s observato umber ( + 1)/2 o ths ordered lst. Sce = 11, ( + 1)/2 = 6; the 6th observato o the lst (29) s the meda. Fally, the rage s equal to the largest observato (231), mus the smallest (6), ad s therefore equal to = Fd the skewess ad kurtoss for the followg data: 4, 7, 8, 13. (Cotued) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 51 10/3/ :40:50 AM

28 52 Statstcal Methods for Geography (Cotued) Soluto. For the skewess, we use Equato 2.7. The equato mples that we wll eed the mea ad the stadard devato. The mea s equal to ( )/4 = 32/4 = 8. The stadard devato s the square root of the varace. The varace s foud by fdg the sum of the squared devatos from the mea, ad the dvdg the result by 1, where s the umber of observatos. The squared devatos from the mea are (4 8) 2, (7 8) 2, (8 8) 2, ad (13 8) 2 ; summg these yelds ( 4) 2 + ( 1) = = 42. The varace s thus equal to 42/(4 1) = 14, ad the stadard devato s s = 14 = The umerator of the skewess s equal to the sum of the cubed devatos from the mea. The cubed devatos from the mea are (4 8) 3, (7 8) 3, (8 8) 3, ad (13 8) 3, ad the sum of these quattes s ( 4) 3 + ( 1) = ( 64) + ( 1) = 60. The deomator s equal to the umber of observatos, multpled by the cube of the stadard devato: 4( ) = The skewess s equal to 60/ = The data exhbt a small amout of postve skewess. We use Equato 2.8 to fd the kurtoss. The devatos from the mea are frst rased to the fourth power: (4 8) 4, (7 8) 4, (8 8) 4, ad (13 8) 4 ; these are summed to fd the umerator of the kurtoss, ad ths result s ( 4) 4 + ( 1) = = 882. The deomator of the kurtoss s equal to the umber of observatos, multpled by the stadard devato rased to the fourth power (ad ote that ths latter quatty s equal to the square of the varace). The deomator s therefore 4( ) = 4(14 2 ) = 784. The kurtoss s equal to 882/784 = Note that sce ths s less tha 3, the dstrbuto ca be descrbed as flat, or platykurtc. 4. Fd the grouped mea for the followg data: Category Frequecy Soluto. To fd the grouped mea, we assume that all observatos are at the mdpot of the category they are. There are fve observatos that are less tha 20. All we kow s that they are betwee 0 ad 20; we do ot kow ther exact values. We assume that they are all mdway betwee 0 ad 20 ad therefore assg them values of 10. Smlarly, the 15 observatos the ext category are all assged a value of 30, whch s the mdpot of the category they are ( ). The mea of all of these observatos, oce they are assged the mdpot of ther categores, s foud by addg all of ther assged values, ad the dvdg the result by the total umber of observatos. 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 52 10/3/ :40:50 AM

29 Descrptve Statstcs 53 Thus, we have 5 10 = 50 (whch s the total of the fve observatos the frst category), plus = 450 (whch s the total of the 15 observatos the secod category), plus 10 50, plus The soluto s therefore equal to ths total of = 1840, dvded by the total umber of observatos ( = 42); ths s equal to 1840/42 = EXERCISES 1. The 236 values that appear below are the 1990 meda household comes ( dollars) for the 236 cesus tracts of Buffalo, New York. (a) For the frst 19 tracts, fd the mea, meda, rage, terquartle rage, stadard devato, varace, skewess, ad kurtoss, usg oly a calculator (though you may wat to check your results usg a statstcal software package). I addto, costruct a stem-ad-leaf plot, a box plot, ad a hstogram for these 19 observatos. (b) Use a statstcal software package to repeat part (a), ths tme usg all 236 observatos. (c) Commet o your results. I partcular, what does t mea to fd the mea of a set of medas? How do the observatos that have a value of 0 affect the results? Should they be cluded? How mght the results dffer f a dfferet geographc scale was chose? 22342, 19919, 8187, 15875, 17994, 30765, 31347, 27282, 29310, 23720, 22033, 11706, 15625, 6173, 15694, 7924, 10433, 13274, 17803, 20583, 21897, 14531, 19048, 19850, 19734, 18205, 13984, 8738, 10299, 10678, 8685, 13455, 14821, 23722, 8740, 12325, 10717, 21447, 11250, 16016, 11509, 11395, 19721, 23231, 21293, 24375, 19510, 14926, 22490, 21383, 25060, 22664, 8671, 31566, 26931, 0, 24965, 34656, 24493, 21764, 25843, 32708, 22188, 19909, 33675, 15608, 15857, 18649, 21880, 17250, 16569, 14991, 0, 8643, 22801, 39708, 17096, 20647, 30712, 19304, 24116, 17500, 19106, 17517, 12525, 13936, 7495, 10232, 6891, 16888, 42274, 43033, 43500, 22257, 22931, 31918, 29072, 31948, 36229, 33860, 32586, 32606, 31453, 32939, 30072, 32185, 35664, 27578, 23861, 18374, 26563, 30726, 33614, 30373, 28347, 37786, 48987, 56318, 49641, 85742, 43229, 53116, 44335, 30184, 36744, 39698, 0, 21987, 66358, 46587, 26934, 27292, 31558, 36944, 43750, 49408, 37354, 31010, 35709, 32913, 25594, 25612, 28980, 28800, 28634, 18958, 26515, 24779, 21667, 24660, 29375, 29063, 30996, 45645, 39312, 34287, 35533, 27647, 24342, 22402, 28967, 39083, 28649, 23881, 31071, (Cotued) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 53 10/3/ :40:50 AM

30 54 Statstcal Methods for Geography (Cotued) 27412, 27943, 34500, 19792, 41447, 35833, 41957, 14333, 12778, 20000, 19656, 22302, 33475, 26580, 0, 24588, 31496, 30179, 33694, 36193, 41921, 35819, 39304, 38844, 37443, 47873, 41410, 34186, 36798, 38508, 38382, 37029, 48472, 38837, 40548, 35165, 39404, 34281, 24615, 34904, 21964, 42617, 58682, 41875, 40370, 24511, 31008, 16250, 29600, 38205, 35536, 35386, 36250, 31341, 33790, 31987, 42113, 37500, 33841, 37877, 35650, 28556, 27048, 27736, 30269, 32699, 28988, 22083, 27446, 76306, Te mgrato dstaces correspodg to the dstaces moved by recet mgrats are observed ( mles): 43, 6, 7, 11, 122, 41, 21, 17, 1, 3. Fd the mea ad stadard devato, ad the covert all observatos to z-scores. 3. By had, fd the mea, meda, ad stadard devato of the umber of bedrooms, for the frst te observatos of the Mlwaukee dataset. 4. Usg SPSS or Excel, aswer the followg questos usg the full Mlwaukee house sale dataset: (a) Make hstograms of () house sale values for those observatos that have more tha two bedrooms, ad () those observatos that have fewer tha three bedrooms. Commet o the dffereces. (b) Fd the mea, meda, ad stadard devato for sales prce, umber of bedrooms, age of house, ad lot sze. 5. Usg SPSS or Excel, aswer the followg questos usg the full Hypothetcal UK Housg Prces dataset: (a) Provde descrptve formato o house prces, umber of bedrooms, umber of bathrooms, floor area, ad date bult. For each, provde the mea, meda, stadard devato, ad skewess. Also for each, provde a box plot. (b) What percetage of homes have garages? (c) What percetage of homes were bult durg each of the followg tme perods: pre-war, ter-war, ad post-war? 6. For the frst te observatos of the Mlwaukee dataset: (a) fd the mea ceter; (b) fd Bach s stadard dstace. Exercses 7 11 are questos pertag to otato; see Appedx B for a revew. 7. Gve a = 3, b = 4, ad 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 54 10/3/ :40:51 AM

31 Descrptve Statstcs 55 Observato x y Fd the followg: (a) Σ y (b) Σ y 2 (c) Σax (d) Πx (e) Σ 3 =2 y + by (f) 3x 2 + y 4 (g) 32!/( 30!) xkyk (h) Σ k 8. Let a = 5, x 1 = 6, x 2 = 7, x 3 = 8, x 4 = 10, x 5 = 11, y 1 = 3, y 2 = 5, y 3 = 6, y 4 = 14, ad y 5 = 12. Fd the followg: (a) Σx (b) Σx y (c) Σ x + ay 3 (d) Σ y 2 =1 (e) ( ) = = 1 Σ a (f) Σ k 2( y k 3) = 5 (g) Σ x x = 1 ( ) 9. Fd 8!/3! (Cotued) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 55 10/3/ :40:55 AM

32 56 Statstcal Methods for Geography (Cotued) 10. Fd Use the followg table of commutg flows to determe the total umber of commuters leavg each zoe ad the total umber eterg each zoe. Also fd the total umber of commuters. For each aswer, also gve the correct otato, assumg y j deotes the umber of commuters who leave org to go to destato zoe j. Destato zoe Org zoe The followg data represet stream lk legths a rver etwork: 100, 426, 322, 466, 112, 155, 388, 1155, 234, 324, 556, 221, 18, 133, 177, 441. Fd the mea ad stadard devato of the lk legths. 13. For the followg aual rafall data, fd the grouped mea ad the grouped varace. Rafall Number of years observed < Notce that assumptos must be made about the mdpots of the ope-eded age groups. I ths example, use 15 ad 65 as the mdpots of the frst ad last rafall groups, respectvely. 14. A square grd s placed over a cty map. What s the Eucldea dstace betwee two places located at (1,3) ad (3,6)? 15. I the example above, what s the Mahatta dstace? 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 56 10/3/ :40:55 AM

33 Descrptve Statstcs Use a rose dagram to portray the followg agular data: Drecto Frequecy Drecto Frequecy N 43 S 60 NE 12 SW 70 E 23 W 75 SE 45 NW Draw frequecy dstrbutos that have (a) postve skewess, (b) egatve skewess, (c) low kurtoss ad o skewess, ad (d) hgh kurtoss ad o skewess. 18. What s the coeffcet of varato amog the followg commutg tmes: 23, 43, 42, 7, 23, 11, 23, 55? 19. A square grd s placed over a cty map. There are resdetal areas at the coordates (0,1), (2,3) ad (5,6). The respectve populatos of the three areas are 2500, 2000, ad A cetralzed faclty s beg cosdered for ether the pot (4,4) or the pot (4,5). Whch of the two pots s the better locato for a cetralzed faclty, gve that we wsh to mmze the total Mahatta dstace traveled by the populato to the faclty? Justfy your aswer by gvg the total Mahatta dstace traveled by the populato to each of the two possble locatos. 20. Is the followg data o comes postvely or egatvely skewed? You do ot eed to calculate skewess, but you should justfy your aswer. Data thousads: 45, 43, 32, 23, 45, 43, 47, 39, 21, 90, (a) Fd the weghted mea ceter of populato, where ctes populatos ad coordates are gve as follows: Cty x y Populato A ,000 B ,500 C ,000 D ,000 E ,500 (b) Fd the uweghted mea ceter, ad commet o the dffereces betwee your two aswers. (c) Fd the Eucldea dstaces of each cty to the weghted mea ceter of populato. (Cotued) 02_Rogerso_4e_BAB1405B0092_Ch-02.dd 57 10/3/ :40:55 AM

is the score of the 1 st student, x

is the score of the 1 st student, x 8 Chapter Collectg, Dsplayg, ad Aalyzg your Data. Descrptve Statstcs Sectos explaed how to choose a sample, how to collect ad orgaze data from the sample, ad how to dsplay your data. I ths secto, you wll

More information

MEASURES OF DISPERSION

MEASURES OF DISPERSION MEASURES OF DISPERSION Measure of Cetral Tedecy: Measures of Cetral Tedecy ad Dsperso ) Mathematcal Average: a) Arthmetc mea (A.M.) b) Geometrc mea (G.M.) c) Harmoc mea (H.M.) ) Averages of Posto: a) Meda

More information

Descriptive Statistics

Descriptive Statistics Page Techcal Math II Descrptve Statstcs Descrptve Statstcs Descrptve statstcs s the body of methods used to represet ad summarze sets of data. A descrpto of how a set of measuremets (for eample, people

More information

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen. .5 x 54.5 a. x 7. 786 7 b. The raked observatos are: 7.4, 7.5, 7.7, 7.8, 7.9, 8.0, 8.. Sce the sample sze 7 s odd, the meda s the (+)/ 4 th raked observato, or meda 7.8 c. The cosumer would more lkely

More information

f f... f 1 n n (ii) Median : It is the value of the middle-most observation(s).

f f... f 1 n n (ii) Median : It is the value of the middle-most observation(s). CHAPTER STATISTICS Pots to Remember :. Facts or fgures, collected wth a defte pupose, are called Data.. Statstcs s the area of study dealg wth the collecto, presetato, aalyss ad terpretato of data.. The

More information

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations HP 30S Statstcs Averages ad Stadard Devatos Average ad Stadard Devato Practce Fdg Averages ad Stadard Devatos HP 30S Statstcs Averages ad Stadard Devatos Average ad stadard devato The HP 30S provdes several

More information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal. Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot

More information

Econometric Methods. Review of Estimation

Econometric Methods. Review of Estimation Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators

More information

Outline. Point Pattern Analysis Part I. Revisit IRP/CSR

Outline. Point Pattern Analysis Part I. Revisit IRP/CSR Pot Patter Aalyss Part I Outle Revst IRP/CSR, frst- ad secod order effects What s pot patter aalyss (PPA)? Desty-based pot patter measures Dstace-based pot patter measures Revst IRP/CSR Equal probablty:

More information

Lecture Notes Types of economic variables

Lecture Notes Types of economic variables Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte

More information

Summary of the lecture in Biostatistics

Summary of the lecture in Biostatistics Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the

More information

= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality

= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality UCLA STAT Itroducto to Statstcal Methods for the Lfe ad Health Sceces Istructor: Ivo Dov, Asst. Prof. of Statstcs ad Neurology Teachg Assstats: Fred Phoa, Krste Johso, Mg Zheg & Matlda Hseh Uversty of

More information

Simple Linear Regression

Simple Linear Regression Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato

More information

Section l h l Stem=Tens. 8l Leaf=Ones. 8h l 03. 9h 58

Section l h l Stem=Tens. 8l Leaf=Ones. 8h l 03. 9h 58 Secto.. 6l 34 6h 667899 7l 44 7h Stem=Tes 8l 344 Leaf=Oes 8h 5557899 9l 3 9h 58 Ths dsplay brgs out the gap the data: There are o scores the hgh 7's. 6. a. beams cylders 9 5 8 88533 6 6 98877643 7 488

More information

Measures of Dispersion

Measures of Dispersion Chapter 8 Measures of Dsperso Defto of Measures of Dsperso (page 31) A measure of dsperso s a descrptve summary measure that helps us characterze the data set terms of how vared the observatos are from

More information

Chapter 8. Inferences about More Than Two Population Central Values

Chapter 8. Inferences about More Than Two Population Central Values Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha

More information

Lecture 1 Review of Fundamental Statistical Concepts

Lecture 1 Review of Fundamental Statistical Concepts Lecture Revew of Fudametal Statstcal Cocepts Measures of Cetral Tedecy ad Dsperso A word about otato for ths class: Idvduals a populato are desgated, where the dex rages from to N, ad N s the total umber

More information

Chapter 5 Properties of a Random Sample

Chapter 5 Properties of a Random Sample Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample

More information

CHAPTER VI Statistical Analysis of Experimental Data

CHAPTER VI Statistical Analysis of Experimental Data Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca

More information

Third handout: On the Gini Index

Third handout: On the Gini Index Thrd hadout: O the dex Corrado, a tala statstca, proposed (, 9, 96) to measure absolute equalt va the mea dfferece whch s defed as ( / ) where refers to the total umber of dvduals socet. Assume that. The

More information

Lesson 3. Group and individual indexes. Design and Data Analysis in Psychology I English group (A) School of Psychology Dpt. Experimental Psychology

Lesson 3. Group and individual indexes. Design and Data Analysis in Psychology I English group (A) School of Psychology Dpt. Experimental Psychology 17/03/015 School of Psychology Dpt. Expermetal Psychology Desg ad Data Aalyss Psychology I Eglsh group (A) Salvador Chacó Moscoso Susaa Saduvete Chaves Mlagrosa Sáchez Martí Lesso 3 Group ad dvdual dexes

More information

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The

More information

Continuous Distributions

Continuous Distributions 7//3 Cotuous Dstrbutos Radom Varables of the Cotuous Type Desty Curve Percet Desty fucto, f (x) A smooth curve that ft the dstrbuto 3 4 5 6 7 8 9 Test scores Desty Curve Percet Probablty Desty Fucto, f

More information

Summary tables and charts

Summary tables and charts Data Aalyss Summary tables ad charts. Orgazg umercal data: Hstograms ad frequecy tables I ths lecture, we wll study descrptve statstcs. By descrptve statstcs, we refer to methods volvg the collecto, presetato,

More information

ENGI 3423 Simple Linear Regression Page 12-01

ENGI 3423 Simple Linear Regression Page 12-01 ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable

More information

Simple Linear Regression

Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Lecture 3. Sampling, sampling distributions, and parameter estimation

Lecture 3. Sampling, sampling distributions, and parameter estimation Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called

More information

Point Estimation: definition of estimators

Point Estimation: definition of estimators Pot Estmato: defto of estmators Pot estmator: ay fucto W (X,..., X ) of a data sample. The exercse of pot estmato s to use partcular fuctos of the data order to estmate certa ukow populato parameters.

More information

CHAPTER 4 RADICAL EXPRESSIONS

CHAPTER 4 RADICAL EXPRESSIONS 6 CHAPTER RADICAL EXPRESSIONS. The th Root of a Real Number A real umber a s called the th root of a real umber b f Thus, for example: s a square root of sce. s also a square root of sce ( ). s a cube

More information

Statistics MINITAB - Lab 5

Statistics MINITAB - Lab 5 Statstcs 10010 MINITAB - Lab 5 PART I: The Correlato Coeffcet Qute ofte statstcs we are preseted wth data that suggests that a lear relatoshp exsts betwee two varables. For example the plot below s of

More information

Statistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura

Statistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura Statstcs Descrptve ad Iferetal Statstcs Istructor: Dasuke Nagakura (agakura@z7.keo.jp) 1 Today s topc Today, I talk about two categores of statstcal aalyses, descrptve statstcs ad feretal statstcs, ad

More information

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is Topc : Probablty Theory Module : Descrptve Statstcs Measures of Locato Descrptve statstcs are measures of locato ad shape that perta to probablty dstrbutos The prmary measures of locato are the arthmetc

More information

Class 13,14 June 17, 19, 2015

Class 13,14 June 17, 19, 2015 Class 3,4 Jue 7, 9, 05 Pla for Class3,4:. Samplg dstrbuto of sample mea. The Cetral Lmt Theorem (CLT). Cofdece terval for ukow mea.. Samplg Dstrbuto for Sample mea. Methods used are based o CLT ( Cetral

More information

Chapter 3 Sampling For Proportions and Percentages

Chapter 3 Sampling For Proportions and Percentages Chapter 3 Samplg For Proportos ad Percetages I may stuatos, the characterstc uder study o whch the observatos are collected are qualtatve ature For example, the resposes of customers may marketg surveys

More information

n -dimensional vectors follow naturally from the one

n -dimensional vectors follow naturally from the one B. Vectors ad sets B. Vectors Ecoomsts study ecoomc pheomea by buldg hghly stylzed models. Uderstadg ad makg use of almost all such models requres a hgh comfort level wth some key mathematcal sklls. I

More information

Module 7: Probability and Statistics

Module 7: Probability and Statistics Lecture 4: Goodess of ft tests. Itroducto Module 7: Probablty ad Statstcs I the prevous two lectures, the cocepts, steps ad applcatos of Hypotheses testg were dscussed. Hypotheses testg may be used to

More information

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

STA 105-M BASIC STATISTICS (This is a multiple choice paper.) DCDM BUSINESS SCHOOL September Mock Eamatos STA 0-M BASIC STATISTICS (Ths s a multple choce paper.) Tme: hours 0 mutes INSTRUCTIONS TO CANDIDATES Do ot ope ths questo paper utl you have bee told to do

More information

Mu Sequences/Series Solutions National Convention 2014

Mu Sequences/Series Solutions National Convention 2014 Mu Sequeces/Seres Solutos Natoal Coveto 04 C 6 E A 6C A 6 B B 7 A D 7 D C 7 A B 8 A B 8 A C 8 E 4 B 9 B 4 E 9 B 4 C 9 E C 0 A A 0 D B 0 C C Usg basc propertes of arthmetc sequeces, we fd a ad bm m We eed

More information

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos

More information

Functions of Random Variables

Functions of Random Variables Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,

More information

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best Error Aalyss Preamble Wheever a measuremet s made, the result followg from that measuremet s always subject to ucertaty The ucertaty ca be reduced by makg several measuremets of the same quatty or by mprovg

More information

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution: Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed

More information

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions. Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos

More information

Lecture 07: Poles and Zeros

Lecture 07: Poles and Zeros Lecture 07: Poles ad Zeros Defto of poles ad zeros The trasfer fucto provdes a bass for determg mportat system respose characterstcs wthout solvg the complete dfferetal equato. As defed, the trasfer fucto

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ  1 STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ

More information

SPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS

SPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS SPECIAL CONSIDERAIONS FOR VOLUMERIC Z-ES FOR PROPORIONS Oe s stctve reacto to the questo of whether two percetages are sgfcatly dfferet from each other s to treat them as f they were proportos whch the

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted

More information

C-1: Aerodynamics of Airfoils 1 C-2: Aerodynamics of Airfoils 2 C-3: Panel Methods C-4: Thin Airfoil Theory

C-1: Aerodynamics of Airfoils 1 C-2: Aerodynamics of Airfoils 2 C-3: Panel Methods C-4: Thin Airfoil Theory ROAD MAP... AE301 Aerodyamcs I UNIT C: 2-D Arfols C-1: Aerodyamcs of Arfols 1 C-2: Aerodyamcs of Arfols 2 C-3: Pael Methods C-4: Th Arfol Theory AE301 Aerodyamcs I Ut C-3: Lst of Subects Problem Solutos?

More information

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971)) art 4b Asymptotc Results for MRR usg RESS Recall that the RESS statstc s a specal type of cross valdato procedure (see Alle (97)) partcular to the regresso problem ad volves fdg Y $,, the estmate at the

More information

ENGI 4421 Propagation of Error Page 8-01

ENGI 4421 Propagation of Error Page 8-01 ENGI 441 Propagato of Error Page 8-01 Propagato of Error [Navd Chapter 3; ot Devore] Ay realstc measuremet procedure cotas error. Ay calculatos based o that measuremet wll therefore also cota a error.

More information

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE Hadout #1 Ttle: Foudatos of Ecoometrcs Course: Eco 367 Fall/015 Istructor: Dr. I-Mg Chu POPULATION vs. SAMPLE From the Bureau of Labor web ste (http://www.bls.gov), we ca fd the uemploymet rate for each

More information

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y. .46. a. The frst varable (X) s the frst umber the par ad s plotted o the horzotal axs, whle the secod varable (Y) s the secod umber the par ad s plotted o the vertcal axs. The scatterplot s show the fgure

More information

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use. INTRODUCTORY NOTE ON LINEAR REGREION We have data of the form (x y ) (x y ) (x y ) These wll most ofte be preseted to us as two colum of a spreadsheet As the topc develops we wll see both upper case ad

More information

STA302/1001-Fall 2008 Midterm Test October 21, 2008

STA302/1001-Fall 2008 Midterm Test October 21, 2008 STA3/-Fall 8 Mdterm Test October, 8 Last Name: Frst Name: Studet Number: Erolled (Crcle oe) STA3 STA INSTRUCTIONS Tme allowed: hour 45 mutes Ads allowed: A o-programmable calculator A table of values from

More information

ESS Line Fitting

ESS Line Fitting ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here

More information

Objectives of Multiple Regression

Objectives of Multiple Regression Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uverst Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Laboratory I.10 It All Adds Up

Laboratory I.10 It All Adds Up Laboratory I. It All Adds Up Goals The studet wll work wth Rema sums ad evaluate them usg Derve. The studet wll see applcatos of tegrals as accumulatos of chages. The studet wll revew curve fttg sklls.

More information

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter LOGISTIC REGRESSION Notato Model Logstc regresso regresses a dchotomous depedet varable o a set of depedet varables. Several methods are mplemeted for selectg the depedet varables. The followg otato s

More information

Measures of Central Tendency

Measures of Central Tendency Chapter 6 Measures of Cetral Tedecy Defto of a Summary Measure (page 185) A summary measure s a sgle value that we compute from a collecto of measuremets order to descrbe oe of the collecto s partcular

More information

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs CLASS NOTES for PBAF 58: Quattatve Methods II SPRING 005 Istructor: Jea Swaso Dael J. Evas School of Publc Affars Uversty of Washgto Ackowledgemet: The structor wshes to thak Rachel Klet, Assstat Professor,

More information

A Primer on Summation Notation George H Olson, Ph. D. Doctoral Program in Educational Leadership Appalachian State University Spring 2010

A Primer on Summation Notation George H Olson, Ph. D. Doctoral Program in Educational Leadership Appalachian State University Spring 2010 Summato Operator A Prmer o Summato otato George H Olso Ph D Doctoral Program Educatoal Leadershp Appalacha State Uversty Sprg 00 The summato operator ( ) {Greek letter captal sgma} s a structo to sum over

More information

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA THE ROYAL STATISTICAL SOCIETY EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA PAPER II STATISTICAL THEORY & METHODS The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for

More information

For combinatorial problems we might need to generate all permutations, combinations, or subsets of a set.

For combinatorial problems we might need to generate all permutations, combinations, or subsets of a set. Addtoal Decrease ad Coquer Algorthms For combatoral problems we mght eed to geerate all permutatos, combatos, or subsets of a set. Geeratg Permutatos If we have a set f elemets: { a 1, a 2, a 3, a } the

More information

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger Example: Multple lear regresso 5000,00 4000,00 Tro Aders Moger 0.0.007 brthweght 3000,00 000,00 000,00 0,00 50,00 00,00 50,00 00,00 50,00 weght pouds Repetto: Smple lear regresso We defe a model Y = β0

More information

Centroids & Moments of Inertia of Beam Sections

Centroids & Moments of Inertia of Beam Sections RCH 614 Note Set 8 S017ab Cetrods & Momets of erta of Beam Sectos Notato: b C d d d Fz h c Jo L O Q Q = ame for area = ame for a (base) wdth = desgato for chael secto = ame for cetrod = calculus smbol

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Exam: ECON430 Statstcs Date of exam: Frday, December 8, 07 Grades are gve: Jauary 4, 08 Tme for exam: 0900 am 00 oo The problem set covers 5 pages Resources allowed:

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the

More information

Chapter -2 Simple Random Sampling

Chapter -2 Simple Random Sampling Chapter - Smple Radom Samplg Smple radom samplg (SRS) s a method of selecto of a sample comprsg of umber of samplg uts out of the populato havg umber of samplg uts such that every samplg ut has a equal

More information

Simulation Output Analysis

Simulation Output Analysis Smulato Output Aalyss Summary Examples Parameter Estmato Sample Mea ad Varace Pot ad Iterval Estmato ermatg ad o-ermatg Smulato Mea Square Errors Example: Sgle Server Queueg System x(t) S 4 S 4 S 3 S 5

More information

L5 Polynomial / Spline Curves

L5 Polynomial / Spline Curves L5 Polyomal / Sple Curves Cotets Coc sectos Polyomal Curves Hermte Curves Bezer Curves B-Sples No-Uform Ratoal B-Sples (NURBS) Mapulato ad Represetato of Curves Types of Curve Equatos Implct: Descrbe a

More information

( ) 2 2. Multi-Layer Refraction Problem Rafael Espericueta, Bakersfield College, November, 2006

( ) 2 2. Multi-Layer Refraction Problem Rafael Espericueta, Bakersfield College, November, 2006 Mult-Layer Refracto Problem Rafael Espercueta, Bakersfeld College, November, 006 Lght travels at dfferet speeds through dfferet meda, but refracts at layer boudares order to traverse the least-tme path.

More information

1 Onto functions and bijections Applications to Counting

1 Onto functions and bijections Applications to Counting 1 Oto fuctos ad bectos Applcatos to Coutg Now we move o to a ew topc. Defto 1.1 (Surecto. A fucto f : A B s sad to be surectve or oto f for each b B there s some a A so that f(a B. What are examples of

More information

Multiple Choice Test. Chapter Adequacy of Models for Regression

Multiple Choice Test. Chapter Adequacy of Models for Regression Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1 STA 08 Appled Lear Models: Regresso Aalyss Sprg 0 Soluto for Homework #. Let Y the dollar cost per year, X the umber of vsts per year. The the mathematcal relato betwee X ad Y s: Y 300 + X. Ths s a fuctoal

More information

Chapter Statistics Background of Regression Analysis

Chapter Statistics Background of Regression Analysis Chapter 06.0 Statstcs Backgroud of Regresso Aalyss After readg ths chapter, you should be able to:. revew the statstcs backgroud eeded for learg regresso, ad. kow a bref hstory of regresso. Revew of Statstcal

More information

PROPERTIES OF GOOD ESTIMATORS

PROPERTIES OF GOOD ESTIMATORS ESTIMATION INTRODUCTION Estmato s the statstcal process of fdg a appromate value for a populato parameter. A populato parameter s a characterstc of the dstrbuto of a populato such as the populato mea,

More information

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model 1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed

More information

Chapter 13 Student Lecture Notes 13-1

Chapter 13 Student Lecture Notes 13-1 Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple

More information

Lecture 3 Probability review (cont d)

Lecture 3 Probability review (cont d) STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto

More information

Special Instructions / Useful Data

Special Instructions / Useful Data JAM 6 Set of all real umbers P A..d. B, p Posso Specal Istructos / Useful Data x,, :,,, x x Probablty of a evet A Idepedetly ad detcally dstrbuted Bomal dstrbuto wth parameters ad p Posso dstrbuto wth

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statstc ad Radom Samples A parameter s a umber that descrbes the populato. It s a fxed umber, but practce we do ot kow ts value. A statstc s a fucto of the sample data,.e., t s a quatty whose

More information

Can we take the Mysticism Out of the Pearson Coefficient of Linear Correlation?

Can we take the Mysticism Out of the Pearson Coefficient of Linear Correlation? Ca we tae the Mstcsm Out of the Pearso Coeffcet of Lear Correlato? Itroducto As the ttle of ths tutoral dcates, our purpose s to egeder a clear uderstadg of the Pearso coeffcet of lear correlato studets

More information

Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b

Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b CS 70 Dscrete Mathematcs ad Probablty Theory Fall 206 Sesha ad Walrad DIS 0b. Wll I Get My Package? Seaky delvery guy of some compay s out delverg packages to customers. Not oly does he had a radom package

More information

Chapter -2 Simple Random Sampling

Chapter -2 Simple Random Sampling Chapter - Smple Radom Samplg Smple radom samplg (SRS) s a method of selecto of a sample comprsg of umber of samplg uts out of the populato havg umber of samplg uts such that every samplg ut has a equal

More information

UNIT 2 SOLUTION OF ALGEBRAIC AND TRANSCENDENTAL EQUATIONS

UNIT 2 SOLUTION OF ALGEBRAIC AND TRANSCENDENTAL EQUATIONS Numercal Computg -I UNIT SOLUTION OF ALGEBRAIC AND TRANSCENDENTAL EQUATIONS Structure Page Nos..0 Itroducto 6. Objectves 7. Ital Approxmato to a Root 7. Bsecto Method 8.. Error Aalyss 9.4 Regula Fals Method

More information

Chapter 11 The Analysis of Variance

Chapter 11 The Analysis of Variance Chapter The Aalyss of Varace. Oe Factor Aalyss of Varace. Radomzed Bloc Desgs (ot for ths course) NIPRL . Oe Factor Aalyss of Varace.. Oe Factor Layouts (/4) Suppose that a expermeter s terested populatos

More information

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1 C. Statstcs a. Descrbe the stages the desg of a clcal tral, takg to accout the: research questos ad hypothess, lterature revew, statstcal advce, choce of study protocol, ethcal ssues, data collecto ad

More information

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data

More information

PTAS for Bin-Packing

PTAS for Bin-Packing CS 663: Patter Matchg Algorthms Scrbe: Che Jag /9/00. Itroducto PTAS for B-Packg The B-Packg problem s NP-hard. If we use approxmato algorthms, the B-Packg problem could be solved polyomal tme. For example,

More information

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..

More information

CHAPTER 2. = y ˆ β x (.1022) So we can write

CHAPTER 2. = y ˆ β x (.1022) So we can write CHAPTER SOLUTIONS TO PROBLEMS. () Let y = GPA, x = ACT, ad = 8. The x = 5.875, y = 3.5, (x x )(y y ) = 5.85, ad (x x ) = 56.875. From equato (.9), we obta the slope as ˆβ = = 5.85/56.875., rouded to four

More information

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class) Assgmet 5/MATH 7/Wter 00 Due: Frday, February 9 class (!) (aswers wll be posted rght after class) As usual, there are peces of text, before the questos [], [], themselves. Recall: For the quadratc form

More information

Median as a Weighted Arithmetic Mean of All Sample Observations

Median as a Weighted Arithmetic Mean of All Sample Observations Meda as a Weghted Arthmetc Mea of All Sample Observatos SK Mshra Dept. of Ecoomcs NEHU, Shllog (Ida). Itroducto: Iumerably may textbooks Statstcs explctly meto that oe of the weakesses (or propertes) of

More information

Chapter 2. 1.what are the types of Data Sets

Chapter 2. 1.what are the types of Data Sets Record Relatoal records Data matr, :umercal matr Documet data: tet documets: term-frequecy vector Trasacto data Graph ad etwork World Wde Web Socal or formato etworks Molecular Structures Chapter.what

More information

Random Variables and Probability Distributions

Random Variables and Probability Distributions Radom Varables ad Probablty Dstrbutos * If X : S R s a dscrete radom varable wth rage {x, x, x 3,. } the r = P (X = xr ) = * Let X : S R be a dscrete radom varable wth rage {x, x, x 3,.}.If x r P(X = x

More information

Lecture 2: Linear Least Squares Regression

Lecture 2: Linear Least Squares Regression Lecture : Lear Least Squares Regresso Dave Armstrog UW Mlwaukee February 8, 016 Is the Relatoshp Lear? lbrary(car) data(davs) d 150) Davs$weght[d]

More information

Chapter Two. An Introduction to Regression ( )

Chapter Two. An Introduction to Regression ( ) ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the

More information

Ideal multigrades with trigonometric coefficients

Ideal multigrades with trigonometric coefficients Ideal multgrades wth trgoometrc coeffcets Zarathustra Brady December 13, 010 1 The problem A (, k) multgrade s defed as a par of dstct sets of tegers such that (a 1,..., a ; b 1,..., b ) a j = =1 for all

More information