5.3 Rank correlations. 194 Multidimensional semiquantitative data

Size: px
Start display at page:

Download "5.3 Rank correlations. 194 Multidimensional semiquantitative data"

Transcription

1 94 Multidimesioal semiquatitative data the Excerpt tests from the Chapter hypothesis 5 of: (H 0 ) that the umbers of pairs with each sig are equal; a equivalet formulatio is that the proportio of pairs with either sig is equal to 0.5. This Legedre, test uses P. & iformatio L. Legedre. about 998. the Numerical directio of ecology, the differeces d Eglish betwee editio. pairs. Elsevier Whe the Sciece relative BV, Amsterdam. magitude of xv the differeces pages. betwee pairs is also kow, it becomes possible to use the more powerful Wilcoxo matched-pairs siged-raks test. Differeces betwee pairs are first raked accordig to their magitude (absolute values), after which the sig of the differece is affixed to each rak. The ull hypothesis of the test (H 0 ) is that the sum of the raks havig a (+) sig is equal to that of the raks with a () sig. The McNemar test provides a meas of comparig paired samples of biary data. For example, usig biary observatios (e.g. presece or absece) made at the same sites, before ad after some evet, oe could test (H 0 ) that o overall chage has occurred. Whe there are several related samples (k ) ad the data are quatitative, the parametric approach for testig (H 0 ) that the meas of the k groups are equal is twoway aalysis of variace, with or without replicatio. Oe classificatio criterio of the two-way ANOVA accouts for the variability amog the k groups (as i oe-way ANOVA above, for k idepedet samples) ad the other for that amog the related samples. Cosider, for a example, 6 sites (i.e. k groups) that have bee sampled at 5 depths i the water colum (or at 5 differet times, or usig 5 differet methods, etc.). The oparametric equivalet, for semiquatitative data, is Friedma s two-way aalysis of variace by raks without replicatio, which is based o a two-way table similar to Table 5.7. I the two-way table, the k groups (e.g. 6 sites) are i rows ad the correspodig samples (e.g. 5 depths) are i colums. Values withi each colum are raked separately, ad the Friedma statistic (eq. 5.0) is used to test (H 0 ) that the rak totals of the various rows (i.e. groups) are equal. For biary data, the Cochra Q test is a extesio to k groups of the McNemar test, described above for k. Fially, whe there are several samples (k ), related across several classificatio criteria (e.g. 6 sites all sampled at 8 differet times, usig each time 5 differet methods), multiway ANOVA is the stadard parametric method for testig the ull hypothesis (H 0 ) that the meas of the k groups are equal (F-test). I that case, there are o obvious equivalet approaches for semiquatitative or qualitative data. 5.3 Rak correlatios Textbooks of oparametric statistics propose a few methods oly for the aalysis of bi- or multivariate semiquatitative data. Sectio 5. has show that there actually exist may umerical approaches for aalysig multidimesioal data, correspodig to all levels of precisio (Table 5.). These methods, which iclude most of those described i this book, belog to oparametric statistics i a geeral sese, because they do ot focus o the parameters of the data distributios. Withi the specific realm of rakig tests, however, the oly statistical techiques available for

2 Rak correlatios 95 Table 5.3 Numerical example. Perfect rak correlatio betwee descriptors y ad y. Objects Raks of objects o the two descriptors (observatio uits) y y x 5 5 x x x 4 x multidimesioal semiquatitative data are two rak correlatio coefficiets (Spearma r ad Kedall ), which both quatify the relatioship betwee two descriptors, ad the coefficiet of cocordace (Kedall W), which assesses the relatioship amog several descriptors. These are described i some detail i the preset sectio. Spearma corr. coeff. The Spearma r statistic, also called (rho), is based o the idea that two descriptors y ad y carry the same iformatio if the largest object o y also has the highest rak o y, ad so o for all other objects. Two descriptors are said to be i perfect correlatio whe the raks of all object are the same o both descriptors, as i the umerical example of Table 5.3. If, however, object x which has rak 5 o y had rak o y, it would be atural to use the differece betwee these raks d (y y ) (5 ) 3 as a measure of the differece betwee the two descriptors, for this object. For the whole set of objects, differeces d i are squared before summig them, i order to prevet differeces with opposite sigs from cacellig each other out. The expressio for the Spearma r may be derived from the geeral formula of correlatio coefficiets (Kedall, 948): y i j y j y ik y k i r jk y i j y j y i k y k (5.)

3 96 Multidimesioal semiquatitative data For raked data, the average raks y j ad y k are equal, so that y ij y j y i k y k (y ij y ik ). Oe ca write the differece betwee the raks of object i o the two descriptors as d i (y ij y ik ) y ij y j y i k y k, which leads to: d i y i j y j + y i k y k y i j y j y ik y k Isolatig the right-had sum gives: y ij y j y i k y k - y i j y j + y i k y k d i Usig this result, eq. 5. is rewritte as: r jk - y i j y j + y i k y k d i y i j y j y i k y k (5.) The sum of raks for each descriptor, which is the sum of the first itegers, is equal to ( + )/ ad the sum of their squares is y i j Sice the sum of deviatios from the mea rak is y ij y j y ij - y ij oe ca write: y ij y j It follows that, whe usig raks, the umerator of eq. 5. becomes: -- y ij y j + y ik y k d i d i while its deomiator reduces to: y ij y j y ik y k

4 Rak correlatios 97 Table 5.4 Numerical example. Raks of four objects o two descriptors, y ad y. Objects Raks of objects o the two descriptors (observatio uits) y y x 3 3 x 4 x 3 4 x 4 The fial formula is obtaied by replacig the above two expressios i eq. 5.. This developmet shows that, whe usig raks, eq. 5. simplifies to the followig formula for Spearma s r: r j k d i 6 d i i i (5.3) Alteratively, the Spearma rak correlatio coefficiet may be obtaied i two steps: () replace all observatios by raks (columwise) ad () compute the Pearso correlatio coefficiet (eq. 4.7) betwee the raked variables. The result is the same as obtaied from eq The Spearma r coefficiet varies betwee + ad, just like the Pearso r. Descriptors that are perfectly matched, i terms of raks, exhibit values r + (direct relatioship) or r (iverse relatioship), whereas r 0 idicates the absece of a mootoic relatioship betwee the two descriptors. (Relatioships that are ot mootoic, e.g. Fig. 4.4d, ca be quatified usig polyomial or oliear regressio, or else cotigecy coefficiets; see Sectios 6. ad 0.3.) Numerical example. A small example (raked data, Table 5.4) illustrates the equivalece betwee eq. 5. computed o raks ad eq Usig eq. 5. gives: The same result is obtaied from eq. 5.3: r r

5 98 Multidimesioal semiquatitative data Two or more objects may have the same rak o a give descriptor. This is ofte the case with descriptors used i ecology, which may have a small umber of states or ordered classes. Such observatios are said to be tied. Each of them is assiged the average of the raks which would have bee assiged had o ties occurred. If the proportio of tied observatios is large, correctio factors must be itroduced ito the sums of squared deviatios of eq. 5., which become: y ij y j t r j q r 3 t rj ad y ik y k t rk s r 3 t rk where t rj ad t rk are the umbers of observatios i descriptors y j ad y k which are tied at raks r, these values beig summed over the q sets of tied observatios i descriptor j ad the s sets i descriptor k. Sigificace of the Spearma r is usually tested agaist the ull hypothesis H 0 : r 0. Whe 0, the test statistic is the same as for Pearso s r (eq. 4.3): t r ki r k i (5.4) H 0 is tested by comparig statistic t to the value foud i a table of critical values of t, with degrees of freedom. H 0 is rejected whe the probability correspodig to t is smaller tha a predetermied level of sigificace ( for a two-tailed test). The rules for oe-tailed ad two-tailed tests are the same as for the Pearso r (Sectio 4.). Whe < 0, which is ot ofte the case i ecology, oe must refer to a special table of critical values of the Spearma rak correlatio coefficiet, foud i textbooks of oparametric statistics. Kedall corr. coeff. Kedall s (tau) is aother rak correlatio coefficiet, which ca be used for the same types of descriptors as Spearma s r. Oe major advatage of over Spearma s r is that the former ca be geeralized to a partial correlatio coefficiet (below), which is ot the case for the latter. While Spearma s r was based o the differeces betwee the raks of objects o the two descriptors beig compared, Kedall s refers to a somewhat differet cocept, which is best explaied usig a example. Numerical example. Kedall s is calculated o the example of Table 5.4, already used for computig Spearma s r. I Table 5.5, the order of the objects was rearraged so as to obtai icreasig raks o oe of the two descriptors (here y ). The Table is used to determie the degree of depedece betwee the two descriptors. Sice the raks are ow i icreasig order

6 Rak correlatios 99 Table 5.5 Numerical example. The order of the four objects from Table 5.4 has bee rearraged i such a way that the raks o y are ow i icreasig order Objects Raks of objects o the two descriptors (observatio uits) y y x 4 x 3 4 x 3 3 x 4 o y, it is sufficiet to determie how may pairs of raks are also i icreasig order o y to obtai a measure of the associatio betwee the two descriptors. Cosiderig the object i first rak (i.e. x 4 ), at the top of the right-had colum, the first pair of raks ( ad 4, belogig to objects x 4 ad x 3 ) is i icreasig order; a score of + is assiged to it. The same goes for the secod pair ( ad 3, belogig to objects x 4 ad x ). The third pair of raks ( ad, belogig to objects x 4 ad x ) is i decreasig order, however, so that it ears a egative score. The same operatio is repeated for every object i successive raks alog y, i.e. for the object i secod rak (x 3 ): first pair of raks (4 ad 3, belogig to objects x 3 ad x ), etc. The sum S of scores assiged to each of the ( )/ differet pairs of raks is the computed. Kedall's rak correlatio coefficiet is defied as follows: a S S (5.5) where S stads for sum of scores. Kedall's a is thus the sum of scores for pairs i icreasig ad decreasig order, divided by the total umber of pairs (( )/). For the example of Tables 5.4 ad 5.5, a is: a 4 3 Clearly, i the case of perfect agreemet betwee two descriptors, all pairs receive a positive score, so that S ( )/ ad thus a +. Whe there is complete disagreemet, S ( )/ ad thus a. Whe the descriptors are totally urelated, the positive ad egative scores cacel out, so that S as well as a are ear 0. Equatio 5.5 caot be used for computig whe there are tied observatios. This is ofte the case with ecological semiquatitative descriptors, which may have a small umber of states. The Kedall rak correlatio is the computed o a cotigecy table (see Chapter 6) crossig two semiquatitative descriptors.

7 00 Multidimesioal semiquatitative data Table 5.6 Numerical example. Cotigecy table givig the distributio of 80 objects amog the states of two semiquatitative descriptors, a ad b. Numbers i the table are frequecies (f). b b b 3 b 4 t j a a a a t k Table 5.6 is a cotigecy table crossig two ordered descriptors. For example, descriptor a could represet the relative abudaces of arthropods i soil eumerated o a semiquatitative scale (e.g. abset, preset, abudat ad very abudat), while descriptor b could be the cocetratio of orgaic matter i the soil, divided ito 4 classes. For simplicity, descriptors are called a ad b here, as i Chapter 6. The states of a vary from to r (umber of rows) while the states of b go from to c (umber of colums). To compute with tied observatios, S is calculated as the differece betwee the umbers of positive (P) ad egative (Q) scores, S P Q. P is the sum of all frequecies f i the cotigecy table, each oe multiplied by the sum of all frequecies located lower ad o its right: r c P f j k f lm r c j k l j + m k + Likewise, Q is the sum of all frequecies f i the table, each oe multiplied by the sum of all frequecies lower ad o its left: r c k Q f j k f lm r j k l j + m Numerical example. For Table 5.6: P (0 40) + (0 30) + (0 0) + (0 0) + (0 0) 600 Q (0 0) + (0 0) 00 S P Q

8 Rak correlatios 0 Usig this value S, there are two approaches for calculatig, depedig o the umbers of states i the two descriptors. Whe a ad b have the same umbers of states (r c), b is computed usig a formula that icludes the total umber of pairs ( )/, as i the case of a (eq. 5.5). The differece with eq. 5.5 is that b icludes correctios for the umber of pairs L tied i a ad the umber of pairs L tied i b, where r L -- t j t j j c L - t k t k k i which t j is the margial total for row j i which t k is the margial total for colum k. The formula for b is: b S L -- L (5.6) Whe there are o tied observatios, L L 0 ad eq. 5.6 becomes idetical to eq Numerical example. For Table 5.6: L L b Without correctio for ties, the calculated value (eq. 5.5) would have bee a ( 400) / (80 79) 0.44 The secod approach for calculatig with tied observatios should be used whe a ad b do ot have the same umber of states (r c). The formula for c uses the miimum umber of states i either a or b, mi(r, c): c S -- mi mi (5.7)

9 0 Multidimesioal semiquatitative data The sigificace of Kedall s is tested by referece to the ull hypothesis H 0 : r 0 (i.e. idepedece of the two descriptors). A test statistic is obtaied by trasformig ito z (or t ) usig the followig formula (Kedall, 948): z (5.8) Whe 30, the secod term of eq. 5.8 becomes egligible (at 30, the value of this term is oly 0.078). For 0, the samplig distributio of is almost the same as the ormal distributio, so that H 0 is tested usig a table of z. Sice z tables are oetailed, the z statistic of eq. 5.8 may be used directly for oe-tailed tests by comparig it to the value z read i the table. For two-tailed tests, the statistic is compared to the value z / from the z table. Whe < 0, which is seldom the case i ecology, oe should refer to Table B, at the ed of this book. Table B gives the critical values of for 4 50 (oe-tailed ad two-tailed tests). Power Spearma s r provides a better approximatio of Pearso s r whe the data are almost quatitative ad there are but a few tied observatios, whereas Kedall s does better whe there are may ties. Computig both Spearma s r ad Kedall s a o the same umerical example, above, produced differet umerical values (i.e. r 0.40 versus a 0.33). This is because the two coefficiets have differet uderlyig scales, so that their umerical values caot be directly compared. However, give their differet samplig distributios, they both reject H 0 at the same level of sigificace. If applied to quatitative data that are meetig all the requiremets of Pearso s r, both Spearma s r ad Kedall s have power early as high (about 9%; Hotellig & Pabst, 936) as their parametric equivalet. I all other cases, they are more powerful tha Pearso s r. This refers to the otio of power of statistical tests: a test is more powerful tha aother if it is more likely to detect small deviatios from H 0 (i.e. smaller type II error), for costat type I error. The chief advatage of Kedall s over Spearma s r, as already metioed, is that it ca be geeralized to a partial correlatio coefficiet, which caot be doe with Spearma s (Siegel, 956: 4). The formula for a partial is: (5.9) This formula is algebraically the same as that of first-order partial Pearso r (eq. 4.36) although, accordig to Kedall (948: 03), this would be merely coicidetal because the two formulae are derived usig etirely differet approaches. The three coefficiets o the right-had side of eq. 5.9 may themselves be partial s, thus allowig oe to cotrol for more tha oe descriptor (i.e. high order partial correlatio coefficiets). Siegel & Castella (988) give tables for testig the sigificace of the Kedall partial correlatio coefficiet.

10 Rak correlatios 03 Kedall coeff. of cocordace Rak correlatio coefficiets should ot be used i the Q mode, i.e. for comparig objects istead of descriptors. This is also the case for the Pearso r (Sectio 7.5). The reasos for this are the followig: While physical dimesios disappear whe computig correlatio coefficiets betwee variables expressed i differet uits, the same coefficiets computed betwee objects have complex ad o-iterpretable physical dimesios. Physical descriptors are usually expressed i somewhat arbitrary uits (e.g. mm, cm, m, or km are all equally correct, i priciple). Ay arbitrary chage i uits could dramatically chage the values of correlatios computed betwee objects. Descriptors may be stadardized first to alleviate these problems but stadardizatio of quatitative descriptors, before rak-orderig the data withi objects, chages the values alog object vectors i a omootoic way. The correlatio betwee two objects is a fuctio of the values of all the other objects i the data set. Cosider species abudace data. At most samplig sites, several species are represeted by a small umber of idividuals, this umber beig subject to stochastic variability. It follows that their raks, i a give observatio uit, may ot strictly correspod to their quatitative importace i the ecosystem. A rak correlatio coefficiet computed betwee observatio uits would thus have high variace sice it would be computed o may ucertai raks, givig a prepoderat importace to the may poorly sampled species. While the cetral limit theorem isures that meas, variaces, covariaces, ad correlatios coverge towards their populatio values whe the umber of objects icreases, computig these same parameters i the Q mode is likely to have the opposite effect sice the additio of ew variables ito the calculatios is likely to chage the values of these parameters i a o-trivial way. Correlatio coefficiets ca be tested by the method of permutatios, as described i Subsectio..3. I the R mode, permutig the values of a variable withi a colum makes physical sese: uder H 0, that value could be foud at ay oe site. I the Q mode, however, permutig values withi a row of the data matrix does ot make sese because, i the real world, these values could ot belog to differet variables. As a illustratio, it would ot make sese to move a saliity of 35 psu to the ph colum. The rak correlatio coefficiets described above measure the correlatio for pairs of descriptors, based o objects. I cotrast, Kedall s coefficiet of cocordace W measures the relatioship amog several rak-ordered variables for objects. I Table 5., Kedall s W is listed as equivalet to the coefficiet of multiple liear correlatio R, but the approach is actually quite differet. The aalysis is coducted o a table which cotais, i each colum, the raks of the objects o oe of the p descriptors, e.g. Table 5.7. Friedma (937) has show

11 04 Multidimesioal semiquatitative data that, whe the umber of rows ad/or colums is large eough, the followig statistic is approximately distributed as with degrees of freedom: X p + R i 3 p + (5.0) where R i is the sum of the raks for row i. This is Friedma s statistic for two-way aalysis of variace by raks. Kedall s coefficiet of cocordace (Kedall, 948) ca be obtaied by trasformig the Friedma s X statistic as follows: W X p (5.) It ca be show that the followig expressio is equivalet to eq. 5.: R i R W p 3 (5.) Kedall s W statistic is simply the variace of the row sums of raks R i divided by the maximum possible value that this variace ca take; this occurs whe all variables are i total agreemet. Hece 0 W. Two properties are used to demostrate the equivalece of eqs. 5. ad 5.. The first oe is that R i R R i - R i ad the secod is that the sum of the all R i values i the table is p( + )/. I the presece of tied values, the formula is modified as follows: *** develop from the Kedall paper*** Whe there are o ties, W ca be computed from the mea, r, of the Spearma correlatios amog all variables (Siegel & Castella 988): p r + W p (5.3)

12 Rak correlatios 05 Table 5.7 Numerical example. Raks of six objects o three descriptors, y, y, ad y 3. Objects Raks of objects o the three descriptors Row sums (observatio uits) y y y 3 R i x 6 8 x x x x x Coefficiet W varies betwee 0 (o cocordace) ad (maximum cocordace). Its sigificace is tested either usig eq. 5. directly, or after trasformig W ito the associated X statistic: X p( )W The ull hypothesis (H 0 ) subjected to testig is that the row sums R i are equal or, i other words, that the p sets of raks (or the p semiquatitative descriptors) are idepedet of oe aother. The X statistic is compared to a value read i a table of critical values of, for ( ). Whe X is smaller tha the critical value (i.e. probability larger tha ), the ull hypothesis that the row sums R i are equal caot be rejected; this leads to the coclusio that the p descriptors are idepedet ad differ i the way they rak the objects. O the cotrary, X (i.e. probability smaller tha or equal to ) idicates good agreemet amog the descriptors i the way they rak the objects. Textbooks of oparametric statistics provide modified formulae for X, for data sets with tied observatios. Numerical example. Calculatio of Kedall s coefficiet of cocordace is illustrated usig the umerical example of Table 5.7. Data could be semiquatitative rak scores, or quatitative descriptors coded ito raks. It is importat to ote that the 6 objects are raked o each descriptor (colum) separately. The last colum gives, for each object i, the sum R i of its raks o the p 3 descriptors. The sum of squared deviatios from the mea, R i R, is equal to 5.5 for this example. The Friedma statistic is calculated with eq. 5.0: X Usig eq. 5., the X statistic is trasformed ito Kedall s W:

13 W Alteratively, W could have bee computed usig eq. 5.: W A table of critical values of idicates that X.43, for 6 5, correspods to a probability of ca. 0.80; the probability associated with this X statistic is actually The hypothesis (H 0 ) that the row sums R i are equal caot be rejected. Oe cocludes that the three descriptors differ i the way they rak the 6 objects. Permutatio test The Kedall coefficiet of cocordace ca also be tested by permutatio. The Kedall cocordace method with permutatio testig has bee used for the search for species associatio (Legedre 005), which is oe of the classical problems of commuity ecology. It is implemeted i the fuctios kedall.global (global test of the cocordace amog all members of a associatio) ad kedall.post (a posteriori test of the cotributio of idividual species to the overall cocordace of their group ) of the R-laguage library vega (Oksae et al. 009). The cocordace amog distace matrices (CADM) ca be tested usig a test of sigificace proposed by Legedre & Lapoite (004, 005). The distace matrices uder compariso are strug out like the descriptors i Table 5.7. The coefficiet of cocordace is computed ad tested usig the same permutatio procedure as i the Matel test (Subsectio 0.5.). This test is actually a geeralizatio of the Matel test of matrix correspodece to ay umber of distace matrices. This method is available i the fuctios CADM.global ad CADM.post of the R-laguage library ape (Paradis et al. 009). Additioal refereces Legedre, P Species associatios: the Kedall coefficiet of cocordace revisited. Joural of Agricultural, Biological, ad Evirometal Statistics 0: Oksae, J., R. Kidt, P. Legedre, B. O'Hara, G. L. Simpso, P. Solymos, M. H. H. Steves, ad H. Wager vega: Commuity Ecology Package. R package versio Paradis, E., B. Bolker, J. Claude, H. S. Cuog, R. Desper, B. Durad, J. Dutheil, O. Gascuel, G. Jobb, C. Heibl, D. Lawso, V. Lefort, P. Legedre, J. Lemo, Y. Noel, J. Nylader, R. Opge- Rhei, K. Strimmer, ad D. de Viee ape: Aalyses of Phylogeetics ad Evolutio. R package versio.3.

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

4 Multidimensional quantitative data

4 Multidimensional quantitative data Chapter 4 Multidimesioal quatitative data 4 Multidimesioal statistics Basic statistics are ow part of the curriculum of most ecologists However, statistical techiques based o such simple distributios as

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION [412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION BY ALAN STUART Divisio of Research Techiques, Lodo School of Ecoomics 1. INTRODUCTION There are several circumstaces

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Lecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 8: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review What do we mea by oparametric? What is a desirable locatio statistic for ordial data? What

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Comparing your lab results with the others by one-way ANOVA

Comparing your lab results with the others by one-way ANOVA Comparig your lab results with the others by oe-way ANOVA You may have developed a ew test method ad i your method validatio process you would like to check the method s ruggedess by coductig a simple

More information

UCLA STAT 110B Applied Statistics for Engineering and the Sciences

UCLA STAT 110B Applied Statistics for Engineering and the Sciences UCLA SA 0B Applied Statistics for Egieerig ad the Scieces Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology eachig Assistats: Bria Ng, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),

More information

Session 5. (1) Principal component analysis and Karhunen-Loève transformation

Session 5. (1) Principal component analysis and Karhunen-Loève transformation 200 Autum semester Patter Iformatio Processig Topic 2 Image compressio by orthogoal trasformatio Sessio 5 () Pricipal compoet aalysis ad Karhue-Loève trasformatio Topic 2 of this course explais the image

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Discrete-Time Systems, LTI Systems, and Discrete-Time Convolution

Discrete-Time Systems, LTI Systems, and Discrete-Time Convolution EEL5: Discrete-Time Sigals ad Systems. Itroductio I this set of otes, we begi our mathematical treatmet of discrete-time s. As show i Figure, a discrete-time operates or trasforms some iput sequece x [

More information

General IxJ Contingency Tables

General IxJ Contingency Tables page1 Geeral x Cotigecy Tables We ow geeralize our previous results from the prospective, retrospective ad cross-sectioal studies ad the Poisso samplig case to x cotigecy tables. For such tables, the test

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Additional Notes and Computational Formulas CHAPTER 3

Additional Notes and Computational Formulas CHAPTER 3 Additioal Notes ad Computatioal Formulas APPENDIX CHAPTER 3 1 The Greek capital sigma is the mathematical sig for summatio If we have a sample of observatios say y 1 y 2 y 3 y their sum is y 1 + y 2 +

More information

4. Hypothesis testing (Hotelling s T 2 -statistic)

4. Hypothesis testing (Hotelling s T 2 -statistic) 4. Hypothesis testig (Hotellig s T -statistic) Cosider the test of hypothesis H 0 : = 0 H A = 6= 0 4. The Uio-Itersectio Priciple W accept the hypothesis H 0 as valid if ad oly if H 0 (a) : a T = a T 0

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740 Ageda: Recap. Lecture. Chapter Homework. Chapt #,, 3 SAS Problems 3 & 4 by had. Copyright 06 by D.B. Rowe Recap. 6: Statistical Iferece: Procedures for μ -μ 6. Statistical Iferece Cocerig μ -μ Recall yes

More information

Principle Of Superposition

Principle Of Superposition ecture 5: PREIMINRY CONCEP O RUCUR NYI Priciple Of uperpositio Mathematically, the priciple of superpositio is stated as ( a ) G( a ) G( ) G a a or for a liear structural system, the respose at a give

More information

The Random Walk For Dummies

The Random Walk For Dummies The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

A proposed discrete distribution for the statistical modeling of

A proposed discrete distribution for the statistical modeling of It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical

More information

Regression, Inference, and Model Building

Regression, Inference, and Model Building Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Statistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium

Statistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium Statistical Hypothesis Testig STAT 536: Geetic Statistics Kari S. Dorma Departmet of Statistics Iowa State Uiversity September 7, 006 Idetify a hypothesis, a idea you wat to test for its applicability

More information

Testing Statistical Hypotheses for Compare. Means with Vague Data

Testing Statistical Hypotheses for Compare. Means with Vague Data Iteratioal Mathematical Forum 5 o. 3 65-6 Testig Statistical Hypotheses for Compare Meas with Vague Data E. Baloui Jamkhaeh ad A. adi Ghara Departmet of Statistics Islamic Azad iversity Ghaemshahr Brach

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

GUIDELINES ON REPRESENTATIVE SAMPLING

GUIDELINES ON REPRESENTATIVE SAMPLING DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

Math 155 (Lecture 3)

Math 155 (Lecture 3) Math 55 (Lecture 3) September 8, I this lecture, we ll cosider the aswer to oe of the most basic coutig problems i combiatorics Questio How may ways are there to choose a -elemet subset of the set {,,,

More information

Bivariate Sample Statistics Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 7

Bivariate Sample Statistics Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 7 Bivariate Sample Statistics Geog 210C Itroductio to Spatial Data Aalysis Chris Fuk Lecture 7 Overview Real statistical applicatio: Remote moitorig of east Africa log rais Lead up to Lab 5-6 Review of bivariate/multivariate

More information

(all terms are scalars).the minimization is clearer in sum notation:

(all terms are scalars).the minimization is clearer in sum notation: 7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

The z-transform. 7.1 Introduction. 7.2 The z-transform Derivation of the z-transform: x[n] = z n LTI system, h[n] z = re j

The z-transform. 7.1 Introduction. 7.2 The z-transform Derivation of the z-transform: x[n] = z n LTI system, h[n] z = re j The -Trasform 7. Itroductio Geeralie the complex siusoidal represetatio offered by DTFT to a represetatio of complex expoetial sigals. Obtai more geeral characteristics for discrete-time LTI systems. 7.

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines) Dr Maddah NMG 617 M Statistics 11/6/1 Multiple egressio () (Chapter 15, Hies) Test for sigificace of regressio This is a test to determie whether there is a liear relatioship betwee the depedet variable

More information

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So, 0 2. OLS Part II The OLS residuals are orthogoal to the regressors. If the model icludes a itercept, the orthogoality of the residuals ad regressors gives rise to three results, which have limited practical

More information

M1 for method for S xy. M1 for method for at least one of S xx or S yy. A1 for at least one of S xy, S xx, S yy correct. M1 for structure of r

M1 for method for S xy. M1 for method for at least one of S xx or S yy. A1 for at least one of S xy, S xx, S yy correct. M1 for structure of r Questio 1 (i) EITHER: 1 S xy = xy x y = 198.56 1 19.8 140.4 =.44 x x = 1411.66 1 19.8 = 15.657 1 S xx = y y = 1417.88 1 140.4 = 9.869 14 Sxy -.44 r = = SxxSyy 15.6579.869 = 0.76 1 S yy = 14 14 M1 for method

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution. Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio

More information

Categorical Data Analysis

Categorical Data Analysis Categorical Data Aalysis Refereces : Ala Agresti, Categorical Data Aalysis, Wiley Itersciece, New Jersey, 2002 Bhattacharya, G.K., Johso, R.A., Statistical Cocepts ad Methods, Wiley,1977 Outlie Categorical

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Inverse Matrix. A meaning that matrix B is an inverse of matrix A.

Inverse Matrix. A meaning that matrix B is an inverse of matrix A. Iverse Matrix Two square matrices A ad B of dimesios are called iverses to oe aother if the followig holds, AB BA I (11) The otio is dual but we ofte write 1 B A meaig that matrix B is a iverse of matrix

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense, 3. Z Trasform Referece: Etire Chapter 3 of text. Recall that the Fourier trasform (FT) of a DT sigal x [ ] is ω ( ) [ ] X e = j jω k = xe I order for the FT to exist i the fiite magitude sese, S = x [

More information

SNAP Centre Workshop. Basic Algebraic Manipulation

SNAP Centre Workshop. Basic Algebraic Manipulation SNAP Cetre Workshop Basic Algebraic Maipulatio 8 Simplifyig Algebraic Expressios Whe a expressio is writte i the most compact maer possible, it is cosidered to be simplified. Not Simplified: x(x + 4x)

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

Information-based Feature Selection

Information-based Feature Selection Iformatio-based Feature Selectio Farza Faria, Abbas Kazeroui, Afshi Babveyh Email: {faria,abbask,afshib}@staford.edu 1 Itroductio Feature selectio is a topic of great iterest i applicatios dealig with

More information

GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)*

GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)* GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)* *Note: I Frech SAD (Système d Aide à la Décisio) 1. Itroductio to the DSS Eightee statistical distributios are available i HYFRAN-PLUS software to

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

DISTRIBUTION LAW Okunev I.V.

DISTRIBUTION LAW Okunev I.V. 1 DISTRIBUTION LAW Okuev I.V. Distributio law belogs to a umber of the most complicated theoretical laws of mathematics. But it is also a very importat practical law. Nothig ca help uderstad complicated

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

1 Models for Matched Pairs

1 Models for Matched Pairs 1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

Chimica Inorganica 3

Chimica Inorganica 3 himica Iorgaica Irreducible Represetatios ad haracter Tables Rather tha usig geometrical operatios, it is ofte much more coveiet to employ a ew set of group elemets which are matrices ad to make the rule

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:

More information

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3 MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special

More information