3 The Scalar Algebra of Variances, Covariances, and Correlations

Size: px
Start display at page:

Download "3 The Scalar Algebra of Variances, Covariances, and Correlations"

Transcription

1 3 The Scalar Algebra of Variaces, Covariaces, ad Correlatios I this chapter, we review the defiitios of some key statistical cocepts: meas, covariaces, ad correlatios. We show how the meas, variaces, covariaces, ad correlatios of variables are related whe the variables themselves are coected by oe or more liear equatios by developig the liear combiatio ad trasformatio rules. These rules, which hold both for lists of umbers ad radom variables, form the algebraic foudatios for regressio aalysis, factor aalysis, ad SEM. 3.1 Meas, Variaces, Covariaces, ad Correlatios I this sectio, we quickly review the basic defiitios of the statistical quatities at the heart of structural equatio modelig. Our approach is to keep the mathematical statistical cotet to a miimum while still coveyig the importat details Meas The mea of a statistical populatio is simply the arithmetic average of all the umbers i the populatio. Usually, we model statistical populatios as radom variables with a particular statistical distributio. Sice this is ot a course i probability theory, we will ot review the formal defiitio of a radom variable as a fuctio assigig umbers to outcomes, but rather use a very iformal otio of such a variable as a process that geerates umbers accordig to a specifiable system. The mea of the populatio ca be represeted as the expected value of the

2 42 structural equatio modelig radom variable. Defiitio (Expected Value of a Radom Variable) The expected value of a radom variable X, deoted E(X), is the log ru average of the umbers geerated by the radom variable X. This defiitio will usually prove adequate for our purposes i this text, although the vetera of a course or two i statistics udoubtedly kows the formal defiitios i terms of sums ad itegrals, ad recogizes that the formal defiitios are differet for discrete ad cotiuous radom variables. The arithmetic average of a sample of umbers take from some statistical populatio is referred to as the sample mea. This was defied already i sectio Deviatio Scores The deviatio score plays a importat role i statistical formulas. The defiitios are quite similar, for the sample ad the populatio. I each case, the deviatio score is simply the differece betwee the score ad the mea. Deviatio scores i a statistical populatio are obtaied by subtractig the populatio mea from every score i the populatio. If the populatio is modeled with a radom variable, the radom variable is coverted to deviatio score form by subtractig the populatio mea from each observatio, as described i the followig defiitio. Defiitio (Deviatio Score Radom Variable) The deviatio score form of a radom variable X is defied as [dx] = X E(X). Sample deviatio scores were defied aalogously i sectio

3 the scalar algebra of variaces, covariaces, ad correlatios Variaces ad Stadard Deviatios The variace of a statistical sample or populatio is a measure of how far the umbers are spread out aroud the mea. Sice the deviatio score reflects distace from the mea, perhaps the two most obvious measures of spread are the average absolute deviatio ad the average squared deviatio. For reasos of mathematical tractability, the latter is much preferred by iferetial statisticias. Defiitio (Populatio Variace) The variace of a statistical populatio, deoted Var(X) or σx 2, is the average squared deviatio score i that populatio. Hece, σx 2 = E (X E(X))2. A importat relatioship, which we give here without proof, is Defiitio (Populatio Variace Computatioal Formula) σx (X 2 = E 2) (E(X)) 2 (3.1) ( = E X 2) µ 2 (3.2) The populatio stadard deviatio relates to the variace aalogously to the correspodig sample quatities. Defiitio (The Populatio Stadard Deviatio) The populatio stadard deviatio is the square root of the populatio variace, i.e., σ X = σx 2 The atural trasfer of the otio of a populatio variace to a sample statistic would be to simply compute, for a sample of umbers, the average squared (sample)

4 44 structural equatio modelig deviatio score. Ufortuately, such a estimate teds to be, o average, lower tha the populatio variace, i.e., egatively biased. To correct this bias, we divide the sum of squared deviatios by 1, istead of. Defiitio (The Sample Variace) The sample variace for a set of scores o the variable X is defied as S 2 X = 1/( 1) (X i X ) 2 As a elemetary exercise i summatio algebra, it ca be show that the sample variace may also be computed via the followig equivalet, but more coveiet formula: ( S 2 X = 1/( 1) Xi 2 ( X i) 2 ) Defiitio (The Sample Stadard Deviatio) The sample stadard deviatio is the square root of the sample variace, i.e., S X = S 2 X 3.2 The Algebra of Meas The liear trasformatio is a importat cocept i statistics, because may elemetary statistical formulas ivolve liear trasformatios. Defiitio (Liear Trasformatio) Suppose we have two variables, X ad Y, represetig either radom variables or two lists of umbers. Suppose furthermore that the Y scores ca be expressed as liear fuctios of the X scores, that is, Y i = ax i + b for some costats a ad b. The we say that Y

5 the scalar algebra of variaces, covariaces, ad correlatios 45 is a liear trasformatio (or liear trasform ) of X. If the multiplicative costat a is positive, the the liear trasformatio is order-preservig The Liear Trasform Rule Both sample ad populatio meas behave i a predictable ad coveiet way if oe variable is a liear trasformatio of the other. Theorem (The Sample Mea of a Liear Trasform) Let Y ad X represet two lists of umbers. If, for each Y i, we have Y i = ax i + b, the Y = ax + b. Proof. For lists of umbers, we simply substitute defiitios ad employ elemetary summatio algebra. That is, Y = (1/) = (1/) = (1/) = a(1/) Y i = ax + b (ax i + b) ax i + (1/) X i + (1/)b b The secod lie above follows from the distributive rule of summatio algebra, give o page 23. The third lie uses the secod costat rule of summatio algebra give o page 22, for the left term, ad the first costat rule (page 20), for the right. The theorem actually icludes more fudametal rules about the sample mea as special cases. First, by lettig b = 0 ad rememberig that a ca represet either multiplicatio or divisio by a costat, we see that multiplyig or dividig a variable (or every umber i a sample) by a costat multiplies or divides its mea by the same costat. Secod, by lettig a = 1, ad recallig that b ca represet either additio or subtractio, we see that addig or subtractig a costat from a variable adds or subtracts that costat from the mea of that variable.

6 46 structural equatio modelig After quickly metioig two additioal results, we shall costruct a M-algebra or algebra of meas. Subsequetly, we ll discover that this simple set of rules allows us to derive may importat results. Theorem (Mea of a Costat) Suppose that, over a list of umbers X, X i ever varies. For example, suppose X i = b for all i. The X = b. The proof of this result is left to a exercise. Theorem (Mea of a Sum) Suppose two lists X ad Y of equal legth are added together to produce a ew list W. The W = X + Y. Proof W = 1 = 1 W i (X i + Y i ) = 1 ( X i + = 1 X i + 1 = X + Y Y i Y i ) (byequatio Costructig a M-Algebra I the precedig sectio, we derived 3 key results: (a) The mea of a liear trasformatio is the same trasformatio rule applied to the origial mea, i.e., if Y = ax + b, the Y = ax + b. (b) The mea of a variable that stays costat at b is equal to b.

7 the scalar algebra of variaces, covariaces, ad correlatios 47 (c) The mea of a sum of two variables is the sum of their meas. Now we ll simply restate these rules i a slightly differet form. I what follows, defie capital letters as variable ames ad lower-case letters as costats. Defie the mea operator M to represet the sample mea of whatever quatity it is applied to. For example, M(X) represets the sample mea of a variable X represetig a list of umbers X i. The the followig rules represet the algebra of sample meas. (a) M(a) = a Note that the third rule also implies that M(X Y) = M(X) M(Y) (b) M(aX + b) = am(x) + b (c) M(X + Y) = M(X) + M(Y) We ca use this algebra to prove may thigs about sample meas more compactly tha we ca with summatio algebra. The mai hurdle is simply gettig used to the otatio. Example (Mea of a Set of Deviatio Scores) A set of scores i the variable X ca be coverted to deviatio scores via the formula [dx] = X M(X). Now it is absolutely trivial to prove that the mea of a set of deviatio scores is zero. Here we go: The third lie combies the first rule with the fact that M(X) is itself a costat. M([dX]) = M(X M(X)) = M(X) M(M(X)) (Third Rule of M-algebra) = M(X) M(X) (First Rule of M-algebra) = The Algebra of Expected Values The algebra of sample meas we developed i the precedig sectio has a couterpart for radom variables. Recall that the mea of a radom variable X is called its expected value ad is deoted E(X). For radom variables X ad Y, ad costats a ad b, the three primary laws of expected value algebra are:

8 48 structural equatio modelig (a) E(a) = a (b) E(aX + b) = ae(x) + b (c) E(X + Y) = E(X) + E(Y) These rules for both discrete ad cotiuous radom variables are a straightforward cosequece of the formulas for expected value i the discrete ad cotiuous cases, ad we omit them here. However, the astute reader should ask the followig very serious questio: would ot ay reasoable measure of cetral tedecy have to follow these rules? 3.4 Liear Trasform Rules for Variaces ad Stadard Deviatios I this sectio, we examie the effect of a liear trasformatio o the variace ad stadard deviatio. Sice both the variace ad stadard deviatio are measures o deviatio scores, we begi by ivestigatig the effect of a liear trasformatio of raw scores o the correspodig deviatio scores. We fid that the multiplicative costat comes straight through i the deviatio scores, while a additive or subtractive costat has o effect o deviatio scores. Theorem (Liear Trasforms ad Deviatio Scores) Suppose a variable X is is trasformed liearly ito a variable Y, i.e., Y = ax + b. The the Y deviatio scores must relate to the X deviatio scores via the equatio [dy] = a[dx]. Proof First, cosider the case of sets of scores. If Y i = ax i + b, the [dy] i = Y i Y = (ax i + b) Y = (ax i + b) (ax + b) [Theorem 3.2.1] = ax i + b ax b = a(x i X ) = a[dx] i

9 the scalar algebra of variaces, covariaces, ad correlatios 49 Next, cosider the case of radom variables. We have [dy] = Y E(Y) = ax + b ae(x) + b [Theorem 3.2.1] = a(x E(X)) Oe may also prove the result for samples of size usig the M-algebra. The proof is essetially the same as the proof for radom variables, with M substituted for E throughouth. This completes the proof. = a[dx] A simple umerical example will help cocretize these results. Cosider the origial set of scores show i Table 3.1. Their deviatio score equivalets are 1, 0, ad +1, as show. Suppose we geerate a ew set of Y scores usig the equatio Y = 2X + 5. The ew deviatio scores are twice the old set of deviatio scores. The multiplicative costat 2 has a effect o the deviatio scores, but the additive costat 5 does ot. From the precedig results, it is easy to deduce the effect of a liear trasformatio o the variace ad stadard deviatio. Sice additive costats have o effect o deviatio scores, they caot affect either the variace or the stadard deviatio. O the other had, multiplicative costats costats come straight through i the deviatio scores, ad this is reflected i the stadard deviatio ad the variace. [dx] X Y = 2X + 5 [dy] Table 3.1: Effect of a Liear Trasform o Deviatio Scores Theorem (Effect of a Liear Trasform o the Variace ad SD) Suppose a variable X is trasformed ito Y via the liear trasform Y = ax + b. The, for radom variables, the followig results hold. σ Y = a σ X σ 2 Y = a2 σ 2 X Similar results hold for samples of umbers, i.e., S Y = a S X S 2 Y = a2 S 2 X

10 50 structural equatio modelig Proof Cosider the case of the variace of sample scores. By defiitio, S 2 Y = 1/( 1) = 1/( 1) = 1/( 1) N = a 2 (1/( 1) = a 2 S 2 X For radom variables, we have σ 2 Y = E(Y E(Y)) 2 [dy] 2 i (a[dy] i ) 2 (from Theorem 3.4.1) a 2 [dx] 2 i [dx] 2 i = E(aX + b E(aX + b)) 2 = E(aX + b (ae(x) + b)) 2 (from Theorem 3.2.1) = E(aX + b ae(x) b) 2 = E(aX ae(x)) 2 = E(a 2 (X E(X)) 2 ) = a 2 E(X E(X)) 2 (from 2d rule, Sectio 3.3) = a 2 σ 2 X ) 3.5 Stadard Scores Havig deduced the effect of a liear trasformatio o the mea ad variace of a radom variable or a set of sample scores, we are ow better equipped to uderstad the otio of stadard scores. Stadard scores are scores that have a particular desired mea ad stadard deviatio. So log as a set of scores are ot all equal, it

11 the scalar algebra of variaces, covariaces, ad correlatios 51 is always possible to trasform them, by a liear trasformatio, so that they have a particular desired mea ad stadard deviatio. Theorem (Liear Stadardizatio Rules) Suppose we restrict ourselves to the class of order-preservig liear trasformatios, of the form Y = ax + b, with a > 0. If a set of scores curretly has mea X ad stadard deviatio S X, ad we wish the scores to have a mea Y ad stadard deviatio S Y, we ca achieve the desired result with a liear trasformatio with a = S Y S X ad b = Y ax Similar rules hold for radom variables, i.e., a radom variable X with mea µ X ad stadard deviatio σ X may be liearly trasformed ito a radom variable Y with mea µ Y ad stadard deviatio σ Y by usig ad a = σ Y σ X b = µ Y aµ X Example (Rescalig Grades) Result is used frequetly to rescale grades i uiversity courses. Suppose, for example, the origial grades have a mea of 70 ad a stadard deviatio of 8, but the typical scale for grades is a mea of 68 ad a stadard deviatio of 12. I that case, we have a = S Y S X = 12 8 = 1.5 b = Y ax = 68 (1.5)(70) = 37

12 52 structural equatio modelig Suppose that there is a particular desired metric (i.e., desired mea ad stadard deviatio) that scores are to be reported i, ad that you were seekig a formula for trasformig ay set of data ito this desired metric. For example, you seek a formula that will trasform ay set of X i ito Y i with a mea of 500 ad a stadard deviatio of 100. Result might make it seem at first glace that such a formula does ot exist, i.e., that the formula would chage with each ew data set. This is, of course, true i oe sese. However, i aother sese, we ca write a sigle formula that will do the job o ay data set. We begi by defiig Z-scores. Defiitio (Z-Scores) Z-scores are scores that have a mea of 0 ad a stadard deviatio of 1. Ay set of scores x i that are ot all equal ca be coverted ito Z-scores by the followig trasformatio rule z i = x i x S X We say that a radom variable is i Z-score form if it has a mea of 0 ad a stadard deviatio of 1. Ay radom variable X havig expected value µ ad stadard deviatio σ may be coverted to Z-score form via the trasformatio Z = X µ σ That the Z-score trasformatio rule accomplishes its desired purpose is derived easily from the liear trasformatio rules. Specifically, whe we subtract the curret mea from a radom variable (or a list of umbers), we do ot affect the stadard deviatio, while we reduce the mea to zero. At that poit, we have a list of umbers with a mea of 0 ad the origial, uchaged stadard deviatio. Whe we ext divide by the stadard deviatio, the mea remais at zero, while the stadard deviatio chages to 1. Oce the scores are i Z-score form, the liear trasformatio rules reveal that it is very easy to trasform them ito ay other desired metric. Suppose, for example, we wished to trasform the scores to have a mea of 500 ad a stadard deviatio of 100. Sice the mea is 0 ad the stadard deviatio 1, multiplyig by 100 will chage the stadard deviatio to 100 while leavig the mea at zero. If we the

13 the scalar algebra of variaces, covariaces, ad correlatios 53 add 500 to every value, the resultig scores will have a mea of 500 ad a stadard deviatio of 100. Cosequetly, the followig trasformatio rule will chage ay set of X scores ito Y scores with a mea of 500 ad a stadard deviatio of 100, so log as the origial scores are ot all equal. ( ) xi x y i = (3.3) S X The part of Equatio 3.3 i paretheses trasforms the scores ito Z-score form. Multiplyig by 100 moves the stadard deviatio to 100 while leavig the mea at 0. Addig 500 the moves the mea to 500 while leavig the stadard deviatio at 100. Z-scores also have a importat theoretical property, i.e., their ivariace uder liear trasformatio of the raw scores. Theorem (Ivariace Property of Z-Scores) Suppose you have two variables X ad Y, ad Y is a order-preservig liear trasformatio of X. Cosider X ad Y trasformed to Z-score variables [zx] ad [zy]. The [zx] = [zy]. The result holds for sets of sample scores, or for radom variables. Proof Suppose Y = ax + b, ad assume a > 0. The, by Theorems?? ad??, E(Y) = ae(x) + b, ad σ Y = aσ X. We the have [zy] = Y E(Y) σ Y ax + b (ae(x) + b) = aσ X = a(x E(X)) + (b b) aσ X = X E(X) σx = [zx] To costruct a simple example that dramatizes this result, we will employ a well-kow fudametal result i statistics.

14 54 structural equatio modelig Example (Properties of 3 Evely Spaced Numbers) Cosider a set of 3 evely spaced umbers, with middle value M ad spacig D, the differece betwee adjacet umbers. For example, the umbers 3,6,9 have M = 6 ad D = 3. Prove that, for ay such set of umbers, the sample mea X is equal to the middle value M, ad the sample stadard deviatio S X is equal to D, the spacig. Solutio The proof is straightforward if you set up the otatio properly. We may represet the ordered list of 3 umbers as X 1 = M D, X 2 = M, ad X 3 = M + D. We the have X = X i = 1 3 (X 1 + X 2 + X 3 ) = 1 (M D + M + M + D) 3 = 1 3 (3M) = M We the have [dx] 1 = D, [dx] 2 = 0, ad [dx] 3 = D, whece S 2 X = [dx] 2 i = 1 2 ([dx]2 1 + [dx]2 2 + [dx]2 3 ) = 1 2 (D D 2 ) = 1 2 (2D2 ) = D 2

15 the scalar algebra of variaces, covariaces, ad correlatios 55 [zx] X Y = 2X 5 [zy] Table 3.2: Ivariace of Z-Scores Uder Liear Trasformatio ad so S X = D. This result makes it easy to costruct simple data sets with a kow mea ad stadard deviatio. It also makes it easy to compute the mea ad variace of 3 evely spaced umbers by ispectio. Example (Ivariace of Z-Scores Uder Liear Trasformatio) Cosider the data i Table 3.2. The X raw scores are evely spaced, so it is easy to see they have a mea of 100 ad a stadard deviatio of 15. The [zx] scores are +1, 0, ad 1 respectively. Now, suppose we liearly trasform the X raw scores ito Y scores with the equatio Y = 2X 5. The ew scores, as ca be predicted from the liear trasformatio rules, will have a mea give by Y = 2x 5 = 2(100) 5 = 195, ad a stadard deviatio of S Y = 2S X = 2(15) = 30 Sice the Y raw scores were obtaied from the X scores by liear trasformatio, we ca tell, without computig them, that the [zy] scores must also be +1, 0, ad 1. This is easily verified by ispectio. A importat implicatio of Result is that ay statistic that is solely a fuctio of Z-scores must be ivariat uder liear trasformatios of the observed variables.

16 56 structural equatio modelig 3.6 Covariaces ad Correlatios Multivariate aalysis deals with the study of sets of variables, observed o oe or more populatios, ad the way these variables covary, or behave joitly. A atural measure of how variables vary together is the covariace, based o the followig simple idea. Suppose there is a depedecy betwee two variables X ad Y, reflected i the fact that high values of X ted to be associated with high values of Y, ad low values of X ted to be associated with low values of Y. We might refer to this as a direct (positive) relatioship betwee X ad Y. If such a relatioship exists, ad we covert X ad Y to deviatio score form, the cross-product of the deviatio scores should ted to be positive, because whe a X score is above its mea, a Y score will ted to be above its mea (i which case both deviatio scores will be positive), ad whe X is below its mea, Y will ted to be below its mea (i which case both deviatio scores will be egative, ad their product positive). Covariace is simply the average cross-product of deviatio scores. Defiitio (Populatio Covariace) The covariace of two radom variables, X ad Y, deoted σ XY, is the average cross-product of deviatio scores, computed as σ XY = E [(X E(X))(Y E(Y))] (3.4) = E(XY) E(X)E(Y) (3.5) Provig the equality of Equatios 3.4 ad 3.5 is a elemetary exercise i expected value algebra that is give i may mathematical statistics texts. Equatio 3.5 is geerally more coveiet i simple proofs. Whe defiig a sample covariace, we average by 1 istead of, to maitai cosistecy with the formulas for the sample variace ad stadard deviatio.

17 the scalar algebra of variaces, covariaces, ad correlatios 57 Defiitio (Sample Covariace) The sample covariace of X ad Y is defied as S XY = 1/( 1) = 1/( 1) = 1/( 1) [dx] i [dy] i (3.6) ( Y i Y i X i Y i (X i X )(Y i Y ) (3.7) ) (3.8) Note that the variace of a variable may be thought of as its covariace with itself. Hece, i some mathematical discussios, the otatio σ XX is used (i.e., the covariace of X with itself) is used to sigify the variace of X. The covariace, beig a measure o deviatio scores, is ivariat uder additio or subtractio of a costat. O the other had, multiplicatio or divisio of a costat affects the covariace, as described i the followig result. Theorem (Effect of a Liear Trasform o Covariace) Suppose the variables X ad Y are trasformed ito the variables W ad M via the liear trasformatios W = ax + b ad M = cy + d. The the covariace betwee the ew variables W ad M relates to the origial covariace i the followig way. For radom variables, we have For samples of scores, we have σ WM = acσ XY S WM = acs XY

18 58 structural equatio modelig Proof By Theorem 3.4.1, we kow that [dw] = a[dx] ad [dm] = c[dy]. So, for sample data, we have S WM = For radom variables, we have = = ac 1 = acs XY [dw] i [dm] i a[dx] i c[dy] i σ WM = E[dW][dM] [dx] i [dy] i = E(a[dX]c[dY]) = ace([dx][dy]) = acσ XY Covariaces are importat i statistical theory. (If they were ot, why would we be studyig the aalysis of covariace structures?) However, the covariace betwee two variables does ot have a stable iterpretatio if the variables chage scale. For example, the covariace betwee height ad weight chages if you measure height i iches as opposed to cetimeters. The value will be 2.54 times as large if you measure height i cetimeters. So, for example, if someoe says The covariace betwee height ad weight is 65, it is impossible to tell what this implies without kowig the measuremet scales for height ad weight The Populatio Correlatio As we oted i the precedig sectio, the covariace is ot a scale-free measure of covariatio. To stabilize the covariace across chages i scale, oe eed oly divide it by the product of the stadard deviatios of the two variables. The resultig statistic, ρ xy, the Pearso Product Momet Correlatio Coefficiet, is defied as

19 the scalar algebra of variaces, covariaces, ad correlatios 59 the average cross-product of Z scores. From Result 3.5.3, we ca deduce without effort oe importat characteristic of the Pearso correlatio, i.e., it is ivariat uder chages of scale of the variables. So, for example, the correlatio betwee height ad weight will be the same whether you measure height i iches or cetimeters, weight i pouds or kilograms. Defiitio (The Populatio Correlatio ρ xy ) For two radom variables X ad Y the Pearso Product Momet Correlatio Coefficiet, is defied as Alteratively, oe may defie the correlatio as ρ XY = E([zX][zY]) (3.9) ρ XY = σ XY σ X σ Y (3.10) The Sample Correlatio The sample correlatio coefficiet is defied aalogously to the populatio quatity. We have Defiitio (The Sample Correlatio r xy ) For pairs of scores X i ad Y i the Pearso Product Momet Correlatio Coefficiet, is defied as r XY = 1/( 1) [zx] i [zy] i (3.11) There are a umber of alterative computatioal formulas, such as r XY = S XY S X S Y, (3.12)

20 60 structural equatio modelig or its fully explicit expasio r XY = X i Y i X i Y i ( X 2 i ( X i ) 2) ( Y 2 i ( Y i ) 2) (3.13) Sice all summatios i Equatio 3.13 ru from 1 to, I have simplified the otatio by elimiatig them. With moder statistical software, actual had computatio of the correlatio coefficiet is a rare evet. 3.7 Liear Combiatio Rules The liear combiatio (or LC) is a key idea i statistics. While studyig factor aalysis, structural equatio modelig, ad related methods, we will ecouter liear combiatios repeatedly. Uderstadig how they behave is importat to the study of SEM. I this sectio, we defie liear combiatios, ad describe their statistical behavior Defiitio of a Liear Combiatio Defiitio (Liear Combiatio) A liear combiatio (or liear composite) L of J radom variables X j is ay weighted sum of those variables, i.e.., L = J c j X j (3.14) j=1 For sample data, i a data matrix X with rows ad J colums, represetig scores o each of J variables, a liear combiatio score for the ith set of observatios is ay expressio of the form L i = J c j X ij (3.15) j=1 The c j i Equatios 3.14 ad 3.15 are called liear weights.

21 the scalar algebra of variaces, covariaces, ad correlatios 61 It is importat to be able to examie a statistical formula, decide whether or ot it cotais or represets a LC, ad idetify the liear weights, with their correct sig. A simple example is give below. Example (Course Grades as a Liear Combiatio) Suppose a istructor aouces at the begiig of a course that the course grade will be produced by takig the mid-term exam grade, addig twice the fial exam grade, ad dividig by 3. If the fial grade is G, the mid-term is X, ad the fial exam Y, the the grades are computed usig the formula G = X + 2Y 3 = 1 3 X Y (3.16) A example of such a situatio is show i Table 3.3. I this case, the course grade is a liear combiatio, ad the liear weights are +1/3 ad +2/3. Note: Table 3.3 gives the meas ad stadard deviatios for the mid-term exam (X), the fial exam (Y), ad the course grade (G). I subsequet sectios of this chapter, we show how the mea ad stadard deviatio for G ca be calculated from a kowledge of the mea ad stadard deviatio of X ad Y, the liear weights that produced G, ad the covariace betwee X ad Y Mea Of A Liear Combiatio Whe several variables are liearly combied, the resultig LC has a mea that ca be expressed as a liear fuctio of the meas of origial variables. This result, give here without proof, is the foudatio of a umber of importat statistical methods. Studet Mid-term Fial Grade X Y G Judi Fred Albert Mea SD Table 3.3: Course Grades as a Liear Combiatio Theorem (Mea of a Liear Combiatio) Give J radom variables X j havig meas µ j, the mea of a liear combiatio L = J c j X j j=1

22 62 structural equatio modelig is µ L = J c j µ j j=1 For a set of N sample scores o J variables, with liear combiatio scores computed as J l i = c j X ij, j=1 the mea of the liear combiatio scores is give by l = J c j X j j=1 This simple, but very useful result allows oe to compute the mea of a LC without actually calculatig LC scores. Example (Computig the Mea of a Liear Combiatio) Cosider agai the data i Table 3.3. Oe may calculate the mea of the fial grades by calculatig the idividual grades, summig them, ad dividig by. A alterative way is to compute the mea from the meas of X ad Y. So, for example, the mea of the fial grades must be G = 1 3 X Y ( ) 1 = = = ( 2 3 ) Variace ad Covariace Of Liear Combiatios The variace of a LC also plays a very importat role i statistical theory.

23 the scalar algebra of variaces, covariaces, ad correlatios 63 Theorem (Variace of a Liear Combiatio) Give J radom variables X j havig variaces σj 2 = σ jj ad covariaces σ jk (betwee variables X j ad X k ), the variace of a liear combiatio is L = J c j X j j=1 σ 2 L = = J J j=1 k=1 J j=1 c j c k σ jk (3.17) c 2 j σ2 j + 2 J J 1 j=2 k=1 c j c k σ jk (3.18) For a set of sample scores o J variables, with liear combiatio scores computed as J L i = c j x ij, j=1 the variace of the liear combiatio scores is give by S 2 L = = J J j=1 k=1 J j=1 c j c k S jk (3.19) c 2 j S2 j + 2 J J 1 j=2 k=1 c j c k S jk (3.20) It is, of course, possible to defie more tha oe LC o the same set of variables. For example, cosider a persoality questioaire that obtais resposes from idividuals o a large umber of items. Several differet persoality scales might be calculated as liear combiatios of the items. (Items ot icluded i a scale have a liear weight of zero.) I that case, oe may be cocered with the covariace or correlatio betwee the two LCs. The followig theorem describes how the

24 64 structural equatio modelig covariace of two LCs may be calculated. Theorem (Covariace of Two Liear Combiatios) Cosider two liear combiatios L ad M of a set of J radom variables X j, havig variaces ad covariaces σ ij. (If i = j, a variace of variable i is beig referred to, otherwise a covariace betwee variables i ad j.) The two LCs are, ad L = M = J c j X j, j=1 J d j X j, j=1 The covariace σ LM betwee the two variables L ad M is give by σ LM = J J j=1 k=1 c j d k σ jk (3.21) The theorem geeralizes to liear combiatios of sample scores i the obvious way. For a set of sample scores o J variables, with liear combiatio scores computed as ad L i = M i = J c j X ij j=1 J d j X ij j=1 the covariace of the liear combiatio scores is give by S LM = J J j=1 k=1 c j d k S jk (3.22)

25 the scalar algebra of variaces, covariaces, ad correlatios 65 Some commets are i order. First, ote that Theorem icludes the result of Theorem as a special case (simply let L = M, recallig that the variace of a variable is its covariace with itself). Secod, the results of both theorems ca be expressed as a heuristic rule that is easy to apply i simple cases A Geeral Heuristic Rule for Liear Trasoforms ad Liear Combiatios We ca combie our previous rules for liear trasformatios ad combiatios ito a sigle heuristic. To compute a variace of a liear combiatio or trasformatio, for example L = X + Y, perform the followig steps: (a) Express the liear combiatio or trasformatio i algebraic form; (b) Square the expressio; X + Y (X + Y) 2 = X 2 + Y 2 + 2XY (c) Trasform the result with the followig coversio rules: (a) Wherever a squared variable appears, replace it with the variace of the variable. (b) Wherever the product of two variables appears, replace it with the covariace of the two variables. (c) Ay expressio that does ot cotai either the square of a variable or the product of two variables is elimiated. X 2 + Y 2 + 2XY σ 2 X + σ2 Y + 2σ XY To calculate the covariace of two liear combiatios, replace the first two steps i the heuristic rules as follows: (a) Write the two liear combiatios side-by-side, for example; (X + Y) (X 2Y)

26 66 structural equatio modelig (b) Compute their algebraic product; X 2 XY 2Y 2 The coversio rule described i step 3 above is the applied, yieldig, with the preset example σ 2 X 2σ XY 2σ 2 Y The above result applies idetically to radom variables or sets of sample scores. It provides a coveiet method for derivig a umber of classic results i statistics. A well kow example is the differece betwee two variables, or, i the sample case, the differece betwee two colums of umbers. Example (Variace of Differece Scores) Suppose you gather pairs of scores X i ad Y i for people, ad compute, for each perso, a differece score as D i = X i Y i Express the sample variace S 2 D of the differece scores as a fuctio of S2 X, S 2 Y, ad S XY. Solutio. First apply the heuristic rule. Squarig X Y, we obtai X 2 + Y 2 2XY. Apply the coversio rules, ad the result is S 2 D = S2 X + S2 Y 2S XY Example (Variace of Course Grades) The data i Table 3.3 give the variaces of the X ad Y tests. Sice the deviatio scores are easily calculated, we ca calculate the covariace S XY = 1/( 1) [dx][dy] betwee the two exams as 20. Sice the fial grades are a liear combiatio of the exam grades, we ca geerate a theoretical formula for the variace of the fial

27 the scalar algebra of variaces, covariaces, ad correlatios 67 grades. Sice G = 1/3X + 2/3Y, applicatio of the heuristic rule shows that S 2 G = 1/9S2 X + 4/9S2 Y + 4/9S XY = 1/9(100) + 4/9(16) + 4/9( 20) = 84/9 = 9.33 The stadard deviatio of the fial grades is thus 9.33 = Oe aspect of the data i Table 3.3 is uusual, i.e., the scores o the two exams are egatively correlated. I practice, of course, this is ulikely to happe uless the class size is very small ad the exams uusual. Normally, oe observes a substatially positive correlatio betwee grades o several exams i the same course. I the followig example, we use LC theory to explai a pheomeo, which we might call variace shrikage, that has caught may a teacher by surprise whe computig stadardized course grades. Example (Variace Shrikage i Course Grades) Suppose a istructor is teachig a course i which the grades are computed simply by averagig the marks from two exams. The istructor is told by the uiversity admiistratio that, i this course, fial grades should be stadardized to have a mea of 70 ad a stadard deviatio of 10. The istructor attempts to comply with this request. She stadardizes the grades o each exam to have a mea of 70 ad a stadard deviatio of 10. She the computes the fial grades by averagig the stadardized marks o the two exams, which, icidetally, had a correlatio of She seds the grades o to the admiistratio, but receives a memo several days later to the effect that, although her fial grades had a mea of 70, their stadard deviatio was icorrect. It was supposed to be 10, but it was oly What wet wrog? Solutio We ca explai this pheomeo usig liear combiatio theory. Sice the fial grades are obtaied averagig the grades o the two mid-terms, the grades are computed as a liear combiatio, i.e., G = (1/2)X + (1/2)Y

28 68 structural equatio modelig Usig the heuristic rule, we establish that the variace of the grades must follow the formula S 2 G = (1/4)S2 X + (1/4)S2 Y + (1/2)S XY However, sice S 2 X = S2 Y, ad S XY = S X S Y r XY = S 2 X r xy, the formula reduces to S 2 G = 1 + r xy S2 X 2 If r xy =.5, the S 2 G = (3/4)S2 X. We have show that, if the two tests correlate.5, ad each test is stadardized to yield a mea of 70 ad a stadard deviatio of 10, it is a ievitable cosequece of the laws of LC theory that the fial grades, calculated as a average of the two tests, will have a variace that is 75, ad a stadard deviatio equal to 75 = More geerally, we have demostrated that i similar situatios, if the idividual tests are stadardized to a particular mea ad variace, the average of the two tests will have a variace that shriks by a multiplicative factor of (1 + 2r xy )/2. O the other had, it is easy to verify that the mea of the fial grades will be the desired value of 70. Example (The Rich Get Richer Pheomeo) Ultimately, the hypothetical istructor i Example will have to do somethig to offset the variace shrikage. What should she do? Solutio The aswer should be apparet from our study of liear trasformatio theory. There are several ways to approach the problem, some more formal tha others. The problem is that the variace of the scores is too low, so they eed to be stretched out alog the umber lie. Ituitively, we should realize that if we subtract 70 from each studet s average grade, we will have a set of scores with a mea of 0 ad a stadard deviatio of If we the multiply each score by 10/8.66, we will have scores with a mea of 0 ad a stadard deviatio of 10. Addig back 70 to the umbers will the restore the proper mea of 70, while leavig the stadard deviatio of 10. This trasformatio

29 the scalar algebra of variaces, covariaces, ad correlatios 69 ca be writte literally as M i = (G i 70)(10/8.66) + 70 (3.23) There are, of course, alterative approaches. Oe is to trasform each studet s mark to a Z-score, the multiply by 10 ad add 70. This trasformatio ca be writte M i = 10 G i With a small amout of maipulatio, you ca see that these two trasformatio rules are equivalet. Study Equatio 3.23 briefly, ad see how it operates. Notice that it expads the idividual s deviatio score (his/her differece from the mea of 70), makig it larger tha it was before. I this case, the multiplyig factor is 10/8.66, so, for example, a grade of would be trasformed ito a grade of 80. It is easy to see that, if the idividual exam grades ad the fial mark are scaled to have the same stadard deviatio, ay studet with a exam average above the mea will receive a fial grade higher tha their exam average. O the other had, ay studet with a exam grade below average will receive a mark lower tha their exam average. For example, a perso with a grade of 61.34, ad a deviatio score of 8.66, would have the deviatio score trasformed to 10, ad the grade chaged to 60. I my udergraduate courses, I refer to this pheomeo as The Rich Get Richer, The Poor Get Poorer. This pheomeo ca be the source of cosiderable costeratio for a studet whose grades have bee below the mea, as the studet s fial grade may be lower tha ay of his/her grades o the exams. Ideed, the lower the studet s mark, the more it is reduced! O the other had, studets with cosistetly above-average performace may get a pleasat surprise if they are uaware of this effect. Liear combiatio theory ca be used to prove a umber of surprisig results i statistics. The followig example shows how to take two colums of umbers ad compute two liear combiatios o them that are guarateed to have a zero correlatio.

30 70 structural equatio modelig Example (Costructig Sets of Ucorrelated Scores) Statistics istructors sometimes costruct sample data sets with certai characteristics. Here is a simple way to costruct two colums of umbers that have a exactly zero correlatio. I will describe the method first, the we will use liear combiatio theory to explai why the scores must have a zero correlatio. The method is as follows: First, simply write dow a list of N differet umbers i a colum. Call these umbers X. Next to the X colum, write the exact same umbers, but i a differet order. Call these Y. Now, create two additioal colums of umbers, W ad M. W is the sum of X ad Y, M the differece betwee X ad Y. W ad M will have a zero correlatio. A example of the method is show i Table Usig ay geeral purpose statistical program, you ca verify that the umbers ideed have a precisely zero correlatio. Ca you explai why? X Y W = X + Y M = X Y Table 3.4: Costructig Numbers with Zero Correlatio Solutio We ca develop a geeral expressio for the correlatio betwee two colums of umbers, usig our heuristic rules for liear combiatios. Let s begi by computig the covariace betwee W ad M. First, we take the algebraic product of X + Y ad X Y. We obtai (X + Y)(X Y) = X 2 Y 2 Applyig the coversio rule, we fid that S X+Y,X Y = S 2 X S2 Y Note that the covariace betwee W ad M is urelated to the covariace betwee the origial colums of umbers, X ad Y! Moreover, if X ad Y have the same variace, the W ad M will have a covariace of exactly zero. Sice the correlatio is the covariace divided by the product of the stadard deviatios, it immediately follows that W ad M will have a correlatio of zero if ad oly if X ad Y have the same variace. If X ad Y cotai the same umbers (i permuted order), the they have the same variace, ad W ad M will have zero correlatio.

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the

More information

SNAP Centre Workshop. Basic Algebraic Manipulation

SNAP Centre Workshop. Basic Algebraic Manipulation SNAP Cetre Workshop Basic Algebraic Maipulatio 8 Simplifyig Algebraic Expressios Whe a expressio is writte i the most compact maer possible, it is cosidered to be simplified. Not Simplified: x(x + 4x)

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22 CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n, CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe

More information

Matrices and vectors

Matrices and vectors Oe Matrices ad vectors This book takes for grated that readers have some previous kowledge of the calculus of real fuctios of oe real variable It would be helpful to also have some kowledge of liear algebra

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram. Key Cocepts: 1) Sketchig of scatter diagram The scatter diagram of bivariate (i.e. cotaiig two variables) data ca be easily obtaied usig GC. Studets are advised to refer to lecture otes for the GC operatios

More information

(3) If you replace row i of A by its sum with a multiple of another row, then the determinant is unchanged! Expand across the i th row:

(3) If you replace row i of A by its sum with a multiple of another row, then the determinant is unchanged! Expand across the i th row: Math 50-004 Tue Feb 4 Cotiue with sectio 36 Determiats The effective way to compute determiats for larger-sized matrices without lots of zeroes is to ot use the defiitio, but rather to use the followig

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense, 3. Z Trasform Referece: Etire Chapter 3 of text. Recall that the Fourier trasform (FT) of a DT sigal x [ ] is ω ( ) [ ] X e = j jω k = xe I order for the FT to exist i the fiite magitude sese, S = x [

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

CHAPTER I: Vector Spaces

CHAPTER I: Vector Spaces CHAPTER I: Vector Spaces Sectio 1: Itroductio ad Examples This first chapter is largely a review of topics you probably saw i your liear algebra course. So why cover it? (1) Not everyoe remembers everythig

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

(3) If you replace row i of A by its sum with a multiple of another row, then the determinant is unchanged! Expand across the i th row:

(3) If you replace row i of A by its sum with a multiple of another row, then the determinant is unchanged! Expand across the i th row: Math 5-4 Tue Feb 4 Cotiue with sectio 36 Determiats The effective way to compute determiats for larger-sized matrices without lots of zeroes is to ot use the defiitio, but rather to use the followig facts,

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Measures of Spread: Standard Deviation

Measures of Spread: Standard Deviation Measures of Spread: Stadard Deviatio So far i our study of umerical measures used to describe data sets, we have focused o the mea ad the media. These measures of ceter tell us the most typical value of

More information

6 Integers Modulo n. integer k can be written as k = qn + r, with q,r, 0 r b. So any integer.

6 Integers Modulo n. integer k can be written as k = qn + r, with q,r, 0 r b. So any integer. 6 Itegers Modulo I Example 2.3(e), we have defied the cogruece of two itegers a,b with respect to a modulus. Let us recall that a b (mod ) meas a b. We have proved that cogruece is a equivalece relatio

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

Activity 3: Length Measurements with the Four-Sided Meter Stick

Activity 3: Length Measurements with the Four-Sided Meter Stick Activity 3: Legth Measuremets with the Four-Sided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a four-sided meter

More information

Discrete-Time Systems, LTI Systems, and Discrete-Time Convolution

Discrete-Time Systems, LTI Systems, and Discrete-Time Convolution EEL5: Discrete-Time Sigals ad Systems. Itroductio I this set of otes, we begi our mathematical treatmet of discrete-time s. As show i Figure, a discrete-time operates or trasforms some iput sequece x [

More information

Math 2784 (or 2794W) University of Connecticut

Math 2784 (or 2794W) University of Connecticut ORDERS OF GROWTH PAT SMITH Math 2784 (or 2794W) Uiversity of Coecticut Date: Mar. 2, 22. ORDERS OF GROWTH. Itroductio Gaiig a ituitive feel for the relative growth of fuctios is importat if you really

More information

2 Definition of Variance and the obvious guess

2 Definition of Variance and the obvious guess 1 Estimatig Variace Statistics - Math 410, 11/7/011 Oe of the mai themes of this course is to estimate the mea µ of some variable X of a populatio. We typically do this by collectig a sample of idividuals

More information

Zeros of Polynomials

Zeros of Polynomials Math 160 www.timetodare.com 4.5 4.6 Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered with fidig the solutios of polyomial equatios of ay degree

More information

Theorem: Let A n n. In this case that A does reduce to I, we search for A 1 as the solution matrix X to the matrix equation A X = I i.e.

Theorem: Let A n n. In this case that A does reduce to I, we search for A 1 as the solution matrix X to the matrix equation A X = I i.e. Theorem: Let A be a square matrix The A has a iverse matrix if ad oly if its reduced row echelo form is the idetity I this case the algorithm illustrated o the previous page will always yield the iverse

More information

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2.

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2. Lesso 3- Lesso 3- Scale Chages of Data Vocabulary scale chage of a data set scale factor scale image BIG IDEA Multiplyig every umber i a data set by k multiplies all measures of ceter ad the stadard deviatio

More information

MEASURES OF DISPERSION (VARIABILITY)

MEASURES OF DISPERSION (VARIABILITY) POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

REVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains.

REVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains. The mai ideas are: Idetities REVISION SHEET FP (MEI) ALGEBRA Before the exam you should kow: If a expressio is a idetity the it is true for all values of the variable it cotais The relatioships betwee

More information

Principle Of Superposition

Principle Of Superposition ecture 5: PREIMINRY CONCEP O RUCUR NYI Priciple Of uperpositio Mathematically, the priciple of superpositio is stated as ( a ) G( a ) G( ) G a a or for a liear structural system, the respose at a give

More information

CURRICULUM INSPIRATIONS: INNOVATIVE CURRICULUM ONLINE EXPERIENCES: TANTON TIDBITS:

CURRICULUM INSPIRATIONS:  INNOVATIVE CURRICULUM ONLINE EXPERIENCES:  TANTON TIDBITS: CURRICULUM INSPIRATIONS: wwwmaaorg/ci MATH FOR AMERICA_DC: wwwmathforamericaorg/dc INNOVATIVE CURRICULUM ONLINE EXPERIENCES: wwwgdaymathcom TANTON TIDBITS: wwwjamestatocom TANTON S TAKE ON MEAN ad VARIATION

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

REVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains.

REVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains. the Further Mathematics etwork wwwfmetworkorguk V 07 The mai ideas are: Idetities REVISION SHEET FP (MEI) ALGEBRA Before the exam you should kow: If a expressio is a idetity the it is true for all values

More information

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics BIOTAT 640 Itermediate Biostatistics Frequetly Asked Questios Topic FAQ Review of BIOTAT 540 Itroductory Biostatistics. I m cofused about the jargo ad otatio, especially populatio versus sample. Could

More information

Chapter Vectors

Chapter Vectors Chapter 4. Vectors fter readig this chapter you should be able to:. defie a vector. add ad subtract vectors. fid liear combiatios of vectors ad their relatioship to a set of equatios 4. explai what it

More information

Session 5. (1) Principal component analysis and Karhunen-Loève transformation

Session 5. (1) Principal component analysis and Karhunen-Loève transformation 200 Autum semester Patter Iformatio Processig Topic 2 Image compressio by orthogoal trasformatio Sessio 5 () Pricipal compoet aalysis ad Karhue-Loève trasformatio Topic 2 of this course explais the image

More information

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Math 224 Fall 2017 Homework 4 Drew Armstrog Problems from 9th editio of Probability ad Statistical Iferece by Hogg, Tais ad Zimmerma: Sectio 2.3, Exercises 16(a,d),18. Sectio 2.4, Exercises 13, 14. Sectio

More information

Exponents. Learning Objectives. Pre-Activity

Exponents. Learning Objectives. Pre-Activity Sectio. Pre-Activity Preparatio Epoets A Chai Letter Chai letters are geerated every day. If you sed a chai letter to three frieds ad they each sed it o to three frieds, who each sed it o to three frieds,

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

SEQUENCES AND SERIES

SEQUENCES AND SERIES 9 SEQUENCES AND SERIES INTRODUCTION Sequeces have may importat applicatios i several spheres of huma activities Whe a collectio of objects is arraged i a defiite order such that it has a idetified first

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

4.3 Growth Rates of Solutions to Recurrences

4.3 Growth Rates of Solutions to Recurrences 4.3. GROWTH RATES OF SOLUTIONS TO RECURRENCES 81 4.3 Growth Rates of Solutios to Recurreces 4.3.1 Divide ad Coquer Algorithms Oe of the most basic ad powerful algorithmic techiques is divide ad coquer.

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

BHW #13 1/ Cooper. ENGR 323 Probabilistic Analysis Beautiful Homework # 13

BHW #13 1/ Cooper. ENGR 323 Probabilistic Analysis Beautiful Homework # 13 BHW # /5 ENGR Probabilistic Aalysis Beautiful Homework # Three differet roads feed ito a particular freeway etrace. Suppose that durig a fixed time period, the umber of cars comig from each road oto the

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

3.2 Properties of Division 3.3 Zeros of Polynomials 3.4 Complex and Rational Zeros of Polynomials

3.2 Properties of Division 3.3 Zeros of Polynomials 3.4 Complex and Rational Zeros of Polynomials Math 60 www.timetodare.com 3. Properties of Divisio 3.3 Zeros of Polyomials 3.4 Complex ad Ratioal Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered

More information

The Binomial Theorem

The Binomial Theorem The Biomial Theorem Robert Marti Itroductio The Biomial Theorem is used to expad biomials, that is, brackets cosistig of two distict terms The formula for the Biomial Theorem is as follows: (a + b ( k

More information

Section 1.1. Calculus: Areas And Tangents. Difference Equations to Differential Equations

Section 1.1. Calculus: Areas And Tangents. Difference Equations to Differential Equations Differece Equatios to Differetial Equatios Sectio. Calculus: Areas Ad Tagets The study of calculus begis with questios about chage. What happes to the velocity of a swigig pedulum as its positio chages?

More information

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So, 0 2. OLS Part II The OLS residuals are orthogoal to the regressors. If the model icludes a itercept, the orthogoality of the residuals ad regressors gives rise to three results, which have limited practical

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Analysis of Experimental Measurements

Analysis of Experimental Measurements Aalysis of Experimetal Measuremets Thik carefully about the process of makig a measuremet. A measuremet is a compariso betwee some ukow physical quatity ad a stadard of that physical quatity. As a example,

More information

Apply change-of-basis formula to rewrite x as a linear combination of eigenvectors v j.

Apply change-of-basis formula to rewrite x as a linear combination of eigenvectors v j. Eigevalue-Eigevector Istructor: Nam Su Wag eigemcd Ay vector i real Euclidea space of dimesio ca be uiquely epressed as a liear combiatio of liearly idepedet vectors (ie, basis) g j, j,,, α g α g α g α

More information

The Sample Variance Formula: A Detailed Study of an Old Controversy

The Sample Variance Formula: A Detailed Study of an Old Controversy The Sample Variace Formula: A Detailed Study of a Old Cotroversy Ky M. Vu PhD. AuLac Techologies Ic. c 00 Email: kymvu@aulactechologies.com Abstract The two biased ad ubiased formulae for the sample variace

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

Lesson 10: Limits and Continuity

Lesson 10: Limits and Continuity www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals

More information

MAT 271 Project: Partial Fractions for certain rational functions

MAT 271 Project: Partial Fractions for certain rational functions MAT 7 Project: Partial Fractios for certai ratioal fuctios Prerequisite kowledge: partial fractios from MAT 7, a very good commad of factorig ad complex umbers from Precalculus. To complete this project,

More information

September 2012 C1 Note. C1 Notes (Edexcel) Copyright - For AS, A2 notes and IGCSE / GCSE worksheets 1

September 2012 C1 Note. C1 Notes (Edexcel) Copyright   - For AS, A2 notes and IGCSE / GCSE worksheets 1 September 0 s (Edecel) Copyright www.pgmaths.co.uk - For AS, A otes ad IGCSE / GCSE worksheets September 0 Copyright www.pgmaths.co.uk - For AS, A otes ad IGCSE / GCSE worksheets September 0 Copyright

More information

NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK

NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK For this piece of coursework studets must use the methods for umerical itegratio they meet i the Numerical Methods module

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

7 Sequences of real numbers

7 Sequences of real numbers 40 7 Sequeces of real umbers 7. Defiitios ad examples Defiitio 7... A sequece of real umbers is a real fuctio whose domai is the set N of atural umbers. Let s : N R be a sequece. The the values of s are

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

(b) What is the probability that a particle reaches the upper boundary n before the lower boundary m?

(b) What is the probability that a particle reaches the upper boundary n before the lower boundary m? MATH 529 The Boudary Problem The drukard s walk (or boudary problem) is oe of the most famous problems i the theory of radom walks. Oe versio of the problem is described as follows: Suppose a particle

More information

Discrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions

Discrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions CS 70 Discrete Mathematics for CS Sprig 2005 Clacy/Wager Notes 21 Some Importat Distributios Questio: A biased coi with Heads probability p is tossed repeatedly util the first Head appears. What is the

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

P1 Chapter 8 :: Binomial Expansion

P1 Chapter 8 :: Binomial Expansion P Chapter 8 :: Biomial Expasio jfrost@tiffi.kigsto.sch.uk www.drfrostmaths.com @DrFrostMaths Last modified: 6 th August 7 Use of DrFrostMaths for practice Register for free at: www.drfrostmaths.com/homework

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

Essential Question How can you recognize an arithmetic sequence from its graph?

Essential Question How can you recognize an arithmetic sequence from its graph? . Aalyzig Arithmetic Sequeces ad Series COMMON CORE Learig Stadards HSF-IF.A.3 HSF-BF.A. HSF-LE.A. Essetial Questio How ca you recogize a arithmetic sequece from its graph? I a arithmetic sequece, the

More information

MA131 - Analysis 1. Workbook 2 Sequences I

MA131 - Analysis 1. Workbook 2 Sequences I MA3 - Aalysis Workbook 2 Sequeces I Autum 203 Cotets 2 Sequeces I 2. Itroductio.............................. 2.2 Icreasig ad Decreasig Sequeces................ 2 2.3 Bouded Sequeces..........................

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

4.1 Sigma Notation and Riemann Sums

4.1 Sigma Notation and Riemann Sums 0 the itegral. Sigma Notatio ad Riema Sums Oe strategy for calculatig the area of a regio is to cut the regio ito simple shapes, calculate the area of each simple shape, ad the add these smaller areas

More information

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2 Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample

More information

Polynomial Functions and Their Graphs

Polynomial Functions and Their Graphs Polyomial Fuctios ad Their Graphs I this sectio we begi the study of fuctios defied by polyomial expressios. Polyomial ad ratioal fuctios are the most commo fuctios used to model data, ad are used extesively

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Variance of Discrete Random Variables Class 5, Jeremy Orloff and Jonathan Bloom

Variance of Discrete Random Variables Class 5, Jeremy Orloff and Jonathan Bloom Variace of Discrete Radom Variables Class 5, 18.05 Jeremy Orloff ad Joatha Bloom 1 Learig Goals 1. Be able to compute the variace ad stadard deviatio of a radom variable.. Uderstad that stadard deviatio

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

SEQUENCES AND SERIES

SEQUENCES AND SERIES Sequeces ad 6 Sequeces Ad SEQUENCES AND SERIES Successio of umbers of which oe umber is desigated as the first, other as the secod, aother as the third ad so o gives rise to what is called a sequece. Sequeces

More information

Kinetics of Complex Reactions

Kinetics of Complex Reactions Kietics of Complex Reactios by Flick Colema Departmet of Chemistry Wellesley College Wellesley MA 28 wcolema@wellesley.edu Copyright Flick Colema 996. All rights reserved. You are welcome to use this documet

More information

The Growth of Functions. Theoretical Supplement

The Growth of Functions. Theoretical Supplement The Growth of Fuctios Theoretical Supplemet The Triagle Iequality The triagle iequality is a algebraic tool that is ofte useful i maipulatig absolute values of fuctios. The triagle iequality says that

More information

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1 PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured

More information

Chapter 10: Power Series

Chapter 10: Power Series Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because

More information

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Quick Review of Probability

Quick Review of Probability Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter 2 & Teachig

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

, then cv V. Differential Equations Elements of Lineaer Algebra Name: Consider the differential equation. and y2 cos( kx)

, then cv V. Differential Equations Elements of Lineaer Algebra Name: Consider the differential equation. and y2 cos( kx) Cosider the differetial equatio y '' k y 0 has particular solutios y1 si( kx) ad y cos( kx) I geeral, ay liear combiatio of y1 ad y, cy 1 1 cy where c1, c is also a solutio to the equatio above The reaso

More information

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled 1 Lecture : Area Area ad distace traveled Approximatig area by rectagles Summatio The area uder a parabola 1.1 Area ad distace Suppose we have the followig iformatio about the velocity of a particle, how

More information

[ 11 ] z of degree 2 as both degree 2 each. The degree of a polynomial in n variables is the maximum of the degrees of its terms.

[ 11 ] z of degree 2 as both degree 2 each. The degree of a polynomial in n variables is the maximum of the degrees of its terms. [ 11 ] 1 1.1 Polyomial Fuctios 1 Algebra Ay fuctio f ( x) ax a1x... a1x a0 is a polyomial fuctio if ai ( i 0,1,,,..., ) is a costat which belogs to the set of real umbers ad the idices,, 1,...,1 are atural

More information

2 Geometric interpretation of complex numbers

2 Geometric interpretation of complex numbers 2 Geometric iterpretatio of complex umbers 2.1 Defiitio I will start fially with a precise defiitio, assumig that such mathematical object as vector space R 2 is well familiar to the studets. Recall that

More information

Inverse Matrix. A meaning that matrix B is an inverse of matrix A.

Inverse Matrix. A meaning that matrix B is an inverse of matrix A. Iverse Matrix Two square matrices A ad B of dimesios are called iverses to oe aother if the followig holds, AB BA I (11) The otio is dual but we ofte write 1 B A meaig that matrix B is a iverse of matrix

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

LESSON 2: SIMPLIFYING RADICALS

LESSON 2: SIMPLIFYING RADICALS High School: Workig with Epressios LESSON : SIMPLIFYING RADICALS N.RN.. C N.RN.. B 5 5 C t t t t t E a b a a b N.RN.. 4 6 N.RN. 4. N.RN. 5. N.RN. 6. 7 8 N.RN. 7. A 7 N.RN. 8. 6 80 448 4 5 6 48 00 6 6 6

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

Topic 8: Expected Values

Topic 8: Expected Values Topic 8: Jue 6, 20 The simplest summary of quatitative data is the sample mea. Give a radom variable, the correspodig cocept is called the distributioal mea, the epectatio or the epected value. We begi

More information

Math 155 (Lecture 3)

Math 155 (Lecture 3) Math 55 (Lecture 3) September 8, I this lecture, we ll cosider the aswer to oe of the most basic coutig problems i combiatorics Questio How may ways are there to choose a -elemet subset of the set {,,,

More information