Measures of Variation

Size: px
Start display at page:

Download "Measures of Variation"

Transcription

1 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig CHAPTER Measures of Variatio Key Cocepts Deviatio from Mea Rage Iterquartile Rage Semi-Iterquartile Rage Variace Stadard Deviatio Explaiig Variace Itroductio Measures of variatio refer to a group of statistics that is iteded to provide us with iformatio o how a set of scores are distributed. A examiatio of measures of variatio is a logical extesio of ay descriptio of a data set usig the measures of cetral tedecy that we examied i the previous chapter. Cosider a case where there are two sectios of a course i statistics, ad you are told that each sectio is taught by the same professor, each sectio has a erollmet of 1 studets, ad that the mea, ad media score o a recet examiatio is 80 i both sectios of the course. Without ay additioal iformatio you would be tempted to coclude that the performace of the studets i the two sectios of the course is reasoably similar. As a matter of fact, all of the iformatio up to this poit would suggest that the performace of the studets i the two sectios is idetical. Now suppose that you are show the actual performace of each studet o the examiatio i both sectios of the course (see Figure.1). Clearly the performace of the studets i the two sectios is radically differet. The score of 80 is ot oly the mea of sectio 1, but also a score that seems to be more represetative of the performace of the etire class. While ot all of the studets scored 80, more were at that score 87

2 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 88 Chapter : Measures of Variatio Sectio 1 Sectio f f Score o examiatio 1; mea 80; media Score o examiatio 1; mea 80; media Figure.1 Graph showig distributios for the two sectios tha ay other, ad the umber of studets scorig higher or lower tha 80 falls off the further the scores deviate from 80. The performace of the studets i sectio is far differet eve though the distributio has the same mea ad media score as the first distributio. I sectio the mea of 80 is ot at all represetative of the typical performace. I fact, oly oe studet eared a score at the mea. Seve studets eared perfect scores of 100, while the remaiig seve studets eared a very low score of oly 60. Just as the mea is a sigle umber that is desiged to tell us where the cetral poit of a distributio of scores is located, measures of variatio are sigle umbers that are desiged to tell us how the idividual scores are distributed. By examiig both a measure of cetral tedecy such as the mea, ad a appropriate measure of variatio, we will be able to kow ot oly where the cetral poit of a distributio is located, but also if it teds to look more like the distributio of scores i sectio 1 or if it looks more like the distributio of scores i sectio. The measures of variatio examied i this chapter ca be divided ito two groups. The first group of statistics measures variatio i a distributio i terms of the distace from the smaller scores to the higher scores. Icluded i this group of measures of variatio is the rage, which is a simple measure of the variatio i a distributio computed by examiig the distace from the smallest score to the largest score. Also icluded i this group of statistics are the iterquartile rage (IQR), ad the semi-iterquartile rage (SIQR). These latter two measures of variatio are ofte used i educatioal research. The secod group of statistics measures variatio i terms of a summary measure of each score s deviatio from the mea. The two statistics of this type that we will examie are the variace, ad the stadard deviatio. The measures of variatio based o deviatio from the mea ted to be more useful, ad are fudametal cocepts i behavioral sciece research. The Rage, Iterquartile Rage, ad Semi-iterquartile Rage Rage The simplest measure of variatio i a distributio of data is the rage. The rage is defied as the distace from the smallest observed score to the largest observed score i a set of data. For raw data, the rage may be computed by subtractig the lower limit of the smallest observed score from the upper limit of the largest observed score. Cosider the set of raw data below: : The rage is (1. 1.) 0.

3 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 89 A simple alterative is to subtract the smallest observed score from the largest observed score, ad the add 1 additioal uit of measuremet to compesate for the upper ad lower limits. I this case, the rage is (1 ) I either case, the set of data rages over 0 uits, just as if you bega readig a book o page ad cotiued through page 1 you would have read a total of 0 pages. Keep i mid whe usig the alterative method of subtractig the smallest observed score from the largest observed score that we add 1 uit of measuremet, ot just 1. For example, if we had the followig data o icome: Icome: 1,000 14,000 18,000 3,000 46,000 8,000 The rage would be computed as: (8,000 1,000) + 1,000 47,000 It would ot be correct to compute the rage as: (8,000 1,000) ,001 The computatio is similar for data orgaized i a simple or full frequecy distributio as illustrated below. f Cf Σ f 0 I a simple frequecy distributio where the iterval size i 1, we do ot lose ay precisio i measuremet. I this case the rage would be computed as (1..) 7 or, alteratively: (1 6) I a frequecy distributio where the iterval size i > 1, we follow the same geeral procedure, but use the lower limit of the smallest iterval, ad the upper limit of the largest iterval as our parameters for the computatio of the rage. For the data below: f Cf Σf 7

4 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 90 Chapter : Measures of Variatio the rage would be computed as: [9. ( 0.)] 30 or, alteratively (9 0) Iterquartile rage The iterquartile rage (IQR) is defied as the distace from the 7th percetile to the th percetile i a set of data. I the previous chapter o cetral tedecy we saw that a few extreme scores at oe ed of the distributio ca bias a measure such as the mea. The same situatio is true for a measure of variatio such as the rage. A few extreme scores at oe or the other ed of a distributio will affect the size of the rage. The IQR is a alterative measure of variatio that elimiates the effect of the extreme scores i a distributio by reportig the rage betwee the 7th ad th percetiles. I effect, the IQR represets the rage of the middle 0% of the distributio, ad igores the top % ad bottom % of the data that may be subject to extreme scores. Computig the IQR is as simple as subtractig the th percetile from the 7th percetile: IQR P7 P The formula for fidig a particular percetile from Chapter 3 is provided below, alog with the frequecy distributio previously used to illustrate the procedure. To compute the IQR we will eed to fid both the th percetile ad the 7th percetile. f Cf Σ f 1 The geeral formula for fidig a give percetile is give by the equatio below. P x LL+ FP CF f it below i where P x F P the desired percetile the umber of frequecies below the desired percetile CF below the value from the cumulative frequecy colum for the iterval just below F P LL the lower limit of the iterval cotaiig F P f it the umber of frequecies i the F P iterval i the iterval size. For the th percetile: F P (.) (1) 31. For the 7th percetile: F P (.7) (1) 93.7

5 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 91 The itervals cotaiig the th ad 7th percetiles are idicated below: f Cf <--(F P 93.7 is i this iterval) <--(CF below 83) <--(F P 31. is i this iterval) <--(CF below 10) Σ f 1 To fid the th percetile we simply substitute appropriate values ito the formula as follows: P 4. + P P 4. + (.8 ) P To fid the 7th percetile we simply substitute appropriate values ito the formula as follows: P P P (.49 ) P Havig computed both the th ad the 7th percetiles, we may ow compute the IQR: IQR The resultig value of 13. for the IQR idicates that there is a rage of 13. poits for the middle 0% of the distributio. The advatage of the IQR over the simple rage is that ay bias that might result from a few extremely high scores or a few extremely low scores (or both) has bee elimiated. Semi-iterquartile rage The fial rage based measure of variatio preseted i this chapter is the semi-iterquartile rage. The cocept iterquartile rage suggests a measure of variatio based o a quartile, or % of the distributio; however, the IQR actually represets the rage of the middle 0% of the distributio. The SIQR is

6 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 9 Chapter : Measures of Variatio a alterative to the IQR that comes closer to represetig a quartile or % size rage i the distributio. The SIQR is simply the IQR divided by. SIQR IQR I the case of our previous example, the SIQR is SIQR 13. The IQR ad the SIQR are widely used i educatio research where there always seems to be oe or two studets at each extreme of the distributio. 6.6 The Variace ad the Stadard Deviatio Variace The variace ad the stadard deviatio are two measures of variatio that are based o the cocept of deviatio from the mea. For ay distributio of scores measured o a cotiuous scale we ca compute a mea, ad the measure the distace of each score from the mea. For example, the set of 6 scores preseted below have a mea equal to We may the defie deviatio from the mea (di) as the distace of each score from the mea, or d i Usig our set of 6 scores, we may the calculate the deviatio from the mea for each score. di 13 (13 8) 11 (11 8) 3 9 (9 8) 1 7 (7 8) 1 ( 8) 3 3 (3 8) Oe way we might costruct a summary measure of variatio i a distributio of scores is to compute the average deviatio of each score from the score s mea. To do this, we would simple sum the idividual deviatios from the mea which we have just calculated, ad the divide by the umber of observatios we have.

7 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 93 Σ di ( 1) + ( 3) + ( ) 6 Σ di It may seem quite logical to costruct a measure of variace by calculatig the average deviatio from the mea for a set of scores, but there is oe small problem. The sum of the deviatios from the mea for all distributios is always the same thig 0. Σ di 0 Oe solutio to this problem is to base our measure of variace o the squared deviatio from the mea. By squarig the result of ( ), we will elimiate the egative umbers, ad prevet the egative deviatios ad positive deviatios from cacelig each other out. Applyig this strategy to our origial distributio will give us the followig result: ( ) 13 (13 8) () 11 (11 8) 3 (3) 9 9 (9 8) 1 (1) 1 7 (7 8) 1 ( 1) 1 ( 8) 3 ( 3) 9 3 (3 8) ( ) Now if we wat to costruct a measure of variatio that gives us a sigle umber represetig the average variatio of each score i a distributio, we ca use the mea (or average) of the squared deviatios of each score from the distributio mea. We eed oly sum the squared deviatios from the mea, ad the divide by the umber of observatios. Σ( ) Σ( ) The resultig value idicates that the mea squared deviatio of each score from the distributio mea is 11.67; or stated differetly, o average, the distace squared of each score from the mea is uits. The statistic we call the variace represets the mea squared deviatio from the mea for a set of data. The logical formula for the variace is simply: where each value of i the distributio the mea of the distributio the sample size. ( ) Variace Σ

8 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 94 Chapter : Measures of Variatio What does the variace really tell us? Recall the situatio from earlier i the chapter where we had two distributios represetig the performace of two sectios of a class o a exam with the same umber of observatios, the same mea, ad the same media. Yet, we could see by simple ispectio that the two distributios were very differet. The variace of the distributio will tell us how represetative the mea is of each of the scores i the distributio. The closer each idividual score is to the mea the smaller the variace will be. If each score is at the mea i a distributio the variace will equal zero, idicatig that there is o variatio from the mea across the etire distributio. The farther each of the idividual scores is from the mea the greater the variace will be, idicatig that the mea is ot as typical of the idividual scores i the distributio. Examie the three simple distributios below. A B C Each distributio cotais observatios, ad each distributio has a mea equal to 6. Yet the observatios differ with respect to how much the idividual scores vary from the mea. Sice the variace represets the average squared deviatio from the mea, which distributio would you expect to have the greatest variace? Which distributio should have the smallest variace? The mea value of 6 seems to be most typical of the scores i distributio C, so it should have the smallest variace. The scores i distributio B are much farther from the mea value of 6, so it should have the largest variace. The scores i distributio A appear somewhat i betwee, ad should have a variace betwee that of distributio B ad distributio C. Let s calculate the variace for each distributio below: A ( ) ( ) B ( ) ( ) C ( ) ( ) Σ( ) 40 Σ( ) 64 Σ( ) 8 Variace: As suspected, the variace i distributio C is the smallest. For distributio C the average squared deviatio from the mea is 1.6 uits. Distributio B has the largest variace with a average squared deviatio of 1.8 uits. Distributio A is i betwee the two with a average squared deviatio from the mea of 8 uits. It is importat to realize that the size of the variace does ot have ay special uderlyig stadard iterpretatio. The value of the variace does ot have a special meaig like your blood pressure, where

9 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 9 you kow that you are i reasoably good coditio with a systolic blood pressure of 10 ad a diastolic pressure of 80. There is o ormal or abormal rage for the variace. What the variace is tellig you is simply what the squared distace is from the typical or average score to the mea. Variace, alog with the mea, ca the allow you to have a idea of what a particular distributio might look like, ad will allow you to judge how well the mea serves as a measure of cetral tedecy. Cosider the case of the three simple distributios we just examied. Typically, iformatio for such distributios would ot provide idividual scores, but would be preseted i summary form as follows: Distributio Summary Statistics A B C Sample Size: Mea: Variace: Eve though we do ot have the idividual scores available, we ca reach some fairly accurate coclusios about what each of these distributios would look like. For example, we ca see that all three distributios are of the same size ( ), ad that all three have the same mea ( 6). The fact that the variace for distributio C is oly 1.6 idicates that most of the idividual scores i the distributio should be very close to the value of 6. After all, a distributio where every score is equal to the mea will have a variace of zero (remember that variace ca ever be egative). Similarly, we would assume that the idividual scores i distributio B must be much more diverse or spread out aroud the mea sice the variace is so much larger. A computatioal formula for variace Up to this poit we have utilized a logical formula for the variace that is useful for demostratig how the variace is computed, but requires us to go through some uecessary steps. A computatioal formula may be used, which simplifies the calculatios, especially whe a larger data set is ivolved. The computatioal formula preseted below may look a little more difficult at first, but with a little experiece usig it you will likely fid it to be much easier. Σ Variace ( ) Σ where Σ the sum of the s squared (Σ) the quatity, sum of s squared the sample size. Notice that the computatioal formula for variace cotais both the sum of s squared term, ad the quatity sum of s squared term. ( ) The term i the umerator: Σ Σ represets the sum of the squared deviatios from the mea that was formally writte as Σ( ) This term is also sometimes referred to as simply the sum of squares, ad will play a role i several statistical procedures that will be examied i later chapters.

10 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 96 Chapter : Measures of Variatio Let s demostrate that the computatioal formula for the variace will provide the same results that we previously obtaied for our three distributios. A B C Σ 30 Σ 0 Σ 30 Σ 44 Σ 30 Σ 188 (Σ ) 900 (Σ ) 900 (Σ ) 900 Variace Computatio Distributio A Distributio B Distributio C I each case we obtai the same result for the variace with the computatioal formula that we previously obtaied with the logical formula. Remember that the variace will ever be egative. If your calculatio of the variace results i a egative umber, you ca be sure that you have made a error somewhere. Cofusig the Σ with the (Σ ) i the computatioal formula is a commo mistake that will result i a egative umber, as is eglectig to divide (Σ ) by before subtractig the result from Σ. Some importat termiology ad symbols for variace At this poit we eed to itroduce some termiology, ad appropriate symbols for the variace. There are three situatios where we might wat to calculate the variace. The logical formula ad its computatioal alterative that we have bee usig are appropriate for two of the three situatios. Recall from Chapter 1 the distictio betwee a populatio represetig the etire collectio of uits of iterest i a research project, ad a sample that is the smaller subgroup that we select for actual observatios. The variace for a populatio is represeted by the lower case Greek letter sigma squared, or σ. The populatio variace σ may be calculated usig the formula that we have worked with up to this poit. The variace for a sample of data, whe we are oly iterested i describig the sample, is represeted by the upper case letter S, ad it too may be calculated with the formula that we have worked with up to this poit.

11 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 97 However, if you also recall from Chapter 1, we made a distictio betwee descriptive statistics that are used to describe a set of data ad iferetial statistics that are used to ifer somethig about a populatio parameter by the observatio of sample statistics. We ofte have a iterest i doig just that i behavioral sciece research, or at the very least we are iterested i beig able to geeralize our results to a larger populatio. It turs out that if you were to actually kow the value of a populatio s variace, ad the take a series of samples from the populatio ad compare the sample variaces computed for each sample to the actual populatio variace, you would fid that the sample variaces ted to uderestimate the true size of the populatio variace. The sample variace is sometimes referred to as a biased estimator of the populatio variace, ad the directio of the bias is to uderestimate the true size of the populatio variace. We ca reduce the bias of the estimate of the populatio variace whe usig data from a sample by makig a slight adjustmet i the formula for the sample variace whe we iteded it to serve as a estimate of the populatio variace. Sice the directio of the bias is to uderestimate the true populatio variace, we ca icrease the size of the estimated variace by usig the value 1 i the deomiator of the variace formula i place of the usual deomiator. We will the use the lower case letter s to represet a sample variace that is beig used as a estimate of the populatio variace. It might be useful at this poit to review the computatioal formulas for the three situatios for computig variace. Symbol Situatio Computatioal Formula Σ σ Populatio Variace σ Σ S Sample Variace S Σ s Sample Variace used to estimate the populatio variace σ s ( ) Σ ( ) Σ ( ) Σ 1 1 as degrees of freedom The use of 1 i the deomiator of the formula for variace is the result of the cocept of degrees of freedom. Oe of the thigs that we observed whe lookig at deviatio from the mea was that the sum of the deviatios from the mea always equaled zero. However, whe dealig with sample data from a populatio there is o guaratee that the sum of the deviatios of the idividual sample scores from the populatio mea will actually equal zero. (Do t be cofused by the fact that the sum of the deviatios of the sample scores from the sample mea will equal zero. What we are cocered with here is whether or ot the sum of the deviatios of the sample scores from the populatio mea will equal zero.) Cosider this example of a populatio of N 10 with a mea μ 30. (You may wat to verify that the sum of the deviatios of the populatio scores from the populatio mea is actually zero.) Suppose I select a sample of scores from the populatio as idicated below. We ca calculate a sample mea for the five scores that were selected from the populatio, ad if we calculate the sum of the deviatios from the sample mea for the five scores i the sample we will i fact fid the sum equal to. But is the sum of the deviatios of the sample scores from the populatio mea equal to zero? No! The sum of the deviatios from the sample scores to the populatio mea is actually 30. How

12 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 98 Chapter : Measures of Variatio Describig a populatio or a sample variace Estimatig the populatio variace by observatio of a sample Σ ( Σ) σ Σ ( Σ ) S Populatio Sample s Σ (Σ) 1 Figure. Diagram of populatio ad sample with the three formulas Populatio N 10 µ 30 Sample Figure.3 Illustratio of populatio data ad sample draw from it may of the sample scores would I have to chage (or cotrol) if I wated to force the sum of the deviatios of the sample scores from the populatio mea to equal zero? The aswer is that oly oe of the sample scores must be cotrolled. I ca always esure that the sum of the deviatios of the sample scores from the populatio mea will equal zero if I ca cotrol oe of the sample scores. It does ot matter which score I choose to cotrol. I simply must be sure that the value of the score I choose will result i a deviatio from the populatio mea that whe added to the other deviatios from the populatio mea will give me a sum of zero. Sice I must cotrol oly oe score, it meas that the other scores are free to take o ay value. I other words, I have 1 degrees of freedom. The cocept of degrees of freedom will be examied agai whe we begi our ivestigatio of hypothesis testig i later chapters. It is easy to be cofused by the cocept of degrees of freedom, or the differece betwee describig a sample variace through the use of oe formula, ad estimatig the populatio variace from sample data usig a differet formula. Like may of the cocepts i statistics, it takes time for reflectio ad experiece before everythig falls ito place. At this poit, the most importat thig to keep i mid is what variace tells us about a distributio of scores. Remember, the larger the variace the less likely that the idividual scores of a distributio are close to the mea, ad the less likely that the mea is a good idicator of what the typical score i a distributio was. The smaller the variace the more likely that the idividual scores of the distributio are close to the mea, ad the more likely that the mea is a good idicator of the typical score i a distributio. We have examied the logical ad computatioal formulas for calculatig the variace whe usig raw data or a set of idividual scores. We will follow a patter similar to what we did i Chapter 4 o measures of cetral tedecy ad also briefly examie the method for computig variace whe data are preseted i a simple frequecy distributio of size i 1, ad a frequecy distributio whe i > 1. It is useful to examie these two situatios sice we do ot always cotrol the way data are preseted to us. But before movig to these other approaches I wat to itroduce the cocept of stadard deviatio, which is closely related to variace.

13 Stadard deviatio Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 99 May more people have heard the term stadard deviatio tha the umber who actually kows what it meas. But what is a stadard deviatio? The stadard deviatio of a set of data is simply the square root of the variace. Just as the variace ca be defied as the mea of the squared deviatios from the mea for a set of data, the stadard deviatio ca be defied as the square root of the mea of the squared deviatios from the mea for a set of data. You have every reaso to be woderig why do we care about the stadard deviatio whe we already kow the variace? There are really two reasos. First, recall that we were ot able to base a measure of variatio i a set of data o the simple deviatio from the mea due to the fact that the sum of the simple deviatios from the mea was always the same thig zero. We elimiated that problem by squarig the deviatios from the mea, ad usig the sum of the squared deviatios as the basis for our measuremet of variace. But i solvig oe problem we artificially iflated our measure of variace whe we squared all of the deviatios from the mea. I oe sese you ca thik of the stadard deviatio as beig a measure of variatio that is more i lie with what we were itedig to measure i the first place sice by takig the square root of the variace we are usquarig the deviatios from the mea. The iformatio coveyed by the variace ad the stadard deviatio is essetially the same, but the stadard deviatio is usually a much smaller value. There is a exceptio of course whe the variace is less tha 1.00, sice the square root of a umber greater tha zero ad less tha 1.00 is larger tha the origial umber. For example, the square root of 0.36 is the larger value 0.60, but i most cases the variace is a much larger umber, ad we will fid it much easier to work with the smaller value of a stadard deviatio. However, it is the secod reaso for workig with the stadard deviatio that is much more importat. It turs out that by kowig the mea ad the stadard deviatio for certai types of distributios we ca also kow a great deal about how the idividual observatios i that distributio are orgaized. The mea ad the stadard deviatio will tell us a great deal about ay distributio that is ormal i form. We discussed the idea of a distributio beig ormal i form or havig a bell-shaped curve i Chapter 3, ad we will be examiig the cocept of a ormal distributio i great detail i the ext chapter. Furthermore, the stadard deviatio, ad a related cocept of the stadard error that is based o the stadard deviatio, will be key terms whe we begi our ivestigatio of hypothesis testig i later chapters. Just as we represeted the populatio variace by the symbol of lower case sigma squared (σ ), we represet the symbol for a populatio stadard deviatio by the lower case sigma (σ). Similarly, a sample stadard deviatio is represeted by the upper case S, ad a stadard deviatio that is beig used to ifer a populatio stadard deviatio is symbolized by a lower case s. We will still have the same problem with stadard deviatio that we had with variace whe we attempt to use sample data to estimate the populatio stadard deviatio. We will use the same techique of alterig the deomiator of the formula by usig 1 i place of the usual deomiator whe we wish to estimate the populatio stadard deviatio. With this i mid, we ca preset both the logical formula ad the computatioal formula for the stadard deviatio. Logical Formula for Stadard Deviatio: Σ( ) σ or S Computatioal formula for Stadard Deviatio: σ or S Σ ( ) Σ

14 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 100 Chapter : Measures of Variatio Whe usig data from a sample to estimate a populatio stadard deviatio we would simply substitute 1 i the deomiator of the formula. For example, the computatioal formula would become: Computatioal formula for Sample Stadard Deviatio used to estimate the Populatio Stadard Deviatio σ: s Σ ( ) Σ 1 Techically, the correctio factor of 1 is sufficiet to adjust for the bias whe usig the sample variace s as a estimate of the populatio variace σ. However, the sample based stadard deviatio s will still be a biased estimator of the populatio stadard deviatio σ eve whe usig the correctio factor of 1. This is especially true whe workig with a very small sample size. Fortuately, oce the sample size moves above a 0 or so, the bias becomes very slight, ad because may research applicatios i the behavioral scieces ivolve a large sample size we eed ot worry i most cases. Those of you with statistical fuctio calculators might take a momet to examie your fuctio keys. May statistical calculators will automatically compute the mea ad stadard deviatio for a set of data, ad some will give you a choice o how you wat the stadard deviatio computed. You may see two keys marked as: σ, ad σ 1, providig you with a choice of a descriptive or a iferetial computatio of the stadard deviatio. Computig the stadard deviatio is relatively simple. Just fid the variace, ad the take the square root. For example, we previously computed the variace for three simple distributios, ad obtaied the results of A B C Σ 30 Σ 0 Σ 30 Σ 44 Σ 30 Σ 188 (Σ ) 900 (Σ ) 900 (Σ ) 900 S 8; S 1.8; ad S 1.6. Stadard Deviatio Computatio Distributio A Distributio B Distributio C SA 8.83 SB SC

15 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 101 To compute the stadard deviatio we simply took the square root of each of the observed variaces. Computig Variace ad Stadard Deviatio for Frequecy Distributios As we have see earlier, we do ot always cotrol the way data are preseted to us. O occasio we may be preseted with data already i the form of a frequecy distributio with o access to the origial raw scores, ad yet we may still wat to compute a mea, or a variace ad stadard deviatio. Variace ad stadard deviatio may be computed for frequecy distributios by makig a simple adjustmet i the formula followig the same patter we used whe computig the mea for frequecy distributios i Chapter 4. To compute the variace we will eed to alter our origial computatioal formula for raw data: as follows: S S Σ Σf ( ) Σ ( f ) Σ where f each value of times its frequecy f each value of squared times its frequecy We must simply use the umber of frequecies f i each iterval to weight the value of Σ ad (Σ ) i each iterval. Cosider the simple frequecy distributio below, which we used i Chapter 4 to compute a mea. We ca add the appropriate colums to the frequecy distributio to obtai the ecessary sums. f f f Σf 16 Σf 11 Σf 884 Note that the f colum represets the product of f, ad as such you may obtai it two ways. Cosider the top two etries i the f colum of the frequecy distributio above. You may first square each value, ad the multiply the resultig sum by the appropriate value of f as follows: f 1 f f f 81 40

16 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 10 Chapter : Measures of Variatio ad so o, or you might see that you already have the value of f, ad may simply multiply that colum by the value of. Sice multiplicatio is commutative it does ot matter what order you perform the computatio. That is: f f f f So we ca also obtai the f colum as follows: 1 f 1 f 11 f f f f 4 f f Use which ever method is easier for you. To complete the computatio for the variace we eed oly plug the umbers ito the formula. Keep i mid that is equal to the total umber of observatios i our sample, ot the umber of itervals i which they happe to be categorized. I this case, 16, ot. S S S Σf ( f) Σ S So our variace is equal to 6.. To compute the stadard deviatio we would take the square root of the variace. S 6.. Notice that we are assumig that we are iterested i describig the observed sample, ad are ot attemptig to estimate a populatio variace or stadard deviatio. This is evidet from our use of 16 i the formula istead of 1 1. We follow the same geeral procedure whe workig with a frequecy distributio where the iterval size is greater tha 1. The formula for variace must be adjusted just as it was whe workig with the simpler frequecy distributio. We may write the formula as before as: S ( f ) Σ f Σ where the iterval midpoit. We will agai use oe of the frequecy distributios from Chapter 4. Our first step is to fid the midpoit of each iterval. From that poit o we are simply repeatig the procedure that we followed

17 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 103 for the frequecy distributio where i 1. We will eed a f colum, ad a f colum to compute the variace. f f f , , , , , ,40 Σf 0 Σf 1,73 Σf 63,01 S S S 6301 (173) 0 0 3, 010, S We have computed the variace, ad may ow compute the stadard deviatio by simply takig the square root of 6.1. S Variace as Predictio Error (or Cabo Sa Lucas Here I Come!) We have examied the idea of variatio i data i several differet ways such as the rage, the IQR, the SIQR, variace, ad stadard deviatio. Toward that ed, we have spet a great deal of time doig a variety of computatios. While it is importat to be able to take a statistical formula, apply it to a set of data, ad geerate the correct result, it is much more importat to kow why we do it. I other words, what does the resultig statistic mea? What does it tell us about a set of data that we did ot kow before? Up util ow I have stressed the idea that variace is importat because it tells us somethig about the way that idividual scores are distributed aroud their mea. By kowig the size of the variace we kow how represetative the mea is of the idividual scores i the distributio. The variace viewed i those terms is a importat piece of iformatio, but i the behavioral scieces we ca, ad ofte do, look at variace i aother way. We look at variace as a type of predictio error, ad try to fid ways to reduce the size of predictio error. Or you might thik of the process as simply tryig to do a better job of predictig the value of some variable that we cosider importat i our research. Suppose we have a set of data represetig aual icome i thousads of dollars for a group of 10 idividuals.

18 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 104 Chapter : Measures of Variatio Icome ( 1,000) Σf 9 We could calculate the mea icome ad would fid that it is 9., or $9,00 per year. By examiig the 10 scores i the distributio you ca see that there is variatio preset. That is, ot everyoe has a icome of 9.; the icomes vary, some are higher ad some are lower. Now suppose I told you that I had the icome of each perso writte o a piece of paper, ad that I was goig to draw the pieces of paper at radom ad let you guess what the perso s icome was. The oly restrictio is that you have to make the same guess each time. You are free to choose ay of the icomes represeted, or ay other value for that matter as your guess. All 10 icomes will evetually be selected, ad I am goig to measure how well you guess by comparig your guess to the icome that is chose. Sice some of your guesses will be too high ad others too low, I am goig to square the differece betwee the actual icome ad your guess to elimiate ay egative umbers. After all 10 icomes have bee selected, I will calculate a mea of your squared differece for each guess as a idicatio of how well you have doe. What icome value would you choose as your guess? You wat to select a value to guess each time that will give you the smallest amout of squared error possible. (Choose wisely because there might be a prize i this for you if you wi. I m thikig maybe some ice luggage ad a trip to Cabo Sa Lucas, but I have t made up my mid yet.) If you examie the distributio you will otice that there are two scores at 1, ad you might be tempted to select the value 1 as your guess sice your mea squared differece for those two cases would be zero. However, your mea squared differece across all 10 icomes would be 7.9 if you select the value 1 as your guess. (You might wat to verify this by subtractig 1 from each of the observed icomes, squarig the differece, ad the averagig the 10 results). Is that a wiig performace? I doubt it, it seems high. Selectig the mode of 1 did ot seem to be a wise strategy. What if you selected the media icome as your guess? If you rearrage the umbers you will fid the media to be equal to 1. If 1 becomes your guess, you will ot be exactly right o ay of the 10 icomes, but your mea squared differece across all 10 icomes will be 437.9, which is much improved over the strategy of selectig 1, but is it the best you could do? Oe fial strategy might be to select the mea value of 9. as your guess. Agai, you will ot be exactly right o ay of the 10 icomes, but how would you do across all 10? By selectig the mea of 9. as your guess, your mea squared differece would be That is the best we have see yet, ad i fact, there is o other guess that would be ay better. Now if you thik about how we have measured

19 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 10 the accuracy of each guess, you might recogize that the value of 36.6 represets somethig else we have examied i this chapter. We took each score, subtracted your guess of the mea, squared the differece, ad the calculated the mea of the 10 squared deviatios. I other words, we computed the variace. Icome ( 1,000) ( - ) ( - ) Σ( ) S Sice there is o other sigle value that would serve as a better guess of the idividual scores tha the mea, we ca thik of the variace as the maximum amout of predictio error that we would have to accept whe tryig to predict a idividual score. Or to thik of it a differet way, your best guess of a score i a distributio is the mea, assumig that o additioal iformatio is available to you. (For those of you who origially guessed the mea, the prize committee iforms me that either the trip to Cabo or the luggage is available. You do wi our home game allowig you hours of fu guessig ayoe s icome you please.) Now let s chage the rules a little bit. Suppose before I have you guess the icome value, I am willig to give you oe piece of additioal iformatio. It would be i your best iterest to ask for somethig that might help you better predict icome. What sorts of thigs (variables) are related to icome? You ca probably thik of may thigs, but certaily educatio is a key variable that helps determie oe s icome. Suppose I am willig to tell you if the icome I have selected is that of a perso who has a college degree or ot. Let s also suppose that I am willig to let you provide two differet icomes as your guess; oe for the college graduates, ad oe for the ocollege graduates. (I hope you uderstad that by chagig the rules of the game the trip to Cabo Sa Lucas is defiitely out of the questio for this roud. No, you re ot goig to get the luggage either!) As you might suspect, your best strategy for guessig has chaged. I the light of this ew iformatio, your best strategy is to guess the mea of the college graduates as the selected icome whe you kow I have selected a college graduate, ad to guess the mea of the ocollege graduates whe you kow I have selected a ocollege graduate. Let s look at the icome distributio agai with the ew iformatio added.

20 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 106 Chapter : Measures of Variatio College Graduate Icome ( 1,000) YES 30 NO 0 YES 0 NO 1 NO 1 College Graduate s Mea 43.4 YES 7 Nocollege Graduate s Mea 1.6 YES 40 NO 1 YES NO 16 The mea icome of the college graduates is 43.4 or $43,400, ad the mea icome of the ocollege graduates is 1.6, or $1,600. How well ca you guess icome ow if you guess the college graduate mea of 43.4 whe you kow the idividual has a college degree, ad guess the ocollege graduate mea of 1.6 whe you kow the idividual does ot have a college degree? We will substitute the appropriate mea ito the calculatio of ( ), ad ( ), ad the compute what we ca thik of as a modified variace (symbolized by S ). College Graduate Icome ( 1,000) ( ) ( - ) YES 30 ( ) NO 0 (0 1.6) YES 0 (0 43.4) NO 1 (1 1.6) NO 1 (1 1.6) YES 7 (7 43.4) YES 40 ( ) NO 1 (1 1.6) YES ( 43.4) NO 16 (16 1.6) Σ( ) S' Whe we use the mea icome of the college graduates to guess a college graduate s icome, ad the mea icome of the ocollege graduates to guess the mea icome of the ocollege graduates, ad the fid the mea squared deviatio, we arrive at a modified measure of variace; oe that uses the specific group mea i place of the overall group mea. I this case usig educatioal level to help predict icome results i a modified variace of 17.44, which we ca compare with our previous variace of By usig the appropriate mea icome for each group we have bee able to reduce the amout

21 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio 107 of variace by poits ( ), or we ca express that differece as a percetage of the origial variace ad say that we have reduced the variace by.8%. Usig educatioal level to help predict icome has explaied over half (.8%) of the variace i icome. Not everyoe i our small sample of 10 idividuals has the same icome, variatio is preset. By kowig what the idividual s educatioal level is we are able to explai or accout for over half of the variatio i icome. This idea of beig able to explai or reduce variace i oe variable by kowig the value of a secod variable is oe of the more importat cocepts i statistical aalysis i the behavioral scieces. We will deal with this cocept agai whe lookig at the iterpretatio of correlatio betwee two variables i Chapter 9, ad i assessig the quality of a liear regressio aalysis i Chapter 10. Computer Applicatios 1. Select several variables from the GSS data set, ad geerate descriptive statistics.. Be sure to click o the Optios butto ad request the variace ad rage i additio to the default statistics of mea, stadard deviatio, miimum, ad maximum. 3. Eter data from oe of the short examples from the text ad compare the results to SPSS. Is SPSS usig the variace formula with as the deomiator, or 1? How to do it Ope the GSS data set, ad the click o Aalyze, Descriptive Statistics, ad the Descriptives. Highlight the desired variables ad select them by clickig o the directio arrow. Click o Optios to request additioal statistics such as the variace ad rage. Click o OK to ru the procedure. Clear the GSS data set by clickig o File, New, ad the Data. Use the ew empty Data Editor Scree to iput data from oe of the simple examples from the text. Ru the descriptive statistics procedure with variace requested, ad determie the formula used. Summary of Key Poits Measures of variatio are a group of statistics that idicate how a set of scores is distributed. Some measures of variatio idicate the variatio from the bottom or lower ed of the distributio to the top or the upper ed of the distributio, while other measures of variatio idicate the variatio of each score from a cetral poit such as the mea. The former techiques typically measure the rage of the distributio, while the latter techiques typically measure deviatio from the mea. Rage The rage is the distace from the smallest score i a distributio to the largest score. It is oe of the simplest measures of variatio. Iterquartile Rage The iterquartile rage is the distace from the 7th percetile to the th percetile i a distributio. Semi-Iterquartile Rage The semi-iterquartile rage is the iterquartile rage divided by. Deviatio from the Mea Deviatio from the mea is a measure of each score s distace from the mea. The sum of the deviatios from the mea is always zero. Variace The variace is the average of the squared deviatios from the mea for a set of scores. We square the deviatios to keep the positive ad egative deviatios from cacellig each other out. The smaller the variace the more closely the scores of a distributio are to their mea. Stadard Deviatio The stadard deviatio is the square root of the variace. Degrees of Freedom The umber of values i a sample that are free to take o ay value ad still represet a ubiased estimate of a populatio parameter. Normal i Form A distributio of scores whose shape approximates that of a ormal or bellshaped curve.

22 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig 108 Chapter : Measures of Variatio Reduce Variace The ability to reduce the amout of error whe predictig a variable by makig use of iformatio obtaied from a secod variable. Reducig or explaiig variace is a importat cocept that is cetral to several statistical procedures. Questios ad Problems for Review 1. Compute the rage for the followig sets of data: A B Uder what circumstaces would it be wise to compute the IQR or the SIQR? 3. Examie the frequecy distributio below. Do you thik it would be better to report the rage for these data, or the IQR? Why? f Cf Σf Compute the rage, IQR, ad SIQR for the data i Problem 3 above.. Examie the summary iformatio preseted below. What ca you coclude about the icome distributio for each occupatioal category? For which groups does the mea seem to be a better measure of cetral tedecy? For which groups is the mea less idicative of overall group icome? Icome Occupatio Mea ($) Stadard Deviatio ($) Accoutat 3,00 8,00 Attorey 7,00 3,100 Egieer 7,800 1,100 Evagelist 66,600 7,800 Physicia 108,000 8,00 Psychic 3,700 4,0 6. Compute the variace ad stadard deviatio for the two sectios of the statistics class illustrated i Figure.1.

23 Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio Copyright Property of Kedall Hut Publishig Chapter : Measures of Variatio Compute the variace ad stadard deviatio for the simple frequecy distributio below f Σf 0 8. What does it mea to be able to explai variace? 9. Scores for the verbal sectio of the SAT are preseted below for a group of 10 studets. A. Compute the variace for the etire group. B. Compute the mea score of the females, ad the mea score of the males. Geder Verbal SAT Female 00 Female 60 Female 48 Female 70 Male 40 Male 39 Male 700 Male 630 Male 40 Male How much of the variace i the SAT scores i problem 9 ca be explaied by geder? (Hit: you will eed to use the mea score of the females whe predictig a female score, ad the mea score of the males whe predictig a male score to compute a modified variace).

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2 Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

Measures of Spread: Standard Deviation

Measures of Spread: Standard Deviation Measures of Spread: Stadard Deviatio So far i our study of umerical measures used to describe data sets, we have focused o the mea ad the media. These measures of ceter tell us the most typical value of

More information

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the

More information

1 Lesson 6: Measure of Variation

1 Lesson 6: Measure of Variation 1 Lesso 6: Measure of Variatio 1.1 The rage As we have see, there are several viable coteders for the best measure of the cetral tedecy of data. The mea, the mode ad the media each have certai advatages

More information

MEASURES OF DISPERSION (VARIABILITY)

MEASURES OF DISPERSION (VARIABILITY) POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral

More information

Confidence Intervals for the Population Proportion p

Confidence Intervals for the Population Proportion p Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying. Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

4.3 Growth Rates of Solutions to Recurrences

4.3 Growth Rates of Solutions to Recurrences 4.3. GROWTH RATES OF SOLUTIONS TO RECURRENCES 81 4.3 Growth Rates of Solutios to Recurreces 4.3.1 Divide ad Coquer Algorithms Oe of the most basic ad powerful algorithmic techiques is divide ad coquer.

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Elementary Statistics

Elementary Statistics Elemetary Statistics M. Ghamsary, Ph.D. Sprig 004 Chap 0 Descriptive Statistics Raw Data: Whe data are collected i origial form, they are called raw data. The followig are the scores o the first test of

More information

Median and IQR The median is the value which divides the ordered data values in half.

Median and IQR The median is the value which divides the ordered data values in half. STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all! ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Solutios Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

Lecture 24 Floods and flood frequency

Lecture 24 Floods and flood frequency Lecture 4 Floods ad flood frequecy Oe of the thigs we wat to kow most about rivers is what s the probability that a flood of size will happe this year? I 100 years? There are two ways to do this empirically,

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK

NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK For this piece of coursework studets must use the methods for umerical itegratio they meet i the Numerical Methods module

More information

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Chapter 6. Sampling and Estimation

Chapter 6. Sampling and Estimation Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Data Description. Measure of Central Tendency. Data Description. Chapter x i

Data Description. Measure of Central Tendency. Data Description. Chapter x i Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

The Binomial Theorem

The Binomial Theorem The Biomial Theorem Robert Marti Itroductio The Biomial Theorem is used to expad biomials, that is, brackets cosistig of two distict terms The formula for the Biomial Theorem is as follows: (a + b ( k

More information

Analysis of Experimental Data

Analysis of Experimental Data Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Activity 3: Length Measurements with the Four-Sided Meter Stick

Activity 3: Length Measurements with the Four-Sided Meter Stick Activity 3: Legth Measuremets with the Four-Sided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a four-sided meter

More information

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1 PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

STP 226 EXAMPLE EXAM #1

STP 226 EXAMPLE EXAM #1 STP 226 EXAMPLE EXAM #1 Istructor: Hoor Statemet: I have either give or received iformatio regardig this exam, ad I will ot do so util all exams have bee graded ad retured. PRINTED NAME: Siged Date: DIRECTIONS:

More information

CHAPTER I: Vector Spaces

CHAPTER I: Vector Spaces CHAPTER I: Vector Spaces Sectio 1: Itroductio ad Examples This first chapter is largely a review of topics you probably saw i your liear algebra course. So why cover it? (1) Not everyoe remembers everythig

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics BIOTAT 640 Itermediate Biostatistics Frequetly Asked Questios Topic FAQ Review of BIOTAT 540 Itroductory Biostatistics. I m cofused about the jargo ad otatio, especially populatio versus sample. Could

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Math 113 Exam 3 Practice

Math 113 Exam 3 Practice Math Exam Practice Exam will cover.-.9. This sheet has three sectios. The first sectio will remid you about techiques ad formulas that you should kow. The secod gives a umber of practice questios for you

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}. 1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers

More information

Measures of Spread: Variance and Standard Deviation

Measures of Spread: Variance and Standard Deviation Lesso 1-6 Measures of Spread: Variace ad Stadard Deviatio BIG IDEA Variace ad stadard deviatio deped o the mea of a set of umbers. Calculatig these measures of spread depeds o whether the set is a sample

More information

SNAP Centre Workshop. Basic Algebraic Manipulation

SNAP Centre Workshop. Basic Algebraic Manipulation SNAP Cetre Workshop Basic Algebraic Maipulatio 8 Simplifyig Algebraic Expressios Whe a expressio is writte i the most compact maer possible, it is cosidered to be simplified. Not Simplified: x(x + 4x)

More information

Tennessee Department of Education

Tennessee Department of Education Teessee Departmet of Educatio Task: Comparig Shapes Geometry O a piece of graph paper with a coordiate plae, draw three o-colliear poits ad label them A, B, C. (Do ot use the origi as oe of your poits.)

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Analysis of Experimental Measurements

Analysis of Experimental Measurements Aalysis of Experimetal Measuremets Thik carefully about the process of makig a measuremet. A measuremet is a compariso betwee some ukow physical quatity ad a stadard of that physical quatity. As a example,

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

MA131 - Analysis 1. Workbook 2 Sequences I

MA131 - Analysis 1. Workbook 2 Sequences I MA3 - Aalysis Workbook 2 Sequeces I Autum 203 Cotets 2 Sequeces I 2. Itroductio.............................. 2.2 Icreasig ad Decreasig Sequeces................ 2 2.3 Bouded Sequeces..........................

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

Some examples of vector spaces

Some examples of vector spaces Roberto s Notes o Liear Algebra Chapter 11: Vector spaces Sectio 2 Some examples of vector spaces What you eed to kow already: The te axioms eeded to idetify a vector space. What you ca lear here: Some

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Lesson 10: Limits and Continuity

Lesson 10: Limits and Continuity www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals

More information

Riemann Sums y = f (x)

Riemann Sums y = f (x) Riema Sums Recall that we have previously discussed the area problem I its simplest form we ca state it this way: The Area Problem Let f be a cotiuous, o-egative fuctio o the closed iterval [a, b] Fid

More information

Topic 10: Introduction to Estimation

Topic 10: Introduction to Estimation Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio

More information

The Sample Variance Formula: A Detailed Study of an Old Controversy

The Sample Variance Formula: A Detailed Study of an Old Controversy The Sample Variace Formula: A Detailed Study of a Old Cotroversy Ky M. Vu PhD. AuLac Techologies Ic. c 00 Email: kymvu@aulactechologies.com Abstract The two biased ad ubiased formulae for the sample variace

More information

Computing Confidence Intervals for Sample Data

Computing Confidence Intervals for Sample Data Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram. Key Cocepts: 1) Sketchig of scatter diagram The scatter diagram of bivariate (i.e. cotaiig two variables) data ca be easily obtaied usig GC. Studets are advised to refer to lecture otes for the GC operatios

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6) STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2.

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2. Lesso 3- Lesso 3- Scale Chages of Data Vocabulary scale chage of a data set scale factor scale image BIG IDEA Multiplyig every umber i a data set by k multiplies all measures of ceter ad the stadard deviatio

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as

More information

Polynomial Functions and Their Graphs

Polynomial Functions and Their Graphs Polyomial Fuctios ad Their Graphs I this sectio we begi the study of fuctios defied by polyomial expressios. Polyomial ad ratioal fuctios are the most commo fuctios used to model data, ad are used extesively

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

Exponents. Learning Objectives. Pre-Activity

Exponents. Learning Objectives. Pre-Activity Sectio. Pre-Activity Preparatio Epoets A Chai Letter Chai letters are geerated every day. If you sed a chai letter to three frieds ad they each sed it o to three frieds, who each sed it o to three frieds,

More information

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? Harold G. Loomis Hoolulu, HI ABSTRACT Most coastal locatios have few if ay records of tsuami wave heights obtaied over various time periods. Still

More information