106 Stat 1434 / 1435 H. Chapter 1: Organizing and Displaying Data

Size: px
Start display at page:

Download "106 Stat 1434 / 1435 H. Chapter 1: Organizing and Displaying Data"

Transcription

1 106 Stat Refereces -Biostatistics : A foudatio i Aalysis i the Health Sciece -By : Waye W. Daiel -Elemetary Biostatistics with Applicatios from Saudi Arabia By : Nacy Hasabelaby 1434 / 1435 H Chapter 1: Orgaizig ad Displayig Data 1.1: Itroductio (مصطلحات) Here we will cosider some basic defiitios ad termiologies Statistics: Is the area of study that is iterested i how to orgaize ad summarize iformatio ad aswer research questios. Biostatistics: Is a brach of statistics that iterested i iformatio obtaied from biological ad medical scieces. Populatio: Is the largest group of people or thigs i which we are iterested i a particular time ad about which we wat to make some statemet or coclusios. Sample: A part of the populatio o which we collect data. The umber of the elemet i the sample is called the sample size ad deoted by. Variable: the characteristic to be measured o the elemets of populatio or sample. 1

2 Types of variables Qualitative: If the values of the variables are word idicatig to which category a elemet of the populatio belogs. Quatitative; if the value of the variable are umbers idicatig how much or how may of somethig Nomial: the value of the variables are ames oly Examples: *Geder: Female or male. * Eye colour: Black, brow, gree, etc Ordial: variables ca be ordered. Examples: Educatioal level: elemetary,itermediate, high school. Blood pressure: Low, medium, high Discrete: Ca have coutable umbers of values ( there are gaps betwee the values) Examples: *Number of patiets admitted to a hospital i oe day (x=1,, ) * Number of pai killer tablets (x= 0.5,1,1.5,,.5, ) Note: Discrete values ca take either iteger values or decimal values with gaps betwee the values. Cotiuous: Ca have ay value withi a certai iterval of values. it is usually measured o some scale i terms of some measuremet uits like kilograms, meters etc Examples: *Level of chemical i drikig water *height (140<x<190) *blood sugar level of a perso. 3 Example 1 Suppose we measure the amout of milk that a child driks i a day (i ml) for a sample of 5 two-years childre i Saudi Arabia. The populatio: all two years childre i Saudi Arabia The variable: the amout of milk that a child drik i a day (i ml) The variable is quatitative, cotiuous. The sample size is 5. Example Suppose we measure weather or ot a child has a hearig loss for a sample of 0 youg childre with a history of repeated ear ifectios. The populatio: all youg childre with a history of repeated ear ifectio. The variable: whether or ot a child has a hearig loss The variable is qualitative, omial. Sice the values are either yes or o. The sample size is 0 4

3 Example 3 Suppose we measure the temperature for a sample of 5 aimals havig a certai disease. The populatio The variable The type of the variable The sample size Orgaizig the Data Suppose we collect a sample of size from a populatio of iterest. A first step i orgaizig is to order the data from smallest to largest (if it is ot omial). A further step is to cout how may umbers are the same (if ay). The last step is to orgaize it ito a table called frequecy table (or frequecy distributio). The frequecy distributio has two kids 1) Simple (ugrouped) frequecy distributio: for ) Grouped frequecy distributio: for Qualitative variables Discrete quatitative with small umber of differet variables Cotiuous quatitative variables Discrete quatitative with large umber of differet variables. 6 3

4 Frequecy of wome Example 1..1: (simple frequecy distributio) Suppose we are iterested i the umber of childre that a Saudi woma has ad we take a sample of 16 wome ad obtai the followig data o the umber of childre 3, 5,, 4, 0, 1, 3, 5,, 3,, 3, 3,, 4, 1 Q1: What is the variable? The populatio? ad the sample size?. What are the differet values of the variable? -the differet values are: 0,1,,3,4,5 Q: Obtai a simple frequecy distributio (table)? If we order the data we obtaied 0, 1,1,,,,, 3, 3, 3, 3, 3, 4, 4, 5, 5 To obtai a simple frequecy distributio (table) we have to kow the followig cocepts The frequecy: is obtaied by coutig how ofte each umber i the data set. The sample size (): is the sum of the frequecies. Relative frequecy= frequecy/ Percetage frequecy= Relative frequecy*100= (frequecy/)* Simple frequecy table for the umber of childre. Number of childre (variable) frequecy of wome (frequecy) Relative frequecy Percetage frequecy Total = The simple frequecy distributio has the frequecy bar chart as graphical represetatio Frequecy bar chart of the umber of childre Number of childre Exercise: for more exercises ad details about graphs m/chapter/graphig_q ualitative.html 8 4

5 Example 1.. :grouped frequecy distributio The followig table gives the hemoglobi level (i g/dl) of a sample of 45 apparetly healthy me aged 0-4. Fid the grouped frequecy distributio for the (ظاهريا) data What is the variable? The sample size? The max= The mi=13.5 -The rage=max-mi= =4.8 Notes 1. I example 1.. to group the data we use a set of itervals, called class itervals.. The width (w) is the distace from the lower or upper limit of oe class iterval to the same limit of the ext class iterval. 3. Let we deote the lower limit ad upper limit of the class iterval by L ad U, that is the first class is L1-U1, the secod class is L-U 4. To fid the class itervals we use the followig relatioship +w L1 U1 +w L U +w L3 ad so o U3 +w 9 6. Cumulative frequecy: is the umber of values obtaied i the class iterval or before, which fid by addig successfully the frequecies. 7. Cumulative relative frequecy: is the proportio of values obtaied i the class iterval or before, which fid by addig successfully the relative frequecies. 8. The Grouped frequecy distributio for Example 1.. is Class Iterval Frequecy Relative frequecy Cumulative frequecy Cumulative relative frequecy Total = True classes ad displayig grouped frequecy distributios ( To Fid the true class itervals we have two ways: 1) Subtract from the lower limit ad add to the upper limit oe- half of the smallest uit. ) Decrease the last decimal place of the lower limit by 1 ad put 5 after it, ad for the upper limit we simply put 5 after the limit. 10 5

6 True class True class To illustrate this let us fid the true classes of example 1.. s.u=0.1 Class Iterval True class iterval Mid poits Frequecy < < < < < < Total =45 Notes: - Each upper limit of the true class iterval eds with the same lower limit of the previous true class itervals - The lower ad upper limit of the true class iterval must always ed i 5, ad they must always have oe more decimal place tha class limit. - The mid poit =(upper limit + lower limit)/. - To fid the midpoit of the iterval we simply add the width to the previous midpoit Displayig grouped frequecy distributios Grouped frequecy distributios ca be displayed by Histogram Polygo curves For frequecy or relative frequecy distributios Histogram

7 Frequecy Polygo Hemoglobi Level Exercises: 1.R.1 (a-c-d-e), 1.R. (a-c-d-e), 1.R.5 pg 5 13 Exammple 1.4: I the study, the blood glucose level (i mg/100 ml) was measured for a sample from all apparetly healthy adult males. a) Idetify variable ad the populatio i the study. b) From the table, fid Class iterval (glucose level ) Frequecy Relative frequecy Cumulative frequecy Total ) w= ) = 3) The umber of healthy males with glucose level mg/100 ml 4) The percetage of healthy males with glucose level less tha mg/100 ml 5) The umber of healthy males with glucose level less tha 99 mg/100 ml 6) The umber of healthy males with glucose level greater tha

8 Chapter : Basic Summary Statistics.1: Itroductio This chapter cocers maily about describig the middle of the observatios ad how spread out they are. Measures of cetral tedecy Measures of dispersio Measures which are i some sese idicate where the middle or cetre of the data is. (e.g.mea, media ad mode) Measures which idicate how spread out the observatio from each other. (e.g. Rage, variace, stadard deviatio ad coefficiet of variatio) 15 Populatio The populatio values of the variable of iterest: X1, X,, XN (usually they are ukow). N=The populatio size Sample the sample values of the variable : x1, x,, x = the sample size. Ay measure obtaied from the populatio values of the variable of iterest is called a parameter Ay measure obtaied from the sample values of the variable of iterest is called a statistics 16 8

9 .: Measures of cetral tedecy We use the term cetral tedecy to refer to the atural fact that the values of the variable ofte ted to be more cocetrated about the cetre of the data. We will cosider three such measures: the mea, the media ad the mode. Mea: (or average) ukow Populatio mea: let X1,X,, XN be the populatio values of the variable (usually ukow), the the populatio mea is X X X X 1... N i Parameter Sample mea :let x1, x,, x be the sample values of the variable, the the sample mea is x x 1 x... x i Kow from the x estimator of a sample populatio The sample mea is a estimator of a populatio mea. mea Statistic Questio: which oe is a parameter ad which oe is a statistic? N N Example.1: Cosider a populatio cosistig of the 5 urses who work i a particular cliic, ad we are iterested i the age of these urses i years X1=30, X=, X3=35, X4=7, X5=41 The average urse populatio is years Media (or med) The media is the middle value of the ordered observatio To fid the media of a sample of observatio, we first order the data, the 1) If is odd, the middle observatio is the order (+1)/. ) If is eve, the middle two observatios are the / ad the ext observatio, the media is the average of them. Example..1: Fid the media of the followig samples a) 9, 30, 3, 31, 8, 9, 30, 4, 40, 40, 40. First we order the data 8, 9, 9, 30, 30, 31, 3, 40, 40, 40, 4 = 11, odd, the order of the media is (+1)/=(11+1)/=6 th 8, 9, 9, 30, 30, 31, 3, 40, 40, 40, 4 med=31 (uit) 6 th b) 1.5, 3.0, 18.5, 4.0, 1.0, 4.5, 6.0, 9.5, 10.5, 15.0, 11.0, 11.5 =1, eve, /=6 th, hece we take the average of the 6 th ad the 7 th value The ordered sample is 1.5, 3.0, 4.5, 6.0, 9.5, 10.5, 11, 11.5, 1.0, 15.0, 18.5, 4.0 med=( )/=10.75 (uit) 6 th 7 th 18 9

10 Mode (or modal) The mode of set of values is that value which occurs with highest frequecy. Ay data must has oe of the three cases No mode: example: Data(1): 1, 15,,19, 14, 18 Data(): 3, 3, 5,5, 4, 4, 6, 6 Oe mode, example :Data (1): 3, 15, 3, 17,, 3, 19, 0,,. The mode= (uit) Data(): 13.5, 1, 13.5, 15, 15, 14.6, 17, 1, 15 The mode=15 (uit) More tha oe mode: example 18, 0, 19, 19, 1, 17, 0 modes: 19, 0 (uit) Notes: Mea ad media ca oly be foud for quatitative variables, the mode ca be foud for quatitative ad qualitative variables. There is oly oe mea ad oe media for ay data set. The mea ca be distorted by extreme values so much. measures that ot affected so much by extreme values are the media ad the mode. Aimated example o the web: 19 Example.. The followig table shows the computer results of the coutry of maufacturig of 50 coditioer devices Valid Total America Europea Japaese Total Frequecy Percet Valid Percet Cumulative Percet From this table: A)What is the variable, what is the type of the variable? B) The umber of Japaese-made devices is (a) (b) 16 (c) 6 (d) 50 (e)11 C) The percetage of America-made devices is (a) % (b) 31% (c) 6% (d) 50% (e)100% D) The mode of coutry-made devices is (a) Japaese (b) Europea (c) America(d) America ad Japaese (e)no mode E) If we wat to represet this type of data, we will use (a) Histogram (b) Lie chart (c) Pie chart (d) Bar chart (e) we ca't 10

11 Example..3 A sample of 80 families have bee asked about the umber of times to travel abroad. The computer results of the SPSS are give below Frequecy Percet Valid Percet Cumulative Percet Valid Total Total 80 From above table: A) The variable is (a) Number of families (b) Number of times to Travel abroad (c) Noe of these B) The type of the variable is (a) Quatitative Discrete (b) Qualitative (c) Quatitative Cotiuous (d) Normal (e) Biomial (f) Noe of these C) Number of families that travelled abroad 7 times is (a) 7.5 (b) 0 (c) 6 (d) 1 (e) 53.8 (f)10 D) The percetage of families that travelled abroad less tha or equal to 10 times is (a) 73.8% (b) 0 (c)100% (d) 88.8% (e) 5% (f) 95% E) The mode of travel times is (a) 13.8 (b) 0 (c) 6 (d) 11 (e) 80 (f)13 11

12 Frequacy Example..4 By usig the computer results of SPSS the plot of the umber of courses i Eglish that studet takes i a year is obtaied: ) The type of the graph is: a) Bar chart (b) polygo (c) histogram (d) lie (e) curve )The Variable is: a) Number of studets (b) umber of courses (c) Eglish (d) Arabic 3) The total umber of studets who study i Eglish is: a) 0 (b) 5 (c) 1 (d) 6 (e) 3 4) The umber of studets who study two courses i Eglish is: a) 0 (b) (c) 7 (d) 8 (e) 5 5)The umber of studets who study at least two courses i Eglish is: a)7 (b) 8 (c) 15 (d) 18 (e) 5 6)The percet of studets who study at most oe course i Eglish is: a)7% (b) 18% (c) 8% (d) 60% (e) 7% 7) The sample mea is a).55 (b) 55 (c) 3 (d) (e) 40 8)The sample mode is a)0 (b) 3 (c) 10 (d) (e) o mode 1

13 .3: Measure of dispersio The variatio or dispersio i a set of observatios refers to how spread out the observatios are from each other. -Whe the variatio is small, this meas that the observatios are close to each other (but ot the same). - Ca you metio a case whe there is o variatio? Larger variatio Same mea Smaller variatio Smaller variatio Larger variatio We will cosider four measures of dispersio: the rage, the variace, the stadard deviatio ad the coefficiet of variatio. 5 Rage (R): Is the differece betwee the largest ad smallest values i the set of values Example.3 (q.6- pg 35): Below are the birth weights (i kg) for a sample of babies bor i Saudi Arabia: 1.69, 1.79, 3.3, 3.6,.71,.4,.59, 1.05, 3.19, 3.40, 3.3, 3.37, 3.6, Fid the mea, mod ad media. - R= =.58. Note: The rage is easy to calculate but it is ot useful as a measure of variatio sice it oly takes ito accout two of the values. Variace: Is a measure which uses the mea as poit of referece. Populatio variace: let X1,X,, XN be the populatio values of the variable (usually ukow), the the populatio variace is populatio mea. N ( X i i ) 1 N where µ is the Sample Variace :let x1, x,, x be the sample values of the variable, the the sample variace is ( x x) i1 i s where x is the sample mea

14 Notes: The variace is less whe all the values are close to the mea, while it is more whe all the values are spread out of the mea. xi x x x x1 ( x x) ( x i x) ( x x ) ( x x 1 ) 0 The variace is always a oegative value (, s 0). Populatio variace is usually ukow (parameter), hece it is estimated by the sample variace s (statistic). ( ) 1 A simpler formula to use for calculatig sample variace is x x i i s 1 The variace is expressed i squared uit. 7 Stadard deviatio (std. dev.): The stadard deviatio is defied to be the root of the variace. Populatio stadard deviatio Sample stadard deviatio N ( ) X i ( i ) 1 1 x x i i s s 1 N 8 14

15 Coefficiet of variatio (CV): - The variace ad stadard deviatio are useful as measures of variatio of the values of sigle variable for a sigle populatio. - If we wat to compare the variatio i two data set the variace ad stadard deviatios may give misleadig results because: - The two variable may have differet uits as kilogram ad cetimeters which caot be compared. - Although the same uits are used, the mea of the two may be quit differet i size. - The coefficiet of variatio (CV) is used to compare the relative variatio i two data set ad it dose ot deped o either the uit or how large the values are, the formula of CV is give by s CV 100(%) x - Suppose we have two data set as the followig ad we wat to compare the variatio mea Std.dev. CV Set 1 x 1 s 1 Set x s s CV x CV x s 100(%) 100(%) 9 The we say that the variability i the first data set is larger tha the variability i the secod data set if CV 1 > CV (ad vice versa). Example.5 Suppose two sets of samples of huma males of differet ages give the followig results weight set1: o males aged 9: x 1 =66kg s 1 =4.5kg CV 1 =(4.5/66) 100%=6.8% set: o males aged 10: x =36kg s 1 =4.5kg CV =(4.5/36) 100%=1.5% Sice CV > CV 1, the variability i the weight of the d set (10-years old) is greater tha the variability i the 1 st data set (9-years old). Examples: pg A site that explais the cocepts i Arabic A site that explais how to use SPSS for descriptive statistics /spss/descript1.htm 30 15

16 frequecy Example.4.1 For a sample of patiets, we obtai the followig graph for approximated hours spet without pai after a certai surgery Hours 1) The type of the graph is: a) Bar chart (b) polygo (c) histogram (d) lie (e) curve ) The umber of patiets stayed the logest time without pai is: a) 10 (b) 15 (c) 6 (d) 5 (e) 80 3)The percet of patiets spet 3.5 hours or more without pai is: a)37.5% (b) 68.75% (c) 18.75% (d) 50% (e) 5% 4)The lowest umber of hours spet without pai is: a)10 (b) 1 (c) 0.5 (d) 5 (e) 5 (f) 6.5 5)What the approximate value of the sample mea a).55 (b) 55 (c) 3 (d) (e) 40 (f) we ca't fid it 6)The sample mode equals a)80 (b) 3 (c) 15 (d),4 (e) 6 (f) we ca't fid it 16

17 The SPSS computer results of the age of patiets i oe of the Riyadh hospitals are give below Fid : a) Variable ame a) The type of the variable b) The mode c) The mea age of the patiets d) The media age of the patiets e) The variace f) Sample size g) The coefficiet of variatio 33.4: Calculatig measures from a ugrouped frequecy tables: Suppose we have the followig frequecy table, where m i is the i th value i the Value (or midpoit) m 1 m m k frequecy f1 f f k f i = ugrouped frequecy table or the midpoit i the grouped frequecy table, ad f i is the i th frequecy. The formulas for sample mea ad variace will be modified as follows: = f i (the sample size= the sum of frequecies) k=umber of distict values (or umber of itervals) k x i i m i i f 1 1 i, k i1 x i i1 mi fi x i 1 x i k m i i 1 x f i x i 1 i ( x) s 1 k i 1 i ( ) m fi x s 1 For usig calculator to fid the mea, variace ad stadard deviatio, you ca visit the site

18 Notes: Whe data are grouped we caot determie from the frequecy distributio what the actual data values are but oly how may of them are i the class iterval. We ca t fid the actual values for the sample mea ad sample variace but we ca fid approximatio of them. For grouped data we assume that all values i particular class iterval are located at the midpoit of the iterval (m i ) because the mid poit is best represetative for whole iterval 35 Example.6: Suppose that i a study o drug cosumptio by pregat Saudi wome, the umber of differet drugs takig durig pregacy was determied for a sample of Saudi wome who took at least oe medicatio obtaiig: Value m i Fid the measure of cetral tedecy ad dispersio. Solutio: =30 x - =83/30= drugs - To fid the media: sice =30 is eve, the order of the two middle values is /=15 th ad 16 th, from the cumulative frequecy the 16 th ad 15 th ordered observatio is, ad hece - Med=(+)/= drugs Frequecy f Cumulative frequecy m i f m i f Total =

19 - The mode is sice it has the highest frequecy. The variace - The rage : R=7-1=6 ( ) 95 (30)(.7666) mi fi x s The stadard deviatio s=.5 =1.5 - The coefficiet of variatio =(1.5/.8) 100=53.6 % ============================================================= Example.7: The followig are the ages of a sample of 100 wome havig childre who were admitted to a particular hospital i Madiah i particular moth. Class Iterval Mid poits Frequecy Total =100 Fid the mea, the variace, ad the coefficiet of variatio. Note: we did t put ay uit here sice the variable is discrete, the word (drug) is just a idicator of what we are coutig 37 Chapter 3: Some Basic Probability Cocepts 3.1 Geeral view of probability Probability: The probability of some evet is the likelihood (chace) that this evet will occur. A experimet: Is a descriptio of some procedure that we do. The uiversal set (Ω): Is the set of all possible outcomes, A evet: Is a set of outcomes i Ω which all have some specified characteristic. Notes: 1. Ω (the uiversal set) is called sure evet. (the empty set) is called impossible evet 38 19

20 Example (3.1) Cosider a set of 6 balls umbered 1,, 3, 4, 5, ad 6. If we put the sex balls ito a bag ad without lookig at the balls, we choose oe ball from the bag, the this is a experimet which is has 6 outcomes. Ω ={1,, 3, 4, 5,6 } Cosider the followig evets E 1 =the evet that a eve umber occurs={, 4, 6}. E =the evet of gettig umber greater tha ={3,4, 5, 6}. E 3 =the evet that a odd umber occurs={1, 3, 5}. E 4 =the evet that a egative umber occurs={}=. 39 Equally likely outcomes: The outcomes of a experimet are equally likely if they have the same chace of occurrece. Probability of equally likely evets cosider a experimet which has N equally likely outcomes, ad let the umbers of outcomes i a evet E give by (E), the the probability of E is give by ( E) P( E) ( ) ( E) N Notes 1. For ay evet A, 0 P(A) 1 (why?) That is, probability is always betwee 0 ad 1.. P(Ω)=1, ad P( )=0 (why?) 1 meas the evet is a certaity, 0 meas the evet is impossible 40 0

21 Example (3.) I the ball experimet we have (Ω)=6, (E 1 )=3, (E )=4, (E )=3 P(E 1 )=3/6=0.5 P(E )=4/6=0.667 P(E 3 )=3/6=0.5 P(E 4 )=0 Repaper that E 1 =the evet that a eve umber occurs={, 4, 6}. E =the evet of gettig umber greater tha ={3,4, 5, 6}. E 3 =the evet that a odd umber occurs={1, 3, 5}. E 4 =the evet that a egative umber occurs={}=. 41 Relatioships betwee evets Uio : A B, cosists of all those outcomes i A or i B or i both A ad B B A A B Itersectio : A B, cosists of all those outcomes i both A ad B B A A B Complemet : A c (or A`) Cosists of all outcomes that are i Ω but ot i A A A c 4 1

22 Notes: 1- (A B)= (A)+(B)-(A B) ad hece P(A B)= P(A)+P(B)-P(A B) Ω B A. (A c )=(Ω)- (A) So that P(A c )=1- P(A) Ω A A c Sets (evets) ca be represeted by Ve Diagram Ω B A A c B AB AB c A c B c 43 Ω 44

23 Disjoit evets Two evets A ad B are said to be disjoit (mutually exclusive) if A B=. - I the case of disjoit evets P(A B)=0 P(A B)= P(A)+P(B) Ω B A 45 Example 3.3 From a populatio of 80 babies i a certai hospital i the last moth, let the eve B= is a boy, ad O= is over weight we have the followig icomplete Ve diagram. - It is a boy P(B) =(3+39)/80= It is a boy ad overweight P(B O)= 3/80= It is a boy or it is overweight P(B U O)= (39+3+7)/80=0.615 B O 46 3

24 Coditioal probability: the coditioal probability of A give B is equal to the probability of A B divided by the probability of B, providig the probability of B is ot zero. That is P(A B)=P(A B )/ P(B), P(B) 0 Notes: 1. P(A B) is the probability of the evet A if we kow that the evet B has occurred. P(B A)=P(A B )/ P(A), P(A) Example Referrig to example 3.3 what is the probability that - He is a boy kowig that he is over weight? P(B O)= P(B O )/ P(O)= (3/80) / (10/ 80) =3/10=0.3 - If we kow that she is a girl, what is the probability that she is ot overweight? P(O c B c )= P(B c O c ) / P(B c ) = (31/80) / [(7+31)/80] = 31/38= Idepedet evets -Two evets A ad B are said to be idepedet if the occurrece of oe of them has o effect o the occurrece of the other. Multiplicatio rule for idepedet evets -If A ad B are idepedet the 1-P(A B)=P(A) P(B) -P(A B)= P(A ) (Why?) 3- P(B A)= P(B ) (Why?) 48 4

25 Example 3.4 I a populatio of people with a certai disease, let M= Me ad S= suffer from swolle leg We have the followig icomplete Ve diagram If we radomly choose oe perso M Complete the Ve diagram Fid the probability that this perso Is a ma ad suffer from swolle leg? P(M S)= Is a wome? P(M c )= 3- Is a wome that does ot suffer from swolle leg? P(M c S c )= Does ot sufferig from swolle leg? P(S c )= = = 0.41 (or P(M c )=1-P(M)= 1-( )=0.41 ) S 49 Margial prbability: Defiitio: Give some variable that ca be broke dow ito m categories desigated by A 1, A,,A m ad aother joitly accurace variable that is broke dow ito categories desigated by B 1, B,,,B, the margial probability of A i, called P(A i ), is equal to the sum of the joit probabilities of A i with all categories of B. That is P(A i )= P(A i B j ), for all values of j. This will be clear i the followig example Example 3.5: The followig table shows 1000 ursig school applicats classified accordig to scores made o a college etrace examiatio ad the quality of the high school form which they graduated, as rated by the group of educators. 50 5

26 Score Score Poor (p) Quality of high school Average (A) Superior (S) Low (L) Medium (M) High (H) total total - Q1-How may margial probabilities ca be calculated from these data? State each probability otatio ad do calculatios. - 6 margial probabilities, P(L), P(M), P(H), P(p), P(A), P(S). - Q-Calculate the probability that a applicat picked at radom from this group: 1-Made a low score o the examiatio P(L)= 0/1000=0. - Graduated from superior high school. P(S)= 500/1000= Made a low score o the examiatio give that he or she graduated from Superior high school P(L S)= P(L S) / P(S) = (55/1000) / (500 /1000)= 55/500 = Made a high score or graduated from a superior high school. P( H S) = P(H) +P(S) P(H S)= ( ) / 1000 = Calculate the followig probabilities 1. P(A) = 300/1000=0.3. P(S) = 500/ 1000= P(M) = 390/1000= P(M P) = 70/ 1000= P(A L)= ( )/1000 = P(P S) = 0 7.P(L H) =(0+ 390)/ 1000 = P(H/S) = 300/ 500= 0.6 Quality of high school P A S total L M H total

27 Chapter 4: Probability Distributio 4.1 Probability Distributio of Discrete Radom Variables - Radom variable: is a variable that measured o populatio where each elemet must have a equal chace of beig selected. - let X be a discrete radom variable, ad suppose we are able to cout the umber of populatio where X=x, the the value of x together with the probability P(X=x) are called probability distributio of the discrete radom variable X. Example 4.1 Suppose we measure the umber of complete days that a patiet speds i the hospital after a particular type of operatio i Dammam hospital i oe year, obtaiig the followig results. 53 Number of days, x The probability of the evet { X=x } is the relative frequecy P(X=x)= ( X x) ( X x) ( S) That is: P(X=1)=5/50=0.1 P(X=)=/50=0.44 P(X=3)=15/50=0.3 P(X=4)=8/50= What is the value of P(X=x)? Frequecy N 50 N 54 7

28 The probability distributio must satisfy the coditios ` 1- - Number of days, x P(X=x) Sum 1 0 P( X x) 1 P( X x) 1 The first coditio must be satisfied sice P(X=x) is a probability, ad the secod coditio must be satisfied sice the evets {X=x} are mutually exclusive ad there uio is the sample space. 55 -Populatio mea for a discrete radom variable: If we kow the distributio fuctio P(X=x) for each possible value x of a discrete radom variable, the the populatio mea (or the expected value of the radom variable X ) is x P( X x) Example: The expected umber of complete days that a patiet speds i the hospital after a particular type of operatio i Dammam hospital i oe year (example 3.1) is x P( X x) =1(0.1)+(0.44)+3(0.3)+4(0.16)=.5 -Cumulative distributios : the cumulative distributio or the cumulative probability distributio of a radom variable is P( X x) It is obtaied i a way similar to fidig the cumulative relative frequecy distributio for samples. -referrig to example 3.1 P(X 1)=0.1 P(X )=P(X=1)+P(X=)= =

29 P(X 3)=P(X=1)+P(X=)+P(X=3)= =0.84 P(X 4)=P(X=1)+P(X=)+P(X=3)+ P(X=4)= =1 The cumulative probability distributio ca be displayed i the followig table Number of days x P(X=x) -From the table fid: 1-P(X<3)=P(X )=0.54 -P( X 4)=P(X=4)+P(X=3)+P(X=)=0.9 Or P( X 4)=P(X 4)-P(X<)=1-0.1=0.9 3-P(X>)= P(X=3)+P(X=4)=0.46 Or P(X>)= 1-P(X )=1-0.54=0.46 P(X x) Sum 1 57 I geeral we ca use the followig rules for iteger umber a ad b 1- P(X a) is a cumulative distributio probability -P(X < a)=p(x a-1) 3-P(X b)=1-p(x< b)=1-p(x b-1) 4-P(X>b)= 1-P(X b) 5- P(a X b)=p(x b) P(X<a)= P(X b) P(X a-1) 6- P(a<X b) = P(X b) - P(X a) 7-P(a X<b) = P(X b-1) - P(X a-1) 8- P(a<X<b)=P(X b-1)-p(x a) 58 9

30 4. Biomial Distributio The biomial distributio is a discrete distributio that is used to model the followig experimet 1-The experimet has a fiite umber of trials. - Each sigle trial has oly two possible (mutually exclusive )outcomes of iterest such as recovers or does t recover; lives or dies; eeds a operatio or does't eed a operatio. We will call havig certai characteristic success ad ot havig this characteristic failure. 3- The probability of a success is a costat π for each trial. The probability of a failure is 1- π. 4- The trials are idepedet; that is the outcome of oe trial has o effect o the outcome of ay other trial. The the discrete radom variable X=the umber of successes i trials has a Biomial(,π) distributio for which the probability distributio fuctio is give by P(X=x)= x ( 1 ) x 0 x x=0,1,,, otherwise 59 Where! Note x x! ( x)! If the discrete radom variable X has a biomial distributio, we write X ~ Bi(,π) The mea ad variace for the biomial distributio: - The mea for a Biomial(, π) radom variable is μ=σx P(X=x)= π The variace σ = π (1- π) Example 4. Suppose that the probability that Saudi ma has a high blood pressure is If we radomly select 6 Saudi me. a- Fid the probability distributio fuctio for the umber of me out of 6 with high blood pressure. b- Fid the probability that there are 4 me with high blood pressure? c-fid the probability that all the 6 me have high blood pressure? d-fid the probability that oe of the 6 me have high blood pressure? e- what is the probability that more tha two me will have high blood pressure? f-fid the expected umber of high blood pressure

31 Solutio: Let X= the umber of me out of 6 with high blood pressure. The X has a biomial distributio ( why?). Success= The ma has a high blood pressure Failure= The ma does t have a high blood pressure Probability of success= π=0.15 ad hece Probability of failure= Number of trials= =6 =6, π=0.15, 1- π=0.85 The X has a Biomial distributio, X~ Bi (6,0.15) a - the probability distributio fuctio is 6 P( X x) 0.15 x x 0,1,..., b- the probability that 4 me will have high blood pressure 6 4 P(X=4)= 0.15 (0.85) =(15)(0.15) 4 (0.85) = C- the probability that all the 6 me have high blood pressure (0.85) P(X=6)= π=0.85 x (0.85) 6x 61 d-the probability that oe of 6 me have high blood pressure is P(X=0)= 0.15 (0.85) e- the probability that more tha two me will have high blood pressure is P(X>)=1-P(X )=1-[P(X=0)+P(X=1)+P(X=)] =1-[ (0.85) (0.85) ] 1 =1-[ ] F- the expected umber of high blood pressure is 6(0.15) 0. 9 ad the variace is (1 ) 6(0.15)(0.85)

32 4.3 The Poisso Distributio The Poisso distributio is a discrete distributio that is used to model the radom variable X that represets the umber of occurreces of some radom evet i the iterval of time or space. The probability that X will occur ( the probability distributio fuctio ) is give by: e P( X x) x! 0 x, x 01,,,... otherwise λ is the average umber of occurreces of the radom variable i the iterval. The mea μ=λ The variace σ = λ If X has a Poisso distributio we write X~ Poisso (λ) Examples of Poisso distributio: - The umber of patiets i a waitig room i a hour. - The umber of serious ijuries الخطيرة) (االصابات i a particular factory i a year. - The umber of times a three year old child has a ear ifectio األذن) (عدوى i a year. 63 Example 4.3: Suppose we are iterested i the umber of sake bite ( األفعى (لدغة cases see i a particular Riyadh hospital i a year. Assume that the average umber of sake bite cases at the hospital i a year is What is the probability that i a radomly chose year, the umber of sake bites cases will be 7? - What is the probability that the umber of cases will be less tha i 6 moths? 3-What is the probability that the umber of cases will be 13 i year? 4- What is Expected umber of sake bites i a year? What is the variace of sake bites i a year? Solutio: X= umber of sake bite cases see at this hospital i a year. Ad the mea is 6 The X~ Poisso (6) First ote the followig The average umber of sake bite cases at the hospital i a year =λ =6 X~ Poisso (6) The average umber of sake bite cases at the hospital i 6 moths = = the average umber of sake bite cases at the hospital i (1/) year =(1/)λ =3 Y~ Poisso (3) The average umber of sake bite cases at the hospital i years = λ =1 V~ Poisso (1) 64 3

33 1- The probability that the umber of sake bites will be 7 i a year 6 x e 6 λ=6 P(X x), x 01,,,... x! 6 e 6 P(X 7 ) 7! The probability that the umber of cases will be less tha i 6 moths 3 y e 3 P( Y y), y 0,1,... λ*=3 y! P( Y ) P( Y 0) P( Y 1) e 3 e ! 1! 3- The probability that the umber of cases will be 13 i years 1 v e 1 P(V v) v! 1 13 λ ** =1 e 1 P(V 13) ! Remember If X~ Poisso (λ) x e P( X x) x! x 0,1,, the expected umber of sake bites i a year: 6 the variace of sake bites i a year: 6 λ= Probability Distributio of Cotiuous Radom Variable If X is a cotiuous radom variable, the there exist a fuctio f(x) called probability desity fuctio that has the followig properties: 1- The area uder the probability curve f(x) =1 f(x) area= f ( x) dx 1 x 66 33

34 - Probability of iterval evets are give by areas uder the probability curve f(x) f(x) f(x) a b x a x a x b P(a X b)= a f ( x) dx P(X a)= f ( x) dx 3- P(X=a)=0 (why?) 4-P(X a)=p(x>a) ad P(X a)=p(x<a) 7- P(X a)= P(X<a) is the cumulative probability 5- P(X a)= 1- P(X a) 6-P(a<X<b)=P(X<b)-P(X<a) a P(X<b) P(X<a) P(X a)= f ( x) dx a f(x) a b x The Normal Distributio: The ormal distributio is oe of the most importat cotiuous distributio i statistics. It has the followig characteristics 1- X takes values from - to. - The populatio mea is μ ad the populatio variace is σ, ad we write X~ N(μ, σ ). 3- The graph of the desity of a ormal distributio has a bell shaped curve, that is symmetric about μ f(x) - μ x 68 34

35 4- μ= mea=mode=media of the ormal distributio. 5-The locatio of the distributio depeds o μ (locatio parameter). The shape of the distributio depeds o σ (shape parameter). σ σ 1 - μ 1 μ - μ μ 1 < μ σ1> σ 69 Stadard ormal distributio: The stadard ormal distributio is a ormal distributio with mea µ=0 ad variace σ =1. σ =1-0 Result If X~ N(μ, σ ) the Z= X ~ N(0, 1). Notes - The probability A= P(Z z ) is the area to the left of z uder the stadard ormal curve. -There is a Table gives values of P(Z z ) for differet values of z

36 Calculatig probabilities from Normal (0,1) P(Z z ) From the table ( the area uder the curve to the left of z ) P(Z z ) σ =1-0 z z P(Z z ) =1- P(Z z ) From the table ( the area uder the curve to the right of z ) z - 0 σ =1 P(Z z ) P( z 1 Z z ) = P(Z z ) - P(Z z 1 ) From the table ( the area uder the curve betwee z 1 ad z ) P(z 1 Z z ) z 1-0 σ =1 z 71 Notes: P(Z 0 ) = P(Z 0 ) =0.5 (why?) P(Z =z )=0 for ay z. P(Z z )= P(Z < z ) ad P(Z z )= P(Z > z ) If z the P(Z z ) =0, ad if z 3.49 the P(Z z )=1. Example 4.1 : - P(Z 1.5 ) = P( Z.4)= P(Z.4 ) -P(Z < 1.33) = = = P(Z 0.98 )=1- P(Z 0.98 )= = Z : : 7 36

37 Example 4. : Suppose that the hemoglobi level for healthy adult males are approximately ormally distributed with mea 16 ad variace of Fid the probability that a radomly chose healthy adult male has hemoglobi level a) Less tha 14. b) Greater tha 15. C) Betwee 13 ad 15 Solutio Let X= the hemoglobi level for healthy adult male, the X~ N(μ=16, σ =0.81). a) Sice μ=16, σ =0.81, we have σ= P(X<14)= P(Z< 14 )= P(Z< 1416 )= P(Z< -. )= b) P(X>15)= P(Z > 15 )= P(Z> 1516 )= P(Z> )= 1- P(Z -1.11)= = = c) P(13<X<15)= P( 13 <Z< 15 )= P(Z< 1516 )- P(Z< 1316 ) = P(Z -1.11) P(Z -3.33) = = d) P(X=13)=0 73 Result(1) Let X 1, X,,X be a radom sample of size from N ( µ, σ ), the 1) ) x i i X 1 ~ N ( µ, σ /) x Z ~ N ( 0, 1). Cetral Limit Theorem Let X 1, X,,X be a radom sample of size from ay distributio with mea µ ad variace σ, ad if is large ( 30), the x Z / N ( 0,1). ( that is, Z has approximately stadard ormal distributio) 74 37

38 Result () If σ is ukow i the cetral limit theorem, the s ( the sample stadard deviatio ) ca be used istead of σ, that is x Z s / N ( 0,1). Where s i1 i x ( x) 1 75 Chapter 5: Statistical Iferece 5.1 Itroductio: There are two mai purposes i statistics -Orgaizig ad summarizig data (descriptive statistics). -Aswer research questios about populatio parameter (statistical iferece). There are two geeral areas of statistical iferece: Hypothesis testig: aswerig questios about populatio parameters. Estimatio: approximatig the actual values of populatio parameters. there are two kids of estimatio: opoit estimatio. oiterval estimatio ( cofidece iterval)

39 Here we will cosider two types of populatio parameters Populatio mea: µ ( for quatitative variable) µ=the average ( mea ) value for some qualitative variable. Populatio proportio o.of elemet i thepopulatiowith some charachtaristic Totalo.of elemet i thepopulatio Examples: -The mea life spa for some bacteria - The icome mea for some bacteria - The icome mea of govermet employee i Saudi Arabia. Examples: -The proportio of Saudi people who have some disease - The proportio of smokers i Riyadh. - The proportio of Childre i Saudi Arabia : Estimatio of Populatio Mea: μ 1) Poit Estimatio: A poit estimate is a sigle umber used to estimate the correspodig populatio parameter. x is a poit estimate of μ That is, the sample mea is a poit estimate of the populatio mea ) Iterval Estimatio (Cofidece Iterval:C.I) of μ Defiitio: (1-α)100% Cofidece Iterval: (1-α)100% Cofidece Iterval is a iterval of umbers (L,U), defied by lower L ad upper U limits that cotais the populatio parameter with probability (1-α)

40 1-α: the cofidece coefficiet. L: Lower limit of the cofidece iterval. U : upper limit of the cofidece iterval. A (1-α)100% CI for μ is If the distributio is ormal If the distributio is ot ormal If σ is kow If σ is ukow is large (>30) x z 1 is large (>30) If σ is kow If σ is ukow x z 1 s x z 1 x z 1 s 79 Note: The C.I x z 1 meas (L, U)= ( x z, z ) 1 x 1 s s - Similarly for s, (L, U)= ( x z, x z.) x z Iterpretatio of the CI: We are (1-α)100% cofidet that the (mea) of (variable) for the (populatio) is betwee L ad U. μ Example 5.1: Let Z~N(0, 1) Z??? 1 Here we have the probability ( the area) ad we wat to fid the exact value of z. hece we ca use the table of stadard ormal but i the opposite directio. a) α=0.05 α/= α/=

41 From the stadard ormal table b) α=0.1 α/= α/=0.95 Z Z Z : : : (الحماض الكيتوني السكري) Example 5.: O 13 patiet of diabetic ketoacidosis patiet i Saudi Arabia, the mea blood glucose level was 6. with a stadard deviatio of 3.3 mm0l/l. Fid the 90% cofidece iterval for the mea blood glucose level of such diabetic ketoacidosis patiet. Solutio: Variable: blood glucose level (i mmol/l) Populatio: Diabetic ketoacodosis patiet i Saudi Arabia. Parameter: μ (the average blood glucose level) =13, x 6. s=3.3 - σ ukow, =13>30 (large) the 90% CI for μ is give by x z 1 s 81 90% = (1- α)100% 1- α=0.9 α=0.1 α/= α/=0.95 Z s The 90% CI for μ is x z 1 Which is ca be writte as ( x z s, x z ( 6. (1645), 6. (1645) 13 ( 5.71, 6.69) s ) 3.3 ) 13 Iterpretatio: We are 90 % cofidet that the mea blood glucose level of diabetic ketoacidosis patiet i Saudi Arabia is betwee 5.71 ad

42 Exercises Q1: Suppose that we are iterested i makig some statistical ifereces about the mea µ of ormal populatio with stadard deviatio 0.. Suppose that a radom sample of size = 49 from this populatio gave the sample mea4.5 The distributio of is (a) N(0,1) (b) t(48) (c)n(µ,(0.0857) ) (d)n(µ,.0) A good poit estimate for µ is (a) 4.5 (b) (c).5 (d) 7 (e) 1.15 Assumptios is (a) Normal, σ kow (b) Normal, σ ukow (c)ot Normal, σ kow (d) ot Normal, σ ukow (4)A 95% cofidet iterval for µ is (a) (3.44,5.56) (b) (3.34,5.66) (c) (4.444, 4.556) (d) (3.94,5.05) (e) (3.04,5.96) 83 Q:A electroics compay wated to estimate i mothly operatig expeses riyals (µ). Assume that the populatio variace equals Suppose that a radom sample of size 49 is take ad foud that the sample mea equals Fid Poit estimate for µ The distributio of the sample mea is The assumptios? A 90% cofidet iterval for µ. Q3:The radom variable X, represetig the lifespa of a certai light bulb is distributed ormally with mea of 400 hours,ad stadard deviatio of 10 hours. -What is the probability that a particular light bulb will last for more tha 380 hours? -What is the probability that a particular light bulb will last for exactly 399 hours? -What is the probability that a particular light bulb will last for betwee 380 ad 40 hours? The mea is.. The variace is.. The stadard deviatio 84 4

43 Q4: The tesile of a certai type of thread is approximately ormally distributed with stadard deviatio of 6.8 Kg. A sample of 0 pieces of the thread has a average stregth of 7.8 Kg. The A poit estimate of the populatio mea of tesile stregth µ is (a)7.8 (b) 0 (c) 6.8 (d) 46.4 (e) oe of these A 98% Cofidet iterval for mea of tesile stregth µ,the lower boud equal to : (a)68.45 (b) 69.6 (c) (d) (e) oe of these A 98% Cofidet iterval for mea of tesile stregth µ,the upper boud equal to : (a)74.16 (b) (c) 75.9 (d) (e) oe of these : Estimatio of Populatio Proportio π Recall that, the populatio proportio π= o.of elemet i thepopulatiowith some charachtaristic Totalo.of elemet i thepopulatio N To estimate the populatio proportio we take a sample of size from the populatio ad fid the sample proportio p o.of elemet i thesample with somecharachtristic p Totalo.of elemet i thesample Result: whe both π>5 ad (1- π)>5 the p p N (, (1 ) / ). Z N ( 0,1). ad hece (1 ) / 86 43

44 Estimatio for π 1) Poit Estimatio: A poit estimator of π ( populatio proportio) is p (sample proportio) 1) Iterval Estimatio: If p>5 ad (1-p)>5, The (1-α)100% Cofidece Iterval for π is give by p z 1 p(1 p) p(1 p) Note:1) p z ca be writte as 1 p(1 p) p(1 p) p z, p z 1 1 ) p= the umber i the sample with the characteristic (1-p)= the umber i the sample wich did ot have the characteristic. 87 Example 5. I the study o the fear (خوف) of detal care i Riyadh, % of 347 adults said they would hesitate (تردد) to take a detal appoitmet due to fear. Fid the poit estimate ad the 95% cofidece iterval for proportio of adults i Riyadh who hesitate to take detal appoitmets..solutio: Variable: whether or ot the perso would hesitate to take a detal appoitmet out of fear. Populatio: adults i Riyadh. Parameter: π, the proportio who would hesitate to take a appoitmet. = 347, p= %=0., p=(347)(0.)=76.34 >5 ad (1-p)=(437)(0.78)=70.66>5 1- poit estimatio of π is p=0. p z p(1 p) - 95% CI for π is 1 1-α=0.95 α=0.05 α/= α/=0.975 Z1 / Z

45 The 95 % CI for π is p z 1 0. (1.96) p(1 p), p z 1 0.(0.78),0. (1.96) 347 p(1 p) 0.(0.78) (1.96)(0.0379),0. (1.96)(0.0379) 0.176,0.64 Iterpretatio: we are 95% cofidet that the true proportio of adult i Riyadh who hesitate to take a detal appoitmet is betwee ad Exercises Q1: A radom sample of 00 studets from a certai school showed that 15 studets smoke. let π be the proportio of smokers i the school. Fid a poit estimate for π Fid 95% cofidece iterval for π Q. A researcher was iterested i makig some statistical ifereces about the proportio of females (π) amog the studets of a certai uiversity. A radom sample of 500 studets showed that 150 studets are female. 1. A good poit estimate forπis (A) 0.31 (B) 0.30 (C) 0.9 (D) 0.5 (E) The lower limit of a 90% cofidece iterval for πis (A) (B) (C) (D) (E) The upper limit of a 90% cofidece iterval for πis (A) (B) (C) (D) (E)

46 Q3. I a radom sample of 500 homes i a certai city, it is foud that 114 are heated by oil. Let π be the proportio of homes i this city that are heated by oil. 1. Fid a poit estimate for π.. Costruct a 98% cofidece iterval for π. Q4. I a study ivolved 100 car drivers, it was foud that 50 car drivers do ot use seat belt. A poit estimate for the proportio of car drivers who do ot use seat belt is: (A) 50 (B) (C) (D) 1150 (E) Noe of these The lower limit of a 95% cofidece iterval of the proportio of car drivers ot usig seat belt is (A) 0.03 (B) (C) (D) (E) Noe of these The upper limit of a 95% cofidece iterval of the proportio of car drivers ot usig seat belt is (A) (B) (C) (D) (E) Noe of these Q5. A study was coducted to make some ifereces about the proportio of female employees (π) i a certai hospital. A radom sample gave the followig data: Calculate a poit estimate (p) for the proportio of female employees (π). Costruct a 90% cofidece iterval for p. Sample size 50 Number of females

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

Data Description. Measure of Central Tendency. Data Description. Chapter x i

Data Description. Measure of Central Tendency. Data Description. Chapter x i Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying. Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example

More information

What is Probability?

What is Probability? Quatificatio of ucertaity. What is Probability? Mathematical model for thigs that occur radomly. Radom ot haphazard, do t kow what will happe o ay oe experimet, but has a log ru order. The cocept of probability

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Elementary Statistics

Elementary Statistics Elemetary Statistics M. Ghamsary, Ph.D. Sprig 004 Chap 0 Descriptive Statistics Raw Data: Whe data are collected i origial form, they are called raw data. The followig are the scores o the first test of

More information

Chapter 1 (Definitions)

Chapter 1 (Definitions) FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Quick Review of Probability

Quick Review of Probability Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter & Teachig Material.

More information

Confidence Intervals for the Population Proportion p

Confidence Intervals for the Population Proportion p Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

Quick Review of Probability

Quick Review of Probability Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter 2 & Teachig

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Sample questions. 8. Let X denote a continuous random variable with probability density function f(x) = 4x 3 /15 for

Sample questions. 8. Let X denote a continuous random variable with probability density function f(x) = 4x 3 /15 for Sample questios Suppose that humas ca have oe of three bloodtypes: A, B, O Assume that 40% of the populatio has Type A, 50% has type B, ad 0% has Type O If a perso has type A, the probability that they

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

UNIT 2 DIFFERENT APPROACHES TO PROBABILITY THEORY

UNIT 2 DIFFERENT APPROACHES TO PROBABILITY THEORY UNIT 2 DIFFERENT APPROACHES TO PROBABILITY THEORY Structure 2.1 Itroductio Objectives 2.2 Relative Frequecy Approach ad Statistical Probability 2. Problems Based o Relative Frequecy 2.4 Subjective Approach

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2 Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

(6) Fundamental Sampling Distribution and Data Discription

(6) Fundamental Sampling Distribution and Data Discription 34 Stat Lecture Notes (6) Fudametal Samplig Distributio ad Data Discriptio ( Book*: Chapter 8,pg5) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye 8.1 Radom Samplig: Populatio:

More information

Lecture 1 Probability and Statistics

Lecture 1 Probability and Statistics Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark

More information

Introduction to Probability and Statistics Twelfth Edition

Introduction to Probability and Statistics Twelfth Edition Itroductio to Probability ad Statistics Twelfth Editio Robert J. Beaver Barbara M. Beaver William Medehall Presetatio desiged ad writte by: Barbara M. Beaver Itroductio to Probability ad Statistics Twelfth

More information

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10

More information

Discrete probability distributions

Discrete probability distributions Discrete probability distributios I the chapter o probability we used the classical method to calculate the probability of various values of a radom variable. I some cases, however, we may be able to develop

More information

College of Science Department of Statistics & OR

College of Science Department of Statistics & OR Biostatistics - STAT 45 Departmet of Statistics Summer Semester 43/43 Kig Saud Uiversity College of Sciece Departmet of Statistics & OR STAT 45 BIOSTATISTICS Summer Semester 43/43 Lectures' Notes Prof.

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

(7 One- and Two-Sample Estimation Problem )

(7 One- and Two-Sample Estimation Problem ) 34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

AMS570 Lecture Notes #2

AMS570 Lecture Notes #2 AMS570 Lecture Notes # Review of Probability (cotiued) Probability distributios. () Biomial distributio Biomial Experimet: ) It cosists of trials ) Each trial results i of possible outcomes, S or F 3)

More information

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence PSet ----- Stats, Cocepts I Statistics Cofidece Iterval Guesswork with Cofidece VII. CONFIDENCE INTERVAL 7.1. Sigificace Level ad Cofidece Iterval (CI) The Sigificace Level The sigificace level, ofte deoted

More information

Confidence Intervals QMET103

Confidence Intervals QMET103 Cofidece Itervals QMET103 Library, Teachig ad Learig CONFIDENCE INTERVALS provide a iterval estimate of the ukow populatio parameter. What is a cofidece iterval? Statisticias have a habit of hedgig their

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

As stated by Laplace, Probability is common sense reduced to calculation.

As stated by Laplace, Probability is common sense reduced to calculation. Note: Hadouts DO NOT replace the book. I most cases, they oly provide a guidelie o topics ad a ituitive feel. The math details will be covered i class, so it is importat to atted class ad also you MUST

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

5. A formulae page and two tables are provided at the end of Part A of the examination PART A

5. A formulae page and two tables are provided at the end of Part A of the examination PART A Istructios: 1. You have bee provided with: (a) this questio paper (Part A ad Part B) (b) a multiple choice aswer sheet (for Part A) (c) Log Aswer Sheet(s) (for Part B) (d) a booklet of tables. (a) I PART

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls Ecoomics 250 Assigmet 1 Suggested Aswers 1. We have the followig data set o the legths (i miutes) of a sample of log-distace phoe calls 1 20 10 20 13 23 3 7 18 7 4 5 15 7 29 10 18 10 10 23 4 12 8 6 (1)

More information

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Itroductio to Statistics Misleadig Iformatio: Surveys ad advertisig claims ca be biased by urepresetative samples, biased questios, iappropriate

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

Analysis of Experimental Data

Analysis of Experimental Data Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both

More information

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State Eco411 Lab: Cetral Limit Theorem, Normal Distributio, ad Jourey to Girl State 1. Some studets may woder why the magic umber 1.96 or 2 (called critical values) is so importat i statistics. Where do they

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

Lecture 1 Probability and Statistics

Lecture 1 Probability and Statistics Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Median and IQR The median is the value which divides the ordered data values in half.

Median and IQR The median is the value which divides the ordered data values in half. STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media

More information

Central Limit Theorem the Meaning and the Usage

Central Limit Theorem the Meaning and the Usage Cetral Limit Theorem the Meaig ad the Usage Covetio about otatio. N, We are usig otatio X is variable with mea ad stadard deviatio. i lieu of sayig that X is a ormal radom Assume a sample of measuremets

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

Exam 2 Instructions not multiple versions

Exam 2 Instructions not multiple versions Exam 2 Istructios Remove this sheet of istructios from your exam. You may use the back of this sheet for scratch work. This is a closed book, closed otes exam. You are ot allowed to use ay materials other

More information

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

Stat 225 Lecture Notes Week 7, Chapter 8 and 11 Normal Distributio Stat 5 Lecture Notes Week 7, Chapter 8 ad Please also prit out the ormal radom variable table from the Stat 5 homepage. The ormal distributio is by far the most importat distributio

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M. MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Probability and statistics: basic terms

Probability and statistics: basic terms Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Computing Confidence Intervals for Sample Data

Computing Confidence Intervals for Sample Data Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Topic 10: Introduction to Estimation

Topic 10: Introduction to Estimation Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio

More information

Topic 6 Sampling, hypothesis testing, and the central limit theorem

Topic 6 Sampling, hypothesis testing, and the central limit theorem CSE 103: Probability ad statistics Fall 2010 Topic 6 Samplig, hypothesis testig, ad the cetral limit theorem 61 The biomial distributio Let X be the umberofheadswhe acoiofbiaspistossedtimes The distributio

More information

Department of Statistics and Operations Research College of Science, King Saud University. Summer Semester

Department of Statistics and Operations Research College of Science, King Saud University. Summer Semester Departmet of Statistics ad Operatios Research College of Sciece, Kig Saud Uiversity Summer Semester 46-47 STAT 34 Probability ad Statistics for Egieers EXERCISES A Collectio of Questios Selected from Midterm

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6) STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters? CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter

More information

Chapter two: Hypothesis testing

Chapter two: Hypothesis testing : Hypothesis testig - Some basic cocepts: - Data: The raw material of statistics is data. For our purposes we may defie data as umbers. The two kids of umbers that we use i statistics are umbers that result

More information