Binomial Distribution
|
|
- Iris Ruth Woods
- 1 years ago
- Views:
Transcription
1 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible values e.g. y 1,..., y or at most there is oe for every iteger e.g. y 1, y 2,.... Also recall that the probability distributio of a discrete r.v. is described by the probability of each possible value of the r.v. i.e. py = PY = y for all possible y. Some particular probability distributios occur ofte because they are useful descriptios of certai chace pheomeo uder study. For example, biomial distributio for discrete r.v. is very useful i practice. Toss a coi idepedetly three times ad record the umber of times that the coi laded o heads. Let Y = # of heads. The the sample space is: 1st toss 2d toss 3rd toss Y H H H 3 H H T 2 H T H 2 H T T 1 T H H 2 T H T 1 T T H 1 T T T 0 Suppose p = probability of heads ad q = probability of tails. The q = 1 p ad PY = 3 = p p p = p 3 PY = 2 = p p q + p q p + q p p = 3p 2 q PY = 1 = 3pq 2 PY = 0 = q 3 A biomial distributio must satisfy three 3 coditios: 1. There are trials, each of which ca result i oe of the two outcomes, either success or failure. i.e. the trials are dichotomous or Beroulli. 2. The probability of a success is costat p for all trials. 3. The trials are idepedet. Defie r.v. Y = # of successes i trials. The Y is said to have a biomial distributio with parameters ad p ad is writte as: Y B, p Let Y B, p. The formula for the biomial distributio is: where! PY = r = r! r! pr q r r = 0, 1, 2,..., are the possible umbers of successes. q = 1 p is the probability of failure. p r = p p p r times. q r = q q q r times. r =! r! r! is read as choose r. A quick review of factorial operatio k! = k k 1 1 4! = = 24 5! = = 120 0! = Examples Oe more example Properties Proportios Suppose Y B3, 1 2. PY = 1 = 3! = 3 1!2! Suppose Y B7, PY = 3 = 7! 3!4! = 3, p = 0.5 = = ! = 7, p = 0.6 Toss a fair die three times. What is the probability that there is at least oce a 6? Let Y B3, 1 6. The Pat least oe 6 = 1 Po 6 = 1 PY = 0. Sice PY = 0 = 3! = !3! 6 6 we have Pat least oe 6 = = = 3, p = Y B, p is a discrete r.v. The expectatio of Y is The variace of Y is µ Y = EY = p σ 2 Y = V ary = pq For example, suppose Y B10, 1 4, EY = = 10 4 = 2.5, V ary = = Aother example, suppose Y B100, 1 4, EY = = = 25, V ary = = I some situatios, oe is iterested i the proportio of successes amog trials. Defie where Y B, p. The W = Y EW = p, V arw = pq usig properties of biomial distributio i.e. expectatio ad variace of Y
2 z PZ>=1.5 PZ<=1.5 P0<=Z<=1.5 Overview Defiitio Defiitio ad properties Biomial distributio is a very importat probability model ad is ofte very useful. For example, cure rate of a ew drug, proportio of lakes that are suitable habitats for fish, survival rate of seedligs, etc. However, biomial distributio is ot suitable for all situatios. Are the trials dichotomous? Is the success probability costat? Are the trials idepedet? Cosider the fair die game, # of rats cured withi the same litter, etc. Always thik about the three assumptios before usig a biomial distributio. Recall that a r.v. is discrete if there are either a fiite umber of possible values e.g. y 1,..., y or at most there is oe for every iteger e.g. y 1, y 2,.... I cotrast a r.v. is cotiuous if the possible values are over a etire iterval of umbers. The probability distributio of a cotiuous r.v. is described by a probability desity curve such that area uder the curve correspods to probability. The most frequetly used type of cotiuous distributio is ormal distributio, which is also kow as Gaussia distributio. Normal distributio is a very importat distributio for modelig cotiuous data, because data that are iflueced by may small ad urelated radom effects are approximately ormally distributed e.g. a studet s height. A ormal distributio is defied by a bell-shaped curve with a probability desity fuctio fy = 1 e 1 2σ 2y µ2 2πσ fy y A r.v. Y is said to have a ormal distributio with parameters µ ad σ 2 ad is writte as: Y Nµ, σ 2 Total area uder the distributio curve is 1. The distributio curve is symmetric about the ceter µ. Values aroud µ are more probable. The area i the rage of µ ±σ is about 2/3 of the total area The distributio curve has a well-defied bell shape. The parameters µ ad σ 2 provide complete iformatio about the distributio curve. I fact, EY = µ, V ary = σ Stadard Stadard Stadard Stadard Defiitio ad properties More properties Fidig probabilities Examples A r.v. Z has a stadardized ormal distributio if Z Nµ, σ 2 with expectatio µ = 0 ad variace σ 2 = 1 ad is writte as: Z N0, 1. The distributio curve is symmetric about the ceter µ = 0. The area betwee 1 ad 1 i.e. µ ± σ is about 2/3. The area betwee 2 ad 2 i.e. µ ± 2σ is Suppose Z is a stadard ormal r.v. i.e. Z N0, 1. PZ 0 = 0.5. P < Z < = 1. PZ = 1 = 0 ad i geeral PZ = c = 0 for ay c. Hece PZ < 0 = PZ 0 = 0.5. Remark: Suppose Y B3, 0.5 the PY 0 PY > 0! Table A o page 408 of the course otes bluebook gives PZ z for ay give value z 0 also kow as the upper tail probability. The z values are o the outside of Table A ad the probabilities are i the iside of Table A. Read the z value off x.x from the row idex ad 0.0x from the colum idex. PZ 1.5 = PZ 1.5 = 1 PZ 1.5 = = P0 Z 1.5 = PZ 0 PZ 1.5 = = How to fid PZ z for ay give value z? Drawig pictures helps! fz
3 PZ>= 0.6 P 1.7<=Z<=1.7 mu = 20, sigma = 3 P20<=Y<= P 1.7<=Z<=1.7 P X < = 0.9 PZ<=?=0.90 z = 1.28 mu = 20, sigma = 3 PY<= Stadard Stadard Stadard Stadard Fidig quatiles PZ 0.6 = 1 PZ 0.6 = 1 PZ 0.6 = = P 1.7 Z 1.7 = 2 P0 Z 1.7 = = P 1.7 Z 1.7 = 1 PZ 1.7 or Z 1.7 = 1 2 PZ 1.7 = = = What value z would give PZ z = 0.90? This z value would also give PZ z = = Look up Table A for the z value that correspods to Whe z = 1.28, PZ z = Whe z = 1.29, PZ z = Thus z value sought after is a umber betwee 1.28 ad More directly the table o page 92 of the bluebook gives z = Iterpretatio: The z value sought after is the cut-off value where 90% of the populatio lies below i.e. the 0.9 th populatio quatile or the 90 th populatio percetile Geeral Geeral Geeral Geeral Defiitio ad stadardizatio Example A r.v. Y is said to have a ormal distributio with parameters µ ad σ 2 ad is writte as: Y Nµ, σ 2 For example, Y N20, 3 2 stads for ormal distributio with µ = 20, σ 2 = 9. FACT: Suppose Y Nµ, σ 2, the a very useful trasformatio is Z = Y µ N0, 1 σ For Y N20, 3 2 i.e. µ = 20, σ 2 = 9, we have P20 Y 23 = P20 µ Y µ 23 µ 20 µ = P Y µ 23 µ σ σ σ = P Z 3 3 = P0 Z 1 = PZ 0 PZ 1 = = Give a value y, the term z = y µ σ is called the z-score correspodig to y. It represets the umber of stadard deviatios y is from µ. For Y N20, 3 2, 23 is 1σ away from µ = is 0σ away from µ = 20 i.e. at µ = is 4 3σ away from µ = is 1σ away from µ = 20. Suppose Y N20, 3 2, what is PY 15.5? Y µ PY 15.5 = P 15.5 µ σ σ = P Z 3 = PZ 1.5 =
4 P X < = 0.9 mu = 60, sigma = 10 PY<=?=0.90 z = Geeral Biomial ad s Fidig quatiles Key R commads Overview Defiitios Suppose Y N60, 10 2, what value y would give PY y = 0.90? From previous discussio, we kow that Thus PZ = 0.90 Y µ P σ = 0.90 PY µ 1.282σ = 0.90 PY µ σ = 0.90 Thus y = µ σ = = y is the stadard deviatio above the mea. > # Y~B3,0.5, compute PY=1 > dbiomx=1,size=3,prob=0.5 [1] > # Y~B7,0.6, compute PY=3 > dbiomx=3,size=7,prob=0.6 [1] > # Y~B3,1/6, compute PY>0 > pbiomq=0,size=3,prob=1/6,lower.tail=f [1] > # of by complemet > 1-dbiomx=0,size=3,prob=1/6 [1] > # Z~N0,1, compute PZ>=1.5, PZ<=1.5, P0<=Z<=1.5, > porm1.5, lower.tail=f [1] > porm1.5, lower.tail=t [1] > porm0, lower.tail=f-porm1.5, lower.tail=f [1] > # Z~N0,1, compute PZ>=-0.6 > porm-0.6, lower.tail=f [1] > # Z~N0,1, compute P-1.7<=Z<=1.7 > 1-2*porm1.7, lower.tail=f [1] > # Z~N0,1, compute PZ<=?=0.90 > qorm0.9, lower.tail=t [1] > # Y~N20,9, compute P20<=Y<=23, PY<=15.5 > pormq=20,mea=20,sd=3,lower.tail=f-porm23,mea=20,sd=3,lower.tail=f [1] > pormq=15.5,mea=20,sd=3,lower.tail=t [1] > # Y~N60,100, compute PY<=?=0.90 > qormp=0.9,mea=60,sd=10,lower.tail=t [1] Recall that a radom variable r.v. is a variable that depeds o the outcome of a chace situatio. The probability distributio of a r.v. is described by the probability of all possible outcomes of the r.v. Now we put data ad probability models together by viewig a observatio obs as a outcome of a r.v. We cosider obs as outcomes of r.v. s. The r.v. s Y 1,Y 2,...,Y formig the sample data are called a radom sample of size from a give distributio if 1 the Y i s are idepedet r.v. s; 2 the Y i s are idetically distributed i.e. every Y i has the same probability distributio. Coditios 1 ad 2 are sometimes abbreviated to the Y i s are iid. Further assume: 3 For each i, EY i = µ, V ary i = σ 2. A statistic is ay quatity whose value ca be calculated from sample data. It is a r.v. For example, sample mea is a statistic: Ȳ = Y i = Y 1 + Y Y The we have uder 1 3: EȲ = µ, σ2 V arȳ = As icreases, V arȳ decreases Sample mea FACT 1 FACT 2 FACT 3 By 2 ad 3, 1 EȲ = E Y 1 + Y Y = 1 EY 1 + Y Y = 1 EY 1 + EY EY = 1 µ + µ + + µ = µ By 1, 2, ad 3, 1 V arȳ = V ar Y 1 + Y Y = 1 2V ary 1 + Y Y = 1 V ary1 + V ary2 + + V ary 2 = 1 2σ2 + σ σ 2 = 1 2 σ2 = σ2 Suppose Y 1, Y 2,..., Y is a radom sample from Nµ,σ 2 distributio. The Ȳ Nµ, σ2 For example, if Y 1, Y 2,..., Y 40 are iid N3, 4, the Ȳ N3, 4 = N3, We may write Ȳ Nµ,σ2, where the variace of Ȳ is Ȳ σ 2 Ȳ = σ2 ad the correspodig stadard deviatio of Ȳ is σ Ȳ = σ also kow as the stadard error of the mea. Suppose Y 1,Y 2,...,Y is a radom sample from Nµ,σ 2 distributio. The Y i Nµ,σ 2 For example, if Y 1,Y 2,...,Y 40 are iid N3, 4, the 40 Thik of Y i = Ȳ : E Y i = µ, Y i N120, 160 V ar Y i = σ 2. Suppose Y 1, Y 2,...,Y is a radom sample from a distributio with mea µ ad variace σ 2. If is large, the the distributio of Ȳ is closely approximated by Ȳ Nµ, σ2 The larger is, the better the approximatio. This fact is also kow as the cetral limit theorem CLT for the sample mea
5 = 60, p = Example: fair die game FACT 4 A quick summary o CLT Recall the fair die game example: Y = wiig $ with probability distributio p0 = 1/2, p9 = 1/6,p15 = 1/3 ad mea µ = 6.5, σ 2 = If we repeat the game 1000 times idepedetly, the for Y 1, Y 2,..., Y 1000, we have EȲ = 6.5, V arȳ = 1000 By CLT, Ȳ is approximately N 6.5, For example, PȲ 7 P Z = PZ 2.32 = Suppose Y 1, Y 2,..., Y is a radom sample from a distributio with mea µ ad variace σ 2. If is large, the the distributio of Y i is closely approximated by Y i Nµ,σ 2 The larger is, the better the approximatio. This fact is also kow as the cetral limit theorem CLT for the sample sum. Example: fair die game Note that = 1000,E Y i = = 6500, V ar Y i = = By FACT 4, Y i is approximately N6500, The, P Y i 7000 P Z = PZ 2.32 = Radom sample, statistic, sample mea. If Y i iid Dµ, σ 2 σ2, EȲ = µ, V arȳ =. If Y i iid Nµ, σ 2 σ2, Ȳ Nµ,. If Y i iid Dµ, σ 2 σ2, approximately Ȳ Nµ, CLT. Similarly for Y i with mea µ ad variace σ 2. The CLT is a remarkable result. It says that regardless of the shape of the origial distributio, the takig of averages or sums results i ormal distributio. Hece we oly eed to kow the populatio mea ad stadard deviatio to have a approximate distributio of Ȳ or Y i. Why is ormal distributio so useful? May atural pheomea give rise to ormal variables; may variables ca be trasformed to ormal distributio; ad CLT. Simulatios. How big must be? Usually whe 30, the CLT works well. But if Y i has a highly skewed distributio, the we eed larger, i which case we may cosider trasformatio to a symmetric distributio first. A applicatio of the CLT is to approximate a biomial distributio by a ormal distributio Normal approximatio of biomial Normal approximatio of biomial Normal approximatio of biomial Recall that if Y B, p, the µ = p, σ 2 = pq where q = 1 p. Let Y B60, 0.4. Thus = 60, p = 0.4, ad µ Y = 24, σ 2 Y = Compute PY 20 = PY = 0 + PY = PY = 20? Now, thik of Y as the sum of = 60 iid r.v. s, each of which is a 0/1 variable. Let Y i = 1 for a success with probability p ad Y i = 0 for a failure with probability q. The Y = Y i is the umber of successes i = 60 trials. By the CLT o sum FACT 4, we ca approximate the distributio of Y by ormal distributio with mea ad variace µ Y = p = 24 σ 2 Y = pq = 14.4 = Hece if Y B60, 0.4, the Y NA N24, Now, we approximate PY 20 by PY 20 PY NA 20 YNA = P = PZ = The exact probability is PY 20 = > pbiomq=20, size=60, prob=0.4, lower.tail=t > [1] The discrepacy is largely due to the fact that we are approximatig a discrete r.v. by a cotiuous r.v. We ca make the ormal approximatio better by usig cotiuity correctio cc. PY 20 PY NA 20.5 YNA = P = PZ = Normal approximatio is OK if p 5 ad q 5. Always check this before usig ormal distributio to approximate biomial distributio. I the example, Y B60, 0.4, p = 24,q = 36, thus it is OK to use ormal approximatio. Cosider proportio of success deoted as W before For Y B60, 0.4, ˆp = Y Eˆp = p = 0.4, V arˆp = pq = = Hece by CLT o average FACT 3, ˆp NA N0.4, Thus for the example Y B60, 0.4, Pˆp 0.48 Pˆp NA 0.48 ˆpNA = P = PZ = =
6 Sample variace Sample variace Example Suppose Y i are iid Nµ, σ 2. We kow that the sample mea Ȳ Nµ, σ2. What about the sample variace also a statistic FACT S 2 = 1 1 Y i Ȳ 2? If Y 1, Y 2,..., Y form a radom sample from Nµ, σ 2, the V 2 1S2 = χ 2 σ 2 1 where χ 2 1 is a chi-squared distributio with 1 degrees of freedom d.f.. Shape of the χ 2 1 distributio curve depeds o the d.f. V 2 0 The distributio curve is skewed i.e. asymmetric. The mea ad variace of V 2 χ 2 1 are EV 2 = 1, V arv 2 = 2 1. Thik of V 2 as a scaled versio of S 2. Table B is i terms of V 2. Hece cautio: always scale to V 2 before usig Table B. Sice we have ES 2 = σ 2, σ2 S 2 = 1 V 2 V ars 2 = 2σ4 1. Oe reaso to use 1 i S 2 is to esure ES 2 = σ 2. As icreases, S 2 estimates σ 2 with more precisio. Let Y 1, Y 2,...,Y 7 be a radom sample from N6, 3 2 i.e. = 7. What is PS 2 > 21? 1S PS > 21 = P > σ 2 σ 2 = PV 2 > = PV 2 > 14 where V 2 χ 2 6. From Table B, we have So ad I R, PV 2 > = 0.05, PV 2 > = > pchisqq=14, df=6, lower.tail=f > [1] < PV 2 > 14 < < PS 2 > 21 < 0.05 I the example, if Y i N200, 3 2, the PS 2 > 21 is the same, because it does ot deped o µ. Agai cautio: rescale S 2 to V 2 before usig Table B. More o the d.f. for sample variace S 2 = 1 Y i 1 Ȳ 2 Cosider = 3 ad all the deviatios. Oce we kow that it follows that Y 1 Ȳ = 3, Y2 Ȳ = 1, Y 3 Ȳ = 2. Hece the complete iformatio about deviatios is cotaied i 1 of them. I geeral, d.f. represets the amout or pieces of iformatio available about the spread. We use Ȳ,S2 istead of media ad IQR, because they are easier to deal with
Parameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
Chapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
Sampling Distributions, Z-Tests, Power
Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace
Probability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
Statisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.
SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample
7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
Lecture 4. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture. Radom variable ad distributio of probability dr hab.iż. Katarzya Zarzewsa, prof.agh Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
Confidence Intervals. Confidence Intervals
A overview Mot probability ditributio are idexed by oe me parameter. F example, N(µ,σ 2 ) B(, p). I igificace tet, we have ued poit etimat f parameter. F example, f iid Y 1,Y 2,...,Y N(µ,σ 2 ), Ȳ i a poit
Lecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables
CSCI-B609: A Theorist s Toolkit, Fall 06 Aug 3 Lecture 0: the Cetral Limit Theorem Lecturer: Yua Zhou Scribe: Yua Xie & Yua Zhou Cetral Limit Theorem for iid radom variables Let us say that we wat to aalyze
Introduction to Probability and Statistics Twelfth Edition
Itroductio to Probability ad Statistics Twelfth Editio Robert J. Beaver Barbara M. Beaver William Medehall Presetatio desiged ad writte by: Barbara M. Beaver Itroductio to Probability ad Statistics Twelfth
Chapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
The standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
Central Limit Theorem the Meaning and the Usage
Cetral Limit Theorem the Meaig ad the Usage Covetio about otatio. N, We are usig otatio X is variable with mea ad stadard deviatio. i lieu of sayig that X is a ormal radom Assume a sample of measuremets
DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
Discrete probability distributions
Discrete probability distributios I the chapter o probability we used the classical method to calculate the probability of various values of a radom variable. I some cases, however, we may be able to develop
Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:
Math 224 Fall 2017 Homework 4 Drew Armstrog Problems from 9th editio of Probability ad Statistical Iferece by Hogg, Tais ad Zimmerma: Sectio 2.3, Exercises 16(a,d),18. Sectio 2.4, Exercises 13, 14. Sectio
0, otherwise. EX = E(X 1 + X n ) = EX j = np and. Var(X j ) = np(1 p). Var(X) = Var(X X n ) =
PROBABILITY MODELS 35 10. Discrete probability distributios I this sectio, we discuss several well-ow discrete probability distributios ad study some of their properties. Some of these distributios, lie
Confidence Level We want to estimate the true mean of a random variable X economically and with confidence.
Cofidece Iterval 700 Samples Sample Mea 03 Cofidece Level 095 Margi of Error 0037 We wat to estimate the true mea of a radom variable X ecoomically ad with cofidece True Mea μ from the Etire Populatio
Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 19
CS 70 Discrete Mathematics ad Probability Theory Sprig 2016 Rao ad Walrad Note 19 Some Importat Distributios Recall our basic probabilistic experimet of tossig a biased coi times. This is a very simple
Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?
CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter
f(x)dx = 1 and f(x) 0 for all x.
OCR Statistics 2 Module Revisio Sheet The S2 exam is 1 hour 30 miutes log. You are allowed a graphics calculator. Before you go ito the exam make sureyou are fully aware of the cotets of theformula booklet
Modeling and Performance Analysis with Discrete-Event Simulation
Simulatio Modelig ad Performace Aalysis with Discrete-Evet Simulatio Chapter 5 Statistical Models i Simulatio Cotets Basic Probability Theory Cocepts Useful Statistical Models Discrete Distributios Cotiuous
Statistical Intervals for a Single Sample
3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea
Statistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }
UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig
Lecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
Median and IQR The median is the value which divides the ordered data values in half.
STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media
NCSS Statistical Software. Tolerance Intervals
Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided
Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed
1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
Output Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
Infinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences
UCLA STAT 13 Itroductio to Statistical Methods for the Life ad Health Scieces Istructor: Ivo Diov, Asst. Prof. of Statistics ad Neurolog Sample Size Calculatios & Cofidece Itervals for Proportios Teachig
Some Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables
Some Basic Probability Cocepts 2. Experimets, Outcomes ad Radom Variables A radom variable is a variable whose value is ukow util it is observed. The value of a radom variable results from a experimet;
Basis for simulation techniques
Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios
Topic 6 Sampling, hypothesis testing, and the central limit theorem
CSE 103: Probability ad statistics Fall 2010 Topic 6 Samplig, hypothesis testig, ad the cetral limit theorem 61 The biomial distributio Let X be the umberofheadswhe acoiofbiaspistossedtimes The distributio
Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals
Chapter 6 Studet Lecture Notes 6-1 Busiess Statistics: A Decisio-Makig Approach 6 th Editio Chapter 6 Itroductio to Samplig Distributios Chap 6-1 Chapter Goals After completig this chapter, you should
Stat 200 -Testing Summary Page 1
Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece
Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab
Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet
Topic 18: Composite Hypotheses
Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:
Variance of Discrete Random Variables Class 5, Jeremy Orloff and Jonathan Bloom
Variace of Discrete Radom Variables Class 5, 18.05 Jeremy Orloff ad Joatha Bloom 1 Learig Goals 1. Be able to compute the variace ad stadard deviatio of a radom variable.. Uderstad that stadard deviatio
Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.
Exam II Review CEE 3710 November 15, 017 EXAM II Friday, November 17, i class. Ope book ad ope otes. Focus o material covered i Homeworks #5 #8, Note Packets #10 19 1 Exam II Topics **Will emphasize material
Unbiased Estimation. February 7-12, 2008
Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom
The Sample Variance Formula: A Detailed Study of an Old Controversy
The Sample Variace Formula: A Detailed Study of a Old Cotroversy Ky M. Vu PhD. AuLac Techologies Ic. c 00 Email: kymvu@aulactechologies.com Abstract The two biased ad ubiased formulae for the sample variace
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 6 9/23/203 Browia motio. Itroductio Cotet.. A heuristic costructio of a Browia motio from a radom walk. 2. Defiitio ad basic properties
Element sampling: Part 2
Chapter 4 Elemet samplig: Part 2 4.1 Itroductio We ow cosider uequal probability samplig desigs which is very popular i practice. I the uequal probability samplig, we ca improve the efficiecy of the resultig
This is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as
Law of the sum of Bernoulli random variables
Law of the sum of Beroulli radom variables Nicolas Chevallier Uiversité de Haute Alsace, 4, rue des frères Lumière 68093 Mulhouse icolas.chevallier@uha.fr December 006 Abstract Let be the set of all possible
Lecture 9: September 19
36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace
PH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
Confidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
(b) What is the probability that a particle reaches the upper boundary n before the lower boundary m?
MATH 529 The Boudary Problem The drukard s walk (or boudary problem) is oe of the most famous problems i the theory of radom walks. Oe versio of the problem is described as follows: Suppose a particle
STAT 203 Chapter 18 Sampling Distribution Models
STAT 203 Chapter 18 Samplig Distributio Models Populatio vs. sample, parameter vs. statistic Recall that a populatio cotais the etire collectio of idividuals that oe wats to study, ad a sample is a subset
STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
A LARGER SAMPLE SIZE IS NOT ALWAYS BETTER!!!
A LARGER SAMLE SIZE IS NOT ALWAYS BETTER!!! Nagaraj K. Neerchal Departmet of Mathematics ad Statistics Uiversity of Marylad Baltimore Couty, Baltimore, MD 2250 Herbert Lacayo ad Barry D. Nussbaum Uited
Probability, Expectation Value and Uncertainty
Chapter 1 Probability, Expectatio Value ad Ucertaity We have see that the physically observable properties of a quatum system are represeted by Hermitea operators (also referred to as observables ) such
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
Statistical Fundamentals and Control Charts
Statistical Fudametals ad Cotrol Charts 1. Statistical Process Cotrol Basics Chace causes of variatio uavoidable causes of variatios Assigable causes of variatio large variatios related to machies, materials,
62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
The Random Walk For Dummies
The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli
2.2. Central limit theorem.
36.. Cetral limit theorem. The most ideal case of the CLT is that the radom variables are iid with fiite variace. Although it is a special case of the more geeral Lideberg-Feller CLT, it is most stadard
Closed book and notes. No calculators. 60 minutes, but essentially unlimited time.
IE 230 Seat # Closed book ad otes. No calculators. 60 miutes, but essetially ulimited time. Cover page, four pages of exam, ad Pages 8 ad 12 of the Cocise Notes. This test covers through Sectio 4.7 of
Properties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
STATISTICAL INFERENCE
STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample
1 Constructing and Interpreting a Confidence Interval
Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad
Introducing Sample Proportions
Itroducig Sample Proportios Probability ad statistics Aswers & Notes TI-Nspire Ivestigatio Studet 60 mi 7 8 9 0 Itroductio A 00 survey of attitudes to climate chage, coducted i Australia by the CSIRO,
ECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Lecture 16
EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Lecture 16 Variace Questio: Let us retur oce agai to the questio of how may heads i a typical sequece of coi flips. Recall that we
Math 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
Y i n. i=1. = 1 [number of successes] number of successes = n
Eco 371 Problem Set # Aswer Sheet 3. I this questio, you are asked to cosider a Beroulli radom variable Y, with a success probability P ry 1 p. You are told that you have draws from this distributio ad
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 6 9/24/2008 DISCRETE RANDOM VARIABLES AND THEIR EXPECTATIONS
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 6 9/24/2008 DISCRETE RANDOM VARIABLES AND THEIR EXPECTATIONS Cotets 1. A few useful discrete radom variables 2. Joit, margial, ad
Exam 2 Instructions not multiple versions
Exam 2 Istructios Remove this sheet of istructios from your exam. You may use the back of this sheet for scratch work. This is a closed book, closed otes exam. You are ot allowed to use ay materials other
Analysis of Experimental Data
Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both
Singular Continuous Measures by Michael Pejic 5/14/10
Sigular Cotiuous Measures by Michael Peic 5/4/0 Prelimiaries Give a set X, a σ-algebra o X is a collectio of subsets of X that cotais X ad ad is closed uder complemetatio ad coutable uios hece, coutable
The Poisson Distribution
MATH 382 The Poisso Distributio Dr. Neal, WKU Oe of the importat distributios i probabilistic modelig is the Poisso Process X t that couts the umber of occurreces over a period of t uits of time. This
Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day
LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the
N <- <- # <- # <- # 1 <- # 6 <- ; <- N-1
Exercise: Get the mothly S&P 500 idex (^GSPC) ad plot the periodogram of the returs. Mae sure that the umber of returs is divisible by. Y
f X (12) = Pr(X = 12) = Pr({(6, 6)}) = 1/36
Probability Distributios A Example With Dice If X is a radom variable o sample space S, the the probability that X takes o the value c is Similarly, Pr(X = c) = Pr({s S X(s) = c}) Pr(X c) = Pr({s S X(s)
On an Application of Bayesian Estimation
O a Applicatio of ayesia Estimatio KIYOHARU TANAKA School of Sciece ad Egieerig, Kiki Uiversity, Kowakae, Higashi-Osaka, JAPAN Email: ktaaka@ifokidaiacjp EVGENIY GRECHNIKOV Departmet of Mathematics, auma
IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.
Closed book ad otes. No calculators. 120 miutes. Cover page, five pages of exam, ad tables for discrete ad cotiuous distributios. Score X i =1 X i / S X 2 i =1 (X i X ) 2 / ( 1) = [i =1 X i 2 X 2 ] / (
Confidence Intervals QMET103
Cofidece Itervals QMET103 Library, Teachig ad Learig CONFIDENCE INTERVALS provide a iterval estimate of the ukow populatio parameter. What is a cofidece iterval? Statisticias have a habit of hedgig their
Statistics 20: Final Exam Solutions Summer Session 2007
1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets
Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
The coalescent coalescence theory
The coalescet coalescece theory Peter Beerli September 1, 009 Historical ote Up to 198 most developmet i populatio geetics was prospective ad developed expectatios based o situatios of today. Most work
o <Xln <X2n <... <X n < o (1.1)
Metrika, Volume 28, 1981, page 257-262. 9 Viea. Estimatio Problems for Rectagular Distributios (Or the Taxi Problem Revisited) By J.S. Rao, Sata Barbara I ) Abstract: The problem of estimatig the ukow
Lesson 10: Limits and Continuity
www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals
Ma 530 Infinite Series I
Ma 50 Ifiite Series I Please ote that i additio to the material below this lecture icorporated material from the Visual Calculus web site. The material o sequeces is at Visual Sequeces. (To use this li
CH19 Confidence Intervals for Proportions. Confidence intervals Construct confidence intervals for population proportions
CH19 Cofidece Itervals for Proportios Cofidece itervals Costruct cofidece itervals for populatio proportios Motivatio Motivatio We are iterested i the populatio proportio who support Mr. Obama. This sample
STAT 516 Answers Homework 6 April 2, 2008 Solutions by Mark Daniel Ward PROBLEMS
STAT 56 Aswers Homework 6 April 2, 28 Solutios by Mark Daiel Ward PROBLEMS Chapter 6 Problems 2a. The mass p(, correspods to either o the irst two balls beig white, so p(, 8 7 4/39. The mass p(, correspods
Matrix Representation of Data in Experiment
Matrix Represetatio of Data i Experimet Cosider a very simple model for resposes y ij : y ij i ij, i 1,; j 1,,..., (ote that for simplicity we are assumig the two () groups are of equal sample size ) Y
4.1 Sigma Notation and Riemann Sums
0 the itegral. Sigma Notatio ad Riema Sums Oe strategy for calculatig the area of a regio is to cut the regio ito simple shapes, calculate the area of each simple shape, ad the add these smaller areas
10 Simulation-Assisted Estimation
CB495-10DRV CB495/Trai KEY BOARDED August 20, 2002 13:43 Char Cout= 0 10 Simulatio-Assisted Estimatio 10.1 Motivatio So far we have examied how to simulate choice probabilities but have ot ivestigated
PROBABILITY LOGIC: Part 2
James L Bec 2 July 2005 PROBABILITY LOGIC: Part 2 Axioms for Probability Logic Based o geeral cosideratios, we derived axioms for: Pb ( a ) = measure of the plausibility of propositio b coditioal o the