Overview of Estimation
|
|
- Herbert Austin
- 5 years ago
- Views:
Transcription
1 Topic Iferece is the problem of turig data ito kowledge, where kowledge ofte is expressed i terms of etities that are ot preset i the data per se but are preset i models that oe uses to iterpret the data. Statistical rigor is ecessary to justify the iferetial leap from data to kowledge, ad may difficulties arise i attemptig to brig statistical priciples to bear o massive data. Overlookig this foudatio may yield results that are, at best, ot useful, or harmful at worst. I ay discussio of massive data ad iferece, it is essetial to be aware that it is quite possible to tur data ito somethig resemblig kowledge whe actually it is ot. Moreover, it ca be quite difficult to kow that this has happeed. - page, Frotiers i Massive Data Aalysis by the Natioal Research Coucil, 03. The balace of tis book is devoted to developig formal procedures of statistical iferece. I this itroductio to iferece, we will be basig our aalysis o the premise that the data has bee collected accordig to well-desiged procedures. We will focus our presetatio o parametric estimatio ad hypothesis testig based o a give family of probably models chose to be cosistet with the sciece uder ivestigatio ad with the data collectio procedures.. Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio displayig a particular behavior, the temperature, the kietic eergy, the Michaelis costat, the speed of light, mutatio rate, the meltig poit, the probability that the domiat allele is expressed, the elasticity, the force, the mass, the free eergy, the mea umber of offsprig, the focal legth, mea lifetime, the slope ad itercept of a lie? The ext step is to perform a experimet that is well desiged to estimate oe (or more) umbers. However, before we ca embark o such a desig, we must lear some priciples of estimatio to have some uderstadig of the properties of a good estimator ad to preset our ucertaily about the estimatio procedure. Statistics has provided two distict approaches this questio - typically called classical or frequetist ad Bayesia. We shall give a overview of both approaches. However, the otes will emphasize the classical approach. We begi with a defiitio: Defiitio.. A statistic is a fuctio of the data that does ot deped o ay ukow parameter. We have to this poit, see a variety of statistics. Example.. sample mea, x 85
2 sample variace, s sample stadard deviatio, s sample media, sample quartiles Q,Q 3, percetiles ad other quatiles stadardized scores (x i x)/s order statistics x (),x (),...x (), icludig sample maximum ad miimum sample momets x m = x m k, m =,, 3,... k= Here, we will look at a particular type of parameter estimatio, i which we cosider X =(X,...,X ), idepedet radom variables chose accordig to oe of a family of probabilities P where is elemet from the parameter space. Based o our aalysis, we choose a estimator ˆ (X). If the data x takes o the values x,x,...,x, the ˆ (x,x,...,x ) is called the estimate of. Thus we have three closely related objects,. - the parameter, a elemet of the parameter space. This is a umber or a vector.. ˆ (x,x,...,x ) - the estimate. This agai is a umber or a vector obtaied by evaluatig the estimator o the data x =(x,x,...,x ). 3. ˆ (X,...,X ) - the estimator. This is a radom variable. We will aalyze the distributio of this radom variable to decide how well it performs i estimatig. The first of these three objects is a umber. The secod is a statistic. The third ca be aalyzed ad its properties described usig the theory of probability. Keepig the relatioship amog these three objects i mid is essetial i uderstadig the fudametal issues i statistical estimatio. Example.3. For Beroulli trials X =(X,...,X ), we have. p, a sigle parameter, the probability of success, with parameter space [0, ].. ˆp(x,...,x ) is the sample proportio of successes i the data set. 3. ˆp(X,...,X ), the sample mea of the radom variables ˆp(X,...,X )= (X + + X )= S is a estimator of p. We ca give the distributio of this estimator because S is a biomial radom variable. Example.4. Give pairs of observatios (x, y) =((x,y ), (x,y ),...,(x,y )) that display a geeral liear patter, we use ordiary least squares regress for. parameters - the slope ad itercept of the regressio lie. So, the parameter space is R, pairs of real umbers.. They are estimated usig the statistics ˆ ad ˆ i the equatios ˆ(x, y) = cov(x, y) var(x), ȳ =ˆ(x, y)+ ˆ(x, y) x. 3. Later, whe we cosider statistical iferece for liear regressio, we will aalyze the distributio of the estimators. Exercise.5. Let X =(X,...,X ) be idepedet uiform radom variables o the iterval [0, ] with ukow. Give some estimators of from the statistics above. 86
3 . Classical Statistics I classical statistics, the state of ature is assumed to be fixed, but ukow to us. Thus, oe goal of estimatio is to determie which of the P is the source of the data. The estimate is a statistic ˆ : data. Itroductio to estimatio i the classical approach to statistics is based o two fudametal questios: How do we determie estimators? How do we evaluate estimators? We ca ask if this estimator i ay way systematically uder or over estimate the parameter, if it has large or small variace, ad how does it compare to a otio of best possible estimator. How easy is it to determie ad to compute ad how does the procedure improve with icreased sample size? The raw material for our aalysis of ay estimator is the distributio of the radom variables that uderlie the data uder ay possible value of the parameter. To simplify laguage, we shall use the term desity fuctio to refer to both cotiuous ad discrete radom variables. Thus, to each parameter value, there exists a desity fuctio which we deote f X (x ). We focus o experimetal desigs based o a simple radom sample. To be more precise, the observatios are based o a experimetal desig that yields a sequece of radom variables X,...,X, draw from a family of distributios havig commo desity f X (x ) where the parameter value is ukow ad must be estimated. Because the radom variables are idepedet, the joit desity is the product of the margial desities. Y f X (x ) = f X (x k ) =f X (x )f X (x ) f X (x ). k=i I this circumstace, the data x are kow ad the parameter is ukow. Thus, we write the desity fuctio as L( x) =f X (x ) ad call L the likelihood fuctio. Because the algebra ad calculus of f X (x ) are a bit ufamiliar, we will look at several examples. Example.6 (Parametric families of desities).. For Beroulli trials with a kow umber of trials but ukow success probability parameter p has joit desity f X (x p) =p x ( p) x p x ( p) x p x ( p) x = p P k= x k ( p) P k= ( x k) = p P k= x k ( p) P k= x k = p x ( x) ( p). Normal radom variables with kow variace 0 but ukow mea µ has joit desity (x µ) (x µ) (x µ) f X (x µ) = p exp 0 0 p exp 0 0 p exp 0 0 = p ( exp 0 ) 0 (x k µ) k= 87
4 3. Normal radom variables with ukow mea µ ad variace has desity f X (x µ, )= ( p ) exp (x k µ). 4. Beta radom variables with parameters ad has joit desity k= f X (x, )= ( + ) (x x x ) (( x ) ( x ) ( x )) () ( ). Exercise.7. Give the likelihood fuctio for observatios of idepedet (, ) radom variables. The choice of a poit estimator ˆ is ofte the first step. The ext two topics will be devoted to cosider two approaches for determiig estimators - method of momets ad maximum likelihood. We ext move to aalyze the quality of the estimator. With this i view, we will give methods for approximatig the bias ad the variace of the estimators. Typically, this iformatio is, i part, summarized though what is kow as a iterval estimator. This is a procedure that determies a subset of the parameter space with high probability that it cotais the real state of ature. We see this most frequetly i the use of cofidece itervals..3 Bayesia Statistics For a few tosses of a coi always that always tur up tails, the estimate ˆp =0for the probability of heads did ot seem reasoable to Thomas Bayes. He wated a way to place our ucertaily of the value for p ito the procedure for estimatio. Today, the Bayesia approach to statistics takes ito accout ot oly the desity f X (x ) for the data collected for ay give experimet but also exteral iformatio to determie a prior desity o the parameter space. Thus, i this approach, both the parameter ad the data are modeled as radom. Estimatio is based o Bayes formula. Let be a radom variable havig the give prior desity. I the case i which both ad the data take o oly a fiite set of values, is a discrete radom variable ad is a mass fuctio { } = P { = }. Let C = { = } be the evet that takes o the value ad A = {X = x} be the values take o by the data. The {C, } from a partitio of the probability space. Bayes formula is f X ( x) =P { = X = x} = P (C A) = P (A C )P (C ) P P (A C )P (C ) or P {X=x = }P { P = } P {X=x = }P { = } = f X (x ) { } P fx (x ) { }. Give data x, the fuctio of, f X ( x) =P { = X = x} is called the posterior desity. For a cotiuous distributio o the parameter space, is ow a desity for a cotiuous radom variable ad the sum i Bayes formula becomes a itegral. f X ( x) = f X (x ) ( ) R fx (x ) ( ) d (.) Sometimes we shall write (.) as f X ( x) =c(x)f X (x ) ( ) 88
5 where c(x), the reciprocal of the itegral i the deomiator i (.), is the value ecessary so that the itegral of the posterior desity f X ( x) with respect to equals. We might also write f X ( x) / f X (x ) ( ) (.) where c(x) is the costat of proportioality. Estimatio, e.g., poit ad iterval estimates, i the Bayesia approach is based o the data ad a aalysis usig the posterior desity. For example, oe way to estimate is to use the mea of the posterior distributio, or more briefly, the posterior mea, Z ˆ (x) =E[ x] = f X ( x) d. Example.8. As suggested i the origial questio of Thomas Bayes, we will make idepedet flips of a biased coi ad use a Bayesia approach to make some iferece for the probability of heads. We first eed to set a prior distributio for P. The beta family Beta(, ) of distributios takes values i the iterval [0, ] ad provides a coveiet prior desity. Thus, (p) =c, p ( ) ( p) ( ), 0 <p<. Ay desity o the iterval [0, ] that ca be writte a a power of p times a power of that is a member of the beta family. This distributio has mea + ad = Z 0 variace (p) dp p times a costat chose so ( + ) ( + + ). (.3) Thus, the mea is the ratio of ad +. If the two parameters are each multiplied by a factor of k, the the mea does ot chage. However, the variace is reduced by a factor close to k. The prior gives a sese of our prior kowledge of the mea through the ratio of to + ad our ucertaily through the size of ad If we perform Beroulli trials, x =(x,...,x ), the the joit desity f X (x p) =p P k= x k ( p) P k= x k. Thus the posterior distributio of the parameter P give the data x, usig (.), we have. f P X (p x) / f X P (x p) (p) =p P k= x k ( p) P k= xk c, p ( ) ( p) ( ). = c, p +P k= x k ( p) + P k= x k. Cosequetly, the posterior distributio is also from the beta family with parameters + x k ad + x k = + ( x k ). k= Notice that the posterior mea ca be writte as k= k= + # successes ad + # failures. + P k= x k + + = P k= x k + + x k = k= + = x + +. This expressio allow us to see that the posterior mea ca be expresses as a weighted average /( + prior mea ad x, the sample mea from the data. The relative weights are 89 + ) from the
6 + from the prior ad, the umber of observatios. Thus, if the umber of observatios is small compared to + mea /( + ). As the umber of observatios icrease, the + +, the most of the weight is placed o the prior icreases towards. The weight result i a shift the posterior mea away from the prior mea ad towards the sample mea x. This brigs forward two cetral issues i the use of the Bayesia approach to estimatio. If the umber of observatios is small, the the estimate relies heavily o the quality of the choice of the prior distributio. Thus, a ureliable choice for leads to a ureliable estimate. As the umber of observatios icreases, the estimate relies less ad less o the prior distributio. I this circumstace, the prior may simply be playig the roll of a catalyst that allows the machiery of the Bayesia methodology to proceed. Exercise.9. Show that this aswer is equivalet to havig heads ad cois. tails i the data set before actually flippig Example.0. If we flip a coi = 4 times with 8 heads, the the classical estimate of the success probability p is 8/4=4/7. For a Bayesia aalysis with a beta prior distributio, usig (.3) we have a beta posterior distributio with the followig parameters. prior data posterior mea variace heads tails mea variace 6 6 / /5= /(+4)=7/3 68/854= /4 3/08= /(7+9) =7/6 53/85= /4 3/08= /(5+)=/6 65/854= x Figure.: Example of prior (black) ad posterior (red) desities based o 4 coi flips, 8 heads ad 6 tails. Left pael: Prior is Beta(6, 6), Right pael: Prior is Beta(9, 3). Note how the peak is arrowed. This shows that the posterior variace is smaller tha the prior variace. I additio, the peal moves from the prior towards ˆp =4/7, the sample proportio of the umber of heads. x 90
7 I his origial example, Bayes chose was the uiform distributio ( = posterior mea is + + X x k. For the example above k= =) for his prior. I this case the prior data posterior mea variace heads tails mea variace / /= /(9+7)=9/6 63/435=0.044 Example.. Suppose that the prior desity is a ormal radom variable with mea 0 ad variace /. This way of givig the variace may seem uusual, but we will see that is a measure of iformatio. Thus, low variace meas high iformatio. Our data x are a realizatio of idepedet ormal radom variables with ukow mea. We shall choose the variace to be to set a scale for the size of the variatio i the measuremets that yield the data x. We will preset this example omittig some of the algebraic steps to focus o the cetral ideas. The prior desity is r ( ) = exp ( 0). We rewrite the desity for the data to empathize the differece betwee the parameter for the mea ad the x, the sample mea. f X (x ) = ( ) exp (x / i ) i= = (x i x). exp ( ) / ( x) The posterior desity is proportioal to the product f X (x ) ( ), Becsuse the posterior is a fuctio of, we eed oly keep track of the terms which ivolve. Cosequetly, we write the posterior desity as f X ( x) =c(x)exp (( x) + ( 0 ) ) where = c(x)exp( + i= ( (x)) ). (x) = 0 + x. (.4) + + Notice that the posterior distributio is ormal with mea (x) that results from the weighted average with relative weights from the iformatio from the prior ad from the data. The variace i iversely proportioal to the total iformatio +. Thus, if is small compared to, the (x) is ear 0. If is large compared to, (x) is ear x. Exercise.. Fill i the steps i the derivatio of the posterior desity i the example above. For these two examples, we see that the prior distributio ad the posterior distributio are members of the same parameterized family of distributios, amely the beta family ad the ormal family. I these two cases, we say that the prior desity ad the desity of the data form a cojugate pair. I the case of coi tosses, we fid that the beta ad the Beroulli families form a cojugate pair. I Example., we lear that the ormal desity is cojugate to itself. 9
8 x Figure.: Example of prior (black) ad posterior (red) desities for a ormal prior distributio ad ormally distributed data. I this figure the prior desity is N(, /). Thus, 0 =ad =. Here the data cosist of 3 observatios havig sample mea x =/3. Thus, the posterior mea from equatio (.4) is (x) =4/5ad the variace is /(+3) = /5. Typically, the computatio of the posterior desity is much more computatioally itesive that what was show i the two examples above. The choice of cojugate pairs is eticig because the posterior desity is a determied from a simple algebraic computatio. Bayesia statistics is seeig icreasig use i the scieces, icludig the life scieces, as we see the explosive icrease i the amout of data. For example, usig a classical approach, mutatio rates estimated from geetic sequece data are, due to the paucity of mutatio evets, ofte ot very precise. However, we ow have may data sets that ca be sythesized to create a prior distributio for mutatio rates ad will lead to estimates for this ad other parameters of iterest that will have much smaller variace tha uder the classical approach. Exercise.3. Show that the gamma family of distributios is a cojugate prior for the Poisso family of distributios. Give the posterior mea based o observatios..4 Aswers to Selected Exercises.5. Double the average, X. Take the maximum value of the data, max appleiapple x i. Double the differece of the maximum ad the miimum, (max appleiapple x i mi appleiapple x i )..7. The desity of a gamma radom variable Thus, for observatios f(x, )= () x L( x) =f(x, )f(x, ) f(x, ) = = () x e x () (x x x ) () x e e 9 e x. x (x +x + +x ) () x e x
9 .9. I this case the total umber of observatios is + + ad the total umber of successes is + P i= x i. Their ratio is the posterior mea... To iclude some of the details i the computatio, we first add ad subtract x i the sum for the joit desity, f X (x ) = ( ) exp (x / i ) = ( ) exp ((x / i x)+( x )) i= i= The we expad the square i the sum to obtai ((x i x)+( x )) = (x i x) + (x i x) ( x i= = i= i= (x i x) +0+( x ) i= )+ ( x ) i= This gives the joit desity f X (x ) = exp ( ) / ( x) (x i x). i= The posterior desity is f X ( x) =c(x)f X (x ) f ( ) = c(x) exp ( ) / r = c(x) ( ) / = c (x)exp ( x) exp i= r (x i x) exp i= (( x) + ( 0 ) ) ( 0) (x i x) exp (( x) + ( 0 ) ) Here c (x) is the fuctio of x i parethesis. We ow expad the expressios i the expoet,. ( x) + ( 0 ) =( x + x )+( 0 + 0) =( + ) ( x + 0 ) +( x + 0) =( + ) x + 0 +( x + + 0) =( + ) (x) + (x) ( + ) (x) +( x + 0) =( + )( (x)) ( + ) (x) +( x + 0) usig the defiitio of (x) i (.4) ad completig the square. f X ( x) =c (x)exp (( x + 0) ( + ) (x) +( + )( (x)) ) = c (x)exp (( x + 0) ( + ) (x) + ) exp( ( (x)) ) + = c (x)exp( ( (x)) ) 93
10 where c (x) is the fuctio of x i parethesis. This give a posterior desity that is ormal, mea (x) ad variace For observatios x,x,...,x of idepedet Poisso radom variables havig parameter, the joit desity is the product of the margial desities. f X (x )= x x e x x e x x e = x x x The prior desity o has a (, ) desity x +x + +x e = x x x x e. ( )= () e. Thus, the posterior desity f X ( x) =c(x) e x e = c(x) + x e ( +) is the desity of a ( + x, + ) radom variable. Its mea ca be writte as the weighted average + x + = + + x + of the prior mea / ad the sample mea x. The weights are, respectively, proportioal to ad the umber of observatios. The figure below demostrate the case with a (, ) prior desity o ad a sum x +x +x 3 +x 4 +x 5 =6for 5 values for idepedet observatios of a Poisso radom radom variable. Thus the posterior has a ( + 6, + 5) = (8, 6) distributio. desity x 94
Topic 10: Introduction to Estimation
Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationUnbiased Estimation. February 7-12, 2008
Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom
More informationThe standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
More informationTopic 8: Expected Values
Topic 8: Jue 6, 20 The simplest summary of quatitative data is the sample mea. Give a radom variable, the correspodig cocept is called the distributioal mea, the epectatio or the epected value. We begi
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationTopic 18: Composite Hypotheses
Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationEXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY
EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationExponential Families and Bayesian Inference
Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationMath 155 (Lecture 3)
Math 55 (Lecture 3) September 8, I this lecture, we ll cosider the aswer to oe of the most basic coutig problems i combiatorics Questio How may ways are there to choose a -elemet subset of the set {,,,
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More informationIt is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.
MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationCS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5
CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationFinal Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationMath 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency
Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the
More information1.010 Uncertainty in Engineering Fall 2008
MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval
More informationProbability and MLE.
10-701 Probability ad MLE http://www.cs.cmu.edu/~pradeepr/701 (brief) itro to probability Basic otatios Radom variable - referrig to a elemet / evet whose status is ukow: A = it will rai tomorrow Domai
More informationn n i=1 Often we also need to estimate the variance. Below are three estimators each of which is optimal in some sense: n 1 i=1 k=1 i=1 k=1 i=1 k=1
MATH88T Maria Camero Cotets Basic cocepts of statistics Estimators, estimates ad samplig distributios 2 Ordiary least squares estimate 3 3 Maximum lielihood estimator 3 4 Bayesia estimatio Refereces 9
More information6. Sufficient, Complete, and Ancillary Statistics
Sufficiet, Complete ad Acillary Statistics http://www.math.uah.edu/stat/poit/sufficiet.xhtml 1 of 7 7/16/2009 6:13 AM Virtual Laboratories > 7. Poit Estimatio > 1 2 3 4 5 6 6. Sufficiet, Complete, ad Acillary
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationStat 200 -Testing Summary Page 1
Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece
More informationMathematical Statistics - MS
Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More informationECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015
ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationTopic 10: The Law of Large Numbers
Topic : October 6, 2 If we choose adult Europea males idepedetly ad measure their heights, keepig a ruig average, the at the begiig we might see some larger fluctuatios but as we cotiue to make measuremets,
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More informationDiscrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22
CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first
More informationOn an Application of Bayesian Estimation
O a Applicatio of ayesia Estimatio KIYOHARU TANAKA School of Sciece ad Egieerig, Kiki Uiversity, Kowakae, Higashi-Osaka, JAPAN Email: ktaaka@ifokidaiacjp EVGENIY GRECHNIKOV Departmet of Mathematics, auma
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationFirst Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise
First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More informationBayesian Methods: Introduction to Multi-parameter Models
Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested
More informationDiscrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 19
CS 70 Discrete Mathematics ad Probability Theory Sprig 2016 Rao ad Walrad Note 19 Some Importat Distributios Recall our basic probabilistic experimet of tossig a biased coi times. This is a very simple
More informationAnalysis of Experimental Data
Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both
More informationEstimation of a population proportion March 23,
1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes
More informationCHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics
CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More informationn outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe
More informationElements of Statistical Methods Lots of Data or Large Samples (Ch 8)
Elemets of Statistical Methods Lots of Data or Large Samples (Ch 8) Fritz Scholz Sprig Quarter 2010 February 26, 2010 x ad X We itroduced the sample mea x as the average of the observed sample values x
More informationLogit regression Logit regression
Logit regressio Logit regressio models the probability of Y= as the cumulative stadard logistic distributio fuctio, evaluated at z = β 0 + β X: Pr(Y = X) = F(β 0 + β X) F is the cumulative logistic distributio
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationAdvanced Stochastic Processes.
Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.
More informationSTAC51: Categorical data Analysis
STAC51: Categorical data Aalysis Mahida Samarakoo Jauary 28, 2016 Mahida Samarakoo STAC51: Categorical data Aalysis 1 / 35 Table of cotets Iferece for Proportios 1 Iferece for Proportios Mahida Samarakoo
More informationStatistical Inference Based on Extremum Estimators
T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationLinear Regression Models
Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationCHAPTER 10 INFINITE SEQUENCES AND SERIES
CHAPTER 10 INFINITE SEQUENCES AND SERIES 10.1 Sequeces 10.2 Ifiite Series 10.3 The Itegral Tests 10.4 Compariso Tests 10.5 The Ratio ad Root Tests 10.6 Alteratig Series: Absolute ad Coditioal Covergece
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationLecture 12: September 27
36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.
More informationof the matrix is =-85, so it is not positive definite. Thus, the first
BOSTON COLLEGE Departmet of Ecoomics EC771: Ecoometrics Sprig 4 Prof. Baum, Ms. Uysal Solutio Key for Problem Set 1 1. Are the followig quadratic forms positive for all values of x? (a) y = x 1 8x 1 x
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationf X (12) = Pr(X = 12) = Pr({(6, 6)}) = 1/36
Probability Distributios A Example With Dice If X is a radom variable o sample space S, the the probablity that X takes o the value c is Similarly, Pr(X = c) = Pr({s S X(s) = c} Pr(X c) = Pr({s S X(s)
More informationStat410 Probability and Statistics II (F16)
Some Basic Cocepts of Statistical Iferece (Sec 5.) Suppose we have a rv X that has a pdf/pmf deoted by f(x; θ) or p(x; θ), where θ is called the parameter. I previous lectures, we focus o probability problems
More informationConfidence Intervals
Cofidece Itervals Berli Che Deartmet of Comuter Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Referece: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chater 5 & Teachig Material Itroductio
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationCorrelation Regression
Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationAAEC/ECON 5126 FINAL EXAM: SOLUTIONS
AAEC/ECON 5126 FINAL EXAM: SOLUTIONS SPRING 2015 / INSTRUCTOR: KLAUS MOELTNER This exam is ope-book, ope-otes, but please work strictly o your ow. Please make sure your ame is o every sheet you re hadig
More informationJanuary 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS
Jauary 25, 207 INTRODUCTION TO MATHEMATICAL STATISTICS Abstract. A basic itroductio to statistics assumig kowledge of probability theory.. Probability I a typical udergraduate problem i probability, we
More informationThe variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.
SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample
More informationLecture 6 Simple alternatives and the Neyman-Pearson lemma
STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More information