4.5 Multiple Imputation
|
|
- Elmer Norman
- 5 years ago
- Views:
Transcription
1 45 ultiple Imputatio Itroductio Assume a parametric model: y fy x; θ We are iterested i makig iferece about θ I Bayesia approach, we wat to make iferece about θ from fθ x, y = πθfy x, θ πθfy x, θdθ where πθ is a prior distributio, which is assumed to be kow for simplicity here The poit estimator is ˆθ = E{θ x, y} ad its variace estimator is ˆV = V {θ x, y} 2 We may express ˆθ = ˆθ x, y ad ˆV = ˆV x, y Now, cosider the case whe x is always observed ad y is subject to missigess Let y = y obs, y mis be the observed, missig part of the sample We have two approaches of makig iferece about θ usig the observed data x, y obs : Direct Bayesia approach: Cosider fθ x, y obs = πθfy obs x, θ πθfyobs x, θdθ
2 ad use ˆθ r = E{θ x, y obs } 3 ad its variace estimator is ˆV r = V {θ x, y obs } 4 2 ultiple imputatio approach: Compariso a For each k, geerate y k mis from fy mis x, y obs b Apply the k-th imputed values to ˆθ i to obtai ˆθ k Also, apply the k-th imputed values to ˆV i 2 to obtai ˆV x, y obs, y k mis c Combie the poit estimators to get as a poit estimator of θ ˆθ I = d The variace estimator of ˆθ I is where ˆV I = W + W = B = ˆθ k + B ˆV k ˆθ k θ I 2 Bayesia Frequetist odel Posterior distributio Predictio model flatet, θ data flatet data, θ Computatio Data augmetatio E algorithm Predictio I-step E-step Parameter update P-step -step Parameter est Posterior mode L estimatio Imputatio ultiple imputatio Fractioal imputatio Variace estimatio Rubi s formula Liearizatio or Bootstrap = ˆθ x, y obs, y k ˆV k = mis 2
3 2 ai Result Bayesia Properties Rubi, 987: For sufficietly large, we have ˆθ I = ˆθ k = Eθ x, y obs, y k mis = E{Eθ x, y obs, Y mis x, y obs } = Eθ x, y obs, which is equal to ˆθ r i 3 Also, for sufficietly large, ˆV I = W + B = ˆV k + ˆθ k θ 2 I = E{V θ x, y obs, Y mis x, y obs } + V {Eθ x, y obs, Y mis x, y obs }, which is equal to ˆV r i 4 2 Frequetist Properties Wag ad Robis, 998 Assume that, uder complete data, ˆθ is the LE of θ ad ˆV = {Iˆθ } is asymptotically ubiased for V ˆθ Uder the existece of missig data, ˆθ I is asymptotically equivalet to the LE of θ ad ˆV I is approximately ubiased for V ˆθ I That is, ˆθ I = ˆθLE 5 ad E{ ˆV I } = V ˆθ I 6 for sufficietly large ad See Appedix A for a sketched proof 3
4 3 Computatio Gibbs samplig Gema ad Gema 984: the Gibbs sampler for Bayesia image recostructio Taer ad Wog 987: data augmetatio for Bayesia iferece i geeric missig-data problems Gelfad ad Smith 990: simulatio of margial distributios by repeated draws from coditioals Idea for Gibbs samplig: Sample from coditioal distributios Give Z t = Z t, Z t 2,, Z t J, draw Z t+ by samplig from the full coditioals of f, Z t+ P Z t+ 2 P Z t+ J P Z Z t 2, Z t 3,, Z t J Z 2 Z t, Z t 3,, Z t J Z J Z t, Z t 2,, Z t J Uder mild regularity coditios, P Z t f as t Data augmetatio: Applicatio of the Gibbs samplig to missig data problem y = observed data z = missig data θ = model parameters Predictive distributio: P z y = Posterior distributio: P θ y = P z y, θ dp θ y P y y, z dp z y 4
5 Algorithm: Iterative method of data augmetatio I-step: Draw P-step: Draw z t+ P z y, θ t θ t+ P θ y, z t+ Two uses of data augmetatio Parameter simulatio: collect ad summarize a sequece of depedet draws of θ, θ t+, θ t+2,, θ t+n, where t is large eough to esure statioarity ultiple imputatio: collect idepedet draws of z, z t, z 2t,, z mt Parameter simulatio Bayesia approach θ = compoet of fuctio of θ of iterest Collect iterates of θ from data augmetatio θ t+, θ t+2,, θ t+n, where t is large eough to esure statioarity ad N is the ote Carlo sample size θ = N N N N θt+k estimates the posterior mea E θ y 2 θ t+k θ estimates the posterior variace V θ y The 25th ad 975th percetiles of θ t+, θ t+2,, θ t+n estimate the edpoits of a 95% equal-tailed Bayesia iterval for θ 5
6 4 Examples 4 Example Uivariate Normal distributio Let y,, y be IID observatios from Nµ, σ 2 ad oly the first r elemets are observed ad the remaiig r elemets are missig Assume that the respose mechaism is igorable Bayesia imputatio: the j-th posterior values of µ, σ 2 are geerated from σ j2 y r rˆσ r/χ 2 2 r 7 ad µ j y r, σ j2 N ȳ r, r σ j2 8 where y r = y,, y r, ȳ r = r r i= y i, ad ˆσ 2 r = r r i= y i ȳ r 2 Give the posterior sample µ j, σ j2, the imputed values are geerated from idepedetly for i = r +,, y j i y r, µ j, σ j2 N µ j, σ j2 9 Let θ = EY be the parameter of iterest ad the I estimator of θ ca be expressed as where The, ˆθ I = ȳ r + r ˆθ j I j= ˆθ I = { r = y i + i= j= µ j ȳ r + ˆθ j I i=r+ y j i } i=r+ j= y j i µ j 0 Asymptotically, the first term has mea µ ad variace r σ 2, the secod term has mea zero ad variace r/ 2 σ 2 /mr, the third term has mea zero ad variace σ 2 r/ 2 m, ad the three terms are mutually idepedet Thus, the variace of ˆθ I is V ˆθI = r σ2 + 2 r r σ2 + r σ2 6
7 For variace estimatio, ote that V y j i = V ȳ r + V µ j ȳ r + V y j i µ j = r σ2 + r + r + r σ2 + σ 2 r r = σ 2 Writig ˆV j I ˆθ = { i= ỹ j i ỹ j k } 2 i= 2 = ỹ j i µ where ỹi = δ i y i + δ i y j i, we have E { } j ˆV I ˆθ = { i= E = [σ 2 = σ 2 2 ỹ j i µ V { r σ2 + r ỹ j k µ ỹ j k 2 } 2 r σ2 + r σ2 }] which shows that EW = V ˆθ Also, EB m = V = V = = ˆθ I { r 2 r r ˆθ Cov I, 2 ˆθ I µ ȳ r + σ 2 σ 2 r + r Thus, Rubi s variace estimator satisfies { } E ˆVI ˆθ I = r σ2 + r which is cosistet with the geeral result i 6 i=r+ y i µ } 2 r σ2 + r σ2 = V ˆθI, 7
8 42 Example 2 Cesored regressio model, or Tobit model odel z i = x iβ + ɛ i ɛ i N 0, σ 2 y i = { zi if z i c i c i if z i < c i Data augmetatio I-step: Give θ t = β t, σ t, geerate the imputed value for δ i = 0 from with ɛ t i If δ i =, the z t+ i = z i 2 P-step: Give z t+ = z t+ i = x iβ t + ɛ t i φ s Φ [c i x i βt /σ t ] z t+, z t+ 2,, z t+ θ t+ P θ z t+, geerate θ t+ from That is, geerate σ 2t+ z t+ p ˆσ 2t+ /χ 2 p ad where ad β t+ z t+, σ 2t+ N ˆβt+, x i x i σ 2t+, ˆβ t+ = i= x i x i i= i= x i z t+ i ˆσ 2t+ = p z t+ I P x z t+ 8
9 43 Example 3 Bayesia bootstrap Noparametric approach to Bayesia imputatio First proposed by Rubi 98 Assume that a elemet of the populatio takes oe of the values d,, d K with probability p,, p K, respectively That is, we assume P Y = d k = p k, K p k = 2 Let y,, y be a IID sample from 2 ad let k be the umber of y i equal to d k The parameter is a vector of probabilities p = p,, p K, such that K i= p i = I this case, the populatio mea θ = EY ca be expressed as θ = K i= p id i ad we oly eed to estimate p If the improper Dirichlet prior with desity proportioal to K p k is placed o the vector p, the the posterior distributio of p is proportioal to K p k k which is a Dirichlet distributio with parameter,, K This posterior distributio ca be simulated usig idepedet uiform radom umbers Let u,, u be IID U0,, ad let g i = u i u i, i =, 2,, where u k is the k-th order statistic of u,, u with u 0 = 0 ad u = Partitio the g,, g ito K collectios, with the k-th oe havig k elemets, ad let p k be the sum of the g i i the k-th collectio The, the realized value of p,, p k follows a K -variate Dirichlet distributio with parameter,, K I particular, if K =, the g,, g is the vector of probabilities to attach to the data values y,, y i that Bayesia bootstrap replicatio To implemet Rubi s Bayesia bootstrap to multiple imputatio, assume that the first r elemets are observed ad the remaiig r elemets are missig The imputed values ca be geerated with the followig steps: 9
10 [Step ] From y r = y,, y r, geerate p r = p,, p r from the posterior distributio usig the Bayesia bootstrap as follows Geerate u,, u r idepedetly from U0, ad sort them to get 0 = u 0 < u < < u r < u r = 2 Compute p i = u i u i, i =, 2,, r ad p r = r i= p i [Step 2] Select the imputed value of y i by y with probability p yi = y r with probability p r idepedetly for each i = r +,, Rubi ad Scheker 986 proposed a approximatio of this Bayesia bootstrap method, called the approximate Bayesia boostrap ABB method, which provides a alterative approach of geeratig imputed values from the empirical distributio The ABB method ca be described as follows: [Step ] From y r = y,, y r, geerate a door set y r = y,, y r by bootstrappig That is, we select y with probability /r yi = y r with probability /r idepedetly for each i =,, r [Step 2] From the door set yr = y,, yr, select a imputed value of y i by y i = idepedetly for each i = r +,, y with probability /r yr with probability /r 0
11 Referece Gelfad, AE ad Smith, AF 990 Samplig-based approaches to calculatig margial desities, Joural of the America Statistical Associatio, 85, Gema, S ad Gema, D 984 Stochastic relaxatio, Gibbs distributios, ad the Bayesia restoratio of images IEEE Trasactios o Patter Aalysis ad achie Itelligece, 62, Rubi, D B 98 The Bayesia bootstrap The Aals of Statistics, 9, Rubi, D B 987 ultiple Imputatio for Norespose i Surveys, Joh Wiley & Sos, New York Rubi, DB ad Scheker, N 986 ultiple imputatio for iterval estimatio from simple radom samples with igorable orespose Joural of the America Statistical Associatio 8, Taer, A ad Wog, WH 987 The calculatio of posterior distributio by data augmetatio Joural of the America Statistical Associatio 82, Wag, N ad Robis, J 998 Large-sample theory for parametric multiple imputatio procedure, Biometrika, 85,
12 Appedix A Proof of 5 ad 6 Let S com θ = Sθ; x, y be the score fuctio of θ uder complete respose The LE uder complete respose, deoted by ˆθ, is asymptotically equivalet to where I com = E{ S com θ/ θ } Thus, ˆθ I = ˆθ k where y k mis = θ + I com = θ + I + I com com ˆθ = θ + I coms com θ, Sθ; x, y obs, y k mis E{Sθ; x, y obs, Y mis x, y obs ; θ k } [ ] Sθ; x, y obs, y k mis E{Sθ; x, y obs, Y mis x, y obs ; θ k }, fy mis x, y obs ; θ k ad θ k pθ x, y obs Uder some regularity coditios, the posterior distributio coverges to a ormal distributio with mea ˆθ LE ad variace I obs = V ˆθ LE This is ofte called Berstei vo ises theorem Thus, we ca apply Taylor liearizatio o Sθ; x, y obs, y k mis with respect to θ k aroud the true θ to get E{Sθ; x, y obs, Y mis x, y obs ; θ k } = E{Sθ; x, y obs, Y mis x, y obs ; θ} + I mis θ k θ = S obs θ + I mis ˆθ LE θ + I mis θ k ˆθ LE, 3 where I mis is the iformatio matrix associated with fy mis x, y obs ; θ Sice ˆθ LE is the solutio to S obs θ = 0, we have ad 3 further simplifies to ˆθ LE = θ + I obs S obsθ E{Sθ; x, y obs, Y mis x, y obs ; θ k } = S obs θ + I mis I obs S obsθ + I mis θ k ˆθ LE = I com I obs S obsθ + I mis θ k ˆθ LE 4 2
13 Thus, combiig all terms together, we have ˆθ k = ˆθ LE + I comi mis θ k ˆθ LE + I com { S k ES x, y obs ; θ k } 5 where S k = Sθ; x, y obs, y k mis Therefore, ˆθ I = ˆθ LE + IcomI mis θ k ˆθ LE + I com = ˆθ LE + I comi mis { +I com [ ] Sθ; x, y obs, y k mis E{Sθ; x, y obs, Y mis x, y obs ; θ k } θ k ˆθ LE } { S k θ ES x, y obs ; θ k } Note that the secod term reflects the variability due to geeratig θ ad the third therm reflects the variability due to geeratig y k mis three terms are idepedet ad so we obtai from fy mis x, y obs ; θ k The V {ˆθ I } = V ˆθ LE + I comi mis I obs I misicom + I comi mis Icom 6 The last two terms are egligible for large To prove 6, ote first that EV = V ˆθ = I com ad so we have EW = I com Now, isertig 5 ito we have B = EB = V ˆθ ˆθ k ˆθ I 2, Covˆθ 2, ˆθ = I comi mis I obs I misi com + I comi mis I com = I obs I com, where the last equality follows from A + BCB = A A BCB A + A BCB A + BCB BCB A with A = I com, B = I, ad C = I mis Therefore, E ˆV I = I obs = V ˆθ LE = V ˆθ I 3
5. Fractional Hot deck Imputation
5. Fractioal Hot deck Imputatio Itroductio Suppose that we are iterested i estimatig θ EY or eve θ 2 P ry < c where y fy x where x is always observed ad y is subject to missigess. Assume MAR i the sese
More informationIt should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable.
Chapter 10 Variace Estimatio 10.1 Itroductio Variace estimatio is a importat practical problem i survey samplig. Variace estimates are used i two purposes. Oe is the aalytic purpose such as costructig
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationElement sampling: Part 2
Chapter 4 Elemet samplig: Part 2 4.1 Itroductio We ow cosider uequal probability samplig desigs which is very popular i practice. I the uequal probability samplig, we ca improve the efficiecy of the resultig
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationLecture 33: Bootstrap
Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece
More informationBayesian Methods: Introduction to Multi-parameter Models
Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors
ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationLecture 3. Properties of Summary Statistics: Sampling Distribution
Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More informationIntroductory statistics
CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key
More informationStatistical Inference Based on Extremum Estimators
T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0
More informationThe variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.
SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample
More information1.010 Uncertainty in Engineering Fall 2008
MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationFirst Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise
First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >
More informationof the matrix is =-85, so it is not positive definite. Thus, the first
BOSTON COLLEGE Departmet of Ecoomics EC771: Ecoometrics Sprig 4 Prof. Baum, Ms. Uysal Solutio Key for Problem Set 1 1. Are the followig quadratic forms positive for all values of x? (a) y = x 1 8x 1 x
More informationR. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State
Bayesia Cotrol Charts for the Two-parameter Expoetial Distributio if the Locatio Parameter Ca Take o Ay Value Betwee Mius Iity ad Plus Iity R. va Zyl, A.J. va der Merwe 2 Quitiles Iteratioal, ruaavz@gmail.com
More information1 Introduction to reducing variance in Monte Carlo simulations
Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week Lecture: Cocept Check Exercises Starred problems are optioal. Statistical Learig Theory. Suppose A = Y = R ad X is some other set. Furthermore, assume P X Y is a discrete
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationA Note on Effi cient Conditional Simulation of Gaussian Distributions. April 2010
A Note o Effi ciet Coditioal Simulatio of Gaussia Distributios A D D C S S, U B C, V, BC, C April 2010 A Cosider a multivariate Gaussia radom vector which ca be partitioed ito observed ad uobserved compoetswe
More informationGoodness-Of-Fit For The Generalized Exponential Distribution. Abstract
Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated
More informationDepartment of Mathematics
Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets
More informationA NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos
.- A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES by Deis D. Boos Departmet of Statistics North Carolia State Uiversity Istitute of Statistics Mimeo Series #1198 September,
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationLecture 11 and 12: Basic estimation theory
Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationMonte Carlo Integration
Mote Carlo Itegratio I these otes we first review basic umerical itegratio methods (usig Riema approximatio ad the trapezoidal rule) ad their limitatios for evaluatig multidimesioal itegrals. Next we itroduce
More informationQuestions and Answers on Maximum Likelihood
Questios ad Aswers o Maximum Likelihood L. Magee Fall, 2008 1. Give: a observatio-specific log likelihood fuctio l i (θ) = l f(y i x i, θ) the log likelihood fuctio l(θ y, X) = l i(θ) a data set (x i,
More informationBull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung
Bull. Korea Math. Soc. 36 (999), No. 3, pp. 45{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Abstract. This paper provides suciet coditios which esure the strog cosistecy of regressio
More informationAsymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values
of the secod half Biostatistics 6 - Statistical Iferece Lecture 6 Fial Exam & Practice Problems for the Fial Hyu Mi Kag Apil 3rd, 3 Hyu Mi Kag Biostatistics 6 - Lecture 6 Apil 3rd, 3 / 3 Rao-Blackwell
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationInvestigating the Significance of a Correlation Coefficient using Jackknife Estimates
Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------
More informationBasis for simulation techniques
Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationBIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov
Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios
More informationPrecise Rates in Complete Moment Convergence for Negatively Associated Sequences
Commuicatios of the Korea Statistical Society 29, Vol. 16, No. 5, 841 849 Precise Rates i Complete Momet Covergece for Negatively Associated Sequeces Dae-Hee Ryu 1,a a Departmet of Computer Sciece, ChugWoo
More informationJoint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }
UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationConfidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation
Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology
More informationMOMENT-METHOD ESTIMATION BASED ON CENSORED SAMPLE
Vol. 8 o. Joural of Systems Sciece ad Complexity Apr., 5 MOMET-METHOD ESTIMATIO BASED O CESORED SAMPLE I Zhogxi Departmet of Mathematics, East Chia Uiversity of Sciece ad Techology, Shaghai 37, Chia. Email:
More informationPOWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES*
Kobe Uiversity Ecoomic Review 50(2004) 3 POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES* By HISASHI TANIZAKI There are various kids of oparametric
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationEXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY
EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationEconomics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator
Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More informationGUIDELINES ON REPRESENTATIVE SAMPLING
DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationExponential Families and Bayesian Inference
Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where
More informationLecture 9: September 19
36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace
More informationAccess to the published version may require journal subscription. Published with permission from: Elsevier.
This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More informationTMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.
Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx
More information3 Resampling Methods: The Jackknife
3 Resamplig Methods: The Jackkife 3.1 Itroductio I this sectio, much of the cotet is a summary of material from Efro ad Tibshirai (1993) ad Maly (2007). Here are several useful referece texts o resamplig
More informationA goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality
A goodess-of-fit test based o the empirical characteristic fuctio ad a compariso of tests for ormality J. Marti va Zyl Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity of the Free State,
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationMA Advanced Econometrics: Properties of Least Squares Estimators
MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample
More informationEstimating Confidence Interval of Mean Using. Classical, Bayesian, and Bootstrap Approaches
Iteratioal Joural of Mathematical Aalysis Vol. 8, 2014, o. 48, 2375-2383 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijma.2014.49287 Estimatig Cofidece Iterval of Mea Usig Classical, Bayesia,
More informationEconomics 326 Methods of Empirical Research in Economics. Lecture 18: The asymptotic variance of OLS and heteroskedasticity
Ecoomics 326 Methods of Empirical Research i Ecoomics Lecture 8: The asymptotic variace of OLS ad heteroskedasticity Hiro Kasahara Uiversity of British Columbia December 24, 204 Asymptotic ormality I I
More informationKurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version)
Kurskod: TAMS Provkod: TENB 2 March 205, 4:00-8:00 Examier: Xiagfeg Yag (Tel: 070 2234765). Please aswer i ENGLISH if you ca. a. You are allowed to use: a calculator; formel -och tabellsamlig i matematisk
More informationSimple Linear Regression
Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationOutput Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
More informationA Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution
A Note o Box-Cox Quatile Regressio Estimatio of the Parameters of the Geeralized Pareto Distributio JM va Zyl Abstract: Makig use of the quatile equatio, Box-Cox regressio ad Laplace distributed disturbaces,
More informationLecture 18: Sampling distributions
Lecture 18: Samplig distributios I may applicatios, the populatio is oe or several ormal distributios (or approximately). We ow study properties of some importat statistics based o a radom sample from
More informationEstimation of the Mean and the ACVF
Chapter 5 Estimatio of the Mea ad the ACVF A statioary process {X t } is characterized by its mea ad its autocovariace fuctio γ ), ad so by the autocorrelatio fuctio ρ ) I this chapter we preset the estimators
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationChapter 2 The Monte Carlo Method
Chapter 2 The Mote Carlo Method The Mote Carlo Method stads for a broad class of computatioal algorithms that rely o radom sampligs. It is ofte used i physical ad mathematical problems ad is most useful
More informationBayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function
Iteratioal Joural of Statistics ad Systems ISSN 973-2675 Volume 12, Number 4 (217), pp. 791-796 Research Idia Publicatios http://www.ripublicatio.com Bayesia ad E- Bayesia Method of Estimatio of Parameter
More informationSTAC51: Categorical data Analysis
STAC51: Categorical data Aalysis Mahida Samarakoo Jauary 28, 2016 Mahida Samarakoo STAC51: Categorical data Aalysis 1 / 35 Table of cotets Iferece for Proportios 1 Iferece for Proportios Mahida Samarakoo
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationGoodness-Of-Fit For The Generalized Exponential Distribution. Abstract
Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated
More information5 : Exponential Family and Generalized Linear Models
0-708: Probabilistic Graphical Models 0-708, Sprig 206 5 : Expoetial Family ad Geeralized Liear Models Lecturer: Matthew Gormley Scribes: Yua Li, Yichog Xu, Silu Wag Expoetial Family Probability desity
More informationSTA6938-Logistic Regression Model
Dr. Yig Zhag STA6938-Logistic Regressio Model Topic -Simple (Uivariate) Logistic Regressio Model Outlies:. Itroductio. A Example-Does the liear regressio model always work? 3. Maximum Likelihood Curve
More informationSTATISTICAL INFERENCE
STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample
More informationReview Questions, Chapters 8, 9. f(y) = 0, elsewhere. F (y) = f Y(1) = n ( e y/θ) n 1 1 θ e y/θ = n θ e yn
Stat 366 Lab 2 Solutios (September 2, 2006) page TA: Yury Petracheko, CAB 484, yuryp@ualberta.ca, http://www.ualberta.ca/ yuryp/ Review Questios, Chapters 8, 9 8.5 Suppose that Y, Y 2,..., Y deote a radom
More informationChi-Squared Tests Math 6070, Spring 2006
Chi-Squared Tests Math 6070, Sprig 2006 Davar Khoshevisa Uiversity of Utah February XXX, 2006 Cotets MLE for Goodess-of Fit 2 2 The Multiomial Distributio 3 3 Applicatio to Goodess-of-Fit 6 3 Testig for
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationLast Lecture. Wald Test
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig
More informationLet A and B be two events such that P (B) > 0, then P (A B) = P (B A) P (A)/P (B).
1 Coditioal Probability Let A ad B be two evets such that P (B) > 0, the P (A B) P (A B)/P (B). Bayes Theorem Let A ad B be two evets such that P (B) > 0, the P (A B) P (B A) P (A)/P (B). Theorem of total
More informationx iu i E(x u) 0. In order to obtain a consistent estimator of β, we find the instrumental variable z which satisfies E(z u) = 0. z iu i E(z u) = 0.
27 However, β MM is icosistet whe E(x u) 0, i.e., β MM = (X X) X y = β + (X X) X u = β + ( X X ) ( X u ) \ β. Note as follows: X u = x iu i E(x u) 0. I order to obtai a cosistet estimator of β, we fid
More information