Multivariate Assays With Values Below the Lower Limit of Quantitation: Parametric Estimation By Imputation and Maximum Likelihood

Size: px
Start display at page:

Download "Multivariate Assays With Values Below the Lower Limit of Quantitation: Parametric Estimation By Imputation and Maximum Likelihood"

Transcription

1 Multivariate Assays With Values Below the Lower Limit of Quantitation: Parametric Estimation By Imputation and Maximum Likelihood Robert E. Johnson and Heather J. Hoffman 2* Department of Biostatistics, Virginia Commonwealth University 2 Department of Epidemiology and Biostatistics, George Washington University * This presentation is based, in part, on Dr. Hoffman s s dissertation directed by Dr. Johnson.

2 Outline Introduction Limit of Quantitation (LOQ) Ignoring values below LOQ problems Solutions Univariate Solutions Multivariate Summary Johnson & Hoffman, BASS XIII, November 7,

3 Purpose To estimate all parameters (means, variances, correlations) associated with a multivariate assay Multiple assays on the same subject Different analytes or repeated measures of one analyte. Do this in the presence of missing data Completely missing due to flawed assay, etc. Some values fall below LOQ Johnson & Hoffman, BASS XIII, November 7,

4 An Example Tobacco Smoke Complex mixture of chemicals released as tobacco products burn Biomarkers include nicotine and its 5 metabolites measured in 24-hour urine samples. Total nicotine sum of Total cotinine sum of Total hydroxycotinine sum of nicotine nicotine-n-glucuronide cotinine cotinine-n-glucuronide trans-3.-hydroxycotinine trans-3.-hydroxycotinine-o-glucuronide Johnson & Hoffman, BASS XIII, November 7,

5 Analysis of Nicotine Nicotine equivalents are calculated according to the following equation: Nicotine Equivalents = 62.23mg mol mg Total Nicotine mg mol mg + Total Cotinine mg mol mg + Total hydroxycotinine mg mol Johnson & Hoffman, BASS XIII, November 7,

6 Analysis of Nicotine Minimize influence of possible incomplete 24-h h urine collections Adjust measured concentration of each subject s s 24-h h urine biomarker by corresponding 24-h h urine creatinine Nicotine Equivalents Creatinine Level Johnson & Hoffman, BASS XIII, November 7,

7 Data Format X X2 X3 X4 X5 X6 X < <0.50 <0.50 < <0.50 < <0.0 <0.50 <0.50 <.20 < <0.0 < <.20. <.00. <0.50 <0.50 <.20 < Total nicotine sum of Total cotinine sum of Total hydroxycotinine sum of Creatinine nicotine nicotine-n-glucuronide cotinine cotinine-nglucuronide trans-3.-hydroxycotinine trans-3.-hydroxycotinine-o-glucuronide Johnson & Hoffman, BASS XIII, November 7, X X2 X3 X4 X5 X6 X7

8 Problems of Interest The general problem Estimate all parameters, mean and standard error of any function of multivariate values A specific problem Estimate mean and standard error of distribution of: T = 6 ai i= bx X 7 i Johnson & Hoffman, BASS XIII, November 7,

9 Primary Questions How does one estimate the parameters of a given set of data containing missing data or values below LOQ? How does one estimate functions of these data? Estimation of means Estimation of standard errors Johnson & Hoffman, BASS XIII, November 7,

10 Methods of Estimation Univariate (single assay) Imputation (Replacement) Cohen s s Method (adjusted mean, std) Maximum Likelihood Estimation (MLE) with left-censored values Extensions of MLE Johnson & Hoffman, BASS XIII, November 7,

11 Methods of Estimation Multivariate (multiple assays) Imputation (Replacement) MLE with left-censoring and correlation structure Pseudo-Likelihood Johnson & Hoffman, BASS XIII, November 7, 2006

12 Parametric Distributions Normal Distribution: Mean= Variance= Coeff Var= σ μ μ σ 2 2 N( μ, σ ) Johnson & Hoffman, BASS XIII, November 7,

13 Parametric Distributions Log-Normal Distribution: 2 exp μ+ σ 2 Mean= Variance= Coeff Var= ( 2 σ ) exp 2 LN( μ, σ ) ( ) exp( 2 2) 2 exp( 2) μ+ σ σ μ = 0 Johnson & Hoffman, BASS XIII, November 7,

14 Parametric Distributions Multivariate Normal Distribution: μ μ ( ) k, μ is the location parameter k of the k th MN ( μ, Σ) variate covariance i Σ ( σ ) ij, σij = variance i = j j Johnson & Hoffman, BASS XIII, November 7,

15 Parametric Distributions Multivariate Log-Normal Distribution: MLN ( μ, Σ) If X MN μσ,, ( ) then Y ( X k ) ( μσ, ) X = e e MLN Johnson & Hoffman, BASS XIII, November 7,

16 Method Validation Validation Criteria Accuracy Precision Specificity Limit of Quantitation/ Limit of Detection Linearity Range Ruggedness/ Repeatability Equivalence USP. General Chapter <225> Validation of Compendial Methods. Pharmacopeial Forum 3(2): , 2005 Johnson & Hoffman, BASS XIII, November 7,

17 Limit of Quantitation (LOQ) The quantitation limit of an individual analytical procedure is the lowest amount of analyte in a sample that can be quantitatively determined with suitable precision and accuracy. Food and Drug Administration, (Draft) Guidance for Industry: Analytical Procedures and Methods Validation: Chemistry, Manufacturing, and Controls Documentation (2000). Accessed online at Johnson & Hoffman, BASS XIII, November 7,

18 Reasons for Values Falling Below LOQ Absence or trace amounts of analyte High degree of noise relative to analyte levels Matrix effects A loss of absolute sensitivity may render some or all of the analytes of interest undetectable Ardrey, R. E., Liquid Chromatography-Mass Spectrometry: An Introduction. John Wiley & Sons, Johnson & Hoffman, BASS XIII, November 7,

19 Determination of LOQ Generally the LOQ is about 0 times the standard deviation of a low standard (blank solutions). The ICH has recognized the ten-to to-one one signal-to to-noise ratio as typical. Johnson & Hoffman, BASS XIII, November 7,

20 Recording Values below LOQ Simple: <LOQ not numeric Numeric values Negative numbers Indicator variable Interval endpoints Johnson & Hoffman, BASS XIII, November 7,

21 Analysis Ignoring values below LOQ Problems The mean will be over-estimated estimated The standard deviation will be under- estimated Type I or Type II error probabilities may be altered in the comparison of means between two groups. Johnson & Hoffman, BASS XIII, November 7,

22 Full curve mean=4 std= Left truncated at LOQ=2 mean=4.055 std=0.945 Full curve mean=4 std= Left truncated at LOQ=3 mean=4.288 std= LOQs Comparison of means: Effect Size Full curve 0 left truncated at LOQs 0.9 Johnson & Hoffman, BASS XIII, November 7,

23 Full curve mean=3 std= left-truncated at LOQ=2 mean=3.288 std= Full curve mean=4 std= left-truncated at LOQ=2 mean=4.055 std=0.945 LOQ Comparison of means: Effect Size Full curve 0.7 left-truncated at LOQ= Johnson & Hoffman, BASS XIII, November 7,

24 Solutions - Univariate Imputation (replacement) Replace values < LOQ with Zero, LOQ/2,, LOQ/ 2, LOQ Treat values < LOQ as left-censored Kaplan-Meier Survival Method Maximum likelihood Cohen s s Method Single LOQ only MLE May have multiple LOQs Robust MLE recommended by Helsel and others Johnson & Hoffman, BASS XIII, November 7,

25 Recommended Book Helsel,, D.R., Nondetects and Data Analysis: Statistics for Censored Environmental Data. John Wiley & Sons, Johnson & Hoffman, BASS XIII, November 7,

26 Replacement: LOQ/2 [Replacement] remains the most commonly used method for computing summary statistics of censored data it is clear that [replacement] must be abandoned it is not an unequivocal method of analysis. Helsel,, 2005 When is replacement good enough? Johnson & Hoffman, BASS XIII, November 7,

27 EPA Guidelines for handling LOD values % values less than LOD < 5% Impute LOD/2 for each nondetect between 5% and 50% Use Cohen's adjustment [ML method] to the sample mean and variance or employ a non-parametric procedure by using the ranks of the observations and by treating all nondetects as tied values greater than 50% use a rank-based procedure. Source: US EPA, Statistical Analysis Of Ground-water Monitoring Data At RCRA Facilities: Addendum To Interim Final Guidance, Office Of Solid Waste, Permits And State Programs Division, U.S. Environmental Protection Agency, July 992. Johnson & Hoffman, BASS XIII, November 7,

28 About the sample mean based on replacement x = ( h) x + hm m [ ] ( ) Ex = Fd ( ) μ + Fdm ( ) m ( Fd ( )) Fd ( ) Var [ x ] m F d n L ( ) 2 ( ) ( ) 2 m = μl + σ L m is the known replacement value h is the observed proportion of values below LOQ d is the LOQ assume a single LOQ was used F ( )) is the underlying CDF L identifies parameters of the left-truncated truncated distribution (truncated at d) Johnson & Hoffman, BASS XIII, November 7,

29 Potential Bias using Imputed Values bias [ ] = E x μ m = Fd ( )( m μ ) R upper bound when bias d F( d) 2 m = d 2 μ is the mean of the underlying distribution R identifies parameters of the right-truncated truncated distribution (truncated at d) Note: μ = Fd ( ) μ + Fd ( ) μ ( ) L R Johnson & Hoffman, BASS XIII, November 7,

30 Potential Bias using Replacement: Log-Normal Distribution bias μ ( 2 zτ τ ) exp 2 = Φ( z) Φ( z τ ) 2 Ratio of bias to mean is a function of Φ(z): Standard normal probability p of values < LOQ τ : Standard deviation of log transformed measure Instead, express τ as a function of the CV : ([ CV ] 2 ) τ = ln + Source: Johnson, R.E., Hoffman, H.J., Estimation in the Presence of Non-Detectables: Imputation and Maximum-Likelihood Approach with Left-Censoring. ASA Proceedings of the Joint Statistical Meetings, , Johnson & Hoffman, BASS XIII, November 7,

31 Bias as Percent of Mean: Log-Normal 5.0% 0.0% % Bias -5.0% -0.0% -5.0% -20.0% % Bias 0.0% -.0% -2.0% -3.0% -4.0% -5.0% -6.0% -7.0% CV -25.0% CV Johnson & Hoffman, BASS XIII, November 7,

32 Full curve mean=4 std= Replace values below LOQ with LOQ/2= Prob LOQ = mean=3.986 std=.0363 Full curve mean=4 std= Replace values below LOQ with LOQ/2=.5 Prob LOQ = 0.59 mean=3.845 std=.2508 LOQs Comparison of means: Effect Size Full curve 0 with replaced values Johnson & Hoffman, BASS XIII, November 7,

33 Full curve mean=3 std= Replace values below LOQ with LOQ/2= Prob LOQ = 0.59 mean=2.925 std=.083 Full curve mean=4 std= Replace values below LOQ with LOQ/2= Prob LOQ = 0.59 mean=3.986 std=.0363 LOQ Comparison of means: Effect Size Full curve 0.7 with replaced values 0.70 Johnson & Hoffman, BASS XIII, November 7,

34 Cohen s s Adjustment Data are assumed to be normally distributed (Transform the data if log-normally distributed) Adjustment of the sample mean and variance ˆλ ˆ ˆ μ = x λ( x d) ˆ ˆ σ = s + λ( x d) is found in Table 2 of Cohen. Can be computed. Cohen, A.C., Jr Simplified estimators for the normal distribution when samples are singly censored or truncated. Technometrics, vol., pp Johnson & Hoffman, BASS XIII, November 7,

35 Parametric estimation in presence of left-censored values: MLE Maximum likelihood estimation (normal distribution) equivalent to Cohen s s adjustment when d i is invariant to i. n δi δi ( θ ) = ( i, θ) ( i, θ) i= L f x F d where δ i = when x i < d i f(x i ) F(d i ) d i Johnson & Hoffman, BASS XIII, November 7,

36 MLE Computations in SAS proc lifereg data=inputdata inputdata; model (Start,End( Start,End)= /dist=lnormal lnormal; run; Regressors may be added to model the mean Johnson & Hoffman, BASS XIII, November 7,

37 Solutions - Multivariate Imputation (replacement) Replace values < LOQ with Zero, LOQ/2,, LOQ/ 2, LOQ Treat values < LOQ as left-censored censored Maximum likelihood EM Algorithm Full Likelihood function Pseudo-likelihood Johnson & Hoffman, BASS XIII, November 7,

38 Extend MLE Method to Multivariate Data Account for correlations among variables Develop MLE tool Construct log-likelihood likelihood Account for left-censored values Maximize using Newton-Raphson Starting value issues Efficiency issues Johnson & Hoffman, BASS XIII, November 7,

39 Likelihood Function Define: Assign: r i s i p = # observed = # censored i X X = i r i + s,..., i,..., X r+ i pi = # non - missing r i X = observed = censored Likelihood for i th subject: L( θ) = f( x,..., x ) F( d,..., d x,..., x ) i r X X r i i r+ i pi i i Johnson & Hoffman, BASS XIII, November 7,

40 Likelihood and Log-Likelihood Likelihood Functions Likelihood n L( θ) = f( x,..., x ) (,...,,..., ) i r F d i X d r i X x x + pi i ri i= Log-Likelihood Likelihood n i= { f x x } r F dx dx x x r ( θ) = ln (,..., ) + ln (,...,,..., ) i i r+ i pi i i Johnson & Hoffman, BASS XIII, November 7,

41 The Crux of the Problem Maximization of likelihood function Must evaluate multivariate normal CDF F( a) Transform the multiple integral Quadrature: : Evaluate the integral = ( 2π ) s Σ a... a s exp 2 ( x μ) Σ ( x μ) dx Genz, A., Numerical Computation of Multivariate Normal Probabilities. Journal of Computational and Graphical Statistics :4-50, 992. Johnson & Hoffman, BASS XIII, November 7,

42 Genz Transformations Compute multivariate normal CDF F( a) = ( 2π ) s Σ a... a s exp 2 ( x μ) Σ ( x μ) dx Transform to integral over unit hyper- cube F( b) = e e2... es dw Johnson & Hoffman, BASS XIII, November 7,

43 Genz Transformations Compute multivariate normal CDF F( a) Standardize data = Define: θ = ( 2π ) s Σ a... ( x μ) Σ ( x μ) Let R be the correlation matrix Standardize limits a,, denoted as b a s exp i i ( θ,..., θ ), where θ =, i s 2 x μ s i () σ = ii dx Johnson & Hoffman, BASS XIII, November 7,

44 Genz Transformations Compute multivariate normal CDF F( b) = ( 2π ) s R b... b s exp θ R 2 θ dθ Johnson & Hoffman, BASS XIII, November 7,

45 Three Transformations Transform to integral over unit hyper- cube () Cholesky decomposition transformation θ = Cy, where CC = Cholesky decomposition of R (2) Transformation of y ( ) y i = Φ z i (3) Transformation converting integral to constant z = ew limit form i i i Johnson & Hoffman, BASS XIII, November 7,

46 () Cholesky Decomposition Transformation Let: Then: θ = Cy, where CC = Cholesky decomposition of R ( 2π ) b 2 b 2 ( y ) 2 b s ( y,..., ys ) 2 y y 2 ys F( b) = exp exp... exp d s 2 2 y 2 where b ( y i,..., yi ) bi = i j= c ii c ij y j Johnson & Hoffman, BASS XIII, November 7,

47 (2) Transformation of y Let: where ( ) y i = Φ z i y Φ y = 2 ( ) exp 2π θ dθ 2 Then: where e e ( z,..., z 2 s s = F( b)... dz e ( z,..., z i 0 i ( z ) 0 e bi ) = Φ 0 i j= c c ij ii Φ ) ( z ) Johnson & Hoffman, BASS XIII, November 7, j

48 (3) Convert Integral to Constant Limit Form Let: Then: z = ew i i i F( b) = e e2... es dw where e i i bi cijφ ( ejwj) j= =Φ cii Johnson & Hoffman, BASS XIII, November 7,

49 Numerically Evaluate Integral We chose to use Legendre-Gauss Quadrature of order 8. Abscissas for quadrature of order n are given by roots of Legendre polynomials Johnson & Hoffman, BASS XIII, November 7,

50 Quadrature Definition: A functional Q n that is defined on C[a,b] ] and has the form Goal: n Qn ( f ) = wv f ( av) for a a... an b, and wv 0 for v = ( )n v= is called a quadrature rule of order n on the fundamental interval [a,b].[ Choose appropriate weights and abscissas Independent of f b For large class of functions f a f ( x) dx Q ( f ) n Johnson & Hoffman, BASS XIII, November 7,

51 Limitations The algorithm works well if only a few observations have 7 or fewer variables with values below their respective LODs. Otherwise the algorithm will consume much time and resources. Johnson & Hoffman, BASS XIII, November 7,

52 Alternate Method: Pseudo-Likelihood Method Instead of applying the ML approach to all variables simultaneously, apply the ML algorithm to pairs of variables. This methodology was recently used by Fieuws to fit mixed models for multivariate longitudinal profiles In the next slides we describe a pseudo- likelihood approach using Fieuws notation. Fieuws, S., Verbeke, G., Pairwise Fitting of Mixed Models for the Joint Modeling of Multivariate Longitudinal Profiles. Biometrics 62: , 2006 Johnson & Hoffman, BASS XIII, November 7,

53 Pseudo-Likelihood Approach Consider the log-likelihood likelihood involving the pair of variables X, X ( ) r n s l = l Θ X, X ( ) rs rsi rs r s i= We have p variables. We may represent rs = k where k =,,P,P and P=p ( p)/2, the number of pairs. Thus l k = l rs Johnson & Hoffman, BASS XIII, November 7,

54 Pseudo-Likelihood Approach We now maximize the pseudo- likelihood function l Θ = P ( ) ( ) k = l p Θ k where Θ= ( Θ ) is the, Θ2,, Θ P stacked parameter vector. Johnson & Hoffman, BASS XIII, November 7,

55 Distribution of Parameter Estimates The estimated stacked parameter vector is asymptotically normal N Θ Θ MN 0, Γ ( ) ( ) Johnson & Hoffman, BASS XIII, November 7,

56 Estimates and Var-Cov of Parameter Estimates The vector of unique parameter estimates is given by and its variance-covariance covariance matrix is given by * Θ = where A is the matrix that calculates the desired averages. ˆ AΘˆ ( Θ) ˆ * ˆ Θ = AΓ A Johnson & Hoffman, BASS XIII, November 7,

57 Back to the Example Estimate mean and standard error of distribution of: T = 6 i= a i bx X 7 i Johnson & Hoffman, BASS XIII, November 7,

58 Back to the Example Replace values below LOQ? Impute missing values? Then compute values of T for each subject and compute the sample mean and standard deviation. Or estimate all parameters with the MLE methods and Johnson & Hoffman, BASS XIII, November 7,

59 Johnson & Hoffman, BASS XIII, November 7, 2006 Johnson & Hoffman, BASS XIII, November 7, Back to the Example Back to the Example Use the delta method Use the delta method Estimate mean Estimate mean Estimate standard error Estimate standard error ( ) + = = = = = ˆ ˆ ˆ ˆ ˆ ˆ 2 ˆ ˆ ) var( μ μ σ σ μ μ σ μ i i i i i i i i i j i ji i j a a a a a b T = ˆ ˆ ˆ ˆ ˆ ˆ ˆ ) ( i i i i a i b T E μ μ σ μ σ μ μ

60 Comparison: based on simulations Multivariate MLE method is generally optimal with respect to bias and MSE assuming either a normal or log-normal distribution. Pseudo-MLE method closely matches the multivariate method with respect to bias and MLE. Pseudo-MLE is much (very much) more efficient with respect to computation resources and time. Johnson & Hoffman, BASS XIII, November 7,

61 Summary Replacement of LOQs is never needed. MLE methods provide for parametric estimation taking censoring and missing values into account. Pseudo-MLE performs better than replacement and essentially as well as the full MLE. When is replacement good enough? Johnson & Hoffman, BASS XIII, November 7,

62 Copy of Presentation Slides An updated and corrected copy of these presentation slides may be found at Johnson & Hoffman, BASS XIII, November 7,

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Analytical Performance & Method. Validation

Analytical Performance & Method. Validation Analytical Performance & Method Ahmad Aqel Ifseisi Assistant Professor of Analytical Chemistry College of Science, Department of Chemistry King Saud University P.O. Box 2455 Riyadh 11451 Saudi Arabia Building:

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Exam C Solutions Spring 2005

Exam C Solutions Spring 2005 Exam C Solutions Spring 005 Question # The CDF is F( x) = 4 ( + x) Observation (x) F(x) compare to: Maximum difference 0. 0.58 0, 0. 0.58 0.7 0.880 0., 0.4 0.680 0.9 0.93 0.4, 0.6 0.53. 0.949 0.6, 0.8

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution

Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution Clemson University TigerPrints All Theses Theses 8-2012 Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution Jing Zhao Clemson University, jzhao2@clemson.edu Follow this and

More information

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution Semiparametric Mixed Effects Models with Flexible Random Effects Distribution Marie Davidian North Carolina State University davidian@stat.ncsu.edu www.stat.ncsu.edu/ davidian Joint work with A. Tsiatis,

More information

Web-based Supplementary Material for A Two-Part Joint. Model for the Analysis of Survival and Longitudinal Binary. Data with excess Zeros

Web-based Supplementary Material for A Two-Part Joint. Model for the Analysis of Survival and Longitudinal Binary. Data with excess Zeros Web-based Supplementary Material for A Two-Part Joint Model for the Analysis of Survival and Longitudinal Binary Data with excess Zeros Dimitris Rizopoulos, 1 Geert Verbeke, 1 Emmanuel Lesaffre 1 and Yves

More information

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain 0.1. INTRODUCTION 1 0.1 Introduction R. A. Fisher, a pioneer in the development of mathematical statistics, introduced a measure of the amount of information contained in an observaton from f(x θ). Fisher

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Exponential Family and Maximum Likelihood, Gaussian Mixture Models and the EM Algorithm. by Korbinian Schwinger

Exponential Family and Maximum Likelihood, Gaussian Mixture Models and the EM Algorithm. by Korbinian Schwinger Exponential Family and Maximum Likelihood, Gaussian Mixture Models and the EM Algorithm by Korbinian Schwinger Overview Exponential Family Maximum Likelihood The EM Algorithm Gaussian Mixture Models Exponential

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2 Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, 2010 Jeffreys priors Lecturer: Michael I. Jordan Scribe: Timothy Hunter 1 Priors for the multivariate Gaussian Consider a multivariate

More information

Measurement Error and Linear Regression of Astronomical Data. Brandon Kelly Penn State Summer School in Astrostatistics, June 2007

Measurement Error and Linear Regression of Astronomical Data. Brandon Kelly Penn State Summer School in Astrostatistics, June 2007 Measurement Error and Linear Regression of Astronomical Data Brandon Kelly Penn State Summer School in Astrostatistics, June 2007 Classical Regression Model Collect n data points, denote i th pair as (η

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little Measurement error as missing data: the case of epidemiologic assays Roderick J. Little Outline Discuss two related calibration topics where classical methods are deficient (A) Limit of quantification methods

More information

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation Variations ECE 6540, Lecture 10 Last Time BLUE (Best Linear Unbiased Estimator) Formulation Advantages Disadvantages 2 The BLUE A simplification Assume the estimator is a linear system For a single parameter

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45 Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS 21 June 2010 9:45 11:45 Answer any FOUR of the questions. University-approved

More information

Introduction to Maximum Likelihood Estimation

Introduction to Maximum Likelihood Estimation Introduction to Maximum Likelihood Estimation Eric Zivot July 26, 2012 The Likelihood Function Let 1 be an iid sample with pdf ( ; ) where is a ( 1) vector of parameters that characterize ( ; ) Example:

More information

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n = Spring 2012 Math 541A Exam 1 1. (a) Let Z i be independent N(0, 1), i = 1, 2,, n. Are Z = 1 n n Z i and S 2 Z = 1 n 1 n (Z i Z) 2 independent? Prove your claim. (b) Let X 1, X 2,, X n be independent identically

More information

BAYESIAN ESTIMATION OF THE EXPONENTI- ATED GAMMA PARAMETER AND RELIABILITY FUNCTION UNDER ASYMMETRIC LOSS FUNC- TION

BAYESIAN ESTIMATION OF THE EXPONENTI- ATED GAMMA PARAMETER AND RELIABILITY FUNCTION UNDER ASYMMETRIC LOSS FUNC- TION REVSTAT Statistical Journal Volume 9, Number 3, November 211, 247 26 BAYESIAN ESTIMATION OF THE EXPONENTI- ATED GAMMA PARAMETER AND RELIABILITY FUNCTION UNDER ASYMMETRIC LOSS FUNC- TION Authors: Sanjay

More information

Comparison between conditional and marginal maximum likelihood for a class of item response models

Comparison between conditional and marginal maximum likelihood for a class of item response models (1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution Communications for Statistical Applications and Methods 2016, Vol. 23, No. 6, 517 529 http://dx.doi.org/10.5351/csam.2016.23.6.517 Print ISSN 2287-7843 / Online ISSN 2383-4757 A comparison of inverse transform

More information

Introduction (Alex Dmitrienko, Lilly) Web-based training program

Introduction (Alex Dmitrienko, Lilly) Web-based training program Web-based training Introduction (Alex Dmitrienko, Lilly) Web-based training program http://www.amstat.org/sections/sbiop/webinarseries.html Four-part web-based training series Geert Verbeke (Katholieke

More information

Method of Conditional Moments Based on Incomplete Data

Method of Conditional Moments Based on Incomplete Data , ISSN 0974-570X (Online, ISSN 0974-5718 (Print, Vol. 20; Issue No. 3; Year 2013, Copyright 2013 by CESER Publications Method of Conditional Moments Based on Incomplete Data Yan Lu 1 and Naisheng Wang

More information

Implementation of Pairwise Fitting Technique for Analyzing Multivariate Longitudinal Data in SAS

Implementation of Pairwise Fitting Technique for Analyzing Multivariate Longitudinal Data in SAS PharmaSUG2011 - Paper SP09 Implementation of Pairwise Fitting Technique for Analyzing Multivariate Longitudinal Data in SAS Madan Gopal Kundu, Indiana University Purdue University at Indianapolis, Indianapolis,

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2008 Prof. Gesine Reinert 1 Data x = x 1, x 2,..., x n, realisations of random variables X 1, X 2,..., X n with distribution (model)

More information

Notes on the Multivariate Normal and Related Topics

Notes on the Multivariate Normal and Related Topics Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions

More information

A Derivation of the EM Updates for Finding the Maximum Likelihood Parameter Estimates of the Student s t Distribution

A Derivation of the EM Updates for Finding the Maximum Likelihood Parameter Estimates of the Student s t Distribution A Derivation of the EM Updates for Finding the Maximum Likelihood Parameter Estimates of the Student s t Distribution Carl Scheffler First draft: September 008 Contents The Student s t Distribution The

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs Christopher Jennison Department of Mathematical Sciences, University of Bath http://people.bath.ac.uk/mascj

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

Nonparametric rank based estimation of bivariate densities given censored data conditional on marginal probabilities

Nonparametric rank based estimation of bivariate densities given censored data conditional on marginal probabilities Hutson Journal of Statistical Distributions and Applications (26 3:9 DOI.86/s4488-6-47-y RESEARCH Open Access Nonparametric rank based estimation of bivariate densities given censored data conditional

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL A Thesis Presented to the Faculty of San Diego State University In Partial Fulfillment of the Requirements for the Degree

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

Chapter 17. Failure-Time Regression Analysis. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

Chapter 17. Failure-Time Regression Analysis. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Chapter 17 Failure-Time Regression Analysis William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Copyright 1998-2008 W. Q. Meeker and L. A. Escobar. Based on the authors

More information

Introduction to Probability and Statistics (Continued)

Introduction to Probability and Statistics (Continued) Introduction to Probability and Statistics (Continued) Prof. icholas Zabaras Center for Informatics and Computational Science https://cics.nd.edu/ University of otre Dame otre Dame, Indiana, USA Email:

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

A Quasi Gamma Distribution

A Quasi Gamma Distribution 08; 3(4): 08-7 ISSN: 456-45 Maths 08; 3(4): 08-7 08 Stats & Maths www.mathsjournal.com Received: 05-03-08 Accepted: 06-04-08 Rama Shanker Department of Statistics, College of Science, Eritrea Institute

More information

Some New Methods for Latent Variable Models and Survival Analysis. Latent-Model Robustness in Structural Measurement Error Models.

Some New Methods for Latent Variable Models and Survival Analysis. Latent-Model Robustness in Structural Measurement Error Models. Some New Methods for Latent Variable Models and Survival Analysis Marie Davidian Department of Statistics North Carolina State University 1. Introduction Outline 3. Empirically checking latent-model robustness

More information

Biostat 2065 Analysis of Incomplete Data

Biostat 2065 Analysis of Incomplete Data Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh October 20, 2005 1. Large-sample inference based on ML Let θ is the MLE, then the large-sample theory implies

More information

Lecture 4: Types of errors. Bayesian regression models. Logistic regression

Lecture 4: Types of errors. Bayesian regression models. Logistic regression Lecture 4: Types of errors. Bayesian regression models. Logistic regression A Bayesian interpretation of regularization Bayesian vs maximum likelihood fitting more generally COMP-652 and ECSE-68, Lecture

More information

Gaussian processes for inference in stochastic differential equations

Gaussian processes for inference in stochastic differential equations Gaussian processes for inference in stochastic differential equations Manfred Opper, AI group, TU Berlin November 6, 2017 Manfred Opper, AI group, TU Berlin (TU Berlin) inference in SDE November 6, 2017

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Statistics - Lecture One. Outline. Charlotte Wickham  1. Basic ideas about estimation Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline 1. Introduction: Estimation of

More information

Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs

Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs Interuniversity Institute for Biostatistics and statistical Bioinformatics (I-BioStat) Universiteit Hasselt & KU

More information

The Relationship Between Confidence Intervals for Failure Probabilities and Life Time Quantiles

The Relationship Between Confidence Intervals for Failure Probabilities and Life Time Quantiles Statistics Preprints Statistics 2008 The Relationship Between Confidence Intervals for Failure Probabilities and Life Time Quantiles Yili Hong Iowa State University, yili_hong@hotmail.com William Q. Meeker

More information

Correction for Ascertainment

Correction for Ascertainment Correction for Ascertainment Michael C. Neale International Workshop on Methodology for Genetic Studies of Twins and Families Boulder CO 2004 Virginia Institute for Psychiatric and Behavioral Genetics

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Introducing the Normal Distribution

Introducing the Normal Distribution Department of Mathematics Ma 3/13 KC Border Introduction to Probability and Statistics Winter 219 Lecture 1: Introducing the Normal Distribution Relevant textbook passages: Pitman [5]: Sections 1.2, 2.2,

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

Physics 403. Segev BenZvi. Propagation of Uncertainties. Department of Physics and Astronomy University of Rochester

Physics 403. Segev BenZvi. Propagation of Uncertainties. Department of Physics and Astronomy University of Rochester Physics 403 Propagation of Uncertainties Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Maximum Likelihood and Minimum Least Squares Uncertainty Intervals

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Generalizing the MCPMod methodology beyond normal, independent data

Generalizing the MCPMod methodology beyond normal, independent data Generalizing the MCPMod methodology beyond normal, independent data José Pinheiro Joint work with Frank Bretz and Björn Bornkamp Novartis AG ASA NJ Chapter 35 th Annual Spring Symposium June 06, 2014 Outline

More information

Generalized, Linear, and Mixed Models

Generalized, Linear, and Mixed Models Generalized, Linear, and Mixed Models CHARLES E. McCULLOCH SHAYLER.SEARLE Departments of Statistical Science and Biometrics Cornell University A WILEY-INTERSCIENCE PUBLICATION JOHN WILEY & SONS, INC. New

More information

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 Lecture 3: Linear Models Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector of observed

More information

On Estimating the Relationship between Longitudinal Measurements and Time-to-Event Data Using a Simple Two-Stage Procedure

On Estimating the Relationship between Longitudinal Measurements and Time-to-Event Data Using a Simple Two-Stage Procedure Biometrics DOI: 10.1111/j.1541-0420.2009.01324.x On Estimating the Relationship between Longitudinal Measurements and Time-to-Event Data Using a Simple Two-Stage Procedure Paul S. Albert 1, and Joanna

More information

Introducing the Normal Distribution

Introducing the Normal Distribution Department of Mathematics Ma 3/103 KC Border Introduction to Probability and Statistics Winter 2017 Lecture 10: Introducing the Normal Distribution Relevant textbook passages: Pitman [5]: Sections 1.2,

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Introduction to Probabilistic Graphical Models

Introduction to Probabilistic Graphical Models Introduction to Probabilistic Graphical Models Sargur Srihari srihari@cedar.buffalo.edu 1 Topics 1. What are probabilistic graphical models (PGMs) 2. Use of PGMs Engineering and AI 3. Directionality in

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Small area estimation with missing data using a multivariate linear random effects model

Small area estimation with missing data using a multivariate linear random effects model Department of Mathematics Small area estimation with missing data using a multivariate linear random effects model Innocent Ngaruye, Dietrich von Rosen and Martin Singull LiTH-MAT-R--2017/07--SE Department

More information

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), 1605 1620 Comparing of some estimation methods for parameters of the Marshall-Olkin generalized exponential distribution under progressive

More information

Efficient multivariate normal distribution calculations in Stata

Efficient multivariate normal distribution calculations in Stata Supervisor: Adrian Mander MRC Biostatistics Unit 2015 UK Stata Users Group Meeting 10/09/15 1/21 Why and what? Why do we need to be able to work with the Multivariate Normal Distribution? The normal distribution

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

Simulating Realistic Ecological Count Data

Simulating Realistic Ecological Count Data 1 / 76 Simulating Realistic Ecological Count Data Lisa Madsen Dave Birkes Oregon State University Statistics Department Seminar May 2, 2011 2 / 76 Outline 1 Motivation Example: Weed Counts 2 Pearson Correlation

More information

Canonical Correlation Analysis of Longitudinal Data

Canonical Correlation Analysis of Longitudinal Data Biometrics Section JSM 2008 Canonical Correlation Analysis of Longitudinal Data Jayesh Srivastava Dayanand N Naik Abstract Studying the relationship between two sets of variables is an important multivariate

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R. Methods and Applications of Linear Models Regression and the Analysis of Variance Third Edition RONALD R. HOCKING PenHock Statistical Consultants Ishpeming, Michigan Wiley Contents Preface to the Third

More information

Inferring from data. Theory of estimators

Inferring from data. Theory of estimators Inferring from data Theory of estimators 1 Estimators Estimator is any function of the data e(x) used to provide an estimate ( a measurement ) of an unknown parameter. Because estimators are functions

More information

An Introduction to Spectral Learning

An Introduction to Spectral Learning An Introduction to Spectral Learning Hanxiao Liu November 8, 2013 Outline 1 Method of Moments 2 Learning topic models using spectral properties 3 Anchor words Preliminaries X 1,, X n p (x; θ), θ = (θ 1,

More information

Density Estimation. Seungjin Choi

Density Estimation. Seungjin Choi Density Estimation Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr http://mlg.postech.ac.kr/

More information

Review (Probability & Linear Algebra)

Review (Probability & Linear Algebra) Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Analysis of incomplete data in presence of competing risks

Analysis of incomplete data in presence of competing risks Journal of Statistical Planning and Inference 87 (2000) 221 239 www.elsevier.com/locate/jspi Analysis of incomplete data in presence of competing risks Debasis Kundu a;, Sankarshan Basu b a Department

More information

Maximum Likelihood Estimation. only training data is available to design a classifier

Maximum Likelihood Estimation. only training data is available to design a classifier Introduction to Pattern Recognition [ Part 5 ] Mahdi Vasighi Introduction Bayesian Decision Theory shows that we could design an optimal classifier if we knew: P( i ) : priors p(x i ) : class-conditional

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable

More information

Analysis of Incomplete Non-Normal Longitudinal Lipid Data

Analysis of Incomplete Non-Normal Longitudinal Lipid Data Analysis of Incomplete Non-Normal Longitudinal Lipid Data Jiajun Liu*, Devan V. Mehrotra, Xiaoming Li, and Kaifeng Lu 2 Merck Research Laboratories, PA/NJ 2 Forrest Laboratories, NY *jiajun_liu@merck.com

More information

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring

Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring Exact Inference for the Two-Parameter Exponential Distribution Under Type-II Hybrid Censoring A. Ganguly, S. Mitra, D. Samanta, D. Kundu,2 Abstract Epstein [9] introduced the Type-I hybrid censoring scheme

More information

Statistical Methods for Alzheimer s Disease Studies

Statistical Methods for Alzheimer s Disease Studies Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations

More information

Survival Analysis for Interval Censored Data - Nonparametric Estimation

Survival Analysis for Interval Censored Data - Nonparametric Estimation Survival Analysis for Interval Censored Data - Nonparametric Estimation Seminar of Statistics, ETHZ Group 8, 02. May 2011 Martina Albers, Nanina Anderegg, Urs Müller Overview Examples Nonparametric MLE

More information

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector

More information

ABHELSINKI UNIVERSITY OF TECHNOLOGY

ABHELSINKI UNIVERSITY OF TECHNOLOGY Cross-Validation, Information Criteria, Expected Utilities and the Effective Number of Parameters Aki Vehtari and Jouko Lampinen Laboratory of Computational Engineering Introduction Expected utility -

More information

Robust Testing and Variable Selection for High-Dimensional Time Series

Robust Testing and Variable Selection for High-Dimensional Time Series Robust Testing and Variable Selection for High-Dimensional Time Series Ruey S. Tsay Booth School of Business, University of Chicago May, 2017 Ruey S. Tsay HTS 1 / 36 Outline 1 Focus on high-dimensional

More information