Statistical Comparison of ROC Curves from Multiple Readers

Size: px
Start display at page:

Download "Statistical Comparison of ROC Curves from Multiple Readers"

Transcription

1 Statistical Comparison of ROC Curves from Multiple Readers MARIJKE SWAVING, MSc, HANS VAN HOUWELINGEN, PhD, FENNO P. OTTES, PhD, TON STEERNEMAN, PhD Receiver operating characteristic (ROC) analysis is the commonly accepted method for comparing diagnostic imaging systems. In general, ROC studies are designed in such a way that multiple readers read the same images and each image is presented by means of two different imaging systems. Statistical methods for the comparison of the ROC curves from one reader have been developed, but extension of these methods to multiple readers is not straightforward. A new method of analysis is presented for the comparison of ROC curves from multiple readers. This method includes a nonparametric estimation of the variances and covariances between the various areas under the curves. The method described is more appropriate than the paired t test, because it also takes the case-sample variation into account. Key words: ROC curves; ROC analysis; nonparametric estimation; area under the curve; comparison of ROC curves; EM algorithm. (Med Decis Making 1996;16: ) Receiver operating characteristic (ROC) studies are frequently used to compare the diagnostic qualities of imaging systems.l ROC studies can also be used to evaluate the performance of systems to support the diagnosis. In a conventional ROC study,&dquo; several observers read all the selected images by means of each of the imaging systems that are to be compared. The diagnosis the observer makes depends on his or her confidence that the particular image shows an abnormality or a normal state and upon the confidence threshold he adopts. The diagnosis the observer makes can be correct or incorrect (table 1). For a given confidence threshold, the fraction of abnormal images that are correctly identified as abnormal is called the true-positive fraction (TPF = sensitivity) and the fraction of the normal images that are correctly identified is called the true-negative fraction (TNF specificity). = In the same way, the false-positive fraction (FPF) and the false-negative fraction (FNF) are defined. For the actually normal and the actually abnormal images, probability distributions can be derived for the various states of Received June 22, 1994 from BAZIS, Experimental Developments, Leiden, The Netherlands (MS, FPO); the Department of Medical Statistics, Leiden University, Leiden, The Netherlands (JCVH); and the Department of Econometrics, University of Groningen, Groningen, The Netherlands (AGMS). Revision accepted for publication September 19, Dr. Steerneman was supported by the Stichting Verzekeringswetenschap and the C. R. Rao Foundation. Address correspondence and reprint requests to Dr. Ottes: HISCOM Research and Development, Schipholweg 97, 2316 XA Leiden, The Netherlands. Table 1. The Types of Correct and Incorrect Diagnoses truth. Two such distributions are shown in figure 1. The horizontal axis represents the confidence the observer has that a particular image was taken from a patient who actually has an abnormal condition. The confidence threshold separates the abnormal diagnoses from the normal diagnoses. The inherent discriminatory capacity of a system depends on the extent to which the probability distributions of perceived evidence from various states of truth are separated or overlap. The sensitivity and the specificity of a diagnostic imaging system depend upon the particular confidence threshold that the observer uses. According to this model, an image is read as positive if the observer s confidence in a positive diagnosis exceeds his confidence threshold. For a range of confidence thresholds, the ROC curve represents the link between the fraction of abnormal images that are correctly diagnosed as abnormal (TPF) and the fraction of normal images that are diagnosed as abnormal (FPF). Therefore, in an ROC study, an observer is asked to adopt several confidence thresholds at the same time, as shown in figure 2, and classify each image in one of these categories denoting his or her confidence. The diagnostic quality of an imaging system can be represented by an ROC curve. The ROC curve can be estimated as a curve through the operating 143

2 144 FIGURE 1. Probability densities of an observer s confidence in a positive diagnosis for a particular diagnostic task. points, which are obtained by the classifications of the images by a reader. Usually it is assumed that the functional form of an ROC curve is given by two underlying normal distributions. The ROC curve denotes the discriminatory capacity of the imaging system between the normal and the abnormal images. A higher ROC curve indicates greater discriminatory capacity. To compare imaging systems, the area under the ROC curve is usually taken as an index of diagnostic quality. Two imaging systems yield the same diagnostic quality when the areas under their ROC curves are equal. If the same images are read and/or the same reader reads the images presented by both imaging systems, the ROC curves (and the areas beneath them) will be correlated. Metz et al.3 and Hanley and McNeil4 have developed several methods for the comparison of two correlated ROC curves. These methods are described below. They cannot be generalized to multiple readers. A nonparametric approach to the comparison of the areas under two or more correlated ROC curves has been reported by DeLong et al. Until now, conclusions about the diagnostic qualities of imaging systems have been drawn by comparing ROC curves per reader, by comparing pooled ROC curves or the averaged areas under the curves and subsequently performing a test for two correlated ROC curves, or by using a paired t test to analyze the areas under the curves for all readers. Poolings is not a good alternative for handling correlated ROC curves, and a test for two correlated ROC curves is not designed for such pooled curves. The paired t test takes both between-reader variation and within-reader variation into account, but it does not take case-sample variation into account. Therefore, the need was felt for a more suitable method than the paired t test. We present a method that takes into account casesample variation. We provide an example of eight correlated ROC curves derived from four readers interpretations of mammograms, then discuss our proposed method. The confidence-rating model, sketched for a five-cat- FIGURE 2. egory scale.

3 145 Comparison of Correlated ROC Curves Under the assumption of a bivariate normal ROC curve, the curve can be plotted as a straight line on double-normal-deviate papery This straight line can be represented by means of two parameters, the intercept a and the slope ~3. The area under the ROC curve is equal to where 4$ denotes the cumulative standard normal distribution function. For the estimation of the area under the curve, AZ, we can substitute the estimates a and b for a and P, respectively. The statistic proposed by Metz for testing the null hypothesis Ho:Az1 This statistic has variance which is estimated by This test is executed by the software package COR- ROC2.$ The statistic W approximately follows a normal distribution, with zero mean and variance S w under Ho. This test can be used for the comparison of two ROC curves only. Therefore, this test is not applicable to an ROC study, when multiple observers read the same images and when each image is presented by means of two imaging systems. Hanley and McNeil4 9 developed a method that tries to estimate the correlation between the two areas to be compared. The variance of the difference in the areas under the curves is given by where SD denotes the standard deviation and p is the correlation coefficient. If the areas are highly positively correlated, the gain of using a paired test is big, because of the reduction in the variance. The calculation of the correlation coefficient p depends on two intermediate correlation coefficients p, and p,,, the correlation coefficients for the ratings of the positive patients and the negative patients imaged by the two modalities, respectively. Each of these can be estimated using Kendall s tau. The statistic W (equation 2) can be used again. This test is not applicable to the comparison of more than two ROC curves. DeLong and colleagues present a nonparametric approach to compare two or more correlated ROC curves. They assume that the readings of the images of two different patients are independent, but dependence of the readings of the images of one patient is allowed. Within this framework, they have to deal with correlated ROC curves. If there are, e.g., two imaging systems and four readers, then we have eight readings for each patient. However, in this situation the assumption of independence is no longer realistic, because the readings by the same reader will be correlated, even when he or she interprets the images of two different patients. In the next section we deal with correlated ROC curves in this more general case, where correlation between readings of the images of different patients occurs. We apply a nonparametric approach. Swets and Pickett, 7 and Dorfman, Berbaum, and Metz&dquo; have discussed methods for statistical comparison of ROC-curve estimates from multiple readers. We also have to deal with a similar situation: a factorial experimental design in which the readers interpret the same images in all modalities, but they do this only once. Application of the method of Swets and Pickett is not possible in this situation, because they require re-reading of images. Besides, it is difficult to obtain the appropriate variances and covariances. Swets and Pickett implicitly postulated an ANOVA type of model with a fixed effect for the modality; the other effects are random. Dorfman et al. also apply such a model, but it is of a more general type, in the sense that two additional second-order interaction terms of random type are included. An important difference is that Dorfman et al. do not postulate their model for the estimated areas under the ROC curves but for the pseudovalues derived for the jackknife method. The jackknife analysis can be suitable in the ANOVA context if the usual assumptions of normality of the error terms and the random effects are weakened. Both articles assume additivity of the various effects in the model. In our method we do not assume this. We follow a nonparametric approach and we do not require an ANOVA kind of specification for the various effects.

4 146 FIGURE 3. An ROC curve obtained without the assumption of bivariate normality. Comparison of ROC Curves from Multiple Readers with Case-Sample Variation The method for comparing ROC curves from multiple readers (the number of readers is denoted by n) is based upon the fact that the same images are read under two conditions. If the areas under the ROC curves are equal for the two imaging systems, then the imaging systems yield the same diagnostic quality. We make only minimal assumptions about the form of the ROC curve. Therefore, we do not assume a bivariate normal ROC curve. The ROC curve is obtained by connecting the operating points by straight lines. Figure 3 represents such an ROC curve, in which the variable X describes the confidence an observer has in a diagnosis of abnormality when the image is actually normal and the variable Y describes the confidence in a diagnosis of abnormality when the image is abnormal. Here h denotes an operating point, h = 0,..., c; where c denotes the number of confidence categories the observer can choose from. If X and Y are independent random variables, the area under this curve is given by the elements wy are given by For the ROC curve, the area under the curve can be estimated. For each normal image and for each abnormal image the reader can choose from c categories to classify the image. This classification process is represented by two multinomial distributions with estimated probability vectors px and py. The variance of d is estimated by where tr stands for the trace of the matrix. For the estimation of the variance-covariance matrices cov(p,) and cov(py), we make use of the underlying multinomial distribution: In the following text we work with We can estimate A from the data by d = pkwpy, where px and py are vectors denoting the estimated probabilities that a normal image and an abnormal image, respectively, are classified in category h; h = 1,..., c. W is a square matrix of dimension k, and where diag(px) and diag(p,.) denote diagonal matrices with the elements of the vector px and the vector py, respectively, at the main diagonal. Since Az is esti- = mated by 1/2 + 1/2. d, the variance of the area under the curve is estimated by

5 147 In our ROC study, multiple readers interpreted the same images under two conditions. Given the selection of readers i = 1,..., n and the imaging systems k = 1, 2, we have to deal with the areas AZ,k. Indeed, conditionally given the sample of readers, the areas A~ can differ between readers for a fixed imaging system k. The areas are estimated by AZ,k = 1/2 + V2. d,k- Since the d,k are obtained from the same case sample, we also have to deal with the covariances between the d,k, for i = 1,..., n and k = 1, 2. More formally, we have to estimate cov(d,k, d 1); i, j = 1,..., n, k, 1 = 1, 2. The covariances between two areas under the curves can be obtained by looking at the similarities and dissimilarities between the classifications of the same images. This is done for the classifications of the same readers and for the classifications of different readers as well. We also need the cross-classifications by the readers. Therefore, we look at the jointly multinomial distribution of the probability vectors. We obtain the following estimators where n, and nz, respectively, denote the number of normal images and the number of abnormal images. The squared matrix PXlk,}1 of dimension c is cal- to the culated as follows. Its (ci, c2) element is equal fraction of normal images in the sample that are classified in category C1 (and cz, respectively) by reader i (and j, respectively) under imaging system k (and 1). So, the matrix PXlk,j1 gives the relative frequency table of the normal images in the sample cross-classified according to the readings by reader i (and j) of images from imaging system k (and 1). For, e.g., the first reader of the normal mammograms, from the example that is presented below, the joint multinomial distribution is estimated by means of the following probabilities: Notice that pxii,iz is a square matrix of dimension to the fraction of the c, and element (1, 2) is equal 50 images that are placed in category 1 by reader 1 under system 1 and in category 2 by reader 1 under system 2. Similar probabilities can also be estimated for different readers. These covariances can be used for the estimation of the covariance between two areas, i.e., By means of the equations 8 and 13, the variances of the areas under the curves and the covariances between various areas under the curves can be esti- We now define a difference vector D, with D, = A.,, - Az,z 1/2(d,l - d,z). The dimension of the difference vector D is equal to n, the number of ob- = servers in the study. The assumption is made that D approximately follows a multivariate normal distribution with expectation )JLL, where L denotes the vector of dimension n filled with ones, and variancecovariance matrix ~. This assumption allows that the areas./1~ are different between readers for given k = 1, 2. It is, however, assumed that the differences AZ,1 - A,,2 are equal. In the previous analysis, it was implicitly assumed that a reader is completely certain about his classifications. In practice, it is likely that the reader is uncertain about the category into which a given image should be classified. So, if the same samples of images are presented to the readers after some time, then another value D, will be obtained. It is realistic to assume that var(d, - D,) 5~ 0. In the foregoing we have implied that var(d, - D,) = 0. The variance of the measurement or, equivalently, the within-reader variance is denoted by a2 ~ 0 for all readers. If the hypothesis of no measurement error would hold, then the analysis should obtain a negligible estimate for Q2. Testing Ho: equal diagnostic quality can now be translated into testing Ho:..L = 0, i.e., no difference in the diagnostic qualities of the two imaging systems per reader. The difference with respect to the paired t test is the calculation of I. For the paired t test it is assumed that each element of the vector D approximately follows a normal distribution with expectation }jl and a variance of Q2 and that the elements of this vector are independent. The variance Q2 is estimated by the sampling variance of the vector. However, we assume that the vector D approximately follows a multivariate normal distribution. The variance-covariance matrix I of D consists of two parts. One part of the variation in D can be explained by the fact that the same images are read multiple times, the case-sample variation. The other part of the variation is caused by a withinreaders effect. We denote the case-sample variance by the matrix C, and the within-readers variation by

6 148 Q2. Thus, the model can be written as where I is the unit matrix of dimension n. The variance-covariance matrix C is given by QZ. Subsequently, each iteration of the algorithm consists of two steps: Expectation (E) step: Compute Maximization (M) step: The harder part is the E step. algebra it can be shown that With some linear We can substitute the variances (equation 8) and covariances (equation 13) in equation 15, the formula for C, to obtain an estimate of the case-sample variance. For the estimation of the within-readers variation QZ we make use of the restricted maximum likelihood (REML) methods approach and the expectation-maximization (EM) algorithm.&dquo; The REML approach projects D on a subspace orthogonal to ii, by where Repeated application of the E and M steps ultimately leads to an estimate of QZ that maximizes the corresponding likelihood function. For the estimation of >, the maximum-likelihood 3 method is applied. We know D - Nn(fl.L, (T 21 + C), with or and C &dquo;known by estimation.&dquo; The maximum-likelihood estimator for )JL is Then D has zero expectation and variance-covariance matrix The estimated variance of the estimator of )JL is The EM algorithm is based upon the representation with independent D1 and DZ, both having expectation zero and with variance-covariance matrices Finally, the comparison of ROC curves with multiple readers can be performed. The null hypothesis stating that the diagnostic qualities of the two modalities are equal is translated into Ho : )JL = 0. Thus, the statistic If D1 were observable, an estimator of QZ could be found by The EM approach starts with an initial estimate of is defined. Under Ho, the distribution of T depends on the magnitude of the within-readers variation QZ. For small Q2, T approximately follows a standard normal distribution. For large cr2, T approximately follows a Student s-t distribution with n - 1 degrees of freedom, and the test coincides with the paired t test.

7 149 The steps to be taken in comparing ROC curves from multiple readers may be summarized as follows : Table 2. Areas under the Curves (with Standard Deviations) for the Bivariate Normal ROC Curves of Four Readers. For each reader-modality combination, estimate the area under the curve in a nonparametric way.. Compute the difference vector D. Estimate the variance-covariance matrix C by estimating the variances of d 1 and d,2, i = 1,..., n, the covariances between d,, and djlj between d 1 and d,2, between d,z and d*, and between day, and djz) i) j = 1)..., n. Estimate with the EM algorithm the within-readers variation Q 2.. Estimate > by means of the maximum-likelihood method. Estimate the variance of p,.. Compute the value of the test statistic T and compare this value with the values of the appropriate distribution. Example We illustrate the method using an ROC study of mammogram interpretations.14 For this study, 50 films with small calcifications and 50 films without small calcifications were selected. All other abnormalities in the films were ignored. The diagnostic truth of the images was determined by two radiologists who specialized in interpreting mammograms. The images were digitized by a video scanner. In this study, four readers were involved. These observers read both the conventional film and the digitized images. They were asked to use a five-point rating scale to represent their confidence in identifying the abnormality sought after. The categories of the certainty scale were: 1 Definitely no small calcifications present 2 Probably no small calcifications present 3 Unsure whether small calcifications are present 4 Probably small calcifications present 5 Definitely small calcifications present The images per imaging system were all read in one session, and after some unspecified time, the images were read again, in a completely different order, by means of the other imaging system. In table 2, the areas under the curves for the different readers are listed, assuming a bivariate normal form for the ROC curves. for all readers the areas under the Although curves are less for the readings of the digitized images than for the conventional films, it is hard to prove a difference in diagnostic quality by statistics. A paired t test produces a one-sided p value of Thus, no conclusion can be made from a test for Table 3. P-values of Testing the Equality of the Diagnostic Qualities of Two Modalities (Results of CORROC2) for Four Readers the hypothesis Ho: equal diagnostic quality, against H,: the diagnostic quality of the digitized images is less than the diagnostic quality of the conventional film, because of the level of significance used (5%) and because of the fact that the Student s-t statistic only asymptotically follows a standard normal distribution. If we test per reader the equality of the to the areas under the curves (table 3) according method of Metz,3 the null hypothesis will be rejected for the readers 1, 3, and 4, but for reader 2 the null hypothesis will be accepted. Therefore, according to the method of Metz, no overall statement about the can be made. diagnostic qualities of the two systems In such situations, the paired t test can help, but even with the paired t test it was not possible to draw an overall conclusion. In figures 4, 5, 6, and 7, the operating points and the estimated ROC curves are given, together with their 95% confidence bands obtained from ROCFIT (a program devised by C. E. Metz), but the Working-Hotelling-type bands of Ma and Hall&dquo; could also be drawn. The behaviors of the bounds Al-4 in figures 4 and 5 are rather strange. It is not clear how ROCFIT produced such bounds. One would expect confidence intervals that are narrow at the lower left and upper right ends of the ROC curve. Table 4 shows the nonparametric estimates of the areas under the curves, together with the standard deviations. The difference vector D = Az,i - AZ,Z is estimated by

8 150 FIGURE 4. The operating points from reader 1 with the estimated curve and a 95% confidence interval for the mammography study. FIGURE 5. The operating points from reader 2 with the estimated curve and a 95% confidence interval for the mammography study. FIGURE 6. The operating points from reader 3 with the estimated curve and a 95% confidence interval for the mammography study.

9 151 FIGURE 7. The operating points from reader 4 with the estimated curve and a 95% confidence interval for the mammography study. A paired t test for this difference vector gives a p value of 0.08; thus, the null hypothesis is accepted according to the paired t test. The variance-covariance matrix C that represents the case-sample variation is given by The estimated variance of the estimate of )JL is Testing the equality of the two imaging systems can be translated into testing Ho: J.L = 0. Therefore, we use the test statistic T, Approximately, D - N4(f.LL, QZI + C). We estimate Q2 by means of REML and the EM algorithm. Table 5 shows the estimates of DiDI and or obtained by the EM algorithm. The EM estimator of the between-readers variation is QZ = Substituting this value in the assumed model for D, we get D - N4(f.LL, 2), with 2 = Q 21 + C estimated by Table 4 o Areas under the Curves (with Standard Deviation) Estimated in a Nonparametrical Way for Four Readers Obviously, the estimate for QZ is not negligible in comparison with the elements of C. This means that we indeed have to take account of within-reader variation, something like measurement error, or classification uncertainty of the readers. The next step is to obtain the maximum-likelihood estimator of ~,: Table 5. Estimates of D;D,, and Q2 by the EM Algorithm

10 152 The magnitude of Q is large relative to the elements of C; thus, we compare the value of T with a Student s-t distribution with three degrees of freedom. The corresponding p value is less than 0.5; thus, the two imaging systems differ in diagnostic quality. Discussion The method described is designed to test the differences between ROC curves obtained from multiple readers in the presence of within-reader variation in a nonparametric context. The analysis does not make use of the assumption of bivariate normal ROC curves. The areas under the ROC curves are estimated in a nonparametric way. Asymptotically the vector D, consisting of the differences in the areas under the curves per reader, follows a normal distribution with expectation fjl and variance-covariance matrix E. This variance-covariance matrix consists of a within-readers effect and of a casesample effect. The case-sample effect can be estimated directly from the variances of and the covariances between the various areas under the curves. For the estimation of the within-readers effect, we make use of the EM algorithm. The expectation of the components of the vector D can be estimated by means of the maximum-likelihood method. Testing whether the two imaging systems yield equal diagnostic quality can be translated into testing whether this expectation is equal to zero. For this test, the assumption is made that the vector D has a constant expectation for all readers, e.g., the gain of using another imaging system is the same for all readers. Therefore, this test will be the most powerful for a homogeneous group of readers. The advantage of the method is that it is more suitable than the paired t test, because it takes into account the case-sample variance. But for large cr 2, T approximately follows a Student s-t distribution with n - 1 degrees of freedom (n denoting the number of readers in the study). Hence the critical values (and the accuracy) of this test will heavily depend on the number of readers in the study. The statistical method used is currently being implemented in software. The authors thank the two reviewers for their constructive remarks. References 1. Metz CE. ROC Methodology in radiologic imaging. Invest Radiol. 1986;21: Metz CE. Basic principles of ROC analysis. Semin Nucl Med. 1978;8: Metz CE, Wang PL, Kronman HB. A New Approach for Testing the Significance of Differences between ROC Curves Measured from Correlated Data. Proceedings of the VIIIth Conference on Information Processing in Medical Imaging, Hanley JA, McNeil BJ. A method of comparing the areas under receiver operating characteristics (ROC) curves derived from the same cases. Radiology. 1983;148: DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44: Metz CE. Some practical issues of experimental design and data analysis in radiologic ROC studies. Invest Radiol. 1989; 24: Swets JA, Pickett RM. Evaluation of Diagnostic Systems, Methods from Signal Detection Theory. New York: Academic Press, Metz CE, Kronman HB. "CORROC2" Program (IBM-PC Version), Hanley JA, McNeil BJ. The meaning and the use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143: Dorfman DD, Berbaum KS, Metz CE. Receiver operating characteristic rating analysis: generalization to the population of readers and patients with the jackknife method. Invest Radiol. 1992;27: Stewart J. Econometrics. Philip Allan, Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J R Statist Soc. 1977;B39: Mood AM, Graybill FA, Boes DC. Introduction to the Theory of Statistics. New York: McGraw-Hill, Klessens PM, Barneveld Binkhuysen FH, Ottes FP, Winter LHL, Willemse APP, de Valk JPJ. Diagnostic evaluation of a PACS subsystem using mammographs. An ROC analysis. J Med Informatics. 1988;13: Ma G, Hall WJ. Confidence bands for receiver operating characteristic curves. Med Decis Making. 1993;13:191-7.

Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves

Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves Journal of Data Science 6(2008), 1-13 Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves Feng Gao 1, Chengjie

More information

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Berkman Sahiner, a) Heang-Ping Chan, Nicholas Petrick, Robert F. Wagner, b) and Lubomir Hadjiiski

More information

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

An Approximate Test for Homogeneity of Correlated Correlation Coefficients Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE

More information

Performance Evaluation and Comparison

Performance Evaluation and Comparison Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation

More information

A process capability index for discrete processes

A process capability index for discrete processes Journal of Statistical Computation and Simulation Vol. 75, No. 3, March 2005, 175 187 A process capability index for discrete processes MICHAEL PERAKIS and EVDOKIA XEKALAKI* Department of Statistics, Athens

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Principal component analysis

Principal component analysis Principal component analysis Motivation i for PCA came from major-axis regression. Strong assumption: single homogeneous sample. Free of assumptions when used for exploration. Classical tests of significance

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Non parametric ROC summary statistics

Non parametric ROC summary statistics Non parametric ROC summary statistics M.C. Pardo and A.M. Franco-Pereira Department of Statistics and O.R. (I), Complutense University of Madrid. 28040-Madrid, Spain May 18, 2017 Abstract: Receiver operating

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart

Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart Peer-Reviewed Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart Máté Mihalovits and Sándor Kemény T his article is a

More information

STATISTICS SYLLABUS UNIT I

STATISTICS SYLLABUS UNIT I STATISTICS SYLLABUS UNIT I (Probability Theory) Definition Classical and axiomatic approaches.laws of total and compound probability, conditional probability, Bayes Theorem. Random variable and its distribution

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j. Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That

More information

Declarative Statistics

Declarative Statistics Declarative Statistics Roberto Rossi, 1 Özgür Akgün, 2 Steven D. Prestwich, 3 S. Armagan Tarim 3 1 The University of Edinburgh Business School, The University of Edinburgh, UK 2 Department of Computer

More information

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error Statistical Analysis of Engineering Data The Bare Bones Edition (I) Precision, Bias, Accuracy, Measures of Precision, Propagation of Error PRIOR TO DATA ACQUISITION ONE SHOULD CONSIDER: 1. The accuracy

More information

How do we compare the relative performance among competing models?

How do we compare the relative performance among competing models? How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model

More information

Binary choice 3.3 Maximum likelihood estimation

Binary choice 3.3 Maximum likelihood estimation Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood

More information

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R. Methods and Applications of Linear Models Regression and the Analysis of Variance Third Edition RONALD R. HOCKING PenHock Statistical Consultants Ishpeming, Michigan Wiley Contents Preface to the Third

More information

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: MULTIVARIATE ANALYSIS OF VARIANCE MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: 1. Cell sizes : o

More information

Data Privacy in Biomedicine. Lecture 11b: Performance Measures for System Evaluation

Data Privacy in Biomedicine. Lecture 11b: Performance Measures for System Evaluation Data Privacy in Biomedicine Lecture 11b: Performance Measures for System Evaluation Bradley Malin, PhD (b.malin@vanderbilt.edu) Professor of Biomedical Informatics, Biostatistics, & Computer Science Vanderbilt

More information

Introduction to Statistical Inference

Introduction to Statistical Inference Structural Health Monitoring Using Statistical Pattern Recognition Introduction to Statistical Inference Presented by Charles R. Farrar, Ph.D., P.E. Outline Introduce statistical decision making for Structural

More information

This paper is not to be removed from the Examination Halls

This paper is not to be removed from the Examination Halls ~~ST104B ZA d0 This paper is not to be removed from the Examination Halls UNIVERSITY OF LONDON ST104B ZB BSc degrees and Diplomas for Graduates in Economics, Management, Finance and the Social Sciences,

More information

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already

More information

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists

More information

CHAPTER 3 FEATURE EXTRACTION USING GENETIC ALGORITHM BASED PRINCIPAL COMPONENT ANALYSIS

CHAPTER 3 FEATURE EXTRACTION USING GENETIC ALGORITHM BASED PRINCIPAL COMPONENT ANALYSIS 46 CHAPTER 3 FEATURE EXTRACTION USING GENETIC ALGORITHM BASED PRINCIPAL COMPONENT ANALYSIS 3.1 INTRODUCTION Cardiac beat classification is a key process in the detection of myocardial ischemic episodes

More information

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low

More information

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1 EEL 851: Biometrics An Overview of Statistical Pattern Recognition EEL 851 1 Outline Introduction Pattern Feature Noise Example Problem Analysis Segmentation Feature Extraction Classification Design Cycle

More information

Estimating the parameters of hidden binomial trials by the EM algorithm

Estimating the parameters of hidden binomial trials by the EM algorithm Hacettepe Journal of Mathematics and Statistics Volume 43 (5) (2014), 885 890 Estimating the parameters of hidden binomial trials by the EM algorithm Degang Zhu Received 02 : 09 : 2013 : Accepted 02 :

More information

Regression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y.

Regression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y. Regression Bivariate i linear regression: Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables and. Generally describe as a

More information

The Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing

The Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing 1 The Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing Greene Ch 4, Kennedy Ch. R script mod1s3 To assess the quality and appropriateness of econometric estimators, we

More information

Brandon D. Gallas Stephen L. Hillis

Brandon D. Gallas Stephen L. Hillis Generalized Roe and Metz receiver operating characteristic model: analytic link between simulated decision scores and empirical AUC variances and covariances Brandon D. Gallas Stephen L. Hillis Journal

More information

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern. STAT 01 Assignment NAME Spring 00 Reading Assignment: Written Assignment: Chapter, and Sections 6.1-6.3 in Johnson & Wichern. Due Monday, February 1, in class. You should be able to do the first four problems

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Assumptions of classical multiple regression model

Assumptions of classical multiple regression model ESD: Recitation #7 Assumptions of classical multiple regression model Linearity Full rank Exogeneity of independent variables Homoscedasticity and non autocorrellation Exogenously generated data Normal

More information

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments The hypothesis testing framework The two-sample t-test Checking assumptions, validity Comparing more that

More information

Lehmann Family of ROC Curves

Lehmann Family of ROC Curves Memorial Sloan-Kettering Cancer Center From the SelectedWorks of Mithat Gönen May, 2007 Lehmann Family of ROC Curves Mithat Gonen, Memorial Sloan-Kettering Cancer Center Glenn Heller, Memorial Sloan-Kettering

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

ECE521 week 3: 23/26 January 2017

ECE521 week 3: 23/26 January 2017 ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

Unsupervised Learning with Permuted Data

Unsupervised Learning with Permuted Data Unsupervised Learning with Permuted Data Sergey Kirshner skirshne@ics.uci.edu Sridevi Parise sparise@ics.uci.edu Padhraic Smyth smyth@ics.uci.edu School of Information and Computer Science, University

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

STAT 501 EXAM I NAME Spring 1999

STAT 501 EXAM I NAME Spring 1999 STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Machine Learning and Pattern Recognition, Tutorial Sheet Number 3 Answers

Machine Learning and Pattern Recognition, Tutorial Sheet Number 3 Answers Machine Learning and Pattern Recognition, Tutorial Sheet Number 3 Answers School of Informatics, University of Edinburgh, Instructor: Chris Williams 1. Given a dataset {(x n, y n ), n 1,..., N}, where

More information

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course. Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA

SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA Kazuyuki Koizumi Department of Mathematics, Graduate School of Science Tokyo University of Science 1-3, Kagurazaka,

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in

More information

Performance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies

Performance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies Clinical Chemistry 44:5 1024 1031 (1998) Laboratory Management Performance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies Kristian Linnet Application

More information

Mathematical studies Standard level Paper 2

Mathematical studies Standard level Paper 2 Mathematical studies Standard level Paper 2 Friday 5 May 2017 (morning) 1 hour 30 minutes Instructions to candidates ydo not open this examination paper until instructed to do so. ya graphic display calculator

More information

Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information

Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Budi Pratikno 1 and Shahjahan Khan 2 1 Department of Mathematics and Natural Science Jenderal Soedirman

More information

ORIGINAL PAPER STATISTICAL METHOD IN A COMPARATIVE STUDY IN WHICH THE STANDARD TREATMENT IS SUPERIOR TO OTHERS

ORIGINAL PAPER STATISTICAL METHOD IN A COMPARATIVE STUDY IN WHICH THE STANDARD TREATMENT IS SUPERIOR TO OTHERS agoya J. Med. Sci. 65. 127 ~ 132, 2002 ORIGIAL PAPER STATISTICAL METHOD I A COMPARATIVE STUDY I WHICH THE STADARD TREATMET IS SUPERIOR TO OTHERS MITSURU IKEDA 1, KAZUHIRO SHIMAMOTO 2, TAKEO ISHIGAKI 3

More information

Applied Multivariate and Longitudinal Data Analysis

Applied Multivariate and Longitudinal Data Analysis Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference

More information

The use of receiver operating characteristic curves in biomedical informatics

The use of receiver operating characteristic curves in biomedical informatics Journal of Biomedical Informatics 38 (2005) 404 415 Methodological Review The use of receiver operating characteristic curves in biomedical informatics Thomas A. Lasko a,b, Jui G. Bhagwat c, Kelly H. Zou

More information

Time: 1 hour 30 minutes

Time: 1 hour 30 minutes Paper Reference(s) 6684/01 Edexcel GCE Statistics S2 Gold Level G3 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates

More information

Multivariate analysis of variance and covariance

Multivariate analysis of variance and covariance Introduction Multivariate analysis of variance and covariance Univariate ANOVA: have observations from several groups, numerical dependent variable. Ask whether dependent variable has same mean for each

More information

A CUSUM approach for online change-point detection on curve sequences

A CUSUM approach for online change-point detection on curve sequences ESANN 22 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges Belgium, 25-27 April 22, i6doc.com publ., ISBN 978-2-8749-49-. Available

More information

PRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES

PRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES Sankhyā : The Indian Journal of Statistics 1994, Volume, Series B, Pt.3, pp. 334 343 PRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES

More information

Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research

Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research HOW IT WORKS For the M.Sc. Early Childhood Research, sufficient knowledge in methods and statistics is one

More information

Lecture 6: April 19, 2002

Lecture 6: April 19, 2002 EE596 Pat. Recog. II: Introduction to Graphical Models Spring 2002 Lecturer: Jeff Bilmes Lecture 6: April 19, 2002 University of Washington Dept. of Electrical Engineering Scribe: Huaning Niu,Özgür Çetin

More information

NAG Library Chapter Introduction. G08 Nonparametric Statistics

NAG Library Chapter Introduction. G08 Nonparametric Statistics NAG Library Chapter Introduction G08 Nonparametric Statistics Contents 1 Scope of the Chapter.... 2 2 Background to the Problems... 2 2.1 Parametric and Nonparametric Hypothesis Testing... 2 2.2 Types

More information

Rejection regions for the bivariate case

Rejection regions for the bivariate case Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test

More information

Smart Home Health Analytics Information Systems University of Maryland Baltimore County

Smart Home Health Analytics Information Systems University of Maryland Baltimore County Smart Home Health Analytics Information Systems University of Maryland Baltimore County 1 IEEE Expert, October 1996 2 Given sample S from all possible examples D Learner L learns hypothesis h based on

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Probability and Information Theory. Sargur N. Srihari

Probability and Information Theory. Sargur N. Srihari Probability and Information Theory Sargur N. srihari@cedar.buffalo.edu 1 Topics in Probability and Information Theory Overview 1. Why Probability? 2. Random Variables 3. Probability Distributions 4. Marginal

More information

Accelerating the EM Algorithm for Mixture Density Estimation

Accelerating the EM Algorithm for Mixture Density Estimation Accelerating the EM Algorithm ICERM Workshop September 4, 2015 Slide 1/18 Accelerating the EM Algorithm for Mixture Density Estimation Homer Walker Mathematical Sciences Department Worcester Polytechnic

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Bayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins.

Bayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins. Bayesian Reasoning Adapted from slides by Tim Finin and Marie desjardins. 1 Outline Probability theory Bayesian inference From the joint distribution Using independence/factoring From sources of evidence

More information

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30 Problem Set 2 MAS 622J/1.126J: Pattern Recognition and Analysis Due: 5:00 p.m. on September 30 [Note: All instructions to plot data or write a program should be carried out using Matlab. In order to maintain

More information

Anomaly Detection. Jing Gao. SUNY Buffalo

Anomaly Detection. Jing Gao. SUNY Buffalo Anomaly Detection Jing Gao SUNY Buffalo 1 Anomaly Detection Anomalies the set of objects are considerably dissimilar from the remainder of the data occur relatively infrequently when they do occur, their

More information

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 29 Multivariate Linear Regression- Model

More information

Part III: Unstructured Data

Part III: Unstructured Data Inf1-DA 2010 2011 III: 51 / 89 Part III Unstructured Data Data Retrieval: III.1 Unstructured data and data retrieval Statistical Analysis of Data: III.2 Data scales and summary statistics III.3 Hypothesis

More information

Learning From Crowds. Presented by: Bei Peng 03/24/15

Learning From Crowds. Presented by: Bei Peng 03/24/15 Learning From Crowds Presented by: Bei Peng 03/24/15 1 Supervised Learning Given labeled training data, learn to generalize well on unseen data Binary classification ( ) Multi-class classification ( y

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

Human-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015

Human-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015 Probability Refresher Kai Arras, University of Freiburg Winter term 2014/2015 Probability Refresher Introduction to Probability Random variables Joint distribution Marginalization Conditional probability

More information

[y i α βx i ] 2 (2) Q = i=1

[y i α βx i ] 2 (2) Q = i=1 Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation

More information

Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition

Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition Karl-Rudolf Koch Introduction to Bayesian Statistics Second, updated and enlarged Edition With 17 Figures Professor Dr.-Ing., Dr.-Ing.

More information

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone

More information

Detection theory. H 0 : x[n] = w[n]

Detection theory. H 0 : x[n] = w[n] Detection Theory Detection theory A the last topic of the course, we will briefly consider detection theory. The methods are based on estimation theory and attempt to answer questions such as Is a signal

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

Uncertainty quantification and visualization for functional random variables

Uncertainty quantification and visualization for functional random variables Uncertainty quantification and visualization for functional random variables MascotNum Workshop 2014 S. Nanty 1,3 C. Helbert 2 A. Marrel 1 N. Pérot 1 C. Prieur 3 1 CEA, DEN/DER/SESI/LSMR, F-13108, Saint-Paul-lez-Durance,

More information

University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout 2:. The Multivariate Gaussian & Decision Boundaries

University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout 2:. The Multivariate Gaussian & Decision Boundaries University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout :. The Multivariate Gaussian & Decision Boundaries..15.1.5 1 8 6 6 8 1 Mark Gales mjfg@eng.cam.ac.uk Lent

More information

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation

More information

Model-based Characterization of Mammographic Masses

Model-based Characterization of Mammographic Masses Model-based Characterization of Mammographic Masses Sven-René von der Heidt 1, Matthias Elter 2, Thomas Wittenberg 2, Dietrich Paulus 1 1 Institut für Computervisualistik, Universität Koblenz-Landau 2

More information

2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)?

2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)? ECE 830 / CS 76 Spring 06 Instructors: R. Willett & R. Nowak Lecture 3: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics Executive summary In the last lecture we

More information

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Linear Models 1. Isfahan University of Technology Fall Semester, 2014 Linear Models 1 Isfahan University of Technology Fall Semester, 2014 References: [1] G. A. F., Seber and A. J. Lee (2003). Linear Regression Analysis (2nd ed.). Hoboken, NJ: Wiley. [2] A. C. Rencher and

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

Bootstrapping, Randomization, 2B-PLS

Bootstrapping, Randomization, 2B-PLS Bootstrapping, Randomization, 2B-PLS Statistics, Tests, and Bootstrapping Statistic a measure that summarizes some feature of a set of data (e.g., mean, standard deviation, skew, coefficient of variation,

More information