Statistical Comparison of ROC Curves from Multiple Readers
|
|
- Sophie Booker
- 5 years ago
- Views:
Transcription
1 Statistical Comparison of ROC Curves from Multiple Readers MARIJKE SWAVING, MSc, HANS VAN HOUWELINGEN, PhD, FENNO P. OTTES, PhD, TON STEERNEMAN, PhD Receiver operating characteristic (ROC) analysis is the commonly accepted method for comparing diagnostic imaging systems. In general, ROC studies are designed in such a way that multiple readers read the same images and each image is presented by means of two different imaging systems. Statistical methods for the comparison of the ROC curves from one reader have been developed, but extension of these methods to multiple readers is not straightforward. A new method of analysis is presented for the comparison of ROC curves from multiple readers. This method includes a nonparametric estimation of the variances and covariances between the various areas under the curves. The method described is more appropriate than the paired t test, because it also takes the case-sample variation into account. Key words: ROC curves; ROC analysis; nonparametric estimation; area under the curve; comparison of ROC curves; EM algorithm. (Med Decis Making 1996;16: ) Receiver operating characteristic (ROC) studies are frequently used to compare the diagnostic qualities of imaging systems.l ROC studies can also be used to evaluate the performance of systems to support the diagnosis. In a conventional ROC study,&dquo; several observers read all the selected images by means of each of the imaging systems that are to be compared. The diagnosis the observer makes depends on his or her confidence that the particular image shows an abnormality or a normal state and upon the confidence threshold he adopts. The diagnosis the observer makes can be correct or incorrect (table 1). For a given confidence threshold, the fraction of abnormal images that are correctly identified as abnormal is called the true-positive fraction (TPF = sensitivity) and the fraction of the normal images that are correctly identified is called the true-negative fraction (TNF specificity). = In the same way, the false-positive fraction (FPF) and the false-negative fraction (FNF) are defined. For the actually normal and the actually abnormal images, probability distributions can be derived for the various states of Received June 22, 1994 from BAZIS, Experimental Developments, Leiden, The Netherlands (MS, FPO); the Department of Medical Statistics, Leiden University, Leiden, The Netherlands (JCVH); and the Department of Econometrics, University of Groningen, Groningen, The Netherlands (AGMS). Revision accepted for publication September 19, Dr. Steerneman was supported by the Stichting Verzekeringswetenschap and the C. R. Rao Foundation. Address correspondence and reprint requests to Dr. Ottes: HISCOM Research and Development, Schipholweg 97, 2316 XA Leiden, The Netherlands. Table 1. The Types of Correct and Incorrect Diagnoses truth. Two such distributions are shown in figure 1. The horizontal axis represents the confidence the observer has that a particular image was taken from a patient who actually has an abnormal condition. The confidence threshold separates the abnormal diagnoses from the normal diagnoses. The inherent discriminatory capacity of a system depends on the extent to which the probability distributions of perceived evidence from various states of truth are separated or overlap. The sensitivity and the specificity of a diagnostic imaging system depend upon the particular confidence threshold that the observer uses. According to this model, an image is read as positive if the observer s confidence in a positive diagnosis exceeds his confidence threshold. For a range of confidence thresholds, the ROC curve represents the link between the fraction of abnormal images that are correctly diagnosed as abnormal (TPF) and the fraction of normal images that are diagnosed as abnormal (FPF). Therefore, in an ROC study, an observer is asked to adopt several confidence thresholds at the same time, as shown in figure 2, and classify each image in one of these categories denoting his or her confidence. The diagnostic quality of an imaging system can be represented by an ROC curve. The ROC curve can be estimated as a curve through the operating 143
2 144 FIGURE 1. Probability densities of an observer s confidence in a positive diagnosis for a particular diagnostic task. points, which are obtained by the classifications of the images by a reader. Usually it is assumed that the functional form of an ROC curve is given by two underlying normal distributions. The ROC curve denotes the discriminatory capacity of the imaging system between the normal and the abnormal images. A higher ROC curve indicates greater discriminatory capacity. To compare imaging systems, the area under the ROC curve is usually taken as an index of diagnostic quality. Two imaging systems yield the same diagnostic quality when the areas under their ROC curves are equal. If the same images are read and/or the same reader reads the images presented by both imaging systems, the ROC curves (and the areas beneath them) will be correlated. Metz et al.3 and Hanley and McNeil4 have developed several methods for the comparison of two correlated ROC curves. These methods are described below. They cannot be generalized to multiple readers. A nonparametric approach to the comparison of the areas under two or more correlated ROC curves has been reported by DeLong et al. Until now, conclusions about the diagnostic qualities of imaging systems have been drawn by comparing ROC curves per reader, by comparing pooled ROC curves or the averaged areas under the curves and subsequently performing a test for two correlated ROC curves, or by using a paired t test to analyze the areas under the curves for all readers. Poolings is not a good alternative for handling correlated ROC curves, and a test for two correlated ROC curves is not designed for such pooled curves. The paired t test takes both between-reader variation and within-reader variation into account, but it does not take case-sample variation into account. Therefore, the need was felt for a more suitable method than the paired t test. We present a method that takes into account casesample variation. We provide an example of eight correlated ROC curves derived from four readers interpretations of mammograms, then discuss our proposed method. The confidence-rating model, sketched for a five-cat- FIGURE 2. egory scale.
3 145 Comparison of Correlated ROC Curves Under the assumption of a bivariate normal ROC curve, the curve can be plotted as a straight line on double-normal-deviate papery This straight line can be represented by means of two parameters, the intercept a and the slope ~3. The area under the ROC curve is equal to where 4$ denotes the cumulative standard normal distribution function. For the estimation of the area under the curve, AZ, we can substitute the estimates a and b for a and P, respectively. The statistic proposed by Metz for testing the null hypothesis Ho:Az1 This statistic has variance which is estimated by This test is executed by the software package COR- ROC2.$ The statistic W approximately follows a normal distribution, with zero mean and variance S w under Ho. This test can be used for the comparison of two ROC curves only. Therefore, this test is not applicable to an ROC study, when multiple observers read the same images and when each image is presented by means of two imaging systems. Hanley and McNeil4 9 developed a method that tries to estimate the correlation between the two areas to be compared. The variance of the difference in the areas under the curves is given by where SD denotes the standard deviation and p is the correlation coefficient. If the areas are highly positively correlated, the gain of using a paired test is big, because of the reduction in the variance. The calculation of the correlation coefficient p depends on two intermediate correlation coefficients p, and p,,, the correlation coefficients for the ratings of the positive patients and the negative patients imaged by the two modalities, respectively. Each of these can be estimated using Kendall s tau. The statistic W (equation 2) can be used again. This test is not applicable to the comparison of more than two ROC curves. DeLong and colleagues present a nonparametric approach to compare two or more correlated ROC curves. They assume that the readings of the images of two different patients are independent, but dependence of the readings of the images of one patient is allowed. Within this framework, they have to deal with correlated ROC curves. If there are, e.g., two imaging systems and four readers, then we have eight readings for each patient. However, in this situation the assumption of independence is no longer realistic, because the readings by the same reader will be correlated, even when he or she interprets the images of two different patients. In the next section we deal with correlated ROC curves in this more general case, where correlation between readings of the images of different patients occurs. We apply a nonparametric approach. Swets and Pickett, 7 and Dorfman, Berbaum, and Metz&dquo; have discussed methods for statistical comparison of ROC-curve estimates from multiple readers. We also have to deal with a similar situation: a factorial experimental design in which the readers interpret the same images in all modalities, but they do this only once. Application of the method of Swets and Pickett is not possible in this situation, because they require re-reading of images. Besides, it is difficult to obtain the appropriate variances and covariances. Swets and Pickett implicitly postulated an ANOVA type of model with a fixed effect for the modality; the other effects are random. Dorfman et al. also apply such a model, but it is of a more general type, in the sense that two additional second-order interaction terms of random type are included. An important difference is that Dorfman et al. do not postulate their model for the estimated areas under the ROC curves but for the pseudovalues derived for the jackknife method. The jackknife analysis can be suitable in the ANOVA context if the usual assumptions of normality of the error terms and the random effects are weakened. Both articles assume additivity of the various effects in the model. In our method we do not assume this. We follow a nonparametric approach and we do not require an ANOVA kind of specification for the various effects.
4 146 FIGURE 3. An ROC curve obtained without the assumption of bivariate normality. Comparison of ROC Curves from Multiple Readers with Case-Sample Variation The method for comparing ROC curves from multiple readers (the number of readers is denoted by n) is based upon the fact that the same images are read under two conditions. If the areas under the ROC curves are equal for the two imaging systems, then the imaging systems yield the same diagnostic quality. We make only minimal assumptions about the form of the ROC curve. Therefore, we do not assume a bivariate normal ROC curve. The ROC curve is obtained by connecting the operating points by straight lines. Figure 3 represents such an ROC curve, in which the variable X describes the confidence an observer has in a diagnosis of abnormality when the image is actually normal and the variable Y describes the confidence in a diagnosis of abnormality when the image is abnormal. Here h denotes an operating point, h = 0,..., c; where c denotes the number of confidence categories the observer can choose from. If X and Y are independent random variables, the area under this curve is given by the elements wy are given by For the ROC curve, the area under the curve can be estimated. For each normal image and for each abnormal image the reader can choose from c categories to classify the image. This classification process is represented by two multinomial distributions with estimated probability vectors px and py. The variance of d is estimated by where tr stands for the trace of the matrix. For the estimation of the variance-covariance matrices cov(p,) and cov(py), we make use of the underlying multinomial distribution: In the following text we work with We can estimate A from the data by d = pkwpy, where px and py are vectors denoting the estimated probabilities that a normal image and an abnormal image, respectively, are classified in category h; h = 1,..., c. W is a square matrix of dimension k, and where diag(px) and diag(p,.) denote diagonal matrices with the elements of the vector px and the vector py, respectively, at the main diagonal. Since Az is esti- = mated by 1/2 + 1/2. d, the variance of the area under the curve is estimated by
5 147 In our ROC study, multiple readers interpreted the same images under two conditions. Given the selection of readers i = 1,..., n and the imaging systems k = 1, 2, we have to deal with the areas AZ,k. Indeed, conditionally given the sample of readers, the areas A~ can differ between readers for a fixed imaging system k. The areas are estimated by AZ,k = 1/2 + V2. d,k- Since the d,k are obtained from the same case sample, we also have to deal with the covariances between the d,k, for i = 1,..., n and k = 1, 2. More formally, we have to estimate cov(d,k, d 1); i, j = 1,..., n, k, 1 = 1, 2. The covariances between two areas under the curves can be obtained by looking at the similarities and dissimilarities between the classifications of the same images. This is done for the classifications of the same readers and for the classifications of different readers as well. We also need the cross-classifications by the readers. Therefore, we look at the jointly multinomial distribution of the probability vectors. We obtain the following estimators where n, and nz, respectively, denote the number of normal images and the number of abnormal images. The squared matrix PXlk,}1 of dimension c is cal- to the culated as follows. Its (ci, c2) element is equal fraction of normal images in the sample that are classified in category C1 (and cz, respectively) by reader i (and j, respectively) under imaging system k (and 1). So, the matrix PXlk,j1 gives the relative frequency table of the normal images in the sample cross-classified according to the readings by reader i (and j) of images from imaging system k (and 1). For, e.g., the first reader of the normal mammograms, from the example that is presented below, the joint multinomial distribution is estimated by means of the following probabilities: Notice that pxii,iz is a square matrix of dimension to the fraction of the c, and element (1, 2) is equal 50 images that are placed in category 1 by reader 1 under system 1 and in category 2 by reader 1 under system 2. Similar probabilities can also be estimated for different readers. These covariances can be used for the estimation of the covariance between two areas, i.e., By means of the equations 8 and 13, the variances of the areas under the curves and the covariances between various areas under the curves can be esti- We now define a difference vector D, with D, = A.,, - Az,z 1/2(d,l - d,z). The dimension of the difference vector D is equal to n, the number of ob- = servers in the study. The assumption is made that D approximately follows a multivariate normal distribution with expectation )JLL, where L denotes the vector of dimension n filled with ones, and variancecovariance matrix ~. This assumption allows that the areas./1~ are different between readers for given k = 1, 2. It is, however, assumed that the differences AZ,1 - A,,2 are equal. In the previous analysis, it was implicitly assumed that a reader is completely certain about his classifications. In practice, it is likely that the reader is uncertain about the category into which a given image should be classified. So, if the same samples of images are presented to the readers after some time, then another value D, will be obtained. It is realistic to assume that var(d, - D,) 5~ 0. In the foregoing we have implied that var(d, - D,) = 0. The variance of the measurement or, equivalently, the within-reader variance is denoted by a2 ~ 0 for all readers. If the hypothesis of no measurement error would hold, then the analysis should obtain a negligible estimate for Q2. Testing Ho: equal diagnostic quality can now be translated into testing Ho:..L = 0, i.e., no difference in the diagnostic qualities of the two imaging systems per reader. The difference with respect to the paired t test is the calculation of I. For the paired t test it is assumed that each element of the vector D approximately follows a normal distribution with expectation }jl and a variance of Q2 and that the elements of this vector are independent. The variance Q2 is estimated by the sampling variance of the vector. However, we assume that the vector D approximately follows a multivariate normal distribution. The variance-covariance matrix I of D consists of two parts. One part of the variation in D can be explained by the fact that the same images are read multiple times, the case-sample variation. The other part of the variation is caused by a withinreaders effect. We denote the case-sample variance by the matrix C, and the within-readers variation by
6 148 Q2. Thus, the model can be written as where I is the unit matrix of dimension n. The variance-covariance matrix C is given by QZ. Subsequently, each iteration of the algorithm consists of two steps: Expectation (E) step: Compute Maximization (M) step: The harder part is the E step. algebra it can be shown that With some linear We can substitute the variances (equation 8) and covariances (equation 13) in equation 15, the formula for C, to obtain an estimate of the case-sample variance. For the estimation of the within-readers variation QZ we make use of the restricted maximum likelihood (REML) methods approach and the expectation-maximization (EM) algorithm.&dquo; The REML approach projects D on a subspace orthogonal to ii, by where Repeated application of the E and M steps ultimately leads to an estimate of QZ that maximizes the corresponding likelihood function. For the estimation of >, the maximum-likelihood 3 method is applied. We know D - Nn(fl.L, (T 21 + C), with or and C &dquo;known by estimation.&dquo; The maximum-likelihood estimator for )JL is Then D has zero expectation and variance-covariance matrix The estimated variance of the estimator of )JL is The EM algorithm is based upon the representation with independent D1 and DZ, both having expectation zero and with variance-covariance matrices Finally, the comparison of ROC curves with multiple readers can be performed. The null hypothesis stating that the diagnostic qualities of the two modalities are equal is translated into Ho : )JL = 0. Thus, the statistic If D1 were observable, an estimator of QZ could be found by The EM approach starts with an initial estimate of is defined. Under Ho, the distribution of T depends on the magnitude of the within-readers variation QZ. For small Q2, T approximately follows a standard normal distribution. For large cr2, T approximately follows a Student s-t distribution with n - 1 degrees of freedom, and the test coincides with the paired t test.
7 149 The steps to be taken in comparing ROC curves from multiple readers may be summarized as follows : Table 2. Areas under the Curves (with Standard Deviations) for the Bivariate Normal ROC Curves of Four Readers. For each reader-modality combination, estimate the area under the curve in a nonparametric way.. Compute the difference vector D. Estimate the variance-covariance matrix C by estimating the variances of d 1 and d,2, i = 1,..., n, the covariances between d,, and djlj between d 1 and d,2, between d,z and d*, and between day, and djz) i) j = 1)..., n. Estimate with the EM algorithm the within-readers variation Q 2.. Estimate > by means of the maximum-likelihood method. Estimate the variance of p,.. Compute the value of the test statistic T and compare this value with the values of the appropriate distribution. Example We illustrate the method using an ROC study of mammogram interpretations.14 For this study, 50 films with small calcifications and 50 films without small calcifications were selected. All other abnormalities in the films were ignored. The diagnostic truth of the images was determined by two radiologists who specialized in interpreting mammograms. The images were digitized by a video scanner. In this study, four readers were involved. These observers read both the conventional film and the digitized images. They were asked to use a five-point rating scale to represent their confidence in identifying the abnormality sought after. The categories of the certainty scale were: 1 Definitely no small calcifications present 2 Probably no small calcifications present 3 Unsure whether small calcifications are present 4 Probably small calcifications present 5 Definitely small calcifications present The images per imaging system were all read in one session, and after some unspecified time, the images were read again, in a completely different order, by means of the other imaging system. In table 2, the areas under the curves for the different readers are listed, assuming a bivariate normal form for the ROC curves. for all readers the areas under the Although curves are less for the readings of the digitized images than for the conventional films, it is hard to prove a difference in diagnostic quality by statistics. A paired t test produces a one-sided p value of Thus, no conclusion can be made from a test for Table 3. P-values of Testing the Equality of the Diagnostic Qualities of Two Modalities (Results of CORROC2) for Four Readers the hypothesis Ho: equal diagnostic quality, against H,: the diagnostic quality of the digitized images is less than the diagnostic quality of the conventional film, because of the level of significance used (5%) and because of the fact that the Student s-t statistic only asymptotically follows a standard normal distribution. If we test per reader the equality of the to the areas under the curves (table 3) according method of Metz,3 the null hypothesis will be rejected for the readers 1, 3, and 4, but for reader 2 the null hypothesis will be accepted. Therefore, according to the method of Metz, no overall statement about the can be made. diagnostic qualities of the two systems In such situations, the paired t test can help, but even with the paired t test it was not possible to draw an overall conclusion. In figures 4, 5, 6, and 7, the operating points and the estimated ROC curves are given, together with their 95% confidence bands obtained from ROCFIT (a program devised by C. E. Metz), but the Working-Hotelling-type bands of Ma and Hall&dquo; could also be drawn. The behaviors of the bounds Al-4 in figures 4 and 5 are rather strange. It is not clear how ROCFIT produced such bounds. One would expect confidence intervals that are narrow at the lower left and upper right ends of the ROC curve. Table 4 shows the nonparametric estimates of the areas under the curves, together with the standard deviations. The difference vector D = Az,i - AZ,Z is estimated by
8 150 FIGURE 4. The operating points from reader 1 with the estimated curve and a 95% confidence interval for the mammography study. FIGURE 5. The operating points from reader 2 with the estimated curve and a 95% confidence interval for the mammography study. FIGURE 6. The operating points from reader 3 with the estimated curve and a 95% confidence interval for the mammography study.
9 151 FIGURE 7. The operating points from reader 4 with the estimated curve and a 95% confidence interval for the mammography study. A paired t test for this difference vector gives a p value of 0.08; thus, the null hypothesis is accepted according to the paired t test. The variance-covariance matrix C that represents the case-sample variation is given by The estimated variance of the estimate of )JL is Testing the equality of the two imaging systems can be translated into testing Ho: J.L = 0. Therefore, we use the test statistic T, Approximately, D - N4(f.LL, QZI + C). We estimate Q2 by means of REML and the EM algorithm. Table 5 shows the estimates of DiDI and or obtained by the EM algorithm. The EM estimator of the between-readers variation is QZ = Substituting this value in the assumed model for D, we get D - N4(f.LL, 2), with 2 = Q 21 + C estimated by Table 4 o Areas under the Curves (with Standard Deviation) Estimated in a Nonparametrical Way for Four Readers Obviously, the estimate for QZ is not negligible in comparison with the elements of C. This means that we indeed have to take account of within-reader variation, something like measurement error, or classification uncertainty of the readers. The next step is to obtain the maximum-likelihood estimator of ~,: Table 5. Estimates of D;D,, and Q2 by the EM Algorithm
10 152 The magnitude of Q is large relative to the elements of C; thus, we compare the value of T with a Student s-t distribution with three degrees of freedom. The corresponding p value is less than 0.5; thus, the two imaging systems differ in diagnostic quality. Discussion The method described is designed to test the differences between ROC curves obtained from multiple readers in the presence of within-reader variation in a nonparametric context. The analysis does not make use of the assumption of bivariate normal ROC curves. The areas under the ROC curves are estimated in a nonparametric way. Asymptotically the vector D, consisting of the differences in the areas under the curves per reader, follows a normal distribution with expectation fjl and variance-covariance matrix E. This variance-covariance matrix consists of a within-readers effect and of a casesample effect. The case-sample effect can be estimated directly from the variances of and the covariances between the various areas under the curves. For the estimation of the within-readers effect, we make use of the EM algorithm. The expectation of the components of the vector D can be estimated by means of the maximum-likelihood method. Testing whether the two imaging systems yield equal diagnostic quality can be translated into testing whether this expectation is equal to zero. For this test, the assumption is made that the vector D has a constant expectation for all readers, e.g., the gain of using another imaging system is the same for all readers. Therefore, this test will be the most powerful for a homogeneous group of readers. The advantage of the method is that it is more suitable than the paired t test, because it takes into account the case-sample variance. But for large cr 2, T approximately follows a Student s-t distribution with n - 1 degrees of freedom (n denoting the number of readers in the study). Hence the critical values (and the accuracy) of this test will heavily depend on the number of readers in the study. The statistical method used is currently being implemented in software. The authors thank the two reviewers for their constructive remarks. References 1. Metz CE. ROC Methodology in radiologic imaging. Invest Radiol. 1986;21: Metz CE. Basic principles of ROC analysis. Semin Nucl Med. 1978;8: Metz CE, Wang PL, Kronman HB. A New Approach for Testing the Significance of Differences between ROC Curves Measured from Correlated Data. Proceedings of the VIIIth Conference on Information Processing in Medical Imaging, Hanley JA, McNeil BJ. A method of comparing the areas under receiver operating characteristics (ROC) curves derived from the same cases. Radiology. 1983;148: DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44: Metz CE. Some practical issues of experimental design and data analysis in radiologic ROC studies. Invest Radiol. 1989; 24: Swets JA, Pickett RM. Evaluation of Diagnostic Systems, Methods from Signal Detection Theory. New York: Academic Press, Metz CE, Kronman HB. "CORROC2" Program (IBM-PC Version), Hanley JA, McNeil BJ. The meaning and the use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143: Dorfman DD, Berbaum KS, Metz CE. Receiver operating characteristic rating analysis: generalization to the population of readers and patients with the jackknife method. Invest Radiol. 1992;27: Stewart J. Econometrics. Philip Allan, Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J R Statist Soc. 1977;B39: Mood AM, Graybill FA, Boes DC. Introduction to the Theory of Statistics. New York: McGraw-Hill, Klessens PM, Barneveld Binkhuysen FH, Ottes FP, Winter LHL, Willemse APP, de Valk JPJ. Diagnostic evaluation of a PACS subsystem using mammographs. An ROC analysis. J Med Informatics. 1988;13: Ma G, Hall WJ. Confidence bands for receiver operating characteristic curves. Med Decis Making. 1993;13:191-7.
Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves
Journal of Data Science 6(2008), 1-13 Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves Feng Gao 1, Chengjie
More informationFeature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size
Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Berkman Sahiner, a) Heang-Ping Chan, Nicholas Petrick, Robert F. Wagner, b) and Lubomir Hadjiiski
More informationAn Approximate Test for Homogeneity of Correlated Correlation Coefficients
Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE
More informationPerformance Evaluation and Comparison
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation
More informationA process capability index for discrete processes
Journal of Statistical Computation and Simulation Vol. 75, No. 3, March 2005, 175 187 A process capability index for discrete processes MICHAEL PERAKIS and EVDOKIA XEKALAKI* Department of Statistics, Athens
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationPrincipal component analysis
Principal component analysis Motivation i for PCA came from major-axis regression. Strong assumption: single homogeneous sample. Free of assumptions when used for exploration. Classical tests of significance
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationNon parametric ROC summary statistics
Non parametric ROC summary statistics M.C. Pardo and A.M. Franco-Pereira Department of Statistics and O.R. (I), Complutense University of Madrid. 28040-Madrid, Spain May 18, 2017 Abstract: Receiver operating
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More informationPIRLS 2016 Achievement Scaling Methodology 1
CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationMethods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart
Peer-Reviewed Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart Máté Mihalovits and Sándor Kemény T his article is a
More informationSTATISTICS SYLLABUS UNIT I
STATISTICS SYLLABUS UNIT I (Probability Theory) Definition Classical and axiomatic approaches.laws of total and compound probability, conditional probability, Bayes Theorem. Random variable and its distribution
More information401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.
401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis
More informationThe purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.
Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That
More informationDeclarative Statistics
Declarative Statistics Roberto Rossi, 1 Özgür Akgün, 2 Steven D. Prestwich, 3 S. Armagan Tarim 3 1 The University of Edinburgh Business School, The University of Edinburgh, UK 2 Department of Computer
More informationStatistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error
Statistical Analysis of Engineering Data The Bare Bones Edition (I) Precision, Bias, Accuracy, Measures of Precision, Propagation of Error PRIOR TO DATA ACQUISITION ONE SHOULD CONSIDER: 1. The accuracy
More informationHow do we compare the relative performance among competing models?
How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model
More informationBinary choice 3.3 Maximum likelihood estimation
Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood
More informationChapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity
More informationLectures on Simple Linear Regression Stat 431, Summer 2012
Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population
More informationWiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.
Methods and Applications of Linear Models Regression and the Analysis of Variance Third Edition RONALD R. HOCKING PenHock Statistical Consultants Ishpeming, Michigan Wiley Contents Preface to the Third
More informationMANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:
MULTIVARIATE ANALYSIS OF VARIANCE MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: 1. Cell sizes : o
More informationData Privacy in Biomedicine. Lecture 11b: Performance Measures for System Evaluation
Data Privacy in Biomedicine Lecture 11b: Performance Measures for System Evaluation Bradley Malin, PhD (b.malin@vanderbilt.edu) Professor of Biomedical Informatics, Biostatistics, & Computer Science Vanderbilt
More informationIntroduction to Statistical Inference
Structural Health Monitoring Using Statistical Pattern Recognition Introduction to Statistical Inference Presented by Charles R. Farrar, Ph.D., P.E. Outline Introduce statistical decision making for Structural
More informationThis paper is not to be removed from the Examination Halls
~~ST104B ZA d0 This paper is not to be removed from the Examination Halls UNIVERSITY OF LONDON ST104B ZB BSc degrees and Diplomas for Graduates in Economics, Management, Finance and the Social Sciences,
More informationOverview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation
Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already
More informationIntroduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test
Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists
More informationCHAPTER 3 FEATURE EXTRACTION USING GENETIC ALGORITHM BASED PRINCIPAL COMPONENT ANALYSIS
46 CHAPTER 3 FEATURE EXTRACTION USING GENETIC ALGORITHM BASED PRINCIPAL COMPONENT ANALYSIS 3.1 INTRODUCTION Cardiac beat classification is a key process in the detection of myocardial ischemic episodes
More informationBasic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).
Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low
More informationEEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1
EEL 851: Biometrics An Overview of Statistical Pattern Recognition EEL 851 1 Outline Introduction Pattern Feature Noise Example Problem Analysis Segmentation Feature Extraction Classification Design Cycle
More informationEstimating the parameters of hidden binomial trials by the EM algorithm
Hacettepe Journal of Mathematics and Statistics Volume 43 (5) (2014), 885 890 Estimating the parameters of hidden binomial trials by the EM algorithm Degang Zhu Received 02 : 09 : 2013 : Accepted 02 :
More informationRegression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y.
Regression Bivariate i linear regression: Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables and. Generally describe as a
More informationThe Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing
1 The Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing Greene Ch 4, Kennedy Ch. R script mod1s3 To assess the quality and appropriateness of econometric estimators, we
More informationBrandon D. Gallas Stephen L. Hillis
Generalized Roe and Metz receiver operating characteristic model: analytic link between simulated decision scores and empirical AUC variances and covariances Brandon D. Gallas Stephen L. Hillis Journal
More informationSTAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.
STAT 01 Assignment NAME Spring 00 Reading Assignment: Written Assignment: Chapter, and Sections 6.1-6.3 in Johnson & Wichern. Due Monday, February 1, in class. You should be able to do the first four problems
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationAssumptions of classical multiple regression model
ESD: Recitation #7 Assumptions of classical multiple regression model Linearity Full rank Exogeneity of independent variables Homoscedasticity and non autocorrellation Exogenously generated data Normal
More informationDesign of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments
Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments The hypothesis testing framework The two-sample t-test Checking assumptions, validity Comparing more that
More informationLehmann Family of ROC Curves
Memorial Sloan-Kettering Cancer Center From the SelectedWorks of Mithat Gönen May, 2007 Lehmann Family of ROC Curves Mithat Gonen, Memorial Sloan-Kettering Cancer Center Glenn Heller, Memorial Sloan-Kettering
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationECE521 week 3: 23/26 January 2017
ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationUnsupervised Learning with Permuted Data
Unsupervised Learning with Permuted Data Sergey Kirshner skirshne@ics.uci.edu Sridevi Parise sparise@ics.uci.edu Padhraic Smyth smyth@ics.uci.edu School of Information and Computer Science, University
More informationMeasuring relationships among multiple responses
Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationSTAT 501 EXAM I NAME Spring 1999
STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationMachine Learning and Pattern Recognition, Tutorial Sheet Number 3 Answers
Machine Learning and Pattern Recognition, Tutorial Sheet Number 3 Answers School of Informatics, University of Edinburgh, Instructor: Chris Williams 1. Given a dataset {(x n, y n ), n 1,..., N}, where
More information* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.
Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course
More informationLogistic Regression: Regression with a Binary Dependent Variable
Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression
More informationSIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA
SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA Kazuyuki Koizumi Department of Mathematics, Graduate School of Science Tokyo University of Science 1-3, Kagurazaka,
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationAn Introduction to Path Analysis
An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in
More informationPerformance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies
Clinical Chemistry 44:5 1024 1031 (1998) Laboratory Management Performance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies Kristian Linnet Application
More informationMathematical studies Standard level Paper 2
Mathematical studies Standard level Paper 2 Friday 5 May 2017 (morning) 1 hour 30 minutes Instructions to candidates ydo not open this examination paper until instructed to do so. ya graphic display calculator
More informationTesting Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information
Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Budi Pratikno 1 and Shahjahan Khan 2 1 Department of Mathematics and Natural Science Jenderal Soedirman
More informationORIGINAL PAPER STATISTICAL METHOD IN A COMPARATIVE STUDY IN WHICH THE STANDARD TREATMENT IS SUPERIOR TO OTHERS
agoya J. Med. Sci. 65. 127 ~ 132, 2002 ORIGIAL PAPER STATISTICAL METHOD I A COMPARATIVE STUDY I WHICH THE STADARD TREATMET IS SUPERIOR TO OTHERS MITSURU IKEDA 1, KAZUHIRO SHIMAMOTO 2, TAKEO ISHIGAKI 3
More informationApplied Multivariate and Longitudinal Data Analysis
Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference
More informationThe use of receiver operating characteristic curves in biomedical informatics
Journal of Biomedical Informatics 38 (2005) 404 415 Methodological Review The use of receiver operating characteristic curves in biomedical informatics Thomas A. Lasko a,b, Jui G. Bhagwat c, Kelly H. Zou
More informationTime: 1 hour 30 minutes
Paper Reference(s) 6684/01 Edexcel GCE Statistics S2 Gold Level G3 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates
More informationMultivariate analysis of variance and covariance
Introduction Multivariate analysis of variance and covariance Univariate ANOVA: have observations from several groups, numerical dependent variable. Ask whether dependent variable has same mean for each
More informationA CUSUM approach for online change-point detection on curve sequences
ESANN 22 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges Belgium, 25-27 April 22, i6doc.com publ., ISBN 978-2-8749-49-. Available
More informationPRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES
Sankhyā : The Indian Journal of Statistics 1994, Volume, Series B, Pt.3, pp. 334 343 PRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES
More informationTest Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research
Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research HOW IT WORKS For the M.Sc. Early Childhood Research, sufficient knowledge in methods and statistics is one
More informationLecture 6: April 19, 2002
EE596 Pat. Recog. II: Introduction to Graphical Models Spring 2002 Lecturer: Jeff Bilmes Lecture 6: April 19, 2002 University of Washington Dept. of Electrical Engineering Scribe: Huaning Niu,Özgür Çetin
More informationNAG Library Chapter Introduction. G08 Nonparametric Statistics
NAG Library Chapter Introduction G08 Nonparametric Statistics Contents 1 Scope of the Chapter.... 2 2 Background to the Problems... 2 2.1 Parametric and Nonparametric Hypothesis Testing... 2 2.2 Types
More informationRejection regions for the bivariate case
Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test
More informationSmart Home Health Analytics Information Systems University of Maryland Baltimore County
Smart Home Health Analytics Information Systems University of Maryland Baltimore County 1 IEEE Expert, October 1996 2 Given sample S from all possible examples D Learner L learns hypothesis h based on
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationProbability and Information Theory. Sargur N. Srihari
Probability and Information Theory Sargur N. srihari@cedar.buffalo.edu 1 Topics in Probability and Information Theory Overview 1. Why Probability? 2. Random Variables 3. Probability Distributions 4. Marginal
More informationAccelerating the EM Algorithm for Mixture Density Estimation
Accelerating the EM Algorithm ICERM Workshop September 4, 2015 Slide 1/18 Accelerating the EM Algorithm for Mixture Density Estimation Homer Walker Mathematical Sciences Department Worcester Polytechnic
More informationCIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8
CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationBayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins.
Bayesian Reasoning Adapted from slides by Tim Finin and Marie desjardins. 1 Outline Probability theory Bayesian inference From the joint distribution Using independence/factoring From sources of evidence
More informationProblem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30
Problem Set 2 MAS 622J/1.126J: Pattern Recognition and Analysis Due: 5:00 p.m. on September 30 [Note: All instructions to plot data or write a program should be carried out using Matlab. In order to maintain
More informationAnomaly Detection. Jing Gao. SUNY Buffalo
Anomaly Detection Jing Gao SUNY Buffalo 1 Anomaly Detection Anomalies the set of objects are considerably dissimilar from the remainder of the data occur relatively infrequently when they do occur, their
More informationApplied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur
Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 29 Multivariate Linear Regression- Model
More informationPart III: Unstructured Data
Inf1-DA 2010 2011 III: 51 / 89 Part III Unstructured Data Data Retrieval: III.1 Unstructured data and data retrieval Statistical Analysis of Data: III.2 Data scales and summary statistics III.3 Hypothesis
More informationLearning From Crowds. Presented by: Bei Peng 03/24/15
Learning From Crowds Presented by: Bei Peng 03/24/15 1 Supervised Learning Given labeled training data, learn to generalize well on unseen data Binary classification ( ) Multi-class classification ( y
More informationFundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur
Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new
More informationHuman-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015
Probability Refresher Kai Arras, University of Freiburg Winter term 2014/2015 Probability Refresher Introduction to Probability Random variables Joint distribution Marginalization Conditional probability
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationKarl-Rudolf Koch Introduction to Bayesian Statistics Second Edition
Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition Karl-Rudolf Koch Introduction to Bayesian Statistics Second, updated and enlarged Edition With 17 Figures Professor Dr.-Ing., Dr.-Ing.
More informationInferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data
Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone
More informationDetection theory. H 0 : x[n] = w[n]
Detection Theory Detection theory A the last topic of the course, we will briefly consider detection theory. The methods are based on estimation theory and attempt to answer questions such as Is a signal
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationUncertainty quantification and visualization for functional random variables
Uncertainty quantification and visualization for functional random variables MascotNum Workshop 2014 S. Nanty 1,3 C. Helbert 2 A. Marrel 1 N. Pérot 1 C. Prieur 3 1 CEA, DEN/DER/SESI/LSMR, F-13108, Saint-Paul-lez-Durance,
More informationUniversity of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout 2:. The Multivariate Gaussian & Decision Boundaries
University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout :. The Multivariate Gaussian & Decision Boundaries..15.1.5 1 8 6 6 8 1 Mark Gales mjfg@eng.cam.ac.uk Lent
More informationParameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn
Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation
More informationModel-based Characterization of Mammographic Masses
Model-based Characterization of Mammographic Masses Sven-René von der Heidt 1, Matthias Elter 2, Thomas Wittenberg 2, Dietrich Paulus 1 1 Institut für Computervisualistik, Universität Koblenz-Landau 2
More information2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)?
ECE 830 / CS 76 Spring 06 Instructors: R. Willett & R. Nowak Lecture 3: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics Executive summary In the last lecture we
More informationLinear Models 1. Isfahan University of Technology Fall Semester, 2014
Linear Models 1 Isfahan University of Technology Fall Semester, 2014 References: [1] G. A. F., Seber and A. J. Lee (2003). Linear Regression Analysis (2nd ed.). Hoboken, NJ: Wiley. [2] A. C. Rencher and
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationBootstrapping, Randomization, 2B-PLS
Bootstrapping, Randomization, 2B-PLS Statistics, Tests, and Bootstrapping Statistic a measure that summarizes some feature of a set of data (e.g., mean, standard deviation, skew, coefficient of variation,
More information