Directional Control Schemes for Multivariate Categorical Processes

Size: px
Start display at page:

Download "Directional Control Schemes for Multivariate Categorical Processes"

Transcription

1 Directional Control Schemes for Multivariate Categorical Processes Nankai University Homepage: math.nankai.edu.cn/ chlzou (Joint work with Mr. Jian Li and Prof. Fugee Tsung)

2 Outline 1 Multivariate Categorical Modeling 2 Directional Control Schemes 3 Performance Assessment

3 Multivariate Categorical Modeling Multivariate Categorical Processes The aluminium electrolytic capacitor (AEC). Capacity (CAP) Dissipation Factor (DF) Leakage Current (LC) (CAP DF LC), 1 for conforming and 2 for nonconforming, 2 3 = 8 level combinations or cross-classifications: (111), (112), (121), (122), (211), (212), (221), (222). (121) means an AEC with conforming CAP and LC, and nonconforming DF 3-way contingency table of size and with 8 cells; Each cell corresponds to one level combination and stores the count under this combination.

4 Multivariate Categorical Modeling Existing works Multivariate binomial chart χ 2 -chart (Patel s 1973) : G MB,k = 1 N ( nmb,k Np (0) ) T ( MB Σ 1 MB nmb,k Np (0) MB). only applicable in multivariate binomial processes The multivariate multinomial chart, a multi-chart comprising p individual charts (Marcucci 1985) G MM,(i)k = 1 N ( nmm,(i)k Np (0) ) T ( MM,(i) Σ 1 MM,(i) nmm,(i)k Np (0) MM,(i)). cumbersome

5 Multivariate Categorical Modeling Some review on multivariate continuous monitoring Hotelling T 2 -based statistic (Lowry et al. 1992): n X Σ 1 X Regression-adjusted statistic (Hawkins 1991): max i=1,...,p v i, v j = n(d jσ 1 X)/(d jσ 1 d j ) 1 2, for j = 1,..., p. MSPC using LASSO or other variable selection techniques

6 Multivariate Categorical Modeling Contingency Table p categorical variables or factors {C 1, C 2,..., C p }. Each classification factor C i takes h i of possible levels. A p-way h 1 h 2... h p cross-classified contingency table with h = p i=1 h i cells. The AEC example: p = 3 and h 1 = h 2 = h 3 = 2. Log-Linear Models For an h 1 h 2 h 3 table and a fixed sample size N, the cell counts follow the multinomial distribution MN(N; p ijk ) with and their expectations m i,j,k = Np i,j,k, and i,j,k p i,j,k = 1. The log-linear model can characterize the relationship between cell probabilities and factor levels, ln p i,j,k = u (0) +u (1) i +u (2) j +u (3) k +u (1,2) i,j +u (1,3) i,k +u (2,3) j,k +u (1,2,3) i,j,k.

7 Multivariate Categorical Modeling Identifiability requires constraints such as u (1) i = u (1,2) i,j = u (1,3) i,k = i i i i u (1,2,3) i,j,k = 0 for the first factor along its index i. Similar equations describe the second and third factors along with their indexes j and k, respectively. Re-parameterizations For example, a 2 3 contingency table u (0) = β 0, u (1) 1 = β 1, u (1) 2 = β 1, u (2) 1 = β 2, u (2) 2 = β 3, u (2) 3 = β 2 β 3, u (1,2) 1,1 = β 4, u (1,2) 1,2 = β 5, u (1,2) 1,3 = β 4 β 5, u (1,2) 2,1 = β 4, u (1,2) 2,2 = β 5, u (1,2) 2,3 = β 4 + β 5.

8 Multivariate Categorical Modeling Log-Linear Model at the effect level Imposed by some constraints, the log-linear model for a general p-way contingency table can be written as 2 p 1 ln p = 1β 0 + X i β i. The subvector β i stands for the ith main or interaction effect. i=1 Log-Linear Model at the coefficient level h 1 ln p = 1β 0 + x i β i i=1 The number β i stands for the ith coefficient, either part or totality of an effect.

9 Directional Control Schemes One-to-one correspondence between coefficient subvectors and factor effects Shifts in the marginal dist. of one factor represent deviations of the coefficient subvector corresponding to its main effect; Shifts in the dependence among multiple factors represent deviations of the coefficient subvector reflecting their interaction effect.

10 Outline 1 Multivariate Categorical Modeling 2 Directional Control Schemes 3 Performance Assessment

11 Directional Control Schemes Problem Formulation F (X; β): ln p = 1β 0 + Xβ and p T 1 = 1 X = [X 1,..., X 2 p 1] = [x 1, x 2,..., x h 1 ] The jth on-line multivariate sampling observation vector, n (j) of size h 1, collected over time from the change-point model { n (j) i.i.d. F (X; β (0) ), for j = 1,..., τ, F (X; β (1) ), for j = τ + 1,...

12 Directional Control Schemes Shift directions Shifts with all possibilities H 0 : β = β (0) H 1 : β β (0). Only the coefficient β i (1 i h 1) adds by an unknown constant δ i H 0 : β = β (0) H 1 : β = β (0) + d i δ i. Only one coefficient may deviate with unknown location H 0 : β = β (0) H 1 : β = β (0) + d 1 δ 1 or... or β = β (0) + d h 1 δ h 1.

13 Directional Control Schemes Shift directions (cont d) Most shifts are inclined to occur in lower-order effects. Most real applications care about only means and variances, namely moments of the first two orders. The monitoring task is to only answer yes or no about whether process is IC, rather than telling the locations of shifted effects. Focus on merely effects of the first two orders (i.e., the main effects and the two-factor interaction effects). H 0 : β = β (0) ) H 1 : (β = β (0) + d i δ i. i I 2

14 Directional Control Schemes LLD chart Given the IC probability vector p (0), Phase II sample size N, observation vectors n j (j = 1, 2,...), we construct an EWMA control chart based on likelihood ratio test with its charting statistic ( 1 ( R k = max z (k) Np (0)) T ( xi x T i I 2 N i Σ (0) ) 1x ( T x i i n Np (0) )). z (k) = (1 λ)z (k 1) + λn (k) EWMA observation vector Σ (0) = diag ( p (0)) p (0) (p (0) ) T x i ith column of X Log-Linear Directional (LLD) control chart.

15 Directional Control Schemes Diagnostic Procedures The real one-coefficient-shift direction index ζ can be identified as ( 1 ( ˆζ = arg max zη Np (0)) T ( ) xj x T 1x ( T j I 3 N j Σxj j zη Np (0))). Σ = diag (ˆp ) ˆpˆp T with ˆp = z η /N. z η is the EWMA observation vector at the signal time point η. We confine the candidate subset of diagnostic shift directions to effects in the first three orders, which is still safe in case a shift appears in a three-factor interaction effect.

16 Outline 1 Multivariate Categorical Modeling 2 Directional Control Schemes 3 Performance Assessment

17 Performance Assessment LLD chart VS MBE chart Multivariate binomial EWMA (MBE) chart, the EWMA version of Patel s (1973) χ 2 -chart. Multivariate Binomial Processes 5 characteristics each with 2 levels, a 5-way contingency table with 2 5 = 32 cells. Table: OC ARLs of LLD and MBE, λ = 0.1, N = 1, 000, ARL 0 =370 δ LLD MBE LLD MBE LLD MBE β (3) β (5) β (1,4) β (2,3) β (2,5) β (3,4)

18 Performance Assessment LLD chart VS MME chart The multivariate multinomial EWMA (MME) chart, a multi-chart comprising p individual charts. Each is the EWMA version of the generalized p-chart in Marcucci (1985). Multivariate Multinomial Processes. 4 characteristics of 2, 2, 3, 3 levels, a 4-way table. Table: OC ARLs of LLD and MME, λ = 0.1, N = 1, 000, ARL 0 =370 δ LLD MME LLD MME LLD MME β (32 ) β (41 ) β (1,2) β (1,32 ) β (2,31 ) β (31,4 1 )

19 Performance Assessment Diagnostic Performance The same setting in the 5-way table as the comparison with MBE. Matching probability P( ˆζ = ζ) Table: Observed matching probability for diagnosing shift direction δ β (2) β (4) β (1,3) β (4,5) β (1,4,5) β (2,3,5) NOTE: λ = 0.1. N = 1, 000.

20 Conclusion The superiority of the LLD chart lies in one-coefficient-shifts or shifts in high-order interaction effects. The LLD chart can work well within the unified framework of multivariate binomial and multivariate multinomial processes. The diagnostic schemes show good performance in estimating fault directions.

21 Thank you for your attention!

Statistical Process Control for Multivariate Categorical Processes

Statistical Process Control for Multivariate Categorical Processes Statistical Process Control for Multivariate Categorical Processes Fugee Tsung The Hong Kong University of Science and Technology Fugee Tsung 1/27 Introduction Typical Control Charts Univariate continuous

More information

Multivariate Binomial/Multinomial Control Chart

Multivariate Binomial/Multinomial Control Chart Multivariate Binomial/Multinomial Control Chart Jian Li 1, Fugee Tsung 1, and Changliang Zou 2 1 Department of Industrial Engineering and Logistics Management, Hong Kong University of Science and Technology,

More information

Describing Contingency tables

Describing Contingency tables Today s topics: Describing Contingency tables 1. Probability structure for contingency tables (distributions, sensitivity/specificity, sampling schemes). 2. Comparing two proportions (relative risk, odds

More information

Module B1: Multivariate Process Control

Module B1: Multivariate Process Control Module B1: Multivariate Process Control Prof. Fugee Tsung Hong Kong University of Science and Technology Quality Lab: http://qlab.ielm.ust.hk I. Multivariate Shewhart chart WHY MULTIVARIATE PROCESS CONTROL

More information

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example

More information

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j. Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David

More information

Monitoring General Linear Profiles Using Multivariate EWMA schemes

Monitoring General Linear Profiles Using Multivariate EWMA schemes Monitoring General Linear Profiles Using Multivariate EWMA schemes Changliang Zou Department of Statistics School of Mathematical Sciences Nankai University Tianjian, PR China Fugee Tsung Department of

More information

A Practitioner s Guide to Generalized Linear Models

A Practitioner s Guide to Generalized Linear Models A Practitioners Guide to Generalized Linear Models Background The classical linear models and most of the minimum bias procedures are special cases of generalized linear models (GLMs). GLMs are more technically

More information

Maximum likelihood in log-linear models

Maximum likelihood in log-linear models Graphical Models, Lecture 4, Michaelmas Term 2010 October 22, 2010 Generating class Dependence graph of log-linear model Conformal graphical models Factor graphs Let A denote an arbitrary set of subsets

More information

Machine Learning, Fall 2012 Homework 2

Machine Learning, Fall 2012 Homework 2 0-60 Machine Learning, Fall 202 Homework 2 Instructors: Tom Mitchell, Ziv Bar-Joseph TA in charge: Selen Uguroglu email: sugurogl@cs.cmu.edu SOLUTIONS Naive Bayes, 20 points Problem. Basic concepts, 0

More information

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as page1 Loglinear Models Loglinear models are a way to describe association and interaction patterns among categorical variables. They are commonly used to model cell counts in contingency tables. These

More information

MIT Spring 2015

MIT Spring 2015 Regression Analysis MIT 18.472 Dr. Kempthorne Spring 2015 1 Outline Regression Analysis 1 Regression Analysis 2 Multiple Linear Regression: Setup Data Set n cases i = 1, 2,..., n 1 Response (dependent)

More information

Naïve Bayes classification

Naïve Bayes classification Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

Notes on Machine Learning for and

Notes on Machine Learning for and Notes on Machine Learning for 16.410 and 16.413 (Notes adapted from Tom Mitchell and Andrew Moore.) Choosing Hypotheses Generally want the most probable hypothesis given the training data Maximum a posteriori

More information

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability Probability theory Naïve Bayes classification Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. s: A person s height, the outcome of a coin toss Distinguish

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Likelihood Ratio-Based Distribution-Free EWMA Control Charts

Likelihood Ratio-Based Distribution-Free EWMA Control Charts Likelihood Ratio-Based Distribution-Free EWMA Control Charts CHANGLIANG ZOU Nankai University, Tianjin, China FUGEE TSUNG Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong

More information

A Probability Review

A Probability Review A Probability Review Outline: A probability review Shorthand notation: RV stands for random variable EE 527, Detection and Estimation Theory, # 0b 1 A Probability Review Reading: Go over handouts 2 5 in

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

Self-Starting Control Chart for Simultaneously Monitoring Process Mean and Variance

Self-Starting Control Chart for Simultaneously Monitoring Process Mean and Variance International Journal of Production Research Vol. 00, No. 00, 15 March 2008, 1 14 Self-Starting Control Chart for Simultaneously Monitoring Process Mean and Variance Zhonghua Li a, Jiujun Zhang a,b and

More information

A Few Special Distributions and Their Properties

A Few Special Distributions and Their Properties A Few Special Distributions and Their Properties Econ 690 Purdue University Justin L. Tobias (Purdue) Distributional Catalog 1 / 20 Special Distributions and Their Associated Properties 1 Uniform Distribution

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 3 for Applied Multivariate Analysis Outline 1 Reprise-Vectors, vector lengths and the angle between them 2 3 Partial correlation

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

The Sparsity and Bias of The LASSO Selection In High-Dimensional Linear Regression

The Sparsity and Bias of The LASSO Selection In High-Dimensional Linear Regression The Sparsity and Bias of The LASSO Selection In High-Dimensional Linear Regression Cun-hui Zhang and Jian Huang Presenter: Quefeng Li Feb. 26, 2010 un-hui Zhang and Jian Huang Presenter: Quefeng The Sparsity

More information

Multivariate distance Fall

Multivariate distance Fall Multivariate distance 2017 Fall Contents Euclidean Distance Definitions Standardization Population Distance Population mean and variance Definitions proportions presence-absence data Multivariate distance

More information

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13 PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null

More information

Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann

Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann Association Model, Page 1 Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann Arbor, MI 48106. Email: yuxie@umich.edu. Tel: (734)936-0039. Fax: (734)998-7415. Association

More information

Lecture 16 : Bayesian analysis of contingency tables. Bayesian linear regression. Jonathan Marchini (University of Oxford) BS2a MT / 15

Lecture 16 : Bayesian analysis of contingency tables. Bayesian linear regression. Jonathan Marchini (University of Oxford) BS2a MT / 15 Lecture 16 : Bayesian analysis of contingency tables. Bayesian linear regression. Jonathan Marchini (University of Oxford) BS2a MT 2013 1 / 15 Contingency table analysis North Carolina State University

More information

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Multi-Variate-Attribute Quality Control (MVAQC)

Multi-Variate-Attribute Quality Control (MVAQC) by Submitted in total fulfillment of the requirements of the degree of Doctor of Philosophy July 2014 Department of Mechanical Engineering The University of Melboune Abstract When the number of quality

More information

Spatial Statistics with Image Analysis. Lecture L11. Home assignment 3. Lecture 11. Johan Lindström. December 5, 2016.

Spatial Statistics with Image Analysis. Lecture L11. Home assignment 3. Lecture 11. Johan Lindström. December 5, 2016. HA3 MRF:s Simulation Estimation Spatial Statistics with Image Analysis Lecture 11 Johan Lindström December 5, 2016 Lecture L11 Johan Lindström - johanl@maths.lth.se FMSN20/MASM25 L11 1/22 HA3 MRF:s Simulation

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Optimal exact tests for complex alternative hypotheses on cross tabulated data

Optimal exact tests for complex alternative hypotheses on cross tabulated data Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex

More information

13.1 Categorical Data and the Multinomial Experiment

13.1 Categorical Data and the Multinomial Experiment Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)

More information

The Multinomial Model

The Multinomial Model The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient

More information

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Goodness of Fit Tests: Homogeneity

Goodness of Fit Tests: Homogeneity Goodness of Fit Tests: Homogeneity Mathematics 47: Lecture 35 Dan Sloughter Furman University May 11, 2006 Dan Sloughter (Furman University) Goodness of Fit Tests: Homogeneity May 11, 2006 1 / 13 Testing

More information

Modeling Real Estate Data using Quantile Regression

Modeling Real Estate Data using Quantile Regression Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices

More information

Log-linear Models for Contingency Tables

Log-linear Models for Contingency Tables Log-linear Models for Contingency Tables Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Log-linear Models for Two-way Contingency Tables Example: Business Administration Majors and Gender A

More information

Applied Statistics Comprehensive Exam

Applied Statistics Comprehensive Exam Applied Statistics Comprehensive Exam August 2013 Ph.D Theory Exam This comprehensive exam consists of 10 questions pertaining to theoretical statistical topics. 1 This Ph.D level exam will run from 8:30

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/term

More information

10708 Graphical Models: Homework 2

10708 Graphical Models: Homework 2 10708 Graphical Models: Homework 2 Due Monday, March 18, beginning of class Feburary 27, 2013 Instructions: There are five questions (one for extra credit) on this assignment. There is a problem involves

More information

Machine Learning - MT Classification: Generative Models

Machine Learning - MT Classification: Generative Models Machine Learning - MT 2016 7. Classification: Generative Models Varun Kanade University of Oxford October 31, 2016 Announcements Practical 1 Submission Try to get signed off during session itself Otherwise,

More information

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Probability Sampling Procedures Collection of Data Measures

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Total No. of Questions : 10] [Total No. of Pages : 02. M.Sc. DEGREE EXAMINATION, DEC Second Year STATISTICS. Statistical Quality Control

Total No. of Questions : 10] [Total No. of Pages : 02. M.Sc. DEGREE EXAMINATION, DEC Second Year STATISTICS. Statistical Quality Control (DMSTT21) Total No. of Questions : 10] [Total No. of Pages : 02 M.Sc. DEGREE EXAMINATION, DEC. 2016 Second Year STATISTICS Statistical Quality Control Time : 3 Hours Maximum Marks : 70 Answer any five

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

Learning Bayesian network : Given structure and completely observed data

Learning Bayesian network : Given structure and completely observed data Learning Bayesian network : Given structure and completely observed data Probabilistic Graphical Models Sharif University of Technology Spring 2017 Soleymani Learning problem Target: true distribution

More information

A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic

A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic Jay R. Schaffer & Shawn VandenHul University of Northern Colorado McKee Greeley, CO 869 jay.schaffer@unco.edu gathen9@hotmail.com

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS Parametric Distributions Basic building blocks: Need to determine given Representation: or? Recall Curve Fitting Binary Variables

More information

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the

More information

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey

More information

Generalized logit models for nominal multinomial responses. Local odds ratios

Generalized logit models for nominal multinomial responses. Local odds ratios Generalized logit models for nominal multinomial responses Categorical Data Analysis, Summer 2015 1/17 Local odds ratios Y 1 2 3 4 1 π 11 π 12 π 13 π 14 π 1+ X 2 π 21 π 22 π 23 π 24 π 2+ 3 π 31 π 32 π

More information

A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression

A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression Noah Simon Jerome Friedman Trevor Hastie November 5, 013 Abstract In this paper we purpose a blockwise descent

More information

Chi-square (χ 2 ) Tests

Chi-square (χ 2 ) Tests Math 442 - Mathematical Statistics II April 30, 2018 Chi-square (χ 2 ) Tests Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4.

More information

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: ) NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3

More information

Advanced topics from statistics

Advanced topics from statistics Advanced topics from statistics Anders Ringgaard Kristensen Advanced Herd Management Slide 1 Outline Covariance and correlation Random vectors and multivariate distributions The multinomial distribution

More information

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 Prof. John Paisley Department of Electrical Engineering & Data Science Institute Columbia University FEATURE EXPANSIONS FEATURE EXPANSIONS

More information

Ad Placement Strategies

Ad Placement Strategies Case Study 1: Estimating Click Probabilities Tackling an Unknown Number of Features with Sketching Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox 2014 Emily Fox January

More information

Gibbs Sampling in Linear Models #2

Gibbs Sampling in Linear Models #2 Gibbs Sampling in Linear Models #2 Econ 690 Purdue University Outline 1 Linear Regression Model with a Changepoint Example with Temperature Data 2 The Seemingly Unrelated Regressions Model 3 Gibbs sampling

More information

Nonparametric Monitoring of Multiple Count Data

Nonparametric Monitoring of Multiple Count Data Nonparametric Monitoring of Multiple Count Data Peihua Qiu 1, Zhen He 2 and Zhiqiong Wang 3 1 Department of Biostatistics University of Florida, Gainesville, United States 2 College of Management and Economics

More information

Hierarchical Linear Models. Hierarchical Linear Models. Much of this material already seen in Chapters 5 and 14. Hyperprior on K parameters α:

Hierarchical Linear Models. Hierarchical Linear Models. Much of this material already seen in Chapters 5 and 14. Hyperprior on K parameters α: Hierarchical Linear Models Hierarchical Linear Models Much of this material already seen in Chapters 5 and 14 Hierarchical linear models combine regression framework with hierarchical framework Unified

More information

Chi-square (χ 2 ) Tests

Chi-square (χ 2 ) Tests Math 145 - Elementary Statistics April 17, 2007 Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. Chi-square (χ 2 ) Tests 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4. Testing

More information

assumption identically change method. 1. Introduction1 .iust.ac.ir/ ABSTRACT identifying KEYWORDS estimation, Correlation,

assumption identically change method. 1. Introduction1 .iust.ac.ir/   ABSTRACT identifying KEYWORDS estimation, Correlation, International Journal Industrial Engineering & Production Research (207) December 207, Volume 28, Number 4 pp. 367-376 DOI: 0.22068/ijiepr.28.4.367 http://ijiepr..iust.ac.ir/ Change-Point Estimation High-Yiel

More information

Modern Statistical Process Control Charts and Their Applications in Analyzing Big Data

Modern Statistical Process Control Charts and Their Applications in Analyzing Big Data Modern Statistical Process Control Charts and Their Applications in Analyzing Big Data (Part III: Profile Monitoring and Dynamic Screening) Peihua Qiu pqiu@ufl.edu Department of Biostatistics University

More information

Asymptotic Statistics-III. Changliang Zou

Asymptotic Statistics-III. Changliang Zou Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (

More information

On an Additive Semigraphoid Model for Statistical Networks With Application to Nov Pathway 25, 2016 Analysis -1 Bing / 38Li,

On an Additive Semigraphoid Model for Statistical Networks With Application to Nov Pathway 25, 2016 Analysis -1 Bing / 38Li, On an Additive Semigraphoid Model for Statistical Networks With Application to Pathway Analysis - Bing Li, Hyunho Chun & Hongyu Zhao Kim Youngrae SNU Stat. Multivariate Lab Nov 25, 2016 On an Additive

More information

High-Dimensional Process Monitoring and Fault Isolation via Variable Selection

High-Dimensional Process Monitoring and Fault Isolation via Variable Selection High-Dimensional Process Monitoring and Fault Isolation via Variable Selection KAIBO WANG Tsinghua University, Beijing 100084, P. R. China WEI JIANG The Hong Kong University of Science & Technology, Kowloon,

More information

Lecture 16 Solving GLMs via IRWLS

Lecture 16 Solving GLMs via IRWLS Lecture 16 Solving GLMs via IRWLS 09 November 2015 Taylor B. Arnold Yale Statistics STAT 312/612 Notes problem set 5 posted; due next class problem set 6, November 18th Goals for today fixed PCA example

More information

STAC51: Categorical data Analysis

STAC51: Categorical data Analysis STAC51: Categorical data Analysis Mahinda Samarakoon January 26, 2016 Mahinda Samarakoon STAC51: Categorical data Analysis 1 / 32 Table of contents Contingency Tables 1 Contingency Tables Mahinda Samarakoon

More information

Markov Chains and Hidden Markov Models

Markov Chains and Hidden Markov Models Chapter 1 Markov Chains and Hidden Markov Models In this chapter, we will introduce the concept of Markov chains, and show how Markov chains can be used to model signals using structures such as hidden

More information

Monitoring Expense Report Errors: Control Charts Under Independence and Dependence. Darren Williams. (Under the direction of Dr.

Monitoring Expense Report Errors: Control Charts Under Independence and Dependence. Darren Williams. (Under the direction of Dr. Monitoring Expense Report Errors: Control Charts Under Independence and Dependence by Darren Williams (Under the direction of Dr. Lynne Seymour) Abstract Control charts were devised to evaluate offices

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

Q3) a) Explain the construction of np chart. b) Write a note on natural tolerance limits and specification limits.

Q3) a) Explain the construction of np chart. b) Write a note on natural tolerance limits and specification limits. (DMSTT 21) Total No. of Questions : 10] [Total No. of Pages : 02 M.Sc. DEGREE EXAMINATION, MAY 2017 Second Year STATISTICS Statistical Quality Control Time : 3 Hours Maximum Marks: 70 Answer any Five questions.

More information

Negative Multinomial Model and Cancer. Incidence

Negative Multinomial Model and Cancer. Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence S. Lahiri & Sunil K. Dhar Department of Mathematical Sciences, CAMS New Jersey Institute of Technology, Newar,

More information

Chapter 1. Modeling Basics

Chapter 1. Modeling Basics Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical

More information

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution A Few Special Distributions Their Properties Econ 675 Iowa State University November 1 006 Justin L Tobias (ISU Distributional Catalog November 1 006 1 / 0 Special Distributions Their Associated Properties

More information

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1 TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1 1.1 The Probability Model...1 1.2 Finite Discrete Models with Equally Likely Outcomes...5 1.2.1 Tree Diagrams...6 1.2.2 The Multiplication Principle...8

More information

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables. Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:

More information

CSC 411: Lecture 09: Naive Bayes

CSC 411: Lecture 09: Naive Bayes CSC 411: Lecture 09: Naive Bayes Class based on Raquel Urtasun & Rich Zemel s lectures Sanja Fidler University of Toronto Feb 8, 2015 Urtasun, Zemel, Fidler (UofT) CSC 411: 09-Naive Bayes Feb 8, 2015 1

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

System Monitoring with Real-Time Contrasts

System Monitoring with Real-Time Contrasts System Monitoring with Real- Contrasts HOUTAO DENG Intuit, Mountain View, CA 94043, USA GEORGE RUNGER Arizona State University, Tempe, AZ 85287, USA EUGENE TUV Intel Corporation, Chandler, AZ 85226, USA

More information

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services

More information

Bayesian Modeling of Conditional Distributions

Bayesian Modeling of Conditional Distributions Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings

More information

GAUSSIAN PROCESS REGRESSION

GAUSSIAN PROCESS REGRESSION GAUSSIAN PROCESS REGRESSION CSE 515T Spring 2015 1. BACKGROUND The kernel trick again... The Kernel Trick Consider again the linear regression model: y(x) = φ(x) w + ε, with prior p(w) = N (w; 0, Σ). The

More information

Math 423/533: The Main Theoretical Topics

Math 423/533: The Main Theoretical Topics Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2018 Examinations Subject CT3 Probability and Mathematical Statistics Core Technical Syllabus 1 June 2017 Aim The

More information

Statistical Process Control

Statistical Process Control Statistical Process Control What is a process? Inputs PROCESS Outputs A process can be described as a transformation of set of inputs into desired outputs. Types of Measures Measures where the metric is

More information

Monitoring and diagnosing a two-stage production process with attribute characteristics

Monitoring and diagnosing a two-stage production process with attribute characteristics Iranian Journal of Operations Research Vol., No.,, pp. -6 Monitoring and diagnosing a two-stage production process with attribute characteristics Downloaded from iors.ir at :6 +33 on Wednesday October

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 / 42 Passenger car mileage Consider the carmpg dataset taken from

More information

1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4.

1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4. Multivariate normal distribution Reading: AMSA: pages 149-200 Multivariate Analysis, Spring 2016 Institute of Statistics, National Chiao Tung University March 1, 2016 1. Density and properties Brief outline

More information

Linear Methods for Prediction

Linear Methods for Prediction Chapter 5 Linear Methods for Prediction 5.1 Introduction We now revisit the classification problem and focus on linear methods. Since our prediction Ĝ(x) will always take values in the discrete set G we

More information

INFORMATION THEORY AND STATISTICS

INFORMATION THEORY AND STATISTICS INFORMATION THEORY AND STATISTICS Solomon Kullback DOVER PUBLICATIONS, INC. Mineola, New York Contents 1 DEFINITION OF INFORMATION 1 Introduction 1 2 Definition 3 3 Divergence 6 4 Examples 7 5 Problems...''.

More information