Directional Control Schemes for Multivariate Categorical Processes
|
|
- Brenda Glenn
- 5 years ago
- Views:
Transcription
1 Directional Control Schemes for Multivariate Categorical Processes Nankai University Homepage: math.nankai.edu.cn/ chlzou (Joint work with Mr. Jian Li and Prof. Fugee Tsung)
2 Outline 1 Multivariate Categorical Modeling 2 Directional Control Schemes 3 Performance Assessment
3 Multivariate Categorical Modeling Multivariate Categorical Processes The aluminium electrolytic capacitor (AEC). Capacity (CAP) Dissipation Factor (DF) Leakage Current (LC) (CAP DF LC), 1 for conforming and 2 for nonconforming, 2 3 = 8 level combinations or cross-classifications: (111), (112), (121), (122), (211), (212), (221), (222). (121) means an AEC with conforming CAP and LC, and nonconforming DF 3-way contingency table of size and with 8 cells; Each cell corresponds to one level combination and stores the count under this combination.
4 Multivariate Categorical Modeling Existing works Multivariate binomial chart χ 2 -chart (Patel s 1973) : G MB,k = 1 N ( nmb,k Np (0) ) T ( MB Σ 1 MB nmb,k Np (0) MB). only applicable in multivariate binomial processes The multivariate multinomial chart, a multi-chart comprising p individual charts (Marcucci 1985) G MM,(i)k = 1 N ( nmm,(i)k Np (0) ) T ( MM,(i) Σ 1 MM,(i) nmm,(i)k Np (0) MM,(i)). cumbersome
5 Multivariate Categorical Modeling Some review on multivariate continuous monitoring Hotelling T 2 -based statistic (Lowry et al. 1992): n X Σ 1 X Regression-adjusted statistic (Hawkins 1991): max i=1,...,p v i, v j = n(d jσ 1 X)/(d jσ 1 d j ) 1 2, for j = 1,..., p. MSPC using LASSO or other variable selection techniques
6 Multivariate Categorical Modeling Contingency Table p categorical variables or factors {C 1, C 2,..., C p }. Each classification factor C i takes h i of possible levels. A p-way h 1 h 2... h p cross-classified contingency table with h = p i=1 h i cells. The AEC example: p = 3 and h 1 = h 2 = h 3 = 2. Log-Linear Models For an h 1 h 2 h 3 table and a fixed sample size N, the cell counts follow the multinomial distribution MN(N; p ijk ) with and their expectations m i,j,k = Np i,j,k, and i,j,k p i,j,k = 1. The log-linear model can characterize the relationship between cell probabilities and factor levels, ln p i,j,k = u (0) +u (1) i +u (2) j +u (3) k +u (1,2) i,j +u (1,3) i,k +u (2,3) j,k +u (1,2,3) i,j,k.
7 Multivariate Categorical Modeling Identifiability requires constraints such as u (1) i = u (1,2) i,j = u (1,3) i,k = i i i i u (1,2,3) i,j,k = 0 for the first factor along its index i. Similar equations describe the second and third factors along with their indexes j and k, respectively. Re-parameterizations For example, a 2 3 contingency table u (0) = β 0, u (1) 1 = β 1, u (1) 2 = β 1, u (2) 1 = β 2, u (2) 2 = β 3, u (2) 3 = β 2 β 3, u (1,2) 1,1 = β 4, u (1,2) 1,2 = β 5, u (1,2) 1,3 = β 4 β 5, u (1,2) 2,1 = β 4, u (1,2) 2,2 = β 5, u (1,2) 2,3 = β 4 + β 5.
8 Multivariate Categorical Modeling Log-Linear Model at the effect level Imposed by some constraints, the log-linear model for a general p-way contingency table can be written as 2 p 1 ln p = 1β 0 + X i β i. The subvector β i stands for the ith main or interaction effect. i=1 Log-Linear Model at the coefficient level h 1 ln p = 1β 0 + x i β i i=1 The number β i stands for the ith coefficient, either part or totality of an effect.
9 Directional Control Schemes One-to-one correspondence between coefficient subvectors and factor effects Shifts in the marginal dist. of one factor represent deviations of the coefficient subvector corresponding to its main effect; Shifts in the dependence among multiple factors represent deviations of the coefficient subvector reflecting their interaction effect.
10 Outline 1 Multivariate Categorical Modeling 2 Directional Control Schemes 3 Performance Assessment
11 Directional Control Schemes Problem Formulation F (X; β): ln p = 1β 0 + Xβ and p T 1 = 1 X = [X 1,..., X 2 p 1] = [x 1, x 2,..., x h 1 ] The jth on-line multivariate sampling observation vector, n (j) of size h 1, collected over time from the change-point model { n (j) i.i.d. F (X; β (0) ), for j = 1,..., τ, F (X; β (1) ), for j = τ + 1,...
12 Directional Control Schemes Shift directions Shifts with all possibilities H 0 : β = β (0) H 1 : β β (0). Only the coefficient β i (1 i h 1) adds by an unknown constant δ i H 0 : β = β (0) H 1 : β = β (0) + d i δ i. Only one coefficient may deviate with unknown location H 0 : β = β (0) H 1 : β = β (0) + d 1 δ 1 or... or β = β (0) + d h 1 δ h 1.
13 Directional Control Schemes Shift directions (cont d) Most shifts are inclined to occur in lower-order effects. Most real applications care about only means and variances, namely moments of the first two orders. The monitoring task is to only answer yes or no about whether process is IC, rather than telling the locations of shifted effects. Focus on merely effects of the first two orders (i.e., the main effects and the two-factor interaction effects). H 0 : β = β (0) ) H 1 : (β = β (0) + d i δ i. i I 2
14 Directional Control Schemes LLD chart Given the IC probability vector p (0), Phase II sample size N, observation vectors n j (j = 1, 2,...), we construct an EWMA control chart based on likelihood ratio test with its charting statistic ( 1 ( R k = max z (k) Np (0)) T ( xi x T i I 2 N i Σ (0) ) 1x ( T x i i n Np (0) )). z (k) = (1 λ)z (k 1) + λn (k) EWMA observation vector Σ (0) = diag ( p (0)) p (0) (p (0) ) T x i ith column of X Log-Linear Directional (LLD) control chart.
15 Directional Control Schemes Diagnostic Procedures The real one-coefficient-shift direction index ζ can be identified as ( 1 ( ˆζ = arg max zη Np (0)) T ( ) xj x T 1x ( T j I 3 N j Σxj j zη Np (0))). Σ = diag (ˆp ) ˆpˆp T with ˆp = z η /N. z η is the EWMA observation vector at the signal time point η. We confine the candidate subset of diagnostic shift directions to effects in the first three orders, which is still safe in case a shift appears in a three-factor interaction effect.
16 Outline 1 Multivariate Categorical Modeling 2 Directional Control Schemes 3 Performance Assessment
17 Performance Assessment LLD chart VS MBE chart Multivariate binomial EWMA (MBE) chart, the EWMA version of Patel s (1973) χ 2 -chart. Multivariate Binomial Processes 5 characteristics each with 2 levels, a 5-way contingency table with 2 5 = 32 cells. Table: OC ARLs of LLD and MBE, λ = 0.1, N = 1, 000, ARL 0 =370 δ LLD MBE LLD MBE LLD MBE β (3) β (5) β (1,4) β (2,3) β (2,5) β (3,4)
18 Performance Assessment LLD chart VS MME chart The multivariate multinomial EWMA (MME) chart, a multi-chart comprising p individual charts. Each is the EWMA version of the generalized p-chart in Marcucci (1985). Multivariate Multinomial Processes. 4 characteristics of 2, 2, 3, 3 levels, a 4-way table. Table: OC ARLs of LLD and MME, λ = 0.1, N = 1, 000, ARL 0 =370 δ LLD MME LLD MME LLD MME β (32 ) β (41 ) β (1,2) β (1,32 ) β (2,31 ) β (31,4 1 )
19 Performance Assessment Diagnostic Performance The same setting in the 5-way table as the comparison with MBE. Matching probability P( ˆζ = ζ) Table: Observed matching probability for diagnosing shift direction δ β (2) β (4) β (1,3) β (4,5) β (1,4,5) β (2,3,5) NOTE: λ = 0.1. N = 1, 000.
20 Conclusion The superiority of the LLD chart lies in one-coefficient-shifts or shifts in high-order interaction effects. The LLD chart can work well within the unified framework of multivariate binomial and multivariate multinomial processes. The diagnostic schemes show good performance in estimating fault directions.
21 Thank you for your attention!
Statistical Process Control for Multivariate Categorical Processes
Statistical Process Control for Multivariate Categorical Processes Fugee Tsung The Hong Kong University of Science and Technology Fugee Tsung 1/27 Introduction Typical Control Charts Univariate continuous
More informationMultivariate Binomial/Multinomial Control Chart
Multivariate Binomial/Multinomial Control Chart Jian Li 1, Fugee Tsung 1, and Changliang Zou 2 1 Department of Industrial Engineering and Logistics Management, Hong Kong University of Science and Technology,
More informationDescribing Contingency tables
Today s topics: Describing Contingency tables 1. Probability structure for contingency tables (distributions, sensitivity/specificity, sampling schemes). 2. Comparing two proportions (relative risk, odds
More informationModule B1: Multivariate Process Control
Module B1: Multivariate Process Control Prof. Fugee Tsung Hong Kong University of Science and Technology Quality Lab: http://qlab.ielm.ust.hk I. Multivariate Shewhart chart WHY MULTIVARIATE PROCESS CONTROL
More informationSTAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).
STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example
More informationThe purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.
Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That
More informationDiscrete Multivariate Statistics
Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are
More informationRejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2
Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David
More informationMonitoring General Linear Profiles Using Multivariate EWMA schemes
Monitoring General Linear Profiles Using Multivariate EWMA schemes Changliang Zou Department of Statistics School of Mathematical Sciences Nankai University Tianjian, PR China Fugee Tsung Department of
More informationA Practitioner s Guide to Generalized Linear Models
A Practitioners Guide to Generalized Linear Models Background The classical linear models and most of the minimum bias procedures are special cases of generalized linear models (GLMs). GLMs are more technically
More informationMaximum likelihood in log-linear models
Graphical Models, Lecture 4, Michaelmas Term 2010 October 22, 2010 Generating class Dependence graph of log-linear model Conformal graphical models Factor graphs Let A denote an arbitrary set of subsets
More informationMachine Learning, Fall 2012 Homework 2
0-60 Machine Learning, Fall 202 Homework 2 Instructors: Tom Mitchell, Ziv Bar-Joseph TA in charge: Selen Uguroglu email: sugurogl@cs.cmu.edu SOLUTIONS Naive Bayes, 20 points Problem. Basic concepts, 0
More informationij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as
page1 Loglinear Models Loglinear models are a way to describe association and interaction patterns among categorical variables. They are commonly used to model cell counts in contingency tables. These
More informationMIT Spring 2015
Regression Analysis MIT 18.472 Dr. Kempthorne Spring 2015 1 Outline Regression Analysis 1 Regression Analysis 2 Multiple Linear Regression: Setup Data Set n cases i = 1, 2,..., n 1 Response (dependent)
More informationNaïve Bayes classification
Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationNotes on Machine Learning for and
Notes on Machine Learning for 16.410 and 16.413 (Notes adapted from Tom Mitchell and Andrew Moore.) Choosing Hypotheses Generally want the most probable hypothesis given the training data Maximum a posteriori
More informationNaïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability
Probability theory Naïve Bayes classification Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. s: A person s height, the outcome of a coin toss Distinguish
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More informationLikelihood Ratio-Based Distribution-Free EWMA Control Charts
Likelihood Ratio-Based Distribution-Free EWMA Control Charts CHANGLIANG ZOU Nankai University, Tianjin, China FUGEE TSUNG Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong
More informationA Probability Review
A Probability Review Outline: A probability review Shorthand notation: RV stands for random variable EE 527, Detection and Estimation Theory, # 0b 1 A Probability Review Reading: Go over handouts 2 5 in
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationSelf-Starting Control Chart for Simultaneously Monitoring Process Mean and Variance
International Journal of Production Research Vol. 00, No. 00, 15 March 2008, 1 14 Self-Starting Control Chart for Simultaneously Monitoring Process Mean and Variance Zhonghua Li a, Jiujun Zhang a,b and
More informationA Few Special Distributions and Their Properties
A Few Special Distributions and Their Properties Econ 690 Purdue University Justin L. Tobias (Purdue) Distributional Catalog 1 / 20 Special Distributions and Their Associated Properties 1 Uniform Distribution
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationPh.D. Qualifying Exam Friday Saturday, January 6 7, 2017
Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 3 for Applied Multivariate Analysis Outline 1 Reprise-Vectors, vector lengths and the angle between them 2 3 Partial correlation
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationThe Sparsity and Bias of The LASSO Selection In High-Dimensional Linear Regression
The Sparsity and Bias of The LASSO Selection In High-Dimensional Linear Regression Cun-hui Zhang and Jian Huang Presenter: Quefeng Li Feb. 26, 2010 un-hui Zhang and Jian Huang Presenter: Quefeng The Sparsity
More informationMultivariate distance Fall
Multivariate distance 2017 Fall Contents Euclidean Distance Definitions Standardization Population Distance Population mean and variance Definitions proportions presence-absence data Multivariate distance
More informationPubH 5450 Biostatistics I Prof. Carlin. Lecture 13
PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null
More informationYu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann
Association Model, Page 1 Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann Arbor, MI 48106. Email: yuxie@umich.edu. Tel: (734)936-0039. Fax: (734)998-7415. Association
More informationLecture 16 : Bayesian analysis of contingency tables. Bayesian linear regression. Jonathan Marchini (University of Oxford) BS2a MT / 15
Lecture 16 : Bayesian analysis of contingency tables. Bayesian linear regression. Jonathan Marchini (University of Oxford) BS2a MT 2013 1 / 15 Contingency table analysis North Carolina State University
More informationPh.D. Qualifying Exam Friday Saturday, January 3 4, 2014
Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently
More informationPractice Problems Section Problems
Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,
More informationMulti-Variate-Attribute Quality Control (MVAQC)
by Submitted in total fulfillment of the requirements of the degree of Doctor of Philosophy July 2014 Department of Mechanical Engineering The University of Melboune Abstract When the number of quality
More informationSpatial Statistics with Image Analysis. Lecture L11. Home assignment 3. Lecture 11. Johan Lindström. December 5, 2016.
HA3 MRF:s Simulation Estimation Spatial Statistics with Image Analysis Lecture 11 Johan Lindström December 5, 2016 Lecture L11 Johan Lindström - johanl@maths.lth.se FMSN20/MASM25 L11 1/22 HA3 MRF:s Simulation
More informationSample Size and Power Considerations for Longitudinal Studies
Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous
More informationOptimal exact tests for complex alternative hypotheses on cross tabulated data
Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex
More information13.1 Categorical Data and the Multinomial Experiment
Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)
More informationThe Multinomial Model
The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient
More informationIntroduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution
Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis
More informationGoodness of Fit Tests: Homogeneity
Goodness of Fit Tests: Homogeneity Mathematics 47: Lecture 35 Dan Sloughter Furman University May 11, 2006 Dan Sloughter (Furman University) Goodness of Fit Tests: Homogeneity May 11, 2006 1 / 13 Testing
More informationModeling Real Estate Data using Quantile Regression
Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices
More informationLog-linear Models for Contingency Tables
Log-linear Models for Contingency Tables Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Log-linear Models for Two-way Contingency Tables Example: Business Administration Majors and Gender A
More informationApplied Statistics Comprehensive Exam
Applied Statistics Comprehensive Exam August 2013 Ph.D Theory Exam This comprehensive exam consists of 10 questions pertaining to theoretical statistical topics. 1 This Ph.D level exam will run from 8:30
More information2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008
MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/term
More information10708 Graphical Models: Homework 2
10708 Graphical Models: Homework 2 Due Monday, March 18, beginning of class Feburary 27, 2013 Instructions: There are five questions (one for extra credit) on this assignment. There is a problem involves
More informationMachine Learning - MT Classification: Generative Models
Machine Learning - MT 2016 7. Classification: Generative Models Varun Kanade University of Oxford October 31, 2016 Announcements Practical 1 Submission Try to get signed off during session itself Otherwise,
More informationPreface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of
Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Probability Sampling Procedures Collection of Data Measures
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationTotal No. of Questions : 10] [Total No. of Pages : 02. M.Sc. DEGREE EXAMINATION, DEC Second Year STATISTICS. Statistical Quality Control
(DMSTT21) Total No. of Questions : 10] [Total No. of Pages : 02 M.Sc. DEGREE EXAMINATION, DEC. 2016 Second Year STATISTICS Statistical Quality Control Time : 3 Hours Maximum Marks : 70 Answer any five
More information2 Describing Contingency Tables
2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random
More informationLearning Bayesian network : Given structure and completely observed data
Learning Bayesian network : Given structure and completely observed data Probabilistic Graphical Models Sharif University of Technology Spring 2017 Soleymani Learning problem Target: true distribution
More informationA Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic
A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic Jay R. Schaffer & Shawn VandenHul University of Northern Colorado McKee Greeley, CO 869 jay.schaffer@unco.edu gathen9@hotmail.com
More informationTopic 21 Goodness of Fit
Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known
More informationPATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS Parametric Distributions Basic building blocks: Need to determine given Representation: or? Recall Curve Fitting Binary Variables
More informationEcon 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines
Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the
More informationGeneralized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence
Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey
More informationGeneralized logit models for nominal multinomial responses. Local odds ratios
Generalized logit models for nominal multinomial responses Categorical Data Analysis, Summer 2015 1/17 Local odds ratios Y 1 2 3 4 1 π 11 π 12 π 13 π 14 π 1+ X 2 π 21 π 22 π 23 π 24 π 2+ 3 π 31 π 32 π
More informationA Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression
A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression Noah Simon Jerome Friedman Trevor Hastie November 5, 013 Abstract In this paper we purpose a blockwise descent
More informationChi-square (χ 2 ) Tests
Math 442 - Mathematical Statistics II April 30, 2018 Chi-square (χ 2 ) Tests Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4.
More informationNATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )
NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3
More informationAdvanced topics from statistics
Advanced topics from statistics Anders Ringgaard Kristensen Advanced Herd Management Slide 1 Outline Covariance and correlation Random vectors and multivariate distributions The multinomial distribution
More informationCOMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017
COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 Prof. John Paisley Department of Electrical Engineering & Data Science Institute Columbia University FEATURE EXPANSIONS FEATURE EXPANSIONS
More informationAd Placement Strategies
Case Study 1: Estimating Click Probabilities Tackling an Unknown Number of Features with Sketching Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox 2014 Emily Fox January
More informationGibbs Sampling in Linear Models #2
Gibbs Sampling in Linear Models #2 Econ 690 Purdue University Outline 1 Linear Regression Model with a Changepoint Example with Temperature Data 2 The Seemingly Unrelated Regressions Model 3 Gibbs sampling
More informationNonparametric Monitoring of Multiple Count Data
Nonparametric Monitoring of Multiple Count Data Peihua Qiu 1, Zhen He 2 and Zhiqiong Wang 3 1 Department of Biostatistics University of Florida, Gainesville, United States 2 College of Management and Economics
More informationHierarchical Linear Models. Hierarchical Linear Models. Much of this material already seen in Chapters 5 and 14. Hyperprior on K parameters α:
Hierarchical Linear Models Hierarchical Linear Models Much of this material already seen in Chapters 5 and 14 Hierarchical linear models combine regression framework with hierarchical framework Unified
More informationChi-square (χ 2 ) Tests
Math 145 - Elementary Statistics April 17, 2007 Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. Chi-square (χ 2 ) Tests 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4. Testing
More informationassumption identically change method. 1. Introduction1 .iust.ac.ir/ ABSTRACT identifying KEYWORDS estimation, Correlation,
International Journal Industrial Engineering & Production Research (207) December 207, Volume 28, Number 4 pp. 367-376 DOI: 0.22068/ijiepr.28.4.367 http://ijiepr..iust.ac.ir/ Change-Point Estimation High-Yiel
More informationModern Statistical Process Control Charts and Their Applications in Analyzing Big Data
Modern Statistical Process Control Charts and Their Applications in Analyzing Big Data (Part III: Profile Monitoring and Dynamic Screening) Peihua Qiu pqiu@ufl.edu Department of Biostatistics University
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationOn an Additive Semigraphoid Model for Statistical Networks With Application to Nov Pathway 25, 2016 Analysis -1 Bing / 38Li,
On an Additive Semigraphoid Model for Statistical Networks With Application to Pathway Analysis - Bing Li, Hyunho Chun & Hongyu Zhao Kim Youngrae SNU Stat. Multivariate Lab Nov 25, 2016 On an Additive
More informationHigh-Dimensional Process Monitoring and Fault Isolation via Variable Selection
High-Dimensional Process Monitoring and Fault Isolation via Variable Selection KAIBO WANG Tsinghua University, Beijing 100084, P. R. China WEI JIANG The Hong Kong University of Science & Technology, Kowloon,
More informationLecture 16 Solving GLMs via IRWLS
Lecture 16 Solving GLMs via IRWLS 09 November 2015 Taylor B. Arnold Yale Statistics STAT 312/612 Notes problem set 5 posted; due next class problem set 6, November 18th Goals for today fixed PCA example
More informationSTAC51: Categorical data Analysis
STAC51: Categorical data Analysis Mahinda Samarakoon January 26, 2016 Mahinda Samarakoon STAC51: Categorical data Analysis 1 / 32 Table of contents Contingency Tables 1 Contingency Tables Mahinda Samarakoon
More informationMarkov Chains and Hidden Markov Models
Chapter 1 Markov Chains and Hidden Markov Models In this chapter, we will introduce the concept of Markov chains, and show how Markov chains can be used to model signals using structures such as hidden
More informationMonitoring Expense Report Errors: Control Charts Under Independence and Dependence. Darren Williams. (Under the direction of Dr.
Monitoring Expense Report Errors: Control Charts Under Independence and Dependence by Darren Williams (Under the direction of Dr. Lynne Seymour) Abstract Control charts were devised to evaluate offices
More informationStatistics 3858 : Maximum Likelihood Estimators
Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,
More informationQ3) a) Explain the construction of np chart. b) Write a note on natural tolerance limits and specification limits.
(DMSTT 21) Total No. of Questions : 10] [Total No. of Pages : 02 M.Sc. DEGREE EXAMINATION, MAY 2017 Second Year STATISTICS Statistical Quality Control Time : 3 Hours Maximum Marks: 70 Answer any Five questions.
More informationNegative Multinomial Model and Cancer. Incidence
Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence S. Lahiri & Sunil K. Dhar Department of Mathematical Sciences, CAMS New Jersey Institute of Technology, Newar,
More informationChapter 1. Modeling Basics
Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical
More information1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution
A Few Special Distributions Their Properties Econ 675 Iowa State University November 1 006 Justin L Tobias (ISU Distributional Catalog November 1 006 1 / 0 Special Distributions Their Associated Properties
More informationTABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1
TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1 1.1 The Probability Model...1 1.2 Finite Discrete Models with Equally Likely Outcomes...5 1.2.1 Tree Diagrams...6 1.2.2 The Multiplication Principle...8
More informationChapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.
Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:
More informationCSC 411: Lecture 09: Naive Bayes
CSC 411: Lecture 09: Naive Bayes Class based on Raquel Urtasun & Rich Zemel s lectures Sanja Fidler University of Toronto Feb 8, 2015 Urtasun, Zemel, Fidler (UofT) CSC 411: 09-Naive Bayes Feb 8, 2015 1
More informationModeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses
Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association
More informationSystem Monitoring with Real-Time Contrasts
System Monitoring with Real- Contrasts HOUTAO DENG Intuit, Mountain View, CA 94043, USA GEORGE RUNGER Arizona State University, Tempe, AZ 85287, USA EUGENE TUV Intel Corporation, Chandler, AZ 85226, USA
More informationContents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1
Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services
More informationBayesian Modeling of Conditional Distributions
Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings
More informationGAUSSIAN PROCESS REGRESSION
GAUSSIAN PROCESS REGRESSION CSE 515T Spring 2015 1. BACKGROUND The kernel trick again... The Kernel Trick Consider again the linear regression model: y(x) = φ(x) w + ε, with prior p(w) = N (w; 0, Σ). The
More informationMath 423/533: The Main Theoretical Topics
Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2018 Examinations Subject CT3 Probability and Mathematical Statistics Core Technical Syllabus 1 June 2017 Aim The
More informationStatistical Process Control
Statistical Process Control What is a process? Inputs PROCESS Outputs A process can be described as a transformation of set of inputs into desired outputs. Types of Measures Measures where the metric is
More informationMonitoring and diagnosing a two-stage production process with attribute characteristics
Iranian Journal of Operations Research Vol., No.,, pp. -6 Monitoring and diagnosing a two-stage production process with attribute characteristics Downloaded from iors.ir at :6 +33 on Wednesday October
More informationMultiple Linear Regression
Multiple Linear Regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 / 42 Passenger car mileage Consider the carmpg dataset taken from
More information1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4.
Multivariate normal distribution Reading: AMSA: pages 149-200 Multivariate Analysis, Spring 2016 Institute of Statistics, National Chiao Tung University March 1, 2016 1. Density and properties Brief outline
More informationLinear Methods for Prediction
Chapter 5 Linear Methods for Prediction 5.1 Introduction We now revisit the classification problem and focus on linear methods. Since our prediction Ĝ(x) will always take values in the discrete set G we
More informationINFORMATION THEORY AND STATISTICS
INFORMATION THEORY AND STATISTICS Solomon Kullback DOVER PUBLICATIONS, INC. Mineola, New York Contents 1 DEFINITION OF INFORMATION 1 Introduction 1 2 Definition 3 3 Divergence 6 4 Examples 7 5 Problems...''.
More information