Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis

Similar documents
Now consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.

The Distribution of Partially Exchangeable Random Variables

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

The Distribution of Partially Exchangeable Random Variables

A Unified Approach for Analyzing Exchangeable Binary Data with Applications to Clinical and Developmental Toxicity Studies

Stat 5101 Lecture Notes

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Subject CS1 Actuarial Statistics 1 Core Principles

Figure 36: Respiratory infection versus time for the first 49 children.

,..., θ(2),..., θ(n)

Semiparametric Generalized Linear Models

Copulas. MOU Lili. December, 2014

Hierarchical Hurdle Models for Zero-In(De)flated Count Data of Complex Designs

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Modelling geoadditive survival data

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Contents 1. Contents

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52

Using Estimating Equations for Spatially Correlated A

Motivational Example

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.

Generating Spatial Correlated Binary Data Through a Copulas Method

Goodness-of-Fit Testing with Empirical Copulas

The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference

Modeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Confidence Distribution

FORECAST VERIFICATION OF EXTREMES: USE OF EXTREME VALUE THEORY

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Financial Econometrics and Volatility Models Copulas

Statistical Inference and Methods

The combined model: A tool for simulating correlated counts with overdispersion. George KALEMA Samuel IDDI Geert MOLENBERGHS

Marginal Specifications and a Gaussian Copula Estimation

Semiparametric Gaussian Copula Models: Progress and Problems

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Multivariate Non-Normally Distributed Random Variables

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Lecture 5 Models and methods for recurrent event data

A Conditional Approach to Modeling Multivariate Extremes

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

SUPPLEMENTARY SIMULATIONS & FIGURES

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

Institute of Actuaries of India

Bayesian non-parametric model to longitudinally predict churn

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Analysing geoadditive regression data: a mixed model approach

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)

Probability and Estimation. Alan Moses

STA216: Generalized Linear Models. Lecture 1. Review and Introduction

Semiparametric Gaussian Copula Models: Progress and Problems

Index. Regression Models for Time Series Analysis. Benjamin Kedem, Konstantinos Fokianos Copyright John Wiley & Sons, Inc. ISBN.

University of Southampton Research Repository eprints Soton

Lecture 6: Gaussian Mixture Models (GMM)

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Probability Distribution And Density For Functional Random Variables

Introduction to Machine Learning

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner

Exam C Solutions Spring 2005

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data

Repeated ordinal measurements: a generalised estimating equation approach

Tail negative dependence and its applications for aggregate loss modeling

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Math Review Sheet, Fall 2008

Robustness of a semiparametric estimator of a copula

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Estimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith

Extreme Value Analysis and Spatial Extremes

ECE521 Tutorial 11. Topic Review. ECE521 Winter Credits to Alireza Makhzani, Alex Schwing, Rich Zemel and TAs for slides. ECE521 Tutorial 11 / 4

STAT J535: Introduction

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Simulating Realistic Ecological Count Data

Contents. Part I: Fundamentals of Bayesian Inference 1

Math 494: Mathematical Statistics

Chapter 6 Expectation and Conditional Expectation. Lectures Definition 6.1. Two random variables defined on a probability space are said to be

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Generalized Linear Models. Kurt Hornik

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Bayesian estimation of the discrepancy with misspecified parametric models

Test of Association between Two Ordinal Variables while Adjusting for Covariates

Generalizing the MCPMod methodology beyond normal, independent data

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

On a simple construction of bivariate probability functions with fixed marginals 1

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

An Overview of Methods in the Analysis of Dependent Ordered Categorical Data: Assumptions and Implications

Foundations of Nonparametric Bayesian Methods

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

Logistic Regression and Generalized Linear Models

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

Step-Stress Models and Associated Inference

Some conditional extremes of a Markov chain

Recovering Copulae from Conditional Quantiles

Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data

Transcription:

Hanxiang Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis March 4, 2009

Outline Project I: Free Knot Spline Cox Model

Project I: Free Knot Spline Cox Model Consider a parametric Cox model: h(t) = h 0 (t)exp(g θ (t, Z(t))), (1) where g θ is a smooth function. Our candidate of g θ is a free knot spline. i A quadratic free-knot polynomial spline with knots in time: g θ (t, z) = (β 0 + β 1 t + β 2 t 2 + β 2 (t γ) 2 +)z, where θ = [β 1, β 2, β 3, γ] is the parameter of interest. The knot γ can be a threshold value such as a changepoint.

ii A quadratic free-knot polynomial spline with knots in covariates: g θ (z) = β 1 z + β 2 z 2 + β 3 (z κ) 2 +, where θ = [β 1, β 2, β 3, κ] is the parameter of interest. The knot κ can be a threshold value such as a nadir (of BMI), a changepoint, etc.

ii A quadratic free-knot polynomial spline with knots in covariates: g θ (z) = β 1 z + β 2 z 2 + β 3 (z κ) 2 +, where θ = [β 1, β 2, β 3, κ] is the parameter of interest. The knot κ can be a threshold value such as a nadir (of BMI), a changepoint, etc. iii Applications? B-splines? Natural Splines?

iv g θ has first continuous derivative, g θ (z) = [z, z 2, (z κ) 2 1 {z>κ}, 2β 3 (z κ)1 {z>κ} ], but the 2nd derivative does not exist at knot κ.

iv g θ has first continuous derivative, g θ (z) = [z, z 2, (z κ) 2 1 {z>κ}, 2β 3 (z κ)1 {z>κ} ], but the 2nd derivative does not exist at knot κ. v Consistency? Asymptotic Normality? Under continuous first derivative, we have obtained.

iv g θ has first continuous derivative, g θ (z) = [z, z 2, (z κ) 2 1 {z>κ}, 2β 3 (z κ)1 {z>κ} ], but the 2nd derivative does not exist at knot κ. v Consistency? Asymptotic Normality? Under continuous first derivative, we have obtained. vi Consistency? Asymptotic Normality? Knots in covariates: g θ (z) = β 1 z + β 2 (z κ) +, g θ (z) = β(1 z κ 1 z<κ ). Knots in time: g θ (t, z) = (β 0 +β 1 t+β 2 (t γ) + )z, g θ (t, z) = β(1 t γ 1 t<γ )z,

Motivations Data consisting of groups of dependent binary random variables arise commonly in many fields including developmental toxicity studies, longitudinal studies, studies of familial diseases, cluster sample surveys and others.

Motivations Data consisting of groups of dependent binary random variables arise commonly in many fields including developmental toxicity studies, longitudinal studies, studies of familial diseases, cluster sample surveys and others. Independence assumption is not appropriate in many areas of science. E.g., in developmental toxicity study, fetuses from the same litter are correlated and may respond more similarly to a stimulus than fetuses from different litters. This intra-litter correlation causes over-dispersion (extra-binomial variation).

Motivations Data consisting of groups of dependent binary random variables arise commonly in many fields including developmental toxicity studies, longitudinal studies, studies of familial diseases, cluster sample surveys and others. Independence assumption is not appropriate in many areas of science. E.g., in developmental toxicity study, fetuses from the same litter are correlated and may respond more similarly to a stimulus than fetuses from different litters. This intra-litter correlation causes over-dispersion (extra-binomial variation). The perception of exchangeability, intensely studied over the past century, is meant to capture the notion of symmetry in a collection of random variables and is often used as an alternative to independence.

Disadvantages of Some Commonly Used Methods The statistical inference based on the conditional analysis does not use any information about the distribution of the latent effects since it is lost in the conditioning.

Disadvantages of Some Commonly Used Methods The statistical inference based on the conditional analysis does not use any information about the distribution of the latent effects since it is lost in the conditioning. The EM algorithm method is conceptually simple, but the computations may be formidable because each E-step in the computation may require a numerical integration.

Disadvantages of Some Commonly Used Methods The statistical inference based on the conditional analysis does not use any information about the distribution of the latent effects since it is lost in the conditioning. The EM algorithm method is conceptually simple, but the computations may be formidable because each E-step in the computation may require a numerical integration. The empirical Bayes approach may also be computationally difficult.

Disadvantages of Some Commonly Used Methods The statistical inference based on the conditional analysis does not use any information about the distribution of the latent effects since it is lost in the conditioning. The EM algorithm method is conceptually simple, but the computations may be formidable because each E-step in the computation may require a numerical integration. The empirical Bayes approach may also be computationally difficult. The quasi-likelihood and GEE approach (e.g. Liang, Qaqish and Zeger, 1992) only uses the first two moments, while higher order of correlation is approximated by a working matrix.

The Proposed Approach By relaxing independence to exchangeability, we introduce a rich family of parsimonious distributions resulted from completely monotonic functions, extending GLMs.

The Proposed Approach By relaxing independence to exchangeability, we introduce a rich family of parsimonious distributions resulted from completely monotonic functions, extending GLMs. We present a general framework that unifies the existing procedures such the beta-binomial, Williams procedures, and give many new distributions.

The Proposed Approach By relaxing independence to exchangeability, we introduce a rich family of parsimonious distributions resulted from completely monotonic functions, extending GLMs. We present a general framework that unifies the existing procedures such the beta-binomial, Williams procedures, and give many new distributions. Our approach uses all the distributional information including, of course, all order moments, so that it is a full likelihood approach.

The Proposed Approach By relaxing independence to exchangeability, we introduce a rich family of parsimonious distributions resulted from completely monotonic functions, extending GLMs. We present a general framework that unifies the existing procedures such the beta-binomial, Williams procedures, and give many new distributions. Our approach uses all the distributional information including, of course, all order moments, so that it is a full likelihood approach. The pmf s are simple both mathematically and computationally.

Exchangeable Binomial B 1,...,B m are exchangeable if for every {0, 1}-valued variables b 1,...,b m, P(B 1 = b 1,...,B m = b m ) = P(B π1 = b 1,...,B πm = b m ), (2) for every permutation π 1,..., π m of 1,...,m.

Exchangeable Binomial B 1,...,B m are exchangeable if for every {0, 1}-valued variables b 1,...,b m, P(B 1 = b 1,...,B m = b m ) = P(B π1 = b 1,...,B πm = b m ), (2) for every permutation π 1,..., π m of 1,...,m. Let Y = B 1 +... + B m. Then P(Y = y) = ( ) m y m ( ) m y ( 1) k λ y+k, y = 0, 1,...,m, y k k=0 where λ 0 = 1, λ k = P(B 1 = 1,...,B k = 1), 1 k m are the marginal probabilities.

Complete Monotonicity The marginal probabilities λ = {λ i : i = 0, 1,...,m} (λ 0 = 1) are completely monotone (CM): ( 1) k k λ i 0, i = 0, 1,...,m, k + i m, (3) where is the difference operator: λ i = λ i+1 λ i.

Complete Monotonicity The marginal probabilities λ = {λ i : i = 0, 1,...,m} (λ 0 = 1) are completely monotone (CM): ( 1) k k λ i 0, i = 0, 1,...,m, k + i m, (3) where is the difference operator: λ i = λ i+1 λ i. Write θ = (θ, ϑ), where θ is a parameter of interest, while ϑ is treated as a nuisance parameter. We model λ j = h j (β X; ϑ), j = 0, 1, 2,...,M.

Complete Monotonicity The marginal probabilities λ = {λ i : i = 0, 1,...,m} (λ 0 = 1) are completely monotone (CM): ( 1) k k λ i 0, i = 0, 1,...,m, k + i m, (3) where is the difference operator: λ i = λ i+1 λ i. Write θ = (θ, ϑ), where θ is a parameter of interest, while ϑ is treated as a nuisance parameter. We model λ j = h j (β X; ϑ), j = 0, 1, 2,...,M. Generalizing the GLMs: (1) CM links extend usual links; (2) exchangeable binomials extend binomials.

Table: The Complete Monotone Links. Name Link(θ = (θ 1, θ 2 )) Parameters Ind-Bin θ t θ (0, 1) MM-Bin θ/(θ + t) θ (0, ) Beta-Bin B(θ 1 + t, θ 2 )/B(θ 1, θ 2 ) θ (0, ) 2 Gamma-Bin (1 + θ 2 t) θ 1 θ (0, ) 2 Poisson-Bin exp(θ(e t 1)) θ (0, ) Normal-Bin 2 exp((σt) 2 /2)(1 Φ(σt)) σ 2 (0, )

Efficient Estimation in a Semiparametric Copula Model Sklar Theorem. Let H be a joint CDF with margins F and G. Then there exists a copula C such that: H(x, y) = C(F(x), G(x)), x, y R. If F and G are continuous, then C is unique. Parametric copulas: Archimedean copulas, Pareto copulas, Placket copulas, etc.

Efficient Estimation in a Semiparametric Copula Model Sklar Theorem. Let H be a joint CDF with margins F and G. Then there exists a copula C such that: H(x, y) = C(F(x), G(x)), x, y R. If F and G are continuous, then C is unique. Parametric copulas: Archimedean copulas, Pareto copulas, Placket copulas, etc. Suppose F = Fα and G = G β, while C is completely unspecified. Suppose we have bivariate data (X i, Y i ) s. How can we efficiently estimate C? Two issues: (1) How good can we estimate? (2) How to construct efficient estimate?