Calibration Estimation for Semiparametric Copula Models under Missing Data

Size: px
Start display at page:

Download "Calibration Estimation for Semiparametric Copula Models under Missing Data"

Transcription

1 Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre Seminar Series Nanyang Technological University January 24, 2018 Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

2 Introduction Copula Model -- Flexible modelling of multivariate distributions -- Popular tool for business and finance Missing Data We bridge them for the first time in the literature. Treatment Effect -- Common problem in all fields -- Deep literature in statistics -- Binary phenomenon (to observe or not to observe) -- Estimate unobserved outcome -- Central topic in econometrics Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

3 Introduction How to fit copula models when there are missing data? A naïve approach is listwise deletion (LD): 1 Keep individuals with all d components observed and discard all other individuals. 2 Treat the individuals with complete data in an equal way. i= 1 i= 2 i= 3 i= 4 i= N Y 11 Y 12 Y 13 Y 14 Y 1N Y 21 Y 22 Y 23 Y 24 Y 2N Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

4 Introduction LD leads to a consistent estimator for the copula parameter of interest if the missing mechanism is missing completely at random (MCAR). LD leads to an inconsistent estimator if the missing mechanism is missing at random (MAR). Under MAR, target variables Y i = [Y 1i,..., Y di ] T and their missing status are independent of each other given observed covariates X i = [X 1i,..., X mi ] T. LD treats individuals with complete data all equally, and it does not use the information of X i. That causes a large bias in the estimator under MAR. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

5 Introduction How to obtain a consistent estimator for the copula parameter when the missing mechanism is MAR? As is well known in the literature of missing data and average treatment effects, a key step is the estimation of propensity score function (i.e. conditional probability of observing data given covariates). Direct estimation of propensity score is notoriously challenging, whether it is done parametrically or nonparametrically. Parametric approaches are haunted by misspecification problems, while nonparametric approaches are haunted by the curse of dimensionality. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

6 Introduction Chan, Yam, and Zhang (2016) propose an alternative approach called calibration estimation in the literature of average treatment effects. The calibration estimator is derived by balancing covariates among treatment, control, and whole groups. It does not require a direct estimation of propensity score. We apply the calibration estimation to a missing data problem for the first time in the literature. Our calibration estimator for the copula parameter satisfies consistency and asymptotic normality under some assumptions including i.i.d. data and the MAR condition. We also derive a consistent estimator for the asymptotic covariance matrix. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

7 Introduction We perform Monte Carlo simulations. Our simulation design itself is a contribution to the literature, since we clear two conditions at the same time in a clever way: 1 The unconditional distribution of {Y i } should be characterized by a known and tractable copula. 2 The missing mechanism should be MAR. Our simulation results indicate that the calibration estimator is indeed consistent and sufficiently close to the true value when sample size is N 500. Other estimators that do not use X i suffer from large bias regardless of sample size N. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

8 Table of Contents 1) Introduction 2) Review of Copula Models and Missing Data 3) Set-up of Main Problem 4) Theory of Calibration Estimation 5) Monte Carlo Simulations 6) Conclusions Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

9 Review of Copula Models Suppose that there are N individuals and d components: Y i = [Y 1i, Y 2i,..., Y di ] T (i = 1,..., N). Suppose that we want to estimate the d-dimensional joint distribution of Y i. What can we do? There are potential problems about estimating the joint distribution directly. d = small d = large Parametric Misspecification Misspecification Parameter proliferation Nonparametric (Curse of dimensionality) Curse of dimensionality Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

10 Review of Copula Models Copula models accomplish flexible specification with a small number of parameters. Copula models follow a two-step procedure. Step 1: Model the marginal distribution of each component separately. Step 2: Combine the d marginal distributions to recover a joint distribution. Step 1 Parametric Nonparametric Nonparametric Step 2 Parametric Parametric Nonparametric Name of model Parametric copula model Semiparametric copula model (Target of our paper) Nonparametric copula model Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

11 Review of Copula Models The copula approach is justified by Sklar s (1959) theorem. Sklar s theorem ensures the existence of a unique copula function C : (0, 1) d (0, 1) that recovers a true joint distribution. Theorem (Sklar, 1959) Let {Y i } be i.i.d. random vectors with a joint distribution F : R d (0, 1). Assume that the marginal distribution of Y ji, written as F j : R (0, 1), is continuous for j {1,..., d}. Then, there exists a unique function C : (0, 1) d (0, 1) such that F (y 1,..., y d ) = C (F 1 (y 1 ),..., F d (y d )), or in terms of probability density functions, f(y 1,..., y d ) = c (f 1 (y 1 ),..., f d (y d )). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

12 Review of Copula Models A well-known example of copula function: bivariate Clayton copula with parameter α > 0. Cumulative distribution function is C 2 (u 1, u 2 ; α) = (u α 1 + u α 2 1) 1 α, where u 1 = F 1 (y 1 ) (0, 1) and u 2 = F 2 (y 2 ) (0, 1) are marginal distribution functions of Y 1i and Y 2i, respectively. Probability density function is c 2 (u 1, u 2 ; α) = (1 + α)(u 1 u 2 ) α 1 (u α 1 + u α 2 1) 1 α 2. Kendall s rank correlation coefficient is τ = α/(α + 2). Larger α implies stronger association between Y 1i and Y 2i. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

13 Review of Copula Models u u u u1 1 PDF (α = or τ =.4) PDF (α = or τ =.7) Association is stronger in the lower tail than in the upper tail. Such an asymmetry matches many economic and financial phenomena (e.g. stock market contagion). Larger α implies stronger association between Y 1i and Y 2i. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

14 Review of Missing Data Suppose that {Y 1i,..., Y di } N i=1 are target variables. Define the missing indicator: { 1 if Y ji is observed, T ji = 0 if Y ji is missing. Suppose that {X 1i,..., X mi } N i=1 are observable covariates. There are three well-known layers of missing mechanism. 1 Missing Completely at Random (MCAR). 2 Missing at Random (MAR). 3 Missing Not at Random (MNAR). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

15 Review of Missing Data Each concept is defined as follows. 1 Missing Completely at Random (MCAR): {T 1i,..., T di } {Y 1i,..., Y di }. 2 Missing at Random (MAR): {T 1i,..., T di } {Y 1i,..., Y di } {X 1i,..., X mi }. 3 Missing Not at Random (MNAR): {T 1i,..., T di } {Y 1i,..., Y di } {X 1i,..., X mi }. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

16 Review of Missing Data An illustrative example on health survey (d = m = 1): Y 1i = weight of individual i; X 1i = I(Individual i is female). MCAR requires that P[Weight of individual i is missing] should be independent of both weight and gender of individual i. MAR requires that: P[Weight of a man is missing] should be independent of his weight. P[Weight of a woman is missing] should be independent of her weight. MNAR allows for the following situations: P[Weight of a man is missing] depends on his weight. P[Weight of a woman is missing] depends on her weight. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

17 Review of Missing Data Probably too restrictive, since men and women may have different willingness to report their weights. MCAR MNAR MAR More plausible than MCAR, since MAR controls for gender. MAR may be still restrictive, since men (or women) with different weights may have different willingness to report their weights. Most general assumption, but hard to handle technically. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

18 Review of Missing Data It is well known that listwise deletion (LD) leads to consistent inference under MCAR. It is also well known that LD leads to inconsistent inference under MAR. Correct inference under MAR has been extensively studied since the semnal work of Rubin (1976). The present paper assumes MAR and elaborates the estimation of semiparametric copula models, which has not been done in the literature. MNAR is a relatively new and challenging topic. See Ai, Linton, and Zhang (2018) and references therein for recent contributions. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

19 Set-up of Main Problem Define propensity score functions: π j (x) = P(T ji = 1 X i = x), j {1,..., d}, η(x) = P(T 1i = 1,..., T di = 1 X i = x). The true copula parameter θ 0 is chracterized as follows. θ 0 = arg max θ Θ E [log c(f 1(Y 1i ),..., F d (Y di ); θ)]. Use the MAR condition and the law of iterated expectations: θ 0 = arg max θ Θ E [I(T 1i = 1,..., T di = 1)q(X i ) log c(f 1 (Y 1i ),..., F d (Y di ); θ)], where q(x i ) η(x i ) 1. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

20 Set-up of Main Problem A natural procedure of estimating θ 0 is as follows. Step 1: Estimate the marginal distributions {F 1,..., F d }. Step 2: Estimate q(x i ) (i.e. the inverse of propensity score) and then compute 1 ˆθ = arg max θ Θ N N I(T 1i = 1,..., T di = 1)ˆq(X i ) log c( ˆF 1 (Y 1i ),..., ˆF d (Y di ); θ). i=1 Calibration estimation shall be used for both { ˆF 1,..., ˆF d } and ˆq(X i ). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

21 Set-up of Main Problem Use the MAR condition and LIE to get F j (y) = P(Y ji y) = E[I(Y ji y)] = E[I(T ji = 1)p j (X i )I(Y ji y)], where p j (x) π j (x) 1 = [P(T ji = 1 X i = x)] 1. Horvitz and Thompson s (1952) inverse probability weighting (IPW) estimator for F j is written as F j (y) = 1 N N I(T ji = 1)p j (X i )I(Y ji y). i=1 If p j (x) were known, then the IPW estimator could be computed straightforwardly. p j (x) is unknown in reality. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

22 Set-up of Main Problem Estimation of propensity score has been a major issue in the literature of missing data and treatment effects. Many papers attempt a direct estimation of p j (x), either parametrically or nonparametrically. Parametric approaches: Zhao and Lipsitz (1992), Robins, Rotnitzky, and Zhao (1994), and Bang and Robins (2005). The parametric approaches are notoriouly sensitive to misspecification (cf. Lawless, Kalbfleisch, and Wild, 1999). Nonparametric approaches: Hahn (1998), Hirano, Imbens, and Ridder (2003), Imbens, Newey, and Ridder (2005), and Chen, Hong, and Tarozzi (2008). The nonparametric approaches have a notoriously poor performance in small sample (the curse of dimensionality). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

23 Theory of Calibration Estimation In the treatment effect literature, Chan, Yam, and Zhang (2016) propose an alternative approach that bypasses a direct estimation of propensity score. They construct calibration weights by balancing the moments of observed covariates among treatment, control, and whole groups. The present paper applies their method to a missing data problem for the first time. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

24 Theory of Calibration Estimation Under MAR, the moment matching condition holds: E [I(T ji = 1)p j (X i )u K (X i )] = E[u K (X i )], j {1,... d} for any integrable function u K : R m R K. A sample counterpart with a generic weight is written as N I(T ji = 1)p ji u K (X i ) = 1 N i=1 N u K (X i ). i=1 There are multiple values of {p j1,..., p jn } that satisfy the moment matching condition. Among them, we choose the one closest to a uniform weight given some distance measure. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

25 Theory of Calibration Estimation Why do we want the uniform weight? 1 If there are no missing data, then the uniform weight leads to a natural estimator ˆF j (y) = (N + 1) 1 N i=1 I(Y ji y). 2 It is well known that volatile weights cause instability in the Horvitz-Thompson estimator. Primal problem is a constrained optimization problem: minimize the distance s.t. the moment matching condition. Dual problem is written as an unconstrained optimization problem. To express the dual problem, let ρ : R R be any strictly concave function. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

26 Theory of Calibration Estimation Define a concave objective function: G jk (λ) = 1 N Compute Compute N [ I(Tji = 1)ρ(λ T u K (X i )) λ T u K (X i ) ], λ R K. i=1 ˆλ jk = arg max λ G jk(λ). ˆp jk (X i ) = 1 N ρ (ˆλ T jku K (X i )). Estimate the marginal distribution of the j-th component by N ˆF j (y) = I(T ji = 1)ˆp jk (X i )I(Y ji y). i=1 Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

27 Theory of Calibration Estimation A common choice of u K (X i ) is a vector of polynomials, say u K (X i ) = [1, X i, X 2 i, X 3 i ] T (m = 1, K = 4). Arbitrariness of ρ arises from the arbitrariness of the distance measure. Functional forms often used in the nonparametric literature include: Exponential Tilting: ρ(v) = exp( v). Empirical Likelihood: ρ(v) = log(1 + v). Quadratic: ρ(v) = 0.5(1 v) 2. Inverse Logistic: ρ(v) = v exp( v) Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

28 Theory of Calibration Estimation The second step of likelihood maximization can be handled analogously. Under MAR, the moment matching condition holds: where E [I(T 1i = 1,..., T di = 1)q(X i )u K (X i )] = E[u K (X i )], q(x i ) = η(x i ) 1 = [P(T 1i = 1,..., T di = 1 X i = x)] 1. Find calibration weights that satisfy the moment matching condition and are closest to the uniform weight given some distance measure. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

29 Theory of Calibration Estimation Define a concave objective function: H K (β) = 1 N Compute N I(T 1i = 1,..., T di = 1)ρ ( β T u K (X i ) ) 1 N i=1 ˆβ K = arg max β H K (β). Compute the calibration weight for the copula function: ˆq K (X i ) = 1 N ρ ( ˆβ T Ku K (X i )). Compute the maximum likelihood estimator ˆθ by N β T u K (X i ). i=1 max θ Θ N i=1 ( I(T 1i = 1,..., T di = 1)ˆq K (X i ) log c d ˆF1 (Y 1i ),..., ˆF ) d (Y di ); θ. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

30 Theory of Calibration Estimation Theorem (Consistency and Asymptotic Normality) Impose a set of assumptions including what follows: The missing mechanism is missing at random (MAR). {Y i, T i, X i } are i.i.d. across individuals i {1,..., N}. π 1 ( ),..., π d ( ), and η( ) are s-times continuously differentiable with sufficiently large s. K(N) as N, and the rate of divergence is sufficiently slow. Then, consistency and asymptotic normality follow. p 1 ˆθ θ0 as N. 2 N( ˆθ θ0 ) d N(0, V ) as N. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

31 Theory of Calibration Estimation The asymptotic covariance matrix V is expressed as V = B 1 ΣB 1. We can construct consistent estimators for B and Σ, and hence ˆV = ˆB 1 ˆΣ ˆB 1 p V. See the main paper for a complete set of assumptions, proofs of the consistency and asymptotic normality, and the construction of V and ˆV. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

32 Monte Carlo Simulations: DGP Target variables are Y i = [Y 1i, Y 2i ] T (d = 2). Consider a scalar covariate X i. Define U i = [U 1i, U 2i, U 3i ] T = [F 1 (Y 1i ), F 2 (Y 2i ), F X (X i )] T. F 1 ( ) is the marginal distribution of Y 1i, and we use χ F 2 ( ) is the marginal distribution of Y 2i, and we use χ F X ( ) is the marginal distribution of X i, and we use N(0, 1). The inverse distribution functions F 1 1 ( ), F 1 2 ( ), and F 1 X ( ) are known and tractable. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

33 Monte Carlo Simulations: DGP Step 1: Draw U i i.i.d. Clayton 3 (α 0 ) with α 0 = Step 2: Recover Y 1i = F1 1 (U 1i ), Y 2i = F2 1 (U 2i ), and X i = F 1 X (U 3i). Step 3: Assume that {Y 11,..., Y 1N } are all observed. Make some of {Y 21,..., Y 2N } missing according to P (T 2i = 1 X i = x i ) = (1 + exp[a + bx i ]) 1, where (a, b) are to be chosen carefully below. Step 4: Repeat Steps 1-3 J = 1000 times with sample size N {250, 500, 1000}. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

34 Monte Carlo Simulations: DGP Consider the unconditional probability of observing Y 2i : µ E[T 2i ] = E[E[T 2i X i = x i ]] = E[P(T 1i = 1 X i = x i )] = E [ (1 + exp[a + bx i ]) 1]. A larger µ implies less missing data on average. We can draw a contour plot of µ given (a, b) b a Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

35 Monte Carlo Simulations: DGP Case A: (a, b) = ( 1.385, 0.000) = MCAR with µ = 0.8. Case B: (a, b) = ( 1.430, 0.400) = MAR with µ = 0.8. Case C: (a, b) = ( 0.405, 0.000) = MCAR with µ = 0.6. Case D: (a, b) = ( 0.420, 0.400) = MAR with µ = 0.6. Case D is the most challenging case since the missing mechanism is MAR and there are relatively many missing data. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

36 Monte Carlo Simulations: Estimation Approach #1: Semiparametric estimation with listwise deletion (LD). Step 1: Estimate the marginal distribution of the j-th component by ˆF j (y) = 1 N + 1 N I(T 1i = 1, T 2i = 1)I(Y ji < y), i=1 where N = N i=1 I(T 1i = 1, T 2i = 1) is the number of individuals with complete data. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

37 Monte Carlo Simulations: Estimation Step 2: Compute the maximum likelihood estimator ˆα by max α (0, ) N i=1 ( I(T 1i = 1, T 2i = 1) log c 2 ˆF1 (Y 1i ), ˆF ) 2 (Y 2i ); α, where c 2 (u 1, u 2 ; α) = (1 + α)(u 1 u 2 ) α 1 (u α 1 + u α 2 1) 1 α 2 is the probability density function of Clayton 2 (α). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

38 Monte Carlo Simulations: Estimation Approach #2: Semiparametric estimation (SP). Step 1: Estimate the marginal distribution of the j-th component by ˆF j (y) = 1 N j + 1 N I(T ji = 1)I(Y ji < y), i=1 where N j = N i=1 I(T ji = 1) is the number of individuals with the j-th component observed. Step 2: Same as the LD approach. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

39 Monte Carlo Simulations: Estimation Approach #3: Calibration estimation (CE). Step 1: Estimate the marginal distribution of the j-th component by ˆF j (y) = N I(T ji = 1)ˆp jk (X i )I(Y ji < y), i=1 where u K (X i ) = [1, X i, X 2 i, X 3 i ] (i.e. K = 4). Step 2: Compute the maximum likelihood estimator ˆα by max α (0, ) N i=1 ( I(T 1i = 1, T 2i = 1)ˆq K (X i ) log c 2 ˆF1 (Y 1i ), ˆF ) 2 (Y 2i ); α. Calibration weights are computed with Exponential Tilting: ρ(v) = exp( v). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

40 Monte Carlo Simulations: Estimation Summary of Three Approaches Use all Y? Use X? Step 1 Step 2 #1. LD No No Nonparametric Parametric #2. SP Yes No Nonparametric Parametric #3. CE Yes Yes Nonparametric Parametric Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

41 Monte Carlo Simulations: Estimation Our simulation design itself is a contribution to the literature. We draw U i = [F 1 (Y 1i ), F 2 (Y 2i ), F X (X i )] T i.i.d. Clayton 3 (α 0 ), which clears two conditions simultaneously: 1 Y i and X i should be associated with each other since we are interested in MAR. 2 The unconditional bivariate distribution of Y i should be characterized by a tractable copula with a known parameter (Clayton 2 (α 0 ) here). Gumbel copula and Frank copula have the same property as Clayton in terms of marginal distributions. To our knowledge, the present simulation design is the only way of clearing the two conditions. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

42 Monte Carlo Simulations: Results Bias of #1. Listwise Deletion Case A Case B Case C Case D MCAR MAR MCAR MAR µ = 0.8 µ = 0.8 µ = 0.6 µ = 0.6 N = N = N = LD estimator is inconsistent under MAR. Bias under MAR gets larger as missing probability gets larger. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

43 Monte Carlo Simulations: Results Bias of #2. Semiparametric Estimation Case A Case B Case C Case D MCAR MAR MCAR MAR µ = 0.8 µ = 0.8 µ = 0.6 µ = 0.6 N = N = N = SP estimator is inconsistent under MAR. Bias under MAR gets larger as missing probability gets larger. Removing bias requires exploiting the information of X i. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

44 Monte Carlo Simulations: Results Bias of #3. Calibration Estimation Case A Case B Case C Case D MCAR MAR MCAR MAR µ = 0.8 µ = 0.8 µ = 0.6 µ = 0.6 N = N = N = Calibration estimator is consistent under MAR. Finite sample bias is sufficiently small when N 500. The consistency stems from the proper usage of X i. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

45 Conclusions We investigate the estimation of semiparametric copula models under missing data for the first time in the literature. There is analogy between missing data and average treatment effects since observing or not observing data is a binary phenomenon. Chan, Yam, and Zhang (2016) propose the calibration estimation for average treatment effects. We apply the calibration estimation to missing data for the first time in the literature. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

46 Conclusions Our calibration estimator for the copula parameter satisfies consistency and asymptotic normality under some assumptions including i.i.d. data and the missing at random (MAR) mechanism. We also derive a consistent estimator for the asymptotic covariance matrix. We design Monte Carlo simulations in a clever way that MAR holds and the true value of parameter is well defined. The simulation results indicate that the calibration estimator is indeed consistent, and it is sufficiently close to the true value when sample size is N 500. Existing estimators which do not exploit the information of covariates suffer from substantial bias for any sample size. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

47 References Ai, C., O. Linton, and Z. Zhang (2018). A Simple and Efficient Estimation Method for Models with Nonignorable Missing Data. Working paper at the University of Florida. Bang, H. and J. M. Robins (2005). Doubly Robust Estimation in Missing Data and Causal Inference Models. Biometrics, 61, Chan, K. C. G., S. C. P. Yam, and Z. Zhang (2016). Globally Efficient Non-Parametric Inference of Average Treatment Effects by Empirical Balancing Calibration Weighting. Journal of the Royal Statistical Society, Series B, 78, Chen, X., H. Hong, and A. Tarozzi (2008). Semiparametric Efficiency in GMM Models with Auxiliary Data. The Annals of Statistics, 36, Hahn, J. (1998). On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects. Econometrica, 66, Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

48 References Hirano, K., G. W. Imbens, and G. Ridder (2003). Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score. Econometrica, 71, Horvitz, D. G. and D. J. Thompson (1952). A Generalization of Sampling without Replacement from a Finite Universe. Journal of the American Statistical Association, 47, Imbens, G. W., W. K. Newey, and G. Ridder (2005). Mean-square-error Calculations for Average Treatment Effects. IEPR Working Paper No Lawless, J. F., J. D. Kalbfleisch, and C. J. Wild (1999). Semiparametric Methods for Response-Selective and Missing Data Problems in Regression. Journal of the Royal Statistical Society, Series B, 61, Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

49 References Robins, J. M., A. Rotnitzky, and L. P. Zhao (1994). Estimation of Regression Coefficients When Some Regressors are not Always Observed. Journal of the American Statistical Association, 89, Rubin, D. B. (1976). Inference and Missing Data. Biometrika, 63, Sklar, A. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications de l Institut de Statistique de L Université de Paris, 8, Zhao, L. P. and S. Lipsitz (1992). Designs and Analysis of Two-Stage Studies. Statistics in Medicine, 11, Hamori, Motegi & Zhang (Kobe & RUC) Copula Models under Missing Data January 24, / 49

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori Kaiji Motegi Zheng Zhang March 28, 2018 Abstract This paper investigates the estimation of semiparametric

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori Kaiji Motegi Zheng Zhang August 30, 208 Abstract This paper investigates the estimation of semiparametric

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Modification and Improvement of Empirical Likelihood for Missing Response Problem UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu

More information

Robustness to Parametric Assumptions in Missing Data Models

Robustness to Parametric Assumptions in Missing Data Models Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

A Goodness-of-fit Test for Copulas

A Goodness-of-fit Test for Copulas A Goodness-of-fit Test for Copulas Artem Prokhorov August 2008 Abstract A new goodness-of-fit test for copulas is proposed. It is based on restrictions on certain elements of the information matrix and

More information

Robustness of a semiparametric estimator of a copula

Robustness of a semiparametric estimator of a copula Robustness of a semiparametric estimator of a copula Gunky Kim a, Mervyn J. Silvapulle b and Paramsothy Silvapulle c a Department of Econometrics and Business Statistics, Monash University, c Caulfield

More information

LIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS

LIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS LIKELIHOOD RATIO IFERECE FOR MISSIG DATA MODELS KARU ADUSUMILLI AD TAISUKE OTSU Abstract. Missing or incomplete outcome data is a ubiquitous problem in biomedical and social sciences. Under the missing

More information

The Value of Knowing the Propensity Score for Estimating Average Treatment Effects

The Value of Knowing the Propensity Score for Estimating Average Treatment Effects DISCUSSION PAPER SERIES IZA DP No. 9989 The Value of Knowing the Propensity Score for Estimating Average Treatment Effects Christoph Rothe June 2016 Forschungsinstitut zur Zukunft der Arbeit Institute

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Financial Econometrics and Volatility Models Copulas

Financial Econometrics and Volatility Models Copulas Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Using copulas to model time dependence in stochastic frontier models

Using copulas to model time dependence in stochastic frontier models Using copulas to model time dependence in stochastic frontier models Christine Amsler Michigan State University Artem Prokhorov Concordia University November 2008 Peter Schmidt Michigan State University

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

ECON 3150/4150, Spring term Lecture 6

ECON 3150/4150, Spring term Lecture 6 ECON 3150/4150, Spring term 2013. Lecture 6 Review of theoretical statistics for econometric modelling (II) Ragnar Nymoen University of Oslo 31 January 2013 1 / 25 References to Lecture 3 and 6 Lecture

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data Biometrics 000, 000 000 DOI: 000 000 0000 Discussion of Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing Data Dylan S. Small

More information

Estimation of the Conditional Variance in Paired Experiments

Estimation of the Conditional Variance in Paired Experiments Estimation of the Conditional Variance in Paired Experiments Alberto Abadie & Guido W. Imbens Harvard University and BER June 008 Abstract In paired randomized experiments units are grouped in pairs, often

More information

Recent Advances in the analysis of missing data with non-ignorable missingness

Recent Advances in the analysis of missing data with non-ignorable missingness Recent Advances in the analysis of missing data with non-ignorable missingness Jae-Kwang Kim Department of Statistics, Iowa State University July 4th, 2014 1 Introduction 2 Full likelihood-based ML estimation

More information

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Causal Inference Lecture Notes: Selection Bias in Observational Studies Causal Inference Lecture Notes: Selection Bias in Observational Studies Kosuke Imai Department of Politics Princeton University April 7, 2008 So far, we have studied how to analyze randomized experiments.

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Imputation Algorithm Using Copulas

Imputation Algorithm Using Copulas Metodološki zvezki, Vol. 3, No. 1, 2006, 109-120 Imputation Algorithm Using Copulas Ene Käärik 1 Abstract In this paper the author demonstrates how the copulas approach can be used to find algorithms for

More information

Copula-Based Random Effects Models

Copula-Based Random Effects Models Copula-Based Random Effects Models Santiago Pereda-Fernández Banca d Italia September 7, 25 PRELIMINARY, INCOMPLETE, PLEASE DO NOT CITE Abstract In a binary choice panel data framework, standard estimators

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Copulas. MOU Lili. December, 2014

Copulas. MOU Lili. December, 2014 Copulas MOU Lili December, 2014 Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion Preliminary MOU Lili SEKE Team 3/30 Probability

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Article Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Fei Jin 1,2 and Lung-fei Lee 3, * 1 School of Economics, Shanghai University of Finance and Economics,

More information

A weighted simulation-based estimator for incomplete longitudinal data models

A weighted simulation-based estimator for incomplete longitudinal data models To appear in Statistics and Probability Letters, 113 (2016), 16-22. doi 10.1016/j.spl.2016.02.004 A weighted simulation-based estimator for incomplete longitudinal data models Daniel H. Li 1 and Liqun

More information

Lecture 6: Discrete Choice: Qualitative Response

Lecture 6: Discrete Choice: Qualitative Response Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;

More information

The Instability of Correlations: Measurement and the Implications for Market Risk

The Instability of Correlations: Measurement and the Implications for Market Risk The Instability of Correlations: Measurement and the Implications for Market Risk Prof. Massimo Guidolin 20254 Advanced Quantitative Methods for Asset Pricing and Structuring Winter/Spring 2018 Threshold

More information

AN INSTRUMENTAL VARIABLE APPROACH FOR IDENTIFICATION AND ESTIMATION WITH NONIGNORABLE NONRESPONSE

AN INSTRUMENTAL VARIABLE APPROACH FOR IDENTIFICATION AND ESTIMATION WITH NONIGNORABLE NONRESPONSE Statistica Sinica 24 (2014), 1097-1116 doi:http://dx.doi.org/10.5705/ss.2012.074 AN INSTRUMENTAL VARIABLE APPROACH FOR IDENTIFICATION AND ESTIMATION WITH NONIGNORABLE NONRESPONSE Sheng Wang 1, Jun Shao

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

MISSING or INCOMPLETE DATA

MISSING or INCOMPLETE DATA MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing

More information

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood Advances in Decision Sciences Volume 2012, Article ID 973173, 8 pages doi:10.1155/2012/973173 Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

More information

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;

More information

arxiv: v2 [stat.me] 17 Jan 2017

arxiv: v2 [stat.me] 17 Jan 2017 Semiparametric Estimation with Data Missing Not at Random Using an Instrumental Variable arxiv:1607.03197v2 [stat.me] 17 Jan 2017 BaoLuo Sun 1, Lan Liu 1, Wang Miao 1,4, Kathleen Wirth 2,3, James Robins

More information

Estimation, Inference, and Hypothesis Testing

Estimation, Inference, and Hypothesis Testing Chapter 2 Estimation, Inference, and Hypothesis Testing Note: The primary reference for these notes is Ch. 7 and 8 of Casella & Berger 2. This text may be challenging if new to this topic and Ch. 7 of

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

Propensity score weighting for causal inference with multi-stage clustered data

Propensity score weighting for causal inference with multi-stage clustered data Propensity score weighting for causal inference with multi-stage clustered data Shu Yang Department of Statistics, North Carolina State University arxiv:1607.07521v1 stat.me] 26 Jul 2016 Abstract Propensity

More information

Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity

Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity Zhengyu Zhang School of Economics Shanghai University of Finance and Economics zy.zhang@mail.shufe.edu.cn

More information

Quaderni di Dipartimento. Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study. Carluccio Bianchi (Università di Pavia)

Quaderni di Dipartimento. Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study. Carluccio Bianchi (Università di Pavia) Quaderni di Dipartimento Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study Carluccio Bianchi (Università di Pavia) Maria Elena De Giuli (Università di Pavia) Dean Fantazzini (Moscow

More information

Dependence Patterns across Financial Markets: a Mixed Copula Approach

Dependence Patterns across Financial Markets: a Mixed Copula Approach Dependence Patterns across Financial Markets: a Mixed Copula Approach Ling Hu This Draft: October 23 Abstract Using the concept of a copula, this paper shows how to estimate association across financial

More information

High Dimensional Propensity Score Estimation via Covariate Balancing

High Dimensional Propensity Score Estimation via Covariate Balancing High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)

More information

Generated Covariates in Nonparametric Estimation: A Short Review.

Generated Covariates in Nonparametric Estimation: A Short Review. Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated

More information

A Measure of Robustness to Misspecification

A Measure of Robustness to Misspecification A Measure of Robustness to Misspecification Susan Athey Guido W. Imbens December 2014 Graduate School of Business, Stanford University, and NBER. Electronic correspondence: athey@stanford.edu. Graduate

More information

Markov Switching Regular Vine Copulas

Markov Switching Regular Vine Copulas Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS057) p.5304 Markov Switching Regular Vine Copulas Stöber, Jakob and Czado, Claudia Lehrstuhl für Mathematische Statistik,

More information

Shu Yang and Jae Kwang Kim. Harvard University and Iowa State University

Shu Yang and Jae Kwang Kim. Harvard University and Iowa State University Statistica Sinica 27 (2017), 000-000 doi:https://doi.org/10.5705/ss.202016.0155 DISCUSSION: DISSECTING MULTIPLE IMPUTATION FROM A MULTI-PHASE INFERENCE PERSPECTIVE: WHAT HAPPENS WHEN GOD S, IMPUTER S AND

More information

Multivariate Non-Normally Distributed Random Variables

Multivariate Non-Normally Distributed Random Variables Multivariate Non-Normally Distributed Random Variables An Introduction to the Copula Approach Workgroup seminar on climate dynamics Meteorological Institute at the University of Bonn 18 January 2008, Bonn

More information

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:

More information

Gaussian Process Vine Copulas for Multivariate Dependence

Gaussian Process Vine Copulas for Multivariate Dependence Gaussian Process Vine Copulas for Multivariate Dependence José Miguel Hernández-Lobato 1,2 joint work with David López-Paz 2,3 and Zoubin Ghahramani 1 1 Department of Engineering, Cambridge University,

More information

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 197 On Varieties of Doubly Robust Estimators Under Missing Not at Random With an Ancillary Variable Wang Miao Eric

More information

Measuring Social Influence Without Bias

Measuring Social Influence Without Bias Measuring Social Influence Without Bias Annie Franco Bobbie NJ Macdonald December 9, 2015 The Problem CS224W: Final Paper How well can statistical models disentangle the effects of social influence from

More information

Topics and Papers for Spring 14 RIT

Topics and Papers for Spring 14 RIT Eric Slud Feb. 3, 204 Topics and Papers for Spring 4 RIT The general topic of the RIT is inference for parameters of interest, such as population means or nonlinearregression coefficients, in the presence

More information

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators ANNALS OF ECONOMICS AND FINANCE 4, 125 136 (2003) A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators Insik Min Department of Economics, Texas A&M University E-mail: i0m5376@neo.tamu.edu

More information

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Applied Mathematical Sciences, Vol. 4, 2010, no. 14, 657-666 Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Pranesh Kumar Mathematics Department University of Northern British Columbia

More information

How to select a good vine

How to select a good vine Universitetet i Oslo ingrihaf@math.uio.no International FocuStat Workshop on Focused Information Criteria and Related Themes, May 9-11, 2016 Copulae Regular vines Model selection and reduction Limitations

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Fumiya Akashi Research Associate Department of Applied Mathematics Waseda University

More information

Semi-parametric predictive inference for bivariate data using copulas

Semi-parametric predictive inference for bivariate data using copulas Semi-parametric predictive inference for bivariate data using copulas Tahani Coolen-Maturi a, Frank P.A. Coolen b,, Noryanti Muhammad b a Durham University Business School, Durham University, Durham, DH1

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

On the Choice of Parametric Families of Copulas

On the Choice of Parametric Families of Copulas On the Choice of Parametric Families of Copulas Radu Craiu Department of Statistics University of Toronto Collaborators: Mariana Craiu, University Politehnica, Bucharest Vienna, July 2008 Outline 1 Brief

More information

Covariate selection and propensity score specification in causal inference

Covariate selection and propensity score specification in causal inference Covariate selection and propensity score specification in causal inference Ingeborg Waernbaum Doctoral Dissertation Department of Statistics Umeå University SE-901 87 Umeå, Sweden Copyright c 2008 by Ingeborg

More information

Statistical Methods for Handling Missing Data

Statistical Methods for Handling Missing Data Statistical Methods for Handling Missing Data Jae-Kwang Kim Department of Statistics, Iowa State University July 5th, 2014 Outline Textbook : Statistical Methods for handling incomplete data by Kim and

More information

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Yingying Dong and Arthur Lewbel California State University Fullerton and Boston College July 2010 Abstract

More information

MISSING or INCOMPLETE DATA

MISSING or INCOMPLETE DATA MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

Fair Inference Through Semiparametric-Efficient Estimation Over Constraint-Specific Paths

Fair Inference Through Semiparametric-Efficient Estimation Over Constraint-Specific Paths Fair Inference Through Semiparametric-Efficient Estimation Over Constraint-Specific Paths for New Developments in Nonparametric and Semiparametric Statistics, Joint Statistical Meetings; Vancouver, BC,

More information

By Bhattacharjee, Das. Published: 26 April 2018

By Bhattacharjee, Das. Published: 26 April 2018 Electronic Journal of Applied Statistical Analysis EJASA, Electron. J. App. Stat. Anal. http://siba-ese.unisalento.it/index.php/ejasa/index e-issn: 2070-5948 DOI: 10.1285/i20705948v11n1p155 Estimation

More information

A better way to bootstrap pairs

A better way to bootstrap pairs A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression

More information

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION Alireza Bayestehtashk and Izhak Shafran Center for Spoken Language Understanding, Oregon Health & Science University, Portland, Oregon, USA

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Analysis of Matched Case Control Data in Presence of Nonignorable Missing Exposure

Analysis of Matched Case Control Data in Presence of Nonignorable Missing Exposure Biometrics DOI: 101111/j1541-0420200700828x Analysis of Matched Case Control Data in Presence of Nonignorable Missing Exposure Samiran Sinha 1, and Tapabrata Maiti 2, 1 Department of Statistics, Texas

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

General Doubly Robust Identification and Estimation

General Doubly Robust Identification and Estimation General Doubly Robust Identification and Estimation Arthur Lewbel, Jin-Young Choi, and Zhuzhu Zhou original Nov. 2016, revised August 2018 THIS VERSION IS PRELIMINARY AND INCOMPLETE Abstract Consider two

More information

arxiv: v2 [stat.me] 7 Oct 2017

arxiv: v2 [stat.me] 7 Oct 2017 Weighted empirical likelihood for quantile regression with nonignorable missing covariates arxiv:1703.01866v2 [stat.me] 7 Oct 2017 Xiaohui Yuan, Xiaogang Dong School of Basic Science, Changchun University

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract Far East J. Theo. Stat. 0() (006), 179-196 COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS Department of Statistics University of Manitoba Winnipeg, Manitoba, Canada R3T

More information

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1 Imbens/Wooldridge, Lecture Notes 1, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 1, Monday, July 30th, 9.00-10.30am Estimation of Average Treatment Effects Under Unconfoundedness 1.

More information

Empirical likelihood methods in missing response problems and causal interference

Empirical likelihood methods in missing response problems and causal interference The University of Toledo The University of Toledo Digital Repository Theses and Dissertations 2017 Empirical likelihood methods in missing response problems and causal interference Kaili Ren University

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

A New Generalized Gumbel Copula for Multivariate Distributions

A New Generalized Gumbel Copula for Multivariate Distributions A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,

More information

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San

More information

A note on L convergence of Neumann series approximation in missing data problems

A note on L convergence of Neumann series approximation in missing data problems A note on L convergence of Neumann series approximation in missing data problems Hua Yun Chen Division of Epidemiology & Biostatistics School of Public Health University of Illinois at Chicago 1603 West

More information