Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Size: px
Start display at page:

Download "Calibration Estimation of Semiparametric Copula Models with Data Missing at Random"

Transcription

1 Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC Chapel Hill November 16, 2018 Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

2 Introduction Copula Model -- Flexible modelling of multivariate distributions -- Popular in business, economics, finance, etc. Missing Data We unitethem for the first time in the literature. Treatment Effect -- Common problem in all fields -- Deep literature in statistics -- Binary phenomenon (to observe or not to observe) -- Estimate unobserved outcome -- Central topic in econometrics Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

3 Introduction How to fit copula models when there are missing data? A naïve approach is listwise deletion (LD): 1 Keep individuals with all d components being observed, and discard all other individuals. 2 Treat the individuals with complete data in an equal way. i= 1 i= 2 i= 3 i= 4 i= N Y 11 Y 12 Y 13 Y 14 Y 1N Y 21 Y 22 Y 23 Y 24 Y 2N Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

4 Introduction LD leads to a consistent estimator for the copula parameter of interest if the missing mechanism is missing completely at random (MCAR). LD leads to an inconsistent estimator if the missing mechanism is missing at random (MAR). Under MAR, target variables Y i = [Y 1i,..., Y di ] and their missing status are independent of each other given observed covariates X i = [X 1i,..., X ri ]. LD treats individuals with complete data all equally, and it does not use the information of X i. That can cause substantial bias under MAR. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

5 Introduction How can we obtain a consistent estimator for the copula parameter when the missing mechanism is MAR? A key step is the estimation of propensity score function (i.e. conditional probability of observing data given covariates). Direct estimation of propensity score is notoriously challenging, whether it is performed parametrically or nonparametrically. Parametric approaches are haunted by misspecification problems, while nonparametric approaches are often unstable. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

6 Introduction We apply Chan, Yam, and Zhang s (2016) calibration estimation to a missing data problem for the first time in the literature. The calibration estimator for the copula parameter satisfies consistency and asymptotic normality under some assumptions including i.i.d. data and the MAR condition. We also derive a consistent estimator for the asymptotic covariance matrix. Our simulation results indicate that the calibration estimator dominates listwise deletion, parametric approach, and nonparametric approach. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

7 Table of Contents 1) Introduction 2) Review of Copula Models 3) Review of Missing Data 4) Set-up of Main Problem 5) Theory of Calibration Estimation 6) Data-Driven Selection of Tuning Parameter K 7) Monte Carlo Simulations 8) Conclusions Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

8 Review of Copula Models Suppose that there are N individuals and d components: Y i = [Y 1i, Y 2i,..., Y di ] (i = 1,..., N). Suppose that we want to estimate the d-dimensional joint distribution of Y i, assuming i.i.d. What can we do? There are potential problems on estimating the joint distribution directly. d = small d = large Parametric Misspecification Misspecification Parameter proliferation Nonparametric (Curse of dimensionality) Curse of dimensionality Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

9 Review of Copula Models Copula models accomplish flexible specification with a small number of parameters. Copula models follow a two-step procedure. Step 1: Model the marginal distribution of each of the d components separately. Step 2: Combine the d marginal distributions to recover a joint distribution. Step 1 Parametric Nonparametric Nonparametric Step 2 Parametric Parametric Nonparametric Name of model Parametric copula model Semiparametric copula model (Target of our paper) Nonparametric copula model Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

10 Review of Copula Models The copula approach is justified by Sklar s (1959) theorem. Sklar s theorem ensures the existence of a unique copula function C : (0, 1) d (0, 1) that recovers a true joint distribution. Theorem (Sklar, 1959) Let {Y i } be i.i.d. random vectors with a joint distribution F : R d (0, 1). Assume that the marginal distribution of Y ji, written as F j : R (0, 1), is continuous for j {1,..., d}. Then, there exists a unique function C : (0, 1) d (0, 1) such that F (y 1,..., y d ) = C (F 1 (y 1 ),..., F d (y d )), or in terms of probability density functions, f(y 1,..., y d ) = c (F 1 (y 1 ),..., F d (y d )) f 1 (y 1 ) f d (y d ). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

11 Review of Copula Models A well-known example of copula function: bivariate Clayton copula with a scalar parameter α > 0. Cumulative distribution function is C 2 (u 1, u 2 ; α) = (u α 1 + u α 2 1) 1 α, where u 1 = F 1 (y 1 ) (0, 1) and u 2 = F 2 (y 2 ) (0, 1) are marginal distribution functions of Y 1i and Y 2i, respectively. Probability density function is c 2 (u 1, u 2 ; α) = (1 + α)(u 1 u 2 ) α 1 (u α 1 + u α 2 1) 1 α 2. Kendall s rank correlation coefficient is τ = α/(α + 2). Larger α implies stronger association between Y 1i and Y 2i. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

12 Review of Copula Models u u 1 p.d.f. (α = or τ =.4) u u 1 p.d.f. (α = or τ =.7) 1 Association is stronger in the lower tail than in the upper tail. Such an asymmetry matches many economic and financial phenomena (e.g. stock market contagion). Larger α implies stronger association between Y 1i and Y 2i. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

13 Review of Missing Data Suppose that {Y 1i,..., Y di } N i=1 are target variables. Define the missing indicator: { 1 if Y ji is observed, T ji = 0 if Y ji is missing. Suppose that {X 1i,..., X ri } N i=1 are observable covariates. Missing Completely at Random (MCAR): Missing at Random (MAR): {T 1i,..., T di } {Y 1i,..., Y di }. {T 1i,..., T di } {Y 1i,..., Y di } {X 1i,..., X ri }. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

14 Review of Missing Data An illustrative example on health survey (d = r = 1): Y i = weight of individual i, T i = I(Y i is observed), X i = I(Individual i is female). MCAR requires that P[T i = 1] = P[T i = 1 X i = x, Y i = y], (x, y). MAR requires that P[T i = 1 X i = 1] = P[T i = 1 X i = 1, Y i = y], P[T i = 1 X i = 0] = P[T i = 1 X i = 0, Y i = y], y, y. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

15 Review of Missing Data Probably too restrictive, since men and women may have different willingness to report their weights. MCAR MNAR MAR More realistic than MCAR, since MAR controls for gender. MAR may be still restrictive, since men (or women) with different weights may have different willingness to report their weights. MNAR is most general since it controls for both gender and weight, but MNAR is hard to handle technically. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

16 Review of Missing Data It is well known that listwise deletion (LD) leads to consistent inference under MCAR. It is also well known that LD leads to inconsistent inference under MAR. Correct inference under MAR has been extensively studied since the seminal work of Rubin (1976). The present paper assumes MAR and elaborates the estimation of semiparametric copula models, which has not been done in the literature. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

17 Set-up of Main Problem Semiparametric copula models are estimated in two steps: Step 1: Estimate the marginal distributions {F 1,..., F d } nonparametrically via F j (y) = P(Y ji y) = E[I(Y ji y)]. Step 2: Estimate the true copula parameter θ 0 via θ 0 = arg max θ Θ E [log c(f 1(Y 1i ),..., F d (Y di ); θ)]. If data were all observed, then we could simply replace the population quantities with sample counterparts. When data are Missing at Random, we need to assign some weights based on propensity score functions. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

18 Set-up of Main Problem Define propensity score functions: π j (x) = P(T ji = 1 X i = x), Define p j (x) = 1/π j (x). Step 1 is rewritten as j {1,..., d}. F j (y) = E[I(Y ji y)] = E[E[I(Y ji y) X i ]] ( LIE) [ [ ] ] Tji = E E π j (X i ) X i E [I(Y ji y) X i ] [ [ ]] Tji = E E π j (X i ) I(Y ji y) X i ( MAR) [ ] Tji = E π j (X i ) I(Y ji y) ( LIE) = E[T ji p j (X i ) I(Y ji y)]. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

19 Set-up of Main Problem We have derived: F j (y) = E[I(T ji = 1) p j (X i ) I(Y ji y)]. Horvitz and Thompson s (1952) inverse probability weighting (IPW) estimator for F j is written as F j (y) = 1 N N I(T ji = 1)p j (X i )I(Y ji y). i=1 If p j (x) were known, then it would be straightforward to compute the IPW estimator. p j (x) is unknown in reality. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

20 Set-up of Main Problem Define a propensity score function: Define q(x) = 1/η(x). η(x) = P(T 1i = 1,..., T di = 1 X i = x). Using MAR and LIE, Step 2 is rewritten as θ 0 = arg max θ Θ E [I(T 1i = 1,..., T di = 1)q(X i ) log c(f 1 (Y 1i ),..., F d (Y di ); θ)]. The IPW estimator for θ 0 is given by 1 θ = arg max θ Θ N N I(T 1i = 1,..., T di = 1)q(X i ) log c( F 1 (Y 1i ),..., F d (Y di ); θ). i=1 q(x) is unknown in reality. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

21 Set-up of Main Problem Direct estimation of propensity score functions is challenging, whether it is performed parametrically or nonparametrically. In the treatment effect literature, Chan, Yam, and Zhang (2016) propose an alternative approach that bypasses a direct estimation of propensity score. They construct calibration weights by balancing the moments of observed covariates among treatment, control, and whole groups. The present paper applies their method to a missing data problem for the first time. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

22 Theory of Calibration Estimation Under MAR, the moment matching condition holds: E [ I(T ji = 1)p j,kj (X i )u Kj (X i ) ] = E[u Kj (X i )], j {1,... d} for any integrable function u Kj : R r R K j called an approximation sieve. A common choice is, say, u Kj (X i ) = [1, X i, X 2 i, X 3 i ] (r = 1, K j = 4). A sample counterpart is written as 1 N N I(T ji = 1) p j,kj (X i ) u Kj (X i ) = 1 N i=1 N u Kj (X i ). Multiple values of {p j,kj (X 1 ),..., p j,kj (X N )} satisfy the moment matching condition. Among them, we choose the one closest to a uniform weight given some distance measure. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49 i=1

23 Theory of Calibration Estimation Why do we want the uniform weight? 1 If there are no missing data, then the uniform weight leads to a natural estimator ˆF j (y) = (N + 1) 1 N i=1 I(Y ji y). 2 It is well known that volatile weights cause instability in the Horvitz-Thompson IPW estimator. Let ρ : R R be any strictly concave function. Define a concave objective function: G j,kj (λ) = 1 N Compute N [ I(Tji = 1)ρ(λ u Kj (X i )) λ u Kj (X i ) ], λ R Kj. i=1 ˆλ j,kj = arg max λ G j,k j (λ). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

24 Theory of Calibration Estimation Compute calibration weights for marginal distributions: ˆp j,kj (X i ) = ρ (ˆλ j,k j u Kj (X i )). Estimate the marginal distribution of the j-th component by ˆF j (y) = 1 N N I(T ji = 1)ˆp j,kj (X i )I(Y ji y). i=1 Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

25 Theory of Calibration Estimation Arbitrariness of ρ arises from the arbitrariness of the distance measure. Functional forms often used in the nonparametric literature include: Exponential tilting: ρ(v) = exp( v). Empirical likelihood: ρ(v) = log(1 + v). Continuous updating GMM: ρ(v) = 0.5(1 v) 2. Inverse logistic: ρ(v) = v exp( v). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

26 Theory of Calibration Estimation Step 2 (likelihood maximization) can be handled analogously. Under MAR, the moment matching condition holds: where E [ I(T 1i = 1,..., T di = 1)q(X i )u Kη (X i ) ] = E[u Kη (X i )], q(x i ) 1 η(x i ) = 1 P(T 1i = 1,..., T di = 1 X i = x). Find calibration weights that satisfy the moment matching condition and are closest to the uniform weight given some distance measure. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

27 Theory of Calibration Estimation Define a concave objective function: H Kη (β) = 1 N ( ) I(T 1i = 1,..., T di = 1)ρ β u Kη (X i ) 1 N N i=1 Compute ˆβ Kη = arg max β H Kη (β). Compute a calibration weight for the likelihood: ˆq Kη (X i ) = ρ ( ˆβ K η u Kη (X i )). Compute the maximum likelihood estimator ˆθ via N β u Kη (X i ). i=1 max θ Θ 1 N N i=1 ( I(T 1i = 1,..., T di = 1)ˆq Kη (X i ) log c d ˆF1 (Y 1i ),..., ˆF ) d (Y di ); θ. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

28 Theory of Calibration Estimation Theorem (Consistency and Asymptotic Normality) Impose a set of assumptions including what follows: The missing mechanism is missing at random (MAR). {Y i, T i, X i } are i.i.d. across individuals i {1,..., N}. π 1 ( ),..., π d ( ), and η( ) are sufficiently smooth. K j (N) as N, and the rate of divergence is sufficiently slow. (The same assumption applies for K η (N).) Then, consistency and asymptotic normality follow. p 1 ˆθ θ0 as N. 2 N( ˆθ θ0 ) d N(0, V ) as N. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

29 Theory of Calibration Estimation The asymptotic covariance matrix V is expressed as V = B 1 ΣB 1. We can construct consistent estimators for B and Σ, and hence ˆV = ˆB 1 ˆΣ ˆB 1 p V. See the main paper for a complete set of assumptions, proofs of the consistency and asymptotic normality, and the construction of V and ˆV. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

30 Data-Driven Selection of K A practical question is how to select {K 1,..., K d, K η }, the dimensions of approximation sieves. Focus on K j for concreteness. We select an optimal Kj from a viewpoint of covariate balancing. A natural estimator of the joint distribution of X, based on the whole group of individuals, is ˆF X (x) = 1 N N I(X 1i x 1,..., X ri x r ). i=1 An alternative estimator based on the observed group of individuals is N ˆF X,Kj (x) = T jiˆp j,kj (X i )I(X 1i x,..., X ri x r ). i=1 Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

31 Data-Driven Selection of K On one hand, we should select K j that makes ˆF X and ˆF X,Kj as close as possible. On the other hand, we should impose a penalty against raising K j in order to keep parsimony. Hence we propose to select K j {1, 2,..., K} that minimizes a penalized L 2 -distance: d Kj ( ˆF X, ˆF X,Kj ) = ] 2 [ 1 N N i=1 ˆFX (X i ) ˆF (X X,Kj i) ( 1 K 2 j /N ) 2. The proposed approach is analogous to the generalized cross-validation of Li and Racine (2007, Sec. 15.2). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

32 Monte Carlo Simulations: DGP Target variables are Y i = [Y 1i, Y 2i ] (d = 2). Consider a scalar covariate X i. Define U i = [U 1i, U 2i, U 3i ] = [F 1 (Y 1i ), F 2 (Y 2i ), F X (X i )]. F 1 ( ) is the marginal distribution of Y 1i, and we use N(0, 1). F 2 ( ) is the marginal distribution of Y 2i, and we use N(0, 1). F X ( ) is the marginal distribution of X i, and we use N(0, 1). The inverse distribution functions F 1 1 ( ), F 1 2 ( ), and F 1 X ( ) are known and tractable. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

33 Monte Carlo Simulations: DGP Step 1: Draw U i i.i.d. Clayton 3 (α 0 ) with α 0 = 6 (Kendall s τ = 0.75). Step 2: Recover Y 1i = F1 1 (U 1i ), Y 2i = F2 1 (U 2i ), and X i = F 1 X (U 3i). Step 3: Assume that {Y 11,..., Y 1N } are all observed. Make some of {Y 21,..., Y 2N } missing according to P(T 2i = 1 X i = x i ) = exp[a + bx i ]. We use (a, b) = ( 0.420, 0.400), which implies MAR with E[T 2i ] = 0.6. Step 4: Repeat Steps 1-3 J = 1000 times with sample size N {500, 1000}. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

34 Monte Carlo Simulations: Estimation Approach #1: Listwise deletion. Step 1: Estimate the marginal distribution of the j-th component by where ˆF j (y) = 1 N + 1 N = N I(T 1i = 1, T 2i = 1)I(Y ji < y), i=1 N I(T 1i = 1, T 2i = 1) i=1 is the number of individuals with complete data. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

35 Monte Carlo Simulations: Estimation Step 2: Compute the maximum likelihood estimator ˆα by max α (0, ) 1 N N i=1 ( I(T 1i = 1, T 2i = 1) log c 2 ˆF1 (Y 1i ), ˆF ) 2 (Y 2i ); α, where c 2 (u 1, u 2 ; α) = (1 + α)(u 1 u 2 ) α 1 (u α 1 + u α 2 1) 1 α 2 is the probability density function of Clayton 2 (α). This likelihood function is correctly specified since any bivariate marginal distribution of Clayton 3 (α) is Clayton 2 (α). The same property holds for any Archimedean copula (e.g. Clayton, Gumbel, Frank). Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

36 Monte Carlo Simulations: Estimation Approach #2: Parametric estimation. Consider a correctly specified model for the propensity score: We estimate (a, b) via π 2 (x; a, b) = exp(a + bx). max N [T 2i log π 2 (X i ; a, b) + (1 T 2i ) log (1 π 2 (X i ; a, b))]. i=1 Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

37 Monte Carlo Simulations: Estimation Compute ˆp 2 (X i ) = ˆq(X i ) = Estimate marginal distributions by 1 π 2 (X i ; â, ˆb). ˆF j (y) = 1 N N I(T ji = 1)ˆp j (X i )I(Y ji < y). i=1 Estimate the copula parameter α via max α (0, ) 1 N N i=1 ( I(T 1i = 1, T 2i = 1)ˆq(X i ) log c 2 ˆF1 (Y 1i ), ˆF ) 2 (Y 2i ); α. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

38 Monte Carlo Simulations: Estimation For comparison, consider a misspecified model: π 2 (x; b) = exp(bx). This model is misspecified since the true value is a = The remaining procedure is the same. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

39 Monte Carlo Simulations: Estimation Approach #3: Nonparametric estimation of Hirano, Imbens, and Ridder (2003). Define the approximation sieve: u K2 (X i ) = [1, X i,..., X K 2 1 i ]. Define 1 π 2,K2 (X i ; λ) = 1 + exp[ λ u K2 (X i )]. Estimate λ via max N [T 2i log π 2,K2 (X i ; λ) + (1 T 2i ) log (1 π 2,K2 (X i ; λ))]. i=1 Compute ˆp 2,K2 (X i ) = ˆq K2 (X i ) = 1/π 2,K2 (X i ; ˆλ). We use fixed K 2 {3, 4} and the data-driven selection of K 2 {1,..., 5} based on the covariate balancing principle. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

40 Monte Carlo Simulations: Estimation The nonparametric approach with K 2 = 2 is essentially identical to the parametric approach with the correctly specified model. This is just a coincidence due to the fact that the true missing mechanism is characterized by a logistic function. In theory, K 2 as N and the nonparametric approach leads to a consistent estimator for any missing mechanism. Finite sample performance is another question. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

41 Monte Carlo Simulations: Estimation Approach #4: Calibration estimation. Calibration weights are computed with the exponential tilting: ρ(v) = exp( v). We use fixed K 2 {3, 4} and the data-driven selection of K 2 {1,..., 5} based on the covariate balancing principle. The procedure is as explained. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

42 Monte Carlo Simulations: Results N = 500 N = 1000 Truth: α 0 = Bias, Stdev, RMSE Bias, Stdev, RMSE Listwise Deletion 0.453, 0.594, , 0.408, Param (Correct) 0.152, 0.454, , 0.322, Param (Misspec) 1.273, 0.420, , 0.330, Nonparam (K 2 = 3) 0.152, 0.434, , 0.322, Nonparam (K 2 = 4) 0.723, 2.358, , 2.470, Nonparam (K2 ) 0.125, 0.452, , 0.333, Calibration (K 2 = 3) 0.161, 0.450, , 0.322, Calibration (K 2 = 4) 0.165, 0.456, , 0.327, Calibration (K2 ) 0.142, 0.472, , 0.315, Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

43 Monte Carlo Simulations: Results Listwise deletion produces bias under MAR as expected. The parametric approach produces enormous bias if a propensity score model is misspecified. The nonparametric approach exhibits considerable instability across K 2. The proposed calibration approach achieves strikingly sharp and stable performance. The data-driven selection of K 2 performs well for both the nonparametric and calibration estimators. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

44 Monte Carlo Simulations: Extension We make a further comparison of the nonparametric estimator and the calibration estimator under a more challenging scenario. Assume that there are r = 2 covariates: X i = [X 1i, X 2i ]. Assume that a true propensity score is P(T 2i = 1 X i = x i ) = exp[ x 1i + 0.2x 2i ]. This implies MAR with E[T 2i ] = 0.6 as before. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

45 Monte Carlo Simulations: Extension Reconstruct the approximation sieve u K2 (X) as follows: u 10 (X) = (1, X 1, X 2, X 2 1, X 2 2, X 1 X 2, X 3 1, X 3 2, X 2 1X 2, X 1 X 2 2). For K 2 {1,..., 10}, let u K2 (X) be the first K 2 elements of u 10 (X). For each estimator, we use the following fixed values: K 2 = 3 (i.e. only the first moments of X). K 2 = 6 (i.e. the second moments added). K 2 = 10 (i.e. the third moments added). We also use the data-driven selection of K 2 {1,..., 10} based on the covariate balancing principle. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

46 Monte Carlo Simulations: Extension N = 500 N = 1000 Truth: α 0 = Bias, Stdev, RMSE Bias, Stdev, RMSE Nonparam (K 2 = 3) 0.150, 0.461, , 0.324, Nonparam (K 2 = 6) 0.160, 0.531, , 0.476, Nonparam (K 2 = 10) 1.127, 4.028, , 3.013, Nonparam (K2 ) 0.106, 1.153, , 1.265, Calibration (K 2 = 3) 0.253, 0.458, , 0.330, Calibration (K 2 = 6) 0.147, 0.446, , 0.331, Calibration (K 2 = 10) 0.143, 0.455, , 0.331, Calibration (K2 ) 0.200, 0.458, , 0.346, Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

47 Monte Carlo Simulations: Extension The nonparametric approach exhibits considerable instability across K 2. The nonparametric estimator assisted by the data-driven selection of K 2 has small bias but large variance. The proposed calibration approach achieves sharp and stable performance. The calibration estimator assisted by the data-driven selection of K 2 has small bias and small variance. Hence the calibration estimator clearly outperforms the nonparametric estimator. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

48 Conclusions We investigate the estimation of semiparametric copula models under missing data for the first time in the literature. There is analogy between missing data and average treatment effects since both of them are binary in nature. We apply Chan, Yam, and Zhang s (2016) calibration estimation to missing data for the first time in the literature. The calibration estimator satisfies consistency and asymptotic normality under the i.i.d. assumption and the Missing at Random (MAR) condition. Our simulation results indicate that the calibration estimator dominates listwise deletion, parametric approach, and nonparametric approach. Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

49 References Chan, K. C. G., S. C. P. Yam, and Z. Zhang (2016). Globally Efficient Non-Parametric Inference of Average Treatment Effects by Empirical Balancing Calibration Weighting. Journal of the Royal Statistical Society, Series B, 78, Hirano, K., G. W. Imbens, and G. Ridder (2003). Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score. Econometrica, 71, Horvitz, D. G. and D. J. Thompson (1952). A Generalization of Sampling without Replacement from a Finite Universe. Journal of the American Statistical Association, 47, Li, Q. and J. S. Racine (2007). Nonparametric Econometrics: Theory and Practice. Princeton University Press. Rubin, D. B. (1976). Inference and Missing Data. Biometrika, 63, Sklar, A. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications de l Institut de Statistique de L Université de Paris, 8, Hamori, Motegi & Zhang (Kobe & RUC) Copula Models with Data MAR November 16, / 49

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics

More information

Calibration Estimation for Semiparametric Copula Models under Missing Data

Calibration Estimation for Semiparametric Copula Models under Missing Data Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori Kaiji Motegi Zheng Zhang August 30, 208 Abstract This paper investigates the estimation of semiparametric

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori Kaiji Motegi Zheng Zhang March 28, 2018 Abstract This paper investigates the estimation of semiparametric

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Modification and Improvement of Empirical Likelihood for Missing Response Problem UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu

More information

A Goodness-of-fit Test for Copulas

A Goodness-of-fit Test for Copulas A Goodness-of-fit Test for Copulas Artem Prokhorov August 2008 Abstract A new goodness-of-fit test for copulas is proposed. It is based on restrictions on certain elements of the information matrix and

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Using copulas to model time dependence in stochastic frontier models

Using copulas to model time dependence in stochastic frontier models Using copulas to model time dependence in stochastic frontier models Christine Amsler Michigan State University Artem Prokhorov Concordia University November 2008 Peter Schmidt Michigan State University

More information

Robustness of a semiparametric estimator of a copula

Robustness of a semiparametric estimator of a copula Robustness of a semiparametric estimator of a copula Gunky Kim a, Mervyn J. Silvapulle b and Paramsothy Silvapulle c a Department of Econometrics and Business Statistics, Monash University, c Caulfield

More information

Financial Econometrics and Volatility Models Copulas

Financial Econometrics and Volatility Models Copulas Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns

More information

Robustness to Parametric Assumptions in Missing Data Models

Robustness to Parametric Assumptions in Missing Data Models Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice

More information

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Copulas. MOU Lili. December, 2014

Copulas. MOU Lili. December, 2014 Copulas MOU Lili December, 2014 Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion Preliminary MOU Lili SEKE Team 3/30 Probability

More information

Copula-Based Random Effects Models

Copula-Based Random Effects Models Copula-Based Random Effects Models Santiago Pereda-Fernández Banca d Italia September 7, 25 PRELIMINARY, INCOMPLETE, PLEASE DO NOT CITE Abstract In a binary choice panel data framework, standard estimators

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Applied Mathematical Sciences, Vol. 4, 2010, no. 14, 657-666 Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Pranesh Kumar Mathematics Department University of Northern British Columbia

More information

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)

More information

Quaderni di Dipartimento. Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study. Carluccio Bianchi (Università di Pavia)

Quaderni di Dipartimento. Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study. Carluccio Bianchi (Università di Pavia) Quaderni di Dipartimento Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study Carluccio Bianchi (Università di Pavia) Maria Elena De Giuli (Università di Pavia) Dean Fantazzini (Moscow

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

Imputation Algorithm Using Copulas

Imputation Algorithm Using Copulas Metodološki zvezki, Vol. 3, No. 1, 2006, 109-120 Imputation Algorithm Using Copulas Ene Käärik 1 Abstract In this paper the author demonstrates how the copulas approach can be used to find algorithms for

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Article Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Fei Jin 1,2 and Lung-fei Lee 3, * 1 School of Economics, Shanghai University of Finance and Economics,

More information

Statistical Data Mining and Machine Learning Hilary Term 2016

Statistical Data Mining and Machine Learning Hilary Term 2016 Statistical Data Mining and Machine Learning Hilary Term 2016 Dino Sejdinovic Department of Statistics Oxford Slides and other materials available at: http://www.stats.ox.ac.uk/~sejdinov/sdmml Naïve Bayes

More information

On the Choice of Parametric Families of Copulas

On the Choice of Parametric Families of Copulas On the Choice of Parametric Families of Copulas Radu Craiu Department of Statistics University of Toronto Collaborators: Mariana Craiu, University Politehnica, Bucharest Vienna, July 2008 Outline 1 Brief

More information

Semi-parametric predictive inference for bivariate data using copulas

Semi-parametric predictive inference for bivariate data using copulas Semi-parametric predictive inference for bivariate data using copulas Tahani Coolen-Maturi a, Frank P.A. Coolen b,, Noryanti Muhammad b a Durham University Business School, Durham University, Durham, DH1

More information

Recent Advances in the analysis of missing data with non-ignorable missingness

Recent Advances in the analysis of missing data with non-ignorable missingness Recent Advances in the analysis of missing data with non-ignorable missingness Jae-Kwang Kim Department of Statistics, Iowa State University July 4th, 2014 1 Introduction 2 Full likelihood-based ML estimation

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators ANNALS OF ECONOMICS AND FINANCE 4, 125 136 (2003) A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators Insik Min Department of Economics, Texas A&M University E-mail: i0m5376@neo.tamu.edu

More information

The Instability of Correlations: Measurement and the Implications for Market Risk

The Instability of Correlations: Measurement and the Implications for Market Risk The Instability of Correlations: Measurement and the Implications for Market Risk Prof. Massimo Guidolin 20254 Advanced Quantitative Methods for Asset Pricing and Structuring Winter/Spring 2018 Threshold

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

A New Generalized Gumbel Copula for Multivariate Distributions

A New Generalized Gumbel Copula for Multivariate Distributions A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

Estimation, Inference, and Hypothesis Testing

Estimation, Inference, and Hypothesis Testing Chapter 2 Estimation, Inference, and Hypothesis Testing Note: The primary reference for these notes is Ch. 7 and 8 of Casella & Berger 2. This text may be challenging if new to this topic and Ch. 7 of

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

MISSING or INCOMPLETE DATA

MISSING or INCOMPLETE DATA MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing

More information

Multivariate Distribution Models

Multivariate Distribution Models Multivariate Distribution Models Model Description While the probability distribution for an individual random variable is called marginal, the probability distribution for multiple random variables is

More information

Does k-th Moment Exist?

Does k-th Moment Exist? Does k-th Moment Exist? Hitomi, K. 1 and Y. Nishiyama 2 1 Kyoto Institute of Technology, Japan 2 Institute of Economic Research, Kyoto University, Japan Email: hitomi@kit.ac.jp Keywords: Existence of moments,

More information

Introduction An approximated EM algorithm Simulation studies Discussion

Introduction An approximated EM algorithm Simulation studies Discussion 1 / 33 An Approximated Expectation-Maximization Algorithm for Analysis of Data with Missing Values Gong Tang Department of Biostatistics, GSPH University of Pittsburgh NISS Workshop on Nonignorable Nonresponse

More information

LIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS

LIKELIHOOD RATIO INFERENCE FOR MISSING DATA MODELS LIKELIHOOD RATIO IFERECE FOR MISSIG DATA MODELS KARU ADUSUMILLI AD TAISUKE OTSU Abstract. Missing or incomplete outcome data is a ubiquitous problem in biomedical and social sciences. Under the missing

More information

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION Alireza Bayestehtashk and Izhak Shafran Center for Spoken Language Understanding, Oregon Health & Science University, Portland, Oregon, USA

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood Advances in Decision Sciences Volume 2012, Article ID 973173, 8 pages doi:10.1155/2012/973173 Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

More information

Web-based Supplementary Material for. Dependence Calibration in Conditional Copulas: A Nonparametric Approach

Web-based Supplementary Material for. Dependence Calibration in Conditional Copulas: A Nonparametric Approach 1 Web-based Supplementary Material for Dependence Calibration in Conditional Copulas: A Nonparametric Approach Elif F. Acar, Radu V. Craiu, and Fang Yao Web Appendix A: Technical Details The score and

More information

Estimation of the Conditional Variance in Paired Experiments

Estimation of the Conditional Variance in Paired Experiments Estimation of the Conditional Variance in Paired Experiments Alberto Abadie & Guido W. Imbens Harvard University and BER June 008 Abstract In paired randomized experiments units are grouped in pairs, often

More information

A Measure of Robustness to Misspecification

A Measure of Robustness to Misspecification A Measure of Robustness to Misspecification Susan Athey Guido W. Imbens December 2014 Graduate School of Business, Stanford University, and NBER. Electronic correspondence: athey@stanford.edu. Graduate

More information

High Dimensional Propensity Score Estimation via Covariate Balancing

High Dimensional Propensity Score Estimation via Covariate Balancing High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

Dependence Patterns across Financial Markets: a Mixed Copula Approach

Dependence Patterns across Financial Markets: a Mixed Copula Approach Dependence Patterns across Financial Markets: a Mixed Copula Approach Ling Hu This Draft: October 23 Abstract Using the concept of a copula, this paper shows how to estimate association across financial

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

Markov Switching Regular Vine Copulas

Markov Switching Regular Vine Copulas Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS057) p.5304 Markov Switching Regular Vine Copulas Stöber, Jakob and Czado, Claudia Lehrstuhl für Mathematische Statistik,

More information

Measuring Social Influence Without Bias

Measuring Social Influence Without Bias Measuring Social Influence Without Bias Annie Franco Bobbie NJ Macdonald December 9, 2015 The Problem CS224W: Final Paper How well can statistical models disentangle the effects of social influence from

More information

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS EVA IV, CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS Jose Olmo Department of Economics City University, London (joint work with Jesús Gonzalo, Universidad Carlos III de Madrid) 4th Conference

More information

Generated Covariates in Nonparametric Estimation: A Short Review.

Generated Covariates in Nonparametric Estimation: A Short Review. Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated

More information

Binary Models with Endogenous Explanatory Variables

Binary Models with Endogenous Explanatory Variables Binary Models with Endogenous Explanatory Variables Class otes Manuel Arellano ovember 7, 2007 Revised: January 21, 2008 1 Introduction In Part I we considered linear and non-linear models with additive

More information

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach The 8th Tartu Conference on MULTIVARIATE STATISTICS, The 6th Conference on MULTIVARIATE DISTRIBUTIONS with Fixed Marginals Modelling Dropouts by Conditional Distribution, a Copula-Based Approach Ene Käärik

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Bayesian Inference. Chapter 9. Linear models and regression

Bayesian Inference. Chapter 9. Linear models and regression Bayesian Inference Chapter 9. Linear models and regression M. Concepcion Ausin Universidad Carlos III de Madrid Master in Business Administration and Quantitative Methods Master in Mathematical Engineering

More information

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;

More information

Trivariate copulas for characterisation of droughts

Trivariate copulas for characterisation of droughts ANZIAM J. 49 (EMAC2007) pp.c306 C323, 2008 C306 Trivariate copulas for characterisation of droughts G. Wong 1 M. F. Lambert 2 A. V. Metcalfe 3 (Received 3 August 2007; revised 4 January 2008) Abstract

More information

Lecture 6: Discrete Choice: Qualitative Response

Lecture 6: Discrete Choice: Qualitative Response Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;

More information

This paper is not to be removed from the Examination Halls

This paper is not to be removed from the Examination Halls ~~ST104B ZA d0 This paper is not to be removed from the Examination Halls UNIVERSITY OF LONDON ST104B ZB BSc degrees and Diplomas for Graduates in Economics, Management, Finance and the Social Sciences,

More information

Web-based Supplementary Materials for A Robust Method for Estimating. Optimal Treatment Regimes

Web-based Supplementary Materials for A Robust Method for Estimating. Optimal Treatment Regimes Biometrics 000, 000 000 DOI: 000 000 0000 Web-based Supplementary Materials for A Robust Method for Estimating Optimal Treatment Regimes Baqun Zhang, Anastasios A. Tsiatis, Eric B. Laber, and Marie Davidian

More information

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Fumiya Akashi Research Associate Department of Applied Mathematics Waseda University

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

How to select a good vine

How to select a good vine Universitetet i Oslo ingrihaf@math.uio.no International FocuStat Workshop on Focused Information Criteria and Related Themes, May 9-11, 2016 Copulae Regular vines Model selection and reduction Limitations

More information

ECON 3150/4150, Spring term Lecture 6

ECON 3150/4150, Spring term Lecture 6 ECON 3150/4150, Spring term 2013. Lecture 6 Review of theoretical statistics for econometric modelling (II) Ragnar Nymoen University of Oslo 31 January 2013 1 / 25 References to Lecture 3 and 6 Lecture

More information

Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite

Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite Christine Amsler Michigan State University Artem Prokhorov Concordia University December

More information

Generalized Autoregressive Score Smoothers

Generalized Autoregressive Score Smoothers Generalized Autoregressive Score Smoothers Giuseppe Buccheri 1, Giacomo Bormetti 2, Fulvio Corsi 3,4, and Fabrizio Lillo 2 1 Scuola Normale Superiore, Italy 2 University of Bologna, Italy 3 University

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas 0 0 5 Motivation: Regression discontinuity (Angrist&Pischke) Outcome.5 1 1.5 A. Linear E[Y 0i X i] 0.2.4.6.8 1 X Outcome.5 1 1.5 B. Nonlinear E[Y 0i X i] i 0.2.4.6.8 1 X utcome.5 1 1.5 C. Nonlinearity

More information

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Yingying Dong and Arthur Lewbel California State University Fullerton and Boston College July 2010 Abstract

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Graduate Econometrics I: Maximum Likelihood II

Graduate Econometrics I: Maximum Likelihood II Graduate Econometrics I: Maximum Likelihood II Yves Dominicy Université libre de Bruxelles Solvay Brussels School of Economics and Management ECARES Yves Dominicy Graduate Econometrics I: Maximum Likelihood

More information

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract Far East J. Theo. Stat. 0() (006), 179-196 COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS Department of Statistics University of Manitoba Winnipeg, Manitoba, Canada R3T

More information

Week 9 The Central Limit Theorem and Estimation Concepts

Week 9 The Central Limit Theorem and Estimation Concepts Week 9 and Estimation Concepts Week 9 and Estimation Concepts Week 9 Objectives 1 The Law of Large Numbers and the concept of consistency of averages are introduced. The condition of existence of the population

More information

Accounting for Missing Values in Score- Driven Time-Varying Parameter Models

Accounting for Missing Values in Score- Driven Time-Varying Parameter Models TI 2016-067/IV Tinbergen Institute Discussion Paper Accounting for Missing Values in Score- Driven Time-Varying Parameter Models André Lucas Anne Opschoor Julia Schaumburg Faculty of Economics and Business

More information

Transformation-based Nonparametric Estimation of Multivariate Densities

Transformation-based Nonparametric Estimation of Multivariate Densities Transformation-based Nonparametric Estimation of Multivariate Densities Meng-Shiuh Chang Ximing Wu March 9, 2013 Abstract We propose a probability-integral-transformation-based estimator of multivariate

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San

More information

A Conditional Approach to Modeling Multivariate Extremes

A Conditional Approach to Modeling Multivariate Extremes A Approach to ing Multivariate Extremes By Heffernan & Tawn Department of Statistics Purdue University s April 30, 2014 Outline s s Multivariate Extremes s A central aim of multivariate extremes is trying

More information

Robust Estimation of Inverse Probability Weights for Marginal Structural Models

Robust Estimation of Inverse Probability Weights for Marginal Structural Models Robust Estimation of Inverse Probability Weights for Marginal Structural Models Kosuke IMAI and Marc RATKOVIC Marginal structural models (MSMs) are becoming increasingly popular as a tool for causal inference

More information

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline

More information

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution p. /2 Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Gaussian Process Vine Copulas for Multivariate Dependence

Gaussian Process Vine Copulas for Multivariate Dependence Gaussian Process Vine Copulas for Multivariate Dependence José Miguel Hernández-Lobato 1,2 joint work with David López-Paz 2,3 and Zoubin Ghahramani 1 1 Department of Engineering, Cambridge University,

More information

Generalized Autoregressive Score Models

Generalized Autoregressive Score Models Generalized Autoregressive Score Models by: Drew Creal, Siem Jan Koopman, André Lucas To capture the dynamic behavior of univariate and multivariate time series processes, we can allow parameters to be

More information

Ultra High Dimensional Variable Selection with Endogenous Variables

Ultra High Dimensional Variable Selection with Endogenous Variables 1 / 39 Ultra High Dimensional Variable Selection with Endogenous Variables Yuan Liao Princeton University Joint work with Jianqing Fan Job Market Talk January, 2012 2 / 39 Outline 1 Examples of Ultra High

More information

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Michal Pešta Charles University in Prague Faculty of Mathematics and Physics Ostap Okhrin Dresden University of Technology

More information

Modeling Real Estate Data using Quantile Regression

Modeling Real Estate Data using Quantile Regression Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

arxiv: v2 [stat.me] 7 Oct 2017

arxiv: v2 [stat.me] 7 Oct 2017 Weighted empirical likelihood for quantile regression with nonignorable missing covariates arxiv:1703.01866v2 [stat.me] 7 Oct 2017 Xiaohui Yuan, Xiaogang Dong School of Basic Science, Changchun University

More information