Summary of Extending the Rank Likelihood for Semiparametric Copula Estimation, by Peter Hoff

Size: px

Start display at page:

Download "Summary of Extending the Rank Likelihood for Semiparametric Copula Estimation, by Peter Hoff"

Leonard Wilson
5 years ago
Views:

1 Summary of Extending the Rank Likelihood for Semiparametric Copula Estimation, by Peter Hoff David Gerard Department of Statistics University of Washington May 2, 2013 David Gerard (UW) May /4

2 Recall: General Social Survey data Respondent s Income Highest degree obtained CHILD Number of children P Parent s income when respondent was 16 P Max(mother s degree, father s degree, na.rm = T) PCHILD Number of siblings + 1 AGE Age of respondent David Gerard (UW) May /4

3 Recall: Which do we choose? or i =β 0 + β 1 CHILD i + β 2 i + β 3 AGE i + β 4 PCHILD i + β 5 P i + β 6 P i + ɛ i CHILD i Pois(exp{β 0 + β 1 i + β 2 i + β 3 AGE i + β 4 PCHILD i + β 5 P i + β 6 P i }) David Gerard (UW) May /4

4 Will Sandwich Help? Response CHILD NA 1.103(0.112) 7.025(<0.001) CHILD 0.005( ) NA (0.056) Response AGE PCHILD P P 0.335(<0.001) 0.284(0.407) 4.070(0.001) 1.399(0.115) CHILD 0.037(<0.001) 0.021(0.080) (0.195) (0.204) David Gerard (UW) May /4

5 Will Sandwich Help? Response CHILD NA 1.103(0.112) 7.025(<0.001) CHILD 0.005( ) NA (0.056) Response AGE PCHILD P P 0.335(<0.001) 0.284(0.407) 4.070(0.001) 1.399(0.115) CHILD 0.037(<0.001) 0.021(0.080) (0.195) (0.204) Using a sandwich estimator we have: Response CHILD... NA 1.103(0.108)... CHILD 0.005( ) NA... David Gerard (UW) May /4

6 Recall: Setup Let y i,j be the value of the j th variable taken on by the i th observational unit. Gaussian Copula Sampling Model z 1,..., z n C i.i.d. multivariate normal(0, C) y i,j = F 1 j [Φ(z i,j )] Estimate association parameters without having to estimate marginals. Do this by only using the partial ordering induced by the data: y k,j < y i,j z k,j < z i,j. A partial ordering is a total ordering without the totality condition (i.e., some elements may be incomparable). David Gerard (UW) May /4

7 The Extended Rank Likelihood Let D be the set of Z := (z i,j ) s that satisfy the partial ordering. Use the following marginal likelihood for inference: Pr(Z D C, F 1,..., F p ) = p(z C)dZ = Pr(Z D) D David Gerard (UW) May /4

8 Gibbs Sampler Full conditionals of the z i,j s are easy to derive, so we can implement a Gibb s sampler (using covariance matrix rather than correlation matrix, but doesn t matter for estimation). 1 Sample z i,j Z [ i, j], V from a truncated normal with the bounds set by the partial ordering and the conditional mean and variance found in the usual way: σ 2 j = V [j,j] V [j, j] V 1 [ j, j] V [ j,j] and µ = V [j, j] V [ j, j] Z T [i, j]. 2 Sample V from an inverse-wishart distribution (if you use the conjugate prior). 3 Let C [i,j] = V [i,j] / V [i,i] V [j,j] David Gerard (UW) May /4

9 Some Trace Plots CHILD C ij C ij scan scan David Gerard (UW) May /4

10 95% Posterior Credible Intervals CHILD P P PCHILD C ij Regression Coefficient CHILD P P PCHILD AGE CHILD P P PCHILD AGE CHILD P P PCHILD AGE CHILD P P PCHILD AGE CHILD P P PCHILD AGE CHILD P P PCHILD AGE David Gerard (UW) May /4

11 Visualizing Conditional Dependencies of GSS data P - + P + CHILD + PCHILD - Figure: Reduced conditional dependence graph for the General Social Survey data. David Gerard (UW) May /4

12 Posterior Predictive We can also sample from a posterior predictive distribution to do inference on y s. Compare to empirical distributions for model checking. 1 sample C p(c Z D); 2 sample z multivariate normal(0, C); 3 1 set y j = ˆF j (Φ(z j )). David Gerard (UW) May /4

13 Posterior Predictive and Empirical distributions of given P Pr( P = 1), n =28 Predictive Empirical Pr( P = 3), n =276 Predictive Empirical Pr( P = 5), n = Pr( P = 2), n =110 Predictive Empirical Pr( P = 4), n =97 Predictive Empirical Predictive Empirical David Gerard (UW) May /4

14 Posterior Predictive and Empirical distributions of given and P P n= Pr( Deg=0,P) P n= Pr( Deg=1,P) P n= Pr( Deg=2,P) P n= Pr( Deg=3,P) P n= Pr( Deg=4,P) David Gerard (UW) May /4

15 Partial Sufficiency The paper proves that ranks are G-sufficient and L-Sufficient when we have continuous marginals. However, when the data are discrete the partial ordering does not have either of these properties. David Gerard (UW) May /4

16 Groups Definition A collection G of 1-1 transformations of X (the sample space) is a group if 1 For all g 1, g 2 G, g 1 g 2 G 2 For all g G, g 1 G. Definition Two points x 1, x 2 X are equivalent if there exists a g G such that x 1 = gx 2. The sets of equivalent points are the orbits of G. Definition A function M is said to be maximally invariant if it is in 1-1 correspondence with the orbits of G. i.e. M(gx) = M(x) for all g G and M(x 1 ) = M(x 2 ) x 2 = gx 1 for some g G. David Gerard (UW) May /4

17 Groups G induces a group Ḡ over the parameter space. Let Ḡ = {ḡ : Ω Ω s.t. ḡθ = θ if X P θ and gx P θ }. There are also maximally invariant parameters for Ḡ. David Gerard (UW) May /4

18 G-Sufficiency Under a group of transformations, G, the maximally invariant statistic is called G-sufficient for the maximally invariant parameter. The ranks are the maximally invariant statistics under the group of continuous strictly increasing functions [Lehmann and Romano, 2005, pp ]. You can also put the correlation matrix in a 1-1 correspondence with the induced group Ḡ s orbits (but only if the marginals are continuous). Hence, the ranks are G-sufficient for the correlation matrix. Intuition: if we assume the marginals are unknown, then applying strictly increasing continuous functions to the data should not change the estimation problem David Gerard (UW) May /4

19 Simple Example Let X 1,..., X n i.i.d.n(µ, σ 2 ), Ω = {(µ, σ) : µ R, σ 2 > 0} and X = R. Let G = {g : g(x) = x + a1, a R} Then { g(x) = ( (X 1 ) + a, (..., X n + )} a) s.t. X i i.i.d. N(µ + a, σ 2 ) and µ µ + a Ḡ = ḡ : ḡ =. σ σ Orbits of Ω under Ḡ are defined by σ. So σ is the maximally invariant parameter. Orbits of X under G are defined by the differences from the mean (X 1 X,..., X n X ), so the differences are G-sufficient for σ and inference should only be based on the differences. Can extend these ideas to minimal G-sufficient ( maximal invariant reduction of minimal sufficient statistics ) [Barnard, 1963]. Here, s 2. David Gerard (UW) May /4

20 L-Sufficiency For {F 1,..., F p } F the marginal distributions, a statistic t(y) is L-sufficient for C if 1 t(y 0 ) = t(y 1 ) sup {F1,...,F p} F p(y 0 C, F 1,..., F p ) = sup {F1,...,F p} F p(y 1 C, F 1,..., F p ); and 2 p(t(y) C, F 1,..., F p ) = p(t(y) C) If there are no nuisance parameters, L-sufficiency becomes full sufficiency. Intuition: partition generated by L-sufficient statistic is at least as fine as the partition generated by the M.L.E. of C [Rémon, 1984] (i.e. MLE is function of ranks alone). David Gerard (UW) May /4

21 Future Work Develop maximum likelihood approach using the extended rank likelihood. Lots of trouble with this. [Pettitt, 1982] used an approximation for the rank likelihood in the standard multivariate normal setting. Run simulation study against the competitors [Genest et al., 1995] (just plugged in ECDF); [Olsson, 1979] (estimate threshholding values when number of categories is known) Develop technique for other copulas, then simulate and see what info is lost by using the wrong copula. David Gerard (UW) May /4

22 References Barnard, G. A. (1963). Some logical aspects of the fiducial argument. Journal of the Royal Statistical Society. Series B (Methodological), pages Genest, C., Ghoudi, K., and Rivest, L.-P. (1995). A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika, 82(3): Hoff, P. D. (2007). Extending the rank likelihood for semiparametric copula estimation. The Annals of Applied Statistics, pages Lehmann, E. E. L. and Romano, J. P. (2005). Testing statistical hypotheses. Springer Science+ Business Media. Olsson, U. (1979). Maximum likelihood estimation of the polychoric correlation coefficient. Psychometrika, 44(4): Pettitt, A. (1982). Inference for the linear model using a likelihood based on ranks. Journal of the Royal Statistical Society. Series B (Methodological), pages Rémon, M. (1984). On a concept of partial sufficiency: L-sufficiency. International Statistical Review/Revue Internationale de Statistique, pages David Gerard (UW) May /4

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required