Bayesian Inference for Conditional Copula models
|
|
- Meryl Payne
- 5 years ago
- Views:
Transcription
1 Bayesian Inference for Conditional Copula models Radu Craiu Department of Statistical Sciences University of Toronto Joint with Evgeny Levi (Toronto), Avideh Sabeti (Toronto) and Mian Wei (Toronto) CRM, Montreal January 27, 2017
2 Outline Introduction Conditional Copulas Motivation for CC Understanding a Dependence Pattern Building Multivariate Distributions Prediction Copula Misspecification Calibration Estimation Frequentist methods Bayesian methods Calibration Models when dim(x ) >> 1 Bayesian Additive Model GP-SIM Numerical Illustrations Estimation Model Selection - Copula and Calibration Selection Criteria: CVML and WAIC
3 Copulas Copula functions are used to model dependence between continuous random variables. (Sklar, 59) If Y 1, Y 2 are continuous r.v. s with distribution functions (df) F, G, there exists an unique copula function C : [0, 1] [0, 1] [0, 1] such that H(t, s) = Pr(Y 1 t, Y 2 s) = C(F (t), G(s)). C is a distribution function on [0, 1] 2 with uniform margins. The copula bridges the marginal distributions with the joint distribution.
4 Inference for Copula Models Early on: abundance of theoretical developments: construction of new copula families and connections with dependence concepts (NA, PQD/NQD, PRD/NRD, etc). Joe ( 97), Nelsen ( 06). Statistical inference for constant copula models: Joint Maximum Likelihood: numerical methods Two-stage approach: Joe (JMVA, 05) Semiparametric approach: Genest, Khoudi & Rivest (Bmka, 95) Nonparametric approach: Pickands (BISI, 81); Capéraà, Fougères & Genest (Biomka, 97). Copula goodness-of-fit and selection (Genest, Remillard & Beaudoin, IME 09; Genest, Quessy & Remillard, SJS 06; Fermanian, JMVA 05; Berg, Eur. J. of Finance, 09).
5 Conditional Copula Consider a random sample {x i R d, y 1i R, y 2i R} 1 i n and suppose F X (y 1 ) and G X (y 2 ) are the unknown marginal conditional cdf s. The bivariate conditional copula (CC) of (Y 1, Y 2 ) X = x, is the conditional joint distribution function of U = F x (Y 1 ) and V = G x (Y 2 ) given X = x (Patton, Int l Econ. Rev. 06) H x (t, s) = C x (F x (t), G x (s)) The parametric bivariate CC model assumes there is a parametric family C = {C θ : θ Θ} s.t. C x (F x (t), G x (s)) = C θ(x) (F x (Y 1 ), G x (Y 2 )). The simplifying assumption: C x (F x (y 1 ), G x (y 2 )) = C(F x (y 1 ), G x (y 2 )).
6 Why CC? Understanding the Dependence Pattern We are interested in understanding the covariate effect on the dependence pattern between responses. The smoking cessation study of Liu, Daniels and Marcus (JASA 09) : Q = smoking cessation (0=No, 1=Yes) W = weight change X = time spent exercising Does exercise weaken the association between smoking status and weight gain?
7 Why CC? Building General Multivariate Distributions Joint models for multivariate data. If the joint distribution of U 1, U 2, U 3 (U i Uniform(0, 1), 1 i 3) is modelled using the pair copula model then c(u 1, u 2, u 3 ) = c 12 (u 1, u 2 )c 23 (u 2, u 3 )c 13 2 (u 1 2, u 3 2 ; u 2 ) where u k 2 = Pr(U k u k U 2 = u 2 ). As dimension increases, the bivariate conditional copulas depend on increasing number of variables.
8 Why CC? Regression-based prediction In a bivariate CC model the joint density is h x (y 1, y 2 ) = f x (y 1 )g x (y 2 )c θ(x) (F x (y 1 ), G x (y 2 )). The conditional density of Y 1 Y 2 = y 2, X = x is h x (y 1 y 2 ) = f x (y 1 )c θ(x) (F x (y 1 ), G x (y 2 )). This can be useful when for each item a subset of the response variables is much easier to measure than the rest.
9 Propagation of Errors Errors can appear from multiple sources: c θ(x)+δ3 (x)(f x (y 1 ) + δ 1 (x), G x (y 2 ) + δ 2 (x)) = error due to F x {}}{ + c (1,0,0) θ(x) (F x (y 1 ), G x (y 2 )) δ 1 (x) + error due to θ(x) {}}{ + c (0,0,1) θ(x) (F x (y 1 ), G x (y 2 ))δ 3 (x) +O( δ(x) 2 ) correct part {}}{ c θ(x) (F x (y 1 ), G x (y 2 )) error due to G x {}}{ c (0,1,0) θ(x) (F x (y 1 ), G x (y 2 ))δ 2 (x) +
10 Estimation of θ(x) - frequentist approaches Acar, Craiu & Yao (Biomcs, 2011) - semiparametric estimation. Parametric marginals, θ(x) is approximated nonparametrically via local polynomial estimation. Veraverbeke, Omelka & Gijbels (SJS, 2011) - nonparametric estimation of the copula and marginals. We observed that the copula estimator may be severely biased if any of the conditional marginal distributions change with the value of the covariate X = x (V., O. & G, 2011) Nonparametric estimates in large-ish dimensions d suffer from curse of dimensionality, unless the volume of data is huge.
11 CC: The SA Condition When d = dim(x ) >> 1 the curse of dimensionality can be alleviated with dimension reduction models. The SA leads to dimension crushing : C x (F x (y 1 ), G x (y 2 )) = C(F x (y 1 ), G x (y 2 )) but when is it justifiable? Acar, Genest & Ne slehová (JMVA, 2012) - discuss the bias incurred when SA is not justified. Acar, Craiu & Yao (EJS, 2013) - Generalized LRT to test a constant or linear null calibration against a general alternative. Gijbels, Omelka & Veraverbeke (Statistics, 2016) - nonparametric testing procedures. Derumigny & Fermanian (arxiv, 2016) - review of state-of-art and a work program around SA for the next years.
12 CC: d = dim(x ) > 1 Sabeti, Wei & Craiu (Stat, 2014) - Bayesian additive CC models. Chavez-Demoulin & Vatter (JMVA, 2015) - Generalized additive models. Lobato, Lloyd & Lobato (NIPS, 2013) - Gaussian Process models for CC in financial time series. Levi & Craiu - Gaussian Process with Single Index Models.
13 Why a Bayesian approach? Joint modelling can be a bit easier as selection of most tuning parameters is automatic and data driven. The posterior distribution accounts for all sources of variation (included in the model). The Monte Carlo samples from the posterior are used to compute finite sample variance estimates, pointwise credible regions and model selection criteria. Priors can be used to favour model sparsity.
14 Additive Model for Calibration X R d Y 1, Y 2 are continuous r.v. s, Y i N (µ i (X ), σi 2 ), for i = 1, 2. Jointly, 2 ( ) 1 Yi µ i (X ) h X (Y 1, Y 2 ) = φ σ i σ i i=1 [ ( ) ( ) Y1 µ 1 (X ) Y2 µ 2 (X ) c Φ, Φ θ(x )], where c(u, v θ) is the pdf of the conditional copula. σ 1 σ 2
15 Additive Model for Calibration Usually there is little/no information about the shape of θ(x ). Generally θ(x ) has a restricted range, so we estimate the calibration function η : R d R where g(θ(x i )) = η(x i ) (g is user-specified). We assume that η(x 1,..., x d ) = α 0 + d ηi (x i ). i=1 Additivity is not preserved when changing dependence measure.
16 Additive Model for Calibration Sabeti, Wang & Craiu (Stat, 2013) use an AM: Each η i has the form: η i (x i ) = 3 j=1 α (i) j x j K (i) i + k=1 ψ (i) k (x i γ (i) k )3 +. Number and location of knots {γ1,..., γ K (i)} is important. The knot-related choices are data driven. Partition the range of X i into K max intervals, I (i) k, and introduce auxiliary variables { 1 if there is a knot γ (i) k in I (i) k ζ (i) k = η i (x i) = 3 and ψ (i) k 0 0 if there is no knot in I (i) k and ψ k = 0 j=1 α(i) j x j i + K max k=1 ψ(i) k ζ(i) k (x i γ (i) k )3 +,
17 Gaussian Process Prior for CC GP is a flexible method when errors are reasonably approximated by Gaussians. GP for marginals when means could be nonlinear functions of X. GP for calibration function could be used in conjunction with other marginal models. Vanilla GP is not helping with the curse of dimensionality and can be expensive when n is large so modifications are needed.
18 Gaussian Process Prior GP prior for smooth f without specifying the form of f. For x [ 5, 5] n, consider f N n (0, K(x, x)) where K ij (x, x) = k(x i, x j ) and f i = f (x i ) L = 1 L = output, f(x) 0 output, f(x) input, x input, x Random functions f generated from a GP prior when n = 100 Cov(f (x i ), f (x j )) = k(x i, x j ) = exp{ 0.5 x i x j 2 L }.
19 Gaussian Process Estimation Observe {y i : 1 i n} noisy realizations of f (x i ) i=1,n, y i = f (x i ) + ɛ i, ɛ i N(0, σ 2 ). When interested in predicting f = (f (x j )) j=1,q use ( y f ) ( [ K(x, x) + σ N n+q 0, 2 I n K(x, x ) K(x, x ) K(x, x ) The conditional distribution of f is Gaussian with E(f y) = K(x, x) expensive for large n {}}{ [K(x, x) + σ 2 I q ] 1 y V (f y) = K(x, x ) K(x, x) [K(x, x) + σ 2 I q ] 1 K(x, x ) }{{} expensive for large n ])
20 Computational challenges for GP When n is large the computation effort is prohibitive so we adopt a sparse GP approach. The information about f in the data is funnelled using a smaller sample of size m << n of inducing (or latent) variables x g, 1 g m. Let f = (f ( x 1 ),..., f ( x m )) T we assume f N(0, K( X, X ). The (Gaussian) conditional distribution of f X, X involves K(X, X ) and the inverse of a m m matrix. Integrating out f yields the (Gaussian) conditional distribution of f X, X with similar computational burden.
21 Gaussian Process Estimation Suppose we know that f : R R and ( 4, 3, 1, 0, 2) f ( 2, 0, 1, 2, 1) y i f i N(f i, σ 2 ), f i = f (x i ), 1 i 5 L = 1 L = output, f(x) 0 output, f(x) input, x input, x Samples from the posterior distribution of f y, when σ 2 = 0.1.
22 Modelling η when d > 1 Assume that θ(x i ) = g 1 (f (X i )) and f = (f (X 1 ), f (X 2 ),..., f (X n )) T N (0, K(X, X ; w)), The (i, j)th element of matrix K(X, X ; w) is k(x i, x j ; w) = e w 0 exp [ ] d (x is x js ) 2. s=1 The covariance between outputs is defined as a function of inputs. The parameters w in the covariance function k determine distance between inputs with a significant difference in the outputs. e ws
23 Modelling η when d > 1 When X i R d a full GP model for the CC has 1 + d parameters. We consider instead the SIM model f (X ) = f (β T X ). GP-SIM model is invariant to nonlinear one-to-one transformations τ(θ). The parameter β is unidentifiable up to a constant so we assume β = 1. Marginals are fitted also using GP-SIM models.
24 Proof of concept Sc1 f 1 (x) = 0.6 sin(5x 1 ) 0.9 sin(2x 2 ), f 2 (x) = 0.6 sin(3x 1 + 5x 2 ), τ(x) = sin(15x T β) β = (1, 3) T / 10, σ 1 = σ 2 = 0.2 n = 400 Clayton Frank Gaussian Clayton SA Scenario IBias 2 IVar IMSE IBias 2 IVar IMSE IBias 2 IVar IMSE IBias 2 IVar IMSE Sc Integrated error for the estimator of τ(x).
25 Estimation of τ(x) X1=0.2 X1=0.8 Kendall's tau Kendall's tau X X2 X2=0.2 X2=0.8 Kendall's tau Kendall's tau X X1
26 Prediction performance If y i x N(µ i (x), σi 2 ), i = 1, 2 then 1 ( )) E x [Y 1 Y 2 = y 2 ] = µ 1 (x)+σ 1 Φ 1 y2 µ 2 (x) (z)c θ(x) (z, Φ dz. 0 σ 2 E(Y1), X1=0.2 E(Y1), X1=0.8 E(Y1) E(Y1) X X2 E(Y1), X2=0.2 E(Y1), X2=0.8 E(Y1) E(Y1) X X1
27 Model misspecification effects Marginals: f1 (x) = 0.6 sin(5x 1 ) 0.9 sin(2x 2 ) f2 (x) = 0.6 sin(3x 1 + 5x 2 ) σ 1 = σ 2 = 0.2 Copula: β = (1, 3)/ 10 τ(x) = sin(5x T β) Frank copula Y1 and X test points: Y 1 = 1.5, X = (0.3, 0.3) Y1 = 1.0, X = (0.3, 0.7) Y1 = 0.5, X = (0.7, 0.3) Y1 = 0.0, X = (0.7, 0.7) Y 1 = 0.5, X = (0.5, 0.5)
28 Model misspecification effects Y2 Y1,X Tau(X) Y Tau Y Y1 Black - true; Red - Correct model; Blue - Correct copula with SA Green - Wrong copula; Purple - Correct copula with missing covariate
29 Model misspecification effects Marginals f1 (x) = 0.6 sin(5x 1 ) 0.9 sin(2x 2 ) f 2 (x) = 0.6 sin(3x 1 + 5x 2 ) σ 1 = σ 2 = 0.2, X 1 X 2. Copula: τ(x) = 0.71 Model: A nonparametric model for marginals and CC based on only x1.
30 Model misspecification effects Marginals f1 (x) = 0.6 sin(5x 1 ) 0.9 sin(2x 2 ) f 2 (x) = 0.6 sin(3x 1 + 5x 2 ) σ 1 = σ 2 = 0.2, X 1 X 2. Copula: τ(x) = 0.71 Model: A nonparametric model for marginals and CC based on only x1. Kendalls Tau x_1
31 Model misspecification effects Suppose E[Y i X 1, X 2 ] = f i (α i X 1 + β i X 2 ), i=1,2. Then fit {}}{ f i (x 1, x 2 ) = f i (x 1, 0) +f (0,1) i (x 1, 0)β i x 2 + f (0,2) i (x 1, 0) β2 i x O( x2 3 ), i = 1, 2 The marginal residuals still contain high-order information about x 1 which varies with the distribution of x 2.
32 Model misspecification effects Suppose E[Y i X 1, X 2 ] = f i (α i X 1 + β i X 2 ), i=1,2. Then fit {}}{ f i (x 1, x 2 ) = f i (x 1, 0) +f (0,1) i (x 1, 0)β i x 2 + f (0,2) i (x 1, 0) β2 i x O( x2 3 ), i = 1, 2 The marginal residuals still contain high-order information about x 1 which varies with the distribution of x 2. Kendalls Tau Kendalls Tau Kendalls Tau x_ x_ x_1
33 Three Model Selection Problems Choice of copula family. Choice of calibration Simplifying Assumption or not? AM or GP-SIM? Covariate selection.
34 CV Marginal Likelihood (CVML) Calculates the average (over parameter values) prediction potential for model M via CVML(M) = n log (P(Y 1i, Y 2i D i, M)), i=1 D i is the data set from which the ith observation has been removed. Estimate CVML using E π [ P(Y1i, Y 2i ω, M) 1] = P(Y 1i, Y 2i D i, M) 1 where ω represents the vector of all the parameters and latent variables in the model.
35 Watanabe-Akaike Information Criterion An alternative approach to estimating the log pointwise predictive density (Watanabe, JMLR 10; Vehtari, Gelman & Gabry, Stat. Comput, 16). WAIC is defined as WAIC(M) = 2fit(M) + 2p(M) N fit(m) = i=1 log E π [P(y 1i, y 2i ω, M)] N p(m) = i=1 Var π[log P(y 1i, y 2i ω, M)]. fit(m) and the penalty p(m) are computed using the posterior samples.
36 Another Two Scenarios Sc3 f 1 (x) = 0.6 sin(5x 1 ) 0.9 sin(2x 2 ) f 2 (x) = 0.6 sin(3x 1 + 5x 2 ) τ(x) = 0.5 σ 1 = σ 2 = 0.2 SA is true Sc4 f 1 (x) = 0.6 sin(5x 1 ) 0.9 sin(2x 2 ) f 2 (x) = 0.6 sin(3x 1 + 5x 2 ) η(x) = sin(3x 3 1 ) 0.5 cos(6x 2 2 ) σ 1 = σ 2 = 0.2 AM model is true
37 Copula Selection - Results Frank Gaussian Clayton SA Scenario CVML WAIC CVML WAIC CVML WAIC Sc1 100% 100% 100% 100% 98% 98% Sc4 100% 100% 100% 100% 100% 100%
38 Calibration Selection - Results Clayton Frank Gaussian Scenario CVML WAIC CVML WAIC CVML WAIC Sc3 78% 78% 100% 100% 100% 100% Scenario CVML WAIC Sc 1 96% 96% Sc4 94% 94%
39 Red Wine 11 Physiochemical properties of 1599 varieties of red Vinho Verde Portuguese wine. We consider the dependence between fixed acidity and density. The former is strongly associated with quality of wine, while the latter is used as a measure of grape quality. Covariates: volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, ph, sulphates, alcohol.
40 Red Wine volatile.acidity citric.acid residual.sugar Kendalls Tau Kendalls Tau Kendalls Tau chlorides free.sulfur.dioxide total.sulfur.dioxide Kendalls Tau Kendalls Tau Kendalls Tau ph sulphates alcohol Kendalls Tau Kendalls Tau Kendalls Tau
41 Red Wine 2.5% quantile mean 97.5% quantile volatile acidity citric acid residual sugar chlorides free sulfur dioxide total sulfur dioxide ph sulphates alcohol Red wine data: posterior estimates and credible regions for β.
42 Conclusions Flexible modelling for CC is necessary. CC models can be useful in prediction when the dependence is strong. The SA is difficult to detect without a properly calibrated penalty for model complexity. Impact of model misspecification can be significant. Robust CC s? How to extend these methods to more than two responses (e.g. vines). CC-based models in clustering, adaptive MCMC, spatial models, statistical genetics.
arxiv: v3 [stat.me] 25 May 2017
Bayesian Inference for Conditional Copulas using Gaussian Process Single Index Models Evgeny Levi Radu V. Craiu arxiv:1603.0308v3 [stat.me] 5 May 017 Department of Statistical Sciences, University of Toronto
More informationBayesian Inference for Conditional Copula models with Continuous and Binary Responses
Bayesian Inference for Conditional Copula models with Continuous and Binary Responses Radu Craiu Department of Statistics University of Toronto Joint with Avideh Sabeti (Toronto) and Mian Wei (Toronto)
More informationWeb-based Supplementary Material for. Dependence Calibration in Conditional Copulas: A Nonparametric Approach
1 Web-based Supplementary Material for Dependence Calibration in Conditional Copulas: A Nonparametric Approach Elif F. Acar, Radu V. Craiu, and Fang Yao Web Appendix A: Technical Details The score and
More informationGaussian Process Vine Copulas for Multivariate Dependence
Gaussian Process Vine Copulas for Multivariate Dependence José Miguel Hernández-Lobato 1,2 joint work with David López-Paz 2,3 and Zoubin Ghahramani 1 1 Department of Engineering, Cambridge University,
More informationMarginal Specifications and a Gaussian Copula Estimation
Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required
More informationDependence Calibration in Conditional Copulas: A Nonparametric Approach
Biometrics DOI: 10.1111/j.1541-0420.2010.01472. Dependence Calibration in Conditional Copulas: A Nonparametric Approach Elif F. Acar, Radu V. Craiu, and Fang Yao Department of Statistics, University of
More informationEstimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith
Estimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith Melbourne Business School, University of Melbourne (Joint with Mohamad Khaled, University of Queensland)
More informationGaussian Process Vine Copulas for Multivariate Dependence
Gaussian Process Vine Copulas for Multivariate Dependence José Miguel Hernández Lobato 1,2, David López Paz 3,2 and Zoubin Ghahramani 1 June 27, 2013 1 University of Cambridge 2 Equal Contributor 3 Ma-Planck-Institute
More informationDisease mapping with Gaussian processes
EUROHEIS2 Kuopio, Finland 17-18 August 2010 Aki Vehtari (former Helsinki University of Technology) Department of Biomedical Engineering and Computational Science (BECS) Acknowledgments Researchers - Jarno
More informationSTAT 518 Intro Student Presentation
STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible
More informationPractical Bayesian Optimization of Machine Learning. Learning Algorithms
Practical Bayesian Optimization of Machine Learning Algorithms CS 294 University of California, Berkeley Tuesday, April 20, 2016 Motivation Machine Learning Algorithms (MLA s) have hyperparameters that
More informationFractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling
Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction
More informationAccounting for extreme-value dependence in multivariate data
Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008 Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence
More informationLecture: Gaussian Process Regression. STAT 6474 Instructor: Hongxiao Zhu
Lecture: Gaussian Process Regression STAT 6474 Instructor: Hongxiao Zhu Motivation Reference: Marc Deisenroth s tutorial on Robot Learning. 2 Fast Learning for Autonomous Robots with Gaussian Processes
More informationGaussian kernel GARCH models
Gaussian kernel GARCH models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics 7 June 2013 Motivation A regression model is often
More informationOn the Choice of Parametric Families of Copulas
On the Choice of Parametric Families of Copulas Radu Craiu Department of Statistics University of Toronto Collaborators: Mariana Craiu, University Politehnica, Bucharest Vienna, July 2008 Outline 1 Brief
More informationCalibration Estimation of Semiparametric Copula Models with Data Missing at Random
Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC
More informationEfficient estimation of a semiparametric dynamic copula model
Efficient estimation of a semiparametric dynamic copula model Christian Hafner Olga Reznikova Institute of Statistics Université catholique de Louvain Louvain-la-Neuve, Blgium 30 January 2009 Young Researchers
More informationNonparametric Methods for Interpretable Copula Calibration and Sparse Functional Classification. Jialin Zou
Nonparametric Methods for Interpretable Copula Calibration and Sparse Functional Classification by Jialin Zou A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy
More informationCopulas. MOU Lili. December, 2014
Copulas MOU Lili December, 2014 Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion Preliminary MOU Lili SEKE Team 3/30 Probability
More informationCSci 8980: Advanced Topics in Graphical Models Gaussian Processes
CSci 8980: Advanced Topics in Graphical Models Gaussian Processes Instructor: Arindam Banerjee November 15, 2007 Gaussian Processes Outline Gaussian Processes Outline Parametric Bayesian Regression Gaussian
More informationA Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness
A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model
More informationSemi-parametric predictive inference for bivariate data using copulas
Semi-parametric predictive inference for bivariate data using copulas Tahani Coolen-Maturi a, Frank P.A. Coolen b,, Noryanti Muhammad b a Durham University Business School, Durham University, Durham, DH1
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationA Significance Test for the Lasso
A Significance Test for the Lasso Lockhart R, Taylor J, Tibshirani R, and Tibshirani R Ashley Petersen June 6, 2013 1 Motivation Problem: Many clinical covariates which are important to a certain medical
More informationGaussian Process Regression
Gaussian Process Regression 4F1 Pattern Recognition, 21 Carl Edward Rasmussen Department of Engineering, University of Cambridge November 11th - 16th, 21 Rasmussen (Engineering, Cambridge) Gaussian Process
More informationChapter 1. Bayesian Inference for D-vines: Estimation and Model Selection
Chapter 1 Bayesian Inference for D-vines: Estimation and Model Selection Claudia Czado and Aleksey Min Technische Universität München, Zentrum Mathematik, Boltzmannstr. 3, 85747 Garching, Germany cczado@ma.tum.de
More informationProbabilistic machine learning group, Aalto University Bayesian theory and methods, approximative integration, model
Aki Vehtari, Aalto University, Finland Probabilistic machine learning group, Aalto University http://research.cs.aalto.fi/pml/ Bayesian theory and methods, approximative integration, model assessment and
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationCalibration Estimation of Semiparametric Copula Models with Data Missing at Random
Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics
More informationApproximate Bayesian inference in semiparametric copula models
Approximate Bayesian inference in semiparametric copula models Clara Grazian 1 and Brunero Liseo 2 1 Nuffield Department of Medicine, University of Oxford arxiv:1503.02912v4 [stat.me] 16 Jul 2017 2 MEMOTEF,
More informationFractional Imputation in Survey Sampling: A Comparative Review
Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More informationA New Method for Varying Adaptive Bandwidth Selection
IEEE TRASACTIOS O SIGAL PROCESSIG, VOL. 47, O. 9, SEPTEMBER 1999 2567 TABLE I SQUARE ROOT MEA SQUARED ERRORS (SRMSE) OF ESTIMATIO USIG THE LPA AD VARIOUS WAVELET METHODS A ew Method for Varying Adaptive
More informationNonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution
Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution p. /2 Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution
More informationBayesian spatial quantile regression
Brian J. Reich and Montserrat Fuentes North Carolina State University and David B. Dunson Duke University E-mail:reich@stat.ncsu.edu Tropospheric ozone Tropospheric ozone has been linked with several adverse
More informationBayesian Inference for Pair-copula Constructions of Multiple Dependence
Bayesian Inference for Pair-copula Constructions of Multiple Dependence Claudia Czado and Aleksey Min Technische Universität München cczado@ma.tum.de, aleksmin@ma.tum.de December 7, 2007 Overview 1 Introduction
More informationConditional Copula Models for Right-Censored Clustered Event Time Data
Conditional Copula Models for Right-Censored Clustered Event Time Data arxiv:1606.01385v1 [stat.me] 4 Jun 2016 Candida Geerdens 1, Elif F. Acar 2, and Paul Janssen 1 1 Center for Statistics, Universiteit
More informationFinancial Econometrics and Volatility Models Copulas
Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns
More informationModel Selection for Gaussian Processes
Institute for Adaptive and Neural Computation School of Informatics,, UK December 26 Outline GP basics Model selection: covariance functions and parameterizations Criteria for model selection Marginal
More informationFitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation
Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation Dimitris Rizopoulos Department of Biostatistics, Erasmus University Medical Center, the Netherlands d.rizopoulos@erasmusmc.nl
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More informationBivariate Degradation Modeling Based on Gamma Process
Bivariate Degradation Modeling Based on Gamma Process Jinglun Zhou Zhengqiang Pan Member IAENG and Quan Sun Abstract Many highly reliable products have two or more performance characteristics (PCs). The
More informationModelling Dropouts by Conditional Distribution, a Copula-Based Approach
The 8th Tartu Conference on MULTIVARIATE STATISTICS, The 6th Conference on MULTIVARIATE DISTRIBUTIONS with Fixed Marginals Modelling Dropouts by Conditional Distribution, a Copula-Based Approach Ene Käärik
More informationCalibration Estimation for Semiparametric Copula Models under Missing Data
Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre
More informationDensity Estimation. Seungjin Choi
Density Estimation Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr http://mlg.postech.ac.kr/
More informationExtreme Value Analysis and Spatial Extremes
Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models
More informationCOMP 551 Applied Machine Learning Lecture 20: Gaussian processes
COMP 55 Applied Machine Learning Lecture 2: Gaussian processes Instructor: Ryan Lowe (ryan.lowe@cs.mcgill.ca) Slides mostly by: (herke.vanhoof@mcgill.ca) Class web page: www.cs.mcgill.ca/~hvanho2/comp55
More informationLecture Quantitative Finance Spring Term 2015
on bivariate Lecture Quantitative Finance Spring Term 2015 Prof. Dr. Erich Walter Farkas Lecture 07: April 2, 2015 1 / 54 Outline on bivariate 1 2 bivariate 3 Distribution 4 5 6 7 8 Comments and conclusions
More informationBayes methods for categorical data. April 25, 2017
Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,
More informationAssessing the VaR of a portfolio using D-vine copula based multivariate GARCH models
Assessing the VaR of a portfolio using D-vine copula based multivariate GARCH models Mathias Hofmann a,, Claudia Czado b a Technische Universität München Zentrum Mathematik Lehrstuhl für Mathematische
More informationSTA 216, GLM, Lecture 16. October 29, 2007
STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural
More informationEcon 582 Nonparametric Regression
Econ 582 Nonparametric Regression Eric Zivot May 28, 2013 Nonparametric Regression Sofarwehaveonlyconsideredlinearregressionmodels = x 0 β + [ x ]=0 [ x = x] =x 0 β = [ x = x] [ x = x] x = β The assume
More informationGAUSSIAN PROCESS REGRESSION
GAUSSIAN PROCESS REGRESSION CSE 515T Spring 2015 1. BACKGROUND The kernel trick again... The Kernel Trick Consider again the linear regression model: y(x) = φ(x) w + ε, with prior p(w) = N (w; 0, Σ). The
More informationInformation geometry for bivariate distribution control
Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic
More informationModeling conditional distributions with mixture models: Applications in finance and financial decision-making
Modeling conditional distributions with mixture models: Applications in finance and financial decision-making John Geweke University of Iowa, USA Journal of Applied Econometrics Invited Lecture Università
More informationConstruction and estimation of high dimensional copulas
Construction and estimation of high dimensional copulas Gildas Mazo PhD work supervised by S. Girard and F. Forbes Mistis, Inria and laboratoire Jean Kuntzmann, Grenoble, France Séminaire Statistiques,
More informationGaussian Processes (10/16/13)
STA561: Probabilistic machine learning Gaussian Processes (10/16/13) Lecturer: Barbara Engelhardt Scribes: Changwei Hu, Di Jin, Mengdi Wang 1 Introduction In supervised learning, we observe some inputs
More informationConstrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources
Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources Yi-Hau Chen Institute of Statistical Science, Academia Sinica Joint with Nilanjan
More informationNonparametric Bayesian Methods (Gaussian Processes)
[70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent
More informationBayesian non-parametric model to longitudinally predict churn
Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics
More informationFlexible Regression Modeling using Bayesian Nonparametric Mixtures
Flexible Regression Modeling using Bayesian Nonparametric Mixtures Athanasios Kottas Department of Applied Mathematics and Statistics University of California, Santa Cruz Department of Statistics Brigham
More informationProbability Distribution And Density For Functional Random Variables
Probability Distribution And Density For Functional Random Variables E. Cuvelier 1 M. Noirhomme-Fraiture 1 1 Institut d Informatique Facultés Universitaires Notre-Dame de la paix Namur CIL Research Contact
More informationPart 6: Multivariate Normal and Linear Models
Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationCopula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011
Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Outline Ordinary Least Squares (OLS) Regression Generalized Linear Models
More informationIntroduction to Gaussian Processes
Introduction to Gaussian Processes Iain Murray murray@cs.toronto.edu CSC255, Introduction to Machine Learning, Fall 28 Dept. Computer Science, University of Toronto The problem Learn scalar function of
More informationStat 542: Item Response Theory Modeling Using The Extended Rank Likelihood
Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal
More informationStatistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach
Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score
More informationA copula goodness-of-t approach. conditional probability integral transform. Daniel Berg 1 Henrik Bakken 2
based on the conditional probability integral transform Daniel Berg 1 Henrik Bakken 2 1 Norwegian Computing Center (NR) & University of Oslo (UiO) 2 Norwegian University of Science and Technology (NTNU)
More informationA note on multiple imputation for general purpose estimation
A note on multiple imputation for general purpose estimation Shu Yang Jae Kwang Kim SSC meeting June 16, 2015 Shu Yang, Jae Kwang Kim Multiple Imputation June 16, 2015 1 / 32 Introduction Basic Setup Assume
More informationVariational Inference with Copula Augmentation
Variational Inference with Copula Augmentation Dustin Tran 1 David M. Blei 2 Edoardo M. Airoldi 1 1 Department of Statistics, Harvard University 2 Department of Statistics & Computer Science, Columbia
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is
More informationProbabilistic & Unsupervised Learning
Probabilistic & Unsupervised Learning Gaussian Processes Maneesh Sahani maneesh@gatsby.ucl.ac.uk Gatsby Computational Neuroscience Unit, and MSc ML/CSML, Dept Computer Science University College London
More informationStatistical testing of covariate effects in conditional copula models
Electronic Journal of Statistics Vol. 7 (2013) 2822 2850 ISSN: 1935-7524 DOI: 10.1214/13-EJS866 Statistical testing of covariate effects in conditional copula models Elif F. Acar Department of Statistics
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationWrapped Gaussian processes: a short review and some new results
Wrapped Gaussian processes: a short review and some new results Giovanna Jona Lasinio 1, Gianluca Mastrantonio 2 and Alan Gelfand 3 1-Università Sapienza di Roma 2- Università RomaTRE 3- Duke University
More informationGaussian processes for inference in stochastic differential equations
Gaussian processes for inference in stochastic differential equations Manfred Opper, AI group, TU Berlin November 6, 2017 Manfred Opper, AI group, TU Berlin (TU Berlin) inference in SDE November 6, 2017
More informationHow to select a good vine
Universitetet i Oslo ingrihaf@math.uio.no International FocuStat Workshop on Focused Information Criteria and Related Themes, May 9-11, 2016 Copulae Regular vines Model selection and reduction Limitations
More informationA nonparametric Bayesian approach to copula estimation
A nonparametric Bayesian approach to copula estimation Abstract We propose a novel Dirichlet-based Pólya tree (D-P tree) prior on the copula and a nonparametric Bayesian inference procedure based on the
More informationComputer Vision Group Prof. Daniel Cremers. 9. Gaussian Processes - Regression
Group Prof. Daniel Cremers 9. Gaussian Processes - Regression Repetition: Regularized Regression Before, we solved for w using the pseudoinverse. But: we can kernelize this problem as well! First step:
More informationGOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS
Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models
More informationA short introduction to INLA and R-INLA
A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk
More informationCopula modeling for discrete data
Copula modeling for discrete data Christian Genest & Johanna G. Nešlehová in collaboration with Bruno Rémillard McGill University and HEC Montréal ROBUST, September 11, 2016 Main question Suppose (X 1,
More informationBayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework
HT5: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Maximum Likelihood Principle A generative model for
More informationUncertainty Quantification for Inverse Problems. November 7, 2011
Uncertainty Quantification for Inverse Problems November 7, 2011 Outline UQ and inverse problems Review: least-squares Review: Gaussian Bayesian linear model Parametric reductions for IP Bias, variance
More informationCorrelation: Copulas and Conditioning
Correlation: Copulas and Conditioning This note reviews two methods of simulating correlated variates: copula methods and conditional distributions, and the relationships between them. Particular emphasis
More informationChapter 2: Fundamentals of Statistics Lecture 15: Models and statistics
Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Data from one or a series of random experiments are collected. Planning experiments and collecting data (not discussed here). Analysis:
More informationBayesian Semiparametric GARCH Models
Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods
More informationUsing Estimating Equations for Spatially Correlated A
Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship
More informationHierarchical Modeling for Univariate Spatial Data
Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This
More informationWhen is a copula constant? A test for changing relationships
When is a copula constant? A test for changing relationships Fabio Busetti and Andrew Harvey Bank of Italy and University of Cambridge November 2007 usetti and Harvey (Bank of Italy and University of Cambridge)
More informationBayesian Semiparametric GARCH Models
Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods
More informationBayesian Modeling of Conditional Distributions
Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings
More informationBayesian inference for multivariate extreme value distributions
Bayesian inference for multivariate extreme value distributions Sebastian Engelke Clément Dombry, Marco Oesting Toronto, Fields Institute, May 4th, 2016 Main motivation For a parametric model Z F θ of
More informationCTDL-Positive Stable Frailty Model
CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland
More information