A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Size: px
Start display at page:

Download "A Fully Nonparametric Modeling Approach to. BNP Binary Regression"

Transcription

1 A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012

2 Outline Simulation Example Atmospheric Measurements Credit Card Data 4

3 Outline Simulation Example Atmospheric Measurements Credit Card Data 4

4 Outline Simulation Example Atmospheric Measurements Credit Card Data 4

5 Outline Simulation Example Atmospheric Measurements Credit Card Data 4

6 Motivation binary responses along with covariates are present in many settings, including biometrics, econometrics, and social sciences Goal: determine the relationship between response and covariates examples: credit scoring, medicine, population dynamics, environmental sciences the response-covariate relationship is described by the regression function standard approaches involve linearity and distributional assumptions, e.g., GLMs

7 Motivation binary responses along with covariates are present in many settings, including biometrics, econometrics, and social sciences Goal: determine the relationship between response and covariates examples: credit scoring, medicine, population dynamics, environmental sciences the response-covariate relationship is described by the regression function standard approaches involve linearity and distributional assumptions, e.g., GLMs

8 Bayesian Nonparametrics Bayesian nonparametrics can be used to relax common distributional assumptions, resulting in flexible regression models with proper uncertainty quantification rather than modeling directly the regression function, model the joint distribution of response and covariates using a nonparametric mixture model (West et al., 1994, Müller et al., 1996) this implies a form for the conditional response distribution, which is implicitly modeled nonparametrically involves random covariates

9 Bayesian Nonparametrics Bayesian nonparametrics can be used to relax common distributional assumptions, resulting in flexible regression models with proper uncertainty quantification rather than modeling directly the regression function, model the joint distribution of response and covariates using a nonparametric mixture model (West et al., 1994, Müller et al., 1996) this implies a form for the conditional response distribution, which is implicitly modeled nonparametrically involves random covariates

10 Bayesian Nonparametrics Bayesian nonparametrics can be used to relax common distributional assumptions, resulting in flexible regression models with proper uncertainty quantification rather than modeling directly the regression function, model the joint distribution of response and covariates using a nonparametric mixture model (West et al., 1994, Müller et al., 1996) this implies a form for the conditional response distribution, which is implicitly modeled nonparametrically involves random covariates

11 Latent Variable Formulation introduce latent continuous random variables z that determine the binary responses y, so that y = 1 if-f z > 0 (e.g., Albert and Chib, 1993) estimate the joint distribution of latent responses and covariates f (z, x) using a nonparametric mixture model, to obtain flexible inference for the regression function pr(y = 1 x) the latent variables may be of interest in some applications, containing more information than just a 0/1 observation in biology applications, these may be thought of as maturity, latent survivorship, or measure of health

12 Latent Variable Formulation introduce latent continuous random variables z that determine the binary responses y, so that y = 1 if-f z > 0 (e.g., Albert and Chib, 1993) estimate the joint distribution of latent responses and covariates f (z, x) using a nonparametric mixture model, to obtain flexible inference for the regression function pr(y = 1 x) the latent variables may be of interest in some applications, containing more information than just a 0/1 observation in biology applications, these may be thought of as maturity, latent survivorship, or measure of health

13 Latent Variable Formulation introduce latent continuous random variables z that determine the binary responses y, so that y = 1 if-f z > 0 (e.g., Albert and Chib, 1993) estimate the joint distribution of latent responses and covariates f (z, x) using a nonparametric mixture model, to obtain flexible inference for the regression function pr(y = 1 x) the latent variables may be of interest in some applications, containing more information than just a 0/1 observation in biology applications, these may be thought of as maturity, latent survivorship, or measure of health

14 Outline Simulation Example Atmospheric Measurements Credit Card Data 4

15 DP Mixture Model The Dirichlet Process (DP) (Ferguson, 1973) generates random distributions, and can be used as a prior for spaces of distribution functions. DP constructive definition (Sethuraman, 1994): if G DP(α, G 0 ), then it is almost surely of the form l=1 p lδ νl ν l iid G0, l = 1, 2,... iid z r Beta(1, α), r = 1, 2,... l 1 define p 1 = z 1, and p l = z l r=1 (1 z r ), for l = 2, 3,... DP mixture model for the latent responses and covariates f (z, x; G) = N p+1 (z, x; µ, Σ)dG(µ, Σ) G α, ψ DP(α, G 0 (µ, Σ; ψ))

16 DP Mixture Model The Dirichlet Process (DP) (Ferguson, 1973) generates random distributions, and can be used as a prior for spaces of distribution functions. DP constructive definition (Sethuraman, 1994): if G DP(α, G 0 ), then it is almost surely of the form l=1 p lδ νl ν l iid G0, l = 1, 2,... iid z r Beta(1, α), r = 1, 2,... l 1 define p 1 = z 1, and p l = z l r=1 (1 z r ), for l = 2, 3,... DP mixture model for the latent responses and covariates f (z, x; G) = N p+1 (z, x; µ, Σ)dG(µ, Σ) G α, ψ DP(α, G 0 (µ, Σ; ψ))

17 Implied Conditional Regression From the constructive definition, the model has an a.s. representation as a countable mixture of MVNs f (z, x; G) = p l N p+1 (z, x; µ l, Σ l ) l=1 Binary regression functional: pr(y = 1 x; G) marginalize over z to obtain f (x; G) and f (y, x; G) f (x; G) = p l N p (x; µ x l, Σxx l ) l=1 And the joint distribution f (y, x; G) = ( ( p l N p (x; µ x l, Σxx l )Bern y; Φ l=1 µ z l + Σ zx l (Σ xx (Σ zz l Σ zx l )) l ) 1 (x µ x l ) ) 1 Σ xz l ) 1/2 (Σ xx l

18 Implied Conditional Regression From the constructive definition, the model has an a.s. representation as a countable mixture of MVNs f (z, x; G) = p l N p+1 (z, x; µ l, Σ l ) l=1 Binary regression functional: pr(y = 1 x; G) marginalize over z to obtain f (x; G) and f (y, x; G) f (x; G) = p l N p (x; µ x l, Σxx l ) l=1 And the joint distribution f (y, x; G) = ( ( p l N p (x; µ x l, Σxx l )Bern y; Φ l=1 µ z l + Σ zx l (Σ xx (Σ zz l Σ zx l )) l ) 1 (x µ x l ) ) 1 Σ xz l ) 1/2 (Σ xx l

19 Implied Conditional Regression From the constructive definition, the model has an a.s. representation as a countable mixture of MVNs f (z, x; G) = p l N p+1 (z, x; µ l, Σ l ) l=1 Binary regression functional: pr(y = 1 x; G) marginalize over z to obtain f (x; G) and f (y, x; G) f (x; G) = p l N p (x; µ x l, Σxx l ) l=1 And the joint distribution f (y, x; G) = ( ( p l N p (x; µ x l, Σxx l )Bern y; Φ l=1 µ z l + Σ zx l (Σ xx (Σ zz l Σ zx l )) l ) 1 (x µ x l ) ) 1 Σ xz l ) 1/2 (Σ xx l

20 The Regression Function implied regression function: pr(y = 1 x; G) = l=1 w l(x)π l (x), with covariate dependent weights and probabilities π l (x) = Φ w l (x) p l N(x; µ x l, Σxx l ) ( µ z l + Σ zx l (Σ xx (Σ zz l Σ zx l ) l ) 1 (x µ x l ) ) 1 Σ xz l ) 1/2 (Σ xx l Notice that the probabilities have the probit form with component-specific intercept and slope parameters

21 The Regression Function implied regression function: pr(y = 1 x; G) = l=1 w l(x)π l (x), with covariate dependent weights and probabilities π l (x) = Φ w l (x) p l N(x; µ x l, Σxx l ) ( µ z l + Σ zx l (Σ xx (Σ zz l Σ zx l ) l ) 1 (x µ x l ) ) 1 Σ xz l ) 1/2 (Σ xx l Notice that the probabilities have the probit form with component-specific intercept and slope parameters

22 Identifiability Can the entire covariance matrix Σ be estimated? Probit Regression: z N(x T β, 1) the binary responses are not able to inform about the scale of the latent responses retaining Σ zx is important, if we set it to 0, then π l (x) becomes just π l We have shown that if Σ zz is fixed, the remaining parameters are identifiable in the kernel of the mixture model for y and x

23 Identifiability Can the entire covariance matrix Σ be estimated? Probit Regression: z N(x T β, 1) the binary responses are not able to inform about the scale of the latent responses retaining Σ zx is important, if we set it to 0, then π l (x) becomes just π l We have shown that if Σ zz is fixed, the remaining parameters are identifiable in the kernel of the mixture model for y and x

24 Identifiability Can the entire covariance matrix Σ be estimated? Probit Regression: z N(x T β, 1) the binary responses are not able to inform about the scale of the latent responses retaining Σ zx is important, if we set it to 0, then π l (x) becomes just π l We have shown that if Σ zz is fixed, the remaining parameters are identifiable in the kernel of the mixture model for y and x

25 Identifiability Can the entire covariance matrix Σ be estimated? Probit Regression: z N(x T β, 1) the binary responses are not able to inform about the scale of the latent responses retaining Σ zx is important, if we set it to 0, then π l (x) becomes just π l We have shown that if Σ zz is fixed, the remaining parameters are identifiable in the kernel of the mixture model for y and x

26 Identifiability Can the entire covariance matrix Σ be estimated? Probit Regression: z N(x T β, 1) the binary responses are not able to inform about the scale of the latent responses retaining Σ zx is important, if we set it to 0, then π l (x) becomes just π l We have shown that if Σ zz is fixed, the remaining parameters are identifiable in the kernel of the mixture model for y and x

27 Facilitating Identifiability How to fix only one element of the covariance matrix? the usual inverse-wishart distribution will not work square-root-free Cholesky decomposition of Σ uses the relationship = βσβ T, with diagonal with all elements δ i > 0, and β lower triangular with 1 on its diagonal (Daniels and Pourahmadi, 2002; Webb and Forster, 2007) For y = (y 1,..., y m ) N(µ, Σ), with = βσβ T, the joint distribution for y can be expressed in a recursive form: y 1 N(µ 1, δ 1 ), (y k y 1,..., y k 1 ) N(µ k k 1 j=1 β k,j(y j µ j ), δ k ), k = 2,..., m useful for modeling longitudinal data and specifying conditional independence assumptions

28 Facilitating Identifiability How to fix only one element of the covariance matrix? the usual inverse-wishart distribution will not work square-root-free Cholesky decomposition of Σ uses the relationship = βσβ T, with diagonal with all elements δ i > 0, and β lower triangular with 1 on its diagonal (Daniels and Pourahmadi, 2002; Webb and Forster, 2007) For y = (y 1,..., y m ) N(µ, Σ), with = βσβ T, the joint distribution for y can be expressed in a recursive form: y 1 N(µ 1, δ 1 ), (y k y 1,..., y k 1 ) N(µ k k 1 j=1 β k,j(y j µ j ), δ k ), k = 2,..., m useful for modeling longitudinal data and specifying conditional independence assumptions

29 Facilitating Identifiability How to fix only one element of the covariance matrix? the usual inverse-wishart distribution will not work square-root-free Cholesky decomposition of Σ uses the relationship = βσβ T, with diagonal with all elements δ i > 0, and β lower triangular with 1 on its diagonal (Daniels and Pourahmadi, 2002; Webb and Forster, 2007) For y = (y 1,..., y m ) N(µ, Σ), with = βσβ T, the joint distribution for y can be expressed in a recursive form: y 1 N(µ 1, δ 1 ), (y k y 1,..., y k 1 ) N(µ k k 1 j=1 β k,j(y j µ j ), δ k ), k = 2,..., m useful for modeling longitudinal data and specifying conditional independence assumptions

30 Facilitating Identifiability How to fix only one element of the covariance matrix? the usual inverse-wishart distribution will not work square-root-free Cholesky decomposition of Σ uses the relationship = βσβ T, with diagonal with all elements δ i > 0, and β lower triangular with 1 on its diagonal (Daniels and Pourahmadi, 2002; Webb and Forster, 2007) For y = (y 1,..., y m ) N(µ, Σ), with = βσβ T, the joint distribution for y can be expressed in a recursive form: y 1 N(µ 1, δ 1 ), (y k y 1,..., y k 1 ) N(µ k k 1 j=1 β k,j(y j µ j ), δ k ), k = 2,..., m useful for modeling longitudinal data and specifying conditional independence assumptions

31 Facilitating Identifiability here, no natural ordering is present, but the paramaterization has other useful properties which we exploit δ 1 = Σ zz fix δ 1, and mix on δ 2,..., δ p+1 and p(p + 1)/2 free elements of β, denoted by vector β Then the DP mixture model becomes f (z, x; G) = N p+1 (z, x; µ, β 1 β T )dg(µ, β, ) computationally convenient: there exist conjugate prior distributions for β and δ 2,..., δ p+1, which are MVN and (independent) inverse-gamma

32 Facilitating Identifiability here, no natural ordering is present, but the paramaterization has other useful properties which we exploit δ 1 = Σ zz fix δ 1, and mix on δ 2,..., δ p+1 and p(p + 1)/2 free elements of β, denoted by vector β Then the DP mixture model becomes f (z, x; G) = N p+1 (z, x; µ, β 1 β T )dg(µ, β, ) computationally convenient: there exist conjugate prior distributions for β and δ 2,..., δ p+1, which are MVN and (independent) inverse-gamma

33 Outline Simulation Example Atmospheric Measurements Credit Card Data 4

34 Hierarchical Model Blocked Gibbs sampler: truncate G to G N ( ) = N l=1 p lδ Wl ( ), with W l = (µ l, β l, l ), and introduce configuration variables (L 1,..., L n ) taking values in 1,..., N. y i z i ind 1 (yi =1)1 (zi >0) + 1 (yi =0)1 (zi 0), i = 1,..., n ind (z i, x i ) W, L i N p+1 ((z i, x i ); µ Li, β 1 Li β T L i ), i = 1,..., n L i p N p l δ l (L i ), l=1 L i i = 1,..., n p+1 W l ψ ind N p+1 (µ l ; m, V )N q ( β l ; θ, ci) IG(δ i,l ; ν i, s i ), i=2 l = 1,..., N

35 Gibbs sampling may be used to simulate from full posterior p(w, L, p, ψ, α, z data), with the conditionally conjugate base distribution, and conjugate priors on ψ and α. The posterior for G N = (p, W ) is imputed in the MCMC, enabling full inference for any functional of f (z, x; G N ), now a finite sum Binary regression functional: for any covariate value x 0, at iteration r of the MCMC, calculate pr(y = 1 x 0 ; G (r) N ) provides point estimate and uncertainty quantification for regression function Same can be done for other functionals, such as latent response distribution f (z x 0 ; G N ) at any covariate value x 0

36 Gibbs sampling may be used to simulate from full posterior p(w, L, p, ψ, α, z data), with the conditionally conjugate base distribution, and conjugate priors on ψ and α. The posterior for G N = (p, W ) is imputed in the MCMC, enabling full inference for any functional of f (z, x; G N ), now a finite sum Binary regression functional: for any covariate value x 0, at iteration r of the MCMC, calculate pr(y = 1 x 0 ; G (r) N ) provides point estimate and uncertainty quantification for regression function Same can be done for other functionals, such as latent response distribution f (z x 0 ; G N ) at any covariate value x 0

37 Gibbs sampling may be used to simulate from full posterior p(w, L, p, ψ, α, z data), with the conditionally conjugate base distribution, and conjugate priors on ψ and α. The posterior for G N = (p, W ) is imputed in the MCMC, enabling full inference for any functional of f (z, x; G N ), now a finite sum Binary regression functional: for any covariate value x 0, at iteration r of the MCMC, calculate pr(y = 1 x 0 ; G (r) N ) provides point estimate and uncertainty quantification for regression function Same can be done for other functionals, such as latent response distribution f (z x 0 ; G N ) at any covariate value x 0

38 Outline Simulation Example Atmospheric Measurements Credit Card Data Simulation Example Atmospheric Measurements Credit Card Data 4

39 Simulated Data Simulation Example Atmospheric Measurements Credit Card Data Data {(z i, x i ) : i = 1,..., n} was simulated from a mixture of 3 bivariate normals, and y determined from z. compare inference from the binary regression model with data (y, x) to that from model which views (z, x) as data a practical prior specification approach which is appropriate when little is known about the problem is applied here to specify priors on ψ, consider only one mixture component and use an approximate center and range of the data, as well as prior simulation to induce an approximate unif( 1, 1) prior on corr(z, x)

40 Simulated Data Simulation Example Atmospheric Measurements Credit Card Data Data {(z i, x i ) : i = 1,..., n} was simulated from a mixture of 3 bivariate normals, and y determined from z. compare inference from the binary regression model with data (y, x) to that from model which views (z, x) as data a practical prior specification approach which is appropriate when little is known about the problem is applied here to specify priors on ψ, consider only one mixture component and use an approximate center and range of the data, as well as prior simulation to induce an approximate unif( 1, 1) prior on corr(z, x)

41 Simulated Data Simulation Example Atmospheric Measurements Credit Card Data Data {(z i, x i ) : i = 1,..., n} was simulated from a mixture of 3 bivariate normals, and y determined from z. compare inference from the binary regression model with data (y, x) to that from model which views (z, x) as data a practical prior specification approach which is appropriate when little is known about the problem is applied here to specify priors on ψ, consider only one mixture component and use an approximate center and range of the data, as well as prior simulation to induce an approximate unif( 1, 1) prior on corr(z, x)

42 Pr(z>0 x;g) Pr(y=1 x;g) x x The inference for pr(z > 0 x; G) (left) is compared to that for pr(y = 1 x; G) (right) and the truth (solid line).

43 f(z x=x1) f(z x=x2) f(z x=x3) z z z f(z x=x1) f(z x=x2) f(z x=x3) z z z Top row: Inference for f (z x 0 ; G) under the model which views z as observed, with true densities as dashed lines, at 3 values of x 0. Bottom: Inference from the binary regression model.

44 Outline Simulation Example Atmospheric Measurements Credit Card Data Simulation Example Atmospheric Measurements Credit Card Data 4

45 Ozone and Wind Speed Simulation Example Atmospheric Measurements Credit Card Data 111 daily measurements of wind speed (mph) and ozone concentration (parts per billion) in NYC over 4 month period objective: model the probability of exceeding a certain ozone concentration as a function of wind speed the model only sees whether or not there was an exceedance, but there is an actual ozone concentration underlying this 0/1 value

46 probability of ozone exceedence ozone concentration wind speed wind speed Left: The probability that ozone concentration (parts per billion) exceeds a threshold of 70 decreases with wind speed (mph). Right: For comparison, here are the actual non-discretized ozone measurements as a function of wind speed.

47 f(z x0) f(z x0) z z f(z x0) f(z x0) z z Estimates for f (z x 0 ; G) at wind speed values of 5, 8, 10, and 15 mph.

48 Outline Simulation Example Atmospheric Measurements Credit Card Data Simulation Example Atmospheric Measurements Credit Card Data 4

49 Credit Cards and Income Simulation Example Atmospheric Measurements Credit Card Data n = 100 subjects in a study were asked whether or not they owned a travel credit card, and their income was recorded (Agresti, 1996) In this situation, it is not clear that there is some meaningful interpretation of the latent continuous random variables, but we can still use the method for regression Does probability of owning a credit card change with income?

50 Pr(y=1 x;g) income in thousands Probability of owning a credit card appears to increase with income, with a slight dip or leveling off around income of 40-50, since all subjects in that region did not own a credit card.

51 Extensions to Ordinal Reponses similar methodology, wider range of applications for an ordinal response with C categories, assume y = j if-f γ j 1 < z γ j, for j = 1,...C, and apply the same DP mixture of MVNs for (z, x) for fixed cut-off points γ, it can be shown that all of µ and Σ are identifiable in the induced kernel for the observables the C 1 free cut-off points can be fixed to arbitrary increasing values (Kottas et al., 2005), which is an attribute in a computational sense

52 Extensions to Ordinal Reponses similar methodology, wider range of applications for an ordinal response with C categories, assume y = j if-f γ j 1 < z γ j, for j = 1,...C, and apply the same DP mixture of MVNs for (z, x) for fixed cut-off points γ, it can be shown that all of µ and Σ are identifiable in the induced kernel for the observables the C 1 free cut-off points can be fixed to arbitrary increasing values (Kottas et al., 2005), which is an attribute in a computational sense

53 Extensions to Ordinal Reponses similar methodology, wider range of applications for an ordinal response with C categories, assume y = j if-f γ j 1 < z γ j, for j = 1,...C, and apply the same DP mixture of MVNs for (z, x) for fixed cut-off points γ, it can be shown that all of µ and Σ are identifiable in the induced kernel for the observables the C 1 free cut-off points can be fixed to arbitrary increasing values (Kottas et al., 2005), which is an attribute in a computational sense

54 Extensions to Ordinal Reponses similar methodology, wider range of applications for an ordinal response with C categories, assume y = j if-f γ j 1 < z γ j, for j = 1,...C, and apply the same DP mixture of MVNs for (z, x) for fixed cut-off points γ, it can be shown that all of µ and Σ are identifiable in the induced kernel for the observables the C 1 free cut-off points can be fixed to arbitrary increasing values (Kottas et al., 2005), which is an attribute in a computational sense

55 Other Extensions multivariate ordinal responses: J ordinal responses associated with a vector of covariates for each subject; with C j categories associated with the jth response several applications, but limited existing methods for flexible inference y and z are vectors, and y j = l if-f γ j,l 1 < z j γ j,l, for j = 1,..., J, and l = 1,..., C j C j > 2 for all j, then no identifiability restrictions needed C j = 2 for some j, then (β, ) paramaterization can be used, and fixing certain elements of δ provides the necessary restrictions mixed ordinal-continuous responses

56 Other Extensions multivariate ordinal responses: J ordinal responses associated with a vector of covariates for each subject; with C j categories associated with the jth response several applications, but limited existing methods for flexible inference y and z are vectors, and y j = l if-f γ j,l 1 < z j γ j,l, for j = 1,..., J, and l = 1,..., C j C j > 2 for all j, then no identifiability restrictions needed C j = 2 for some j, then (β, ) paramaterization can be used, and fixing certain elements of δ provides the necessary restrictions mixed ordinal-continuous responses

57 Other Extensions multivariate ordinal responses: J ordinal responses associated with a vector of covariates for each subject; with C j categories associated with the jth response several applications, but limited existing methods for flexible inference y and z are vectors, and y j = l if-f γ j,l 1 < z j γ j,l, for j = 1,..., J, and l = 1,..., C j C j > 2 for all j, then no identifiability restrictions needed C j = 2 for some j, then (β, ) paramaterization can be used, and fixing certain elements of δ provides the necessary restrictions mixed ordinal-continuous responses

58 Other Extensions multivariate ordinal responses: J ordinal responses associated with a vector of covariates for each subject; with C j categories associated with the jth response several applications, but limited existing methods for flexible inference y and z are vectors, and y j = l if-f γ j,l 1 < z j γ j,l, for j = 1,..., J, and l = 1,..., C j C j > 2 for all j, then no identifiability restrictions needed C j = 2 for some j, then (β, ) paramaterization can be used, and fixing certain elements of δ provides the necessary restrictions mixed ordinal-continuous responses

59 Other Extensions multivariate ordinal responses: J ordinal responses associated with a vector of covariates for each subject; with C j categories associated with the jth response several applications, but limited existing methods for flexible inference y and z are vectors, and y j = l if-f γ j,l 1 < z j γ j,l, for j = 1,..., J, and l = 1,..., C j C j > 2 for all j, then no identifiability restrictions needed C j = 2 for some j, then (β, ) paramaterization can be used, and fixing certain elements of δ provides the necessary restrictions mixed ordinal-continuous responses

60 Conclusions Binary responses measured along with covariates represents a simple setting, but the scope of problems which lie in this category is large. This framework allows flexible, nonparametric inference to be obtained for the regression relationship in a general binary regression problem. The methodology extends easily to larger classes of problems in ordinal regression, including multivariate responses and mixed responses, making the framework much more powerful, with utility in a wide variety of applications.

Nonparametric Bayesian modeling for dynamic ordinal regression relationships

Nonparametric Bayesian modeling for dynamic ordinal regression relationships Nonparametric Bayesian modeling for dynamic ordinal regression relationships Athanasios Kottas Department of Applied Mathematics and Statistics, University of California, Santa Cruz Joint work with Maria

More information

Flexible Regression Modeling using Bayesian Nonparametric Mixtures

Flexible Regression Modeling using Bayesian Nonparametric Mixtures Flexible Regression Modeling using Bayesian Nonparametric Mixtures Athanasios Kottas Department of Applied Mathematics and Statistics University of California, Santa Cruz Department of Statistics Brigham

More information

Bayesian Nonparametric Modeling for Multivariate Ordinal Regression

Bayesian Nonparametric Modeling for Multivariate Ordinal Regression Bayesian Nonparametric Modeling for Multivariate Ordinal Regression arxiv:1408.1027v3 [stat.me] 20 Sep 2016 Maria DeYoreo Department of Statistical Science, Duke University and Athanasios Kottas Department

More information

Lecture 16: Mixtures of Generalized Linear Models

Lecture 16: Mixtures of Generalized Linear Models Lecture 16: Mixtures of Generalized Linear Models October 26, 2006 Setting Outline Often, a single GLM may be insufficiently flexible to characterize the data Setting Often, a single GLM may be insufficiently

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

A Nonparametric Bayesian Model for Multivariate Ordinal Data

A Nonparametric Bayesian Model for Multivariate Ordinal Data A Nonparametric Bayesian Model for Multivariate Ordinal Data Athanasios Kottas, University of California at Santa Cruz Peter Müller, The University of Texas M. D. Anderson Cancer Center Fernando A. Quintana,

More information

STA 216, GLM, Lecture 16. October 29, 2007

STA 216, GLM, Lecture 16. October 29, 2007 STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural

More information

Bayes methods for categorical data. April 25, 2017

Bayes methods for categorical data. April 25, 2017 Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,

More information

Gibbs Sampling in Endogenous Variables Models

Gibbs Sampling in Endogenous Variables Models Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take

More information

Part 6: Multivariate Normal and Linear Models

Part 6: Multivariate Normal and Linear Models Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

Gibbs Sampling in Latent Variable Models #1

Gibbs Sampling in Latent Variable Models #1 Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Bayesian Multivariate Logistic Regression

Bayesian Multivariate Logistic Regression Bayesian Multivariate Logistic Regression Sean M. O Brien and David B. Dunson Biostatistics Branch National Institute of Environmental Health Sciences Research Triangle Park, NC 1 Goals Brief review of

More information

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University Econ 690 Purdue University In virtually all of the previous lectures, our models have made use of normality assumptions. From a computational point of view, the reason for this assumption is clear: combined

More information

A Bayesian Nonparametric Approach to Inference for Quantile Regression

A Bayesian Nonparametric Approach to Inference for Quantile Regression A Bayesian Nonparametric Approach to Inference for Quantile Regression Matthew TADDY The University of Chicago Booth School of Business, Chicago, IL 60637 (matt.taddy@chicagogsb.edu) Athanasios KOTTAS

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

Bayesian linear regression

Bayesian linear regression Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding

More information

November 2002 STA Random Effects Selection in Linear Mixed Models

November 2002 STA Random Effects Selection in Linear Mixed Models November 2002 STA216 1 Random Effects Selection in Linear Mixed Models November 2002 STA216 2 Introduction It is common practice in many applications to collect multiple measurements on a subject. Linear

More information

Bayesian non-parametric model to longitudinally predict churn

Bayesian non-parametric model to longitudinally predict churn Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

MULTILEVEL IMPUTATION 1

MULTILEVEL IMPUTATION 1 MULTILEVEL IMPUTATION 1 Supplement B: MCMC Sampling Steps and Distributions for Two-Level Imputation This document gives technical details of the full conditional distributions used to draw regression

More information

arxiv: v3 [stat.me] 3 May 2016

arxiv: v3 [stat.me] 3 May 2016 A Bayesian Nonparametric Markovian Model for Nonstationary Time Series arxiv:1601.04331v3 [stat.me] 3 May 2016 Maria DeYoreo and Athanasios Kottas Abstract Stationary time series models built from parametric

More information

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Athanasios Kottas Department of Applied Mathematics and Statistics,

More information

A Nonparametric Model-based Approach to Inference for Quantile Regression

A Nonparametric Model-based Approach to Inference for Quantile Regression A Nonparametric Model-based Approach to Inference for Quantile Regression Matthew Taddy and Athanasios Kottas Department of Applied Mathematics and Statistics, University of California, Santa Cruz, CA

More information

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data Nonparametric Bayesian Modeling for Multivariate Ordinal Data Athanasios Kottas, Peter Müller and Fernando Quintana Abstract We propose a probability model for k-dimensional ordinal outcomes, i.e., we

More information

Timevarying VARs. Wouter J. Den Haan London School of Economics. c Wouter J. Den Haan

Timevarying VARs. Wouter J. Den Haan London School of Economics. c Wouter J. Den Haan Timevarying VARs Wouter J. Den Haan London School of Economics c Wouter J. Den Haan Time-Varying VARs Gibbs-Sampler general idea probit regression application (Inverted Wishart distribution Drawing from

More information

Gibbs Sampling in Linear Models #2

Gibbs Sampling in Linear Models #2 Gibbs Sampling in Linear Models #2 Econ 690 Purdue University Outline 1 Linear Regression Model with a Changepoint Example with Temperature Data 2 The Seemingly Unrelated Regressions Model 3 Gibbs sampling

More information

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units Sahar Z Zangeneh Robert W. Keener Roderick J.A. Little Abstract In Probability proportional

More information

Mixture Modeling for Marked Poisson Processes

Mixture Modeling for Marked Poisson Processes Mixture Modeling for Marked Poisson Processes Matthew A. Taddy taddy@chicagobooth.edu The University of Chicago Booth School of Business 5807 South Woodlawn Ave, Chicago, IL 60637, USA Athanasios Kottas

More information

The STS Surgeon Composite Technical Appendix

The STS Surgeon Composite Technical Appendix The STS Surgeon Composite Technical Appendix Overview Surgeon-specific risk-adjusted operative operative mortality and major complication rates were estimated using a bivariate random-effects logistic

More information

Bayesian (conditionally) conjugate inference for discrete data models. Jon Forster (University of Southampton)

Bayesian (conditionally) conjugate inference for discrete data models. Jon Forster (University of Southampton) Bayesian (conditionally) conjugate inference for discrete data models Jon Forster (University of Southampton) with Mark Grigsby (Procter and Gamble?) Emily Webb (Institute of Cancer Research) Table 1:

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

Research Division Federal Reserve Bank of St. Louis Working Paper Series

Research Division Federal Reserve Bank of St. Louis Working Paper Series Research Division Federal Reserve Bank of St Louis Working Paper Series Kalman Filtering with Truncated Normal State Variables for Bayesian Estimation of Macroeconomic Models Michael Dueker Working Paper

More information

Methods for the Comparability of Scores

Methods for the Comparability of Scores Assessment and Development of new Statistical Methods for the Comparability of Scores 1 Pontificia Universidad Católica de Chile 2 Laboratorio Interdisciplinario de Estadística Social Advisor: Jorge González

More information

Supplementary Material for Analysis of Job Satisfaction: The Case of Japanese Private Companies

Supplementary Material for Analysis of Job Satisfaction: The Case of Japanese Private Companies Supplementary Material for Analysis of Job Satisfaction: The Case of Japanese Private Companies S1. Sampling Algorithms We assume that z i NX i β, Σ), i =1,,n, 1) where Σ is an m m positive definite covariance

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

Online Appendix to: Marijuana on Main Street? Estimating Demand in Markets with Limited Access

Online Appendix to: Marijuana on Main Street? Estimating Demand in Markets with Limited Access Online Appendix to: Marijuana on Main Street? Estating Demand in Markets with Lited Access By Liana Jacobi and Michelle Sovinsky This appendix provides details on the estation methodology for various speci

More information

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

Index. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables.

Index. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables. Index Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables. Adaptive rejection metropolis sampling (ARMS), 98 Adaptive shrinkage, 132 Advanced Photo System (APS), 255 Aggregation

More information

A Bayesian Mixture Model with Application to Typhoon Rainfall Predictions in Taipei, Taiwan 1

A Bayesian Mixture Model with Application to Typhoon Rainfall Predictions in Taipei, Taiwan 1 Int. J. Contemp. Math. Sci., Vol. 2, 2007, no. 13, 639-648 A Bayesian Mixture Model with Application to Typhoon Rainfall Predictions in Taipei, Taiwan 1 Tsai-Hung Fan Graduate Institute of Statistics National

More information

STAT Advanced Bayesian Inference

STAT Advanced Bayesian Inference 1 / 32 STAT 625 - Advanced Bayesian Inference Meng Li Department of Statistics Jan 23, 218 The Dirichlet distribution 2 / 32 θ Dirichlet(a 1,...,a k ) with density p(θ 1,θ 2,...,θ k ) = k j=1 Γ(a j) Γ(

More information

Modeling conditional distributions with mixture models: Theory and Inference

Modeling conditional distributions with mixture models: Theory and Inference Modeling conditional distributions with mixture models: Theory and Inference John Geweke University of Iowa, USA Journal of Applied Econometrics Invited Lecture Università di Venezia Italia June 2, 2005

More information

Gibbs Sampling for the Probit Regression Model with Gaussian Markov Random Field Latent Variables

Gibbs Sampling for the Probit Regression Model with Gaussian Markov Random Field Latent Variables Gibbs Sampling for the Probit Regression Model with Gaussian Markov Random Field Latent Variables Mohammad Emtiyaz Khan Department of Computer Science University of British Columbia May 8, 27 Abstract

More information

1 Data Arrays and Decompositions

1 Data Arrays and Decompositions 1 Data Arrays and Decompositions 1.1 Variance Matrices and Eigenstructure Consider a p p positive definite and symmetric matrix V - a model parameter or a sample variance matrix. The eigenstructure is

More information

Using Model Selection and Prior Specification to Improve Regime-switching Asset Simulations

Using Model Selection and Prior Specification to Improve Regime-switching Asset Simulations Using Model Selection and Prior Specification to Improve Regime-switching Asset Simulations Brian M. Hartman, PhD ASA Assistant Professor of Actuarial Science University of Connecticut BYU Statistics Department

More information

Bayesian Nonparametric Regression through Mixture Models

Bayesian Nonparametric Regression through Mixture Models Bayesian Nonparametric Regression through Mixture Models Sara Wade Bocconi University Advisor: Sonia Petrone October 7, 2013 Outline 1 Introduction 2 Enriched Dirichlet Process 3 EDP Mixtures for Regression

More information

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns

More information

Bayesian Mixture Modeling of Significant P Values: A Meta-Analytic Method to Estimate the Degree of Contamination from H 0 : Supplemental Material

Bayesian Mixture Modeling of Significant P Values: A Meta-Analytic Method to Estimate the Degree of Contamination from H 0 : Supplemental Material Bayesian Mixture Modeling of Significant P Values: A Meta-Analytic Method to Estimate the Degree of Contamination from H 0 : Supplemental Material Quentin Frederik Gronau 1, Monique Duizer 1, Marjan Bakker

More information

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data Nonparametric Bayesian Modeling for Multivariate Ordinal Data Athanasios Kottas, Peter Müller and Fernando Quintana August 18, 2004 Abstract We propose a probability model for k-dimensional ordinal outcomes,

More information

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Outline Ordinary Least Squares (OLS) Regression Generalized Linear Models

More information

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS M. Gasparini and J. Eisele 2 Politecnico di Torino, Torino, Italy; mauro.gasparini@polito.it

More information

Bayesian Nonparametric Inference Methods for Mean Residual Life Functions

Bayesian Nonparametric Inference Methods for Mean Residual Life Functions Bayesian Nonparametric Inference Methods for Mean Residual Life Functions Valerie Poynor Department of Applied Mathematics and Statistics, University of California, Santa Cruz April 28, 212 1/3 Outline

More information

Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives. 1(w i = h + 1)β h + ɛ i,

Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives. 1(w i = h + 1)β h + ɛ i, Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives Often interest may focus on comparing a null hypothesis of no difference between groups to an ordered restricted alternative. For

More information

Modeling for Dynamic Ordinal Regression Relationships: An. Application to Estimating Maturity of Rockfish in California

Modeling for Dynamic Ordinal Regression Relationships: An. Application to Estimating Maturity of Rockfish in California Modeling for Dynamic Ordinal Regression Relationships: An Application to Estimating Maturity of Rockfish in California Maria DeYoreo and Athanasios Kottas Abstract We develop a Bayesian nonparametric framework

More information

Outline. Binomial, Multinomial, Normal, Beta, Dirichlet. Posterior mean, MAP, credible interval, posterior distribution

Outline. Binomial, Multinomial, Normal, Beta, Dirichlet. Posterior mean, MAP, credible interval, posterior distribution Outline A short review on Bayesian analysis. Binomial, Multinomial, Normal, Beta, Dirichlet Posterior mean, MAP, credible interval, posterior distribution Gibbs sampling Revisit the Gaussian mixture model

More information

Introduction to Probabilistic Graphical Models: Exercises

Introduction to Probabilistic Graphical Models: Exercises Introduction to Probabilistic Graphical Models: Exercises Cédric Archambeau Xerox Research Centre Europe cedric.archambeau@xrce.xerox.com Pascal Bootcamp Marseille, France, July 2010 Exercise 1: basics

More information

Bayesian Nonparametrics: Dirichlet Process

Bayesian Nonparametrics: Dirichlet Process Bayesian Nonparametrics: Dirichlet Process Yee Whye Teh Gatsby Computational Neuroscience Unit, UCL http://www.gatsby.ucl.ac.uk/~ywteh/teaching/npbayes2012 Dirichlet Process Cornerstone of modern Bayesian

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

UNIVERSITY OF CALIFORNIA SANTA CRUZ

UNIVERSITY OF CALIFORNIA SANTA CRUZ UNIVERSITY OF CALIFORNIA SANTA CRUZ BAYESIAN NONPARAMETRIC ANALYSIS OF CONDITIONAL DISTRIBUTIONS AND INFERENCE FOR POISSON POINT PROCESSES A dissertation submitted in partial satisfaction of the requirements

More information

Bayesian Nonparametric Autoregressive Models via Latent Variable Representation

Bayesian Nonparametric Autoregressive Models via Latent Variable Representation Bayesian Nonparametric Autoregressive Models via Latent Variable Representation Maria De Iorio Yale-NUS College Dept of Statistical Science, University College London Collaborators: Lifeng Ye (UCL, London,

More information

Stat 451 Lecture Notes Markov Chain Monte Carlo. Ryan Martin UIC

Stat 451 Lecture Notes Markov Chain Monte Carlo. Ryan Martin UIC Stat 451 Lecture Notes 07 12 Markov Chain Monte Carlo Ryan Martin UIC www.math.uic.edu/~rgmartin 1 Based on Chapters 8 9 in Givens & Hoeting, Chapters 25 27 in Lange 2 Updated: April 4, 2016 1 / 42 Outline

More information

Nonparametric Bayes Uncertainty Quantification

Nonparametric Bayes Uncertainty Quantification Nonparametric Bayes Uncertainty Quantification David Dunson Department of Statistical Science, Duke University Funded from NIH R01-ES017240, R01-ES017436 & ONR Review of Bayes Intro to Nonparametric Bayes

More information

Bayesian semiparametric modeling and inference with mixtures of symmetric distributions

Bayesian semiparametric modeling and inference with mixtures of symmetric distributions Bayesian semiparametric modeling and inference with mixtures of symmetric distributions Athanasios Kottas 1 and Gilbert W. Fellingham 2 1 Department of Applied Mathematics and Statistics, University of

More information

Assessing Toxicities in a Clinical Trial: Bayesian Inference for Ordinal Data. Nested within Categories

Assessing Toxicities in a Clinical Trial: Bayesian Inference for Ordinal Data. Nested within Categories 1 Assessing Toxicities in a Clinical Trial: Bayesian Inference for Ordinal Data Nested within Categories L.G. Leon-Novelo University of Florida. Department of Statistics 102 Griffin-Floyd Hall. P.O. Box

More information

Bayesian Statistics. Debdeep Pati Florida State University. April 3, 2017

Bayesian Statistics. Debdeep Pati Florida State University. April 3, 2017 Bayesian Statistics Debdeep Pati Florida State University April 3, 2017 Finite mixture model The finite mixture of normals can be equivalently expressed as y i N(µ Si ; τ 1 S i ), S i k π h δ h h=1 δ h

More information

Hyperparameter estimation in Dirichlet process mixture models

Hyperparameter estimation in Dirichlet process mixture models Hyperparameter estimation in Dirichlet process mixture models By MIKE WEST Institute of Statistics and Decision Sciences Duke University, Durham NC 27706, USA. SUMMARY In Bayesian density estimation and

More information

Latent Variable Models Probabilistic Models in the Study of Language Day 4

Latent Variable Models Probabilistic Models in the Study of Language Day 4 Latent Variable Models Probabilistic Models in the Study of Language Day 4 Roger Levy UC San Diego Department of Linguistics Preamble: plate notation for graphical models Here is the kind of hierarchical

More information

1. Introduction. Hang Qian 1 Iowa State University

1. Introduction. Hang Qian 1 Iowa State University Users Guide to the VARDAS Package Hang Qian 1 Iowa State University 1. Introduction The Vector Autoregression (VAR) model is widely used in macroeconomics. However, macroeconomic data are not always observed

More information

Nonparametric Bayesian Methods - Lecture I

Nonparametric Bayesian Methods - Lecture I Nonparametric Bayesian Methods - Lecture I Harry van Zanten Korteweg-de Vries Institute for Mathematics CRiSM Masterclass, April 4-6, 2016 Overview of the lectures I Intro to nonparametric Bayesian statistics

More information

Spatial Mixture Modelling for Unobserved Point Processes: Examples in Immunofluorescence Histology

Spatial Mixture Modelling for Unobserved Point Processes: Examples in Immunofluorescence Histology Spatial Mixture Modelling for Unobserved Point Processes: Examples in Immunofluorescence Histology Chunlin Ji, Daniel Merl, Thomas B. Kepler and Mike West April 12, 2009 Abstract We discuss Bayesian modelling

More information

A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models

A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models Journal of Data Science 8(2010), 43-59 A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models Jing Wang Louisiana State University Abstract: In this paper, we

More information

Flexible modeling for stock-recruitment relationships using Bayesian nonparametric mixtures

Flexible modeling for stock-recruitment relationships using Bayesian nonparametric mixtures Flexible modeling for stock-recruitment relationships using Bayesian nonparametric mixtures Kassandra Fronczyk, Athanasios Kottas Department of Applied Mathematics and Statistics, University of California,

More information

Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, )

Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, ) Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, 302-308) Consider data in which multiple outcomes are collected for

More information

Large-scale Ordinal Collaborative Filtering

Large-scale Ordinal Collaborative Filtering Large-scale Ordinal Collaborative Filtering Ulrich Paquet, Blaise Thomson, and Ole Winther Microsoft Research Cambridge, University of Cambridge, Technical University of Denmark ulripa@microsoft.com,brmt2@cam.ac.uk,owi@imm.dtu.dk

More information

Quantifying the Price of Uncertainty in Bayesian Models

Quantifying the Price of Uncertainty in Bayesian Models Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the published version when available. Title Quantifying the Price of Uncertainty in Bayesian Models Author(s)

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Metropolis-Hastings Algorithm

Metropolis-Hastings Algorithm Strength of the Gibbs sampler Metropolis-Hastings Algorithm Easy algorithm to think about. Exploits the factorization properties of the joint probability distribution. No difficult choices to be made to

More information

Lecture Notes based on Koop (2003) Bayesian Econometrics

Lecture Notes based on Koop (2003) Bayesian Econometrics Lecture Notes based on Koop (2003) Bayesian Econometrics A.Colin Cameron University of California - Davis November 15, 2005 1. CH.1: Introduction The concepts below are the essential concepts used throughout

More information

Particle Learning for General Mixtures

Particle Learning for General Mixtures Particle Learning for General Mixtures Hedibert Freitas Lopes 1 Booth School of Business University of Chicago Dipartimento di Scienze delle Decisioni Università Bocconi, Milano 1 Joint work with Nicholas

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

Bayesian Inference in the Multivariate Probit Model

Bayesian Inference in the Multivariate Probit Model Bayesian Inference in the Multivariate Probit Model Estimation of the Correlation Matrix by Aline Tabet A THESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science

More information

Multiparameter models (cont.)

Multiparameter models (cont.) Multiparameter models (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University February 1, 2018 Jarad Niemi (STAT544@ISU) Multiparameter models (cont.) February 1, 2018 1 / 20 Outline Multinomial Multivariate

More information

Partial factor modeling: predictor-dependent shrinkage for linear regression

Partial factor modeling: predictor-dependent shrinkage for linear regression modeling: predictor-dependent shrinkage for linear Richard Hahn, Carlos Carvalho and Sayan Mukherjee JASA 2013 Review by Esther Salazar Duke University December, 2013 Factor framework The factor framework

More information

Multivariate Normal & Wishart

Multivariate Normal & Wishart Multivariate Normal & Wishart Hoff Chapter 7 October 21, 2010 Reading Comprehesion Example Twenty-two children are given a reading comprehsion test before and after receiving a particular instruction method.

More information

Bayesian Nonparametric Regression for Diabetes Deaths

Bayesian Nonparametric Regression for Diabetes Deaths Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,

More information

Nonparametric Bayes Density Estimation and Regression with High Dimensional Data

Nonparametric Bayes Density Estimation and Regression with High Dimensional Data Nonparametric Bayes Density Estimation and Regression with High Dimensional Data Abhishek Bhattacharya, Garritt Page Department of Statistics, Duke University Joint work with Prof. D.Dunson September 2010

More information

Spatial modeling for risk assessment of extreme values from environmental time series: A Bayesian nonparametric approach

Spatial modeling for risk assessment of extreme values from environmental time series: A Bayesian nonparametric approach Spatial modeling for risk assessment of extreme values from environmental time series: A Bayesian nonparametric approach Athanasios Kottas, Ziwei Wang and Abel Rodríguez Department of Applied Mathematics

More information

Scaling up Bayesian Inference

Scaling up Bayesian Inference Scaling up Bayesian Inference David Dunson Departments of Statistical Science, Mathematics & ECE, Duke University May 1, 2017 Outline Motivation & background EP-MCMC amcmc Discussion Motivation & background

More information

Colouring and breaking sticks, pairwise coincidence losses, and clustering expression profiles

Colouring and breaking sticks, pairwise coincidence losses, and clustering expression profiles Colouring and breaking sticks, pairwise coincidence losses, and clustering expression profiles Peter Green and John Lau University of Bristol P.J.Green@bristol.ac.uk Isaac Newton Institute, 11 December

More information

Image segmentation combining Markov Random Fields and Dirichlet Processes

Image segmentation combining Markov Random Fields and Dirichlet Processes Image segmentation combining Markov Random Fields and Dirichlet Processes Jessica SODJO IMS, Groupe Signal Image, Talence Encadrants : A. Giremus, J.-F. Giovannelli, F. Caron, N. Dobigeon Jessica SODJO

More information

Hybrid Copula Bayesian Networks

Hybrid Copula Bayesian Networks Kiran Karra kiran.karra@vt.edu Hume Center Electrical and Computer Engineering Virginia Polytechnic Institute and State University September 7, 2016 Outline Introduction Prior Work Introduction to Copulas

More information

Simulating Random Variables

Simulating Random Variables Simulating Random Variables Timothy Hanson Department of Statistics, University of South Carolina Stat 740: Statistical Computing 1 / 23 R has many built-in random number generators... Beta, gamma (also

More information

Sample Size Calculations for ROC Studies: Parametric Robustness and Bayesian Nonparametrics

Sample Size Calculations for ROC Studies: Parametric Robustness and Bayesian Nonparametrics Baylor Health Care System From the SelectedWorks of unlei Cheng Spring January 30, 01 Sample Size Calculations for ROC Studies: Parametric Robustness and Bayesian Nonparametrics unlei Cheng, Baylor Health

More information

Bayesian nonparametric Poisson process modeling with applications

Bayesian nonparametric Poisson process modeling with applications Bayesian nonparametric Poisson process modeling with applications Athanasios Kottas Department of Applied Mathematics and Statistics University of California, Santa Cruz Neyman Seminar Department of Statistics

More information

Wrapped Gaussian processes: a short review and some new results

Wrapped Gaussian processes: a short review and some new results Wrapped Gaussian processes: a short review and some new results Giovanna Jona Lasinio 1, Gianluca Mastrantonio 2 and Alan Gelfand 3 1-Università Sapienza di Roma 2- Università RomaTRE 3- Duke University

More information