Quantile regression in varying coefficient models

Size: px
Start display at page:

Download "Quantile regression in varying coefficient models"

Transcription

1 Quantile regression in varying coefficient models Irène Gijbels KU Leuven Department of Mathematics and Leuven Statistics Research Centre Belgium July 6, 2017 Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

2 Introduction: mean and quantile regression unconditional mean and quantile Y random variable, with cumulative distribution function F Y mean or expectation of Y : E(Y ) = argmin E(Y a) 2 a quantiles of Y, with the median as a special case for 0 < τ < 1, the τ-th order quantile of Y q τ (Y ) = F 1 Y (τ) = inf{y : F Y (y) τ} Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

3 Introduction: mean and quantile regression q τ (Y ) = F 1 Y (τ) = inf{y : F Y (y) τ} F(x) 0 1/7 2/7 3/7 4/7 5/7 6/7 1 F 1 (p) /7 2/7 3/7 4/7 5/7 6/7 1 x Y Exponential(λ) (F Y strictly increasing) F Y (y) = 1 e λy (y > 0) F 1 τ) Y (τ) = ln(1 λ p (0 < τ < 1)) mean E(Y ) one characteristic of the distribution of Y quantile function F 1 Y (τ) fully characterizes Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

4 Introduction: mean and quantile regression E(Y ) = argmin a E(Y a) 2 q τ (Y ) = argmin a E [ρ τ (Y a)] q 0.5 (Y ) = argmin a E Y a with ρ τ ( ) the so-called check function, or the pinball loss function,... ρ τ (z) = { τ z if z > 0 (1 τ) z otherwise ρ τ(u) Rho_0.5 Rho_ u Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

5 Introduction: mean and quantile regression estimation of unconditional mean and quantile Y 1,..., Y n i.i.d. sample from Y E(Y a) 2 1 n n (Y i a) 2 i=1 population quantity mean E(Y ) = argmin a E(Y a) 2 argmin a 1 n estimator sample mean n (Y i a) 2 = 1 n i=1 non robust to outliers n i=1 Y i quantile / median sample quantile / median 1 n q τ (Y ) = argmin a E [ρ τ (Y a)] argmin a ρ τ (Y i a) n i=1 robust to outliers Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

6 Introduction: mean and quantile regression conditional mean and quantile / mean and quantile regression Y response variable X (1),..., X (p) covariates multiple linear regression model: Y = β 0 + β 1 X (1) β p X (p) + ε mean regression function p E(Y X (1),..., X (p) ) = β 0 + β j X (j) + E (ε X (1),..., X (p)) j=1 E ( ε X (1),..., X (p)) = 0 = E(Y X(1),..., X (p) ) = β 0 + β = (β 0, β 1,..., β p ) T argmin β estimation by least squares method E ( Y β 0 p β j X (j) j=1 p β j X (j)) 2 Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43 j=1

7 Introduction: mean and quantile regression quantile regression function Y = β 0 + β 1 X (1) β p X (p) + ε τth conditional quantile of Y given X (1),..., X (p) q τ (Y X (1),..., X (p) ) = β 0 + d j=1 β j X j +F 1 ε X (1),...,X (p) (τ X (1),..., X (p) ) if F 1 (τ X (1),..., X (p) ) = F 1 ε X (1),...,X (p) ε (τ), then p p q τ (Y X (1),..., X (p) ) = β0 + Fε 1 (τ) + β }{{} j X (j) = β0 τ + β j X (j) =β0 τ j=1 j=1 p β τ = (β0 τ, β 1,..., β p ) T argmin β τ E ρ τ Y β0 τ β j X (j) j=1 Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

8 Introduction: mean and quantile regression multiple linear regression model: Y = β 0 + β 1 X (1) β p X (p) + ε mean regression quantile regression homoscedastic regression homoscedastic setting E ( ε X (1),..., X (p)) = 0 Var ( ε X (1),..., X (p)) = σ 2 (τ X (1),..., X (p) )= F 1 (τ) ε X (1),...,X (p) F 1 ε heteroscedastic regression heteroscedastic setting ε = σ ( X (1),..., X (p)) ε ε = σ ( X (1),..., X (p)) ε σ(,..., ) 0 E (ε) = 0 Var (ε) = 1 identifiability assumption(s) Var ( ε X (1),..., X (p)) ( = σ 2 X (1),..., X (p)) } {{ } to be estimated F 1 (τ X (1),..., X (p) ) ε X (1),...,X (p) = σ (X (1),..., X (p)) Fε 1 (τ) }{{}}{{} possibly to be estimated to be estimated Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

9 Introduction: mean and quantile regression multiple linear regression model: Y = β 0 + β 1 X (1) β p X (p) + V ( X (1),..., X (p)) ε V 0 location-scale model examples heteroscedasticity structures: ) ( λ V (X (1),..., X (p) = σ 1 + γ 1X (1) γ px ) (p) (Box and Hill (1974),...) ) V (X (1),..., X (p) = σg(x (1),..., X (p), λ, γ 1,..., γ p) G known function Bianco et al. (2000), Bianco & Boente (2002),... ) ln V (X (1),..., X (p) = γ 0 + γ 1X (1) γ px (p) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

10 Introduction: mean and quantile regression example: Y = β 0 + β 1 X + V (X)ε ε independent of X V (X) 0 V (X) = γ γ > 0 q τ (Y X) = β 0 + β 1 X + γ Fε 1 (τ) = β0 τ + β 1X V (X) = γx 2 q τ (Y X) = β 0 + β 1 X + γx 2 F 1 ε (τ) = β 0 + β 1 X + γ τ X 2 V (X) = 1 + γx 2 q τ (Y X) = β 0 + β 1 X + (1 + γx 2 ) F 1 ε (τ) = β τ 0 + β 1X + γ τ X 2 Koenker & Bassett (1978),..., Koenker (2005),... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

11 Introduction: mean and quantile regression Y = β 0 + β 1 X + V (X)ε ε N(0; 1) β 0 = 2 β 1 = 0.5 γ = 1 V (X) = γ V (X) = γ X 2 V (X) = 1 + γ X 2 q τ (Y X) = β τ 0 + β 1X q τ (Y X) = β 0 + β 1 X + γ τ X 2 q τ (Y X) = β τ 0 + β 1X + γ τ X 2 quantile function quantile function quantile function x x x Figure: Examples of regression quantiles. τ = Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

12 Introduction: mean and quantile regression example: esterase data data set involves 113 assays for the concentration of the enzyme esterase Y = the number of bindings counted by a radio-immunoassay procedure X = observed esterase concentration 1400 Quantile curves 1400 local linear Quantile curves data points global linear linear fits RIA Count RIA Count Esterase Concentration Esterase Concentration τ = Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

13 Introduction: mean and quantile regression example: triceps skinfold measurements triceps skinfold measurements of 892 girls and women up to age 50, recorded in three Gambian villages during the dry season of 1989 Y = skinfold measurement X = age of the girl/woman 40 Quantile curves Triceps Skinfold Age (yr) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

14 Introduction: mean and quantile regression Y = β 0 + β 1 X (1) β p X (p) +error term }{{} signal part several important modeling and inference questions is the model specification about the signal part correct? is the signal part a linear function of the covariates? is there a parametric function that adequately describes the signal part? is there any structure in the error term? if yes, which heteroscedastic structure? do all covariates have a significant impact on the conditional quantile function q τ (Y X (1),..., X (p) )? how to select variables that have an impact on the quantile functions and/or on the heteroscedastic error structure? robust methods? Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

15 Introduction: mean and quantile regression other issues... preventing for crossing estimated quantile functions response Y covariate X F Y X : conditional distribution function of Y X theoretically: quantile curves should not cross each other: indeed, for any given conditional distribution F Y X ( x), with x fixed, and with 0 < τ 1 τ 2 < 1, and hence {y : F Y X (y x) τ 1 } {y : F Y X (y x) τ 2 } q τ1 (x) = inf{y : F Y X (y x) τ 1 } inf{y : F Y X (y x) τ 2 } = q τ2 (x) x in practice: estimated quantile curves can cross adaptations are needed to prevent this from happening... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

16 Introduction: mean and quantile regression methods to prevent crossing estimated quantile curves random vector (Y, X) Y = β 0 + β 1 X + ε (Y 1, X 1 ),... (Y n, X n ) i.i.d. observations from (Y, X) n quantile regression estimation: solve min (β τ 0,β 1 ) ρ τ (Y i β0 τ β 1 X i ) investigated approaches: i=1 simultaneous quantile objective function estimation consider 0 < τ 1 < τ 2 <... τ H 1 < τ H < 1; solve min (β τ 1 0,βτ 2 0,...,βτ H 0,β 1 ) H n h=1 i=1 ρ τh (Y i β τ h 0 β 1 X i ) and introduce constraints to impose non-crossing composite quantile regression; Zou & Yuan (2008), Bondell et al. (2010), Schnabel & Eilers (2013, quantile sheet), Jiang et al. (2016), Yang & Liu (2016),... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

17 Introduction: mean and quantile regression weighted simultaneous quantile objective function estimation: solve min (β τ 1 0,βτ 2 0,...,βτ H 0,β 1 ) H n W h h=1 i=1 ρ τh (Y i β τ h 0 β 1 X i ) (how to choose the weights? preferably in a data-driven way)... Koenker (2004, 2011), Lamarche (2011), Alexander et al. (2011), Zhao & Xiao (2014), Jiang et al. (2014),... stepwise procedure (Wu & Liu (2009)) approach of He (1997), in linear quantile regression settings Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

18 Flexible quantile regression modeling varying coefficient regression model multiple linear regression model: Y = β 0 + β 1 X (1) β p X (p) + ε complex data flexible modelling varying coefficient regression model: Y (T) = β 0 (T) + β 1 (T)X (1) (T) β p (T)X (p) (T) + ε(t) (Y (T ), X (1) (T ),..., X (p) (T ), T ) random vector Hastie & Tibshirani (1993), Hoover et al. (1998),..., Honda (2004), Kim (2006),..., Wang et al. (2008),..., Antoniadis, G. & Verhasselt (2012a), Andriyana (2015), Xie et al. (2015),... other flexible models: additive models, single-index models,... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

19 Flexible quantile regression modeling additive quantile regression: q τ (Y X (1),..., X (p) ) = β 0 + p j=1 g j ( X (j)) β 0 parameter; g 1 ( ),..., g p ( ) unknown univariate functions Yu & Lu (2004), Koenker (2011),... partially linear additive quantile regression: for 1 q < p p q τ (Y X (1),..., X (p) ) = β 0 + β 1 X (1) β q X (q) + j=q+1 β 0,..., β q parameters; g q+1 ( ),..., g p ( ) univariate functions Sherwood & Wang (2016),... single index quantile regression: q τ (Y X (1),..., X (p) ) = g 0 ( β0 + β 1 X (1) β p X (p)) β 0,..., β p parameters; g 0 ( ) univariate functions g j ( X (j)) Choudhuri et al. (1997), Wu et al. (2010), Kong & Xia (2012), Ma & He (2016), Christou & Akritad (2016), Zhao et al. (2017),... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

20 general setting Quantile regression in varying coefficient models Y (T ) = β 0 (T ) + β 1 (T )X (1) (T ) + + β p (T )X (p) (T ) + ε = X T (T )β(t ) + V (X(T ), T ) ε where ε is independent of (X(T ), T ) X(t) = ( 1, X (1) (t),..., X (p) (t) ) T (extended) covariate vector at time t β(t) = (β 0 (t), β 1 (t),..., β p (t)) T vector of (p + 1) unknown univariate regression coefficients at time t β 0 (t) is the baseline effect special settings: V (X(T ), T ) = V (T ) V (X(T ), T ) = V a constant simple heteroscedastic setting homoscedastic setting assumptions to ensure identifiability needed in all settings Andriyana (2015), Andriyana & G. (2017), Andriyana et al. (2017),... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

21 Quantile regression in varying coefficient models heteroscedastic varying coefficient models Y (T ) = β 0 (T ) + β 1 (T )X (1) (T ) β p (T )X (p) (T ) + V (X(T ), T ) ε V (X(T ), T ) = γ 0 (T ) + γ 1 (T )X (1) (T ) γ p (T )X (p) (T ) 0 from the model and the error structure: Y (T ) = X T (T )β(t ) + X T (T )γ(t ) ε where β(t) = (β 0 (t), β 1 (t),..., β p (t)) T and γ(t) = (γ 0 (t), γ 1 (t),..., γ p (t)) T denote: F 1 ε (τ) = inf {u : F ε (u) τ} τ-th conditional quantile can be re-written as q τ (Y (t) X(t), t) = X T (t)β(t) + X T (t)γ(t) Fε 1 (τ) = X T (t) [ β(t) + γ(t)fε 1 (τ) ] Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

22 Quantile regression in varying coefficient models general heteroscedastic varying coefficient model Y (T ) = β 0 (T ) + β 1 (T )X (1) (T ) β p (T )X (p) (T ) + V (X(T ), T ) ε V (X(T ), T ) = γ 0 (T ) + γ 1 (T )X (1) (T ) γ p (T )X (p) (T ) q τ (Y (t) X(t), t) = X T (t) [ β(t) + γ(t)fε 1 (τ) ] = X T (t)β τ (t) aim: estimate the conditional quantile estimated quantile curves should not cross each other estimate β 0 (t), β 1 (t),..., β p (t) estimate γ 0 (t), γ 1 (t),..., γ p (t), and variability function V (, ) simple heteroscedastic model: Y (T ) = β 0 (T ) + β 1 (T )X (1) (T ) β p (T )X (p) (T ) + V (T ) ε q τ (Y (t) X(t), t) = X T (t)β(t) + V (t) Fε 1 (τ) = X T (t)β τ (t) where β τ (t) = (β τ 0 (t), β 1(t),..., β p (t)) T, with now β0 τ (t) = β 0(t) + V (t)fε 1 (τ) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

23 Quantile regression in varying coefficient models observational setting: longitudinal data setup n independent subjects/individuals for each individual i: measurements repeated over a time period measurements at time points t i1,..., t ini N i different measurements for response and all explanatory variables: Y (t ij ) = Y ij X (k) (t ij ) = X (k) ij k = 1,..., p = X(t ij ) not. = X ij = (1, X (1) ij,..., X(p) ij )T total number of observations over all individuals: N = n i=1 N i Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

24 Real data example CD4 data example the data are a subset from the Multicenter AIDS Cohort Study (Kaslow et al. (1987)) 283 homosexual men who became HIV-positive between 1984 and 1991 data set contains repeated measurements of physical examinations, laboratory results, CD4 cell counts and CD4 percentages of each individual unequal numbers of repeated measurements and different measurement times for each individual number of repeated measurements ranges from 1 to 14 the number of distinct time points was 59 Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

25 Real data example response variable : Y (t) = CD4 percentage at time t after infection covariates: X (1) i the smoking status of the i-th individual (1 or 0 if the individual ever or never smoked cigarettes) X (2) i X (3) i the centered age at HIV infection for the i-th individual the centered pre-infection CD4 percentage aim: try to evaluate the median effect of cigarette smoking, pre-hiv infection CD4 cell percentage and age at HIV infection on the CD4 percentage after infection response Y (t) = β 0 (t) + β 1 (t)x (1) + β 2 (t)x (2) + β 3 (t)x (3) + V(t) ε Y (t) = V(X(t), t) ε Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

26 Real data example baseline CD Fitted coefficient function coefficient of smoking status Fitted coefficient function 95%CI Percentile method 95%CI Percentile method time since infection time since infection coefficient of age Fitted coefficient function coefficient of pre infection CD Fitted coefficient function 95%CI Percentile method 95%CI Percentile method time since infection time since infection Figure: τ = 0.5, γ = 1, estimated coefficient fcts, 95% bootstrap CI. Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

27 Real data example time since infection CD4 percentage after infection τ = 0.5 τ = 0.1 τ = time since infection CD4 percentage after infection τ = 0.5 τ = 0.1 τ = 0.9 Figure: Estimated quantile curves: τ = 0.1 (dashed curves), τ = 0.5 (solid curves) and τ = 0.9 (dotted curves) for (left) median and (right) maximum of covariate values. median covariate case: nonsmoking, 32.6 years old patient, with pre-infection CD4 of 42.3% τ = 0.5: estimated to have a CD4 percentage of 24.37% after 6 years Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

28 Real data example Example: CD4 data CD4 percentage after infection τ = 0.95 τ = 0.75 τ = 0.5 τ = 0.25 τ = 0.05 CD4 percentage after infection τ = 0.95 τ = 0.75 τ = 0.5 τ = 0.25 τ = time since infection (a) Individual time since infection (b) AHe approach. Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

29 Real data example time since infection V^(.) o V^(X(t), t) V^(t) Figure: Estimated variability functions. seems that V (T ) constant seems that V (X(T ), T ) = V (T ) suffices (simple heteroscedastic structure) R package QRegVCM (Yudhie Andriyana) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

30 Quantile estimation in varying coefficient models varying coefficient models Y (T) = β 0 (T) + β 1 (T)X (1) (T) β p (T)X (p) (T) + ε(t) ε(t ) independent of (X (1) (T ),..., X (p) (T )) aim: based on longitudinal data (Y ij, X ij ), i = 1,..., n; j = 1,..., N i, estimate τth conditional quantile function (0 < τ < 1) q τ (Y (t) X(t), t) = β τ 0 (t) {}}{ β 0 (t) +β 1 (t)x (1) (t) β p (t)x (p) (t) approximate each univariate function by normalized B-splines... use penalization to prevent from overfitting... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

31 Quantile estimation in varying coefficient models approximate each univariate function β k ( ) by normalized B-splines... coefficient functions β k ( ): can be of different degree of smoothness; given knots; B-splines are polynomial pieces of degree ν k joined together at each knot point; for k = 0,..., p, approximate β k (t) β k (t) α k1 B k1 (t; ν k ) α kmk B kmk (t; ν k ) = α k = (α k1,..., α kmk ) T m k = u k + ν k normalized B-splines: m k α kl B kl (t; ν k ) l=1 = α T k B k(t; ν k ) B k (t; ν k ) = (B k1 (t; ν k ),..., B kmk (t; ν k )) T u k + 1 = number of equidistant knot points m k B kl (t; ν k ) = 1 l=1 Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

32 Quantile estimation in varying coefficient models estimation of global vector of all unknown coefficients α = ( α T 0,..., ) T αt p quality of the fit measured via the goodness-of-fit quantity n 1 N i N i=1 i j=1 ) p m k ρ τ (Y ij α kl B kl (t ij ; ν k )X (k) ij k=0 l=1 reducing the modelling bias: use a large number of basis functions but this leads to overfitting... prevent this to happen by adding a penalty term Eilers & Marx (1996),... Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

33 Quantile estimation in varying coefficient models... adding a penalty term: minimize ) n N 1 i p m k ρ τ (Y ij α kl B kl (t ij ; ν k )X (k) ij + N i=1 i j=1 k=0 l=1 p m k k=0 l=d k +1 λ k d k α kl γ where γ > 0 (for example γ = 1) λ k > 0, k = 0,..., p : smoothing (regularization) parameters d k = the d k th order differencing operator of the kth variable, i.e d kα kl = d k t=0 ( 1)t( d kt ) αk(l t) with d k IN d k = 1 : d k = 2 : α kl = α kl α k,l 1 2 α kl = ( α kl ) = α kl α k,l 1 = 1 α kl 2α k,l α k,l 2 denote by α k the resulting P-splines estimator for the vector α k, k = 0,..., p estimator for the τ th conditional quantile function? Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

34 Quantile estimation in varying coefficient models q τ (Y (t) X(t), t) = = β τ 0 {}} (t) { β 0 (t) +β 1 (t)x (1) (t) β p (t)x (p) (t) d m k α kl B kl (t ij ; ν k )X (k) ij k=0 l=1 p α T kb k (t; ν k )X (k) (t) = X T (t)b(t)α k=0 B 01 (t, ν 0 )... B 0m0 (t, ν 0 ) B(t) = B p1 (t, ν p)... B pmp (t, ν p) P-splines estimator for β k (t) and and the conditional regression quantile : β k (t) = m k l=1 α kl B kl (t; ν k ) q τ (Y ij X ij, t ij ) = p m k k=0 l=1 α kl B kl (t ij ; ν k )X (k) ij Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

35 Quantile estimation in varying coefficient models simple heteroscedastic varying coefficient model q τ (Y (t) X(t), t) = X T (t)β(t) + V (t) Fε 1 (τ) = X T (t) [ β(t) + V (t)fε 1 (τ) ] general heteroscedastic varying coefficient model q τ (Y (t) X(t), t) = X T (t) [ β(t) + γ(t) Fε 1 (τ) ] adapt methods to allow/estimate (general) heteroscedastic error structure... V (t) B-splines V (t) αl V BV l (t; νv ) m V l=1 } {{ } B V (t)α V γ 0 (t) approximated by B-splines γ 1 (t) approximated by B-splines. γ p (t) approximated by B-splines V (X(t), t) = X T (t)b v (t)γ Andriyana, G. & Verhasselt (2017), Andriyana & G. (2017) R package QRegVCM (Yudhie Andriyana) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

36 Real data example Example: UK employment data data from Datastream International (R-package plm under EmplUK) records of employment and remuneration (i.e. wage bill) for all United Kingdom quoted companies concerns 140 United Kingdom firms observed each year from (e.g. Arellano and Bond (1991)) data are unbalanced: some firms have more observations than others; observations correspond to different points in historical time range of observations per firm is seven to nine (i.e. min(n i ) = 7 and max(n i ) = 9) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

37 Real data example response variable Y (t): log(employment) at time t = log(number of employees) at time t 3 covariates: X (1) (t) : average annual wage per employee in the company (at year t) X (2) (t) : capital defined as the book value of gross fixed assets X (3) (t) : an index of value-added output at constant cost Y (t) = β 0 (t) + β 1 (t)x (1) (t) + β 2 (t)x (2) (t) + β 3 (t)x (3) (t) + V(t) ε we also adapt the AHe approach further to allow for Y (t) = β 0 (t) + β 1 (t)x (1) (t) + β 2 (t)x (2) (t) + β 3 (t)x (3) (t) + V(X(t), t) ε Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

38 Real data example coefficient of capital coefficient of capital year year (a) Coefficient of capital ( β 2 (t)). (b) Coefficient of capital ( γ 2 (t)). Figure: UK employment data. The estimated influence of the covariate capital on the main quantile function (a) and on the heteroscedasticity (b)), using the general AHe approach method. Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

39 Real data example year V^(.) o V^(X(t), t) V^(t) Figure: UK employment data. Variability estimators: V (X(t), t) and V (t). most appropriate: general heteroscedastic structure: V (X(t), t) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

40 Real data example baseline log employment coefficient of output year year (a) Basel. log-employment ( β 0 (t)). (b) Coef. of output ( β 3 (t)). coefficient of output x year (c) Coef. of output ( γ 3 (t)). Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

41 Shape testing in quantile regression of interest is to look into testing problems of the form H 0 : β k (t) = constant versus H 1 : β k (t) β k H 0 : β k (t) is monotone versus H 1 : β k (t) is not monotone H 0 : β k (t) is convex versus H 1 : β k (t) is not a convex function similar hypothesis for functional coefficients γ k (t) in the variability function and simultaneous testing problems (e.g. β k (t) is increasing and γ l (t) is constant) by using with B-spline approximations, tests for these problems can be derived G., Ibrahim & Verhasselt (2017) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

42 Shape testing in quantile regression Thank you! Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

43 Shape testing in quantile regression Andriyana, Y. (2016). R package QRegVCM Andriyana, Y. & Gijbels, I. (2017). Quantile regression in heteroscedastic varying coefficient models. AStA Advances in Statistical Analysis, 101, Andriyana, Y., Gijbels, I. & Verhasselt, A. (2014). P-splines quantile regression estimation in varying coefficient models. Test, 23, Andriyana, Y., Gijbels, I. & Verhasselt, A. (2017). Quantile regression in varying coefficient models: non-crossingness and heteroscedasticity. Statistical Papers, to appear. Antoniadis, A., Gijbels, I. & Verhasselt, A. (2012). Variable selection in varying coefficient models using P-splines. Journal of Computational and Graphical Statistics, 21, Gijbels, I., Ibrahim, M. & Verhasselt, A. (2017). Shape testing in quantile varying coefficient models with heteroscedastic error. Journal of Nonparametric Statistics, 29, Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

44 Shape testing in quantile regression AHe s approach (adaptation of idea of He (1987)) for identifiability reasons : (H1): q 0.5 (ε(t)) = 0 (i.e. a 0.5 (t) = 0) (H2): q 0.5 ( ε(t) ) = 1 Step 1. Under (H1), the median quantile function is q 0.5 (Y (t) X(t), t) = β 0 (t)x (0) (t) + β 1 (t)x (1) (t) + + β p (t)x (p) (t) = X T (t)β(t) estimated median regression curve: q 0.5 (Y (t) X(t), t) = β(t) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

45 Shape testing in quantile regression Step 2. from the model and since V (t) 0, we have Y (t) X T (t)β(t) = V (t) ε(t) using (H2) q 0.5 ( Y (t) X T (t)β(t) ) = V (t) estimate V ( ) using a P-splines estimation technique, and pseudo-observations Y (t ij ) X T (t ij ) β(t ij ) for Y (t) X T (t)β(t) V (t) αl V BV l (t; νv ) m V l=1 obtain ( α 1 V,..., αv ) by minimizing m V N n i 1 ρ 0.5 Y (tij) X T m V (t ij) β(t ij) αl V Bl V (t ij; ν V ) + N i i=1 j=1 l=1 estimated variability function : V mv (t) = l=1 m V l=d V +1 α V l BV l (t; νv ) λ V d Vα V l Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

46 Shape testing in quantile regression Step 3. equipped with ( β 0 (t),..., β p (t)) from Step 1 and the estimator V (t) from Step 2, and recall that Y (t) X T (t)β(t) = V (t)ε(t) substitute β(t) by β(t) and V (t) by V (t) m h approximate : a τ h(t ij ) α q h,l Bq l (t ij; ν h ) minimize l=1 n 1 N i ρ τh Y (t ij ) X T m h m h (t ij ) β(t ij ) V (t ij ) α q h,l i=1 N Bq l (t ij ; ν h ) q + λ i j=1 l=1 l=d q h dq h α q h,l h +1 estimator of a τ h(t) : estimated quantile function â τ h(t) = m h l=1 αq h,l Bq l (t; ν h) q τh (Y (t) X(t), t) = X T (t) β(t) + V (t)â τ h (t) Irène Gijbels (KU Leuven) ICORS 2017, Wollongong, Australia July 6, / 43

Smoothing and variable selection using P-splines

Smoothing and variable selection using P-splines Smoothing and variable selection using P-splines Irène Gijbels Department of Mathematics & Leuven Statistics Research Center Katholieke Universiteit Leuven, Belgium Joint work with Anestis Antoniadis,

More information

Package QRegVCM. March 16, 2018

Package QRegVCM. March 16, 2018 Type Package Package QRegVCM March 16, 2018 Title Quantile Regression in Varying-Coefficient Models Version 1.2 Date 2018-02-26 Author 'Andriyana, Y.' [aut, cre], 'Ibrahim M. A.' [aut], 'Gijbels, I.' [ctb],

More information

A DYNAMIC QUANTILE REGRESSION TRANSFORMATION MODEL FOR LONGITUDINAL DATA

A DYNAMIC QUANTILE REGRESSION TRANSFORMATION MODEL FOR LONGITUDINAL DATA Statistica Sinica 19 (2009), 1137-1153 A DYNAMIC QUANTILE REGRESSION TRANSFORMATION MODEL FOR LONGITUDINAL DATA Yunming Mu and Ying Wei Portland State University and Columbia University Abstract: This

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

Single Index Quantile Regression for Heteroscedastic Data

Single Index Quantile Regression for Heteroscedastic Data Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University SMAC, November 6, 2015 E. Christou, M. G. Akritas (PSU) SIQR

More information

Estimation of cumulative distribution function with spline functions

Estimation of cumulative distribution function with spline functions INTERNATIONAL JOURNAL OF ECONOMICS AND STATISTICS Volume 5, 017 Estimation of cumulative distribution function with functions Akhlitdin Nizamitdinov, Aladdin Shamilov Abstract The estimation of the cumulative

More information

Interquantile Shrinkage in Regression Models

Interquantile Shrinkage in Regression Models Interquantile Shrinkage in Regression Models Liewen Jiang, Huixia Judy Wang, Howard D. Bondell Abstract Conventional analysis using quantile regression typically focuses on fitting the regression model

More information

Single Index Quantile Regression for Heteroscedastic Data

Single Index Quantile Regression for Heteroscedastic Data Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University JSM, 2015 E. Christou, M. G. Akritas (PSU) SIQR JSM, 2015

More information

Quantile Regression Methods for Reference Growth Charts

Quantile Regression Methods for Reference Growth Charts Quantile Regression Methods for Reference Growth Charts 1 Roger Koenker University of Illinois at Urbana-Champaign ASA Workshop on Nonparametric Statistics Texas A&M, January 15, 2005 Based on joint work

More information

A Resampling Method on Pivotal Estimating Functions

A Resampling Method on Pivotal Estimating Functions A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation

More information

Penalized estimation in additive varying coefficient models using grouped regularization

Penalized estimation in additive varying coefficient models using grouped regularization Noname manuscript No. (will be inserted by the editor) Penalized estimation in additive varying coefficient models using grouped regularization A. Antoniadis I. Gijbels S. Lambert-Lacroix Received: date

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Homogeneity Pursuit. Jianqing Fan

Homogeneity Pursuit. Jianqing Fan Jianqing Fan Princeton University with Tracy Ke and Yichao Wu http://www.princeton.edu/ jqfan June 5, 2014 Get my own profile - Help Amazing Follow this author Grace Wahba 9 Followers Follow new articles

More information

A direct formulation for sparse PCA using semidefinite programming

A direct formulation for sparse PCA using semidefinite programming A direct formulation for sparse PCA using semidefinite programming A. d Aspremont, L. El Ghaoui, M. Jordan, G. Lanckriet ORFE, Princeton University & EECS, U.C. Berkeley Available online at www.princeton.edu/~aspremon

More information

Efficient Estimation in Convex Single Index Models 1

Efficient Estimation in Convex Single Index Models 1 1/28 Efficient Estimation in Convex Single Index Models 1 Rohit Patra University of Florida http://arxiv.org/abs/1708.00145 1 Joint work with Arun K. Kuchibhotla (UPenn) and Bodhisattva Sen (Columbia)

More information

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F

More information

Ensemble estimation and variable selection with semiparametric regression models

Ensemble estimation and variable selection with semiparametric regression models Ensemble estimation and variable selection with semiparametric regression models Sunyoung Shin Department of Mathematical Sciences University of Texas at Dallas Joint work with Jason Fine, Yufeng Liu,

More information

SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES. Liping Zhu, Mian Huang, & Runze Li. The Pennsylvania State University

SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES. Liping Zhu, Mian Huang, & Runze Li. The Pennsylvania State University SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES Liping Zhu, Mian Huang, & Runze Li The Pennsylvania State University Technical Report Series #10-104 College of Health and Human Development

More information

Quantile Processes for Semi and Nonparametric Regression

Quantile Processes for Semi and Nonparametric Regression Quantile Processes for Semi and Nonparametric Regression Shih-Kang Chao Department of Statistics Purdue University IMS-APRM 2016 A joint work with Stanislav Volgushev and Guang Cheng Quantile Response

More information

Robust estimators for additive models using backfitting

Robust estimators for additive models using backfitting Robust estimators for additive models using backfitting Graciela Boente Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and CONICET, Argentina Alejandra Martínez Facultad de Ciencias

More information

More formally, the Gini coefficient is defined as. with p(y) = F Y (y) and where GL(p, F ) the Generalized Lorenz ordinate of F Y is ( )

More formally, the Gini coefficient is defined as. with p(y) = F Y (y) and where GL(p, F ) the Generalized Lorenz ordinate of F Y is ( ) Fortin Econ 56 3. Measurement The theoretical literature on income inequality has developed sophisticated measures (e.g. Gini coefficient) on inequality according to some desirable properties such as decomposability

More information

Impact Evaluation Technical Workshop:

Impact Evaluation Technical Workshop: Impact Evaluation Technical Workshop: Asian Development Bank Sept 1 3, 2014 Manila, Philippines Session 19(b) Quantile Treatment Effects I. Quantile Treatment Effects Most of the evaluation literature

More information

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract Journal of Data Science,17(1). P. 145-160,2019 DOI:10.6339/JDS.201901_17(1).0007 WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION Wei Xiong *, Maozai Tian 2 1 School of Statistics, University of

More information

Analysis of Longitudinal Data with Semiparametric Estimation of Covariance Function

Analysis of Longitudinal Data with Semiparametric Estimation of Covariance Function Analysis of Longitudinal Data with Semiparametric Estimation of Covariance Function Jianqing Fan, Tao Huang and Runze Li Abstract Improving efficiency for regression coefficients and predicting trajectories

More information

A Semi-Parametric Measure for Systemic Risk

A Semi-Parametric Measure for Systemic Risk Natalia Sirotko-Sibirskaya Ladislaus von Bortkiewicz Chair of Statistics C.A.S.E. - Center for Applied Statistics and Economics Humboldt Universität zu Berlin http://lvb.wiwi.hu-berlin.de http://www.case.hu-berlin.de

More information

Goodness-of-fit tests for the cure rate in a mixture cure model

Goodness-of-fit tests for the cure rate in a mixture cure model Biometrika (217), 13, 1, pp. 1 7 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics,

More information

Quantile Regression for Extraordinarily Large Data

Quantile Regression for Extraordinarily Large Data Quantile Regression for Extraordinarily Large Data Shih-Kang Chao Department of Statistics Purdue University November, 2016 A joint work with Stanislav Volgushev and Guang Cheng Quantile regression Two-step

More information

Time Series and Forecasting Lecture 4 NonLinear Time Series

Time Series and Forecasting Lecture 4 NonLinear Time Series Time Series and Forecasting Lecture 4 NonLinear Time Series Bruce E. Hansen Summer School in Economics and Econometrics University of Crete July 23-27, 2012 Bruce Hansen (University of Wisconsin) Foundations

More information

18 Bivariate normal distribution I

18 Bivariate normal distribution I 8 Bivariate normal distribution I 8 Example Imagine firing arrows at a target Hopefully they will fall close to the target centre As we fire more arrows we find a high density near the centre and fewer

More information

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R In probabilistic models, a random variable is a variable whose possible values are numerical outcomes of a random phenomenon. As a function or a map, it maps from an element (or an outcome) of a sample

More information

Local Polynomial Regression

Local Polynomial Regression VI Local Polynomial Regression (1) Global polynomial regression We observe random pairs (X 1, Y 1 ),, (X n, Y n ) where (X 1, Y 1 ),, (X n, Y n ) iid (X, Y ). We want to estimate m(x) = E(Y X = x) based

More information

Coordinate descent. Geoff Gordon & Ryan Tibshirani Optimization /

Coordinate descent. Geoff Gordon & Ryan Tibshirani Optimization / Coordinate descent Geoff Gordon & Ryan Tibshirani Optimization 10-725 / 36-725 1 Adding to the toolbox, with stats and ML in mind We ve seen several general and useful minimization tools First-order methods

More information

Approximation of Functions

Approximation of Functions Approximation of Functions Carlos Hurtado Department of Economics University of Illinois at Urbana-Champaign hrtdmrt2@illinois.edu Nov 7th, 217 C. Hurtado (UIUC - Economics) Numerical Methods On the Agenda

More information

The Simulation Study to Test the Performance of Quantile Regression Method With Heteroscedastic Error Variance

The Simulation Study to Test the Performance of Quantile Regression Method With Heteroscedastic Error Variance CAUCHY Jurnal Matematika Murni dan Aplikasi Volume 5(1)(2017), Pages 36-41 p-issn: 2086-0382; e-issn: 2477-3344 The Simulation Study to Test the Performance of Quantile Regression Method With Heteroscedastic

More information

4. Conditional risk measures and their robust representation

4. Conditional risk measures and their robust representation 4. Conditional risk measures and their robust representation We consider a discrete-time information structure given by a filtration (F t ) t=0,...,t on our probability space (Ω, F, P ). The time horizon

More information

3. QUANTILE-REGRESSION MODEL AND ESTIMATION

3. QUANTILE-REGRESSION MODEL AND ESTIMATION 03-Hao.qxd 3/13/2007 5:24 PM Page 22 22 Combining these two partial derivatives leads to: m + y m f(y)dy = F (m) (1 F (m)) = 2F (m) 1. [A.2] By setting 2F(m) 1 = 0, we solve for the value of F(m) = 1/2,

More information

arxiv: v2 [stat.me] 4 Jun 2016

arxiv: v2 [stat.me] 4 Jun 2016 Variable Selection for Additive Partial Linear Quantile Regression with Missing Covariates 1 Variable Selection for Additive Partial Linear Quantile Regression with Missing Covariates Ben Sherwood arxiv:1510.00094v2

More information

Stochastic Demand and Revealed Preference

Stochastic Demand and Revealed Preference Stochastic Demand and Revealed Preference Richard Blundell Dennis Kristensen Rosa Matzkin UCL & IFS, Columbia and UCLA November 2010 Blundell, Kristensen and Matzkin ( ) Stochastic Demand November 2010

More information

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION VICTOR CHERNOZHUKOV CHRISTIAN HANSEN MICHAEL JANSSON Abstract. We consider asymptotic and finite-sample confidence bounds in instrumental

More information

ECAS Summer Course. Quantile Regression for Longitudinal Data. Roger Koenker University of Illinois at Urbana-Champaign

ECAS Summer Course. Quantile Regression for Longitudinal Data. Roger Koenker University of Illinois at Urbana-Champaign ECAS Summer Course 1 Quantile Regression for Longitudinal Data Roger Koenker University of Illinois at Urbana-Champaign La Roche-en-Ardennes: September 2005 Part I: Penalty Methods for Random Effects Part

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013 BSquare: An R package for Bayesian simultaneous quantile regression Luke B Smith and Brian J Reich North Carolina State University May 21, 2013 BSquare in an R package to conduct Bayesian quantile regression

More information

High-dimensional regression

High-dimensional regression High-dimensional regression Advanced Methods for Data Analysis 36-402/36-608) Spring 2014 1 Back to linear regression 1.1 Shortcomings Suppose that we are given outcome measurements y 1,... y n R, and

More information

CONTRIBUTIONS TO PENALIZED ESTIMATION. Sunyoung Shin

CONTRIBUTIONS TO PENALIZED ESTIMATION. Sunyoung Shin CONTRIBUTIONS TO PENALIZED ESTIMATION Sunyoung Shin A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the degree

More information

Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models

Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models Short title: Restricted LR Tests in Longitudinal Models Ciprian M. Crainiceanu David Ruppert May 5, 2004 Abstract We assume that repeated

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Lecture 3: Statistical Decision Theory (Part II)

Lecture 3: Statistical Decision Theory (Part II) Lecture 3: Statistical Decision Theory (Part II) Hao Helen Zhang Hao Helen Zhang Lecture 3: Statistical Decision Theory (Part II) 1 / 27 Outline of This Note Part I: Statistics Decision Theory (Classical

More information

OPTIMALLY COMBINED ESTIMATION FOR TAIL QUANTILE REGRESSION

OPTIMALLY COMBINED ESTIMATION FOR TAIL QUANTILE REGRESSION Statistica Sinica 26 (2016), 295-311 doi:http://dx.doi.org/10.5705/ss.2014.051 OPTIMALLY COMBINED ESTIMATION FOR TAIL QUANTILE REGRESSION Kehui Wang and Huixia Judy Wang PPD Inc. and George Washington

More information

Basis Penalty Smoothers. Simon Wood Mathematical Sciences, University of Bath, U.K.

Basis Penalty Smoothers. Simon Wood Mathematical Sciences, University of Bath, U.K. Basis Penalty Smoothers Simon Wood Mathematical Sciences, University of Bath, U.K. Estimating functions It is sometimes useful to estimate smooth functions from data, without being too precise about the

More information

Trends in the Relative Distribution of Wages by Gender and Cohorts in Brazil ( )

Trends in the Relative Distribution of Wages by Gender and Cohorts in Brazil ( ) Trends in the Relative Distribution of Wages by Gender and Cohorts in Brazil (1981-2005) Ana Maria Hermeto Camilo de Oliveira Affiliation: CEDEPLAR/UFMG Address: Av. Antônio Carlos, 6627 FACE/UFMG Belo

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Efficient Estimation for the Partially Linear Models with Random Effects

Efficient Estimation for the Partially Linear Models with Random Effects A^VÇÚO 1 33 ò 1 5 Ï 2017 c 10 Chinese Journal of Applied Probability and Statistics Oct., 2017, Vol. 33, No. 5, pp. 529-537 doi: 10.3969/j.issn.1001-4268.2017.05.009 Efficient Estimation for the Partially

More information

STAT 461/561- Assignments, Year 2015

STAT 461/561- Assignments, Year 2015 STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and

More information

Quantile regression for longitudinal data using the asymmetric Laplace distribution

Quantile regression for longitudinal data using the asymmetric Laplace distribution Biostatistics (2007), 8, 1, pp. 140 154 doi:10.1093/biostatistics/kxj039 Advance Access publication on April 24, 2006 Quantile regression for longitudinal data using the asymmetric Laplace distribution

More information

NEW EFFICIENT AND ROBUST ESTIMATION IN VARYING-COEFFICIENT MODELS WITH HETEROSCEDASTICITY

NEW EFFICIENT AND ROBUST ESTIMATION IN VARYING-COEFFICIENT MODELS WITH HETEROSCEDASTICITY Statistica Sinica 22 202, 075-0 doi:http://dx.doi.org/0.5705/ss.200.220 NEW EFFICIENT AND ROBUST ESTIMATION IN VARYING-COEFFICIENT MODELS WITH HETEROSCEDASTICITY Jie Guo, Maozai Tian and Kai Zhu 2 Renmin

More information

Statistics for high-dimensional data: Group Lasso and additive models

Statistics for high-dimensional data: Group Lasso and additive models Statistics for high-dimensional data: Group Lasso and additive models Peter Bühlmann and Sara van de Geer Seminar für Statistik, ETH Zürich May 2012 The Group Lasso (Yuan & Lin, 2006) high-dimensional

More information

Assessing Model Adequacy

Assessing Model Adequacy Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for inferences. In cases where some assumptions are violated, there are

More information

Robust control charts for time series data

Robust control charts for time series data Robust control charts for time series data Christophe Croux K.U. Leuven & Tilburg University Sarah Gelper Erasmus University Rotterdam Koen Mahieu K.U. Leuven Abstract This article presents a control chart

More information

SEMIPARAMETRIC LONGITUDINAL MODEL WITH IRREGULAR TIME AUTOREGRESSIVE ERROR PROCESS

SEMIPARAMETRIC LONGITUDINAL MODEL WITH IRREGULAR TIME AUTOREGRESSIVE ERROR PROCESS Statistica Sinica 25 (2015), 507-527 doi:http://dx.doi.org/10.5705/ss.2013.073 SEMIPARAMETRIC LOGITUDIAL MODEL WITH IRREGULAR TIME AUTOREGRESSIVE ERROR PROCESS Yang Bai 1,2, Jian Huang 3, Rui Li 1 and

More information

Habilitationsvortrag: Machine learning, shrinkage estimation, and economic theory

Habilitationsvortrag: Machine learning, shrinkage estimation, and economic theory Habilitationsvortrag: Machine learning, shrinkage estimation, and economic theory Maximilian Kasy May 25, 218 1 / 27 Introduction Recent years saw a boom of machine learning methods. Impressive advances

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

School on Modelling Tools and Capacity Building in Climate and Public Health April 2013

School on Modelling Tools and Capacity Building in Climate and Public Health April 2013 2453-14 School on Modelling Tools and Capacity Building in Climate and Public Health 15-26 April 2013 Expectile and Quantile Regression and Other Extensions KAZEMBE Lawrence University of Malawi Chancellor

More information

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods MIT 14.385, Fall 2007 Due: Wednesday, 07 November 2007, 5:00 PM 1 Applied Problems Instructions: The page indications given below give you

More information

Covariate Adjusted Varying Coefficient Models

Covariate Adjusted Varying Coefficient Models Biostatistics Advance Access published October 26, 2005 Covariate Adjusted Varying Coefficient Models Damla Şentürk Department of Statistics, Pennsylvania State University University Park, PA 16802, U.S.A.

More information

Likelihood ratio testing for zero variance components in linear mixed models

Likelihood ratio testing for zero variance components in linear mixed models Likelihood ratio testing for zero variance components in linear mixed models Sonja Greven 1,3, Ciprian Crainiceanu 2, Annette Peters 3 and Helmut Küchenhoff 1 1 Department of Statistics, LMU Munich University,

More information

Semi-parametric estimation of non-stationary Pickands functions

Semi-parametric estimation of non-stationary Pickands functions Semi-parametric estimation of non-stationary Pickands functions Linda Mhalla 1 Joint work with: Valérie Chavez-Demoulin 2 and Philippe Naveau 3 1 Geneva School of Economics and Management, University of

More information

Stat 710: Mathematical Statistics Lecture 31

Stat 710: Mathematical Statistics Lecture 31 Stat 710: Mathematical Statistics Lecture 31 Jun Shao Department of Statistics University of Wisconsin Madison, WI 53706, USA Jun Shao (UW-Madison) Stat 710, Lecture 31 April 13, 2009 1 / 13 Lecture 31:

More information

Expecting the Unexpected: Uniform Quantile Regression Bands with an application to Investor Sentiments

Expecting the Unexpected: Uniform Quantile Regression Bands with an application to Investor Sentiments Expecting the Unexpected: Uniform Bands with an application to Investor Sentiments Boston University November 16, 2016 Econometric Analysis of Heterogeneity in Financial Markets Using s Chapter 1: Expecting

More information

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods Alternatives to Basis Expansions Basis expansions require either choice of a discrete set of basis or choice of smoothing penalty and smoothing parameter Both of which impose prior beliefs on data. Alternatives

More information

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David

More information

Testing Equality of Nonparametric Quantile Regression Functions

Testing Equality of Nonparametric Quantile Regression Functions International Journal of Statistics and Probability; Vol. 3, No. 1; 2014 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Testing Equality of Nonparametric Quantile

More information

Bayesian Semiparametric Methods for Joint Modeling Longitudinal and Survival Data

Bayesian Semiparametric Methods for Joint Modeling Longitudinal and Survival Data Bayesian Semiparametric Methods for Joint Modeling Longitudinal and Survival Data Tim Hanson Division of Biostatistics University of Minnesota and Adam Branscum University of Kentucky Departments of Biostatistics,

More information

Estimating Global Bank Network Connectedness

Estimating Global Bank Network Connectedness Estimating Global Bank Network Connectedness Mert Demirer (MIT) Francis X. Diebold (Penn) Laura Liu (Penn) Kamil Yılmaz (Koç) September 22, 2016 1 / 27 Financial and Macroeconomic Connectedness Market

More information

Transformation and Smoothing in Sample Survey Data

Transformation and Smoothing in Sample Survey Data Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department

More information

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Chapter 3 Examining Data

Chapter 3 Examining Data Chapter 3 Examining Data This chapter discusses methods of displaying quantitative data with the objective of understanding the distribution of the data. Example During childhood and adolescence, bone

More information

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Part III Measures of Classification Accuracy for the Prediction of Survival Times Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples

More information

Modelling Survival Data using Generalized Additive Models with Flexible Link

Modelling Survival Data using Generalized Additive Models with Flexible Link Modelling Survival Data using Generalized Additive Models with Flexible Link Ana L. Papoila 1 and Cristina S. Rocha 2 1 Faculdade de Ciências Médicas, Dep. de Bioestatística e Informática, Universidade

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 1 Bootstrapped Bias and CIs Given a multiple regression model with mean and

More information

Estimating subgroup specific treatment effects via concave fusion

Estimating subgroup specific treatment effects via concave fusion Estimating subgroup specific treatment effects via concave fusion Jian Huang University of Iowa April 6, 2016 Outline 1 Motivation and the problem 2 The proposed model and approach Concave pairwise fusion

More information

Computational treatment of the error distribution in nonparametric regression with right-censored and selection-biased data

Computational treatment of the error distribution in nonparametric regression with right-censored and selection-biased data Computational treatment of the error distribution in nonparametric regression with right-censored and selection-biased data Géraldine Laurent 1 and Cédric Heuchenne 2 1 QuantOM, HEC-Management School of

More information

Combining Model and Test Data for Optimal Determination of Percentiles and Allowables: CVaR Regression Approach, Part I

Combining Model and Test Data for Optimal Determination of Percentiles and Allowables: CVaR Regression Approach, Part I 9 Combining Model and Test Data for Optimal Determination of Percentiles and Allowables: CVaR Regression Approach, Part I Stan Uryasev 1 and A. Alexandre Trindade 2 1 American Optimal Decisions, Inc. and

More information

The Information Content of Capacity Utilisation Rates for Output Gap Estimates

The Information Content of Capacity Utilisation Rates for Output Gap Estimates The Information Content of Capacity Utilisation Rates for Output Gap Estimates Michael Graff and Jan-Egbert Sturm 15 November 2010 Overview Introduction and motivation Data Output gap data: OECD Economic

More information

Nonparametric Inference In Functional Data

Nonparametric Inference In Functional Data Nonparametric Inference In Functional Data Zuofeng Shang Purdue University Joint work with Guang Cheng from Purdue Univ. An Example Consider the functional linear model: Y = α + where 1 0 X(t)β(t)dt +

More information

Introduction to bivariate analysis

Introduction to bivariate analysis Introduction to bivariate analysis When one measurement is made on each observation, univariate analysis is applied. If more than one measurement is made on each observation, multivariate analysis is applied.

More information

Chapter 2: Resampling Maarten Jansen

Chapter 2: Resampling Maarten Jansen Chapter 2: Resampling Maarten Jansen Randomization tests Randomized experiment random assignment of sample subjects to groups Example: medical experiment with control group n 1 subjects for true medicine,

More information

PEP-PMMA Technical Note Series (PTN 01) Percentile Weighed Regression

PEP-PMMA Technical Note Series (PTN 01) Percentile Weighed Regression PEP-PMMA Technical Note Series (PTN 01) Percentile Weighed Regression Araar Abdelkrim University of Laval, aabd@ecn.ulaval.ca May 29, 2016 Abstract - In this brief note, we review the quantile and unconditional

More information

Introduction to bivariate analysis

Introduction to bivariate analysis Introduction to bivariate analysis When one measurement is made on each observation, univariate analysis is applied. If more than one measurement is made on each observation, multivariate analysis is applied.

More information

mboost - Componentwise Boosting for Generalised Regression Models

mboost - Componentwise Boosting for Generalised Regression Models mboost - Componentwise Boosting for Generalised Regression Models Thomas Kneib & Torsten Hothorn Department of Statistics Ludwig-Maximilians-University Munich 13.8.2008 Boosting in a Nutshell Boosting

More information

Robust Variable Selection Through MAVE

Robust Variable Selection Through MAVE Robust Variable Selection Through MAVE Weixin Yao and Qin Wang Abstract Dimension reduction and variable selection play important roles in high dimensional data analysis. Wang and Yin (2008) proposed sparse

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

On Backtesting Risk Measurement Models

On Backtesting Risk Measurement Models On Backtesting Risk Measurement Models Hideatsu Tsukahara Department of Economics, Seijo University e-mail address: tsukahar@seijo.ac.jp 1 Introduction In general, the purpose of backtesting is twofold:

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Bickel Rosenblatt test

Bickel Rosenblatt test University of Latvia 28.05.2011. A classical Let X 1,..., X n be i.i.d. random variables with a continuous probability density function f. Consider a simple hypothesis H 0 : f = f 0 with a significance

More information

Threshold Autoregressions and NonLinear Autoregressions

Threshold Autoregressions and NonLinear Autoregressions Threshold Autoregressions and NonLinear Autoregressions Original Presentation: Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Threshold Regression 1 / 47 Threshold Models

More information

Local linear multiple regression with variable. bandwidth in the presence of heteroscedasticity

Local linear multiple regression with variable. bandwidth in the presence of heteroscedasticity Local linear multiple regression with variable bandwidth in the presence of heteroscedasticity Azhong Ye 1 Rob J Hyndman 2 Zinai Li 3 23 January 2006 Abstract: We present local linear estimator with variable

More information

Sparse PCA with applications in finance

Sparse PCA with applications in finance Sparse PCA with applications in finance A. d Aspremont, L. El Ghaoui, M. Jordan, G. Lanckriet ORFE, Princeton University & EECS, U.C. Berkeley Available online at www.princeton.edu/~aspremon 1 Introduction

More information