Bootstrap & Confidence/Prediction intervals

Size: px

Start display at page:

Download "Bootstrap & Confidence/Prediction intervals"

Vivien Boyd
5 years ago
Views:

1 Bootstrap & Confidence/Prediction intervals Olivier Roustant Mines Saint-Étienne 2017/11 Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 1 / 9

2 Framework Consider a model with an additive homoskedastic noise y i = g(x i ; β) + ε i i = 1,..., n with ε 1,..., ε n i.i.d. with a cdf F, and β is a vector of parameters. Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 2 / 9

3 Framework Consider a model with an additive homoskedastic noise y i = g(x i ; β) + ε i i = 1,..., n with ε 1,..., ε n i.i.d. with a cdf F, and β is a vector of parameters. Let ˆβ an estimator of β, and let x be a new site. We are interested in : The prediction mean at x : ŷ(x) = g(x; ˆβ) The prediction law at x, i.e. the law of ŷ(x) + ɛ, where ɛ F independent of the ε 1,..., ε n. Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 2 / 9

4 Confidence intervals The variability of ŷ(x) may provide a (random) confidence interval at level α of a (deterministic) statistic of interest t for g(x, β) : P(I t) = 1 α Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 3 / 9

5 Confidence intervals The variability of ŷ(x) may provide a (random) confidence interval at level α of a (deterministic) statistic of interest t for g(x, β) : Example (Gaussian linear model) P(I t) = 1 α g(x, ˆβ) ) = x ˆβ N (x β, σ 2 x (X X) 1 x This gives a 95% confidence interval of x β, the true prediction at x : I x ˆβ ± 2ˆσ x (X X) 1 x Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 3 / 9

6 Confidence intervals, bootstrap estimate Resample the data z i = (x i, y i ), i = 1,..., n (z b 1,..., z b n ), b = 1..., B Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 4 / 9

7 Confidence intervals, bootstrap estimate Resample the data z i = (x i, y i ), i = 1,..., n (z b 1,..., z b n ), b = 1..., B For each bootstrap sample (zi b ) 1 i n, refit the data ˆβ b Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 4 / 9

8 Confidence intervals, bootstrap estimate Resample the data z i = (x i, y i ), i = 1,..., n (z b 1,..., z b n ), b = 1..., B For each bootstrap sample (zi b ) 1 i n, refit the data ˆβ b Give a 95% confidence interval by computing the empirical quantiles at level 2.5% and 97.5% of g(x, ˆβ b ), b = 1,..., B Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 4 / 9

9 Prediction intervals The law of ŷ(x) + ɛ provides a (deterministic) prediction interval I at level α, which contains the (random) prediction with probability 1 α : P(ŷ(x) + ɛ I) = 1 α Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 5 / 9

10 Prediction intervals The law of ŷ(x) + ɛ provides a (deterministic) prediction interval I at level α, which contains the (random) prediction with probability 1 α : Example (Gaussian linear model) P(ŷ(x) + ɛ I) = 1 α ŷ(x) + ɛ = x ˆβ + ɛ N (x β, σ 2 (1 + x (X X) 1 x)) and I x β ± 2ˆσ 1 + x (X X) 1 x Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 5 / 9

11 Prediction intervals, bootstrap estimate Resample the data z i = (x i, y i ), i = 1,..., n (z b 1,..., z b n ), b = 1..., B For each bootstrap sample (zi b ) 1 i n, b Refit the data : ˆβ Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 6 / 9

12 Prediction intervals, bootstrap estimate Resample the data z i = (x i, y i ), i = 1,..., n (z b 1,..., z b n ), b = 1..., B For each bootstrap sample (zi b ) 1 i n, b Refit the data : ˆβ Sample ε b from the resampled residuals : ε b i = y i g(x i, ˆβ b ) Non-parametric bootstrap on the bootstrapped residuals Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 6 / 9

13 Prediction intervals, bootstrap estimate Resample the data z i = (x i, y i ), i = 1,..., n (z b 1,..., z b n ), b = 1..., B For each bootstrap sample (zi b ) 1 i n, b Refit the data : ˆβ Sample ε b from the resampled residuals : ε b i = y i g(x i, ˆβ b ) Non-parametric bootstrap on the bootstrapped residuals Give a 95% prediction interval by computing the empirical quantiles at level 2.5% and 97.5% of g(x, ˆβ b ) + ε b, b = 1,..., B Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 6 / 9

14 Prediction intervals, bootstrap estimate - Normal approximation If the residuals ε 1,..., ε n are assumed N (0, σ 2 ), one can sample ε b from N (0, (σ b ) 2 ), where σ b is the s.d. of (ε b i ) 1 i n. Parametric bootstrap on the bootstrapped residuals Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 7 / 9

15 Prediction intervals, bootstrap estimate - Normal approximation If the residuals ε 1,..., ε n are assumed N (0, σ 2 ), one can sample ε b from N (0, (σ b ) 2 ), where σ b is the s.d. of (ε b i ) 1 i n. Parametric bootstrap on the bootstrapped residuals If, in addition, the law of g(x, ˆβ) is assumed to be Gaussian, one can simply estimate the variances of g(x, ˆβ) and ε separately ˆσ 2 g(x,.) = 1 B 1 B b=1 ( ) 2 g(x, ˆβ b ) g(x,.) ˆσ ε 2 = 1 n 1 and compute the prediction intervals as g(x,.) ± 2 ˆσ g(x,.) 2 + ˆσ2 ε n i=1 ɛ 2 i Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 7 / 9

16 References References Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 8 / 9

17 References For more details, in particular about bias induced by bootstrap, one can read : ESL T. Hastie, R. Tibshirani and J. Friedman (2009), The Elements of Statistical Learning, Springer, 2nd edition, print 10. Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 9 / 9

BANA 7046 Data Mining I Lecture 2. Linear Regression, Model Assessment, and Cross-validation 1

BANA 7046 Data Mining I Lecture 2. Linear Regression, Model Assessment, and Cross-validation 1 Shaobo Li University of Cincinnati 1 Partially based on Hastie, et al. (2009) ESL, and James, et al. (2013)