Quantifying Stochastic Model Errors via Robust Optimization

Size: px
Start display at page:

Download "Quantifying Stochastic Model Errors via Robust Optimization"

Transcription

1 Quantifying Stochastic Model Errors via Robust Optimization IPAM Workshop on Uncertainty Quantification for Multiscale Stochastic Systems and Applications Jan 19, 2016 Henry Lam Industrial & Operations Engineering University of Michigan Acknowledgement: National Science Foundation under grants CMMI and CMMI

2 Overview Focus: Discrete-event stochastic systems in operations research Motivation: Model risk / model error/ input uncertainty arises when performance analysis is based on misspecified / uncertain input model assumptions Goal: Performance analysis that is robust against model error Scope: Input model: stochastic Uncertainty: nonparametric

3 Basic Question Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. Exp(1) service times Target performance measure: probability of long waiting time incurred by the 50-th arriving customer, i.e. P W 50 > 2

4 Basic Question Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. Exp(1) service times Target performance measure: probability of long waiting time incurred by the 50-th arriving customer, i.e. P W 50 > 2 With insufficient or unreliable historical data, What if service time distribution is not Exp(1)?

5 Basic Question Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. Exp(1) service times Target performance measure: probability of long waiting time incurred by the 50-th arriving customer, i.e. P W 50 > 2 With insufficient or unreliable historical data, What if service time distribution is not Exp(1)? What is a reliable estimate of the performance measure, without committing to a (non-)parametric model for the service time distribution?

6 Question 2: Uncertain Dependency Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. Exp(1) service times Target performance measure: probability of long waiting time incurred by the 50-th arriving customer, i.e. P W 50 > 2

7 Question 2: Uncertain Dependency Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. Exp(1) service times Target performance measure: probability of long waiting time incurred by the 50-th arriving customer, i.e. P W 50 > 2 What if the interarrival time sequence is not i.i.d.?

8 Question 2: Uncertain Dependency Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. Exp(1) service times Target performance measure: probability of long waiting time incurred by the 50-th arriving customer, i.e. P W 50 > 2 What if the interarrival time sequence is not i.i.d.? An estimate without restricting to any parametric class of time series models for the interarrival times?

9 Question 3: Input Reconstruction Single-server queue: i.i.d. Exp(0.8) interarrival times i.i.d. unknown service times What information can we extract on the service time distribution if we only have data on the waiting times?

10 Robust Stochastic Modeling Model uncertainty arises due to modeling limitations, presence of risk, lack of direct data etc. Robust perspective:

11 Robust Stochastic Modeling Model uncertainty arises due to modeling limitations, presence of risk, lack of direct data etc. Robust perspective: Element 1: Represent partial, nonparametric information on the input model via constraints uncertainty set (e.g. Ben-Tal et al. 09, Bertsimas et al. 11)

12 Robust Stochastic Modeling Model uncertainty arises due to modeling limitations, presence of risk, lack of direct data etc. Robust perspective: Element 1: Represent partial, nonparametric information on the input model via constraints uncertainty set (e.g. Ben-Tal et al. 09, Bertsimas et al. 11) Element 2: Solve optimization problem that computes the worstcase performance measure subject to the uncertainty

13 Basic Example Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of service times, P 0 = Exp(1), h X = I(waiting time of 20th customer > 2) compute E 0 h X 1,, X T under the condition i.i.d. X t P 0

14 Basic Example Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of service times, P 0 = Exp(1), h X = I(waiting time of 20th customer > 2) What if the distribution for X t is potentially misspecified? compute E 0 h X 1,, X T under the condition i.i.d. X t P 0

15 Basic Example Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of service times, P 0 = Exp(1), h X = I(waiting time of 20th customer > 2) What if the distribution for X t is potentially misspecified? maximize E f h X 1,, X T subject to i.i.d. X t P f D P f P 0 η

16 Basic Example Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of service times, P 0 = Exp(1), h X = I(waiting time of 20th customer > 2) What if the distribution for X t is potentially misspecified? The true service time distribution P f is in a neighborhood of the baseline model P 0, as measured by a nonparametric distance D of η units maximize E f h X 1,, X T subject to i.i.d. X t P f D P f P 0 η

17 Basic Example Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of service times, P 0 = Exp(1), h X = I(waiting time of 20th customer > 2) What if the distribution for X t is potentially misspecified? The true service time distribution P f is in a neighborhood of the baseline model P 0, as measured by a nonparametric distance D of η units maximize E f h X 1,, X T subject to i.i.d. X t P f D P f P 0 η Decision variable: P f

18 Basic Example Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of service times, P 0 = Exp(1), h X = I(waiting time of 20th customer > 2) What if the distribution for X t is potentially misspecified? The true service time distribution P f is in a neighborhood of the baseline model P 0, as measured by a nonparametric distance D of η units maximize E f h X 1,, X T subject to i.i.d. X t P f D P f P 0 η Decision variable: P f D P f P 0 can be: Kullback-Leibler (KL) divergence = log dp f dp 0 dp f φ-divergence = φ dp f dp dp 0 0 replaced/enriched by moment constraints, e.g. μ E f X t μ

19 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? compute E 0 h X 1,, X T under the condition i.i.d. X t P 0

20 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f P f is within η-neighborhood of P 0 = P 0 P 0 P f has the same marginal as P 0 P f a class of stationary serially dependent processes

21 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f P f is within η-neighborhood of P 0 = P 0 P 0 P f has the same marginal as P 0 P f a class of stationary serially dependent processes

22 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f P f is within η-neighborhood of P 0 = P 0 P 0 P f has the same marginal as P 0 P f a class of stationary serially dependent processes

23 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f P f is within η-neighborhood of P 0 = P 0 P 0 P f has the same marginal as P 0 P f a class of stationary serially dependent processes

24 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f P f is within η-neighborhood of P 0 = P 0 P 0 P f has the same marginal as P 0 P f a class of stationary serially dependent processes Decision variable: P f

25 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f P f is within η-neighborhood of P 0 = P 0 P 0 P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes Decision variable: P f

26 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes Decision variable: P f D P f P 0 can be: mutual information for the bivariate distribution of any two consecutive states under P f

27 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes Decision variable: P f D P f P 0 can be: mutual information for the bivariate distribution of any two consecutive states under P f Pearson s φ 2 -coefficient for the bivariate distribution of any two consecutive states under P f

28 Example 2: Uncertain Dependency Baseline input model: X t P 0 X = (X 1, X 2,, X T ) Performance measure: E[h X ] Output the worst-case performance measure among all 1-lag processes within η units of nonparametric distance from i.i.d. process, keeping marginals fixed E.g. Single-server queue: X = sequence of interarrival times, P 0 = Exp(0.8), h X = I(waiting time of 20th customer > 2) What if the i.i.d. assumption for (X 1,, X T ) is potentially misspecified? maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes Decision variable: P f D P f P 0 can be: mutual information for the bivariate distribution of any two consecutive states under P f Pearson s φ 2 -coefficient for the bivariate distribution of any two consecutive states under P f

29 Past Literature Area Economics Hansen & Sargent 01, 08 Examples of Works Finance Glasserman & Xu 13, Lim et al. 12 Control theory/mdp/markov perturbation analysis Petersen et. al. 00, Nilim & El Ghaoui 05, Iyengar 05, Jain et. al. 10, Heidergott et. al. 10 (Distributionally) robust optimization Delage & Ye 10, Goh & Sim 10, Ben-Tal et. al. 13, Wiesemann et. al. 14, Xin et. al. 15 Robust Monte Carlo Glasserman & Xu 14, Glasserman & Yang 15 Decision analysis/moment method Smith 93, 95, Birge & Wets 87, Bertsimas & Popescu 05, Popescu 05

30 Past Literature Area Economics Hansen & Sargent 01, 08 Examples of Works Finance Glasserman & Xu 13, Lim et al. 12 Control theory/mdp/markov perturbation analysis Petersen et. al. 00, Nilim & El Ghaoui 05, Iyengar 05, Jain et. al. 10, Heidergott et. al. 10 (Distributionally) robust optimization Delage & Ye 10, Goh & Sim 10, Ben-Tal et. al. 13, Wiesemann et. al. 14, Xin et. al. 15 Robust Monte Carlo Glasserman & Xu 14, Glasserman & Yang 15 Decision analysis/moment method Typical focus in the literature: Tractable convex optimization formulation Smith 93, 95, Birge & Wets 87, Bertsimas & Popescu 05, Popescu 05

31 Past Literature Area Economics Hansen & Sargent 01, 08 Examples of Works Finance Glasserman & Xu 13, Lim et al. 12 Control theory/mdp/markov perturbation analysis Petersen et. al. 00, Nilim & El Ghaoui 05, Iyengar 05, Jain et. al. 10, Heidergott et. al. 10 (Distributionally) robust optimization Delage & Ye 10, Goh & Sim 10, Ben-Tal et. al. 13, Wiesemann et. al. 14, Xin et. al. 15 Robust Monte Carlo Glasserman & Xu 14, Glasserman & Yang 15 Decision analysis/moment method Typical focus in the literature: Tractable convex optimization formulation Challenges in the stochastic modeling contexts: Non-standard constraints/objectives High or infinite dimensionality Stochastic/simulation-based Smith 93, 95, Birge & Wets 87, Bertsimas & Popescu 05, Popescu 05

32 A Sensitivity Approach Worst-case formulation for the Basic Example: where D( ) = KL divergence maximize E f [h X ] subject to D P f P 0 X t i.i.d. P f η

33 A Sensitivity Approach Relaxed worst-case formulation for the Basic Example: maximize E f [h X ] subject to D P f P 0 Tη where D( ) = KL divergence

34 A Sensitivity Approach Relaxed worst-case formulation for the Basic Example: maximize E f [h X ] subject to D P f P 0 Too conservative Tη where D( ) = KL divergence

35 A Sensitivity Approach Worst-case formulation: maximize E f [h X ] subject to D P f P 0 X i.i.d. t P f η

36 A Sensitivity Approach Worst-case formulation: maximize E f [h X ] subject to D P f P 0 X i.i.d. t P f η Let η 0. Under mild regularity conditions, max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + where ξ 1 (P 0, h), are computationally tractable objects that depend only on the baseline model P 0

37 A Sensitivity Approach Worst-case formulation: maximize E f [h X ] subject to D P f P 0 X i.i.d. t P f η Let η 0. Under mild regularity conditions, max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + where ξ 1 (P 0, h), are computationally tractable objects that depend only on the baseline model P 0

38 A Sensitivity Approach Worst-case formulation: maximize E f [h X ] subject to D P f P 0 X i.i.d. t P f η Let η 0. Under mild regularity conditions, max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + where ξ 1 (P 0, h), are computationally tractable objects that depend only on the baseline model P 0

39 Numerical Illustration Performance measure: E h X = P W 20 > 2 Baseline model P 0 for service time: Exp(1) Worst-case formulation: maximize E f [h X ] subject to D P f P 0 η and X i.i.d. t P f minimize E f [h X ] subject to D P f P 0 X t i.i.d. P f η max E f h X E 0 h X + ξ 1 P 0, h η Asymptotic approximation min E f h X E 0 h X ξ 1 P 0, h η

40 Numerical Illustration Worst-case lower and upper bounds Baseline performance measure

41 Numerical Illustration Exp (increasing rate) Gamma (increasing shape parameter) Mixture of Exp Exp (decreasing rate) Gamma (constant mean) Mixture of Exp (constant mean)

42 Numerical Illustration Relaxed bounds from conventional approach

43 Form of Expansion Coefficients Influence function (Hampel 74): the first order effect on a statistical functional due to perturbation of the input probability distribution For a functional Z( ) acting on P, the influence function G(x) is defined such that: Z P f = Z P 0 + E f G X +

44 Form of Expansion Coefficients Influence function (Hampel 74): the first order effect on a statistical functional due to perturbation of the input probability distribution For a functional Z( ) acting on P, the influence function G(x) is defined such that: Z P f = Z P 0 + E f G X + i.i.d. For example, if Z P f = E f [h X 1,, X T ] where X t P f, G x = T t=1 E 0 [h(x 1,, X T ) X t = x] TE 0 [h(x 1,, X T )]

45 Form of Expansion Coefficients maximize E f [h X ] subject to D P f P 0 X i.i.d. t P f η T Theorem (L. 13): For KL divergence D( ), assuming h X i=1 Λ i (X i ) where each Λ i X i satisfies E 0 e θλ i X i < for θ in a neighborhood of zero, and Var 0 G X > 0, then ξ 1 P 0, h = 2Var 0 G X

46 Form of Expansion Coefficients maximize E f [h X ] subject to D P f P 0 X i.i.d. t P f η T Theorem (L. 13): For KL divergence D( ), assuming h X i=1 Λ i (X i ) where each Λ i X i satisfies E 0 e θλ i X i < for θ in a neighborhood of zero, and Var 0 G X > 0, then ξ 1 P 0, h = ξ 2 P 0, h = 2Var 0 G X 1 Var 0 G X 1 3 κ 3 G X + ν where κ 3 is the third-order cumulant and ν = E 0 G 2 X, Y E 0 G 2 X, Y G X E 0 G X G Y E 0 G Y and G 2 X, Y is a second order influence function

47 Form of Expansion Coefficients maximize E f [h X ] subject to D P f P 0 X t i.i.d. P f η T Theorem (L. 13): For KL divergence D( ), assuming h X i=1 Λ i (X i ) where each Λ i X i satisfies E 0 e θλ i X i < for θ in a neighborhood of zero, and Var 0 G X > 0, then ξ 1 P 0, h = ξ 2 P 0, h = 2Var 0 G X 1 Var 0 G X 1 3 κ 3 G X + ν where κ 3 is the third-order cumulant and ν = E 0 G 2 X, Y E 0 G 2 X, Y G X E 0 G X G Y E 0 G Y and G 2 X, Y is a second order influence function Simulation strategy: nested simulation, or nonparametric bootstrap

48 Uncertain Dependency maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes

49 Uncertain Dependency maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η +

50 Uncertain Dependency maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + Theorem (L. 13): Define a bivariate influence function G x, y = T t=2 E 0 [h(x 1,, X T ) X t 1 = x, X t = y]

51 Uncertain Dependency maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + Theorem (L. 13): Define a bivariate influence function G x, y = T t=2 E 0 [h(x 1,, X T ) X t 1 = x, X t = y] and its interaction effect R x, y = G x, y E 0 G X, Y X = x E 0 G X, Y Y = y + E 0 [ G X, Y ]

52 Uncertain Dependency maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + Theorem (L. 13): Define a bivariate influence function G x, y = T t=2 E 0 [h(x 1,, X T ) X t 1 = x, X t = y] and its interaction effect R x, y = G x, y E 0 G X, Y X = x E 0 G X, Y Y = y + E 0 [ G X, Y ] For mutual information D( ), assuming h X is bounded, and Var 0 R X, Y > 0, then i.i.d. where X, Y P 0 ξ 1 P 0, h = 2Var 0 R X, Y

53 Uncertain Dependency maximize E f h X 1,, X T subject to X 1,, X T P f D P f P 0 η P f has the same marginal as P 0 P f a class of stationary 1-lag dependent processes max E f h X = E 0 h X + ξ 1 P 0, h η + ξ 2 P 0, h η + Theorem (L. 13): Define a bivariate influence function G x, y = T t=2 E 0 [h(x 1,, X T ) X t 1 = x, X t = y] and its interaction effect R x, y = G x, y E 0 G X, Y X = x E 0 G X, Y Y = y + E 0 [ G X, Y ] For mutual information D( ), assuming h X is bounded, and Var 0 R X, Y > 0, then i.i.d. where X, Y P 0 ξ 1 P 0, h = 2Var 0 R X, Y Simulation strategy: nested simulation + two-way analysis of variance (ANOVA)

54 Numerical Illustration

55 General Formulations and Simulation Optimization Beyond asymptotic approximation? Other types of constraints/objectives?

56 General Formulations and Simulation Optimization Beyond asymptotic approximation? Other types of constraints/objectives? Example 3: Input Reconstruction Can we infer about the service time distribution from waiting time data?

57 General Formulations and Simulation Optimization Beyond asymptotic approximation? Other types of constraints/objectives? Example 3: Input Reconstruction Can we infer about the service time distribution from waiting time data? Find P f such that E f φ j h X 1,, X T = X i.i.d. t P f μ j for j = 1,, m φ j, j = 1,, m: moment functions, e.g. I( q j ), j h(x 1,, X T ): waiting time μ j : empirical moments

58 General Formulations and Simulation Find P f such that Optimization Beyond asymptotic approximation? Other types of constraints/objectives? Example 3: Input Reconstruction Can we infer about the service time distribution from waiting time data? E f φ j h X 1,, X T = X i.i.d. t P f μ j for j = 1,, m Moment matching between the simulation output and the real output data φ j, j = 1,, m: moment functions, e.g. I( q j ), j h(x 1,, X T ): waiting time μ j : empirical moments

59 General Formulations and Simulation maximize R(P f ) subject to E f φ j h X 1,, X T = Optimization Beyond asymptotic approximation? Other types of constraints/objectives? Example 3: Input Reconstruction Can we infer about the service time distribution from waiting time data? e.g., R(P f ) = entropy of P f X i.i.d. t P f μ j for j = 1,, m φ j, j = 1,, m: moment functions, e.g. I( q j ), j h(x 1,, X T ): waiting time μ j : empirical moments

60 Sequential Least-Square Minimization An optimization problem with nonconvex stochastic constraints maximize R(P f ) subject to E f φ j h X = μ j for j = 1,, m i.i.d. X t P f

61 Sequential Least-Square Minimization An optimization problem with nonconvex stochastic constraints maximize R(P f ) subject to E f φ j h X = μ j for j = 1,, m i.i.d. X t P f A sequence of optimizations with stochastic non-convex objectives but deterministic convex constraints Find the maximum η such that the optimal value of minimize Z P f subject to = m 2 j=1 E f φ j h X μ j R P f η is 0. The P that is attained by η gives the optimal solution

62 Sequential Least-Square Minimization An optimization problem with nonconvex stochastic constraints maximize R(P f ) subject to E f φ j h X = μ j for j = 1,, m i.i.d. X t P f A sequence of optimizations with stochastic non-convex objectives but deterministic convex constraints Find the maximum η such that the optimal value of minimize Z P f subject to = m 2 j=1 E f φ j h X μ j R P f η is 0. The P that is attained by η gives the optimal solution

63 Sequential Least-Square Minimization An optimization problem with nonconvex stochastic constraints maximize R(P f ) subject to E f φ j h X = μ j for j = 1,, m i.i.d. X t P f A sequence of optimizations with stochastic non-convex objectives but deterministic convex constraints Find the maximum η such that the optimal value of minimize Z P f subject to = m 2 j=1 E f φ j h X μ j R P f η is 0. The P that is attained by η gives the optimal solution

64 Sequential Least-Square Minimization An optimization problem with nonconvex stochastic constraints maximize R(P f ) subject to E f φ j h X = μ j for j = 1,, m i.i.d. X t P f A sequence of optimizations with stochastic non-convex objectives but deterministic convex constraints Find the maximum η such that the optimal value of minimize Z P f subject to = m 2 j=1 E f φ j h X μ j R P f η is 0. The P that is attained by η gives the optimal solution Similar in spirit to quadratic penalty method (Bertsekas 95)

65 Sequential Least-Square Minimization An optimization problem with nonconvex stochastic constraints maximize R(P f ) subject to E f φ j h X = μ j for j = 1,, m i.i.d. X t P f A sequence of optimizations with stochastic non-convex objectives but deterministic convex constraints Find the maximum η such that the optimal value of minimize Z P f subject to = m 2 j=1 E f φ j h X μ j R P f η is 0. The P that is attained by η gives the optimal solution Similar in spirit to quadratic penalty method (Bertsekas 95)

66 Simulation Optimization over Probability Distributions Operate on a discretized space P f as (p 1,, p k ) on fixed support points (z 1,, z k ) Constrained stochastic approximation (SA) for solving local optimum

67 Simulation Optimization over Probability Distributions Operate on a discretized space P f as (p 1,, p k ) on fixed support points (z 1,, z k ) Constrained stochastic approximation (SA) for solving local optimum Gradient estimation: Perturb P f to a mixture with point mass 1 ε P f + εe i to obtain a finitedimensional influence function A nonparametric version of the likelihood ratio / score function method (Glynn 90, Reiman & Weiss 89, L Ecuyer 90)

68 Iteration on Mirror Descent Stochastic Approximation Each iteration solves a linearization of the objective value, penalized by the KL divergence (the entropic descent algorithm; e.g., Nemirovski et. al. 09, Beck & Teboulle 03, Nemirovski & Yudin 83) Given a current solution P n, solve P n+1 = argmin Pf Z P n P f P n + 1 γ n D(P f P n ) subject to R P f η βn 1 γn Z Pn 1+βn P 0 P n 1+βn e 1+βn where Z P n is the estimated gradient γ n is the step size β n solves a one-dimensional root-finding problem Convergence rate O 1/n in terms of the expected KL divergence when γ n = θ/n for large enough θ and standard conditions hold (Goeva, L. & Zhang 14, L. & Qian 16)

69 Numerical Illustration System: M/G/1 queue with known arrival rate but unknown service time distribution Output observations: queue length over time Specifications: Match 9 quantiles of the output variable time-average queue length in a busy period Discretization grid of 50 support points # data = 100,000

70 Numerical Illustration Plot of the optimal Z P f against η True vs reconstructed distribution # matching quantiles increased from 9 to 14 # data reduced from 10 5 to 200

71 Numerical Illustration Monotone distribution Bimodal distribution

72 Discussion Motivation: Model misspecification/uncertainty Main Approach: Sensitivity/asymptotic analysis of robust optimization programs over input stochastic models Simulation optimization Future Work: Other forms of stochastic/system uncertainty Relation to Bayesian methodology

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. D. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds.

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. D. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. D. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. RECONSTRUCTING INPUT MODELS VIA SIMULATION OPTIMIZATION Aleksandrina

More information

ITERATIVE METHODS FOR ROBUST ESTIMATION UNDER BIVARIATE DISTRIBUTIONAL UNCERTAINTY

ITERATIVE METHODS FOR ROBUST ESTIMATION UNDER BIVARIATE DISTRIBUTIONAL UNCERTAINTY Proceedings of the 2013 Winter Simulation Conference R. Pasupathy, S.-H. Kim, A. Tolk, R. Hill, and M. E. Kuhl, eds. ITERATIVE METHODS FOR ROBUST ESTIMATION UNDER BIVARIATE DISTRIBUTIONAL UNCERTAINTY Henry

More information

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. D. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds.

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. D. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. Proceedings of the 204 Winter Simulation Conference A. Tolk, S. D. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. ROBUST RARE-EVENT PERFORMANCE ANALYSIS WITH NATURAL NON-CONVEX CONSTRAINTS

More information

Robust Dual-Response Optimization

Robust Dual-Response Optimization Yanıkoğlu, den Hertog, and Kleijnen Robust Dual-Response Optimization 29 May 1 June 1 / 24 Robust Dual-Response Optimization İhsan Yanıkoğlu, Dick den Hertog, Jack P.C. Kleijnen Özyeğin University, İstanbul,

More information

Distributionally Robust Stochastic Optimization with Wasserstein Distance

Distributionally Robust Stochastic Optimization with Wasserstein Distance Distributionally Robust Stochastic Optimization with Wasserstein Distance Rui Gao DOS Seminar, Oct 2016 Joint work with Anton Kleywegt School of Industrial and Systems Engineering Georgia Tech What is

More information

Optimal Transport in Risk Analysis

Optimal Transport in Risk Analysis Optimal Transport in Risk Analysis Jose Blanchet (based on work with Y. Kang and K. Murthy) Stanford University (Management Science and Engineering), and Columbia University (Department of Statistics and

More information

Proceedings of the 2015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds.

Proceedings of the 2015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds. Proceedings of the 015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds. SIMULATING TAIL EVENTS WITH UNSPECIFIED TAIL MODELS Henry Lam

More information

Distributionally Robust Convex Optimization

Distributionally Robust Convex Optimization Distributionally Robust Convex Optimization Wolfram Wiesemann 1, Daniel Kuhn 1, and Melvyn Sim 2 1 Department of Computing, Imperial College London, United Kingdom 2 Department of Decision Sciences, National

More information

Robust Sensitivity Analysis for Stochastic Systems

Robust Sensitivity Analysis for Stochastic Systems Robust Sensitivity Analysis for Stochastic Systems Henry Lam Department of Mathematics and Statistics, Boston University, 111 Cummington Street, Boston, MA 02215 email: khlam@bu.edu We study a worst-case

More information

Optimal Transport Methods in Operations Research and Statistics

Optimal Transport Methods in Operations Research and Statistics Optimal Transport Methods in Operations Research and Statistics Jose Blanchet (based on work with F. He, Y. Kang, K. Murthy, F. Zhang). Stanford University (Management Science and Engineering), and Columbia

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds.

Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds. Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds. IMPROVING PREDICTION FROM STOCHASTIC SIMULATION VIA MODEL DISCREPANCY

More information

Sensitivity to Serial Dependency of Input Processes: A Robust Approach

Sensitivity to Serial Dependency of Input Processes: A Robust Approach Sensitivity to Serial Dependency of Input Processes: A Robust Approach Henry Lam Boston University Abstract We propose a distribution-free approach to assess the sensitivity of stochastic simulation with

More information

Proceedings of the 2015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds.

Proceedings of the 2015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds. Proceedings of the 2015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds. STOCHASTIC OPTIMIZATION USING HELLINGER DISTANCE Anand N. Vidyashankar

More information

Random Convex Approximations of Ambiguous Chance Constrained Programs

Random Convex Approximations of Ambiguous Chance Constrained Programs Random Convex Approximations of Ambiguous Chance Constrained Programs Shih-Hao Tseng Eilyan Bitar Ao Tang Abstract We investigate an approach to the approximation of ambiguous chance constrained programs

More information

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. ADVANCED TUTORIAL: INPUT UNCERTAINTY AND ROBUST ANALYSIS IN

More information

Proceedings of the 2018 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds.

Proceedings of the 2018 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds. Proceedings of the 208 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds. ACHIEVING OPTIMAL BIAS-VARIANCE TRADEOFF IN ONLINE DERIVATIVE ESTIMATION

More information

Distributionally Robust Optimization with ROME (part 1)

Distributionally Robust Optimization with ROME (part 1) Distributionally Robust Optimization with ROME (part 1) Joel Goh Melvyn Sim Department of Decision Sciences NUS Business School, Singapore 18 Jun 2009 NUS Business School Guest Lecture J. Goh, M. Sim (NUS)

More information

Robust Optimization for Risk Control in Enterprise-wide Optimization

Robust Optimization for Risk Control in Enterprise-wide Optimization Robust Optimization for Risk Control in Enterprise-wide Optimization Juan Pablo Vielma Department of Industrial Engineering University of Pittsburgh EWO Seminar, 011 Pittsburgh, PA Uncertainty in Optimization

More information

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. Proceedings of the 216 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. LEARNING STOCHASTIC MODEL DISCREPANCY Matthew Plumlee Henry

More information

Distributionally Robust Discrete Optimization with Entropic Value-at-Risk

Distributionally Robust Discrete Optimization with Entropic Value-at-Risk Distributionally Robust Discrete Optimization with Entropic Value-at-Risk Daniel Zhuoyu Long Department of SEEM, The Chinese University of Hong Kong, zylong@se.cuhk.edu.hk Jin Qi NUS Business School, National

More information

Likelihood robust optimization for data-driven problems

Likelihood robust optimization for data-driven problems Comput Manag Sci (2016) 13:241 261 DOI 101007/s10287-015-0240-3 ORIGINAL PAPER Likelihood robust optimization for data-driven problems Zizhuo Wang 1 Peter W Glynn 2 Yinyu Ye 2 Received: 9 August 2013 /

More information

Introduction to Rare Event Simulation

Introduction to Rare Event Simulation Introduction to Rare Event Simulation Brown University: Summer School on Rare Event Simulation Jose Blanchet Columbia University. Department of Statistics, Department of IEOR. Blanchet (Columbia) 1 / 31

More information

Robust Backtesting Tests for Value-at-Risk Models

Robust Backtesting Tests for Value-at-Risk Models Robust Backtesting Tests for Value-at-Risk Models Jose Olmo City University London (joint work with Juan Carlos Escanciano, Indiana University) Far East and South Asia Meeting of the Econometric Society

More information

Online Companion: Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures

Online Companion: Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures Online Companion: Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures Daniel R. Jiang and Warren B. Powell Abstract In this online companion, we provide some additional preliminary

More information

ORIGINS OF STOCHASTIC PROGRAMMING

ORIGINS OF STOCHASTIC PROGRAMMING ORIGINS OF STOCHASTIC PROGRAMMING Early 1950 s: in applications of Linear Programming unknown values of coefficients: demands, technological coefficients, yields, etc. QUOTATION Dantzig, Interfaces 20,1990

More information

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department

More information

Distributionally Robust Optimization with Infinitely Constrained Ambiguity Sets

Distributionally Robust Optimization with Infinitely Constrained Ambiguity Sets manuscript Distributionally Robust Optimization with Infinitely Constrained Ambiguity Sets Zhi Chen Imperial College Business School, Imperial College London zhi.chen@imperial.ac.uk Melvyn Sim Department

More information

arxiv: v1 [stat.me] 21 Jun 2016

arxiv: v1 [stat.me] 21 Jun 2016 Sensitivity to Serial Dependency of Input Processes: A Robust Approach Henry Lam Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, MI 48109, khlam@umich.edu arxiv:1606.06645v1

More information

Distirbutional robustness, regularizing variance, and adversaries

Distirbutional robustness, regularizing variance, and adversaries Distirbutional robustness, regularizing variance, and adversaries John Duchi Based on joint work with Hongseok Namkoong and Aman Sinha Stanford University November 2017 Motivation We do not want machine-learned

More information

Assessing financial model risk

Assessing financial model risk Assessing financial model risk and an application to electricity prices Giacomo Scandolo University of Florence giacomo.scandolo@unifi.it joint works with Pauline Barrieu (LSE) and Angelica Gianfreda (LBS)

More information

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood Kuangyu Wen & Ximing Wu Texas A&M University Info-Metrics Institute Conference: Recent Innovations in Info-Metrics October

More information

Optimization Tools in an Uncertain Environment

Optimization Tools in an Uncertain Environment Optimization Tools in an Uncertain Environment Michael C. Ferris University of Wisconsin, Madison Uncertainty Workshop, Chicago: July 21, 2008 Michael Ferris (University of Wisconsin) Stochastic optimization

More information

arxiv: v3 [math.oc] 25 Apr 2018

arxiv: v3 [math.oc] 25 Apr 2018 Problem-driven scenario generation: an analytical approach for stochastic programs with tail risk measure Jamie Fairbrother *, Amanda Turner *, and Stein W. Wallace ** * STOR-i Centre for Doctoral Training,

More information

Distributionally Robust Convex Optimization

Distributionally Robust Convex Optimization Submitted to Operations Research manuscript OPRE-2013-02-060 Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title. However,

More information

Markov Chain Monte Carlo Methods for Stochastic

Markov Chain Monte Carlo Methods for Stochastic Markov Chain Monte Carlo Methods for Stochastic Optimization i John R. Birge The University of Chicago Booth School of Business Joint work with Nicholas Polson, Chicago Booth. JRBirge U Florida, Nov 2013

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate

More information

Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization

Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization Instructor: Farid Alizadeh Author: Ai Kagawa 12/12/2012

More information

Particle Filtering Approaches for Dynamic Stochastic Optimization

Particle Filtering Approaches for Dynamic Stochastic Optimization Particle Filtering Approaches for Dynamic Stochastic Optimization John R. Birge The University of Chicago Booth School of Business Joint work with Nicholas Polson, Chicago Booth. JRBirge I-Sim Workshop,

More information

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Athanasios Kottas Department of Applied Mathematics and Statistics,

More information

Robust Stochastic Frontier Analysis: a Minimum Density Power Divergence Approach

Robust Stochastic Frontier Analysis: a Minimum Density Power Divergence Approach Robust Stochastic Frontier Analysis: a Minimum Density Power Divergence Approach Federico Belotti Giuseppe Ilardi CEIS, University of Rome Tor Vergata Bank of Italy Workshop on the Econometrics and Statistics

More information

Modelling Under Risk and Uncertainty

Modelling Under Risk and Uncertainty Modelling Under Risk and Uncertainty An Introduction to Statistical, Phenomenological and Computational Methods Etienne de Rocquigny Ecole Centrale Paris, Universite Paris-Saclay, France WILEY A John Wiley

More information

arxiv: v1 [stat.me] 2 Nov 2017

arxiv: v1 [stat.me] 2 Nov 2017 On Optimization over Tail Distributions Clémentine Mottet Department of Mathematics and Statistics, Boston University, Boston, MA 2115, cmottet@bu.edu Henry Lam Department of Industrial Engineering and

More information

Uncertainty. Jayakrishnan Unnikrishnan. CSL June PhD Defense ECE Department

Uncertainty. Jayakrishnan Unnikrishnan. CSL June PhD Defense ECE Department Decision-Making under Statistical Uncertainty Jayakrishnan Unnikrishnan PhD Defense ECE Department University of Illinois at Urbana-Champaign CSL 141 12 June 2010 Statistical Decision-Making Relevant in

More information

Curve Fitting Re-visited, Bishop1.2.5

Curve Fitting Re-visited, Bishop1.2.5 Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood differentiation p(t x, w, β) = Maximum Likelihood N N ( t n y(x n, w), β 1). (1.61) n=1 As we did in the case of the

More information

Foundations of Nonparametric Bayesian Methods

Foundations of Nonparametric Bayesian Methods 1 / 27 Foundations of Nonparametric Bayesian Methods Part II: Models on the Simplex Peter Orbanz http://mlg.eng.cam.ac.uk/porbanz/npb-tutorial.html 2 / 27 Tutorial Overview Part I: Basics Part II: Models

More information

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc.

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc. DSGE Methods Estimation of DSGE models: GMM and Indirect Inference Willi Mutschler, M.Sc. Institute of Econometrics and Economic Statistics University of Münster willi.mutschler@wiwi.uni-muenster.de Summer

More information

PRIVATE RISK AND VALUATION: A DISTRIBUTIONALLY ROBUST OPTIMIZATION VIEW

PRIVATE RISK AND VALUATION: A DISTRIBUTIONALLY ROBUST OPTIMIZATION VIEW PRIVATE RISK AND VALUATION: A DISTRIBUTIONALLY ROBUST OPTIMIZATION VIEW J. BLANCHET AND U. LALL ABSTRACT. This chapter summarizes a body of work which has as objective the quantification of risk exposures

More information

Testing Statistical Hypotheses

Testing Statistical Hypotheses E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions

More information

Handout 8: Dealing with Data Uncertainty

Handout 8: Dealing with Data Uncertainty MFE 5100: Optimization 2015 16 First Term Handout 8: Dealing with Data Uncertainty Instructor: Anthony Man Cho So December 1, 2015 1 Introduction Conic linear programming CLP, and in particular, semidefinite

More information

Proceedings of the 2018 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds.

Proceedings of the 2018 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds. Proceedings of the 2018 Winter Simulation Conference M. Rabe, A. A. Juan, N. Mustafee, A. Skoogh, S. Jain, and B. Johansson, eds. CONSTRUCTING SIMULATION OUTPUT INTERVALS UNDER INPUT UNCERTAINTY VIA DATA

More information

Dynamic System Identification using HDMR-Bayesian Technique

Dynamic System Identification using HDMR-Bayesian Technique Dynamic System Identification using HDMR-Bayesian Technique *Shereena O A 1) and Dr. B N Rao 2) 1), 2) Department of Civil Engineering, IIT Madras, Chennai 600036, Tamil Nadu, India 1) ce14d020@smail.iitm.ac.in

More information

On Bayesian Computation

On Bayesian Computation On Bayesian Computation Michael I. Jordan with Elaine Angelino, Maxim Rabinovich, Martin Wainwright and Yun Yang Previous Work: Information Constraints on Inference Minimize the minimax risk under constraints

More information

ECS171: Machine Learning

ECS171: Machine Learning ECS171: Machine Learning Lecture 3: Linear Models I (LFD 3.2, 3.3) Cho-Jui Hsieh UC Davis Jan 17, 2018 Linear Regression (LFD 3.2) Regression Classification: Customer record Yes/No Regression: predicting

More information

Econometrics I, Estimation

Econometrics I, Estimation Econometrics I, Estimation Department of Economics Stanford University September, 2008 Part I Parameter, Estimator, Estimate A parametric is a feature of the population. An estimator is a function of the

More information

Estimation and Optimization: Gaps and Bridges. MURI Meeting June 20, Laurent El Ghaoui. UC Berkeley EECS

Estimation and Optimization: Gaps and Bridges. MURI Meeting June 20, Laurent El Ghaoui. UC Berkeley EECS MURI Meeting June 20, 2001 Estimation and Optimization: Gaps and Bridges Laurent El Ghaoui EECS UC Berkeley 1 goals currently, estimation (of model parameters) and optimization (of decision variables)

More information

TWO-STAGE LIKELIHOOD ROBUST LINEAR PROGRAM WITH APPLICATION TO WATER ALLOCATION UNDER UNCERTAINTY

TWO-STAGE LIKELIHOOD ROBUST LINEAR PROGRAM WITH APPLICATION TO WATER ALLOCATION UNDER UNCERTAINTY Proceedings of the 2013 Winter Simulation Conference R. Pasupathy, S.-H. Kim, A. Tolk, R. Hill, and M. E. Kuhl, eds. TWO-STAGE LIKELIHOOD ROBUST LIEAR PROGRAM WITH APPLICATIO TO WATER ALLOCATIO UDER UCERTAITY

More information

Data-Driven Optimization under Distributional Uncertainty

Data-Driven Optimization under Distributional Uncertainty Data-Driven Optimization under Distributional Uncertainty Postdoctoral Researcher Electrical and Systems Engineering University of Pennsylvania A network of physical objects - Devices - Vehicles - Buildings

More information

Long-Run Covariability

Long-Run Covariability Long-Run Covariability Ulrich K. Müller and Mark W. Watson Princeton University October 2016 Motivation Study the long-run covariability/relationship between economic variables great ratios, long-run Phillips

More information

MEASURES OF RISK IN STOCHASTIC OPTIMIZATION

MEASURES OF RISK IN STOCHASTIC OPTIMIZATION MEASURES OF RISK IN STOCHASTIC OPTIMIZATION R. T. Rockafellar University of Washington, Seattle University of Florida, Gainesville Insitute for Mathematics and its Applications University of Minnesota

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Probability Models in Electrical and Computer Engineering Mathematical models as tools in analysis and design Deterministic models Probability models

Probability Models in Electrical and Computer Engineering Mathematical models as tools in analysis and design Deterministic models Probability models Probability Models in Electrical and Computer Engineering Mathematical models as tools in analysis and design Deterministic models Probability models Statistical regularity Properties of relative frequency

More information

Simulation optimization via bootstrapped Kriging: Survey

Simulation optimization via bootstrapped Kriging: Survey Simulation optimization via bootstrapped Kriging: Survey Jack P.C. Kleijnen Department of Information Management / Center for Economic Research (CentER) Tilburg School of Economics & Management (TiSEM)

More information

ABC methods for phase-type distributions with applications in insurance risk problems

ABC methods for phase-type distributions with applications in insurance risk problems ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon

More information

Stochastic Networks and Parameter Uncertainty

Stochastic Networks and Parameter Uncertainty Stochastic Networks and Parameter Uncertainty Assaf Zeevi Graduate School of Business Columbia University Stochastic Processing Networks Conference, August 2009 based on joint work with Mike Harrison Achal

More information

Estimation of Dynamic Regression Models

Estimation of Dynamic Regression Models University of Pavia 2007 Estimation of Dynamic Regression Models Eduardo Rossi University of Pavia Factorization of the density DGP: D t (x t χ t 1, d t ; Ψ) x t represent all the variables in the economy.

More information

SOME HISTORY OF STOCHASTIC PROGRAMMING

SOME HISTORY OF STOCHASTIC PROGRAMMING SOME HISTORY OF STOCHASTIC PROGRAMMING Early 1950 s: in applications of Linear Programming unknown values of coefficients: demands, technological coefficients, yields, etc. QUOTATION Dantzig, Interfaces

More information

Quantitative Biology II Lecture 4: Variational Methods

Quantitative Biology II Lecture 4: Variational Methods 10 th March 2015 Quantitative Biology II Lecture 4: Variational Methods Gurinder Singh Mickey Atwal Center for Quantitative Biology Cold Spring Harbor Laboratory Image credit: Mike West Summary Approximate

More information

Markov Chain Monte Carlo Methods for Stochastic Optimization

Markov Chain Monte Carlo Methods for Stochastic Optimization Markov Chain Monte Carlo Methods for Stochastic Optimization John R. Birge The University of Chicago Booth School of Business Joint work with Nicholas Polson, Chicago Booth. JRBirge U of Toronto, MIE,

More information

On the Power of Tests for Regime Switching

On the Power of Tests for Regime Switching On the Power of Tests for Regime Switching joint work with Drew Carter and Ben Hansen Douglas G. Steigerwald UC Santa Barbara May 2015 D. Steigerwald (UCSB) Regime Switching May 2015 1 / 42 Motivating

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models

More information

Distributed Chance-Constrained Task Allocation for Autonomous Multi-Agent Teams

Distributed Chance-Constrained Task Allocation for Autonomous Multi-Agent Teams Distributed Chance-Constrained Task Allocation for Autonomous Multi-Agent Teams Sameera S. Ponda, Luke B. Johnson and Jonathan P. How Abstract This research presents a distributed chanceconstrained task

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling François Bachoc former PhD advisor: Josselin Garnier former CEA advisor: Jean-Marc Martinez Department

More information

Optimization Problems with Probabilistic Constraints

Optimization Problems with Probabilistic Constraints Optimization Problems with Probabilistic Constraints R. Henrion Weierstrass Institute Berlin 10 th International Conference on Stochastic Programming University of Arizona, Tucson Recommended Reading A.

More information

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment Ben Shaby SAMSI August 3, 2010 Ben Shaby (SAMSI) OFS adjustment August 3, 2010 1 / 29 Outline 1 Introduction 2 Spatial

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning Introduction to Probabilistic Methods Varun Chandola Computer Science & Engineering State University of New York at Buffalo Buffalo, NY, USA chandola@buffalo.edu Chandola@UB

More information

DSGE-Models. Limited Information Estimation General Method of Moments and Indirect Inference

DSGE-Models. Limited Information Estimation General Method of Moments and Indirect Inference DSGE-Models General Method of Moments and Indirect Inference Dr. Andrea Beccarini Willi Mutschler, M.Sc. Institute of Econometrics and Economic Statistics University of Münster willi.mutschler@uni-muenster.de

More information

Additive Isotonic Regression

Additive Isotonic Regression Additive Isotonic Regression Enno Mammen and Kyusang Yu 11. July 2006 INTRODUCTION: We have i.i.d. random vectors (Y 1, X 1 ),..., (Y n, X n ) with X i = (X1 i,..., X d i ) and we consider the additive

More information

Algorithms for Uncertainty Quantification

Algorithms for Uncertainty Quantification Algorithms for Uncertainty Quantification Lecture 9: Sensitivity Analysis ST 2018 Tobias Neckel Scientific Computing in Computer Science TUM Repetition of Previous Lecture Sparse grids in Uncertainty Quantification

More information

ECON 3150/4150, Spring term Lecture 6

ECON 3150/4150, Spring term Lecture 6 ECON 3150/4150, Spring term 2013. Lecture 6 Review of theoretical statistics for econometric modelling (II) Ragnar Nymoen University of Oslo 31 January 2013 1 / 25 References to Lecture 3 and 6 Lecture

More information

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. OPTIMAL COMPUTING BUDGET ALLOCATION WITH INPUT UNCERTAINTY

More information

Robust Actuarial Risk Analysis

Robust Actuarial Risk Analysis Robust Actuarial Risk Analysis Jose Blanchet Henry Lam Qihe Tang Zhongyi Yuan Abstract This paper investigates techniques for the assessment of model error in the context of insurance risk analysis. The

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

ICML Scalable Bayesian Inference on Point processes. with Gaussian Processes. Yves-Laurent Kom Samo & Stephen Roberts

ICML Scalable Bayesian Inference on Point processes. with Gaussian Processes. Yves-Laurent Kom Samo & Stephen Roberts ICML 2015 Scalable Nonparametric Bayesian Inference on Point Processes with Gaussian Processes Machine Learning Research Group and Oxford-Man Institute University of Oxford July 8, 2015 Point Processes

More information

Robust Stochastic Lot-Sizing by Means of Histograms

Robust Stochastic Lot-Sizing by Means of Histograms Robust Stochastic Lot-Sizing by Means of Histograms The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published Publisher

More information

LIGHT ROBUSTNESS. Matteo Fischetti, Michele Monaci. DEI, University of Padova. 1st ARRIVAL Annual Workshop and Review Meeting, Utrecht, April 19, 2007

LIGHT ROBUSTNESS. Matteo Fischetti, Michele Monaci. DEI, University of Padova. 1st ARRIVAL Annual Workshop and Review Meeting, Utrecht, April 19, 2007 LIGHT ROBUSTNESS Matteo Fischetti, Michele Monaci DEI, University of Padova 1st ARRIVAL Annual Workshop and Review Meeting, Utrecht, April 19, 2007 Work supported by the Future and Emerging Technologies

More information

Time inconsistency of optimal policies of distributionally robust inventory models

Time inconsistency of optimal policies of distributionally robust inventory models Time inconsistency of optimal policies of distributionally robust inventory models Alexander Shapiro Linwei Xin Abstract In this paper, we investigate optimal policies of distributionally robust (risk

More information

Scenario-Free Stochastic Programming

Scenario-Free Stochastic Programming Scenario-Free Stochastic Programming Wolfram Wiesemann, Angelos Georghiou, and Daniel Kuhn Department of Computing Imperial College London London SW7 2AZ, United Kingdom December 17, 2010 Outline 1 Deterministic

More information

Lecture 1. Stochastic Optimization: Introduction. January 8, 2018

Lecture 1. Stochastic Optimization: Introduction. January 8, 2018 Lecture 1 Stochastic Optimization: Introduction January 8, 2018 Optimization Concerned with mininmization/maximization of mathematical functions Often subject to constraints Euler (1707-1783): Nothing

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Information Geometry

Information Geometry 2015 Workshop on High-Dimensional Statistical Analysis Dec.11 (Friday) ~15 (Tuesday) Humanities and Social Sciences Center, Academia Sinica, Taiwan Information Geometry and Spontaneous Data Learning Shinto

More information

Ambiguous Joint Chance Constraints under Mean and Dispersion Information

Ambiguous Joint Chance Constraints under Mean and Dispersion Information Ambiguous Joint Chance Constraints under Mean and Dispersion Information Grani A. Hanasusanto 1, Vladimir Roitch 2, Daniel Kuhn 3, and Wolfram Wiesemann 4 1 Graduate Program in Operations Research and

More information

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications.

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications. Stat 811 Lecture Notes The Wald Consistency Theorem Charles J. Geyer April 9, 01 1 Analyticity Assumptions Let { f θ : θ Θ } be a family of subprobability densities 1 with respect to a measure µ on a measurable

More information

Surrogate loss functions, divergences and decentralized detection

Surrogate loss functions, divergences and decentralized detection Surrogate loss functions, divergences and decentralized detection XuanLong Nguyen Department of Electrical Engineering and Computer Science U.C. Berkeley Advisors: Michael Jordan & Martin Wainwright 1

More information

Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1)

Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1) Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1) Detection problems can usually be casted as binary or M-ary hypothesis testing problems. Applications: This chapter: Simple hypothesis

More information

Computational statistics

Computational statistics Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated

More information

Approximate Bayesian Computation and Particle Filters

Approximate Bayesian Computation and Particle Filters Approximate Bayesian Computation and Particle Filters Dennis Prangle Reading University 5th February 2014 Introduction Talk is mostly a literature review A few comments on my own ongoing research See Jasra

More information

Does Better Inference mean Better Learning?

Does Better Inference mean Better Learning? Does Better Inference mean Better Learning? Andrew E. Gelfand, Rina Dechter & Alexander Ihler Department of Computer Science University of California, Irvine {agelfand,dechter,ihler}@ics.uci.edu Abstract

More information