Approximate Dynamic Programming: Solving the curses of dimensionality

Size: px
Start display at page:

Download "Approximate Dynamic Programming: Solving the curses of dimensionality"

Transcription

1 Approximate Dynamic Programming: Solving the curses of dimensionality Informs Computing Society Tutorial October, 2008 Warren Powell CASTLE Laboratory Princeton University Warren B. Powell, Princeton University 2008 Warren B. Powell Slide 1

2 2008 Warren B. Powell Slide 2

3 2008 Warren B. Powell Slide 3

4 The fractional jet ownership industry 2008 Warren B. Powell Slide 4

5 NetJets Inc Warren B. Powell Slide 5

6 Schneider National 2008 Warren B. Powell Slide 6

7 Schneider National 2008 Warren B. Powell Slide 7

8 Planning for a risky world Weather Robust design of emergency response networks. Design of financial instruments to hedge against weather emergencies to protect individuals, companies and municipalities. Design of sensor networks and communication systems to manage responses to major weather events. Disease Models of disease propagation for response planning. Management of medical personnel, equipment and vaccines to respond to a disease outbreak. Robust design of supply chains to mitigate the disruption of transportation systems Warren B. Powell Slide 8

9 Energy management Energy resource allocation What is the right mix of energy technologies? How should the use of different energy resources be coordinated over space and time? What should my energy R&D portfolio look like? Should I invest in nuclear energy? What is the impact of a carbon tax? Energy markets How should I hedge energy commodities? How do I price energy assets? What is the right price for energy futures? 2008 Warren B. Powell Slide 9

10 High value spare parts Electric Power Grid PJM oversees an aging investment in highvoltage transformers. Replacement strategy needs to anticipate a bulge in retirements and failures 1-2 year lag times on orders. Typical cost of a replacement ~ $5 million. Failures vary widely in terms of economic impact on the network. Spare parts for business jets ADP is used to determine purchasing and allocation strategies for over 400 high value spare parts. Inventory strategy has to determine what to buy, when and where to store it. Many parts are very low volume (e.g. 7 spares spread across 15 service centers). Inventories have to meet global targets on level of service and inventory costs Warren B. Powell Slide 10

11 Challenges Real-time control» Scheduling aircraft, pilots, generators, tankers» Pricing stocks, options» Electricity resource allocation Near-term tactical planning» Can I accept a customer request?» Should I lease equipment?» How much energy can I commit with my windmills? Strategic planning» What is the right equipment mix?» What is the value of this contract?» What is the value of more reliable aircraft? 2008 Warren B. Powell Slide 11

12 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

13 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

14 A resource allocation model Attribute vectors: a = Asset class Time invested Type Location Age Location ETA Home Experience Driving hours Location ETA A/C type Fuel level Home shop Crew Eqpt1 Eqpt Warren B. Powell Slide 14

15 A resource allocation model Modeling resources:» The attributes of a single resource: a = The attributes of a single resource a A The attribute space» The resource state vector: R» The information process: ˆ ta R ta = The number of resources with attribute ( ) Rt = Rta a A The resource state vector = The change in the number of resources with attribute a. a 2008 Warren B. Powell Slide 15

16 A resource allocation model Modeling demands:» The attributes of a single demand: b = The attributes of a demand to be served. b B The attribute space» The demand state vector: D = The number of demands with attribute Dt = Dtb The demand state vector b B» The information process: ˆ tb D tb ( ) = The change in the number of demands with attribute b. b 2008 Warren B. Powell Slide 16

17 Energy resource modeling The system state: S ( R D ρ ) =,, = System state, where: t t t t R t D t = Resource state (how much capacity, reserves) = Market demands ρ = "system parameters" t State of the technology (costs, performance) Climate, weather (temperature, rainfall, wind) Government policies (tax rebates on solar panels) Market prices (oil, coal) 2008 Warren B. Powell Slide 17

18 Energy resource modeling The decision variable: x t New capacity Retired capacity for each: = Type Location Technology 2008 Warren B. Powell Slide 18

19 Energy resource modeling Exogenous information: W = ( Rˆ Dˆ ˆ ρ ) New information =,, t t t t Rˆ t = Exogenous changes in capacity, reserves Dˆ t = New demands for energy from each source ˆ ρ = Exogenous changes in parameters. t 2008 Warren B. Powell Slide 19

20 Energy resource modeling The transition function = M t+ 1 t t t+ 1 S S ( S, x, W ) 2008 Warren B. Powell Slide 20

21 A resource allocation model Resources Demands 2008 Warren B. Powell Slide 21

22 A resource allocation model t t+1 t Warren B. Powell Slide 22

23 A resource allocation model Optimizing over time t t+1 t+2 Optimizing at a point in time 2008 Warren B. Powell Slide 23

24 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

25 Introduction to ADP We just solved Bellman s equation: V ( S ) = max C ( S, x ) + E V ( S ) S { } t t t t t t+ 1 t+ 1 t x X» We found the value of being in each state by stepping backward through the tree Warren B. Powell Slide 25

26 Introduction to ADP The challenge of dynamic programming: ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X Problem: Curse of dimensionality 2008 Warren B. Powell Slide 26

27 Introduction to ADP The challenge of dynamic programming: ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X Three curses Problem: Curse of dimensionality State space Outcome space Action space (feasible region) 2008 Warren B. Powell Slide 27

28 Introduction to ADP The computational challenge: ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X How do we find V ( S )? t+ 1 t+ 1 How do we compute the expectation? How do we find the optimal solution? 2008 Warren B. Powell Slide 28

29 Introduction to ADP Classical ADP» Most applications of ADP focus on the challenge of handling multidimensional state variables» Start with ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X» Now replace the value function with some sort of approximation V ( S ) V ( S ) t t t t

30 Introduction to ADP Approximating the value function:» We have to exploit the structure of the value function (e.g. concavity).» We might approximate the value function using a simple polynomial V ( S θ) = θ + θ S + θ S 2 t t 0 1 t 2 t».. or a complicated one: V ( S θ) = θ + θ S + θ S + θ ln S + θ sin( S )» Sometimes, they get really messy: ( ) 2 t t 0 1 t 2 t 3 t 4 t

31 t+ 2 t+ 3 (0) (1) (1) t( θ) = θ + θtst, ' tst, ' + θtwt, ' twt, ' s t' = t w t' = t V R R R + θ R + (2) 2 (2) 2 twt, ' twt, ' tst, ' tst, ' w t' s t' + θ R s 1 (3) ts t, st t, s' t S s' t+ 1 t+ 1 (4) θts Rt, st ' Rt, s' t ' s t' = t 2S s' t' = t R 1 θ 2 R t+ 2 t+ 2 (5) θts Rt, st ' Rt, s' t ' s t' = t 3S s' t' = t w s θ R R ( ws,1) tws, twt, tst, t+ 2 ( ws,2) + θtws, Rtwt, Rtst, ' w s t' = t t+ 3 t+ 2 ( ws,3) θtws, Rtwt, ' Rtst, ' w s t' = t t' = t t+ 2 t+ 2 ( ws,4) θtws, Rtwt, ' Rtst, ' w s t ' = t t' = t w s θ 1 R R R ( ws,5) tsw, tst,, + 2 twt, twt,, ,2,3 4,5,6,

32 Introduction to ADP We can write a model of the observed value of being in a state as: vˆ = θ + θ S + θ S + θ ln S + θ sin( S ) + ε ( ) t 2 t 3 t 4 t This is often written as a generic regression model: Y = θ + θ X + θ X + θ X + θ X The ADP community refers to the independent variables as basis functions: Y = θ ϕ ( S) + θϕ ( S) + θ ϕ ( S) + θ ϕ ( S) + θ ϕ ( S) = θϕ ( S) f F f f ϕ ( R) are also known as features. f

33 Introduction to ADP Methods for estimating»» θ 1 2 N ˆ ˆ ˆ Generate observations v, v,..., v, and use traditional regression methods to fit θ. n Stochastic gradient for updating θ : ( ˆ ) θ = θ α V ( S θ ) v V ( S θ ) n n 1 n 1 n n 1 n n 1 n n 1 n 1 ( V ( S ) vˆ ) = θ α θ n 1 n 1 n n 1 n n 1 Error ϕ1( S) ϕ2( S) ϕf ( S) Basis functions

34 Introduction to ADP Methods for estimating θ» Recursive statistics Iterative equations avoid matrix inverse: n n 1 n n n n θ = θ H x ε ε = Error n 1 n 1 T H = B B is F F matrix= XX n γ ( ( ) ) n n 1 1 n 1 n n T n 1 B = B B x x B n γ γ n n T n 1 n = 1+ ( ) x B x 1

35 Introduction to ADP Other statistical methods» Regression trees Combines regression with techniques for discrete variables.» Data mining Good for categorical data» Neural networks Engineers like this for low-dimensional continuous problems

36 Introduction to ADP Notes:» When approximating value functions, we are basically drawing on the entire field of statistics.» Choosing an approximation is primarily an art.» In special cases, the resulting algorithm can produce optimal solutions.» Most of the time, we are hoping for good solutions.» In some cases, it can work terribly.» As a general rule you have to use problem structure. Value function approximations have to capture the right structure. Blind use of polynomials will rarely be successful.

37 Introduction to ADP What you will struggle with:» Stepsizes Can t live with em, can t live without em. Too small, you think you have converged but you have really just stalled ( apparent convergence ) Too large, and the system is unstable.» Stability There are two sources of randomness: The traditional exogenous randomness An evolving policy» Exploration vs. exploitation You sometimes have to choose to visit a state just to collect information about the value of being in a state.

38 Introduction to ADP But we are not out of the woods» Assume we have an approximate value function.» We still have to solve a problem that looks like Vt( St) = max Ct( St, xt) + E θφ f f ( St+ 1) x X f F» This means we still have to deal with a maximization problem (might be a linear, nonlinear or integer program) with an expectation.

39 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

40 Action State Use weather report Do not use weather report Information Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game State Action Schedule game Cancel game Schedule game Cancel game Schedule game Cancel game Rain.2 -$2000 Clouds.3 $1000 Sun.5 $5000 Rain.2 -$200 Clouds.3 -$200 Sun.5 -$200 Information Rain.8 -$2000 Clouds.2 $1000 Sun.0 $5000 Rain.8 -$200 Clouds.2 -$200 Sun.0 -$200 Rain.1 -$2000 Clouds.5 $1000 Sun.4 $5000 Rain.1 -$200 Clouds.5 -$200 Sun.4 -$200 Rain.1 -$2000 Clouds.2 $1000 Sun.7 $5000 Rain.1 -$200 Clouds.2 -$200 Sun.7 -$200 - Decision nodes - Outcome nodes

41 Schedule game Cancel game Schedule game Cancel game Schedule game Cancel game $2400 -$200 -$1400 -$200 $2300 -$200 $3500 -$200 Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game Use weather report Do not use weather report

42 -$200 $2300 $3500 $2400 -$200 Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game Use weather report Do not use weather report

43 $2770 $2400 Use weather report Do not use weather report

44 The post-decision state 2008 Warren B. Powell Slide 44

45 The post-decision state Managing blood inventories over time Week 0 Week 1 Week 2 Week 3 S 0 x 0 Rˆ, Dˆ 1 1 S 1 x 1 Rˆ, Dˆ 2 2 S 2 x 2 S x 2 Rˆ, Dˆ 3 3 x S S 3 3 x 3 t=0 t=1 t=2 t= Warren B. Powell Slide 45

46 The post-decision state What happens if use Bellman s equation?» State variable is: The supply of each type of blood 8 blood types 6 ages The demand for each type of blood 8 blood types 56 dimensional state vector» Decision variable is how much of 8 blood types to supply to 8 demand types dimensional decision vector» Random information Blood donations by week (8 types) New demands for blood (8 types) 16-dimensional information vector 2008 Warren B. Powell Slide 46

47 Rain.8 -$2000 Clouds.2 $1000 Sun.0 $5000 Rain.8 -$200 Clouds.2 -$200 Sun.0 -$200 Rain.1 -$2000 Clouds.5 $1000 Sun.4 $5000 Rain.1 -$200 Clouds.5 -$200 Sun.4 -$200 Rain.1 -$2000 Clouds.2 $1000 Sun.7 $5000 Rain.1 -$200 Clouds.2 -$200 S n 7 $200 Schedule game S t S t + 1 Cancel game Forecast rain.1 Schedule game Forecast cloudy.3 Cancel game Forecast sunny.6 Use weather report Schedule game Cancel game Rain 2 -$2000 Do not weath ( { } ) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X

48 Rain.8 -$2000 Clouds.2 $1000 Sun.0 $5000 Rain.8 -$200 Clouds.2 -$200 Sun.0 -$200 Rain.1 -$2000 Clouds.5 $1000 Sun.4 $5000 Rain.1 -$200 Clouds.5 -$200 Sun.4 -$200 Rain.1 -$2000 Clouds.2 $1000 Sun.7 $5000 Rain.1 -$200 Clouds.2 -$200 Sun.7 -$200 Schedule game Cancel game Schedule game Cancel game Schedule game Cancel game Rain.2 -$2000 Clouds.3 $1000 Sun.5 $5000 Rain.2 -$200 Clouds.3 -$200 Sun.5 -$200 - Decision nodes - Outcome nodes Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game Use weather report Do not use weather report

49 The post-decision state New concept:» The pre-decision state variable: St = The information required to make a decision x t Same as a decision node in a decision tree.» The post-decision state variable: x S t = The state of what we know immediately after we make a decision. Same as an outcome node in a decision tree Warren B. Powell Slide 49

50 The post-decision state An inventory problem:» Our basic inventory equation: R t+1 =max{0,r t + x t ˆD t+1 } where R t = Inventory at time t. x t = Amount we ordered. ˆD t+1 = Demand in next time period.» Using pre- and post-decision states: Rt x = R t + x t Post-decision state R t+1 = max{0,rt x ˆD t+1 } Pre-decision state 2008 Warren B. Powell Slide 50

51 The post-decision state Pre-decision, state-action, and post-decision Pre-decision state State Action Post-decision state 9 3 states state-action pairs 9 3 states 2008 Warren B. Powell Slide 51

52 The post-decision state Pre-decision: resources and demands S = ( R, D ) t t t 2008 Warren B. Powell Slide 52

53 The post-decision state S = S ( S, x ) x M, x t t t 2008 Warren B. Powell Slide 53

54 The post-decision state x S t S = S ( S, W ) MW, x t+ 1 t t+ 1 W = ( Rˆ, Dˆ ) t+ 1 t+ 1 t Warren B. Powell Slide 54

55 The post-decision state S t Warren B. Powell Slide 55

56 The post-decision state Classical form of Bellman s equation: { } ( ) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X Bellman s equations around pre- and post-decision states:» Optimization problem (making the decision): ( x( M, x )) V ( S ) = max C ( S, x ) + V S ( S, x ) t t x t t t t t t t Note: this problem is deterministic!» Simulation problem (the effect of exogenous information): {, } V ( S ) = E V ( S ( S, W )) S x x M W x x t t t t t t 2008 Warren B. Powell Slide 56

57 The post-decision state Challenges» For most practical problems, we are not going to be x x able to compute V ( S ). t ( x x ) V ( S ) = max C ( S, x ) + V ( S ) t t x t t t t t» Concept: replace it with an approximation V ( x t St ) and solve» So now we face: What should the approximation look like? How do we estimate it? t ( x ) V ( S ) = max C ( S, x ) + V ( S ) t t x t t t t t 2008 Warren B. Powell Slide 57

58 The post-decision state Value function approximations:» Linear (in the resource state): x V ( R ) = v R x t t ta ta a A» Piecewise linear, separable: x x Vt ( Rt ) = Vta ( Rta ) a A» Indexed PWL separable: ( x ) x Vt( Rt ) = Vta Rta ( featurest) a A 2008 Warren B. Powell Slide 58

59 The post-decision state Value function approximations:» Ridge regression (Klabjan and Adelman)» Benders cuts ( ) x V ( R ) = V R R = θ R t t tf tf tf fa ta f F a A f V ( R ) t t x 1 x Warren B. Powell Slide 59

60 The post-decision state Comparison to other methods:» Classical MDP (value iteration) ( n 1 γ ) t+ 1 n V ( S) = max C( S, x) + EV ( S ) x» Classical ADP (pre-decision state): n n n n 1 vˆ t = max x Ct( St, xt) + p( s' St, xt) Vt+ 1 s' s' V ( S ) = (1 α ) V ( S ) + α vˆ n n n 1 n n t t n 1 t t n 1 t, 1» Our method (update V x n around post-decision state): (, 1, ) n n x n M x n vˆ = max C ( S, x ) + V ( S ( S, x )) t x t t t t t t V ( S ) = (1 α ) V ( S ) + α vˆ n x, n n 1 x, n n t 1 t 1 n 1 t 1 t 1 n 1 t t ( ) Expectation vˆ updates V ( S ) t t t vˆ updates V ( S ) x t t 1 t Warren B. Powell Slide 60

61 The post-decision state n Step 1: Start with a pre-decision state S t Step 2: Solve the deterministic optimization using an approximate value function: ( 1,, ))) n n n M x n vˆ = max C ( S, x ) + V ( S ( S x t x t t t t t t n x t to obtain. Step 3: Update the value function approximation n x, n n 1 x, n n t 1( t 1) = (1 αn 1) t 1 ( t 1) + αn 1ˆ t V S V S v n Step 4: Obtain Monte Carlo sample of Wt ( ω ) and compute the next pre-decision state: n M n n n St+ 1 = S ( St, xt, Wt+ 1( ω )) Step 5: Return to step 1. Deterministic optimization Recursive statistics Simulation 2008 Warren B. Powell Slide 61

62 Approximate dynamic programming Optimization Optimization Strengths Produces optimal decisions. Mature technology Weaknesses Cannot handle uncertainty. Cannot handle high levels of complexity Simulation Simulation Strengths Extremely flexible High level of detail Easily handles uncertainty Weaknesses Models decisions using user-specified rules. Low solution quality. Cplex Approximate dynamic programming combines simulation and optimization in a rigorous yet flexible framework.

63 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

64 Blood management Managing blood inventories 2008 Warren B. Powell Slide 64

65 Blood management Managing blood inventories over time Week 0 Week 1 Week 2 Week 3 S 0 x 0 S x 0 Rˆ, Dˆ 1 1 S 1 x 1 S x 1 Rˆ, Dˆ 2 2 S 2 x 2 S x 2 Rˆ, Dˆ 3 3 x S S 3 3 x 3 t=0 t=1 t=2 t= Warren B. Powell Slide 65

66 S t = ( R, ˆ ) t Dt AB+ D ˆt, AB+ x R t AB+,0 R t,( AB+,0) AB+,0 AB- D ˆt, AB AB+,1 R t,( AB+,1) R t,( AB+,2) AB+,1 AB+,2 A+ A- D ˆt, A+ D ˆt, AB+ AB+,2 AB+,3 R R R t,( O,0) t,( O,1) t,( O,2) O-,0 O-,1 O-,2 B+ B- D ˆt, AB+ D ˆt, AB+ D ˆt, AB+ O-,0 O-,1 O-,2 O+ O-,3 O- D ˆt, AB+ Satisfy a demand Hold

67 R t x R t R t + 1 R t,( AB+,0) AB+,0 AB+,0 R ˆt+ 1, AB+ AB+,0 R t,( AB+,1) AB+,1 AB+,1 AB+,1 R t,( AB+,2) AB+,2 AB+,2 AB+,2 AB+,3 AB+,3 R t,( O,0) O-,0 R t,( O,1) O-,1 O-,0 R ˆt+ 1, O O-,0 R t,( O,2) O-,2 O-,1 O-,1 O-,2 O-,2 O-,3 O-,3 Dˆt

68 R t x R t R t,( AB+,0) R t,( AB+,1) R t,( AB+,2) R R R t,( O,0) t,( O,1) t,( O,2) AB+,0 AB+,1 AB+,2 O-,0 O-,1 O-,2 AB+,0 AB+,1 AB+,2 AB+,3 O-,0 O-,1 O-,2 O-,3 Dˆt

69 R t x R t ( ) F R t R t,( AB+,0) R t,( AB+,1) R t,( AB+,2) R R R t,( O,0) t,( O,1) t,( O,2) AB+,0 AB+,1 AB+,2 O-,0 O-,1 O-,2 AB+,0 AB+,1 AB+,2 AB+,3 O-,0 O-,1 O-,2 Solve this as a linear program. O-,3 Dˆt

70 Duals R t x R t ( ) F R t ˆt ν,( AB +,0) AB+,0 AB+,0 ˆt ν,( AB +,1) AB+,1 AB+,1 ˆt ν,( AB +,2) AB+,2 AB+,2 AB+,3 ν ˆt,( AB,0) O-,0 ν ˆt,( AB,1) O-,1 O-,0 ν ˆt,( AB,2) O-,2 O-,1 O-,2 Dual variables give value additional unit of blood.. O-,3 Dˆt

71 Updating the value function approximation Estimate the gradient at n R t ˆn ν t,( AB+,2) ( ) F R t n R t,( AB+,2) 2008 Warren B. Powell Slide 71

72 Updating the value function approximation Update the value function at xn, Rt 1 V ( R ) n 1 x t 1 t 1 ˆn ν t,( AB+,2) ( ) F R t xn, Rt 1 n R t,( AB+,2) 2008 Warren B. Powell Slide 72

73 Updating the value function approximation Update the value function at xn, Rt 1 V ( R ) n 1 x t 1 t 1 ˆn ν t,( AB+,2) xn, Rt Warren B. Powell Slide 73

74 Updating the value function approximation Update the value function at xn, Rt 1 V ( R ) n 1 x t 1 t 1 V ( R ) n x t 1 t 1 xn, Rt Warren B. Powell Slide 74

75 Iterative learning t 2008 Warren B. Powell Slide 75

76 Iterative learning 2008 Warren B. Powell Slide 76

77 Iterative learning 2008 Warren B. Powell Slide 77

78 Iterative learning 2008 Warren B. Powell Slide 78

79 Approximate dynamic programming With luck, the objective function will improve steadily Objective function Iteration

80 Approximate dynamic programming but performance can be jagged. Chart Title Objective function Iterations

81 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

82 Hierarchical aggregation Attribute vectors: a = Asset class Time invested Blood type Age Frozen? Location ETA Bus. segment Single/team Domicile Drive hours Days from home Location ETA A/C type Fuel level Home shop Crew Eqpt1 Eqpt100

83 Aggregation Estimating value functions $2050 = (1 0.10) $ (0.10) $2500 Location Location Location Fleet v Fleet α v Fleet αvˆ Domicile = + Domicile Domicile DOThrs DaysFromHome n n 1 ( ) (1 ) ( ) ( ) Value function Approximation may have fewer attributes than driver. Drivers may have very detailed attributes

84 Hierarchical aggregation Estimating value functions» Most disaggregate level Location Location Location Fleet v Fleet α v Fleet αvˆ Domicile = + Domicile Domicile DOThrs DaysFromHome n n 1 ( ) (1 ) ( ) ( )

85 Hierarchical aggregation Estimating value functions» Middle level of aggregation Location Fleet Location Location v α v αvˆ Domicile Fleet = + Fleet DOThrs DaysFromHome n n 1 ( ) (1 ) ( ) ( )

86 Hierarchical aggregation Estimating value functions» Most aggregate level Location Fleet n n 1 v ([ Location] ) = (1 α) v ([ Location] ) + αvˆ ( Domicile ) DOThrs DaysFromHome

87 Hierarchical aggregation State-dependent weighted aggregation:» Now we have a linear regression for each attribute v = w v ( g) ( g) a a a g» We cannot solve hundreds of thousands of linear regressions. Instead use ( ( ) ( ) 2 ) β 1 w Var v + w = 1 ( g) ( g) ( g) ( g) a a a a g Estimate of variance Estimate of bias

88 Hierarchical aggregation Objective function Aggregate Disaggregate Iterations

89 Hierarchical aggregation Weighted Combination Objective function Aggregate Disaggregate Iterations

90 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

91 Stepsizes Stepsizes:» Fundamental to ADP is an updating equation that looks like: n x n 1 1( 1) (1 1) x n t t = n t 1 ( t 1) + n 1ˆ t V S α V S α v Updated estimate Old estimate New observation The stepsize Learning rate Smoothing factor

92 Stepsizes High noise data: Best stepsize α = 1/ n n Low noise, changing signal αn

93 Stepsizes Theorem 1 (general knowledge):» If data is stationary, then 1/n is the best possible stepsize. Theorem 2 (e.g. Tsitsiklis 1994)» For general problems, 1/n is provably convergent Theorem 3 (Frazier and Powell):» 1/n works so badly it should (almost) never be used.

94 Stepsizes Lower bound on the value function after n iterations:

95 Stepsizes Bias-adjusted Kalman filter Estimate of the variance α n where: = 1 ( n 1) 2 1 ( n + λ σ + β ) 2 2 n λ = 1 α λ + α ( ) n 1 ( ) n 2 σ 2 As increases, stepsize decreases toward 1/ σ Estimate of the bias n As β increases, stepsize increases toward 1. n 2 Noise n Bias

96 Stepsizes The bias-adjusted Kalman filter Observed values BAKF stepsize rule /n stepsize rule

97 Stepsizes The bias-adjusted Kalman filter Observed values BAKF stepsize rule

98 Stepsizes BAKF formula Our newest formula» where α n = α n = ( ( ) ) n 1 2 n 2 λ σ + β ( ( ) 2 ) n n σ λ σ β ( 1 2 ( ( ) ) ) n n λ σ + γ δ c ( 2 c ) ( n ) n 1 ( n + γ λ σ + λ σ + 1 ( 1 γ) δ ) » First stepsize formula designed specifically for dynamic n programming (note: no need to estimate bias β )» Captures the natural growth of a value function through iterative learning.

99 Stepsizes I recommend:» Start with a constant stepsize Vary it, get a sense of what works best» Next try a deterministic stepsize such as a/(a+n) Choose a so that it declines to roughly the best deterministic stepsize at an iteration comparable to where your constant stepsize seems to stabilize Might be 50 iterations Might be 5000» Try an (optimal) adaptive stepsize rule Can work very well if there is not too much noise Adaptive rules work well when there is a need to keep the stepsize from declining too quickly (but you do not know how quickly)

100 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

101 Two-stage optimization Piecewise linear, separable value function approximations: Piecewise linear, separable: Vt( Rt) = Vtl( Rtl) l L

102 Two-stage optimization Benders decomposition: VR ( ) t t x 1 x 0 Multidimensional cuts produce provably convergent, nonseparable value function approximation.

103 The competition Exact solutions using Benders: L-Shaped decomposition (Van Slyke and Wets) V 0 x 0 Stochastic decomposition (Higle and Sen) V 0 x 0 CUPPS (Chen and Powell) V 0 x 0

104 The competition 45 Percent over Percent optimal from optimal after iterations iterations Increasing problem size Percent error locations 25 locations 50 locations 100 locations SD L-shaped CUPPS SPAR Benders Separable

105 The competition Percent over Percent optimal from optimal after iterations iterations Increasing problem size 35 Percent error locations 25 locations 50 locations 100 locations SD L-shaped CUPPS SPAR Benders Separable

106 Multistage problems Deterministic, single commodity flow

107 Multistage problems Deterministic, single commodity flow» ADP solution as a percent of the optimal solution (from Cplex)

108 Multistage problems Stochastic, single commodity flow = optimal posterior bound 0 Rolling horizon ADP Percent of posterior optimal (20,100) (20,200) (20,400) (40,100) (40,200) (40,400) (80,100) (80,200) (80,400)

109 Multistage problems Deterministic, (integer) multicommodity flow

110 Multistage problems Deterministic, (integer) multicommodity flow = optimal continuous relaxation Percent of optimal Base T_30 T_90 I_10 I_40 C_II C_III C_IV R_1 R_5 R_100 R_400 C_1 C_8

111 Multistage problems Stochastic, (integer) multicommodity flow Rolling horizon ADP Percent of posterior optimal 0 Base I_10 I_40 C_II C_III C_IV R_1 R_5 R_100 R_400 C_1 C_8

112 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

113 Schneider National 2008 Warren B. Powell Slide 113

114 2008 Warren B. Powell Slide 114

115 Schneider National 1400 Revenue per WU US_SOLO US_IC US_TEAM Capacity category Revenue per WU U tiliz atio n Historical maximum Simulation Historical minimum Calibrated Historical min model and max Utilization Historical maximum Simulation Historical minimum US_SOLO US_IC US_TEAM Capacity category 2008 Warren B. Powell Slide 115

116 Princeton team: Warren Powell Belgacem Bouzaiene-Ayari NS team: Clark Cheng Ricardo Fiorillo Junxia Chang Sourav Das

117 2008 Warren B. Powell Slide 117

118 2008 Warren B. Powell Slide 118

119 2008 Warren B. Powell Slide 119

120 2008 Warren B. Powell Slide 120

121 Convergence Train coverage

122 Model calibration Train delay curve October 2007 Delay hours/day Fleet size

123 Model calibration Train delay curve March 2008 Delay hours/day Fleet size

124 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy

125 Energy resource modeling 2008 Warren B. Powell Slide 125

126 Energy resource modeling Hourly model» Decisions at time t impact t+1 through the amount of water held in the reservoir. Hour t Hour t Warren B. Powell Slide 126

127 Energy resource modeling Hourly model» Decisions at time t impact t+1 through the amount of water held in the reservoir. Hour t Value of holding water in the reservoir for future time periods Warren B. Powell Slide 127

128 Energy resource modeling 2008 Warren B. Powell Slide 128

129 Energy resource modeling 2008 Hour Warren B. Powell Slide 129

130 Energy resource modeling 2008 Hour Warren B. Powell Slide 130

131 Annual energy model Optimal Reservoir level Precipitation HydroUsage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 131

132 Annual energy model Approximate dynamic programming Reservoir level Precipitation Usage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 132

133 Annual energy model Optimal Reservoir level Precipitation HydroUsage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 133

134 Annual energy model Approximate dynamic programming Reservoir level Precipitation Usage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 134

135 Annual energy model 9000 ADP vs optimal reservoir levels for stochastic rainfall Reservoir level ADP at last iteration Optimal for individual scenarios Time period 2008 Warren B. Powell Slide 135

136 2008 Warren B. Powell Slide 136

The Dynamic Energy Resource Model

The Dynamic Energy Resource Model The Dynamic Energy Resource Model Group Peer Review Committee Lawrence Livermore National Laboratories July, 2007 Warren Powell Alan Lamont Jeffrey Stewart Abraham George 2007 Warren B. Powell, Princeton

More information

Approximate Dynamic Programming in Rail Operations

Approximate Dynamic Programming in Rail Operations Approximate Dynamic Programming in Rail Operations June, 2007 Tristan VI Phuket Island, Thailand Warren Powell Belgacem Bouzaiene-Ayari CASTLE Laboratory Princeton University http://www.castlelab.princeton.edu

More information

The Optimizing-Simulator: Merging Optimization and Simulation Using Approximate Dynamic Programming

The Optimizing-Simulator: Merging Optimization and Simulation Using Approximate Dynamic Programming The Optimizing-Simulator: Merging Optimization and Simulation Using Approximate Dynamic Programming Winter Simulation Conference December 5, 2005 Warren Powell CASTLE Laboratory Princeton University http://www.castlelab.princeton.edu

More information

Approximate Dynamic Programming for High Dimensional Resource Allocation Problems

Approximate Dynamic Programming for High Dimensional Resource Allocation Problems Approximate Dynamic Programming for High Dimensional Resource Allocation Problems Warren B. Powell Abraham George Belgacem Bouzaiene-Ayari Hugo P. Simao Department of Operations Research and Financial

More information

On-line supplement to: SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology

On-line supplement to: SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology On-line supplement to: SMART: A Stochastic Multiscale Model for e Analysis of Energy Resources, Technology and Policy This online supplement provides a more detailed version of e model, followed by derivations

More information

What you should know about approximate dynamic programming

What you should know about approximate dynamic programming What you should know about approximate dynamic programming Warren B. Powell Department of Operations Research and Financial Engineering Princeton University, Princeton, NJ 08544 December 16, 2008 Abstract

More information

Tutorial: Stochastic Optimization in Energy

Tutorial: Stochastic Optimization in Energy Tutorial: Stochastic Optimization in Energy FERC, Washington, D.C. August 6, 2014 Warren B. Powell CASTLE Labs Princeton University http://www.castlelab.princeton.edu Warren B. Powell, 2014 Slide 1 Mission

More information

Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework

Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework Warren B. Powell, Hugo P. Simao and Belgacem Bouzaiene-Ayari Department of Operations Research and Financial Engineering

More information

Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework

Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework Warren B. Powell, Hugo P. Simao and Belgacem Bouzaiene-Ayari Department of Operations Research and Financial Engineering

More information

Perhaps one of the most widely used and poorly understood terms in dynamic programming is policy. A simple definition of a policy is:

Perhaps one of the most widely used and poorly understood terms in dynamic programming is policy. A simple definition of a policy is: CHAPTER 6 POLICIES Perhaps one of the most widely used and poorly understood terms in dynamic programming is policy. A simple definition of a policy is: Definition 6.0.1 A policy is a rule (or function)

More information

Bayesian Active Learning With Basis Functions

Bayesian Active Learning With Basis Functions Bayesian Active Learning With Basis Functions Ilya O. Ryzhov Warren B. Powell Operations Research and Financial Engineering Princeton University Princeton, NJ 08544, USA IEEE ADPRL April 13, 2011 1 / 29

More information

Chapter 7 Forecasting Demand

Chapter 7 Forecasting Demand Chapter 7 Forecasting Demand Aims of the Chapter After reading this chapter you should be able to do the following: discuss the role of forecasting in inventory management; review different approaches

More information

Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems

Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems Warren B. Powell Department of Operations Research and Financial Engineering Princeton University Benjamin Van Roy Departments

More information

Integrated Electricity Demand and Price Forecasting

Integrated Electricity Demand and Price Forecasting Integrated Electricity Demand and Price Forecasting Create and Evaluate Forecasting Models The many interrelated factors which influence demand for electricity cannot be directly modeled by closed-form

More information

We present a general optimization framework for locomotive models that captures different levels of detail,

We present a general optimization framework for locomotive models that captures different levels of detail, Articles in Advance, pp. 1 24 ISSN 0041-1655 (print) ISSN 1526-5447 (online) http://dx.doi.org/10.1287/trsc.2014.0536 2014 INFORMS From Single Commodity to Multiattribute Models for Locomotive Optimization:

More information

MODELING DYNAMIC PROGRAMS

MODELING DYNAMIC PROGRAMS CHAPTER 5 MODELING DYNAMIC PROGRAMS Perhaps one of the most important skills to develop in approximate dynamic programming is the ability to write down a model of the problem. Everyone who wants to solve

More information

High-dimensional Problems in Finance and Economics. Thomas M. Mertens

High-dimensional Problems in Finance and Economics. Thomas M. Mertens High-dimensional Problems in Finance and Economics Thomas M. Mertens NYU Stern Risk Economics Lab April 17, 2012 1 / 78 Motivation Many problems in finance and economics are high dimensional. Dynamic Optimization:

More information

DRIVING ROI. The Business Case for Advanced Weather Solutions for the Energy Market

DRIVING ROI. The Business Case for Advanced Weather Solutions for the Energy Market DRIVING ROI The Business Case for Advanced Weather Solutions for the Energy Market Table of Contents Energy Trading Challenges 3 Skill 4 Speed 5 Precision 6 Key ROI Findings 7 About The Weather Company

More information

SHORT TERM LOAD FORECASTING

SHORT TERM LOAD FORECASTING Indian Institute of Technology Kanpur (IITK) and Indian Energy Exchange (IEX) are delighted to announce Training Program on "Power Procurement Strategy and Power Exchanges" 28-30 July, 2014 SHORT TERM

More information

Proper Security Criteria Determination in a Power System with High Penetration of Renewable Resources

Proper Security Criteria Determination in a Power System with High Penetration of Renewable Resources Proper Security Criteria Determination in a Power System with High Penetration of Renewable Resources Mojgan Hedayati, Kory Hedman, and Junshan Zhang School of Electrical, Computer, and Energy Engineering

More information

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Michael Ellington and Costas Milas Financial Services, Liquidity and Economic Activity Bank of England May

More information

R O B U S T E N E R G Y M AN AG E M E N T S Y S T E M F O R I S O L AT E D M I C R O G R I D S

R O B U S T E N E R G Y M AN AG E M E N T S Y S T E M F O R I S O L AT E D M I C R O G R I D S ROBUST ENERGY MANAGEMENT SYSTEM FOR ISOLATED MICROGRIDS Jose Daniel La r a Claudio Cañizares Ka nka r Bhattacharya D e p a r t m e n t o f E l e c t r i c a l a n d C o m p u t e r E n g i n e e r i n

More information

Some Fixed-Point Results for the Dynamic Assignment Problem

Some Fixed-Point Results for the Dynamic Assignment Problem Some Fixed-Point Results for the Dynamic Assignment Problem Michael Z. Spivey Department of Mathematics and Computer Science Samford University, Birmingham, AL 35229 Warren B. Powell Department of Operations

More information

Stochastic Dual Dynamic Programming with CVaR Risk Constraints Applied to Hydrothermal Scheduling. ICSP 2013 Bergamo, July 8-12, 2012

Stochastic Dual Dynamic Programming with CVaR Risk Constraints Applied to Hydrothermal Scheduling. ICSP 2013 Bergamo, July 8-12, 2012 Stochastic Dual Dynamic Programming with CVaR Risk Constraints Applied to Hydrothermal Scheduling Luiz Carlos da Costa Junior Mario V. F. Pereira Sérgio Granville Nora Campodónico Marcia Helena Costa Fampa

More information

APPROXIMATE DYNAMIC PROGRAMMING II: ALGORITHMS

APPROXIMATE DYNAMIC PROGRAMMING II: ALGORITHMS APPROXIMATE DYNAMIC PROGRAMMING II: ALGORITHMS INTRODUCTION WARREN B. POWELL Department of Operations Research and Financial Engineering, Princeton University, Princeton, New Jersey Approximate dynamic

More information

Multi-Area Stochastic Unit Commitment for High Wind Penetration

Multi-Area Stochastic Unit Commitment for High Wind Penetration Multi-Area Stochastic Unit Commitment for High Wind Penetration Workshop on Optimization in an Uncertain Environment Anthony Papavasiliou, UC Berkeley Shmuel S. Oren, UC Berkeley March 25th, 2011 Outline

More information

There has been considerable recent interest in the dynamic vehicle routing problem, but the complexities of

There has been considerable recent interest in the dynamic vehicle routing problem, but the complexities of TRANSPORTATION SCIENCE Vol. 38, No. 4, November 2004, pp. 399 419 issn 0041-1655 eissn 1526-5447 04 3804 0399 informs doi 10.1287/trsc.1030.0073 2004 INFORMS The Dynamic Assignment Problem Michael Z. Spivey,

More information

IMA Preprint Series # 2373

IMA Preprint Series # 2373 BATTERY STORAGE CONTROL FOR STEADYING RENEWABLE POWER GENERATION By Shengyuan (Mike) Chen, Emilie Danna, Kory Hedman, Mark Iwen, Wei Kang, John Marriott, Anders Nottrott, George Yin, and Qing Zhang IMA

More information

Dynamic Models for Freight Transportation

Dynamic Models for Freight Transportation Dynamic Models for Freight Transportation Warren B. Powell Belgacem Bouzaiene-Ayari Hugo P. Simao Department of Operations Research and Financial Engineering Princeton University August 13, 2003 Abstract

More information

Optimization Tools in an Uncertain Environment

Optimization Tools in an Uncertain Environment Optimization Tools in an Uncertain Environment Michael C. Ferris University of Wisconsin, Madison Uncertainty Workshop, Chicago: July 21, 2008 Michael Ferris (University of Wisconsin) Stochastic optimization

More information

OPTIMIZATION UNDER UNCERTAINTY A unified framework (Draft)

OPTIMIZATION UNDER UNCERTAINTY A unified framework (Draft) OPTIMIZATION UNDER UNCERTAINTY A unified framework (Draft) Warren B. Powell September 15, 2016 A JOHN WILEY & SONS, INC., PUBLICATION Copyright c 2016 by John Wiley & Sons, Inc. All rights reserved. Published

More information

Incorporating Demand Response with Load Shifting into Stochastic Unit Commitment

Incorporating Demand Response with Load Shifting into Stochastic Unit Commitment Incorporating Demand Response with Load Shifting into Stochastic Unit Commitment Frank Schneider Supply Chain Management & Management Science, University of Cologne, Cologne, Germany, frank.schneider@uni-koeln.de

More information

SYSTEM OPERATIONS. Dr. Frank A. Monforte

SYSTEM OPERATIONS. Dr. Frank A. Monforte SYSTEM OPERATIONS FORECASTING Dr. Frank A. Monforte Itron s Forecasting Brown Bag Seminar September 13, 2011 PLEASE REMEMBER» In order to help this session run smoothly, your phones are muted.» To make

More information

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons

WEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons WEATHER NORMALIZATION METHODS AND ISSUES Stuart McMenamin Mark Quan David Simons Itron Forecasting Brown Bag September 17, 2013 Please Remember» Phones are Muted: In order to help this session run smoothly,

More information

Optimal Demand Response

Optimal Demand Response Optimal Demand Response Libin Jiang Steven Low Computing + Math Sciences Electrical Engineering Caltech Oct 2011 Outline Caltech smart grid research Optimal demand response Global trends 1 Exploding renewables

More information

HEALTHCARE. 5 Components of Accurate Rolling Forecasts in Healthcare

HEALTHCARE. 5 Components of Accurate Rolling Forecasts in Healthcare HEALTHCARE 5 Components of Accurate Rolling Forecasts in Healthcare Introduction Rolling Forecasts Improve Accuracy and Optimize Decisions Accurate budgeting, planning, and forecasting are essential for

More information

A Progressive Hedging Approach to Multistage Stochastic Generation and Transmission Investment Planning

A Progressive Hedging Approach to Multistage Stochastic Generation and Transmission Investment Planning A Progressive Hedging Approach to Multistage Stochastic Generation and Transmission Investment Planning Yixian Liu Ramteen Sioshansi Integrated Systems Engineering Department The Ohio State University

More information

Multi-Area Stochastic Unit Commitment for High Wind Penetration in a Transmission Constrained Network

Multi-Area Stochastic Unit Commitment for High Wind Penetration in a Transmission Constrained Network Multi-Area Stochastic Unit Commitment for High Wind Penetration in a Transmission Constrained Network Anthony Papavasiliou Center for Operations Research and Econometrics Université catholique de Louvain,

More information

Traffic Flow Impact (TFI)

Traffic Flow Impact (TFI) Traffic Flow Impact (TFI) Michael P. Matthews 27 October 2015 Sponsor: Yong Li, FAA ATO AJV-73 Technical Analysis & Operational Requirements Distribution Statement A. Approved for public release; distribution

More information

Introduction to Reinforcement Learning. CMPT 882 Mar. 18

Introduction to Reinforcement Learning. CMPT 882 Mar. 18 Introduction to Reinforcement Learning CMPT 882 Mar. 18 Outline for the week Basic ideas in RL Value functions and value iteration Policy evaluation and policy improvement Model-free RL Monte-Carlo and

More information

APPROXIMATE dynamic programming (ADP) has

APPROXIMATE dynamic programming (ADP) has IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 60, NO. 3, MARCH 2015 743 A New Optimal Stepsize for Approximate Dynamic Programming Ilya O. Ryzhov, Member, IEEE, Peter I. Frazier, and Warren B. Powell, Member,

More information

1 The Basic RBC Model

1 The Basic RBC Model IHS 2016, Macroeconomics III Michael Reiter Ch. 1: Notes on RBC Model 1 1 The Basic RBC Model 1.1 Description of Model Variables y z k L c I w r output level of technology (exogenous) capital at end of

More information

Dynamic Models for Freight Transportation

Dynamic Models for Freight Transportation Dynamic Models for Freight Transportation Warren B. Powell Belgacem Bouzaiene-Ayari Hugo P. Simao Department of Operations Research and Financial Engineering Princeton University September 14, 2005 Abstract

More information

Project Discussions: SNL/ADMM, MDP/Randomization, Quadratic Regularization, and Online Linear Programming

Project Discussions: SNL/ADMM, MDP/Randomization, Quadratic Regularization, and Online Linear Programming Project Discussions: SNL/ADMM, MDP/Randomization, Quadratic Regularization, and Online Linear Programming Yinyu Ye Department of Management Science and Engineering Stanford University Stanford, CA 94305,

More information

An Approximate Dynamic Programming Algorithm for the Allocation of High-Voltage Transformer Spares in the Electric Grid

An Approximate Dynamic Programming Algorithm for the Allocation of High-Voltage Transformer Spares in the Electric Grid An Approximate Dynamic Programming Algorithm for the Allocation of High-Voltage Transformer Spares in the Electric Grid Johannes Enders Department of Operations Research and Financial Engineering Princeton

More information

The Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks

The Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks The Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks Yingfei Wang, Chu Wang and Warren B. Powell Princeton University Yingfei Wang Optimal Learning Methods June 22, 2016

More information

FINAL REPORT EVALUATION REVIEW OF TVA'S LOAD FORECAST RISK

FINAL REPORT EVALUATION REVIEW OF TVA'S LOAD FORECAST RISK Memorandum from the Office of the Inspector General Robert Irvin, WT 9C-K FINAL REPORT EVALUATION 2012-14507 REVIEW OF TVA'S LOAD FORECAST RISK As part of a series of reviews to evaluate the Tennessee

More information

Real Business Cycle Model (RBC)

Real Business Cycle Model (RBC) Real Business Cycle Model (RBC) Seyed Ali Madanizadeh November 2013 RBC Model Lucas 1980: One of the functions of theoretical economics is to provide fully articulated, artificial economic systems that

More information

AESO Load Forecast Application for Demand Side Participation. Eligibility Working Group September 26, 2017

AESO Load Forecast Application for Demand Side Participation. Eligibility Working Group September 26, 2017 AESO Load Forecast Application for Demand Side Participation Eligibility Working Group September 26, 2017 Load forecasting for the Capacity Market Demand Considerations Provide further information on forecasting

More information

Planning a 100 percent renewable electricity system

Planning a 100 percent renewable electricity system Planning a 100 percent renewable electricity system Andy Philpott Electric Power Optimization Centre University of Auckland www.epoc.org.nz (Joint work with Michael Ferris) INI Open for Business Meeting,

More information

Why do we care? Measurements. Handling uncertainty over time: predicting, estimating, recognizing, learning. Dealing with time

Why do we care? Measurements. Handling uncertainty over time: predicting, estimating, recognizing, learning. Dealing with time Handling uncertainty over time: predicting, estimating, recognizing, learning Chris Atkeson 2004 Why do we care? Speech recognition makes use of dependence of words and phonemes across time. Knowing where

More information

PANEL DISCUSSION: THE ROLE OF POTENTIAL OUTPUT IN POLICYMAKING

PANEL DISCUSSION: THE ROLE OF POTENTIAL OUTPUT IN POLICYMAKING PANEL DISCUSSION: THE ROLE OF POTENTIAL OUTPUT IN POLICYMAKING James Bullard* Federal Reserve Bank of St. Louis 33rd Annual Economic Policy Conference St. Louis, MO October 17, 2008 Views expressed are

More information

Agenda today. Introduction to prescriptive modeling. Linear optimization models through three examples: Beyond linear optimization

Agenda today. Introduction to prescriptive modeling. Linear optimization models through three examples: Beyond linear optimization Agenda today Introduction to prescriptive modeling Linear optimization models through three examples: 1 Production and inventory optimization 2 Distribution system design 3 Stochastic optimization Beyond

More information

Using Static Flow Patterns in Time-Staged Resource Allocation Problems

Using Static Flow Patterns in Time-Staged Resource Allocation Problems Using Static Flow Patterns in Time-Staged Resource Allocation Problems Arun Marar Warren B. Powell Hugo P. Simão Department of Operations Research and Financial Engineering, Princeton University, Princeton,

More information

WE propose an approximate dynamic programming

WE propose an approximate dynamic programming IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL 58, NO 12, DECEMBER 2013 2995 An Optimal Approximate Dynamic Programming Algorithm for Concave, Scalar Storage Problems With Vector-Valued Controls Juliana Nascimento

More information

Battery Storage Control for Steadying Renewable Power Generation

Battery Storage Control for Steadying Renewable Power Generation Battery Storage Control for Steadying Renewable Power Generation Shengyuan (Mike) Chen, Emilie Danna, Kory Hedman, Mark Iwen, Wei Kang, John Marriott, Anders Nottrott, George Yin, Qing Zhang March 13,

More information

Approximate dynamic programming in transportation and logistics: a unified framework

Approximate dynamic programming in transportation and logistics: a unified framework EURO J Transp Logist (2012) 1:237 284 DOI 10.1007/s13676-012-0015-8 TUTORIAL Approximate dynamic programming in transportation and logistics: a unified framework Warren B. Powell Hugo P. Simao Belgacem

More information

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014 Warwick Business School Forecasting System Summary Ana Galvao, Anthony Garratt and James Mitchell November, 21 The main objective of the Warwick Business School Forecasting System is to provide competitive

More information

Information Choice in Macroeconomics and Finance.

Information Choice in Macroeconomics and Finance. Information Choice in Macroeconomics and Finance. Laura Veldkamp New York University, Stern School of Business, CEPR and NBER Spring 2009 1 Veldkamp What information consumes is rather obvious: It consumes

More information

TRANSMISSION BUSINESS LOAD FORECAST AND METHODOLOGY

TRANSMISSION BUSINESS LOAD FORECAST AND METHODOLOGY Filed: September, 00 EB-00-00 Tab Schedule Page of 0 TRANSMISSION BUSINESS LOAD FORECAST AND METHODOLOGY.0 INTRODUCTION 0 This exhibit discusses Hydro One Networks transmission system load forecast and

More information

2013 WEATHER NORMALIZATION SURVEY. Industry Practices

2013 WEATHER NORMALIZATION SURVEY. Industry Practices 2013 WEATHER NORMALIZATION SURVEY Industry Practices FORECASTING SPECIALIZATION Weather Operational Forecasting Short-term Forecasting to support: System Operations and Energy Trading Hourly Load Financial/Budget

More information

Stochastic Optimization

Stochastic Optimization Chapter 27 Page 1 Stochastic Optimization Operations research has been particularly successful in two areas of decision analysis: (i) optimization of problems involving many variables when the outcome

More information

MIDAS: A Mixed Integer Dynamic Approximation Scheme

MIDAS: A Mixed Integer Dynamic Approximation Scheme MIDAS: A Mixed Integer Dynamic Approximation Scheme Andy Philpott, Faisal Wahid, Frédéric Bonnans May 7, 2016 Abstract Mixed Integer Dynamic Approximation Scheme (MIDAS) is a new sampling-based algorithm

More information

Chapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models

Chapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models Chapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models Fall 22 Contents Introduction 2. An illustrative example........................... 2.2 Discussion...................................

More information

Time Series and Forecasting

Time Series and Forecasting Time Series and Forecasting Introduction to Forecasting n What is forecasting? n Primary Function is to Predict the Future using (time series related or other) data we have in hand n Why are we interested?

More information

Forecasting demand in the National Electricity Market. October 2017

Forecasting demand in the National Electricity Market. October 2017 Forecasting demand in the National Electricity Market October 2017 Agenda Trends in the National Electricity Market A review of AEMO s forecasting methods Long short-term memory (LSTM) neural networks

More information

Handout 1: Introduction to Dynamic Programming. 1 Dynamic Programming: Introduction and Examples

Handout 1: Introduction to Dynamic Programming. 1 Dynamic Programming: Introduction and Examples SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 1: Introduction to Dynamic Programming Instructor: Shiqian Ma January 6, 2014 Suggested Reading: Sections 1.1 1.5 of Chapter

More information

Clearing the Jungle of Stochastic Optimization

Clearing the Jungle of Stochastic Optimization Clearing the Jungle of Stochastic Optimization Warren B. Powell Department of Operations Research and Financial Engineering Princeton University Prepared for Informs TutORials, 2014. DRAFT April 7, 2014

More information

Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise

Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise Robert J. Elliott 1 Samuel N. Cohen 2 1 Department of Commerce, University of South Australia 2 Mathematical Insitute, University

More information

1 Bewley Economies with Aggregate Uncertainty

1 Bewley Economies with Aggregate Uncertainty 1 Bewley Economies with Aggregate Uncertainty Sofarwehaveassumedawayaggregatefluctuations (i.e., business cycles) in our description of the incomplete-markets economies with uninsurable idiosyncratic risk

More information

2013 Weather Normalization Survey. Itron, Inc El Camino Real San Diego, CA

2013 Weather Normalization Survey. Itron, Inc El Camino Real San Diego, CA Itron, Inc. 11236 El Camino Real San Diego, CA 92130 2650 858 724 2620 March 2014 Weather normalization is the process of reconstructing historical energy consumption assuming that normal weather occurred

More information

CP:

CP: Adeng Pustikaningsih, M.Si. Dosen Jurusan Pendidikan Akuntansi Fakultas Ekonomi Universitas Negeri Yogyakarta CP: 08 222 180 1695 Email : adengpustikaningsih@uny.ac.id Operations Management Forecasting

More information

Birgit Rudloff Operations Research and Financial Engineering, Princeton University

Birgit Rudloff Operations Research and Financial Engineering, Princeton University TIME CONSISTENT RISK AVERSE DYNAMIC DECISION MODELS: AN ECONOMIC INTERPRETATION Birgit Rudloff Operations Research and Financial Engineering, Princeton University brudloff@princeton.edu Alexandre Street

More information

Uncertainty in energy system models

Uncertainty in energy system models Uncertainty in energy system models Amy Wilson Durham University May 2015 Table of Contents 1 Model uncertainty 2 3 Example - generation investment 4 Conclusion Model uncertainty Contents 1 Model uncertainty

More information

Weather Risk Management

Weather Risk Management Weather Risk Management David Molyneux, FCAS Introduction Weather Risk - Revenue or profits that are sensitive to weather conditions Weather Derivatives - Financial Products that allow companies to manage

More information

Operations Report. Tag B. Short, Director South Region Operations. Entergy Regional State Committee (ERSC) February 14, 2018

Operations Report. Tag B. Short, Director South Region Operations. Entergy Regional State Committee (ERSC) February 14, 2018 Operations Report Tag B. Short, Director South Region Operations Entergy Regional State Committee (ERSC) February 14, 2018 1 Winter Operations Highlights South Region Max Gen Event Regional Dispatch Transfer

More information

Optimal Demand Response

Optimal Demand Response Optimal Demand Response Libin Jiang Steven Low Computing + Math Sciences Electrical Engineering Caltech June 2011 Outline o Motivation o Demand response model o Some results Wind power over land (outside

More information

JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH BACKORDERS. Lou Caccetta and Elham Mardaneh

JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH BACKORDERS. Lou Caccetta and Elham Mardaneh JOURNAL OF INDUSTRIAL AND doi:10.3934/jimo.2010.6.123 MANAGEMENT OPTIMIZATION Volume 6, Number 1, February 2010 pp. 123 147 JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH

More information

LOADS, CUSTOMERS AND REVENUE

LOADS, CUSTOMERS AND REVENUE EB-00-0 Exhibit K Tab Schedule Page of 0 0 LOADS, CUSTOMERS AND REVENUE The purpose of this evidence is to present the Company s load, customer and distribution revenue forecast for the test year. The

More information

Financial Factors in Economic Fluctuations. Lawrence Christiano Roberto Motto Massimo Rostagno

Financial Factors in Economic Fluctuations. Lawrence Christiano Roberto Motto Massimo Rostagno Financial Factors in Economic Fluctuations Lawrence Christiano Roberto Motto Massimo Rostagno Background Much progress made on constructing and estimating models that fit quarterly data well (Smets-Wouters,

More information

Balancing and Control of a Freely-Swinging Pendulum Using a Model-Free Reinforcement Learning Algorithm

Balancing and Control of a Freely-Swinging Pendulum Using a Model-Free Reinforcement Learning Algorithm Balancing and Control of a Freely-Swinging Pendulum Using a Model-Free Reinforcement Learning Algorithm Michail G. Lagoudakis Department of Computer Science Duke University Durham, NC 2778 mgl@cs.duke.edu

More information

SMART GRID FORECASTING

SMART GRID FORECASTING SMART GRID FORECASTING AND FINANCIAL ANALYTICS Itron Forecasting Brown Bag December 11, 2012 PLEASE REMEMBER» Phones are Muted: In order to help this session run smoothly, your phones are muted.» Full

More information

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER

When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER SAS White Paper Table of Contents Executive Summary.... 1 Electric Load Forecasting Challenges... 1 Selected

More information

Lecture 1: March 7, 2018

Lecture 1: March 7, 2018 Reinforcement Learning Spring Semester, 2017/8 Lecture 1: March 7, 2018 Lecturer: Yishay Mansour Scribe: ym DISCLAIMER: Based on Learning and Planning in Dynamical Systems by Shie Mannor c, all rights

More information

Impact on Agriculture

Impact on Agriculture Weather Variability and the Impact on Agriculture InfoAg 2017 Copyright 2017, awhere. All Rights Reserved The Problem: The Earth s Atmosphere is a Heat Engine In transition 1 C warming of atmosphere Triples

More information

Mathematical Optimization Models and Applications

Mathematical Optimization Models and Applications Mathematical Optimization Models and Applications Yinyu Ye Department of Management Science and Engineering Stanford University Stanford, CA 94305, U.S.A. http://www.stanford.edu/ yyye Chapters 1, 2.1-2,

More information

AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS

AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS James Hall JHTech PO Box 877 Divide, CO 80814 Email: jameshall@jhtech.com Jeffrey Hall JHTech

More information

peak half-hourly Tasmania

peak half-hourly Tasmania Forecasting long-term peak half-hourly electricity demand for Tasmania Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report for

More information

Stochastic and Adaptive Optimal Control

Stochastic and Adaptive Optimal Control Stochastic and Adaptive Optimal Control Robert Stengel Optimal Control and Estimation, MAE 546 Princeton University, 2018! Nonlinear systems with random inputs and perfect measurements! Stochastic neighboring-optimal

More information

Miscellaneous. Regarding reading materials. Again, ask questions (if you have) and ask them earlier

Miscellaneous. Regarding reading materials. Again, ask questions (if you have) and ask them earlier Miscellaneous Regarding reading materials Reading materials will be provided as needed If no assigned reading, it means I think the material from class is sufficient Should be enough for you to do your

More information

Why do we care? Examples. Bayes Rule. What room am I in? Handling uncertainty over time: predicting, estimating, recognizing, learning

Why do we care? Examples. Bayes Rule. What room am I in? Handling uncertainty over time: predicting, estimating, recognizing, learning Handling uncertainty over time: predicting, estimating, recognizing, learning Chris Atkeson 004 Why do we care? Speech recognition makes use of dependence of words and phonemes across time. Knowing where

More information

A stochastic integer programming approach to the optimal thermal and wind generator scheduling problem

A stochastic integer programming approach to the optimal thermal and wind generator scheduling problem A stochastic integer programming approach to the optimal thermal and wind generator scheduling problem Presented by Michael Chen York University Industrial Optimization Seminar Fields Institute for Research

More information

Finding the Value of Information About a State Variable in a Markov Decision Process 1

Finding the Value of Information About a State Variable in a Markov Decision Process 1 05/25/04 1 Finding the Value of Information About a State Variable in a Markov Decision Process 1 Gilvan C. Souza The Robert H. Smith School of usiness, The University of Maryland, College Park, MD, 20742

More information

BEFORE THE PUBLIC UTILITIES COMMISSION OF THE STATE OF COLORADO * * * *

BEFORE THE PUBLIC UTILITIES COMMISSION OF THE STATE OF COLORADO * * * * Exhibit No. 1 BEFORE THE PUBLIC UTILITIES COMMISSION OF THE STATE OF COLORADO * * * * IN THE MATTER OF THE APPLICATION OF PUBLIC SERVICE COMPANY OF COLORADO FOR APPROVAL OF ITS 0 ELECTRIC RESOURCE PLAN

More information

Demand Forecasting Models

Demand Forecasting Models E 2017 PSE Integrated Resource Plan Demand Forecasting Models This appendix describes the econometric models used in creating the demand forecasts for PSE s 2017 IRP analysis. Contents 1. ELECTRIC BILLED

More information

Chapter 8 - Forecasting

Chapter 8 - Forecasting Chapter 8 - Forecasting Operations Management by R. Dan Reid & Nada R. Sanders 4th Edition Wiley 2010 Wiley 2010 1 Learning Objectives Identify Principles of Forecasting Explain the steps in the forecasting

More information

Speculation and the Bond Market: An Empirical No-arbitrage Framework

Speculation and the Bond Market: An Empirical No-arbitrage Framework Online Appendix to the paper Speculation and the Bond Market: An Empirical No-arbitrage Framework October 5, 2015 Part I: Maturity specific shocks in affine and equilibrium models This Appendix present

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Learning March May, 2013 Schedule Update Introduction 03/13/2015 (10:15-12:15) Sala conferenze MDPs 03/18/2015 (10:15-12:15) Sala conferenze Solving MDPs 03/20/2015 (10:15-12:15) Aula Alpha

More information

REINFORCE Framework for Stochastic Policy Optimization and its use in Deep Learning

REINFORCE Framework for Stochastic Policy Optimization and its use in Deep Learning REINFORCE Framework for Stochastic Policy Optimization and its use in Deep Learning Ronen Tamari The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (#67679) February 28, 2016 Ronen Tamari

More information

The AIR Bushfire Model for Australia

The AIR Bushfire Model for Australia The AIR Bushfire Model for Australia In February 2009, amid tripledigit temperatures and drought conditions, fires broke out just north of Melbourne, Australia. Propelled by high winds, as many as 400

More information