Approximate Dynamic Programming: Solving the curses of dimensionality
|
|
- Anthony Harold Turner
- 5 years ago
- Views:
Transcription
1 Approximate Dynamic Programming: Solving the curses of dimensionality Informs Computing Society Tutorial October, 2008 Warren Powell CASTLE Laboratory Princeton University Warren B. Powell, Princeton University 2008 Warren B. Powell Slide 1
2 2008 Warren B. Powell Slide 2
3 2008 Warren B. Powell Slide 3
4 The fractional jet ownership industry 2008 Warren B. Powell Slide 4
5 NetJets Inc Warren B. Powell Slide 5
6 Schneider National 2008 Warren B. Powell Slide 6
7 Schneider National 2008 Warren B. Powell Slide 7
8 Planning for a risky world Weather Robust design of emergency response networks. Design of financial instruments to hedge against weather emergencies to protect individuals, companies and municipalities. Design of sensor networks and communication systems to manage responses to major weather events. Disease Models of disease propagation for response planning. Management of medical personnel, equipment and vaccines to respond to a disease outbreak. Robust design of supply chains to mitigate the disruption of transportation systems Warren B. Powell Slide 8
9 Energy management Energy resource allocation What is the right mix of energy technologies? How should the use of different energy resources be coordinated over space and time? What should my energy R&D portfolio look like? Should I invest in nuclear energy? What is the impact of a carbon tax? Energy markets How should I hedge energy commodities? How do I price energy assets? What is the right price for energy futures? 2008 Warren B. Powell Slide 9
10 High value spare parts Electric Power Grid PJM oversees an aging investment in highvoltage transformers. Replacement strategy needs to anticipate a bulge in retirements and failures 1-2 year lag times on orders. Typical cost of a replacement ~ $5 million. Failures vary widely in terms of economic impact on the network. Spare parts for business jets ADP is used to determine purchasing and allocation strategies for over 400 high value spare parts. Inventory strategy has to determine what to buy, when and where to store it. Many parts are very low volume (e.g. 7 spares spread across 15 service centers). Inventories have to meet global targets on level of service and inventory costs Warren B. Powell Slide 10
11 Challenges Real-time control» Scheduling aircraft, pilots, generators, tankers» Pricing stocks, options» Electricity resource allocation Near-term tactical planning» Can I accept a customer request?» Should I lease equipment?» How much energy can I commit with my windmills? Strategic planning» What is the right equipment mix?» What is the value of this contract?» What is the value of more reliable aircraft? 2008 Warren B. Powell Slide 11
12 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
13 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
14 A resource allocation model Attribute vectors: a = Asset class Time invested Type Location Age Location ETA Home Experience Driving hours Location ETA A/C type Fuel level Home shop Crew Eqpt1 Eqpt Warren B. Powell Slide 14
15 A resource allocation model Modeling resources:» The attributes of a single resource: a = The attributes of a single resource a A The attribute space» The resource state vector: R» The information process: ˆ ta R ta = The number of resources with attribute ( ) Rt = Rta a A The resource state vector = The change in the number of resources with attribute a. a 2008 Warren B. Powell Slide 15
16 A resource allocation model Modeling demands:» The attributes of a single demand: b = The attributes of a demand to be served. b B The attribute space» The demand state vector: D = The number of demands with attribute Dt = Dtb The demand state vector b B» The information process: ˆ tb D tb ( ) = The change in the number of demands with attribute b. b 2008 Warren B. Powell Slide 16
17 Energy resource modeling The system state: S ( R D ρ ) =,, = System state, where: t t t t R t D t = Resource state (how much capacity, reserves) = Market demands ρ = "system parameters" t State of the technology (costs, performance) Climate, weather (temperature, rainfall, wind) Government policies (tax rebates on solar panels) Market prices (oil, coal) 2008 Warren B. Powell Slide 17
18 Energy resource modeling The decision variable: x t New capacity Retired capacity for each: = Type Location Technology 2008 Warren B. Powell Slide 18
19 Energy resource modeling Exogenous information: W = ( Rˆ Dˆ ˆ ρ ) New information =,, t t t t Rˆ t = Exogenous changes in capacity, reserves Dˆ t = New demands for energy from each source ˆ ρ = Exogenous changes in parameters. t 2008 Warren B. Powell Slide 19
20 Energy resource modeling The transition function = M t+ 1 t t t+ 1 S S ( S, x, W ) 2008 Warren B. Powell Slide 20
21 A resource allocation model Resources Demands 2008 Warren B. Powell Slide 21
22 A resource allocation model t t+1 t Warren B. Powell Slide 22
23 A resource allocation model Optimizing over time t t+1 t+2 Optimizing at a point in time 2008 Warren B. Powell Slide 23
24 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
25 Introduction to ADP We just solved Bellman s equation: V ( S ) = max C ( S, x ) + E V ( S ) S { } t t t t t t+ 1 t+ 1 t x X» We found the value of being in each state by stepping backward through the tree Warren B. Powell Slide 25
26 Introduction to ADP The challenge of dynamic programming: ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X Problem: Curse of dimensionality 2008 Warren B. Powell Slide 26
27 Introduction to ADP The challenge of dynamic programming: ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X Three curses Problem: Curse of dimensionality State space Outcome space Action space (feasible region) 2008 Warren B. Powell Slide 27
28 Introduction to ADP The computational challenge: ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X How do we find V ( S )? t+ 1 t+ 1 How do we compute the expectation? How do we find the optimal solution? 2008 Warren B. Powell Slide 28
29 Introduction to ADP Classical ADP» Most applications of ADP focus on the challenge of handling multidimensional state variables» Start with ( { }) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X» Now replace the value function with some sort of approximation V ( S ) V ( S ) t t t t
30 Introduction to ADP Approximating the value function:» We have to exploit the structure of the value function (e.g. concavity).» We might approximate the value function using a simple polynomial V ( S θ) = θ + θ S + θ S 2 t t 0 1 t 2 t».. or a complicated one: V ( S θ) = θ + θ S + θ S + θ ln S + θ sin( S )» Sometimes, they get really messy: ( ) 2 t t 0 1 t 2 t 3 t 4 t
31 t+ 2 t+ 3 (0) (1) (1) t( θ) = θ + θtst, ' tst, ' + θtwt, ' twt, ' s t' = t w t' = t V R R R + θ R + (2) 2 (2) 2 twt, ' twt, ' tst, ' tst, ' w t' s t' + θ R s 1 (3) ts t, st t, s' t S s' t+ 1 t+ 1 (4) θts Rt, st ' Rt, s' t ' s t' = t 2S s' t' = t R 1 θ 2 R t+ 2 t+ 2 (5) θts Rt, st ' Rt, s' t ' s t' = t 3S s' t' = t w s θ R R ( ws,1) tws, twt, tst, t+ 2 ( ws,2) + θtws, Rtwt, Rtst, ' w s t' = t t+ 3 t+ 2 ( ws,3) θtws, Rtwt, ' Rtst, ' w s t' = t t' = t t+ 2 t+ 2 ( ws,4) θtws, Rtwt, ' Rtst, ' w s t ' = t t' = t w s θ 1 R R R ( ws,5) tsw, tst,, + 2 twt, twt,, ,2,3 4,5,6,
32 Introduction to ADP We can write a model of the observed value of being in a state as: vˆ = θ + θ S + θ S + θ ln S + θ sin( S ) + ε ( ) t 2 t 3 t 4 t This is often written as a generic regression model: Y = θ + θ X + θ X + θ X + θ X The ADP community refers to the independent variables as basis functions: Y = θ ϕ ( S) + θϕ ( S) + θ ϕ ( S) + θ ϕ ( S) + θ ϕ ( S) = θϕ ( S) f F f f ϕ ( R) are also known as features. f
33 Introduction to ADP Methods for estimating»» θ 1 2 N ˆ ˆ ˆ Generate observations v, v,..., v, and use traditional regression methods to fit θ. n Stochastic gradient for updating θ : ( ˆ ) θ = θ α V ( S θ ) v V ( S θ ) n n 1 n 1 n n 1 n n 1 n n 1 n 1 ( V ( S ) vˆ ) = θ α θ n 1 n 1 n n 1 n n 1 Error ϕ1( S) ϕ2( S) ϕf ( S) Basis functions
34 Introduction to ADP Methods for estimating θ» Recursive statistics Iterative equations avoid matrix inverse: n n 1 n n n n θ = θ H x ε ε = Error n 1 n 1 T H = B B is F F matrix= XX n γ ( ( ) ) n n 1 1 n 1 n n T n 1 B = B B x x B n γ γ n n T n 1 n = 1+ ( ) x B x 1
35 Introduction to ADP Other statistical methods» Regression trees Combines regression with techniques for discrete variables.» Data mining Good for categorical data» Neural networks Engineers like this for low-dimensional continuous problems
36 Introduction to ADP Notes:» When approximating value functions, we are basically drawing on the entire field of statistics.» Choosing an approximation is primarily an art.» In special cases, the resulting algorithm can produce optimal solutions.» Most of the time, we are hoping for good solutions.» In some cases, it can work terribly.» As a general rule you have to use problem structure. Value function approximations have to capture the right structure. Blind use of polynomials will rarely be successful.
37 Introduction to ADP What you will struggle with:» Stepsizes Can t live with em, can t live without em. Too small, you think you have converged but you have really just stalled ( apparent convergence ) Too large, and the system is unstable.» Stability There are two sources of randomness: The traditional exogenous randomness An evolving policy» Exploration vs. exploitation You sometimes have to choose to visit a state just to collect information about the value of being in a state.
38 Introduction to ADP But we are not out of the woods» Assume we have an approximate value function.» We still have to solve a problem that looks like Vt( St) = max Ct( St, xt) + E θφ f f ( St+ 1) x X f F» This means we still have to deal with a maximization problem (might be a linear, nonlinear or integer program) with an expectation.
39 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
40 Action State Use weather report Do not use weather report Information Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game State Action Schedule game Cancel game Schedule game Cancel game Schedule game Cancel game Rain.2 -$2000 Clouds.3 $1000 Sun.5 $5000 Rain.2 -$200 Clouds.3 -$200 Sun.5 -$200 Information Rain.8 -$2000 Clouds.2 $1000 Sun.0 $5000 Rain.8 -$200 Clouds.2 -$200 Sun.0 -$200 Rain.1 -$2000 Clouds.5 $1000 Sun.4 $5000 Rain.1 -$200 Clouds.5 -$200 Sun.4 -$200 Rain.1 -$2000 Clouds.2 $1000 Sun.7 $5000 Rain.1 -$200 Clouds.2 -$200 Sun.7 -$200 - Decision nodes - Outcome nodes
41 Schedule game Cancel game Schedule game Cancel game Schedule game Cancel game $2400 -$200 -$1400 -$200 $2300 -$200 $3500 -$200 Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game Use weather report Do not use weather report
42 -$200 $2300 $3500 $2400 -$200 Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game Use weather report Do not use weather report
43 $2770 $2400 Use weather report Do not use weather report
44 The post-decision state 2008 Warren B. Powell Slide 44
45 The post-decision state Managing blood inventories over time Week 0 Week 1 Week 2 Week 3 S 0 x 0 Rˆ, Dˆ 1 1 S 1 x 1 Rˆ, Dˆ 2 2 S 2 x 2 S x 2 Rˆ, Dˆ 3 3 x S S 3 3 x 3 t=0 t=1 t=2 t= Warren B. Powell Slide 45
46 The post-decision state What happens if use Bellman s equation?» State variable is: The supply of each type of blood 8 blood types 6 ages The demand for each type of blood 8 blood types 56 dimensional state vector» Decision variable is how much of 8 blood types to supply to 8 demand types dimensional decision vector» Random information Blood donations by week (8 types) New demands for blood (8 types) 16-dimensional information vector 2008 Warren B. Powell Slide 46
47 Rain.8 -$2000 Clouds.2 $1000 Sun.0 $5000 Rain.8 -$200 Clouds.2 -$200 Sun.0 -$200 Rain.1 -$2000 Clouds.5 $1000 Sun.4 $5000 Rain.1 -$200 Clouds.5 -$200 Sun.4 -$200 Rain.1 -$2000 Clouds.2 $1000 Sun.7 $5000 Rain.1 -$200 Clouds.2 -$200 S n 7 $200 Schedule game S t S t + 1 Cancel game Forecast rain.1 Schedule game Forecast cloudy.3 Cancel game Forecast sunny.6 Use weather report Schedule game Cancel game Rain 2 -$2000 Do not weath ( { } ) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X
48 Rain.8 -$2000 Clouds.2 $1000 Sun.0 $5000 Rain.8 -$200 Clouds.2 -$200 Sun.0 -$200 Rain.1 -$2000 Clouds.5 $1000 Sun.4 $5000 Rain.1 -$200 Clouds.5 -$200 Sun.4 -$200 Rain.1 -$2000 Clouds.2 $1000 Sun.7 $5000 Rain.1 -$200 Clouds.2 -$200 Sun.7 -$200 Schedule game Cancel game Schedule game Cancel game Schedule game Cancel game Rain.2 -$2000 Clouds.3 $1000 Sun.5 $5000 Rain.2 -$200 Clouds.3 -$200 Sun.5 -$200 - Decision nodes - Outcome nodes Forecast rain.1 Forecast cloudy.3 Forecast sunny.6 Schedule game Cancel game Use weather report Do not use weather report
49 The post-decision state New concept:» The pre-decision state variable: St = The information required to make a decision x t Same as a decision node in a decision tree.» The post-decision state variable: x S t = The state of what we know immediately after we make a decision. Same as an outcome node in a decision tree Warren B. Powell Slide 49
50 The post-decision state An inventory problem:» Our basic inventory equation: R t+1 =max{0,r t + x t ˆD t+1 } where R t = Inventory at time t. x t = Amount we ordered. ˆD t+1 = Demand in next time period.» Using pre- and post-decision states: Rt x = R t + x t Post-decision state R t+1 = max{0,rt x ˆD t+1 } Pre-decision state 2008 Warren B. Powell Slide 50
51 The post-decision state Pre-decision, state-action, and post-decision Pre-decision state State Action Post-decision state 9 3 states state-action pairs 9 3 states 2008 Warren B. Powell Slide 51
52 The post-decision state Pre-decision: resources and demands S = ( R, D ) t t t 2008 Warren B. Powell Slide 52
53 The post-decision state S = S ( S, x ) x M, x t t t 2008 Warren B. Powell Slide 53
54 The post-decision state x S t S = S ( S, W ) MW, x t+ 1 t t+ 1 W = ( Rˆ, Dˆ ) t+ 1 t+ 1 t Warren B. Powell Slide 54
55 The post-decision state S t Warren B. Powell Slide 55
56 The post-decision state Classical form of Bellman s equation: { } ( ) V ( S ) = max C ( S, x ) + E V ( S ) S t t t t t t+ 1 t+ 1 t x X Bellman s equations around pre- and post-decision states:» Optimization problem (making the decision): ( x( M, x )) V ( S ) = max C ( S, x ) + V S ( S, x ) t t x t t t t t t t Note: this problem is deterministic!» Simulation problem (the effect of exogenous information): {, } V ( S ) = E V ( S ( S, W )) S x x M W x x t t t t t t 2008 Warren B. Powell Slide 56
57 The post-decision state Challenges» For most practical problems, we are not going to be x x able to compute V ( S ). t ( x x ) V ( S ) = max C ( S, x ) + V ( S ) t t x t t t t t» Concept: replace it with an approximation V ( x t St ) and solve» So now we face: What should the approximation look like? How do we estimate it? t ( x ) V ( S ) = max C ( S, x ) + V ( S ) t t x t t t t t 2008 Warren B. Powell Slide 57
58 The post-decision state Value function approximations:» Linear (in the resource state): x V ( R ) = v R x t t ta ta a A» Piecewise linear, separable: x x Vt ( Rt ) = Vta ( Rta ) a A» Indexed PWL separable: ( x ) x Vt( Rt ) = Vta Rta ( featurest) a A 2008 Warren B. Powell Slide 58
59 The post-decision state Value function approximations:» Ridge regression (Klabjan and Adelman)» Benders cuts ( ) x V ( R ) = V R R = θ R t t tf tf tf fa ta f F a A f V ( R ) t t x 1 x Warren B. Powell Slide 59
60 The post-decision state Comparison to other methods:» Classical MDP (value iteration) ( n 1 γ ) t+ 1 n V ( S) = max C( S, x) + EV ( S ) x» Classical ADP (pre-decision state): n n n n 1 vˆ t = max x Ct( St, xt) + p( s' St, xt) Vt+ 1 s' s' V ( S ) = (1 α ) V ( S ) + α vˆ n n n 1 n n t t n 1 t t n 1 t, 1» Our method (update V x n around post-decision state): (, 1, ) n n x n M x n vˆ = max C ( S, x ) + V ( S ( S, x )) t x t t t t t t V ( S ) = (1 α ) V ( S ) + α vˆ n x, n n 1 x, n n t 1 t 1 n 1 t 1 t 1 n 1 t t ( ) Expectation vˆ updates V ( S ) t t t vˆ updates V ( S ) x t t 1 t Warren B. Powell Slide 60
61 The post-decision state n Step 1: Start with a pre-decision state S t Step 2: Solve the deterministic optimization using an approximate value function: ( 1,, ))) n n n M x n vˆ = max C ( S, x ) + V ( S ( S x t x t t t t t t n x t to obtain. Step 3: Update the value function approximation n x, n n 1 x, n n t 1( t 1) = (1 αn 1) t 1 ( t 1) + αn 1ˆ t V S V S v n Step 4: Obtain Monte Carlo sample of Wt ( ω ) and compute the next pre-decision state: n M n n n St+ 1 = S ( St, xt, Wt+ 1( ω )) Step 5: Return to step 1. Deterministic optimization Recursive statistics Simulation 2008 Warren B. Powell Slide 61
62 Approximate dynamic programming Optimization Optimization Strengths Produces optimal decisions. Mature technology Weaknesses Cannot handle uncertainty. Cannot handle high levels of complexity Simulation Simulation Strengths Extremely flexible High level of detail Easily handles uncertainty Weaknesses Models decisions using user-specified rules. Low solution quality. Cplex Approximate dynamic programming combines simulation and optimization in a rigorous yet flexible framework.
63 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
64 Blood management Managing blood inventories 2008 Warren B. Powell Slide 64
65 Blood management Managing blood inventories over time Week 0 Week 1 Week 2 Week 3 S 0 x 0 S x 0 Rˆ, Dˆ 1 1 S 1 x 1 S x 1 Rˆ, Dˆ 2 2 S 2 x 2 S x 2 Rˆ, Dˆ 3 3 x S S 3 3 x 3 t=0 t=1 t=2 t= Warren B. Powell Slide 65
66 S t = ( R, ˆ ) t Dt AB+ D ˆt, AB+ x R t AB+,0 R t,( AB+,0) AB+,0 AB- D ˆt, AB AB+,1 R t,( AB+,1) R t,( AB+,2) AB+,1 AB+,2 A+ A- D ˆt, A+ D ˆt, AB+ AB+,2 AB+,3 R R R t,( O,0) t,( O,1) t,( O,2) O-,0 O-,1 O-,2 B+ B- D ˆt, AB+ D ˆt, AB+ D ˆt, AB+ O-,0 O-,1 O-,2 O+ O-,3 O- D ˆt, AB+ Satisfy a demand Hold
67 R t x R t R t + 1 R t,( AB+,0) AB+,0 AB+,0 R ˆt+ 1, AB+ AB+,0 R t,( AB+,1) AB+,1 AB+,1 AB+,1 R t,( AB+,2) AB+,2 AB+,2 AB+,2 AB+,3 AB+,3 R t,( O,0) O-,0 R t,( O,1) O-,1 O-,0 R ˆt+ 1, O O-,0 R t,( O,2) O-,2 O-,1 O-,1 O-,2 O-,2 O-,3 O-,3 Dˆt
68 R t x R t R t,( AB+,0) R t,( AB+,1) R t,( AB+,2) R R R t,( O,0) t,( O,1) t,( O,2) AB+,0 AB+,1 AB+,2 O-,0 O-,1 O-,2 AB+,0 AB+,1 AB+,2 AB+,3 O-,0 O-,1 O-,2 O-,3 Dˆt
69 R t x R t ( ) F R t R t,( AB+,0) R t,( AB+,1) R t,( AB+,2) R R R t,( O,0) t,( O,1) t,( O,2) AB+,0 AB+,1 AB+,2 O-,0 O-,1 O-,2 AB+,0 AB+,1 AB+,2 AB+,3 O-,0 O-,1 O-,2 Solve this as a linear program. O-,3 Dˆt
70 Duals R t x R t ( ) F R t ˆt ν,( AB +,0) AB+,0 AB+,0 ˆt ν,( AB +,1) AB+,1 AB+,1 ˆt ν,( AB +,2) AB+,2 AB+,2 AB+,3 ν ˆt,( AB,0) O-,0 ν ˆt,( AB,1) O-,1 O-,0 ν ˆt,( AB,2) O-,2 O-,1 O-,2 Dual variables give value additional unit of blood.. O-,3 Dˆt
71 Updating the value function approximation Estimate the gradient at n R t ˆn ν t,( AB+,2) ( ) F R t n R t,( AB+,2) 2008 Warren B. Powell Slide 71
72 Updating the value function approximation Update the value function at xn, Rt 1 V ( R ) n 1 x t 1 t 1 ˆn ν t,( AB+,2) ( ) F R t xn, Rt 1 n R t,( AB+,2) 2008 Warren B. Powell Slide 72
73 Updating the value function approximation Update the value function at xn, Rt 1 V ( R ) n 1 x t 1 t 1 ˆn ν t,( AB+,2) xn, Rt Warren B. Powell Slide 73
74 Updating the value function approximation Update the value function at xn, Rt 1 V ( R ) n 1 x t 1 t 1 V ( R ) n x t 1 t 1 xn, Rt Warren B. Powell Slide 74
75 Iterative learning t 2008 Warren B. Powell Slide 75
76 Iterative learning 2008 Warren B. Powell Slide 76
77 Iterative learning 2008 Warren B. Powell Slide 77
78 Iterative learning 2008 Warren B. Powell Slide 78
79 Approximate dynamic programming With luck, the objective function will improve steadily Objective function Iteration
80 Approximate dynamic programming but performance can be jagged. Chart Title Objective function Iterations
81 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
82 Hierarchical aggregation Attribute vectors: a = Asset class Time invested Blood type Age Frozen? Location ETA Bus. segment Single/team Domicile Drive hours Days from home Location ETA A/C type Fuel level Home shop Crew Eqpt1 Eqpt100
83 Aggregation Estimating value functions $2050 = (1 0.10) $ (0.10) $2500 Location Location Location Fleet v Fleet α v Fleet αvˆ Domicile = + Domicile Domicile DOThrs DaysFromHome n n 1 ( ) (1 ) ( ) ( ) Value function Approximation may have fewer attributes than driver. Drivers may have very detailed attributes
84 Hierarchical aggregation Estimating value functions» Most disaggregate level Location Location Location Fleet v Fleet α v Fleet αvˆ Domicile = + Domicile Domicile DOThrs DaysFromHome n n 1 ( ) (1 ) ( ) ( )
85 Hierarchical aggregation Estimating value functions» Middle level of aggregation Location Fleet Location Location v α v αvˆ Domicile Fleet = + Fleet DOThrs DaysFromHome n n 1 ( ) (1 ) ( ) ( )
86 Hierarchical aggregation Estimating value functions» Most aggregate level Location Fleet n n 1 v ([ Location] ) = (1 α) v ([ Location] ) + αvˆ ( Domicile ) DOThrs DaysFromHome
87 Hierarchical aggregation State-dependent weighted aggregation:» Now we have a linear regression for each attribute v = w v ( g) ( g) a a a g» We cannot solve hundreds of thousands of linear regressions. Instead use ( ( ) ( ) 2 ) β 1 w Var v + w = 1 ( g) ( g) ( g) ( g) a a a a g Estimate of variance Estimate of bias
88 Hierarchical aggregation Objective function Aggregate Disaggregate Iterations
89 Hierarchical aggregation Weighted Combination Objective function Aggregate Disaggregate Iterations
90 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
91 Stepsizes Stepsizes:» Fundamental to ADP is an updating equation that looks like: n x n 1 1( 1) (1 1) x n t t = n t 1 ( t 1) + n 1ˆ t V S α V S α v Updated estimate Old estimate New observation The stepsize Learning rate Smoothing factor
92 Stepsizes High noise data: Best stepsize α = 1/ n n Low noise, changing signal αn
93 Stepsizes Theorem 1 (general knowledge):» If data is stationary, then 1/n is the best possible stepsize. Theorem 2 (e.g. Tsitsiklis 1994)» For general problems, 1/n is provably convergent Theorem 3 (Frazier and Powell):» 1/n works so badly it should (almost) never be used.
94 Stepsizes Lower bound on the value function after n iterations:
95 Stepsizes Bias-adjusted Kalman filter Estimate of the variance α n where: = 1 ( n 1) 2 1 ( n + λ σ + β ) 2 2 n λ = 1 α λ + α ( ) n 1 ( ) n 2 σ 2 As increases, stepsize decreases toward 1/ σ Estimate of the bias n As β increases, stepsize increases toward 1. n 2 Noise n Bias
96 Stepsizes The bias-adjusted Kalman filter Observed values BAKF stepsize rule /n stepsize rule
97 Stepsizes The bias-adjusted Kalman filter Observed values BAKF stepsize rule
98 Stepsizes BAKF formula Our newest formula» where α n = α n = ( ( ) ) n 1 2 n 2 λ σ + β ( ( ) 2 ) n n σ λ σ β ( 1 2 ( ( ) ) ) n n λ σ + γ δ c ( 2 c ) ( n ) n 1 ( n + γ λ σ + λ σ + 1 ( 1 γ) δ ) » First stepsize formula designed specifically for dynamic n programming (note: no need to estimate bias β )» Captures the natural growth of a value function through iterative learning.
99 Stepsizes I recommend:» Start with a constant stepsize Vary it, get a sense of what works best» Next try a deterministic stepsize such as a/(a+n) Choose a so that it declines to roughly the best deterministic stepsize at an iteration comparable to where your constant stepsize seems to stabilize Might be 50 iterations Might be 5000» Try an (optimal) adaptive stepsize rule Can work very well if there is not too much noise Adaptive rules work well when there is a need to keep the stepsize from declining too quickly (but you do not know how quickly)
100 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
101 Two-stage optimization Piecewise linear, separable value function approximations: Piecewise linear, separable: Vt( Rt) = Vtl( Rtl) l L
102 Two-stage optimization Benders decomposition: VR ( ) t t x 1 x 0 Multidimensional cuts produce provably convergent, nonseparable value function approximation.
103 The competition Exact solutions using Benders: L-Shaped decomposition (Van Slyke and Wets) V 0 x 0 Stochastic decomposition (Higle and Sen) V 0 x 0 CUPPS (Chen and Powell) V 0 x 0
104 The competition 45 Percent over Percent optimal from optimal after iterations iterations Increasing problem size Percent error locations 25 locations 50 locations 100 locations SD L-shaped CUPPS SPAR Benders Separable
105 The competition Percent over Percent optimal from optimal after iterations iterations Increasing problem size 35 Percent error locations 25 locations 50 locations 100 locations SD L-shaped CUPPS SPAR Benders Separable
106 Multistage problems Deterministic, single commodity flow
107 Multistage problems Deterministic, single commodity flow» ADP solution as a percent of the optimal solution (from Cplex)
108 Multistage problems Stochastic, single commodity flow = optimal posterior bound 0 Rolling horizon ADP Percent of posterior optimal (20,100) (20,200) (20,400) (40,100) (40,200) (40,400) (80,100) (80,200) (80,400)
109 Multistage problems Deterministic, (integer) multicommodity flow
110 Multistage problems Deterministic, (integer) multicommodity flow = optimal continuous relaxation Percent of optimal Base T_30 T_90 I_10 I_40 C_II C_III C_IV R_1 R_5 R_100 R_400 C_1 C_8
111 Multistage problems Stochastic, (integer) multicommodity flow Rolling horizon ADP Percent of posterior optimal 0 Base I_10 I_40 C_II C_III C_IV R_1 R_5 R_100 R_400 C_1 C_8
112 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
113 Schneider National 2008 Warren B. Powell Slide 113
114 2008 Warren B. Powell Slide 114
115 Schneider National 1400 Revenue per WU US_SOLO US_IC US_TEAM Capacity category Revenue per WU U tiliz atio n Historical maximum Simulation Historical minimum Calibrated Historical min model and max Utilization Historical maximum Simulation Historical minimum US_SOLO US_IC US_TEAM Capacity category 2008 Warren B. Powell Slide 115
116 Princeton team: Warren Powell Belgacem Bouzaiene-Ayari NS team: Clark Cheng Ricardo Fiorillo Junxia Chang Sourav Das
117 2008 Warren B. Powell Slide 117
118 2008 Warren B. Powell Slide 118
119 2008 Warren B. Powell Slide 119
120 2008 Warren B. Powell Slide 120
121 Convergence Train coverage
122 Model calibration Train delay curve October 2007 Delay hours/day Fleet size
123 Model calibration Train delay curve March 2008 Delay hours/day Fleet size
124 Outline A resource allocation model An introduction to ADP ADP and the post-decision state variable A blood management example Hierarchical aggregation Stepsizes How well does it work? Some applications» Transportation» Energy
125 Energy resource modeling 2008 Warren B. Powell Slide 125
126 Energy resource modeling Hourly model» Decisions at time t impact t+1 through the amount of water held in the reservoir. Hour t Hour t Warren B. Powell Slide 126
127 Energy resource modeling Hourly model» Decisions at time t impact t+1 through the amount of water held in the reservoir. Hour t Value of holding water in the reservoir for future time periods Warren B. Powell Slide 127
128 Energy resource modeling 2008 Warren B. Powell Slide 128
129 Energy resource modeling 2008 Hour Warren B. Powell Slide 129
130 Energy resource modeling 2008 Hour Warren B. Powell Slide 130
131 Annual energy model Optimal Reservoir level Precipitation HydroUsage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 131
132 Annual energy model Approximate dynamic programming Reservoir level Precipitation Usage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 132
133 Annual energy model Optimal Reservoir level Precipitation HydroUsage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 133
134 Annual energy model Approximate dynamic programming Reservoir level Precipitation Usage Remaining Wastage Demand Coal Wind Demand Warren B. Powell Slide 134
135 Annual energy model 9000 ADP vs optimal reservoir levels for stochastic rainfall Reservoir level ADP at last iteration Optimal for individual scenarios Time period 2008 Warren B. Powell Slide 135
136 2008 Warren B. Powell Slide 136
The Dynamic Energy Resource Model
The Dynamic Energy Resource Model Group Peer Review Committee Lawrence Livermore National Laboratories July, 2007 Warren Powell Alan Lamont Jeffrey Stewart Abraham George 2007 Warren B. Powell, Princeton
More informationApproximate Dynamic Programming in Rail Operations
Approximate Dynamic Programming in Rail Operations June, 2007 Tristan VI Phuket Island, Thailand Warren Powell Belgacem Bouzaiene-Ayari CASTLE Laboratory Princeton University http://www.castlelab.princeton.edu
More informationThe Optimizing-Simulator: Merging Optimization and Simulation Using Approximate Dynamic Programming
The Optimizing-Simulator: Merging Optimization and Simulation Using Approximate Dynamic Programming Winter Simulation Conference December 5, 2005 Warren Powell CASTLE Laboratory Princeton University http://www.castlelab.princeton.edu
More informationApproximate Dynamic Programming for High Dimensional Resource Allocation Problems
Approximate Dynamic Programming for High Dimensional Resource Allocation Problems Warren B. Powell Abraham George Belgacem Bouzaiene-Ayari Hugo P. Simao Department of Operations Research and Financial
More informationOn-line supplement to: SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology
On-line supplement to: SMART: A Stochastic Multiscale Model for e Analysis of Energy Resources, Technology and Policy This online supplement provides a more detailed version of e model, followed by derivations
More informationWhat you should know about approximate dynamic programming
What you should know about approximate dynamic programming Warren B. Powell Department of Operations Research and Financial Engineering Princeton University, Princeton, NJ 08544 December 16, 2008 Abstract
More informationTutorial: Stochastic Optimization in Energy
Tutorial: Stochastic Optimization in Energy FERC, Washington, D.C. August 6, 2014 Warren B. Powell CASTLE Labs Princeton University http://www.castlelab.princeton.edu Warren B. Powell, 2014 Slide 1 Mission
More informationApproximate Dynamic Programming in Transportation and Logistics: A Unified Framework
Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework Warren B. Powell, Hugo P. Simao and Belgacem Bouzaiene-Ayari Department of Operations Research and Financial Engineering
More informationApproximate Dynamic Programming in Transportation and Logistics: A Unified Framework
Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework Warren B. Powell, Hugo P. Simao and Belgacem Bouzaiene-Ayari Department of Operations Research and Financial Engineering
More informationPerhaps one of the most widely used and poorly understood terms in dynamic programming is policy. A simple definition of a policy is:
CHAPTER 6 POLICIES Perhaps one of the most widely used and poorly understood terms in dynamic programming is policy. A simple definition of a policy is: Definition 6.0.1 A policy is a rule (or function)
More informationBayesian Active Learning With Basis Functions
Bayesian Active Learning With Basis Functions Ilya O. Ryzhov Warren B. Powell Operations Research and Financial Engineering Princeton University Princeton, NJ 08544, USA IEEE ADPRL April 13, 2011 1 / 29
More informationChapter 7 Forecasting Demand
Chapter 7 Forecasting Demand Aims of the Chapter After reading this chapter you should be able to do the following: discuss the role of forecasting in inventory management; review different approaches
More informationApproximate Dynamic Programming for High-Dimensional Resource Allocation Problems
Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems Warren B. Powell Department of Operations Research and Financial Engineering Princeton University Benjamin Van Roy Departments
More informationIntegrated Electricity Demand and Price Forecasting
Integrated Electricity Demand and Price Forecasting Create and Evaluate Forecasting Models The many interrelated factors which influence demand for electricity cannot be directly modeled by closed-form
More informationWe present a general optimization framework for locomotive models that captures different levels of detail,
Articles in Advance, pp. 1 24 ISSN 0041-1655 (print) ISSN 1526-5447 (online) http://dx.doi.org/10.1287/trsc.2014.0536 2014 INFORMS From Single Commodity to Multiattribute Models for Locomotive Optimization:
More informationMODELING DYNAMIC PROGRAMS
CHAPTER 5 MODELING DYNAMIC PROGRAMS Perhaps one of the most important skills to develop in approximate dynamic programming is the ability to write down a model of the problem. Everyone who wants to solve
More informationHigh-dimensional Problems in Finance and Economics. Thomas M. Mertens
High-dimensional Problems in Finance and Economics Thomas M. Mertens NYU Stern Risk Economics Lab April 17, 2012 1 / 78 Motivation Many problems in finance and economics are high dimensional. Dynamic Optimization:
More informationDRIVING ROI. The Business Case for Advanced Weather Solutions for the Energy Market
DRIVING ROI The Business Case for Advanced Weather Solutions for the Energy Market Table of Contents Energy Trading Challenges 3 Skill 4 Speed 5 Precision 6 Key ROI Findings 7 About The Weather Company
More informationSHORT TERM LOAD FORECASTING
Indian Institute of Technology Kanpur (IITK) and Indian Energy Exchange (IEX) are delighted to announce Training Program on "Power Procurement Strategy and Power Exchanges" 28-30 July, 2014 SHORT TERM
More informationProper Security Criteria Determination in a Power System with High Penetration of Renewable Resources
Proper Security Criteria Determination in a Power System with High Penetration of Renewable Resources Mojgan Hedayati, Kory Hedman, and Junshan Zhang School of Electrical, Computer, and Energy Engineering
More informationIdentifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data
Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Michael Ellington and Costas Milas Financial Services, Liquidity and Economic Activity Bank of England May
More informationR O B U S T E N E R G Y M AN AG E M E N T S Y S T E M F O R I S O L AT E D M I C R O G R I D S
ROBUST ENERGY MANAGEMENT SYSTEM FOR ISOLATED MICROGRIDS Jose Daniel La r a Claudio Cañizares Ka nka r Bhattacharya D e p a r t m e n t o f E l e c t r i c a l a n d C o m p u t e r E n g i n e e r i n
More informationSome Fixed-Point Results for the Dynamic Assignment Problem
Some Fixed-Point Results for the Dynamic Assignment Problem Michael Z. Spivey Department of Mathematics and Computer Science Samford University, Birmingham, AL 35229 Warren B. Powell Department of Operations
More informationStochastic Dual Dynamic Programming with CVaR Risk Constraints Applied to Hydrothermal Scheduling. ICSP 2013 Bergamo, July 8-12, 2012
Stochastic Dual Dynamic Programming with CVaR Risk Constraints Applied to Hydrothermal Scheduling Luiz Carlos da Costa Junior Mario V. F. Pereira Sérgio Granville Nora Campodónico Marcia Helena Costa Fampa
More informationAPPROXIMATE DYNAMIC PROGRAMMING II: ALGORITHMS
APPROXIMATE DYNAMIC PROGRAMMING II: ALGORITHMS INTRODUCTION WARREN B. POWELL Department of Operations Research and Financial Engineering, Princeton University, Princeton, New Jersey Approximate dynamic
More informationMulti-Area Stochastic Unit Commitment for High Wind Penetration
Multi-Area Stochastic Unit Commitment for High Wind Penetration Workshop on Optimization in an Uncertain Environment Anthony Papavasiliou, UC Berkeley Shmuel S. Oren, UC Berkeley March 25th, 2011 Outline
More informationThere has been considerable recent interest in the dynamic vehicle routing problem, but the complexities of
TRANSPORTATION SCIENCE Vol. 38, No. 4, November 2004, pp. 399 419 issn 0041-1655 eissn 1526-5447 04 3804 0399 informs doi 10.1287/trsc.1030.0073 2004 INFORMS The Dynamic Assignment Problem Michael Z. Spivey,
More informationIMA Preprint Series # 2373
BATTERY STORAGE CONTROL FOR STEADYING RENEWABLE POWER GENERATION By Shengyuan (Mike) Chen, Emilie Danna, Kory Hedman, Mark Iwen, Wei Kang, John Marriott, Anders Nottrott, George Yin, and Qing Zhang IMA
More informationDynamic Models for Freight Transportation
Dynamic Models for Freight Transportation Warren B. Powell Belgacem Bouzaiene-Ayari Hugo P. Simao Department of Operations Research and Financial Engineering Princeton University August 13, 2003 Abstract
More informationOptimization Tools in an Uncertain Environment
Optimization Tools in an Uncertain Environment Michael C. Ferris University of Wisconsin, Madison Uncertainty Workshop, Chicago: July 21, 2008 Michael Ferris (University of Wisconsin) Stochastic optimization
More informationOPTIMIZATION UNDER UNCERTAINTY A unified framework (Draft)
OPTIMIZATION UNDER UNCERTAINTY A unified framework (Draft) Warren B. Powell September 15, 2016 A JOHN WILEY & SONS, INC., PUBLICATION Copyright c 2016 by John Wiley & Sons, Inc. All rights reserved. Published
More informationIncorporating Demand Response with Load Shifting into Stochastic Unit Commitment
Incorporating Demand Response with Load Shifting into Stochastic Unit Commitment Frank Schneider Supply Chain Management & Management Science, University of Cologne, Cologne, Germany, frank.schneider@uni-koeln.de
More informationSYSTEM OPERATIONS. Dr. Frank A. Monforte
SYSTEM OPERATIONS FORECASTING Dr. Frank A. Monforte Itron s Forecasting Brown Bag Seminar September 13, 2011 PLEASE REMEMBER» In order to help this session run smoothly, your phones are muted.» To make
More informationWEATHER NORMALIZATION METHODS AND ISSUES. Stuart McMenamin Mark Quan David Simons
WEATHER NORMALIZATION METHODS AND ISSUES Stuart McMenamin Mark Quan David Simons Itron Forecasting Brown Bag September 17, 2013 Please Remember» Phones are Muted: In order to help this session run smoothly,
More informationOptimal Demand Response
Optimal Demand Response Libin Jiang Steven Low Computing + Math Sciences Electrical Engineering Caltech Oct 2011 Outline Caltech smart grid research Optimal demand response Global trends 1 Exploding renewables
More informationHEALTHCARE. 5 Components of Accurate Rolling Forecasts in Healthcare
HEALTHCARE 5 Components of Accurate Rolling Forecasts in Healthcare Introduction Rolling Forecasts Improve Accuracy and Optimize Decisions Accurate budgeting, planning, and forecasting are essential for
More informationA Progressive Hedging Approach to Multistage Stochastic Generation and Transmission Investment Planning
A Progressive Hedging Approach to Multistage Stochastic Generation and Transmission Investment Planning Yixian Liu Ramteen Sioshansi Integrated Systems Engineering Department The Ohio State University
More informationMulti-Area Stochastic Unit Commitment for High Wind Penetration in a Transmission Constrained Network
Multi-Area Stochastic Unit Commitment for High Wind Penetration in a Transmission Constrained Network Anthony Papavasiliou Center for Operations Research and Econometrics Université catholique de Louvain,
More informationTraffic Flow Impact (TFI)
Traffic Flow Impact (TFI) Michael P. Matthews 27 October 2015 Sponsor: Yong Li, FAA ATO AJV-73 Technical Analysis & Operational Requirements Distribution Statement A. Approved for public release; distribution
More informationIntroduction to Reinforcement Learning. CMPT 882 Mar. 18
Introduction to Reinforcement Learning CMPT 882 Mar. 18 Outline for the week Basic ideas in RL Value functions and value iteration Policy evaluation and policy improvement Model-free RL Monte-Carlo and
More informationAPPROXIMATE dynamic programming (ADP) has
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 60, NO. 3, MARCH 2015 743 A New Optimal Stepsize for Approximate Dynamic Programming Ilya O. Ryzhov, Member, IEEE, Peter I. Frazier, and Warren B. Powell, Member,
More information1 The Basic RBC Model
IHS 2016, Macroeconomics III Michael Reiter Ch. 1: Notes on RBC Model 1 1 The Basic RBC Model 1.1 Description of Model Variables y z k L c I w r output level of technology (exogenous) capital at end of
More informationDynamic Models for Freight Transportation
Dynamic Models for Freight Transportation Warren B. Powell Belgacem Bouzaiene-Ayari Hugo P. Simao Department of Operations Research and Financial Engineering Princeton University September 14, 2005 Abstract
More informationProject Discussions: SNL/ADMM, MDP/Randomization, Quadratic Regularization, and Online Linear Programming
Project Discussions: SNL/ADMM, MDP/Randomization, Quadratic Regularization, and Online Linear Programming Yinyu Ye Department of Management Science and Engineering Stanford University Stanford, CA 94305,
More informationAn Approximate Dynamic Programming Algorithm for the Allocation of High-Voltage Transformer Spares in the Electric Grid
An Approximate Dynamic Programming Algorithm for the Allocation of High-Voltage Transformer Spares in the Electric Grid Johannes Enders Department of Operations Research and Financial Engineering Princeton
More informationThe Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks
The Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks Yingfei Wang, Chu Wang and Warren B. Powell Princeton University Yingfei Wang Optimal Learning Methods June 22, 2016
More informationFINAL REPORT EVALUATION REVIEW OF TVA'S LOAD FORECAST RISK
Memorandum from the Office of the Inspector General Robert Irvin, WT 9C-K FINAL REPORT EVALUATION 2012-14507 REVIEW OF TVA'S LOAD FORECAST RISK As part of a series of reviews to evaluate the Tennessee
More informationReal Business Cycle Model (RBC)
Real Business Cycle Model (RBC) Seyed Ali Madanizadeh November 2013 RBC Model Lucas 1980: One of the functions of theoretical economics is to provide fully articulated, artificial economic systems that
More informationAESO Load Forecast Application for Demand Side Participation. Eligibility Working Group September 26, 2017
AESO Load Forecast Application for Demand Side Participation Eligibility Working Group September 26, 2017 Load forecasting for the Capacity Market Demand Considerations Provide further information on forecasting
More informationPlanning a 100 percent renewable electricity system
Planning a 100 percent renewable electricity system Andy Philpott Electric Power Optimization Centre University of Auckland www.epoc.org.nz (Joint work with Michael Ferris) INI Open for Business Meeting,
More informationWhy do we care? Measurements. Handling uncertainty over time: predicting, estimating, recognizing, learning. Dealing with time
Handling uncertainty over time: predicting, estimating, recognizing, learning Chris Atkeson 2004 Why do we care? Speech recognition makes use of dependence of words and phonemes across time. Knowing where
More informationPANEL DISCUSSION: THE ROLE OF POTENTIAL OUTPUT IN POLICYMAKING
PANEL DISCUSSION: THE ROLE OF POTENTIAL OUTPUT IN POLICYMAKING James Bullard* Federal Reserve Bank of St. Louis 33rd Annual Economic Policy Conference St. Louis, MO October 17, 2008 Views expressed are
More informationAgenda today. Introduction to prescriptive modeling. Linear optimization models through three examples: Beyond linear optimization
Agenda today Introduction to prescriptive modeling Linear optimization models through three examples: 1 Production and inventory optimization 2 Distribution system design 3 Stochastic optimization Beyond
More informationUsing Static Flow Patterns in Time-Staged Resource Allocation Problems
Using Static Flow Patterns in Time-Staged Resource Allocation Problems Arun Marar Warren B. Powell Hugo P. Simão Department of Operations Research and Financial Engineering, Princeton University, Princeton,
More informationWE propose an approximate dynamic programming
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL 58, NO 12, DECEMBER 2013 2995 An Optimal Approximate Dynamic Programming Algorithm for Concave, Scalar Storage Problems With Vector-Valued Controls Juliana Nascimento
More informationBattery Storage Control for Steadying Renewable Power Generation
Battery Storage Control for Steadying Renewable Power Generation Shengyuan (Mike) Chen, Emilie Danna, Kory Hedman, Mark Iwen, Wei Kang, John Marriott, Anders Nottrott, George Yin, Qing Zhang March 13,
More informationApproximate dynamic programming in transportation and logistics: a unified framework
EURO J Transp Logist (2012) 1:237 284 DOI 10.1007/s13676-012-0015-8 TUTORIAL Approximate dynamic programming in transportation and logistics: a unified framework Warren B. Powell Hugo P. Simao Belgacem
More informationWarwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014
Warwick Business School Forecasting System Summary Ana Galvao, Anthony Garratt and James Mitchell November, 21 The main objective of the Warwick Business School Forecasting System is to provide competitive
More informationInformation Choice in Macroeconomics and Finance.
Information Choice in Macroeconomics and Finance. Laura Veldkamp New York University, Stern School of Business, CEPR and NBER Spring 2009 1 Veldkamp What information consumes is rather obvious: It consumes
More informationTRANSMISSION BUSINESS LOAD FORECAST AND METHODOLOGY
Filed: September, 00 EB-00-00 Tab Schedule Page of 0 TRANSMISSION BUSINESS LOAD FORECAST AND METHODOLOGY.0 INTRODUCTION 0 This exhibit discusses Hydro One Networks transmission system load forecast and
More information2013 WEATHER NORMALIZATION SURVEY. Industry Practices
2013 WEATHER NORMALIZATION SURVEY Industry Practices FORECASTING SPECIALIZATION Weather Operational Forecasting Short-term Forecasting to support: System Operations and Energy Trading Hourly Load Financial/Budget
More informationStochastic Optimization
Chapter 27 Page 1 Stochastic Optimization Operations research has been particularly successful in two areas of decision analysis: (i) optimization of problems involving many variables when the outcome
More informationMIDAS: A Mixed Integer Dynamic Approximation Scheme
MIDAS: A Mixed Integer Dynamic Approximation Scheme Andy Philpott, Faisal Wahid, Frédéric Bonnans May 7, 2016 Abstract Mixed Integer Dynamic Approximation Scheme (MIDAS) is a new sampling-based algorithm
More informationChapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models
Chapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models Fall 22 Contents Introduction 2. An illustrative example........................... 2.2 Discussion...................................
More informationTime Series and Forecasting
Time Series and Forecasting Introduction to Forecasting n What is forecasting? n Primary Function is to Predict the Future using (time series related or other) data we have in hand n Why are we interested?
More informationForecasting demand in the National Electricity Market. October 2017
Forecasting demand in the National Electricity Market October 2017 Agenda Trends in the National Electricity Market A review of AEMO s forecasting methods Long short-term memory (LSTM) neural networks
More informationHandout 1: Introduction to Dynamic Programming. 1 Dynamic Programming: Introduction and Examples
SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 1: Introduction to Dynamic Programming Instructor: Shiqian Ma January 6, 2014 Suggested Reading: Sections 1.1 1.5 of Chapter
More informationClearing the Jungle of Stochastic Optimization
Clearing the Jungle of Stochastic Optimization Warren B. Powell Department of Operations Research and Financial Engineering Princeton University Prepared for Informs TutORials, 2014. DRAFT April 7, 2014
More informationDynamic Risk Measures and Nonlinear Expectations with Markov Chain noise
Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise Robert J. Elliott 1 Samuel N. Cohen 2 1 Department of Commerce, University of South Australia 2 Mathematical Insitute, University
More information1 Bewley Economies with Aggregate Uncertainty
1 Bewley Economies with Aggregate Uncertainty Sofarwehaveassumedawayaggregatefluctuations (i.e., business cycles) in our description of the incomplete-markets economies with uninsurable idiosyncratic risk
More information2013 Weather Normalization Survey. Itron, Inc El Camino Real San Diego, CA
Itron, Inc. 11236 El Camino Real San Diego, CA 92130 2650 858 724 2620 March 2014 Weather normalization is the process of reconstructing historical energy consumption assuming that normal weather occurred
More informationCP:
Adeng Pustikaningsih, M.Si. Dosen Jurusan Pendidikan Akuntansi Fakultas Ekonomi Universitas Negeri Yogyakarta CP: 08 222 180 1695 Email : adengpustikaningsih@uny.ac.id Operations Management Forecasting
More informationBirgit Rudloff Operations Research and Financial Engineering, Princeton University
TIME CONSISTENT RISK AVERSE DYNAMIC DECISION MODELS: AN ECONOMIC INTERPRETATION Birgit Rudloff Operations Research and Financial Engineering, Princeton University brudloff@princeton.edu Alexandre Street
More informationUncertainty in energy system models
Uncertainty in energy system models Amy Wilson Durham University May 2015 Table of Contents 1 Model uncertainty 2 3 Example - generation investment 4 Conclusion Model uncertainty Contents 1 Model uncertainty
More informationWeather Risk Management
Weather Risk Management David Molyneux, FCAS Introduction Weather Risk - Revenue or profits that are sensitive to weather conditions Weather Derivatives - Financial Products that allow companies to manage
More informationOperations Report. Tag B. Short, Director South Region Operations. Entergy Regional State Committee (ERSC) February 14, 2018
Operations Report Tag B. Short, Director South Region Operations Entergy Regional State Committee (ERSC) February 14, 2018 1 Winter Operations Highlights South Region Max Gen Event Regional Dispatch Transfer
More informationOptimal Demand Response
Optimal Demand Response Libin Jiang Steven Low Computing + Math Sciences Electrical Engineering Caltech June 2011 Outline o Motivation o Demand response model o Some results Wind power over land (outside
More informationJOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH BACKORDERS. Lou Caccetta and Elham Mardaneh
JOURNAL OF INDUSTRIAL AND doi:10.3934/jimo.2010.6.123 MANAGEMENT OPTIMIZATION Volume 6, Number 1, February 2010 pp. 123 147 JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH
More informationLOADS, CUSTOMERS AND REVENUE
EB-00-0 Exhibit K Tab Schedule Page of 0 0 LOADS, CUSTOMERS AND REVENUE The purpose of this evidence is to present the Company s load, customer and distribution revenue forecast for the test year. The
More informationFinancial Factors in Economic Fluctuations. Lawrence Christiano Roberto Motto Massimo Rostagno
Financial Factors in Economic Fluctuations Lawrence Christiano Roberto Motto Massimo Rostagno Background Much progress made on constructing and estimating models that fit quarterly data well (Smets-Wouters,
More informationBalancing and Control of a Freely-Swinging Pendulum Using a Model-Free Reinforcement Learning Algorithm
Balancing and Control of a Freely-Swinging Pendulum Using a Model-Free Reinforcement Learning Algorithm Michail G. Lagoudakis Department of Computer Science Duke University Durham, NC 2778 mgl@cs.duke.edu
More informationSMART GRID FORECASTING
SMART GRID FORECASTING AND FINANCIAL ANALYTICS Itron Forecasting Brown Bag December 11, 2012 PLEASE REMEMBER» Phones are Muted: In order to help this session run smoothly, your phones are muted.» Full
More informationWhen One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER
When One Size No Longer Fits All Electric Load Forecasting with a Geographic Hierarchy WHITE PAPER SAS White Paper Table of Contents Executive Summary.... 1 Electric Load Forecasting Challenges... 1 Selected
More informationLecture 1: March 7, 2018
Reinforcement Learning Spring Semester, 2017/8 Lecture 1: March 7, 2018 Lecturer: Yishay Mansour Scribe: ym DISCLAIMER: Based on Learning and Planning in Dynamical Systems by Shie Mannor c, all rights
More informationImpact on Agriculture
Weather Variability and the Impact on Agriculture InfoAg 2017 Copyright 2017, awhere. All Rights Reserved The Problem: The Earth s Atmosphere is a Heat Engine In transition 1 C warming of atmosphere Triples
More informationMathematical Optimization Models and Applications
Mathematical Optimization Models and Applications Yinyu Ye Department of Management Science and Engineering Stanford University Stanford, CA 94305, U.S.A. http://www.stanford.edu/ yyye Chapters 1, 2.1-2,
More informationAN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS
AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS James Hall JHTech PO Box 877 Divide, CO 80814 Email: jameshall@jhtech.com Jeffrey Hall JHTech
More informationpeak half-hourly Tasmania
Forecasting long-term peak half-hourly electricity demand for Tasmania Dr Shu Fan B.S., M.S., Ph.D. Professor Rob J Hyndman B.Sc. (Hons), Ph.D., A.Stat. Business & Economic Forecasting Unit Report for
More informationStochastic and Adaptive Optimal Control
Stochastic and Adaptive Optimal Control Robert Stengel Optimal Control and Estimation, MAE 546 Princeton University, 2018! Nonlinear systems with random inputs and perfect measurements! Stochastic neighboring-optimal
More informationMiscellaneous. Regarding reading materials. Again, ask questions (if you have) and ask them earlier
Miscellaneous Regarding reading materials Reading materials will be provided as needed If no assigned reading, it means I think the material from class is sufficient Should be enough for you to do your
More informationWhy do we care? Examples. Bayes Rule. What room am I in? Handling uncertainty over time: predicting, estimating, recognizing, learning
Handling uncertainty over time: predicting, estimating, recognizing, learning Chris Atkeson 004 Why do we care? Speech recognition makes use of dependence of words and phonemes across time. Knowing where
More informationA stochastic integer programming approach to the optimal thermal and wind generator scheduling problem
A stochastic integer programming approach to the optimal thermal and wind generator scheduling problem Presented by Michael Chen York University Industrial Optimization Seminar Fields Institute for Research
More informationFinding the Value of Information About a State Variable in a Markov Decision Process 1
05/25/04 1 Finding the Value of Information About a State Variable in a Markov Decision Process 1 Gilvan C. Souza The Robert H. Smith School of usiness, The University of Maryland, College Park, MD, 20742
More informationBEFORE THE PUBLIC UTILITIES COMMISSION OF THE STATE OF COLORADO * * * *
Exhibit No. 1 BEFORE THE PUBLIC UTILITIES COMMISSION OF THE STATE OF COLORADO * * * * IN THE MATTER OF THE APPLICATION OF PUBLIC SERVICE COMPANY OF COLORADO FOR APPROVAL OF ITS 0 ELECTRIC RESOURCE PLAN
More informationDemand Forecasting Models
E 2017 PSE Integrated Resource Plan Demand Forecasting Models This appendix describes the econometric models used in creating the demand forecasts for PSE s 2017 IRP analysis. Contents 1. ELECTRIC BILLED
More informationChapter 8 - Forecasting
Chapter 8 - Forecasting Operations Management by R. Dan Reid & Nada R. Sanders 4th Edition Wiley 2010 Wiley 2010 1 Learning Objectives Identify Principles of Forecasting Explain the steps in the forecasting
More informationSpeculation and the Bond Market: An Empirical No-arbitrage Framework
Online Appendix to the paper Speculation and the Bond Market: An Empirical No-arbitrage Framework October 5, 2015 Part I: Maturity specific shocks in affine and equilibrium models This Appendix present
More informationReinforcement Learning
Reinforcement Learning March May, 2013 Schedule Update Introduction 03/13/2015 (10:15-12:15) Sala conferenze MDPs 03/18/2015 (10:15-12:15) Sala conferenze Solving MDPs 03/20/2015 (10:15-12:15) Aula Alpha
More informationREINFORCE Framework for Stochastic Policy Optimization and its use in Deep Learning
REINFORCE Framework for Stochastic Policy Optimization and its use in Deep Learning Ronen Tamari The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (#67679) February 28, 2016 Ronen Tamari
More informationThe AIR Bushfire Model for Australia
The AIR Bushfire Model for Australia In February 2009, amid tripledigit temperatures and drought conditions, fires broke out just north of Melbourne, Australia. Propelled by high winds, as many as 400
More information