Synchronization of coupled oscillators is a game. Prashant G. Mehta 1
|
|
- Elijah Simpson
- 5 years ago
- Views:
Transcription
1 CSL COORDINATED SCIENCE LABORATORY Synchronization of coupled oscillators is a game Prashant G. Mehta 1 1 Coordinated Science Laboratory Department of Mechanical Science and Engineering University of Illinois at Urbana-Champaign GERAD seminar, April 20, 2010 Acknowledgment: AFOSR, NSF
2 Huibing Yin Sean P. Meyn Uday V. Shanbhag H. Yin, P. G. Mehta, S. P. Meyn and U. V. Shanbhag, Synchronization of coupled oscillators is a game, ACC 2010 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
3 Acknowledgement Thanks to Minyi Huang, Roland P. Malhame, and Peter Caines Minyi Huang Roland P. Malhame Peter Caines M. Huang, P. Caines, and R. Malhame, Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ε-nash equilibria, IEEE TAC, 2007 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
4 Millennium bridge Video of London Millennium bridge from youtube [11] S. H. Strogatz et al., Nature, 2005 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
5 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
6 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
7 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
8 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling κ 0.3 Synchrony Locking R κ < κ c (γ) Incoherence γ [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
9 Movies of incoherence and synchrony solution Incoherence Synchrony P. G. Mehta (UIUC) GERAD Apr. 20, / 75
10 Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
11 Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
12 Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
13 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
14 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
15 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
16 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
17 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
18 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
19 Rational irrationality Motivation Why a game? behavior that, on the individual level, is perfectly reasonable but that, when aggregated in the marketplace, produces calamity. Examples Millennium bridge Financial market John Cassidy, Rational Irrationality: The real reason that capitalism is so crash-prone, The New Yorker, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
20 Rational irrationality Motivation Why a game? behavior that, on the individual level, is perfectly reasonable but that, when aggregated in the marketplace, produces calamity. Examples Millennium bridge Financial market John Cassidy, Rational Irrationality: The real reason that capitalism is so crash-prone, The New Yorker, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
21 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
22 Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
23 Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
24 r Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time r Limit cyle h h V 0 v [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
25 r Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time r Limit cyle Normal form reduction h h V 0 v θ i = ω i + u i Φ(θ i ) [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
26 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
27 Problems and results Finite oscillator model Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
28 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
29 Problems and results Main results 1. Synchronization is a solution of game R 1/ Synchrony Locking γ R > R c (γ) Incoherence γ dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 Yin et al., ACC 2010 Strogatz et al., J. Stat. Phy., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
30 Problems and results Main results 1. Synchronization is a solution of game R 1/ Synchrony Locking κ 0.3 Synchrony Locking 0.25 R γ R > R c (γ) Incoherence γ κ < κ c (γ) Incoherence γ dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 ( ) dθ i = ω i + κ N N sin(θ j θ i ) dt + σ dξ i j=1 Yin et al., ACC 2010 Strogatz et al., J. Stat. Phy., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
31 Problems and results Main results 2. Kuramoto control is approximately optimal 6 k = 0.01; R = Population Density 0 π 2 π θ u i = A i R 1 N sin(θ θ j (t)) j i Control laws ω = 1 Kuramoto A * A i t Learning algorithm: da i = ε... dt Yin et.al. CDC 2010 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
32 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
33 Derivation of model Overview of model derivation Overview dθ i = (ω i + u i (t))dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[ c(θ i,t) Ru2 i ]ds Mass Influence Influence 1 Mean-field approximation Assumption: N c(θ i ;θ i (t)) = 1 N c (θ i,θ j ) c(θ,t) j i 2 Optimal control of single oscillator Decentralized control structure [5] M. Huang, P. Caines, and R. Malhame, IEEE TAC, 2007 [HCM] P. G. Mehta (UIUC) GERAD Apr. 20, / 75
34 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
35 Derivation of model Derivation steps Single oscillator with given cost Dynamics of the oscillator dθ i = (ω i + u i (t))dt + σ dξ i, t 0 The cost function is assumed known 1 T η i (u i ; c) = lim E[ c(θ i ;θ i ) + 1 T T 2 Ru2 i (s)]ds 0 c(θ i (s),s) HJB equation: t h i + ω i θ h i = 1 2R ( θ h i ) 2 c(θ,t) + η i σ θθ h i Optimal control law: u i (t) = ϕ i (θ,t) = 1 R θ h i (θ,t) [1] D. P. Bertsekas (1995); [9] S. P. Meyn, IEEE TAC, 1997 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
36 Derivation of model Derivation steps Single oscillator with optimal control Dynamics of the oscillator ( dθ i (t) = ω i 1 ) R θ h i (θ i,t) dt + σ dξ i (t) Fokker-Planck equation for pdf p(θ,t,ω i ) FPK: t p + ω i θ p = 1 R θ [p( θ h)] + σ θθ p [7] A. Lasota and M. C. Mackey, Chaos, Fractals and Noise, Springer 1994 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
37 Derivation of model Mean-field Approximation Derivation steps HJB equation for population t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η(ω) σ θθ h h(θ,t,ω) Population density t p + ω θ p = 1 R θ [p( θ h)] + σ 2 Enforce cost consistency 2π c(θ,t) = c (θ,ϑ)p(ϑ,t,ω)g(ω)dϑ dω Ω 0 1 N j i c (θ,ϑ) 2 2 θθ p p(θ,t,ω) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
38 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
39 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
40 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
41 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
42 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
43 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
44 Derivation of model PDE model 1. Solution of PDE gives ε-nash equilibrium Optimal control law ε-nash property (as N ) u o i = 1 R θ h(θ(t),t,ω) ω=ωi η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. So, we look for solutions of PDEs. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
45 Derivation of model PDE model 1. Solution of PDE gives ε-nash equilibrium Optimal control law ε-nash property (as N ) u o i = 1 R θ h(θ(t),t,ω) ω=ωi η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. So, we look for solutions of PDEs. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
46 Derivation of model PDE model 1. Solution of PDE gives ε-nash equilibrium Optimal control law ε-nash property (as N ) u o i = 1 R θ h(θ(t),t,ω) ω=ωi η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. So, we look for solutions of PDEs. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
47 Derivation of model PDE model 2. Incoherence solution (PDE) Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π incoherence h(θ,t,ω) = 0 t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h t p + ω θ p = 1 R θ [p( θ h)] + σ θθ p 2π c(θ,t) = c (θ,ϑ)p(ϑ,t,ω)g(ω)dϑ dω Ω 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
48 Derivation of model PDE model 2. Incoherence solution (PDE) Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π incoherence h(θ,t,ω) = 0 t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h p(θ,t,ω) = 1 2π tp + ω θ p = 1 R θ [p( θ h)] + σ θθ p 2π c(θ,t) = c (θ,ϑ)p(ϑ,t,ω)g(ω)dϑ dω Ω 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
49 Derivation of model PDE model 2. Incoherence solution (PDE) ( ) ϑ θ Assume c (ϑ,θ) = c (ϑ θ) = 1 2 sin2 2 Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π Optimal control u = 1 R θ h = 0 Average cost 2π ( ) 1 θ ϑ 1 c(θ,t) = 2 sin2 g(ω)dϑ dω Ω 0 2 2π η (ω) = c(θ,t) = 1 4 =: η 0 for all ω Ω incoherence soln. No cost of control P. G. Mehta (UIUC) GERAD Apr. 20, / 75
50 Derivation of model PDE model 2. Incoherence solution (Finite population) Closed-loop dynamics dθ i = (ω i + u }{{} i )dt + σ dξ i (t) =0 Average cost 1 T η i = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]dt 0 }{{} =0 1 = lim T N 1 T ( E[ 1 θi (t) θ j (t) j i T 2 sin2 0 2 = 1 N 2π ( E[ 1 θi (t) ϑ 2 sin2 j i 0 2 ) ]dt ) ] 1 2π dϑ = N N η incoherence 1 ε-nash property η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
51 Derivation of model PDE model 2. Incoherence solution (Finite population) Closed-loop dynamics dθ i = (ω i + u }{{} i )dt + σ dξ i (t) =0 Average cost 1 T η i = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]dt 0 }{{} =0 1 = lim T N 1 T ( E[ 1 θi (t) θ j (t) j i T 2 sin2 0 2 = 1 N 2π ( E[ 1 θi (t) ϑ 2 sin2 j i 0 2 ) ]dt ) ] 1 2π dϑ = N N η incoherence 1 ε-nash property η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
52 Derivation of model PDE model 3. Synchronization is a solution of game R 1/ Synchrony Locking η(ω) η(ω) = η0 c R < Rc ω= 0.95 ω= 1 ω= 1.05 γ R > R c (γ) 0.15 Incoherence γ R > Rc 0.15 η(ω) < η R 1/ 2 dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 η(ω) = minη i (u i ;u o u i i) t = Synchrony solution of P. G. Mehta (UIUC) GERAD Apr. 20, / 75
53 Derivation of model PDE model 3. Synchronization is a solution of game R 1/ Synchrony Locking η(ω) η(ω) = η0 incoherence soln. c R < Rc ω= 0.95 ω= 1 ω= 1.05 γ R > R c (γ) 0.15 Incoherence γ R > Rc 0.15 η(ω) < η R 1/ 2 dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 η(ω) = minη i (u i ;u o u i i) t = Synchrony solution of P. G. Mehta (UIUC) GERAD Apr. 20, / 75
54 Derivation of model PDE model 3. Synchronization is a solution of game γ R 1/ 2 Synchrony Locking R > R c (γ) 0.15 Incoherence γ η(ω) R > Rc η(ω) = η0 R < Rc η(ω) < η R 1/ 2 c ω= 0.95 ω= 1 ω= 1.05 synchrony soln. dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 η(ω) = minη i (u i ;u o u i i) t = Synchrony solution of P. G. Mehta (UIUC) GERAD Apr. 20, / 75
55 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
56 Analysis of phase transition Overview of the steps Incoherence solution HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 ( ) ϑ θ Assume c (ϑ,θ) = c (ϑ θ) = 1 2 sin2 2 Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π P. G. Mehta (UIUC) GERAD Apr. 20, / 75
57 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
58 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
59 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
60 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum imag γ = 0.1 R decreases k=1 k=1 k=2 k= real Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
61 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum imag γ = 0.1 R decreases k=1 k=1 k=2 k= real Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
62 Analysis of phase transition Bifurcation analysis Bifurcation diagram (Hamiltonian Hopf) Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) Stability proof [3] Dellnitz et al., Int. Series Num. Math., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
63 Analysis of phase transition Bifurcation analysis Bifurcation diagram (Hamiltonian Hopf) Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) Stability proof Bifurcation point (a) imag Cont. spectrum; ind. of R Disc. spectrum; fn. of R real [3] Dellnitz et al., Int. Series Num. Math., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
64 Analysis of phase transition Bifurcation analysis Bifurcation diagram (Hamiltonian Hopf) Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) Stability proof Bifurcation point (a) ) R R > Rc(γ) (c) Incoherence imag Cont. spectrum; ind. of R Disc. spectrum; fn. of R real Synchrony γ [3] Dellnitz et al., Int. Series Num. Math., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
65 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
66 Analysis of phase transition Numerical solution of PDEs Numerics incoherence Incoherence; R = 60 incoherence P. G. Mehta (UIUC) GERAD Apr. 20, / 75
67 Analysis of phase transition Numerical solution of PDEs Numerics incoherence Incoherence; R = 60 incoherence synchrony Synchrony; R = 10 synchrony P. G. Mehta (UIUC) GERAD Apr. 20, / 75
68 Bifurcation diagram Analysis of phase transition Numerics R 1/ Synchrony Locking η(ω) 0.25 R < Rc η(ω) < η0 ω = 0.95 ω = 1 ω = γ R > R c (γ) 0.15 Incoherence γ 0.15 R > Rc η(ω) = η R 1/2 dθ i = (ω i + u i )dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[c(θ i ;θ i ) Ru2 i ]ds P. G. Mehta (UIUC) GERAD Apr. 20, / 75
69 Bifurcation diagram Analysis of phase transition Numerics R 1/ Synchrony Locking η(ω) 0.25 incoherence soln. R < Rc η(ω) < η0 ω = 0.95 ω = 1 ω = γ R > R c (γ) 0.15 Incoherence γ 0.15 R > Rc η(ω) = η R 1/2 dθ i = (ω i + u i )dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[c(θ i ;θ i ) Ru2 i ]ds P. G. Mehta (UIUC) GERAD Apr. 20, / 75
70 Bifurcation diagram Analysis of phase transition Numerics R 1/ Synchrony Locking η(ω) 0.25 R < Rc η(ω) < η0 ω = 0.95 ω = 1 ω = 1.05 γ 0.2 R > R c (γ) 0.15 Incoherence γ R > Rc η(ω) = η synchrony soln. R 1/2 dθ i = (ω i + u i )dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[c(θ i ;θ i ) Ru2 i ]ds P. G. Mehta (UIUC) GERAD Apr. 20, / 75
71 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) R 1/2 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
72 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) R = p R 1/ θ P. G. Mehta (UIUC) GERAD Apr. 20, / 75
73 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) c (θ,ϑ) = 1 cos(θ ϑ) cos3(θ ϑ) R 1/ R 1/2 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
74 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) c (θ,ϑ) = 1 cos(θ ϑ) cos3(θ ϑ) p R 1/ θ P. G. Mehta (UIUC) GERAD Apr. 20, / 75
75 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
76 Learning Comparison to Kuramoto law Q-function approximation Control law u = ϕ(θ,t,ω) = 1 R θ h(θ,t,ω) Population Density Control laws ω = 0.95 ω = 1 ω = π 2π θ Equivalent control law in Kuramoto oscillator u (Kur) i = κ N N j=1 sin(θ j (t) θ i ) N κ 0 sin(ϑ 0 + t θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
77 Learning Comparison to Kuramoto law Q-function approximation Control law u = ϕ(θ,t,ω) = 1 R θ h(θ,t,ω) Population Density π 2 π θ Control laws ω = 0.95 ω = 1 ω = 1.05 Kuramoto Equivalent control law in Kuramoto oscillator u (Kur) i = κ N N j=1 sin(θ j (t) θ i ) N κ 0 sin(ϑ 0 + t θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
78 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
79 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
80 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
81 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
82 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm
83 Learning Steepest descent algorithm Bellman error: Pointwise: Simple gradient descent algorithm ẽ(a i,φ i ) = 2 k=1 L (A i,φ i ) (θ,t) = min{h (A i,φ i ) u i } η (A i,φ i i L (A i,φ i ), ϕ k (θ) 2 da i = ε dẽ(a i,φ i ), dt da i Theorem (Convergence) dφ i dt = ε dẽ(a i,φ i ) dφ i Assume population is in synchrony. The i th oscillator updates according to ( ). Then A i (t) A = 1 2σ 2 The pointwise Bellman error L (A i,0) (θ,t) = ε(r)cos2(θ t) where ε(r) = 1 16Rσ 4 P. G. Mehta (UIUC) GERAD Apr. 20, / 75 i ) ( )
84 Phase transition Learning Steepest descent algorithm Suppose all oscillators use approx. optimal control law: u i = A R 1 N sin(θ i θ j (t)) j i then the phase transition boundary is { 1 if γ = 0 2σ R c (γ) = 4 ( ) 1 2γ 4σ 2 γ tan 1 if γ > 0 σ k = 0.01; R = PDE Learning 5 A * A i R Synchrony Incoherence t γ P. G. Mehta (UIUC) GERAD Apr. 20, / 75
85 Learning Comparison of control laws Steepest descent algorithm From PDE: u i = ϕ(θ,t,ω) = 1 ω=ωi R θ h(θ,t,ω) From learning: u i = A R 1 N sin(θ i θ j (t) φ i ) j i ω=ωi Population Density Population Density θ π 2 π 0 π 2 π θ Control laws PDE ω = 0.95 ω = 0.1 ω = 1.05 Learning ω = 0.95 ω = 0.1 ω = 1.05 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
86 Learning Comparison of average cost Steepest descent algorithm Average Cost PDE Learning ω = 0.95 ω = 0.1 ω = P. G. Mehta (UIUC) GERAD Apr. 20, / 75
87 Thank you! Website: Huibing Yin Sean P. Meyn Uday V. Shanbhag H. Yin, P. G. Mehta, S. P. Meyn and U. V. Shanbhag, Synchronization of coupled oscillators is a game, ACC 2010
88 Part I Apendix
89 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
90 Infinit population limit Appendix Game-theoretic results (ε-nash) Recall the PDE model t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h t p + ω θ p = 1 R θ [p( θ h)] + σ 2 2 θθ 2 p 2π c(ϑ,t) = c (ϑ,θ)p(θ,t,ω)g(ω)dθ dω Ω 0 Solution h(θ,t,ω) Control u o i (t) := 1 R θ h(θ,t,ω) ω=ωi By construction: η i (u o i ; c) η i (u i ; c) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
91 Finite population Appendix Game-theoretic results (ε-nash) Finite population cost [ c (N) 1 i (ϑ,t) := E N c ( θ i (t);θ i(t) o θ i (t) = ϑ )] j i θ i o = (θ1 o,...,θo i 1,θo i+1,...,θo N) with θi o (t) obtained from dynamics with control u i = u o i (t) Lemma 1: for each i = 1,...,N [ 1 T ] max lim sup c(ϑ,s) c (N) i (ϑ,s) ds ϑ [0,2π] T T 0 = O( 1 N ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
92 ε-nash equilibrium Appendix Game-theoretic results (ε-nash) Theorem The control {u o i (t) = 1 R θ h(θ(t),t,ω) ω=ωi } N i=1 equilibrium with respect to the finite population cost is an ε-nash η (POP) i (u i ;u o 1 T i) = lim T T 0 i.e., for any adapted control u i, c (N) i Ru2 i ds; η (POP) i (u o i ;u o i) η (POP) i (u i ;u o i) + O( 1 ), N i = 1,...,N. c (N) i (ϑ,t) := E[c(θ i (t);θ o i(t)) θ i (t) = ϑ] P. G. Mehta (UIUC) GERAD Apr. 20, / 75
93 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
94 Stability proof Appendix Stability dx = ax + by, a > 0 dt dy = cx + ay, bc < 0 dt FPK eqn HJB eqn with boundary conditions x(0) = x 0 and y( ) = 0 t x(t) = e at x 0 + b e a(t τ) y(τ)dτ, y(t) = c t 0 e a(t τ) x(τ)dτ. For x 0 = 0, equilibrium solution x(t) 0, y(t) 0 Characteristic equation σ 2 = a 2 + bc. Stable in the sense that The equilibrium solution is asymptotically stable if for any initial perturbation x 0, the solution x(t) 0 as t. Return to spectrum P. G. Mehta (UIUC) GERAD Apr. 20, / 75
95 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
96 Appendix Algorithm for solving PDEs Numerics HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 V(θ,t,ω) = h(θ,t s,ω) Iterative algorithm s Backward time Forward time t P. G. Mehta (UIUC) GERAD Apr. 20, / 75
97 Appendix Algorithm for solving PDEs Numerics HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 V(θ,t,ω) = h(θ,t s,ω) Iterative algorithm s Backward time Forward time t P. G. Mehta (UIUC) GERAD Apr. 20, / 75
98 Appendix Algorithm for solving PDEs Numerics HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 V(θ,t,ω) = h(θ,t s,ω) Iterative algorithm s Backward time Forward time t P. G. Mehta (UIUC) GERAD Apr. 20, / 75
99 Appendix Numerical solution of PDEs Numerics Incoherence; R = 60 incoherence ) R R > Rc(γ) (c) Incoherence 20 Synchrony γ p(θ,t,ω) at given position θ 0.2 γ = 0.01; Iter = γ = 0.01; Iter = γ = 0.01; Iter = p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t Iter = 1 Iter=4 Iter=19 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
100 Appendix Numerical solution of PDEs Numerics synchrony Synchrony; R = 10 synchrony p(θ,t,ω) at given position θ 0.25 γ = 0.01; Iter = γ = 0.01; Iter = 4 1 γ = 0.01; Iter = 19 p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t 1 p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t Iter = 1 Iter=4 Iter=19 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
101 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
102 Linearization Appendix Linearization/Spectra ( t z(θ,t,ω) = Eigenvector problem λz = L R Z ω θ h c σ θθ h ω θ p + 1 2πR 2 θθ h + σ θθ p Fourier series expansion with respect to θ H = + k= H k (ω)e ikθ, P = + k= P k (ω)e ikθ Decomposition of the linear operator L R = k L (k) R ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
103 Linearization (cont.) Appendix Linearization/Spectra Individual operator ( σ 2 L (1) R := 2 ωi π 4 Ω g(ω)dω L (k) R := 1 2πR σ 2 2 ( ωi σ 2 2 k2 kωi 0 ) k2 2πR σ 2 2 k2 kωi L ( k) R = L (k) R Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω ), k 2 Discrete spectrum coincides with the discrete spectrum of L (±1) R P. G. Mehta (UIUC) GERAD Apr. 20, / 75
104 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
105 Neuron Appendix Modeling P. G. Mehta (UIUC) GERAD Apr. 20, / 75
106 Neuron Appendix Modeling Equivalent circuits P. G. Mehta (UIUC) GERAD Apr. 20, / 75
107 Finite oscillator model Appendix Modeling Dynamics of i th oscillator [10]: dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 (1) θ i phase of the oscillator {ξ i } mutually independent Wiener processes ω i constant and chosen independently according to a fixed distribution with density g u i (t) control P. G. Mehta (UIUC) GERAD Apr. 20, / 75
108 Appendix Finite oscillator model (cont.) Modeling i th oscillator seeks to minimize η (POP) 1 T i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 {u i } N i=1 Nash equilibrium if u i minimizes η (POP) i (u i ;u i), for i = 1,...,N. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
109 Intuition Appendix Modeling i th oscillator seeks to minimize η (POP) 1 T i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control Trade-off between reducing the two costs Cost associated with θ i θ j : c(θ i ;θ i ) = 1 N j i c (θ i θ j ) Cost associated with control: 1 2 Ru2 i Qualitative macro-behavior change when R varies ) R R > Rc(γ) (c) Incoherence Synchrony P. G. Mehta (UIUC) GERAD Apr. 20, / 75 γ
110 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
111 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
112 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) Fokker-Planck-Kolmogorov (FPK) equation dθ i (t) = (ω i + u i (t))dt + σ dξ i, i = 1,...,N ) replaced by: t p + θ ((ω + ϕ )p = σ 2 2 θθ 2 p p(θ,t,ω) probability density function [10] Strogatz et al., Journal of Statistical Physics, 1991 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
113 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) Fokker-Planck-Kolmogorov (FPK) equation dθ i (t) = (ω i + u i (t))dt + σ dξ i, i = 1,...,N ) replaced by: t p + θ ((ω + ϕ )p = σ 2 2 θθ 2 p p(θ,t,ω) probability density function [10] Strogatz et al., Journal of Statistical Physics, 1991 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
114 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) FPK equation evolution of population phase angles t p + ω θ p = 1 R θ [p( θ h)] + σ θθ p (3) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
115 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) FPK equation evolution of population phase angles t p + ω θ p = 1 R θ [p( θ h)] + σ 2 2 θθ 2 p (3) 2π Generate cost: c (ϑ,θ)p(θ,t,ω)g(ω)dθ dω Ω 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
116 Consistency Appendix Multiple oscillators Consistency requirement on deterministic mass influence 2π c(ϑ,t) = c (ϑ,θ)p(θ,t,ω)g(ω)dθ dω Ω 0 Recall: c(θ i ;θ i ) = 1 N j i c (θ i,θ j ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75
117 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
118 Q-learning Appendix Learning Dynamics of i th d agent dt x i = a i x i + b i u i i th agent seeks to minimize Control u i = k i xx i k i zz T η i = lim e γs ( (x i (s) z(s)) 2 + u 2 i (s) ) ds T 0 (individual state) (ensemble state) Agent [8] P. Mehta and S. Meyn, CDC, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
119 Control comparison Appendix Learning PDE Learning ω = 0.95 ω = 0.1 ω = Control Amplitude Control Phase P. G. Mehta (UIUC) GERAD Apr. 20, / 75
120 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography
121 Rational irrationality Appendix The New Yorker excerpt The Prisoner s Dilemma is the obverse of Adam Smith s theory of the invisible hand, in which the free market coördinates the behavior of self-seeking individuals to the benefit of all. Each businessman intends only his own gain, Smith wrote in The Wealth of Nations, and he is in this, as in many other cases, led by an invisible hand to promote an end which was no part of his intention. But in a market environment the individual pursuit of self-interest, however rational, can give way to collective disaster. The invisible hand becomes a fist. John Cassidy, Rational Irrationality: The real reason that capitalism is so crash-prone, The New Yorker, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75
122 Bibliography Dimitri P. Bertsekas. Dynamic Programming and Optimal Control, volume 1. Athena Scientific, Belmont, Massachusetts, Eric Brown, Jeff Moehlis, and Philip Holmes. On the phase reduction and response dynamics of neural oscillator populations. Neural Computation, 16(4): , M. Dellnitz, J.E. Marsden, I. Melbourne, and J. Scheurle. Generic bifurcations of pendula. Int. Series Num. Math., 104: , J. Guckenheimer. Isochrons and phaseless sets. J. Math. Biol., 1: , Minyi Huang, Peter E. Caines, and Roland P. Malhame. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
123 Bibliography Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ε-nash equilibria. IEEE transactions on automatic control, 52(9): , Y. Kuramoto. International Symposium on Mathematical Problems in Theoretical Physics, volume 39 of Lecture Notes in Physics. Springer-Verlag, Andrzej Lasota and Michael C. Mackey. Chaos, Fractals and Noise. Springer, P. Mehta and S. Meyn. Q-learning and Pontryagin s Minimum Principle. To appear, 48th IEEE Conference on Decision and Control, December Sean P. Meyn. P. G. Mehta (UIUC) GERAD Apr. 20, / 75
124 Bibliography The policy iteration algorithm for average reward markov decision processes with general state space. IEEE Transactions on Automatic Control, 42(12): , December S. H. Strogatz and R. E. Mirollo. Stability of incoherence in a population of coupled oscillators. Journal of Statistical Physics, 63: , May Steven H. Strogatz, Daniel M. Abrams, Bruno Eckhardt, and Edward Ott. Theoretical mechanics: Crowd synchrony on the millennium bridge. Nature, 438:43 44, P. G. Mehta (UIUC) GERAD Apr. 20, / 75
(1) I. INTRODUCTION (2)
920 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 57, NO. 4, APRIL 2012 Synchronization of Coupled Oscillators is a Game Huibing Yin, Member, IEEE, Prashant G. Mehta, Member, IEEE, Sean P. Meyn, Fellow,
More informationBayesian Inference with Oscillator Models: A Possible Role of Neural Rhythms
Bayesian Inference with Oscillator Models: A Possible Role of Neural Rhythms Qualcomm/Brain Corporation/INC Lecture Series on Computational Neuroscience University of California, San Diego, March 5, 2012
More informationc 2012 by Huibing Yin. All rights reserved.
c 212 by Huibing Yin. All rights reserved. NONCOOPERATIVE STATIC AND DYNAMIC GAMES: ADDRESSING SHARED CONSTRAINTS AND PHASE TRANSITIONS BY HUIBING YIN DISSERTATION Submitted in partial fulfillment of the
More informationDynamical systems in neuroscience. Pacific Northwest Computational Neuroscience Connection October 1-2, 2010
Dynamical systems in neuroscience Pacific Northwest Computational Neuroscience Connection October 1-2, 2010 What do I mean by a dynamical system? Set of state variables Law that governs evolution of state
More informationFeedback Particle Filter and its Application to Coupled Oscillators Presentation at University of Maryland, College Park, MD
Feedback Particle Filter and its Application to Coupled Oscillators Presentation at University of Maryland, College Park, MD Prashant Mehta Dept. of Mechanical Science and Engineering and the Coordinated
More informationLarge-population, dynamical, multi-agent, competitive, and cooperative phenomena occur in a wide range of designed
Definition Mean Field Game (MFG) theory studies the existence of Nash equilibria, together with the individual strategies which generate them, in games involving a large number of agents modeled by controlled
More informationRisk-Sensitive and Robust Mean Field Games
Risk-Sensitive and Robust Mean Field Games Tamer Başar Coordinated Science Laboratory Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign Urbana, IL - 6181 IPAM
More informationQuasi Stochastic Approximation American Control Conference San Francisco, June 2011
Quasi Stochastic Approximation American Control Conference San Francisco, June 2011 Sean P. Meyn Joint work with Darshan Shirodkar and Prashant Mehta Coordinated Science Laboratory and the Department of
More informationSynchronization and Phase Oscillators
1 Synchronization and Phase Oscillators Richard Bertram Department of Mathematics and Programs in Neuroscience and Molecular Biophysics Florida State University Tallahassee, Florida 32306 Synchronization
More informationPhase Synchronization
Phase Synchronization Lecture by: Zhibin Guo Notes by: Xiang Fan May 10, 2016 1 Introduction For any mode or fluctuation, we always have where S(x, t) is phase. If a mode amplitude satisfies ϕ k = ϕ k
More informationMath 46, Applied Math (Spring 2008): Final
Math 46, Applied Math (Spring 2008): Final 3 hours, 80 points total, 9 questions, roughly in syllabus order (apart from short answers) 1. [16 points. Note part c, worth 7 points, is independent of the
More informationSpiral Wave Chimeras
Spiral Wave Chimeras Carlo R. Laing IIMS, Massey University, Auckland, New Zealand Erik Martens MPI of Dynamics and Self-Organization, Göttingen, Germany Steve Strogatz Dept of Maths, Cornell University,
More information5 Applying the Fokker-Planck equation
5 Applying the Fokker-Planck equation We begin with one-dimensional examples, keeping g = constant. Recall: the FPE for the Langevin equation with η(t 1 )η(t ) = κδ(t 1 t ) is = f(x) + g(x)η(t) t = x [f(x)p
More informationControlled Diffusions and Hamilton-Jacobi Bellman Equations
Controlled Diffusions and Hamilton-Jacobi Bellman Equations Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2014 Emo Todorov (UW) AMATH/CSE 579, Winter
More informationSynchronization Transitions in Complex Networks
Synchronization Transitions in Complex Networks Y. Moreno 1,2,3 1 Institute for Biocomputation and Physics of Complex Systems (BIFI) University of Zaragoza, Zaragoza 50018, Spain 2 Department of Theoretical
More informationMean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio
Mean-Field optimization problems and non-anticipative optimal transport Beatrice Acciaio London School of Economics based on ongoing projects with J. Backhoff, R. Carmona and P. Wang Robust Methods in
More informationStochastic differential equations in neuroscience
Stochastic differential equations in neuroscience Nils Berglund MAPMO, Orléans (CNRS, UMR 6628) http://www.univ-orleans.fr/mapmo/membres/berglund/ Barbara Gentz, Universität Bielefeld Damien Landon, MAPMO-Orléans
More informationChimera states in networks of biological neurons and coupled damped pendulums
in neural models in networks of pendulum-like elements in networks of biological neurons and coupled damped pendulums J. Hizanidis 1, V. Kanas 2, A. Bezerianos 3, and T. Bountis 4 1 National Center for
More informationHJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011
Department of Probability and Mathematical Statistics Faculty of Mathematics and Physics, Charles University in Prague petrasek@karlin.mff.cuni.cz Seminar in Stochastic Modelling in Economics and Finance
More informationPhase Oscillators. and at r, Hence, the limit cycle at r = r is stable if and only if Λ (r ) < 0.
1 Phase Oscillators Richard Bertram Department of Mathematics and Programs in Neuroscience and Molecular Biophysics Florida State University Tallahassee, Florida 32306 2 Phase Oscillators Oscillations
More informationDynamical modelling of systems of coupled oscillators
Dynamical modelling of systems of coupled oscillators Mathematical Neuroscience Network Training Workshop Edinburgh Peter Ashwin University of Exeter 22nd March 2009 Peter Ashwin (University of Exeter)
More informationNeuroscience applications: isochrons and isostables. Alexandre Mauroy (joint work with I. Mezic)
Neuroscience applications: isochrons and isostables Alexandre Mauroy (joint work with I. Mezic) Outline Isochrons and phase reduction of neurons Koopman operator and isochrons Isostables of excitable systems
More informationMEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS
MEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS René Carmona Department of Operations Research & Financial Engineering PACM Princeton University CEMRACS - Luminy, July 17, 217 MFG WITH MAJOR AND MINOR PLAYERS
More informationChaotic motion. Phys 420/580 Lecture 10
Chaotic motion Phys 420/580 Lecture 10 Finite-difference equations Finite difference equation approximates a differential equation as an iterative map (x n+1,v n+1 )=M[(x n,v n )] Evolution from time t
More informationTraveling waves dans le modèle de Kuramoto quenched
Traveling waves dans le modèle de Kuramoto quenched Eric Luçon MAP5 - Université Paris Descartes Grenoble - Journées MAS 2016 31 août 2016 Travail en commun avec Christophe Poquet (Lyon 1) [Giacomin, L.,
More informationCollective and Stochastic Effects in Arrays of Submicron Oscillators
DYNAMICS DAYS: Long Beach, 2005 1 Collective and Stochastic Effects in Arrays of Submicron Oscillators Ron Lifshitz (Tel Aviv), Jeff Rogers (HRL, Malibu), Oleg Kogan (Caltech), Yaron Bromberg (Tel Aviv),
More informationON MEAN FIELD GAMES. Pierre-Louis LIONS. Collège de France, Paris. (joint project with Jean-Michel LASRY)
Collège de France, Paris (joint project with Jean-Michel LASRY) I INTRODUCTION II A REALLY SIMPLE EXAMPLE III GENERAL STRUCTURE IV TWO PARTICULAR CASES V OVERVIEW AND PERSPECTIVES I. INTRODUCTION New class
More informationStochastic Gradient Descent in Continuous Time
Stochastic Gradient Descent in Continuous Time Justin Sirignano University of Illinois at Urbana Champaign with Konstantinos Spiliopoulos (Boston University) 1 / 27 We consider a diffusion X t X = R m
More informationControl Theory in Physics and other Fields of Science
Michael Schulz Control Theory in Physics and other Fields of Science Concepts, Tools, and Applications With 46 Figures Sprin ger 1 Introduction 1 1.1 The Aim of Control Theory 1 1.2 Dynamic State of Classical
More informationMulti-dimensional Feedback Particle Filter for Coupled Oscillators
3 American Control Conference (ACC) Washington, DC, USA, June 7-9, 3 Multi-dimensional Feedback Particle Filter for Coupled Oscillators Adam K. Tilton, Prashant G. Mehta and Sean P. Meyn Abstract This
More informationMean field equilibria of multiarmed bandit games
Mean field equilibria of multiarmed bandit games Ramesh Johari Stanford University Joint work with Ramki Gummadi (Stanford University) and Jia Yuan Yu (IBM Research) A look back: SN 2000 2 Overview What
More informationExternal Periodic Driving of Large Systems of Globally Coupled Phase Oscillators
External Periodic Driving of Large Systems of Globally Coupled Phase Oscillators T. M. Antonsen Jr., R. T. Faghih, M. Girvan, E. Ott and J. Platig Institute for Research in Electronics and Applied Physics
More informationApproximation of Top Lyapunov Exponent of Stochastic Delayed Turning Model Using Fokker-Planck Approach
Approximation of Top Lyapunov Exponent of Stochastic Delayed Turning Model Using Fokker-Planck Approach Henrik T. Sykora, Walter V. Wedig, Daniel Bachrathy and Gabor Stepan Department of Applied Mechanics,
More informationA conjecture on sustained oscillations for a closed-loop heat equation
A conjecture on sustained oscillations for a closed-loop heat equation C.I. Byrnes, D.S. Gilliam Abstract The conjecture in this paper represents an initial step aimed toward understanding and shaping
More informationLecture 8: Policy Gradient
Lecture 8: Policy Gradient Hado van Hasselt Outline 1 Introduction 2 Finite Difference Policy Gradient 3 Monte-Carlo Policy Gradient 4 Actor-Critic Policy Gradient Introduction Vapnik s rule Never solve
More informationNonlinear and Collective Effects in Mesoscopic Mechanical Oscillators
Dynamics Days Asia-Pacific: Singapore, 2004 1 Nonlinear and Collective Effects in Mesoscopic Mechanical Oscillators Alexander Zumdieck (Max Planck, Dresden), Ron Lifshitz (Tel Aviv), Jeff Rogers (HRL,
More informationProceedings of Neural, Parallel, and Scientific Computations 4 (2010) xx-xx PHASE OSCILLATOR NETWORK WITH PIECEWISE-LINEAR DYNAMICS
Proceedings of Neural, Parallel, and Scientific Computations 4 (2010) xx-xx PHASE OSCILLATOR NETWORK WITH PIECEWISE-LINEAR DYNAMICS WALTER GALL, YING ZHOU, AND JOSEPH SALISBURY Department of Mathematics
More informationMean Field Stochastic Control
Mean Field Stochastic Control Peter E. Caines 50 Years of Nonlinear Control and Optimization IFAC Workshop The Royal Society London, 2010 McGill University 1 / 72 Co-Authors Minyi Huang Roland Malhamé
More informationEntrainment and Chaos in the Hodgkin-Huxley Oscillator
Entrainment and Chaos in the Hodgkin-Huxley Oscillator Kevin K. Lin http://www.cims.nyu.edu/ klin Courant Institute, New York University Mostly Biomath - 2005.4.5 p.1/42 Overview (1) Goal: Show that the
More informationNew Homoclinic and Heteroclinic Solutions for Zakharov System
Commun. Theor. Phys. 58 (2012) 749 753 Vol. 58, No. 5, November 15, 2012 New Homoclinic and Heteroclinic Solutions for Zakharov System WANG Chuan-Jian ( ), 1 DAI Zheng-De (à ), 2, and MU Gui (½ ) 3 1 Department
More information10.1 Review of Parametric Equations
10.1 Review of Parametric Equations Recall that often, instead of representing a curve using just x and y (called a Cartesian equation), it is more convenient to define x and y using parametric equations
More informationUNIVERSITY OF MANITOBA
Question Points Score INSTRUCTIONS TO STUDENTS: This is a 6 hour examination. No extra time will be given. No texts, notes, or other aids are permitted. There are no calculators, cellphones or electronic
More informationMean Field Consumption-Accumulation Optimization with HAR
Mean Field Consumption-Accumulation Optimization with HARA Utility School of Mathematics and Statistics Carleton University, Ottawa Workshop on Mean Field Games and Related Topics 2 Padova, September 4-7,
More informationWeakly interacting particle systems on graphs: from dense to sparse
Weakly interacting particle systems on graphs: from dense to sparse Ruoyu Wu University of Michigan (Based on joint works with Daniel Lacker and Kavita Ramanan) USC Math Finance Colloquium October 29,
More informationMathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio ( )
Mathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio (2014-2015) Etienne Tanré - Olivier Faugeras INRIA - Team Tosca November 26th, 2014 E. Tanré (INRIA - Team Tosca) Mathematical
More informationDynamic Bertrand and Cournot Competition
Dynamic Bertrand and Cournot Competition Effects of Product Differentiation Andrew Ledvina Department of Operations Research and Financial Engineering Princeton University Joint with Ronnie Sircar Princeton-Lausanne
More informationPontryagin s maximum principle
Pontryagin s maximum principle Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2012 Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 1 / 9 Pontryagin
More informationBIFURCATION PHENOMENA Lecture 1: Qualitative theory of planar ODEs
BIFURCATION PHENOMENA Lecture 1: Qualitative theory of planar ODEs Yuri A. Kuznetsov August, 2010 Contents 1. Solutions and orbits. 2. Equilibria. 3. Periodic orbits and limit cycles. 4. Homoclinic orbits.
More informationIntroduction to asymptotic techniques for stochastic systems with multiple time-scales
Introduction to asymptotic techniques for stochastic systems with multiple time-scales Eric Vanden-Eijnden Courant Institute Motivating examples Consider the ODE {Ẋ = Y 3 + sin(πt) + cos( 2πt) X() = x
More informationCDS 101/110a: Lecture 2.1 Dynamic Behavior
CDS 11/11a: Lecture 2.1 Dynamic Behavior Richard M. Murray 6 October 28 Goals: Learn to use phase portraits to visualize behavior of dynamical systems Understand different types of stability for an equilibrium
More informationThe Hamiltonian Mean Field Model: Effect of Network Structure on Synchronization Dynamics. Yogesh Virkar University of Colorado, Boulder.
The Hamiltonian Mean Field Model: Effect of Network Structure on Synchronization Dynamics Yogesh Virkar University of Colorado, Boulder. 1 Collaborators Prof. Juan G. Restrepo Prof. James D. Meiss Department
More informationIntroduction to Mean Field Game Theory Part I
School of Mathematics and Statistics Carleton University Ottawa, Canada Workshop on Game Theory, Institute for Math. Sciences National University of Singapore, June 2018 Outline of talk Background and
More informationStochastic Processes. M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno
Stochastic Processes M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno 1 Outline Stochastic (random) processes. Autocorrelation. Crosscorrelation. Spectral density function.
More informationA Barrier Version of the Russian Option
A Barrier Version of the Russian Option L. A. Shepp, A. N. Shiryaev, A. Sulem Rutgers University; shepp@stat.rutgers.edu Steklov Mathematical Institute; shiryaev@mi.ras.ru INRIA- Rocquencourt; agnes.sulem@inria.fr
More informationOn continuous time contract theory
Ecole Polytechnique, France Journée de rentrée du CMAP, 3 octobre, 218 Outline 1 2 Semimartingale measures on the canonical space Random horizon 2nd order backward SDEs (Static) Principal-Agent Problem
More informationLINEAR DISPERSIVE WAVES
LINEAR DISPERSIVE WAVES Nathaniel Egwu December 16, 2009 Outline 1 INTRODUCTION Dispersion Relations Definition of Dispersive Waves 2 SOLUTION AND ASYMPTOTIC ANALYSIS The Beam Equation General Solution
More informationSystems Analysis and Control
Systems Analysis and Control Matthew M. Peet Illinois Institute of Technology Lecture 12: Overview In this Lecture, you will learn: Review of Feedback Closing the Loop Pole Locations Changing the Gain
More informationy = x 3 and y = 2x 2 x. 2x 2 x = x 3 x 3 2x 2 + x = 0 x(x 2 2x + 1) = 0 x(x 1) 2 = 0 x = 0 and x = (x 3 (2x 2 x)) dx
Millersville University Name Answer Key Mathematics Department MATH 2, Calculus II, Final Examination May 4, 2, 8:AM-:AM Please answer the following questions. Your answers will be evaluated on their correctness,
More informationNETWORKS Lecture 2: Synchronization Fundamentals
Short-course: Complex Systems Beyond the Metaphor UNSW, February 2007 NETWORKS Lecture 2: Synchronization Fundamentals David J Hill Research School of Information Sciences and Engineering The ANU 8/2/2007
More informationInverse Optimality Design for Biological Movement Systems
Inverse Optimality Design for Biological Movement Systems Weiwei Li Emanuel Todorov Dan Liu Nordson Asymtek Carlsbad CA 921 USA e-mail: wwli@ieee.org. University of Washington Seattle WA 98195 USA Google
More informationMultisoliton Interaction of Perturbed Manakov System: Effects of External Potentials
Multisoliton Interaction of Perturbed Manakov System: Effects of External Potentials Michail D. Todorov Faculty of Applied Mathematics and Computer Science Technical University of Sofia, Bulgaria (Work
More informationRobotics. Control Theory. Marc Toussaint U Stuttgart
Robotics Control Theory Topics in control theory, optimal control, HJB equation, infinite horizon case, Linear-Quadratic optimal control, Riccati equations (differential, algebraic, discrete-time), controllability,
More informationTopic 5: The Difference Equation
Topic 5: The Difference Equation Yulei Luo Economics, HKU October 30, 2017 Luo, Y. (Economics, HKU) ME October 30, 2017 1 / 42 Discrete-time, Differences, and Difference Equations When time is taken to
More informationA new approach for investment performance measurement. 3rd WCMF, Santa Barbara November 2009
A new approach for investment performance measurement 3rd WCMF, Santa Barbara November 2009 Thaleia Zariphopoulou University of Oxford, Oxford-Man Institute and The University of Texas at Austin 1 Performance
More informationBasics of reinforcement learning
Basics of reinforcement learning Lucian Buşoniu TMLSS, 20 July 2018 Main idea of reinforcement learning (RL) Learn a sequential decision policy to optimize the cumulative performance of an unknown system
More informationDynamical Systems and Chaos Part II: Biology Applications. Lecture 10: Coupled Systems. Ilya Potapov Mathematics Department, TUT Room TD325
Dynamical Systems and Chaos Part II: Biology Applications Lecture 10: Coupled Systems. Ilya Potapov Mathematics Department, TUT Room TD325 Foreword In order to model populations of physical/biological
More informationCheap talk theoretic tools for distributed sensing in the presence of strategic sensors
1 / 37 Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors Cédric Langbort Department of Aerospace Engineering & Coordinated Science Laboratory UNIVERSITY OF ILLINOIS
More informationReinforcement Learning
1 Reinforcement Learning Chris Watkins Department of Computer Science Royal Holloway, University of London July 27, 2015 2 Plan 1 Why reinforcement learning? Where does this theory come from? Markov decision
More informationCDS 101/110a: Lecture 2.1 Dynamic Behavior
CDS 11/11a: Lecture.1 Dynamic Behavior Richard M. Murray 6 October 8 Goals: Learn to use phase portraits to visualize behavior of dynamical systems Understand different types of stability for an equilibrium
More informationThe Theory of Weakly Coupled Oscillators
The Theory of Weakly Coupled Oscillators Michael A. Schwemmer and Timothy J. Lewis Department of Mathematics, One Shields Ave, University of California Davis, CA 95616 1 1 Introduction 2 3 4 5 6 7 8 9
More informationQ-Learning in Continuous State Action Spaces
Q-Learning in Continuous State Action Spaces Alex Irpan alexirpan@berkeley.edu December 5, 2015 Contents 1 Introduction 1 2 Background 1 3 Q-Learning 2 4 Q-Learning In Continuous Spaces 4 5 Experimental
More informationMathematical Foundations of Neuroscience - Lecture 7. Bifurcations II.
Mathematical Foundations of Neuroscience - Lecture 7. Bifurcations II. Filip Piękniewski Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruń, Poland Winter 2009/2010 Filip
More informationFINAL EXAM MATH303 Theory of Ordinary Differential Equations. Spring dx dt = x + 3y dy dt = x y.
FINAL EXAM MATH0 Theory of Ordinary Differential Equations There are 5 problems on 2 pages. Spring 2009. 25 points Consider the linear plane autonomous system x + y x y. Find a fundamental matrix of the
More informationLecture 11 : Overview
Lecture 11 : Overview Error in Assignment 3 : In Eq. 1, Hamiltonian should be H = p2 r 2m + p2 ϕ 2mr + (p z ea z ) 2 2 2m + eφ (1) Error in lecture 10, slide 7, Eq. (21). Should be S(q, α, t) m Q = β =
More informationMajid Sodagar, 1 Patrick Chang, 1 Edward Coyler, 1 and John Parke 1 School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
Experimental Characterization of Chua s Circuit Majid Sodagar, 1 Patrick Chang, 1 Edward Coyler, 1 and John Parke 1 School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA (Dated:
More informationPHASE-LOCKED SOLUTIONS IN A HUB CONNECTED OSCILLATOR RING NETWORK
Copyright c 29 by ABCM PHASE-LOCKED SOLUTIONS IN A HUB CONNECTED OSCILLATOR RING NETWORK Jacqueline Bridge, Jacqueline.Bridge@sta.uwi.edu Department of Mechanical Engineering, The University of the West
More informationAdvanced Heat and Mass Transfer by Amir Faghri, Yuwen Zhang, and John R. Howell
Advanced Heat and Mass Transfer by Amir Faghri, Yuwen Zhang, and John R. Howell 9.2 The Blackbody as the Ideal Radiator A material that absorbs 100 percent of the energy incident on it from all directions
More information{ } is an asymptotic sequence.
AMS B Perturbation Methods Lecture 3 Copyright by Hongyun Wang, UCSC Recap Iterative method for finding asymptotic series requirement on the iteration formula to make it work Singular perturbation use
More informationNeural Excitability in a Subcritical Hopf Oscillator with a Nonlinear Feedback
Neural Excitability in a Subcritical Hopf Oscillator with a Nonlinear Feedback Gautam C Sethia and Abhijit Sen Institute for Plasma Research, Bhat, Gandhinagar 382 428, INDIA Motivation Neural Excitability
More informationMean Field Competitive Binary MDPs and Structured Solutions
Mean Field Competitive Binary MDPs and Structured Solutions Minyi Huang School of Mathematics and Statistics Carleton University Ottawa, Canada MFG217, UCLA, Aug 28 Sept 1, 217 1 / 32 Outline of talk The
More informationLearning Cycle Linear Hybrid Automata for Excitable Cells
Learning Cycle Linear Hybrid Automata for Excitable Cells Sayan Mitra Joint work with Radu Grosu, Pei Ye, Emilia Entcheva, I V Ramakrishnan, and Scott Smolka HSCC 2007 Pisa, Italy Excitable Cells Outline
More information46th IEEE CDC, New Orleans, USA, Dec , where θ rj (0) = j 1. Re 1 N. θ =
Proceedings of the 46th IEEE Conference on Decision and Control New Orleans, LA, USA, Dec. 12-14, 2007 INVITED SESSION NUMBER: 157 Event-Based Feedback Control of Nonlinear Oscillators Using Phase Response
More informationOn the Stability of the Best Reply Map for Noncooperative Differential Games
On the Stability of the Best Reply Map for Noncooperative Differential Games Alberto Bressan and Zipeng Wang Department of Mathematics, Penn State University, University Park, PA, 68, USA DPMMS, University
More informationAnalysis of Systems with State-Dependent Delay
Analysis of Systems with State-Dependent Delay Matthew M. Peet Arizona State University Tempe, AZ USA American Institute of Aeronautics and Astronautics Guidance, Navigation and Control Conference Boston,
More informationAMATH 353 Lecture 9. Weston Barger. How to classify PDEs as linear/nonlinear, order, homogeneous or non-homogeneous.
AMATH 353 ecture 9 Weston Barger 1 Exam What you need to know: How to classify PDEs as linear/nonlinear, order, homogeneous or non-homogeneous. The definitions for traveling wave, standing wave, wave train
More informationOn the Response of Neurons to Sinusoidal Current Stimuli: Phase Response Curves and Phase-Locking
On the Response of Neurons to Sinusoidal Current Stimuli: Phase Response Curves and Phase-Locking Michael J. Schaus and Jeff Moehlis Abstract A powerful technique for analyzing mathematical models for
More informationMulti-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form
Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Yipeng Yang * Under the supervision of Dr. Michael Taksar Department of Mathematics University of Missouri-Columbia Oct
More informationAlmost Sure Convergence of Two Time-Scale Stochastic Approximation Algorithms
Almost Sure Convergence of Two Time-Scale Stochastic Approximation Algorithms Vladislav B. Tadić Abstract The almost sure convergence of two time-scale stochastic approximation algorithms is analyzed under
More informationMean Field Control: Selected Topics and Applications
Mean Field Control: Selected Topics and Applications School of Mathematics and Statistics Carleton University Ottawa, Canada University of Michigan, Ann Arbor, Feb 2015 Outline of talk Background and motivation
More informationControl of McKean-Vlasov Dynamics versus Mean Field Games
Control of McKean-Vlasov Dynamics versus Mean Field Games René Carmona (a),, François Delarue (b), and Aimé Lachapelle (a), (a) ORFE, Bendheim Center for Finance, Princeton University, Princeton, NJ 8544,
More informationLecture 5. Outline: Limit Cycles. Definition and examples How to rule out limit cycles. Poincare-Bendixson theorem Hopf bifurcations Poincare maps
Lecture 5 Outline: Limit Cycles Definition and examples How to rule out limit cycles Gradient systems Liapunov functions Dulacs criterion Poincare-Bendixson theorem Hopf bifurcations Poincare maps Limit
More informationDynamical Systems & Lyapunov Stability
Dynamical Systems & Lyapunov Stability Harry G. Kwatny Department of Mechanical Engineering & Mechanics Drexel University Outline Ordinary Differential Equations Existence & uniqueness Continuous dependence
More informationEffects of Various Uncertainty Sources on Automatic Generation Control Systems
Effects of Various Uncertainty Sources on Automatic Generation Control Systems D. Apostolopoulou, Y. C. Chen, J. Zhang, A. D. Domínguez-García, and P. W. Sauer University of Illinois at Urbana-Champaign
More informationOrdinary Differential Equations (ODEs) Background. Video 17
Ordinary Differential Equations (ODEs) Background Video 17 Daniel J. Bodony Department of Aerospace Engineering University of Illinois at Urbana-Champaign In this video you will learn... 1 What ODEs are
More informationLinear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters
Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 204 Emo Todorov (UW) AMATH/CSE 579, Winter
More informationChaotic motion. Phys 750 Lecture 9
Chaotic motion Phys 750 Lecture 9 Finite-difference equations Finite difference equation approximates a differential equation as an iterative map (x n+1,v n+1 )=M[(x n,v n )] Evolution from time t =0to
More informationThe correlation between stochastic resonance and the average phase-synchronization time of a bistable system driven by colour-correlated noises
Chin. Phys. B Vol. 19, No. 1 (010) 01050 The correlation between stochastic resonance and the average phase-synchronization time of a bistable system driven by colour-correlated noises Dong Xiao-Juan(
More informationSolutions 2: Simple Harmonic Oscillator and General Oscillations
Massachusetts Institute of Technology MITES 2017 Physics III Solutions 2: Simple Harmonic Oscillator and General Oscillations Due Wednesday June 21, at 9AM under Rene García s door Preface: This problem
More informationActor-critic methods. Dialogue Systems Group, Cambridge University Engineering Department. February 21, 2017
Actor-critic methods Milica Gašić Dialogue Systems Group, Cambridge University Engineering Department February 21, 2017 1 / 21 In this lecture... The actor-critic architecture Least-Squares Policy Iteration
More informationMean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio
Mean-Field optimization problems and non-anticipative optimal transport Beatrice Acciaio London School of Economics based on ongoing projects with J. Backhoff, R. Carmona and P. Wang Thera Stochastics
More information