Synchronization of coupled oscillators is a game. Prashant G. Mehta 1

Size: px
Start display at page:

Download "Synchronization of coupled oscillators is a game. Prashant G. Mehta 1"

Transcription

1 CSL COORDINATED SCIENCE LABORATORY Synchronization of coupled oscillators is a game Prashant G. Mehta 1 1 Coordinated Science Laboratory Department of Mechanical Science and Engineering University of Illinois at Urbana-Champaign GERAD seminar, April 20, 2010 Acknowledgment: AFOSR, NSF

2 Huibing Yin Sean P. Meyn Uday V. Shanbhag H. Yin, P. G. Mehta, S. P. Meyn and U. V. Shanbhag, Synchronization of coupled oscillators is a game, ACC 2010 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

3 Acknowledgement Thanks to Minyi Huang, Roland P. Malhame, and Peter Caines Minyi Huang Roland P. Malhame Peter Caines M. Huang, P. Caines, and R. Malhame, Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ε-nash equilibria, IEEE TAC, 2007 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

4 Millennium bridge Video of London Millennium bridge from youtube [11] S. H. Strogatz et al., Nature, 2005 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

5 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

6 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

7 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

8 Classical Kuramoto model dθ i (t) = ( ω i + κ N N j=1 sin(θ j (t) θ i (t)) ) dt + σ dξ i (t), i = 1,...,N ω i taken from distribution g(ω) over [1 γ,1 + γ] γ measures the heterogeneity of the population κ measures the strength of coupling κ 0.3 Synchrony Locking R κ < κ c (γ) Incoherence γ [6] Y. Kuramoto (1975) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

9 Movies of incoherence and synchrony solution Incoherence Synchrony P. G. Mehta (UIUC) GERAD Apr. 20, / 75

10 Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

11 Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

12 Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

13 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

14 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

15 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

16 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

17 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

18 Quiz Motivation Why a game? In the video you just watched, why were the individuals walking strangely? A. To show respect to the Queen. B. Anarchists in the crowd were trying to destabilize the bridge. C. They were stepping to the beat of the soundtrack "Walk Like an Egyptian." D. The individuals were trying to maintain their balance. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

19 Rational irrationality Motivation Why a game? behavior that, on the individual level, is perfectly reasonable but that, when aggregated in the marketplace, produces calamity. Examples Millennium bridge Financial market John Cassidy, Rational Irrationality: The real reason that capitalism is so crash-prone, The New Yorker, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

20 Rational irrationality Motivation Why a game? behavior that, on the individual level, is perfectly reasonable but that, when aggregated in the marketplace, produces calamity. Examples Millennium bridge Financial market John Cassidy, Rational Irrationality: The real reason that capitalism is so crash-prone, The New Yorker, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

21 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

22 Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

23 Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

24 r Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time r Limit cyle h h V 0 v [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

25 r Motivation Why Oscillators? Hodgkin-Huxley type Neuron model C dv dt = g T m 2 (V) h (V E T ) g h r (V E h )... dh dt = h (V) h τ h (V) dr dt = r (V) r τ r (V) Voltage Neural spike train time r Limit cyle Normal form reduction h h V 0 v θ i = ω i + u i Φ(θ i ) [4] J. Guckenheimer, J. Math. Biol., 1975; [2] J. Moehlis et al., Neural Computation, 2004 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

26 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

27 Problems and results Finite oscillator model Problem statement Dynamics of i th oscillator dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 u i (t) control i th oscillator seeks to minimize 1 T η i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

28 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

29 Problems and results Main results 1. Synchronization is a solution of game R 1/ Synchrony Locking γ R > R c (γ) Incoherence γ dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 Yin et al., ACC 2010 Strogatz et al., J. Stat. Phy., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

30 Problems and results Main results 1. Synchronization is a solution of game R 1/ Synchrony Locking κ 0.3 Synchrony Locking 0.25 R γ R > R c (γ) Incoherence γ κ < κ c (γ) Incoherence γ dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 ( ) dθ i = ω i + κ N N sin(θ j θ i ) dt + σ dξ i j=1 Yin et al., ACC 2010 Strogatz et al., J. Stat. Phy., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

31 Problems and results Main results 2. Kuramoto control is approximately optimal 6 k = 0.01; R = Population Density 0 π 2 π θ u i = A i R 1 N sin(θ θ j (t)) j i Control laws ω = 1 Kuramoto A * A i t Learning algorithm: da i = ε... dt Yin et.al. CDC 2010 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

32 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

33 Derivation of model Overview of model derivation Overview dθ i = (ω i + u i (t))dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[ c(θ i,t) Ru2 i ]ds Mass Influence Influence 1 Mean-field approximation Assumption: N c(θ i ;θ i (t)) = 1 N c (θ i,θ j ) c(θ,t) j i 2 Optimal control of single oscillator Decentralized control structure [5] M. Huang, P. Caines, and R. Malhame, IEEE TAC, 2007 [HCM] P. G. Mehta (UIUC) GERAD Apr. 20, / 75

34 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

35 Derivation of model Derivation steps Single oscillator with given cost Dynamics of the oscillator dθ i = (ω i + u i (t))dt + σ dξ i, t 0 The cost function is assumed known 1 T η i (u i ; c) = lim E[ c(θ i ;θ i ) + 1 T T 2 Ru2 i (s)]ds 0 c(θ i (s),s) HJB equation: t h i + ω i θ h i = 1 2R ( θ h i ) 2 c(θ,t) + η i σ θθ h i Optimal control law: u i (t) = ϕ i (θ,t) = 1 R θ h i (θ,t) [1] D. P. Bertsekas (1995); [9] S. P. Meyn, IEEE TAC, 1997 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

36 Derivation of model Derivation steps Single oscillator with optimal control Dynamics of the oscillator ( dθ i (t) = ω i 1 ) R θ h i (θ i,t) dt + σ dξ i (t) Fokker-Planck equation for pdf p(θ,t,ω i ) FPK: t p + ω i θ p = 1 R θ [p( θ h)] + σ θθ p [7] A. Lasota and M. C. Mackey, Chaos, Fractals and Noise, Springer 1994 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

37 Derivation of model Mean-field Approximation Derivation steps HJB equation for population t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η(ω) σ θθ h h(θ,t,ω) Population density t p + ω θ p = 1 R θ [p( θ h)] + σ 2 Enforce cost consistency 2π c(θ,t) = c (θ,ϑ)p(ϑ,t,ω)g(ω)dϑ dω Ω 0 1 N j i c (θ,ϑ) 2 2 θθ p p(θ,t,ω) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

38 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

39 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

40 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

41 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

42 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

43 Summary Derivation of model PDE model HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π Mean-field approx.: c(ϑ, t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 1 Bellman s optimality principle (H,J,B) 2 Propagation of chaos (F,P,K, Mckean, Vlasov,... ) 3 Mean-field approximation (Boltzmann, Kac,... ) 4 Connection to Nash game (Weintraub, HCM, Altman,... ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

44 Derivation of model PDE model 1. Solution of PDE gives ε-nash equilibrium Optimal control law ε-nash property (as N ) u o i = 1 R θ h(θ(t),t,ω) ω=ωi η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. So, we look for solutions of PDEs. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

45 Derivation of model PDE model 1. Solution of PDE gives ε-nash equilibrium Optimal control law ε-nash property (as N ) u o i = 1 R θ h(θ(t),t,ω) ω=ωi η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. So, we look for solutions of PDEs. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

46 Derivation of model PDE model 1. Solution of PDE gives ε-nash equilibrium Optimal control law ε-nash property (as N ) u o i = 1 R θ h(θ(t),t,ω) ω=ωi η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. So, we look for solutions of PDEs. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

47 Derivation of model PDE model 2. Incoherence solution (PDE) Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π incoherence h(θ,t,ω) = 0 t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h t p + ω θ p = 1 R θ [p( θ h)] + σ θθ p 2π c(θ,t) = c (θ,ϑ)p(ϑ,t,ω)g(ω)dϑ dω Ω 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

48 Derivation of model PDE model 2. Incoherence solution (PDE) Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π incoherence h(θ,t,ω) = 0 t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h p(θ,t,ω) = 1 2π tp + ω θ p = 1 R θ [p( θ h)] + σ θθ p 2π c(θ,t) = c (θ,ϑ)p(ϑ,t,ω)g(ω)dϑ dω Ω 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

49 Derivation of model PDE model 2. Incoherence solution (PDE) ( ) ϑ θ Assume c (ϑ,θ) = c (ϑ θ) = 1 2 sin2 2 Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π Optimal control u = 1 R θ h = 0 Average cost 2π ( ) 1 θ ϑ 1 c(θ,t) = 2 sin2 g(ω)dϑ dω Ω 0 2 2π η (ω) = c(θ,t) = 1 4 =: η 0 for all ω Ω incoherence soln. No cost of control P. G. Mehta (UIUC) GERAD Apr. 20, / 75

50 Derivation of model PDE model 2. Incoherence solution (Finite population) Closed-loop dynamics dθ i = (ω i + u }{{} i )dt + σ dξ i (t) =0 Average cost 1 T η i = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]dt 0 }{{} =0 1 = lim T N 1 T ( E[ 1 θi (t) θ j (t) j i T 2 sin2 0 2 = 1 N 2π ( E[ 1 θi (t) ϑ 2 sin2 j i 0 2 ) ]dt ) ] 1 2π dϑ = N N η incoherence 1 ε-nash property η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

51 Derivation of model PDE model 2. Incoherence solution (Finite population) Closed-loop dynamics dθ i = (ω i + u }{{} i )dt + σ dξ i (t) =0 Average cost 1 T η i = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]dt 0 }{{} =0 1 = lim T N 1 T ( E[ 1 θi (t) θ j (t) j i T 2 sin2 0 2 = 1 N 2π ( E[ 1 θi (t) ϑ 2 sin2 j i 0 2 ) ]dt ) ] 1 2π dϑ = N N η incoherence 1 ε-nash property η i (u o i ;u o i) η i (u i ;u o i) + O( 1 N ), i = 1,...,N. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

52 Derivation of model PDE model 3. Synchronization is a solution of game R 1/ Synchrony Locking η(ω) η(ω) = η0 c R < Rc ω= 0.95 ω= 1 ω= 1.05 γ R > R c (γ) 0.15 Incoherence γ R > Rc 0.15 η(ω) < η R 1/ 2 dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 η(ω) = minη i (u i ;u o u i i) t = Synchrony solution of P. G. Mehta (UIUC) GERAD Apr. 20, / 75

53 Derivation of model PDE model 3. Synchronization is a solution of game R 1/ Synchrony Locking η(ω) η(ω) = η0 incoherence soln. c R < Rc ω= 0.95 ω= 1 ω= 1.05 γ R > R c (γ) 0.15 Incoherence γ R > Rc 0.15 η(ω) < η R 1/ 2 dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 η(ω) = minη i (u i ;u o u i i) t = Synchrony solution of P. G. Mehta (UIUC) GERAD Apr. 20, / 75

54 Derivation of model PDE model 3. Synchronization is a solution of game γ R 1/ 2 Synchrony Locking R > R c (γ) 0.15 Incoherence γ η(ω) R > Rc η(ω) = η0 R < Rc η(ω) < η R 1/ 2 c ω= 0.95 ω= 1 ω= 1.05 synchrony soln. dθ i = (ω i + u i )dt + σ dξ i 1 T η i (u i ;u i ) = lim E[c(θ i ;θ i ) + 1 T T 2 Ru2 i ]ds 0 η(ω) = minη i (u i ;u o u i i) t = Synchrony solution of P. G. Mehta (UIUC) GERAD Apr. 20, / 75

55 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

56 Analysis of phase transition Overview of the steps Incoherence solution HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 ( ) ϑ θ Assume c (ϑ,θ) = c (ϑ θ) = 1 2 sin2 2 Incoherence solution h(θ,t,ω) = h 0 (θ) := 0 p(θ,t,ω) = p 0 (θ) := 1 2π P. G. Mehta (UIUC) GERAD Apr. 20, / 75

57 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

58 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

59 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

60 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum imag γ = 0.1 R decreases k=1 k=1 k=2 k= real Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

61 Analysis of phase transition Linearization and spectra Bifurcation analysis Linearized PDE (about incoherence solution) ( t z(θ,t,ω) = ω θ h c σ 2 2 θθ 2 h ) ω θ p + 1 2πR θθ 2 h + σ 2 2 θθ 2 p =: L R z(θ,t,ω) Spectrum of the linear operator 1 Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω 2 Discrete spectrum imag γ = 0.1 R decreases k=1 k=1 k=2 k= real Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

62 Analysis of phase transition Bifurcation analysis Bifurcation diagram (Hamiltonian Hopf) Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) Stability proof [3] Dellnitz et al., Int. Series Num. Math., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

63 Analysis of phase transition Bifurcation analysis Bifurcation diagram (Hamiltonian Hopf) Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) Stability proof Bifurcation point (a) imag Cont. spectrum; ind. of R Disc. spectrum; fn. of R real [3] Dellnitz et al., Int. Series Num. Math., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

64 Analysis of phase transition Bifurcation analysis Bifurcation diagram (Hamiltonian Hopf) Characteristic eqn: 1 8R Ω g(ω) (λ σ ωi)(λ + dω + 1 = 0. σ ωi) Stability proof Bifurcation point (a) ) R R > Rc(γ) (c) Incoherence imag Cont. spectrum; ind. of R Disc. spectrum; fn. of R real Synchrony γ [3] Dellnitz et al., Int. Series Num. Math., 1992 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

65 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

66 Analysis of phase transition Numerical solution of PDEs Numerics incoherence Incoherence; R = 60 incoherence P. G. Mehta (UIUC) GERAD Apr. 20, / 75

67 Analysis of phase transition Numerical solution of PDEs Numerics incoherence Incoherence; R = 60 incoherence synchrony Synchrony; R = 10 synchrony P. G. Mehta (UIUC) GERAD Apr. 20, / 75

68 Bifurcation diagram Analysis of phase transition Numerics R 1/ Synchrony Locking η(ω) 0.25 R < Rc η(ω) < η0 ω = 0.95 ω = 1 ω = γ R > R c (γ) 0.15 Incoherence γ 0.15 R > Rc η(ω) = η R 1/2 dθ i = (ω i + u i )dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[c(θ i ;θ i ) Ru2 i ]ds P. G. Mehta (UIUC) GERAD Apr. 20, / 75

69 Bifurcation diagram Analysis of phase transition Numerics R 1/ Synchrony Locking η(ω) 0.25 incoherence soln. R < Rc η(ω) < η0 ω = 0.95 ω = 1 ω = γ R > R c (γ) 0.15 Incoherence γ 0.15 R > Rc η(ω) = η R 1/2 dθ i = (ω i + u i )dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[c(θ i ;θ i ) Ru2 i ]ds P. G. Mehta (UIUC) GERAD Apr. 20, / 75

70 Bifurcation diagram Analysis of phase transition Numerics R 1/ Synchrony Locking η(ω) 0.25 R < Rc η(ω) < η0 ω = 0.95 ω = 1 ω = 1.05 γ 0.2 R > R c (γ) 0.15 Incoherence γ R > Rc η(ω) = η synchrony soln. R 1/2 dθ i = (ω i + u i )dt + σ dξ i 1 η i (u i ;u i ) = lim T T T 0 E[c(θ i ;θ i ) Ru2 i ]ds P. G. Mehta (UIUC) GERAD Apr. 20, / 75

71 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) R 1/2 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

72 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) R = p R 1/ θ P. G. Mehta (UIUC) GERAD Apr. 20, / 75

73 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) c (θ,ϑ) = 1 cos(θ ϑ) cos3(θ ϑ) R 1/ R 1/2 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

74 p R = θ Analysis of phase transition Numerics Bifurcation diagram with another cost c (θ,ϑ) = 0.25(1 cos(θ ϑ)) c (θ,ϑ) = 1 cos(θ ϑ) cos3(θ ϑ) p R 1/ θ P. G. Mehta (UIUC) GERAD Apr. 20, / 75

75 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

76 Learning Comparison to Kuramoto law Q-function approximation Control law u = ϕ(θ,t,ω) = 1 R θ h(θ,t,ω) Population Density Control laws ω = 0.95 ω = 1 ω = π 2π θ Equivalent control law in Kuramoto oscillator u (Kur) i = κ N N j=1 sin(θ j (t) θ i ) N κ 0 sin(ϑ 0 + t θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

77 Learning Comparison to Kuramoto law Q-function approximation Control law u = ϕ(θ,t,ω) = 1 R θ h(θ,t,ω) Population Density π 2 π θ Control laws ω = 0.95 ω = 1 ω = 1.05 Kuramoto Equivalent control law in Kuramoto oscillator u (Kur) i = κ N N j=1 sin(θ j (t) θ i ) N κ 0 sin(ϑ 0 + t θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

78 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

79 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

80 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

81 Learning Q-function approximation Optimality equation Optimal control law u i = 1 R θ h i (θ,t) Parameterization: min{c(θ;θ i (t)) + 1 u i 2 Ru2 i + D ui h i (θ,t)} = ηi }{{} =: H i (θ,u i ;θ i (t)) Kuramoto law u (Kur) i = κ N j i sin(θ i θ j (t)) H (A i,φ i ) i (θ,u i ;θ i (t)) = c(θ;θ i (t))+ 1 2 Ru2 i +(ω i 1+u i )A i S (φ i) + σ 2 2 A ic (φ i) where S (φ) (θ,θ i ) = 1 N sin(θ θ j φ), C (φ) (θ,θ i ) = 1 j i N cos(θ θ j φ) j i Approx. optimal control: u (A i,φ i ) i = argmin{h (A i,φ i ) i (θ,u i ;θ i (t))} = A i u i R S(φ i) (θ,θ i ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

82 1 Motivation Why a game? Why Oscillators? 2 Problems and results Problem statement Main results 3 Derivation of model Overview Derivation steps PDE model 4 Analysis of phase transition Incoherence solution Bifurcation analysis Numerics 5 Learning Q-function approximation Steepest descent algorithm

83 Learning Steepest descent algorithm Bellman error: Pointwise: Simple gradient descent algorithm ẽ(a i,φ i ) = 2 k=1 L (A i,φ i ) (θ,t) = min{h (A i,φ i ) u i } η (A i,φ i i L (A i,φ i ), ϕ k (θ) 2 da i = ε dẽ(a i,φ i ), dt da i Theorem (Convergence) dφ i dt = ε dẽ(a i,φ i ) dφ i Assume population is in synchrony. The i th oscillator updates according to ( ). Then A i (t) A = 1 2σ 2 The pointwise Bellman error L (A i,0) (θ,t) = ε(r)cos2(θ t) where ε(r) = 1 16Rσ 4 P. G. Mehta (UIUC) GERAD Apr. 20, / 75 i ) ( )

84 Phase transition Learning Steepest descent algorithm Suppose all oscillators use approx. optimal control law: u i = A R 1 N sin(θ i θ j (t)) j i then the phase transition boundary is { 1 if γ = 0 2σ R c (γ) = 4 ( ) 1 2γ 4σ 2 γ tan 1 if γ > 0 σ k = 0.01; R = PDE Learning 5 A * A i R Synchrony Incoherence t γ P. G. Mehta (UIUC) GERAD Apr. 20, / 75

85 Learning Comparison of control laws Steepest descent algorithm From PDE: u i = ϕ(θ,t,ω) = 1 ω=ωi R θ h(θ,t,ω) From learning: u i = A R 1 N sin(θ i θ j (t) φ i ) j i ω=ωi Population Density Population Density θ π 2 π 0 π 2 π θ Control laws PDE ω = 0.95 ω = 0.1 ω = 1.05 Learning ω = 0.95 ω = 0.1 ω = 1.05 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

86 Learning Comparison of average cost Steepest descent algorithm Average Cost PDE Learning ω = 0.95 ω = 0.1 ω = P. G. Mehta (UIUC) GERAD Apr. 20, / 75

87 Thank you! Website: Huibing Yin Sean P. Meyn Uday V. Shanbhag H. Yin, P. G. Mehta, S. P. Meyn and U. V. Shanbhag, Synchronization of coupled oscillators is a game, ACC 2010

88 Part I Apendix

89 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

90 Infinit population limit Appendix Game-theoretic results (ε-nash) Recall the PDE model t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h t p + ω θ p = 1 R θ [p( θ h)] + σ 2 2 θθ 2 p 2π c(ϑ,t) = c (ϑ,θ)p(θ,t,ω)g(ω)dθ dω Ω 0 Solution h(θ,t,ω) Control u o i (t) := 1 R θ h(θ,t,ω) ω=ωi By construction: η i (u o i ; c) η i (u i ; c) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

91 Finite population Appendix Game-theoretic results (ε-nash) Finite population cost [ c (N) 1 i (ϑ,t) := E N c ( θ i (t);θ i(t) o θ i (t) = ϑ )] j i θ i o = (θ1 o,...,θo i 1,θo i+1,...,θo N) with θi o (t) obtained from dynamics with control u i = u o i (t) Lemma 1: for each i = 1,...,N [ 1 T ] max lim sup c(ϑ,s) c (N) i (ϑ,s) ds ϑ [0,2π] T T 0 = O( 1 N ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

92 ε-nash equilibrium Appendix Game-theoretic results (ε-nash) Theorem The control {u o i (t) = 1 R θ h(θ(t),t,ω) ω=ωi } N i=1 equilibrium with respect to the finite population cost is an ε-nash η (POP) i (u i ;u o 1 T i) = lim T T 0 i.e., for any adapted control u i, c (N) i Ru2 i ds; η (POP) i (u o i ;u o i) η (POP) i (u i ;u o i) + O( 1 ), N i = 1,...,N. c (N) i (ϑ,t) := E[c(θ i (t);θ o i(t)) θ i (t) = ϑ] P. G. Mehta (UIUC) GERAD Apr. 20, / 75

93 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

94 Stability proof Appendix Stability dx = ax + by, a > 0 dt dy = cx + ay, bc < 0 dt FPK eqn HJB eqn with boundary conditions x(0) = x 0 and y( ) = 0 t x(t) = e at x 0 + b e a(t τ) y(τ)dτ, y(t) = c t 0 e a(t τ) x(τ)dτ. For x 0 = 0, equilibrium solution x(t) 0, y(t) 0 Characteristic equation σ 2 = a 2 + bc. Stable in the sense that The equilibrium solution is asymptotically stable if for any initial perturbation x 0, the solution x(t) 0 as t. Return to spectrum P. G. Mehta (UIUC) GERAD Apr. 20, / 75

95 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

96 Appendix Algorithm for solving PDEs Numerics HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 V(θ,t,ω) = h(θ,t s,ω) Iterative algorithm s Backward time Forward time t P. G. Mehta (UIUC) GERAD Apr. 20, / 75

97 Appendix Algorithm for solving PDEs Numerics HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 V(θ,t,ω) = h(θ,t s,ω) Iterative algorithm s Backward time Forward time t P. G. Mehta (UIUC) GERAD Apr. 20, / 75

98 Appendix Algorithm for solving PDEs Numerics HJB: t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h h(θ,t,ω) FPK: t p + ω θ p = 1 R θ [p( θ h )] + σ 2 2 θθ 2 p p(θ,t,ω) 2π c(ϑ,t) = c (ϑ,θ) p(θ,t,ω) g(ω)dθ dω Ω 0 V(θ,t,ω) = h(θ,t s,ω) Iterative algorithm s Backward time Forward time t P. G. Mehta (UIUC) GERAD Apr. 20, / 75

99 Appendix Numerical solution of PDEs Numerics Incoherence; R = 60 incoherence ) R R > Rc(γ) (c) Incoherence 20 Synchrony γ p(θ,t,ω) at given position θ 0.2 γ = 0.01; Iter = γ = 0.01; Iter = γ = 0.01; Iter = p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t Iter = 1 Iter=4 Iter=19 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

100 Appendix Numerical solution of PDEs Numerics synchrony Synchrony; R = 10 synchrony p(θ,t,ω) at given position θ 0.25 γ = 0.01; Iter = γ = 0.01; Iter = 4 1 γ = 0.01; Iter = 19 p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t 1 p(θ,t,ω) p(θ,t,ω) p(θ,t,ω) t t t Iter = 1 Iter=4 Iter=19 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

101 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

102 Linearization Appendix Linearization/Spectra ( t z(θ,t,ω) = Eigenvector problem λz = L R Z ω θ h c σ θθ h ω θ p + 1 2πR 2 θθ h + σ θθ p Fourier series expansion with respect to θ H = + k= H k (ω)e ikθ, P = + k= P k (ω)e ikθ Decomposition of the linear operator L R = k L (k) R ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

103 Linearization (cont.) Appendix Linearization/Spectra Individual operator ( σ 2 L (1) R := 2 ωi π 4 Ω g(ω)dω L (k) R := 1 2πR σ 2 2 ( ωi σ 2 2 k2 kωi 0 ) k2 2πR σ 2 2 k2 kωi L ( k) R = L (k) R Continuous spectrum {S (k) } + k= S (k) := {λ C λ = ± σ 2 } 2 k2 kωi for all ω Ω ), k 2 Discrete spectrum coincides with the discrete spectrum of L (±1) R P. G. Mehta (UIUC) GERAD Apr. 20, / 75

104 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

105 Neuron Appendix Modeling P. G. Mehta (UIUC) GERAD Apr. 20, / 75

106 Neuron Appendix Modeling Equivalent circuits P. G. Mehta (UIUC) GERAD Apr. 20, / 75

107 Finite oscillator model Appendix Modeling Dynamics of i th oscillator [10]: dθ i = (ω i + u i (t))dt + σ dξ i, i = 1,...,N, t 0 (1) θ i phase of the oscillator {ξ i } mutually independent Wiener processes ω i constant and chosen independently according to a fixed distribution with density g u i (t) control P. G. Mehta (UIUC) GERAD Apr. 20, / 75

108 Appendix Finite oscillator model (cont.) Modeling i th oscillator seeks to minimize η (POP) 1 T i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control θ i = (θ j ) j i R control penalty c( ) cost function c(θ i ;θ i ) = 1 N j i c (θ i,θ j ), c 0 {u i } N i=1 Nash equilibrium if u i minimizes η (POP) i (u i ;u i), for i = 1,...,N. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

109 Intuition Appendix Modeling i th oscillator seeks to minimize η (POP) 1 T i (u i ;u i ) = lim E[ c(θ i ;θ i ) + 1 T T 0 }{{} 2 Ru2 i ]ds }{{} cost of anarchy cost of control Trade-off between reducing the two costs Cost associated with θ i θ j : c(θ i ;θ i ) = 1 N j i c (θ i θ j ) Cost associated with control: 1 2 Ru2 i Qualitative macro-behavior change when R varies ) R R > Rc(γ) (c) Incoherence Synchrony P. G. Mehta (UIUC) GERAD Apr. 20, / 75 γ

110 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

111 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

112 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) Fokker-Planck-Kolmogorov (FPK) equation dθ i (t) = (ω i + u i (t))dt + σ dξ i, i = 1,...,N ) replaced by: t p + θ ((ω + ϕ )p = σ 2 2 θθ 2 p p(θ,t,ω) probability density function [10] Strogatz et al., Journal of Statistical Physics, 1991 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

113 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) Fokker-Planck-Kolmogorov (FPK) equation dθ i (t) = (ω i + u i (t))dt + σ dξ i, i = 1,...,N ) replaced by: t p + θ ((ω + ϕ )p = σ 2 2 θθ 2 p p(θ,t,ω) probability density function [10] Strogatz et al., Journal of Statistical Physics, 1991 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

114 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) FPK equation evolution of population phase angles t p + ω θ p = 1 R θ [p( θ h)] + σ θθ p (3) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

115 PDE model (as N ) Appendix Multiple oscillators HJB equation obtain optimal control t h + ω θ h = 1 2R ( θ h) 2 c(θ,t) + η σ θθ h (2) h(θ,t,ω) relative value function η (ω) average optimal cost Optimal feed-back control law ϕ(θ,t,ω) := 1 R θ h(θ,t,ω) FPK equation evolution of population phase angles t p + ω θ p = 1 R θ [p( θ h)] + σ 2 2 θθ 2 p (3) 2π Generate cost: c (ϑ,θ)p(θ,t,ω)g(ω)dθ dω Ω 0 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

116 Consistency Appendix Multiple oscillators Consistency requirement on deterministic mass influence 2π c(ϑ,t) = c (ϑ,θ)p(θ,t,ω)g(ω)dθ dω Ω 0 Recall: c(θ i ;θ i ) = 1 N j i c (θ i,θ j ) P. G. Mehta (UIUC) GERAD Apr. 20, / 75

117 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

118 Q-learning Appendix Learning Dynamics of i th d agent dt x i = a i x i + b i u i i th agent seeks to minimize Control u i = k i xx i k i zz T η i = lim e γs ( (x i (s) z(s)) 2 + u 2 i (s) ) ds T 0 (individual state) (ensemble state) Agent [8] P. Mehta and S. Meyn, CDC, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

119 Control comparison Appendix Learning PDE Learning ω = 0.95 ω = 0.1 ω = Control Amplitude Control Phase P. G. Mehta (UIUC) GERAD Apr. 20, / 75

120 6 Appendix Game-theoretic results (ε-nash) Stability Numerics Linearization/Spectra Modeling Multiple oscillators Learning The New Yorker excerpt 7 Bibliography

121 Rational irrationality Appendix The New Yorker excerpt The Prisoner s Dilemma is the obverse of Adam Smith s theory of the invisible hand, in which the free market coördinates the behavior of self-seeking individuals to the benefit of all. Each businessman intends only his own gain, Smith wrote in The Wealth of Nations, and he is in this, as in many other cases, led by an invisible hand to promote an end which was no part of his intention. But in a market environment the individual pursuit of self-interest, however rational, can give way to collective disaster. The invisible hand becomes a fist. John Cassidy, Rational Irrationality: The real reason that capitalism is so crash-prone, The New Yorker, 2009 P. G. Mehta (UIUC) GERAD Apr. 20, / 75

122 Bibliography Dimitri P. Bertsekas. Dynamic Programming and Optimal Control, volume 1. Athena Scientific, Belmont, Massachusetts, Eric Brown, Jeff Moehlis, and Philip Holmes. On the phase reduction and response dynamics of neural oscillator populations. Neural Computation, 16(4): , M. Dellnitz, J.E. Marsden, I. Melbourne, and J. Scheurle. Generic bifurcations of pendula. Int. Series Num. Math., 104: , J. Guckenheimer. Isochrons and phaseless sets. J. Math. Biol., 1: , Minyi Huang, Peter E. Caines, and Roland P. Malhame. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

123 Bibliography Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ε-nash equilibria. IEEE transactions on automatic control, 52(9): , Y. Kuramoto. International Symposium on Mathematical Problems in Theoretical Physics, volume 39 of Lecture Notes in Physics. Springer-Verlag, Andrzej Lasota and Michael C. Mackey. Chaos, Fractals and Noise. Springer, P. Mehta and S. Meyn. Q-learning and Pontryagin s Minimum Principle. To appear, 48th IEEE Conference on Decision and Control, December Sean P. Meyn. P. G. Mehta (UIUC) GERAD Apr. 20, / 75

124 Bibliography The policy iteration algorithm for average reward markov decision processes with general state space. IEEE Transactions on Automatic Control, 42(12): , December S. H. Strogatz and R. E. Mirollo. Stability of incoherence in a population of coupled oscillators. Journal of Statistical Physics, 63: , May Steven H. Strogatz, Daniel M. Abrams, Bruno Eckhardt, and Edward Ott. Theoretical mechanics: Crowd synchrony on the millennium bridge. Nature, 438:43 44, P. G. Mehta (UIUC) GERAD Apr. 20, / 75

(1) I. INTRODUCTION (2)

(1) I. INTRODUCTION (2) 920 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 57, NO. 4, APRIL 2012 Synchronization of Coupled Oscillators is a Game Huibing Yin, Member, IEEE, Prashant G. Mehta, Member, IEEE, Sean P. Meyn, Fellow,

More information

Bayesian Inference with Oscillator Models: A Possible Role of Neural Rhythms

Bayesian Inference with Oscillator Models: A Possible Role of Neural Rhythms Bayesian Inference with Oscillator Models: A Possible Role of Neural Rhythms Qualcomm/Brain Corporation/INC Lecture Series on Computational Neuroscience University of California, San Diego, March 5, 2012

More information

c 2012 by Huibing Yin. All rights reserved.

c 2012 by Huibing Yin. All rights reserved. c 212 by Huibing Yin. All rights reserved. NONCOOPERATIVE STATIC AND DYNAMIC GAMES: ADDRESSING SHARED CONSTRAINTS AND PHASE TRANSITIONS BY HUIBING YIN DISSERTATION Submitted in partial fulfillment of the

More information

Dynamical systems in neuroscience. Pacific Northwest Computational Neuroscience Connection October 1-2, 2010

Dynamical systems in neuroscience. Pacific Northwest Computational Neuroscience Connection October 1-2, 2010 Dynamical systems in neuroscience Pacific Northwest Computational Neuroscience Connection October 1-2, 2010 What do I mean by a dynamical system? Set of state variables Law that governs evolution of state

More information

Feedback Particle Filter and its Application to Coupled Oscillators Presentation at University of Maryland, College Park, MD

Feedback Particle Filter and its Application to Coupled Oscillators Presentation at University of Maryland, College Park, MD Feedback Particle Filter and its Application to Coupled Oscillators Presentation at University of Maryland, College Park, MD Prashant Mehta Dept. of Mechanical Science and Engineering and the Coordinated

More information

Large-population, dynamical, multi-agent, competitive, and cooperative phenomena occur in a wide range of designed

Large-population, dynamical, multi-agent, competitive, and cooperative phenomena occur in a wide range of designed Definition Mean Field Game (MFG) theory studies the existence of Nash equilibria, together with the individual strategies which generate them, in games involving a large number of agents modeled by controlled

More information

Risk-Sensitive and Robust Mean Field Games

Risk-Sensitive and Robust Mean Field Games Risk-Sensitive and Robust Mean Field Games Tamer Başar Coordinated Science Laboratory Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign Urbana, IL - 6181 IPAM

More information

Quasi Stochastic Approximation American Control Conference San Francisco, June 2011

Quasi Stochastic Approximation American Control Conference San Francisco, June 2011 Quasi Stochastic Approximation American Control Conference San Francisco, June 2011 Sean P. Meyn Joint work with Darshan Shirodkar and Prashant Mehta Coordinated Science Laboratory and the Department of

More information

Synchronization and Phase Oscillators

Synchronization and Phase Oscillators 1 Synchronization and Phase Oscillators Richard Bertram Department of Mathematics and Programs in Neuroscience and Molecular Biophysics Florida State University Tallahassee, Florida 32306 Synchronization

More information

Phase Synchronization

Phase Synchronization Phase Synchronization Lecture by: Zhibin Guo Notes by: Xiang Fan May 10, 2016 1 Introduction For any mode or fluctuation, we always have where S(x, t) is phase. If a mode amplitude satisfies ϕ k = ϕ k

More information

Math 46, Applied Math (Spring 2008): Final

Math 46, Applied Math (Spring 2008): Final Math 46, Applied Math (Spring 2008): Final 3 hours, 80 points total, 9 questions, roughly in syllabus order (apart from short answers) 1. [16 points. Note part c, worth 7 points, is independent of the

More information

Spiral Wave Chimeras

Spiral Wave Chimeras Spiral Wave Chimeras Carlo R. Laing IIMS, Massey University, Auckland, New Zealand Erik Martens MPI of Dynamics and Self-Organization, Göttingen, Germany Steve Strogatz Dept of Maths, Cornell University,

More information

5 Applying the Fokker-Planck equation

5 Applying the Fokker-Planck equation 5 Applying the Fokker-Planck equation We begin with one-dimensional examples, keeping g = constant. Recall: the FPE for the Langevin equation with η(t 1 )η(t ) = κδ(t 1 t ) is = f(x) + g(x)η(t) t = x [f(x)p

More information

Controlled Diffusions and Hamilton-Jacobi Bellman Equations

Controlled Diffusions and Hamilton-Jacobi Bellman Equations Controlled Diffusions and Hamilton-Jacobi Bellman Equations Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2014 Emo Todorov (UW) AMATH/CSE 579, Winter

More information

Synchronization Transitions in Complex Networks

Synchronization Transitions in Complex Networks Synchronization Transitions in Complex Networks Y. Moreno 1,2,3 1 Institute for Biocomputation and Physics of Complex Systems (BIFI) University of Zaragoza, Zaragoza 50018, Spain 2 Department of Theoretical

More information

Mean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio

Mean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio Mean-Field optimization problems and non-anticipative optimal transport Beatrice Acciaio London School of Economics based on ongoing projects with J. Backhoff, R. Carmona and P. Wang Robust Methods in

More information

Stochastic differential equations in neuroscience

Stochastic differential equations in neuroscience Stochastic differential equations in neuroscience Nils Berglund MAPMO, Orléans (CNRS, UMR 6628) http://www.univ-orleans.fr/mapmo/membres/berglund/ Barbara Gentz, Universität Bielefeld Damien Landon, MAPMO-Orléans

More information

Chimera states in networks of biological neurons and coupled damped pendulums

Chimera states in networks of biological neurons and coupled damped pendulums in neural models in networks of pendulum-like elements in networks of biological neurons and coupled damped pendulums J. Hizanidis 1, V. Kanas 2, A. Bezerianos 3, and T. Bountis 4 1 National Center for

More information

HJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011

HJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011 Department of Probability and Mathematical Statistics Faculty of Mathematics and Physics, Charles University in Prague petrasek@karlin.mff.cuni.cz Seminar in Stochastic Modelling in Economics and Finance

More information

Phase Oscillators. and at r, Hence, the limit cycle at r = r is stable if and only if Λ (r ) < 0.

Phase Oscillators. and at r, Hence, the limit cycle at r = r is stable if and only if Λ (r ) < 0. 1 Phase Oscillators Richard Bertram Department of Mathematics and Programs in Neuroscience and Molecular Biophysics Florida State University Tallahassee, Florida 32306 2 Phase Oscillators Oscillations

More information

Dynamical modelling of systems of coupled oscillators

Dynamical modelling of systems of coupled oscillators Dynamical modelling of systems of coupled oscillators Mathematical Neuroscience Network Training Workshop Edinburgh Peter Ashwin University of Exeter 22nd March 2009 Peter Ashwin (University of Exeter)

More information

Neuroscience applications: isochrons and isostables. Alexandre Mauroy (joint work with I. Mezic)

Neuroscience applications: isochrons and isostables. Alexandre Mauroy (joint work with I. Mezic) Neuroscience applications: isochrons and isostables Alexandre Mauroy (joint work with I. Mezic) Outline Isochrons and phase reduction of neurons Koopman operator and isochrons Isostables of excitable systems

More information

MEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS

MEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS MEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS René Carmona Department of Operations Research & Financial Engineering PACM Princeton University CEMRACS - Luminy, July 17, 217 MFG WITH MAJOR AND MINOR PLAYERS

More information

Chaotic motion. Phys 420/580 Lecture 10

Chaotic motion. Phys 420/580 Lecture 10 Chaotic motion Phys 420/580 Lecture 10 Finite-difference equations Finite difference equation approximates a differential equation as an iterative map (x n+1,v n+1 )=M[(x n,v n )] Evolution from time t

More information

Traveling waves dans le modèle de Kuramoto quenched

Traveling waves dans le modèle de Kuramoto quenched Traveling waves dans le modèle de Kuramoto quenched Eric Luçon MAP5 - Université Paris Descartes Grenoble - Journées MAS 2016 31 août 2016 Travail en commun avec Christophe Poquet (Lyon 1) [Giacomin, L.,

More information

Collective and Stochastic Effects in Arrays of Submicron Oscillators

Collective and Stochastic Effects in Arrays of Submicron Oscillators DYNAMICS DAYS: Long Beach, 2005 1 Collective and Stochastic Effects in Arrays of Submicron Oscillators Ron Lifshitz (Tel Aviv), Jeff Rogers (HRL, Malibu), Oleg Kogan (Caltech), Yaron Bromberg (Tel Aviv),

More information

ON MEAN FIELD GAMES. Pierre-Louis LIONS. Collège de France, Paris. (joint project with Jean-Michel LASRY)

ON MEAN FIELD GAMES. Pierre-Louis LIONS. Collège de France, Paris. (joint project with Jean-Michel LASRY) Collège de France, Paris (joint project with Jean-Michel LASRY) I INTRODUCTION II A REALLY SIMPLE EXAMPLE III GENERAL STRUCTURE IV TWO PARTICULAR CASES V OVERVIEW AND PERSPECTIVES I. INTRODUCTION New class

More information

Stochastic Gradient Descent in Continuous Time

Stochastic Gradient Descent in Continuous Time Stochastic Gradient Descent in Continuous Time Justin Sirignano University of Illinois at Urbana Champaign with Konstantinos Spiliopoulos (Boston University) 1 / 27 We consider a diffusion X t X = R m

More information

Control Theory in Physics and other Fields of Science

Control Theory in Physics and other Fields of Science Michael Schulz Control Theory in Physics and other Fields of Science Concepts, Tools, and Applications With 46 Figures Sprin ger 1 Introduction 1 1.1 The Aim of Control Theory 1 1.2 Dynamic State of Classical

More information

Multi-dimensional Feedback Particle Filter for Coupled Oscillators

Multi-dimensional Feedback Particle Filter for Coupled Oscillators 3 American Control Conference (ACC) Washington, DC, USA, June 7-9, 3 Multi-dimensional Feedback Particle Filter for Coupled Oscillators Adam K. Tilton, Prashant G. Mehta and Sean P. Meyn Abstract This

More information

Mean field equilibria of multiarmed bandit games

Mean field equilibria of multiarmed bandit games Mean field equilibria of multiarmed bandit games Ramesh Johari Stanford University Joint work with Ramki Gummadi (Stanford University) and Jia Yuan Yu (IBM Research) A look back: SN 2000 2 Overview What

More information

External Periodic Driving of Large Systems of Globally Coupled Phase Oscillators

External Periodic Driving of Large Systems of Globally Coupled Phase Oscillators External Periodic Driving of Large Systems of Globally Coupled Phase Oscillators T. M. Antonsen Jr., R. T. Faghih, M. Girvan, E. Ott and J. Platig Institute for Research in Electronics and Applied Physics

More information

Approximation of Top Lyapunov Exponent of Stochastic Delayed Turning Model Using Fokker-Planck Approach

Approximation of Top Lyapunov Exponent of Stochastic Delayed Turning Model Using Fokker-Planck Approach Approximation of Top Lyapunov Exponent of Stochastic Delayed Turning Model Using Fokker-Planck Approach Henrik T. Sykora, Walter V. Wedig, Daniel Bachrathy and Gabor Stepan Department of Applied Mechanics,

More information

A conjecture on sustained oscillations for a closed-loop heat equation

A conjecture on sustained oscillations for a closed-loop heat equation A conjecture on sustained oscillations for a closed-loop heat equation C.I. Byrnes, D.S. Gilliam Abstract The conjecture in this paper represents an initial step aimed toward understanding and shaping

More information

Lecture 8: Policy Gradient

Lecture 8: Policy Gradient Lecture 8: Policy Gradient Hado van Hasselt Outline 1 Introduction 2 Finite Difference Policy Gradient 3 Monte-Carlo Policy Gradient 4 Actor-Critic Policy Gradient Introduction Vapnik s rule Never solve

More information

Nonlinear and Collective Effects in Mesoscopic Mechanical Oscillators

Nonlinear and Collective Effects in Mesoscopic Mechanical Oscillators Dynamics Days Asia-Pacific: Singapore, 2004 1 Nonlinear and Collective Effects in Mesoscopic Mechanical Oscillators Alexander Zumdieck (Max Planck, Dresden), Ron Lifshitz (Tel Aviv), Jeff Rogers (HRL,

More information

Proceedings of Neural, Parallel, and Scientific Computations 4 (2010) xx-xx PHASE OSCILLATOR NETWORK WITH PIECEWISE-LINEAR DYNAMICS

Proceedings of Neural, Parallel, and Scientific Computations 4 (2010) xx-xx PHASE OSCILLATOR NETWORK WITH PIECEWISE-LINEAR DYNAMICS Proceedings of Neural, Parallel, and Scientific Computations 4 (2010) xx-xx PHASE OSCILLATOR NETWORK WITH PIECEWISE-LINEAR DYNAMICS WALTER GALL, YING ZHOU, AND JOSEPH SALISBURY Department of Mathematics

More information

Mean Field Stochastic Control

Mean Field Stochastic Control Mean Field Stochastic Control Peter E. Caines 50 Years of Nonlinear Control and Optimization IFAC Workshop The Royal Society London, 2010 McGill University 1 / 72 Co-Authors Minyi Huang Roland Malhamé

More information

Entrainment and Chaos in the Hodgkin-Huxley Oscillator

Entrainment and Chaos in the Hodgkin-Huxley Oscillator Entrainment and Chaos in the Hodgkin-Huxley Oscillator Kevin K. Lin http://www.cims.nyu.edu/ klin Courant Institute, New York University Mostly Biomath - 2005.4.5 p.1/42 Overview (1) Goal: Show that the

More information

New Homoclinic and Heteroclinic Solutions for Zakharov System

New Homoclinic and Heteroclinic Solutions for Zakharov System Commun. Theor. Phys. 58 (2012) 749 753 Vol. 58, No. 5, November 15, 2012 New Homoclinic and Heteroclinic Solutions for Zakharov System WANG Chuan-Jian ( ), 1 DAI Zheng-De (à ), 2, and MU Gui (½ ) 3 1 Department

More information

10.1 Review of Parametric Equations

10.1 Review of Parametric Equations 10.1 Review of Parametric Equations Recall that often, instead of representing a curve using just x and y (called a Cartesian equation), it is more convenient to define x and y using parametric equations

More information

UNIVERSITY OF MANITOBA

UNIVERSITY OF MANITOBA Question Points Score INSTRUCTIONS TO STUDENTS: This is a 6 hour examination. No extra time will be given. No texts, notes, or other aids are permitted. There are no calculators, cellphones or electronic

More information

Mean Field Consumption-Accumulation Optimization with HAR

Mean Field Consumption-Accumulation Optimization with HAR Mean Field Consumption-Accumulation Optimization with HARA Utility School of Mathematics and Statistics Carleton University, Ottawa Workshop on Mean Field Games and Related Topics 2 Padova, September 4-7,

More information

Weakly interacting particle systems on graphs: from dense to sparse

Weakly interacting particle systems on graphs: from dense to sparse Weakly interacting particle systems on graphs: from dense to sparse Ruoyu Wu University of Michigan (Based on joint works with Daniel Lacker and Kavita Ramanan) USC Math Finance Colloquium October 29,

More information

Mathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio ( )

Mathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio ( ) Mathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio (2014-2015) Etienne Tanré - Olivier Faugeras INRIA - Team Tosca November 26th, 2014 E. Tanré (INRIA - Team Tosca) Mathematical

More information

Dynamic Bertrand and Cournot Competition

Dynamic Bertrand and Cournot Competition Dynamic Bertrand and Cournot Competition Effects of Product Differentiation Andrew Ledvina Department of Operations Research and Financial Engineering Princeton University Joint with Ronnie Sircar Princeton-Lausanne

More information

Pontryagin s maximum principle

Pontryagin s maximum principle Pontryagin s maximum principle Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2012 Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 1 / 9 Pontryagin

More information

BIFURCATION PHENOMENA Lecture 1: Qualitative theory of planar ODEs

BIFURCATION PHENOMENA Lecture 1: Qualitative theory of planar ODEs BIFURCATION PHENOMENA Lecture 1: Qualitative theory of planar ODEs Yuri A. Kuznetsov August, 2010 Contents 1. Solutions and orbits. 2. Equilibria. 3. Periodic orbits and limit cycles. 4. Homoclinic orbits.

More information

Introduction to asymptotic techniques for stochastic systems with multiple time-scales

Introduction to asymptotic techniques for stochastic systems with multiple time-scales Introduction to asymptotic techniques for stochastic systems with multiple time-scales Eric Vanden-Eijnden Courant Institute Motivating examples Consider the ODE {Ẋ = Y 3 + sin(πt) + cos( 2πt) X() = x

More information

CDS 101/110a: Lecture 2.1 Dynamic Behavior

CDS 101/110a: Lecture 2.1 Dynamic Behavior CDS 11/11a: Lecture 2.1 Dynamic Behavior Richard M. Murray 6 October 28 Goals: Learn to use phase portraits to visualize behavior of dynamical systems Understand different types of stability for an equilibrium

More information

The Hamiltonian Mean Field Model: Effect of Network Structure on Synchronization Dynamics. Yogesh Virkar University of Colorado, Boulder.

The Hamiltonian Mean Field Model: Effect of Network Structure on Synchronization Dynamics. Yogesh Virkar University of Colorado, Boulder. The Hamiltonian Mean Field Model: Effect of Network Structure on Synchronization Dynamics Yogesh Virkar University of Colorado, Boulder. 1 Collaborators Prof. Juan G. Restrepo Prof. James D. Meiss Department

More information

Introduction to Mean Field Game Theory Part I

Introduction to Mean Field Game Theory Part I School of Mathematics and Statistics Carleton University Ottawa, Canada Workshop on Game Theory, Institute for Math. Sciences National University of Singapore, June 2018 Outline of talk Background and

More information

Stochastic Processes. M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno

Stochastic Processes. M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno Stochastic Processes M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno 1 Outline Stochastic (random) processes. Autocorrelation. Crosscorrelation. Spectral density function.

More information

A Barrier Version of the Russian Option

A Barrier Version of the Russian Option A Barrier Version of the Russian Option L. A. Shepp, A. N. Shiryaev, A. Sulem Rutgers University; shepp@stat.rutgers.edu Steklov Mathematical Institute; shiryaev@mi.ras.ru INRIA- Rocquencourt; agnes.sulem@inria.fr

More information

On continuous time contract theory

On continuous time contract theory Ecole Polytechnique, France Journée de rentrée du CMAP, 3 octobre, 218 Outline 1 2 Semimartingale measures on the canonical space Random horizon 2nd order backward SDEs (Static) Principal-Agent Problem

More information

LINEAR DISPERSIVE WAVES

LINEAR DISPERSIVE WAVES LINEAR DISPERSIVE WAVES Nathaniel Egwu December 16, 2009 Outline 1 INTRODUCTION Dispersion Relations Definition of Dispersive Waves 2 SOLUTION AND ASYMPTOTIC ANALYSIS The Beam Equation General Solution

More information

Systems Analysis and Control

Systems Analysis and Control Systems Analysis and Control Matthew M. Peet Illinois Institute of Technology Lecture 12: Overview In this Lecture, you will learn: Review of Feedback Closing the Loop Pole Locations Changing the Gain

More information

y = x 3 and y = 2x 2 x. 2x 2 x = x 3 x 3 2x 2 + x = 0 x(x 2 2x + 1) = 0 x(x 1) 2 = 0 x = 0 and x = (x 3 (2x 2 x)) dx

y = x 3 and y = 2x 2 x. 2x 2 x = x 3 x 3 2x 2 + x = 0 x(x 2 2x + 1) = 0 x(x 1) 2 = 0 x = 0 and x = (x 3 (2x 2 x)) dx Millersville University Name Answer Key Mathematics Department MATH 2, Calculus II, Final Examination May 4, 2, 8:AM-:AM Please answer the following questions. Your answers will be evaluated on their correctness,

More information

NETWORKS Lecture 2: Synchronization Fundamentals

NETWORKS Lecture 2: Synchronization Fundamentals Short-course: Complex Systems Beyond the Metaphor UNSW, February 2007 NETWORKS Lecture 2: Synchronization Fundamentals David J Hill Research School of Information Sciences and Engineering The ANU 8/2/2007

More information

Inverse Optimality Design for Biological Movement Systems

Inverse Optimality Design for Biological Movement Systems Inverse Optimality Design for Biological Movement Systems Weiwei Li Emanuel Todorov Dan Liu Nordson Asymtek Carlsbad CA 921 USA e-mail: wwli@ieee.org. University of Washington Seattle WA 98195 USA Google

More information

Multisoliton Interaction of Perturbed Manakov System: Effects of External Potentials

Multisoliton Interaction of Perturbed Manakov System: Effects of External Potentials Multisoliton Interaction of Perturbed Manakov System: Effects of External Potentials Michail D. Todorov Faculty of Applied Mathematics and Computer Science Technical University of Sofia, Bulgaria (Work

More information

Robotics. Control Theory. Marc Toussaint U Stuttgart

Robotics. Control Theory. Marc Toussaint U Stuttgart Robotics Control Theory Topics in control theory, optimal control, HJB equation, infinite horizon case, Linear-Quadratic optimal control, Riccati equations (differential, algebraic, discrete-time), controllability,

More information

Topic 5: The Difference Equation

Topic 5: The Difference Equation Topic 5: The Difference Equation Yulei Luo Economics, HKU October 30, 2017 Luo, Y. (Economics, HKU) ME October 30, 2017 1 / 42 Discrete-time, Differences, and Difference Equations When time is taken to

More information

A new approach for investment performance measurement. 3rd WCMF, Santa Barbara November 2009

A new approach for investment performance measurement. 3rd WCMF, Santa Barbara November 2009 A new approach for investment performance measurement 3rd WCMF, Santa Barbara November 2009 Thaleia Zariphopoulou University of Oxford, Oxford-Man Institute and The University of Texas at Austin 1 Performance

More information

Basics of reinforcement learning

Basics of reinforcement learning Basics of reinforcement learning Lucian Buşoniu TMLSS, 20 July 2018 Main idea of reinforcement learning (RL) Learn a sequential decision policy to optimize the cumulative performance of an unknown system

More information

Dynamical Systems and Chaos Part II: Biology Applications. Lecture 10: Coupled Systems. Ilya Potapov Mathematics Department, TUT Room TD325

Dynamical Systems and Chaos Part II: Biology Applications. Lecture 10: Coupled Systems. Ilya Potapov Mathematics Department, TUT Room TD325 Dynamical Systems and Chaos Part II: Biology Applications Lecture 10: Coupled Systems. Ilya Potapov Mathematics Department, TUT Room TD325 Foreword In order to model populations of physical/biological

More information

Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors

Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors 1 / 37 Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors Cédric Langbort Department of Aerospace Engineering & Coordinated Science Laboratory UNIVERSITY OF ILLINOIS

More information

Reinforcement Learning

Reinforcement Learning 1 Reinforcement Learning Chris Watkins Department of Computer Science Royal Holloway, University of London July 27, 2015 2 Plan 1 Why reinforcement learning? Where does this theory come from? Markov decision

More information

CDS 101/110a: Lecture 2.1 Dynamic Behavior

CDS 101/110a: Lecture 2.1 Dynamic Behavior CDS 11/11a: Lecture.1 Dynamic Behavior Richard M. Murray 6 October 8 Goals: Learn to use phase portraits to visualize behavior of dynamical systems Understand different types of stability for an equilibrium

More information

The Theory of Weakly Coupled Oscillators

The Theory of Weakly Coupled Oscillators The Theory of Weakly Coupled Oscillators Michael A. Schwemmer and Timothy J. Lewis Department of Mathematics, One Shields Ave, University of California Davis, CA 95616 1 1 Introduction 2 3 4 5 6 7 8 9

More information

Q-Learning in Continuous State Action Spaces

Q-Learning in Continuous State Action Spaces Q-Learning in Continuous State Action Spaces Alex Irpan alexirpan@berkeley.edu December 5, 2015 Contents 1 Introduction 1 2 Background 1 3 Q-Learning 2 4 Q-Learning In Continuous Spaces 4 5 Experimental

More information

Mathematical Foundations of Neuroscience - Lecture 7. Bifurcations II.

Mathematical Foundations of Neuroscience - Lecture 7. Bifurcations II. Mathematical Foundations of Neuroscience - Lecture 7. Bifurcations II. Filip Piękniewski Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruń, Poland Winter 2009/2010 Filip

More information

FINAL EXAM MATH303 Theory of Ordinary Differential Equations. Spring dx dt = x + 3y dy dt = x y.

FINAL EXAM MATH303 Theory of Ordinary Differential Equations. Spring dx dt = x + 3y dy dt = x y. FINAL EXAM MATH0 Theory of Ordinary Differential Equations There are 5 problems on 2 pages. Spring 2009. 25 points Consider the linear plane autonomous system x + y x y. Find a fundamental matrix of the

More information

Lecture 11 : Overview

Lecture 11 : Overview Lecture 11 : Overview Error in Assignment 3 : In Eq. 1, Hamiltonian should be H = p2 r 2m + p2 ϕ 2mr + (p z ea z ) 2 2 2m + eφ (1) Error in lecture 10, slide 7, Eq. (21). Should be S(q, α, t) m Q = β =

More information

Majid Sodagar, 1 Patrick Chang, 1 Edward Coyler, 1 and John Parke 1 School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA

Majid Sodagar, 1 Patrick Chang, 1 Edward Coyler, 1 and John Parke 1 School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA Experimental Characterization of Chua s Circuit Majid Sodagar, 1 Patrick Chang, 1 Edward Coyler, 1 and John Parke 1 School of Physics, Georgia Institute of Technology, Atlanta, Georgia 30332, USA (Dated:

More information

PHASE-LOCKED SOLUTIONS IN A HUB CONNECTED OSCILLATOR RING NETWORK

PHASE-LOCKED SOLUTIONS IN A HUB CONNECTED OSCILLATOR RING NETWORK Copyright c 29 by ABCM PHASE-LOCKED SOLUTIONS IN A HUB CONNECTED OSCILLATOR RING NETWORK Jacqueline Bridge, Jacqueline.Bridge@sta.uwi.edu Department of Mechanical Engineering, The University of the West

More information

Advanced Heat and Mass Transfer by Amir Faghri, Yuwen Zhang, and John R. Howell

Advanced Heat and Mass Transfer by Amir Faghri, Yuwen Zhang, and John R. Howell Advanced Heat and Mass Transfer by Amir Faghri, Yuwen Zhang, and John R. Howell 9.2 The Blackbody as the Ideal Radiator A material that absorbs 100 percent of the energy incident on it from all directions

More information

{ } is an asymptotic sequence.

{ } is an asymptotic sequence. AMS B Perturbation Methods Lecture 3 Copyright by Hongyun Wang, UCSC Recap Iterative method for finding asymptotic series requirement on the iteration formula to make it work Singular perturbation use

More information

Neural Excitability in a Subcritical Hopf Oscillator with a Nonlinear Feedback

Neural Excitability in a Subcritical Hopf Oscillator with a Nonlinear Feedback Neural Excitability in a Subcritical Hopf Oscillator with a Nonlinear Feedback Gautam C Sethia and Abhijit Sen Institute for Plasma Research, Bhat, Gandhinagar 382 428, INDIA Motivation Neural Excitability

More information

Mean Field Competitive Binary MDPs and Structured Solutions

Mean Field Competitive Binary MDPs and Structured Solutions Mean Field Competitive Binary MDPs and Structured Solutions Minyi Huang School of Mathematics and Statistics Carleton University Ottawa, Canada MFG217, UCLA, Aug 28 Sept 1, 217 1 / 32 Outline of talk The

More information

Learning Cycle Linear Hybrid Automata for Excitable Cells

Learning Cycle Linear Hybrid Automata for Excitable Cells Learning Cycle Linear Hybrid Automata for Excitable Cells Sayan Mitra Joint work with Radu Grosu, Pei Ye, Emilia Entcheva, I V Ramakrishnan, and Scott Smolka HSCC 2007 Pisa, Italy Excitable Cells Outline

More information

46th IEEE CDC, New Orleans, USA, Dec , where θ rj (0) = j 1. Re 1 N. θ =

46th IEEE CDC, New Orleans, USA, Dec , where θ rj (0) = j 1. Re 1 N. θ = Proceedings of the 46th IEEE Conference on Decision and Control New Orleans, LA, USA, Dec. 12-14, 2007 INVITED SESSION NUMBER: 157 Event-Based Feedback Control of Nonlinear Oscillators Using Phase Response

More information

On the Stability of the Best Reply Map for Noncooperative Differential Games

On the Stability of the Best Reply Map for Noncooperative Differential Games On the Stability of the Best Reply Map for Noncooperative Differential Games Alberto Bressan and Zipeng Wang Department of Mathematics, Penn State University, University Park, PA, 68, USA DPMMS, University

More information

Analysis of Systems with State-Dependent Delay

Analysis of Systems with State-Dependent Delay Analysis of Systems with State-Dependent Delay Matthew M. Peet Arizona State University Tempe, AZ USA American Institute of Aeronautics and Astronautics Guidance, Navigation and Control Conference Boston,

More information

AMATH 353 Lecture 9. Weston Barger. How to classify PDEs as linear/nonlinear, order, homogeneous or non-homogeneous.

AMATH 353 Lecture 9. Weston Barger. How to classify PDEs as linear/nonlinear, order, homogeneous or non-homogeneous. AMATH 353 ecture 9 Weston Barger 1 Exam What you need to know: How to classify PDEs as linear/nonlinear, order, homogeneous or non-homogeneous. The definitions for traveling wave, standing wave, wave train

More information

On the Response of Neurons to Sinusoidal Current Stimuli: Phase Response Curves and Phase-Locking

On the Response of Neurons to Sinusoidal Current Stimuli: Phase Response Curves and Phase-Locking On the Response of Neurons to Sinusoidal Current Stimuli: Phase Response Curves and Phase-Locking Michael J. Schaus and Jeff Moehlis Abstract A powerful technique for analyzing mathematical models for

More information

Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form

Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Yipeng Yang * Under the supervision of Dr. Michael Taksar Department of Mathematics University of Missouri-Columbia Oct

More information

Almost Sure Convergence of Two Time-Scale Stochastic Approximation Algorithms

Almost Sure Convergence of Two Time-Scale Stochastic Approximation Algorithms Almost Sure Convergence of Two Time-Scale Stochastic Approximation Algorithms Vladislav B. Tadić Abstract The almost sure convergence of two time-scale stochastic approximation algorithms is analyzed under

More information

Mean Field Control: Selected Topics and Applications

Mean Field Control: Selected Topics and Applications Mean Field Control: Selected Topics and Applications School of Mathematics and Statistics Carleton University Ottawa, Canada University of Michigan, Ann Arbor, Feb 2015 Outline of talk Background and motivation

More information

Control of McKean-Vlasov Dynamics versus Mean Field Games

Control of McKean-Vlasov Dynamics versus Mean Field Games Control of McKean-Vlasov Dynamics versus Mean Field Games René Carmona (a),, François Delarue (b), and Aimé Lachapelle (a), (a) ORFE, Bendheim Center for Finance, Princeton University, Princeton, NJ 8544,

More information

Lecture 5. Outline: Limit Cycles. Definition and examples How to rule out limit cycles. Poincare-Bendixson theorem Hopf bifurcations Poincare maps

Lecture 5. Outline: Limit Cycles. Definition and examples How to rule out limit cycles. Poincare-Bendixson theorem Hopf bifurcations Poincare maps Lecture 5 Outline: Limit Cycles Definition and examples How to rule out limit cycles Gradient systems Liapunov functions Dulacs criterion Poincare-Bendixson theorem Hopf bifurcations Poincare maps Limit

More information

Dynamical Systems & Lyapunov Stability

Dynamical Systems & Lyapunov Stability Dynamical Systems & Lyapunov Stability Harry G. Kwatny Department of Mechanical Engineering & Mechanics Drexel University Outline Ordinary Differential Equations Existence & uniqueness Continuous dependence

More information

Effects of Various Uncertainty Sources on Automatic Generation Control Systems

Effects of Various Uncertainty Sources on Automatic Generation Control Systems Effects of Various Uncertainty Sources on Automatic Generation Control Systems D. Apostolopoulou, Y. C. Chen, J. Zhang, A. D. Domínguez-García, and P. W. Sauer University of Illinois at Urbana-Champaign

More information

Ordinary Differential Equations (ODEs) Background. Video 17

Ordinary Differential Equations (ODEs) Background. Video 17 Ordinary Differential Equations (ODEs) Background Video 17 Daniel J. Bodony Department of Aerospace Engineering University of Illinois at Urbana-Champaign In this video you will learn... 1 What ODEs are

More information

Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters

Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 204 Emo Todorov (UW) AMATH/CSE 579, Winter

More information

Chaotic motion. Phys 750 Lecture 9

Chaotic motion. Phys 750 Lecture 9 Chaotic motion Phys 750 Lecture 9 Finite-difference equations Finite difference equation approximates a differential equation as an iterative map (x n+1,v n+1 )=M[(x n,v n )] Evolution from time t =0to

More information

The correlation between stochastic resonance and the average phase-synchronization time of a bistable system driven by colour-correlated noises

The correlation between stochastic resonance and the average phase-synchronization time of a bistable system driven by colour-correlated noises Chin. Phys. B Vol. 19, No. 1 (010) 01050 The correlation between stochastic resonance and the average phase-synchronization time of a bistable system driven by colour-correlated noises Dong Xiao-Juan(

More information

Solutions 2: Simple Harmonic Oscillator and General Oscillations

Solutions 2: Simple Harmonic Oscillator and General Oscillations Massachusetts Institute of Technology MITES 2017 Physics III Solutions 2: Simple Harmonic Oscillator and General Oscillations Due Wednesday June 21, at 9AM under Rene García s door Preface: This problem

More information

Actor-critic methods. Dialogue Systems Group, Cambridge University Engineering Department. February 21, 2017

Actor-critic methods. Dialogue Systems Group, Cambridge University Engineering Department. February 21, 2017 Actor-critic methods Milica Gašić Dialogue Systems Group, Cambridge University Engineering Department February 21, 2017 1 / 21 In this lecture... The actor-critic architecture Least-Squares Policy Iteration

More information

Mean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio

Mean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio Mean-Field optimization problems and non-anticipative optimal transport Beatrice Acciaio London School of Economics based on ongoing projects with J. Backhoff, R. Carmona and P. Wang Thera Stochastics

More information