Learning Automata in Games with Memory with Application to Circuit-Switched Routing

Size: px
Start display at page:

Download "Learning Automata in Games with Memory with Application to Circuit-Switched Routing"

Transcription

1 Learning Automata in Games with Memory with Application to Circuit-Switche Routing Murat Alanyali Abstract A general setting is consiere in which autonomous users interact by means of a finite-state controlle Marov process. This process is riven by the collective actions of all users, an iniviual users receive separate rewars accoring to its state. It is assume that each user chooses its actions via a reinforcement learning algorithm base on its local information. The ynamic behavior of user strategies is characterize for small values of a step-size parameter aopte in learning. The general form of equilibria is obtaine an is shown to be analogous to Warrop equilibria if users upate their strategies on a faster time-scale compare to the unerlying process. The results are illustrate in the context of routing in circuit-switche communication networs. I. INTRODUCTION We consier a ynamic setting in which a collection of users interact by means of an unerlying Marov process that is controlle by the collective actions of all users. The state space of this process is assume to be finite but it is arbitrary otherwise. The process may, for example, represent a suitable state escriptor for a share resource. Iniviual users receive possibly ifferent rewars epening on the state of this process. The users are not require to be aware of the unerlying process, or the presence of other users. In particular it is assume that each user autonomously follows a ynamic ranomize strategy that amounts to reinforcement learning base on its local information, which is the history of its own actions an own rewars. The outline scenario is motivate by istribute moels of routing an flow control in communication networs, where an iniviual networ user may gain by eviating from socially oriente principles. The framewor aopte in this paper iffers from that of classical game theory in two aspects. Namely, it is assume here that rewars are etermine by the whole history of actions ue to the Marovian nature of the unerlying process, an that users operate with ite information about the nature of their interaction. In view of the latter assumption we restrict attention to user strategies that involve estimating consequences of available actions, an moel iniviual user behavior by reinforcement learning which arises as a plausible moel in a variety of relate contexts [5], [11]. The main contribution of the paper is an approximation of user strategies that is asymptotically exact for small values of a step-size parameter aopte in learning. In particular This wor was supporte by the NSF CAREER Program uner grant ANI Department of Electrical an Computer Engineering, an The Center for Information an Systems Engineering, Boston University, 8 Saint Mary s Street, Boston, MA 02215, USA. alanyali@bu.eu the transient behavior of user strategies is approximate via a system of orinary ifferential equations. The it ynamics amits a more intuitive interpretation when users upate their strategies on a fast time scale. In particular it is shown that equilibria of the it ynamics are then analogous to Warrop equilibria [14] that arise in the context of transportation networs. Interpretation of the aopte moel an the obtaine results is illustrate on an aaptation of a istribute circuit-switche networ architecture ue to Kelly [7]. We show that when autonomous source-estination pairs employ reinforcement learning to aaptively route incoming calls, the it ynamics locally maximizes a social welfare function, namely rate of networ-wie revenue generation ue to amitte calls. The outline of the paper is as follows. Section II escribes the general moel, an the main results are given in Section III. Application to aaptive routing in circuit-switche networs is iscusse in Section IV. The paper conclues with final remars in Section V. II. MODEL Consier a set U of users that share a collection of resources. We refer to the totality of resources as the networ. Suppose that each user has a finite number of alternative ways to access the networ. We refer to each alternative way as an action an enote by Au the set of actions available to user u U. A U-tuple αu : u U that specifies one action αu Au from the action set of each user u is calle an action profile. Let A = u U Au enote the set of action profiles. Consier further a ynamic situation in which the state of the networ evolves accoring to the actions taen by the users. Let Ξ enote the state space of the networ an let X t Ξ enote the state of the networ at time t 0. We assume that Ξ is finite but arbitrary otherwise, an that the networ process X t : t 0 is a controlle Marov process. To provie a complete specification of X t : t 0 let α t u enote the action exercise by user u U at time t an let α t = α t u : u U. For each action profile α A there is a generator matrix Gα = [q ξ,ξ α] Ξ Ξ such that 1 h 0 h P X t+h = ξ X t = ξ, α t = α = q ξ,ξ α, ξ ξ. We shall assume that Gα is irreucible for each α A. Suppose that each user receives a ranom rewar whose istribution epens on the networ state an the exercise action by that user. More precisely, for each user u U an

2 t 0 let θ t u enote the total rewar receive by user u over the time interval [0, t. Let g + u, g u : Ξ Au R + be positive value functions an set g u ξ, a = g + u ξ, a g u ξ, a for ξ Ξ, a Au. We shall assume that for each ξ Ξ, a Au an integer, 1 h 0 h P θ t+hu = + j θ t u =, X t = ξ, α t u = a g u + ξ, a if j = 1 = gu ξ, a if j = 1 0 if j { 1, 0, 1}. In particular g u ξ, a is the mean rewar receive by user u per unit of time uring which the networ is at state ξ an the user exercises action a. We concentrate here on the case when users follow ranomize strategies in exercising their actions. Some efinitions are neee before giving a formal escription of user strategies. For each u U let S u be the space of probability vectors on the set Au. Set S = u U S u, so that S = pu, a : u U, a Au, pu, a 0 an a Au pu, a = 1 for u U. For p S let pu enote the probability vector pu, a : a Au S u. Suppose that each user u U maintains a ata structure zu, Qu where zu = zu, a : a Au R Au an Qu = Qu, a : a Au S u. This ata structure is ynamically maintaine by upating at iscrete time instants. The upate proceure is mae precise in the following paragraphs. At each upate instant the user also selects an action ranomly with respect to the istribution Qu, an it exercises this action until the next upate instant. Hence the ata structure Qu bears a ranomize action policy for user u. The ata structure zu will be seen to represent estimates of mean rewars ue to ifferent actions. Let z t u, Q t u enote the content of zu, Qu at time t 0. Each user u U upates its ata structure at the jump instants of a Poisson process with rate γu > 0. The Poisson clocs of ifferent users are mutually inepenent; so users operate asynchronously. Let t u enote the time of th upate of user u with t u 0 = 0. For each integer 1 let I u = I u, a : a Au be a binary ranom vector with exactly one nonzero entry. Given Q t u 1 u the vector I u is rawn inepenently of the prior history before time t u 1, so that P I u, a = 1 Q t u 1 u = Q t u 1 u, a, a Au. The vector I u ientifies the action exercise by user u over the interval [t u, tu +1, that is, I u, a = 1 if an only if α t u = a for t [t u, tu +1. It is assume that an arbitrary action is exercise in [0, t u 1 an for completeness let I 0 u ientify that action. For t 0 let Θ t u enote the total rewar receive by user u since its last policy upate before time t. In particular for 0 Θ t u +1 u = θ t u +1 u θ t u u. Let β 0, 1 be a fixe constant. The ata structure zu is upate at time t u +1 by setting z 0u, a = 0 an z t u +1 u, a = z t u u, a+β Θ t u +1 u z t u u, a I u, a 1 for each a Au. Note that zu, a changes its value at time t u +1 only if a is the action exercise over the interval [t u, tu +1. As I u, a = 1 an the action yiels rewar Θ t u +1 u in that case, the upate rule is inee exponential averaging of the associate rewar. In turn zu, a aims to trac the expecte rewar ue to action a, which is typically non-stationary owing to the ynamic strategies followe by other users. Upating the ranomize strategy Qu involves two entries. For positive integer let J u = J u, a : a Au be a ranom binary vector such that given Q t u 1 u, J u is conitionally inepenent with an ientically istribute as I u. Let n > 0 be a real parameter common to all users. The probability vector Qu is upate at time t u +1 by setting { ˆQt u Q t u +1 u, a = u, a if ˆQ +1 t u +1 u S u 2 Q t u u, a otherwise, where ˆQu, a is an auxiliary variable that satisfies ˆQ t u +1 u, a = Q t u u, a + 1 n z t u u, a z t u u, a 3 if I +1 u, a = J +1 u, a = 1 or I +1 u, a = J +1 u, a = 1. In wors, the vectors I +1 u an J +1 u ientify the two entries of Qu that are upate at time t u +1. If a, a Au are such that I +1 u, a = 1 an J +1 u, a = 1 then Qu, a is increase by an amount n 1 z t u u, a z t u u, a an Qu, a is ecrease by the same amount, provie that all entries of the resulting probability vector are strictly positive. Otherwise Qu is not upate. The ecision expresse by equality 2 assures that Qu represents a probability vector at all times. The algorithm ientifie by equalities 1 2 amounts to reinforcement learning on the part of each user: If the value of Qu, a changes its value ue to either I +1 u, a = 1 or J +1 u, a = 1 then the expecte change, conitione on Q t u u, is given by n 1 z t u u, a a Au z t uu, a Q t u u, a. Hence if z t u u is a reliable estimate of the expecte rewars of iniviual actions in Au, then the upate rule 3 tens to increase the probabilities of actions that yiel larger-than-average expecte rewars. Reliability of the estimates in zu : u U is achieve by choosing the parameter n > 0 large. In fact large values of n lea to separation in the time scales of the estimates zu : u U an the policies Qu : u U. Namely, each probability istribution in Qu : u U changes by an amount On 1 per upate; therefore the estimates zu : u U are upate many times before the policies Qu : u U change their value significantly, an in turn Qu : u U are upate base on reliable estimates.

3 The following section quantifies this interplay between the estimates an the ranomize policies. III. MAIN RESULTS For t 0 efine the vector-value variable Q t = Q t u : u U. We next characterize the time-scale process Q nt : t 0 in the large n it. For u U an a Au efine the mapping T u,a : A A so that the image α = T u,a α of α A satisfies α u = a an α u = αu for u u. For p S let ϕ p enote an equilibrium istribution for the Marov process that taes values in Ξ A, an has generator Ĝp = [ˆq ξ,α,ξ,α ] Ξ A Ξ A with off-iagonal entries q ξ,ξ α if ξ ξ, α = α ˆq ξ,α,ξ,α = γupu, a if ξ = ξ, α = T u,a α 0 otherwise. 4 Note that ϕ p is uniquely ientifie ue to the irreucibility of generators Gα : α A. Let 1{ } be the inicator function whose value is 1 if its argument is correct, an it is 0 otherwise. Define the probability istribution νp u on Ξ Au by setting ν u p ξ, a = α A 1{αu = a}ϕ p ξ, α, ξ Ξ, a Au. Given π o S, let π t : t 0 be the trajectory in S such that for each u U an a Au t π tu, a = 2 µ πt u, a π t u, a µ πt u, a, with π 0 u, a = π o u, a an a Au µ πt u, a = ξ Ξ g u ξ, aν u π t ξ, a. The right han sie of equality 5 is well-efine an Lipschitz continuous, an the trajectory π t : t 0 is uniquely ientifie. Let S be enowe with metric such that p, p = max{ pu, a p u, a : u U, a Au} for p, p S. Define Q n t = Q nt for t 0. The following theorem states that, for large values of n, the trajectories of Q n t : t 0 are well-approximate by the solution of 5. Theorem 3.1: Let π o S an let π t : t 0 solve equations 5. If n Qn 0 = π o then for each ε, T > 0 n P in probability, sup Q n t, π t > ε 0 t T = 0. It appears ifficult to obtain an explicit expression for the solutions of equations 5, or to mae general qualitative observations about their ynamic behavior. However the following example suggests the possibility of sophisticate 5 behavior, by illustrating that for certain choices of parameters 5 reuces to the replicator ynamics, which may possess unstable equilibria an it cycles [4], [12]. Example 3.1: Replicator ynamics Suppose that the state of the networ is ientical to the current action profile. That is, Ξ = A an for α A the generator matrix Gα satisfies q α,ξ α = 0 an q ξ,α α 1 for ξ Ξ, ξ α, so that, for the purposes of this example, we may tae X t = α t for t 0. Then for p S, ξ Ξ, α A, u U an a Au ϕ p ξ, α = 1{ξ = α} u U pu, αu ν u p ξ, a = pu, a1{ξu = a} µ p u, a = pu, a α A:αu=a u U {u} g u α, a Let the mapping g u : A R be such that pu, ξu u U {u} g u α = g u α, αu, α A. pu, αu. Let µ p u, a be the expecte value of g u when user u exercises action a an each user u u inepenently ranomizes its action accoring to the probability istribution pu. That is, µ p u, a = g u α pu, αu. α A:αu=a Direct substitution in 5 yiels t π tu, a = 2π t u, a µ πt u, a a Au u U {u} π t u, a µ πt u, a, 6 which is the replicator ynamics associate with the normalform game specifie by action sets Au : u U an payoff functions g u : u U. A. Special case: Agile users We next consier the it ynamics 5 when the users are agile in the sense that they upate their strategies on a faster time-scale than the networ process evolves. Namely in this section it is assume that γu max q ξ,ξα for all users u U. 7 ξ Ξ, α A This assumption leas to a more intuitive interpretation of 5, an it appears suitable when, for example, the networ process is comprise of ynamically maintaine estimates that are obtaine via averaging. Such estimates are employe in typical rate-base control mechanisms in pacet ata networs, as well as in the circuit-switche networ application examine in Section IV.

4 Suppose that conition 7 hols, so that for p S the generator matrix Ĝp efine by equality 4 is nearlyecomposable [3]. That is, Ĝp has a roughly blociagonal structure in which each iagonal bloc is associate with a istinct ξ Ξ, an the entries of noniagonal blocs are small in magnitue relative to those of the iagonal blocs. For p S the probability istribution ϕ p is then well-approximate by ϕ p ξ, α ν p ξ u U pu, αu, 8 where ν p is the unique equilibrium istribution of the Marov process that taes values in Ξ an is generate by α A Gα u U pu, αu. In other wors ν p is the equilibrium istribution of the networ state in the case when each user u U inepenently ranomizes its actions accoring to the static istribution pu. The approximation 8 reflects a separation of time scales where one component of the process fluctuates much faster than the other, so that transition probabilities of the slow component are etermine by the equilibrium istribution of the fast component. The reaer is referre to [3] for a etaile iscussion an explicit error bouns that apply to the approximation 8. Here we appeal in particular to [3, Section 2.1] to point out that the approximation error is vanishingly small for large values of min{γu : u U}. Let g u ν p, a enote the expecte value of g u, a with respect to the istribution ν p. That is, g u ν p, a = ξ Ξ g u ξ, aν p ξ. 9 Let the trajectory ˆπ t : t 0 in S be efine by t ˆπ tu, a = 2ˆπ t u, a g u νˆπt, a a Au ˆπ t u, a g u νˆπt, a, 10 with ˆπ 0 u, a = π o u, a, for u U, a Au. The approximation 8 leas via irect substitution to ν u p ξ, a pu, aν p ξ µ p u, a pu, a ξ Ξ g u ξ, aν p ξ, an equation 5 reuces to equation 10 provie that the approximations above are exact. This observation is formalize in the following proposition. Proposition 3.1: Given ε, T > 0 there exists γ > 0 such that if min u U γu > γ then sup ˆπ t, π t ε. 0 t T Proposition 3.1, together with Theorem 3.1, leas to the following conclusion: Corollary 3.1: Let ˆπ o S an let ˆπ t : t 0 solve equations 10. If n Q n 0 = ˆπ o in probability, then for each ε, T > 0 there exists γ > 0 such that if min u U γu > γ then P sup Q n t, ˆπ t > ε = 0. n 0 t T In interpreting equations 10 note that g u νˆπt, a is the instantaneous rate that user u accumulates rewars per unit of time uring which it exercises action a an the networ process is in statistical equilibrium uner static strategies ˆπ t u : u U. Hence the probability of exercising an action tens to increase if an only if the action has a betterthan-average rewar rate, in the sense escribe above, for the user. Although Corollary 3.1 establishes a rigorous connection between Q n t : t 0 an ˆπ t : t 0 only over finite intervals, we continue by examining asymptotic properties of ˆπ t : t 0 in orer to gain insight on the equilibrium regime of Q n t : t 0 1. The following lemma provies a necessary conition for Lyapunov stability. Lemma 3.1: If p S is Lyapunov stable uner 10 then for each u U g u ν p, a = max g uν p, a whenever pu, a > a Au Conition 11 is reminiscent of Warrop equilibrium [14], which, in informal terms, refers to an assignment of flows to routes on a graph so that all flows between a given pair of noes experience the same elay an no other route for that pair has smaller elay. We next appeal to the literature on potential games an aapt [13, Lemma 4.1] to obtain a sufficient conition for stability. The following proposition ientifies rewar rates g u : u U so that the ynamics 10 amits a prescribe Lyapunov function. Proposition 3.2: Suppose that there exists a continuously ifferentiable function V : S R such that for each p S, V p pu, a = g uν p, a, u U, a Au. 12 Then t V ˆπ t 0, with equality if an only if ˆπ t is an equilibrium for 10. Isolate local maxima of V are asymptotically stable, an p S is a local maximum of V if it satisfies conition 11. IV. APPLICATION: ADAPTIVE ROUTING This section aims to illustrate interpretation of the results in a circuit-switche communication networ comprise of finite-capacity lins see, for example, [1] for another illustration in the context of pacet routing. The set of lins is enote by L, an users U of the networ are source-estination pairs that provie ata traffic. For each user u U, the action set Au enotes a set of alternate 1 A rigorous connection between the equilibrium istribution of Q n t : t 0 an stable equilibria of 5 entails more conservative reflection at the bounary of S. This irection is not pursue here in orer to eep the exposition simple.

5 routes for user u, in particular each a Au is a subset of L. The capacity κl of lin l L enotes the number of channels available at that lin. Suppose that each user u U receives a Poisson stream of connection requests at rate λu > 0 requests per unit time, an each request is to be route along one of the alternate routes in Au. A request that arrives at time t is assigne to the alternate route ientifie by α t u. If each lin on that route has at least one free channel, then the request is route along the route, otherwise the request is bloce hence a request may be bloce even though there are free alternate routes at the time of arrival an it is lost. Once assigne to a route, a connection remains in the system for the uration of its holing time uring which it simultaneously reserves one channel from each lin on its original route. Connection holing times are exponentially istribute with mean 1, inepenently of the history prior to the arrival time. The traffic moel escribe above has been rigourously stuie in the context of telephone networs. In particular Kelly [6] consiers the case when each Au is a singleton, in an asymptotic regime where for positive numbers λ o u : u U an κ o l : l L λu = mλ o u, κl = mκ o l, m 1, 13 an provies an approximation for the connection-blocing probabilities that is asymptotically exact in the large m it. This approximation is referre to as the reuce loa approximation an it will be restate in the following paragraphs. When alternative routes are present, it is nown that intuitively appealing routing rules may lea to instability in the networ [8], as well as to instances of Braess paraox [2]. Reinforcement-learning base ecentralize routing schemes were consiere for alternate routing in telephone networs by [9], [10]. The empirical evience therein suggests that employing learning automata leas to equalization of blocing probabilities on alternate routes. A connection between networ-wie optimality an user behavior was establishe by [7] in terms of certain parameters, referre to as shaow prices, which were envisione to be compute by the networ an then conveye to the users who execute ecentralize hill-escent algorithms base on these parameters. The istribute architecture outline in this section is also base on shaow prices. The reaer may fin the review [8] of circuit-switching networs helpful in following the evelopment of ieas in this section. We first recite two relevant results from the literature of capacity-ite circuit-switching networs, in the notation of the present section. The first result concerns asymptotic exactness of an approximate expression for the connectionblocing probabilities: Let p S an suppose for the moment that each user u U aopts the static ranomize routing policy pu. That is, each user u U routes an arriving request on route a Au with probability pu, a, inepenently of the prior history. The connection-blocing probability on each route can then be etermine in the framewor of [6] in the following fashion. Define κ 1 Bρ, κ = ρκ ρ i, ρ > 0, κ Z +, κ! i! i=0 an ientify a real value vector b p l, ρ p l : l L that solves the equalities b p l = Bρ p l, κl ρ p l = 1{l a}λupu, a u U a Au l a {l} 1 b p l. Brouwer s fixe point theorem guarantees existence of solutions, furthermore the vector b p l : l L is unique [6]. The reuce loa approximation refers to approximating the connection acceptance probability on route a by the prouct l a 1 b pl. In essence, this approximation is base on the assumption that lin loas are statistically inepenent in equilibrium. Although this assumption is generally inaccurate, the reuce loa approximation is asymptotically exact in the large m it in 13 [6]. For u U, a Au set λ p u, a = λupu, a l a1 b p l, which is interprete as the rate of accepte requests on route a uner the reuce loa approximation. The secon recite result pertains to the sensitivity of the approximate blocing probabilities arising from the reuce loa approximation: Let wu, a : u U, a Au be arbitrary but fixe nonnegative numbers, an let s p l : l L solve s p l = h p l 1{l a}λupu, a u U a Au wu, a s p l, l L, where l a {l} h p l = Bρ pl, κl 1 Bρ p l, κl. 1 b p l The quantities s p l : l L are coine as shaow prices by Kelly [7]. We refer the reaer to [7] for an intuitive interpretation of shaow prices. For the purposes of the present iscussion it is enough to note that [7, Equation 1.6] translates to wu, a λ p u, a = pu, a u U a Au wu, a l a s p l λu l a1 b p l 14 for u U, a Au. We now turn to aaptive routing an escribe the networ architecture. Suppose that each lin l L ynamically estimates the shaow price s Q n t l : t 0. See for example

6 [7, Sections 4-5] for a recursive proceure for locally estimating the shaow price at each lin, an a iscussion of relate implementation issues. Let Ŝtl enote the estimate of the shaow price s Q n t l for lin l at time t. We shall assume that each estimator is consistent, in the sense that it eventually ientifies the associate shaow price correctly provie that the strategies of all users are ept constant. That is, for each fixe strategy profile p S, 1{ŝl = s p l}ν p ξ = 1, l L. 15 ξ Ξ Let ˆNt u, a be the number of connections on route a of user u that have been establishe but not expire by time t. Aopt the networ process efine by X t = Ŝtl, ˆN t u, a : l L, u U, a Au for t 0. 2 Suppose that for each connection that user u successfully routes along route a, the user charges that connection an amount wu, a, an it is charge by each lin on route a an amount that equals the current estimate of the shaow price of that lin. Users o not receive any rewar for bloce requests. The instantaneous rewar rate of user u ue to choosing route a when the networ is at state ξ = ŝ, ˆn, with ŝ = ŝl : l L an ˆn = ˆnu, a : u U, a Au, is given by g u ξ, a = wu, a l a ŝl λuψu, a, ξ, where ψu, a, ξ is the conitional probability that each lin on route a of user u has at least one free channel given that the networ state is ξ. In particular ψu, a, ξ = 1 if each lin on route a has free capacity uner route assignment ˆn, 0 else. To mae further progress, appeal to conition 13 to assume that the reuce loa approximation is exact, so that the acceptance probability on route a uner the static strategy profile p is given by 1 b p l, p S. 16 ξ Ξ ψu, a, ξν p ξ = l a Equality 9, together with conitions 15-16, implies that g u ν p, a = wu, a l a s p l in turn by equality 14 the mapping V p = wu, aλ p u, a, u U a Au λu l a1 b p l; p S satisfies conition 12. Note that uner the reuce loa approximation the expression for V p enotes the total rate of revenue generation by the users when they aopt the static policies pu : u U, an by Proposition 3.2 the ynamics 10 locally maximizes the total revenue generate by the users in the long-term. We appeal to [7, Section 6] to point out that, in the large m it in 13, local maxima of the mapping V are global maximizers. V. CONCLUSION This paper provies a ynamic escription of user strategies when each user employs a tractable reinforcement learning algorithm to pursue own interests subject to local information. The present setting is motivate by large resource sharing systems, such as ata networs, where control is istribute to users uner the assumption that they are socially responsible. The results of the paper thereby provie a framewor to assess robustness of such architectures against selfish behavior of partially informe users. The results may also be interprete from a control perspective to assess efficiency of autonomous reinforcement learning as a ecentralize control mechanism. In this respect conition 12 provies a guieline for engineering the networ process so that system ynamics amits a prescribe Lyapunov function. REFERENCES [1] M. Alanyali, On non-cooperative interaction via reinforcement learning an its control, 41st Allerton Conference on Communications, Control an Computing. Champaign, [2] N. G. Bean, F. P. Kelly, an P. G. Taylor, Braess paraox in a loss networ, Journal of Applie Probability, vol. 34, pp , [3] P. J. Courtois, Decomposability: Queueing an computer system applications, Acaemic Press, New Yor, [4] J. Hofbauer an K. Siegmun, Evolutionary games an replicator ynamics, Cambrige University Press, [5] E. Hopins, Two competing moels of how people learn in games, Econometrica, vol. 70, no. 6, pp , [6] F. P. Kelly, Blocing probabilities in large circuit-switche networs, Avances in Applie Probability, vol. 18, pp , [7] F. P. Kelly, Routing in circuit-switche networs: optimization, shaow prices an ecentralization, Avances in Applie Probability, vol. 20, pp , [8] F. P. Kelly, Loss networs, The Annals of Applie Probability, vol. 1, no. 3, pp , [9] K. S. Narenra, E. A. Wright, an L. G. Mason, Application of learning automata to telephone traffic routing an control, IEEE Transactions on Systems, Man, an Cybernetics, vol. 7, no. 11, pp , [10] K. S. Narenra an P. Mars, The use of learning algorithms in telephone traffic routing a methoology, Automatica, vol. 19, no. 5, pp , [11] K. S. Narenra an M. A. L. Thathachar, Learning automata: An introuction, Prentice-Hall, New Jersey, [12] K. Ritzberger an J.W. Weibull, Evolutionary selection in normalform games, Econometrica, vol. 63, no. 6, pp , [13] W. B. Sanholm, Potential games with continuous player sets, Journal of Economic Theory, vol. 97, pp , [14] J. G. Warrop, Some theoretical aspects of roa traffic research, Proceeings of the Institute of Civil Engineering, Part 2, vol. 1, pp , [15] J. Weibull, Evolutionary game theory, MIT Press, Cambrige, If the estimator of [7] is employe then a number of auxiliary variables shoul be inclue in the state escriptor of the networ in orer that the networ process complies with the conitions set forth in Section II. Such variables are omitte here for notational convenience.

Equilibrium in Queues Under Unknown Service Times and Service Value

Equilibrium in Queues Under Unknown Service Times and Service Value University of Pennsylvania ScholarlyCommons Finance Papers Wharton Faculty Research 1-2014 Equilibrium in Queues Uner Unknown Service Times an Service Value Laurens Debo Senthil K. Veeraraghavan University

More information

On Characterizing the Delay-Performance of Wireless Scheduling Algorithms

On Characterizing the Delay-Performance of Wireless Scheduling Algorithms On Characterizing the Delay-Performance of Wireless Scheuling Algorithms Xiaojun Lin Center for Wireless Systems an Applications School of Electrical an Computer Engineering, Purue University West Lafayette,

More information

Lyapunov Functions. V. J. Venkataramanan and Xiaojun Lin. Center for Wireless Systems and Applications. School of Electrical and Computer Engineering,

Lyapunov Functions. V. J. Venkataramanan and Xiaojun Lin. Center for Wireless Systems and Applications. School of Electrical and Computer Engineering, On the Queue-Overflow Probability of Wireless Systems : A New Approach Combining Large Deviations with Lyapunov Functions V. J. Venkataramanan an Xiaojun Lin Center for Wireless Systems an Applications

More information

Capacity Analysis of MIMO Systems with Unknown Channel State Information

Capacity Analysis of MIMO Systems with Unknown Channel State Information Capacity Analysis of MIMO Systems with Unknown Channel State Information Jun Zheng an Bhaskar D. Rao Dept. of Electrical an Computer Engineering University of California at San Diego e-mail: juzheng@ucs.eu,

More information

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments 2 Conference on Information Sciences an Systems, The Johns Hopkins University, March 2, 2 Time-of-Arrival Estimation in Non-Line-Of-Sight Environments Sinan Gezici, Hisashi Kobayashi an H. Vincent Poor

More information

How to Minimize Maximum Regret in Repeated Decision-Making

How to Minimize Maximum Regret in Repeated Decision-Making How to Minimize Maximum Regret in Repeate Decision-Making Karl H. Schlag July 3 2003 Economics Department, European University Institute, Via ella Piazzuola 43, 033 Florence, Italy, Tel: 0039-0-4689, email:

More information

6 General properties of an autonomous system of two first order ODE

6 General properties of an autonomous system of two first order ODE 6 General properties of an autonomous system of two first orer ODE Here we embark on stuying the autonomous system of two first orer ifferential equations of the form ẋ 1 = f 1 (, x 2 ), ẋ 2 = f 2 (, x

More information

Lower Bounds for the Smoothed Number of Pareto optimal Solutions

Lower Bounds for the Smoothed Number of Pareto optimal Solutions Lower Bouns for the Smoothe Number of Pareto optimal Solutions Tobias Brunsch an Heiko Röglin Department of Computer Science, University of Bonn, Germany brunsch@cs.uni-bonn.e, heiko@roeglin.org Abstract.

More information

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control 19 Eigenvalues, Eigenvectors, Orinary Differential Equations, an Control This section introuces eigenvalues an eigenvectors of a matrix, an iscusses the role of the eigenvalues in etermining the behavior

More information

Lower bounds on Locality Sensitive Hashing

Lower bounds on Locality Sensitive Hashing Lower bouns on Locality Sensitive Hashing Rajeev Motwani Assaf Naor Rina Panigrahy Abstract Given a metric space (X, X ), c 1, r > 0, an p, q [0, 1], a istribution over mappings H : X N is calle a (r,

More information

Slide10 Haykin Chapter 14: Neurodynamics (3rd Ed. Chapter 13)

Slide10 Haykin Chapter 14: Neurodynamics (3rd Ed. Chapter 13) Slie10 Haykin Chapter 14: Neuroynamics (3r E. Chapter 13) CPSC 636-600 Instructor: Yoonsuck Choe Spring 2012 Neural Networks with Temporal Behavior Inclusion of feeback gives temporal characteristics to

More information

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012 CS-6 Theory Gems November 8, 0 Lecture Lecturer: Alesaner Mąry Scribes: Alhussein Fawzi, Dorina Thanou Introuction Toay, we will briefly iscuss an important technique in probability theory measure concentration

More information

Lectures - Week 10 Introduction to Ordinary Differential Equations (ODES) First Order Linear ODEs

Lectures - Week 10 Introduction to Ordinary Differential Equations (ODES) First Order Linear ODEs Lectures - Week 10 Introuction to Orinary Differential Equations (ODES) First Orer Linear ODEs When stuying ODEs we are consiering functions of one inepenent variable, e.g., f(x), where x is the inepenent

More information

Nested Saturation with Guaranteed Real Poles 1

Nested Saturation with Guaranteed Real Poles 1 Neste Saturation with Guarantee Real Poles Eric N Johnson 2 an Suresh K Kannan 3 School of Aerospace Engineering Georgia Institute of Technology, Atlanta, GA 3332 Abstract The global stabilization of asymptotically

More information

The total derivative. Chapter Lagrangian and Eulerian approaches

The total derivative. Chapter Lagrangian and Eulerian approaches Chapter 5 The total erivative 51 Lagrangian an Eulerian approaches The representation of a flui through scalar or vector fiels means that each physical quantity uner consieration is escribe as a function

More information

Evolutionary Stability of Pure-Strategy Equilibria in Finite Games

Evolutionary Stability of Pure-Strategy Equilibria in Finite Games GAMES AND ECONOMIC BEHAVIOR 2, 253265 997 ARTICLE NO GA970533 Evolutionary Stability of Pure-Strategy Equilibria in Finite Games E Somanathan* Emory Uniersity, Department of Economics, Atlanta, Georgia

More information

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013 Survey Sampling Kosuke Imai Department of Politics, Princeton University February 19, 2013 Survey sampling is one of the most commonly use ata collection methos for social scientists. We begin by escribing

More information

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device Close an Open Loop Optimal Control of Buffer an Energy of a Wireless Device V. S. Borkar School of Technology an Computer Science TIFR, umbai, Inia. borkar@tifr.res.in A. A. Kherani B. J. Prabhu INRIA

More information

inflow outflow Part I. Regular tasks for MAE598/494 Task 1

inflow outflow Part I. Regular tasks for MAE598/494 Task 1 MAE 494/598, Fall 2016 Project #1 (Regular tasks = 20 points) Har copy of report is ue at the start of class on the ue ate. The rules on collaboration will be release separately. Please always follow the

More information

IPA Derivatives for Make-to-Stock Production-Inventory Systems With Backorders Under the (R,r) Policy

IPA Derivatives for Make-to-Stock Production-Inventory Systems With Backorders Under the (R,r) Policy IPA Derivatives for Make-to-Stock Prouction-Inventory Systems With Backorers Uner the (Rr) Policy Yihong Fan a Benamin Melame b Yao Zhao c Yorai Wari Abstract This paper aresses Infinitesimal Perturbation

More information

Chapter 6: Energy-Momentum Tensors

Chapter 6: Energy-Momentum Tensors 49 Chapter 6: Energy-Momentum Tensors This chapter outlines the general theory of energy an momentum conservation in terms of energy-momentum tensors, then applies these ieas to the case of Bohm's moel.

More information

NOTES ON EULER-BOOLE SUMMATION (1) f (l 1) (n) f (l 1) (m) + ( 1)k 1 k! B k (y) f (k) (y) dy,

NOTES ON EULER-BOOLE SUMMATION (1) f (l 1) (n) f (l 1) (m) + ( 1)k 1 k! B k (y) f (k) (y) dy, NOTES ON EULER-BOOLE SUMMATION JONATHAN M BORWEIN, NEIL J CALKIN, AND DANTE MANNA Abstract We stuy a connection between Euler-MacLaurin Summation an Boole Summation suggeste in an AMM note from 196, which

More information

Least-Squares Regression on Sparse Spaces

Least-Squares Regression on Sparse Spaces Least-Squares Regression on Sparse Spaces Yuri Grinberg, Mahi Milani Far, Joelle Pineau School of Computer Science McGill University Montreal, Canaa {ygrinb,mmilan1,jpineau}@cs.mcgill.ca 1 Introuction

More information

Fluid model for a data network with α-fair bandwidth sharing and general document size distributions: two examples of stability

Fluid model for a data network with α-fair bandwidth sharing and general document size distributions: two examples of stability IMS Lecture Notes Monograph Series??? Vol. 0 (0000) 1 c Institute of Mathematical Statistics, 0000 Flui moel for a ata network with α-fair banwith sharing an general ocument size istributions: two examples

More information

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21 Large amping in a structural material may be either esirable or unesirable, epening on the engineering application at han. For example, amping is a esirable property to the esigner concerne with limiting

More information

Generalized Tractability for Multivariate Problems

Generalized Tractability for Multivariate Problems Generalize Tractability for Multivariate Problems Part II: Linear Tensor Prouct Problems, Linear Information, an Unrestricte Tractability Michael Gnewuch Department of Computer Science, University of Kiel,

More information

Math 342 Partial Differential Equations «Viktor Grigoryan

Math 342 Partial Differential Equations «Viktor Grigoryan Math 342 Partial Differential Equations «Viktor Grigoryan 6 Wave equation: solution In this lecture we will solve the wave equation on the entire real line x R. This correspons to a string of infinite

More information

Predictive Control of a Laboratory Time Delay Process Experiment

Predictive Control of a Laboratory Time Delay Process Experiment Print ISSN:3 6; Online ISSN: 367-5357 DOI:0478/itc-03-0005 Preictive Control of a aboratory ime Delay Process Experiment S Enev Key Wors: Moel preictive control; time elay process; experimental results

More information

Robust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k

Robust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k A Proof of Lemma 2 B Proof of Lemma 3 Proof: Since the support of LL istributions is R, two such istributions are equivalent absolutely continuous with respect to each other an the ivergence is well-efine

More information

TOEPLITZ AND POSITIVE SEMIDEFINITE COMPLETION PROBLEM FOR CYCLE GRAPH

TOEPLITZ AND POSITIVE SEMIDEFINITE COMPLETION PROBLEM FOR CYCLE GRAPH English NUMERICAL MATHEMATICS Vol14, No1 Series A Journal of Chinese Universities Feb 2005 TOEPLITZ AND POSITIVE SEMIDEFINITE COMPLETION PROBLEM FOR CYCLE GRAPH He Ming( Λ) Michael K Ng(Ξ ) Abstract We

More information

Games and Economic Behavior

Games and Economic Behavior Games an Economic Behavior 80 013 39 67 Contents lists available at SciVerse ScienceDirect Games an Economic Behavior www.elsevier.com/locate/geb Fast convergence in evolutionary equilibrium selection

More information

Constraint Reformulation and a Lagrangian Relaxation based Solution Algorithm for a Least Expected Time Path Problem Abstract 1.

Constraint Reformulation and a Lagrangian Relaxation based Solution Algorithm for a Least Expected Time Path Problem Abstract 1. Constraint Reformulation an a Lagrangian Relaation base Solution Algorithm for a Least Epecte Time Path Problem Liing Yang State Key Laboratory of Rail Traffic Control an Safety, Being Jiaotong University,

More information

Improved Rate-Based Pull and Push Strategies in Large Distributed Networks

Improved Rate-Based Pull and Push Strategies in Large Distributed Networks Improve Rate-Base Pull an Push Strategies in Large Distribute Networks Wouter Minnebo an Benny Van Hout Department of Mathematics an Computer Science University of Antwerp - imins Mielheimlaan, B-00 Antwerp,

More information

Switching Time Optimization in Discretized Hybrid Dynamical Systems

Switching Time Optimization in Discretized Hybrid Dynamical Systems Switching Time Optimization in Discretize Hybri Dynamical Systems Kathrin Flaßkamp, To Murphey, an Sina Ober-Blöbaum Abstract Switching time optimization (STO) arises in systems that have a finite set

More information

The canonical controllers and regular interconnection

The canonical controllers and regular interconnection Systems & Control Letters ( www.elsevier.com/locate/sysconle The canonical controllers an regular interconnection A.A. Julius a,, J.C. Willems b, M.N. Belur c, H.L. Trentelman a Department of Applie Mathematics,

More information

arxiv: v3 [cs.lg] 3 Dec 2017

arxiv: v3 [cs.lg] 3 Dec 2017 Context-Aware Generative Aversarial Privacy Chong Huang, Peter Kairouz, Xiao Chen, Lalitha Sankar, an Ram Rajagopal arxiv:1710.09549v3 [cs.lg] 3 Dec 2017 Abstract Preserving the utility of publishe atasets

More information

Power Generation and Distribution via Distributed Coordination Control

Power Generation and Distribution via Distributed Coordination Control Power Generation an Distribution via Distribute Coorination Control Byeong-Yeon Kim, Kwang-Kyo Oh, an Hyo-Sung Ahn arxiv:407.4870v [math.oc] 8 Jul 204 Abstract This paper presents power coorination, power

More information

Adaptive Gain-Scheduled H Control of Linear Parameter-Varying Systems with Time-Delayed Elements

Adaptive Gain-Scheduled H Control of Linear Parameter-Varying Systems with Time-Delayed Elements Aaptive Gain-Scheule H Control of Linear Parameter-Varying Systems with ime-delaye Elements Yoshihiko Miyasato he Institute of Statistical Mathematics 4-6-7 Minami-Azabu, Minato-ku, okyo 6-8569, Japan

More information

A note on asymptotic formulae for one-dimensional network flow problems Carlos F. Daganzo and Karen R. Smilowitz

A note on asymptotic formulae for one-dimensional network flow problems Carlos F. Daganzo and Karen R. Smilowitz A note on asymptotic formulae for one-imensional network flow problems Carlos F. Daganzo an Karen R. Smilowitz (to appear in Annals of Operations Research) Abstract This note evelops asymptotic formulae

More information

Abstract A nonlinear partial differential equation of the following form is considered:

Abstract A nonlinear partial differential equation of the following form is considered: M P E J Mathematical Physics Electronic Journal ISSN 86-6655 Volume 2, 26 Paper 5 Receive: May 3, 25, Revise: Sep, 26, Accepte: Oct 6, 26 Eitor: C.E. Wayne A Nonlinear Heat Equation with Temperature-Depenent

More information

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks A PAC-Bayesian Approach to Spectrally-Normalize Margin Bouns for Neural Networks Behnam Neyshabur, Srinah Bhojanapalli, Davi McAllester, Nathan Srebro Toyota Technological Institute at Chicago {bneyshabur,

More information

Mark J. Machina CARDINAL PROPERTIES OF "LOCAL UTILITY FUNCTIONS"

Mark J. Machina CARDINAL PROPERTIES OF LOCAL UTILITY FUNCTIONS Mark J. Machina CARDINAL PROPERTIES OF "LOCAL UTILITY FUNCTIONS" This paper outlines the carinal properties of "local utility functions" of the type use by Allen [1985], Chew [1983], Chew an MacCrimmon

More information

ECE 422 Power System Operations & Planning 7 Transient Stability

ECE 422 Power System Operations & Planning 7 Transient Stability ECE 4 Power System Operations & Planning 7 Transient Stability Spring 5 Instructor: Kai Sun References Saaat s Chapter.5 ~. EPRI Tutorial s Chapter 7 Kunur s Chapter 3 Transient Stability The ability of

More information

Introduction to the Vlasov-Poisson system

Introduction to the Vlasov-Poisson system Introuction to the Vlasov-Poisson system Simone Calogero 1 The Vlasov equation Consier a particle with mass m > 0. Let x(t) R 3 enote the position of the particle at time t R an v(t) = ẋ(t) = x(t)/t its

More information

SYNCHRONOUS SEQUENTIAL CIRCUITS

SYNCHRONOUS SEQUENTIAL CIRCUITS CHAPTER SYNCHRONOUS SEUENTIAL CIRCUITS Registers an counters, two very common synchronous sequential circuits, are introuce in this chapter. Register is a igital circuit for storing information. Contents

More information

VIRTUAL STRUCTURE BASED SPACECRAFT FORMATION CONTROL WITH FORMATION FEEDBACK

VIRTUAL STRUCTURE BASED SPACECRAFT FORMATION CONTROL WITH FORMATION FEEDBACK AIAA Guiance, Navigation, an Control Conference an Exhibit 5-8 August, Monterey, California AIAA -9 VIRTUAL STRUCTURE BASED SPACECRAT ORMATION CONTROL WITH ORMATION EEDBACK Wei Ren Ranal W. Bear Department

More information

Stability Analysis of Nonlinear Networks via M-matrix Theory: Beyond Linear Consensus

Stability Analysis of Nonlinear Networks via M-matrix Theory: Beyond Linear Consensus 2012 American Control Conference Fairmont Queen Elizabeth, Montréal, Canaa June 27-June 29, 2012 Stability Analysis of Nonlinear Networks via M-matrix Theory: Beyon Linear Consensus Airlie Chapman an Mehran

More information

Markov Chains in Continuous Time

Markov Chains in Continuous Time Chapter 23 Markov Chains in Continuous Time Previously we looke at Markov chains, where the transitions betweenstatesoccurreatspecifietime- steps. That it, we mae time (a continuous variable) avance in

More information

Simultaneous Input and State Estimation with a Delay

Simultaneous Input and State Estimation with a Delay 15 IEEE 5th Annual Conference on Decision an Control (CDC) December 15-18, 15. Osaa, Japan Simultaneous Input an State Estimation with a Delay Sze Zheng Yong a Minghui Zhu b Emilio Frazzoli a Abstract

More information

Online ICA: Understanding Global Dynamics of Nonconvex Optimization via Diffusion Processes

Online ICA: Understanding Global Dynamics of Nonconvex Optimization via Diffusion Processes Online ICA: Unerstaning Global Dynamics of Nonconvex Optimization via Diffusion Processes Chris Junchi Li Zhaoran Wang Han Liu Department of Operations Research an Financial Engineering, Princeton University

More information

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y Ph195a lecture notes, 1/3/01 Density operators for spin- 1 ensembles So far in our iscussion of spin- 1 systems, we have restricte our attention to the case of pure states an Hamiltonian evolution. Toay

More information

Sliding mode approach to congestion control in connection-oriented communication networks

Sliding mode approach to congestion control in connection-oriented communication networks JOURNAL OF APPLIED COMPUTER SCIENCE Vol. xx. No xx (200x), pp. xx-xx Sliing moe approach to congestion control in connection-oriente communication networks Anrzej Bartoszewicz, Justyna Żuk Technical University

More information

7.1 Support Vector Machine

7.1 Support Vector Machine 67577 Intro. to Machine Learning Fall semester, 006/7 Lecture 7: Support Vector Machines an Kernel Functions II Lecturer: Amnon Shashua Scribe: Amnon Shashua 7. Support Vector Machine We return now to

More information

ON THE OPTIMALITY SYSTEM FOR A 1 D EULER FLOW PROBLEM

ON THE OPTIMALITY SYSTEM FOR A 1 D EULER FLOW PROBLEM ON THE OPTIMALITY SYSTEM FOR A D EULER FLOW PROBLEM Eugene M. Cliff Matthias Heinkenschloss y Ajit R. Shenoy z Interisciplinary Center for Applie Mathematics Virginia Tech Blacksburg, Virginia 46 Abstract

More information

The Exact Form and General Integrating Factors

The Exact Form and General Integrating Factors 7 The Exact Form an General Integrating Factors In the previous chapters, we ve seen how separable an linear ifferential equations can be solve using methos for converting them to forms that can be easily

More information

DEGREE DISTRIBUTION OF SHORTEST PATH TREES AND BIAS OF NETWORK SAMPLING ALGORITHMS

DEGREE DISTRIBUTION OF SHORTEST PATH TREES AND BIAS OF NETWORK SAMPLING ALGORITHMS DEGREE DISTRIBUTION OF SHORTEST PATH TREES AND BIAS OF NETWORK SAMPLING ALGORITHMS SHANKAR BHAMIDI 1, JESSE GOODMAN 2, REMCO VAN DER HOFSTAD 3, AND JÚLIA KOMJÁTHY3 Abstract. In this article, we explicitly

More information

Experimental Robustness Study of a Second-Order Sliding Mode Controller

Experimental Robustness Study of a Second-Order Sliding Mode Controller Experimental Robustness Stuy of a Secon-Orer Sliing Moe Controller Anré Blom, Bram e Jager Einhoven University of Technology Department of Mechanical Engineering P.O. Box 513, 5600 MB Einhoven, The Netherlans

More information

Hyperbolic Moment Equations Using Quadrature-Based Projection Methods

Hyperbolic Moment Equations Using Quadrature-Based Projection Methods Hyperbolic Moment Equations Using Quarature-Base Projection Methos J. Koellermeier an M. Torrilhon Department of Mathematics, RWTH Aachen University, Aachen, Germany Abstract. Kinetic equations like the

More information

Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity

Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity Preprints of the 9th Worl Congress The International Feeration of Automatic Control Cape Town, South Africa. August -9, Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity Zhengqiang

More information

Throughput Optimal Control of Cooperative Relay Networks

Throughput Optimal Control of Cooperative Relay Networks hroughput Optimal Control of Cooperative Relay Networks Emun Yeh Dept. of Electrical Engineering Yale University New Haven, C 06520, USA E-mail: emun.yeh@yale.eu Ranall Berry Dept. of Electrical an Computer

More information

Sparse Reconstruction of Systems of Ordinary Differential Equations

Sparse Reconstruction of Systems of Ordinary Differential Equations Sparse Reconstruction of Systems of Orinary Differential Equations Manuel Mai a, Mark D. Shattuck b,c, Corey S. O Hern c,a,,e, a Department of Physics, Yale University, New Haven, Connecticut 06520, USA

More information

Nonlinear Adaptive Ship Course Tracking Control Based on Backstepping and Nussbaum Gain

Nonlinear Adaptive Ship Course Tracking Control Based on Backstepping and Nussbaum Gain Nonlinear Aaptive Ship Course Tracking Control Base on Backstepping an Nussbaum Gain Jialu Du, Chen Guo Abstract A nonlinear aaptive controller combining aaptive Backstepping algorithm with Nussbaum gain

More information

Robustness and Perturbations of Minimal Bases

Robustness and Perturbations of Minimal Bases Robustness an Perturbations of Minimal Bases Paul Van Dooren an Froilán M Dopico December 9, 2016 Abstract Polynomial minimal bases of rational vector subspaces are a classical concept that plays an important

More information

Discrete Hamilton Jacobi Theory and Discrete Optimal Control

Discrete Hamilton Jacobi Theory and Discrete Optimal Control 49th IEEE Conference on Decision an Control December 15-17, 2010 Hilton Atlanta Hotel, Atlanta, GA, USA Discrete Hamilton Jacobi Theory an Discrete Optimal Control Tomoi Ohsawa, Anthony M. Bloch, an Melvin

More information

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE Journal of Soun an Vibration (1996) 191(3), 397 414 THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE E. M. WEINSTEIN Galaxy Scientific Corporation, 2500 English Creek

More information

A NONLINEAR SOURCE SEPARATION APPROACH FOR THE NICOLSKY-EISENMAN MODEL

A NONLINEAR SOURCE SEPARATION APPROACH FOR THE NICOLSKY-EISENMAN MODEL 6th European Signal Processing Conference EUSIPCO 28, Lausanne, Switzerlan, August 25-29, 28, copyright by EURASIP A NONLINEAR SOURCE SEPARATION APPROACH FOR THE NICOLSKY-EISENMAN MODEL Leonaro Tomazeli

More information

One-dimensional I test and direction vector I test with array references by induction variable

One-dimensional I test and direction vector I test with array references by induction variable Int. J. High Performance Computing an Networking, Vol. 3, No. 4, 2005 219 One-imensional I test an irection vector I test with array references by inuction variable Minyi Guo School of Computer Science

More information

An M/G/1 Retrial Queue with Priority, Balking and Feedback Customers

An M/G/1 Retrial Queue with Priority, Balking and Feedback Customers Journal of Convergence Information Technology Volume 5 Number April 1 An M/G/1 Retrial Queue with Priority Balking an Feeback Customers Peishu Chen * 1 Yiuan Zhu 1 * 1 Faculty of Science Jiangsu University

More information

TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS. Yannick DEVILLE

TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS. Yannick DEVILLE TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS Yannick DEVILLE Université Paul Sabatier Laboratoire Acoustique, Métrologie, Instrumentation Bât. 3RB2, 8 Route e Narbonne,

More information

A Review of Multiple Try MCMC algorithms for Signal Processing

A Review of Multiple Try MCMC algorithms for Signal Processing A Review of Multiple Try MCMC algorithms for Signal Processing Luca Martino Image Processing Lab., Universitat e València (Spain) Universia Carlos III e Mari, Leganes (Spain) Abstract Many applications

More information

IERCU. Institute of Economic Research, Chuo University 50th Anniversary Special Issues. Discussion Paper No.210

IERCU. Institute of Economic Research, Chuo University 50th Anniversary Special Issues. Discussion Paper No.210 IERCU Institute of Economic Research, Chuo University 50th Anniversary Special Issues Discussion Paper No.210 Discrete an Continuous Dynamics in Nonlinear Monopolies Akio Matsumoto Chuo University Ferenc

More information

Optimistic Planning with Long Sequences of Identical Actions: An Extended Theoretical and Experimental Study

Optimistic Planning with Long Sequences of Identical Actions: An Extended Theoretical and Experimental Study Volume 56, Number 1-2, 2015 27 Optimistic Planning with Long Sequences of Ientical Actions: An Extene Theoretical an Experimental Stuy Koppány Máthé, Lucian Bușoniu, Liviu Miclea epartment of Automation,

More information

arxiv: v2 [math.pr] 27 Nov 2018

arxiv: v2 [math.pr] 27 Nov 2018 Range an spee of rotor wals on trees arxiv:15.57v [math.pr] 7 Nov 1 Wilfrie Huss an Ecaterina Sava-Huss November, 1 Abstract We prove a law of large numbers for the range of rotor wals with ranom initial

More information

Optimal Control of Spatially Distributed Systems

Optimal Control of Spatially Distributed Systems Optimal Control of Spatially Distribute Systems Naer Motee an Ali Jababaie Abstract In this paper, we stuy the structural properties of optimal control of spatially istribute systems. Such systems consist

More information

CONTROL CHARTS FOR VARIABLES

CONTROL CHARTS FOR VARIABLES UNIT CONTOL CHATS FO VAIABLES Structure.1 Introuction Objectives. Control Chart Technique.3 Control Charts for Variables.4 Control Chart for Mean(-Chart).5 ange Chart (-Chart).6 Stanar Deviation Chart

More information

Conservation Laws. Chapter Conservation of Energy

Conservation Laws. Chapter Conservation of Energy 20 Chapter 3 Conservation Laws In orer to check the physical consistency of the above set of equations governing Maxwell-Lorentz electroynamics [(2.10) an (2.12) or (1.65) an (1.68)], we examine the action

More information

Statics. There are four fundamental quantities which occur in mechanics:

Statics. There are four fundamental quantities which occur in mechanics: Statics Mechanics isabranchofphysicsinwhichwestuythestate of rest or motion of boies subject to the action of forces. It can be ivie into two logical parts: statics, where we investigate the equilibrium

More information

Perturbation Analysis and Optimization of Stochastic Flow Networks

Perturbation Analysis and Optimization of Stochastic Flow Networks IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. XX, NO. Y, MMM 2004 1 Perturbation Analysis an Optimization of Stochastic Flow Networks Gang Sun, Christos G. Cassanras, Yorai Wari, Christos G. Panayiotou,

More information

I m Doing as Well as I Can: Modeling People as Rational Finite Automata

I m Doing as Well as I Can: Modeling People as Rational Finite Automata I m Doing as Well as I Can: Moeling People as Rational Finite Automata Joseph Y Halpern Rafael Pass Lior Seeman Computer Science Dept Cornell University Ithaca, NY E-mail: halpern rafael lseeman@cscornelleu

More information

Tractability results for weighted Banach spaces of smooth functions

Tractability results for weighted Banach spaces of smooth functions Tractability results for weighte Banach spaces of smooth functions Markus Weimar Mathematisches Institut, Universität Jena Ernst-Abbe-Platz 2, 07740 Jena, Germany email: markus.weimar@uni-jena.e March

More information

Analyzing Tensor Power Method Dynamics in Overcomplete Regime

Analyzing Tensor Power Method Dynamics in Overcomplete Regime Journal of Machine Learning Research 18 (2017) 1-40 Submitte 9/15; Revise 11/16; Publishe 4/17 Analyzing Tensor Power Metho Dynamics in Overcomplete Regime Animashree Ananumar Department of Electrical

More information

Minimum-time constrained velocity planning

Minimum-time constrained velocity planning 7th Meiterranean Conference on Control & Automation Makeonia Palace, Thessaloniki, Greece June 4-6, 9 Minimum-time constraine velocity planning Gabriele Lini, Luca Consolini, Aurelio Piazzi Università

More information

Leaving Randomness to Nature: d-dimensional Product Codes through the lens of Generalized-LDPC codes

Leaving Randomness to Nature: d-dimensional Product Codes through the lens of Generalized-LDPC codes Leaving Ranomness to Nature: -Dimensional Prouct Coes through the lens of Generalize-LDPC coes Tavor Baharav, Kannan Ramchanran Dept. of Electrical Engineering an Computer Sciences, U.C. Berkeley {tavorb,

More information

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION The Annals of Statistics 1997, Vol. 25, No. 6, 2313 2327 LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION By Eva Riccomagno, 1 Rainer Schwabe 2 an Henry P. Wynn 1 University of Warwick, Technische

More information

Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs

Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs Ashish Goel Michael Kapralov Sanjeev Khanna Abstract We consier the well-stuie problem of fining a perfect matching in -regular bipartite

More information

Optimal Control of Spatially Distributed Systems

Optimal Control of Spatially Distributed Systems Optimal Control of Spatially Distribute Systems Naer Motee an Ali Jababaie Abstract In this paper, we stuy the structural properties of optimal control of spatially istribute systems. Such systems consist

More information

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency Transmission Line Matrix (TLM network analogues of reversible trapping processes Part B: scaling an consistency Donar e Cogan * ANC Eucation, 308-310.A. De Mel Mawatha, Colombo 3, Sri Lanka * onarecogan@gmail.com

More information

Quantum mechanical approaches to the virial

Quantum mechanical approaches to the virial Quantum mechanical approaches to the virial S.LeBohec Department of Physics an Astronomy, University of Utah, Salt Lae City, UT 84112, USA Date: June 30 th 2015 In this note, we approach the virial from

More information

The Impact of Collusion on the Price of Anarchy in Nonatomic and Discrete Network Games

The Impact of Collusion on the Price of Anarchy in Nonatomic and Discrete Network Games The Impact of Collusion on the Price of Anarchy in Nonatomic an Discrete Network Games Tobias Harks Institute of Mathematics, Technical University Berlin, Germany harks@math.tu-berlin.e Abstract. Hayrapetyan,

More information

1 dx. where is a large constant, i.e., 1, (7.6) and Px is of the order of unity. Indeed, if px is given by (7.5), the inequality (7.

1 dx. where is a large constant, i.e., 1, (7.6) and Px is of the order of unity. Indeed, if px is given by (7.5), the inequality (7. Lectures Nine an Ten The WKB Approximation The WKB metho is a powerful tool to obtain solutions for many physical problems It is generally applicable to problems of wave propagation in which the frequency

More information

Hyperbolic Systems of Equations Posed on Erroneous Curved Domains

Hyperbolic Systems of Equations Posed on Erroneous Curved Domains Hyperbolic Systems of Equations Pose on Erroneous Curve Domains Jan Norström a, Samira Nikkar b a Department of Mathematics, Computational Mathematics, Linköping University, SE-58 83 Linköping, Sween (

More information

Topic Modeling: Beyond Bag-of-Words

Topic Modeling: Beyond Bag-of-Words Hanna M. Wallach Cavenish Laboratory, University of Cambrige, Cambrige CB3 0HE, UK hmw26@cam.ac.u Abstract Some moels of textual corpora employ text generation methos involving n-gram statistics, while

More information

A Polynomial Rooting Approach for Analysis of Competition among Secondary Users in Cognitive Radio Networks

A Polynomial Rooting Approach for Analysis of Competition among Secondary Users in Cognitive Radio Networks A Polynomial Rooting Approach for Analysis of Competition among Seconary Users in Cognitive Raio Networs Xiaohua Li, Chengyu Xiong an Wenel Caeau Department of Electrical an Computer Engineering State

More information

Situation awareness of power system based on static voltage security region

Situation awareness of power system based on static voltage security region The 6th International Conference on Renewable Power Generation (RPG) 19 20 October 2017 Situation awareness of power system base on static voltage security region Fei Xiao, Zi-Qing Jiang, Qian Ai, Ran

More information

On the Aloha throughput-fairness tradeoff

On the Aloha throughput-fairness tradeoff On the Aloha throughput-fairness traeoff 1 Nan Xie, Member, IEEE, an Steven Weber, Senior Member, IEEE Abstract arxiv:1605.01557v1 [cs.it] 5 May 2016 A well-known inner boun of the stability region of

More information

On the number of isolated eigenvalues of a pair of particles in a quantum wire

On the number of isolated eigenvalues of a pair of particles in a quantum wire On the number of isolate eigenvalues of a pair of particles in a quantum wire arxiv:1812.11804v1 [math-ph] 31 Dec 2018 Joachim Kerner 1 Department of Mathematics an Computer Science FernUniversität in

More information

Convergence of Random Walks

Convergence of Random Walks Chapter 16 Convergence of Ranom Walks This lecture examines the convergence of ranom walks to the Wiener process. This is very important both physically an statistically, an illustrates the utility of

More information

All s Well That Ends Well: Supplementary Proofs

All s Well That Ends Well: Supplementary Proofs All s Well That Ens Well: Guarantee Resolution of Simultaneous Rigi Boy Impact 1:1 All s Well That Ens Well: Supplementary Proofs This ocument complements the paper All s Well That Ens Well: Guarantee

More information

Lie symmetry and Mei conservation law of continuum system

Lie symmetry and Mei conservation law of continuum system Chin. Phys. B Vol. 20, No. 2 20 020 Lie symmetry an Mei conservation law of continuum system Shi Shen-Yang an Fu Jing-Li Department of Physics, Zhejiang Sci-Tech University, Hangzhou 3008, China Receive

More information

arxiv: v1 [math-ph] 5 May 2014

arxiv: v1 [math-ph] 5 May 2014 DIFFERENTIAL-ALGEBRAIC SOLUTIONS OF THE HEAT EQUATION VICTOR M. BUCHSTABER, ELENA YU. NETAY arxiv:1405.0926v1 [math-ph] 5 May 2014 Abstract. In this work we introuce the notion of ifferential-algebraic

More information