Convergence analysis of ant colony learning

Size: px
Start display at page:

Download "Convergence analysis of ant colony learning"

Transcription

1 Delft University of Technology Delft Center for Systems and Control Technical report Convergence analysis of ant colony learning J van Ast R Babška and B De Schtter If yo want to cite this report please se the following reference instead: J van Ast R Babška and B De Schtter Convergence analysis of ant colony learning Proceedings of the 18th IFAC World Congress Milan Italy pp Ag Sept 2011 Delft Center for Systems and Control Delft University of Technology Mekelweg CD Delft The etherlands phone: (secretary) fax: URL: This report can also be downloaded viahttp://pbdeschtterinfo/abs/11_012html

2 Convergence Analysis of Ant Colony Learning Jelmer van Ast Robert Babška Bart De Schtter Delft Center for Systems and Control of the Delft University of Technology Mekelweg CD Delft The etherlands ( jmvanast@tdelftnl rbabska@tdelftnl bdeschtter@tdelftnl) Abstract: In this paper we stdy the convergence of the pheromone levels of Ant Colony Learning (ACL) in the setting of discrete state spaces and noiseless state transitions ACL is a mlti-agent approach for learning control policies that combines some of the principles fond in ant colony optimization and reinforcement learning Convergence of the pheromone levels in expected vale is a necessary reqirement for the convergence of the learning process to optimal control policies In this paper we derive pper and lower bonds for the pheromone levels and relate those to the learning parameters and the nmber of ants sed in the algorithm We also derive pper and lower bonds on the expected vale of the pheromone levels Keywords: Evoltionary algorithms in control and identification; Reinforcement learning control 1 ITRODUCTIO Ant Colony Learning (ACL) is an algorithmic framework for the atomatic learning of control policies for non-linear systems with continos or discrete state spaces (van Ast et al ) The framework is based on the Ant Colony Optimization (ACO) class of algorithms which is inspired by the foraging behavior of ants (Dorigo and Blm 2005) In ACL a collection of agents called ants jointly interact with the system at hand in order to find an optimal control policy ie an optimal mapping between states and actions Throgh the stigmergic interaction by means of pheromones the ants are gided by each other s experience towards better control policies In discrete time the control policy will lead to a seqence of state-action pairs starting in a given initial state and terminating with the action that brings the state of the system to the desired vale A seqence of state-action pairs is called a soltion A predefined cost fnction evalates soltions and a soltion is then called optimal if it has the lowest cost compared to the cost of all possible soltion trajectories from the initial state to the goal state A policy is called optimal if for all states in the state space the soltions are optimal In this paper we present a theoretical stdy on the convergence of the pheromone levels of ACL This is a step towards a convergence proof of the algorithm to optimal control policies We derive pper and lower bonds on the pheromone levels as well as on the expected vale on the pheromone levels Frthermore we relate those bonds to the learning parameters and the nmber of ants sed in the algorithm Convergence analysis of ACL is different from the convergence analysis of the ACO gbτmin class of ACO algorithms (Stützle and Dorigo 2002) The convergence proof of Stützle and Dorigo (2002) mainly relies on the global-best pdate rle with which the pheromones are only pdated if they belong to the best soltion fond so far With ACL all soltions fond in a trial are sed to pdate the pheromone levels This reqires a different type of convergence analysis This paper is strctred as follows In Section 2 the ACL framework is presented Section 3 presents a closed expression for the total pheromone pdate that takes place dring a trial of the algorithm In Section 4 pper and lower bonds are derived for the pheromone levels Based on these reslts the pper and lower bonds on the expected vale of the pheromone levels is derived in Section 5 The paper concldes with Section 6 2 AT COLOY LEARIG 21 The Optimal Policy Learning Problem Assme that we have a nonlinear dynamic system characterized by a discrete-valed state vector q = [q 1 q 2 q n T Q where Q has a finite nmber of elements Also assme that the system can be controlled by an inpt U that can only take a finite nmber of vales and that the state can be measred at discrete time steps with a sampling timet s witht the discrete time index The sampled system is denoted as: q(t+1) p(q(t)(t)) with p a probability distribtion fnction over the state-action space The nonlinear mapping fnction h is called the control policy: (t) = h(q(t)) The optimal control problem we consider is to control the system from any given initial state q(0) = q 0 to a desired goal state q(t) = q g in at most t T steps and in an optimal way where optimality is defined by minimizing a certain cost fnction: J(s) = J( qũ) (1) with s a soltion and q = q(1)q(t) and ũ = (0)(T 1) respectively the seqences of states and actions in that soltion The problem is to find a control policy that when applied to the system in q 0 reslts in a seqence of state-action pairs ((0)q(1)) ((1)q(2)) ((T 1)q(T)) that minimizes this cost fnction In or case we aim at finding control policies for non-linear systems which in general cannot be derived analytically from the system description and the cost fnction

3 The cost fnction mst satisfy for any soltion s: 0 < τ 0 ρ J max J (s) J min (2) with J max = maxj(s) and J min = minj(s) respectively the s s largest and smallest possible vale of the cost fnction The parameter τ 0 represents the initial vale of the pheromone levels and ρ denoted the global pheromone decay rate These parameters will be frther explained in Section 22 and Section 25 respectively ote that this reqirement is not at all restrictive since adding a constant τ0 ρ to the cost fnction renders this reqirement satisfied withot changing the optimal soltion Also note that we can trivially extend the optimal control problem that we consider here to inclde a set of goal states denoted byq g In that case we can inclde a single virtal goal state to which all states q g Q g lead with probability one and a cost of zero 22 Otline of the ACL Algorithm The general otline of the ACL algorithm is as follows Initially all M ants are distribted randomly over the state space of the system and the pheromone levels τ q associated with each state-action pair (q) are set to an initial vale τ q (0) = τ 0 In what is called a trial all ants make interaction steps with the system First they decide based on the pheromone levels which action to perform after which they apply this action to their own copy of the system They store the state-action pair to their personal record called their partial soltion s p and the pheromone level at that state-action pair τ q is annealed according to (4) throgh the local pheromone pdate Each copy of the system responds to the inpt by changing its state after which the ants repeat the process by choosing a new action ntil they reach the goal state q g which terminates the trial After all ants have terminated their trial or after a predefined nmber of trials all partial soltions are added to the mltiset S trial This set is sed in the global pheromone pdate step where all soltions in the set are evalated over the cost fnction and the state-action pairs contained in the soltions receive a pheromone pdate accordingly From the otpt of the algorithm which are the pheromone vales the control policy can be derived In the following we explicitly distingish between the steps in the inner loop and the steps in the oter loop of the algorithm In the inner loop the iterations are indexed by t while in the oter loop the iterations are indexed by k In order to nderstand the timing of the pheromone pdates nambigosly the pheromone variables in the inner loop receive the sperscript local : τq local Before starting the inner loop the crrent pheromone levels are copied to the local pheromone levels: τq local (0) = τ q (k) for all stateaction pairs The first step in the inner loop is the selection of the action 23 Action Selection In the action selection step each ant c determines which action to apply to the system in a given state q c There are varios possible forms for the action selection bt the analysis in this paper is based on the ǫ-greedy action selection rle Here the amont of exploration is kept constant de to the inclsion of an explicit exploration probability ǫ: ( ) arg max τq local c = (t) cl with probability 1 ǫ l U qc random(u qc ) with probability ǫ whererandom(u qc ) denotes the random selection of an action from the action setu qc in stateq c sing a niform distribtion 24 Local Pheromone Update After every step each antcperforms a local pheromone pdate for the(q c c )-pair jst visited: τ local q c c (t+1) = (1 γ)τ local q c c (t)+γτ 0 (4) with γ [01) the local pheromone decay rate The prpose of the local pheromone pdate is to stimlate exploration of the state-action space by making it less attractive for an ant to choose the same action in a certain state as its predecessor When all ants have reached the goal or when the inner loop has timed-ot the algorithm contines with the global pheromone step in the oter loop 25 Global Pheromone Update After completion of the trial (which let s assme happens when t = T ) the pheromone levels are pdated according to the following global pheromone pdate step: τ q (k +1) =(1 ρ)τ local q (T)+ρ s S trial (k): (q) s J (s) (q) : s S trial (k) : (q) s (5) with S trial the mltiset of all candidate soltions fond in the trial and ρ (01 the global pheromone decay rate ote that elitism in the global pheromone pdate is not possible since the best soltion wold then always be the soltion starting jst prior to the goal state with the action taking the system to the goal state immediately Since we aim at learning optimal control policies from any initial state we mst also inclde every soltion fond in the pdate The pheromone deposit is eqal to J (s) = J ( qũ) the inverse of the cost fnction over the seqence of discrete state-action pairs in s ote that minimizing the cost corresponds to maximizing the pheromone levels corresponding to the optimal soltion After the global pheromone pdate the algorithm contines fork+1 at the start of the oter loop ntil the maximal nmber of trials have taken place (ie when k = K) 26 Control Policy The control policy can be extracted from the pheromone levels as follows: = h(q) = argmax(τ ql ) (6) l U q in which ties are broken randomly This eqation means that the control policy assigns the action to a given state that maximizes the associated pheromone levels 3 TOTAL PHEROMOE UPDATE Let s now derive an expression for the total effect of the pheromone pdates dring one trial At the start of a trial k the crrent pheromone levels are copied to the local pheromone levels τq local (0) = τ q (k) for all (q)-pairs In the first (3)

4 trial when τ local q (0) = τ q (0) = τ 0 for all (q)-pairs the local pheromone pdate (4) has no effect Only after some pheromone variables have received a pheromone deposit from the global pheromone pdate (5) these pheromone levels can become larger than τ 0 Let s assme that the trial is ended when t = T and that M q (k) ants have visited the (q)-pair in the given trial It is then easy to verify that the pheromone levels have then been pdated according to: τ local q (T) = (1 γ) Mq(k) (τ local q (0) τ 0 )+τ 0 = (1 γ) Mq(k) (τ q (k) τ 0 )+τ 0 (7) The global pheromone pdate is applied toτq local (T) at the end of a trial if (q) is an element of the soltion of one or more ants We can aggregate (7) and (5) to get an expression for the total pheromone pdate at the end of a trial: τ q (k +1) = (1 ρ) {(1 γ) Mq(k) (τ q (k) τ 0 )+τ 0 +ρ J (s) ifm q (k) > 0 (8) s S trial (k): (q) s τ q (k +1) = τ q (k) otherwise (9) withm q (k) the nmber of ants that have visited(q) dring trialk From (8) - (9) we can see that: (1) If a(q) is not visited by any of the ants in a given trial the pheromone level τ q will not be pdated in that trial (2) If a (q) is visited by one or more ants in a given trial τ q will be pdated in that trial and may increase or decrease in vale depending on the pheromone deposits of the ants and the vales ofγ and ρ By introdcingκas the conter of the nmber of trials in which the pheromone level of a state-action pair (q) receives a global pheromone pdate we can write (8) - (9) as: { τq pd (κ+1) = (1 ρ) (1 γ) Mq(κ) (τq pd (κ) τ 0 )+τ 0 +ρ J (s) (10) s S trial (κ); (q) s in which the sperscript pd is sed to avoid confsion between pheromone pdates indexed withk and κ 4 BOUDS O THE PHEROMOE LEVELS We will proceed or analysis by deriving lower and pper bonds for the pheromone levels Starting with the lower bond we prove the following proposition: Proposition 41 The lower bond on the pheromone levels is τ 0 Proof By indction we can show that a pheromone level can never become smaller than τ 0 when sing the pheromone pdate expression from (10): τq pd (0) =τ 0 { q (κ+1) =(1 ρ) (1 γ) Mq(κ) (τq pd (κ) τ 0 )+τ 0 +ρ J (s) s S trial (κ): (q) s (1 ρ) (1 γ)mq(κ) (τq pd (κ) τ 0 ) +τ 0 {{ by indction +ρm q (κ)jmax (1 ρ)τ 0 +ρjmax (1 ρ)τ 0 +τ 0 τ 0 in which we have sed the condition from (2) In order to derive the pper bond on the pheromone levels we se the following lemma: Lemma 42 Consider a first-order scalar difference eqation: y(k +1) = ay(k)+b witha [01)b R and an initial pointy(0) Ify(0) b 1 a then y(k) is non-decreasing with the final vale: lim y(k) = b k 1 a Proof The final vale follows trivially from the final vale theorem of the z-transform (Åström and Wittenmark 1990) We can prove that the seqence y(k) is non-decreasing as follows Using: y(k) = a k y(0)+(a k ++a+1)b y(k +1) = a k+1 y(0)+(a k +a k ++a+1)b we can derive the following expression for the difference between two consective time steps of this difference eqation: y(k +1) y(k) = (a k+1 a k )y(0)+a k b = a k (b+ay(0) y(0)) Since a [01) we have a k 0 So in order to make y(k) non-decreasing we need that: (b +ay(0) y(0)) 0 which is eqal to reqiring that y(0) b 1 a Using this lemma we can prove the following proposition: Proposition 43 For any (q )-pair the pheromone levels are bonded from above: τq pd (κ) β pper +ρmj min 1 α pper with α pper = (1 ρ)(1 γ) β pper = (1 ρ) [ (1 γ) M ( τ 0 )+τ 0 and with J min = min s J(s) the smallest possible vale of the cost fnction ote that for this theoretical analysis it is not necessary to know the vale ofj min Proof Considering a given (q )-pair let s rewrite (10) as follows: τq pd (κ+1) =(1 ρ)(1 γ) Mq(κ) τq pd (κ) +(1 ρ) [(1 γ) Mq(κ) ( τ 0 )+τ 0 +ρ J (s) s S trial (κ): (q) s

5 q q = = * q This eqation is of the form: q (κ+1) = α(κ) q (κ)+β(κ)+δ(κ) Let s now introdce θ q (κ) which satisfies the following difference eqation: θ q (κ+1) = α pper θ q (κ)+β pper +δ pper in which α pper β pper and δ pper are pper bonds of α(κ) β(κ) and δ(κ) respectively: α(κ) α pper = (1 ρ)(1 γ) β(κ) β pper = (1 ρ) [ (1 γ) M ( τ 0 )+τ 0 δ(κ) δ pper = ρmj min The pper bond for α(κ) is obtained by taking M q (κ) = 1 for allκ while the pper bonds forβ(κ) andδ(κ) are obtained for M q (κ) = M for all κ We take the same initial vales for q and θ q so θ q (0) = τ 0 Using Lemma 42 we can now show that θ q (κ) is a non-decreasing fnction of κ We immediately see that α pper 0 and we mst show that: β pper +δ pper 1 α pper θ q (0) = τ 0 This condition is satisfied if (1 ρ) [ (1 γ) M ( τ 0 )+τ 0 +ρmj min τ 0 1 (1 ρ)(1 γ) or (1 ρ) [ (1 γ) M ( τ 0 )+τ 0 +ρmj min τ 0 +(1 ρ)(1 γ)τ 0 0 Recalling the ranges for ρ viz (01 and for γ viz [01) we can show that the latter ineqality holds as follows: (1 ρ) (1 γ) M ( τ 0 )+τ 0 {{ 0 +(1 ρ)(1 γ)τ 0 +ρmj min τ 0 τ 0 Moreover by indction it follows that q (κ) θ q (κ): τq pd (0) = θ q (0) τq pd (κ+1) = α(κ)τq pd (κ)+β(κ)+δ(κ) α pper θ q (κ)+β pper +δ pper = θ q (κ+1) We can se Lemma 42 to get lim θ q(κ) = β pper +δ pper κ 1 α pper Since we have shown that θ q (κ) is non-decreasing we have now arrived at the conclsion that: τq pd (κ) θ q (κ) β pper +ρmj min 1 α pper ote that this pper bond is only tight form = 1 5 BOUDS O THE EXPECTED VALUE OF THE PHEROMOE LEVELS The bonds derived in the previos section are sefl as they give s information abot the range in which the pheromone levels reside Moreover the bonds give the relations between the global and local pheromone decay ratesρandγ the nmber of ants M and the initial vale of the pheromone levels τ 0 However two shortcomings of these bonds prevent s from drawing conclsions abot convergence of the algorithm: (1) Or pper bond is not tight form > 1 (2) Or pper bond is the same for all pheromone levels associated with all (q)-pairs Especially the second isse prevents s from analyzing whether and when the pheromone levels associated with the optimal state-action pairs become larger than the pheromone levels associated with sboptimal state-action pairs In this section or aim is to find expressions for the evoltion of individal pheromone levels However two factors in the algorithm in particlar complicate sch an analysis: (1) The total pheromone pdate from (10) contains M q (κ) the nmber of ants that have visited the(q )-pair in trial κ which is dependent on many nknown factors sch as the other pheromone levels and the exploration process (2) The total pheromone pdate of τq pd (κ) depends on the pheromone deposits J (s) from all soltions fond in trial κ The pdate of a particlar pheromone level ths depends on all state-action pairs prior to (q) and all state-action pairs following (q ) which also depends on many nknown factors sch as the other pheromone levels and the exploration process There are too many ncertainties involved in the algorithm in order to find a closed expression of the final pheromone levels This is inherent to learning algorithms that contain random variables sch as exploration and in which the credit assignment sch as the distribtion of rewards in reinforcement learning (Stton and Barto 1998) or the distribtion of the pheromone deposits in ACL depends on a series of decision variables In the following we eliminate the ncertainty arising from exploration by looking at the expected vale of the pheromone levels We eliminate the ncertainty arising from the stateaction pairs prior to (q ) and the state-action pairs following (q) by considering the sitation depicted in Figre 1 Fig 1 From a state q the action will take the system to another state q from which there are possible actions The action in this state will bring the system to the goal state optimally The action ū does so sb-optimally bt still with a lower cost than the other actions Here we regard the M q (κ) ants to start in state q and all choose the action All ants are taken toq after which they can choose between possible actions Withot loss of generality we assme that the action 1 = then takes an ant to the goal state immediately and with the lowest cost compared to the other available actions It is ths considered to be the optimal action The action 2 = ū is the second-best action It takes an ant to the goal state with a higher cost compared to 1 bt with a lower cost compared to all the other actions Let s also g

6 assme that the inverse of the cost reslting from all possible actions is ordered as follows: J minq = J (q 1 = ) > J (q 2 = ū) = J secondq > > J (q ) = J maxq Here J q is a short-hand expression for the cost that reslts from an action chosen in q and possible other state-action pairs following q The sbscripts min second etc then denote respectively the lowest or next to lowest possible cost to reslting in this manner By definition J (q 1 = ) = J minq and J (q ) = J maxq We will analyze the behavior of the expected vale ofτq pd (κ) Since the cost reslting from the state-action pair (q ) is independent from the action chosen inq any constant cost for J (q) will do for or analysis Withot loss of generality and for the sake of simplicity we takej (q) = 0 althogh formally this is impossible since J (q) Jmax > 0 for any (q)-pair ote that it ths also holds that J J minq minq = J secondq = J secondq etc We assme in this section that the optimal action from q is crrently also associated with the highest pheromone level and is ths also designated to be optimal Dring learning this does not have to be the case since other actions may be associated with higher pheromone levels and is ths not yet known to be the optimal action Compting the expected vale of τq pd (κ) involves taking the expected vale of the nmber of ants M q (κ) in the exponent which severely complicates deriving an analytical expression for the expected vale of τq pd (κ) We mst ths shift or aim by choosing to derive pper and lower bonds on the expected vale of τq pd (κ) instead Proposition 51 For any state-action pair (q ) the expected vale of the pheromone levels is bonded from above: with q (κ) β pper +ρmjexpq (11) 1 α pper E[ α pper = (1 ρ)(1 γ) β pper = (1 ρ) [ (1 γ) M ( τ 0 )+τ 0 Jexpq = (1 ǫ)j minq +ǫj avgq and J avgq the inverse of the average cost expected to reslt when moving fromqto the goal Proof When choosing the pheromone level q is increased the most when: q (κ+1) =(1 ρ)(1 γ) α pper q (κ)+ρmj minq +(1 ρ) [ (1 γ) M ( τ 0 )+τ 0 β pper When choosing ū the pheromone level q is increased the most when: q (κ+1) =(1 ρ)(1 γ) α pper q (κ)+ρmj secondq +(1 ρ) [ (1 γ) M ( τ 0 )+τ 0 β pper The largest increase of τq pd for the other actions follows in a similar manner The probability that the optimal action is chosen is: p( ) = 1 ǫ+ ǫ ( ) = 1 ǫ namely the probability of not exploring pls the probability of selecting that action while exploring (which is niformly distribted) The probability of choosing any of the other actions is p( i ) = ǫ for i The expected vale of τq pd when increasing the most can now be compted as follows: E pper [τq pd (κ+1) [ ( ) = 1 ǫ + (α pper q (κ)+β pper +ρmj minq ) (α pper q (κ)+β pper +ρmj secondq ) (α pper + q (κ)+β pper +ρmj [ ( ) =α pper τq pd (κ)+β pper + 1 ǫ + ρmj secondq ++ {{ terms =α pper τq pd (κ)+β pper +[1 ǫρmj + maxq) ρmj maxq minq ρm(j minq +J secondq ++J maxq) ρmj minq =α pper τq pd (κ)+β pper +ρmjexpq where Jexpq = (1 ǫ)j minq + ǫj avgq is the expected pheromone deposit on τq pd Since E pper [τq pd (κ) E[τq pd (κ) for all κ the following difference eqation describes the evoltion of the pper bond of the expected vale of τq pd (κ): E pper [τq pd (κ+1) =α pper E pper [τq pd (κ)+β pper Since we can show that +ρmj expq E pper [τq pd (0) = τ 0 β pper +ρmjexpq 1 α pper we know from Lemma 42 that E pper [τq pd (κ) is nondecreasing and that: E[τq pd (κ) E pper [τq pd (κ) β pper +ρmjexpq 1 α pper ote that J avgq is generally not known althogh it might be possible to estimate it Similar to the pper bond we can derive a lower bond on the expected vale of pheromone levels Proposition 52 For any state-action pair (q ) in the limit for κ the expected vale of the pheromone levels is bonded from below: with lim κ E[τpd q (κ) β lower +ρj expq 1 α lower (12) α lower = (1 ρ)(1 γ) M β lower = (1 ρ)[(1 γ)( τ 0 )+τ 0 Jexpq = (1 ǫ)j minq +ǫj avgq

7 Proof When choosing q is increased the least when: q (κ+1) =(1 ρ)(1 γ) M q (κ) {{ α lower +(1 ρ)[(1 γ)( τ 0 )+τ 0 {{ β lower +ρjminq since for allκα(κ) = (1 ρ)(1 γ) Mq(κ) is the smallest for M q (κ) = M andβ(κ) = (1 ρ) [ (1 γ) Mq(κ) ( τ 0 )+τ 0 is the smallest for M q (κ) = 1 The rest of the proof follows easily along the same lines as the proof of Proposition 51 The expected vale of the pheromone levels lies between these bonds E lower [τq pd (κ) E[τq pd (κ) E pper [τq pd (κ) for all κ In the limit for κ the expected vale of the pheromone levels lies between the derived pper and lower bonds: β lower +ρjexpq lim 1 α lower κ E[τpd q (κ) β pper +ρmjexpq 1 α pper The expressions for α(κ) α pper α lower β(κ) β pper and β lower are presented in Table 1 Table 1 The expressions for α(κ) and β(κ) and their bonds ICIS project Self-Organizing Moving Agents (grant no BSIK03024) REFERECES Åström KJ and Wittenmark B (1990) Compter Controlled Systems Theory and Design Prentice-Hall Englewood Cliffs J Dorigo M and Blm C (2005) Ant colony optimization theory: a srvey Theoretical Compter Science 344(2-3) Stützle T and Dorigo M (2002) A short convergence proof for a class of ant colony optimization algorithms IEEE Transactions on Evoltionary Comptation 6(4) Stton RS and Barto AG (1998) Reinforcement Learning: An Introdction MIT Press Cambridge MA van Ast JM Babška R and De Schtter B (2009) ovel ant colony optimization approach to optimal control International Jornal of Intelligent Compting and Cybernetics 2(3) van Ast JM Babška R and De Schtter B (2010) Ant colony learning algorithm for optimal control In R Babška and FCA Groen (eds) Interactive Collaborative Information Systems volme 281 of Stdies in Comptational Intelligence Springer Berlin / Heidelberg α(κ) and its pper and lower bond α(κ) = (1 ρ)(1 γ) Mq(κ) α pper = (1 ρ)(1 γ) α lower = (1 ρ)(1 γ) M β(κ) and its pper and lower bond β(κ) = (1 ρ) [ (1 γ) Mq(κ) ( τ 0 )+τ 0 β pper = (1 ρ) [ (1 γ) M ( τ 0 )+τ 0 β lower = (1 ρ)[(1 γ)( τ 0 )+τ 0 = τ 0 γ(1 ρ) 6 COCLUSIOS In this paper we have analyzed the evoltion of the pheromone levels in Ant Colony Learning In particlar we have stdied the sitation in which from a state q an action will take the system to another state q from which there are possible actions There the pheromone level τ q was dependent on the action chosen in the state q frther downstream towards the goal The goal wold be reached after choosing the optimal action in q Becase of the npredictability of the nmber of ants that visit a particlar state we have derived pper and lower bonds on the pheromone levels followed by pper and lower bonds on the expected vale of the pheromone levels In ftre research we will extend the convergence analysis towards a convergence proof of the algorithm to optimal control policies The convergence analysis of the pheromone levels as presented in this paper is an essential step for this Ftre research mst also focs on the applicability of the convergence reslts to real-world control problems where the strctre of the control problem mst be taken into accont ACKOWLEDGMET This research is financially spported by Senter Ministry of Economic Affairs of the etherlands within the BSIK-

A Model-Free Adaptive Control of Pulsed GTAW

A Model-Free Adaptive Control of Pulsed GTAW A Model-Free Adaptive Control of Plsed GTAW F.L. Lv 1, S.B. Chen 1, and S.W. Dai 1 Institte of Welding Technology, Shanghai Jiao Tong University, Shanghai 00030, P.R. China Department of Atomatic Control,

More information

UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL

UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL 8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-1 April 01, Tallinn, Estonia UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL Põdra, P. & Laaneots, R. Abstract: Strength analysis is a

More information

Lecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2

Lecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 BIJU PATNAIK UNIVERSITY OF TECHNOLOGY, ODISHA Lectre Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 Prepared by, Dr. Sbhend Kmar Rath, BPUT, Odisha. Tring Machine- Miscellany UNIT 2 TURING MACHINE

More information

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation Stability of Model Predictive Control sing Markov Chain Monte Carlo Optimisation Elilini Siva, Pal Golart, Jan Maciejowski and Nikolas Kantas Abstract We apply stochastic Lyapnov theory to perform stability

More information

CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy

CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK Wassim Joini and Christophe Moy SUPELEC, IETR, SCEE, Avene de la Bolaie, CS 47601, 5576 Cesson Sévigné, France. INSERM U96 - IFR140-

More information

The Linear Quadratic Regulator

The Linear Quadratic Regulator 10 The Linear Qadratic Reglator 10.1 Problem formlation This chapter concerns optimal control of dynamical systems. Most of this development concerns linear models with a particlarly simple notion of optimality.

More information

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance Optimal Control of a Heterogeneos Two Server System with Consideration for Power and Performance by Jiazheng Li A thesis presented to the University of Waterloo in flfilment of the thesis reqirement for

More information

System identification of buildings equipped with closed-loop control devices

System identification of buildings equipped with closed-loop control devices System identification of bildings eqipped with closed-loop control devices Akira Mita a, Masako Kamibayashi b a Keio University, 3-14-1 Hiyoshi, Kohok-k, Yokohama 223-8522, Japan b East Japan Railway Company

More information

4.2 First-Order Logic

4.2 First-Order Logic 64 First-Order Logic and Type Theory The problem can be seen in the two qestionable rles In the existential introdction, the term a has not yet been introdced into the derivation and its se can therefore

More information

Universal Scheme for Optimal Search and Stop

Universal Scheme for Optimal Search and Stop Universal Scheme for Optimal Search and Stop Sirin Nitinawarat Qalcomm Technologies, Inc. 5775 Morehose Drive San Diego, CA 92121, USA Email: sirin.nitinawarat@gmail.com Vengopal V. Veeravalli Coordinated

More information

Sensitivity Analysis in Bayesian Networks: From Single to Multiple Parameters

Sensitivity Analysis in Bayesian Networks: From Single to Multiple Parameters Sensitivity Analysis in Bayesian Networks: From Single to Mltiple Parameters Hei Chan and Adnan Darwiche Compter Science Department University of California, Los Angeles Los Angeles, CA 90095 {hei,darwiche}@cs.cla.ed

More information

Sources of Non Stationarity in the Semivariogram

Sources of Non Stationarity in the Semivariogram Sorces of Non Stationarity in the Semivariogram Migel A. Cba and Oy Leangthong Traditional ncertainty characterization techniqes sch as Simple Kriging or Seqential Gassian Simlation rely on stationary

More information

QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING

QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING Jornal of the Korean Statistical Society 2007, 36: 4, pp 543 556 QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING Hosila P. Singh 1, Ritesh Tailor 2, Sarjinder Singh 3 and Jong-Min Kim 4 Abstract In sccessive

More information

Assignment Fall 2014

Assignment Fall 2014 Assignment 5.086 Fall 04 De: Wednesday, 0 December at 5 PM. Upload yor soltion to corse website as a zip file YOURNAME_ASSIGNMENT_5 which incldes the script for each qestion as well as all Matlab fnctions

More information

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli 1 Introdction Discssion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen Department of Economics, University of Copenhagen and CREATES,

More information

E ect Of Quadrant Bow On Delta Undulator Phase Errors

E ect Of Quadrant Bow On Delta Undulator Phase Errors LCLS-TN-15-1 E ect Of Qadrant Bow On Delta Undlator Phase Errors Zachary Wolf SLAC Febrary 18, 015 Abstract The Delta ndlator qadrants are tned individally and are then assembled to make the tned ndlator.

More information

Imprecise Continuous-Time Markov Chains

Imprecise Continuous-Time Markov Chains Imprecise Continos-Time Markov Chains Thomas Krak *, Jasper De Bock, and Arno Siebes t.e.krak@.nl, a.p.j.m.siebes@.nl Utrecht University, Department of Information and Compting Sciences, Princetonplein

More information

Pulses on a Struck String

Pulses on a Struck String 8.03 at ESG Spplemental Notes Plses on a Strck String These notes investigate specific eamples of transverse motion on a stretched string in cases where the string is at some time ndisplaced, bt with a

More information

FOUNTAIN codes [3], [4] provide an efficient solution

FOUNTAIN codes [3], [4] provide an efficient solution Inactivation Decoding of LT and Raptor Codes: Analysis and Code Design Francisco Lázaro, Stdent Member, IEEE, Gianligi Liva, Senior Member, IEEE, Gerhard Bach, Fellow, IEEE arxiv:176.5814v1 [cs.it 19 Jn

More information

Discontinuous Fluctuation Distribution for Time-Dependent Problems

Discontinuous Fluctuation Distribution for Time-Dependent Problems Discontinos Flctation Distribtion for Time-Dependent Problems Matthew Hbbard School of Compting, University of Leeds, Leeds, LS2 9JT, UK meh@comp.leeds.ac.k Introdction For some years now, the flctation

More information

Move Blocking Strategies in Receding Horizon Control

Move Blocking Strategies in Receding Horizon Control Move Blocking Strategies in Receding Horizon Control Raphael Cagienard, Pascal Grieder, Eric C. Kerrigan and Manfred Morari Abstract In order to deal with the comptational brden of optimal control, it

More information

Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis Functional Response and Feedback Controls

Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis Functional Response and Feedback Controls Hindawi Pblishing Corporation Discrete Dynamics in Natre and Society Volme 2008 Article ID 149267 8 pages doi:101155/2008/149267 Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis

More information

BLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students

BLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students BLOOM S TAXONOMY Topic Following Bloom s Taonomy to Assess Stdents Smmary A handot for stdents to eplain Bloom s taonomy that is sed for item writing and test constrction to test stdents to see if they

More information

arxiv: v1 [cs.sy] 22 Nov 2018

arxiv: v1 [cs.sy] 22 Nov 2018 AAS 18-253 ROBUST MYOPIC CONTROL FOR SYSTEMS WITH IMPERFECT OBSERVATIONS Dantong Ge, Melkior Ornik, and Ufk Topc arxiv:1811.09000v1 [cs.sy] 22 Nov 2018 INTRODUCTION Control of systems operating in nexplored

More information

Nonlinear parametric optimization using cylindrical algebraic decomposition

Nonlinear parametric optimization using cylindrical algebraic decomposition Proceedings of the 44th IEEE Conference on Decision and Control, and the Eropean Control Conference 2005 Seville, Spain, December 12-15, 2005 TC08.5 Nonlinear parametric optimization sing cylindrical algebraic

More information

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled.

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled. Jnction elements in network models. Classify by nmber of ports and examine the possible strctres that reslt. Using only one-port elements, no more than two elements can be assembled. Combining two two-ports

More information

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How 1 Gradient Projection Anti-windp Scheme on Constrained Planar LTI Systems Jstin Teo and Jonathan P. How Technical Report ACL1 1 Aerospace Controls Laboratory Department of Aeronatics and Astronatics Massachsetts

More information

4 Exact laminar boundary layer solutions

4 Exact laminar boundary layer solutions 4 Eact laminar bondary layer soltions 4.1 Bondary layer on a flat plate (Blasis 1908 In Sec. 3, we derived the bondary layer eqations for 2D incompressible flow of constant viscosity past a weakly crved

More information

Theoretical and Experimental Implementation of DC Motor Nonlinear Controllers

Theoretical and Experimental Implementation of DC Motor Nonlinear Controllers Theoretical and Experimental Implementation of DC Motor Nonlinear Controllers D.R. Espinoza-Trejo and D.U. Campos-Delgado Facltad de Ingeniería, CIEP, UASLP, espinoza trejo dr@aslp.mx Facltad de Ciencias,

More information

Model Discrimination of Polynomial Systems via Stochastic Inputs

Model Discrimination of Polynomial Systems via Stochastic Inputs Model Discrimination of Polynomial Systems via Stochastic Inpts D. Georgiev and E. Klavins Abstract Systems biologists are often faced with competing models for a given experimental system. Unfortnately,

More information

Worst-case analysis of the LPT algorithm for single processor scheduling with time restrictions

Worst-case analysis of the LPT algorithm for single processor scheduling with time restrictions OR Spectrm 06 38:53 540 DOI 0.007/s009-06-043-5 REGULAR ARTICLE Worst-case analysis of the LPT algorithm for single processor schedling with time restrictions Oliver ran Fan Chng Ron Graham Received: Janary

More information

Data-Efficient Control Policy Search using Residual Dynamics Learning

Data-Efficient Control Policy Search using Residual Dynamics Learning Data-Efficient Control Policy Search sing Residal Dynamics Learning Matteo Saveriano 1, Ychao Yin 1, Pietro Falco 1 and Donghei Lee 1,2 Abstract In this work, we propose a model-based and data efficient

More information

Control Performance Monitoring of State-Dependent Nonlinear Processes

Control Performance Monitoring of State-Dependent Nonlinear Processes Control Performance Monitoring of State-Dependent Nonlinear Processes Lis F. Recalde*, Hong Ye Wind Energy and Control Centre, Department of Electronic and Electrical Engineering, University of Strathclyde,

More information

Bridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control

Bridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control Bridging the Gap Between Mltigrid, Hierarchical, and Receding-Horizon Control Victor M. Zavala Mathematics and Compter Science Division Argonne National Laboratory Argonne, IL 60439 USA (e-mail: vzavala@mcs.anl.gov).

More information

Formal Methods for Deriving Element Equations

Formal Methods for Deriving Element Equations Formal Methods for Deriving Element Eqations And the importance of Shape Fnctions Formal Methods In previos lectres we obtained a bar element s stiffness eqations sing the Direct Method to obtain eact

More information

Discussion Papers Department of Economics University of Copenhagen

Discussion Papers Department of Economics University of Copenhagen Discssion Papers Department of Economics University of Copenhagen No. 10-06 Discssion of The Forward Search: Theory and Data Analysis, by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen,

More information

Lecture Notes: Finite Element Analysis, J.E. Akin, Rice University

Lecture Notes: Finite Element Analysis, J.E. Akin, Rice University 9. TRUSS ANALYSIS... 1 9.1 PLANAR TRUSS... 1 9. SPACE TRUSS... 11 9.3 SUMMARY... 1 9.4 EXERCISES... 15 9. Trss analysis 9.1 Planar trss: The differential eqation for the eqilibrim of an elastic bar (above)

More information

Technical Note. ODiSI-B Sensor Strain Gage Factor Uncertainty

Technical Note. ODiSI-B Sensor Strain Gage Factor Uncertainty Technical Note EN-FY160 Revision November 30, 016 ODiSI-B Sensor Strain Gage Factor Uncertainty Abstract Lna has pdated or strain sensor calibration tool to spport NIST-traceable measrements, to compte

More information

Sareban: Evaluation of Three Common Algorithms for Structure Active Control

Sareban: Evaluation of Three Common Algorithms for Structure Active Control Engineering, Technology & Applied Science Research Vol. 7, No. 3, 2017, 1638-1646 1638 Evalation of Three Common Algorithms for Strctre Active Control Mohammad Sareban Department of Civil Engineering Shahrood

More information

Setting The K Value And Polarization Mode Of The Delta Undulator

Setting The K Value And Polarization Mode Of The Delta Undulator LCLS-TN-4- Setting The Vale And Polarization Mode Of The Delta Undlator Zachary Wolf, Heinz-Dieter Nhn SLAC September 4, 04 Abstract This note provides the details for setting the longitdinal positions

More information

Modelling by Differential Equations from Properties of Phenomenon to its Investigation

Modelling by Differential Equations from Properties of Phenomenon to its Investigation Modelling by Differential Eqations from Properties of Phenomenon to its Investigation V. Kleiza and O. Prvinis Kanas University of Technology, Lithania Abstract The Panevezys camps of Kanas University

More information

Study on the impulsive pressure of tank oscillating by force towards multiple degrees of freedom

Study on the impulsive pressure of tank oscillating by force towards multiple degrees of freedom EPJ Web of Conferences 80, 0034 (08) EFM 07 Stdy on the implsive pressre of tank oscillating by force towards mltiple degrees of freedom Shigeyki Hibi,* The ational Defense Academy, Department of Mechanical

More information

A New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation

A New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation A ew Approach to Direct eqential imlation that Acconts for the Proportional ffect: Direct ognormal imlation John Manchk, Oy eangthong and Clayton Detsch Department of Civil & nvironmental ngineering University

More information

Prandl established a universal velocity profile for flow parallel to the bed given by

Prandl established a universal velocity profile for flow parallel to the bed given by EM 0--00 (Part VI) (g) The nderlayers shold be at least three thicknesses of the W 50 stone, bt never less than 0.3 m (Ahrens 98b). The thickness can be calclated sing Eqation VI-5-9 with a coefficient

More information

Mathematical Models of Physiological Responses to Exercise

Mathematical Models of Physiological Responses to Exercise Mathematical Models of Physiological Responses to Exercise Somayeh Sojodi, Benjamin Recht, and John C. Doyle Abstract This paper is concerned with the identification of mathematical models for physiological

More information

Essentials of optimal control theory in ECON 4140

Essentials of optimal control theory in ECON 4140 Essentials of optimal control theory in ECON 4140 Things yo need to know (and a detail yo need not care abot). A few words abot dynamic optimization in general. Dynamic optimization can be thoght of as

More information

RESGen: Renewable Energy Scenario Generation Platform

RESGen: Renewable Energy Scenario Generation Platform 1 RESGen: Renewable Energy Scenario Generation Platform Emil B. Iversen, Pierre Pinson, Senior Member, IEEE, and Igor Ardin Abstract Space-time scenarios of renewable power generation are increasingly

More information

FRTN10 Exercise 12. Synthesis by Convex Optimization

FRTN10 Exercise 12. Synthesis by Convex Optimization FRTN Exercise 2. 2. We want to design a controller C for the stable SISO process P as shown in Figre 2. sing the Yola parametrization and convex optimization. To do this, the control loop mst first be

More information

Step-Size Bounds Analysis of the Generalized Multidelay Adaptive Filter

Step-Size Bounds Analysis of the Generalized Multidelay Adaptive Filter WCE 007 Jly - 4 007 London UK Step-Size onds Analysis of the Generalized Mltidelay Adaptive Filter Jnghsi Lee and Hs Chang Hang Abstract In this paper we analyze the bonds of the fixed common step-size

More information

Typed Kleene Algebra with Products and Iteration Theories

Typed Kleene Algebra with Products and Iteration Theories Typed Kleene Algebra with Prodcts and Iteration Theories Dexter Kozen and Konstantinos Mamoras Compter Science Department Cornell University Ithaca, NY 14853-7501, USA {kozen,mamoras}@cs.cornell.ed Abstract

More information

REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL

REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL Lewis c11.tex V1-10/19/2011 4:10pm Page 461 11 REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL In this book we have presented a variety of methods for the analysis and design of optimal control systems.

More information

Queueing analysis of service deferrals for load management in power systems

Queueing analysis of service deferrals for load management in power systems Qeeing analysis of service deferrals for load management in power systems Andrés Ferragt and Fernando Paganini Universidad ORT Urgay Abstract With the advent of renewable sorces and Smart- Grid deployments,

More information

Joint Transfer of Energy and Information in a Two-hop Relay Channel

Joint Transfer of Energy and Information in a Two-hop Relay Channel Joint Transfer of Energy and Information in a Two-hop Relay Channel Ali H. Abdollahi Bafghi, Mahtab Mirmohseni, and Mohammad Reza Aref Information Systems and Secrity Lab (ISSL Department of Electrical

More information

Decoder Error Probability of MRD Codes

Decoder Error Probability of MRD Codes Decoder Error Probability of MRD Codes Maximilien Gadolea Department of Electrical and Compter Engineering Lehigh University Bethlehem, PA 18015 USA E-mail: magc@lehighed Zhiyan Yan Department of Electrical

More information

1 Undiscounted Problem (Deterministic)

1 Undiscounted Problem (Deterministic) Lectre 9: Linear Qadratic Control Problems 1 Undisconted Problem (Deterministic) Choose ( t ) 0 to Minimize (x trx t + tq t ) t=0 sbject to x t+1 = Ax t + B t, x 0 given. x t is an n-vector state, t a

More information

Second-Order Wave Equation

Second-Order Wave Equation Second-Order Wave Eqation A. Salih Department of Aerospace Engineering Indian Institte of Space Science and Technology, Thirvananthapram 3 December 016 1 Introdction The classical wave eqation is a second-order

More information

Scheduling parallel jobs to minimize the makespan

Scheduling parallel jobs to minimize the makespan J Sched (006) 9:433 45 DOI 10.1007/s10951-006-8497-6 Schedling parallel jobs to minimize the makespan Berit Johannes C Springer Science + Bsiness Media, LLC 006 Abstract We consider the NP-hard problem

More information

Chapter 4 Supervised learning:

Chapter 4 Supervised learning: Chapter 4 Spervised learning: Mltilayer Networks II Madaline Other Feedforward Networks Mltiple adalines of a sort as hidden nodes Weight change follows minimm distrbance principle Adaptive mlti-layer

More information

EOQ Problem Well-Posedness: an Alternative Approach Establishing Sufficient Conditions

EOQ Problem Well-Posedness: an Alternative Approach Establishing Sufficient Conditions pplied Mathematical Sciences, Vol. 4, 2, no. 25, 23-29 EOQ Problem Well-Posedness: an lternative pproach Establishing Sfficient Conditions Giovanni Mingari Scarpello via Negroli, 6, 226 Milano Italy Daniele

More information

Information Source Detection in the SIR Model: A Sample Path Based Approach

Information Source Detection in the SIR Model: A Sample Path Based Approach Information Sorce Detection in the SIR Model: A Sample Path Based Approach Kai Zh and Lei Ying School of Electrical, Compter and Energy Engineering Arizona State University Tempe, AZ, United States, 85287

More information

Decoder Error Probability of MRD Codes

Decoder Error Probability of MRD Codes Decoder Error Probability of MRD Codes Maximilien Gadolea Department of Electrical and Compter Engineering Lehigh University Bethlehem, PA 18015 USA E-mail: magc@lehigh.ed Zhiyan Yan Department of Electrical

More information

We automate the bivariate change-of-variables technique for bivariate continuous random variables with

We automate the bivariate change-of-variables technique for bivariate continuous random variables with INFORMS Jornal on Compting Vol. 4, No., Winter 0, pp. 9 ISSN 09-9856 (print) ISSN 56-558 (online) http://dx.doi.org/0.87/ijoc.046 0 INFORMS Atomating Biariate Transformations Jeff X. Yang, John H. Drew,

More information

Intelligent Positioning Plate Predictive Control and Concept of Diagnosis System Design

Intelligent Positioning Plate Predictive Control and Concept of Diagnosis System Design Intelligent Positioning Plate Predictive Control and Concept of Diagnosis System Design Matej Oravec - Anna Jadlovská Department of Cybernetics and Artificial Intelligence, Faclty of Electrical Engineering

More information

RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS

RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS Athors: N.UNNIKRISHNAN NAIR Department of Statistics, Cochin University of Science Technology, Cochin, Kerala, INDIA

More information

Department of Industrial Engineering Statistical Quality Control presented by Dr. Eng. Abed Schokry

Department of Industrial Engineering Statistical Quality Control presented by Dr. Eng. Abed Schokry Department of Indstrial Engineering Statistical Qality Control presented by Dr. Eng. Abed Schokry Department of Indstrial Engineering Statistical Qality Control C and U Chart presented by Dr. Eng. Abed

More information

Adaptive Fault-tolerant Control with Control Allocation for Flight Systems with Severe Actuator Failures and Input Saturation

Adaptive Fault-tolerant Control with Control Allocation for Flight Systems with Severe Actuator Failures and Input Saturation 213 American Control Conference (ACC) Washington, DC, USA, Jne 17-19, 213 Adaptive Falt-tolerant Control with Control Allocation for Flight Systems with Severe Actator Failres and Inpt Satration Man Wang,

More information

Multi-Voltage Floorplan Design with Optimal Voltage Assignment

Multi-Voltage Floorplan Design with Optimal Voltage Assignment Mlti-Voltage Floorplan Design with Optimal Voltage Assignment ABSTRACT Qian Zaichen Department of CSE The Chinese University of Hong Kong Shatin,N.T., Hong Kong zcqian@cse.chk.ed.hk In this paper, we stdy

More information

Effects Of Symmetry On The Structural Controllability Of Neural Networks: A Perspective

Effects Of Symmetry On The Structural Controllability Of Neural Networks: A Perspective 16 American Control Conference (ACC) Boston Marriott Copley Place Jly 6-8, 16. Boston, MA, USA Effects Of Symmetry On The Strctral Controllability Of Neral Networks: A Perspective Andrew J. Whalen 1, Sean

More information

5. The Bernoulli Equation

5. The Bernoulli Equation 5. The Bernolli Eqation [This material relates predominantly to modles ELP034, ELP035] 5. Work and Energy 5. Bernolli s Eqation 5.3 An example of the se of Bernolli s eqation 5.4 Pressre head, velocity

More information

Evaluation of the Fiberglass-Reinforced Plastics Interfacial Behavior by using Ultrasonic Wave Propagation Method

Evaluation of the Fiberglass-Reinforced Plastics Interfacial Behavior by using Ultrasonic Wave Propagation Method 17th World Conference on Nondestrctive Testing, 5-8 Oct 008, Shanghai, China Evalation of the Fiberglass-Reinforced Plastics Interfacial Behavior by sing Ultrasonic Wave Propagation Method Jnjie CHANG

More information

Applying Fuzzy Set Approach into Achieving Quality Improvement for Qualitative Quality Response

Applying Fuzzy Set Approach into Achieving Quality Improvement for Qualitative Quality Response Proceedings of the 007 WSES International Conference on Compter Engineering and pplications, Gold Coast, stralia, Janary 17-19, 007 5 pplying Fzzy Set pproach into chieving Qality Improvement for Qalitative

More information

Upper Bounds on the Spanning Ratio of Constrained Theta-Graphs

Upper Bounds on the Spanning Ratio of Constrained Theta-Graphs Upper Bonds on the Spanning Ratio of Constrained Theta-Graphs Prosenjit Bose and André van Renssen School of Compter Science, Carleton University, Ottaa, Canada. jit@scs.carleton.ca, andre@cg.scs.carleton.ca

More information

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany Sbcritical bifrcation to innitely many rotating waves Arnd Scheel Institt fr Mathematik I Freie Universitat Berlin Arnimallee 2-6 14195 Berlin, Germany 1 Abstract We consider the eqation 00 + 1 r 0 k2

More information

FEA Solution Procedure

FEA Solution Procedure EA Soltion Procedre (demonstrated with a -D bar element problem) EA Procedre for Static Analysis. Prepare the E model a. discretize (mesh) the strctre b. prescribe loads c. prescribe spports. Perform calclations

More information

Decision Oriented Bayesian Design of Experiments

Decision Oriented Bayesian Design of Experiments Decision Oriented Bayesian Design of Experiments Farminder S. Anand*, Jay H. Lee**, Matthew J. Realff*** *School of Chemical & Biomoleclar Engineering Georgia Institte of echnology, Atlanta, GA 3332 USA

More information

Prediction of Transmission Distortion for Wireless Video Communication: Analysis

Prediction of Transmission Distortion for Wireless Video Communication: Analysis Prediction of Transmission Distortion for Wireless Video Commnication: Analysis Zhifeng Chen and Dapeng W Department of Electrical and Compter Engineering, University of Florida, Gainesville, Florida 326

More information

Cuckoo hashing: Further analysis

Cuckoo hashing: Further analysis Information Processing Letters 86 (2003) 215 219 www.elsevier.com/locate/ipl Cckoo hashing: Frther analysis Lc Devroye,PatMorin School of Compter Science, McGill University, 3480 University Street, Montreal,

More information

OPTIMUM EXPRESSION FOR COMPUTATION OF THE GRAVITY FIELD OF A POLYHEDRAL BODY WITH LINEARLY INCREASING DENSITY 1

OPTIMUM EXPRESSION FOR COMPUTATION OF THE GRAVITY FIELD OF A POLYHEDRAL BODY WITH LINEARLY INCREASING DENSITY 1 OPTIMUM EXPRESSION FOR COMPUTATION OF THE GRAVITY FIEL OF A POLYHERAL BOY WITH LINEARLY INCREASING ENSITY 1 V. POHÁNKA2 Abstract The formla for the comptation of the gravity field of a polyhedral body

More information

Multivariable Ripple-Free Deadbeat Control

Multivariable Ripple-Free Deadbeat Control Scholarly Jornal of Mathematics and Compter Science Vol. (), pp. 9-9, October Available online at http:// www.scholarly-jornals.com/sjmcs ISS 76-8947 Scholarly-Jornals Fll Length Research aper Mltivariable

More information

1. State-Space Linear Systems 2. Block Diagrams 3. Exercises

1. State-Space Linear Systems 2. Block Diagrams 3. Exercises LECTURE 1 State-Space Linear Sstems This lectre introdces state-space linear sstems, which are the main focs of this book. Contents 1. State-Space Linear Sstems 2. Block Diagrams 3. Exercises 1.1 State-Space

More information

Mathematical Analysis of Nipah Virus Infections Using Optimal Control Theory

Mathematical Analysis of Nipah Virus Infections Using Optimal Control Theory Jornal of Applied Mathematics and Physics, 06, 4, 099- Pblished Online Jne 06 in SciRes. http://www.scirp.org/jornal/jamp http://dx.doi.org/0.436/jamp.06.464 Mathematical Analysis of Nipah Virs nfections

More information

Chapter 2 Difficulties associated with corners

Chapter 2 Difficulties associated with corners Chapter Difficlties associated with corners This chapter is aimed at resolving the problems revealed in Chapter, which are cased b corners and/or discontinos bondar conditions. The first section introdces

More information

Designing of Virtual Experiments for the Physics Class

Designing of Virtual Experiments for the Physics Class Designing of Virtal Experiments for the Physics Class Marin Oprea, Cristina Miron Faclty of Physics, University of Bcharest, Bcharest-Magrele, Romania E-mail: opreamarin2007@yahoo.com Abstract Physics

More information

Bayes and Naïve Bayes Classifiers CS434

Bayes and Naïve Bayes Classifiers CS434 Bayes and Naïve Bayes Classifiers CS434 In this lectre 1. Review some basic probability concepts 2. Introdce a sefl probabilistic rle - Bayes rle 3. Introdce the learning algorithm based on Bayes rle (ths

More information

Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining

Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining P. Hgenin*, B. Srinivasan*, F. Altpeter**, R. Longchamp* * Laboratoire d Atomatiqe,

More information

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup L -smoothing for the Ornstein-Uhlenbeck semigrop K. Ball, F. Barthe, W. Bednorz, K. Oleszkiewicz and P. Wolff September, 00 Abstract Given a probability density, we estimate the rate of decay of the measre

More information

EXPT. 5 DETERMINATION OF pk a OF AN INDICATOR USING SPECTROPHOTOMETRY

EXPT. 5 DETERMINATION OF pk a OF AN INDICATOR USING SPECTROPHOTOMETRY EXPT. 5 DETERMITIO OF pk a OF IDICTOR USIG SPECTROPHOTOMETRY Strctre 5.1 Introdction Objectives 5.2 Principle 5.3 Spectrophotometric Determination of pka Vale of Indicator 5.4 Reqirements 5.5 Soltions

More information

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Qadratic Optimization Problems in Continos and Binary Variables Naohiko Arima, Snyong Kim and Masakaz Kojima October 2012,

More information

3 2D Elastostatic Problems in Cartesian Coordinates

3 2D Elastostatic Problems in Cartesian Coordinates D lastostatic Problems in Cartesian Coordinates Two dimensional elastostatic problems are discssed in this Chapter, that is, static problems of either plane stress or plane strain. Cartesian coordinates

More information

Study on the Mathematic Model of Product Modular System Orienting the Modular Design

Study on the Mathematic Model of Product Modular System Orienting the Modular Design Natre and Science, 2(, 2004, Zhong, et al, Stdy on the Mathematic Model Stdy on the Mathematic Model of Prodct Modlar Orienting the Modlar Design Shisheng Zhong 1, Jiang Li 1, Jin Li 2, Lin Lin 1 (1. College

More information

CDS 110b: Lecture 1-2 Introduction to Optimal Control

CDS 110b: Lecture 1-2 Introduction to Optimal Control CDS 110b: Lectre 1-2 Introdction to Optimal Control Richard M. Mrray 4 Janary 2006 Goals: Introdce the problem of optimal control as method of trajectory generation State the maimm principle and give eamples

More information

Performance analysis of GTS allocation in Beacon Enabled IEEE

Performance analysis of GTS allocation in Beacon Enabled IEEE 1 Performance analysis of GTS allocation in Beacon Enabled IEEE 8.15.4 Pangn Park, Carlo Fischione, Karl Henrik Johansson Abstract Time-critical applications for wireless sensor networks (WSNs) are an

More information

Optimal search: a practical interpretation of information-driven sensor management

Optimal search: a practical interpretation of information-driven sensor management Optimal search: a practical interpretation of information-driven sensor management Fotios Katsilieris, Yvo Boers and Hans Driessen Thales Nederland B.V. Hengelo, the Netherlands Email: {Fotios.Katsilieris,

More information

Mechanisms and topology determination of complex chemical and biological network systems from first-passage theoretical approach

Mechanisms and topology determination of complex chemical and biological network systems from first-passage theoretical approach Mechanisms and topology determination of complex chemical and biological network systems from first-passage theoretical approach Xin Li and Anatoly B. Kolomeisky Citation: J. Chem. Phys. 39, 4406 (203);

More information

Robust Tracking and Regulation Control of Uncertain Piecewise Linear Hybrid Systems

Robust Tracking and Regulation Control of Uncertain Piecewise Linear Hybrid Systems ISIS Tech. Rept. - 2003-005 Robst Tracking and Reglation Control of Uncertain Piecewise Linear Hybrid Systems Hai Lin Panos J. Antsaklis Department of Electrical Engineering, University of Notre Dame,

More information

Linear and Nonlinear Model Predictive Control of Quadruple Tank Process

Linear and Nonlinear Model Predictive Control of Quadruple Tank Process Linear and Nonlinear Model Predictive Control of Qadrple Tank Process P.Srinivasarao Research scholar Dr.M.G.R.University Chennai, India P.Sbbaiah, PhD. Prof of Dhanalaxmi college of Engineering Thambaram

More information

Complex Variables. For ECON 397 Macroeconometrics Steve Cunningham

Complex Variables. For ECON 397 Macroeconometrics Steve Cunningham Comple Variables For ECON 397 Macroeconometrics Steve Cnningham Open Disks or Neighborhoods Deinition. The set o all points which satis the ineqalit

More information

Constructive Root Bound for k-ary Rational Input Numbers

Constructive Root Bound for k-ary Rational Input Numbers Constrctive Root Bond for k-ary Rational Inpt Nmbers Sylvain Pion, Chee Yap To cite this version: Sylvain Pion, Chee Yap. Constrctive Root Bond for k-ary Rational Inpt Nmbers. 19th Annal ACM Symposim on

More information

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time Tomas Björk Department of Finance, Stockholm School of Economics tomas.bjork@hhs.se Agatha Mrgoci Department of Economics Aarhs

More information

Optimization in Predictive Control Algorithm

Optimization in Predictive Control Algorithm Latest rends in Circits, Systems, Sinal Processin and Atomatic Control Optimization in Predictive Control Alorithm JAN ANOŠ, MAREK KUBALČÍK omas Bata University in Zlín, Faclty of Applied Informatics Nám..

More information