Optimistic Planning with Long Sequences of Identical Actions: An Extended Theoretical and Experimental Study

Size: px
Start display at page:

Download "Optimistic Planning with Long Sequences of Identical Actions: An Extended Theoretical and Experimental Study"

Transcription

1 Volume 56, Number 1-2, Optimistic Planning with Long Sequences of Ientical Actions: An Extene Theoretical an Experimental Stuy Koppány Máthé, Lucian Bușoniu, Liviu Miclea epartment of Automation, Technical University of Cluj-Napoca, Abstract - Optimistic planning for eterministic systems (OP) fins near-optimal control solutions for general, non-linear systems. OP iteratively explores a search tree of action sequences by always expaning further the most promising sequence, where each expansion appens all possible one-step actions. However, the generality of the algorithm comes at a high computational cost. We aim to alleviate this complexity in a subclass of control problems where longer ranges of constant actions are preferre, by aapting OP to this class of problems. The novel algorithm is calle optimistic planning with K ientical actions (), an it creates sequences by appening to them up to K repetitions of each possible action. In our analysis we show that inee, offers a ilar a posteriori performance as an in certain cases the tree epth reache (a measure of the performance) is increase compare to OP. Our experiments, performe on the inverte penulum an HIV infection treatment control, confirm that for suitable control problems can perform better than OP, for properly tune parameter K. Keywors online planning; piecewise constant control; preictive control; optimization 1. INTROUCTION We consier optimal control problems moele as Marov ecision processes, in which a nonlinear system must be controlle in iscrete time so as to imize a iscounte sum of rewars along an infinite horizon (the return). Examples of such performance inices inclue processing times, energy or resource usage, etc. Many algorithms have been propose to solve this type of problem, coming from a variety of fiels incluing reinforcement learning [1], approximate ynamic programming [2], optimal control, stochastic programming, etc. In this paper, the rewars are assume to be boune, the control actions (inputs) must resie in a finite set with M elements. We focus on online planning algorithms, which wor by searching at each step for an action sequence that is best locally for the current state of the system, an then applying the first action of this sequence. Specifically, we employ an exten the algorithm Optimistic Planning for eterministic Systems (OP) [3], which iteratively expans a tree representation of the possible sequences of actions. At each iteration, OP further refines a sequence that has the largest upper boun on the iscounte return, hence the ajective 'optimistic'. Refinement is one by appening all possible, singlestep actions to the en of the sequence, resulting in M chil noes (sequences). Then, at the en, a safe choice is mae by returning the sequence with the largest lower boun. In certain optimal control problems, sequences that eep the same action constant for a larger number of steps are preferre. This is e.g. a feature of bangbang solutions to time-optimal control problems (e.g. in vehicle path planning, aerospace) [4]. Switche systems where the actions are moe selections are another example, since switches between actions are often energetically costly (e.g., rising or lowering a canal barrier [5]), or switching too often may estabilize the system [6]. In networe controlle systems, changing the action implies sening it over the networ an is therefore costly in terms of communication. Our aim in this paper is to aapt optimistic planning to such problems, where repeate actions are preferable. The main iea is ple: rather than appening single actions to the optimistic sequence, we appen up to K repetitions of the same action, where K is the main parameter of the algorithm. Thus, if such repeate-action solutions are better, the algorithm will have the opportunity to iscover them earlier than OP. The other elements of the algorithm remain the same as OP (namely, an optimistic sequence is refine, an a sequence imizing the lower boun is returne at the en). We call the moifie algorithm Optimistic Planning with K ientical actions,. We provie analytical insight into the behavior of the algorithm. We start by showing that the nearoptimality of the solution is still ictate by the eepest expane sequence, lie for OP (the eeper, the closer to the optimum). However, this epth will be ifferent 2015 Meiamira Science Publisher. All rights reserve

2 28 ACTA ELECTROTEHNICA from that in OP ue to the ifferent expansion rule. In particular, we analyze two interesting special cases illustrating that in certain problems the epth will be larger an using pays off; but that in other problems, OP will expan eeper trees an so it remains preferable. Experimental results are given for two optimal control problems: swinging up an uneractuate inverte penulum, an treatment of HIV infection by optimal switching of rugs on an off. In our ulations, leas to shallower trees than OP, so we cannot tae avantage of the analytical results. Nevertheless, in practice outperforms OP for the control problems investigate, an we expect that for properly tune parameter K this will hol true in a wier class of problems. Several other types of optimistic planning algorithms have been propose [7], such as for stochastic problems [8], [9] an continuous action spaces [10], [11]. Closer to the present wor is our approach from [12], where planning only explores sequences that switch between actions a limite number of times. In contrast, the approach from here oes not limit the number of switches, instea only exploring repeate-action solutions earlier. In the control fiel, relate approaches inclue piecewise-constant control [13] an specifically piecewise-constant moelpreictive control [14], [15], as well as ilar ieas in sample-ata systems [16]. References [17] [19] focus on improving performance an reucing computation, as we o. However, whereas stability is the main concern in these wors, our theoretical iscussion explicitly focuses on the relation between computation an performance of the algorithm. Unlie [19], our algorithm oes not constrain where the action shoul be change (although it wors better when switches happen after aroun K steps). Note that online planning in general is a type of moel-preictive control. This article is a revise an extene version of the conference paper [20]. Aitional contributions with respect to the conference version inclue the analysis of two illustrative cases to provie more insight, an a proof of the main result; the new HIV infection example; an a more etaile experimental investigation in the inverte penulum problem. The rest of the paper is structure as follows. Section II escribes the optimal control problem an OP. Section III presents with its analysis, an Section IV shows our ulation results. Section V gives our conclusions. 2. OPTIMAL CONTROL AN OPTIMISTIC PLANNING FOR ETERMINISTIC SYSTEMS Optimal control problems are often efine as Marov ecision processes (MP) using states x X, actions u U, a state transition function f ( x, u) = x' an a corresponing rewar function ρ : X U R, r ( x, u) = r. Besies escribing the system ynamics by function f (i.e. the transition from state x to x when applying control action u), MPs also provie a characterization of the quality of each transition, by means of ρ ( x, u). iscounte optimal control fins for any given initial state x 0 an infinite action sequence h = ( u 0, u 1,...) that imizes the iscounte sum of rewars: v( h ) = ρ( x, u ) (1) = 0 where [0,1) is the iscount factor. Function v is calle the value function, an the optimal value is enote v = supv( h ). h An optimal control metho, optimistic planning for eterministic systems (OP) [3] esigns control problems as MPs, assuming a finite an iscrete action space U = { u 1,..., u M }, the system ynamics f an the rewar function ρ to be nown, an the rewars to be boune, ρ ( x, u) [0,1], x, u. With these assumptions, OP loos for the optimizer of a problem by constructing search trees of action sequences. Given a computational buget n of allowe number of expansions, OP starts from the empty sequence an iteratively appens in each expansion step all possible actions from U to an existing sequence. In this manner, a tree is constructe where the noes correspons to the control action taen. Each expane noe will have M chilren. Each leaf state etermines an action sequence by reaing all actions from the root to a given leaf. An example OP tree is shown in Fig. 1. Fig. 1. OP search tree after three expansions, with M=2 action space size. Noes contain actions, an each expane noe has M chilren, one corresponing to each action from the action space. A sample action sequence, leaing to the bol leaf from tree epth = is h 3 = ( u, u, u ). The tree expansion is performe base on the following rule: for each leaf (i.e. ulate, finitely long action sequence h ), an upper boun b( h ) is

3 Volume 56, Number 1-2, calculate on the value function of all the infinite action sequences h that pass through that leaf: b( h ) = l( h ) v( h ) (2) 1 l( h l efines a lower boun on h ) where h ) 1 ( v : ( ) = ρ( x, u ) v( h ) (3) = 0 switche systems. In networe control systems, banwith limitations favour fewer transmissions of control action upates, an thus less action switches in the action sequences. Switche systems commonly have high costs for changing the current moe of the system (e.g. opening a barrier, or altering the state of a mechanical switch) an thus prefer longer ranges of constant actions. For such control problems, moifies the principle of OP as follows: uring the optimistic search tree construction, as with each expansion step M K chilren to a noe, taing each possible action with recurrence 1 to K. Taing K=1 reuces to OP. Because ρ( x, u ) taes values between 0 an 1, these are vali bouns on v( h ) from (1). Having these bouns calculate, the leaf with the highest upper boun (also calle b-value) is selecte for further expansion, after which the proceure is repeate. The algorithm is calle optimistic as it always maes the selection for expansion by assuming the best possible value for the rewars that have not yet been observe. At the en of the search, after the n expansions have been exhauste, the leaf with the highest lower boun is chosen as the near-optimal solution: h ' = arg l( h ) (4) h Woring in a receing horizon fashion, usually, OP applies to the system only the first action from h ', after which the algorithm is repeate for the new state the system reache. Proposition 1. OP expans only noes that satisfy v l( h ) /(1 ). Further, OP is /(1 ) -optimal, with the epth of the eepest expane noe. In other wors, the cumulative rewar of the chosen action sequence h ) is at most /(1 ) ( smaller than the optimal solution. Note that this information is available only a posteriori, i.e. after running the algorithm. A formulation of this a posteriori guarantee with a etaile proof, as well as an a priori boun, are provie in [3]. 3. OPTIMISTIC PLANNING WITH K IENTICAL ACTIONS Optimistic planning with K ientical actions () consiers the subclass of control problems where ranges of ientical control actions are preferre, lie in the case of networe control systems or Fig. 2. search tree with M=3 possible actions an K=2 imum repetitions. The left graph is a compact representation of m the tree, with u, m inicating the sequence of action u repeate K times. The same tree is unwrappe in the OP representation in the right graph. uring our analysis, we will use both representations. The novelty of the algorithm is the aitional evaluation of sequences of repeate actions. In case of control problems where ranges of ientical control actions are preferre, intuitively, shoul reach a near-optimal solution with fewer expansions than OP. We enote sequences from the compact tree as h', while corresponing unwrappe sequences are enote h. Note also the change of notation for the epths. An example sequence from the compact tree 3,1 1,2 from epth =2 is h ' 2 = [ u, u ] that correspons to the unwrappe action sequence h 3 = [ u, u, u ] with =3. Note that the unwrappe tree epths moify the b-value efinition as follows: b( h ) = l( h ) v( h ) 1 with the lower boun efine as: l( h 1 (5) ) = ρ( x, u ) v( h ) (6) = 0 These relations hol since v ( h ) is calculate for the unwrappe sequences in the same manner as in case of OP.

4 30 ACTA ELECTROTEHNICA The tree contains uplicate unwrappe 3,2 sequences. For instance, sequences h ' = [ ] an 1 u 3,1 3,1 h ' = [ u, ] from Fig. 2 contain the same 2 u 3 3 unwrappe action sequences, h 2 = [ u, u ]. This reunancy is unwante an shoul thus be avoie. The way this is performe oes not influence the analysis an remains a etail of implementation. Proposition 2. expans only noes that satisfy v l( h ) /(1 ) with the cumulative epth of the search tree. Therefore, is /(1 ) -optimal, with the largest cumulative epth of the eepest expane noe h. ' Proof. We follow ilar line to the proof for the OP a posteriori guarantee from Proposition 1, etaile in [3]. At any iteration, there exists a noe h in the tree that is the initial subsequence of an optimal sequence. Hence, b( h ) v by efinition (5). While we expan a possibly ifferent noe h, because we imize b-values we have b( h ) b( h ) v. Taing (5), equivalently v l( h ) /(1 ). enote now the noe returne by the algorithm by h ret. Since the lower bouns only increase with further expansions an h is alreay containe in the search tree, we have l ( h ) l( h ). So finally, ret /(1 ) from which by efinition of l( hret ) v the lower boun from (6), v( hret ) /(1 ) v or v v( h ret ) /(1 ). We tae the most favorable, which means the epth of the eepest expane noe. Besies the a posteriori guarantee, another performance inicator is the a priori boun that relates the available computational buget n to the expansion epth reache, an so to the near-optimality of the algorithm. In case of, obtaining such a boun is more ifficult, as the increase of the tree epth is relate both to the available computational buget an the factor K, where the latter influences the epth increase arbitrarily (between 1 an K with each expansion). etaile iscussion can be foun in [20]. The performance of can thus be evaluate only a posteriori. The obtaine boun cannot be irectly compare to the one of OP, as given the same control problem the two algorithms expan ifferent search trees. As expans more noes with each step, <, from where one cannot conclue a OP irect relation between an, the real OP action sequence lengths. In certain cases outperforms OP, an these are the types of problems we are intereste in. In other cases OP will wor better. Consier two special cases, presente in Fig. 3. In the first case (left graph), there exists a single path with rewars always equal to one, an all the other branches have zero rewar. In the secon case (right graph), uniform zero rewars are obtaine except for one path, where each K-th action will have a rewar of one. In both cases, we evaluate the tree epth the algorithms reach (as an inicator of the near-optimality), consiering the computational buget n as the number of noes ae to the search tree. Note that this is a more fair measure of computation, as the most resource consuming operation is the ulation of transitions (i.e. applying an action, which correspons to the aition of a noe). We enote the eepest expane noe by OP, respectively for the two algorithms. In the first case of a path of rewars equal to one, both algorithms expan only on this path. OP expans thus at epth = ( n 1) / M. For we will have with each expansion M K noes ae with a epth increase of K. Now, since each expane noe on the optimal path will have the same b- value, b ( h ) = 1/(1 ), the algorithm will have to apply a tie breaing rule. Commonly, when multiple caniates appear for expansion, the first create noe is selecte for further expansion, which in our case will be the noe with the fewest repetitions. Thus, eeps on expaning on the optimal path noe by noe, in increasing orer of epth, an will expan until epth = ( n 1) /( M K). Therefore, in this case < OP. In case of the secon graph from Fig. 3, OP will perform uniform expansion until epth K after which continue with uniform expansion only from the noe that has the rewar of one, an so on. In this manner, OP will expan subtrees having 2 K 1 K 1 M M... M 1 = ( M 1) /( M 1) 1 noes. For reaching a epth that is multiple of K, OP will therefore require a buget of K n = / K (( M 1) /( M 1) 1). On the other han, with repetition K will always fin the rewars of one with one expansion an thus explore only on the optimal path. Thus n / M n / M 1 But K OP n /(( M M 2) /( M 1)) K from where K 1 K 2 OP n /(( M M 2) / M ) K ( n / M ) K Therefore, for large enough K we have >. OP

5 Volume 56, Number 1-2, Fig. 3. Search tree with: all the rewars zero except for the most-right path (left); all the rewars zero except for every K-th transition on the most-right path, with K=2 (right). Besies these special cases, the experiments from the sequel show how OP compares with in further situations. 4. EXPERIMENTAL RESULTS The algorithms are evaluate consiering two control problems: the inverte penulum an HIV infection treatment. etaile test results an iscussions are provie with the inverte penulum, while the treatment control problem is consiere as it is a more complex, high-imensionality example Inverte penulum swing-up In the inverte penulum problem, optimal control aims to swing up an stabilize an uneractuate penulum. The control power is lower than it woul be sufficient to swing up the penulum using a single rotation. Thus, several swings are require to stabilize the penulum in the pointing up state. As the swings can be obtaine by sequences of constant actions, an the multiple swings require long planning horizon, this control problem matches the problem class where is expecte to provie goo performance. Fig. 4. Inverte penulum x = ] with [, ) ra T [ x1, x2 x1 = α π π x2 = α [ 15π,15π ra/s. an ] The angles cover the entire circle with 0 for the pointing up state, whereas the angular velocity is boune by saturation to the given interval. The actions are taen in the iscrete space U u,0, u } { = V, representing the voltage applie to the motors. Except for the last set of experiments, we tae u 2 V. The = rewar function is consiere in its unnormalize form T T as ρ ( x, u) = x Qx u Ru, with parameters 1 Q = an R=0.1. The normalization of the rewar function to the interval [0, 1] is performe base on the state bouns. The iscount factor use for calculating the cumulative rewar is set to = We specify computational bugets using n, the number of noes ae to the search tree (corresponing to the number of ulate transitions). In case of OP n = M n 1, whereas for n = K M n 1. Another remar is that the uplicate elimination in case of leas to smaller search trees than those of OP. For this reason, an equal treesize variant of the algorithm will be consiere as well, which in place of the eliminate uplicates as further noes to the tree until reaching the same treesize n as with OP. We consier two types of experiments. In the first, we calculate offline regrets that show the nearoptimality of the algorithm. We tae a set of initial states { π, 5 / 6 π,..., π} ra { 15π, 14π,...,15π } ra/s, an for each state calculate a single control action. Finally, we evaluate the average of the real regret 1. The secon set of experiments calculates the online return, that is the cumulative rewar obtaine while applying a sequence of actions in close-loop. In these experiments, the penulum is starte from the initial T state x 0 = [ π,0] (penulum pointing own, with zero angular velocity) an the algorithms calculate a nearoptimal control action an apply to the system, reaching a new state, after which the searches are repeate. We allow the experiments to run for T=4s, which is usually enough for swinging up an stabilizing the penulum. The penulum is moele using a state vector that consists of the angle of the penulum an the angular velocity: 1 The real regret calculates the near-optimality of the algorithm with respect to the optimum v, value available from another algorithm at much higher computational costs. The formula is r = v l( h ' ). n

6 32 ACTA ELECTROTEHNICA The sampling time is taen TS = s, which results in 160 search tree constructions. We first analyze the effect of varying the value of K for a given computational buget. After several preliminary tests, we observe that OP reaches a nearoptimal solution (i.e. is able to swing up an stabilize the penulum) using a buget of n = 1500 noes ae to the tree per ulation. Taing the algorithm parameters in the set K {1,2,4,5,8,10,16,20,32}, we select n so to have all noes completely expane, i.e. in case to be the multiple of M K for any K. Therefore, we tae n = Fig. 5 shows the results from the offline tests, while Fig. 6 compares online performance. for both of its variants. Comparing the two variants of, there is no evience that the equal treesize algorithm that expans more noes than the other one woul result in the same or better guarantees. Nevertheless, there is no ecreasing tren of the regrets with respect to the computational buget for any of the algorithms in general. Fig. 7. Regret obtaine with K=16 Further ulations analyze the tree epth increase in case of offline tests. The tree epth is etermine by the epth of the eepest expane noe, which in case of OP matches the epth introuce in Section II, while for it is the cumulative epth also in Section III., use Fig. 5. Average regret for variable K an n = 1920 From Fig. 5, the best (smallest) regret is obtaine for K=16 an one may observe that smaller values of K o not yet reach the performance of OP, while higher values of K alreay provie weaer performance. This confirms that the choice of K is subject to fine tuning an that higher values than the chosen one weaen the performance. All these conclusions are confirme by the obtaine cumulative rewars in the online experiments as well, presente in Fig. 6. Fig. 6. Return for variable K an n = 1920 Now, taing in the sequel K=16 for, we perform offline an online tests for a set of computational bugets. Fig. 7 presents the offline test results. The first conclusion is that offers better performance than OP, having lower regret than OP Fig. 8. Tree epth with K=16 Fig. 8 confirms that higher bugets allow for the construction of eeper search trees an the increase of the tree epth is monotonic. Comparing the two variants, as the equal treesize variant expans more noes than the other one, a small increase in epth is also observe. Nevertheless, a greater ifference is observe between the epths reache by OP an the variants. OP constructs much eeper search trees, i.e. OP >, unlie the expectations from the analysis. This confirms our iscussions from Section III regaring the epths reache by the algorithms an invaliates thus the initial intuition of constructing eeper trees with, at least in case of the current control problem. Nevertheless, the actual performance obtaine for the inverte penulum are better for, as shown in Fig. 7. Thus, for suitable control problems, the real performance of might

7 Volume 56, Number 1-2, be better than the performance of OP, as shown in the sequel with the online experiments as well. Fig. 9. Returns obtaine for K=16. Fig. 9 shows the online experimental results, i.e. the returns obtaine by the algorithms when performing the penulum swing-up, using ifferent values for the computational buget. The near-optimal solution is aroun a return of 30, which is alreay obtaine by all the algorithms for n = Two important conclusions can be rawn. First, the algorithm obtains overall higher return than OP, i.e. has a better performance than OP. Secon, reaches the nearoptimal solutions sooner than OP. This can be seen for small values of n where OP is sub-optimal but alreay provies near-optimal solutions. Finally, eeping K=16 for, we compare the online performance of the algorithms when varying the control input. strategies for their use. In the structure treatment interruptions strategy, the patient is cycle on an off rugs, see e.g. [21]. The HIV infection ynamics are escribe by a six-imensional nonlinear moel with two binary inputs corresponing to the application of the two rugs, so there are 4 iscrete actions; the sampling time is 5 ays. The objective is to rive the system from an unhealthy equilibrium, where the infection has taen hol, to the basin of attraction of a healthy equilibrium where the patient controls the infection without the nee for rugs. For the moel, parameters, an rewar function, see [21]. In our experiments, we consier using the two rugs in combinations {{ 0,0},{0,0.03},{0.7,0},{0.7,0.03}}. These combinations refer to the quantity from the two rugs consiere uring each time sample of 5 ays. After several tests, we obtain that OP is able to fin near-optimal solutions with trajectory lengths of 200 ays, i.e. it can fin combinations that ai healing when ulating treatments that woul last 200 ays. Taing this trajectory length, we consier an vary the number of repetitions K to verify our previous results. Fig. 11 shows the return obtaine when applying the algorithms online, in ulations. Fig. 11. HIV infection treatment: evaluating for variable K Fig. 10. Returns obtaine for K=16 an state spaces U = { 1.5,0,1.5} V (left) an U = { 3,0,3} V (right). Figures 9 an 10 present the results for three ifferent values of u from the action space U: Fig. 9 consiers u = 2 V, while Fig. 10 shows results for u = 1.5 V an u = 3 V. Looing at these results, taing ifferent control inputs, eeps its performance compare to OP. However, its improvement compare to OP varies base on which MP is consiere, fact that confirms that the value of K is subject to fine-tuning accoring to the given control problem HIV infection treatment Prevalent HIV treatment strategies involve two types of rugs. These rugs have negative sie effects in the long term, which motivates research into optimal Looing at Fig. 11, one may observe that increasing the value of K oes not necessarily result in better performance. Thus, this problem confirms too that the value of K has to be tune. However, one can see again that for certain values, is able to provie better near-optimal solutions than OP. 5. CONCLUSIONS In this paper the novel algorithm calle was introuce, that aresses the complexity concerns of the OP algorithm for a subclass of control problems by evaluating longer ranges of constant action sequences in aition to the one-step actions. In the class of problems where the control action is preferre to be rarely change (e.g. switche systems lie barriers or mechanical switches) was expecte to provie closer solutions to the optimum than OP is able to. Our analysis showe that inee, has ilar performance compare to OP an it even outperforms

8 34 ACTA ELECTROTEHNICA the latter in certain problem types, though a general relation between the performance of the two algorithms coul not be rawn. The experiments confirme that for the aresse subclass of control problems, when choosing parameter K properly, outperforms OP in terms of near-optimality. In this paper a novel algorithm calle optimistic planning with K ientical actions () was introuce, that extens optimistic planning for eterministic systems (OP) by evaluating longer ranges of constant action sequences in aition to onestep actions, whenever an action sequence is further refine. In the class of problems where the control action shoul preferably be change rarely, such as switche systems or networe control systems, is expecte to provie closer solutions to the optimum than OP is able to. Our analysis showe that, has ilar performance guarantees compare to OP an it can outperform the latter in certain types of problems, although a general relation between the performance of the two algorithms coul not be rawn. Experiments confirme that, for an inverte penulum an an HIV infection problem, outperforms OP when choosing parameter K properly. An interesting irection for future wor is a more complete analytical characterization of the class of problems where is expecte to wor well, an an a priori guarantee relating buget to near-optimality, lie the one available for OP. ACKNOWLEGEMENTS This paper is supporte by the Sectoral Operational Programme Human Resources evelopment (SOP HR), I (POSRU/159/1.5/S/137516) finance by the European Social Fun an by the Romanian Government; an by a grant from the Romanian National Authority for Scientific Research, CNCS- UEFISCI, project number PNII-RU-TE REFERENCES 1. R. S. Sutton an A. G. Barto, Reinforcement Learning: An Introuction. MIT Press, P. Bertseas, ynamic Programming an Optimal Control, 4 th e. Athena Scientific, 2012, vol J.-F. Hren an R. Munos, Optimistic planning of eterministic systems, in Proceeings 8th European Worshop on Reinforcement Learning (EWRL-08), Villeneuve Ascq, France, 30 June 3 July 2008, pp R. Vinter, Optimal control. Springer Science Business Meia, H. van Eeren, R. Negenborn, P. van Overloop, an B. e Schutter, Time-instant optimization for hybri moel preictive control of the Rhine-Meuse elta. Journal of Hyroinformatics, vol. 15, no. 2, pp , J. C. Geromel an P. Colaneri, Stability an stabilization of iscrete time switche systems. International Journal of Control, vol. 79, no. 7, pp , R. Munos, The optimistic principle applie to games, optimization an planning: Towars founations of Monte- Carlo tree search, Founations an Trens in Machine Learning, vol. 7, no. 1, pp , S. Bubec an R. Munos, Open loop optimistic planning, in Proceeings 23r Annual Conference on Learning Theory, Haifa, Israel, June 2010, pp L. Busoniu, R. Munos et al., Optimistic planning for Marov ecision processes, in Proceeings of the 15th International Conference on Artificial Intelligence an Statistics, AISTATS- 12, vol. 22, 2012, pp L. Busoniu, A. aniels, R. Munos, an R. Babusa, Optimistic planning for continuous action eterministic systems, in 2013 IEEE International Symposium on Aaptive ynamic Programming an Reinforcement Learning (APRL-13), Singapore, April A. Weinstein an M. L. Littman, Banit-base planning an learning in continuous-action Marov ecision processes, in Proceeings of the 22 n International Conference on Automate Planning an Scheuling (ICAPS), K. Mathe, L. Busoniu, R. Munos, an B. e Schutter, Optimistic planning with a limite number of action switches for near-optimal nonlinear control, in ecision an Control (CC), 2014 IEEE 53 r Annual Conference on, ec 2014, pp M. Quincampoix an N. Seube, Stabilization of uncertain control systems through piecewise constant feebac, Journal of mathematical analysis an applications, vol. 218, no. 1, pp , L. Magni an R. Scattolini, Moel preictive control of continuous-time nonlinear systems with piecewise constant control, Automatic Control, IEEE Transactions on, vol. 49, no. 6, pp , , Tracing of non-square nonlinear continuous time systems with piecewise constant moel preictive control, Journal of Process Control, vol. 17, no. 8, pp , Y.-Y. Cao, L. Hu, an P. Fran, Moel preictive control via piecewise constant output feebac for multirate sample-ata systems, in ecision an Control, Proceeings of the 39th IEEE Conference on, vol. 1. IEEE, 2000, pp R. Fineisen an F. Allgower, Computational elay in nonlinear moel preictive control, in Proceeings International Symposium on the Avance Control of Chemical Processes, 2004, pp X. Yang an L. T. Biegler, Avance-multi-step nonlinear moel preictive control, Journal of Process Control, vol. 23, no. 8, pp , C. Liu, W.-H. Chen, an J. Anrews, Piecewise constant moel preictive control for autonomous helicopters, Robotics an Autonomous Systems, vol. 59, no. 7, pp , K. Mathe, L. Busoniu, an L. Miclea, Optimistic planning with long sequences of ientical actions for near-optimal nonlinear control, in International Conference on Automation, Quality an Testing, Robotics. IEEE, 2014, pp B. Aams, H. Bans, H.-. Kwon, an H. Tran, ynamic multirug therapies for HIV: Optimal an STI control approaches, Mathematical Biosciences an Engineering, vol. 1, no. 2, pp , Koppany Mathe epartment of Automation, Technical University of Cluj- Napoca, Memoranumului 28, Cluj-Napoca, Romania Koppany.Mathe@aut.utcluj.ro

Switching Time Optimization in Discretized Hybrid Dynamical Systems

Switching Time Optimization in Discretized Hybrid Dynamical Systems Switching Time Optimization in Discretize Hybri Dynamical Systems Kathrin Flaßkamp, To Murphey, an Sina Ober-Blöbaum Abstract Switching time optimization (STO) arises in systems that have a finite set

More information

A simple model for the small-strain behaviour of soils

A simple model for the small-strain behaviour of soils A simple moel for the small-strain behaviour of soils José Jorge Naer Department of Structural an Geotechnical ngineering, Polytechnic School, University of São Paulo 05508-900, São Paulo, Brazil, e-mail:

More information

Sliding mode approach to congestion control in connection-oriented communication networks

Sliding mode approach to congestion control in connection-oriented communication networks JOURNAL OF APPLIED COMPUTER SCIENCE Vol. xx. No xx (200x), pp. xx-xx Sliing moe approach to congestion control in connection-oriente communication networks Anrzej Bartoszewicz, Justyna Żuk Technical University

More information

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments

Time-of-Arrival Estimation in Non-Line-Of-Sight Environments 2 Conference on Information Sciences an Systems, The Johns Hopkins University, March 2, 2 Time-of-Arrival Estimation in Non-Line-Of-Sight Environments Sinan Gezici, Hisashi Kobayashi an H. Vincent Poor

More information

On the Enumeration of Double-Base Chains with Applications to Elliptic Curve Cryptography

On the Enumeration of Double-Base Chains with Applications to Elliptic Curve Cryptography On the Enumeration of Double-Base Chains with Applications to Elliptic Curve Cryptography Christophe Doche Department of Computing Macquarie University, Australia christophe.oche@mq.eu.au. Abstract. The

More information

6 General properties of an autonomous system of two first order ODE

6 General properties of an autonomous system of two first order ODE 6 General properties of an autonomous system of two first orer ODE Here we embark on stuying the autonomous system of two first orer ifferential equations of the form ẋ 1 = f 1 (, x 2 ), ẋ 2 = f 2 (, x

More information

Predictive Control of a Laboratory Time Delay Process Experiment

Predictive Control of a Laboratory Time Delay Process Experiment Print ISSN:3 6; Online ISSN: 367-5357 DOI:0478/itc-03-0005 Preictive Control of a aboratory ime Delay Process Experiment S Enev Key Wors: Moel preictive control; time elay process; experimental results

More information

u t v t v t c a u t b a v t u t v t b a

u t v t v t c a u t b a v t u t v t b a Nonlinear Dynamical Systems In orer to iscuss nonlinear ynamical systems, we must first consier linear ynamical systems. Linear ynamical systems are just systems of linear equations like we have been stuying

More information

How to Minimize Maximum Regret in Repeated Decision-Making

How to Minimize Maximum Regret in Repeated Decision-Making How to Minimize Maximum Regret in Repeate Decision-Making Karl H. Schlag July 3 2003 Economics Department, European University Institute, Via ella Piazzuola 43, 033 Florence, Italy, Tel: 0039-0-4689, email:

More information

A NONLINEAR SOURCE SEPARATION APPROACH FOR THE NICOLSKY-EISENMAN MODEL

A NONLINEAR SOURCE SEPARATION APPROACH FOR THE NICOLSKY-EISENMAN MODEL 6th European Signal Processing Conference EUSIPCO 28, Lausanne, Switzerlan, August 25-29, 28, copyright by EURASIP A NONLINEAR SOURCE SEPARATION APPROACH FOR THE NICOLSKY-EISENMAN MODEL Leonaro Tomazeli

More information

BEYOND THE CONSTRUCTION OF OPTIMAL SWITCHING SURFACES FOR AUTONOMOUS HYBRID SYSTEMS. Mauro Boccadoro Magnus Egerstedt Paolo Valigi Yorai Wardi

BEYOND THE CONSTRUCTION OF OPTIMAL SWITCHING SURFACES FOR AUTONOMOUS HYBRID SYSTEMS. Mauro Boccadoro Magnus Egerstedt Paolo Valigi Yorai Wardi BEYOND THE CONSTRUCTION OF OPTIMAL SWITCHING SURFACES FOR AUTONOMOUS HYBRID SYSTEMS Mauro Boccaoro Magnus Egerstet Paolo Valigi Yorai Wari {boccaoro,valigi}@iei.unipg.it Dipartimento i Ingegneria Elettronica

More information

JUST THE MATHS UNIT NUMBER DIFFERENTIATION 2 (Rates of change) A.J.Hobson

JUST THE MATHS UNIT NUMBER DIFFERENTIATION 2 (Rates of change) A.J.Hobson JUST THE MATHS UNIT NUMBER 10.2 DIFFERENTIATION 2 (Rates of change) by A.J.Hobson 10.2.1 Introuction 10.2.2 Average rates of change 10.2.3 Instantaneous rates of change 10.2.4 Derivatives 10.2.5 Exercises

More information

05 The Continuum Limit and the Wave Equation

05 The Continuum Limit and the Wave Equation Utah State University DigitalCommons@USU Founations of Wave Phenomena Physics, Department of 1-1-2004 05 The Continuum Limit an the Wave Equation Charles G. Torre Department of Physics, Utah State University,

More information

On Characterizing the Delay-Performance of Wireless Scheduling Algorithms

On Characterizing the Delay-Performance of Wireless Scheduling Algorithms On Characterizing the Delay-Performance of Wireless Scheuling Algorithms Xiaojun Lin Center for Wireless Systems an Applications School of Electrical an Computer Engineering, Purue University West Lafayette,

More information

Lagrangian and Hamiltonian Mechanics

Lagrangian and Hamiltonian Mechanics Lagrangian an Hamiltonian Mechanics.G. Simpson, Ph.. epartment of Physical Sciences an Engineering Prince George s Community College ecember 5, 007 Introuction In this course we have been stuying classical

More information

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21 Large amping in a structural material may be either esirable or unesirable, epening on the engineering application at han. For example, amping is a esirable property to the esigner concerne with limiting

More information

Cascaded redundancy reduction

Cascaded redundancy reduction Network: Comput. Neural Syst. 9 (1998) 73 84. Printe in the UK PII: S0954-898X(98)88342-5 Cascae reunancy reuction Virginia R e Sa an Geoffrey E Hinton Department of Computer Science, University of Toronto,

More information

Multi-robot Formation Control Using Reinforcement Learning Method

Multi-robot Formation Control Using Reinforcement Learning Method Multi-robot Formation Control Using Reinforcement Learning Metho Guoyu Zuo, Jiatong Han, an Guansheng Han School of Electronic Information & Control Engineering, Beijing University of Technology, Beijing

More information

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device Close an Open Loop Optimal Control of Buffer an Energy of a Wireless Device V. S. Borkar School of Technology an Computer Science TIFR, umbai, Inia. borkar@tifr.res.in A. A. Kherani B. J. Prabhu INRIA

More information

SYNCHRONOUS SEQUENTIAL CIRCUITS

SYNCHRONOUS SEQUENTIAL CIRCUITS CHAPTER SYNCHRONOUS SEUENTIAL CIRCUITS Registers an counters, two very common synchronous sequential circuits, are introuce in this chapter. Register is a igital circuit for storing information. Contents

More information

θ x = f ( x,t) could be written as

θ x = f ( x,t) could be written as 9. Higher orer PDEs as systems of first-orer PDEs. Hyperbolic systems. For PDEs, as for ODEs, we may reuce the orer by efining new epenent variables. For example, in the case of the wave equation, (1)

More information

Lecture 2 Lagrangian formulation of classical mechanics Mechanics

Lecture 2 Lagrangian formulation of classical mechanics Mechanics Lecture Lagrangian formulation of classical mechanics 70.00 Mechanics Principle of stationary action MATH-GA To specify a motion uniquely in classical mechanics, it suffices to give, at some time t 0,

More information

Optimization of Geometries by Energy Minimization

Optimization of Geometries by Energy Minimization Optimization of Geometries by Energy Minimization by Tracy P. Hamilton Department of Chemistry University of Alabama at Birmingham Birmingham, AL 3594-140 hamilton@uab.eu Copyright Tracy P. Hamilton, 1997.

More information

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013 Survey Sampling Kosuke Imai Department of Politics, Princeton University February 19, 2013 Survey sampling is one of the most commonly use ata collection methos for social scientists. We begin by escribing

More information

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions Working Paper 2013:5 Department of Statistics Computing Exact Confience Coefficients of Simultaneous Confience Intervals for Multinomial Proportions an their Functions Shaobo Jin Working Paper 2013:5

More information

The canonical controllers and regular interconnection

The canonical controllers and regular interconnection Systems & Control Letters ( www.elsevier.com/locate/sysconle The canonical controllers an regular interconnection A.A. Julius a,, J.C. Willems b, M.N. Belur c, H.L. Trentelman a Department of Applie Mathematics,

More information

The Exact Form and General Integrating Factors

The Exact Form and General Integrating Factors 7 The Exact Form an General Integrating Factors In the previous chapters, we ve seen how separable an linear ifferential equations can be solve using methos for converting them to forms that can be easily

More information

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE Journal of Soun an Vibration (1996) 191(3), 397 414 THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE E. M. WEINSTEIN Galaxy Scientific Corporation, 2500 English Creek

More information

Quantum Search on the Spatial Grid

Quantum Search on the Spatial Grid Quantum Search on the Spatial Gri Matthew D. Falk MIT 2012, 550 Memorial Drive, Cambrige, MA 02139 (Date: December 11, 2012) This paper explores Quantum Search on the two imensional spatial gri. Recent

More information

A Path Planning Method Using Cubic Spiral with Curvature Constraint

A Path Planning Method Using Cubic Spiral with Curvature Constraint A Path Planning Metho Using Cubic Spiral with Curvature Constraint Tzu-Chen Liang an Jing-Sin Liu Institute of Information Science 0, Acaemia Sinica, Nankang, Taipei 5, Taiwan, R.O.C., Email: hartree@iis.sinica.eu.tw

More information

Lower Bounds for the Smoothed Number of Pareto optimal Solutions

Lower Bounds for the Smoothed Number of Pareto optimal Solutions Lower Bouns for the Smoothe Number of Pareto optimal Solutions Tobias Brunsch an Heiko Röglin Department of Computer Science, University of Bonn, Germany brunsch@cs.uni-bonn.e, heiko@roeglin.org Abstract.

More information

Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity

Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity Preprints of the 9th Worl Congress The International Feeration of Automatic Control Cape Town, South Africa. August -9, Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity Zhengqiang

More information

Chapter 6: Energy-Momentum Tensors

Chapter 6: Energy-Momentum Tensors 49 Chapter 6: Energy-Momentum Tensors This chapter outlines the general theory of energy an momentum conservation in terms of energy-momentum tensors, then applies these ieas to the case of Bohm's moel.

More information

Nonlinear Adaptive Ship Course Tracking Control Based on Backstepping and Nussbaum Gain

Nonlinear Adaptive Ship Course Tracking Control Based on Backstepping and Nussbaum Gain Nonlinear Aaptive Ship Course Tracking Control Base on Backstepping an Nussbaum Gain Jialu Du, Chen Guo Abstract A nonlinear aaptive controller combining aaptive Backstepping algorithm with Nussbaum gain

More information

Equilibrium in Queues Under Unknown Service Times and Service Value

Equilibrium in Queues Under Unknown Service Times and Service Value University of Pennsylvania ScholarlyCommons Finance Papers Wharton Faculty Research 1-2014 Equilibrium in Queues Uner Unknown Service Times an Service Value Laurens Debo Senthil K. Veeraraghavan University

More information

Table of Common Derivatives By David Abraham

Table of Common Derivatives By David Abraham Prouct an Quotient Rules: Table of Common Derivatives By Davi Abraham [ f ( g( ] = [ f ( ] g( + f ( [ g( ] f ( = g( [ f ( ] g( g( f ( [ g( ] Trigonometric Functions: sin( = cos( cos( = sin( tan( = sec

More information

Technion - Computer Science Department - M.Sc. Thesis MSC Constrained Codes for Two-Dimensional Channels.

Technion - Computer Science Department - M.Sc. Thesis MSC Constrained Codes for Two-Dimensional Channels. Technion - Computer Science Department - M.Sc. Thesis MSC-2006- - 2006 Constraine Coes for Two-Dimensional Channels Keren Censor Technion - Computer Science Department - M.Sc. Thesis MSC-2006- - 2006 Technion

More information

A Review of Multiple Try MCMC algorithms for Signal Processing

A Review of Multiple Try MCMC algorithms for Signal Processing A Review of Multiple Try MCMC algorithms for Signal Processing Luca Martino Image Processing Lab., Universitat e València (Spain) Universia Carlos III e Mari, Leganes (Spain) Abstract Many applications

More information

Optimal Cooperative Spectrum Sensing in Cognitive Sensor Networks

Optimal Cooperative Spectrum Sensing in Cognitive Sensor Networks Optimal Cooperative Spectrum Sensing in Cognitive Sensor Networks Hai Ngoc Pham, an Zhang, Paal E. Engelsta,,3, Tor Skeie,, Frank Eliassen, Department of Informatics, University of Oslo, Norway Simula

More information

Regular tree languages definable in FO and in FO mod

Regular tree languages definable in FO and in FO mod Regular tree languages efinable in FO an in FO mo Michael Beneikt Luc Segoufin Abstract We consier regular languages of labele trees. We give an effective characterization of the regular languages over

More information

Discrete Mathematics

Discrete Mathematics Discrete Mathematics 309 (009) 86 869 Contents lists available at ScienceDirect Discrete Mathematics journal homepage: wwwelseviercom/locate/isc Profile vectors in the lattice of subspaces Dániel Gerbner

More information

A simplified macroscopic urban traffic network model for model-based predictive control

A simplified macroscopic urban traffic network model for model-based predictive control Delft University of Technology Delft Center for Systems an Control Technical report 9-28 A simplifie macroscopic urban traffic network moel for moel-base preictive control S. Lin, B. De Schutter, Y. Xi,

More information

A Novel Decoupled Iterative Method for Deep-Submicron MOSFET RF Circuit Simulation

A Novel Decoupled Iterative Method for Deep-Submicron MOSFET RF Circuit Simulation A Novel ecouple Iterative Metho for eep-submicron MOSFET RF Circuit Simulation CHUAN-SHENG WANG an YIMING LI epartment of Mathematics, National Tsing Hua University, National Nano evice Laboratories, an

More information

Optimal Variable-Structure Control Tracking of Spacecraft Maneuvers

Optimal Variable-Structure Control Tracking of Spacecraft Maneuvers Optimal Variable-Structure Control racking of Spacecraft Maneuvers John L. Crassiis 1 Srinivas R. Vaali F. Lanis Markley 3 Introuction In recent years, much effort has been evote to the close-loop esign

More information

Linear First-Order Equations

Linear First-Order Equations 5 Linear First-Orer Equations Linear first-orer ifferential equations make up another important class of ifferential equations that commonly arise in applications an are relatively easy to solve (in theory)

More information

Polynomial Inclusion Functions

Polynomial Inclusion Functions Polynomial Inclusion Functions E. e Weert, E. van Kampen, Q. P. Chu, an J. A. Muler Delft University of Technology, Faculty of Aerospace Engineering, Control an Simulation Division E.eWeert@TUDelft.nl

More information

Approximate reduction of dynamic systems

Approximate reduction of dynamic systems Systems & Control Letters 57 2008 538 545 www.elsevier.com/locate/sysconle Approximate reuction of ynamic systems Paulo Tabuaa a,, Aaron D. Ames b, Agung Julius c, George J. Pappas c a Department of Electrical

More information

TIME-DELAY ESTIMATION USING FARROW-BASED FRACTIONAL-DELAY FIR FILTERS: FILTER APPROXIMATION VS. ESTIMATION ERRORS

TIME-DELAY ESTIMATION USING FARROW-BASED FRACTIONAL-DELAY FIR FILTERS: FILTER APPROXIMATION VS. ESTIMATION ERRORS TIME-DEAY ESTIMATION USING FARROW-BASED FRACTIONA-DEAY FIR FITERS: FITER APPROXIMATION VS. ESTIMATION ERRORS Mattias Olsson, Håkan Johansson, an Per öwenborg Div. of Electronic Systems, Dept. of Electrical

More information

Thermal conductivity of graded composites: Numerical simulations and an effective medium approximation

Thermal conductivity of graded composites: Numerical simulations and an effective medium approximation JOURNAL OF MATERIALS SCIENCE 34 (999)5497 5503 Thermal conuctivity of grae composites: Numerical simulations an an effective meium approximation P. M. HUI Department of Physics, The Chinese University

More information

Generalizing Kronecker Graphs in order to Model Searchable Networks

Generalizing Kronecker Graphs in order to Model Searchable Networks Generalizing Kronecker Graphs in orer to Moel Searchable Networks Elizabeth Boine, Babak Hassibi, Aam Wierman California Institute of Technology Pasaena, CA 925 Email: {eaboine, hassibi, aamw}@caltecheu

More information

Total Energy Shaping of a Class of Underactuated Port-Hamiltonian Systems using a New Set of Closed-Loop Potential Shape Variables*

Total Energy Shaping of a Class of Underactuated Port-Hamiltonian Systems using a New Set of Closed-Loop Potential Shape Variables* 51st IEEE Conference on Decision an Control December 1-13 212. Maui Hawaii USA Total Energy Shaping of a Class of Uneractuate Port-Hamiltonian Systems using a New Set of Close-Loop Potential Shape Variables*

More information

VIRTUAL STRUCTURE BASED SPACECRAFT FORMATION CONTROL WITH FORMATION FEEDBACK

VIRTUAL STRUCTURE BASED SPACECRAFT FORMATION CONTROL WITH FORMATION FEEDBACK AIAA Guiance, Navigation, an Control Conference an Exhibit 5-8 August, Monterey, California AIAA -9 VIRTUAL STRUCTURE BASED SPACECRAT ORMATION CONTROL WITH ORMATION EEDBACK Wei Ren Ranal W. Bear Department

More information

Quantum mechanical approaches to the virial

Quantum mechanical approaches to the virial Quantum mechanical approaches to the virial S.LeBohec Department of Physics an Astronomy, University of Utah, Salt Lae City, UT 84112, USA Date: June 30 th 2015 In this note, we approach the virial from

More information

Lower bounds on Locality Sensitive Hashing

Lower bounds on Locality Sensitive Hashing Lower bouns on Locality Sensitive Hashing Rajeev Motwani Assaf Naor Rina Panigrahy Abstract Given a metric space (X, X ), c 1, r > 0, an p, q [0, 1], a istribution over mappings H : X N is calle a (r,

More information

Learning Automata in Games with Memory with Application to Circuit-Switched Routing

Learning Automata in Games with Memory with Application to Circuit-Switched Routing Learning Automata in Games with Memory with Application to Circuit-Switche Routing Murat Alanyali Abstract A general setting is consiere in which autonomous users interact by means of a finite-state controlle

More information

Applications of First Order Equations

Applications of First Order Equations Applications of First Orer Equations Viscous Friction Consier a small mass that has been roppe into a thin vertical tube of viscous flui lie oil. The mass falls, ue to the force of gravity, but falls more

More information

An Iterative Incremental Learning Algorithm for Complex-Valued Hopfield Associative Memory

An Iterative Incremental Learning Algorithm for Complex-Valued Hopfield Associative Memory An Iterative Incremental Learning Algorithm for Complex-Value Hopfiel Associative Memory aoki Masuyama (B) an Chu Kiong Loo Faculty of Computer Science an Information Technology, University of Malaya,

More information

KNN Particle Filters for Dynamic Hybrid Bayesian Networks

KNN Particle Filters for Dynamic Hybrid Bayesian Networks KNN Particle Filters for Dynamic Hybri Bayesian Networs H. D. Chen an K. C. Chang Dept. of Systems Engineering an Operations Research George Mason University MS 4A6, 4400 University Dr. Fairfax, VA 22030

More information

Experimental Robustness Study of a Second-Order Sliding Mode Controller

Experimental Robustness Study of a Second-Order Sliding Mode Controller Experimental Robustness Stuy of a Secon-Orer Sliing Moe Controller Anré Blom, Bram e Jager Einhoven University of Technology Department of Mechanical Engineering P.O. Box 513, 5600 MB Einhoven, The Netherlans

More information

Systems & Control Letters

Systems & Control Letters Systems & ontrol Letters ( ) ontents lists available at ScienceDirect Systems & ontrol Letters journal homepage: www.elsevier.com/locate/sysconle A converse to the eterministic separation principle Jochen

More information

Non-deterministic Social Laws

Non-deterministic Social Laws Non-eterministic Social Laws Michael H. Coen MIT Artificial Intelligence Lab 55 Technology Square Cambrige, MA 09 mhcoen@ai.mit.eu Abstract The paper generalizes the notion of a social law, the founation

More information

Improved Rate-Based Pull and Push Strategies in Large Distributed Networks

Improved Rate-Based Pull and Push Strategies in Large Distributed Networks Improve Rate-Base Pull an Push Strategies in Large Distribute Networks Wouter Minnebo an Benny Van Hout Department of Mathematics an Computer Science University of Antwerp - imins Mielheimlaan, B-00 Antwerp,

More information

inflow outflow Part I. Regular tasks for MAE598/494 Task 1

inflow outflow Part I. Regular tasks for MAE598/494 Task 1 MAE 494/598, Fall 2016 Project #1 (Regular tasks = 20 points) Har copy of report is ue at the start of class on the ue ate. The rules on collaboration will be release separately. Please always follow the

More information

Role of parameters in the stochastic dynamics of a stick-slip oscillator

Role of parameters in the stochastic dynamics of a stick-slip oscillator Proceeing Series of the Brazilian Society of Applie an Computational Mathematics, v. 6, n. 1, 218. Trabalho apresentao no XXXVII CNMAC, S.J. os Campos - SP, 217. Proceeing Series of the Brazilian Society

More information

7.1 Support Vector Machine

7.1 Support Vector Machine 67577 Intro. to Machine Learning Fall semester, 006/7 Lecture 7: Support Vector Machines an Kernel Functions II Lecturer: Amnon Shashua Scribe: Amnon Shashua 7. Support Vector Machine We return now to

More information

Acute sets in Euclidean spaces

Acute sets in Euclidean spaces Acute sets in Eucliean spaces Viktor Harangi April, 011 Abstract A finite set H in R is calle an acute set if any angle etermine by three points of H is acute. We examine the maximal carinality α() of

More information

Proof of SPNs as Mixture of Trees

Proof of SPNs as Mixture of Trees A Proof of SPNs as Mixture of Trees Theorem 1. If T is an inuce SPN from a complete an ecomposable SPN S, then T is a tree that is complete an ecomposable. Proof. Argue by contraiction that T is not a

More information

Situation awareness of power system based on static voltage security region

Situation awareness of power system based on static voltage security region The 6th International Conference on Renewable Power Generation (RPG) 19 20 October 2017 Situation awareness of power system base on static voltage security region Fei Xiao, Zi-Qing Jiang, Qian Ai, Ran

More information

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y Ph195a lecture notes, 1/3/01 Density operators for spin- 1 ensembles So far in our iscussion of spin- 1 systems, we have restricte our attention to the case of pure states an Hamiltonian evolution. Toay

More information

Time-Optimal Motion Control of Piezoelectric Actuator: STM Application

Time-Optimal Motion Control of Piezoelectric Actuator: STM Application Time-Optimal Motion Control of Piezoelectric Actuator: STM Application Yongai Xu, Peter H. Mecl Abstract This paper exaes the problem of time-optimal motion control in the context of Scanning Tunneling

More information

Minimum-time constrained velocity planning

Minimum-time constrained velocity planning 7th Meiterranean Conference on Control & Automation Makeonia Palace, Thessaloniki, Greece June 4-6, 9 Minimum-time constraine velocity planning Gabriele Lini, Luca Consolini, Aurelio Piazzi Università

More information

An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback Journal of Machine Learning Research 8 07) - Submitte /6; Publishe 5/7 An Optimal Algorithm for Banit an Zero-Orer Convex Optimization with wo-point Feeback Oha Shamir Department of Computer Science an

More information

F 1 x 1,,x n. F n x 1,,x n

F 1 x 1,,x n. F n x 1,,x n Chapter Four Dynamical Systems 4. Introuction The previous chapter has been evote to the analysis of systems of first orer linear ODE s. In this chapter we will consier systems of first orer ODE s that

More information

A study on ant colony systems with fuzzy pheromone dispersion

A study on ant colony systems with fuzzy pheromone dispersion A stuy on ant colony systems with fuzzy pheromone ispersion Louis Gacogne LIP6 104, Av. Kenney, 75016 Paris, France gacogne@lip6.fr Sanra Sanri IIIA/CSIC Campus UAB, 08193 Bellaterra, Spain sanri@iiia.csic.es

More information

A Note on Exact Solutions to Linear Differential Equations by the Matrix Exponential

A Note on Exact Solutions to Linear Differential Equations by the Matrix Exponential Avances in Applie Mathematics an Mechanics Av. Appl. Math. Mech. Vol. 1 No. 4 pp. 573-580 DOI: 10.4208/aamm.09-m0946 August 2009 A Note on Exact Solutions to Linear Differential Equations by the Matrix

More information

CONTROL CHARTS FOR VARIABLES

CONTROL CHARTS FOR VARIABLES UNIT CONTOL CHATS FO VAIABLES Structure.1 Introuction Objectives. Control Chart Technique.3 Control Charts for Variables.4 Control Chart for Mean(-Chart).5 ange Chart (-Chart).6 Stanar Deviation Chart

More information

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012 CS-6 Theory Gems November 8, 0 Lecture Lecturer: Alesaner Mąry Scribes: Alhussein Fawzi, Dorina Thanou Introuction Toay, we will briefly iscuss an important technique in probability theory measure concentration

More information

Dissipative numerical methods for the Hunter-Saxton equation

Dissipative numerical methods for the Hunter-Saxton equation Dissipative numerical methos for the Hunter-Saton equation Yan Xu an Chi-Wang Shu Abstract In this paper, we present further evelopment of the local iscontinuous Galerkin (LDG) metho esigne in [] an a

More information

TRACKING CONTROL OF MULTIPLE MOBILE ROBOTS: A CASE STUDY OF INTER-ROBOT COLLISION-FREE PROBLEM

TRACKING CONTROL OF MULTIPLE MOBILE ROBOTS: A CASE STUDY OF INTER-ROBOT COLLISION-FREE PROBLEM 265 Asian Journal of Control, Vol. 4, No. 3, pp. 265-273, September 22 TRACKING CONTROL OF MULTIPLE MOBILE ROBOTS: A CASE STUDY OF INTER-ROBOT COLLISION-FREE PROBLEM Jurachart Jongusuk an Tsutomu Mita

More information

Perturbation Analysis and Optimization of Stochastic Flow Networks

Perturbation Analysis and Optimization of Stochastic Flow Networks IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. XX, NO. Y, MMM 2004 1 Perturbation Analysis an Optimization of Stochastic Flow Networks Gang Sun, Christos G. Cassanras, Yorai Wari, Christos G. Panayiotou,

More information

APPROXIMATE SOLUTION FOR TRANSIENT HEAT TRANSFER IN STATIC TURBULENT HE II. B. Baudouy. CEA/Saclay, DSM/DAPNIA/STCM Gif-sur-Yvette Cedex, France

APPROXIMATE SOLUTION FOR TRANSIENT HEAT TRANSFER IN STATIC TURBULENT HE II. B. Baudouy. CEA/Saclay, DSM/DAPNIA/STCM Gif-sur-Yvette Cedex, France APPROXIMAE SOLUION FOR RANSIEN HEA RANSFER IN SAIC URBULEN HE II B. Bauouy CEA/Saclay, DSM/DAPNIA/SCM 91191 Gif-sur-Yvette Ceex, France ABSRAC Analytical solution in one imension of the heat iffusion equation

More information

Discounted near-optimal control of general continuous-action nonlinear systems using optimistic planning

Discounted near-optimal control of general continuous-action nonlinear systems using optimistic planning Discounted near-optimal control of general continuous-action nonlinear systems using optimistic planning Lucian Buşoniu, Előd Páll, Rémi Munos Abstract We propose an optimistic planning method to search

More information

Math 1271 Solutions for Fall 2005 Final Exam

Math 1271 Solutions for Fall 2005 Final Exam Math 7 Solutions for Fall 5 Final Eam ) Since the equation + y = e y cannot be rearrange algebraically in orer to write y as an eplicit function of, we must instea ifferentiate this relation implicitly

More information

Laplacian Cooperative Attitude Control of Multiple Rigid Bodies

Laplacian Cooperative Attitude Control of Multiple Rigid Bodies Laplacian Cooperative Attitue Control of Multiple Rigi Boies Dimos V. Dimarogonas, Panagiotis Tsiotras an Kostas J. Kyriakopoulos Abstract Motivate by the fact that linear controllers can stabilize the

More information

Least-Squares Regression on Sparse Spaces

Least-Squares Regression on Sparse Spaces Least-Squares Regression on Sparse Spaces Yuri Grinberg, Mahi Milani Far, Joelle Pineau School of Computer Science McGill University Montreal, Canaa {ygrinb,mmilan1,jpineau}@cs.mcgill.ca 1 Introuction

More information

ELEC3114 Control Systems 1

ELEC3114 Control Systems 1 ELEC34 Control Systems Linear Systems - Moelling - Some Issues Session 2, 2007 Introuction Linear systems may be represente in a number of ifferent ways. Figure shows the relationship between various representations.

More information

Math 342 Partial Differential Equations «Viktor Grigoryan

Math 342 Partial Differential Equations «Viktor Grigoryan Math 342 Partial Differential Equations «Viktor Grigoryan 6 Wave equation: solution In this lecture we will solve the wave equation on the entire real line x R. This correspons to a string of infinite

More information

arxiv:hep-th/ v1 3 Feb 1993

arxiv:hep-th/ v1 3 Feb 1993 NBI-HE-9-89 PAR LPTHE 9-49 FTUAM 9-44 November 99 Matrix moel calculations beyon the spherical limit arxiv:hep-th/93004v 3 Feb 993 J. Ambjørn The Niels Bohr Institute Blegamsvej 7, DK-00 Copenhagen Ø,

More information

PREPARATION OF THE NATIONAL MAGNETIC FIELD STANDARD IN CROATIA

PREPARATION OF THE NATIONAL MAGNETIC FIELD STANDARD IN CROATIA n IMEKO TC 11 International Symposium METROLOGICAL INFRASTRUCTURE June 15-17, 11, Cavtat, Dubrovni Riviera, Croatia PREPARATION OF THE NATIONAL MAGNETIC FIELD STANDARD IN CROATIA A. Pavić 1, L.Ferović,

More information

Topic Modeling: Beyond Bag-of-Words

Topic Modeling: Beyond Bag-of-Words Hanna M. Wallach Cavenish Laboratory, University of Cambrige, Cambrige CB3 0HE, UK hmw26@cam.ac.u Abstract Some moels of textual corpora employ text generation methos involving n-gram statistics, while

More information

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION The Annals of Statistics 1997, Vol. 25, No. 6, 2313 2327 LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION By Eva Riccomagno, 1 Rainer Schwabe 2 an Henry P. Wynn 1 University of Warwick, Technische

More information

Approximate Reduction of Dynamical Systems

Approximate Reduction of Dynamical Systems Proceeings of the 4th IEEE Conference on Decision & Control Manchester Gran Hyatt Hotel San Diego, CA, USA, December 3-, 6 FrIP.7 Approximate Reuction of Dynamical Systems Paulo Tabuaa, Aaron D. Ames,

More information

Nested Saturation with Guaranteed Real Poles 1

Nested Saturation with Guaranteed Real Poles 1 Neste Saturation with Guarantee Real Poles Eric N Johnson 2 an Suresh K Kannan 3 School of Aerospace Engineering Georgia Institute of Technology, Atlanta, GA 3332 Abstract The global stabilization of asymptotically

More information

Constraint Reformulation and a Lagrangian Relaxation based Solution Algorithm for a Least Expected Time Path Problem Abstract 1.

Constraint Reformulation and a Lagrangian Relaxation based Solution Algorithm for a Least Expected Time Path Problem Abstract 1. Constraint Reformulation an a Lagrangian Relaation base Solution Algorithm for a Least Epecte Time Path Problem Liing Yang State Key Laboratory of Rail Traffic Control an Safety, Being Jiaotong University,

More information

ECE 422 Power System Operations & Planning 7 Transient Stability

ECE 422 Power System Operations & Planning 7 Transient Stability ECE 4 Power System Operations & Planning 7 Transient Stability Spring 5 Instructor: Kai Sun References Saaat s Chapter.5 ~. EPRI Tutorial s Chapter 7 Kunur s Chapter 3 Transient Stability The ability of

More information

Diophantine Approximations: Examining the Farey Process and its Method on Producing Best Approximations

Diophantine Approximations: Examining the Farey Process and its Method on Producing Best Approximations Diophantine Approximations: Examining the Farey Process an its Metho on Proucing Best Approximations Kelly Bowen Introuction When a person hears the phrase irrational number, one oes not think of anything

More information

Balancing Expected and Worst-Case Utility in Contracting Models with Asymmetric Information and Pooling

Balancing Expected and Worst-Case Utility in Contracting Models with Asymmetric Information and Pooling Balancing Expecte an Worst-Case Utility in Contracting Moels with Asymmetric Information an Pooling R.B.O. erkkamp & W. van en Heuvel & A.P.M. Wagelmans Econometric Institute Report EI2018-01 9th January

More information

Optimized Schwarz Methods with the Yin-Yang Grid for Shallow Water Equations

Optimized Schwarz Methods with the Yin-Yang Grid for Shallow Water Equations Optimize Schwarz Methos with the Yin-Yang Gri for Shallow Water Equations Abessama Qaouri Recherche en prévision numérique, Atmospheric Science an Technology Directorate, Environment Canaa, Dorval, Québec,

More information

Math 1B, lecture 8: Integration by parts

Math 1B, lecture 8: Integration by parts Math B, lecture 8: Integration by parts Nathan Pflueger 23 September 2 Introuction Integration by parts, similarly to integration by substitution, reverses a well-known technique of ifferentiation an explores

More information

Calculus in the AP Physics C Course The Derivative

Calculus in the AP Physics C Course The Derivative Limits an Derivatives Calculus in the AP Physics C Course The Derivative In physics, the ieas of the rate change of a quantity (along with the slope of a tangent line) an the area uner a curve are essential.

More information