CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy

Size: px
Start display at page:

Download "CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy"

Transcription

1 CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK Wassim Joini and Christophe Moy SUPELEC, IETR, SCEE, Avene de la Bolaie, CS 47601, 5576 Cesson Sévigné, France. INSERM U96 - IFR140- FacltédeMédecine Université de Rennes1-50 Rennes France. ABSTRACT Channel Selection in fading environments with no prior information on the channels qality is a challenging isse. In the case of Rayleigh channels the measred Signal-To-Noise Ratio follows exponential distribtions. Ths, we sggest in this paper a simple algorithm that deals with resorce selection when the measred samples are drawn from exponential distribtions. This strategy, referred to as Mltiplicative Upper Confidence Bond Algorithm MUCB, associates a tility index to every available arm, and then selects the arm with the highest index. For every arm, the associated index is eqal to the prodct of a mltiplicative factor by the sample mean of the rewards collected by this arm. We show that MUCB policies are order optimal. Moreover simlations illstrate and validate the stated theoretical reslts. 1. INTRODUCTION Several seqential decision maing problems face a dilemma between the exploration of a space of choices, or soltions, and the exploitation of the information available to the decision maer. The problem described herein is nown as seqential decision maing nder ncertainty. In this paper we focs on a sb-class of this problem, where the decision maer has a discrete set of stateless choices and the added information is a real valed seqence of feedbacs, or rewards that qantifies how well the decision maer behaved in the previos time steps. This particlar instance of seqential decision maing problems is generally nown as the mltiarmed bandit MAB problem [1, 2]. A common approach to solving the exploration verss exploitation dilemma within MAB problems consists in assigning an tility vale to every arm. An arm s tility aggregates all the past information abot the lever and qantifies the gambler s interest in plling it. Sch tilities are called indexes. Agrawal et al. [2] emphasized the family of indexes minimizing the expected cmlated loss and called them Upper Confidence Bond UCB indexes. UCB indexes provide an optimistic estimation of the arms performances while ensring a The athors wold lie to than Damien Ernst, Raphael Fontenea and Emmanel Rachelson for their many helpfl comments and answers regarding this wor. rapidly decreasing probability of selecting a sboptimal arm. The decision maer bilds its policy by greedily selecting the largest index. Recently, Aer et al. [] proved that a simple additive form, of the rewards sample mean and a bias, nown as UCB 1 can achieve order optimality over time when dealing with rewards drawn from bonded distribtions. Tacling exponentially distribted rewards, as it sally occrs when measring Signal-to-Noise Ratios SNR in Fading environments remains a challenge. Asymptotically, optimal, or order optimal, algorithms exists [1, 2]. These algorithms are nown to be complex to compte. Recently, an algorithm for locally sb-gassian distribtions has been sggested 1. Unfortnately, the algorithm needs prior nowledge on the distribtions parameters. Sch nowledge, on the fading channels, is not available prior to the learning process. This paper is inspired from the aforementioned wor [1, 2] and is motivated by the problematic of channel selection when the channels as sbject to Rayleigh fading. However, we sggest the analysis of a mltiplicative rather than an additive expression for the index. The main contribtion of this paper is to design and analyze a simple, deterministic, mltiplicative index-based policy. The decision maing strategy comptes an index associated to every available arm, and then selects the arm with the highest index. Every index associated to an arm is eqal to the prodct of the sample mean of the reward collected by this arm and a scaling factor. The scaling factor is chosen so as to provide an optimistic estimation of the considered arm s performance. We show that or decision policy has both a low comptational complexity and can lead to a logarithmic loss over time nder some non-restrictive conditions. For the rest of this paper we will refer to or sggested policy as Mltiplicative Upper Confidence Bond index MUCB. The otline of this paper is the following: We start by presenting some general notions on the mlti-armed bandit framewor with exponentially distribted rewards in Section 2. Then, Section introdces or index policy and Section 4 analyzes its behavior, proving the order optimality of the sggested algorithm. Finally, Section 6 concldes. 1 Fond online on arxiv:

2 2. MULTI-ARMED BANDITS A K-armed bandit K N is a machine learning problem based on an analogy with the traditional slot machine onearmed bandit bt with more than one lever. Sch a problem is defined by the K-tple θ 1,θ 2,..., θ K Θ K, Θ being the set of all positive reward distribtions. When plled at a time t N, each lever 2 1,K where 1,K = 1,..., K} provides a reward r t drawn from a distribtion θ associated to that specific lever. The objective of the gambler is to maximize the cmlated sm of rewards throgh iterative plls. It is generally assmed that the gambler has no or partial initial nowledge abot the levers. The crcial tradeoff the gambler faces at each trial is between exploitation of the lever that has the highest expected payoff and exploration to get more information abot the expected payoffs of the other levers. In this paper, we assme that the different exponentially distribted payoffs drawn from a machine are independent and identically distribted i.i.d. and that the independence of the rewards holds between the machines. However the different machines reward distribtions θ 1,θ 2,..., θ K are not spposed to be the same. Let I t 1,K denote the machine selected at a time t, and let H t be the history vector available to the gambler at instant t, i.e., H t =[I 0,r 0,I 1,r 1,...,I t1,r t1 ] We assme that the gambler ses a policy π to select arm I t at instant t, sch that I t = πh t. We shall also write Δ 1,K, μ = 1 Δ λ =E[θ ], where λ refers to the parameter of the considered exponential distribtion with pdf f θ x =λ e λx,x 0, and we assme that μ > 0, 1,K. The cmlated regret of a policy π at time t after t plls is defined as follows: R t = tμ t1 μ = m=0 r m, where max μ } refers to the expected reward of the opti- 1,K mal arm. We see to find a policy that minimizes the expected cmlated regret Eqation 1, E [R t ]= = Δ E [T,t ], 1 where Δ = μ μ is the expected loss of playing arm, and T,t refers to the nmber of times the machine has been played from instant 0 to instant t 1.. MULTIPLICATIVE UPPER CONFIDENCE BOUND ALGORITHMS This section presents or main contribtion, the introdction of a new mltiplicative index. Let B,t T,t denote the index of arm at time t after being plled T,t. We refer to as Mltiplicative Upper Confidence Bond algorithms MUCB the family of indexes that can be written in the form: B,t T,t =X,t T,t M,t T,t, 2 We se indifferently the words lever, arm, or machine. where X,t T,t is the sample mean of machine at step t after T,t plls, i.e., X,t T,t = 1 t1 T,t i=0 1 I i=}r i and M,t is an pper confidence scaling factor chosen to insre that the index B,t T,t is an increasing fnction of the nmber of ronds t. This last property insres that the index of an arm that has not been plled for a long time will increase, ths eventally leading to the sampling of this arm. We introdce a particlar parametric class of MUCB indexes, which we call MUCBα, given as follows : α 0, M,t T,t = 1 } 2 max 0; 1 α lnt T,t We adopt the convention that 1 0 =+. Given a history H t, one can compte the vales of T,t and M,t and derive an index-based policy π as follows: I t = πh t arg max B,t T,t }. 1,K 4. ANALYSIS OF MUCBα POLICIES This section analyses the theoretical properties of MUCBα algorithms. More specifically, it focses on determining how fast is the optimal arm identified and what are the probabilities of anomalies, that is sb-optimal plls Consistency and order optimality of MUCB indexes Definition 1 β-consistency Consider the set Θ K of K-armed bandit problems. A policy π is said to be β-consistent, 0 <β 1, with respect to Θ K, if and only if: θ 1,...,θ K Θ K E[R t ], lim t t β =0 4 We expect good policies to be at least 1-consistent. Asa matter of fact, 1-consistency ensres that, asymptotically, the average expected reward is optimal. From the expression of Eqation 1 one can remar that its is sfficient to pper bond the expected nmber of times E[T,t ] one plays a sboptimal machine after t ronds, to obtain an pper bond on the expected cmlated regret. This leads to the main reslt of this paper in the form of the following theorem. Theorem 1 Order optimality of MUCBα policies Let ρ = μ /μ, 1,K }. For all K 2, if policy MUCBα > 4 isrn onk machines having rewards drawn from exponential distribtions θ 1,..., θ K then: E [R t ] :Δ >0 4μ α 1 ρ lnt+o lnt 5 This form offers a compact mathematical formla. However practically speaing, a machine is played when T,t α lnt. Otherwise the machine with largest finite index is played.

3 Proving Theorem 1 relies on three lemmas that we analyze and prove in the next sbsection Learning Anomalies and Consistency of MUCB policies Let s introdce the set S = N R; then, one can write S,t =T,t,B,t S the decision state of arm at time t. We associate the prodct order to the set S: for a pair of states S =T,B S and S =T,B S, we write S S if and only if T T and B B. Definition 2 Anomaly of type 1 We assme that there exists at least one sboptimal machine, i.e., 1,K } =. We call anomaly of type 1, denoted by φ 1 } π,t,fora sboptimal machine 1,K }, and with parameter N, the following event: φ 1 } π,t = S,t,μ }. Definition Anomaly of type 2 We refer to as anomaly of type 2, denoted by φ 2 } π t, associated to the optimal machine, the following event: φ 2 } π t = S,t <,μ T,t 1}. Lemma 1 Expected cmlated regret. Proof in 8.2 Given a policy π and a MAB problem, let =[ 1,..., K ] represent a set of integers, then the expected cmlated regret is pper bonded by: E[R t ] Δ + Δ P t = = with, P t = t m= +1 P φ 2 } π m+p φ 1 } π,m We consider the following vales for the set, for all sboptimal arms, t = lnt. 4α 1ρ 2 We show in the two following lemmas that for the defined set the anomalies are pper bonded by exponentially decreasing fnctions of the nmber of iterations. Lemma 2 Upper bond of Anomaly 1. Proof in 8. For all K 2, if policy MUCBα is rn on K machines having rewards drawn from exponential distribtions θ 1,..., θ K then 1,K }: We end this paper by the proof of Theorem 1 [Proof of Theorem 1] For α>4, relying on Lemmas 1, 2 and we can write: E[R t ] 4α 1 ρ 2 lnt + olnt = Δ with, = Δ P t = olnt. Finally, since Δ = μ 1 ρ and t = 4α lnt+olnt, we find the 1ρ 2 stated reslt in Theorem SIMULATION RESULTS For illstration prpose we consider a SU willing to evalate the qality of K = 10 channels. The SU relies on the measre of the channels SNR to evalate the best channel. We assme that the SU sffers Rayleigh fading. Conseqently, for every channel, the measred SNR follows an exponential distribtion. The presented simlation consider the following parameters μ = μ 1,,μ 10 } for the channels, where μ 1 μ 10 withot loss of generality: and μ = 0.1; 0.2; 0.; 0.4; 0.5; 0.6; 0.7; 0.8; 0.9; 1}. The simlations compare three MUCB policies for α eqal to respectively, 1; 2; 4.01}. Theses algorithms are referred to as MUCB1, MUCB2 and MUCB4 respectively. Notice that MUCB4 is chosen so as to respect the condition imposed in Theorem 1, i.e., α>4. MUCB1 and MUCB2 on the contrary are considered as possibly risy by Theorem 1. The simlations consider a time horizon of 10 6 iterations. Figre 1 plots the cmlated averaged regret of MUCB policies. In order to obtain relevant reslts, the crves were averaged over 100 experiments. All crves show a similar behavior: first an exploration phase were the regret grows qicly. Then the crves tend to confirm that the regret of MUCB policies grow as a logarithmic fnction of the nmber of iterations. As matter of fact, we notice that after the first exploration phase, on a logarithmic scale, the regret grows as a linear fnction. Moreover, since MUCB1 and MUCB2 seem to respect this trend, these crves sggest that the imposed condition in Theorem 1, α>4, might be improvable. P φ 1 } π,t t α/ CONCLUSION Lemma Upper bond of Anomaly 2. Proof in 8.4 For all K 2, if policy MUCBα is rn on K machines having rewards drawn from exponential distribtions θ 1,..., θ K then: P φ 2 } π t t α/2+1 7 A new low complexity algorithm for MAB problems is sggested and analyzed in this paper: MUCB. The analysis of its regret proves that the algorithm is order optimal over time. In order to qantify it performance compared to optimal algorithms, frther empirical evalations are needed and are crrently nder investigation.

4 Regret Averaged Regret over 100 experiments MUCB1 MUCB2 MUCB4 This reslt was initially proposed and proved in [4]. The bonds provided by this lemma are called Large Deviations Ineqalities LDIs in this paper. In the case of exponential distribtions this theorem can be applied and LDI fnctions have the following expressions: 2 l 1 β =l 2 β = β β 1 β E[X] 1 ln E[X] E[X] β E[X] Iterations Fig. 1. Average Regret Over 100 experiments: Illstration of Theorem REFERENCES [1] T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rles. Advances in Applied Mathematics, 6:4 22, [2] R. Agrawal. Sample mean based index policies with Ologn regret for the mlti-armed bandit problem. Advances in Applied Probability, 27: , [] P. Aer, N. Cesa-Bianchi, and P. Fischer. Finite time analysis of mlti-armed bandit problems. Machine learning, 472/:25 256, [4] H. Chernoff. A measre of asymptotic efficiency fo tests of a hypothesis based on the sm of observations. The Annals of Mathematical Statistics, pages , APPENDIX 8.1. Large deviations ineqalities Assmption 1 Cramer condition Let X be a real random variable. X satisfies the Cramer condition if and only if γ >0: η 0,γ, E [ e ηx] <. Lemma 4 Cramer-Chernoff Lemma for the sample mean Let X 1,...,X n n N be a seqence of i.i.d. real random variables satisfying the Cramer condition with expected vale E[X]. We denote by X n the sample mean X n = 1 n n i=1 X i. Then, there exist two fnctions l 1 and l 2 sch that: β 1 > E[X], PX n β 1 e l1β1n, β 2 < E[X], PX n β 2 e l2β2n. Fnctions l 1 and l 2 do not depend on the sample size n and are continos non-negative, strictly increasing respectively strictly-decreasing for all β 1 > EX respectively β 2 < EX, both nll for β 1 = β 2 = EX Proof of Lemma 1 According to Eqation 1: E[Rt π ]= Δ E [T,t ]. Per definition T,t = t1 = 1 Im=. Then, E[T,t ]= t1 E [1 Im=]. m=0 m=0 After playing an arm times, bonding the first terms by 1 yields: E[T,t ] + m= +1 P I m = } T,m > } 8 Then we can notice that the following events are eqivalent: } I m = } = B,m > max B,m = Moreover we can notice that: } B,m > max B,m B,m >B,m } = Which can be frther inclded in the following nion of events: B,m >B,m } B,m >μ } μ >B,m } Conseqently we can write: I m = } T,m > } Φ 1 } π,m Φ 2} π m 9 Finally, we apply the probability operator: E[T,t ] + m= +1 PΦ 1 } π,m +PΦ 2} π m 10 The combination of Eqation 1 - given at the beginning of this proof - and Eqation 10 concldes this proof. 8.. Proof of Lemma 2 From the definition of φ 1 } π,t we can write that: P φ 1 } π,t = P S,t,μ, S,t S = P B,t μ.

5 In the case of MUCB policies, we have: t, P B,t μ =P X,t μ M,t Conseqently, we can pper bond the probability of occrrence of type 1 anomalies by: P φ 1 } π,t P X,t μ 11 M =,t μ M,t T,t Let s define β,t T,t = Since we are dealing with exponential distribtions, the rewards provided by the arm satisfy the Cramer condition. As a matter of fact, since α lnt then: 1ρ 2 β,t λ = ρ 1 1 α lnt 1 So, according to the large deviation ineqality for X,t T,t given by Lemma 4 with T,t and large enogh, there exists a continos, non-decreasing, non-negative fnction l 1, sch that: P X,t T,t β,t T,t T,t = e l 1,β,t Finally: P φ 1 } π,t t1 = e l 1,β,t The end of this proof aims at proving that for: : l 1, β,t α lnt 2. Note that since we are dealing with exponential distribtions we can write: l 1, β,t 1β,tλ β,t λ. Moreover since α lnt then: 1ρ 2 β,t λ = ρ 1 1 α lnt ρ 1 Conseqently it is sfficient to prove that: 1 β,t λ ρ 1 α lnt 2 Let s define ht as a fnction of time: ht = α lnt [0, 1]. We analyze the sign of the fnction: gt = ρ 1 ht ρ ρ 1 ht 2 12 Conseqently we need to prove that for, g has positive vales. Factorizing last eqation leads to the following two terms: ρ ρ ρ ρ ht ρ 1 1 ht ρ Since per definition: ht [0, 1] and ρ 1 ht ρ 1 ρ ρ 1 then, 1 0. Conseqently, g is positive only if the second term of Eqation 1 is negative, i.e., α lnt ρ 1 1. Since,the ρ ρ 1 last ineqation is verified. Finally pper bonding Eqation 11 for : P φ 1 } π,t e α ln/2 1 1 α/2 t α/21 = = 8.4. Proof of Lemma This proof follows the same steps as the the proof in Sbsection 8.. From the definition of φ 1 } π,t we can write that: P φ 2 } π t t1 =1 P B,t μ In the case of MUCB policies, we have: t, P B,t μ μ =P X,t M,t Conseqently, we can pper bond the probability of occrrence of type 2 anomalies by: P φ 2 } π t P =1 X,t μ max 0; 1 } α lnt T,t } Since μ max 0; 1 α lnt T,t μ Cramer s condition is verified. Moreover since the machine is played when the maximal of the previos term is eqal to 0, we can consider that α lnt and that: μ max 0; 1 } α lnt = μ 1 T,t α lnt T,t Conseqently, we can pper-bond the occrrence of Anomaly 2: P φ 2 } π t =α lnt e l2β,t 14 Where, l 2 β,t verifies the LDI as defined in Appendix 8.1. Ths, after mild simplifications we can write, l 2 β,t 2 α lnt 1+21 α lnt α lnt 2 Conseqently, inclding this last ineqality into Eqation 14 ends the proof.

Sources of Non Stationarity in the Semivariogram

Sources of Non Stationarity in the Semivariogram Sorces of Non Stationarity in the Semivariogram Migel A. Cba and Oy Leangthong Traditional ncertainty characterization techniqes sch as Simple Kriging or Seqential Gassian Simlation rely on stationary

More information

The Linear Quadratic Regulator

The Linear Quadratic Regulator 10 The Linear Qadratic Reglator 10.1 Problem formlation This chapter concerns optimal control of dynamical systems. Most of this development concerns linear models with a particlarly simple notion of optimality.

More information

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation Stability of Model Predictive Control sing Markov Chain Monte Carlo Optimisation Elilini Siva, Pal Golart, Jan Maciejowski and Nikolas Kantas Abstract We apply stochastic Lyapnov theory to perform stability

More information

CHARACTERIZATIONS OF EXPONENTIAL DISTRIBUTION VIA CONDITIONAL EXPECTATIONS OF RECORD VALUES. George P. Yanev

CHARACTERIZATIONS OF EXPONENTIAL DISTRIBUTION VIA CONDITIONAL EXPECTATIONS OF RECORD VALUES. George P. Yanev Pliska Std. Math. Blgar. 2 (211), 233 242 STUDIA MATHEMATICA BULGARICA CHARACTERIZATIONS OF EXPONENTIAL DISTRIBUTION VIA CONDITIONAL EXPECTATIONS OF RECORD VALUES George P. Yanev We prove that the exponential

More information

Characterizations of probability distributions via bivariate regression of record values

Characterizations of probability distributions via bivariate regression of record values Metrika (2008) 68:51 64 DOI 10.1007/s00184-007-0142-7 Characterizations of probability distribtions via bivariate regression of record vales George P. Yanev M. Ahsanllah M. I. Beg Received: 4 October 2006

More information

Multi-armed bandit based policies for cognitive radio s decision making issues

Multi-armed bandit based policies for cognitive radio s decision making issues Multi-armed bandit based policies for cognitive radio s decision making issues Wassim Jouini SUPELEC/IETR wassim.jouini@supelec.fr Damien Ernst University of Liège dernst@ulg.ac.be Christophe Moy SUPELEC/IETR

More information

Nonlinear parametric optimization using cylindrical algebraic decomposition

Nonlinear parametric optimization using cylindrical algebraic decomposition Proceedings of the 44th IEEE Conference on Decision and Control, and the Eropean Control Conference 2005 Seville, Spain, December 12-15, 2005 TC08.5 Nonlinear parametric optimization sing cylindrical algebraic

More information

Admissibility under the LINEX loss function in non-regular case. Hidekazu Tanaka. Received November 5, 2009; revised September 2, 2010

Admissibility under the LINEX loss function in non-regular case. Hidekazu Tanaka. Received November 5, 2009; revised September 2, 2010 Scientiae Mathematicae Japonicae Online, e-2012, 427 434 427 Admissibility nder the LINEX loss fnction in non-reglar case Hidekaz Tanaka Received November 5, 2009; revised September 2, 2010 Abstract. In

More information

Decision Oriented Bayesian Design of Experiments

Decision Oriented Bayesian Design of Experiments Decision Oriented Bayesian Design of Experiments Farminder S. Anand*, Jay H. Lee**, Matthew J. Realff*** *School of Chemical & Biomoleclar Engineering Georgia Institte of echnology, Atlanta, GA 3332 USA

More information

Essentials of optimal control theory in ECON 4140

Essentials of optimal control theory in ECON 4140 Essentials of optimal control theory in ECON 4140 Things yo need to know (and a detail yo need not care abot). A few words abot dynamic optimization in general. Dynamic optimization can be thoght of as

More information

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance Optimal Control of a Heterogeneos Two Server System with Consideration for Power and Performance by Jiazheng Li A thesis presented to the University of Waterloo in flfilment of the thesis reqirement for

More information

1 Undiscounted Problem (Deterministic)

1 Undiscounted Problem (Deterministic) Lectre 9: Linear Qadratic Control Problems 1 Undisconted Problem (Deterministic) Choose ( t ) 0 to Minimize (x trx t + tq t ) t=0 sbject to x t+1 = Ax t + B t, x 0 given. x t is an n-vector state, t a

More information

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli 1 Introdction Discssion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen Department of Economics, University of Copenhagen and CREATES,

More information

Lecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2

Lecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 BIJU PATNAIK UNIVERSITY OF TECHNOLOGY, ODISHA Lectre Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 Prepared by, Dr. Sbhend Kmar Rath, BPUT, Odisha. Tring Machine- Miscellany UNIT 2 TURING MACHINE

More information

ON THE SHAPES OF BILATERAL GAMMA DENSITIES

ON THE SHAPES OF BILATERAL GAMMA DENSITIES ON THE SHAPES OF BILATERAL GAMMA DENSITIES UWE KÜCHLER, STEFAN TAPPE Abstract. We investigate the for parameter family of bilateral Gamma distribtions. The goal of this paper is to provide a thorogh treatment

More information

LOS Component-Based Equal Gain Combining for Ricean Links in Uplink Massive MIMO

LOS Component-Based Equal Gain Combining for Ricean Links in Uplink Massive MIMO 016 Sixth International Conference on Instrmentation & Measrement Compter Commnication and Control LOS Component-Based Eqal Gain Combining for Ricean Lins in plin Massive MIMO Dian-W Ye College of Information

More information

Optimal search: a practical interpretation of information-driven sensor management

Optimal search: a practical interpretation of information-driven sensor management Optimal search: a practical interpretation of information-driven sensor management Fotios Katsilieris, Yvo Boers and Hans Driessen Thales Nederland B.V. Hengelo, the Netherlands Email: {Fotios.Katsilieris,

More information

QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING

QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING Jornal of the Korean Statistical Society 2007, 36: 4, pp 543 556 QUANTILE ESTIMATION IN SUCCESSIVE SAMPLING Hosila P. Singh 1, Ritesh Tailor 2, Sarjinder Singh 3 and Jong-Min Kim 4 Abstract In sccessive

More information

Universal Scheme for Optimal Search and Stop

Universal Scheme for Optimal Search and Stop Universal Scheme for Optimal Search and Stop Sirin Nitinawarat Qalcomm Technologies, Inc. 5775 Morehose Drive San Diego, CA 92121, USA Email: sirin.nitinawarat@gmail.com Vengopal V. Veeravalli Coordinated

More information

Convergence analysis of ant colony learning

Convergence analysis of ant colony learning Delft University of Technology Delft Center for Systems and Control Technical report 11-012 Convergence analysis of ant colony learning J van Ast R Babška and B De Schtter If yo want to cite this report

More information

Discussion Papers Department of Economics University of Copenhagen

Discussion Papers Department of Economics University of Copenhagen Discssion Papers Department of Economics University of Copenhagen No. 10-06 Discssion of The Forward Search: Theory and Data Analysis, by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen,

More information

Technical Note. ODiSI-B Sensor Strain Gage Factor Uncertainty

Technical Note. ODiSI-B Sensor Strain Gage Factor Uncertainty Technical Note EN-FY160 Revision November 30, 016 ODiSI-B Sensor Strain Gage Factor Uncertainty Abstract Lna has pdated or strain sensor calibration tool to spport NIST-traceable measrements, to compte

More information

Typed Kleene Algebra with Products and Iteration Theories

Typed Kleene Algebra with Products and Iteration Theories Typed Kleene Algebra with Prodcts and Iteration Theories Dexter Kozen and Konstantinos Mamoras Compter Science Department Cornell University Ithaca, NY 14853-7501, USA {kozen,mamoras}@cs.cornell.ed Abstract

More information

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany Sbcritical bifrcation to innitely many rotating waves Arnd Scheel Institt fr Mathematik I Freie Universitat Berlin Arnimallee 2-6 14195 Berlin, Germany 1 Abstract We consider the eqation 00 + 1 r 0 k2

More information

FOUNTAIN codes [3], [4] provide an efficient solution

FOUNTAIN codes [3], [4] provide an efficient solution Inactivation Decoding of LT and Raptor Codes: Analysis and Code Design Francisco Lázaro, Stdent Member, IEEE, Gianligi Liva, Senior Member, IEEE, Gerhard Bach, Fellow, IEEE arxiv:176.5814v1 [cs.it 19 Jn

More information

VIBRATION MEASUREMENT UNCERTAINTY AND RELIABILITY DIAGNOSTICS RESULTS IN ROTATING SYSTEMS

VIBRATION MEASUREMENT UNCERTAINTY AND RELIABILITY DIAGNOSTICS RESULTS IN ROTATING SYSTEMS VIBRATIO MEASUREMET UCERTAITY AD RELIABILITY DIAGOSTICS RESULTS I ROTATIG SYSTEMS. Introdction M. Eidkevicite, V. Volkovas anas University of Technology, Lithania The rotating machinery technical state

More information

Imprecise Continuous-Time Markov Chains

Imprecise Continuous-Time Markov Chains Imprecise Continos-Time Markov Chains Thomas Krak *, Jasper De Bock, and Arno Siebes t.e.krak@.nl, a.p.j.m.siebes@.nl Utrecht University, Department of Information and Compting Sciences, Princetonplein

More information

Section 7.4: Integration of Rational Functions by Partial Fractions

Section 7.4: Integration of Rational Functions by Partial Fractions Section 7.4: Integration of Rational Fnctions by Partial Fractions This is abot as complicated as it gets. The Method of Partial Fractions Ecept for a few very special cases, crrently we have no way to

More information

The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost under Poisson Arrival Demands

The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost under Poisson Arrival Demands Scientiae Mathematicae Japonicae Online, e-211, 161 167 161 The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost nder Poisson Arrival Demands Hitoshi

More information

Elements of Coordinate System Transformations

Elements of Coordinate System Transformations B Elements of Coordinate System Transformations Coordinate system transformation is a powerfl tool for solving many geometrical and kinematic problems that pertain to the design of gear ctting tools and

More information

A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model

A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model entropy Article A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model Hyenkyn Woo School of Liberal Arts, Korea University of Technology and Edcation, Cheonan

More information

EOQ Problem Well-Posedness: an Alternative Approach Establishing Sufficient Conditions

EOQ Problem Well-Posedness: an Alternative Approach Establishing Sufficient Conditions pplied Mathematical Sciences, Vol. 4, 2, no. 25, 23-29 EOQ Problem Well-Posedness: an lternative pproach Establishing Sfficient Conditions Giovanni Mingari Scarpello via Negroli, 6, 226 Milano Italy Daniele

More information

Analytical Value-at-Risk and Expected Shortfall under Regime Switching *

Analytical Value-at-Risk and Expected Shortfall under Regime Switching * Working Paper 9-4 Departamento de Economía Economic Series 14) Universidad Carlos III de Madrid March 9 Calle Madrid, 16 893 Getafe Spain) Fax 34) 91649875 Analytical Vale-at-Risk and Expected Shortfall

More information

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup L -smoothing for the Ornstein-Uhlenbeck semigrop K. Ball, F. Barthe, W. Bednorz, K. Oleszkiewicz and P. Wolff September, 00 Abstract Given a probability density, we estimate the rate of decay of the measre

More information

A New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation

A New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation A ew Approach to Direct eqential imlation that Acconts for the Proportional ffect: Direct ognormal imlation John Manchk, Oy eangthong and Clayton Detsch Department of Civil & nvironmental ngineering University

More information

UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL

UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL 8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-1 April 01, Tallinn, Estonia UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL Põdra, P. & Laaneots, R. Abstract: Strength analysis is a

More information

Sensitivity Analysis in Bayesian Networks: From Single to Multiple Parameters

Sensitivity Analysis in Bayesian Networks: From Single to Multiple Parameters Sensitivity Analysis in Bayesian Networks: From Single to Mltiple Parameters Hei Chan and Adnan Darwiche Compter Science Department University of California, Los Angeles Los Angeles, CA 90095 {hei,darwiche}@cs.cla.ed

More information

Model Discrimination of Polynomial Systems via Stochastic Inputs

Model Discrimination of Polynomial Systems via Stochastic Inputs Model Discrimination of Polynomial Systems via Stochastic Inpts D. Georgiev and E. Klavins Abstract Systems biologists are often faced with competing models for a given experimental system. Unfortnately,

More information

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time Tomas Björk Department of Finance, Stockholm School of Economics tomas.bjork@hhs.se Agatha Mrgoci Department of Economics Aarhs

More information

The Dual of the Maximum Likelihood Method

The Dual of the Maximum Likelihood Method Department of Agricltral and Resorce Economics University of California, Davis The Dal of the Maximm Likelihood Method by Qirino Paris Working Paper No. 12-002 2012 Copyright @ 2012 by Qirino Paris All

More information

Advanced Machine Learning

Advanced Machine Learning Advanced Machine Learning Bandit Problems MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Multi-Armed Bandit Problem Problem: which arm of a K-slot machine should a gambler pull to maximize his

More information

Step-Size Bounds Analysis of the Generalized Multidelay Adaptive Filter

Step-Size Bounds Analysis of the Generalized Multidelay Adaptive Filter WCE 007 Jly - 4 007 London UK Step-Size onds Analysis of the Generalized Mltidelay Adaptive Filter Jnghsi Lee and Hs Chang Hang Abstract In this paper we analyze the bonds of the fixed common step-size

More information

Chapter 2 Difficulties associated with corners

Chapter 2 Difficulties associated with corners Chapter Difficlties associated with corners This chapter is aimed at resolving the problems revealed in Chapter, which are cased b corners and/or discontinos bondar conditions. The first section introdces

More information

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Qadratic Optimization Problems in Continos and Binary Variables Naohiko Arima, Snyong Kim and Masakaz Kojima October 2012,

More information

On relative errors of floating-point operations: optimal bounds and applications

On relative errors of floating-point operations: optimal bounds and applications On relative errors of floating-point operations: optimal bonds and applications Clade-Pierre Jeannerod, Siegfried M. Rmp To cite this version: Clade-Pierre Jeannerod, Siegfried M. Rmp. On relative errors

More information

On relative errors of floating-point operations: optimal bounds and applications

On relative errors of floating-point operations: optimal bounds and applications On relative errors of floating-point operations: optimal bonds and applications Clade-Pierre Jeannerod, Siegfried M. Rmp To cite this version: Clade-Pierre Jeannerod, Siegfried M. Rmp. On relative errors

More information

Prediction of Transmission Distortion for Wireless Video Communication: Analysis

Prediction of Transmission Distortion for Wireless Video Communication: Analysis Prediction of Transmission Distortion for Wireless Video Commnication: Analysis Zhifeng Chen and Dapeng W Department of Electrical and Compter Engineering, University of Florida, Gainesville, Florida 326

More information

Applying Fuzzy Set Approach into Achieving Quality Improvement for Qualitative Quality Response

Applying Fuzzy Set Approach into Achieving Quality Improvement for Qualitative Quality Response Proceedings of the 007 WSES International Conference on Compter Engineering and pplications, Gold Coast, stralia, Janary 17-19, 007 5 pplying Fzzy Set pproach into chieving Qality Improvement for Qalitative

More information

4.2 First-Order Logic

4.2 First-Order Logic 64 First-Order Logic and Type Theory The problem can be seen in the two qestionable rles In the existential introdction, the term a has not yet been introdced into the derivation and its se can therefore

More information

Move Blocking Strategies in Receding Horizon Control

Move Blocking Strategies in Receding Horizon Control Move Blocking Strategies in Receding Horizon Control Raphael Cagienard, Pascal Grieder, Eric C. Kerrigan and Manfred Morari Abstract In order to deal with the comptational brden of optimal control, it

More information

Department of Industrial Engineering Statistical Quality Control presented by Dr. Eng. Abed Schokry

Department of Industrial Engineering Statistical Quality Control presented by Dr. Eng. Abed Schokry Department of Indstrial Engineering Statistical Qality Control presented by Dr. Eng. Abed Schokry Department of Indstrial Engineering Statistical Qality Control C and U Chart presented by Dr. Eng. Abed

More information

Introdction Finite elds play an increasingly important role in modern digital commnication systems. Typical areas of applications are cryptographic sc

Introdction Finite elds play an increasingly important role in modern digital commnication systems. Typical areas of applications are cryptographic sc A New Architectre for a Parallel Finite Field Mltiplier with Low Complexity Based on Composite Fields Christof Paar y IEEE Transactions on Compters, Jly 996, vol 45, no 7, pp 856-86 Abstract In this paper

More information

Bandit models: a tutorial

Bandit models: a tutorial Gdt COS, December 3rd, 2015 Multi-Armed Bandit model: general setting K arms: for a {1,..., K}, (X a,t ) t N is a stochastic process. (unknown distributions) Bandit game: a each round t, an agent chooses

More information

arxiv: v1 [math.pr] 14 Jul 2016

arxiv: v1 [math.pr] 14 Jul 2016 PHASE TRANSITIONS IN EDGE-WEIGHTED EXPONENTIAL RANDOM GRAPHS MEI YIN Jly 15, 2016 arxiv:1607.04084v1 [math.pr] 14 Jl 2016 Abstract. The exponential family of random graphs represents an important and challenging

More information

4 Exact laminar boundary layer solutions

4 Exact laminar boundary layer solutions 4 Eact laminar bondary layer soltions 4.1 Bondary layer on a flat plate (Blasis 1908 In Sec. 3, we derived the bondary layer eqations for 2D incompressible flow of constant viscosity past a weakly crved

More information

Study of the diffusion operator by the SPH method

Study of the diffusion operator by the SPH method IOSR Jornal of Mechanical and Civil Engineering (IOSR-JMCE) e-issn: 2278-684,p-ISSN: 2320-334X, Volme, Isse 5 Ver. I (Sep- Oct. 204), PP 96-0 Stdy of the diffsion operator by the SPH method Abdelabbar.Nait

More information

RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS

RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS RELIABILITY ASPECTS OF PROPORTIONAL MEAN RESIDUAL LIFE MODEL USING QUANTILE FUNC- TIONS Athors: N.UNNIKRISHNAN NAIR Department of Statistics, Cochin University of Science Technology, Cochin, Kerala, INDIA

More information

A Regulator for Continuous Sedimentation in Ideal Clarifier-Thickener Units

A Regulator for Continuous Sedimentation in Ideal Clarifier-Thickener Units A Reglator for Continos Sedimentation in Ideal Clarifier-Thickener Units STEFAN DIEHL Centre for Mathematical Sciences, Lnd University, P.O. Box, SE- Lnd, Sweden e-mail: diehl@maths.lth.se) Abstract. The

More information

An Investigation into Estimating Type B Degrees of Freedom

An Investigation into Estimating Type B Degrees of Freedom An Investigation into Estimating Type B Degrees of H. Castrp President, Integrated Sciences Grop Jne, 00 Backgrond The degrees of freedom associated with an ncertainty estimate qantifies the amont of information

More information

arxiv: v1 [cs.sy] 22 Nov 2018

arxiv: v1 [cs.sy] 22 Nov 2018 AAS 18-253 ROBUST MYOPIC CONTROL FOR SYSTEMS WITH IMPERFECT OBSERVATIONS Dantong Ge, Melkior Ornik, and Ufk Topc arxiv:1811.09000v1 [cs.sy] 22 Nov 2018 INTRODUCTION Control of systems operating in nexplored

More information

Performance analysis of the MAP equalizer within an iterative receiver including a channel estimator

Performance analysis of the MAP equalizer within an iterative receiver including a channel estimator Performance analysis of the MAP eqalizer within an iterative receiver inclding a channel estimator Nora Sellami ISECS Rote Menzel Chaer m 0.5 B.P 868, 308 Sfax, Tnisia Aline Romy IRISA-INRIA Camps de Bealie

More information

Stochastic Contextual Bandits with Known. Reward Functions

Stochastic Contextual Bandits with Known. Reward Functions Stochastic Contextual Bandits with nown 1 Reward Functions Pranav Sakulkar and Bhaskar rishnamachari Ming Hsieh Department of Electrical Engineering Viterbi School of Engineering University of Southern

More information

We automate the bivariate change-of-variables technique for bivariate continuous random variables with

We automate the bivariate change-of-variables technique for bivariate continuous random variables with INFORMS Jornal on Compting Vol. 4, No., Winter 0, pp. 9 ISSN 09-9856 (print) ISSN 56-558 (online) http://dx.doi.org/0.87/ijoc.046 0 INFORMS Atomating Biariate Transformations Jeff X. Yang, John H. Drew,

More information

Wireless Networks Without Edges : Dynamic Radio Resource Clustering and User Scheduling

Wireless Networks Without Edges : Dynamic Radio Resource Clustering and User Scheduling Wireless Networks Withot Edges : Dynamic Radio Resorce Clstering and User Schedling Yhan D and Gstavo de Veciana Department of Electrical and Compter Engineering, The University of Texas at Astin Email:

More information

MAXIMUM AND ANTI-MAXIMUM PRINCIPLES FOR THE P-LAPLACIAN WITH A NONLINEAR BOUNDARY CONDITION. 1. Introduction. ν = λ u p 2 u.

MAXIMUM AND ANTI-MAXIMUM PRINCIPLES FOR THE P-LAPLACIAN WITH A NONLINEAR BOUNDARY CONDITION. 1. Introduction. ν = λ u p 2 u. 2005-Ojda International Conference on Nonlinear Analysis. Electronic Jornal of Differential Eqations, Conference 14, 2006, pp. 95 107. ISSN: 1072-6691. URL: http://ejde.math.txstate.ed or http://ejde.math.nt.ed

More information

The Lehmer matrix and its recursive analogue

The Lehmer matrix and its recursive analogue The Lehmer matrix and its recrsive analoge Emrah Kilic, Pantelimon Stănică TOBB Economics and Technology University, Mathematics Department 0660 Sogtoz, Ankara, Trkey; ekilic@etedtr Naval Postgradate School,

More information

Network Coding for Multiple Unicasts: An Approach based on Linear Optimization

Network Coding for Multiple Unicasts: An Approach based on Linear Optimization Network Coding for Mltiple Unicasts: An Approach based on Linear Optimization Danail Traskov, Niranjan Ratnakar, Desmond S. Ln, Ralf Koetter, and Mriel Médard Abstract In this paper we consider the application

More information

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled.

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled. Jnction elements in network models. Classify by nmber of ports and examine the possible strctres that reslt. Using only one-port elements, no more than two elements can be assembled. Combining two two-ports

More information

ECON3120/4120 Mathematics 2, spring 2009

ECON3120/4120 Mathematics 2, spring 2009 University of Oslo Department of Economics Arne Strøm ECON3/4 Mathematics, spring 9 Problem soltions for Seminar 4, 6 Febrary 9 (For practical reasons some of the soltions may inclde problem parts that

More information

Queueing analysis of service deferrals for load management in power systems

Queueing analysis of service deferrals for load management in power systems Qeeing analysis of service deferrals for load management in power systems Andrés Ferragt and Fernando Paganini Universidad ORT Urgay Abstract With the advent of renewable sorces and Smart- Grid deployments,

More information

A Single Species in One Spatial Dimension

A Single Species in One Spatial Dimension Lectre 6 A Single Species in One Spatial Dimension Reading: Material similar to that in this section of the corse appears in Sections 1. and 13.5 of James D. Mrray (), Mathematical Biology I: An introction,

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHOS FOR TRANSLATIONAL & CLINICAL RESEARCH ROC Crve: IAGNOSTIC MEICINE iagnostic tests have been presented as alwas having dichotomos otcomes. In some cases, the reslt of the test ma be

More information

Multi-armed bandit models: a tutorial

Multi-armed bandit models: a tutorial Multi-armed bandit models: a tutorial CERMICS seminar, March 30th, 2016 Multi-Armed Bandit model: general setting K arms: for a {1,..., K}, (X a,t ) t N is a stochastic process. (unknown distributions)

More information

Simulation investigation of the Z-source NPC inverter

Simulation investigation of the Z-source NPC inverter octoral school of energy- and geo-technology Janary 5 20, 2007. Kressaare, Estonia Simlation investigation of the Z-sorce NPC inverter Ryszard Strzelecki, Natalia Strzelecka Gdynia Maritime University,

More information

Remarks on strongly convex stochastic processes

Remarks on strongly convex stochastic processes Aeqat. Math. 86 (01), 91 98 c The Athor(s) 01. This article is pblished with open access at Springerlink.com 0001-9054/1/010091-8 pblished online November 7, 01 DOI 10.1007/s00010-01-016-9 Aeqationes Mathematicae

More information

Fixed points for discrete logarithms

Fixed points for discrete logarithms Fixed points for discrete logarithms Mariana Levin 1, Carl Pomerance 2, and and K. Sondararajan 3 1 Gradate Grop in Science and Mathematics Edcation University of California Berkeley, CA 94720, USA levin@berkeley.ed

More information

Chapter 4 Supervised learning:

Chapter 4 Supervised learning: Chapter 4 Spervised learning: Mltilayer Networks II Madaline Other Feedforward Networks Mltiple adalines of a sort as hidden nodes Weight change follows minimm distrbance principle Adaptive mlti-layer

More information

CRITERIA FOR TOEPLITZ OPERATORS ON THE SPHERE. Jingbo Xia

CRITERIA FOR TOEPLITZ OPERATORS ON THE SPHERE. Jingbo Xia CRITERIA FOR TOEPLITZ OPERATORS ON THE SPHERE Jingbo Xia Abstract. Let H 2 (S) be the Hardy space on the nit sphere S in C n. We show that a set of inner fnctions Λ is sfficient for the prpose of determining

More information

Bayes and Naïve Bayes Classifiers CS434

Bayes and Naïve Bayes Classifiers CS434 Bayes and Naïve Bayes Classifiers CS434 In this lectre 1. Review some basic probability concepts 2. Introdce a sefl probabilistic rle - Bayes rle 3. Introdce the learning algorithm based on Bayes rle (ths

More information

DYNAMICAL LOWER BOUNDS FOR 1D DIRAC OPERATORS. 1. Introduction We consider discrete, resp. continuous, Dirac operators

DYNAMICAL LOWER BOUNDS FOR 1D DIRAC OPERATORS. 1. Introduction We consider discrete, resp. continuous, Dirac operators DYNAMICAL LOWER BOUNDS FOR D DIRAC OPERATORS ROBERTO A. PRADO AND CÉSAR R. DE OLIVEIRA Abstract. Qantm dynamical lower bonds for continos and discrete one-dimensional Dirac operators are established in

More information

Chem 4501 Introduction to Thermodynamics, 3 Credits Kinetics, and Statistical Mechanics. Fall Semester Homework Problem Set Number 10 Solutions

Chem 4501 Introduction to Thermodynamics, 3 Credits Kinetics, and Statistical Mechanics. Fall Semester Homework Problem Set Number 10 Solutions Chem 4501 Introdction to Thermodynamics, 3 Credits Kinetics, and Statistical Mechanics Fall Semester 2017 Homework Problem Set Nmber 10 Soltions 1. McQarrie and Simon, 10-4. Paraphrase: Apply Eler s theorem

More information

Multivariable Ripple-Free Deadbeat Control

Multivariable Ripple-Free Deadbeat Control Scholarly Jornal of Mathematics and Compter Science Vol. (), pp. 9-9, October Available online at http:// www.scholarly-jornals.com/sjmcs ISS 76-8947 Scholarly-Jornals Fll Length Research aper Mltivariable

More information

Decoder Error Probability of MRD Codes

Decoder Error Probability of MRD Codes Decoder Error Probability of MRD Codes Maximilien Gadolea Department of Electrical and Compter Engineering Lehigh University Bethlehem, PA 18015 USA E-mail: magc@lehigh.ed Zhiyan Yan Department of Electrical

More information

A new integral transform on time scales and its applications

A new integral transform on time scales and its applications Agwa et al. Advances in Difference Eqations 202, 202:60 http://www.advancesindifferenceeqations.com/content/202//60 RESEARCH Open Access A new integral transform on time scales and its applications Hassan

More information

Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis Functional Response and Feedback Controls

Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis Functional Response and Feedback Controls Hindawi Pblishing Corporation Discrete Dynamics in Natre and Society Volme 2008 Article ID 149267 8 pages doi:101155/2008/149267 Research Article Permanence of a Discrete Predator-Prey Systems with Beddington-DeAngelis

More information

Collaborative Filtering with Low Regret

Collaborative Filtering with Low Regret Collaborative Filtering with Low Regret Gy Bresler IDSS/LIDS/EECS, MIT 3 Vassar Street Cambridge, Massachsetts gy@mit.ed Devavrat Shah IDSS/LIDS/EECS, MIT 3 Vassar Street Cambridge, Massachsetts devavrat@mit.ed

More information

Scheduling parallel jobs to minimize the makespan

Scheduling parallel jobs to minimize the makespan J Sched (006) 9:433 45 DOI 10.1007/s10951-006-8497-6 Schedling parallel jobs to minimize the makespan Berit Johannes C Springer Science + Bsiness Media, LLC 006 Abstract We consider the NP-hard problem

More information

Worst-case analysis of the LPT algorithm for single processor scheduling with time restrictions

Worst-case analysis of the LPT algorithm for single processor scheduling with time restrictions OR Spectrm 06 38:53 540 DOI 0.007/s009-06-043-5 REGULAR ARTICLE Worst-case analysis of the LPT algorithm for single processor schedling with time restrictions Oliver ran Fan Chng Ron Graham Received: Janary

More information

Control Performance Monitoring of State-Dependent Nonlinear Processes

Control Performance Monitoring of State-Dependent Nonlinear Processes Control Performance Monitoring of State-Dependent Nonlinear Processes Lis F. Recalde*, Hong Ye Wind Energy and Control Centre, Department of Electronic and Electrical Engineering, University of Strathclyde,

More information

Thompson Sampling for the non-stationary Corrupt Multi-Armed Bandit

Thompson Sampling for the non-stationary Corrupt Multi-Armed Bandit European Worshop on Reinforcement Learning 14 (2018 October 2018, Lille, France. Thompson Sampling for the non-stationary Corrupt Multi-Armed Bandit Réda Alami Orange Labs 2 Avenue Pierre Marzin 22300,

More information

On the circuit complexity of the standard and the Karatsuba methods of multiplying integers

On the circuit complexity of the standard and the Karatsuba methods of multiplying integers On the circit complexity of the standard and the Karatsba methods of mltiplying integers arxiv:1602.02362v1 [cs.ds] 7 Feb 2016 Igor S. Sergeev The goal of the present paper is to obtain accrate estimates

More information

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models On the Complexity of Best Arm Identification in Multi-Armed Bandit Models Aurélien Garivier Institut de Mathématiques de Toulouse Information Theory, Learning and Big Data Simons Institute, Berkeley, March

More information

Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining

Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining P. Hgenin*, B. Srinivasan*, F. Altpeter**, R. Longchamp* * Laboratoire d Atomatiqe,

More information

Effects of Soil Spatial Variability on Bearing Capacity of Shallow Foundations

Effects of Soil Spatial Variability on Bearing Capacity of Shallow Foundations Geotechnical Safety and Risk V T. Schweckendiek et al. (Eds.) 2015 The athors and IOS Press. This article is pblished online with Open Access by IOS Press and distribted nder the terms of the Creative

More information

9. Tensor product and Hom

9. Tensor product and Hom 9. Tensor prodct and Hom Starting from two R-modles we can define two other R-modles, namely M R N and Hom R (M, N), that are very mch related. The defining properties of these modles are simple, bt those

More information

Learning to play K-armed bandit problems

Learning to play K-armed bandit problems Learning to play K-armed bandit problems Francis Maes 1, Louis Wehenkel 1 and Damien Ernst 1 1 University of Liège Dept. of Electrical Engineering and Computer Science Institut Montefiore, B28, B-4000,

More information

Information Source Detection in the SIR Model: A Sample Path Based Approach

Information Source Detection in the SIR Model: A Sample Path Based Approach Information Sorce Detection in the SIR Model: A Sample Path Based Approach Kai Zh and Lei Ying School of Electrical, Compter and Energy Engineering Arizona State University Tempe, AZ, United States, 85287

More information

Control Systems

Control Systems 6.5 Control Systems Last Time: Introdction Motivation Corse Overview Project Math. Descriptions of Systems ~ Review Classification of Systems Linear Systems LTI Systems The notion of state and state variables

More information

Stratégies bayésiennes et fréquentistes dans un modèle de bandit

Stratégies bayésiennes et fréquentistes dans un modèle de bandit Stratégies bayésiennes et fréquentistes dans un modèle de bandit thèse effectuée à Telecom ParisTech, co-dirigée par Olivier Cappé, Aurélien Garivier et Rémi Munos Journées MAS, Grenoble, 30 août 2016

More information

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How 1 Gradient Projection Anti-windp Scheme on Constrained Planar LTI Systems Jstin Teo and Jonathan P. How Technical Report ACL1 1 Aerospace Controls Laboratory Department of Aeronatics and Astronatics Massachsetts

More information