Temporal-Difference Learning

Size: px
Start display at page:

Download "Temporal-Difference Learning"

Transcription

1 .997 Decision-Making in Lage-Scale Systems Mach 17 MIT, Sping 004 Handout #17 Lectue Note 13 1 Tempoal-Diffeence Leaning We now conside the poblem of computing an appopiate paamete, so that, given an appoximation achitectue J (x, ), J (, ) J ( ). A class of iteative methods ae the so-called tempoal-diffeence leaning algoithms, which geneates a seies of appoximations J k = J (, k ) as follows. Conside geneating a tajectoy (x 1, u 1,..., x k, u k ), whee u k is the geedy policy with espect to J k. We then have the eo/tempoal diffeences d k = g uk (x k ) + αj k (x k+1, k ) J k (x k, k ), which epesent an appoximation to the Bellman eo (T J k )(x k ) J k (x k ) at state x k. Based on the tempoal diffeences, an intuitive way of updating the paametes k is to make updates popotional to the obseved Bellman eo/tempoal diffeence: k+1 = k + γ k d k z k, whee γ k is the step size and z k is called an eligibility vecto it measues how much updates to each component of the vecto k would affect the Bellman eo. To gathe moe intuition about how to choose the eligibility vecto, we will conside the case of autonomous systems, i.e., systems that do not involve contol. In this case, we can estimate the cost-to-go function via sampling as follows. Suppose that we have a tajectoy x 1,..., x n. Then we have J (x n 1 1 ) α g(x t ) t=1 J (x n ) α g(x t ). t= In othe wods, fom a tajectoy x 1,..., x n, we can deive pais (x i, Ĵ(x i )), whee Ĵ(x i ) is a noisy and biased estimate of J (x i ). Theefoe we may conside fitting the appoximation J (x, ) by minimizing the empiical squaed eo: ( min Ĵ n (x t ) J(x t, )) (1) t=1 We deive an incemental, appoximate vesion of (1). Fist note that Ĵ n (x t ) could be updated incementally as follows: n+1 t Ĵ n+1 (x t ) = Ĵ n (x t ) + α g(x n+1 ) () Altenatively, we may use a small-step update of the fom n +1 Ĵ n+1 (x t ) = Ĵ n (x t ) + γ α j t g(x j ) Jˆ n (x t ), (3) j=t 1

2 which makes Ĵ n+1 (x t ) an aveage of the old estimate Ĵ n (x t ) and the new estimate (). Finally, we may appoximate (3) to have Ĵ n (x t ) function d 1, d,..., d n : α j t g(x j ) Jˆ n (x t ) = g(x t ) + αĵ n (x t+1 ) Jˆ n (x t ) + α(g(x t+1 ) + αĵ n (x t+ ) Jˆ n (x t+1 )) +... j=t Hence n t (g(x n ) + αĵ n (x n+1 ) Ĵ n (x n )) α n+1 t Ĵ n (x +α n+1 α j t d t. j=t n +1 Ĵ n+1 (x t ) = Ĵ n (x t ) + γ α j t d j. (4) Finally, we may conside having the sum in (1) implemented incementally, so that the pevious tempoal diffeences do not have to be stoed: Stating fom the solution n to the poblem at stage n, we can appoximate the solution of the poblem at stage n + 1 by updating n+1 along the gadient of (5). This leads to ( ) α t n n+1 = + γ J ( n, x t ) d j=t Ĵ n+1 t n+1 (x t ) = Ĵ n (x t ) + γα dn+1. Hence, in each time stage, we would like to find n minimizing ( min Ĵ n (x t ) + γα n t d n J(x t, )). (5) t=1 We can also have an incemental vesion, given by n t=1 k+1 = k + γ k z k d k z k = αz k 1 + J (x k, k ) The algoithm above is known as T D(1). We have the genealization T D(λ), λ [0, 1]. n+1. k+1 = k + γ k z k d k TD(λ) z k = αλz k 1 + J (x k, k ) Befoe analyzing the behavio of T D(λ), we ae going to study a elated, deteministic algoithm appoximate value iteation. The analysis of T D(λ) will be based on intepeting it as a stochastic appoximation vesion of appoximate value iteation. Appoximate Value Iteation Define the opeato T λ T λ J = (1 λ) λ m T m+1 J, fo λ [0, 1) m=0 T λ J = J, fo λ = 1. We can show that T λ satisfies the same popeties as T :

3 Lemma 1 T λ J T λ J α(1 λ) 1 αλ J J, J = T λ J J, J The motivation fo T λ is as follows. Recall that, in value iteation, we have J k+1 = T J k. Howeve, we could also implement value iteation with J k+1 = T L J k, which implies L steps look ahead. Finally, we can have an update that is a weighted aveage ove all possible values of L; J k+1 = T λ J k gives one such update. In what follows, we ae going to estict attention to linea appoximation achitectues. Let P J (x, ) = φ i (x) i, and Φ = i=1 J = Φ φ 1 (1) φ (1)... φ P (1) φ 1 () φ ()... φ P () φ 1 (n) φ (n)... φ P (n) Moeove, we ae going to conside only autonomous systems. We denote by P the tansition matix associated with the system. Let us intoduce some notation. Fist, we have d(1) d()... 0 D = d(n) whee d : S (0, 1) S is a pobability distibution ove states. Define the weighted Euclidean noms J,D = J T DJ = d(x)j (x) x S < J J > D = J T DJ = d(x)j(x)j (x) Fo simplicity, we assume that φ i, i = 1,..., p is an othonomal basis to the subspace J = Φ, i.e., x S φ i,d = 1 and < φ i, φ j >= 0, i = j In matix notation, we have Φ T DΦ = I. We ae going to use the following pojection opeato Π: ΠJ = Φ J, whee J = ag min Φ J,D 3

4 State J T λ Φ k J = Φ ΠJ Φ k+1 = ΠT λ Φ k State 1 Φ k State 3 Figue 1: Appoximate Value Iteation 4

5 We can chaacteize Π explicitly by solving the associated minimizing poblem. We have J = ag min Φ J,D = ag min (Φ J) T D(Φ J) ( = Φ T DΦ ) 1 Φ T DJ = < Φ, J > D Hence, we have ΠJ = Φ < Φ, J > D. Lemma Fo all J, ΠJ = Φ < Φ, J > D (6) < ΠJ, J ΠJ > D = 0 (7) J = ΠJ,D + J ΠJ,D (8),D Note that Φ k+1 = ΠT λ Φ k. We know that the pojection Π is a nonexpansion fom ΠJ ΠJ,D = Π(J J ),D J J,D. Moeove, T λ is a contaction: T λ J T λ J K J J. Howeve, the fact that Π and T λ ae a non-expansion and a contaction with espect to diffeent noms implies that convegence of appoximate value iteation cannot be guaanteed by a contaction agument, as was the case with exact value iteation. Indeed, as illustated in Figue, ΠT λ is not necessaily a contaction with espect to any nom, and one can find counteexamples whee T D(λ) fails to convege. As it tuns out, thee is a special choice of D that ensues convegence of T D(λ) fo all λ [0, 1]. Befoe poving that, we need the following auxiliay esult. Fist, we pesent two definitions involving Makov chains. Definition 1 A Makov chain is called ieducible if, fo evey pai of states x and y, thee is k such that P k (x, y) > 0. Definition A state x is called peiodic if thee is m such that P k (x, x) > 0 iff k = mn, fo some n {0, 1,,... }. A Makov chain is called apeiodic if none of its states is peiodic. Lemma 3 Given a tansition matix P and assume that P is ieducible and apeiodic. Then thee exists a unique π such that π T P = π T and π T π T P n... π T 5

6 Φ k T λ Φ k ΠT λ Φ k J Figue : T λ Φ k must be inside the smalle squae and ΠT λ Φ k must be inside the cicle, but ΠT λ Φ k may be outside the lage squae and futhe away fom J than Φ k. 6

7 This lemma was poved in Poblem Set, fo the special case whee P (x, x) > 0 fo some x. We ae now poised to pove the following cental esult used to deive a convegent vesion of T D(λ): Lemma 4 Suppose that the tansition matix P is ieducible and apeiodic. Let π π... 0 D =, π S whee π is the stationay distibution associated with P. Then P J,D J,D. Poof: P J =,D = = ( ) π(x) P (x, y)j(y) x S π(x) y P (x, y)j (y) x S y π(x)p (x, y)j (y) y x π(y)j (y) y J =,D The fist inequality follows the Jensen s inequality and the thid equality holds because π is a stationay distibution. Based on the pevious lemma, we can show that T λ is a contaction with espect to,dπ, whee π π... 0 D π = π S and π is the stationay distibution of the tansition matix P. It follows that, if the pojection Π is pefomed with espect to,dπ, ΠT λ becomes a contaction with espect to the same nom, and convegence of T D(λ) is guaanteed. Lemma 5 (i) T J T J,Dπ α J J,Dπ α(1 α) (ii) T λ J T λ J,Dπ 1 αλ J J α(1 α) (iii) ΠT λ J ΠT λ J,Dπ J J 1 αλ,d π,d π 7

8 Poof of (1) T J T J,Dπ = g + αp J (g + α J ),Dπ = α P J P J,Dπ α J J,Dπ Theoem 1 Let Φ k+1 = ΠT λ Φ k and Then k with π π... 0 D π = π S Φ J,Dπ K α,λ ΠJ J,Dπ. Poof: Convegence follows fom (iii). We have Φ = ΠT λ Φ and J T λ J. Then Φ J,D π = Φ ΠJ + ΠJ J,D π = Φ ΠJ,D π + ΠJ J,D π (othogonal) = ΠT λ Φ ΠT λ J,D π + ΠJ J,D π α (1 λ) (1 αλ) }{{ Φ J,D π + ΠJ J,D π } γ Theefoe 1 Φ J,Dπ ΠJ J,Dπ 1 γ 8

1 Explicit Explore or Exploit (E 3 ) Algorithm

1 Explicit Explore or Exploit (E 3 ) Algorithm 2.997 Decision-Making in Lage-Scale Systems Mach 3 MIT, Sping 2004 Handout #2 Lectue Note 9 Explicit Exploe o Exploit (E 3 ) Algoithm Last lectue, we studied the Q-leaning algoithm: [ ] Q t+ (x t, a t

More information

MULTILAYER PERCEPTRONS

MULTILAYER PERCEPTRONS Last updated: Nov 26, 2012 MULTILAYER PERCEPTRONS Outline 2 Combining Linea Classifies Leaning Paametes Outline 3 Combining Linea Classifies Leaning Paametes Implementing Logical Relations 4 AND and OR

More information

3.1 Random variables

3.1 Random variables 3 Chapte III Random Vaiables 3 Random vaiables A sample space S may be difficult to descibe if the elements of S ae not numbes discuss how we can use a ule by which an element s of S may be associated

More information

Performance Loss Bounds for Approximate Value Iteration with State Aggregation

Performance Loss Bounds for Approximate Value Iteration with State Aggregation MATHEMATICS OF OPERATIONS RESEARCH Vol. 31, No. 2, May 2006, pp. 234 244 issn 0364-765X eissn 1526-5471 06 3102 0234 infoms doi 10.1287/moo.1060.0188 2006 INFORMS Pefomance Loss Bounds fo Appoximate Value

More information

Lecture 28: Convergence of Random Variables and Related Theorems

Lecture 28: Convergence of Random Variables and Related Theorems EE50: Pobability Foundations fo Electical Enginees July-Novembe 205 Lectue 28: Convegence of Random Vaiables and Related Theoems Lectue:. Kishna Jagannathan Scibe: Gopal, Sudhasan, Ajay, Swamy, Kolla An

More information

Chapter 8: Generalization and Function Approximation

Chapter 8: Generalization and Function Approximation Chapte 8: Genealization and Function Appoximation Objectives of this chapte: Look at how expeience with a limited pat of the state set be used to poduce good behavio ove a much lage pat. Oveview of function

More information

Stanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012

Stanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012 Stanfod Univesity CS59Q: Quantum Computing Handout 8 Luca Tevisan Octobe 8, 0 Lectue 8 In which we use the quantum Fouie tansfom to solve the peiod-finding poblem. The Peiod Finding Poblem Let f : {0,...,

More information

Suggested Solutions to Homework #4 Econ 511b (Part I), Spring 2004

Suggested Solutions to Homework #4 Econ 511b (Part I), Spring 2004 Suggested Solutions to Homewok #4 Econ 5b (Pat I), Sping 2004. Conside a neoclassical gowth model with valued leisue. The (epesentative) consume values steams of consumption and leisue accoding to P t=0

More information

Value Prediction with FA. Chapter 8: Generalization and Function Approximation. Adapt Supervised Learning Algorithms. Backups as Training Examples [ ]

Value Prediction with FA. Chapter 8: Generalization and Function Approximation. Adapt Supervised Learning Algorithms. Backups as Training Examples [ ] Chapte 8: Genealization and Function Appoximation Objectives of this chapte:! Look at how expeience with a limited pat of the state set be used to poduce good behavio ove a much lage pat.! Oveview of function

More information

EM Boundary Value Problems

EM Boundary Value Problems EM Bounday Value Poblems 10/ 9 11/ By Ilekta chistidi & Lee, Seung-Hyun A. Geneal Desciption : Maxwell Equations & Loentz Foce We want to find the equations of motion of chaged paticles. The way to do

More information

Many Electron Atoms. Electrons can be put into approximate orbitals and the properties of the many electron systems can be catalogued

Many Electron Atoms. Electrons can be put into approximate orbitals and the properties of the many electron systems can be catalogued Many Electon Atoms The many body poblem cannot be solved analytically. We content ouselves with developing appoximate methods that can yield quite accuate esults (but usually equie a compute). The electons

More information

New problems in universal algebraic geometry illustrated by boolean equations

New problems in universal algebraic geometry illustrated by boolean equations New poblems in univesal algebaic geomety illustated by boolean equations axiv:1611.00152v2 [math.ra] 25 Nov 2016 Atem N. Shevlyakov Novembe 28, 2016 Abstact We discuss new poblems in univesal algebaic

More information

Internet Appendix for A Bayesian Approach to Real Options: The Case of Distinguishing Between Temporary and Permanent Shocks

Internet Appendix for A Bayesian Approach to Real Options: The Case of Distinguishing Between Temporary and Permanent Shocks Intenet Appendix fo A Bayesian Appoach to Real Options: The Case of Distinguishing Between Tempoay and Pemanent Shocks Steven R. Genadie Gaduate School of Business, Stanfod Univesity Andey Malenko Gaduate

More information

Quasi-Randomness and the Distribution of Copies of a Fixed Graph

Quasi-Randomness and the Distribution of Copies of a Fixed Graph Quasi-Randomness and the Distibution of Copies of a Fixed Gaph Asaf Shapia Abstact We show that if a gaph G has the popety that all subsets of vetices of size n/4 contain the coect numbe of tiangles one

More information

THE CONE THEOREM JOEL A. TROPP. Abstract. We prove a fixed point theorem for functions which are positive with respect to a cone in a Banach space.

THE CONE THEOREM JOEL A. TROPP. Abstract. We prove a fixed point theorem for functions which are positive with respect to a cone in a Banach space. THE ONE THEOEM JOEL A. TOPP Abstact. We pove a fixed point theoem fo functions which ae positive with espect to a cone in a Banach space. 1. Definitions Definition 1. Let X be a eal Banach space. A subset

More information

4/18/2005. Statistical Learning Theory

4/18/2005. Statistical Learning Theory Statistical Leaning Theoy Statistical Leaning Theoy A model of supevised leaning consists of: a Envionment - Supplying a vecto x with a fixed but unknown pdf F x (x b Teache. It povides a desied esponse

More information

working pages for Paul Richards class notes; do not copy or circulate without permission from PGR 2004/11/3 10:50

working pages for Paul Richards class notes; do not copy or circulate without permission from PGR 2004/11/3 10:50 woking pages fo Paul Richads class notes; do not copy o ciculate without pemission fom PGR 2004/11/3 10:50 CHAPTER7 Solid angle, 3D integals, Gauss s Theoem, and a Delta Function We define the solid angle,

More information

Information Retrieval Advanced IR models. Luca Bondi

Information Retrieval Advanced IR models. Luca Bondi Advanced IR models Luca Bondi Advanced IR models 2 (LSI) Pobabilistic Latent Semantic Analysis (plsa) Vecto Space Model 3 Stating point: Vecto Space Model Documents and queies epesented as vectos in the

More information

Method for Approximating Irrational Numbers

Method for Approximating Irrational Numbers Method fo Appoximating Iational Numbes Eic Reichwein Depatment of Physics Univesity of Califonia, Santa Cuz June 6, 0 Abstact I will put foth an algoithm fo poducing inceasingly accuate ational appoximations

More information

10/04/18. P [P(x)] 1 negl(n).

10/04/18. P [P(x)] 1 negl(n). Mastemath, Sping 208 Into to Lattice lgs & Cypto Lectue 0 0/04/8 Lectues: D. Dadush, L. Ducas Scibe: K. de Boe Intoduction In this lectue, we will teat two main pats. Duing the fist pat we continue the

More information

Bounds on the performance of back-to-front airplane boarding policies

Bounds on the performance of back-to-front airplane boarding policies Bounds on the pefomance of bac-to-font aiplane boading policies Eitan Bachmat Michael Elin Abstact We povide bounds on the pefomance of bac-to-font aiplane boading policies. In paticula, we show that no

More information

16 Modeling a Language by a Markov Process

16 Modeling a Language by a Markov Process K. Pommeening, Language Statistics 80 16 Modeling a Language by a Makov Pocess Fo deiving theoetical esults a common model of language is the intepetation of texts as esults of Makov pocesses. This model

More information

Goodness-of-fit for composite hypotheses.

Goodness-of-fit for composite hypotheses. Section 11 Goodness-of-fit fo composite hypotheses. Example. Let us conside a Matlab example. Let us geneate 50 obsevations fom N(1, 2): X=nomnd(1,2,50,1); Then, unning a chi-squaed goodness-of-fit test

More information

PROBLEM SET #1 SOLUTIONS by Robert A. DiStasio Jr.

PROBLEM SET #1 SOLUTIONS by Robert A. DiStasio Jr. POBLM S # SOLUIONS by obet A. DiStasio J. Q. he Bon-Oppenheime appoximation is the standad way of appoximating the gound state of a molecula system. Wite down the conditions that detemine the tonic and

More information

ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0},

ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0}, ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION E. J. IONASCU and A. A. STANCU Abstact. We ae inteested in constucting concete independent events in puely atomic pobability

More information

MONTE CARLO STUDY OF PARTICLE TRANSPORT PROBLEM IN AIR POLLUTION. R. J. Papancheva, T. V. Gurov, I. T. Dimov

MONTE CARLO STUDY OF PARTICLE TRANSPORT PROBLEM IN AIR POLLUTION. R. J. Papancheva, T. V. Gurov, I. T. Dimov Pliska Stud. Math. Bulga. 14 (23), 17 116 STUDIA MATHEMATICA BULGARICA MOTE CARLO STUDY OF PARTICLE TRASPORT PROBLEM I AIR POLLUTIO R. J. Papancheva, T. V. Guov, I. T. Dimov Abstact. The actual tanspot

More information

PAPER 39 STOCHASTIC NETWORKS

PAPER 39 STOCHASTIC NETWORKS MATHEMATICAL TRIPOS Pat III Tuesday, 2 June, 2015 1:30 pm to 4:30 pm PAPER 39 STOCHASTIC NETWORKS Attempt no moe than FOUR questions. Thee ae FIVE questions in total. The questions cay equal weight. STATIONERY

More information

arxiv: v1 [math.na] 8 Feb 2013

arxiv: v1 [math.na] 8 Feb 2013 A mixed method fo Diichlet poblems with adial basis functions axiv:1302.2079v1 [math.na] 8 Feb 2013 Nobet Heue Thanh Tan Abstact We pesent a simple discetization by adial basis functions fo the Poisson

More information

On Computing Optimal (Q, r) Replenishment Policies under Quantity Discounts

On Computing Optimal (Q, r) Replenishment Policies under Quantity Discounts Annals of Opeations Reseach manuscipt No. will be inseted by the edito) On Computing Optimal, ) Replenishment Policies unde uantity Discounts The all - units and incemental discount cases Michael N. Katehakis

More information

As is natural, our Aerospace Structures will be described in a Euclidean three-dimensional space R 3.

As is natural, our Aerospace Structures will be described in a Euclidean three-dimensional space R 3. Appendix A Vecto Algeba As is natual, ou Aeospace Stuctues will be descibed in a Euclidean thee-dimensional space R 3. A.1 Vectos A vecto is used to epesent quantities that have both magnitude and diection.

More information

Solution to HW 3, Ma 1a Fall 2016

Solution to HW 3, Ma 1a Fall 2016 Solution to HW 3, Ma a Fall 206 Section 2. Execise 2: Let C be a subset of the eal numbes consisting of those eal numbes x having the popety that evey digit in the decimal expansion of x is, 3, 5, o 7.

More information

Bayesian Congestion Control over a Markovian Network Bandwidth Process

Bayesian Congestion Control over a Markovian Network Bandwidth Process Bayesian Congestion Contol ove a Makovian Netwok Bandwidth Pocess Paisa Mansouifad,, Bhaska Kishnamachai, Taa Javidi Ming Hsieh Depatment of Electical Engineeing, Univesity of Southen Califonia, Los Angeles,

More information

Fractional Zero Forcing via Three-color Forcing Games

Fractional Zero Forcing via Three-color Forcing Games Factional Zeo Focing via Thee-colo Focing Games Leslie Hogben Kevin F. Palmowski David E. Robeson Michael Young May 13, 2015 Abstact An -fold analogue of the positive semidefinite zeo focing pocess that

More information

On the Poisson Approximation to the Negative Hypergeometric Distribution

On the Poisson Approximation to the Negative Hypergeometric Distribution BULLETIN of the Malaysian Mathematical Sciences Society http://mathusmmy/bulletin Bull Malays Math Sci Soc (2) 34(2) (2011), 331 336 On the Poisson Appoximation to the Negative Hypegeometic Distibution

More information

JENSEN S INEQUALITY FOR DISTRIBUTIONS POSSESSING HIGHER MOMENTS, WITH APPLICATION TO SHARP BOUNDS FOR LAPLACE-STIELTJES TRANSFORMS

JENSEN S INEQUALITY FOR DISTRIBUTIONS POSSESSING HIGHER MOMENTS, WITH APPLICATION TO SHARP BOUNDS FOR LAPLACE-STIELTJES TRANSFORMS J. Austal. Math. Soc. Se. B 40(1998), 80 85 JENSEN S INEQUALITY FO DISTIBUTIONS POSSESSING HIGHE MOMENTS, WITH APPLICATION TO SHAP BOUNDS FO LAPLACE-STIELTJES TANSFOMS B. GULJAŠ 1,C.E.M.PEACE 2 and J.

More information

Directed Regression. Benjamin Van Roy Stanford University Stanford, CA Abstract

Directed Regression. Benjamin Van Roy Stanford University Stanford, CA Abstract Diected Regession Yi-hao Kao Stanfod Univesity Stanfod, CA 94305 yihaoao@stanfod.edu Benjamin Van Roy Stanfod Univesity Stanfod, CA 94305 bv@stanfod.edu Xiang Yan Stanfod Univesity Stanfod, CA 94305 xyan@stanfod.edu

More information

Estimation of the Correlation Coefficient for a Bivariate Normal Distribution with Missing Data

Estimation of the Correlation Coefficient for a Bivariate Normal Distribution with Missing Data Kasetsat J. (Nat. Sci. 45 : 736-74 ( Estimation of the Coelation Coefficient fo a Bivaiate Nomal Distibution with Missing Data Juthaphon Sinsomboonthong* ABSTRACT This study poposes an estimato of the

More information

6 PROBABILITY GENERATING FUNCTIONS

6 PROBABILITY GENERATING FUNCTIONS 6 PROBABILITY GENERATING FUNCTIONS Cetain deivations pesented in this couse have been somewhat heavy on algeba. Fo example, detemining the expectation of the Binomial distibution (page 5.1 tuned out to

More information

Deterministic vs Non-deterministic Graph Property Testing

Deterministic vs Non-deterministic Graph Property Testing Deteministic vs Non-deteministic Gaph Popety Testing Lio Gishboline Asaf Shapia Abstact A gaph popety P is said to be testable if one can check whethe a gaph is close o fa fom satisfying P using few andom

More information

Unobserved Correlation in Ascending Auctions: Example And Extensions

Unobserved Correlation in Ascending Auctions: Example And Extensions Unobseved Coelation in Ascending Auctions: Example And Extensions Daniel Quint Univesity of Wisconsin Novembe 2009 Intoduction In pivate-value ascending auctions, the winning bidde s willingness to pay

More information

Compactly Supported Radial Basis Functions

Compactly Supported Radial Basis Functions Chapte 4 Compactly Suppoted Radial Basis Functions As we saw ealie, compactly suppoted functions Φ that ae tuly stictly conditionally positive definite of ode m > do not exist The compact suppot automatically

More information

A Power Method for Computing Square Roots of Complex Matrices

A Power Method for Computing Square Roots of Complex Matrices JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS 13, 39345 1997 ARTICLE NO. AY975517 A Powe Method fo Computing Squae Roots of Complex Matices Mohammed A. Hasan Depatment of Electical Engineeing, Coloado

More information

Parameter identification in Markov chain choice models

Parameter identification in Markov chain choice models Poceedings of Machine Leaning Reseach 76:1 11, 2017 Algoithmic Leaning Theoy 2017 Paamete identification in Makov chain choice models Aushi Gupta Daniel Hsu Compute Science Depatment Columbia Univesity

More information

Analytical time-optimal trajectories for an omni-directional vehicle

Analytical time-optimal trajectories for an omni-directional vehicle Analytical time-optimal tajectoies fo an omni-diectional vehicle Weifu Wang and Devin J. Balkcom Abstact We pesent the fist analytical solution method fo finding a time-optimal tajectoy between any given

More information

Analysis of Loss Networks with Routing

Analysis of Loss Networks with Routing Analysis of Loss Netwoks with Routing Nelson Antunes, Chistine Ficke, Philippe Robet, Danielle Tibi To cite this vesion: Nelson Antunes, Chistine Ficke, Philippe Robet, Danielle Tibi. Analysis of Loss

More information

6 Matrix Concentration Bounds

6 Matrix Concentration Bounds 6 Matix Concentation Bounds Concentation bounds ae inequalities that bound pobabilities of deviations by a andom vaiable fom some value, often its mean. Infomally, they show the pobability that a andom

More information

Alternative Tests for the Poisson Distribution

Alternative Tests for the Poisson Distribution Chiang Mai J Sci 015; 4() : 774-78 http://epgsciencecmuacth/ejounal/ Contibuted Pape Altenative Tests fo the Poisson Distibution Manad Khamkong*[a] and Pachitjianut Siipanich [b] [a] Depatment of Statistics,

More information

Green s function Monte Carlo algorithms for elliptic problems

Green s function Monte Carlo algorithms for elliptic problems Mathematics and Computes in Simulation 63 (2003) 587 604 Geen s function Monte Calo algoithms fo elliptic poblems I.T. Dimov, R.Y. Papancheva Cental Laboatoy fo Paallel Pocessing, Depatment of Paallel

More information

Psychometric Methods: Theory into Practice Larry R. Price

Psychometric Methods: Theory into Practice Larry R. Price ERRATA Psychometic Methods: Theoy into Pactice Lay R. Pice Eos wee made in Equations 3.5a and 3.5b, Figue 3., equations and text on pages 76 80, and Table 9.1. Vesions of the elevant pages that include

More information

Probablistically Checkable Proofs

Probablistically Checkable Proofs Lectue 12 Pobablistically Checkable Poofs May 13, 2004 Lectue: Paul Beame Notes: Chis Re 12.1 Pobablisitically Checkable Poofs Oveview We know that IP = PSPACE. This means thee is an inteactive potocol

More information

Quantum theory of angular momentum

Quantum theory of angular momentum Quantum theoy of angula momentum Igo Mazets igo.mazets+e141@tuwien.ac.at (Atominstitut TU Wien, Stadionallee 2, 1020 Wien Time: Fiday, 13:00 14:30 Place: Feihaus, Sem.R. DA gün 06B (exception date 18 Nov.:

More information

COLLAPSING WALLS THEOREM

COLLAPSING WALLS THEOREM COLLAPSING WALLS THEOREM IGOR PAK AND ROM PINCHASI Abstact. Let P R 3 be a pyamid with the base a convex polygon Q. We show that when othe faces ae collapsed (otated aound the edges onto the plane spanned

More information

A Multivariate Normal Law for Turing s Formulae

A Multivariate Normal Law for Turing s Formulae A Multivaiate Nomal Law fo Tuing s Fomulae Zhiyi Zhang Depatment of Mathematics and Statistics Univesity of Noth Caolina at Chalotte Chalotte, NC 28223 Abstact This pape establishes a sufficient condition

More information

Localization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matrix

Localization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matrix Jounal of Sciences, Islamic Republic of Ian (): - () Univesity of Tehan, ISSN - http://sciencesutaci Localization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matix H Ahsani

More information

q i i=1 p i ln p i Another measure, which proves a useful benchmark in our analysis, is the chi squared divergence of p, q, which is defined by

q i i=1 p i ln p i Another measure, which proves a useful benchmark in our analysis, is the chi squared divergence of p, q, which is defined by CSISZÁR f DIVERGENCE, OSTROWSKI S INEQUALITY AND MUTUAL INFORMATION S. S. DRAGOMIR, V. GLUŠČEVIĆ, AND C. E. M. PEARCE Abstact. The Ostowski integal inequality fo an absolutely continuous function is used

More information

Energy Levels Of Hydrogen Atom Using Ladder Operators. Ava Khamseh Supervisor: Dr. Brian Pendleton The University of Edinburgh August 2011

Energy Levels Of Hydrogen Atom Using Ladder Operators. Ava Khamseh Supervisor: Dr. Brian Pendleton The University of Edinburgh August 2011 Enegy Levels Of Hydogen Atom Using Ladde Opeatos Ava Khamseh Supeviso: D. Bian Pendleton The Univesity of Edinbugh August 11 1 Abstact The aim of this pape is to fist use the Schödinge wavefunction methods

More information

State tracking control for Takagi-Sugeno models

State tracking control for Takagi-Sugeno models State tacing contol fo Taagi-Sugeno models Souad Bezzaoucha, Benoît Max,3,DidieMaquin,3 and José Ragot,3 Abstact This wo addesses the model efeence tacing contol poblem It aims to highlight the encouteed

More information

Multiple Criteria Secretary Problem: A New Approach

Multiple Criteria Secretary Problem: A New Approach J. Stat. Appl. Po. 3, o., 9-38 (04 9 Jounal of Statistics Applications & Pobability An Intenational Jounal http://dx.doi.og/0.785/jsap/0303 Multiple Citeia Secetay Poblem: A ew Appoach Alaka Padhye, and

More information

Math 124B February 02, 2012

Math 124B February 02, 2012 Math 24B Febuay 02, 202 Vikto Gigoyan 8 Laplace s equation: popeties We have aleady encounteed Laplace s equation in the context of stationay heat conduction and wave phenomena. Recall that in two spatial

More information

C e f paamete adaptation f (' x) ' ' d _ d ; ; e _e K p K v u ^M() RBF NN ^h( ) _ obot s _ s n W ' f x x xm xm f x xm d Figue : Block diagam of comput

C e f paamete adaptation f (' x) ' ' d _ d ; ; e _e K p K v u ^M() RBF NN ^h( ) _ obot s _ s n W ' f x x xm xm f x xm d Figue : Block diagam of comput A Neual-Netwok Compensato with Fuzzy Robustication Tems fo Impoved Design of Adaptive Contol of Robot Manipulatos Y.H. FUNG and S.K. TSO Cente fo Intelligent Design, Automation and Manufactuing City Univesity

More information

Journal of Inequalities in Pure and Applied Mathematics

Journal of Inequalities in Pure and Applied Mathematics Jounal of Inequalities in Pue and Applied Mathematics COEFFICIENT INEQUALITY FOR A FUNCTION WHOSE DERIVATIVE HAS A POSITIVE REAL PART S. ABRAMOVICH, M. KLARIČIĆ BAKULA AND S. BANIĆ Depatment of Mathematics

More information

Absorption Rate into a Small Sphere for a Diffusing Particle Confined in a Large Sphere

Absorption Rate into a Small Sphere for a Diffusing Particle Confined in a Large Sphere Applied Mathematics, 06, 7, 709-70 Published Online Apil 06 in SciRes. http://www.scip.og/jounal/am http://dx.doi.og/0.46/am.06.77065 Absoption Rate into a Small Sphee fo a Diffusing Paticle Confined in

More information

Tight Upper Bounds for the Expected Loss of Lexicographic Heuristics in Binary Multi-attribute Choice

Tight Upper Bounds for the Expected Loss of Lexicographic Heuristics in Binary Multi-attribute Choice Tight Uppe Bounds fo the Expected Loss of Lexicogaphic Heuistics in Binay Multi-attibute Choice Juan A. Caasco, Manel Baucells Except fo fomatting details and the coection of some eata, this vesion matches

More information

CALCULUS II Vectors. Paul Dawkins

CALCULUS II Vectors. Paul Dawkins CALCULUS II Vectos Paul Dawkins Table of Contents Peface... ii Vectos... 3 Intoduction... 3 Vectos The Basics... 4 Vecto Aithmetic... 8 Dot Poduct... 13 Coss Poduct... 21 2007 Paul Dawkins i http://tutoial.math.lama.edu/tems.aspx

More information

556: MATHEMATICAL STATISTICS I

556: MATHEMATICAL STATISTICS I 556: MATHEMATICAL STATISTICS I CHAPTER 5: STOCHASTIC CONVERGENCE The following efinitions ae state in tems of scala anom vaiables, but exten natually to vecto anom vaiables efine on the same obability

More information

Chapter 5 Linear Equations: Basic Theory and Practice

Chapter 5 Linear Equations: Basic Theory and Practice Chapte 5 inea Equations: Basic Theoy and actice In this chapte and the next, we ae inteested in the linea algebaic equation AX = b, (5-1) whee A is an m n matix, X is an n 1 vecto to be solved fo, and

More information

Encapsulation theory: the transformation equations of absolute information hiding.

Encapsulation theory: the transformation equations of absolute information hiding. 1 Encapsulation theoy: the tansfomation equations of absolute infomation hiding. Edmund Kiwan * www.edmundkiwan.com Abstact This pape descibes how the potential coupling of a set vaies as the set is tansfomed,

More information

Weighted least-squares estimators of parametric functions of the regression coefficients under a general linear model

Weighted least-squares estimators of parametric functions of the regression coefficients under a general linear model Ann Inst Stat Math (2010) 62:929 941 DOI 10.1007/s10463-008-0199-8 Weighted least-squaes estimatos of paametic functions of the egession coefficients unde a geneal linea model Yongge Tian Received: 9 Januay

More information

QIP Course 10: Quantum Factorization Algorithm (Part 3)

QIP Course 10: Quantum Factorization Algorithm (Part 3) QIP Couse 10: Quantum Factoization Algoithm (Pat 3 Ryutaoh Matsumoto Nagoya Univesity, Japan Send you comments to yutaoh.matsumoto@nagoya-u.jp Septembe 2018 @ Tokyo Tech. Matsumoto (Nagoya U. QIP Couse

More information

2 Governing Equations

2 Governing Equations 2 Govening Equations This chapte develops the govening equations of motion fo a homogeneous isotopic elastic solid, using the linea thee-dimensional theoy of elasticity in cylindical coodinates. At fist,

More information

Chapter 3 Optical Systems with Annular Pupils

Chapter 3 Optical Systems with Annular Pupils Chapte 3 Optical Systems with Annula Pupils 3 INTRODUCTION In this chapte, we discuss the imaging popeties of a system with an annula pupil in a manne simila to those fo a system with a cicula pupil The

More information

Notes on McCall s Model of Job Search. Timothy J. Kehoe March if job offer has been accepted. b if searching

Notes on McCall s Model of Job Search. Timothy J. Kehoe March if job offer has been accepted. b if searching Notes on McCall s Model of Job Seach Timothy J Kehoe Mach Fv ( ) pob( v), [, ] Choice: accept age offe o eceive b and seach again next peiod An unemployed oke solves hee max E t t y t y t if job offe has

More information

1.2 Differential cross section

1.2 Differential cross section .2. DIFFERENTIAL CROSS SECTION Febuay 9, 205 Lectue VIII.2 Diffeential coss section We found that the solution to the Schodinge equation has the fom e ik x ψ 2π 3/2 fk, k + e ik x and that fk, k = 2 m

More information

Efficiency Loss in a Network Resource Allocation Game: The Case of Elastic Supply

Efficiency Loss in a Network Resource Allocation Game: The Case of Elastic Supply Efficiency Loss in a Netwok Resouce Allocation Game: The Case of Elastic Supply axiv:cs/0506054v1 [cs.gt] 14 Jun 2005 Ramesh Johai (johai@stanfod.edu) Shie Manno (shie@mit.edu) John N. Tsitsiklis (jnt@mit.edu)

More information

A STUDY OF HAMMING CODES AS ERROR CORRECTING CODES

A STUDY OF HAMMING CODES AS ERROR CORRECTING CODES AGU Intenational Jounal of Science and Technology A STUDY OF HAMMING CODES AS ERROR CORRECTING CODES Ritu Ahuja Depatment of Mathematics Khalsa College fo Women, Civil Lines, Ludhiana-141001, Punjab, (India)

More information

ANA BERRIZBEITIA, LUIS A. MEDINA, ALEXANDER C. MOLL, VICTOR H. MOLL, AND LAINE NOBLE

ANA BERRIZBEITIA, LUIS A. MEDINA, ALEXANDER C. MOLL, VICTOR H. MOLL, AND LAINE NOBLE THE p-adic VALUATION OF STIRLING NUMBERS ANA BERRIZBEITIA, LUIS A. MEDINA, ALEXANDER C. MOLL, VICTOR H. MOLL, AND LAINE NOBLE Abstact. Let p > 2 be a pime. The p-adic valuation of Stiling numbes of the

More information

Preliminary Exam: Quantum Physics 1/14/2011, 9:00-3:00

Preliminary Exam: Quantum Physics 1/14/2011, 9:00-3:00 Peliminay Exam: Quantum Physics /4/ 9:-: Answe a total of SIX questions of which at least TWO ae fom section A and at least THREE ae fom section B Fo you answes you can use eithe the blue books o individual

More information

High Dimensional Clustering with r-nets

High Dimensional Clustering with r-nets High Dimensional Clusteing with -nets Geogia Avaikioti, Alain Ryse, Yuyi Wang, Roge Wattenhofe ETH Zuich, Switzeland {zetava,ayse,yuwang,wattenhofe}@ethz.ch Abstact Clusteing, a fundamental task in data

More information

LET a random variable x follows the two - parameter

LET a random variable x follows the two - parameter INTERNATIONAL JOURNAL OF MATHEMATICS AND SCIENTIFIC COMPUTING ISSN: 2231-5330, VOL. 5, NO. 1, 2015 19 Shinkage Bayesian Appoach in Item - Failue Gamma Data In Pesence of Pio Point Guess Value Gyan Pakash

More information

FOURIER-BESSEL SERIES AND BOUNDARY VALUE PROBLEMS IN CYLINDRICAL COORDINATES

FOURIER-BESSEL SERIES AND BOUNDARY VALUE PROBLEMS IN CYLINDRICAL COORDINATES FOURIER-BESSE SERIES AND BOUNDARY VAUE PROBEMS IN CYINDRICA COORDINATES The paametic Bessel s equation appeas in connection with the aplace opeato in pola coodinates. The method of sepaation of vaiables

More information

Model and Controller Order Reduction for Infinite Dimensional Systems

Model and Controller Order Reduction for Infinite Dimensional Systems IT J. Eng. Sci., Vol. 4, No.,, -6 Model and Contolle Ode Reduction fo Infinite Dimensional Systems Fatmawati,*, R. Saagih,. Riyanto 3 & Y. Soehayadi Industial and Financial Mathematics Goup email: fatma47@students.itb.ac.id;

More information

To Feel a Force Chapter 7 Static equilibrium - torque and friction

To Feel a Force Chapter 7 Static equilibrium - torque and friction To eel a oce Chapte 7 Chapte 7: Static fiction, toque and static equilibium A. Review of foce vectos Between the eath and a small mass, gavitational foces of equal magnitude and opposite diection act on

More information

Efficiency Loss in a Network Resource Allocation Game

Efficiency Loss in a Network Resource Allocation Game Efficiency Loss in a Netwok Resouce Allocation Game Ramesh Johai johai@mit.edu) John N. Tsitsiklis jnt@mit.edu) June 11, 2004 Abstact We exploe the popeties of a congestion game whee uses of a congested

More information

Math 301: The Erdős-Stone-Simonovitz Theorem and Extremal Numbers for Bipartite Graphs

Math 301: The Erdős-Stone-Simonovitz Theorem and Extremal Numbers for Bipartite Graphs Math 30: The Edős-Stone-Simonovitz Theoem and Extemal Numbes fo Bipatite Gaphs May Radcliffe The Edős-Stone-Simonovitz Theoem Recall, in class we poved Tuán s Gaph Theoem, namely Theoem Tuán s Theoem Let

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Physics Department. Problem Set 10 Solutions. r s

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Physics Department. Problem Set 10 Solutions. r s MASSACHUSETTS INSTITUTE OF TECHNOLOGY Physics Depatment Physics 8.033 Decembe 5, 003 Poblem Set 10 Solutions Poblem 1 M s y x test paticle The figue above depicts the geomety of the poblem. The position

More information

Estimation of entropy rate and Rényi entropy rate for Markov chains

Estimation of entropy rate and Rényi entropy rate for Markov chains 06 IEEE Intenational Symposium on Infomation Theoy Estimation of entopy ate and Rényi entopy ate fo Makov chains Sudeep Kamath, Segio Vedú Depatment of Electical Engineeing Pinceton Univesity e-mail: sukamath,

More information

The Substring Search Problem

The Substring Search Problem The Substing Seach Poblem One algoithm which is used in a vaiety of applications is the family of substing seach algoithms. These algoithms allow a use to detemine if, given two chaacte stings, one is

More information

Chapter 7-8 Rotational Motion

Chapter 7-8 Rotational Motion Chapte 7-8 Rotational Motion What is a Rigid Body? Rotational Kinematics Angula Velocity ω and Acceleation α Unifom Rotational Motion: Kinematics Unifom Cicula Motion: Kinematics and Dynamics The Toque,

More information

Root-Exponential Accuracy for Coarse

Root-Exponential Accuracy for Coarse Root-Exponential Accuacy fo Coase 1 Quantization of Finite Fame Expansions Felix Kahme, Rayan Saab, and Rachel Wad Abstact In this note, we show that by quantizing the N-dimensional fame coefficients of

More information

arxiv: v2 [physics.data-an] 15 Jul 2015

arxiv: v2 [physics.data-an] 15 Jul 2015 Limitation of the Least Squae Method in the Evaluation of Dimension of Factal Bownian Motions BINGQIANG QIAO,, SIMING LIU, OUDUN ZENG, XIANG LI, and BENZONG DAI Depatment of Physics, Yunnan Univesity,

More information

Pearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms

Pearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms Peason s Chi-Squae Test Modifications fo Compaison of Unweighted and Weighted Histogams and Two Weighted Histogams Univesity of Akueyi, Bogi, v/noduslód, IS-6 Akueyi, Iceland E-mail: nikolai@unak.is Two

More information

Research Design - - Topic 17 Multiple Regression & Multiple Correlation: Two Predictors 2009 R.C. Gardner, Ph.D.

Research Design - - Topic 17 Multiple Regression & Multiple Correlation: Two Predictors 2009 R.C. Gardner, Ph.D. Reseach Design - - Topic 7 Multiple Regession & Multiple Coelation: Two Pedictos 009 R.C. Gadne, Ph.D. Geneal Rationale and Basic Aithmetic fo two pedictos Patial and semipatial coelation Regession coefficients

More information

ON THE INVERSE SIGNED TOTAL DOMINATION NUMBER IN GRAPHS. D.A. Mojdeh and B. Samadi

ON THE INVERSE SIGNED TOTAL DOMINATION NUMBER IN GRAPHS. D.A. Mojdeh and B. Samadi Opuscula Math. 37, no. 3 (017), 447 456 http://dx.doi.og/10.7494/opmath.017.37.3.447 Opuscula Mathematica ON THE INVERSE SIGNED TOTAL DOMINATION NUMBER IN GRAPHS D.A. Mojdeh and B. Samadi Communicated

More information

A Simple Nonparametric Approach to Estimating the Distribution of Random Coefficients in Structural Models

A Simple Nonparametric Approach to Estimating the Distribution of Random Coefficients in Structural Models A Simple Nonpaametic Appoach to Estimating the Distibution of Random Coefficients in Stuctual Models Jeemy T. Fox Rice Univesity & NBER Kyoo il Kim Michigan State Univesity May 2016 Chenyu Yang Univesity

More information

COUPLED MODELS OF ROLLING, SLIDING AND WHIRLING FRICTION

COUPLED MODELS OF ROLLING, SLIDING AND WHIRLING FRICTION ENOC 008 Saint Petesbug Russia June 30-July 4 008 COUPLED MODELS OF ROLLING SLIDING AND WHIRLING FRICTION Alexey Kieenkov Ins ti tu te fo P ob le ms in Me ch an ic s Ru ss ia n Ac ad em y of Sc ie nc es

More information

Chaos and bifurcation of discontinuous dynamical systems with piecewise constant arguments

Chaos and bifurcation of discontinuous dynamical systems with piecewise constant arguments Malaya Jounal of Matematik ()(22) 4 8 Chaos and bifucation of discontinuous dynamical systems with piecewise constant aguments A.M.A. El-Sayed, a, and S. M. Salman b a Faculty of Science, Aleandia Univesity,

More information

Brief summary of functional analysis APPM 5440 Fall 2014 Applied Analysis

Brief summary of functional analysis APPM 5440 Fall 2014 Applied Analysis Bief summay of functional analysis APPM 5440 Fall 014 Applied Analysis Stephen Becke, stephen.becke@coloado.edu Standad theoems. When necessay, I used Royden s and Keyzsig s books as a efeence. Vesion

More information

Mean Curvature and Shape Operator of Slant Immersions in a Sasakian Space Form

Mean Curvature and Shape Operator of Slant Immersions in a Sasakian Space Form Mean Cuvatue and Shape Opeato of Slant Immesions in a Sasakian Space Fom Muck Main Tipathi, Jean-Sic Kim and Son-Be Kim Abstact Fo submanifolds, in a Sasakian space fom, which ae tangential to the stuctue

More information

arxiv: v1 [math.co] 1 Apr 2011

arxiv: v1 [math.co] 1 Apr 2011 Weight enumeation of codes fom finite spaces Relinde Juius Octobe 23, 2018 axiv:1104.0172v1 [math.co] 1 Ap 2011 Abstact We study the genealized and extended weight enumeato of the - ay Simplex code and

More information

On the global uniform asymptotic stability of time-varying dynamical systems

On the global uniform asymptotic stability of time-varying dynamical systems Stud. Univ. Babeş-Bolyai Math. 59014), No. 1, 57 67 On the global unifom asymptotic stability of time-vaying dynamical systems Zaineb HajSalem, Mohamed Ali Hammami and Mohamed Mabouk Abstact. The objective

More information