Temporal-Difference Learning
|
|
- Adam Brown
- 5 years ago
- Views:
Transcription
1 .997 Decision-Making in Lage-Scale Systems Mach 17 MIT, Sping 004 Handout #17 Lectue Note 13 1 Tempoal-Diffeence Leaning We now conside the poblem of computing an appopiate paamete, so that, given an appoximation achitectue J (x, ), J (, ) J ( ). A class of iteative methods ae the so-called tempoal-diffeence leaning algoithms, which geneates a seies of appoximations J k = J (, k ) as follows. Conside geneating a tajectoy (x 1, u 1,..., x k, u k ), whee u k is the geedy policy with espect to J k. We then have the eo/tempoal diffeences d k = g uk (x k ) + αj k (x k+1, k ) J k (x k, k ), which epesent an appoximation to the Bellman eo (T J k )(x k ) J k (x k ) at state x k. Based on the tempoal diffeences, an intuitive way of updating the paametes k is to make updates popotional to the obseved Bellman eo/tempoal diffeence: k+1 = k + γ k d k z k, whee γ k is the step size and z k is called an eligibility vecto it measues how much updates to each component of the vecto k would affect the Bellman eo. To gathe moe intuition about how to choose the eligibility vecto, we will conside the case of autonomous systems, i.e., systems that do not involve contol. In this case, we can estimate the cost-to-go function via sampling as follows. Suppose that we have a tajectoy x 1,..., x n. Then we have J (x n 1 1 ) α g(x t ) t=1 J (x n ) α g(x t ). t= In othe wods, fom a tajectoy x 1,..., x n, we can deive pais (x i, Ĵ(x i )), whee Ĵ(x i ) is a noisy and biased estimate of J (x i ). Theefoe we may conside fitting the appoximation J (x, ) by minimizing the empiical squaed eo: ( min Ĵ n (x t ) J(x t, )) (1) t=1 We deive an incemental, appoximate vesion of (1). Fist note that Ĵ n (x t ) could be updated incementally as follows: n+1 t Ĵ n+1 (x t ) = Ĵ n (x t ) + α g(x n+1 ) () Altenatively, we may use a small-step update of the fom n +1 Ĵ n+1 (x t ) = Ĵ n (x t ) + γ α j t g(x j ) Jˆ n (x t ), (3) j=t 1
2 which makes Ĵ n+1 (x t ) an aveage of the old estimate Ĵ n (x t ) and the new estimate (). Finally, we may appoximate (3) to have Ĵ n (x t ) function d 1, d,..., d n : α j t g(x j ) Jˆ n (x t ) = g(x t ) + αĵ n (x t+1 ) Jˆ n (x t ) + α(g(x t+1 ) + αĵ n (x t+ ) Jˆ n (x t+1 )) +... j=t Hence n t (g(x n ) + αĵ n (x n+1 ) Ĵ n (x n )) α n+1 t Ĵ n (x +α n+1 α j t d t. j=t n +1 Ĵ n+1 (x t ) = Ĵ n (x t ) + γ α j t d j. (4) Finally, we may conside having the sum in (1) implemented incementally, so that the pevious tempoal diffeences do not have to be stoed: Stating fom the solution n to the poblem at stage n, we can appoximate the solution of the poblem at stage n + 1 by updating n+1 along the gadient of (5). This leads to ( ) α t n n+1 = + γ J ( n, x t ) d j=t Ĵ n+1 t n+1 (x t ) = Ĵ n (x t ) + γα dn+1. Hence, in each time stage, we would like to find n minimizing ( min Ĵ n (x t ) + γα n t d n J(x t, )). (5) t=1 We can also have an incemental vesion, given by n t=1 k+1 = k + γ k z k d k z k = αz k 1 + J (x k, k ) The algoithm above is known as T D(1). We have the genealization T D(λ), λ [0, 1]. n+1. k+1 = k + γ k z k d k TD(λ) z k = αλz k 1 + J (x k, k ) Befoe analyzing the behavio of T D(λ), we ae going to study a elated, deteministic algoithm appoximate value iteation. The analysis of T D(λ) will be based on intepeting it as a stochastic appoximation vesion of appoximate value iteation. Appoximate Value Iteation Define the opeato T λ T λ J = (1 λ) λ m T m+1 J, fo λ [0, 1) m=0 T λ J = J, fo λ = 1. We can show that T λ satisfies the same popeties as T :
3 Lemma 1 T λ J T λ J α(1 λ) 1 αλ J J, J = T λ J J, J The motivation fo T λ is as follows. Recall that, in value iteation, we have J k+1 = T J k. Howeve, we could also implement value iteation with J k+1 = T L J k, which implies L steps look ahead. Finally, we can have an update that is a weighted aveage ove all possible values of L; J k+1 = T λ J k gives one such update. In what follows, we ae going to estict attention to linea appoximation achitectues. Let P J (x, ) = φ i (x) i, and Φ = i=1 J = Φ φ 1 (1) φ (1)... φ P (1) φ 1 () φ ()... φ P () φ 1 (n) φ (n)... φ P (n) Moeove, we ae going to conside only autonomous systems. We denote by P the tansition matix associated with the system. Let us intoduce some notation. Fist, we have d(1) d()... 0 D = d(n) whee d : S (0, 1) S is a pobability distibution ove states. Define the weighted Euclidean noms J,D = J T DJ = d(x)j (x) x S < J J > D = J T DJ = d(x)j(x)j (x) Fo simplicity, we assume that φ i, i = 1,..., p is an othonomal basis to the subspace J = Φ, i.e., x S φ i,d = 1 and < φ i, φ j >= 0, i = j In matix notation, we have Φ T DΦ = I. We ae going to use the following pojection opeato Π: ΠJ = Φ J, whee J = ag min Φ J,D 3
4 State J T λ Φ k J = Φ ΠJ Φ k+1 = ΠT λ Φ k State 1 Φ k State 3 Figue 1: Appoximate Value Iteation 4
5 We can chaacteize Π explicitly by solving the associated minimizing poblem. We have J = ag min Φ J,D = ag min (Φ J) T D(Φ J) ( = Φ T DΦ ) 1 Φ T DJ = < Φ, J > D Hence, we have ΠJ = Φ < Φ, J > D. Lemma Fo all J, ΠJ = Φ < Φ, J > D (6) < ΠJ, J ΠJ > D = 0 (7) J = ΠJ,D + J ΠJ,D (8),D Note that Φ k+1 = ΠT λ Φ k. We know that the pojection Π is a nonexpansion fom ΠJ ΠJ,D = Π(J J ),D J J,D. Moeove, T λ is a contaction: T λ J T λ J K J J. Howeve, the fact that Π and T λ ae a non-expansion and a contaction with espect to diffeent noms implies that convegence of appoximate value iteation cannot be guaanteed by a contaction agument, as was the case with exact value iteation. Indeed, as illustated in Figue, ΠT λ is not necessaily a contaction with espect to any nom, and one can find counteexamples whee T D(λ) fails to convege. As it tuns out, thee is a special choice of D that ensues convegence of T D(λ) fo all λ [0, 1]. Befoe poving that, we need the following auxiliay esult. Fist, we pesent two definitions involving Makov chains. Definition 1 A Makov chain is called ieducible if, fo evey pai of states x and y, thee is k such that P k (x, y) > 0. Definition A state x is called peiodic if thee is m such that P k (x, x) > 0 iff k = mn, fo some n {0, 1,,... }. A Makov chain is called apeiodic if none of its states is peiodic. Lemma 3 Given a tansition matix P and assume that P is ieducible and apeiodic. Then thee exists a unique π such that π T P = π T and π T π T P n... π T 5
6 Φ k T λ Φ k ΠT λ Φ k J Figue : T λ Φ k must be inside the smalle squae and ΠT λ Φ k must be inside the cicle, but ΠT λ Φ k may be outside the lage squae and futhe away fom J than Φ k. 6
7 This lemma was poved in Poblem Set, fo the special case whee P (x, x) > 0 fo some x. We ae now poised to pove the following cental esult used to deive a convegent vesion of T D(λ): Lemma 4 Suppose that the tansition matix P is ieducible and apeiodic. Let π π... 0 D =, π S whee π is the stationay distibution associated with P. Then P J,D J,D. Poof: P J =,D = = ( ) π(x) P (x, y)j(y) x S π(x) y P (x, y)j (y) x S y π(x)p (x, y)j (y) y x π(y)j (y) y J =,D The fist inequality follows the Jensen s inequality and the thid equality holds because π is a stationay distibution. Based on the pevious lemma, we can show that T λ is a contaction with espect to,dπ, whee π π... 0 D π = π S and π is the stationay distibution of the tansition matix P. It follows that, if the pojection Π is pefomed with espect to,dπ, ΠT λ becomes a contaction with espect to the same nom, and convegence of T D(λ) is guaanteed. Lemma 5 (i) T J T J,Dπ α J J,Dπ α(1 α) (ii) T λ J T λ J,Dπ 1 αλ J J α(1 α) (iii) ΠT λ J ΠT λ J,Dπ J J 1 αλ,d π,d π 7
8 Poof of (1) T J T J,Dπ = g + αp J (g + α J ),Dπ = α P J P J,Dπ α J J,Dπ Theoem 1 Let Φ k+1 = ΠT λ Φ k and Then k with π π... 0 D π = π S Φ J,Dπ K α,λ ΠJ J,Dπ. Poof: Convegence follows fom (iii). We have Φ = ΠT λ Φ and J T λ J. Then Φ J,D π = Φ ΠJ + ΠJ J,D π = Φ ΠJ,D π + ΠJ J,D π (othogonal) = ΠT λ Φ ΠT λ J,D π + ΠJ J,D π α (1 λ) (1 αλ) }{{ Φ J,D π + ΠJ J,D π } γ Theefoe 1 Φ J,Dπ ΠJ J,Dπ 1 γ 8
1 Explicit Explore or Exploit (E 3 ) Algorithm
2.997 Decision-Making in Lage-Scale Systems Mach 3 MIT, Sping 2004 Handout #2 Lectue Note 9 Explicit Exploe o Exploit (E 3 ) Algoithm Last lectue, we studied the Q-leaning algoithm: [ ] Q t+ (x t, a t
More informationMULTILAYER PERCEPTRONS
Last updated: Nov 26, 2012 MULTILAYER PERCEPTRONS Outline 2 Combining Linea Classifies Leaning Paametes Outline 3 Combining Linea Classifies Leaning Paametes Implementing Logical Relations 4 AND and OR
More information3.1 Random variables
3 Chapte III Random Vaiables 3 Random vaiables A sample space S may be difficult to descibe if the elements of S ae not numbes discuss how we can use a ule by which an element s of S may be associated
More informationPerformance Loss Bounds for Approximate Value Iteration with State Aggregation
MATHEMATICS OF OPERATIONS RESEARCH Vol. 31, No. 2, May 2006, pp. 234 244 issn 0364-765X eissn 1526-5471 06 3102 0234 infoms doi 10.1287/moo.1060.0188 2006 INFORMS Pefomance Loss Bounds fo Appoximate Value
More informationLecture 28: Convergence of Random Variables and Related Theorems
EE50: Pobability Foundations fo Electical Enginees July-Novembe 205 Lectue 28: Convegence of Random Vaiables and Related Theoems Lectue:. Kishna Jagannathan Scibe: Gopal, Sudhasan, Ajay, Swamy, Kolla An
More informationChapter 8: Generalization and Function Approximation
Chapte 8: Genealization and Function Appoximation Objectives of this chapte: Look at how expeience with a limited pat of the state set be used to poduce good behavio ove a much lage pat. Oveview of function
More informationStanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012
Stanfod Univesity CS59Q: Quantum Computing Handout 8 Luca Tevisan Octobe 8, 0 Lectue 8 In which we use the quantum Fouie tansfom to solve the peiod-finding poblem. The Peiod Finding Poblem Let f : {0,...,
More informationSuggested Solutions to Homework #4 Econ 511b (Part I), Spring 2004
Suggested Solutions to Homewok #4 Econ 5b (Pat I), Sping 2004. Conside a neoclassical gowth model with valued leisue. The (epesentative) consume values steams of consumption and leisue accoding to P t=0
More informationValue Prediction with FA. Chapter 8: Generalization and Function Approximation. Adapt Supervised Learning Algorithms. Backups as Training Examples [ ]
Chapte 8: Genealization and Function Appoximation Objectives of this chapte:! Look at how expeience with a limited pat of the state set be used to poduce good behavio ove a much lage pat.! Oveview of function
More informationEM Boundary Value Problems
EM Bounday Value Poblems 10/ 9 11/ By Ilekta chistidi & Lee, Seung-Hyun A. Geneal Desciption : Maxwell Equations & Loentz Foce We want to find the equations of motion of chaged paticles. The way to do
More informationMany Electron Atoms. Electrons can be put into approximate orbitals and the properties of the many electron systems can be catalogued
Many Electon Atoms The many body poblem cannot be solved analytically. We content ouselves with developing appoximate methods that can yield quite accuate esults (but usually equie a compute). The electons
More informationNew problems in universal algebraic geometry illustrated by boolean equations
New poblems in univesal algebaic geomety illustated by boolean equations axiv:1611.00152v2 [math.ra] 25 Nov 2016 Atem N. Shevlyakov Novembe 28, 2016 Abstact We discuss new poblems in univesal algebaic
More informationInternet Appendix for A Bayesian Approach to Real Options: The Case of Distinguishing Between Temporary and Permanent Shocks
Intenet Appendix fo A Bayesian Appoach to Real Options: The Case of Distinguishing Between Tempoay and Pemanent Shocks Steven R. Genadie Gaduate School of Business, Stanfod Univesity Andey Malenko Gaduate
More informationQuasi-Randomness and the Distribution of Copies of a Fixed Graph
Quasi-Randomness and the Distibution of Copies of a Fixed Gaph Asaf Shapia Abstact We show that if a gaph G has the popety that all subsets of vetices of size n/4 contain the coect numbe of tiangles one
More informationTHE CONE THEOREM JOEL A. TROPP. Abstract. We prove a fixed point theorem for functions which are positive with respect to a cone in a Banach space.
THE ONE THEOEM JOEL A. TOPP Abstact. We pove a fixed point theoem fo functions which ae positive with espect to a cone in a Banach space. 1. Definitions Definition 1. Let X be a eal Banach space. A subset
More information4/18/2005. Statistical Learning Theory
Statistical Leaning Theoy Statistical Leaning Theoy A model of supevised leaning consists of: a Envionment - Supplying a vecto x with a fixed but unknown pdf F x (x b Teache. It povides a desied esponse
More informationworking pages for Paul Richards class notes; do not copy or circulate without permission from PGR 2004/11/3 10:50
woking pages fo Paul Richads class notes; do not copy o ciculate without pemission fom PGR 2004/11/3 10:50 CHAPTER7 Solid angle, 3D integals, Gauss s Theoem, and a Delta Function We define the solid angle,
More informationInformation Retrieval Advanced IR models. Luca Bondi
Advanced IR models Luca Bondi Advanced IR models 2 (LSI) Pobabilistic Latent Semantic Analysis (plsa) Vecto Space Model 3 Stating point: Vecto Space Model Documents and queies epesented as vectos in the
More informationMethod for Approximating Irrational Numbers
Method fo Appoximating Iational Numbes Eic Reichwein Depatment of Physics Univesity of Califonia, Santa Cuz June 6, 0 Abstact I will put foth an algoithm fo poducing inceasingly accuate ational appoximations
More information10/04/18. P [P(x)] 1 negl(n).
Mastemath, Sping 208 Into to Lattice lgs & Cypto Lectue 0 0/04/8 Lectues: D. Dadush, L. Ducas Scibe: K. de Boe Intoduction In this lectue, we will teat two main pats. Duing the fist pat we continue the
More informationBounds on the performance of back-to-front airplane boarding policies
Bounds on the pefomance of bac-to-font aiplane boading policies Eitan Bachmat Michael Elin Abstact We povide bounds on the pefomance of bac-to-font aiplane boading policies. In paticula, we show that no
More information16 Modeling a Language by a Markov Process
K. Pommeening, Language Statistics 80 16 Modeling a Language by a Makov Pocess Fo deiving theoetical esults a common model of language is the intepetation of texts as esults of Makov pocesses. This model
More informationGoodness-of-fit for composite hypotheses.
Section 11 Goodness-of-fit fo composite hypotheses. Example. Let us conside a Matlab example. Let us geneate 50 obsevations fom N(1, 2): X=nomnd(1,2,50,1); Then, unning a chi-squaed goodness-of-fit test
More informationPROBLEM SET #1 SOLUTIONS by Robert A. DiStasio Jr.
POBLM S # SOLUIONS by obet A. DiStasio J. Q. he Bon-Oppenheime appoximation is the standad way of appoximating the gound state of a molecula system. Wite down the conditions that detemine the tonic and
More informationON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0},
ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION E. J. IONASCU and A. A. STANCU Abstact. We ae inteested in constucting concete independent events in puely atomic pobability
More informationMONTE CARLO STUDY OF PARTICLE TRANSPORT PROBLEM IN AIR POLLUTION. R. J. Papancheva, T. V. Gurov, I. T. Dimov
Pliska Stud. Math. Bulga. 14 (23), 17 116 STUDIA MATHEMATICA BULGARICA MOTE CARLO STUDY OF PARTICLE TRASPORT PROBLEM I AIR POLLUTIO R. J. Papancheva, T. V. Guov, I. T. Dimov Abstact. The actual tanspot
More informationPAPER 39 STOCHASTIC NETWORKS
MATHEMATICAL TRIPOS Pat III Tuesday, 2 June, 2015 1:30 pm to 4:30 pm PAPER 39 STOCHASTIC NETWORKS Attempt no moe than FOUR questions. Thee ae FIVE questions in total. The questions cay equal weight. STATIONERY
More informationarxiv: v1 [math.na] 8 Feb 2013
A mixed method fo Diichlet poblems with adial basis functions axiv:1302.2079v1 [math.na] 8 Feb 2013 Nobet Heue Thanh Tan Abstact We pesent a simple discetization by adial basis functions fo the Poisson
More informationOn Computing Optimal (Q, r) Replenishment Policies under Quantity Discounts
Annals of Opeations Reseach manuscipt No. will be inseted by the edito) On Computing Optimal, ) Replenishment Policies unde uantity Discounts The all - units and incemental discount cases Michael N. Katehakis
More informationAs is natural, our Aerospace Structures will be described in a Euclidean three-dimensional space R 3.
Appendix A Vecto Algeba As is natual, ou Aeospace Stuctues will be descibed in a Euclidean thee-dimensional space R 3. A.1 Vectos A vecto is used to epesent quantities that have both magnitude and diection.
More informationSolution to HW 3, Ma 1a Fall 2016
Solution to HW 3, Ma a Fall 206 Section 2. Execise 2: Let C be a subset of the eal numbes consisting of those eal numbes x having the popety that evey digit in the decimal expansion of x is, 3, 5, o 7.
More informationBayesian Congestion Control over a Markovian Network Bandwidth Process
Bayesian Congestion Contol ove a Makovian Netwok Bandwidth Pocess Paisa Mansouifad,, Bhaska Kishnamachai, Taa Javidi Ming Hsieh Depatment of Electical Engineeing, Univesity of Southen Califonia, Los Angeles,
More informationFractional Zero Forcing via Three-color Forcing Games
Factional Zeo Focing via Thee-colo Focing Games Leslie Hogben Kevin F. Palmowski David E. Robeson Michael Young May 13, 2015 Abstact An -fold analogue of the positive semidefinite zeo focing pocess that
More informationOn the Poisson Approximation to the Negative Hypergeometric Distribution
BULLETIN of the Malaysian Mathematical Sciences Society http://mathusmmy/bulletin Bull Malays Math Sci Soc (2) 34(2) (2011), 331 336 On the Poisson Appoximation to the Negative Hypegeometic Distibution
More informationJENSEN S INEQUALITY FOR DISTRIBUTIONS POSSESSING HIGHER MOMENTS, WITH APPLICATION TO SHARP BOUNDS FOR LAPLACE-STIELTJES TRANSFORMS
J. Austal. Math. Soc. Se. B 40(1998), 80 85 JENSEN S INEQUALITY FO DISTIBUTIONS POSSESSING HIGHE MOMENTS, WITH APPLICATION TO SHAP BOUNDS FO LAPLACE-STIELTJES TANSFOMS B. GULJAŠ 1,C.E.M.PEACE 2 and J.
More informationDirected Regression. Benjamin Van Roy Stanford University Stanford, CA Abstract
Diected Regession Yi-hao Kao Stanfod Univesity Stanfod, CA 94305 yihaoao@stanfod.edu Benjamin Van Roy Stanfod Univesity Stanfod, CA 94305 bv@stanfod.edu Xiang Yan Stanfod Univesity Stanfod, CA 94305 xyan@stanfod.edu
More informationEstimation of the Correlation Coefficient for a Bivariate Normal Distribution with Missing Data
Kasetsat J. (Nat. Sci. 45 : 736-74 ( Estimation of the Coelation Coefficient fo a Bivaiate Nomal Distibution with Missing Data Juthaphon Sinsomboonthong* ABSTRACT This study poposes an estimato of the
More information6 PROBABILITY GENERATING FUNCTIONS
6 PROBABILITY GENERATING FUNCTIONS Cetain deivations pesented in this couse have been somewhat heavy on algeba. Fo example, detemining the expectation of the Binomial distibution (page 5.1 tuned out to
More informationDeterministic vs Non-deterministic Graph Property Testing
Deteministic vs Non-deteministic Gaph Popety Testing Lio Gishboline Asaf Shapia Abstact A gaph popety P is said to be testable if one can check whethe a gaph is close o fa fom satisfying P using few andom
More informationUnobserved Correlation in Ascending Auctions: Example And Extensions
Unobseved Coelation in Ascending Auctions: Example And Extensions Daniel Quint Univesity of Wisconsin Novembe 2009 Intoduction In pivate-value ascending auctions, the winning bidde s willingness to pay
More informationCompactly Supported Radial Basis Functions
Chapte 4 Compactly Suppoted Radial Basis Functions As we saw ealie, compactly suppoted functions Φ that ae tuly stictly conditionally positive definite of ode m > do not exist The compact suppot automatically
More informationA Power Method for Computing Square Roots of Complex Matrices
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS 13, 39345 1997 ARTICLE NO. AY975517 A Powe Method fo Computing Squae Roots of Complex Matices Mohammed A. Hasan Depatment of Electical Engineeing, Coloado
More informationParameter identification in Markov chain choice models
Poceedings of Machine Leaning Reseach 76:1 11, 2017 Algoithmic Leaning Theoy 2017 Paamete identification in Makov chain choice models Aushi Gupta Daniel Hsu Compute Science Depatment Columbia Univesity
More informationAnalytical time-optimal trajectories for an omni-directional vehicle
Analytical time-optimal tajectoies fo an omni-diectional vehicle Weifu Wang and Devin J. Balkcom Abstact We pesent the fist analytical solution method fo finding a time-optimal tajectoy between any given
More informationAnalysis of Loss Networks with Routing
Analysis of Loss Netwoks with Routing Nelson Antunes, Chistine Ficke, Philippe Robet, Danielle Tibi To cite this vesion: Nelson Antunes, Chistine Ficke, Philippe Robet, Danielle Tibi. Analysis of Loss
More information6 Matrix Concentration Bounds
6 Matix Concentation Bounds Concentation bounds ae inequalities that bound pobabilities of deviations by a andom vaiable fom some value, often its mean. Infomally, they show the pobability that a andom
More informationAlternative Tests for the Poisson Distribution
Chiang Mai J Sci 015; 4() : 774-78 http://epgsciencecmuacth/ejounal/ Contibuted Pape Altenative Tests fo the Poisson Distibution Manad Khamkong*[a] and Pachitjianut Siipanich [b] [a] Depatment of Statistics,
More informationGreen s function Monte Carlo algorithms for elliptic problems
Mathematics and Computes in Simulation 63 (2003) 587 604 Geen s function Monte Calo algoithms fo elliptic poblems I.T. Dimov, R.Y. Papancheva Cental Laboatoy fo Paallel Pocessing, Depatment of Paallel
More informationPsychometric Methods: Theory into Practice Larry R. Price
ERRATA Psychometic Methods: Theoy into Pactice Lay R. Pice Eos wee made in Equations 3.5a and 3.5b, Figue 3., equations and text on pages 76 80, and Table 9.1. Vesions of the elevant pages that include
More informationProbablistically Checkable Proofs
Lectue 12 Pobablistically Checkable Poofs May 13, 2004 Lectue: Paul Beame Notes: Chis Re 12.1 Pobablisitically Checkable Poofs Oveview We know that IP = PSPACE. This means thee is an inteactive potocol
More informationQuantum theory of angular momentum
Quantum theoy of angula momentum Igo Mazets igo.mazets+e141@tuwien.ac.at (Atominstitut TU Wien, Stadionallee 2, 1020 Wien Time: Fiday, 13:00 14:30 Place: Feihaus, Sem.R. DA gün 06B (exception date 18 Nov.:
More informationCOLLAPSING WALLS THEOREM
COLLAPSING WALLS THEOREM IGOR PAK AND ROM PINCHASI Abstact. Let P R 3 be a pyamid with the base a convex polygon Q. We show that when othe faces ae collapsed (otated aound the edges onto the plane spanned
More informationA Multivariate Normal Law for Turing s Formulae
A Multivaiate Nomal Law fo Tuing s Fomulae Zhiyi Zhang Depatment of Mathematics and Statistics Univesity of Noth Caolina at Chalotte Chalotte, NC 28223 Abstact This pape establishes a sufficient condition
More informationLocalization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matrix
Jounal of Sciences, Islamic Republic of Ian (): - () Univesity of Tehan, ISSN - http://sciencesutaci Localization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matix H Ahsani
More informationq i i=1 p i ln p i Another measure, which proves a useful benchmark in our analysis, is the chi squared divergence of p, q, which is defined by
CSISZÁR f DIVERGENCE, OSTROWSKI S INEQUALITY AND MUTUAL INFORMATION S. S. DRAGOMIR, V. GLUŠČEVIĆ, AND C. E. M. PEARCE Abstact. The Ostowski integal inequality fo an absolutely continuous function is used
More informationEnergy Levels Of Hydrogen Atom Using Ladder Operators. Ava Khamseh Supervisor: Dr. Brian Pendleton The University of Edinburgh August 2011
Enegy Levels Of Hydogen Atom Using Ladde Opeatos Ava Khamseh Supeviso: D. Bian Pendleton The Univesity of Edinbugh August 11 1 Abstact The aim of this pape is to fist use the Schödinge wavefunction methods
More informationState tracking control for Takagi-Sugeno models
State tacing contol fo Taagi-Sugeno models Souad Bezzaoucha, Benoît Max,3,DidieMaquin,3 and José Ragot,3 Abstact This wo addesses the model efeence tacing contol poblem It aims to highlight the encouteed
More informationMultiple Criteria Secretary Problem: A New Approach
J. Stat. Appl. Po. 3, o., 9-38 (04 9 Jounal of Statistics Applications & Pobability An Intenational Jounal http://dx.doi.og/0.785/jsap/0303 Multiple Citeia Secetay Poblem: A ew Appoach Alaka Padhye, and
More informationMath 124B February 02, 2012
Math 24B Febuay 02, 202 Vikto Gigoyan 8 Laplace s equation: popeties We have aleady encounteed Laplace s equation in the context of stationay heat conduction and wave phenomena. Recall that in two spatial
More informationC e f paamete adaptation f (' x) ' ' d _ d ; ; e _e K p K v u ^M() RBF NN ^h( ) _ obot s _ s n W ' f x x xm xm f x xm d Figue : Block diagam of comput
A Neual-Netwok Compensato with Fuzzy Robustication Tems fo Impoved Design of Adaptive Contol of Robot Manipulatos Y.H. FUNG and S.K. TSO Cente fo Intelligent Design, Automation and Manufactuing City Univesity
More informationJournal of Inequalities in Pure and Applied Mathematics
Jounal of Inequalities in Pue and Applied Mathematics COEFFICIENT INEQUALITY FOR A FUNCTION WHOSE DERIVATIVE HAS A POSITIVE REAL PART S. ABRAMOVICH, M. KLARIČIĆ BAKULA AND S. BANIĆ Depatment of Mathematics
More informationAbsorption Rate into a Small Sphere for a Diffusing Particle Confined in a Large Sphere
Applied Mathematics, 06, 7, 709-70 Published Online Apil 06 in SciRes. http://www.scip.og/jounal/am http://dx.doi.og/0.46/am.06.77065 Absoption Rate into a Small Sphee fo a Diffusing Paticle Confined in
More informationTight Upper Bounds for the Expected Loss of Lexicographic Heuristics in Binary Multi-attribute Choice
Tight Uppe Bounds fo the Expected Loss of Lexicogaphic Heuistics in Binay Multi-attibute Choice Juan A. Caasco, Manel Baucells Except fo fomatting details and the coection of some eata, this vesion matches
More informationCALCULUS II Vectors. Paul Dawkins
CALCULUS II Vectos Paul Dawkins Table of Contents Peface... ii Vectos... 3 Intoduction... 3 Vectos The Basics... 4 Vecto Aithmetic... 8 Dot Poduct... 13 Coss Poduct... 21 2007 Paul Dawkins i http://tutoial.math.lama.edu/tems.aspx
More information556: MATHEMATICAL STATISTICS I
556: MATHEMATICAL STATISTICS I CHAPTER 5: STOCHASTIC CONVERGENCE The following efinitions ae state in tems of scala anom vaiables, but exten natually to vecto anom vaiables efine on the same obability
More informationChapter 5 Linear Equations: Basic Theory and Practice
Chapte 5 inea Equations: Basic Theoy and actice In this chapte and the next, we ae inteested in the linea algebaic equation AX = b, (5-1) whee A is an m n matix, X is an n 1 vecto to be solved fo, and
More informationEncapsulation theory: the transformation equations of absolute information hiding.
1 Encapsulation theoy: the tansfomation equations of absolute infomation hiding. Edmund Kiwan * www.edmundkiwan.com Abstact This pape descibes how the potential coupling of a set vaies as the set is tansfomed,
More informationWeighted least-squares estimators of parametric functions of the regression coefficients under a general linear model
Ann Inst Stat Math (2010) 62:929 941 DOI 10.1007/s10463-008-0199-8 Weighted least-squaes estimatos of paametic functions of the egession coefficients unde a geneal linea model Yongge Tian Received: 9 Januay
More informationQIP Course 10: Quantum Factorization Algorithm (Part 3)
QIP Couse 10: Quantum Factoization Algoithm (Pat 3 Ryutaoh Matsumoto Nagoya Univesity, Japan Send you comments to yutaoh.matsumoto@nagoya-u.jp Septembe 2018 @ Tokyo Tech. Matsumoto (Nagoya U. QIP Couse
More information2 Governing Equations
2 Govening Equations This chapte develops the govening equations of motion fo a homogeneous isotopic elastic solid, using the linea thee-dimensional theoy of elasticity in cylindical coodinates. At fist,
More informationChapter 3 Optical Systems with Annular Pupils
Chapte 3 Optical Systems with Annula Pupils 3 INTRODUCTION In this chapte, we discuss the imaging popeties of a system with an annula pupil in a manne simila to those fo a system with a cicula pupil The
More informationNotes on McCall s Model of Job Search. Timothy J. Kehoe March if job offer has been accepted. b if searching
Notes on McCall s Model of Job Seach Timothy J Kehoe Mach Fv ( ) pob( v), [, ] Choice: accept age offe o eceive b and seach again next peiod An unemployed oke solves hee max E t t y t y t if job offe has
More information1.2 Differential cross section
.2. DIFFERENTIAL CROSS SECTION Febuay 9, 205 Lectue VIII.2 Diffeential coss section We found that the solution to the Schodinge equation has the fom e ik x ψ 2π 3/2 fk, k + e ik x and that fk, k = 2 m
More informationEfficiency Loss in a Network Resource Allocation Game: The Case of Elastic Supply
Efficiency Loss in a Netwok Resouce Allocation Game: The Case of Elastic Supply axiv:cs/0506054v1 [cs.gt] 14 Jun 2005 Ramesh Johai (johai@stanfod.edu) Shie Manno (shie@mit.edu) John N. Tsitsiklis (jnt@mit.edu)
More informationA STUDY OF HAMMING CODES AS ERROR CORRECTING CODES
AGU Intenational Jounal of Science and Technology A STUDY OF HAMMING CODES AS ERROR CORRECTING CODES Ritu Ahuja Depatment of Mathematics Khalsa College fo Women, Civil Lines, Ludhiana-141001, Punjab, (India)
More informationANA BERRIZBEITIA, LUIS A. MEDINA, ALEXANDER C. MOLL, VICTOR H. MOLL, AND LAINE NOBLE
THE p-adic VALUATION OF STIRLING NUMBERS ANA BERRIZBEITIA, LUIS A. MEDINA, ALEXANDER C. MOLL, VICTOR H. MOLL, AND LAINE NOBLE Abstact. Let p > 2 be a pime. The p-adic valuation of Stiling numbes of the
More informationPreliminary Exam: Quantum Physics 1/14/2011, 9:00-3:00
Peliminay Exam: Quantum Physics /4/ 9:-: Answe a total of SIX questions of which at least TWO ae fom section A and at least THREE ae fom section B Fo you answes you can use eithe the blue books o individual
More informationHigh Dimensional Clustering with r-nets
High Dimensional Clusteing with -nets Geogia Avaikioti, Alain Ryse, Yuyi Wang, Roge Wattenhofe ETH Zuich, Switzeland {zetava,ayse,yuwang,wattenhofe}@ethz.ch Abstact Clusteing, a fundamental task in data
More informationLET a random variable x follows the two - parameter
INTERNATIONAL JOURNAL OF MATHEMATICS AND SCIENTIFIC COMPUTING ISSN: 2231-5330, VOL. 5, NO. 1, 2015 19 Shinkage Bayesian Appoach in Item - Failue Gamma Data In Pesence of Pio Point Guess Value Gyan Pakash
More informationFOURIER-BESSEL SERIES AND BOUNDARY VALUE PROBLEMS IN CYLINDRICAL COORDINATES
FOURIER-BESSE SERIES AND BOUNDARY VAUE PROBEMS IN CYINDRICA COORDINATES The paametic Bessel s equation appeas in connection with the aplace opeato in pola coodinates. The method of sepaation of vaiables
More informationModel and Controller Order Reduction for Infinite Dimensional Systems
IT J. Eng. Sci., Vol. 4, No.,, -6 Model and Contolle Ode Reduction fo Infinite Dimensional Systems Fatmawati,*, R. Saagih,. Riyanto 3 & Y. Soehayadi Industial and Financial Mathematics Goup email: fatma47@students.itb.ac.id;
More informationTo Feel a Force Chapter 7 Static equilibrium - torque and friction
To eel a oce Chapte 7 Chapte 7: Static fiction, toque and static equilibium A. Review of foce vectos Between the eath and a small mass, gavitational foces of equal magnitude and opposite diection act on
More informationEfficiency Loss in a Network Resource Allocation Game
Efficiency Loss in a Netwok Resouce Allocation Game Ramesh Johai johai@mit.edu) John N. Tsitsiklis jnt@mit.edu) June 11, 2004 Abstact We exploe the popeties of a congestion game whee uses of a congested
More informationMath 301: The Erdős-Stone-Simonovitz Theorem and Extremal Numbers for Bipartite Graphs
Math 30: The Edős-Stone-Simonovitz Theoem and Extemal Numbes fo Bipatite Gaphs May Radcliffe The Edős-Stone-Simonovitz Theoem Recall, in class we poved Tuán s Gaph Theoem, namely Theoem Tuán s Theoem Let
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY Physics Department. Problem Set 10 Solutions. r s
MASSACHUSETTS INSTITUTE OF TECHNOLOGY Physics Depatment Physics 8.033 Decembe 5, 003 Poblem Set 10 Solutions Poblem 1 M s y x test paticle The figue above depicts the geomety of the poblem. The position
More informationEstimation of entropy rate and Rényi entropy rate for Markov chains
06 IEEE Intenational Symposium on Infomation Theoy Estimation of entopy ate and Rényi entopy ate fo Makov chains Sudeep Kamath, Segio Vedú Depatment of Electical Engineeing Pinceton Univesity e-mail: sukamath,
More informationThe Substring Search Problem
The Substing Seach Poblem One algoithm which is used in a vaiety of applications is the family of substing seach algoithms. These algoithms allow a use to detemine if, given two chaacte stings, one is
More informationChapter 7-8 Rotational Motion
Chapte 7-8 Rotational Motion What is a Rigid Body? Rotational Kinematics Angula Velocity ω and Acceleation α Unifom Rotational Motion: Kinematics Unifom Cicula Motion: Kinematics and Dynamics The Toque,
More informationRoot-Exponential Accuracy for Coarse
Root-Exponential Accuacy fo Coase 1 Quantization of Finite Fame Expansions Felix Kahme, Rayan Saab, and Rachel Wad Abstact In this note, we show that by quantizing the N-dimensional fame coefficients of
More informationarxiv: v2 [physics.data-an] 15 Jul 2015
Limitation of the Least Squae Method in the Evaluation of Dimension of Factal Bownian Motions BINGQIANG QIAO,, SIMING LIU, OUDUN ZENG, XIANG LI, and BENZONG DAI Depatment of Physics, Yunnan Univesity,
More informationPearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms
Peason s Chi-Squae Test Modifications fo Compaison of Unweighted and Weighted Histogams and Two Weighted Histogams Univesity of Akueyi, Bogi, v/noduslód, IS-6 Akueyi, Iceland E-mail: nikolai@unak.is Two
More informationResearch Design - - Topic 17 Multiple Regression & Multiple Correlation: Two Predictors 2009 R.C. Gardner, Ph.D.
Reseach Design - - Topic 7 Multiple Regession & Multiple Coelation: Two Pedictos 009 R.C. Gadne, Ph.D. Geneal Rationale and Basic Aithmetic fo two pedictos Patial and semipatial coelation Regession coefficients
More informationON THE INVERSE SIGNED TOTAL DOMINATION NUMBER IN GRAPHS. D.A. Mojdeh and B. Samadi
Opuscula Math. 37, no. 3 (017), 447 456 http://dx.doi.og/10.7494/opmath.017.37.3.447 Opuscula Mathematica ON THE INVERSE SIGNED TOTAL DOMINATION NUMBER IN GRAPHS D.A. Mojdeh and B. Samadi Communicated
More informationA Simple Nonparametric Approach to Estimating the Distribution of Random Coefficients in Structural Models
A Simple Nonpaametic Appoach to Estimating the Distibution of Random Coefficients in Stuctual Models Jeemy T. Fox Rice Univesity & NBER Kyoo il Kim Michigan State Univesity May 2016 Chenyu Yang Univesity
More informationCOUPLED MODELS OF ROLLING, SLIDING AND WHIRLING FRICTION
ENOC 008 Saint Petesbug Russia June 30-July 4 008 COUPLED MODELS OF ROLLING SLIDING AND WHIRLING FRICTION Alexey Kieenkov Ins ti tu te fo P ob le ms in Me ch an ic s Ru ss ia n Ac ad em y of Sc ie nc es
More informationChaos and bifurcation of discontinuous dynamical systems with piecewise constant arguments
Malaya Jounal of Matematik ()(22) 4 8 Chaos and bifucation of discontinuous dynamical systems with piecewise constant aguments A.M.A. El-Sayed, a, and S. M. Salman b a Faculty of Science, Aleandia Univesity,
More informationBrief summary of functional analysis APPM 5440 Fall 2014 Applied Analysis
Bief summay of functional analysis APPM 5440 Fall 014 Applied Analysis Stephen Becke, stephen.becke@coloado.edu Standad theoems. When necessay, I used Royden s and Keyzsig s books as a efeence. Vesion
More informationMean Curvature and Shape Operator of Slant Immersions in a Sasakian Space Form
Mean Cuvatue and Shape Opeato of Slant Immesions in a Sasakian Space Fom Muck Main Tipathi, Jean-Sic Kim and Son-Be Kim Abstact Fo submanifolds, in a Sasakian space fom, which ae tangential to the stuctue
More informationarxiv: v1 [math.co] 1 Apr 2011
Weight enumeation of codes fom finite spaces Relinde Juius Octobe 23, 2018 axiv:1104.0172v1 [math.co] 1 Ap 2011 Abstact We study the genealized and extended weight enumeato of the - ay Simplex code and
More informationOn the global uniform asymptotic stability of time-varying dynamical systems
Stud. Univ. Babeş-Bolyai Math. 59014), No. 1, 57 67 On the global unifom asymptotic stability of time-vaying dynamical systems Zaineb HajSalem, Mohamed Ali Hammami and Mohamed Mabouk Abstact. The objective
More information