Online Learning & Game Theory
|
|
- Dorothy Stevens
- 5 years ago
- Views:
Transcription
1 Olie Learig & Gae Theory A quick overview with recet results Viaey Perchet Laboratoire Probabilités et Modèles Aléatoires Uiv. Paris-Diderot Jourées MAS Août 2014
2 Startig Exaples
3 Startig Exaples
4 Outlie 1 First, i a stochastic eviroet (i.i.d. processes) 2 The, i a adversarial eviroet (or idividual sequeces) 3 Fially, soe liks with gae theory
5 Stochastic Eviroet Regret Regret Miiizatio Extesios First Part Stochastic eviroet
6 Stochastic Eviroet Regret Regret Miiizatio Extesios Estiatio of Meas K = 2 discrete-tie proc.: X (1), X (2) i [0, 1] The payoff of the ad 1/2 o query Estiate the eas µ (1), µ (2) Hoeffdig iequality: expoetial decay X (k) µ k ( ) > ε with proba at ost 2 exp 2ε 2. Fiite uber of istakes: E N 1 { X (k) µ k > ε } 1 ε 2
7 Stochastic Eviroet Regret Regret Miiizatio Extesios Regret Miiizatio Choose oe ad to display k. Reward: X (k) Maxiize cuulative reward =1 X (k) or =1 µ(k) Miiize Regret [Haa 56] R = µ µ (k), with µ = ax{µ (1), µ (2) } =1 Equivalet forulatio with = µ µ k : R = 1{k } =1
8 Stochastic Eviroet Regret Regret Miiizatio Extesios Stochastic & Full Moitorig Full Moitorig: all values X (1), X (2) observed. Optial algorith: k = arg ax X (k) : ER 1 ad for sall, ER N Bouded regret, uiforly i! Give, worst is 1/ ad ER But i the exaples, oly X (k) is observed (badit oitorig)!
9 Stochastic Eviroet Regret Regret Miiizatio Extesios Stochastic & Badit Moitorig X (k) = 1 =1 X (k) with k = arg ax ot available, oly (k) X, ER = Θ(). (k) :k X = X (k) =k { : k = k} Balace exploitatio (play arg ax) ad exploratio (play arg i) to get iforatio Upper Cofidece Boud [Auer,Cesa-Biachi,Fischer 02] (k) 2 log() k = arg ax X + { : k = k} ER log()
10 Stochastic Eviroet Regret Regret Miiizatio Extesios New policy: Explore The Coit [P,Rigollet 13] Fiite horizo N N give. 1) Play alteratively ar 1 ad 2 as log as X (1) (2) X 2 2 log(4n/) 2) The play for ever the best ar. ER N log(n 2 ) Worst case 1 N Full Moit & ETC: N vs vs 1 with Full Ifo UCB: N log(n) Badit vs Full Moitorig Logarithic vs bouded regret; sae worst case
11 Stochastic Eviroet Regret Regret Miiizatio Extesios Bouded Regret? [Lai,Robbis 84],[Bubeck,P,Rigollet 13] Without additioal assuptio, No: lower boud i log()/ With ay give iterediate value µ (µ (1), µ (2) ), yes: If (1) (2) X or X above µ (k), the k = arg ax X Otherwise play alteratively both ars. X < µ o 1 (µ µ ) 2 stages (sae arguet for other ar). If µ ad kow: ER 1 as with Full Moit. If oly µ kow: ER log(1/ 2 )
12 Stochastic Eviroet Regret Regret Miiizatio Extesios More Geeral Fraeworks & Results Results i worst case ( distributio idepedet bouds ) Multi-ared badit. [Auer,Cesa-Biachi,Freud,Schapire 02],[Audibert,Bubeck 09] K > 2 ars, ER K Cotiuous badit. [Kleiberg 08],[Bubeck,Muos,Stoltz,Szepesvari 11] Ifiite set of ars, x [0, 1] d ad µ( ) Lipschitz. ER d+1 d+2 Liear badit[dai,hayes,kakade 08],[Zikevich 02],[Aberethy,Haza,Rakhli 08] x [0, 1] d ad µ( ) Liear. ER Badit with covariates (cf Google Exaple) [P,Rigollet 13],[Bull 14] Covariates ω [0, 1] d, E[X (k) ω] = µ (k) (ω) 1-Lip. ER d+1 d+2 Higher order bouds/sall losses/sparsity[haza,kale 10], [Gershiovitz 13], [Cappé,Garivier,Maillard,Muos,Stoltz 13], [Gaillard,Stoltz,va Erve 14] vs =1 (X (k) µ (k) ) 2, =1 K k=1 p(k) ( ) 2 X (k)
13 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret Secod Part Adversarial eviroet What we have leared so far: I worst case aalysis Regret iiizatio i log(k ) with full oit Up to K, learig as fast with badit oit. tha with full oit. I distributio depedet (ot worst case) Bouded regret i 1 k Additioal assuptio required to lear as fast i badit oit
14 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret Adversarial World I the exaples, data are ot i.i.d.. Spa seders ca eve adapt to spa filters, that is: The law of X (k) (1) +1 ca deped o X 1,..., X (1), X (K ) 1,..., X (K ) but eve o the previous choices k 1,..., k. The eviroet ca adapt ad choose rewards strategically. Sae def of regret (except argax chages with tie) R = ax k =1 X (k) =1 X (k) Goal: a policy with subliear regret o() agaist ANY possible strategy of the eviroet (i particular ay sequeces X (k) )
15 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret A Popular Algorith with Full Moitorig With k = argax X (k), ER = Θ(). With ay deteriistic policy, ER = Θ(). k with proba ( exp η =1 X (k) ) K j=1 (η exp =1 X (j) ); teperature η log(k ) Regret of expoetial weights [Auer,Cesa-Biachi,Freud,Schapire 02] ER log(k ), N Sae depedecy i as worst case i.i.d., optial i K.
16 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret Optiality ad Badit Moitorig Optiality: ER log(k ) if X (k) = ±1 w.p. 1/2 E =1 X (k) Badit Moit.: = 0 but E ax k X (k) Expoetial weights w.r.t. = X (k) =1 1{k = k} P {k = k} X (k) = log(k ) X (k) : ER K log(k ) Reark: Optial bouds are K ubiased esti. of X (k)
17 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret Discrete/Cotiuous Tie ( exp η ) =1 X (k) K j=1 (η exp =1 X (j) with V (k) ) = Φ(V ) := 1 η log ( K k=1 = =1 X (k) X (k) ) (k) exp(ηv ) Deteriistic cotiuous approx. of stochastic discrete proc. [Beaï,Hofbauer,Sori 06],[Beaï,Faure 13] ( ) E[V +1 ] V = X (k) +1 Φ(V ), X +1 k=1,...,k Stochastic Approx of V F (V ) := {U Φ(V ), U } 1; U R K Differetial iclusio with Lyapouov fuctio Φ(V ): Φ(V ) = V, Φ(V ) = U U, Φ(V ) 1, Φ(V ) = 0 li R li V = V (+ ) = V (0) = log(d)/η
18 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret Refied Regret: Iteral-Swap- Regret: As well as the best costat strategy Iteral: O the stages where k = k, k was the best choice [Foster,Vohra 99] R it = ax k { ax j :k =k X (j) X (k) } Swap: As well as φ(k) istead of k, φ : [K ] [K ] [Blu,Masour 07] R swap = ax φ[k] [k] =1 X (φ(k)) X (k)
19 Adversarial Eviroet Distributio Idepedet A Algorith Iteral Regret Geeral regret Regret: As well as the best costat strategy Geeral: As well as ξ(k 1,..., k ) istead of k, ξ Ξ [Lehrer 02] R ge = ax ξ Ξ { ax j =1 X (ξ(k 1,...,k )) X (k) Geeralized versio of expoetial weights [P 14] ER ge log( Ξ ) Iteral regret log(k ), Swap regret K log(k ) }
20 Gae Theory Nash Equilibria Other equilibria Third Part Liks with Gae Theory What we have leared i the previous sectio: I worst case aalysis Learig is as fast i adversarial tha stochastic eviroet I the adversarial fraework Refied otios of regret ca be iiized
21 Gae Theory Nash Equilibria Other equilibria Agaist Oppoets - Gae Theory X (k) ot arbitrary, but iduced by choices of aother player TWO players, siultaeous actios i {1,.., K } ad {1,.., L} Payoffs are defied by two atrices A R K L ad B R K L. Player 1 picks row k {1,.., K } ad Player 2 colu l {1,.., L} Player 1 gets A k,l ad Player 2 gets B k,l Choices ca be rado p ([K ]) ad q ([L]) Player 1 gets k,l p kq l A k,l = p T Aq; P2 gets p T Bq Olie learig: X (k) = A k,l ad Y (l) = B k,l. Assue both players iiize regret idepedetly. Do they lear a solutio cocept fro gae theory?
22 Gae Theory Nash Equilibria Other equilibria Nash Equilibria A Nash equilibria is a situatio where o player has iterest to chage his actio [Nash 50], [Nash 51] A Nash equilibria is a pair (p, q ) ([K ]) ([L]) such that Player 1 has o iterest to chage give q : (p ) T Aq p T Aq, p ([K ]) Player 2 has o iterest to chage give p : (p ) T Aq (p ) T Aq, q ([L]) There always exist Nash equilibria; geerically a odd uber [Nash 50], [Nash 51], [Shapley 74]
23 Gae Theory Nash Equilibria Other equilibria Are Nash Equilibria Learable? Both players iiize their regret idepedetly. k p ([K ]), l q ([L]) Learig Nash equilibria could ea: (p, q ) ([K ]) ([L]) cv to a NE, or to set of NE. ( 1 =1 δ k, 1 =1 δ ) l ([K ]) ([L]) cv to a NE, or to set of NE =1 δ ) k,l ([K ] [L]) cv to a NE, or to set of NE ( 1 Nash equilibria are ot learable (idepedetly): [Hart,Mas-Colell 04] There always exists a gae s.t. oe of the covergece occur What is Learable? correlated eq, Miax-Value, Potetial eq [Coucheey, Gaujal, Mertikopolous]
24 Gae Theory Nash Equilibria Other equilibria Correlated Equilibria Players use a exteral device to correlate (as traffic lights); whe they are told to take a actio (as stop or go), it is optial A correlated equilibriu is a distributio π ([K ] [L]). (k, l ) π; P1 is told secretly to play k, P2 to play l l [L] if P1 plays k [K ], he gets l [L] π k,la k,l. If he plays j [K ] istead, he would get l [L] π k,la j,l π k,la k,l l [L] π k,la j,l, for all k, j [K ] Siilar to o iteral regret! If both players iiize iteral regret, epirical distributio of actios coverge to the set of correlated equilibria. [Foster,Vohra 99]
25 Gae Theory Nash Equilibria Other equilibria Miax Theory I zero-su gaes, players have optial strategies zero-su : B = A; P1 axiizes ad P2 iiizes p T Aq Value= ax p ([K ]) i q ([L]) pt Aq = i ax q ([L]) p ([K ]) p optial if (p ) T Aq Value for all q ([L]). p T Aq R 0 = 1 =1 X (k) Value ( 1 =1 δ k, 1 =1 δ ) l cv to optial strat, i.e. to NE NE are fast learable i zero-su gae, at O ( ) 1 [Harris 98]
26 coclusio I worst case aalysis Coclusio With full oitorig, learig is as fast i adversarial tha stochastic eviroet Up to K, learig is as fast with badit oit. tha with full oit. I distributio depedet (ot worst case) Additioal assuptio required to lear as fast i badit tha i full oitorig I gae theoretic fraework Nash equilibria are ot learable i geeral Correlated equilibria are learable (by iiizig iteral regret) I zero-su ad potetial gaes, equilibria are learable. Fudaetal textbook: [Cesa-Biachi,Lugosi 06]
ECE 901 Lecture 4: Estimation of Lipschitz smooth functions
ECE 9 Lecture 4: Estiatio of Lipschitz sooth fuctios R. Nowak 5/7/29 Cosider the followig settig. Let Y f (X) + W, where X is a rado variable (r.v.) o X [, ], W is a r.v. o Y R, idepedet of X ad satisfyig
More informationLearning Theory for Conditional Risk Minimization: Supplementary Material
Learig Theory for Coditioal Risk Miiizatio: Suppleetary Material Alexader Zii IST Austria azii@istacat Christoph H Lapter IST Austria chl@istacat Proofs Proof of Theore After the applicatio of (6) ad (8)
More informationRegret Bounds and Minimax Policies under Partial Monitoring
Joural of Machie Learig Research 11 010 597-648 Subitted 7/09; Revised 6/10; Published 10/10 Regret Bouds ad Miiax Policies uder Partial Moitorig Jea-Yves Audibert Iagie, Uiversité Paris Est 6 aveue Blaise
More informationf(1), and so, if f is continuous, f(x) = f(1)x.
2.2.35: Let f be a additive fuctio. i Clearly fx = fx ad therefore f x = fx for all Z+ ad x R. Hece, for ay, Z +, f = f, ad so, if f is cotiuous, fx = fx. ii Suppose that f is bouded o soe o-epty ope set.
More informationStanford Statistics 311/Electrical Engineering 377
I. Uiversal predictio ad codig a. Gae: sequecex ofdata, adwattopredict(orcode)aswellasifwekew distributio of data b. Two versios: probabilistic ad adversarial. I either case, let p ad q be desities or
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationTwo-Target Algorithms for Infinite-Armed Bandits with Bernoulli Rewards
Two-Target Algoriths for Ifiite-Ared Badits with Beroulli Rewards Thoas Boald Departet of Networkig ad Coputer Sciece Teleco ParisTech Paris, Frace thoas.boald@teleco-paristech.fr Alexadre Proutière Autoatic
More informationA string of not-so-obvious statements about correlation in the data. (This refers to the mechanical calculation of correlation in the data.
STAT-UB.003 NOTES for Wedesday 0.MAY.0 We will use the file JulieApartet.tw. We ll give the regressio of Price o SqFt, show residual versus fitted plot, save residuals ad fitted. Give plot of (Resid, Price,
More informationProbability Theory. Exercise Sheet 4. ETH Zurich HS 2017
ETH Zurich HS 2017 D-MATH, D-PHYS Prof. A.-S. Szita Coordiator Yili Wag Probability Theory Exercise Sheet 4 Exercise 4.1 Let X ) N be a sequece of i.i.d. rado variables i a probability space Ω, A, P ).
More informationFall 2013 MTH431/531 Real analysis Section Notes
Fall 013 MTH431/531 Real aalysis Sectio 8.1-8. Notes Yi Su 013.11.1 1. Defiitio of uiform covergece. We look at a sequece of fuctios f (x) ad study the coverget property. Notice we have two parameters
More informationStatistics and Data Analysis in MATLAB Kendrick Kay, February 28, Lecture 4: Model fitting
Statistics ad Data Aalysis i MATLAB Kedrick Kay, kedrick.kay@wustl.edu February 28, 2014 Lecture 4: Model fittig 1. The basics - Suppose that we have a set of data ad suppose that we have selected the
More informationQueueing Theory II. Summary. M/M/1 Output process Networks of Queue Method of Stages. General Distributions
Queueig Theory II Suary M/M/1 Output process Networks of Queue Method of Stages Erlag Distributio Hyperexpoetial Distributio Geeral Distributios Ebedded Markov Chais 1 M/M/1 Output Process Burke s Theore:
More informationREGRESSION WITH QUADRATIC LOSS
REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More informationarxiv: v1 [math.st] 12 Dec 2018
DIVERGENCE MEASURES ESTIMATION AND ITS ASYMPTOTIC NORMALITY THEORY : DISCRETE CASE arxiv:181.04795v1 [ath.st] 1 Dec 018 Abstract. 1) BA AMADOU DIADIÉ AND 1,,4) LO GANE SAMB 1. Itroductio 1.1. Motivatios.
More informationIntroduction to Optimization, DIKU Monday 19 November David Pisinger. Duality, motivation
Itroductio to Optiizatio, DIKU 007-08 Moday 9 Noveber David Pisiger Lecture, Duality ad sesitivity aalysis Duality, shadow prices, sesitivity aalysis, post-optial aalysis, copleetary slackess, KKT optiality
More information) is a square matrix with the property that for any m n matrix A, the product AI equals A. The identity matrix has a ii
square atrix is oe that has the sae uber of rows as colus; that is, a atrix. he idetity atrix (deoted by I, I, or [] I ) is a square atrix with the property that for ay atrix, the product I equals. he
More informationRegression with quadratic loss
Regressio with quadratic loss Maxim Ragisky October 13, 2015 Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X,Y, where, as before,
More informationIntro to Learning Theory
Lecture 1, October 18, 2016 Itro to Learig Theory Ruth Urer 1 Machie Learig ad Learig Theory Comig soo 2 Formal Framework 21 Basic otios I our formal model for machie learig, the istaces to be classified
More informationEmpirical Process Theory and Oracle Inequalities
Stat 928: Statistical Learig Theory Lecture: 10 Empirical Process Theory ad Oracle Iequalities Istructor: Sham Kakade 1 Risk vs Risk See Lecture 0 for a discussio o termiology. 2 The Uio Boud / Boferoi
More informationSECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES
SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,
More informationA PROBABILITY PROBLEM
A PROBABILITY PROBLEM A big superarket chai has the followig policy: For every Euros you sped per buy, you ear oe poit (suppose, e.g., that = 3; i this case, if you sped 8.45 Euros, you get two poits,
More informationStatistics for Applications Fall Problem Set 7
18.650. Statistics for Applicatios Fall 016. Proble Set 7 Due Friday, Oct. 8 at 1 oo Proble 1 QQ-plots Recall that the Laplace distributio with paraeter λ > 0 is the cotiuous probaλ bility easure with
More informationCS 2750 Machine Learning. Lecture 22. Concept learning. CS 2750 Machine Learning. Concept Learning
Lecture 22 Cocept learig Milos Hauskrecht milos@cs.pitt.edu 5329 Seott Square Cocept Learig Outlie: Learig boolea fuctios Most geeral ad most specific cosistet hypothesis. Mitchell s versio space algorithm
More informationCS 2750 Machine Learning. Lecture 23. Concept learning. CS 2750 Machine Learning. Concept Learning
Lecture 3 Cocept learig Milos Hauskrecht milos@cs.pitt.edu Cocept Learig Outlie: Learig boolea fuctios Most geeral ad most specific cosistet hypothesis. Mitchell s versio space algorithm Probably approximately
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More informationLecture 10: Bounded Linear Operators and Orthogonality in Hilbert Spaces
Lecture : Bouded Liear Operators ad Orthogoality i Hilbert Spaces 34 Bouded Liear Operator Let ( X, ), ( Y, ) i i be ored liear vector spaces ad { } X Y The, T is said to be bouded if a real uber c such
More informationWe have also learned that, thanks to the Central Limit Theorem and the Law of Large Numbers,
Cofidece Itervals III What we kow so far: We have see how to set cofidece itervals for the ea, or expected value, of a oral probability distributio, both whe the variace is kow (usig the stadard oral,
More informationChapter 2. Asymptotic Notation
Asyptotic Notatio 3 Chapter Asyptotic Notatio Goal : To siplify the aalysis of ruig tie by gettig rid of details which ay be affected by specific ipleetatio ad hardware. [1] The Big Oh (O-Notatio) : It
More informationLecture 3: August 31
36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,
More information( ) = p and P( i = b) = q.
MATH 540 Radom Walks Part 1 A radom walk X is special stochastic process that measures the height (or value) of a particle that radomly moves upward or dowward certai fixed amouts o each uit icremet of
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More information18.657: Mathematics of Machine Learning
18.657: Mathematics of Machie Learig Lecturer: Philippe Rigollet Lecture 15 Scribe: Zach Izzo Oct. 27, 2015 Part III Olie Learig It is ofte the case that we will be asked to make a sequece of predictios,
More informationCSCI-6971 Lecture Notes: Stochastic processes
CSCI-6971 Lecture Notes: Stochastic processes Kristopher R. Beevers Departet of Coputer Sciece Resselaer Polytechic Istitute beevek@cs.rpi.edu February 2, 2006 1 Overview Defiitio 1.1. A stochastic process
More informationInternational Journal of Mathematical Archive-4(9), 2013, 1-5 Available online through ISSN
Iteratioal Joural o Matheatical Archive-4(9), 03, -5 Available olie through www.ija.io ISSN 9 5046 THE CUBIC RATE OF CONVERGENCE OF GENERALIZED EXTRAPOLATED NEWTON RAPHSON METHOD FOR SOLVING NONLINEAR
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More informationReview Problems 1. ICME and MS&E Refresher Course September 19, 2011 B = C = AB = A = A 2 = A 3... C 2 = C 3 = =
Review Problems ICME ad MS&E Refresher Course September 9, 0 Warm-up problems. For the followig matrices A = 0 B = C = AB = 0 fid all powers A,A 3,(which is A times A),... ad B,B 3,... ad C,C 3,... Solutio:
More informationAda Boost, Risk Bounds, Concentration Inequalities. 1 AdaBoost and Estimates of Conditional Probabilities
CS8B/Stat4B Sprig 008) Statistical Learig Theory Lecture: Ada Boost, Risk Bouds, Cocetratio Iequalities Lecturer: Peter Bartlett Scribe: Subhrasu Maji AdaBoost ad Estimates of Coditioal Probabilities We
More informationPartial match queries: a limit process
Partial match queries: a limit process Nicolas Brouti Ralph Neiiger Heig Sulzbach Partial match queries: a limit process 1 / 17 Searchig geometric data ad quadtrees 1 Partial match queries: a limit process
More information1.2 AXIOMATIC APPROACH TO PROBABILITY AND PROPERTIES OF PROBABILITY MEASURE 1.2 AXIOMATIC APPROACH TO PROBABILITY AND
NTEL- robability ad Distributios MODULE 1 ROBABILITY LECTURE 2 Topics 1.2 AXIOMATIC AROACH TO ROBABILITY AND ROERTIES OF ROBABILITY MEASURE 1.2.1 Iclusio-Exclusio Forula I the followig sectio we will discuss
More informationSummer MA Lesson 13 Section 1.6, Section 1.7 (part 1)
Suer MA 1500 Lesso 1 Sectio 1.6, Sectio 1.7 (part 1) I Solvig Polyoial Equatios Liear equatio ad quadratic equatios of 1 variable are specific types of polyoial equatios. Soe polyoial equatios of a higher
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationChapter 7 Isoperimetric problem
Chapter 7 Isoperimetric problem Recall that the isoperimetric problem (see the itroductio its coectio with ido s proble) is oe of the most classical problem of a shape optimizatio. It ca be formulated
More information1 Convergence in Probability and the Weak Law of Large Numbers
36-752 Advaced Probability Overview Sprig 2018 8. Covergece Cocepts: i Probability, i L p ad Almost Surely Istructor: Alessadro Rialdo Associated readig: Sec 2.4, 2.5, ad 4.11 of Ash ad Doléas-Dade; Sec
More informationLecture Chapter 6: Convergence of Random Sequences
ECE5: Aalysis of Radom Sigals Fall 6 Lecture Chapter 6: Covergece of Radom Sequeces Dr Salim El Rouayheb Scribe: Abhay Ashutosh Doel, Qibo Zhag, Peiwe Tia, Pegzhe Wag, Lu Liu Radom sequece Defiitio A ifiite
More informationIt is often useful to approximate complicated functions using simpler ones. We consider the task of approximating a function by a polynomial.
Taylor Polyomials ad Taylor Series It is ofte useful to approximate complicated fuctios usig simpler oes We cosider the task of approximatig a fuctio by a polyomial If f is at least -times differetiable
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.
More informationStatistical Theory MT 2008 Problems 1: Solution sketches
Statistical Theory MT 008 Problems : Solutio sketches. Which of the followig desities are withi a expoetial family? Explai your reasoig. a) Let 0 < θ < ad put fx, θ) = θ)θ x ; x = 0,,,... b) c) where α
More informationDefinition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.
4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad
More information10/ Statistical Machine Learning Homework #1 Solutions
Caregie Mello Uiversity Departet of Statistics & Data Sciece 0/36-70 Statistical Macie Learig Hoework # Solutios Proble [40 pts.] DUE: February, 08 Let X,..., X P were X i [0, ] ad P as desity p. Let p
More informationStatistical Theory MT 2009 Problems 1: Solution sketches
Statistical Theory MT 009 Problems : Solutio sketches. Which of the followig desities are withi a expoetial family? Explai your reasoig. (a) Let 0 < θ < ad put f(x, θ) = ( θ)θ x ; x = 0,,,... (b) (c) where
More informationCS 70 Second Midterm 7 April NAME (1 pt): SID (1 pt): TA (1 pt): Name of Neighbor to your left (1 pt): Name of Neighbor to your right (1 pt):
CS 70 Secod Midter 7 April 2011 NAME (1 pt): SID (1 pt): TA (1 pt): Nae of Neighbor to your left (1 pt): Nae of Neighbor to your right (1 pt): Istructios: This is a closed book, closed calculator, closed
More informationCS 330 Discussion - Probability
CS 330 Discussio - Probability March 24 2017 1 Fudametals of Probability 11 Radom Variables ad Evets A radom variable X is oe whose value is o-determiistic For example, suppose we flip a coi ad set X =
More informationChapter 6 Infinite Series
Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat
More informationAnalysis of the Chow-Robbins Game with Biased Coins
Aalysis of the Chow-Robbis Game with Biased Cois Arju Mithal May 7, 208 Cotets Itroductio to Chow-Robbis 2 2 Recursive Framework for Chow-Robbis 2 3 Geeralizig the Lower Boud 3 4 Geeralizig the Upper Boud
More information(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3
MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special
More informationSection 11.8: Power Series
Sectio 11.8: Power Series 1. Power Series I this sectio, we cosider geeralizig the cocept of a series. Recall that a series is a ifiite sum of umbers a. We ca talk about whether or ot it coverges ad i
More informationDefine a Markov chain on {1,..., 6} with transition probability matrix P =
Pla Group Work 0. The title says it all Next Tie: MCMC ad Geeral-state Markov Chais Midter Exa: Tuesday 8 March i class Hoework 4 due Thursday Uless otherwise oted, let X be a irreducible, aperiodic Markov
More informationAPPLIED MULTIVARIATE ANALYSIS
ALIED MULTIVARIATE ANALYSIS FREQUENTLY ASKED QUESTIONS AMIT MITRA & SHARMISHTHA MITRA DEARTMENT OF MATHEMATICS & STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANUR X = X X X [] The variace covariace atrix
More informationMA Advanced Econometrics: Properties of Least Squares Estimators
MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample
More informationOnline Convex Optimization in the Bandit Setting: Gradient Descent Without a Gradient. -Avinash Atreya Feb
Olie Covex Optimizatio i the Badit Settig: Gradiet Descet Without a Gradiet -Aviash Atreya Feb 9 2011 Outlie Itroductio The Problem Example Backgroud Notatio Results Oe Poit Estimate Mai Theorem Extesios
More informationOn Modeling On Minimum Description Length Modeling. M-closed
O Modelig O Miiu Descriptio Legth Modelig M M-closed M-ope Do you believe that the data geeratig echais really is i your odel class M? 7 73 Miiu Descriptio Legth Priciple o-m-closed predictive iferece
More informationReview for Test 3 Math 1552, Integral Calculus Sections 8.8,
Review for Test 3 Math 55, Itegral Calculus Sectios 8.8, 0.-0.5. Termiology review: complete the followig statemets. (a) A geometric series has the geeral form k=0 rk.theseriescovergeswhe r is less tha
More informationLecture Outline. 2 Separating Hyperplanes. 3 Banach Mazur Distance An Algorithmist s Toolkit October 22, 2009
18.409 A Algorithist s Toolkit October, 009 Lecture 1 Lecturer: Joatha Keler Scribes: Alex Levi (009) 1 Outlie Today we ll go over soe of the details fro last class ad ake precise ay details that were
More informationChapter 10: Power Series
Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because
More informationECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002
ECE 330:541, Stochastic Sigals ad Systems Lecture Notes o Limit Theorems from robability Fall 00 I practice, there are two ways we ca costruct a ew sequece of radom variables from a old sequece of radom
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tau.edu/~suhasii/teachig.htl Suhasii Subba Rao Exaple The itroge cotet of three differet clover plats is give below. 3DOK1 3DOK5 3DOK7
More informationEfficient GMM LECTURE 12 GMM II
DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet
More informationThis section is optional.
4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore
More informationTomoki Toda. Augmented Human Communication Laboratory Graduate School of Information Science
Seuetial Data Modelig d class Basics of seuetial data odelig ooki oda Augeted Hua Couicatio Laboratory Graduate School of Iforatio Sciece Basic Aroaches How to efficietly odel joit robability of high diesioal
More informationLecture 2: Concentration Bounds
CSE 52: Desig ad Aalysis of Algorithms I Sprig 206 Lecture 2: Cocetratio Bouds Lecturer: Shaya Oveis Ghara March 30th Scribe: Syuzaa Sargsya Disclaimer: These otes have ot bee subjected to the usual scrutiy
More informationAlgorithms for Clustering
CR2: Statistical Learig & Applicatios Algorithms for Clusterig Lecturer: J. Salmo Scribe: A. Alcolei Settig: give a data set X R p where is the umber of observatio ad p is the umber of features, we wat
More informationHölderian Version of Donsker-Prohorov s Invariance Principle
Hölderia Versio of oser-rohorov s Ivariace riciple Haadouche jael ad Taleb Youcef Abstract The wea covergece of a sequece of stochastic processes is classically studied i the Sorohod space [0, ] or C[0,
More informationMath 25 Solutions to practice problems
Math 5: Advaced Calculus UC Davis, Sprig 0 Math 5 Solutios to practice problems Questio For = 0,,, 3,... ad 0 k defie umbers C k C k =! k!( k)! (for k = 0 ad k = we defie C 0 = C = ). by = ( )... ( k +
More informationStat410 Probability and Statistics II (F16)
Some Basic Cocepts of Statistical Iferece (Sec 5.) Suppose we have a rv X that has a pdf/pmf deoted by f(x; θ) or p(x; θ), where θ is called the parameter. I previous lectures, we focus o probability problems
More information32 estimating the cumulative distribution function
32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio
More informationMath 312 Lecture Notes One Dimensional Maps
Math 312 Lecture Notes Oe Dimesioal Maps Warre Weckesser Departmet of Mathematics Colgate Uiversity 21-23 February 25 A Example We begi with the simplest model of populatio growth. Suppose, for example,
More informationThe Ratio Test. THEOREM 9.17 Ratio Test Let a n be a series with nonzero terms. 1. a. n converges absolutely if lim. n 1
460_0906.qxd //04 :8 PM Page 69 SECTION 9.6 The Ratio ad Root Tests 69 Sectio 9.6 EXPLORATION Writig a Series Oe of the followig coditios guaratees that a series will diverge, two coditios guaratee that
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationMachine Learning Theory (CS 6783)
Machie Learig Theory (CS 6783) Lecture 3 : Olie Learig, miimax value, sequetial Rademacher complexity Recap: Miimax Theorem We shall use the celebrated miimax theorem as a key tool to boud the miimax rate
More informationWe are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n
Review of Power Series, Power Series Solutios A power series i x - a is a ifiite series of the form c (x a) =c +c (x a)+(x a) +... We also call this a power series cetered at a. Ex. (x+) is cetered at
More informationOn Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities
O Equivalece of Martigale Tail Bouds ad Determiistic Regret Iequalities Sasha Rakhli Departmet of Statistics, The Wharto School Uiversity of Pesylvaia Dec 16, 2015 Joit work with K. Sridhara arxiv:1510.03925
More informationAn alternating series is a series where the signs alternate. Generally (but not always) there is a factor of the form ( 1) n + 1
Calculus II - Problem Solvig Drill 20: Alteratig Series, Ratio ad Root Tests Questio No. of 0 Istructios: () Read the problem ad aswer choices carefully (2) Work the problems o paper as eeded (3) Pick
More information1 Introduction to reducing variance in Monte Carlo simulations
Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by
More informationMAT1026 Calculus II Basic Convergence Tests for Series
MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real
More informationSolutions to HW Assignment 1
Solutios to HW: 1 Course: Theory of Probability II Page: 1 of 6 Uiversity of Texas at Austi Solutios to HW Assigmet 1 Problem 1.1. Let Ω, F, {F } 0, P) be a filtered probability space ad T a stoppig time.
More informationSolutions: Homework 3
Solutios: Homework 3 Suppose that the radom variables Y,...,Y satisfy Y i = x i + " i : i =,..., IID where x,...,x R are fixed values ad ",...," Normal(0, )with R + kow. Fid ˆ = MLE( ). IND Solutio: Observe
More informationSTA Object Data Analysis - A List of Projects. January 18, 2018
STA 6557 Jauary 8, 208 Object Data Aalysis - A List of Projects. Schoeberg Mea glaucomatous shape chages of the Optic Nerve Head regio i aimal models 2. Aalysis of VW- Kedall ati-mea shapes with a applicatio
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week Lecture: Cocept Check Exercises Starred problems are optioal. Statistical Learig Theory. Suppose A = Y = R ad X is some other set. Furthermore, assume P X Y is a discrete
More informationsin(n) + 2 cos(2n) n 3/2 3 sin(n) 2cos(2n) n 3/2 a n =
60. Ratio ad root tests 60.1. Absolutely coverget series. Defiitio 13. (Absolute covergece) A series a is called absolutely coverget if the series of absolute values a is coverget. The absolute covergece
More informationEFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS
EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog
More informationTHE CLOSED FORMS OF CONVERGENT INFINITE SERIES ESTIMATION OF THE SERIES SUM OF NON-CLOSED FORM ALTERNATING SERIES TO A HIGH DEGREE OF PRECISION.
THE CLSED FRMS F CNERGENT INFINITE SERIES ESTIMATIN F THE SERIES SUM F NN-CLSED FRM ALTERNATING SERIES T A HIGH DEGREE F PRECISIN. Peter G.Bass. PGBass M er..0.0. www.relativityoais.co May 0 Abstract This
More informationLecture 19. Curve fitting I. 1 Introduction. 2 Fitting a constant to measured data
Lecture 9 Curve fittig I Itroductio Suppose we are preseted with eight poits of easured data (x i, y j ). As show i Fig. o the left, we could represet the uderlyig fuctio of which these data are saples
More informationOn Order of a Function of Several Complex Variables Analytic in the Unit Polydisc
ISSN 746-7659, Eglad, UK Joural of Iforatio ad Coutig Sciece Vol 6, No 3, 0, 95-06 O Order of a Fuctio of Several Colex Variables Aalytic i the Uit Polydisc Rata Kuar Dutta + Deartet of Matheatics, Siliguri
More informationDiscrete-Time Systems, LTI Systems, and Discrete-Time Convolution
EEL5: Discrete-Time Sigals ad Systems. Itroductio I this set of otes, we begi our mathematical treatmet of discrete-time s. As show i Figure, a discrete-time operates or trasforms some iput sequece x [
More informationENGI Series Page 6-01
ENGI 3425 6 Series Page 6-01 6. Series Cotets: 6.01 Sequeces; geeral term, limits, covergece 6.02 Series; summatio otatio, covergece, divergece test 6.03 Stadard Series; telescopig series, geometric series,
More informationIntroduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT
Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory
More information