Gärtner-Ellis Theorem and applications.
|
|
- Marjory Randall
- 5 years ago
- Views:
Transcription
1 Gärtner-Ellis Theorem and applications. Elena Kosygina July 25, 208 In this lecture we turn to the non-i.i.d. case and discuss Gärtner-Ellis theorem. As an application, we study Curie-Weiss model with random external magnetic fields. We have discussed Cramér s theorem for empirical means of i.i.d. random variables which take values in R. This theorem under exactly the same condition can be shown to hold for empirical means of i.i.d. random vectors with values in R d (see 2 Corollary 6..6). The result we consider in this note is much more general: measures µ n are not required to be distributions of empirical means, they can be arbitrary distributions on (R d, B(R d )) as long as the limit Λ(t) := lim n n ln Λ n(nt) := lim n n ln e nt,x dµ n [, ] () R d exists for all t R d. A typical situation which arises in applications is the following: we have a sequence of random vectors (Y n ) n N on possibly different probability spaces (Ω n, G n, P n ), Y n : Ω n R d, and µ n (A) := P n (Y n A) for every A B(R d ). In the special case when µ n is the distribution of empirical means of i.i.d. random vectors (X i ) i N with a common distribution µ we have Λ(t) = n ln Λ n(nt) = [E (e )] n ln t,x n i = ln e t,x dµ R d - just the familiar logarithmic MGF of µ. The condition is 0 DΛ o, where Λ is the logarithmic MGF of the common distribution µ on R d. 2 Amir Dembo and Ofer Zeitouni. Large deviations techniques and applications, volume 38 of Applications of Mathematics. Springer-Verlag, New York, second edition, 998 We can always think of these random variables as defined on a common probability space so that their distributions are exactly those that we want (e.g. construct them on an infinite product space) but it is not necessary. Gärtner-Ellis Theorem Let (µ n ) n N be a sequence of probability measures on (R d, B(R d )). Assume that for all t R d a possibly infinite limit Λ(t) in () exists. The convexity of Λ n (t) := ln e t,x dµ n R d for each n N and the limit definition of Λ immediately imply that Λ is convex. Similarly to the assumption in Cramér s theorem, we shall assume throughout this note that 0 DΛ o. This will ensure, in particular, that
2 gärtner-ellis theorem and applications. 2 Λ >. Indeed, note that as Λ n (0) = 0 for all n, so Λ(0) = 0. If for some t we had Λ(t) = then by convexity we would have for all α (0, ] But then Λ(αt) = Λ(αt + ( α)0) αλ(t) + ( α)λ(0) =. 0 = Λ(0) = Λ ( 2 (αt) + ) 2 ( αt) 2 Λ(αt) + 2 Λ( αt), and we would also have Λ( αt) = for all α (0, ]. This contradicts the assumption 0 D o Λ. We shall also need the following definition. Definition. Let I be the Legendre-Fenchel transform of Λ, i.e. I(x) = sup x R d ( t, x Λ(t)). A point x D I := {x R d : I(x) < } is said to be exposed for I if there is a η R d such that I(y) I(x) > η, y x for all y = x y The hyperplane h x (y) = I(x) + η, y x is called an exposing hyperplane to the graph of I at x. For a given (x, I(x)), it is characterized by its normal η. With a slight abuse of terminology, η itself will be referred to as an exposing hyperplane. Figure : Points x (0, ] are not exposed for the pictured function. x Since I is the Legendre-Fenchel transform of Λ, I is convex and satisfies all conditions of a rate function, i.e. it is non-negative and has compact sub-level sets. A justification of the last two claims can be given along the same lines as in lecture notes (right after Cramér theorem). Theorem (Gärtner-Ellis). Let (µ n ) n N be a sequence of probability measures on (R d, B(R d )). Assume that for all t R d a possibly infinite limit Λ(t) in () exists and that 0 D o Λ. Then (i) for every closed set C R d, lim sup n n ln µ n(c) inf x C I(x). Jurgen Gärtner. On large deviations from an invariant measure. Teor. Verojatnost. i Primenen., 22():27 42, 977 Richard S. Ellis. Large deviations for a general class of random vectors. Ann. Probab., 2(): 2, 984
3 gärtner-ellis theorem and applications. 3 (ii) for every open set O R d, lim sup n n ln µ n(o) inf I(x), x O E where E is the set of those exposed points for I which have an exposing hyperplane in D o Λ. Suppose in addition that Λ is lower semi-continuous on R d, differentiable on D o Λ, and either D Λ = R d or Λ is steep, i.e. lim Λ(t n) = n whenever t n D o Λ, t n t D o Λ as n. Then (µ n) n N satisfies the LDP with rate function I. This theorem is rather general but still it does not capture all the cases in which a sequence of measures on R d satisfies a LDP. Our concern is, of course, only about the lower bound. Here is a simple example borrowed from 3, p. 45. Example. Let µ n ((, x]) = ( e nx ) [0, ) (x) (exponential distribution with parameter n), x R. Then ( ) Λ(t) = lim n n ln ne ntx nx 0, if t < ; dx = 0, if t. x, if x 0; I(x) = sup(tx Λ(t)) = t R, if x < 0. We see that E = {0} while D I = [0, ), and for each open set O with O E = Gärtner-Ellis theorem gives only a trivial lower bound. But it is easy to see directly that for every open set O for which O D I = lim n n ln µ n(o) = lim n n ln O [0, ) ne nx dx inf{x, x O [0, )} = inf x O I(x). This says that (µ n ) n N satisfy a LDP with rate I. 3 Amir Dembo and Ofer Zeitouni. Large deviations techniques and applications, volume 38 of Applications of Mathematics. Springer-Verlag, New York, second edition, 998 Exercise. Give the details of this computation. Hint: Every open set in R is a countable union of disjoint open intervals. Use the leftmost interval which intersects D I. We note that Gärtner-Ellis Theorem readily implies Cramér theorem on R d, d, when the logarithmic MGF of µ is finite on all of R d. Otherwise we have to impose an additional condition of
4 gärtner-ellis theorem and applications. 4 steepness (see Theorem ). As we have already mentioned above, the inclusion 0 DΛ o is sufficient for the result in Cramér theorem to hold. The following exercise shows that Theorem does not fully include Cramér theorem if D Λ = R d. Exercise (Exercise 2.3.7(a) in A. Dembo, O. Zeitouni). Let µ be a probability measure on (R d, B(R d )) with a density proportional to e x /( + x d+2 ) and Λ be its logarithmic MGF. (a) Find D o Λ. (b) Show that Λ is not steep. Conclude that in this example Cramér theorem yields the full LDP for empirical means while Gärtner-Ellis theorem does not. A general limitation of Gärtner-Ellis theorem is that the differentiability and steepness conditions on Λ are often hard to check in applications. This is due to the fact that the existence of the limit which defines Λ is often obtained by soft methods which do not give a usable formula for Λ. Let us look again at the already familiar Curie-Weiss model and see what Gärtner-Ellis theorem can and can not do in this case. This example also provides a good illustration of notions introduced in this note. Curie-Weiss model II: what does Gärtner-Ellis theorem give us? For each spin configuration σ = (σ, σ 2,..., σ n ) Σ n = {, } n we define P n,β (σ) = e βh n(σ) Z n,β, where H n (σ) = J 2n n σ i σ j and Z n,β = e βhn(σ). i,j= σ Σ n For each n we have a different probability space (Σ n, G n, P n,β ). We study random variables σ n = n i= σ i [, ]. Thus, the relevant probability measures µ n,β (the distributions of σ n under P n,β ), n N, are measures on the common measurable space ([, ], B([, ])). Thus, µ n,β (A) = P n,β (σ : σ n A), A B([, ]).
5 gärtner-ellis theorem and applications. 5 We obtained two main results for (µ n,β ) n N : the LDP with the rate function I β (x) = I 0 (x) βj [ 2 x2 inf I 0 (y) βj ] y [,] 2 y2, where +x 2 ln +x 2 + x 2 ln x 2, if x < ; I 0 (x) = ln 2, if x = ;, if x >. and weak convergence to a limiting measure. These results clearly show the existence of a phase transition as β crosses J. Let us apply Gärtner-Ellis theorem for this case and draw conclusions. We know from the start that when β > J we should run into trouble, since in this regime I β is not convex while every rate function in Gärtner-Ellis theorem is convex. To start, we need to compute Λ β (t). As we did before, we rewrite everything in terms of µ n,0 (β = 0) which assigns a binomial probability to each possible value of σ n {, + 2/n,..., 2/n, }, ( ) n µ n,0 ({ + 2m/n}) = 2 n m and use the LDP for these measures (with the rate function I 0 ) and Varadhan s lemma to get Λ β (t) = lim n n R ln E = lim n n ln µn,0 = sup x [,] [ tx + e ntx dµ n,β (e n(tσ n+ βj 2 (σ n) 2) E µn,0 ( e n βj 2 (σ n) 2) = sup x [,] ( βj 2 x2 I 0 (x) sup y [,] (2) ( tx + βj ) 2 x2 I 0 (x) Since we take the supremum of continuous functions over [, ], Λ β (t) < for all t, i.e. D Λβ = R. By Theorem (i) we get a LDP upper bound with the rate function Λ β. If β J then I β is convex and Λ β = I = I β with D Iβ = [, ]. Moreover, I β is strictly convex so that every point in (, ) is exposed with an exposing hyperplane in D Λβ simply because D Λβ = R. By Theorem (ii) we get a LDP lower bound: for every open set O lim inf n n ln µ n,β(o) inf I β(x) x O (,) = inf x O [,] I β(x) = inf x O I β(x). sup y [,] ( ) βj 2 y2 I 0 (y) ( ) )] βj 2 y2 I 0 (y) = sup (tx I β (x)). x [,]
6 gärtner-ellis theorem and applications. 6 The next to the last equality is due to continuity of I β on [, ]. The last equality holds because I β (x) = outside of [, ]. We recover an earlier obtained LDP in this case. Alternatively, we could have used the last statement of Theorem : Λ β satisfies all additional conditions needed for a full LDP. If β > J then Λ β is convex while I β is not and Λ β (x) = I β, if x m β ; 0, if x < m β, where ±m β (, 0) (0, ) are the points where I β attains its minimal value 0. In lecture note 3 we have shown that m β is the unique solution in (0, ) of the equation x = tanh(βjx). Graph of I β (x) Graph of Λ (x) Construction of Λ(t) 0. y 0. y 0. y y = tx y = Λ (x) m β m β x m β m β x m β m β Λ(t) x Our first observation is that Λ β < I β on ( m β, m β ) and an upper bound provided by Theorem is strictly larger than the one we obtained earlier: for every closed set G ( m β, m β ) inf x G I β(x) < inf x G Λ β (x). We also see that Λ β 0 on [ m β, m β ]. This implies that Λ β (t) = Iβ (t) = Λ β (t) t m β and, since Λ β (0) = 0, we conclude that Λ β has a corner at the origin. The differentiability condition fails at 0, and Theorem only gives us the following lower bound: for all open sets O lim inf n n ln µ n,β(o) inf x O E Λ β (x), where the set of exposed points E is (, m β ) (m β, ) (we have to exclude the set where Λ β is not strictly convex). If O [ m β, m β ] then Theorem gives us only a trivial lower bound (the infimum is taken over the empty set), while our earlier results give us more information in this case. Now it is the right time for an example where Gärtner-Ellis theorem provides an efficient and quick way of getting a LDP z Λ(t) m β t Figure 2: Graph of Λ(t) = Λ (t). t
7 gärtner-ellis theorem and applications. 7 Curie-Weiss model III: adding a random external field This subsection follows 4 and shows how to get a LDP for Curie- Weiss model in the case when a constant global external magnetic field h is replaced by a random local field h := (h i ) i N satisfying an appropriate averaging assumption (see Assumption below). Let (Ω, F, P) be a probability space and h = (h i ) i N be a sequence of random variables on it. We shall consider random probability measures on (Σ n, G n ) defined by 4 Matthias Löwe, Raphael Meiners, and Felipe Torres. Large deviations principle for Curie-Weiss models with random fields. J. Phys. A, 46(2):25004, 0 pp., 203 P n,β,h (σ) := e βh n,h(σ) Z n,β,h, where H n,h (σ) = J 2n n i,j= σ i σ j n i= h i σ i, Z n,β,h = σ Σ n e βh n,h(σ). An important difference with the original deterministic model is that the measure P n,β,h (σ) is not anymore completely determined by the value of σ n. For each ω Ω, let µ n,β,h be the distribution of σ n. It is a measure on [, ]. Our goal will be to state and prove a LDP for (µ n,β,h ) n N. For this we shall need an assumption on a random field h. Assumption: Let h = (h i ) i N be a sequence of random variables on some probability space (Ω, F, P). We assume that for almost every ω Ω f n (t, ω) := n n ln cosh(t + βh i (ω)) f (t) as n i= for every t R and some differentiable function f : R R. Exercise 2. Prove that if f satisfies the above assumption then necessarily f (t) for all t R and, therefore, f (x) := sup t R (tx f (t)) is equal to for all x >. The most basic example is given by a sequence (h i ) i N of i.i.d. random variables with a finite first moment, E[ h i ] <. The convergence follows by the strong law of large numbers (and a small additional argument, see the exercise below) and f (t) = E[ln cosh(t + βh i )]. Note that ln cosh t t and, thus, E[ ln cosh(t + βh i ) ] t + βe[ h i ] <.
8 gärtner-ellis theorem and applications. 8 Exercise 3. The strong law of large numbers implies that for every t the convergence holds outside of a set of probability 0. But R is uncountable. Start with rational t and show rigorously that the assumption is indeed satisfied. Another example, now with dependence, arises when (h i ) i N is an irreducible positive recurrent Markov chain on a countable subset S of R. Such Markov chain has a unique stationary distribution π on S. Assuming that ln cosh(t + βs) π(s) <, s S the ergodic theorem for Markov chains (see Theorem 4. in 5 ) implies that for any initial distribution with probability lim n n n ln cosh(t + βh i ) = f (t) := ln cosh(t + βs)π(s). i= s S Note that when the initial distribution is different from π the random variables (h i ) i N are not identically distributed. 5 Pierre Brémaud. Markov chains, volume 3 of Texts in Applied Mathematics. Springer-Verlag, New York, 999. Gibbs fields, Monte Carlo simulation, and queues For our last example let (h i ) i N {0} be a stationary ergodic sequence on some probability space (Ω, F, P). More precisely, let T : Ω Ω be a measure preserving transformation, i.e. T be F- measurable and P(T A) = P(A) for all A F. In addition, assume that T is ergodic, that is T A = A implies that P(A) {0, }. Let h be any random variable on (Ω, F, P) with a finite first moment. Define h i (ω) = h(t i ω), i N {0}. The ergodicity assumption and the finiteness of the first moment of h ensure (by Birkhoff ergodic theorem) that for almost every ω Ω n lim n n i=0 ln cosh(t + βh i (ω)) = f (t) := E[ln cosh(t + βh(ω))]. Now we are ready to state the main theorem of this subsection. A sequence (h i ) i N {0} is said to be stationary if for every k N {0} and n N the random vector (h n, h n+,..., h n+k ) has the same distribution as (h 0, h,..., h n ). In simple words, ergodicity of T means that there are no non-trivial T-invariant sets. This general framework, in fact, includes the previous two examples provided that the Markov chain in the second example starts from the stationary distribution. Theorem 2. Suppose that h = (h i ) i N satisfies the assumption above. Then for almost every ω Ω measures (µ n,β,h ) n N satisfy a LDP with deterministic rate function I β, f (x) := f (x) βj [ 2 x2 inf f (y) βj ] y [,] 2 y2, where f (x) := sup t R (tx f (t)).
9 gärtner-ellis theorem and applications. 9 Remark. Note that if h i h then f (t) = ln cosh (t + βh) and after a calculation we get that f (x) = I 0 (x) βhx, where I 0 is given by (2). Combining this result with Theorem 2 we get the rate function for the last exercise in lecture notes 3. A useful identity: ln cosh(arctanhx) = 2 ln( x2 ). Proof. The plan is to introduce a family of convenient reference measures which satisfies a LDP and then use tilting to get a desired LDP for the original measures. In our previous considerations measures (µ n,0 ) n N were such reference measures. They were the distributions of σ n under the uniform measure P n,0 on all spin configurations {, } n. Under this uniform measure spins σ i, i n, were independent and identically distributed with P n,0 (σ : σ i = ) = /2. We consider measures Q n,β,h under which individual spins are still independent but not identically distributed: Q n,β,h (σ Σ n : σ i = ±) = e ±βh i 2 cosh(βh i ). Let ν n,β,h be the distribution of σ n under Q n,β,h. These are our new reference measures. We are going to apply Gärtner-Ellis theorem to (ν n,β,h ) n N to get a LDP. We need to compute Λ β (t). Using our assumption we find that for almost every ω Ω Λ β (t) := lim n n R ln e ntx dν n,β,h = lim n n ln e t n i= σ n i Q n,β,h (σ) = lim n σ Σ n n ln e βhi+t + e βh i t 2 cosh(βh i= i ) for all t R. Therefore, D Λβ = R and Λ β is differentiable on R. By Gärtner-Ellis theorem (ν n,β,h ) n N satisfy a LDP with rate = lim n ( f n (t) f n (0)) = f (t) f (0) Λ β (x) = f (x) + f (0). According to Exercise 2, D Λ β [, ]. Our next step is tilting. For every A B([, ]) A simpler way to see this is to note that σ n [, ], and, therefore, the rate function has to be infinite outside of [, ]. P n,β,h (σ) µ n,β,h (A) = P n,β,h (σ) = Q σ: σ n A σ: σ n A n,β,h (σ) Q n,β,h(σ) = 2n n i= cosh(h iσ i ) Z n,β,h = 2n n i= cosh(h iσ i ) Z n,β,h σ: σ n A e nβj 2 (σ n) 2 Q n,β,h (σ) e nβj 2 x2 dν n,β,h =: A Z G n,β,h G(x) : = βj 2 x2, x [, ]; Z G n,β,h := Z n,β,h 2 n n i= cosh(h iσ i ) = e ng(x) dν n,β,h, where A [,] eng(x) dν n,β,h.
10 gärtner-ellis theorem and applications. 0 By part (b) of Lemma 0.3 from lecture notes 3 we conclude that for almost every ω Ω measures (µ n,β,h ) n N satisfy a LDP with rate function I G (x) = Λ β (x) G(x) inf y [,] (Λ β (y) G(y)) = I β, f (x) as claimed. References Pierre Brémaud. Markov chains, volume 3 of Texts in Applied Mathematics. Springer-Verlag, New York, 999. Gibbs fields, Monte Carlo simulation, and queues. Amir Dembo and Ofer Zeitouni. Large deviations techniques and applications, volume 38 of Applications of Mathematics. Springer-Verlag, New York, second edition, 998. Richard S. Ellis. Large deviations for a general class of random vectors. Ann. Probab., 2(): 2, 984. Jurgen Gärtner. On large deviations from an invariant measure. Teor. Verojatnost. i Primenen., 22():27 42, 977. Matthias Löwe, Raphael Meiners, and Felipe Torres. Large deviations principle for Curie-Weiss models with random fields. J. Phys. A, 46 (2):25004, 0 pp., 203.
Large Deviations for Weakly Dependent Sequences: The Gärtner-Ellis Theorem
Chapter 34 Large Deviations for Weakly Dependent Sequences: The Gärtner-Ellis Theorem This chapter proves the Gärtner-Ellis theorem, establishing an LDP for not-too-dependent processes taking values in
More information2a Large deviation principle (LDP) b Contraction principle c Change of measure... 10
Tel Aviv University, 2007 Large deviations 5 2 Basic notions 2a Large deviation principle LDP......... 5 2b Contraction principle................ 9 2c Change of measure................. 10 The formalism
More information2.2 Some Consequences of the Completeness Axiom
60 CHAPTER 2. IMPORTANT PROPERTIES OF R 2.2 Some Consequences of the Completeness Axiom In this section, we use the fact that R is complete to establish some important results. First, we will prove that
More informationReal Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi
Real Analysis Math 3AH Rudin, Chapter # Dominique Abdi.. If r is rational (r 0) and x is irrational, prove that r + x and rx are irrational. Solution. Assume the contrary, that r+x and rx are rational.
More informationGeneral Theory of Large Deviations
Chapter 30 General Theory of Large Deviations A family of random variables follows the large deviations principle if the probability of the variables falling into bad sets, representing large deviations
More informationErgodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.
Ergodic Theorems Samy Tindel Purdue University Probability Theory 2 - MA 539 Taken from Probability: Theory and examples by R. Durrett Samy T. Ergodic theorems Probability Theory 1 / 92 Outline 1 Definitions
More informationThe Lebesgue Integral
The Lebesgue Integral Brent Nelson In these notes we give an introduction to the Lebesgue integral, assuming only a knowledge of metric spaces and the iemann integral. For more details see [1, Chapters
More informationCHAPTER 7. Connectedness
CHAPTER 7 Connectedness 7.1. Connected topological spaces Definition 7.1. A topological space (X, T X ) is said to be connected if there is no continuous surjection f : X {0, 1} where the two point set
More informationProbability and Measure
Part II Year 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2018 84 Paper 4, Section II 26J Let (X, A) be a measurable space. Let T : X X be a measurable map, and µ a probability
More informationMATHS 730 FC Lecture Notes March 5, Introduction
1 INTRODUCTION MATHS 730 FC Lecture Notes March 5, 2014 1 Introduction Definition. If A, B are sets and there exists a bijection A B, they have the same cardinality, which we write as A, #A. If there exists
More information3 Measurable Functions
3 Measurable Functions Notation A pair (X, F) where F is a σ-field of subsets of X is a measurable space. If µ is a measure on F then (X, F, µ) is a measure space. If µ(x) < then (X, F, µ) is a probability
More informationMATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION. Extra Reading Material for Level 4 and Level 6
MATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION Extra Reading Material for Level 4 and Level 6 Part A: Construction of Lebesgue Measure The first part the extra material consists of the construction
More informationn [ F (b j ) F (a j ) ], n j=1(a j, b j ] E (4.1)
1.4. CONSTRUCTION OF LEBESGUE-STIELTJES MEASURES In this section we shall put to use the Carathéodory-Hahn theory, in order to construct measures with certain desirable properties first on the real line
More informationLarge Deviations Techniques and Applications
Amir Dembo Ofer Zeitouni Large Deviations Techniques and Applications Second Edition With 29 Figures Springer Contents Preface to the Second Edition Preface to the First Edition vii ix 1 Introduction 1
More informationExamples of Dual Spaces from Measure Theory
Chapter 9 Examples of Dual Spaces from Measure Theory We have seen that L (, A, µ) is a Banach space for any measure space (, A, µ). We will extend that concept in the following section to identify an
More informationChapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries
Chapter 1 Measure Spaces 1.1 Algebras and σ algebras of sets 1.1.1 Notation and preliminaries We shall denote by X a nonempty set, by P(X) the set of all parts (i.e., subsets) of X, and by the empty set.
More informationConsequences of the Completeness Property
Consequences of the Completeness Property Philippe B. Laval KSU Today Philippe B. Laval (KSU) Consequences of the Completeness Property Today 1 / 10 Introduction In this section, we use the fact that R
More informationHOMEWORK ASSIGNMENT 6
HOMEWORK ASSIGNMENT 6 DUE 15 MARCH, 2016 1) Suppose f, g : A R are uniformly continuous on A. Show that f + g is uniformly continuous on A. Solution First we note: In order to show that f + g is uniformly
More informationGeneral Glivenko-Cantelli theorems
The ISI s Journal for the Rapid Dissemination of Statistics Research (wileyonlinelibrary.com) DOI: 10.100X/sta.0000......................................................................................................
More informationWeek 2: Sequences and Series
QF0: Quantitative Finance August 29, 207 Week 2: Sequences and Series Facilitator: Christopher Ting AY 207/208 Mathematicians have tried in vain to this day to discover some order in the sequence of prime
More informationImportant Properties of R
Chapter 2 Important Properties of R The purpose of this chapter is to explain to the reader why the set of real numbers is so special. By the end of this chapter, the reader should understand the difference
More information02. Measure and integral. 1. Borel-measurable functions and pointwise limits
(October 3, 2017) 02. Measure and integral Paul Garrett garrett@math.umn.edu http://www.math.umn.edu/ garrett/ [This document is http://www.math.umn.edu/ garrett/m/real/notes 2017-18/02 measure and integral.pdf]
More informationLebesgue Measure on R n
CHAPTER 2 Lebesgue Measure on R n Our goal is to construct a notion of the volume, or Lebesgue measure, of rather general subsets of R n that reduces to the usual volume of elementary geometrical sets
More information1.1. MEASURES AND INTEGRALS
CHAPTER 1: MEASURE THEORY In this chapter we define the notion of measure µ on a space, construct integrals on this space, and establish their basic properties under limits. The measure µ(e) will be defined
More informationInformation Theory and Statistics Lecture 3: Stationary ergodic processes
Information Theory and Statistics Lecture 3: Stationary ergodic processes Łukasz Dębowski ldebowsk@ipipan.waw.pl Ph. D. Programme 2013/2014 Measurable space Definition (measurable space) Measurable space
More informationProblem Set 2: Solutions Math 201A: Fall 2016
Problem Set 2: s Math 201A: Fall 2016 Problem 1. (a) Prove that a closed subset of a complete metric space is complete. (b) Prove that a closed subset of a compact metric space is compact. (c) Prove that
More informationABSTRACT INTEGRATION CHAPTER ONE
CHAPTER ONE ABSTRACT INTEGRATION Version 1.1 No rights reserved. Any part of this work can be reproduced or transmitted in any form or by any means. Suggestions and errors are invited and can be mailed
More informationCHAPTER 6. Differentiation
CHPTER 6 Differentiation The generalization from elementary calculus of differentiation in measure theory is less obvious than that of integration, and the methods of treating it are somewhat involved.
More information106 CHAPTER 3. TOPOLOGY OF THE REAL LINE. 2. The set of limit points of a set S is denoted L (S)
106 CHAPTER 3. TOPOLOGY OF THE REAL LINE 3.3 Limit Points 3.3.1 Main Definitions Intuitively speaking, a limit point of a set S in a space X is a point of X which can be approximated by points of S other
More information5 Birkhoff s Ergodic Theorem
5 Birkhoff s Ergodic Theorem Birkhoff s Ergodic Theorem extends the validity of Kolmogorov s strong law to the class of stationary sequences of random variables. Stationary sequences occur naturally even
More informationREAL AND COMPLEX ANALYSIS
REAL AND COMPLE ANALYSIS Third Edition Walter Rudin Professor of Mathematics University of Wisconsin, Madison Version 1.1 No rights reserved. Any part of this work can be reproduced or transmitted in any
More informationFilters in Analysis and Topology
Filters in Analysis and Topology David MacIver July 1, 2004 Abstract The study of filters is a very natural way to talk about convergence in an arbitrary topological space, and carries over nicely into
More informationDivision of the Humanities and Social Sciences. Supergradients. KC Border Fall 2001 v ::15.45
Division of the Humanities and Social Sciences Supergradients KC Border Fall 2001 1 The supergradient of a concave function There is a useful way to characterize the concavity of differentiable functions.
More informationThe large deviation principle for the Erdős-Rényi random graph
The large deviation principle for the Erdős-Rényi random graph (Courant Institute, NYU) joint work with S. R. S. Varadhan Main objective: how to count graphs with a given property Only consider finite
More informationMAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9
MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended
More informationg 2 (x) (1/3)M 1 = (1/3)(2/3)M.
COMPACTNESS If C R n is closed and bounded, then by B-W it is sequentially compact: any sequence of points in C has a subsequence converging to a point in C Conversely, any sequentially compact C R n is
More information1. Supremum and Infimum Remark: In this sections, all the subsets of R are assumed to be nonempty.
1. Supremum and Infimum Remark: In this sections, all the subsets of R are assumed to be nonempty. Let E be a subset of R. We say that E is bounded above if there exists a real number U such that x U for
More informationZdzis law Brzeźniak and Tomasz Zastawniak
Basic Stochastic Processes by Zdzis law Brzeźniak and Tomasz Zastawniak Springer-Verlag, London 1999 Corrections in the 2nd printing Version: 21 May 2005 Page and line numbers refer to the 2nd printing
More informationMAGIC010 Ergodic Theory Lecture Entropy
7. Entropy 7. Introduction A natural question in mathematics is the so-called isomorphism problem : when are two mathematical objects of the same class the same (in some appropriately defined sense of
More informationIn N we can do addition, but in order to do subtraction we need to extend N to the integers
Chapter The Real Numbers.. Some Preliminaries Discussion: The Irrationality of 2. We begin with the natural numbers N = {, 2, 3, }. In N we can do addition, but in order to do subtraction we need to extend
More informationLecture Notes Introduction to Ergodic Theory
Lecture Notes Introduction to Ergodic Theory Tiago Pereira Department of Mathematics Imperial College London Our course consists of five introductory lectures on probabilistic aspects of dynamical systems,
More informationAppendix B Convex analysis
This version: 28/02/2014 Appendix B Convex analysis In this appendix we review a few basic notions of convexity and related notions that will be important for us at various times. B.1 The Hausdorff distance
More information3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?
MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due ). Show that the open disk x 2 + y 2 < 1 is a countable union of planar elementary sets. Show that the closed disk x 2 + y 2 1 is a countable
More informationIndeed, if we want m to be compatible with taking limits, it should be countably additive, meaning that ( )
Lebesgue Measure The idea of the Lebesgue integral is to first define a measure on subsets of R. That is, we wish to assign a number m(s to each subset S of R, representing the total length that S takes
More informationWe are going to discuss what it means for a sequence to converge in three stages: First, we define what it means for a sequence to converge to zero
Chapter Limits of Sequences Calculus Student: lim s n = 0 means the s n are getting closer and closer to zero but never gets there. Instructor: ARGHHHHH! Exercise. Think of a better response for the instructor.
More information13. Examples of measure-preserving tranformations: rotations of a torus, the doubling map
3. Examples of measure-preserving tranformations: rotations of a torus, the doubling map 3. Rotations of a torus, the doubling map In this lecture we give two methods by which one can show that a given
More informationIntroduction and Preliminaries
Chapter 1 Introduction and Preliminaries This chapter serves two purposes. The first purpose is to prepare the readers for the more systematic development in later chapters of methods of real analysis
More informationADVANCED CALCULUS - MTH433 LECTURE 4 - FINITE AND INFINITE SETS
ADVANCED CALCULUS - MTH433 LECTURE 4 - FINITE AND INFINITE SETS 1. Cardinal number of a set The cardinal number (or simply cardinal) of a set is a generalization of the concept of the number of elements
More informationCorrections. ir j r mod n 2 ). (i r j r ) mod n should be d. p. 83, l. 2 [p. 80, l. 5]: d. p. 92, l. 1 [p. 87, l. 2]: Λ Z d should be Λ Z d.
Corrections We list here typos and errors that have been found in the published version of the book. We welcome any additional corrections you may find! We indicate page and line numbers for the published
More informationINTRODUCTION TO STATISTICAL MECHANICS Exercises
École d hiver de Probabilités Semaine 1 : Mécanique statistique de l équilibre CIRM - Marseille 4-8 Février 2012 INTRODUCTION TO STATISTICAL MECHANICS Exercises Course : Yvan Velenik, Exercises : Loren
More informationconsists of two disjoint copies of X n, each scaled down by 1,
Homework 4 Solutions, Real Analysis I, Fall, 200. (4) Let be a topological space and M be a σ-algebra on which contains all Borel sets. Let m, µ be two positive measures on M. Assume there is a constant
More informationLarge deviation theory and applications
Large deviation theory and applications Peter Mörters November 0, 2008 Abstract Large deviation theory deals with the decay of the probability of increasingly unlikely events. It is one of the key techniques
More informationNotes 1 : Measure-theoretic foundations I
Notes 1 : Measure-theoretic foundations I Math 733-734: Theory of Probability Lecturer: Sebastien Roch References: [Wil91, Section 1.0-1.8, 2.1-2.3, 3.1-3.11], [Fel68, Sections 7.2, 8.1, 9.6], [Dur10,
More informationHOPF S DECOMPOSITION AND RECURRENT SEMIGROUPS. Josef Teichmann
HOPF S DECOMPOSITION AND RECURRENT SEMIGROUPS Josef Teichmann Abstract. Some results of ergodic theory are generalized in the setting of Banach lattices, namely Hopf s maximal ergodic inequality and the
More informationChapter 3 Continuous Functions
Continuity is a very important concept in analysis. The tool that we shall use to study continuity will be sequences. There are important results concerning the subsets of the real numbers and the continuity
More information5.5 Deeper Properties of Continuous Functions
5.5. DEEPER PROPERTIES OF CONTINUOUS FUNCTIONS 195 5.5 Deeper Properties of Continuous Functions 5.5.1 Intermediate Value Theorem and Consequences When one studies a function, one is usually interested
More informationLarge-deviation theory and coverage in mobile phone networks
Weierstrass Institute for Applied Analysis and Stochastics Large-deviation theory and coverage in mobile phone networks Paul Keeler, Weierstrass Institute for Applied Analysis and Stochastics, Berlin joint
More informationB. Appendix B. Topological vector spaces
B.1 B. Appendix B. Topological vector spaces B.1. Fréchet spaces. In this appendix we go through the definition of Fréchet spaces and their inductive limits, such as they are used for definitions of function
More informationMathematical Statistical Physics Solution Sketch of the math part of the Exam
Prof. B. Paredes, Prof. P. Pickl Summer Term 2016 Lukas Nickel, Carlos Velasco July 22, 2016 Mathematical Statistical Physics Solution Sketch of the math part of the Exam Family Name: First name: Student
More informationII - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define
1 Measures 1.1 Jordan content in R N II - REAL ANALYSIS Let I be an interval in R. Then its 1-content is defined as c 1 (I) := b a if I is bounded with endpoints a, b. If I is unbounded, we define c 1
More informationIntroduction to Real Analysis Alternative Chapter 1
Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces
More information1 Stat 605. Homework I. Due Feb. 1, 2011
The first part is homework which you need to turn in. The second part is exercises that will not be graded, but you need to turn it in together with the take-home final exam. 1 Stat 605. Homework I. Due
More informationLarge deviation convergence of generated pseudo measures
Large deviation convergence of generated pseudo measures Ljubo Nedović 1, Endre Pap 2, Nebojša M. Ralević 1, Tatjana Grbić 1 1 Faculty of Engineering, University of Novi Sad Trg Dositeja Obradovića 6,
More informationLecture 17 Brownian motion as a Markov process
Lecture 17: Brownian motion as a Markov process 1 of 14 Course: Theory of Probability II Term: Spring 2015 Instructor: Gordan Zitkovic Lecture 17 Brownian motion as a Markov process Brownian motion is
More information6.2 Deeper Properties of Continuous Functions
6.2. DEEPER PROPERTIES OF CONTINUOUS FUNCTIONS 69 6.2 Deeper Properties of Continuous Functions 6.2. Intermediate Value Theorem and Consequences When one studies a function, one is usually interested in
More information6 CARDINALITY OF SETS
6 CARDINALITY OF SETS MATH10111 - Foundations of Pure Mathematics We all have an idea of what it means to count a finite collection of objects, but we must be careful to define rigorously what it means
More information25.1 Ergodicity and Metric Transitivity
Chapter 25 Ergodicity This lecture explains what it means for a process to be ergodic or metrically transitive, gives a few characterizes of these properties (especially for AMS processes), and deduces
More informationMath 564 Homework 1. Solutions.
Math 564 Homework 1. Solutions. Problem 1. Prove Proposition 0.2.2. A guide to this problem: start with the open set S = (a, b), for example. First assume that a >, and show that the number a has the properties
More information5 Set Operations, Functions, and Counting
5 Set Operations, Functions, and Counting Let N denote the positive integers, N 0 := N {0} be the non-negative integers and Z = N 0 ( N) the positive and negative integers including 0, Q the rational numbers,
More informationFerromagnets and the classical Heisenberg model. Kay Kirkpatrick, UIUC
Ferromagnets and the classical Heisenberg model Kay Kirkpatrick, UIUC Ferromagnets and the classical Heisenberg model: asymptotics for a mean-field phase transition Kay Kirkpatrick, Urbana-Champaign June
More informationOn asymptotically efficient simulation of large deviation probabilities
On asymptotically efficient simulation of large deviation probabilities A. B. Dieker and M. Mandjes CWI P.O. Box 94079 1090 GB Amsterdam, the Netherlands and University of Twente Faculty of Mathematical
More informationPart V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory
Part V 7 Introduction: What are measures and why measurable sets Lebesgue Integration Theory Definition 7. (Preliminary). A measure on a set is a function :2 [ ] such that. () = 2. If { } = is a finite
More informationElementary Analysis Math 140D Fall 2007
Elementary Analysis Math 140D Fall 2007 Bernard Russo Contents 1 Friday September 28, 2007 1 1.1 Course information............................ 1 1.2 Outline of the course........................... 1
More informationX n D X lim n F n (x) = F (x) for all x C F. lim n F n(u) = F (u) for all u C F. (2)
14:17 11/16/2 TOPIC. Convergence in distribution and related notions. This section studies the notion of the so-called convergence in distribution of real random variables. This is the kind of convergence
More informationMeasure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond
Measure Theory on Topological Spaces Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond May 22, 2011 Contents 1 Introduction 2 1.1 The Riemann Integral........................................ 2 1.2 Measurable..............................................
More informationAn inverse of Sanov s theorem
An inverse of Sanov s theorem Ayalvadi Ganesh and Neil O Connell BRIMS, Hewlett-Packard Labs, Bristol Abstract Let X k be a sequence of iid random variables taking values in a finite set, and consider
More informationMetric Spaces and Topology
Chapter 2 Metric Spaces and Topology From an engineering perspective, the most important way to construct a topology on a set is to define the topology in terms of a metric on the set. This approach underlies
More information6 Markov Chain Monte Carlo (MCMC)
6 Markov Chain Monte Carlo (MCMC) The underlying idea in MCMC is to replace the iid samples of basic MC methods, with dependent samples from an ergodic Markov chain, whose limiting (stationary) distribution
More informationA Note On Large Deviation Theory and Beyond
A Note On Large Deviation Theory and Beyond Jin Feng In this set of notes, we will develop and explain a whole mathematical theory which can be highly summarized through one simple observation ( ) lim
More informationGeneral Notation. Exercises and Problems
Exercises and Problems The text contains both Exercises and Problems. The exercises are incorporated into the development of the theory in each section. Additional Problems appear at the end of most sections.
More information18.175: Lecture 2 Extension theorems, random variables, distributions
18.175: Lecture 2 Extension theorems, random variables, distributions Scott Sheffield MIT Outline Extension theorems Characterizing measures on R d Random variables Outline Extension theorems Characterizing
More informationMATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 2: Countability and Cantor Sets
MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION Chapter 2: Countability and Cantor Sets Countable and Uncountable Sets The concept of countability will be important in this course
More informationSet, functions and Euclidean space. Seungjin Han
Set, functions and Euclidean space Seungjin Han September, 2018 1 Some Basics LOGIC A is necessary for B : If B holds, then A holds. B A A B is the contraposition of B A. A is sufficient for B: If A holds,
More informationMeasure and integration
Chapter 5 Measure and integration In calculus you have learned how to calculate the size of different kinds of sets: the length of a curve, the area of a region or a surface, the volume or mass of a solid.
More informationTopology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:
Topology Xiaolong Han Department of Mathematics, California State University, Northridge, CA 91330, USA E-mail address: Xiaolong.Han@csun.edu Remark. You are entitled to a reward of 1 point toward a homework
More informationProblem set 1, Real Analysis I, Spring, 2015.
Problem set 1, Real Analysis I, Spring, 015. (1) Let f n : D R be a sequence of functions with domain D R n. Recall that f n f uniformly if and only if for all ɛ > 0, there is an N = N(ɛ) so that if n
More informationof the set A. Note that the cross-section A ω :={t R + : (t,ω) A} is empty when ω/ π A. It would be impossible to have (ψ(ω), ω) A for such an ω.
AN-1 Analytic sets For a discrete-time process {X n } adapted to a filtration {F n : n N}, the prime example of a stopping time is τ = inf{n N : X n B}, the first time the process enters some Borel set
More informationChapter 8. P-adic numbers. 8.1 Absolute values
Chapter 8 P-adic numbers Literature: N. Koblitz, p-adic Numbers, p-adic Analysis, and Zeta-Functions, 2nd edition, Graduate Texts in Mathematics 58, Springer Verlag 1984, corrected 2nd printing 1996, Chap.
More informationP i [B k ] = lim. n=1 p(n) ii <. n=1. V i :=
2.7. Recurrence and transience Consider a Markov chain {X n : n N 0 } on state space E with transition matrix P. Definition 2.7.1. A state i E is called recurrent if P i [X n = i for infinitely many n]
More informationMarkov Chains CK eqns Classes Hitting times Rec./trans. Strong Markov Stat. distr. Reversibility * Markov Chains
Markov Chains A random process X is a family {X t : t T } of random variables indexed by some set T. When T = {0, 1, 2,... } one speaks about a discrete-time process, for T = R or T = [0, ) one has a continuous-time
More informationMATH 102 INTRODUCTION TO MATHEMATICAL ANALYSIS. 1. Some Fundamentals
MATH 02 INTRODUCTION TO MATHEMATICAL ANALYSIS Properties of Real Numbers Some Fundamentals The whole course will be based entirely on the study of sequence of numbers and functions defined on the real
More informationFrom now on, we will represent a metric space with (X, d). Here are some examples: i=1 (x i y i ) p ) 1 p, p 1.
Chapter 1 Metric spaces 1.1 Metric and convergence We will begin with some basic concepts. Definition 1.1. (Metric space) Metric space is a set X, with a metric satisfying: 1. d(x, y) 0, d(x, y) = 0 x
More informationFinding a Needle in a Haystack: Conditions for Reliable. Detection in the Presence of Clutter
Finding a eedle in a Haystack: Conditions for Reliable Detection in the Presence of Clutter Bruno Jedynak and Damianos Karakos October 23, 2006 Abstract We study conditions for the detection of an -length
More informationFunctional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals
Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals Noèlia Viles Cuadros BCAM- Basque Center of Applied Mathematics with Prof. Enrico
More informationLegendre-Fenchel transforms in a nutshell
1 2 3 Legendre-Fenchel transforms in a nutshell Hugo Touchette School of Mathematical Sciences, Queen Mary, University of London, London E1 4NS, UK Started: July 11, 2005; last compiled: August 14, 2007
More informationON THE ZERO-ONE LAW AND THE LAW OF LARGE NUMBERS FOR RANDOM WALK IN MIXING RAN- DOM ENVIRONMENT
Elect. Comm. in Probab. 10 (2005), 36 44 ELECTRONIC COMMUNICATIONS in PROBABILITY ON THE ZERO-ONE LAW AND THE LAW OF LARGE NUMBERS FOR RANDOM WALK IN MIXING RAN- DOM ENVIRONMENT FIRAS RASSOUL AGHA Department
More informationAN EXPLORATION OF THE METRIZABILITY OF TOPOLOGICAL SPACES
AN EXPLORATION OF THE METRIZABILITY OF TOPOLOGICAL SPACES DUSTIN HEDMARK Abstract. A study of the conditions under which a topological space is metrizable, concluding with a proof of the Nagata Smirnov
More informationConnectedness. Proposition 2.2. The following are equivalent for a topological space (X, T ).
Connectedness 1 Motivation Connectedness is the sort of topological property that students love. Its definition is intuitive and easy to understand, and it is a powerful tool in proofs of well-known results.
More informationSome Background Material
Chapter 1 Some Background Material In the first chapter, we present a quick review of elementary - but important - material as a way of dipping our toes in the water. This chapter also introduces important
More informationIntroduction to Proofs in Analysis. updated December 5, By Edoh Y. Amiran Following the outline of notes by Donald Chalice INTRODUCTION
Introduction to Proofs in Analysis updated December 5, 2016 By Edoh Y. Amiran Following the outline of notes by Donald Chalice INTRODUCTION Purpose. These notes intend to introduce four main notions from
More information