Continuum Probability and Sets of Measure Zero

Size: px
Start display at page:

Download "Continuum Probability and Sets of Measure Zero"


1 Chapter 3 Continuum Probability and Sets of Measure Zero In this chapter, we provide a motivation for using measure theory as a foundation for probability. It uses the example of random coin tossing to explain why we need to move past discrete probability theory and to figure out what would be needed in the new foundation (that has yet to be developed). Presuming that we can indeed create the necessary theoretical foundation, we show some important consequences that result. This is intended to justify the investment we have to make in rigorous analysis in the following chapters. We do not show that the required theoretical foundation exists in this chapter! This is meant to be a fun and engaging introduction into thought processes involved with measure theoretic probability. Moreover, it shows that formulating a vague idea of measure allows the possibility of stating and proving deep results. Enjoy this chapter as the next chapter provides all the heavy-going theory and proof a reader could want! After developing rigorous measure theory, we revisit the material in this chapter to verify that everything discussed is indeed rigorously justified. 3.1 Probability and sets of real numbers We begin by developing a connection between a probability space with an infinite number of points and an interval of real numbers. With this correspondence, we can then develop a systematic method for computing probabilities of events in the probability space by measuring the sizes of corresponding sets of real numbers. However, it turns out that perfectly reasonable probability questions correspond to very complicated sets of real numbers. Thus, we first need to develop a way to measure the size of rather unusual sets of real numbers Bernoulli sequences and the unit interval Definition Suppose an experiment has two possible outcomes and the probabilities of these outcomes are fixed. A finite number of independent trials of the experiment is a called a Bernoulli trial. An infinite sequence of independent trials is called a Bernoulli sequence. Example Let the experiment be the toss of two-sided coin, with a head denoted (H) and tails denoted (T ). An example of a Bernoulli sequence is H,T,T, H, H, H, H,T, H,T,T, H, H,T,T,T,T, H,T, H,T,T,... 29

2 3 Chapter 3. Continuum Probability and Sets of Measure Zero Definition We define the space of Bernoulli sequences, B = {all Bernoulli sequences generated by a particular experiment}. We use H and T to denote the two outcomes. For simplicity, we mostly treat the case where the outcomes have equal probability of occurring, i.e. corresponding to a fair coin. In general, the two results may have different probabilities. We show that B can (almost) be represented by the real numbers in (,1], which implies that B is uncountable. Theorem If we delete a countable subset of B, we can index the remaining points using the numbers in (,1]. Recall that by index, we mean there is a 1 1 correspondence between the two sets. Proof. We construct a map from (,1] to B that fails to be onto by a countable subset. Any point x [,1] can be written as an expansion in base 2, or binary expansion, x = a i 2 i, a i = or1. Each such expansion corresponds to a Bernoulli sequence. To see this, define the n t h term of the Bernoulli sequence to be H when a n = 1 and T when a n =. Example H,T, H, H, H,T,T, H,T,T, H, A problem with using real numbers as an index set is the fact that some numbers do not have a unique binary expansion but we consider two Bernoulli sequences with different members to be distinct. Example =.1... and 1 = , but H T T T T T H H H. 2 Thus, the method above used to generate a Bernoulli sequence does not define a function into B. To avoid this trouble, we adopt the convention that if the real number x has terminating and non-terminating binary expansions, we use the non-terminating expansion. This is the reason for using (,1] instead of [,1]. With this convention, the method above defines a 1 1 map into B that is not onto because it does not produce Bernoulli sequences ending in all T s. We claim that the set B T of such Bernoulli sequences is countable. Let B k be the finite set of Bernoulli T sequences that have only T s after the k t h term. We have, B T = B k T. (3.1) k=1 This implies that B T is countable and there is a 1 1 and onto correspondence between (,1] and B \ B T.

3 3.1. Probability and sets of real numbers 31 Proof Comment 3.1. The decomposition of a countable set as a countable union of finite sets in (3.1) is a standard measure theory argument Initial encounter with measure Since B is uncountable and B T is countable, we would like to ignore B T for all practical purposes and identify B with I = (,1]. Likewise, it turns out to be convenient to measure the size of any finite or countable subset of I as negligible compared to the size of I, which has a number of important ramifications. This is the first motivating example for devising a way to measure the size of sets of real numbers that applies to complex sets. Lebesgue developed an approach to measure the sizes of complex sets of real numbers that is the basis for measure theory. Measure theory can be developed in a very abstract way that applies to spaces of many different kinds of objects, though we focus on spaces consisting of real numbers in this book. In that context, it is initially reasonable to think of measure as a generalization of length in one dimension, and area and volume in higher dimensions. But, we also caution that measures can have other interpretations. For example, we use measure to quantify probability later on. To fit common conceptions of measuring the sizes of sets, at a minimum, a measure µ should satisfy some properties. Definition (First Wish List for Measures). A measure µ is a real-valued function defined on a collection of subsets of a space called the measurable sets. If A is a measurable set, µ(a) is the measure of A. At a minimum, the structure must satisfy: (Non-negativity) µ should be non-negative. (Closed under finite unions) If {A i } n is a finite collection of disjoint measurable sets, n then A i is measurable. (Finite-additivity) If {A i } n is a collection of disjoint measurable sets, then, n n µ A i = µ(a i ). Thus, a measure is a non-negative finitely additive set function, just like a probability function. There should be a connection here. We pay particular attention to the case of real numbers: Definition If the space is an interval of real numbers and the measurable sets include intervals for which µ((a, b)) = µ([a, b]) = µ((a, b]) = µ([a, b)) = b a, we call µ the Lebesgue measure on and write µ = µ. a, b, Note that this implies that the measure of a set of a single point is zero, i.e., µ ({a}) =.

4 32 Chapter 3. Continuum Probability and Sets of Measure Zero Assigning probabilities to events in B So far, we have identified B with the interval of real numbers I and have introduced the desirability of a general way to measure the sizes of sets in I and some properties that such a measure should have. The next step is to assign a system for computing probabilities of events in B using the measure. For simplicity, we consider the case when T and H occur with equal probability. To start from what we know, we first consider the space consisting of a Bernoulli trial of finite length n. The probability of H as the first outcome in any trial is.5, and likewise the probability of T as the first outcome in any trial. This can be computed using simple counting over all possible trials of length n. Unfortunately, we cannot make a counting argument in the case of B, though intuition suggests that the probabilities are also.5. Switching to sets of real numbers, if A H is the event in B consisting of sequences where H is the first outcome, the corresponding set in I = (,1] is I AH = {x I ; x =.1a 1 a 2 a 3... : a i = or1} = (.5,1]. Note that the largest number not in I AH is.1... while the largest number in I AH is ) We do not include 1/2 because we use non-terminating expansions. Likewise, if A T is the event where T occurs as the first outcome, then I AT = (,.5]. We have µ (I AH ) = µ (I AT ) =.5. In this case, based on the fact that I AH and I AT have equal measures, it seems reasonable to assign the probabilities, P(A H ) = µ (I AH ) =.5 and P(A T ) = µ (I AT ) =.5. Next, if we consider the events A H H, A H T, A T H, A T T in B in which the first two outcomes H H, H T, T H, T T are specified, the corresponding intervals are I AT T = (,.25], I AT H = (.25,.5], I AH T = (.5,.75], I AH H = (.75,1]. Since these intervals have equal length, we assign the probability of.25 to each and to each corresponding event. We can continue with this argument, considering the events corresponding to specification of the first three outcomes, then the first four outcomes, and so on. Considering the events in which the first n outcomes are specified, we obtain 2 n intervals of equal length, and assign equal probability 2 n to each interval and thus each event. In this way, we obtain a sequence of binary partitions n of I into 2 n nonoverlapping subintervals I n, j of equal length such that I = 2n j =1 I n, j, see Fig We assign equal probabilities to each subinterval in a given partition and to the corresponding events. Moreover, it appears that any interval (a, b] I can be approximated arbitrarily well by I n, j (a,b] I n, j in the sense that the intervals of points not in the approximation (a, b] \ In, j (a,b] I n, j shrink in size as n increases, see Fig In view of the Wish List and the fact that µ (I ) = 1, we extend these observations to a general principle of modeling. Axiom (The Measure Theory Model for Probability on B). If A is an event in B, we let I A denote the corresponding set of real numbers in (,1]. Then, we assign the probability of A, denoted by P(E), to be µ (I A ). All of this discussion is terribly vague, since we have not defined µ, described the collection of measurable sets, or quantified the sense of approximation of sets observed

5 3.1. Probability and sets of real numbers 33 1 T 4 a b Figure 3.1. Illustration of the sequence of binary partitions n of I. We illustrate an approximation of the interval (a, b) by subintervals in 5. above! But, we verify these ideas are useful in some simple examples below and show that they lead to stating and proving important theorems in the next couple of sections. Example Consider the event A in which H is the n t h outcome. Then, I A = x I ; x =.a 1 a 2...1a n+1 a n+2 a n+3... : a i = or1 Let s =.a 1 a 2...a n 1 1, so I A contains (s, s + 2 n 1 ]. We can choose a 1, a 2,...a n 1 in 2 n 1 different ways and each of the resulting intervals are disjoint from the others, so we use finite additivity to conclude that, P(A) = µ (I A ) = 2 n n = 1/2. As a concrete example, consider n = 3. Then, we have the following cases: H T H, H H H, T H H, H T H : corresponding to 4 disjoint intervals of length 1/2 3, and P(A) = 4/8 = 1/2.) Example Let A be the event where exactly i of the first n outcomes are H, so I A = x I ; x =.a 1 a 2...a n a n+1 : exactly i of the first n digits are 1 and remaining are or 1. Choose a 1,...,a n so exactly i are 1 and set s =.a 1 a 2...a n. I A contains (s, s + 2 n ]. The intervals corresponding to different choices of a 1,...,a n are disjoint and there are exactly n n! = i i!(n i)!, such intervals. So P(A) = µ (I A ) = n 1 i 2. n

6 34 Chapter 3. Continuum Probability and Sets of Measure Zero Recapping the construction of the model We note that there are actually two modeling steps involved with Axiom 3.1.1: Step 1 The adoption of the measure formulation for probability, which gives a procedure for computing probabilities of events; Step 2 The assignation of specific probabilities to events in B, i.e. P(A) = µ (I A ) for A B. Step 1 is a proposal for how to carry out stochastic computations in a probability space with an infinite number of points. This use of measure theory is not entirely free from controversy and there are alternative proposed frameworks. But it is fair to say that the proposal of measure theory as a foundation for probability by Kolmogorov stands as one of the great mathematical achievements of the Twentieth Century. The worthiness of measure theory as a framework for probability is demonstrated in part by the ability to state and prove important probabilistic results. We present a couple of examples in the next two sections and many examples in later chapters. The assigning of probabilities in Step 2 is subject to perhaps a greater degree of controversy. Partly, this is due to the fact that randomness is used to model various situations, including systems that are truly stochastic in nature and systems whose state is unknown but not truly stochastic. Even if a system is random, there may be limited information on the probability values of different events, and when there is information, it is often based on a finite set of observations. Above, we extrapolated to define P(E) = µ (I E ) working from a finite set of examples. We concluding by noting that the model derived in this section can be applied to a variety of situations. Example We can use I an an index set for the points in the space corresponding to the random throw of a dart onto the interval I and it can index the time of arrival of a single α particle during a unit interval of time. We can also extend these ideas to higher dimensions, e.g., by considering a square dart board. Put a 2 d dart throwing example here Numerical simulation References Exercises 3.2 The Weak Law of Large Numbers Continuing the program of motivating measure theory as a model for probability in B, we use it to state and prove some important results in probability. Of course, we have not shown that it is possible to derive measures yet and we have only described properties of measures under a lot of restrictions. But, we tackle those issues later. In the mean time, we begin by revisiting the Law of Large Numbers. Recall that intuition suggests that it should be possible to detect the probabilities of H and T in B by examining the outcomes of many repetitions of the experiment. In particular, the number of times that H occurs in a large number of trials should be related to the probability of H. However, as discussed earlier, a precise statement of this intuition

7 3.2. The Weak Law of Large Numbers 35 is difficult to formulate. Assuming the probability of H is p and S n is the number of H s that occur in the first n trials, then if we could show that S lim n n n = p, then this would be a mathematical statement expressing the intuition. But such a result is certainly false. A sequence of experiments could yield outcomes of all H s for example. So, we need to create a careful formulation. To make things simple, we assume that the probabilities of H and T are both.5. To state and prove the desired result, we introduce some functions. Definition A random variable is a function on the outcomes of an experiment. The name random variable is a rather disconcerting name to assign to a function! Expressing and proving results in probability by using random variables is a supremely important technique. Definition For x I, define the random variable, S n (x) = a a n, where x =.a 1 a 2 a n S n gives the number of heads in the first n experiments of the Bernoulli sequence corresponding to x. Definition Given δ >, define I n = x I : S n (x) 1 n 2. > δ (3.2) Roughly speaking, this is the event consisting of outcomes for which there are not approximately the same number of H and T after n trials, where δ quantifies the discrepancy. We prove Theorem (Weak Law of Large Numbers for Bernoulli Sequences). For fixed δ >, µ (I n ) as n. (3.3) An observant reader should be uncomfortable at this conclusion, because I n is an apparently complicated set, and we have not yet specified a procedure for computing the measure µ of complicated sets. Fortunately, during the proof, it becomes apparent that I n is actually a finite collection of nonoverlapping intervals for which µ is defined. By definition, (3.3) implies that for any fixed δ >, given any ε >, µ x I : S n (x) 1 n 2 > δ < ε, for all sufficiently large n. Identifying µ with P, we see that (3.3) extends the earlier Law of Large Numbers (2.4) to B.

8 36 Chapter 3. Continuum Probability and Sets of Measure Zero R 1 R 2 R /2 1 1/4 1/2 3/4 1 1/4 1/2 3/ Figure 3.2. Plots of the first three Rademacher functions. Remark 3.1. The idea of measuring the size of the set where a function takes a specified range of values is central to measure theory. However, such a set is not a finite collection of disjoint intervals in general. To prove the result, we reformulate it using two new random variables. Definition For x I, we define the i t h Rademacher function by, R i (x) = 2a i 1, x =.a 1 a 2 Equivalently, R i (x) = 1, ai = 1, 1, a i =. We plot some of these functions in Fig R i has a useful interpretation. Suppose we bet on a sequence of coin tosses such that at each toss, we win $1 if it is heads and lose $1 if it is tails. Then R i (x) is the amount won or lost at the i t h toss in the sequence of tosses represented by x. The next random variable is; Definition We define W n (x) = n R i (x). Following the interpretation of R i, W n gives the total amount won or lost after the n t h toss in the betting game described above. By the definition of R i, Now, W n (x) = 2(a 1 + a a n ) n = 2S n (x) n, x =.a 1 a 2 a 3. x I : or in other words, if and only if, S n (x) 1 n 2 > ε 2S n (x) n > 2εn, W n (x) > 2εn. (3.4)

9 3.2. The Weak Law of Large Numbers 37 f α included in set included in set Figure 3.3. We illustrate a typical set in Chebyshev s inequality. Note that since ε is arbitrary, the factor 2 is immaterial. Definition We define, A n = {x I : W n (x) > nε}. We can prove Theorem by showing that µ (A n ) as n. (3.5) To do this, we use a special version of an important result. Theorem (Special Case of Chebyshev s Inequality). Let f be a non-negative, piecewise constant function on I and α > be a positive real number. Then, µ ({x I : f (x) > α}) < 1 f (x) d x, α where the integral is the standard Riemann integral, which is well defined for piecewise constant, nonnegative functions. We illustrate the theorem in Fig Proof. [Theorem 3.2.2] Since f is piecewise constant, there is a mesh = x 1 < x 2 < < x n = 1 such that f (x) = c i for x i < x x i+1 for 1 i n 1. Then since f is

10 38 Chapter 3. Continuum Probability and Sets of Measure Zero nonnegative, f (x) d x = > α n c i (x i+1 x i ) n c i (x i+1 x i ) c i >α n (x i+1 x i ) c i >α = αµ ({x I : f (x) > α}). Now we are ready to prove Theorem Proof. We can also describe the set A n as A n = x I : W 2 n (x) > n2 ε 2, where Wn 2 (x) is piecewise constant and non-negative. By Theorem 3.2.2, We compute, 1 W 2 n (x) d x = µ (A n ) < 1 n 2 ε 2 n 2 R i (x) d x = n W 2 n (x) d x. R 2 i (x) d x + n i, j =1 i j The first integral on the right is easy since R 2 (x) = 1 for all x, so i n R 2 i (x) d x = n. R i (x)r j (x) d x. We consider R i (x)r j (x) d x when i j. Without loss of generality, we assume i < j. Set J to be the interval, l J = 2, l + 1, l < 2 i. i 2 i R i is constant on J while R j oscillates 2( j i) times. Because this is an even number of oscillations, cancellation implies R i (x)r j (x) d x = R i (x) R j (x) d x =.

11 3.3. Sets of measure zero 39 Therefore, Thus, 1 Wn 2 (x) d x = n, and R i (x)r j (x) d x =, i j. µ (I n ) 1 n 2 ε n = 1 2 nε µ 2 (I n ) as n. The random variables introduced for this proof can be used to quantify other interesting questions. Example Suppose in the betting game above, we start with M dollars. We compute an expression that yields the probability we lose all the money. If A n is the event where we lose the money on the n t h toss, then the corresponding set of numbers is I An = {x I : W i (x) > M for i < n and W n (x) = M}. The set I An, determined by where a function has prescribed values, is generally complicated. The event A of losing all the money, given by I A = is even more complicated. The probability of A is µ (I A ), once we figure out how that is computed Numerical simulation References Exercises 3.3 Sets of measure zero Theorem states that the size of the event consisting of Bernoulli sequences for a fair coin for which the relative frequency of H s in the first N trials is larger than a fixed distance from 1/2 tends to as N. But, this leaves open the question: For a fair coin and a typical x, does S lim n (x) = 1 n n 2? (3.6) This is an important question from the point of view of numerical simulation, as it is quite common that we would have only one numerical sequence corresponding to a choice of x in hand. Can we reliably use the computed example to try to approximate the answer to statistical questions? I An Definition The set of normal numbers in I is S N = x I : n (x) 1 n 2 as n.

12 4 Chapter 3. Continuum Probability and Sets of Measure Zero Another way to state the intuition behind the Law of Large Numbers is that the nonnormal numbers should be atypical in some sense. Definition An event in B is atypical if it has probability zero, or if the corresponding set of real numbers has Lebesgue measure. Thus, the intuition behind the Law of Large Numbers is that N c should have Lebesgue measure zero. In this section, we characterize sets with Lebesgue measure zero. We noted above that the Lebesgue measure of a single point is zero. It follows immediately that finite collections of points also have Lebesgue measure zero. Infinite collections are apparently more complicated. For example, I is the uncountable union of single points and does not have Lebesgue measure zero. Working from the assumptions about measure we have made so far, we develop a general method for characterizing sets with Lebesgue measure zero. In doing so, we actually motivate several key aspects of measure theory. The characterization is based on a fundamentally important concept for metric spaces. Definition Given a subset A n, a countable cover of A is a countable collection of sets {A i } in n such that A A i. If the sets in a countable cover are open, we call it an open cover. We emphasize that the requirement of being countable is important. Definition A set A has Lebesgue measure zero if for every ε >, there is a countable cover {A i } of A, where each A i consists of a finite union of open intervals, such that We also say that A has measure zero. µ (A i ) < ε. Note that because each A i in the countable cover consists of a finite union of open intervals, their Lebesgue measure is computable. In this way, we sidestep the issue of computing µ (A) directly. This definition also uses (implicitly) another property of Lebesgue measure: Definition If (c, d) (a, b), then µ ((c, d)) µ ((a, b)). We say that Lebesgue measure is monotone. We could use half open or closed interval in the definition instead of open intervals, but open intervals turn out to be convenient for compactness arguments. Example We show that a closed interval [a, b] with a b cannot have measure zero. If [a, b] is covered by countably many open intervals, we can extract a finite number that cover [a, b] (a finite subcover) because it is compact. The sum of length of these intervals must be at least b a. We describe some sets of measure zero.

13 3.3. Sets of measure zero 41 Theorem A measurable subset of a set of measure zero has measure zero. 2. If {A i } is a countable collection of sets of measure zero, then A i has measure zero. 3. Any finite or countable set of numbers has measure zero. This states that a countable union of sets of measure zero is a set of measure zero. In contrast, uncountable unions of sets of measure zero can have nonzero measure. The assumption that the subset of the set of measure zero in 2. is measurable is an important point that we address in later chapters. Proof. Result 1. This follows from the definition since any countable cover of the larger set is also a cover of the smaller set. Result 2. We choose ε >. Since A n has measure zero, there is a countable collection of open intervals B n,1, B n,2,..., covering A n with µ (B n,i ) ε 2 n. The collection {B n,i } is countable and covers A n, n. Moreover, i, ε µ (B n,i ) = µ (B n,i ) 2 = ε. n Note that we use non-negativity to switch the order of summation in this argument. Result 3. This follows from 2. and 3. and the observation that a point has measure zero. Proof Comment 3.2. This is a classic measure theory argument that the reader should study until it is familiar. An interesting question is whether or not there are any interesting sets of measure zero. We next show that there are uncountable sets of measure zero. In particular, we describe the construction of a special example that is used frequently in measure theory. The set is constructed by an iterative process. Definition Step 1 Beginning with the unit interval F = [,1], divide F into 3 equal parts and remove the middle third open interval 1 3, 3 2 to get F 1 =, ,1. See Fig. 3.4.

14 42 Chapter 3. Continuum Probability and Sets of Measure Zero o 1 o 1_ 2_ F F1 Figure 3.4. The first step in the construction of the Cantor set. Step 2 Working on F 1 next, divide each of its two pieces into equal thirds and remove the middle open intervals from the divisions to get F 2. F 2 =, , , ,1. This has 2 2 closed intervals of length 3 2, see Fig o 1_ 9 2_ 9 1_ 3 2_ 3 7_ 9 8_ 9 1 F2 Figure 3.5. The second step in the construction of the Cantor set. Step i Divide each of the 2 i 1 pieces remaining after step i 1 into equal thirds and remove the middle piece from each to get F i. F i has 2 i closed intervals of length 3 i. End result This procedure yields a sequence of closed sets {F i }, where each F i is a finite union of 2 i closed interval of length 3 i. The Cantor (Middle Third) Set C is defined, C = F i. Theorem Let C be the Cantor set in. Then, 1. C is closed. 2. Every point in C is a limit of a sequence of points in C. 3. C has measure zero. 4. C is uncountable. Proof. Result 1 Result 2 Exercise. Exercise. Result 3 C is contained in F i for any i. Since F i is a union of disjoint intervals whose lengths sum to (2/3) i and, for any ε >, (2/3) i < ε for all sufficiently large i, C has measure zero.

15 3.3. Sets of measure zero 43 Result 4 form We show that every point x C can be represented uniquely by a series of the x = where a i = or 2. This can be recognized as a base 3 decimal expansion. To show uniqueness, if a i b = i 3 i 3 i for a i, b i = or 2, we show that a i = b i for all i. Suppose a i b i for some i. Let n be the smallest number with a n b n, so a n b n = 2. Since a i b i 2 for all i, = a i b i 3 i = i=n a i b i 3 i a i 3 i, 1 a a 3 n n b n i b i 3 i=n+1 i n = 1 3 n 3 i 3. n This is a contradiction and so every number in C has a unique base 3 decimal expansion. Now let {G i, j, j = 1,2,...,2 i 1 } be the open middle third intervals removed to obtain F i. Then, a number given by the base 3 decimal expansion.b 1 b 2 b 3..., b i =,1,2, is in G i, j for some j if and only if: b j = or 2 for each j < i, because it is in F i 1 ; b i = 1, because it is in one of the discarded open intervals at this stage; the b j s are not all or 2 for j > i. It is a good exercise to use a variation of the Cantor diagonal argument to show that C is uncountable. Check notes on this proof. To give some idea of the importance of the concept of sets of measure zero, we quote a beautiful result of Lebesgue that states if and only if conditions for a function to be Riemann integrable. Recall that two aspects of Riemann integration provided significant impetus to the development of measure theory. First, there was a long search minimal equivalent conditions on a function that would guarantee the function is Riemann integrable. Second, the Riemann integral has some annoying flaws. We provide a theory for Riemann integration and discuss these issues in Appendix A. Here, we simply quote one of the most important results. To explain the idea, we begin with a canonic example. First, Definition A property of sets that holds except on a set of measure zero is said to hold almost everywhere (a.e. ). We say that almost all points in a set have a property if all the points except those in a set of measure zero have the property. Now, the example.

16 44 Chapter 3. Continuum Probability and Sets of Measure Zero Definition Dirichlet s function is defined 1, if x, D(x) =, if x. From the definition, D is a bounded function and D(x) = a.e. It is a simple exercise to show that D is discontinuous at every point in I and therefore D(x) is not continuous a.e. We prove the following result in Appendix A. Theorem (Lebesgue s Theorem on Riemann Integration). A bounded function is Riemann integrable on a closed interval if and only if it is continuous a.e. on the interval. Add Theorem 1.3 from Billingsley? References Exercises 3.4 The Strong Law of Large Numbers We return to analyzing the set of normal numbers N. Theorem (Strong Law of Large Numbers for Bernoulli Sequences). uncountable set with Lebesgue measure zero. N c is an Unlike the Weak Law of Large Numbers Theorem 3.2.1, this theorem is a statement that requires measure theory. This version of the Law of Large Numbers is called strong because Theorem implies Theorem This is a consequence of a general result on different kinds of convergence that we prove later on. Proof. We first show that that N c is uncountable and contains a Cantor-like set. Consider the map f : I I, f(x) =.a 1 11a 2 11a , for x =.a 1 a 2 a 3... The map is 1 1, so its image is uncountable. Moreover, f(i ) is contained in N c. In fact, if y = f(x), then S 3n (y) 3n, and S 3n (y) 3n 2 3. Such y s clearly violate the Law of Large Numbers. The image set f(i ) is Cantor-like in that it is the countable nested union of sets consisting of finite number of well-separated, disjoint intervals. We cover the complicated set N c using a countable cover of much simpler sets. Recall the set A n = {x I : W n (x) > εn} used in the proof of the Weak Law of Large Numbers. We use an equivalent definition, A n = x I : W 4 n (x) > ε4 n 4.

17 3.4. The Strong Law of Large Numbers 45 By Chebyshev s Inequality 3.2.2, µ (A n ) 1 ε 4 n 4 The integrand yields 5 kinds of terms, W 4 n d x 1 ε 4 n 4 n 4 R i d x. 1. R 4 i for i = 1 n. 2. R 2 i R2 j for i j. 3. R 2 i R j R k for i j k. 4. R 3 i R j for i j. 5. R i R j R k R l for i j k l. Since R 4 i (x) = 1 and R2 i (x)r2 (x) = 1 for all i, j, j R 4 i d x = R 2 i R2 j d x = 1. We show the other terms integrate to zero because of cancellation. Two follow from the proof of the Weak Law of Large Numbers: R 2 i R i R k d x = R j R k d x =, i j k, R 3 i R j d x = R i R j d x =, i j. Finally, assume i < j < k < l, and consider an interval of the form m J = 2, m + 1. k 2 k R i R j R k is constant on J. However, R l oscillates 2(l l ) times on J, so R i R j R k R l d x =. There are n terms of the first kind of integrand and 3n(n 1) terms involving the second kind of integrand, so W 4 n (x) d x = 3n2 2n 3n 2, and µ (A n ) 3 n 2 ε 4.

18 46 Chapter 3. Continuum Probability and Sets of Measure Zero We cover N c using a collection of sets of the form A n for increasing n and decreasing ε chosen in such a way that the cover has arbitrarily small measure. For a constant C, set ε 4 n = C n 1/2, so 3 ε 4 nn 2 = 3 C 1 n 3/2. The last series converges and the quantity can be made smaller than any δ > by choosing sufficiently large C. Hence, given δ >, there is a sequence {ε n } such that 3 ε n4 n 2 δ. For each n, set à n = {x I : W n (x) > ε n n}. Note Ãn is a finite union of intervals since W n is piecewise constant. We have and µ (Ãn ) 3 ε 4 nn 2, µ (Ãn ) δ. If we show that N c à n, then we are done. This holds if N à c n. If x à c n, then for each n, W n (x) ε n n, or W n (x) n ε n. Since ε n, W n (x) n, or x N. The proof of Theorem can be used to draw stronger conclusions. For example, a normal number has the property that no finite sequence of digits occurs more frequently than any other finite sequence of digits Numerical simulation 3.5 A second wish list for measure theory With some informal experience with measure theory ideas, we make a second attempt at a wish list of desirable properties for a measure theory. We are considering the measure on n that extends the standard notions of length, area, and volume. If E n for some n, let µ(e) denote its measure. 1. µ should be non-negative set function from sets in n into the extended reals { }. µ({x}) = for a single point. µ(a) = should be possible for unbounded sets. 2. In, we should have µ([a, b]) = b a. In n, we should have µ(q) = (b 1 a 1 )(b 2 a 2 )...(b n a n ), for generalized rectangles (multi-intervals), Q = {x n : a i x i b i, 1 i n}.

19 3.5. A second wish list for measure theory If {A 1, A 2,...,A n } are disjoint sets, then µ(a 1 A 2... A n ) = n µ(a i ). What about infinite collections? Well, µ({x}) =. But in, (,1) = {x}. This is a problem because we cannot have 1 = µ((,1)) = µ {x} = µ({x}) =. x x x (,1) So, uncountable collections of sets are a problem and we avoid them. What about countable collections? Countable disjoint collections of sets of measure zero should have measure zero. Also, 1 1 (,1] = 2,1 3, , 1..., 3 and, 1 = µ((,1]) = 1 = µ 2, µ , µ , References Exercises So we would like to say that if {A i } is a countable collection of disjoint sets then µ A i = µ(a i ). 4. If A B are sets, then µ(a) µ(b), or µ should be monotone. 5. For the standard volume measure on n, if a set A is obtained from another set B by rotation, translation, or reflection maps, then µ(a) = µ(b). It turns out that we cannot construct a desirable measure that satisfies all of these properties. We have to give up something, so we do not require that the measure be defined on all subsets on n. We settle for a measure defined on a class of subsets.

Construction of a general measure structure

Construction of a general measure structure Chapter 4 Construction of a general measure structure We turn to the development of general measure theory. The ingredients are a set describing the universe of points, a class of measurable subsets along

More information

Measure and integration

Measure and integration Chapter 5 Measure and integration In calculus you have learned how to calculate the size of different kinds of sets: the length of a curve, the area of a region or a surface, the volume or mass of a solid.

More information

Lebesgue measure and integration

Lebesgue measure and integration Chapter 4 Lebesgue measure and integration If you look back at what you have learned in your earlier mathematics courses, you will definitely recall a lot about area and volume from the simple formulas

More information

the time it takes until a radioactive substance undergoes a decay

the time it takes until a radioactive substance undergoes a decay 1 Probabilities 1.1 Experiments with randomness Wewillusethetermexperimentinaverygeneralwaytorefertosomeprocess that produces a random outcome. Examples: (Ask class for some first) Here are some discrete

More information

Measures. Chapter Some prerequisites. 1.2 Introduction

Measures. Chapter Some prerequisites. 1.2 Introduction Lecture notes Course Analysis for PhD students Uppsala University, Spring 2018 Rostyslav Kozhan Chapter 1 Measures 1.1 Some prerequisites I will follow closely the textbook Real analysis: Modern Techniques

More information

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory Part V 7 Introduction: What are measures and why measurable sets Lebesgue Integration Theory Definition 7. (Preliminary). A measure on a set is a function :2 [ ] such that. () = 2. If { } = is a finite

More information

Integration on Measure Spaces

Integration on Measure Spaces Chapter 3 Integration on Measure Spaces In this chapter we introduce the general notion of a measure on a space X, define the class of measurable functions, and define the integral, first on a class of

More information

Measures and Measure Spaces

Measures and Measure Spaces Chapter 2 Measures and Measure Spaces In summarizing the flaws of the Riemann integral we can focus on two main points: 1) Many nice functions are not Riemann integrable. 2) The Riemann integral does not

More information

Introduction to Proofs in Analysis. updated December 5, By Edoh Y. Amiran Following the outline of notes by Donald Chalice INTRODUCTION

Introduction to Proofs in Analysis. updated December 5, By Edoh Y. Amiran Following the outline of notes by Donald Chalice INTRODUCTION Introduction to Proofs in Analysis updated December 5, 2016 By Edoh Y. Amiran Following the outline of notes by Donald Chalice INTRODUCTION Purpose. These notes intend to introduce four main notions from

More information

Some Background Material

Some Background Material Chapter 1 Some Background Material In the first chapter, we present a quick review of elementary - but important - material as a way of dipping our toes in the water. This chapter also introduces important

More information


1.1. MEASURES AND INTEGRALS CHAPTER 1: MEASURE THEORY In this chapter we define the notion of measure µ on a space, construct integrals on this space, and establish their basic properties under limits. The measure µ(e) will be defined

More information

MAT1000 ASSIGNMENT 1. a k 3 k. x =

MAT1000 ASSIGNMENT 1. a k 3 k. x = MAT1000 ASSIGNMENT 1 VITALY KUZNETSOV Question 1 (Exercise 2 on page 37). Tne Cantor set C can also be described in terms of ternary expansions. (a) Every number in [0, 1] has a ternary expansion x = a

More information

Chapter 1 The Real Numbers

Chapter 1 The Real Numbers Chapter 1 The Real Numbers In a beginning course in calculus, the emphasis is on introducing the techniques of the subject;i.e., differentiation and integration and their applications. An advanced calculus

More information

MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 2: Countability and Cantor Sets

MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 2: Countability and Cantor Sets MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION Chapter 2: Countability and Cantor Sets Countable and Uncountable Sets The concept of countability will be important in this course

More information

Sample Spaces, Random Variables

Sample Spaces, Random Variables Sample Spaces, Random Variables Moulinath Banerjee University of Michigan August 3, 22 Probabilities In talking about probabilities, the fundamental object is Ω, the sample space. (elements) in Ω are denoted

More information

The Lebesgue Integral

The Lebesgue Integral The Lebesgue Integral Brent Nelson In these notes we give an introduction to the Lebesgue integral, assuming only a knowledge of metric spaces and the iemann integral. For more details see [1, Chapters

More information

Lebesgue Measure on R n

Lebesgue Measure on R n CHAPTER 2 Lebesgue Measure on R n Our goal is to construct a notion of the volume, or Lebesgue measure, of rather general subsets of R n that reduces to the usual volume of elementary geometrical sets

More information

Indeed, if we want m to be compatible with taking limits, it should be countably additive, meaning that ( )

Indeed, if we want m to be compatible with taking limits, it should be countably additive, meaning that ( ) Lebesgue Measure The idea of the Lebesgue integral is to first define a measure on subsets of R. That is, we wish to assign a number m(s to each subset S of R, representing the total length that S takes

More information

Countability. 1 Motivation. 2 Counting

Countability. 1 Motivation. 2 Counting Countability 1 Motivation In topology as well as other areas of mathematics, we deal with a lot of infinite sets. However, as we will gradually discover, some infinite sets are bigger than others. Countably

More information

Measure and Integration: Concepts, Examples and Exercises. INDER K. RANA Indian Institute of Technology Bombay India

Measure and Integration: Concepts, Examples and Exercises. INDER K. RANA Indian Institute of Technology Bombay India Measure and Integration: Concepts, Examples and Exercises INDER K. RANA Indian Institute of Technology Bombay India Department of Mathematics, Indian Institute of Technology, Bombay, Powai, Mumbai 400076,

More information

ADVANCE TOPICS IN ANALYSIS - REAL. 8 September September 2011

ADVANCE TOPICS IN ANALYSIS - REAL. 8 September September 2011 ADVANCE TOPICS IN ANALYSIS - REAL NOTES COMPILED BY KATO LA Introductions 8 September 011 15 September 011 Nested Interval Theorem: If A 1 ra 1, b 1 s, A ra, b s,, A n ra n, b n s, and A 1 Ě A Ě Ě A n

More information

REAL ANALYSIS I Spring 2016 Product Measures

REAL ANALYSIS I Spring 2016 Product Measures REAL ANALSIS I Spring 216 Product Measures We assume that (, M, µ), (, N, ν) are σ- finite measure spaces. We want to provide the Cartesian product with a measure space structure in which all sets of the

More information

Module 1. Probability

Module 1. Probability Module 1 Probability 1. Introduction In our daily life we come across many processes whose nature cannot be predicted in advance. Such processes are referred to as random processes. The only way to derive

More information


CHAPTER 8: EXPLORING R CHAPTER 8: EXPLORING R LECTURE NOTES FOR MATH 378 (CSUSM, SPRING 2009). WAYNE AITKEN In the previous chapter we discussed the need for a complete ordered field. The field Q is not complete, so we constructed

More information

RS Chapter 1 Random Variables 6/5/2017. Chapter 1. Probability Theory: Introduction

RS Chapter 1 Random Variables 6/5/2017. Chapter 1. Probability Theory: Introduction Chapter 1 Probability Theory: Introduction Basic Probability General In a probability space (Ω, Σ, P), the set Ω is the set of all possible outcomes of a probability experiment. Mathematically, Ω is just

More information

Measurable Functions and Random Variables

Measurable Functions and Random Variables Chapter 8 Measurable Functions and Random Variables The relationship between two measurable quantities can, strictly speaking, not be found by observation. Carl Runge What I don t like about measure theory

More information

Notes on the Lebesgue Integral by Francis J. Narcowich Septemmber, 2014

Notes on the Lebesgue Integral by Francis J. Narcowich Septemmber, 2014 1 Introduction Notes on the Lebesgue Integral by Francis J. Narcowich Septemmber, 2014 In the definition of the Riemann integral of a function f(x), the x-axis is partitioned and the integral is defined

More information

In N we can do addition, but in order to do subtraction we need to extend N to the integers

In N we can do addition, but in order to do subtraction we need to extend N to the integers Chapter The Real Numbers.. Some Preliminaries Discussion: The Irrationality of 2. We begin with the natural numbers N = {, 2, 3, }. In N we can do addition, but in order to do subtraction we need to extend

More information


REAL ANALYSIS LECTURE NOTES: 1.4 OUTER MEASURE REAL ANALYSIS LECTURE NOTES: 1.4 OUTER MEASURE CHRISTOPHER HEIL 1.4.1 Introduction We will expand on Section 1.4 of Folland s text, which covers abstract outer measures also called exterior measures).

More information

In N we can do addition, but in order to do subtraction we need to extend N to the integers

In N we can do addition, but in order to do subtraction we need to extend N to the integers Chapter 1 The Real Numbers 1.1. Some Preliminaries Discussion: The Irrationality of 2. We begin with the natural numbers N = {1, 2, 3, }. In N we can do addition, but in order to do subtraction we need

More information

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is Math 416 Lecture 3 Expected values The average or mean or expected value of x 1, x 2, x 3,..., x n is x 1 x 2... x n n x 1 1 n x 2 1 n... x n 1 n 1 n x i p x i where p x i 1 n is the probability of x i

More information


CLASS NOTES FOR APRIL 14, 2000 CLASS NOTES FOR APRIL 14, 2000 Announcement: Section 1.2, Questions 3,5 have been deferred from Assignment 1 to Assignment 2. Section 1.4, Question 5 has been dropped entirely. 1. Review of Wednesday class

More information

STAT 7032 Probability Spring Wlodek Bryc

STAT 7032 Probability Spring Wlodek Bryc STAT 7032 Probability Spring 2018 Wlodek Bryc Created: Friday, Jan 2, 2014 Revised for Spring 2018 Printed: January 9, 2018 File: Grad-Prob-2018.TEX Department of Mathematical Sciences, University of Cincinnati,

More information

Admin and Lecture 1: Recap of Measure Theory

Admin and Lecture 1: Recap of Measure Theory Admin and Lecture 1: Recap of Measure Theory David Aldous January 16, 2018 I don t use bcourses: Read web page (search Aldous 205B) Web page rather unorganized some topics done by Nike in 205A will post

More information

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define 1 Measures 1.1 Jordan content in R N II - REAL ANALYSIS Let I be an interval in R. Then its 1-content is defined as c 1 (I) := b a if I is bounded with endpoints a, b. If I is unbounded, we define c 1

More information

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries Chapter 1 Measure Spaces 1.1 Algebras and σ algebras of sets 1.1.1 Notation and preliminaries We shall denote by X a nonempty set, by P(X) the set of all parts (i.e., subsets) of X, and by the empty set.

More information

An Introduction to Non-Standard Analysis and its Applications

An Introduction to Non-Standard Analysis and its Applications An Introduction to Non-Standard Analysis and its Applications Kevin O Neill March 6, 2014 1 Basic Tools 1.1 A Shortest Possible History of Analysis When Newton and Leibnitz practiced calculus, they used

More information


A VERY BRIEF REVIEW OF MEASURE THEORY A VERY BRIEF REVIEW OF MEASURE THEORY A brief philosophical discussion. Measure theory, as much as any branch of mathematics, is an area where it is important to be acquainted with the basic notions and

More information

We are going to discuss what it means for a sequence to converge in three stages: First, we define what it means for a sequence to converge to zero

We are going to discuss what it means for a sequence to converge in three stages: First, we define what it means for a sequence to converge to zero Chapter Limits of Sequences Calculus Student: lim s n = 0 means the s n are getting closer and closer to zero but never gets there. Instructor: ARGHHHHH! Exercise. Think of a better response for the instructor.

More information

1 Measurable Functions

1 Measurable Functions 36-752 Advanced Probability Overview Spring 2018 2. Measurable Functions, Random Variables, and Integration Instructor: Alessandro Rinaldo Associated reading: Sec 1.5 of Ash and Doléans-Dade; Sec 1.3 and

More information

Sequence convergence, the weak T-axioms, and first countability

Sequence convergence, the weak T-axioms, and first countability Sequence convergence, the weak T-axioms, and first countability 1 Motivation Up to now we have been mentioning the notion of sequence convergence without actually defining it. So in this section we will

More information

MATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION. Extra Reading Material for Level 4 and Level 6

MATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION. Extra Reading Material for Level 4 and Level 6 MATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION Extra Reading Material for Level 4 and Level 6 Part A: Construction of Lebesgue Measure The first part the extra material consists of the construction

More information

Coin tossing space. 0,1 consisting of all sequences (t n ) n N, represents the set of possible outcomes of tossing a coin infinitely many times.

Coin tossing space. 0,1 consisting of all sequences (t n ) n N, represents the set of possible outcomes of tossing a coin infinitely many times. Coin tossing space Think of a coin toss as a random choice from the two element set }. Thus the set } n represents the set of possible outcomes of n coin tosses, and Ω := } N, consisting of all sequences

More information

Notes on the Lebesgue Integral by Francis J. Narcowich November, 2013

Notes on the Lebesgue Integral by Francis J. Narcowich November, 2013 Notes on the Lebesgue Integral by Francis J. Narcowich November, 203 Introduction In the definition of the Riemann integral of a function f(x), the x-axis is partitioned and the integral is defined in

More information

CHAPTER 6. Differentiation

CHAPTER 6. Differentiation CHPTER 6 Differentiation The generalization from elementary calculus of differentiation in measure theory is less obvious than that of integration, and the methods of treating it are somewhat involved.

More information

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability Lecture Notes 1 Basic Probability Set Theory Elements of Probability Conditional probability Sequential Calculation of Probability Total Probability and Bayes Rule Independence Counting EE 178/278A: Basic

More information

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures 36-752 Spring 2014 Advanced Probability Overview Lecture Notes Set 1: Course Overview, σ-fields, and Measures Instructor: Jing Lei Associated reading: Sec 1.1-1.4 of Ash and Doléans-Dade; Sec 1.1 and A.1

More information

Contents Ordered Fields... 2 Ordered sets and fields... 2 Construction of the Reals 1: Dedekind Cuts... 2 Metric Spaces... 3

Contents Ordered Fields... 2 Ordered sets and fields... 2 Construction of the Reals 1: Dedekind Cuts... 2 Metric Spaces... 3 Analysis Math Notes Study Guide Real Analysis Contents Ordered Fields 2 Ordered sets and fields 2 Construction of the Reals 1: Dedekind Cuts 2 Metric Spaces 3 Metric Spaces 3 Definitions 4 Separability

More information

Quick Tour of the Topology of R. Steven Hurder, Dave Marker, & John Wood 1

Quick Tour of the Topology of R. Steven Hurder, Dave Marker, & John Wood 1 Quick Tour of the Topology of R Steven Hurder, Dave Marker, & John Wood 1 1 Department of Mathematics, University of Illinois at Chicago April 17, 2003 Preface i Chapter 1. The Topology of R 1 1. Open

More information

Measurable functions are approximately nice, even if look terrible.

Measurable functions are approximately nice, even if look terrible. Tel Aviv University, 2015 Functions of real variables 74 7 Approximation 7a A terrible integrable function........... 74 7b Approximation of sets................ 76 7c Approximation of functions............

More information

Finite and Infinite Sets

Finite and Infinite Sets Chapter 9 Finite and Infinite Sets 9. Finite Sets Preview Activity (Equivalent Sets, Part ). Let A and B be sets and let f be a function from A to B..f W A! B/. Carefully complete each of the following

More information

Inference for Stochastic Processes

Inference for Stochastic Processes Inference for Stochastic Processes Robert L. Wolpert Revised: June 19, 005 Introduction A stochastic process is a family {X t } of real-valued random variables, all defined on the same probability space

More information

Notes 1 : Measure-theoretic foundations I

Notes 1 : Measure-theoretic foundations I Notes 1 : Measure-theoretic foundations I Math 733-734: Theory of Probability Lecturer: Sebastien Roch References: [Wil91, Section 1.0-1.8, 2.1-2.3, 3.1-3.11], [Fel68, Sections 7.2, 8.1, 9.6], [Dur10,

More information

Mathematics 220 Workshop Cardinality. Some harder problems on cardinality.

Mathematics 220 Workshop Cardinality. Some harder problems on cardinality. Some harder problems on cardinality. These are two series of problems with specific goals: the first goal is to prove that the cardinality of the set of irrational numbers is continuum, and the second

More information

2.3 Some Properties of Continuous Functions

2.3 Some Properties of Continuous Functions 2.3 Some Properties of Continuous Functions In this section we look at some properties, some quite deep, shared by all continuous functions. They are known as the following: 1. Preservation of sign property

More information

2.1 Lecture 5: Probability spaces, Interpretation of probabilities, Random variables

2.1 Lecture 5: Probability spaces, Interpretation of probabilities, Random variables Chapter 2 Kinetic Theory 2.1 Lecture 5: Probability spaces, Interpretation of probabilities, Random variables In the previous lectures the theory of thermodynamics was formulated as a purely phenomenological

More information

consists of two disjoint copies of X n, each scaled down by 1,

consists of two disjoint copies of X n, each scaled down by 1, Homework 4 Solutions, Real Analysis I, Fall, 200. (4) Let be a topological space and M be a σ-algebra on which contains all Borel sets. Let m, µ be two positive measures on M. Assume there is a constant

More information

Summary of Fourier Transform Properties

Summary of Fourier Transform Properties Summary of Fourier ransform Properties Frank R. Kschischang he Edward S. Rogers Sr. Department of Electrical and Computer Engineering University of oronto January 7, 207 Definition and Some echnicalities

More information

Probability. Table of contents

Probability. Table of contents Probability Table of contents 1. Important definitions 2. Distributions 3. Discrete distributions 4. Continuous distributions 5. The Normal distribution 6. Multivariate random variables 7. Other continuous

More information

Measure Theory and Lebesgue Integration. Joshua H. Lifton

Measure Theory and Lebesgue Integration. Joshua H. Lifton Measure Theory and Lebesgue Integration Joshua H. Lifton Originally published 31 March 1999 Revised 5 September 2004 bstract This paper originally came out of my 1999 Swarthmore College Mathematics Senior

More information

Connectedness. Proposition 2.2. The following are equivalent for a topological space (X, T ).

Connectedness. Proposition 2.2. The following are equivalent for a topological space (X, T ). Connectedness 1 Motivation Connectedness is the sort of topological property that students love. Its definition is intuitive and easy to understand, and it is a powerful tool in proofs of well-known results.

More information


SETS AND FUNCTIONS JOSHUA BALLEW SETS AND FUNCTIONS JOSHUA BALLEW 1. Sets As a review, we begin by considering a naive look at set theory. For our purposes, we define a set as a collection of objects. Except for certain sets like N, Z,

More information

Principles of Real Analysis I Fall I. The Real Number System

Principles of Real Analysis I Fall I. The Real Number System 21-355 Principles of Real Analysis I Fall 2004 I. The Real Number System The main goal of this course is to develop the theory of real-valued functions of one real variable in a systematic and rigorous

More information

1 Normal Distribution.

1 Normal Distribution. Normal Distribution.. Introduction A Bernoulli trial is simple random experiment that ends in success or failure. A Bernoulli trial can be used to make a new random experiment by repeating the Bernoulli

More information

Problem set 1, Real Analysis I, Spring, 2015.

Problem set 1, Real Analysis I, Spring, 2015. Problem set 1, Real Analysis I, Spring, 015. (1) Let f n : D R be a sequence of functions with domain D R n. Recall that f n f uniformly if and only if for all ɛ > 0, there is an N = N(ɛ) so that if n

More information

CHAPTER 5. The Topology of R. 1. Open and Closed Sets

CHAPTER 5. The Topology of R. 1. Open and Closed Sets CHAPTER 5 The Topology of R 1. Open and Closed Sets DEFINITION 5.1. A set G Ω R is open if for every x 2 G there is an " > 0 such that (x ", x + ") Ω G. A set F Ω R is closed if F c is open. The idea is

More information

Lebesgue Measure on R n

Lebesgue Measure on R n 8 CHAPTER 2 Lebesgue Measure on R n Our goal is to construct a notion of the volume, or Lebesgue measure, of rather general subsets of R n that reduces to the usual volume of elementary geometrical sets

More information

The strictly 1/2-stable example

The strictly 1/2-stable example The strictly 1/2-stable example 1 Direct approach: building a Lévy pure jump process on R Bert Fristedt provided key mathematical facts for this example. A pure jump Lévy process X is a Lévy process such

More information

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9 MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended

More information


ADVANCED CALCULUS - MTH433 LECTURE 4 - FINITE AND INFINITE SETS ADVANCED CALCULUS - MTH433 LECTURE 4 - FINITE AND INFINITE SETS 1. Cardinal number of a set The cardinal number (or simply cardinal) of a set is a generalization of the concept of the number of elements

More information

Introduction and Preliminaries

Introduction and Preliminaries Chapter 1 Introduction and Preliminaries This chapter serves two purposes. The first purpose is to prepare the readers for the more systematic development in later chapters of methods of real analysis

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten

More information

The small ball property in Banach spaces (quantitative results)

The small ball property in Banach spaces (quantitative results) The small ball property in Banach spaces (quantitative results) Ehrhard Behrends Abstract A metric space (M, d) is said to have the small ball property (sbp) if for every ε 0 > 0 there exists a sequence

More information

2 Measure Theory. 2.1 Measures

2 Measure Theory. 2.1 Measures 2 Measure Theory 2.1 Measures A lot of this exposition is motivated by Folland s wonderful text, Real Analysis: Modern Techniques and Their Applications. Perhaps the most ubiquitous measure in our lives

More information

Theorems. Theorem 1.11: Greatest-Lower-Bound Property. Theorem 1.20: The Archimedean property of. Theorem 1.21: -th Root of Real Numbers

Theorems. Theorem 1.11: Greatest-Lower-Bound Property. Theorem 1.20: The Archimedean property of. Theorem 1.21: -th Root of Real Numbers Page 1 Theorems Wednesday, May 9, 2018 12:53 AM Theorem 1.11: Greatest-Lower-Bound Property Suppose is an ordered set with the least-upper-bound property Suppose, and is bounded below be the set of lower

More information

What is a random variable

What is a random variable OKAN UNIVERSITY FACULTY OF ENGINEERING AND ARCHITECTURE MATH 256 Probability and Random Processes 04 Random Variables Fall 20 Yrd. Doç. Dr. Didem Kivanc Tureli

More information

n if n is even. f (n)=

n if n is even. f (n)= 6 2. PROBABILITY 4. Countable and uncountable Definition 32. An set Ω is said to be finite if there is an n N and a bijection from Ω onto [n]. An infinite set Ω is said to be countable if there is a bijection

More information

CS 125 Section #10 (Un)decidability and Probability November 1, 2016

CS 125 Section #10 (Un)decidability and Probability November 1, 2016 CS 125 Section #10 (Un)decidability and Probability November 1, 2016 1 Countability Recall that a set S is countable (either finite or countably infinite) if and only if there exists a surjective mapping

More information

Probability. Lecture Notes. Adolfo J. Rumbos

Probability. Lecture Notes. Adolfo J. Rumbos Probability Lecture Notes Adolfo J. Rumbos October 20, 204 2 Contents Introduction 5. An example from statistical inference................ 5 2 Probability Spaces 9 2. Sample Spaces and σ fields.....................

More information


CONSTRUCTION OF THE REAL NUMBERS. CONSTRUCTION OF THE REAL NUMBERS. IAN KIMING 1. Motivation. It will not come as a big surprise to anyone when I say that we need the real numbers in mathematics. More to the point, we need to be able to

More information

REU 2007 Transfinite Combinatorics Lecture 9

REU 2007 Transfinite Combinatorics Lecture 9 REU 2007 Transfinite Combinatorics Lecture 9 Instructor: László Babai Scribe: Travis Schedler August 10, 2007. Revised by instructor. Last updated August 11, 3:40pm Note: All (0, 1)-measures will be assumed

More information

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define Homework, Real Analysis I, Fall, 2010. (1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define ρ(f, g) = 1 0 f(x) g(x) dx. Show that

More information

Standard forms for writing numbers

Standard forms for writing numbers Standard forms for writing numbers In order to relate the abstract mathematical descriptions of familiar number systems to the everyday descriptions of numbers by decimal expansions and similar means,

More information

Measure Theoretic Probability. P.J.C. Spreij

Measure Theoretic Probability. P.J.C. Spreij Measure Theoretic Probability P.J.C. Spreij this version: September 16, 2009 Contents 1 σ-algebras and measures 1 1.1 σ-algebras............................... 1 1.2 Measures...............................

More information

2. Transience and Recurrence

2. Transience and Recurrence Virtual Laboratories > 15. Markov Chains > 1 2 3 4 5 6 7 8 9 10 11 12 2. Transience and Recurrence The study of Markov chains, particularly the limiting behavior, depends critically on the random times

More information

Probability COMP 245 STATISTICS. Dr N A Heard. 1 Sample Spaces and Events Sample Spaces Events Combinations of Events...

Probability COMP 245 STATISTICS. Dr N A Heard. 1 Sample Spaces and Events Sample Spaces Events Combinations of Events... Probability COMP 245 STATISTICS Dr N A Heard Contents Sample Spaces and Events. Sample Spaces........................................2 Events........................................... 2.3 Combinations

More information

1.4 Outer measures 10 CHAPTER 1. MEASURE

1.4 Outer measures 10 CHAPTER 1. MEASURE 10 CHAPTER 1. MEASURE 1.3.6. ( Almost everywhere and null sets If (X, A, µ is a measure space, then a set in A is called a null set (or µ-null if its measure is 0. Clearly a countable union of null sets

More information

13. Examples of measure-preserving tranformations: rotations of a torus, the doubling map

13. Examples of measure-preserving tranformations: rotations of a torus, the doubling map 3. Examples of measure-preserving tranformations: rotations of a torus, the doubling map 3. Rotations of a torus, the doubling map In this lecture we give two methods by which one can show that a given

More information

STAT:5100 (22S:193) Statistical Inference I

STAT:5100 (22S:193) Statistical Inference I STAT:5100 (22S:193) Statistical Inference I Week 3 Luke Tierney University of Iowa Fall 2015 Luke Tierney (U Iowa) STAT:5100 (22S:193) Statistical Inference I Fall 2015 1 Recap Matching problem Generalized

More information

CONSTRUCTION OF sequence of rational approximations to sets of rational approximating sequences, all with the same tail behaviour Definition 1.

CONSTRUCTION OF sequence of rational approximations to sets of rational approximating sequences, all with the same tail behaviour Definition 1. CONSTRUCTION OF R 1. MOTIVATION We are used to thinking of real numbers as successive approximations. For example, we write π = 3.14159... to mean that π is a real number which, accurate to 5 decimal places,

More information

Introduction to Real Analysis Alternative Chapter 1

Introduction to Real Analysis Alternative Chapter 1 Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces

More information


PROBABILITY THEORY 1. Basics PROILITY THEORY. asics Probability theory deals with the study of random phenomena, which under repeated experiments yield different outcomes that have certain underlying patterns about them. The notion

More information

Definition: Let S and T be sets. A binary relation on SxT is any subset of SxT. A binary relation on S is any subset of SxS.

Definition: Let S and T be sets. A binary relation on SxT is any subset of SxT. A binary relation on S is any subset of SxS. 4 Functions Before studying functions we will first quickly define a more general idea, namely the notion of a relation. A function turns out to be a special type of relation. Definition: Let S and T be

More information

Discrete Mathematics and Probability Theory Fall 2013 Vazirani Note 1

Discrete Mathematics and Probability Theory Fall 2013 Vazirani Note 1 CS 70 Discrete Mathematics and Probability Theory Fall 013 Vazirani Note 1 Induction Induction is a basic, powerful and widely used proof technique. It is one of the most common techniques for analyzing

More information

Why study probability? Set theory. ECE 6010 Lecture 1 Introduction; Review of Random Variables

Why study probability? Set theory. ECE 6010 Lecture 1 Introduction; Review of Random Variables ECE 6010 Lecture 1 Introduction; Review of Random Variables Readings from G&S: Chapter 1. Section 2.1, Section 2.3, Section 2.4, Section 3.1, Section 3.2, Section 3.5, Section 4.1, Section 4.2, Section

More information

Independent random variables

Independent random variables CHAPTER 2 Independent random variables 2.1. Product measures Definition 2.1. Let µ i be measures on (Ω i,f i ), 1 i n. Let F F 1... F n be the sigma algebra of subsets of Ω : Ω 1... Ω n generated by all

More information

Recap of Basic Probability Theory

Recap of Basic Probability Theory 02407 Stochastic Processes Recap of Basic Probability Theory Uffe Høgsbro Thygesen Informatics and Mathematical Modelling Technical University of Denmark 2800 Kgs. Lyngby Denmark Email:

More information

6.2 Fubini s Theorem. (µ ν)(c) = f C (x) dµ(x). (6.2) Proof. Note that (X Y, A B, µ ν) must be σ-finite as well, so that.

6.2 Fubini s Theorem. (µ ν)(c) = f C (x) dµ(x). (6.2) Proof. Note that (X Y, A B, µ ν) must be σ-finite as well, so that. 6.2 Fubini s Theorem Theorem 6.2.1. (Fubini s theorem - first form) Let (, A, µ) and (, B, ν) be complete σ-finite measure spaces. Let C = A B. Then for each µ ν- measurable set C C the section x C is

More information

Chapter 8: An Introduction to Probability and Statistics

Chapter 8: An Introduction to Probability and Statistics Course S3, 200 07 Chapter 8: An Introduction to Probability and Statistics This material is covered in the book: Erwin Kreyszig, Advanced Engineering Mathematics (9th edition) Chapter 24 (not including

More information

MATHS 730 FC Lecture Notes March 5, Introduction

MATHS 730 FC Lecture Notes March 5, Introduction 1 INTRODUCTION MATHS 730 FC Lecture Notes March 5, 2014 1 Introduction Definition. If A, B are sets and there exists a bijection A B, they have the same cardinality, which we write as A, #A. If there exists

More information