MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 11 10/15/2008 ABSTRACT INTEGRATION I

Similar documents
A := A i : {A i } S. is an algebra. The same object is obtained when the union in required to be disjoint.

3.8 Three Types of Convergence

Prerequisites. We recall: Theorem 2 A subset of a countably innite set is countable.

Solutions 1. Introduction to Coding Theory - Spring 2010 Solutions 1. Exercise 1.1. See Examples 1.2 and 1.11 in the course notes.

Some Classical Ergodic Theorems

New upper bound for the B-spline basis condition number II. K. Scherer. Institut fur Angewandte Mathematik, Universitat Bonn, Bonn, Germany.

Measure and integration

Question 1. Question 3. Question 4. Graduate Analysis I Exercise 4

Physics 215 Winter The Density Matrix

4 = (0.02) 3 13, = 0.25 because = 25. Simi-

1 Measurable Functions

ABSTRACT EXPECTATION

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 8 10/1/2008 CONTINUOUS RANDOM VARIABLES

Math Real Analysis The Henstock-Kurzweil Integral

Computational and Statistical Learning Theory

E0 370 Statistical Learning Theory Lecture 5 (Aug 25, 2011)

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

Lecture 21: Expectation of CRVs, Fatou s Lemma and DCT Integration of Continuous Random Variables

The Weierstrass Approximation Theorem

Uniform Approximation and Bernstein Polynomials with Coefficients in the Unit Interval

On Process Complexity

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures

18.175: Lecture 3 Integration

In this chapter, we consider several graph-theoretic and probabilistic models

Supplement to: Subsampling Methods for Persistent Homology

LORENTZ SPACES AND REAL INTERPOLATION THE KEEL-TAO APPROACH

36-752: Lecture 1. We will use measures to say how large sets are. First, we have to decide which sets we will measure.

The isomorphism problem of Hausdorff measures and Hölder restrictions of functions

Finite fields. and we ve used it in various examples and homework problems. In these notes I will introduce more finite fields

The Euler-Maclaurin Formula and Sums of Powers

3.3 Variational Characterization of Singular Values

Lectures 8 & 9: The Z-transform.

Proof of Radon-Nikodym theorem

Math Reviews classifications (2000): Primary 54F05; Secondary 54D20, 54D65

On Conditions for Linearity of Optimal Estimation

4 Integration 4.1 Integration of non-negative simple functions

Lecture 5: Expectation

STOPPING SIMULATED PATHS EARLY

Compendium and Solutions to exercises TMA4225 Foundation of analysis

The Frequent Paucity of Trivial Strings

Lebesgue measure and integration

arxiv: v2 [math.co] 3 Dec 2008

Probability and Measure

Block designs and statistics

Lecture 21 Principle of Inclusion and Exclusion

Lecture 3: Expected Value. These integrals are taken over all of Ω. If we wish to integrate over a measurable subset A Ω, we will write

Integration on Measure Spaces

FAST DYNAMO ON THE REAL LINE

Machine Learning Basics: Estimators, Bias and Variance

The degree of a typical vertex in generalized random intersection graph models

1.4 Outer measures 10 CHAPTER 1. MEASURE

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Lecture 7. Sums of random variables

Approximation in Stochastic Scheduling: The Power of LP-Based Priority Policies

Probability and Measure

Polygonal Designs: Existence and Construction

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 9 10/2/2013. Conditional expectations, filtration and martingales

Plasma-Wall Interaction: Sheath and Pre-sheath

arxiv: v1 [math.pr] 17 May 2009

MATH 418: Lectures on Conditional Expectation

On Certain C-Test Words for Free Groups

Understanding Machine Learning Solution Manual

If Y and Y 0 satisfy (1-2), then Y = Y 0 a.s.

A1. Find all ordered pairs (a, b) of positive integers for which 1 a + 1 b = 3

Chapter 6 1-D Continuous Groups

Hermite s Rule Surpasses Simpson s: in Mathematics Curricula Simpson s Rule. Should be Replaced by Hermite s

MULTIPLAYER ROCK-PAPER-SCISSORS

arxiv: v1 [math.nt] 14 Sep 2014

Asynchronous Gossip Algorithms for Stochastic Optimization

Product measures, Tonelli s and Fubini s theorems For use in MAT4410, autumn 2017 Nadia S. Larsen. 17 November 2017.

Max-Product Shepard Approximation Operators

Learnability and Stability in the General Learning Setting

Chapter 6. Integration. 1. Integrals of Nonnegative Functions. a j µ(e j ) (ca j )µ(e j ) = c X. and ψ =

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

A Simple Regression Problem

Figure 1: Equivalent electric (RC) circuit of a neurons membrane

Effects of an Inhomogeneous Magnetic Field (E =0)

Constrained Consensus and Optimization in Multi-Agent Networks arxiv: v2 [math.oc] 17 Dec 2008

( f ^ M _ M 0 )dµ (5.1)

Computable Shell Decomposition Bounds

Testing Properties of Collections of Distributions

Lecture 22: Variance and Covariance

Independent random variables

Probability and Random Processes

4 Expectation & the Lebesgue Theorems

Research Article On the Isolated Vertices and Connectivity in Random Intersection Graphs

Probability Distributions

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE

arxiv: v1 [math.co] 19 Apr 2017

The Lebesgue Integral

Standard & Canonical Forms

Introduction to Optimization Techniques. Nonlinear Programming

#A52 INTEGERS 10 (2010), COMBINATORIAL INTERPRETATIONS OF BINOMIAL COEFFICIENT ANALOGUES RELATED TO LUCAS SEQUENCES

The Methods of Solution for Constrained Nonlinear Programming

Fourier Series Summary (From Salivahanan et al, 2002)

12 Towards hydrodynamic equations J Nonlinear Dynamics II: Continuum Systems Lecture 12 Spring 2015

Asymptotics of weighted random sums

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Three hours THE UNIVERSITY OF MANCHESTER. 24th January

Distributional transformations, orthogonal polynomials, and Stein characterizations

Transcription:

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 11 10/15/2008 ABSTRACT INTEGRATION I Contents 1. Preliinaries 2. The ain result 3. The Rieann integral 4. The integral of a nonnegative siple function 5. The integral of a nonnegative function 6. The general case The aterial in these notes can be found in practically every textboo that includes basic easure theory, although the order with which various properties are proved can be soewhat different in different sources. 1 PRELIMINARIES The objective of these notes is to define the integral g dµ [soeties also denoted g(ω) dµ(ω)] of a easurable function g : Ω R, defined on a easure space (Ω, F, µ). Special cases: (a) If (Ω, F, P) is a probability space and X : Ω R is easurable (i.e., an extended-valued rando variable), the integral X dp is also denoted E[X], and is called the expectation of X. (b) If we are dealing with the easure space (R, B, λ), where B is the Borel σ-field and λ is the Borel easure, the integral g dλ is often denoted as g(x) dx, and is eant to be a generalization of the usual integral encountered in calculus. 1

The progra: We will define the integral g dµ for progressively general classes of easurable functions g: (a) Finite nonnegative functions g that tae finitely any values ( siple functions ). In this case, the integral is just a suitably weighted su of the values of g. (b) Nonnegative functions g. Here, the integral will be defined by approxiating g fro below by a sequence of siple functions. (c) General functions g. This is done by decoposing g in the for g = g + g, where g + and g are nonnegative functions, and letting g dµ = g + dµ g dµ. We will be focusing on the integral over the entire set Ω. The integral over a (easurable) subset B of Ω will be defined by letting g dµ = (1 B g) dµ. Here 1 B is the indicator function of the set g, so that { g(ω), if ω B, (1 B g)(ω) = 0, if ω / B. B Throughout, we will use the ter alost everywhere to ean for all ω outside a zero-easure subset of Ω. For the special case of probability easures, we will often use the alternative terinology alost surely, or a.s. for short. Thus, if X and ( Y are rando variables, ) we have X = Y, a.s., if and only if P(X = Y ) = P {ω : X(ω) Y (ω)} = 0. In the sequel, an inequality g h between two functions will be interpreted as g(ω) h(ω), for all ω. Siilarly, g h, a.e., eans that g(ω) h(ω), for all ω outside a zero-easure set. The notation g n g will ean that for every ω, the sequence g n (ω) is onotone nondecreasing and converges to g(ω). Finally, g n g, a.e., will ean that the onotonic convergence of g n (ω) to g(ω) holds for all ω outside a zero-easure set. 2 THE MAIN RESULT Once the construction is carried out, integrals of nonnegative functions will always be well-defined. For general functions, integrals will be left undefined only when an expression of the for is encountered. The following properties will turn out to be true, whenever the integrals or expectations involved are well-defined. On the left, we show the general version; 2

on the right, we show the sae property, specialized to the case of probability easures. In property 8, the convention 0 will be in effect when needed. 1. 2. 3. 4. 4 1 B dµ = µ(b) g 0 g dµ 0 g = 0, a.e. g dµ = 0 g h g dµ h dµ g h, a.e. g dµ h dµ E[1 B ] = P(B) X 0 E[X] 0 X = 0, a.s. E[X] = 0 X Y E[X] E[Y ] X Y, a.s. E[X] E[Y ] 5. g = h, a.e. g dµ = h dµ X = Y, a.s. E[X] = E[Y ] 6. [g 0, a.e., and g dµ = 0] g = 0, a.e. [X 0, a.s., and E[X] 0] X = 0, a.s. 7. 8. (g + h) dµ = g dµ + h dµ (ag) dµ = a g dµ E[X + Y ] = E[X] + E[Y ] E[aX] = ae[x] 9. 0 g n g g n dµ g dµ 0 X n X, E[X n ] E[X] 9. 0 g n g, a.e. g n dµ g dµ 0 X n X, a.s. E[X n ] E[X] 10. g 0 ν(b) = B g dµ is a easure [f 0 and f dp = 1] ν(b) = f dp is a probability easure Property 9, and its generalization, property 9, is nown as the Monotone Convergence Theore (MCT), and is a cornerstone of integration theory. 3 THE RIEMANN INTEGRAL Before proceeding, it is worth understanding why the traditional integral en countered in calculus is not adequate for our purposes. Let us recall the defini b tion of the (Rieann) integral a g(x) dx. We subdivide the interval [a, b] using a finite sequence σ = (x 1, x 2,..., x n ) of points that satisfy a = x 1 < x 2 < < x n = b, and define n 1 ( ) U(σ) = ax g(x) (x i+1 x i ), x i x<x i+1 i=1 n 1 ( ) L(σ) = in g(x) (x i+1 x i ). x i x<x i+1 i=1 B 3

Thus, U(σ) and L(σ) are approxiations of the area under the curve g, fro b above and fro below, respectively. We say that the integral a g(x) dx is welldefined, and equal to a constant c, if sup L(σ) = inf U(σ) = c. σ σ In this case, we also say that g is Rieann-integrable over [a, b]. Intuitively, we want the upper and lower approxiants U(σ) and L(σ) to agree, in the liit of very fine subdivisions of the interval [a, b]. It is nown that if g : R R is Rieann-integrable over every interval [a, b], then g is continuous alost everywhere (i.e., there exists a set S of Lebesgue easure zero, such that g is continuous at every x / S). This is a severe liitation on the class of Rieann-integrable functions. Exaple. Let Q be the set of rational nubers in [0, 1]. Let g = 1 Q. For any σ = (x 1, x 2,..., x n ), and every i, the interval [x i, x i+1 ) contains a rational nuber, and also an irrational nuber. Thus, ax xi x<x i+1 g(x) = 1 and in xi x<x i+1 g(x) = 0. It follows that U(σ) = 1 and L(σ) = 0, for all σ, and sup σ L(σ) = inf σ U(σ). Therefore 1 Q is not Rieann integrable. On the other hand if we consider a unifor distribution over [0, 1], and the binary rando variable 1 Q, we have P(1 Q = 1) = 0, and we would lie to be able to say that E[X] = [0,1] 1 Q(x) dx = 0. This indicates that a different definition is in order. 4 THE INTEGRAL OF A NONNEGATIVE SIMPLE FUNCTION A function g : Ω R is called siple if it is easurable, finite-value, and taes finitely any different values. In particular, a siple function can be written as g(ω) = a i 1 Ai (ω), ω, (1) i=1 where is a (finite) nonnegative integer, the coefficients a i are nonzero and finite, and the A i are easurable sets. Note that a siple function can have several representations of the for (1). For exaple, 1 [0,2] and 1 [0,1] + 1 (1,2] are two representations of the sae function. For another exaple, note that 1 [0,2] + 1 [1,2] = 1 [0,1) + 2 1 [1,2]. On the other hand, if we require the a i to be distinct and the sets A i to be disjoint, it is not hard to see that there is only one possible representation, which we will call the canonical representation. More concretely, in the canonical representation, we let {a 1,..., a } be the range of g, where the a i are distinct, and A i = {ω g(ω) = a i }. 4

Definition 1. If g is a siple function, of the for (1), its integral is defined by g dµ = a i µ(a i ). i=1 Before continuing, we need to ae sure that Definition 1 is sound, in the following sense. If we consider two alternative representations of the sae siple function, we need to ensure that the resulting value of g dµ is the sae. Technically, we need to show the following: if a i 1 Ai = b i 1 Bi, then a i µ(a i ) = b i µ(b i ). i=1 i=1 i=1 i=1 This is left as an exercise for the reader. Exaple. We have 1 [0,2] + 1 [1,2] = 1 [0,1) + 2 1 [1,2]. The first representation leads to µ([0, 2]) + µ([1, 2], the second to µ([0, 1)) + 2µ([1, 2]). Using the fact µ([0, 2]) = µ([0, 1)) + µ([1, 2]) (finite additivity), we see that the two values are indeed equal. For the case where the underlying easure is a probability easure P, a siple function X : Ω R is called a siple rando variable, and its integral X dp is also denoted as E[X]. We then have E[X] = a i P(A i ). i=1 If the coefficients a i are distinct, equal to the possible values of X, and by taing A i = {ω X(ω) = a i }, we obtain E[X] = a i P({ω X(ω) = a i }) = a i P(X = a i ), i=1 i=1 which agrees with the eleentary definition of E[X] for discrete rando variables. Note, for future reference, that the su or difference of two siple functions is also siple. 4.1 Verification of various properties for the case of siple functions For the various properties listed in Section 2, we will use the shorthand property S-A and property N-A, to refer to property A for the special case of siple functions and property A for nonnegative easurable functions, respectively. 5

We note a few iediate consequences of the definition. For any B F, The function 1 B is siple and 1 B dµ = µ(b), which verifies property 1. In particular, when Q is the set of rational nubers and µ is Lebesgue easure, we have 1 Q dµ = µ(q) = 0, as desired. Note that a nonnegative siple function has a representation of the for (1) with all a i positive. It follows that g dµ 0, which verifies property S-2. Suppose now that a siple function satisfies g = 0, a.e. Then, it has a canonical representation of the for g = i=1 a i1 Ai, where µ(a i ) = 0, for every i. Definition 1 iplies that g dµ = 0, which verifies property S-3. Let us now verify the linearity property S-7. Let g and h be nonnegative siple functions. Using canonical representations, we can write g = a i 1 Ai, h = b j 1 Bj, i=1 j=1 where the sets A i are disjoint, and the sets B j are also disjoint. Then, the sets A i B j are disjoint, and Therefore, g + h = (a i + b j )1 Ai B j. i=1 j=1 (g + h) dµ = (a i + b j )µ(a i B j ) i=1 j=1 = a i µ(a i B j ) + b j µ(a i B j ) i=1 j=1 j=1 i=1 = a i µ(a i ) + b j µ(b j ) i=1 j=1 = g dµ + h dµ. (The first and fourth equalities follow fro Definition 1. The third equality ade use of finite additivity for µ.) Property S-8 is an iediate consequence of Definition 1. We only need to be careful for the case where g dµ = and a = 0. Using the convention 0, we see that ag = 0, so that (ag) dµ = 0 = 0 = a g dµ, and the 6

property indeed holds. By cobining properties S-7 and S-8, with a = 1 we see that, for siple functions g and h we have (g h) dµ = g dµ h dµ. We now verify property S-4, which also iplies property S-4 as a special case. Suppose that g h, a.e. We then have h = g + q, for a siple function q such that q 0, a.e. In particular, q = q q, where q + 0, q 0, and q = 0, a.e. Thus, h = g + q q. Note that q, q +, and q are all siple functions. Using the linearity property S-7and then properties S-3, S-2, we obtain h dµ = g dµ + q + dµ q dµ = g dµ + q + dµ g dµ. We next verify property S-5. If g = h, a.e., then we have both g h, a.e., and h g, a.e. Thus, g dµ h dµ, and g dµ h dµ, which iplies that g dµ = h dµ. We finally verify property S-6. Suppose that g 0, a.e., and g dµ = 0. We write g = g + g, where g + 0 and g 0. Then, g = 0, a.e., and g dµ = 0. Thus, using property S-7, g + dµ = g dµ + g dµ = 0. Note that g + is siple. Hence, its canonical representation is of the for g + = i=1 a i1 Ai, with a i > 0. Since i=1 a iµ(a i ) = 0, it follows that µ(a i ) = 0, for every i. Fro finite additivity, we conclude that µ( i=1 A i ) = 0. Therefore, g + = 0, a.e., and also g = 0, a.e. 5 THE INTEGRAL OF A NONNEGATIVE FUNCTION The integral of a nonnegative function g will be defined by approxiating g fro below, using siple functions. Definition 2. For a easurable function g : Ω [0, ], we let S(g) be the set of all nonnegative siple (hence autoatically easurable) functions q that satisfy q g, and define g dµ = sup q dµ. q S(g) We will now verify that with this definition, properties N-2 to N-10 are all satisfied. This is easy for soe (e.g., property N-2). Most of our effort will be devoted to establishing properties N-7 (linearity) and N-9 (onotone convergence theore). The arguents that follow will ae occasional use of the following continuity property for onotonic sequences of easurable sets B i : If B i B, then 7

µ(b i ) µ(b). This property was established in the notes for Lecture 1, for the special case where µ is a probability easure, but the sae proof applies to the general case. 5.1 Verification of soe easy properties Throughout this subsection, we assue that g is easurable and nonnegative. Property N-2: For every q S(g), we have q dµ 0 (property S-2). Thus, g dµ = sup q S(g) q dµ 0. Property N-3: If g = 0, a.e, and 0 q g, then q = 0, a.e. Therefore, q dµ = 0 for every q S(g) (by property S-3), which iplies that g dµ = 0. Property N-4: Suppose that 0 g h. Then, S(g) S(h), which iplies that g dµ = sup q dµ sup q dµ = h dµ. q S(g) q S(h) Property N-5: Suppose that g = h, a.e. Let A = {ω g(ω) = h(ω)}, and note that the copleent of A has zero easure, so that q = 1 A q, a.e., for any function q. Then, g dµ = sup q dµ = sup 1 A q dµ sup q dµ q S(g) q S(g) q S(1 A g) sup q dµ = h dµ. q S(h) A syetrical arguent yields h, dµ g dµ. Exercise: Justify the above sequence of equalities and inequalities. Property N-4 : Suppose that g h, a.e. Then, there exists a function g such that g h and g = g, a.e. Property N-5 yields g dµ = g dµ. Property N-4 yields g dµ h dµ. These iply that g dµ h dµ. Property N-6: Suppose that g 0 but the relation g = 0, a.e., is not true. We will show that g dµ > 0. Let B = {ω g(ω) > 0}. Then, µ(b) > 0. Let B n = {ω g(ω) > 1/n}. Then, B n B and, therefore, µ(b n ) µ(b) > 0. This shows that for soe n we have µ(b n ) > 0. Note that g (1/n)1 Bn. Then, properties S-4, S-8, and 1 yield 1 1 1 g dµ n 1 B n dµ = 1 Bn dµ = µ(b n ) > 0. n n 8

Property N-8, when a 0: If a = 0, the result is iediate. Assue that a > 0. It is not hard to see that q S(g) if and only if aq S(ag). Thus, (ag) dµ = sup q dµ = sup (aq) dµ = sup (aq) dµ = a q dµ. q S(ag) aq S(ag) q S(g) 5.2 Proof of the Monotone Convergence Theore We first provide the proof of property 9, for the special case where g is equal to a siple function q, and then generalize. Let q be a nonnegative siple function, represented in the for q = i=1 a i1 Ai, where the a i are finite positive nubers, and the sets A i are easurable and disjoint. Let g n be a sequence of nonnegative easurable functions such that g n q. We distinguish between two different cases, depending on whether q dµ is finite or infinite. (i) Suppose that q dµ =. This iplies that µ(a i ) = for soe i. It follows that µ(a i ) = for soe i. Fix such an i, and let B n = {ω A i g n (ω) > a i /2}. For every ω A i, there exists soe n such that g n (ω) > a i /2. Therefore, B n A i. Fro the continuity of easures, we obtain µ(b n ). Now, note that g n (a i /2)1 Bn. Then, using property N-4, we have g n dµ a i µ(bn ) = q dµ. 2 (ii) Suppose now that q dµ <. Then, µ(a i ) <, for all i S. Let A = i=1 A i. By finite additivity, we have µ(a) <. Let us fix a positive integer r such that 1/r < a. Let { } B n = ω A g n (ω) q(ω) (1/r). We observe that B n A and, by continuity, µ(b n ) µ(a). Since µ(a) = µ(b n ) + µ(a \ B n ), and µ(a) <, this also yields µ(a \ B n ) 0. Note that 1 A q = q, a.e. Using properties, S-5 and S-7, we have q dµ = 1 A q dµ = 1 Bn q dµ + 1 A\Bn q dµ. (2) 9

For ω B n, we have g n (ω) + (1/r) q(ω). Thus, g n + (1/r)1 Bn 1 Bn q. Using properties N-4 and S-7, together with Eq. (2), we have 1 g n dµ + 1 Bn dµ 1 Bn q dµ = q dµ 1 A\Bn q dµ r q dµ aµ(a \ B n ). By taing the liit as n, we obtain 1 li g n dµ + µ(a) q dµ. n r Since this is true for every r > 1/a, we ust have li g n dµ q dµ. n On the other hand, we have g n q, so that g n dµ q dµ, and li n g n dµ q dµ. We now turn to the general case. We assue that 0 g n g. Suppose that q S(g), so that 0 q g. We have Therefore, 0 in{g n, q} in{g, q} = q. li g n dµ li in{g n, q} dµ = q dµ. n n (The inequality above uses property N-4; the equality relies on the fact that we already proved the MCT for the case where the liit function is siple.) By taing the supreu over q S(g), we obtain li g n dµ sup q dµ = g dµ. n q S(g) On the other hand, we have g n g, so that g n dµ g dµ. Therefore, li n g n dµ g dµ, which concludes the proof of property N-9. To prove property 9, suppose that g n g, a.e. Then, there exist functions g n and g, such that g n = g n, a.e., g = g, a.e., and g n g. By cobining properties N-5 and N-9, we obtain li g n dµ = li g n dµ = g dµ = g dµ. n n 10

5.3 Approxiating g fro below using special siple functions Let g be a nonnegative easurable function. Fro the definition of g dµ, it follows that there exists a sequence q n S(g) such that q n dµ q dµ. This does not provide us with uch inforation on the sequence q n. In contrast, the construction that follows provides us with a concrete way of approxiating g dµ. For any positive integer r, we define a function g r : Ω R by letting { r, if g(ω) r g r (ω) = i i i + 1 2 r, if 2 r g(ω) < 2 r, i = 0, 1,..., r2 r 1 In words, the function g r is a quantized version of g. For every ω, the value of g(ω) is first capped at r, and then rounded down to the nearest ultiple of 2 r. We note a few properties of g r that are direct consequences of its definition. (a) For every r, the function g r is siple (and, in particular, easurable). (b) We have 0 g r g; that is, for every ω, we have g r (ω) g(ω). (c) If g is bounded above by c and r c, then g r (ω) g(ω) 1/2 r, for every ω. Stateent (b) above gives us a transparent characterization of the set of easurable functions. Naely, a nonnegative function is easurable if and only if it is the onotonic and pointwise liit of siple functions. Furtherore, the MCT indicates that g r dµ g dµ, for this particular choice of siple functions g r. (In an alternative line of developent of the subject, soe texts start by defining g dµ as the liit of g r dµ.) 5.4 Linearity We now prove linearity (property N-7). Let g r and h r be the approxiants of g and h, respectively, defined in Section 5.3. Since g r g and h r h, we have (g r + h r ) (g + h). Therefore, using the MCT and property S-7 (linearity for siple functions), (g + h) dµ = li (g r + h r ) dµ r ( ) = li g r dµ + h r dµ r = li g r dµ + li h r dµ r r = g dµ + h dµ. 11

6 THE GENERAL CASE Consider now a easurable function g : Ω R. Let A + = {ω g(ω) > 0} and A = {ω g(ω) < 0}; note that these are easurable sets. Let g + = g 1 A+ and g = 1 A g; note that these are nonnegative easurable functions. We then have g = g + g, and we define g dµ = g + dµ g dµ. The integral g dµ is well-defined, as long as we do not have both g + dµ and g dµ equal to infinity. With this definition, verifying properties 3-8 is not too difficult, and there are no surprises. We decopose the functions g and h into negative and positive parts, and apply the properties already proved for the nonnegative case. The details are left as an exercise. In order to prove the last property (property 10) one uses countable additivity for the easure µ, a liiting arguent based on the approxiation of g by siple functions, and the MCT. The detailed proof is left as an exercise for the reader. 12

MIT OpenCourseWare http://ocw.it.edu 6.436J / 15.085J Fundaentals of Probability Fall 2008 For inforation about citing these aterials or our Ters of Use, visit: http://ocw.it.edu/ters.