arxiv:math/ v1 [math.pr] 21 Aug 2006
|
|
- Anissa O’Neal’
- 5 years ago
- Views:
Transcription
1 Measure Concentration of Markov Tree Processes arxiv:math/ v1 [math.pr] 21 Aug 2006 Leonid Kontorovich School of Computer Science Carnegie Mellon University Pittsburgh, PA USA May 23, 2008 Abstract We prove an apparently novel concentration of measure result for Markov tree processes. The bound we derive reduces to the known bounds for Markov processes when the tree is a chain, thus strictly generalizing the known Markov process concentration results. We employ several techniques of potential independent interest, especially for obtaining similar results for more general directed acyclic graphical models. 1 Introduction An emerging paradigm for proving concentration results for nonproduct measures is to quantify the dependence between the variables and state the bounds in terms of that dependence. A process (measure) particularly amenable to this approach is the Markov process. Using different techniques, Marton (coupling method [4], 1996), Samson (log-sobolev inequality [6], 2000) and Kontorovich and Ramanan (martingale differences [3], 2006) have obtained qualitatively similar concentration of measure results for Markov processes. One natural generalization of the Markov process is the hidden Markov process; we proved a concentration result for this class in [2]. A different way to generalize the Markov process is via the Markov tree process, which we address in the present paper. If (S n, d) is a metric space and (X i ) 1 i n, X i S is a random process, a measure concentration result (for the purposes of this paper) is an inequality stating that for any Lipschitz (with respect to d) function f : S n R, we have P f(x) Ef(X) > t} α(t), (1) where α(t) is rapidly decaying to 0 as t gets large. The quantity η ij, defined below, has proved useful for obtaining concentration results. For 1 i < j n, y S i 1 and w S, let L(X n j Xi 1 1 = y, X i = w) be the law of X n j conditioned on Xi 1 1 = y and X i = w. Define η ij (y, w, w ) = L(X n j X i 1 1 = y, X i = w) L(X n j X i 1 1 = y, X i = w ) TV (2) 1
2 and η ij = sup sup y S i 1 w,w S η ij (y, w, w ) where TV is the total variation norm (see 2.1 to clarify notation). Let Γ and be upper-triangular n n matrices, with Γ ii = ii = 1 and Γ ij = η ij, ij = η ij for 1 i < j n. For the case where S = [0, 1] and d is the Euclidean metric on R n, Samson [6] showed that if f : [0, 1] n R is convex and Lipschitz with f Lip 1, then P f(x) Ef(X) > t} 2 exp ( t2 2 Γ 2 2 where Γ 2 is the l 2 operator norm of the matrix Γ; Marton [5] has a comparable result. For the case where S is countable and d is the (normalized) Hamming metric on S n, d(x, y) = 1 n n ½ xi y i}, i=1 ) (3) Kontorovich and Ramanan [3] showed that if f : S n R is Lipschitz with f Lip 1, then ( ) P f(x) Ef(X) > t} 2 exp t2 2 2 (4) where is the l operator norm of the matrix, also given by = max 1 i<n (1 + η i,i η i,n ). (5) This leads to a strengthening of the Markov measure concentration result in Marton [4]. The sharpest currently known Markov measure concentration result (for the normalized Hamming metric) was obtained in [3], in terms of the contraction coefficients (θ i ) 1 i<n of the Markov process: η ij θ i θ i+1 θ j 1. (6) In this paper, we prove a bound on η ij in terms of the contraction coefficients of the Markov tree process (Theorem 2.1). This bound is cumbersome to state without preliminary definitions, but it reduces to (6) in the case where the Markov tree is a chain. 2 Bounding η ij for Markov tree processes 2.1 Notational preliminaries Random variables are capitalized (X), specified state sequences are written in lowercase (x), the shorthand X j i X i... X j is used for all sequences, and brackets denote sequence concatenation: [x j i xk j+1 ] = xk i. Another way to index collections of variables is by subset: if I = i 1, i 2,...,i m } then we write x I x i1, x i2,..., x im }. Thus, if u R SI, then u xi = u (xi1,x i2,...,x im ) R 2
3 for each x I S I. We will use to denote set cardinalities. Sums will range over the entire space of the summation variable; thus x j i f(x j i ), and f(x I ) is shorthand for f(x I ). x j i Sj i+1 x I x I S I f(x j i ) stands for The probability operator P } is defined with respect the measure space specified in context. We will write [n] for the set 1,..., n}. Anytime appears without a subscript, it will always denote the total variation norm TV. Regarding the latter, we recall that if τ is a signed, balanced measure on a countable set X (i.e., τ(x) = x X τ(x) = 0), then τ TV = 1 2 τ x X τ(x). (7) If G = (V, E) is a graph, we will frequently abuse notation and write u G instead of u V, blurring the distinction between a graph and its vertex set. This notation will carry over to set-theoretic operations (G = G 1 G 2 ) and indexing of variables (e.g., X G ). Unless we will need to refer explicitly to a σ-algebra, we will suppress it in the probability space notation, using less precise formulations, such as Let µ be a measure on S n. Furthermore, to avoid the technical but inessential complications associated with infinite sets, we will take S to be finite in this paper, noting only that the bounds carry over unchanged to the countable case (as done in [3] and [2]). To extend the results to the continuous case, some mild measure-theoretic assumptions are needed (see [5]). 2.2 Definition of Markov tree process Graph-theoretic preliminaries Consider a directed acyclic graph G = (V, E), and define a partial order G on G by the transitive closure of the relation u G v if (u, v) E. We define the parents and children of v V in the natural way: and parents(v) = u V : (u, v) E} children(v) = w V : (v, w) E}. If G is connected and each v V has at most one parent, G is called a (directed) tree. In a tree, whenever u G v there is a unique directed path from u to v. A tree T always has a unique minimal (w.r.t. T ) element r 0 V, called its root. Thus, for every v V there is a unique directed path r 0 T r 1 T... T r d = v; define the depth of v, dep T (v) = d, to be the length (i.e., number of edges) of this path. Note that dep T (r 0 ) = 0. We define the depth of the tree by dep(t) = sup v T dep T (v). For d = 0, 1,... define the d th level of the tree T by lev T (d) = v V : dep T (v) = d}; note that the levels induce a disjoint partition on V : V = dep(t) d=1 lev T (d). 3
4 We define the width of a tree as the greatest number of nodes in any level: wid(t) = sup lev T (d). (8) 1 d dep(t) We will consistently take V = n for finite V. An ordering J : V N of the nodes is said to be breadth-first if dep T (u) < dep T (v) = J(u) < J(v). (9) Since every directed tree T = (V, E) has some breadth-first ordering, 1 we shall henceforth blur the distinction between v V and J(v), simply taking V = [n] (or V = N) and assuming that dep T (u) < dep T (v) u < v holds. This will allow us to write S V simply as S n for any set S. Note that we have two orders on V : the partial order T, induced by the tree topology, and the total order <, given by the breadth-first enumeration. Observe that i T j implies i < j but not the other way around. If T = (V, E) is a tree and u V, we define the subtree induced by u, T u = (V u, E u ) by V u = v V : u T v}, E u = (v, w) E : v, w V u } Markov tree measure If S is a finite set, a Markov tree measure µ is defined on S n by a tree T = (V, E) and transition kernels p 0, p ij ( )} (i,j) E. Continuing our convention in 2.2.1, we have a breadthfirst order < and the total order T on V, and take V = 1,...,n}. Together, the topology of T and the transition kernels determine the measure µ on S n : µ(x) = p 0 (x 1 ) p ij (x j x i ). (10) (i,j) E A measure on S n satisfying (10) for some T and p ij } is said to be compatible with tree T; a measure is a Markov tree measure if it is compatible with some tree. Suppose S is a finite set and (X i ) i N, X i S is a random process defined on (S N, P). If for each n > 0 there is a tree T (n) = ([n], E (n) ) and a Markov tree measure µ n compatible with T (n) such that for all x S n we have P X n 1 = x} = µ n (x) then we call X a Markov tree process. The trees T (n) } are easily seen to be consistent in the sense that T (n) is an induced subgraph of T (n+1). So corresponding to any Markov tree process is the unique infinite tree T = (N, E). The uniqueness of T is easy to see, since for v > 1, the parent of v is the smallest u N such that P X v = x v X u 1 = x u 1 } = P X v = x v X u = x u }; thus P determines the topology of T. It is straightforward to verify that a Markov tree process X v } v T compatible with tree T has the following Markov property: if v and v are children of u in T, then P X Tv = x, X Tv = x X u = y } = P X Tv = x X u = y} P X Tv = x X u = y }. In other words, the subtrees induced by the children are conditionally independent given the parent; this follows directly from the definition of the Markov tree measure in (10). 1 One can easily construct a breadth-first ordering on a given tree by ordering the nodes arbitrarily within each level and listing the levels in ascending order: lev T(1),lev T(2),.... 4
5 2.3 Statement of result Theorem 2.1. Let S be a finite set and let (X i ) 1 i n, X i S be a Markov tree process, defined by a tree T = (V, E) and transition kernels p 0, p uv ( )} (u,v) E. Define the (u, v)- contraction coefficient θ uv by θ uv = sup p uv ( y) p uv ( y ) TV. (11) y,y S Suppose max (u,v) E θ uv θ < 1 for some θ and wid(t) L. Then for the Markov tree process X we have η ij ( 1 (1 θ) L) (j i)/l (12) for 1 i < j n. To cast (12) in more usable form, we first note that for L N and k N, if k L then k k (13) L 2L 1 (we omit the elementary number-theoretic proof). Using (13), we have η ij θ j i, for j i + L (14) where θ = (1 (1 θ) L ) 1/(2L 1). The bounds in (3) and (4) are for different metric spaces and therefore not readily comparable (the result in (3) has the additional convexity assumption). For the case where (14) holds, Samson s bound [6] yields Γ θ, (15) 1 2 and the approximation k=0 θ k = 1 1 θ (16) holds trivially via (5). 2 In the (degenerate) case where the Markov tree is a chain, we have L = 1 and therefore θ = θ; thus we recover the Markov chain concentration results in [3, 4, 6] and the approximations in (15,16) become precise inequalities. 2.4 Proof of main result The proof of Theorem 2.1 is combination of elementary graph theory and tensor algebra. We start with a graph-theoretic lemma: 2 The statement is approximate because (14) does not hold for all j > i but only starting with j i + L. The difference between ( 1 (1 θ) L) (j i)/l = 1 and θj i for i < j < i + L is at most 1 θ L 1 and affects only a fixed finite number (L 1) of entries in each row of Γ and. Since 2 and are continuous functionals, we are justified in claiming the approximate bound, which may be quantified if an application calls for it. The statements in (15) and (16) are only meant to convey an order of magnitude. 5
6 Lemma 2.2. Let T = ([n], E) be a tree and fix 1 i < j n. Suppose (X i ) n 1 is a Markov tree process whose law P on S n is compatible with T. Define the set T j i = T i j, j + 1,..., n}, consisting of those nodes in the subtree T i whose breadth-first numbering does not precede j. Then, for y S i 1 and w, w S, we have η ij (y, w, w ) = 0 T j i = η ij0 (y, w, w ) otherwise, (17) where j 0 is the minimum (with respect to <) element of T j i. Remark 2.3. This lemma tells us that when computing η ij it is sufficient to restrict our attention to the subtree induced by i. Proof. The case j T i implies j 0 = j and is trivial; thus we assume j / T i. In this case, the subtrees T i and T j are disjoint. Putting T i = T i \ i}, we have by the Markov property, P X Ti = x Ti, X Tj = x Tj X i 1 = [w y]} = P X Ti = x Ti X i = w } P X Tj = x Tj X i 1 1 = y }. Then from (2) and (7), and by marginalizing out the X Tj, we have η ij = 1 2 P X n j = x n j Xi 1 = [y w i] } P Xj n = xn j Xi 1 = [y w i ]} = 1 2 x n j P x T j i } X T j = x i T j X i = w P X i T j = x i T j X i = w }. i If T j i = then obviously η ij = 0; otherwise, η ij = η ij0, since j 0 is the first element of T j i. Next we develop some basic results for tensor norms; recall that unless specified otherwise, the norm used in this paper is the total variation norm defined in (7). If A is an M N columnstochastic matrix: (A ij 0 for 1 i M, 1 j N and M i=1 A ij = 1 for all 1 j N) and u R N is balanced in the sense that N j=1 u j = 0, we have, by the contraction lemma in [3], where Au A u, (18) A = max 1 j,j N A,j A,j, (19) and A,j denotes the j th column of A. An immediate consequence of (18) is that satisfies for column-stochastic matrices A R M N and B R N P. AB A B (20) Remark 2.4. Note that if A is a column-stochastic matrix then A 1, and if additionally u is balanced then Au is also balanced. If u R M and v R N, define their tensor product w = v u by w (i,j) = u i v j, 6
7 where the notation (v u) (i,j) is used to distinguish the 2-tensor w from an M N matrix. The tensor w is a vector in R MN indexed by pairs (i, j) [M] [N]; its norm is naturally defined to be w = 1 2 w(i,j). (21) (i,j) [M] [N] The following result will play a key role in deriving our bound (we suppress the boldfaced vector notation for readability): Lemma 2.5. Consider two finite sets X, Y, with probability measures p, p on X and q, q on Y. Then p q p q p p + q q p p q q. (22) Remark 2.6. Note that p q is a 2-tensor in R X Y and a probability measure on X Y. Proof. Fix q, q and define the function F(u, v) = x X u x v x + q q ( 2 x X u x v x ) x X,y Y over the convex polytope U R X R X, U = (u, v) : u x, v x 0, u x = } v x = 1 ; note that proving the claim is equivalent to showing that F 0 on U. For any σ 1, +1} X, let U σ = (u, v) U : sgn(u x v x ) = σ x }; note that U σ is a convex polytope and that U = σ 1,+1} U X σ. 3 Pick an arbitrary τ 1, +1} X Y and define ( ) F σ (u, v) = x σ x (u x v x ) + q q 2 x σ x (u x v x ) x,y u x q y v x q y τ xy (u x q y v x q y ) over U σ. Since σ x (u x v x ) = u x v x and τ xy can be chosen (for any given u, v, q, q ) so that τ xy (u x q y v x q y) = ux q y v x q y, the claim that F 0 on U will follow if we can show that F σ 0 on U σ. Observe that F σ is affine in its arguments (u, v) and recall that an affine function achieves its extreme values on the extreme points of a convex domain. Thus to verify that F σ 0 on U σ, we need only check the value of F σ on the extreme points of U σ. The extreme points of U σ are pairs (u, v) such that, for some x, x X, u = δ(x ) and v = δ(x ), where δ(x 0 ) R X is given by [δ(x 0 )] x = ½ x=x0}. Let (û, ˆv) be an extreme point of U σ. The case û = ˆv is trivial, so assume û ˆv. In this case, x X σ x(û x ˆv x ) = 2 and τ xy (û x q y ˆv x q y) ûx q y ˆv x q y x X,y Y 2. x X,y Y This shows that F σ 0 on U σ and completes the proof. 3 We define sgn(z) = ½ z 0} ½ z<0}. Note that the constraint x X ux = x X vx = 1 forces Uσ = (u, v) U : u x = v x} when σ x = +1 for all x X and U σ = when σ x = 1 for all x X. Both of these cases are trivial. 7
8 To develop a convenient tensor notation, we will fix the index set V = 1,..., n}. For I V, a tensor indexed by I is a vector u R SI. A special case of such an I-tensor is the product u = i I v(i), where v (i) R S and u xi = i I(v (i) ) xi. To gain more familiarity with the notation, let us write the total variation norm of an I-tensor: u = 1 2 x I S I u xi. In order to extend Lemma 2.5 to product tensors, we will need to define the function α k : R k R and state some of its properties: Lemma 2.7. Define α k : R k R recursively as α 1 (x) = x and Then α k+1 (x 1, x 2,..., x k+1 ) = x k+1 + (1 x k+1 )α k (x 1, x 2,..., x k ). (23) (a) α k is symmetric in its k arguments, so it is well-defined as a mapping from finite real sets to the reals α : x i : 1 i k} R (b) α k takes [0, 1] k to [0, 1] and is monotonically increasing in each argument on [0, 1] k (c) If B C [0, 1] then α(b) α(c) (d) α k (x, x,..., x) = 1 (1 x) k (e) if 1 B [0, 1] then α(b) = 1. Remark 2.8. In light of (a), we will use the notation α k (x 1, x 2,..., x k ) and α(x i : 1 i k}) interchangeably, as dictated by convenience. Proof. Claims (a), (b), (e) are straightforward to verify from the recursive definition of α and induction. Claim (c) follows from (b) since α k+1 (x 1, x 2,..., x k, 0) = α k (x 1, x 2,..., x k ) and (d) is easily derived from the binomial expansion of (1 x) k. The function α k is the natural generalization of α 2 (x 1, x 2 ) = x 1 + x 2 x 1 x 2 to k variables, and it is what we need for the analogue of Lemma 2.5 for a product of k tensors: Corollary 2.9. Let u (i) } i I and v (i) } i I be two sets of tensors and assume that each of u (i),v (i) is a probability measure on S. Then we have u (i) u v (i) α (i) v (i) } : i I. (24) i I i I Proof. Pick an i 0 I and let p = u (i0), q = v (i0), p = u (i), q = i 0 i I i 0 i I v (i). Apply Lemma 2.5 to p q p q and proceed by induction. 8
9 Our final generalization concerns linear operators over I-tensors. An I, J-matrix A has dimensions S J S I and takes an I-tensor u to a J-tensor v: for each x J S J, we have v xj = x I S I A xj,x I u xi, (25) which we write as Au = v. If A is an I, J-matrix and B is a J, K-matrix, the matrix product BA is defined analogously to (25). As a special case, an I, J-matrix might factorize as a tensor product of S S matrices A (i,j) R S S. We will write such a factorization in terms of a bipartite graph G = (I + J, E), where E I J and the factors A (i,j) are indexed by (i, j) E: A = A (i,j), (26) (i,j) E where A xj,x I = (i,j) E A (i,j) x j,x i for all x I S I and x J S J. The norm of an I, J-matrix is a natural generalization of the matrix norm defined in (19): A = where A,xI is the J-tensor given by max A,xI A,x x I,x I (27) I SI u xj = A xj,x I ; (27) is well-defined via the tensor norm in (21). Since I, J matrices act on I-tensors by ordinary matrix multiplication, Au A u continues to hold when A is a column-stochastic I, J- matrix and u is a balanced I-tensor; if, additionally, B is a column-stochastic J, K-matrix, BA B A also holds. Likewise, since another way of writing (26) is A,xI = (i,j) E A (i,j),x i, Corollary 2.9 extends to tensor products of matrices: Lemma Fix index sets I, J and a bipartite graph (I + J, E). Let A (i,j)} (i,j) E be a collection of column-stochastic S S matrices, whose tensor product is the I, J matrix A = A (i,j). (i,j) E Then A (i,j) A α } : (i, j) E. We are now in a position to state the main technical lemma, from which Theorem 2.1 will follow straightforwardly: Lemma Let S be a finite set and let (X i ) 1 i n, X i S be a Markov tree process, defined by a tree T = (V, E) and transition kernels p 0, p uv ( )} (u,v) E. Let the (u, v)-contraction coefficient θ uv be as defined in (11). 9
10 Fix 1 i < j n and let j 0 = j 0 (i, j) be as defined in Lemma 2.2 (we are assuming its existence, for otherwise η ij = 0). Then we have η ij dep T (j 0) d=dep T (i)+1 α θ uv : v lev T (d)}. (28) Proof. For y S i 1 and w, w S, we have η ij (y, w, w ) = 1 2 P X n j = x n j X1 i = [y w] } P Xj n = x n j X1 i = [y w ] } (29) = 1 2 x n j ( x n j z j 1 i+1 } P Xi+1 n = [zj 1 i+1 xn j ] Xi 1 = [y w] ) P Xi+1 n = [zj 1 i+1 xn j ] Xi 1 ]} = [y w. (30) Let T i be the subtree induced by i and Z = T i i + 1,..., j 0 1} and C = v T i : (u, v) E, u < j 0, v j 0 }. (31) Then by Lemma 2.2 and the Markov property, we get η ij (y, w, w ) = 1 ( ) 2 P X C Z = x C Z X i = w} P X C Z = x C Z X i = w } x C x Z (the sum indexed by j 0,...,n} \ C marginalizes out). Define D = d k : k = 0,..., D } with d 0 = dep T (i), d D = dep T (j 0 ) and d k+1 = d k + 1 for 0 k < D. For d D, let I d = T i lev T (d) and G d = (I d 1 + I d, E d ) be the bipartite graph consisting of the nodes in I d 1 and I d, and the edges in E joining them (note that I d0 = i}). For (u, v) E, let A (u,v) be the S S matrix given by A (u,v) x,x = p uv(x x ) and note that A (u,v) = θuv. Then by the Markov property, for each x Id S I d and x Id 1 S I d 1, d D \ d 0 }, we have (32) P X Id = x Id X Id 1 = x Id 1 } = A (d) x Id,x Id 1, where A (d) = (u,v) E d A (u,v). Likewise, for d D \ d 0 }, P X Id = x Id X i = w} = x I1 x I2 x Id 1 P X I1 = x I1 X i = w} P X I2 = x I2 X I1 = x I1 } P X Id = x Id X Id 1 = x Id 1 } = (A (d) A (d 1) A (d1) ) xid,w. (33) 10
11 Define the (balanced) I d1 -tensor the I d D -tensor h = A (d1),w A(d1),w, (34) f = A (d D ) A (d D 1) A (d2) h, (35) and C 0, C 1, Z 0 1,..., n}: C 0 = C I dept (j 0), C 1 = C \ C 0, Z 0 = I dept (j 0) \ C 0, (36) where C and Z are defined in (31). For readability we will write p(x U ) instead of P X U = x U } below; no ambiguity should arise. Combining (32) and (33), we have η ij (y, w, w ) = 1 2 (p(x C Z X i = w) p(x C Z X i = w )) (37) x C x Z = 1 2 p(x C1 x Z0 )f C0 Z 0 (38) x Z0 x C0 x C1 = Bf (39) where B is the S C0 C1 S C0 Z0 column-stochastic matrix given by B (xc0 x C1 ),(x C 0 x Z0 ) = ½ x C0 =x C 0 } p(x C1 x Z0 ) with the convention that p(x C1 x Z0 ) = 1 if either of Z 0,C 1 is empty. The claim now follows by reading off the results previously obtained: Bf B f Eq. (7) f Remark 2.4 h D k=2 A (d k) Eqs. (20,35) D k=1 α A (u,v) : (u, v) Edk } Lemma Proof of Theorem 2.1. We will borrow the definitions from the proof of Lemma To upperbound η ij we first bound α A (u,v) : (u, v) Edk }. Since E dk wid(t) L (because every node in I dk has exactly one parent in I dk 1 ) and A (u,v) = θuv θ < 1, we appeal to Lemma 2.7 to obtain α A (u,v) : (u, v) E dk } 1 (1 θ) L. (40) Now we must lower-bound the quantity h = dep T (j 0 ) dep T (i). Since every level can have up to L nodes, we have j 0 i hl and so h (j 0 i)/l (j i)/l. 11
12 The calculations in Lemma 2.11 yield considerably more information than the simple bound in (12). For example, suppose the tree T has levels I d : d = 0, 1,...} with the property that the levels are growing at most linearly: I d cd for some c > 0. Let d i = dep T (i), d j = dep T (j 0 ), and h = d j d i. Then so which yields the bound j i j 0 i c d j d i+1 k = c 2 (d j(d j + 1) d i (d i + 1)) < c 2 ((d j + 1) 2 d 2 i) < c 2 (d i + h + 1) 2 h > 2(j i)/c d i 1, η ij h (1 (1 θ k ) ck ) (41) where θ k maxθ uv : (u, v) E k }. When θ k is small (ckθ k θ < 1), this becomes η ij < k=1 h (ckθ k ) (42) k=1 2(j i)/c di 1 k=1 (ckθ k ) (43) θ 2(j i)/c di 1. (44) This is a non-trivial bound for trees with linearly growing levels: recall that to bound (4,5), we must bound the series η ij. j=i+1 By the limit comparison test with the series j=1 1/j2, we have that θ 2(j i)/c di 1 j=i+1 converges for θ < 1. Similar techniques may be applied when the level growth is bounded by other slowly increasing functions. 3 Discussion We have presented a concentration of measure bound for Markov tree processes; to our knowledge, this is the first such result. 4 In the simple case of the contracting, bounded-width Markov 4 In a 2003 paper, Dembo et al. [1] presented large deviation bounds for typed Markov trees, which is a more general class of processes than the Markov tree processes defined here. The techniques used and bounds obtained in [1] are of a rather different flavor than here; this is not surprising since measure concentration and large deviations, while pursuing similar goals, tend to use different methods and state results that are often not immediately comparable. 12
13 tree processes (i.e., those for which wid(t) L < and sup u,v θ uv θ < 1), the bound takes on a particularly tractable form (12), and in the degenerate case L = 1 it reduces to the sharpest known bound for Markov chains. The techniques we develop extend well beyond the somewhat restrictive contracting-bounded-width case, as demonstrated in the calculation in (44). The technical results in 2.4, particularly Lemma 2.5 and its generalizations, might be of independent interest. It is hoped that these techniques will be extended to obtain concentration bounds for larger classes of directed acyclic graphical models. Acknowledgements I thank Kavita Ramanan for useful discussions. References [1] Amir Dembo, Peter Morters, Scott Sheffield, A large-deviation theorem for tree-indexed Markov chains [2] Leonid Kontorovich, Measure Concentration of Hidden Markov Processes [3] Leonid Kontorovich and Kavita Ramanan, A concentration inequality for weakly contracting Markov chains. Paper in preparation, [4] Katalin Marton, Bounding d-distance by informational divergence: a method to prove measure concentration. Ann. Probab., Vol. 24, No. 2, , [5] Katalin Marton, A measure concentration inequality for contracting Markov chains. Geom. Funct. Anal., Vol. 6, , [6] Paul-Marie Samson, Concentration of measure inequalities for Markov chains and Φ-mixing processes. Ann. Probab., Vol. 28, No. 1, ,
arxiv:math/ v5 [math.pr] 1 Oct 2006
Measure Concentration of Markov Tree Processes arxiv:math/0608511v5 [math.pr] 1 Oct 2006 Leonid Kontorovich School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 USA lkontor@cs.cmu.edu
More informationConcentration Inequalities for Dependent Random Variables via the Martingale Method
Concentration Inequalities for Dependent Random Variables via the Martingale Method Leonid Kontorovich, and Kavita Ramanan Carnegie Mellon University L. Kontorovich School of Computer Science Carnegie
More informationPartial cubes: structures, characterizations, and constructions
Partial cubes: structures, characterizations, and constructions Sergei Ovchinnikov San Francisco State University, Mathematics Department, 1600 Holloway Ave., San Francisco, CA 94132 Abstract Partial cubes
More informationBasic Properties of Metric and Normed Spaces
Basic Properties of Metric and Normed Spaces Computational and Metric Geometry Instructor: Yury Makarychev The second part of this course is about metric geometry. We will study metric spaces, low distortion
More informationStanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures
2 1 Borel Regular Measures We now state and prove an important regularity property of Borel regular outer measures: Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon
More informationAutomorphism groups of wreath product digraphs
Automorphism groups of wreath product digraphs Edward Dobson Department of Mathematics and Statistics Mississippi State University PO Drawer MA Mississippi State, MS 39762 USA dobson@math.msstate.edu Joy
More informationA matrix over a field F is a rectangular array of elements from F. The symbol
Chapter MATRICES Matrix arithmetic A matrix over a field F is a rectangular array of elements from F The symbol M m n (F ) denotes the collection of all m n matrices over F Matrices will usually be denoted
More informationTree sets. Reinhard Diestel
1 Tree sets Reinhard Diestel Abstract We study an abstract notion of tree structure which generalizes treedecompositions of graphs and matroids. Unlike tree-decompositions, which are too closely linked
More informationAn Algebraic View of the Relation between Largest Common Subtrees and Smallest Common Supertrees
An Algebraic View of the Relation between Largest Common Subtrees and Smallest Common Supertrees Francesc Rosselló 1, Gabriel Valiente 2 1 Department of Mathematics and Computer Science, Research Institute
More informationUndirected Graphical Models
Undirected Graphical Models 1 Conditional Independence Graphs Let G = (V, E) be an undirected graph with vertex set V and edge set E, and let A, B, and C be subsets of vertices. We say that C separates
More information5 Flows and cuts in digraphs
5 Flows and cuts in digraphs Recall that a digraph or network is a pair G = (V, E) where V is a set and E is a multiset of ordered pairs of elements of V, which we refer to as arcs. Note that two vertices
More informationCHAPTER 7. Connectedness
CHAPTER 7 Connectedness 7.1. Connected topological spaces Definition 7.1. A topological space (X, T X ) is said to be connected if there is no continuous surjection f : X {0, 1} where the two point set
More informationHilbert spaces. 1. Cauchy-Schwarz-Bunyakowsky inequality
(October 29, 2016) Hilbert spaces Paul Garrett garrett@math.umn.edu http://www.math.umn.edu/ garrett/ [This document is http://www.math.umn.edu/ garrett/m/fun/notes 2016-17/03 hsp.pdf] Hilbert spaces are
More informationON COST MATRICES WITH TWO AND THREE DISTINCT VALUES OF HAMILTONIAN PATHS AND CYCLES
ON COST MATRICES WITH TWO AND THREE DISTINCT VALUES OF HAMILTONIAN PATHS AND CYCLES SANTOSH N. KABADI AND ABRAHAM P. PUNNEN Abstract. Polynomially testable characterization of cost matrices associated
More informationExtreme Point Solutions for Infinite Network Flow Problems
Extreme Point Solutions for Infinite Network Flow Problems H. Edwin Romeijn Dushyant Sharma Robert L. Smith January 3, 004 Abstract We study capacitated network flow problems with supplies and demands
More informationGeneral Notation. Exercises and Problems
Exercises and Problems The text contains both Exercises and Problems. The exercises are incorporated into the development of the theory in each section. Additional Problems appear at the end of most sections.
More informationAnalysis III. Exam 1
Analysis III Math 414 Spring 27 Professor Ben Richert Exam 1 Solutions Problem 1 Let X be the set of all continuous real valued functions on [, 1], and let ρ : X X R be the function ρ(f, g) = sup f g (1)
More informationMetric Spaces and Topology
Chapter 2 Metric Spaces and Topology From an engineering perspective, the most important way to construct a topology on a set is to define the topology in terms of a metric on the set. This approach underlies
More informationThe Lebesgue Integral
The Lebesgue Integral Brent Nelson In these notes we give an introduction to the Lebesgue integral, assuming only a knowledge of metric spaces and the iemann integral. For more details see [1, Chapters
More informationUNIVERSAL DERIVED EQUIVALENCES OF POSETS
UNIVERSAL DERIVED EQUIVALENCES OF POSETS SEFI LADKANI Abstract. By using only combinatorial data on two posets X and Y, we construct a set of so-called formulas. A formula produces simultaneously, for
More informationIntroduction and Preliminaries
Chapter 1 Introduction and Preliminaries This chapter serves two purposes. The first purpose is to prepare the readers for the more systematic development in later chapters of methods of real analysis
More informationChapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries
Chapter 1 Measure Spaces 1.1 Algebras and σ algebras of sets 1.1.1 Notation and preliminaries We shall denote by X a nonempty set, by P(X) the set of all parts (i.e., subsets) of X, and by the empty set.
More informationLebesgue Measure on R n
CHAPTER 2 Lebesgue Measure on R n Our goal is to construct a notion of the volume, or Lebesgue measure, of rather general subsets of R n that reduces to the usual volume of elementary geometrical sets
More informationCourse 212: Academic Year Section 1: Metric Spaces
Course 212: Academic Year 1991-2 Section 1: Metric Spaces D. R. Wilkins Contents 1 Metric Spaces 3 1.1 Distance Functions and Metric Spaces............. 3 1.2 Convergence and Continuity in Metric Spaces.........
More informationMathematics Course 111: Algebra I Part I: Algebraic Structures, Sets and Permutations
Mathematics Course 111: Algebra I Part I: Algebraic Structures, Sets and Permutations D. R. Wilkins Academic Year 1996-7 1 Number Systems and Matrix Algebra Integers The whole numbers 0, ±1, ±2, ±3, ±4,...
More informationThe fundamental group of a locally finite graph with ends
1 The fundamental group of a locally finite graph with ends Reinhard Diestel and Philipp Sprüssel Abstract We characterize the fundamental group of a locally finite graph G with ends combinatorially, as
More informationAN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES
AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES JOEL A. TROPP Abstract. We present an elementary proof that the spectral radius of a matrix A may be obtained using the formula ρ(a) lim
More informationDynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor)
Dynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor) Matija Vidmar February 7, 2018 1 Dynkin and π-systems Some basic
More informationExtreme points of compact convex sets
Extreme points of compact convex sets In this chapter, we are going to show that compact convex sets are determined by a proper subset, the set of its extreme points. Let us start with the main definition.
More informationVectors. January 13, 2013
Vectors January 13, 2013 The simplest tensors are scalars, which are the measurable quantities of a theory, left invariant by symmetry transformations. By far the most common non-scalars are the vectors,
More informationSeveral variables. x 1 x 2. x n
Several variables Often we have not only one, but several variables in a problem The issues that come up are somewhat more complex than for one variable Let us first start with vector spaces and linear
More informationImmerse Metric Space Homework
Immerse Metric Space Homework (Exercises -2). In R n, define d(x, y) = x y +... + x n y n. Show that d is a metric that induces the usual topology. Sketch the basis elements when n = 2. Solution: Steps
More informationB. Appendix B. Topological vector spaces
B.1 B. Appendix B. Topological vector spaces B.1. Fréchet spaces. In this appendix we go through the definition of Fréchet spaces and their inductive limits, such as they are used for definitions of function
More informationSupplemental for Spectral Algorithm For Latent Tree Graphical Models
Supplemental for Spectral Algorithm For Latent Tree Graphical Models Ankur P. Parikh, Le Song, Eric P. Xing The supplemental contains 3 main things. 1. The first is network plots of the latent variable
More informationWe simply compute: for v = x i e i, bilinearity of B implies that Q B (v) = B(v, v) is given by xi x j B(e i, e j ) =
Math 395. Quadratic spaces over R 1. Algebraic preliminaries Let V be a vector space over a field F. Recall that a quadratic form on V is a map Q : V F such that Q(cv) = c 2 Q(v) for all v V and c F, and
More informationEC 521 MATHEMATICAL METHODS FOR ECONOMICS. Lecture 1: Preliminaries
EC 521 MATHEMATICAL METHODS FOR ECONOMICS Lecture 1: Preliminaries Murat YILMAZ Boğaziçi University In this lecture we provide some basic facts from both Linear Algebra and Real Analysis, which are going
More informationAn introduction to some aspects of functional analysis
An introduction to some aspects of functional analysis Stephen Semmes Rice University Abstract These informal notes deal with some very basic objects in functional analysis, including norms and seminorms
More informationELEMENTARY LINEAR ALGEBRA
ELEMENTARY LINEAR ALGEBRA K R MATTHEWS DEPARTMENT OF MATHEMATICS UNIVERSITY OF QUEENSLAND First Printing, 99 Chapter LINEAR EQUATIONS Introduction to linear equations A linear equation in n unknowns x,
More informationContents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.
Minimization Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping. 1 Minimization A Topological Result. Let S be a topological
More informationMeasure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond
Measure Theory on Topological Spaces Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond May 22, 2011 Contents 1 Introduction 2 1.1 The Riemann Integral........................................ 2 1.2 Measurable..............................................
More informationGraph coloring, perfect graphs
Lecture 5 (05.04.2013) Graph coloring, perfect graphs Scribe: Tomasz Kociumaka Lecturer: Marcin Pilipczuk 1 Introduction to graph coloring Definition 1. Let G be a simple undirected graph and k a positive
More informationCopyright 2013 Springer Science+Business Media New York
Meeks, K., and Scott, A. (2014) Spanning trees and the complexity of floodfilling games. Theory of Computing Systems, 54 (4). pp. 731-753. ISSN 1432-4350 Copyright 2013 Springer Science+Business Media
More informationSZEMERÉDI S REGULARITY LEMMA FOR MATRICES AND SPARSE GRAPHS
SZEMERÉDI S REGULARITY LEMMA FOR MATRICES AND SPARSE GRAPHS ALEXANDER SCOTT Abstract. Szemerédi s Regularity Lemma is an important tool for analyzing the structure of dense graphs. There are versions of
More informationLecture 1 and 2: Random Spanning Trees
Recent Advances in Approximation Algorithms Spring 2015 Lecture 1 and 2: Random Spanning Trees Lecturer: Shayan Oveis Gharan March 31st Disclaimer: These notes have not been subjected to the usual scrutiny
More informationNotes on the Matrix-Tree theorem and Cayley s tree enumerator
Notes on the Matrix-Tree theorem and Cayley s tree enumerator 1 Cayley s tree enumerator Recall that the degree of a vertex in a tree (or in any graph) is the number of edges emanating from it We will
More informationLinear Algebra. The analysis of many models in the social sciences reduces to the study of systems of equations.
POLI 7 - Mathematical and Statistical Foundations Prof S Saiegh Fall Lecture Notes - Class 4 October 4, Linear Algebra The analysis of many models in the social sciences reduces to the study of systems
More informationa 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.
Chapter 1 LINEAR EQUATIONS 11 Introduction to linear equations A linear equation in n unknowns x 1, x,, x n is an equation of the form a 1 x 1 + a x + + a n x n = b, where a 1, a,, a n, b are given real
More informationKernels of Directed Graph Laplacians. J. S. Caughman and J.J.P. Veerman
Kernels of Directed Graph Laplacians J. S. Caughman and J.J.P. Veerman Department of Mathematics and Statistics Portland State University PO Box 751, Portland, OR 97207. caughman@pdx.edu, veerman@pdx.edu
More informationWhere is matrix multiplication locally open?
Linear Algebra and its Applications 517 (2017) 167 176 Contents lists available at ScienceDirect Linear Algebra and its Applications www.elsevier.com/locate/laa Where is matrix multiplication locally open?
More informationUnit 2, Section 3: Linear Combinations, Spanning, and Linear Independence Linear Combinations, Spanning, and Linear Independence
Linear Combinations Spanning and Linear Independence We have seen that there are two operations defined on a given vector space V :. vector addition of two vectors and. scalar multiplication of a vector
More informationOn the Properties of Positive Spanning Sets and Positive Bases
Noname manuscript No. (will be inserted by the editor) On the Properties of Positive Spanning Sets and Positive Bases Rommel G. Regis Received: May 30, 2015 / Accepted: date Abstract The concepts of positive
More informationx log x, which is strictly convex, and use Jensen s Inequality:
2. Information measures: mutual information 2.1 Divergence: main inequality Theorem 2.1 (Information Inequality). D(P Q) 0 ; D(P Q) = 0 iff P = Q Proof. Let ϕ(x) x log x, which is strictly convex, and
More informationFinite-Dimensional Cones 1
John Nachbar Washington University March 28, 2018 1 Basic Definitions. Finite-Dimensional Cones 1 Definition 1. A set A R N is a cone iff it is not empty and for any a A and any γ 0, γa A. Definition 2.
More informationSolution. 1 Solution of Homework 7. Sangchul Lee. March 22, Problem 1.1
Solution Sangchul Lee March, 018 1 Solution of Homework 7 Problem 1.1 For a given k N, Consider two sequences (a n ) and (b n,k ) in R. Suppose that a n b n,k for all n,k N Show that limsup a n B k :=
More informationIntegral Jensen inequality
Integral Jensen inequality Let us consider a convex set R d, and a convex function f : (, + ]. For any x,..., x n and λ,..., λ n with n λ i =, we have () f( n λ ix i ) n λ if(x i ). For a R d, let δ a
More informationABELIAN SELF-COMMUTATORS IN FINITE FACTORS
ABELIAN SELF-COMMUTATORS IN FINITE FACTORS GABRIEL NAGY Abstract. An abelian self-commutator in a C*-algebra A is an element of the form A = X X XX, with X A, such that X X and XX commute. It is shown
More informationSolution. 1 Solutions of Homework 1. 2 Homework 2. Sangchul Lee. February 19, Problem 1.2
Solution Sangchul Lee February 19, 2018 1 Solutions of Homework 1 Problem 1.2 Let A and B be nonempty subsets of R + :: {x R : x > 0} which are bounded above. Let us define C = {xy : x A and y B} Show
More information2. The Concept of Convergence: Ultrafilters and Nets
2. The Concept of Convergence: Ultrafilters and Nets NOTE: AS OF 2008, SOME OF THIS STUFF IS A BIT OUT- DATED AND HAS A FEW TYPOS. I WILL REVISE THIS MATE- RIAL SOMETIME. In this lecture we discuss two
More informationMath 341: Convex Geometry. Xi Chen
Math 341: Convex Geometry Xi Chen 479 Central Academic Building, University of Alberta, Edmonton, Alberta T6G 2G1, CANADA E-mail address: xichen@math.ualberta.ca CHAPTER 1 Basics 1. Euclidean Geometry
More informationLinear Algebra Lecture Notes-I
Linear Algebra Lecture Notes-I Vikas Bist Department of Mathematics Panjab University, Chandigarh-6004 email: bistvikas@gmail.com Last revised on February 9, 208 This text is based on the lectures delivered
More informationChapter 4. Measure Theory. 1. Measure Spaces
Chapter 4. Measure Theory 1. Measure Spaces Let X be a nonempty set. A collection S of subsets of X is said to be an algebra on X if S has the following properties: 1. X S; 2. if A S, then A c S; 3. if
More information7. Baker-Campbell-Hausdorff formula
7. Baker-Campbell-Hausdorff formula 7.1. Formulation. Let G GL(n,R) be a matrix Lie group and let g = Lie(G). The exponential map is an analytic diffeomorphim of a neighborhood of 0 in g with a neighborhood
More informationSection Summary. Relations and Functions Properties of Relations. Combining Relations
Chapter 9 Chapter Summary Relations and Their Properties n-ary Relations and Their Applications (not currently included in overheads) Representing Relations Closures of Relations (not currently included
More informationGärtner-Ellis Theorem and applications.
Gärtner-Ellis Theorem and applications. Elena Kosygina July 25, 208 In this lecture we turn to the non-i.i.d. case and discuss Gärtner-Ellis theorem. As an application, we study Curie-Weiss model with
More informationMATH 205C: STATIONARY PHASE LEMMA
MATH 205C: STATIONARY PHASE LEMMA For ω, consider an integral of the form I(ω) = e iωf(x) u(x) dx, where u Cc (R n ) complex valued, with support in a compact set K, and f C (R n ) real valued. Thus, I(ω)
More informationi c Robert C. Gunning
c Robert C. Gunning i ii MATHEMATICS 218: NOTES Robert C. Gunning January 27, 2010 ii Introduction These are notes of honors courses on calculus of several variables given at Princeton University during
More informationLinear Algebra March 16, 2019
Linear Algebra March 16, 2019 2 Contents 0.1 Notation................................ 4 1 Systems of linear equations, and matrices 5 1.1 Systems of linear equations..................... 5 1.2 Augmented
More informationACO Comprehensive Exam October 14 and 15, 2013
1. Computability, Complexity and Algorithms (a) Let G be the complete graph on n vertices, and let c : V (G) V (G) [0, ) be a symmetric cost function. Consider the following closest point heuristic for
More informationLecture Notes 1 Basic Concepts of Mathematics MATH 352
Lecture Notes 1 Basic Concepts of Mathematics MATH 352 Ivan Avramidi New Mexico Institute of Mining and Technology Socorro, NM 87801 June 3, 2004 Author: Ivan Avramidi; File: absmath.tex; Date: June 11,
More informationMATH 326: RINGS AND MODULES STEFAN GILLE
MATH 326: RINGS AND MODULES STEFAN GILLE 1 2 STEFAN GILLE 1. Rings We recall first the definition of a group. 1.1. Definition. Let G be a non empty set. The set G is called a group if there is a map called
More informationEmbeddings of finite metric spaces in Euclidean space: a probabilistic view
Embeddings of finite metric spaces in Euclidean space: a probabilistic view Yuval Peres May 11, 2006 Talk based on work joint with: Assaf Naor, Oded Schramm and Scott Sheffield Definition: An invertible
More informationChapter 1 Preliminaries
Chapter 1 Preliminaries 1.1 Conventions and Notations Throughout the book we use the following notations for standard sets of numbers: N the set {1, 2,...} of natural numbers Z the set of integers Q the
More informationOn the decay of elements of inverse triangular Toeplitz matrix
On the decay of elements of inverse triangular Toeplitz matrix Neville Ford, D. V. Savostyanov, N. L. Zamarashkin August 03, 203 arxiv:308.0724v [math.na] 3 Aug 203 Abstract We consider half-infinite triangular
More informationk-distinct In- and Out-Branchings in Digraphs
k-distinct In- and Out-Branchings in Digraphs Gregory Gutin 1, Felix Reidl 2, and Magnus Wahlström 1 arxiv:1612.03607v2 [cs.ds] 21 Apr 2017 1 Royal Holloway, University of London, UK 2 North Carolina State
More informationLecture 5. If we interpret the index n 0 as time, then a Markov chain simply requires that the future depends only on the present and not on the past.
1 Markov chain: definition Lecture 5 Definition 1.1 Markov chain] A sequence of random variables (X n ) n 0 taking values in a measurable state space (S, S) is called a (discrete time) Markov chain, if
More informationIntroduction to Real Analysis Alternative Chapter 1
Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces
More informationDistance-Divergence Inequalities
Distance-Divergence Inequalities Katalin Marton Alfréd Rényi Institute of Mathematics of the Hungarian Academy of Sciences Motivation To find a simple proof of the Blowing-up Lemma, proved by Ahlswede,
More informationFunctional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...
Functional Analysis Franck Sueur 2018-2019 Contents 1 Metric spaces 1 1.1 Definitions........................................ 1 1.2 Completeness...................................... 3 1.3 Compactness......................................
More informationReal Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi
Real Analysis Math 3AH Rudin, Chapter # Dominique Abdi.. If r is rational (r 0) and x is irrational, prove that r + x and rx are irrational. Solution. Assume the contrary, that r+x and rx are rational.
More information1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3
Index Page 1 Topology 2 1.1 Definition of a topology 2 1.2 Basis (Base) of a topology 2 1.3 The subspace topology & the product topology on X Y 3 1.4 Basic topology concepts: limit points, closed sets,
More informationZaslavsky s Theorem. As presented by Eric Samansky May 11, 2002
Zaslavsky s Theorem As presented by Eric Samansky May, 2002 Abstract This paper is a retelling of the proof of Zaslavsky s Theorem. For any arrangement of hyperplanes, there is a corresponding semi-lattice
More informationIntroduction to Bases in Banach Spaces
Introduction to Bases in Banach Spaces Matt Daws June 5, 2005 Abstract We introduce the notion of Schauder bases in Banach spaces, aiming to be able to give a statement of, and make sense of, the Gowers
More informationInfinite-Dimensional Triangularization
Infinite-Dimensional Triangularization Zachary Mesyan March 11, 2018 Abstract The goal of this paper is to generalize the theory of triangularizing matrices to linear transformations of an arbitrary vector
More informationMetric Spaces Lecture 17
Metric Spaces Lecture 17 Homeomorphisms At the end of last lecture an example was given of a bijective continuous function f such that f 1 is not continuous. For another example, consider the sets T =
More informationMath Camp Lecture 4: Linear Algebra. Xiao Yu Wang. Aug 2010 MIT. Xiao Yu Wang (MIT) Math Camp /10 1 / 88
Math Camp 2010 Lecture 4: Linear Algebra Xiao Yu Wang MIT Aug 2010 Xiao Yu Wang (MIT) Math Camp 2010 08/10 1 / 88 Linear Algebra Game Plan Vector Spaces Linear Transformations and Matrices Determinant
More informationA Lower Bound for the Size of Syntactically Multilinear Arithmetic Circuits
A Lower Bound for the Size of Syntactically Multilinear Arithmetic Circuits Ran Raz Amir Shpilka Amir Yehudayoff Abstract We construct an explicit polynomial f(x 1,..., x n ), with coefficients in {0,
More informationELEMENTARY LINEAR ALGEBRA
ELEMENTARY LINEAR ALGEBRA K. R. MATTHEWS DEPARTMENT OF MATHEMATICS UNIVERSITY OF QUEENSLAND Corrected Version, 7th April 013 Comments to the author at keithmatt@gmail.com Chapter 1 LINEAR EQUATIONS 1.1
More informationLinear Algebra Lecture Notes
Linear Algebra Lecture Notes Lecturers: Inna Capdeboscq and Damiano Testa Warwick, January 2017 Contents 1 Number Systems and Fields 3 1.1 Axioms for number systems............................ 3 2 Vector
More informationAnalysis and Linear Algebra. Lectures 1-3 on the mathematical tools that will be used in C103
Analysis and Linear Algebra Lectures 1-3 on the mathematical tools that will be used in C103 Set Notation A, B sets AcB union A1B intersection A\B the set of objects in A that are not in B N. Empty set
More informationMAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9
MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended
More informationarxiv: v1 [math.pr] 14 Aug 2017
Uniqueness of Gibbs Measures for Continuous Hardcore Models arxiv:1708.04263v1 [math.pr] 14 Aug 2017 David Gamarnik Kavita Ramanan Abstract We formulate a continuous version of the well known discrete
More informationEfficient Approximation for Restricted Biclique Cover Problems
algorithms Article Efficient Approximation for Restricted Biclique Cover Problems Alessandro Epasto 1, *, and Eli Upfal 2 ID 1 Google Research, New York, NY 10011, USA 2 Department of Computer Science,
More informationAPPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.
APPENDIX A Background Mathematics A. Linear Algebra A.. Vector algebra Let x denote the n-dimensional column vector with components 0 x x 2 B C @. A x n Definition 6 (scalar product). The scalar product
More informationCIRCULAR CHROMATIC NUMBER AND GRAPH MINORS. Xuding Zhu 1. INTRODUCTION
TAIWANESE JOURNAL OF MATHEMATICS Vol. 4, No. 4, pp. 643-660, December 2000 CIRCULAR CHROMATIC NUMBER AND GRAPH MINORS Xuding Zhu Abstract. This paper proves that for any integer n 4 and any rational number
More informationDiscrete Geometry. Problem 1. Austin Mohr. April 26, 2012
Discrete Geometry Austin Mohr April 26, 2012 Problem 1 Theorem 1 (Linear Programming Duality). Suppose x, y, b, c R n and A R n n, Ax b, x 0, A T y c, and y 0. If x maximizes c T x and y minimizes b T
More informationCourse 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra
Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra D. R. Wilkins Contents 3 Topics in Commutative Algebra 2 3.1 Rings and Fields......................... 2 3.2 Ideals...............................
More informationEnumeration of subtrees of trees
Enumeration of subtrees of trees Weigen Yan a,b 1 and Yeong-Nan Yeh b a School of Sciences, Jimei University, Xiamen 36101, China b Institute of Mathematics, Academia Sinica, Taipei 1159. Taiwan. Theoretical
More informationReaching a Consensus in a Dynamically Changing Environment A Graphical Approach
Reaching a Consensus in a Dynamically Changing Environment A Graphical Approach M. Cao Yale Univesity A. S. Morse Yale University B. D. O. Anderson Australia National University and National ICT Australia
More informationTHE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS
THE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS RALPH HOWARD DEPARTMENT OF MATHEMATICS UNIVERSITY OF SOUTH CAROLINA COLUMBIA, S.C. 29208, USA HOWARD@MATH.SC.EDU Abstract. This is an edited version of a
More informationALGEBRAIC GEOMETRY (NMAG401) Contents. 2. Polynomial and rational maps 9 3. Hilbert s Nullstellensatz and consequences 23 References 30
ALGEBRAIC GEOMETRY (NMAG401) JAN ŠŤOVÍČEK Contents 1. Affine varieties 1 2. Polynomial and rational maps 9 3. Hilbert s Nullstellensatz and consequences 23 References 30 1. Affine varieties The basic objects
More information