Normal Factor Graphs, Linear Algebra and Probabilistic Modeling

Size: px

Start display at page:

Download "Normal Factor Graphs, Linear Algebra and Probabilistic Modeling"

Aron Allison
5 years ago
Views:

1 Normal Factor Graphs, Linear Algebra and Probabilistic Modeling Yongyi Mao University of Ottawa Workshop on Counting, Inference, and Optimization on Graphs Princeton University, NJ, USA November 3, 2011

2 It started on day one of my PhD program...

3 August 1, PhD advisor Frank Kschischang. Codes on graphs: normal realizations [Forney, 2001] Kschischang: Go read it! and went on a vacation...

4 August 1, PhD advisor Frank Kschischang. Codes on graphs: normal realizations [Forney, 2001] Kschischang: Go read it! and went on a vacation...

5 August 1, PhD advisor Frank Kschischang. Codes on graphs: normal realizations [Forney, 2001] Kschischang: Go read it! and went on a vacation...

6 August 1, PhD advisor Frank Kschischang. Codes on graphs: normal realizations [Forney, 2001] Kschischang: Go read it! and went on a vacation...

7 Normal graph duality x 1 C 1 ^x 1 C 2 C K 1 C K 2 x 2 C 3 C 4 ^x 2 C K 3 C K 4 C 5 C K 5 Duality theorem: Dual normal graphs represent dual codes

8 One month later... Kschischang: Why do normal graphs have duality, but factor graphs do not? Two weeks later... Me: We need... Fourier... we need convolution..." Given f(x, y) and g(y, z) f(x, y) g(y, z) = w f(x, y w)g(w, z)

9 One month later... Kschischang: Why do normal graphs have duality, but factor graphs do not? Two weeks later... Me: We need... Fourier... we need convolution..." Given f(x, y) and g(y, z) f(x, y) g(y, z) = w f(x, y w)g(w, z)

10 One month later... Kschischang: Why do normal graphs have duality, but factor graphs do not? Two weeks later... Me: We need... Fourier... we need convolution..." Given f(x, y) and g(y, z) f(x, y) g(y, z) = w f(x, y w)g(w, z)

11 Convolutional factor graph bipartite graph: function nodes and variables nodes encodes a convolutional product f 1 x 1 f 2 x 2 f 3 x 3 f 1 (x 1, x 2 ) f 2 (x 1, x 3 ) f 3 (x 1, x 3 )

12 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

13 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

14 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

15 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

16 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

17 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

18 Factor graph duality Dual FGs: (G, {f i }, ) and (G, {^f i }, ) F Dual FGs encode a FT pair Dual FGs of codes: (G, {δ Ci }, ) and (G, {δ C K i }, ) δ C F δc K Dual FGs represent dual codes Normal graph duality reinterpreted On factor graphs and the Fourier transform [Mao & Kschischang, ISIT2001, IT2005]

19 The reviewer... Dave Forney 9 pages of review Forney: It may take years for people to take it [convolutional FG] and run

20 The reviewer... Dave Forney 9 pages of review Forney: It may take years for people to take it [convolutional FG] and run

21 The reviewer... Dave Forney 9 pages of review Forney: It may take years for people to take it [convolutional FG] and run

22 Going probabilistic... Bredan Frey Convolutional FGs as probabilistic models [Mao, Kschischang & Frey, UAI 2004] Each node: joint distribution of a collection of (latent) variables All collections are independent Convoltional CFG represents the joint distribution of observed variables that are linearly transformed from the latent variables

23 Going probabilistic... Bredan Frey Convolutional FGs as probabilistic models [Mao, Kschischang & Frey, UAI 2004] Each node: joint distribution of a collection of (latent) variables All collections are independent Convoltional CFG represents the joint distribution of observed variables that are linearly transformed from the latent variables

24 Going probabilistic... Bredan Frey Convolutional FGs as probabilistic models [Mao, Kschischang & Frey, UAI 2004] Each node: joint distribution of a collection of (latent) variables All collections are independent Convoltional CFG represents the joint distribution of observed variables that are linearly transformed from the latent variables

25 Going probabilistic... Bredan Frey Convolutional FGs as probabilistic models [Mao, Kschischang & Frey, UAI 2004] Each node: joint distribution of a collection of (latent) variables All collections are independent Convoltional CFG represents the joint distribution of observed variables that are linearly transformed from the latent variables

26 Linear characteristic graphical models December 2010, from Danny Bickson: Inference with multivariate heavy-tails in linear models [Bickson & Guestrin, NIPS2010] modelling heavy-tail (stable) distributions applications and inference algorithms based on convolutional FG and duality

27 Linear characteristic graphical models December 2010, from Danny Bickson: Inference with multivariate heavy-tails in linear models [Bickson & Guestrin, NIPS2010] modelling heavy-tail (stable) distributions applications and inference algorithms based on convolutional FG and duality

28 Linear characteristic graphical models December 2010, from Danny Bickson: Inference with multivariate heavy-tails in linear models [Bickson & Guestrin, NIPS2010] modelling heavy-tail (stable) distributions applications and inference algorithms based on convolutional FG and duality

29 Linear characteristic graphical models December 2010, from Danny Bickson: Inference with multivariate heavy-tails in linear models [Bickson & Guestrin, NIPS2010] modelling heavy-tail (stable) distributions applications and inference algorithms based on convolutional FG and duality

30 Missed opportunity... Year met Frey in Toronto Frey: How about a model for CDF? Cumulative distribution networks and derivativesum-product algorithm [Huang & Frey, UAI2008] F X1X 2 x 1 F X1X 3 x 2 F X1X 3 x 3 CDN: multiplicative FG each function node: a CDF the global function is a CDF efficiently solves structured ranking problems

31 Missed opportunity... Year met Frey in Toronto Frey: How about a model for CDF? Cumulative distribution networks and derivativesum-product algorithm [Huang & Frey, UAI2008] F X1X 2 x 1 F X1X 3 x 2 F X1X 3 x 3 CDN: multiplicative FG each function node: a CDF the global function is a CDF efficiently solves structured ranking problems

32 Missed opportunity... Year met Frey in Toronto Frey: How about a model for CDF? Cumulative distribution networks and derivativesum-product algorithm [Huang & Frey, UAI2008] F X1X 2 x 1 F X1X 3 x 2 F X1X 3 x 3 CDN: multiplicative FG each function node: a CDF the global function is a CDF efficiently solves structured ranking problems

33 Missed opportunity... Year met Frey in Toronto Frey: How about a model for CDF? Cumulative distribution networks and derivativesum-product algorithm [Huang & Frey, UAI2008] F X1X 2 x 1 F X1X 3 x 2 F X1X 3 x 3 CDN: multiplicative FG each function node: a CDF the global function is a CDF efficiently solves structured ranking problems

34 Missed opportunity... Year met Frey in Toronto Frey: How about a model for CDF? Cumulative distribution networks and derivativesum-product algorithm [Huang & Frey, UAI2008] F X1X 2 x 1 F X1X 3 x 2 F X1X 3 x 3 CDN: multiplicative FG each function node: a CDF the global function is a CDF efficiently solves structured ranking problems

35 Conditional/marginal independence properties f 1 x 1 f 2 x 2 f 3 x 3 Multiplicative FG: conditional indepdence; Convolutional FG: marginal independence; CDN: marginal independence

36 Again, it is Frank Kschischang... Early Kschischang: "Look at this!" Constrained coding as networks of relations [Schwartz & Bruck 2008] Holographic algorithms [Valiant 2004] Counting problem ± ± Holographic reduction: Holant theorem Apparent normal structure Do something with holographic algorithm? My question: the magic of Holant theorem...

37 Again, it is Frank Kschischang... Early Kschischang: "Look at this!" Constrained coding as networks of relations [Schwartz & Bruck 2008] Holographic algorithms [Valiant 2004] Counting problem ± ± Holographic reduction: Holant theorem Apparent normal structure Do something with holographic algorithm? My question: the magic of Holant theorem...

38 Again, it is Frank Kschischang... Early Kschischang: "Look at this!" Constrained coding as networks of relations [Schwartz & Bruck 2008] Holographic algorithms [Valiant 2004] Counting problem ± ± Holographic reduction: Holant theorem Apparent normal structure Do something with holographic algorithm? My question: the magic of Holant theorem...

39 Again, it is Frank Kschischang... Early Kschischang: "Look at this!" Constrained coding as networks of relations [Schwartz & Bruck 2008] Holographic algorithms [Valiant 2004] Counting problem ± ± Holographic reduction: Holant theorem Apparent normal structure Do something with holographic algorithm? My question: the magic of Holant theorem...

40 Again, it is Frank Kschischang... Early Kschischang: "Look at this!" Constrained coding as networks of relations [Schwartz & Bruck 2008] Holographic algorithms [Valiant 2004] Counting problem ± ± Holographic reduction: Holant theorem Apparent normal structure Do something with holographic algorithm? My question: the magic of Holant theorem...

41 Give it the student... Ali Al-Bashebsheh Reformulated normal factor graphs Generalized Holant theorem Unified normal graph duality theorem and Holant theorem

42 Give it the student... Ali Al-Bashebsheh Reformulated normal factor graphs Generalized Holant theorem Unified normal graph duality theorem and Holant theorem

43 Two concurrent submissions... Editor Pascal Vontobel Normal factor graphs and holographic transformations " [Al-Bashebsheh & Mao 2011] Codes on graphs: duality and MacWilliams identities [Forney 2011] Extensive comments, rounds of rewriting

44 Two concurrent submissions... Editor Pascal Vontobel Normal factor graphs and holographic transformations " [Al-Bashebsheh & Mao 2011] Codes on graphs: duality and MacWilliams identities [Forney 2011] Extensive comments, rounds of rewriting

45 Normal factor graphs under new semantics... x 1 f x 3 1 f 2 x 4 x 5 x6 Z G (x 1, x 2 ) := x 2 f 3 f 4 x7 x 3,...,x 7 f 1 (x 1, x 3, x 4 )f 2 (x 3, x 5, x 6 )f 3 (x 2, x 4, x 5, x 7 )f 4 (x 6, x 7 ). := xf 1, f 2, f 3, f 4 y. := xf 1 (x 1, x 3, x 4 ), f 2 (x 3, x 5, x 6 ), f 3 (x 2, x 4, x 5, x 7 ), f 4 (x 6, x 7 ).y

46 Functions as multidimensional arrays X 1 = 2, X 2 = 3, and X 3 = 5. f x 1 x 2 x 3 f x 1 x 2 x 3 f x 1 x 2 x 3

47 Functions as multidimensional arrays X 1 = 2, X 2 = 3, and X 3 = 5. f x 1 x 2 x 3 f x 1 x 2 x 3 f x 1 x 2 x 3

48 Functions as multidimensional arrays X 1 = 2, X 2 = 3, and X 3 = 5. f x 1 x 2 x 3 f x 1 x 2 x 3 f x 1 x 2 x 3

49 Example (Equality Indicator) { 1, s = s δ(s, s 1 1 ) := 0, otherwise δ(, ): identity matrix Example ( Transformer ) Function Φ : χ χ C is called a transformer if it corresponds to an invertible matrix. Dual pair of transformers Φ and p Φ: an inverse pair of matrices xφ(s, t), p Φ(t, s 1 )y = δ(s, s 1 )

50 Example (Equality Indicator) { 1, s = s δ(s, s 1 1 ) := 0, otherwise δ(, ): identity matrix Example ( Transformer ) Function Φ : χ χ C is called a transformer if it corresponds to an invertible matrix. Dual pair of transformers Φ and p Φ: an inverse pair of matrices xφ(s, t), p Φ(t, s 1 )y = δ(s, s 1 )

51 Example (Equality Indicator) { 1, s = s δ(s, s 1 1 ) := 0, otherwise δ(, ): identity matrix Example ( Transformer ) Function Φ : χ χ C is called a transformer if it corresponds to an invertible matrix. Dual pair of transformers Φ and p Φ: an inverse pair of matrices xφ(s, t), p Φ(t, s 1 )y = δ(s, s 1 )

52 Example (Equality Indicator) { 1, s = s δ(s, s 1 1 ) := 0, otherwise δ(, ): identity matrix Example ( Transformer ) Function Φ : χ χ C is called a transformer if it corresponds to an invertible matrix. Dual pair of transformers Φ and p Φ: an inverse pair of matrices xφ(s, t), p Φ(t, s 1 )y = δ(s, s 1 )

53 Example (Equality Indicator) { 1, s = s δ(s, s 1 1 ) := 0, otherwise δ(, ): identity matrix Example ( Transformer ) Function Φ : χ χ C is called a transformer if it corresponds to an invertible matrix. Dual pair of transformers Φ and p Φ: an inverse pair of matrices xφ(s, t), p Φ(t, s 1 )y = δ(s, s 1 )

54 Functions as linear maps xf, gy: xf(s), g(s)y: vector-vector dot product xf(s), g(s, t)y: vector-matrix product xf(s, t), g(t, u)y: matrix-matrix product xf(s), g(t)y: vector outer product, matrix Kronecker product, tensor product etc NFGs are linear algebraic expressions written graphically.

55 Functions as linear maps xf, gy: xf(s), g(s)y: vector-vector dot product xf(s), g(s, t)y: vector-matrix product xf(s, t), g(t, u)y: matrix-matrix product xf(s), g(t)y: vector outer product, matrix Kronecker product, tensor product etc NFGs are linear algebraic expressions written graphically.

56 Functions as linear maps xf, gy: xf(s), g(s)y: vector-vector dot product xf(s), g(s, t)y: vector-matrix product xf(s, t), g(t, u)y: matrix-matrix product xf(s), g(t)y: vector outer product, matrix Kronecker product, tensor product etc NFGs are linear algebraic expressions written graphically.

57 Functions as linear maps xf, gy: xf(s), g(s)y: vector-vector dot product xf(s), g(s, t)y: vector-matrix product xf(s, t), g(t, u)y: matrix-matrix product xf(s), g(t)y: vector outer product, matrix Kronecker product, tensor product etc NFGs are linear algebraic expressions written graphically.

58 Functions as linear maps xf, gy: xf(s), g(s)y: vector-vector dot product xf(s), g(s, t)y: vector-matrix product xf(s, t), g(t, u)y: matrix-matrix product xf(s), g(t)y: vector outer product, matrix Kronecker product, tensor product etc NFGs are linear algebraic expressions written graphically.

59 Opening/closing the box [Vontobel, Loeliger, 2002] f xf, gy g X X X Y X Opening/closing the box is a powerful technique

60 Opening/closing the box [Vontobel, Loeliger, 2002] f xf, gy g X X X Y X Opening/closing the box is a powerful technique

61 Holographic transformation x 1 y 1 f 1 f 2 f 1 f 2 x 2 f 3 f 4 y 2 f 3 f 4 f 5 f 5 Z G Z G H Theorem (Generalized Holant Theorem) Z G H is externally transformed version of Z G.

62 Generalized Holant Theorem When external transformers are δ(, ) reduces to Valiant s Holant Theorem When all transformers are Fourier kernels reduces to General NFG Duality Theorem when each function is a subgroup-indicator function reduces to Forney s Normal Graph Duality Theorem

63 Generalized Holant Theorem When external transformers are δ(, ) reduces to Valiant s Holant Theorem When all transformers are Fourier kernels reduces to General NFG Duality Theorem when each function is a subgroup-indicator function reduces to Forney s Normal Graph Duality Theorem

64 Generalized Holant Theorem When external transformers are δ(, ) reduces to Valiant s Holant Theorem When all transformers are Fourier kernels reduces to General NFG Duality Theorem when each function is a subgroup-indicator function reduces to Forney s Normal Graph Duality Theorem

65 General NFG Duality Theorem x 1 ^x 1 f 1 f 2 F[f1] F[f2] x 2 f 3 f 4 ^x 2 F[f3] F[f4] f 5 F[f5] x 1 f 1 f 2 ^x 1 f 1 f 2 x 2 f 3 f 4 ^x 2 f 3 f 4 f 5 f 5

66 Holographic algorithm: PerfMatch(H) H: weighted graph PerfMatch(H) π(h) := ¹ MPQ(H) epm w(e), where Q(H) the set of all perfect matchings of H. If H is planar, π(h) is poly. solvable by FKT algorithm Matchgate (G, U), where G is a weighted graph, U V(G). Signature of matchgate (G, U) is a function µ(x U )... a? 1 2 b? 3 c a 1{2 1 b 2 3 c a a? 1? 1{ b c b c 2

67 Holographic algorithm: PerfMatch(H) H: weighted graph PerfMatch(H) π(h) := ¹ MPQ(H) epm w(e), where Q(H) the set of all perfect matchings of H. If H is planar, π(h) is poly. solvable by FKT algorithm Matchgate (G, U), where G is a weighted graph, U V(G). Signature of matchgate (G, U) is a function µ(x U )... a? 1 2 b? 3 c a 1{2 1 b 2 3 c a a? 1? 1{ b c b c 2

68 Holographic algorithm: PerfMatch(H) H: weighted graph PerfMatch(H) π(h) := ¹ MPQ(H) epm w(e), where Q(H) the set of all perfect matchings of H. If H is planar, π(h) is poly. solvable by FKT algorithm Matchgate (G, U), where G is a weighted graph, U V(G). Signature of matchgate (G, U) is a function µ(x U )... a? 1 2 b? 3 c a 1{2 1 b 2 3 c a a? 1? 1{ b c b c 2

69 Holographic algorithm: a graph-theoretic property Lemma: π(h) = x E ± m i=1 µ i(x E(i) ). 1 a a? 1? 1{ b c b c 2 1 a? 1 2 b? 3 c a 1{2 1 3 b c a a? 1? 1{ b c b c 2 1 a? 1 2 b? 3 c a 1{2 1 3 b c 2

70 Holographic algorithm Solve Z := x E ±v f v(x E(v) ) a? 1 2 b? 3 c a 1{2 1 3 b c 2 a? 1 2 b? 3 c a 1{2 1 3 b c 2 FKT

71 Don t forget the mathematicians... Vontobel pointed to Trace Diagram, Birdtracks... Unshackling linear algebra from linear notation [Peterson 2009] Group Theory: Birdtracks, Lie s, and Exceptional Group, [Cvitanovic, 2008] Trace diagram is contained in NFG framework Trace diagrams are NFGs with degree 1, 2, or χ NFGs: a diagrammatic approach to linear algebra [Al-Bashebsheh, Mao & Vontobel ISIT2011]

72 Don t forget the mathematicians... Vontobel pointed to Trace Diagram, Birdtracks... Unshackling linear algebra from linear notation [Peterson 2009] Group Theory: Birdtracks, Lie s, and Exceptional Group, [Cvitanovic, 2008] Trace diagram is contained in NFG framework Trace diagrams are NFGs with degree 1, 2, or χ NFGs: a diagrammatic approach to linear algebra [Al-Bashebsheh, Mao & Vontobel ISIT2011]

73 Don t forget the mathematicians... Vontobel pointed to Trace Diagram, Birdtracks... Unshackling linear algebra from linear notation [Peterson 2009] Group Theory: Birdtracks, Lie s, and Exceptional Group, [Cvitanovic, 2008] Trace diagram is contained in NFG framework Trace diagrams are NFGs with degree 1, 2, or χ NFGs: a diagrammatic approach to linear algebra [Al-Bashebsheh, Mao & Vontobel ISIT2011]

74 Don t forget the mathematicians... Vontobel pointed to Trace Diagram, Birdtracks... Unshackling linear algebra from linear notation [Peterson 2009] Group Theory: Birdtracks, Lie s, and Exceptional Group, [Cvitanovic, 2008] Trace diagram is contained in NFG framework Trace diagrams are NFGs with degree 1, 2, or χ NFGs: a diagrammatic approach to linear algebra [Al-Bashebsheh, Mao & Vontobel ISIT2011]

75 Ciliated function nodes Draw the edges of f counter-clockwise, with first edge ciliated x n x n 1 x k+1... f... x n x n 1 x k+1... f... x 1 x 2 x k x 1 x 2 x k f(x 1,..., x n ) f(x k+1,..., x n, x 1,..., x k )

76 Ciliated function nodes x 1 x 1 x 1 x 1 A A A A B B B B x 2 x 2 x 2 x 2 A B A B T A T B T A T B

77 Trace A i Z = i A(i, i) = tr(a)

78 tr(a B) = tr(b A) Proof: A B Read the graph in two ways: Z = tr(a B) and Z = tr(b A). l

79 Levi-Civita symbol S n : the permutation group on {1,..., n}, n = χ. Levi-Civita symbol, ε : {1,..., n} n C s.t. ( ) ( ) 1 n 1 n sgn, ε(x 1,..., x n )= x 1 x n x 1 x n 0, otherwise Contraction (n = 3). t P S n ε(y 1, y 2, t)ε(t, x 2, x 1 ) = δ(x 1, y 2 )δ(x 2, y 1 ) δ(x 1, y 1 )δ(x 2, y 2 ). x 1 x 2 ε t ε = x 1 x 2 y 1 y 2 x 1 x 2 y 1 y 2 y 1 y 2

80 Levi-Civita symbol S n : the permutation group on {1,..., n}, n = χ. Levi-Civita symbol, ε : {1,..., n} n C s.t. ( ) ( ) 1 n 1 n sgn, ε(x 1,..., x n )= x 1 x n x 1 x n 0, otherwise Contraction (n = 3). t P S n ε(y 1, y 2, t)ε(t, x 2, x 1 ) = δ(x 1, y 2 )δ(x 2, y 1 ) δ(x 1, y 1 )δ(x 2, y 2 ). x 1 x 2 ε t ε = x 1 x 2 y 1 y 2 x 1 x 2 y 1 y 2 y 1 y 2

81 Levi-Civita symbol S n : the permutation group on {1,..., n}, n = χ. Levi-Civita symbol, ε : {1,..., n} n C s.t. ( ) ( ) 1 n 1 n sgn, ε(x 1,..., x n )= x 1 x n x 1 x n 0, otherwise Contraction (n = 3). t P S n ε(y 1, y 2, t)ε(t, x 2, x 1 ) = δ(x 1, y 2 )δ(x 2, y 1 ) δ(x 1, y 1 )δ(x 2, y 2 ). x 1 x 2 ε t ε = x 1 x 2 y 1 y 2 x 1 x 2 y 1 y 2 y 1 y 2

82 Vector cross product u, v : {1, 2, 3} F u, v P F 3. Cross Product. Z(x) = t 1,t 2 u(t 1 )v(t 2 )ε(t 1, t 2, x). Z(1) = u(2)v(3) u(3)v(2) Z(2) = u(3)v(1) u(1)v(3) Z(3) = u(1)v(2) u(2)v(1) x ε t 1 t 2 u v Z = u v.

83 Vector cross product u, v : {1, 2, 3} F u, v P F 3. Cross Product. Z(x) = t 1,t 2 u(t 1 )v(t 2 )ε(t 1, t 2, x). Z(1) = u(2)v(3) u(3)v(2) Z(2) = u(3)v(1) u(1)v(3) Z(3) = u(1)v(2) u(2)v(1) x ε t 1 t 2 u v Z = u v.

84 Vector cross product u, v : {1, 2, 3} F u, v P F 3. Cross Product. Z(x) = t 1,t 2 u(t 1 )v(t 2 )ε(t 1, t 2, x). Z(1) = u(2)v(3) u(3)v(2) Z(2) = u(3)v(1) u(1)v(3) Z(3) = u(1)v(2) u(2)v(1) x ε t 1 t 2 u v Z = u v.

85 Vector cross product u, v : {1, 2, 3} F u, v P F 3. Cross Product. Z(x) = t 1,t 2 u(t 1 )v(t 2 )ε(t 1, t 2, x). Z(1) = u(2)v(3) u(3)v(2) Z(2) = u(3)v(1) u(1)v(3) Z(3) = u(1)v(2) u(2)v(1) x ε t 1 t 2 u v Z = u v.

86 Vector cross product u, v : {1, 2, 3} F u, v P F 3. Cross Product. Z(x) = t 1,t 2 u(t 1 )v(t 2 )ε(t 1, t 2, x). Z(1) = u(2)v(3) u(3)v(2) Z(2) = u(3)v(1) u(1)v(3) Z(3) = u(1)v(2) u(2)v(1) x ε t 1 t 2 u v Z = u v.

87 Cross product identities (u v) (s w) = ((u v) s) w = (w (u v)) s = ((s w) u) v = (v (s w)) u = (u s)(v w) (u w)(v s). Proof: w s ε ε u v = w s ε ε u v = w s ε ε u v = w s ε ε u v = w s ε ε u v = w s ε ε u v = w s u v w s u v

88 Determinant ± n A := sgn(σ) σps n j=1 A(j, σ(j)). Lemma: ε = A ε A... A...

89 A B = A B Proof: AB ε... = AB ε... AB = B A ε A... B = A B ε... B = A B ε...

90 A T = A Proof: ε ε ε ε A T... =... A T A T =... A A = A... ε ε ε ε

91 Pfaffian A is a 2n 2n skew-symmetric matrix. ± Pf(A) = 1 2 n n! sgn(σ) n A ( σ(2i 1), σ(2i) ). σps 2n i=1 ε A A... A Z = n! 2 n Pf(A) Affirms a conjecture of Peterson [Peterson, 2009].

92 Pfaffian and Determinant ( ) 0 In J :=, A := R T J R for some R. Then Pf(A) 2 = A. I n 0 Proof: Pf(J) = 1 Pf(A) ε J J... J = ε A A... A R R R = ε J J... J R R R R R R = ε J J... J R R R = R ε J J... J Pf(A) = R Pf(A) 2 = R 2 = A.

93 Going probabilistic... NFGs as probabilistic model [Al-Bashebsheh & Mao, 2011, soon available on arxiv] Internal edges: latent variabels External edges: observed variables Exterior function: joint distribution of observed variables (up to scale) Function: compatibility/constraints/interaction protential source of randomness, i.e., distribution Bipartite normal: WLOG

94 Going probabilistic... NFGs as probabilistic model [Al-Bashebsheh & Mao, 2011, soon available on arxiv] Internal edges: latent variabels External edges: observed variables Exterior function: joint distribution of observed variables (up to scale) Function: compatibility/constraints/interaction protential source of randomness, i.e., distribution Bipartite normal: WLOG

95 Going probabilistic... NFGs as probabilistic model [Al-Bashebsheh & Mao, 2011, soon available on arxiv] Internal edges: latent variabels External edges: observed variables Exterior function: joint distribution of observed variables (up to scale) Function: compatibility/constraints/interaction protential source of randomness, i.e., distribution Bipartite normal: WLOG

96 Special functions x 1 x 1 x 1 Σ max x 2 x n x 2 x n x 2 x n

97 Cumulus and difference transformers A D χ: an ordered set (integers) with well defined. A(x, x 1 ) = [x 1 x] [ ] 1 0 E.g. χ = 2, A = ; χ = 3, A = xf X (x 1 ), A(x, x 1 )y = F X (x) D := A 1, e.g.d = for χ =

98 Bipartite NFG reduces to FG One partitiion: arbitrary potential functions The other partition: equality indicators NFG model reduces to FG model f1 x 1 f 1 x 1 f2 x 2 f 2 x 2 f3 x 3 f 3 x 3

99 Bipartite NFG reduces to convolutional FG One side: distributions of indep. collections of latent RVs The other side: sum indicators No RVs from the same collection connect to the same indicator NFG model reduces to convolutional FG model f 1 x 1 f 1 x 1 f2 x 2 f 2 x 2 x f 3 3 f 3 x 3 f(x, u)g(v, y)[w = u + v] = f(x, u)g(w u, y) = f(x, w) g(w, y) u,v u

100 CDN is a holographic transformed NFG Lemma x 1 A max x 2... xn = A x 1 = x 2... A x n Corollary x 1 A D max x 2 D x n = x 1 x 2 x n

101 CDN is a holographic transformed NFG One side: distributions of indep. collections of latent RVs The other side: max indicators No RVs from the same collection connect to the same indicator f 1 max x 1 f 2 x 2 f 3 max x 3

102 CDN is a holographic transformed NFG x 2 A f 1 f 2 f 3 max max x 1 x 2 x 3 f1 f 2 f3 A A A A A D D D D D max max A A x 1 x 3 F X1 X 2 x 1 F X1 X 2 x 1 F X1 X 3 x 2 F X1 X 3 x 2 F X1 X 3 x 3 F X1 X 3 x 3

103 The independence coincidence of Convolutional FG and CDN f 1 max x 1 f 1 x 1 f 2 x 2 f2 x 2 f 3 max x 3 f 3 x 3 Both are generative models In fact, the independence property holds when changing indicator to conditional distributions Changing the right-side functions gives rise to a family of infinite models... what applications?

104 The End NFGs are linear algebraic expressions written graphically Opening/closing the box is a powerful technique NFGs are potential tools for inference Outlook: continuous alphabets

105 The End NFGs are linear algebraic expressions written graphically Opening/closing the box is a powerful technique NFGs are potential tools for inference Outlook: continuous alphabets

106 The End NFGs are linear algebraic expressions written graphically Opening/closing the box is a powerful technique NFGs are potential tools for inference Outlook: continuous alphabets

107 The End NFGs are linear algebraic expressions written graphically Opening/closing the box is a powerful technique NFGs are potential tools for inference Outlook: continuous alphabets

108 The End NFGs are linear algebraic expressions written graphically Opening/closing the box is a powerful technique NFGs are potential tools for inference Outlook: continuous alphabets

109

Codes on Graphs, Normal Realizations, and Partition Functions

Codes on Graphs, Normal Realizations, and Partition Functions G. David Forney, Jr. 1 Workshop on Counting, Inference and Optimization Princeton, NJ November 3, 2011 1 Joint work with: Heide Gluesing-Luerssen,