From Arabidopsis roots to bilinear equations

Size: px

Start display at page:

Download "From Arabidopsis roots to bilinear equations"

Ashlee Osborne
5 years ago
Views:

1 From Arabidopsis roots to bilinear equations Dustin Cartwright 1 October 22, joint with Philip Benfey, Siobhan Brady, David Orlando (Duke University) and Bernd Sturmfels (UC Berkeley), research supported by the DARPA project Fundamental Laws of Biology

2 Arabidopsis root

3 Arabidopsis root Gene expression microarrays are a tool to understand dynamics and regulatory processes.

4 Arabidopsis root Gene expression microarrays are a tool to understand dynamics and regulatory processes. Two ways of separating cells in the lab: I Chemically, using 18 markers (colors in diagram A)

5 Arabidopsis root Gene expression microarrays are a tool to understand dynamics and regulatory processes. Two ways of separating cells in the lab: I Chemically, using 18 markers (colors in diagram A) I Physically, using 13 longitudinal sections (red lines in diagram B)

6 Measurement along two axes Markers measure variation among cell types.

7 Measurement along two axes Markers measure variation among cell types. Longitudinal sections measure variation along developmental stage.

8 Measurement along two axes Markers measure variation among cell types. Longitudinal sections measure variation along developmental stage. Naïve approach would use variation among each set of experiments as proxies for variation along each of the two axes.

9 Problem with naïve approach Correspondence between markers and cell types is imperfect.

10 Problem with naïve approach Correspondence between markers and cell types is imperfect. For example, the sample labelled APL consists of mixture of two cell types: cell type section phloem phloem companion cells columella 0 0.

11 Problem with naïve approach Similarly, the longitudinal sections do not have the same mixture of cells. For example: In each of sections 1-5, 30-50% of the cells are lateral root cap cells.

12 Problem with naïve approach Similarly, the longitudinal sections do not have the same mixture of cells. For example: In each of sections 1-5, 30-50% of the cells are lateral root cap cells. In sections 6-12, there are no lateral root cap cells.

13 Problem with naïve approach Similarly, the longitudinal sections do not have the same mixture of cells. For example: In each of sections 1-5, 30-50% of the cells are lateral root cap cells. In sections 6-12, there are no lateral root cap cells. Conclusion: Need to analyze each transcript across all 31 (= ) experiments to model the expression pattern in the whole root.

14 Model A cluster consists of cells of the same type in the same section. Each cluster has an expression level.

15 Model A cluster consists of cells of the same type in the same section. Each cluster has an expression level. For each marker and each longitudinal section, we have a measurement functional, a linear combination of the expression levels in different clusters.

16 Model A cluster consists of cells of the same type in the same section. Each cluster has an expression level. For each marker and each longitudinal section, we have a measurement functional, a linear combination of the expression levels in different clusters. The coefficients of these functionals can be determined from: Numbers of cells present in each section Marker selection patterns

17 Model A cluster consists of cells of the same type in the same section. Each cluster has an expression level. For each marker and each longitudinal section, we have a measurement functional, a linear combination of the expression levels in different clusters. The coefficients of these functionals can be determined from: Numbers of cells present in each section Marker selection patterns Under-constrained system: 31 (= ) functionals and 129 clusters.

18 Assumption Since the system is under constrained, we make the following assumption.

19 Assumption Since the system is under constrained, we make the following assumption. The dependence on the expression level on the section is independent of the dependence on the cell type.

20 Assumption Since the system is under constrained, we make the following assumption. The dependence on the expression level on the section is independent of the dependence on the cell type. More precisely, the expression level of cluster in section i and type j is x i y j for some vectors x and y.

21 Assumption Since the system is under constrained, we make the following assumption. The dependence on the expression level on the section is independent of the dependence on the cell type. More precisely, the expression level of cluster in section i and type j is x i y j for some vectors x and y. Example If the expression level is either 0 or 1 (off or on), then our assumption says that it is 1 for the combination of some subset of the sections and some subset of the cell types.

22 Non-negative bilinear equations A (1),..., A (k) o 1,..., o k n m non-negative matrices (cell mixture) non-negative scalars (expression levels) Solve (approximately) f 1 (x, y) := x t A (1) y = o 1. f k (x, y) := x t A (k) y = o k x x n = 1

23 Non-negative bilinear equations A (1),..., A (k) o 1,..., o k n m non-negative matrices (cell mixture) non-negative scalars (expression levels) Solve (approximately) f 1 (x, y) := x t A (1) y = o 1. f k (x, y) := x t A (k) y = o k x x n = 1 for x and y non-negative vectors of dimensions n 1 and m 1 respectively.

24 Probabilistic interpretation f l (x, y) := i,j A (l) ij x i y j for l = 1,..., k Up to scaling, this vector has the form of the family of probability distributions (depending on vectors x and y)

25 Probabilistic interpretation f l (x, y) := i,j A (l) ij x i y j for l = 1,..., k Up to scaling, this vector has the form of the family of probability distributions (depending on vectors x and y) coming from the following process: 1. Pick a pair of integers from {1,..., n} {1,..., m} with (i, j) having probability proportional to ( ) l A(l) ij x i y j

26 Probabilistic interpretation f l (x, y) := i,j A (l) ij x i y j for l = 1,..., k Up to scaling, this vector has the form of the family of probability distributions (depending on vectors x and y) coming from the following process: 1. Pick a pair of integers from {1,..., n} {1,..., m} with (i, j) having probability proportional to ( ) l A(l) ij x i y j 2. Output an integer from {1,..., k}. Conditional on having picked i and j in the previous step, the probability of outputing l is: ( A (l) ) ij / l A(l) ij

27 Maximum Likelihood Estimation Rescaling both sides of our system of equations: f l (x, y) l f = o l (x, y) l l o l for l = 1,..., k

28 Maximum Likelihood Estimation Rescaling both sides of our system of equations: f l (x, y) l f = o l (x, y) l l o l for l = 1,..., k Finding an approximate solution to these equations is known as Maximum Likelihood Estimation.

29 Kullback-Leibler divergence Kullback-Leibler divergence gives a way of comparing two probability distributions: D(z f (x, y)) := ( ) zl z l log f l (x) l

30 Kullback-Leibler divergence Kullback-Leibler divergence gives a way of comparing two probability distributions: D(z f (x, y)) := ( ) zl z l log z l + f l (x, y) f l (x) l We generalize divergence to any pair of non-negative vectors.

31 Kullback-Leibler divergence Kullback-Leibler divergence gives a way of comparing two probability distributions: D(z f (x, y)) := ( ) zl z l log z l + f l (x, y) f l (x) l We generalize divergence to any pair of non-negative vectors. By approximate solution to a system, we will mean the a solution which minimizes the Kullback-Leibler divergence.

32 Expectation Maximization Want to solve: i,j A (l) ij x i y j = o l for l = 1,..., k (1)

33 Expectation Maximization Want to solve: i,j Start with guesses x, ỹ A (l) ij x i y j = o l for l = 1,..., k (1)

34 Expectation Maximization Want to solve: i,j Start with guesses x, ỹ A (l) ij x i y j = o l for l = 1,..., k (1) Estimate contribution of (i, j) term of left side of equation 1 needed to obtain equality: x i ỹ j i j A(l) i j x o l =: e ijl i ỹ j A (l) ij

35 Expectation Maximization Want to solve: i,j Start with guesses x, ỹ A (l) ij x i y j = o l for l = 1,..., k (1) Estimate contribution of (i, j) term of left side of equation 1 needed to obtain equality: x i ỹ j i j A(l) i j x o l =: e ijl i ỹ j A (l) ij Find approximate solution to system: ( ) A (l) ij x i y j e ijl =: e ij l l

36 Expectation Maximization Want to solve: i,j Start with guesses x, ỹ A (l) ij x i y j = o l for l = 1,..., k (1) Estimate contribution of (i, j) term of left side of equation 1 needed to obtain equality: x i ỹ j i j A(l) i j x o l =: e ijl i ỹ j A (l) ij Find approximate solution to system: ( ) A (l) ij x i y j e ijl =: e ij l l Repeat until convergence

37 Likelihood maximization for monomial models where A ij = l A(l) ij. g : R n R m R nm (x i ), (y j ) A ij x i y j

38 Likelihood maximization for monomial models g : R n R m R nm (x i ), (y j ) A ij x i y j where A ij = l A(l) ij. Moment map (taking row sums and column sums): µ: R nm R n R m ( ( ) b ij b ij ), b ij j i

39 Likelihood maximization for monomial models g : R n R m R nm (x i ), (y j ) A ij x i y j where A ij = l A(l) ij. Moment map (taking row sums and column sums): µ: R nm R n R m ( ( ) b ij b ij ), b ij Theorem Kullback-Leibler divergence D(z g(x, y)) is minimized over all x and y when µ(z) equals µ(g(x, y)). j i

40 Inverting the moment map: Iterative Proportional Fitting b R nm g(x, y) µ µ(g(x, y)) µ(b) R n R m

41 Inverting the moment map: Iterative Proportional Fitting Adjust x i : x i x i Adjust ỹ i : ỹ j ỹ j j b ij j a ij x i ỹ j i b ij i a ij x i ỹ j Iterate until convergence b g(x, y) µ µ(g(x, y)) µ(b) R nm R n R m

42 Validation: Preliminary results On the left is a visual representation of the reconstructed expression levels. On the right, the expression levels for the same transcript are visualized using GFP.

Mathematical foundations - linear algebra

Mathematical foundations - linear algebra Andrea Passerini passerini@disi.unitn.it Machine Learning Vector space Definition (over reals) A set X is called a vector space over IR if addition and scalar