Genome duplications (polyploidy) / ancient genome duplications (paleopolyploidy) How to detect paleoploidy? e.g. a diploid cell undergoes failed meiosis, producing diploid gametes, which selffertilize to produce a tetraploid zygote.
Timing of duplication by trees (phylogenetic timing)
Phylogenetic timing of duplicates b
Paramecium genome duplications
Comparison of two scaffolds originating from a common ancestor at the recent WGD
Saccharomyces cerevisiae
Just before genome duplication
Just after genome duplication
More time after genome duplication
Unaligned view (removing gaps just like in cerev has occurred)
Saccharomyces cerevisiae
Problem reciprocal gene loss (extreme case); how to solve?
Problem reciprocal gene loss (extreme case); how to solve?
Just before genome duplication Outgroup!
Just after genome duplication Outgroup
Just after genome duplication Outgroup
More time after genome duplication Outgroup
Problem (extreme case); how to solve? Outgroup
Outgroup
Outgroup
Outgroup
Outgroup
Using other genomes Wong et al. 2002 PNAS
Centromeres
Vertebrate genome duplication
Nature. 2011 Apr 10. [Epub ahead of print] Ancestral polyploidy in seed plants and angiosperms. Jiao Y, Wickett NJ, Ayyampalayam S, Chanderbali AS, Landherr L, Ralph PE, Tomsho LP, Hu Y, Liang H, Soltis PS, Soltis DE, Clifton SW, Schlarbaum SE, Schuster SC, Ma H, Leebens-Mack J, Depamphilis CW.
Teleosts S. serevisiae and close relatives Paramecium MOSS Flowering plants Vertebrates
Reconstructed map of genome duplications allows unprecedented mapping of evolutionary history of genes in genomes
Reconstructed map of genome duplications allows unprecedented mapping of evolutionary history of genes in genomes
Major fate is gene loss
In paramecium 3 or 4 genome duplications!
Massive gene loss
Correlation to (adaptive) radiation?
Correlation to (adaptive) radiation?
Explanations for adaptive radiation? Incompatibility at essential loci
Explanations for adaptive radiation? Regulatory innovations Plants
Acknowledgements John van Dam (Bioinformatics, UU) Like Fokkens (Bioinformatics, UU) Peer Bork (EMBL, Heidelberg) Martijn Huynen (CMBI, Nijmegen) Hans Bos (UMCU) Holger Rehmann (UMCU) Fried Swartkruis (UMCU) Geert Kops (UMCU)
How we use the WGD As a controlled experiment! How do ks / ka / expression diverge after duplication and how does this correlate with epigenetic modifications
A. thaliana s history Traces of (at least) 3 whole-genome duplications The latest (α or 3R) is the most wellannotated 3821 paralogous pairs Van de Peer et al. (2009), Trends in Plant Science
Fate of duplicated genes (paralogs) Loss Depends on scale of duplication and function of the gene (Maere et al., 2005) Subfunctionalization: e.g. divergence of expression paterns between paralogs Increases with evolutionary time Depends on mode/scale of duplication and function of the gene (Casneuf et al., 2006)
Fate of duplicated genes (paralogs) Loss Depends on scale of duplication and function of the gene (Maere et al., 2005) Subfunctionalization: e.g. divergence of expression paterns between paralogs Increases with evolutionary time Depends on mode/scale of duplication and function of the gene (Casneuf et al., 2006) Blanc G, Wolfe KH (2004), The Plant Cell
Fate of duplicated genes (paralogs) Why do expression patterns of paralogs diverge at different rates? It might be epigenetics Blanc G, Wolfe KH (2004), The Plant Cell.
Histones and their modifications Array of modifications: Acetylation Phosphorylation Methylation Ubiquitination And many more... Epigenetic marks Role: regulation Zhang Y, Reinberg D (2001), Genes & Development
H3K27me3 Regulation of expression: Repressive mark Gene-specific (unlike in animals) Tissue- or developmental stagespecific genes (at least) 16% of Arabidopsis genome ChIP-chip data for seedlings Zhang X et al. (2007), PLoS Biology
RESULTS
Paralogs from the latest Arabidopsis WGD (Bowers et al., 2003) Genes of the same age Genes with the H3K27me3 mark Union of ChIP-chip experiments (Zhang et al., 2003; Turck, unpublished) Significant overlap (p<1.0e-99) Datasets
Datasets Paralogs from the latest Arabidopsis WGD (Bowers et al., 2003) Genes with the H3K27me3 mark Possible situations: Both paralogs with mark One paralog with mark No paralogs with mark gene1 gene2 gene1 gene2 gene1 gene2
Rate of sequence divergence (Ks) Ks: number of synonymous substitutions per synonymous site (Often assumed not to be under evolutionary pressure) No H3K27me3: slower rate of sequence evolution Genes with H3K27me3 are tightly regulated and have lower average expression Sequences of genes with lower expression evolve faster Two sample Kolmogorov-Smirnov test: none is significantly different (p<2.2e-16), partial-both (p = 4.6e-05)
Expression divergence MAs from a wide range of experiments (TAIR)
Expression divergence MAs from a wide range of experiments (TAIR)
Expression divergence MAs from a wide range of experiments (TAIR)
Expression divergence MAs from a wide range of experiments (TAIR) Paralogs sharing mark status (0 or 2) have higher correlation Not significantly different Partial inches towards random distribution Different from both (p = 3.863e-07) and none (p = 4.44e-16)
Expression divergence
Expression divergence: TFs Paralogous pairs both keeping the mark have more similar expression patterns p-values both-partial: 0.0014, both-none: 0.0269