Stage 1: Karyotype Stage 2: Gene content & order Step 3

Size: px
Start display at page:

Download "Stage 1: Karyotype Stage 2: Gene content & order Step 3"

Transcription

1 Supplementary Figure Method used for ancestral genome reconstruction. MRCA (Most Recent Common Ancestor), AMK (Ancestral Monocot Karyotype), AEK (Ancestral Eudicot Karyotype), AGK (Ancestral Grass Karyotype) were reconstructed according to a two-stage procedure. While the ancestral karyotype is reconstructed in the first stage, the ancestral gene content of such karyotypes, as the gene order, are inferred in the second stage. Schematic representation below of the two-steps method involving: () karyotyping (based on core-ppgs), yielding the ancestral protochromosomes, and (2) gene order enrichment (based on ppgs) defining the complete set of opgs on the protochromosomes, using CIP/CALP Blast parameters as well as DRIMM-Synteny, MGRA tools as detailed in the Methods section of the manuscript. Step Genome alignment (ppg) Blast - CIP/CALP PROTOGENES Step 2 ppg conserved in all species (core-ppg). Stage : Karyotype Stage 2: Gene content & order Step 3 Step 4 Step 5 Syntenic Blocks with ppg (DRIMM Synteny) Enrichment of ancestral karyotype with ppg Step 9 Step 6 Syntenic Blocks (SB) (DRIMM Synteny) SB filtering (<5 core-ppg) SB merging (MGRA) Ancestral chromosome number Step Step 8 Ancestral gene order (opg) CHROMOSOMES GENE ORDER ANCESTRAL GENOME Nature Genetics: doi:0.038/ng.383

2 CAR 5 (A5 grasses) CAR /5 paralogy CAR (A grasses) Ancestor Ancestor Ancestor b2 Supplementary Figure 2 Schematic representation of ancestral genome reconstruction. The ancestral genome reconstruction procedure is illustrated for AGK (Ancestral Grass Karyotype chromosome, right) deriving from the orthologous chromosomes r (rice), s3 (sorghum), b2 (Brachypodium with telomere, tel.) defining A (cf dotplots and chromosome schema entitled CAR, left top) and orthologous chromosomes r5, s9, b2 (centromere, cent.) defining A5 (cf dotplots and chromosome schema entitled CAR5, left bottom), based on ppg and core-pg defining opg (color code). The post-duplication A5 and A protochromosomes are ultimately fused into a pre-duplication AGK protochromosome based on the paralogy observed between r-r5, s3-s9, b2 tel-cent (cf dotplots entitled CAR /5 paralogy, middle left), based on ppg and core-pg defining opg (cf color code) as defined in the Methods section of the manuscript. Modern synteny/paralogy Ancestors post-wgd Ancestor pre-wgd r s3 b2 (tel.) r s3 b2 (tel.) opg r r5 r s3 b2 (tel.) s3 s9 r5 s9 b2 (cent.) T C T T C T b2 ppg core-ppg A ppg core-ppg A5 Ancestor pre-wgd AGK core-ppg ppg opg opg r r5 s3 s9 b2 (tel. / cent.) r5 s9 b2 (cent.) r5 s9 b2 (cent.) 2 Nature Genetics: doi:0.038/ng.383

3 Supplementary Figure 3 AGK (pre-ρ / post-ρ) reconstruction. AGK was reconstructed in comparing sorghum, rice and Brachypodium, following the strategy detailed in the Methods section of the manuscript and delivering a post-ρ AGK with 2 chromosomes and 424 genes and pre-ρ AGK with chromosomes and 00 genes. a. Detailed procedure for AGK reconstruction from modern genomes (top) and the identification of ppgs, core-ppgs and opgs following karyotyping and enrichment procedures for pre-ρ and post-ρ ancestors. These newly defined AGKs (pre-ρ / post-ρ) clearly refine previously reported grass ancestors based on solely 6464 ppgs by Murat et al. (Ref 25, cf manuscript), corresponding to 84 % of the 9430 ppgs defined here for the new 2-protochromosomes AGK and 6246 opgs by Murat et al. (Ref 25, cf manuscript), corresponding to 89 % of the 00 opgs defined here for the new -protochromosomes AGK. b. Dotplot representation of the comparison between AGK ancestors (n=, n=2; y-axis) and modern species (x-axis) with a five-colors code illuminating AMK chromosomes. The dotplot representation of post-ρ AGK and pre-ρ AGK against the sorghum-rice-brachypodium genomes demonstrated that the reconstructed AGKs are robust and accurate with all chromosomes from either the modern species or from the ρ-block pairs entirely covered by the inferred karyotypes (i.e. 00% modern genomes/chromosomes integrated into the post-ρ and preρ AGK protochromosomes). c. The inferred n= AGKs (pre-ρ, AGK-) defines precise and exhaustive syntenic chromosome relationships between the modern grasses as detailed in the table. Following the proposed method applied on rice, Brachypodium and sorghum, we inferred a post-ρ AGK consisting of 2 protochromosomes containing 424 ordered protogenes (opgs). Alignment of the two subgenomes generated by ρ made it possible to reconstruct a pre-ρ AGK, consisting of 00 ordered protogenes (opgs) on protochromosomes. The transition from the pre-ρ to post-ρ AGK involved seven known paralogous ancestral chromosome pairs (using the rice chromosome nomenclature): A-A5, A2-A4, A2-A6, A3-A, A3-A0, A8-A9, A-A2. a Oryza sativa v.0 2 chromosomes genes Ancestor Post-ρ Karyotyping Step I.9430 Potential protogenes (ppg) Step II. 6 ppg conserved in the three species (Core-pPG). Step III. 33 ppg In Syntenic Blocks (SB). Step IV. 93 ppg after filtering Step V. 93 protogenes (opg) in 2 Protochromosomes (SB). Brachypodium distachyon v2. 5 chromosomes genes Sorghum bicolor v.0 0 chromosomes 242 genes Ancestor Pre-ρ Karyotyping Step II. 835 ppg (duplicated) Step III. 94 ppg / 88 SB. Step IV. 55 ppg / 68 SB. Step V. 55 opg / 68 SB Protochromosomes. Enrichment Step VI ppg non core-ppg. Step VII. 522 ppg In Syntenic Blocks. Step VIII. 424 opg in 2 Protochromosomes. Enrichment Step VI. 60 opg / 68 groups Protochromosomes. Step VII. 00 opg in Protochromosomes. 3 Nature Genetics: doi:0.038/ng.383

4 Ancestor Post-ρ Ancestor Pre-ρ b Ancestor Post-ρ 2 Rice 2 Brachy 5 Sorghum 0 2 Rice 2 Brachy 5 Sorghum 0 c AGK Pre-ρ AGK Post-ρ Rice Brachypodium Sorghum AGK AGK AGK AGK AGK AGK AGK Nature Genetics: doi:0.038/ng.383

5 Supplementary Figure 4 Oil palm (pre-p / pre-τ) ancestor reconstruction. The oil palm paleohistory was reconstructed following the strategy detailed in the Methods section of the manuscript and delivering a pre-p oil palm ancestor of 0 chromosomes with 396 genes and a pre-τ oil palm (or AMK) of 5 chromosomes with 60 genes. a. Detailed procedure for AMK (pre-p / pre-τ) reconstruction from the oil palm genome, leading to the identification of ppgs, core-ppgs and opgs following the karyotyping and enrichment procedures for pre-τ and post-τ ancestors. The analysis unveiled the p duplication (p-p6, p3-p, p4-p, p2-p6, p5-p4, p2-p0, p5-p0, p3-p5, p8-p0, p2-p8, p2-p9, p-p8, p3-p0, p3-p paralogous blocks) as well as the τ duplication (p-p6/p3-p, p2-p8-p0/p2-p-p8-p9, p4-p/p2-p6, p5-p4/p2-p5-p0, p3-p-p0/p3-p5 paralogous blocks) b. Dotplot representation of the comparison between oil palm (6 chromosomes), oil palm pre-p (n=0), AEK (pre-γ n=, post-γ n=2) and AGK (pre-ρ n=, post- ρ n=2) with a five-colors code illuminating AMK chromosomes. The observed synteny and paralogy are entirely integrated in 5 CARs (diagonal color code) delivering a pre-p oil palm ancestor of 0 chromosomes (396 ordered protogenes, opgs) and a pre-τ AMK of 5 chromosomes (60 ordered protogenes, opgs). a Elaeis guinensis (20965 genes). Ancestor Pre-p Karyotyping Step I.2998 Potential proto-genes (ppg) Ancestor Pre-τ Karyotyping Step I.064 Potential proto-genes (ppg) Step II ppg Step II. 064 ppg Step III ppg In Syntenic Blocks (SB). Step III. 30 ppg In Syntenic Blocks. Step IV. 286 ppg after filtering. Step IV. 30 ppg after filtering. Step V opg in 0 Protochromosomes. Step V. 30 opg in 5 Protochromosomes. Enrichment Step VI. 24 opg (duplicated) Step VII. 396 opg in 0 ProtoChromosomes Enrichment Step VI. 30 opg (duplicated) Step VII. 60 opg in 5 Protochromosomes 5 Nature Genetics: doi:0.038/ng.383

6 AEK (post-γ) AGK (post-ρ) Oil palm pre-p Oil palm Oil palm pre-p b opg opg opg 2 0 AEK (pre-γ) Oil palm pre-p AGK (pre-ρ) 6284 opg 00 opg 2 AEK (pre-γ) AGK (pre-ρ). 6 Nature Genetics: doi:0.038/ng.383

7 Oil Palm Oil Palm Supplementary Figure 5 Oil palm (pre-p) and AMK (pre-τ) ancestors. The oil palm genome allowed the characterization of the shared ancestral τ duplication through a mixed approach including Ks and synteny inference of paralogs for the lineage-specific p duplication (see also previous Supplementary Figure 4) as well as the monocot τ duplication. a. Schematic representation of Ks distribution separating two WGDs (recent p WGD in red and ancient τ WGD in blue, top illustration) in the oil palm genome; and the corresponding gene pairs defining paralogous blocks (diagonals) on the oil palm-oil palm dotplot comparison (bottom illustration) with p duplication in red (p-p6, p3-p, p4-p, p2-p6, p5-p4, p2-p0, p5-p0, p3-p5, p8- p0, p2-p8, p2-p9, p-p8, p3-p0, p3-p paralogous blocks) and τ duplication in blue (p-p6/p3-p, p2-p8-p0/p2-p-p8-p9, p4- p/p2-p6, p5-p4/p2-p5-p0, p3-p-p0/p3-p5 paralogous blocks). b. Schematic representation of the deduced most parsimonious (reduced number of fusion and fission events) evolutionary model from a pre-τ AMK structured in 5 chromosomes (with 60 genes) deriving a n=0 post-τ AMK (396 genes) followed by a chromosomal fusion (between A-A9) to derive a n=9 intermediate. The second WGD (p) derives a n=8 (2x9) intermediate followed by 5 chromosomal fissions (breaks) and fusions to reach the modern oil palm genome (n=6), top illustration. Dotplot diagonals correspond to the p and τ WGDs from the oil palm-oil palm comparison with a colour code highlighting the five AMK (oil palm pre- τ) protochromosomes (right), and entirely covering the modern oil pam genome (with pre-τ AMK ancestor as color code at the center). a b #Paralogous pairs ks Oil Palm Oil Palm Nature Genetics: doi:0.038/ng.383

8 Supplementary Figure 6 Dotplot-based deconvolution of the AMK (pre-τ), AGK (pre-ρ) and oil palm genomes into 5 CARs. The complete dotplot-based comparative genomics deconvolution into reconstructed CARs of the observed synteny and paralogy between the oil palm (pre-p with 0 chromosomes and 396 genes), AMK (pre-τ with 5 chromosomes and 60 genes) and AGK (preρ with chromosomes and 00 genes) genomes validates the five proposed protochromosomes as the origin of monocots, i.e. AMK. AGK-AMK (right), AGK-oil palm (left) and oil palm-amk (centre) dotplot comparisons illustrate the complete deconvolution of synteny/paralogy relationships (diagonals) into 5 CARs (color code), as illuminated as a case example for AK (external red arrows) deriving orthologous/paralogous blocks on AGK (chromosomes A-4-6) and oil palm (chromosomes -3-6-). Oil pam (6 chromosomes, z-axis) and AGK ( ancestral chromosomes from rice, Brachypodium and sorghum comparison, y-axis) genomes are entirely covered with 5 independent ancestral chromosome (AMK CARs, x-axis) that do not share any orthologous or paralogous relationship. AMK AK AK2 AK3 AK4 AK5 AGK AMK Oil Palm 8 Nature Genetics: doi:0.038/ng.383

9 AMK Supplementary Figure Dotplot-based deconvolution of the AMK (pre-τ), pineapple genomes into 5 CARs. The dotplot-based comparative genomics of the pineapple (25 chromosomes and genes) genome and the pre-τ AMK (5 chromosomes and 60 genes) genome revealed the precise nature of the σ event as a hexaploidization event, with a clear -to-6 (two regions inherited from τ and three from σ) chromosomal relationships between AMK and pineapple genomes. Dotplot (top) comparison of AMK (post-τ, y-axis) and pineapple (x-axis) illustrating a -to-6 relationship (horizontal) observed between the 5 AMK protochromosomes (or CARs, diagonal color code) and pineapple chromosomes. As a case example for AK (bottom), the number of identified conserved genes (y-axis) is shown as a distribution of 6 pineapple orthologous chromosomes (x-axis). The -to-6 relationship observed between AMK and pineapple involves τ (reported as a tetraploidization) and σ, a paleohexaploidization event. The observed difference in ancestral gene retention between the triplicated regions inherited from σ in pineapple (chromosomes -2-6 and chromosomes -5-0 as case example) is in favour of a post-polyploidy subgenome dominance with chromosomes 0- as the least fractionated (LF), chromosomes 5-6 as the medium fractionated (MF) and chromosomes -2 as the most fractionated (MF2) compartments. The comparison of the pineapple genome and AMK (pre-τ, 5 protochromosomes with 60 ordered protogenes) allowed us to unveil the precise nature of the σ event as a hexaploidization with a clear -to-6 (two regions inherited from τ and three from σ) chromosome relationships between the investigated genomes, then deriving the AMK pre-σ with 9 chromosomes (from the fusion of two ancestral A-A9 post-τ chromosomes) and the AMK post-σ with 2 chromosomes. The σ hexaploidization event shows the characteristic of the subgenome dominance phenomenon following hexaploidization involving MF2 (most fractionated), MF (medium fractionated) and LF (least fractionated) compartments as reported in the case of the ancestral hexaploidization in rosids by Murat et al. (Ref. 24, cf manuscript) and Brassicacae by Murat et al. (Ref. 32, cf manuscript). 5 4 τ 3 LF MF 2 σ Pine Apple LF MF MF2 LF MF MF2 PA PA5 PA0 PA PA2 PA6 PA PA6 PA2 PA0 PA5 PA 9 Nature Genetics: doi:0.038/ng.383

10 Supplementary Figure 8 Dotplot-based deconvolution of the AMK (pre-τ and post-σ) and oil palm (pre-p) genomes into 5 CARs. The complete dotplot-based comparative genomics deconvolution of the observed synteny between the oil palm (pre-p, 0 chromosomes and 396 genes), AMK pre-τ (5 chromosomes and 60 genes), and AMK post-σ (2 chromosomes and 208 genes) genomes into five independent CARs validated the origin of monocots from the five proposed protochromosomes, the tetraploid and hexaploid nature of respectively the τ (referenced as x2) and σ (referenced as x3) polyploidization events. a. AMK (pre-τ, 5 chromosomes)/amk (post-σ, 2 chromosomes) (right), AMK (post-σ, 2 chromosomes)/oil palm (pre-p, 0 chromosomes) (left) and AMK (pre-τ, 5 chromosomes)/ oil palm (pre-p, 0 chromosomes) (centre) dotplot comparisons illustrate the complete deconvolution of synteny/paralogy relationships (diagonals) into 5 CARs (color code, bottom tight). b. AMK pre-τ, AMK post-σ and oil palm pre-p, are entirely covered with 5 independent ancestral chromosomes (AMK CARs) defining a -to-6 (τ(x2) and σ(x3)) chromosomal relationship between AMK pre-τ / AMK post-σ, a -to-6 (τ(x2) and σ(x3)) chromosomal relationship between AMK post-σ / oil palm pre-p and a -to- 2 (τ(x2)) chromosomal relationship between AMK pre-τ / oil palm pre-p as AMK post-σ / AMK post-σ and detailed in the table with the ancestral chromosomes referred to as AK X and modern chromosome as X'. a AMK post-σ x6 = τ(x2) + σ(x3) Pre-P x6 = τ(x2) + σ(x3) AMK AMK AK AK2 AK3 AK4 AK5 b AMK Pre-τ AMK Post-τ/Pre-p (x2) Oil palm (x4) AMK Post-σ (x6) AGK Pre-ρ (oil palm) (x6) Pineapple (x6) AK AK AK AK AK Nature Genetics: doi:0.038/ng.383

11 Supplementary Figure 9 AEK (pre-γ, post- γ) reconstruction. AEK was reconstructed in comparing cocoa, grape and peach, following the strategy detailed in the Methods section of the manuscript, delivering a post-γ AEK with 2 chromosomes with 9022 genes and pre-γ AEK with chromosomes with 6284 genes. a. Detailed procedure for AEK reconstruction from modern genomes (top) and the identification of ppgs, core-ppgs and opgs following the karyotyping and enrichment procedures for pre-γ and post-γ AEK ancestors. The proposed pre-γ and post-γ AEK ancestors considerably refine previous reported AEK based on solely 02 ppgs by Murat et al. (Ref. 24, cf manuscript), corresponding to 48 % of the 430 newly defined ppgs for the 2-protochromosome AEK, and 626 opgs by Murat et al. (Ref. 24, cf manuscript), corresponding to 0 % of the 6284 newly defined opgs for the -protochromosomes AEK b. Dotplot representation of the comparison between AEK ancestors (n=, n=2; y-axis) and modern species (x-axis) with a five-colors code illuminating the AMK chromosomes. The accuracy of the reconstructed post-γ AEK and pre-γ AEK is validated by the dotplot comparisons, which unveil 00% coverage of all chromosomes either from modern species or from the γ-block triplets into the reconstructed n= and n=2 AEK protochromosomes. c. The inferred n= AEKs (pre- γ, AEK-) defines exhaustive and accurate syntenic chromosome relationships between the modern eudicots as detailed in the table. The transition from the pre-γ AEK to the post-γ AEK involved seven known paralogous ancestral chromosome triplets (using the grape chromosome nomenclature): A-A4-A, A2-A5-A2-A6, A3-A4-A-A8, A4-A9-A, A5-A- A4, A6-A8-A3, A0-A2-A9. a Theobroma cacao CocoaGen DB 0 chromosomes genes Ancestor Post- γ Karyotyping Step I. 430 Potential protogenes (ppg) Prunus persica v.0 8 chromosomes 2256 genes Vitis vinifera Genoscope 2X 9 chromosomes 2364 genes Ancestor Pre- γ Karyotyping Step II. 500 ppg conserved in the three species (core-ppg). Step III. 269 ppg in Syntenic Blocks (SB). Step IV. 580 ppg after filtering. Step V. 520 protogenes (opg) in 2 Protochromosomes. Enrichment Step VI ppg non core-ppg. Step VII. 30 ppg In Syntenic Blocks. Step VIII opg in 2 Protochromosomes. Step II. 499 ppg (duplicated) Step III. 36 ppg / 42 SB. Step IV. 36 ppg / 42 SB. Step V. 352 opg / 4 SB Protochromosomes. Enrichment Step VI. 335 opg / 4 groups Protochromosomes. Step VII opg in Protochromosomes. Nature Genetics: doi:0.038/ng.383

12 Ancestor Post-γ Ancestor Pre-γ b 2 2 Ancestor Post-γ Grape 9 Peach 8 Cacao 0 c Grape 9 Peach 8 Cacao 0 AEK Pre-γ AEK Post-γ Grape Peach Cocoa AEK AEK AEK AEK AEK AEK AEK Nature Genetics: doi:0.038/ng.383

13 Supplementary Figure 0 Dotplot-based deconvolution of the AMK, AEK and Amborella genomes into 5 CARs. In order to identify the most parsimonious structure of the MRCA for eudicots/monocots from the comparison of AEK and AMK, an outgroup species is required. Amborella trichopoda is a basal angiosperm with no specific WGD event making it the most interesting candidate in reconstructing eudicots/monocots MRCA. Amborella genome contains 3 chromosomes for an estimated size of 80 Mb. The available genome is assembled into 545 scaffolds (06 Mb) and annotated genes. 38 scaffolds are assigned by FISH to 2 over the 3 chromosomes, representing 34% of the genome (i.e annotated genes, 204 Mb). The dotplot-based comparative genomics of the reconstructed AEK and AMK unveiled a synteny signal between these two principal families of flowering plants, in the form of 5 CARs recovered from 5 orthologs that appeared to display a higher degree of structural conservation (with one-to-one chromosome relationships on average, i.e.3) between the AMK and the Amborella genome, than between the AEK and Amborella (with one-to-two chromosome relationships on average, i.e.9). The complete deconvolution of the observed syntenies between the Amborella, AMK (pre-τ) and AEK (pre- γ) genomes into independent CARs ultimately validated the existence of a angiosperm progenitor with a minimum of fifteen proposed protochromosomes. a. AMK (pre-τ, 5 chromosomes)/aek (pre-γ, chromosomes) (right), AMK (pre-τ, 5 chromosomes)/amborella (3 chromosomes) (left) and AEK (pre-γ, chromosomes)/amborella (3 chromosomes) (centre) dotplot comparisons illustrate the complete deconvolution of synteny/paralogy relationships (diagonals, color code from the 5 AMK CARs, top) into 5 CARs (rectangles). 3-D dotplot comparison is provided at the centre with the associated 2-D dotplots provided for details. b. chromosome-to-chromosome synteny relationships (table lines corresponding to colored arrows on the panel A) illustrate a.3 ratio between Amborella and AMK chromosomes and a.9 ratio between Amborella and AEK chromosomes. c. Illustration of the close structural relationship at the chromosome level between Amborella and AMK compared to Amborella and AEK with Amborella chromosome 4, structurally conserved with AMK chromosome 4, whereas two orthologous regions are identified in AEK chromosomes 5 (CAR number 4 on the panel A) and 6 (CAR number 8 on the panel A). d. Illustration of the synteny relationships (5 CARs from the panel A and detailed in the table from panel B) as colored blocks (reflecting the 5 AMK CARs) between Amborella AEK and AMK, suggesting AMK as closest representative of the angiosperm MRCA of 5 CARs. The comparison of the reconstructed AEK (pre-γ with chromosomes and 6284 protogenes) and AMK (pre-τ with 5 chromosomes and 60 protogenes) deliver a syntenic signal between these two main families of flowering plants consisting in 5 CARs recovered from 5 orthologs. a AMK 5 AK AK2 AK3 AK4 AK AEK 3 0 Amborella trichopoda Nature Genetics: doi:0.038/ng.383

14 B b CARs AEK- AMK AEK chr (n=) AMK chr (n=5) Amborella trichopoda chr (n=3) Nb AEK regions Nb AMK regions NA NA NA NA c C AMK chr4 AEK chr5 Amborella chr4 chr6 CAR 4 CAR 8 D Average.9.3 Angiosperms d Angiosperms Basal angiosperms Eudicots 5 MRCA E-M Monocots Basal angiosperms Eudicots 5 MRCA Monocots x Amborella trichopoda AEK AMK x Amborella trichopoda AEK AMK 4 Nature Genetics: doi:0.038/ng.383

15 Supplementary Figure Gene Ontology enrichment comparing MRCA and extant genomes. We compared 3 plant species (3 monocots, 20 eudicots, one basal angiosperm (Amborella trichopoda), and three outgroup species (Picea abies as a representative of gymnosperms, Physcomitrella patens as a representative of mosses and Chlamydomonas reinhardtii as a representative of single-celled green algae)) to identify orthologous gene clusters representative of the angiosperm MRCA gene pool with 0263 gene groups specific to flowering plants (absent from the outgroup species). We performed a GO (Gene Ontology) enrichment analysis at the Biological process, Cellular component and Molecular function levels. Enriched GO terms (at the Biological process, Cellular component and Molecular function levels) in comparing 0263 angiosperm-specific gene clusters to the MRCA gene pool (a), in comparing the MRCA gene pool to that of modern species, considering Arabidopisis thaliana as a reference with the most robust GO terms and classification (b), and in comparing the modern genomes to the MRCA gene pool (c). GO analysis from the panel a delivers enriched terms in angiosperms compared to outgroups. GO analysis from the panel b delivers enriched terms in MRCA compared to extant species, i.e. basic functions or processes already present in the ancestor. Finally, GO analysis from panel c delivers enriched terms in extant species compared to MRCA, i.e. functions amplified during evolution. a 0263 angiosperm-specific genes compared to MRCA genes Biological process Cellular component Molecular function b MRCA genes compared to modern genomes (Arabidopsis) Biological process Cellular component Molecular function 5 Nature Genetics: doi:0.038/ng.383

16 c Modern genomes (Arabidopsis) compared to MRCA genes Biological process Cellular component Molecular function 6 Nature Genetics: doi:0.038/ng.383

17 Supplementary Figure 2 Genomic evolutionary plasticity of the AEK and AGK genomes. Comparisons of the genomes of the MRCA and extant modern species provided insight into the structural plasticity of angiosperm genomes and diversification in the response to polyploidy where: () post-duplication chromosomes have similar gene distributions, from the gene-rich telomeric regions to the gene-poor centromeric regions; (2) lineage-specific genes (i.e. not mapped onto the ancestral karyotypes and referred to as non-ak genes ) are preferentially located in pericentromeric and subtelomeric regions; (3) the ancestral gene pool is partitioned between paralogous blocks, forming the MF (most fractionated (MF) also described as sensitive (S)) and LF (least fractionated (LF) also described as dominant (D)) chromosomal compartments; (4) gene pairs evolved differentially between rapidly evolving and slowly evolving species (such as rice for the monocots and grape for the dicots); (5) ancestral genes tend to have larger numbers of exons than non-ancestral genes; (6) genes conserved in grasses have a higher GC content than those conserved in eudicots. Detailed analysis of the genomic features (gene number, AK genes, Ks, exon numbers, GC content) are reported in columns, for the monocot and eudicot species investigated (in rows). Statistically significant differences (t-test p<0.05) are indicated by red stars for each genomic feature tested (columns). Avg = average, SD = standard deviation, % = percentage, spe=specific, LF=least fractionated, MF=most fractionated. Clade Monocots Eudicots Species Partitionning Genes AK genes Ks (median) Exons number (avg) % GC content (avg) Rice LF ,4 5,8 5 9, MF ,8 5 9,8 Spe , Brachy LF , MF Spe 964 4, 56 9 Sorghum LF ,3 5,8 5 9,6 MF , 5 9, Spe ,5 5 0 Grape LF ,32,4 46 3,3 MF 2086,2 46 3,4 MF2 52,5 46 3, Spe 92 5, Cacao LF , MF 854 6, MF2 326, Spe ,3 4 5,5 Peach LF ,65 6, 46 3, MF 8 6,6 46 3, MF Spe ,4 45 4,8 % GC content (SD) P-value <0,05 Nature Genetics: doi:0.038/ng.383

18 Supplementary Figure 3 Dating the Angiosperm origin. 286 orthologous gene clusters, from the 5 angiosperm MRCA genes, containing single-gene copies from peach, grape, cocoa, Brachypodium, rice and sorghum were used to assess the age of angiosperm origin (considered here as the monocot/eudicot speciation date) using BEAUTi, BEAST and visualized in FigTree and DensiTree (see Methods section of the main manuscript) with a minimum age calibration from fossils of 65 mya for grasses and 25 mya for the eudicots. We inferred that angiosperms originated 24 (between ) mya. The figure illustrates the maximum clade credibility tree from divergence time estimates of angiosperms based on 286 ancestral genes conserved in peach, grape, cocoa, Brachypodium, rice and sorghum. The branch length illustrates the average substitutions per sites according to the time scale (in million years, my) associated with major geological epochs (bottom). The 95% highest posterior density (HPD) estimates for each well-supported speciation events are represented by horizontal blue bars and with age intervals mentioned in brackets. Red asterisks represent minimum age fossil calibrations with 65 my for grasses and 25 my for the eudicots. Dashed horizontal black lines represent age estimates of the angiosperm origin from the literature (references cited) and timetree ( Sorghum Smith et al. (200) [82-25] Seng et al. (204) [ ] Grasses [65-8] * 3 68 [60-5] Rice Angiosperms [90-238] 24 Brachy Magalloon and Sanderson (2005) [6-3] Timetree [68-94] Eudicots [8-09] 98 * 3 [64-8] Grape Peach Cocoa Million years CARBON. PERMIAN TRIASSIC JURASSIC CRETACEOUS TERTIARY QUAT Paleozoic Mesozoic Cenozoic 8 Nature Genetics: doi:0.038/ng.383

19 Supplementary Table 5 Synteny relationships between AEK, AGK, AMK based on MRCA. Chromosomal conservation (rows) between MRCA (CARs), AMK (AK-AK5), AEK (n= pre-γ and n=2 post-γ), AGK (n= pre-ρ and n=2 post-ρ) and the modern monocot (rice as representative, with chromosomes to 2) and eudicot (grape as representative, with chromosomes to 9) species (columns). MRCA Pre-γ (AEK n=) Post-γ (AEK n=2) Grape (reference) AMK (n=5) Pre-ρ (AGK n=) Post-ρ (AGK n=2) Rice (reference) CAR 4 Ch Ch-2-3 Ch2-5-6 Ch Ch-5 Ch-5 CAR 2 Ch2 Ch4-5-6 Ch4-9- AK Ch4 Ch2-6 Ch2-6 CAR Ch6 Ch6--8 Ch-4- Ch6 Ch8-9 Ch8-9 CAR 2 Ch Ch Ch0-2-9 Ch3 Ch3- Ch3- CAR 3 Ch Ch-2-3 Ch2-5-6 Ch2 Ch3-0 Ch3-0 CAR 0 Ch3 Ch-8-9 Ch5--4 AK2 CAR Ch4 Ch0--2 Ch6-8-3 Ch5 Ch2-4 Ch2-4 CAR 6 Ch5 Ch3-4-5 Ch Ch Ch-5 Ch-5 CAR 3 Ch3 Ch-8-9 Ch5--4 AK3 Ch3 Ch3- Ch3- CAR Ch4 Ch0--2 Ch6-8-3 Ch Ch-2 Ch3- Ch2 Ch3-0 Ch3-0 CAR 5 Ch2 Ch4-5-6 Ch4-9- CAR 4 Ch5 Ch3-4-5 Ch CAR 8 Ch6 Ch6--8 Ch-4- Ch3 Ch3- Ch3- AK4 Ch4 Ch2-6 Ch2-6 Ch5 Ch2-4 Ch2-4 Ch6 Ch8-9 Ch8-9 CAR 5 Ch2 Ch4-5-6 Ch4-9- CAR 9 Ch6 Ch6--8 Ch-4- AK5 Ch Ch-2 Ch-2 Ch4 Ch2-6 Ch2-6 Ch5 Ch2-4 Ch2-4 Ch6 Ch8-9 Ch8-9 Ch Ch-2 Ch-2 9 Nature Genetics: doi:0.038/ng.383

Paleo-evolutionary plasticity of plant disease resistance genes

Paleo-evolutionary plasticity of plant disease resistance genes Zhang et al. BMC Genomics 2014, 15:187 RESEARCH ARTICLE Paleo-evolutionary plasticity of plant disease resistance genes Rongzhi Zhang 1,2, Florent Murat 1, Caroline Pont 1, Thierry Langin 1 and Jerome

More information

Potato Genome Analysis

Potato Genome Analysis Potato Genome Analysis Xin Liu Deputy director BGI research 2016.1.21 WCRTC 2016 @ Nanning Reference genome construction???????????????????????????????????????? Sequencing HELL RIEND WELCOME BGI ZHEN LLOFRI

More information

Understanding Brassicaceae evolution through ancestral genome reconstruction

Understanding Brassicaceae evolution through ancestral genome reconstruction Murat et al. Genome Biology (2015) 16:262 DOI 10.1186/s13059-015-0814-y RESEARCH Open Access Understanding Brassicaceae evolution through ancestral genome reconstruction Florent Murat 1, Alexandra Louis

More information

Big Questions. Is polyploidy an evolutionary dead-end? If so, why are all plants the products of multiple polyploidization events?

Big Questions. Is polyploidy an evolutionary dead-end? If so, why are all plants the products of multiple polyploidization events? Plant of the Day Cyperus esculentus - Cyperaceae Chufa (tigernut) 8,000 kg/ha, 720 kcal/sq m per month Top Crop for kcal productivity! One of the world s worst weeds Big Questions Is polyploidy an evolutionary

More information

Evolution by duplication: paleopolyploidy events in plants reconstructed by deciphering the evolutionary history of VOZ transcription factors

Evolution by duplication: paleopolyploidy events in plants reconstructed by deciphering the evolutionary history of VOZ transcription factors Gao et al. BMC Plant Biology (2018) 18:256 https://doi.org/10.1186/s12870-018-1437-8 RESEARCH ARTICLE Evolution by duplication: paleopolyploidy events in plants reconstructed by deciphering the evolutionary

More information

Whole-genome duplications and paleopolyploidy

Whole-genome duplications and paleopolyploidy Whole-genome duplications and paleopolyploidy Whole-genome duplications in protozoa Aury et al. (2006) analyzed the unicellular eukaryote Paramecium tetraurelia most of 40,000 genes arose through at least

More information

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class

More information

Bioinformatics tools to analyze complex genomes. Yves Van de Peer Ghent University/VIB

Bioinformatics tools to analyze complex genomes. Yves Van de Peer Ghent University/VIB Bioinformatics tools to analyze complex genomes Yves Van de Peer Ghent University/VIB Detecting colinearity and large-scale gene duplications A 1 2 3 4 5 6 7 8 9 10 11 Speciation/Duplicatio n S1 S2 1

More information

Lineage specific conserved noncoding sequences in plants

Lineage specific conserved noncoding sequences in plants Lineage specific conserved noncoding sequences in plants Nilmini Hettiarachchi Department of Genetics, SOKENDAI National Institute of Genetics, Mishima, Japan 20 th June 2014 Conserved Noncoding Sequences

More information

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic

More information

Supporting Information

Supporting Information SI Material and Methods Nucleic acid sequence alignments. Supporting Information Three new parameters were recently defined in Salse et al. (1) to increase the stringency and significance of BLAST sequence

More information

Supplementary Material

Supplementary Material Supplementary Material Supplementary Table S1. Genomes available in build 47 Supplementary Table S2. Counts of putative contiguous gene split models in 39 plant reference genomes in build 47 Supplementary

More information

Chapter 27: Evolutionary Genetics

Chapter 27: Evolutionary Genetics Chapter 27: Evolutionary Genetics Student Learning Objectives Upon completion of this chapter you should be able to: 1. Understand what the term species means to biology. 2. Recognize the various patterns

More information

Fractionation resistance of duplicate genes following whole genome duplication in plants as a function of gene ontology category and expression level

Fractionation resistance of duplicate genes following whole genome duplication in plants as a function of gene ontology category and expression level Fractionation resistance of duplicate genes following whole genome duplication in plants as a function of gene ontology category and expression level Eric Chun-Hung Chen Thesis submitted to the Faculty

More information

Plant Genome Sequencing

Plant Genome Sequencing Plant Genome Sequencing Traditional Sanger Sequencing Genome Sequencing Approach 1. Create sequencing libraries of different insert sizes 2kb o Bulk of sequencing is performed on these libraries 10kb o

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

Ancestral grass karyotype reconstruction unravels new mechanisms of genome shuffling as a source of plant evolution

Ancestral grass karyotype reconstruction unravels new mechanisms of genome shuffling as a source of plant evolution Research Ancestral grass karyotype reconstruction unravels new mechanisms of genome shuffling as a source of plant evolution Florent Murat, 1,4 Jian-Hong Xu, 2,4 Eric Tannier, 3,4 Michael Abrouk, 1 Nicolas

More information

Comparative Genomics II

Comparative Genomics II Comparative Genomics II Advances in Bioinformatics and Genomics GEN 240B Jason Stajich May 19 Comparative Genomics II Slide 1/31 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods

More information

PaleoGenomics & Evolution (group PaleoEVO). UMR1095 INRA UBP Génétique, Diversité & Ecophysiologie des Céréales

PaleoGenomics & Evolution (group PaleoEVO). UMR1095 INRA UBP Génétique, Diversité & Ecophysiologie des Céréales TranscriptomePartitioning Following Polyploidization Events in Grasses Journée Transcriptome URGV 1 Mai 13 Plant and Animal Genomes SPECIES COMMON NAME CHROMOSOMES GENOME (MB) ANNOTATED GENES SYNTENY DUPLICATION

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

New insights into the evolution and functional divergence of the SWEET family in Saccharum based on comparative genomics

New insights into the evolution and functional divergence of the SWEET family in Saccharum based on comparative genomics Hu et al. BMC Plant Biology (2018) 18:270 https://doi.org/10.1186/s12870-018-1495-y RESEARCH ARTICLE Open Access New insights into the evolution and functional divergence of the SWEET family in Saccharum

More information

Supplemental Figure 1. Comparisons of GC3 distribution computed with raw EST data, bi-beta fits and complete genome sequences for 6 species.

Supplemental Figure 1. Comparisons of GC3 distribution computed with raw EST data, bi-beta fits and complete genome sequences for 6 species. Supplemental Figure 1. Comparisons of GC3 distribution computed with raw EST data, bi-beta fits and complete genome sequences for 6 species. Filled distributions: GC3 computed with raw EST data. Dashed

More information

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the

More information

Identification and Characterization of Shared Duplications between Rice and Wheat Provide New Insight into Grass Genome Evolution W

Identification and Characterization of Shared Duplications between Rice and Wheat Provide New Insight into Grass Genome Evolution W The Plant Cell, Vol. 20: 11 24, January 2008, www.plantcell.org ª 2008 American Society of Plant Biologists RESEARCH ARTICLES Identification and Characterization of Shared Duplications between Rice and

More information

Annotation and Nomenclature: A Zebrafish Example. Ingo Braasch, Julian Catchen and John Postlethwait

Annotation and Nomenclature: A Zebrafish Example. Ingo Braasch, Julian Catchen and John Postlethwait Annotation and Nomenclature: A Zebrafish Example Ingo Braasch, Julian Catchen and John Postlethwait Annotation and Nomenclature: An Example: Zebrafish The goal Solutions Annotation and Nomenclature: An

More information

molecular evolution and phylogenetics

molecular evolution and phylogenetics molecular evolution and phylogenetics Charlotte Darby Computational Genomics: Applied Comparative Genomics 2.13.18 https://www.thinglink.com/scene/762084640000311296 Internal node Root TIME Branch Leaves

More information

Beaming in your answers

Beaming in your answers Bio 112 Handout for Evolution 8 This handout contains: Today s iclicker Questions Handouts for today s lecture iclicker Question #5A - before lecture Consider the hypothetical creatures on the last page

More information

Supplemental Data. Vanneste et al. (2015). Plant Cell /tpc

Supplemental Data. Vanneste et al. (2015). Plant Cell /tpc Supplemental Figure 1: Representation of the structure of an orthogroup. Each orthogroup consists out of a paralogous pair representing the WGD (denoted by Equisetum 1 and Equisetum 2) supplemented with

More information

Cladistics and Bioinformatics Questions 2013

Cladistics and Bioinformatics Questions 2013 AP Biology Name Cladistics and Bioinformatics Questions 2013 1. The following table shows the percentage similarity in sequences of nucleotides from a homologous gene derived from five different species

More information

Bio94 Discussion Activity week 3: Chapter 27 Phylogenies and the History of Life

Bio94 Discussion Activity week 3: Chapter 27 Phylogenies and the History of Life Bio94 Discussion Activity week 3: Chapter 27 Phylogenies and the History of Life 1. Constructing a phylogenetic tree using a cladistic approach Construct a phylogenetic tree using the following table:

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

Impact of recurrent gene duplication on adaptation of plant genomes

Impact of recurrent gene duplication on adaptation of plant genomes Fischer et al. BMC Plant Biology 2014, 14:151 RESEARCH ARTICLE Open Access Impact of recurrent gene duplication on adaptation of plant genomes Iris Fischer 1,2*, Jacques Dainat 3,6, Vincent Ranwez 3, Sylvain

More information

Fossils Biology 2 Thursday, January 31, 2013

Fossils Biology 2 Thursday, January 31, 2013 Fossils Biology 2 Evolution Change in the genetic composition of a group of organisms over time. Causes: Natural Selection Artificial Selection Genetic Engineering Genetic Drift Hybridization Mutation

More information

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses Anatomy of a tree outgroup: an early branching relative of the interest groups sister taxa: taxa derived from the same recent ancestor polytomy: >2 taxa emerge from a node Anatomy of a tree clade is group

More information

How to detect paleoploidy?

How to detect paleoploidy? Genome duplications (polyploidy) / ancient genome duplications (paleopolyploidy) How to detect paleoploidy? e.g. a diploid cell undergoes failed meiosis, producing diploid gametes, which selffertilize

More information

A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS

A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS CRYSTAL L. KAHN and BENJAMIN J. RAPHAEL Box 1910, Brown University Department of Computer Science & Center for Computational Molecular Biology

More information

Phylogeny and the Tree of Life

Phylogeny and the Tree of Life LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert B. Jackson Chapter 26 Phylogeny and the Tree of Life

More information

C3020 Molecular Evolution. Exercises #3: Phylogenetics

C3020 Molecular Evolution. Exercises #3: Phylogenetics C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from

More information

Orthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona

Orthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona Orthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona (tgabaldon@crg.es) http://gabaldonlab.crg.es Homology the same organ in different animals under

More information

Lecture 7 Mutation and genetic variation

Lecture 7 Mutation and genetic variation Lecture 7 Mutation and genetic variation Thymidine dimer Natural selection at a single locus 2. Purifying selection a form of selection acting to eliminate harmful (deleterious) alleles from natural populations.

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature23897 Supplementary Notes 1 Evolution of gene families 1.1 Evolution of gene family sizes We determined the expansion and contraction of orthologous gene families using CAFÉ 2.2 1. One

More information

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

CHAPTERS 24-25: Evidence for Evolution and Phylogeny CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology

More information

BMI/CS 776 Lecture #20 Alignment of whole genomes. Colin Dewey (with slides adapted from those by Mark Craven)

BMI/CS 776 Lecture #20 Alignment of whole genomes. Colin Dewey (with slides adapted from those by Mark Craven) BMI/CS 776 Lecture #20 Alignment of whole genomes Colin Dewey (with slides adapted from those by Mark Craven) 2007.03.29 1 Multiple whole genome alignment Input set of whole genome sequences genomes diverged

More information

Homology and Information Gathering and Domain Annotation for Proteins

Homology and Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The

More information

Supporting Information

Supporting Information Supporting Information Fawcett et al. 10.1073/pnas.0900906106 SI Text Estimating the Age of Gene Duplication Events: The Use of K S Values. One of the most common methods used to study and visualize gene

More information

A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes

A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes Cedric Chauve 1, Eric Tannier 2,3,4,5 * 1 Department of Mathematics,

More information

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Maheshi Dassanayake 1,9, Dong-Ha Oh 1,9, Jeffrey S. Haas 1,2, Alvaro Hernandez 3, Hyewon Hong 1,4, Shahjahan

More information

Phylogeny and the Tree of Life

Phylogeny and the Tree of Life Chapter 26 Phylogeny and the Tree of Life PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting

Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting Delphine Larivière & David Couvin Under the supervision of Dominique This, Jean-François Dufayard

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

Supplementary Figure 1. Number of CC- and TIR- type NBS- LRR genes and presence of mir482/2118 on sequenced plant genomes.

Supplementary Figure 1. Number of CC- and TIR- type NBS- LRR genes and presence of mir482/2118 on sequenced plant genomes. Number of CC- NBS and CC- NBS- LRR R- genes Number of TIR- NBS and TIR- NBS- LRR R- genes 0 50 100 150 200 250 0 50 100 150 200 250 300 350 400 450 mir482 and mir2118 Cajanus cajan Glycine max Hevea brasiliensis

More information

3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT

3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT 3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT.03.239 25.09.2012 SEQUENCE ANALYSIS IS IMPORTANT FOR... Prediction of function Gene finding the process of identifying the regions of genomic DNA that encode

More information

P OLYPLOIDY AND ANGIOSPERM DIVERSIFICATION 1

P OLYPLOIDY AND ANGIOSPERM DIVERSIFICATION 1 American Journal of Botany 96(1): 336 348. 2009. P OLYPLOIDY AND ANGIOSPERM DIVERSIFICATION 1 Douglas E. Soltis,2,11 Victor A. Albert, 3 Jim Leebens-Mack,4 Charles D. Bell, 5 Andrew H. Paterson, 4,6 Chunfang

More information

The Origin of Species

The Origin of Species The Origin of Species Introduction A species can be defined as a group of organisms whose members can breed and produce fertile offspring, but who do not produce fertile offspring with members of other

More information

LAB 4: PHYLOGENIES & MAPPING TRAITS

LAB 4: PHYLOGENIES & MAPPING TRAITS LAB 4: PHYLOGENIES & MAPPING TRAITS *This is a good day to check your Physcomitrella (protonema, buds, gametophores?) and Ceratopteris cultures (embryos, young sporophytes?)* Phylogeny Introduction The

More information

Computational analyses of ancient polyploidy

Computational analyses of ancient polyploidy Computational analyses of ancient polyploidy Kevin P. Byrne 1 and Guillaume Blanc 2* 1 Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland. 2 Laboratoire

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew TEACHER S GUIDE Case Study Who s the daddy? Polyploidy in plant evolution James Clarkson Royal Botanic Gardens, Kew Dean Madden [Ed.] NCBE, University of Reading Version 1.1 Polypoidy in plant evolution

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

Computational methods for predicting protein-protein interactions

Computational methods for predicting protein-protein interactions Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational

More information

Evolution by duplication

Evolution by duplication 6.095/6.895 - Computational Biology: Genomes, Networks, Evolution Lecture 18 Nov 10, 2005 Evolution by duplication Somewhere, something went wrong Challenges in Computational Biology 4 Genome Assembly

More information

(Paleo)Polyploidy When Things Get Bigger

(Paleo)Polyploidy When Things Get Bigger (Paleo)Polyploidy When Things Get Bigger Hegarty and Hiscock 2008, Current Biology 18 Whole-genome duplications AUTOPOLYPLOIDY ALLOPOLYPLOIDY Examples of allopolyploid speciation Hegarty and Hiscock 2008,

More information

Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics

Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics A. H. Paterson*, J. E. Bowers, and B. A. Chapman Plant Genome Mapping Laboratory, University

More information

Bio 2 Plant and Animal Biology

Bio 2 Plant and Animal Biology Bio 2 Plant and Animal Biology Evolution Evolution as the explanation for life s unity and diversity Darwinian Revolution Two main Points Descent with Modification Natural Selection Biological Species

More information

Phylogeny and the Tree of Life

Phylogeny and the Tree of Life Chapter 26 Phylogeny and the Tree of Life PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

BINF6201/8201. Molecular phylogenetic methods

BINF6201/8201. Molecular phylogenetic methods BINF60/80 Molecular phylogenetic methods 0-7-06 Phylogenetics Ø According to the evolutionary theory, all life forms on this planet are related to one another by descent. Ø Traditionally, phylogenetics

More information

Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants Liu et al. BMC Evolutionary Biology (2017) 17:47 DOI 10.1186/s12862-017-0891-5 RESEARCH ARTICLE Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

More information

Figure S1: Mitochondrial gene map for Pythium ultimum BR144. Arrows indicate transcriptional orientation, clockwise for the outer row and

Figure S1: Mitochondrial gene map for Pythium ultimum BR144. Arrows indicate transcriptional orientation, clockwise for the outer row and Figure S1: Mitochondrial gene map for Pythium ultimum BR144. Arrows indicate transcriptional orientation, clockwise for the outer row and counterclockwise for the inner row, with green representing coding

More information

Reconstructing the history of lineages

Reconstructing the history of lineages Reconstructing the history of lineages Class outline Systematics Phylogenetic systematics Phylogenetic trees and maps Class outline Definitions Systematics Phylogenetic systematics/cladistics Systematics

More information

Spring th Grade

Spring th Grade Spring 2015 8 th Grade The geologic time scale is a record of the major events and diversity of life forms present in Earth s history. The geologic time scale began when Earth was formed and goes on until

More information

Chapters 25 and 26. Searching for Homology. Phylogeny

Chapters 25 and 26. Searching for Homology. Phylogeny Chapters 25 and 26 The Origin of Life as we know it. Phylogeny traces evolutionary history of taxa Systematics- analyzes relationships (modern and past) of organisms Figure 25.1 A gallery of fossils The

More information

Chapter 1-Plants in Our World

Chapter 1-Plants in Our World Chapter 1-Plants in Our World Formation of earth-4.5-4.6 billion years ago Evidence of life from organic material-3.8 billion years ago Many cyanobacteria are photosynthetic, but these microscopic organisms

More information

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi) Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction Lesser Tenrec (Echinops telfairi) Goals: 1. Use phylogenetic experimental design theory to select optimal taxa to

More information

Genome-wide discovery of G-quadruplex forming sequences and their functional

Genome-wide discovery of G-quadruplex forming sequences and their functional *Correspondence and requests for materials should be addressed to R.G. (rohini@nipgr.ac.in) Genome-wide discovery of G-quadruplex forming sequences and their functional relevance in plants Rohini Garg*,

More information

1 Genomes with odd ploidy are generally deemed to be infertile because

1 Genomes with odd ploidy are generally deemed to be infertile because BIOINFORMAIC Vol. 23 IMB/ECCB 2007, pages i433 i439 doi:10.1093/bioinformatics/btm169 Polyploids, genome halving and phylogeny David ankoff 1, *, Chunfang Zheng 2 and Qian Zhu 3 1 Department of Mathematics

More information

Gene order in Rosid phylogeny, inferred from pairwise syntenies among extant genomes

Gene order in Rosid phylogeny, inferred from pairwise syntenies among extant genomes Gene order in Rosid phylogeny, inferred from pairwise syntenies among extant genomes Chunfang Zheng 1,2 and David Sanko 2 1 Département d informatique et de recherche opérationnelle, Université de Montréal

More information

Phylogenetic inference

Phylogenetic inference Phylogenetic inference Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, March 7 th 016 After this lecture, you can discuss (dis-) advantages of different information types

More information

Supplementary Materials for

Supplementary Materials for advances.sciencemag.org/cgi/content/full/1/8/e1500527/dc1 Supplementary Materials for A phylogenomic data-driven exploration of viral origins and evolution The PDF file includes: Arshan Nasir and Gustavo

More information

Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus

Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus Xiaohan Yang Sara Jawdy Timothy Tschaplinski Gerald Tuskan Environmental Sciences Division Oak Ridge National Laboratory

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature13082 Supplementary Table 1. Examination of nectar production in wild-type and atsweet9 flowers. No. of flowers with detectable nectar out of the total observed

More information

Chapter 26 Phylogeny and the Tree of Life

Chapter 26 Phylogeny and the Tree of Life Chapter 26 Phylogeny and the Tree of Life Chapter focus Shifting from the process of how evolution works to the pattern evolution produces over time. Phylogeny Phylon = tribe, geny = genesis or origin

More information

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis 10 December 2012 - Corrections - Exercise 1 Non-vertebrate chordates generally possess 2 homologs, vertebrates 3 or more gene copies; a Drosophila

More information

Reading for Lecture 13 Release v10

Reading for Lecture 13 Release v10 Reading for Lecture 13 Release v10 Christopher Lee November 15, 2011 Contents 1 Evolutionary Trees i 1.1 Evolution as a Markov Process...................................... ii 1.2 Rooted vs. Unrooted Trees........................................

More information

Chapter 26: Phylogeny and the Tree of Life

Chapter 26: Phylogeny and the Tree of Life Chapter 26: Phylogeny and the Tree of Life 1. Key Concepts Pertaining to Phylogeny 2. Determining Phylogenies 3. Evolutionary History Revealed in Genomes 1. Key Concepts Pertaining to Phylogeny PHYLOGENY

More information

Genome-wide analysis of the MYB transcription factor superfamily in soybean

Genome-wide analysis of the MYB transcription factor superfamily in soybean Du et al. BMC Plant Biology 2012, 12:106 RESEARCH ARTICLE Open Access Genome-wide analysis of the MYB transcription factor superfamily in soybean Hai Du 1,2,3, Si-Si Yang 1,2, Zhe Liang 4, Bo-Run Feng

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Evolutionary trends and mechanisms of chromosome number variation

Evolutionary trends and mechanisms of chromosome number variation Evolutionary trends and mechanisms of chromosome number variation Chromosome number variation Chromosome numbers of plants vary enormously over a 360-fold range. n = 2 in five angiosperm species n = 630

More information

Genomes Comparision via de Bruijn graphs

Genomes Comparision via de Bruijn graphs Genomes Comparision via de Bruijn graphs Student: Ilya Minkin Advisor: Son Pham St. Petersburg Academic University June 4, 2012 1 / 19 Synteny Blocks: Algorithmic challenge Suppose that we are given two

More information

GENOME DUPLICATION AND GENE ANNOTATION: AN EXAMPLE FOR A REFERENCE PLANT SPECIES.

GENOME DUPLICATION AND GENE ANNOTATION: AN EXAMPLE FOR A REFERENCE PLANT SPECIES. GENOME DUPLICATION AND GENE ANNOTATION: AN EXAMPLE FOR A REFERENCE PLANT SPECIES. Alessandra Vigilante, Mara Sangiovanni, Chiara Colantuono, Luigi Frusciante and Maria Luisa Chiusano Dept. of Soil, Plant,

More information

Phylogenetic analysis. Characters

Phylogenetic analysis. Characters Typical steps: Phylogenetic analysis Selection of taxa. Selection of characters. Construction of data matrix: character coding. Estimating the best-fitting tree (model) from the data matrix: phylogenetic

More information

Supplementary Information

Supplementary Information Supplementary Information Supplementary Figure 1. Schematic pipeline for single-cell genome assembly, cleaning and annotation. a. The assembly process was optimized to account for multiple cells putatively

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family Review: Gene Families Gene Families part 2 03 327/727 Lecture 8 What is a Case study: ian globin genes Gene trees and how they differ from species trees Homology, orthology, and paralogy Last tuesday 1

More information

A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution Across the Poales

A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution Across the Poales Kennesaw State University DigitalCommons@Kennesaw State University Faculty Publications 3-7-2016 A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution Across the Poales Michael R. McKain

More information

Concepts and Methods in Molecular Divergence Time Estimation

Concepts and Methods in Molecular Divergence Time Estimation Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks

More information

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline Phylogenetics Todd Vision iology 522 March 26, 2007 pplications of phylogenetics Studying organismal or biogeographic history Systematics ating events in the fossil record onservation biology Studying

More information

Quantitative Genetics & Evolutionary Genetics

Quantitative Genetics & Evolutionary Genetics Quantitative Genetics & Evolutionary Genetics (CHAPTER 24 & 26- Brooker Text) May 14, 2007 BIO 184 Dr. Tom Peavy Quantitative genetics (the study of traits that can be described numerically) is important

More information

Reconstruction of Ancestral Genome subject to Whole Genome Duplication, Speciation, Rearrangement and Loss

Reconstruction of Ancestral Genome subject to Whole Genome Duplication, Speciation, Rearrangement and Loss Reconstruction of Ancestral Genome subject to Whole Genome Duplication, Speciation, Rearrangement and Loss Denis Bertrand, Yves Gagnon, Mathieu Blanchette 2, and Nadia El-Mabrouk DIRO, Université de Montréal,

More information

The History of Life. Fossils and Ancient Life (page 417) How Fossils Form (page 418) Interpreting Fossil Evidence (pages ) Chapter 17

The History of Life. Fossils and Ancient Life (page 417) How Fossils Form (page 418) Interpreting Fossil Evidence (pages ) Chapter 17 Chapter 17 The History of Life Section 17 1 The Fossil Record (pages 417 422) This section explains how fossils form and how they can be interpreted. It also describes the geologic time scale that is used

More information

Evolutionary Tree Analysis. Overview

Evolutionary Tree Analysis. Overview CSI/BINF 5330 Evolutionary Tree Analysis Young-Rae Cho Associate Professor Department of Computer Science Baylor University Overview Backgrounds Distance-Based Evolutionary Tree Reconstruction Character-Based

More information