Two Ancient Classes of MIKC-type MADS-box Genes are Present in the Moss Physcomitrella patens

Size: px
Start display at page:

Download "Two Ancient Classes of MIKC-type MADS-box Genes are Present in the Moss Physcomitrella patens"

Transcription

1 Two Ancient Classes of MIKC-type MADS-box Genes are Present in the Moss Physcomitrella patens Katrin Henschel,* Rumiko Kofuji, Mitsuyasu Hasebe, Heinz Saedler,* Thomas Münster,* and Günter Theißen* 1 *Department of Molecular Plant Genetics, Max Planck Institute for Breeding Research, Cologne, Germany; Faculty of Science, Kanazawa University, Kanazawa, Japan; and National Institute for Basic Biology, Okazaki, Japan Characterization of seven MADS-box genes, termed PPM1 PPM4 and PpMADS1 PpMADS3, from the moss model species Physcomitrella patens is reported. Phylogeny reconstructions and comparison of exon-intron structures revealed that the genes described here represent two different classes of homologous, yet distinct, MIKC-type MADS-box genes, termed MIKC c -type genes c stands for classic (PPM1, PPM2, PpMADS1) and MIKC*- type genes (PPM3, PPM4, PpMADS2, PpMADS3). The two gene classes deviate from each other in a characteristic way, especially in a sequence stretch termed intervening region. MIKC c -type genes are abundantly present in all land plants which have been investigated in this respect, and give rise to well-known gene types such as floral meristem and organ identity genes. In contrast, LAMB1 from the clubmoss Lycopodium annotinum was identified as the only other MIKC*-type gene published so far. Our findings strongly suggest that the most recent common ancestor of mosses and vascular plants contained at least one MIKC c -type and one MIKC*-type gene. Our studies thus reveal an ancient duplication of an MIKC-type gene that occurred before the separation of the lineages that led to extant mosses and vascular plants more than about 450 MYA. The identification of bona fide K-domains in both MIKC*-type and MIKC c -type proteins suggests that the K-domain is more ancient than is suggested by a recent alternative hypothesis. MIKC*-type genes may have escaped identification in ferns and seed plants so far. It seems more likely, however, that they represent a class of genes which has been lost in the lineage which led to extant ferns and seed plants. The high number of P. patens MADS-box genes and the presence of a K-box in the coding region and of some potential binding sites for MADS-domain proteins and other transcription factors in the putative promoter regions of these genes suggest that MADS-box genes in mosses are involved in complex gene regulatory networks similar to those in flowering plants. Introduction Although comparative morphological and phylogenetic analyses may have revealed the major patterns of land plant evolution, the mechanisms which generated the enormous diversity of land plant body plans have remained elusive. However, promising approaches are currently made to solve the mysteries of macroevolution, one of which is evolutionary developmental genetics ( evo-devo ). In brief, evo-devo assumes that there is a close interrelationship between developmental and evolutionary processes (Gilbert, Opitz, and Raff 1996). This is so because even the most complex organisms are generated anew in each generation by developmental processes that usually start with a single cell, the fertilized egg cell. In the case of multicellular organisms, evolution of form is thus always the evolution of developmental processes. Because development is to a very large extent under genetic control, changes in developmental control genes may be a major aspect of evolutionary changes in morphology (Theißen and Saedler 1995; Gilbert, Opitz, and Raff 1996; Theißen et 1 Present address: Lehrstuhl for Genetics, Friedrich Schiller University, Philosophenweg 12, D Jena, Germany. Abbreviations: aa, amino acid(s); C-domain, C-terminal domain; I, intervening; K, keratin-like; M-domain, MADS domain; RACE, rapid amplification of cdna ends; RAGE, rapid amplification of genomic ends; UTR, untranslated region. Key words: MADS-box gene, Physcomitrella, moss, evolution. Address for correspondence and reprints: Günter Theißen, Lehrstuhl for Genetics, Friedrich Schiller University, Philosophenweg 12, D Jena, Germany. guenter.theissen@uni-jena.de. Mol. Biol. Evol. 19(6): by the Society for Molecular Biology and Evolution. ISSN: al. 2000; Carroll, Grenier, and Weatherbee 2001, p. 13; Davidson 2001, p. 18). Understanding the phylogeny of developmental control genes may thus significantly contribute to understanding the evolution of plant and animal form. In recent years it has been demonstrated that many of the key developmental control genes of plants and animals are members of multigene families which encode transcription factors. One of the most important gene families for understanding the interrelationships between development and evolution in green plants is arguably the family of MADS-box genes (Theißen and Saedler 1995; Theißen, Kim, and Saedler 1996; Hasebe and Banks 1997; Hasebe 1999; Theißen et al. 2000; Vergara-Silva, Martinez-Castilla, and Alvarez-Buylla 2000). The defining characteristic of all MADS-box genes is the presence of a highly conserved, approximately 180-nucleotides-long DNA sequence, termed the MADS-box. It encodes the DNA-binding domain of the respective MADS-domain (M-domain) transcription factors (Theißen, Kim, and Saedler 1996; Theißen et al. 2000). Outside the DNA-binding domains, MADS-proteins are structurally quite diverse (Alvarez-Buylla et al. 2000b). However, like many other eukaryotic transcription factors, M-domain proteins have a modular structural organization, which provides the basis for a further subdivision of these proteins and their genes. The majority of the plant M-domain proteins known so far belong to a single clade with a conserved structural organization, the so-called MIKC-type domain structure. It comprises (from N- to C-terminal) an N- terminal domain, which is, however, present only in a 801

2 802 Henschel et al. minority of proteins; an M-domain, which is the major determinant of DNA-binding but which also performs dimerization and accessory factor binding functions; an intervening (I) domain, which constitutes a key molecular determinant for the selective formation of DNAbinding dimers; a keratin-like (K) domain, which promotes protein dimerization; and a C-terminal domain (C-domain), which is involved in transcriptional activation or in the formation of ternary or quaternary protein complexes (Riechmann and Meyerowitz 1997; Egea-Cortines, Saedler, and Sommer 1999; Becker et al. 2000; Theißen et al. 2000; Honma and Goto 2001). By phylogeny reconstruction, MIKC-type genes can be further subdivided into a considerable number of defined gene clades. Most clade members share highly related functions and similar expression patterns during vegetative or reproductive growth (Theißen, Kim, and Saedler 1996; Theißen et al. 2000). The most well-known MIKC-type MADS-box genes function as homeotic selector genes in the specification of floral organ identity (Schwarz-Sommer et al. 1990; Weigel and Meyerowitz 1994; Theißen and Saedler 1999, 2001; Theißen 2001). These floral organ identity genes are involved in differentiating different plant organs from each other. They do so by encoding transcription factors which bind to regulatory DNA sequences of their target genes and either activate or repress these genes as appropriate for the development of the respective floral organs. The target genes, or genes further downstream, function as realizator genes, meaning that they encode the enzymes and structural proteins needed for any floral organ to develop its characteristic identity. The floral organ identity genes are generally expressed in those organs whose identity they specify, such as sepals and petals, petals and stamens, or stamens and carpels in case of the class A, class B, and class C floral organ identity genes, respectively (Weigel and Meyerowitz 1994; Theißen and Saedler 1999; Theißen 2001; Theißen and Saedler 2001). However, the flowering plant Arabidopsis alone contains more than 80 different MADS-box genes (Riechmann et al. 2000), many of which are involved in other developmental processes inside and outside of the flower, including fruit, leaf, and root development (Alvarez-Buylla et al. 2000a; Liljegren et al. 2000; Theißen et al. 2000). Because MADS-box genes play such important roles in diverse aspects of flowering plant development, understanding their phylogeny may strongly improve our understanding of the origin and evolution of flowering plant architecture. For example, floral organ identity totally depends on the activity of some MADS-box genes. The origin of the hallmark of the flowering plants, the flower, can therefore only be understood in the context of MADS-box gene phylogeny. The analogous may be true for more ancient structural features, such as mega- and microsporophylls or even vegetative organs. Thus, the question arises as to when and how the genes arose during evolution that today control the development of flowering plants. To answer this, the phylogeny of MADS-box genes has to be reconstructed and superimposed on the phylogeny of land plants. Toward that goal, MADS-box genes have to be studied in phylogenetically informative taxa, including all major groups of land plants (embryophytes). Extant land plants very likely represent a monophyletic group of organisms whose closest relatives are a group of green freshwater algae termed stoneworts (charophytes) (Bhattacharya and Medlin 1998; Graham and Wilcox 2001, pp ). Land plants comprise structurally relatively simple bryophytes as well as more complex tracheophytes (vascular plants) (Kenrick and Crane 1997). Extant bryophytes comprise liverworts, hornworts, and mosses. Extant vascular plants range from clubmosses (lycophytes) and ferns and their allies (such as whisk ferns and horsetails) to complex seed plants (spermatophytes), comprising gymnosperms and angiosperms (Kenrick and Crane 1997; Nickrent et al. 2000; Pryer et al. 2001). Bryophytes are very likely not a monophyletic group, but the question as to which of the bryophyte lineages is the most basal is still highly controversial (Qiu and Palmer 1999; Nickrent et al. 2000). A widely held view places liverworts as the first branch of the land plant tree, whereas some studies suggest that hornworts are the earliest land plants, with mosses and liverworts jointly forming the second deepest lineage (Kenrick and Crane 1997; Nishiyama and Kato 1999; Nickrent et al. 2000). A detailed characterization of the MADS-box gene family in the pteridophytes Ceratopteris pteroides and C. richardii (leptosporangiate ferns) and Ophioglossum pedunculosum (eusporangiate fern) (Kofuji and Yamaguchi 1997; Münster et al. 1997; Hasebe et al. 1998; Theißen et al. 2000; Münster, Faigl, and Theißen, personal communication) suggested that the most recent common ancestor of ferns and seed plants about 400 MYA already contained at least two different MIKCtype MADS-box genes (Münster et al. 1997; Hasebe et al. 1998; Theißen et al. 2000). These genes probably had expression patterns and functions that were more ubiquitous than those of the highly specialized floral organ identity genes from extant flowering plants, as suggested by the expression patterns of genes from extant ferns. No clear evidence has been obtained so far that there are orthologs of floral homeotic genes in ferns. Thus, the most recent common ancestors of extant ferns and seed plants possibly did not have orthologs of floral homeotic genes yet (Theißen et al. 2000). However, the deep branching of the MADS-box gene tree is not well resolved, and molecular clock estimates based on angiosperm sequences suggested that the floral homeotic genes diverged before the seed plant fern split (Purugganan 1997). It cannot be ruled out, therefore, that floral homeotic gene orthologs existed already in the most recent common ancestor of ferns and seed plants but have either been lost near the base of the lineage that led to extant ferns or exist in ferns but have escaped identification so far (Hasebe 1999). It would be interesting to see, therefore, what kind of MADS-box genes are present in even more basal, nonvascular plants, such as mosses, which separated from the lineage that led to extant vascular plants about 450 MYA.

3 Because of its unprecedented technical advantages as a green plant model system (Cove and Knight 1993; Schaefer and Zryd 1997; Reski 1999; Schaefer 2001), we and others (Krogan and Ashton 2000) have chosen Physcomitrella patens for a characterization of moss MADS-box genes. In the following, we outline structural data for seven different MADS-box genes, but we also report that the total number of MADS-box genes in P. patens is even higher. These MADS-box genes can be subdivided into at least two different classes, classical MIKC-type genes (henceforth termed MIKC c -type genes) and a deviant type of MIKC-type genes with structurally abnormal I- and K-regions, termed MIKC*- type genes. We identified LAMB1 from the lycophyte Lycopodium annotinum (Svensson, Johannesson, and Engström 2000) as the only other MIKC* gene published so far. Our data imply that the complexity of the MADS-box gene family in the moss Physcomitrella is higher than was previously assumed (Krogan and Ashton 2000). The implications of our findings for a reconstruction of MADS-box gene phylogeny and the prospects of future research are discussed. Materials and Methods Plant Material Physcomitrella patens (Hedw.) B. S. G. was grown under standard conditions as described by D. G. Schaefer (protocol available via the World Wide Web at http: // DNA Synthesis and Sequencing Oligonucleotides were purchased from LifeTech. All DNA sequences corresponding to PPM and Pp- MADS genes were determined by the MPIZ DNA core facility and the NIBB Center for Analytical Instruments, respectively, on Applied Biosystems (Weiterstadt, Germany) Abi Prism 377 and 3700 sequencers using BigDye-terminator chemistry. Premixed reagents were from Applied Biosystems. Isolation of cdnas For the isolation of cdnas of PPM1 PPM4, poly(a) -RNA was isolated from plant material, including all major stages of the P. patens life cycle (protonema, gametophores, and sporophytes of different ages), according to a method published by Chomczynski and Sacchi (1987), using Total RNA Reagent (BIOMOL). Partial cdnas of PPM2c and PPM3 were isolated by 3 rapid amplification of cdna ends (RACE) as generally described (Frohman, Dush, and Martin 1988; Münster et al. 1997). Oligonucleotide P RQVT2 (5 -CGR CAR GTG ACS TTC TSC AAR CG-3 ), which had been derived from fern, gymnosperm, and angiosperm sequences, was used as an MADS-box specific primer. RACE products were cloned into a standard plasmid vector, pgem-t (Promega, Mannheim, Germany). The PPM4 cdna was isolated by 3 RACE with primers derived from genomic sequence fragments of PPM4 (see later) and a RACE-adapter primer. Physcomitrella MADS-box Genes 803 Additional cdnas of PPM3 and partial cdnas of PPM1 were isolated by screening of a cdna phage library representing P. patens protonema mrna (kindly provided by R. Strepp and R. Reski) under conditions of moderate stringency. Radioactively labeled cdnas of PPM2c and PPM3 were used as hybridization probes. Upstream sequences overlapping with the 3 fragments of PPM1, PPM2c and PPM3 were isolated by 5 RACE, employing a commercially available kit (5 /3 - RACE Kit; Boehringer Mannheim, Mannheim, Germany). cdnas of PpMADS1 PpMADS3 were isolated in the following way. Poly(A) -RNA was isolated from a mixture of protonemata and gametophores cultured at 25 C and gametophores cultured at 16 C using Dynabeads mrna Direct Kit (DYNAL, Oslo, Norway) and further purified with ISOGEN-LS (Wako Pure Chemical, Osaka, Japan). The Marathon cdna Amplification Kit (Clontech, Palo Alto, Calif.) and the 3 RACE System (Life Technologies, Inc., Rockville, Md.) were used to synthesize cdna of each poly(a) -RNA, using nested M-domain specific primers (M1: 5 -SAR MTN AAR MGG ATM GAG AAC-3 and dumads2: 5 -CAU CAU CAU CAU AAR AAR GCI TAY GAR CTI TCN TCN GT-3 ) and modified poly-t primers supplied by the kits. The PCR-amplified candidate cdnas were separated on 1% agarose gels, and fragments of over 500 bp were cloned into the pamp1 vector (Life Technologies). From candidate clones obtained by 3 -RACE, 480 clones (288 clones for the mixture of cdna from protonemata and gametophores cultured at 25 C and 192 clones for cdna from gametophores cultured at 16 C) were characterized. All clones were digested with Sau3A1, and the 288 and 192 clones were classified into groups of 46 and 32, respectively. In the group of 46, subsets of 2, 7, and 37 corresponded to PpMADS1, PpMADS2, and unrelated DNA fragments, respectively. In the group of 32, subsets of 7, 8, and 17 corresponded to PpMADS2, PpMADS3, and unrelated DNA fragments, respectively. The 5 region of each gene was obtained using the Marathon cdna Amplification Kit and sequenced. The nucleotide sequences of the gene-specific primers are deposited in the DNA database, with the sequence of each gene as additional information. Nucleotide sequence of each gene was confirmed by sequencing PCR products using two gene-specific primers located close to each end of the putative mrna. Nucleotide sequence data of the cdnas have been deposited in the EMBL, GenBank, and DDBJ Nucleotide Sequence Databases under the accession numbers AF (PPM1; Krogan and Ashton 2000), AJ (PPM2c), AJ (PPM3), AJ (PPM4), AB (PpMADS1), AB (Pp- MADS2), and AB (PpMADS3). Genomic sequences are available under AF (PPM1; Krogan and Ashton 2000), AF (PPM2; Krogan and Ashton 2000), AJ (PPM3), and AJ (PPM4). Genomic DNA-based PCR Techniques Genomic DNA was isolated from 3-week-old gametophytes using a DIECA-based protocol. Genomic

4 804 Henschel et al. DNA fragments corresponding to isolated cdnas were amplified with primers derived from the known cdna sequences. DNA (400 ng) was used as template in a standard PCR program (denaturation at 94 C, annealing at 66 C, elongation at 72 C) of 40 cycles. Sequence information of putative promoter regions and 5 untranslated regions (UTRs) was obtained by the rapid amplification of genomic ends (RAGE) technique using primers of the 5 parts of known sequences and an adapter-specific oligonucleotide, as described elsewhere (Siebert et al. 1995; Cormack and Somssich 1997). Second rounds of PCRs with nested primers were carried out employing primary amplification products as templates. A PPM4 fragment was isolated by 3 RAGE, running the PCR with oligonucleotides P KH05 (MADS-box primer) (5 -AGG CAR GTG ACS TWC TSC AAR MG- 3 ), which had been derived from known MADS-box sequences, and an adapter-specific oligonucleotide P AP1 (5 -GTA ATA CGA CTC ACT ATA GGG C-3 ). PCR products were cloned into the plasmid vector pgem-t (Promega). To complete the genomic 5 part of PPM4, 5 RAGE was performed. After cloning the PPM4 cdna, the corresponding genomic DNA fragments were amplified with primers derived from the cdna sequences. Southern Analysis Southern blots were prepared by standard methods with 10 g of total DNA from P. patens gametophytes per lane, digested with different restriction enzymes. Gene-specific hybridization probes were mainly obtained from the region downstream of the MADS-box to avoid cross-hybridization with other gene family members even under stringent hybridization conditions. All probes were radioactively labeled by oligo-probe labeling using Klenow enzyme and 32 P-dCTP (Sambrook, Fritsch, and Maniatis 1989, pp ). The filters were hybridized at 68 C (high stringency) or at 60 C (moderate stringency) for 16 h in 5 SSC, 5 Denhardt s solution, 0.5% SDS, and 1 mg/ml herring sperm DNA and washed at 68 C in0.1 SSPE/0.1% SDS (high stringency) or at 55 C in1 SSPE/0.1% SDS (moderate stringency). Sequence Alignments and Construction of Phylogenetic Trees Multiple alignments of conceptual amino acid sequences were generated by using the Genetics Computer Group program PILEUP (version 10.0), with a gap weight of 8 and a gap length weight of 2 (default parameters), and were optimized by hand, if necessary. Only the M-domains (amino acid [aa] position 1 60) and part of the K-domains (aa position ; numbering according to the SQUA protein), fused together, were used for the reconstruction of phylogenetic trees because these represent the only sequences which could be unambiguously aligned in the case of the data sets used. In addition, the 51st amino acid of LAMB1, which represents an autapomorphic insertion (fig. 1; Svensson, Johannesson, and Engström 2000), was removed for phylogeny reconstructions. Trees were constructed by the neighbor-joining method and statistically evaluated by bootstrap analysis as described (Münster et al. 1997). Accession numbers of the sequences used can be found at the MADS homepage ( de/mads). Results Cloning of MADS-box Genes from P. patens Employing a diversity of molecular cloning techniques, sequence information about seven different MADS-box genes of the moss P. patens was obtained. Two of the genes are identical with PPM1 and PPM2 as described by Krogan and Ashton (2000), so we adopt these names here. In the case of PPM2, however, a novel cdna isoform (PPM2c) was found; its sequence is identical to that of the PPM2 cdna (sensu Krogan and Ashton 2000), except for the sequence downstream of the 10th exon where PPM2c is identical to the cdna termed PPM2b (Krogan and Ashton 2000). Complete sequencing of representative cdnas and conceptual translation of the open reading frames yielded proteins of lengths of 283 aa (PPM1), 284 aa (PPM2c), 372 aa (PPM3), 380 aa (PPM4), 281 aa (PpMADS1), 306 aa (PpMADS2), and 320 aa (PpMADS3) (fig. 1). Hybridization of Southern blots containing genomic DNA of P. patens, with different probes specific for the moss genes PPM1 PPM4, under stringent conditions gave single, but different, bands for each gene, indicating that PPM1 PPM4 represent four different single-copy genes (fig. 2). Sequence Comparisons Reveal Two Distinct Classes of MIKC-type Genes in P. patens Multiple sequence alignments of the conceptual PPM and PpMADS amino acid sequences with sequences of diverse MIKC-type M-domain proteins from ferns and seed plants demonstrated that all the M-domain proteins from P. patens have recognizable M- and K-domains linked together by an I-domain and followed by a C-domain (fig. 1). Thus, all these conceptual gene products may be considered as MIKC-type proteins. However, the sequence alignment also revealed that two different classes of P. patens proteins can be distinguished. PPM1, PPM2, and PpMADS1 have a primary structure which is very similar to that of the other MIKC-type proteins known (fig. 1). We refer here to proteins with such a structure as MIKC c -type proteins (or, in brief, MIKC c proteins), where c stands for classic. In contrast, PPM3, PPM4, PpMADS2, and PpMADS3 share some features which distinguish them from the MIKC c -type proteins. Proteins with at least some of these features will henceforth be termed MIKC*-type (or MIKC*) proteins. The unusual features of the MIKC*-type proteins concern mainly the I-domain, but the K-domain also shows some peculiarities. The I-domains of the MIKC*- type proteins are considerably longer than those of the MIKC c -type proteins. For example, in the alignment

5 Physcomitrella MADS-box Genes 805 FIG. 1. Sequence alignment of M-domain proteins. Conceptual amino acid sequences of MADS-box genes from P. patens (PPM and PpMADS sequences) were aligned to representatives of different MADS-box gene subfamilies. A sign indicates that the sequence of AGL17 is incomplete at the N terminus. The N-, M-, I-, K-, and C-domains are marked; the hydrophobic amino acids (L, I, V, and M) of the K-domain are highlighted in bold. Brackets below the sequences indicate the positions of introns which are located between codons (zero-phase introns). Triangles indicate the position of introns which are located within codons. Except for the last intron of PPM1, PPM2, and PpMADS1, they are all phase 2 introns, i.e., they are located between the second and the third codon positions. DEF, GLO, PLE, and SQUA are derived from Antirrhinum majus; AGL2, AGL6, AGL12, and AGL17 from Arabidopsis thaliana; AGL15-1 from Brassica napus; TM8 from Lycopersicon esculentum; TOBMADS1 from Nicotiana tabacum; CRM1 and CRM3 from C. richardii; and LAMB1 from L. annotinum.

6 806 Henschel et al. FIG. 2. Genomic Southern blot analysis of some of the P. patens MADS-box genes. Genomic DNA was digested with the restriction enzymes EcoRI, EcoRV, HindII, or HindIII, as indicated above the lanes, separated on an agarose gel, and blotted onto a nylon membrane. Probes were used which were specific for the different PPM genes under the high-stringency hybridization and washing conditions applied. presented here the I-domain of PPM3 is 62 aa long, and the I-domain of PpMADS2 is even 87 aa long, whereas that of the typical MIKC c -type protein SQUAMOSA (SQUA) from Antirrhinum majus has a length of 35 aa (Huijser et al. 1992) (fig. 1). The K-domains of MIKC*-type proteins also show regularly spaced hydrophobic amino acids as the K-domains of the MIKC c -type proteins do, but some of the hydrophobic amino acids may be shifted by one or two positions or may be less conserved (fig. 1). In addition, there are three indels where the K-domains of the P. patens MIKC*-type proteins are longer than the K-domains of the MIKC c -type proteins. The indels are very small: two comprise two amino acids and one just one amino acid residue. Interestingly, one insertion of two amino acids is also found in two closely related MIKC c - type proteins, CRM3 and CRM9, from the fern Ceratopteris (fig. 1; Münster et al. 1997; Theißen et al. 2000). Taken together, a longer I-domain seems to be the most characteristic hallmark of the MIKC*-type proteins, whereas a deviating K-domain, with respect to both sequence and length, is a typical, but less diagnostic, feature. There is one published M-domain protein, LAMB1, from the lycopod (clubmoss) L. annotinum (Svensson, Johannesson, and Engström 2000) that shares the most remarkable feature of PPM3, PPM4, PpMADS2, and PpMADS3 (fig. 1): the I-domain of LAMB1 is significantly longer than those of the MIKC c -type proteins (fig. 1). This raises the intriguing question as to whether this similarity between the P. patens MIKC*-type proteins and LAMB1 indicates a close evolutionary relationship between the respective genes. In the following, we consider LAMB1 tentatively as an MIKC*-type protein. Exon-intron Structures of MIKC c - and MIKC*-type Genes Interpretations of structural differences between MIKC c - and MIKC*-type proteins critically depend on proper recognition of the different domains and thus on assumptions about homologies. The hypothesis about the domain structures of the MIKC-type proteins implied by figure 1 (henceforth termed longer I-domain hypothesis ) was not easy to corroborate by simple sequence comparisons. There was an alternative hypothesis ( K-domain insertion hypothesis ) which suggested that the length difference between MIKC c - and MIKC*-type proteins upstream of the C-domain is mainly because of an insertion within the K-domain rather than an elongation of the I-domain. Determination of sequence similarities between both sets of putative K-domains of MIKC*-type proteins and the K-domains of MIKC c -type proteins, as well as the analysis of patterns of conserved amino acids (data not shown), slightly favored the longer I-domain hypothesis, as did pairwise and multiple sequence alignments involving different samples of MIKC-type proteins (an example is given in fig. 1). However, these analyses could not completely rule out the K-domain insertion hypothesis being true. To distinguish between the two hypotheses and to better understand the differences between the two classes of MIKC-type genes, the sequences and exon-intron structures of the genomic loci of the P. patens MADSbox genes were determined. The positions of all introns with respect to the protein sequences are depicted in figure 1. The exon-intron structures of PPM1 PPM4 are compared with those of SQUA and LAMB1 in figure 3. It becomes obvious that under the longer I-domain hypothesis, the exon-intron structures of both classes of MIKC-type proteins appear generally quite conserved. Most important, the positions of introns within the K- domains are identical for all MIKC-type genes under that hypothesis, except that one intron is missing in LAMB1 (fig. 1). In contrast, under the K-domain insertion hypothesis, intron positions differ dramatically in MIKC c - and MIKC*-type genes (data not shown). Moreover, the K-domain insertion hypothesis requires the split of a domain assumed to be of some functional

7 Physcomitrella MADS-box Genes 807 FIG. 3. Analysis of the exon-intron structures of the P. patens MADS-box genes PPM1 PPM4. Exons are shown as colored boxes, the 5 and 3 -UTRs as white boxes, and the introns as bold black lines. Homologous exons are connected with thin vertical lines. In addition, the genomic structure of SQUAMOSA as a representative of an MIKC c -type and of LAMB1 as an MIKC*-type MADS-box gene are presented. relevance (for protein-protein interactions). For these reasons we currently strongly favor the longer I-domain hypothesis over its alternative. Under the longer I-domain hypothesis, the major structural difference between the two subtypes of MIKC genes is the following: whereas the I-domain of MIKC c - type proteins is largely encoded by just one exon, that of MIKC*-type proteins is encoded by four (PPM3, PPM4, LAMB1) or five (PpMADS2, PpMADS3) exons, thus explaining the increase in I-domain length in MIKC*-type proteins (figs. 1 and 3). As already described for LAMB1 (Svensson, Johannesson, and Engström 2000), the last and third last of the I-domain exons of each MIKC*-type gene are remarkably small, encoding only a few amino acids (figs. 1 and 3). The analysis of exon-intron structures thus supports the view that a deviant I-domain is the most consistent feature that distinguishes the MIKC*-type proteins from the MIKC c - type proteins. The similarity in exon-intron structures between the P. patens MIKC*-type genes and LAMB1 in the I-domain (figs. 1 and 3) strongly suggests a relatively close relationship between the respective genes. Despite the high structural similarity of the P. patens MIKC* proteins and LAMB1, the overall sequence identity between these proteins is quite low. PPM3 and LAMB1, for example, show an overall identity of only 26.9% (a similarity of 37.3%), probably reflecting the fact that the lineages that led to extant mosses and vascular plants (including lycophytes) were already separated about 450 MYA. In line with this, and despite the general similarity among the MIKC*-type genes, LAMB1 shows some unique features which distinguishes it from the P. patens MIKC* genes. Downstream of the first intron within the K-domain, all other introns are missing (figs. 1 and 3); the three indels found in the K-domain of the P. patens MIKC* proteins are not found; there is an exceptionally long C-domain that seems to be composed of three quite imperfect repeat units (Svensson, Johannesson, and Engström 2000). MIKC*-type Proteins are Closely Related The similarities between PPM3, PPM4, PpMADS2, PpMADS3 and LAMB1 suggest that they are more closely related to each other than to any other MADS-box gene known. To test this hypothesis, the evolutionary relationships between the moss genes and the other known MADS-box genes were determined by phylogeny reconstructions involving a diverse set of MIKCtype proteins from vascular plants (including LAMB1) and the conceptual gene products of all the P. patens MADS-box genes reported here. The set of seed plant MIKC c proteins used comprised representatives of all the major subfamilies of plant M-domain proteins

8 808 Henschel et al. FIG. 4. Phylogenetic tree showing the relationships between some MIKC c -type and all MIKC*-type M-domain proteins known. Names of species from which the respective genes were isolated are given in parentheses behind the protein names. Proteins from the moss P. patens are shown by inverted boxes, the LAMB1 protein from the lycophyte L. annotinum is highlighted by a shaded box, and those of ferns by open boxes. All other genes have been isolated from seed plants (angiosperms or gymnosperms). The numbers next to some nodes give bootstrap percentages, shown only for relevant nodes (mainly those defining gene subfamilies or clades of special interest for this work). Subfamilies and gene classes (MIKC* or MIKC c ) are labeled by brackets at the right margin. known (Theißen, Kim, and Saedler 1996; Becker et al. 2000; Theißen et al. 2000). Only sequences of M- and K-domains were used for these kinds of studies because they produce the most unambiguous alignments. A representative example of the phylogenetic trees obtained is shown in figure 4. It indicates that both the MIKC c - and the MIKC*-type genes cluster together with reasonable bootstrap support. Given that the tree in figure 4 is unrooted, this suggests that at least one of the two types of genes (MIKC* or MIKC c ) represents a clade, meaning that the respective genes share a most recent common ancestor not shared with any of the other MIKC-type genes known. In addition, it becomes apparent that the P. patens MIKC proteins constitute two strongly supported clades (one containing MIKC c, the other containing MIKC* proteins). This suggests that the respective genes within each clade originated by relatively recent gene, chromosome, or genome duplications in the lineage that led to extant mosses. It cannot be completely ruled out, however, that the different loci within one clade are more ancient than it seems but were kept similar by gene conversion events. In addition, it is noteworthy that the phylogeny reconstructions could not be used to assign the P. patens MIKC c genes to any of the well-defined subfamilies of ferns or seed plant MADS-box genes (fig. 4). In figure 4 the P. patens MIKC c proteins form a sister clade to CRM1-like proteins from the fern Ceratopteris, but in several phylogeny reconstructions with different data sets, this was never supported by high bootstrap values (data not shown). The significance of the observation thus remains unclear. Therefore, orthology relationships between the MADS-box genes from the moss P. patens on the one hand and from seed plants on the other could not be identified. All the features outlined here were not only supported by neighbor-joining trees as the one shown in figure 4 but also by maximum-likelihood trees (unpublished data). Discussion Homology of P. patens MADS-box Genes We report here the cdna and genomic sequences of seven different MADS-box genes from the moss

9 model system P. patens. Their conceptual gene products are all recognized as MIKC-type proteins (fig. 1). As suggested by their characteristic and unique domain structure, all MIKC-type genes are probably homologous genes which originated from a single common ancestor gene; they were probably generated by gene duplications, sequence diversification, and fixation events, whereas domain swapping played only a minor role, if any (Theißen, Kim, and Saedler 1996; Theißen et al. 2000). This implies that the MADS-box genes reported here are genuine homologs of the many well-characterized MADS-box genes from flowering plants, including almost all of the floral homeotic genes and some of the floral meristem identity genes. However, by phylogeny reconstructions, these genes could not be assigned to any of the well-defined MIKC-type gene subfamilies known from seed plants (fig. 4; Theißen, Kim, and Saedler 1996; Becker et al. 2000). Orthology between the moss genes and any MADS-box gene (or clades of genes) from seed plants could not thus be identified here. The long-independent evolutionary history of mosses and seed plants, and many independent gene duplications both in the lineage that led to extant mosses and in that which led to extant seed plants, may well have obscured any traces of orthology. A Novel Class of MIKC-type Genes Despite the homology of all the P. patens MADSbox genes reported here, the structures of four proteins PPM3, PPM4, PpMADS2, and PpMADS3 showed some unusual features, which we interpret as a longer I-domain and a less strict conservation of the length and the hydrophobic amino acids of the K-domain. Phylogeny reconstructions (fig. 4) corroborate the view that the respective genes, together with LAMB1 from the lycophyte L. annotinum, are a hitherto unknown class of MIKC-type genes, termed MIKC*-type genes. Sequence comparisons and the analysis of exon-intron structures (figs. 1 and 3) suggest that an extended I-domain, encoded by four or five rather than one exon (as in the case of MIKC c genes), is the most characteristic feature of the MIKC*-type genes. Are MIKC*-type Genes Restricted to Lower Land Plants? PPM3, PPM4, PpMADS2, PpMADS3 and LAMB1 are the only MIKC*-type genes that have been identified so far, implying that this class of genes has not been reported so far for ferns and seed plants (fig. 5). This raises the intriguing question as to whether MIKC*-type genes are really absent in these taxa or whether they have just not been identified yet. Given the intensive characterization of the MADS-box gene family in some ferns, gymnosperms, and flowering plants (for references, see Introduction and Theißen et al. 2000), we think it is quite unlikely that MIKC* genes exist in these taxa but have not been isolated or recognized yet. However, a low expression level could have helped to hide MIKC* genes, because most of the MADS-box genes were first identified at the cdna level. Physcomitrella MADS-box Genes 809 In the case of the flowering plant Arabidopsis thaliana, where almost the complete genome has been sequenced, the presence or absence of MIKC* genes can now be checked more rigorously. However, there are still limitations with respect to the identification of these genes. MIKC* genes may be recognized by their membership in the respective phylogenetic cluster or by the number of exons encoding the I-domain. Both require reliable information about the exon-intron and domain structures of the genes, which is available for many, but by far not all, A. thaliana MADS-box genes. The computer prediction of the short exons in the I- and K-regions of MIKC-type genes often fails (unpublished data), so that in general a cdna sequence is required to accurately determine the exon-intron structure of an MIKC-type gene. There are at least 82 MADS-box genes in the A. thaliana genome (Riechmann et al. 2000), at least 32 of which are Type II (plant MIKC or animal and fungal MEF2) genes and thus have to be considered here (Alvarez-Buylla et al. 2000b). For all of these, MADS- and K-domains could be predicted with sufficient accuracy, aligned, and used for phylogeny reconstructions analogous to the one shown in figure 4. None of these was identified as an MIKC*-type protein (unpublished data). Thus, there is no evidence so far that corresponding genes exist in A. thaliana. More rigorous statements about the absence or presence of MIKC*-type genes in A. thaliana require the determination of the exon-intron structure for all the genes in the genome. An Ancient Duplication of an MIKC-type Gene The data presented here demonstrate that MIKC c - type as well as MIKC*-type genes are present in both a moss and a clubmoss (lycophyte). Recently, dramatic progress has been made in understanding the relationships between the major land plant taxa, including mosses and clubmosses (fig. 5). There is evidence that there are three clades of extant vascular plants, lycophytes, seed plants, and ferns plus allies such as horsetails (equisetophytes) and whisk ferns (psilophytes) (Pryer et al. 2001). Ferns (sensu lato) and seed plants form a clade, so that lycophytes appear as the most basal vascular plants; mosses represent an outgroup of the vascular plants (fig. 5). The presence of MIKC c and MIKC* genes in both mosses and clubmosses thus strongly suggests that the last common ancestor of mosses and vascular plants, that probably existed about 450 MYA, contained already an MIKC c as well as an MIKC* gene (fig. 5). Our studies, therefore, reveal an ancient duplication of an MIKC-type gene that occurred before the separation of the lineages that led to extant mosses and vascular plants more than about 450 MYA. But which kind of MIKC gene was duplicated at that time? Are MIKC c or MIKC* Genes Ancestral? Because MIKC c as well as MIKC* genes have been isolated from both mosses and vascular plants, it is difficult to say which class of genes is ancestral. In principle, MIKC* genes may have originated from MIKC c

10 810 Henschel et al. FIG. 5. The ancestry of MIKC c - and MIKC*-type genes in the evolution of land plants. A phylogenetic tree of some major taxa of land plants is shown; the topology of the tree is according to recent publications (Kenrick and Crane 1997; Nickrent et al. 2000; Pryer et al. 2001). Seed plants comprise angiosperms (flowering plants) and gymnosperms. Together with ferns (including allies such as horsetails and whisk ferns) and the lycophytes, they constitute the vascular plants. At the terminal branches the identification of MIKC c and MIKC* genes in representative taxa is indicated. At the root of the tree a minimal set of MADS-box genes that was very likely present in the last common ancestor of mosses and vascular plants is shown. Branches are not drawn to scale. The separation of the lineages that led to extant mosses from the other taxa (vascular plants) occurred about 450 MYA. The separation of the lineage that led to extant ferns from the lineage that led to seed plants occurred about 400 MYA, as revealed by fossil and molecular evidence. genes, or vice versa, or both genes may have originated from a common ancestor which was neither a clear MIKC c nor an MIKC* gene. We will restrict our considerations here mainly to the I-domain, the most important character distinguishing both gene types. Svensson, Johannesson, and Engström (2000) argued that the MIKC* genes may represent the ancestral gene type and that MIKC c genes evolved under the loss of three exons in the I-region. However, the fact that MIKC c, but not MIKC*, genes have been isolated from a grade of green algae (Y. Tanabe, M. Hasebe, and M. Ito, personal communication) makes it appear much more likely that MIKC c rather than MIKC* genes are ancestral. Together with the phylogeny reconstructions shown in figure 4, this would imply that the MIKC* genes are monophyletic (a clade), whereas the MIKC c genes are paraphyletic. It cannot be completely ruled out, however, that MIKC* genes have just been overlooked in algae so far. Under the hypothesis that MIKC c genes are ancestral, an increase in the number of I-domain exons from one to four or five has to be postulated. A formal third possibility postulates the root between the MIKC* and the MIKC c genes. In that case, it seems also not unreasonable that the most recent common ancestor of MIKC c and MIKC* genes had neither one nor four I-domain exons. Rather, it may have had two, a relatively long and a short exon, and duplication of that arrangement may have led to the situation in extant MIKC* genes, whereas loss of the short exon occurred in the lineage that led to extant MIKC c genes. However, sequence similarity between the different exons of the I-domain does not support this scenario. In contrast, similarity between the first and second exon of the I-domain of PpMADS2 and PpMADS3 indicates that these originated by an exon duplication, implying that four rather than five I-domain exons is the ancestral state within the MIKC* genes. Have MIKC* Genes Been Lost During the Evolution of Higher Vascular Plants? If MIKC*-type genes are really absent from extant ferns and seed plants and are not just awaiting isolation from these taxa, this class of genes must have been lost in the lineage that led to extant ferns (sensu lato) and

11 seed plants; alternatively (but less likely), this class of genes was lost several times independently, e.g., at the base of extant ferns and seed plants. It can be assumed that not many, probably only one, MIKC*-type genes were present in the respective lineage, to make the loss of a complete class of genes a likely event. Even then, it raises the intriguing question as to what function MIKC* genes had and have, so that this gene type was conserved in mosses and clubmosses for about 450 MYA but became dispensable in the higher vascular plants. Definition, Ancestry, and Functional Importance of the K-domain The K-domain is a domain of about 70 amino acids roughly spanning positions of MIKC c -type M-domain proteins. It is shown here that MIKC*-type proteins too contain a K-domain and that the K-box was thus combined with the MADS-box before the gene duplication which generated both classes of genes more than 450 MYA. In line with this, MIKC-type genes with bona fide K-domains have even been isolated from charophyte algae (Y. Tanabe, M. Hasebe, and M. Ito, personal communication), suggesting that this gene type existed even before green plants colonized the land. Alvarez-Buylla et al. (2000b) have maintained that some Type II MADS-box genes (comprising MIKC c - type genes according to our definition and MEF2-like genes from fungi and animals) from Arabidopsis, such as AGL12 and AGL25-like genes, do not contain a K- box. They concluded that the K-box was acquired by an MADS-box gene during the course of Type II gene evolution, after the gene lineages that led to AGL12 and AGL25-like genes had already split off, because phylogeny reconstructions suggested that these genes were basal within the MIKC c gene tree (Alvarez-Buylla et al. 2000b). However, one should note that different definitions of K-domains are used in different studies. The K-domain (or box) was originally defined as a domain in some early-identified M-domain proteins which has a low but significant similarity to a portion of keratin sequences (Ma, Yanofsky, and Meyerowitz 1991). It was known that the region of keratin with similarity to the K-domains of the plant proteins is part of the coiledcoil sequence that constitutes the central rod-shaped domain of keratin, and thus it was postulated (but not demanded by definition!) that the K-domain adopts a coiled-coil structure. It was noted that K-domains can potentially form two amphipathic helices (Ma, Yanofsky, and Meyerowitz 1991) because of a regular spacing of hydrophobic amino acids. In the following years, sequence similarity to the early-identified K-domains and the regular spacing of hydrophobic amino acids were taken as criteria for identifying K-domains by other researchers, including ourselves (e.g., Davies and Schwarz-Sommer 1994; Rounsley, Ditta, and Yanofsky 1995; Theißen et al. 1995; Münster et al. 1997), partly because these criteria, which also clearly apply to AGL12 (fig. 1), are relatively straightforward. Physcomitrella MADS-box Genes 811 Whether the similarity between keratin and K-domains reflects common ancestry (homology) is unclear. We can be quite confident, however, that the similarity among the different K-domains of plant MADS-box genes is based on homology, because phylogenetic reconstructions employing M-domain sequences or I- and K-domain sequences give almost the same results (Theißen, Kim, and Saedler 1996). Thus, the original definition of the K-box (or domain) refers to a homologous stretch of DNA (or protein). We stick to that definition here because homology is certainly an appropriate concept in evolutionary considerations. For example, according to our definition and findings, there were two sequence elements, M-domain and K-domain, present in the most recent ancestor of MIKC c and MIKC* genes more than 450 MYA that have since then been coinherited through many gene duplications, sequence divergence, fixation, and speciation events, which yielded extant MIKC gene diversity (including AGL12 and AGL25-like genes). Nevertheless, there may have been rare cases of recombination (Alvarez-Buylla et al. 2000b). Note that these statements are independent of the actual higher-order structure that the different K-domains adopt. In contrast, Alvarez-Buylla et al. (2000b) use the formation of a coiled-coil structure as the defining character of a K-domain. However, because few mutations may be able to change that structure (as suggested for AGL12), this may not yield an especially stable phylogenetic signal. But even worse, there is not one K-domain for which the structure has been determined by physical methods, so that there is currently no way to assess the reliability of the computer methods used. And computer predictions of protein structures are notoriously unreliable when applied to novel types of sequences, as in this case, so that it may appear premature to switch already to a novel definition of the K-domain. But let us assume that computer modeling is able to accurately predict the structure of K-domain like sequences and use the formation of a coiled-coil structure for a definition of the K-domain. Even that would not imply a relatively recent origin of the M- and K-domain connection because preliminary attempts to root the MIKC c gene tree employing algae sequences did not place AGL12 near the base of the tree (Y. Tanabe, M. Hasebe, and M. Ito, personal communication). Interestingly, the programs that do not predict coiled-coil structures in AGL12 strongly predict such structures in all MIKC*-type proteins except LAMB1 (unpublished data). All this would argue that the situation in AGL12 (and in possibly some other proteins) is derived rather than basal. One should also take into consideration, however, that the computer programs predicting coiled-coils may need some refinements, especially when applied to principally novel types of sequences. Physical determination of K-domain structures (e.g., via X-ray diffraction or high-resolution NMR) will be required to clarify that case. Although the three-dimensional structure of the K- domain is highly speculative, its functional importance

Floral homeotic genes were recruited from homologous MADS-box genes preexisting in the common ancestor of ferns and seed plants

Floral homeotic genes were recruited from homologous MADS-box genes preexisting in the common ancestor of ferns and seed plants Proc. Natl. Acad. Sci. USA Vol. 94, pp. 2415 2420, March 1997 Evolution Floral homeotic genes were recruited from homologous MADS-box genes preexisting in the common ancestor of ferns and seed plants THOMAS

More information

The MADS-Box Floral Homeotic Gene Lineages Predate the Origin of Seed Plants: Phylogenetic and Molecular Clock Estimates

The MADS-Box Floral Homeotic Gene Lineages Predate the Origin of Seed Plants: Phylogenetic and Molecular Clock Estimates J Mol Evol (1997) 45:392 396 Springer-Verlag New York Inc. 1997 The MADS-Box Floral Homeotic Gene Lineages Predate the Origin of Seed Plants: Phylogenetic and Molecular Clock Estimates Michael D. Purugganan

More information

LAB 4: PHYLOGENIES & MAPPING TRAITS

LAB 4: PHYLOGENIES & MAPPING TRAITS LAB 4: PHYLOGENIES & MAPPING TRAITS *This is a good day to check your Physcomitrella (protonema, buds, gametophores?) and Ceratopteris cultures (embryos, young sporophytes?)* Phylogeny Introduction The

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

Chapter 18 Lecture. Concepts of Genetics. Tenth Edition. Developmental Genetics

Chapter 18 Lecture. Concepts of Genetics. Tenth Edition. Developmental Genetics Chapter 18 Lecture Concepts of Genetics Tenth Edition Developmental Genetics Chapter Contents 18.1 Differentiated States Develop from Coordinated Programs of Gene Expression 18.2 Evolutionary Conservation

More information

PHYLOGENY AND SYSTEMATICS

PHYLOGENY AND SYSTEMATICS AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study

More information

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

The Plant Cell, November. 2017, American Society of Plant Biologists. All rights reserved

The Plant Cell, November. 2017, American Society of Plant Biologists. All rights reserved The Genetics of Floral Development Teaching Guide Overview The development of flowers in angiosperm plants provided a critical evolutionary advantage, allowing more options for pollen dispersal and seed

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern

More information

BIOLOGY 366 PLANT SYSTEMATICS EXAM POINTS TOTAL (LECTURE 100, LAB PRACTICAL 50)

BIOLOGY 366 PLANT SYSTEMATICS EXAM POINTS TOTAL (LECTURE 100, LAB PRACTICAL 50) BIOLOGY 366 PLANT SYSTEMATICS EXAM 1 150 POINTS TOTAL (LECTURE 100, LAB PRACTICAL 50) SECTION 1 (Short answer; 80 points total): Be as precise as possible in your answers. 1. What makes the alternation

More information

Characterization of MADS homeotic genes in the fern Ceratopteris richardii

Characterization of MADS homeotic genes in the fern Ceratopteris richardii Proc. Natl. Acad. Sci. USA Vol. 95, pp. 6222 6227, May 1998 Evolution Characterization of MADS homeotic genes in the fern Ceratopteris richardii MITSUYASU HASEBE*, CHI-KUANG WEN, MASAHIRO KATO, AND JO

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms

Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam, Joonyul Kim, Shinyoung Lee, Gynheung An, Hong Ma, and Masatoshi Nei Institute

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

C3020 Molecular Evolution. Exercises #3: Phylogenetics

C3020 Molecular Evolution. Exercises #3: Phylogenetics C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from

More information

673 Comparative Genomics of Angiosperm MADS Box Genes Yale University, New Haven, CT. 674 The evolution of plant architecture in Brassicaceae

673 Comparative Genomics of Angiosperm MADS Box Genes Yale University, New Haven, CT. 674 The evolution of plant architecture in Brassicaceae 673 Comparative Genomics of Angiosperm MADS Box Genes Vivian F. Irish Yale University, New Haven, CT. MADS box genes encode key transcriptional regulators that have been implicated in the control of various

More information

Ms.Sastry, AP Biology Unit 4/Chp 26 to 34/Diversity 1 Chapter in class follow along lecture notes

Ms.Sastry, AP Biology Unit 4/Chp 26 to 34/Diversity 1 Chapter in class follow along lecture notes Ms.Sastry, AP Biology Unit 4/Chp 26 to 34/Diversity 1 Chapter 26 34 in class follow along lecture notes Chp 26 Origin of life: 1) When did earth form? 2) What is the order of evolution of life forms on

More information

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together SPECIATION Origin of new species=speciation -Process by which one species splits into two or more species, accounts for both the unity and diversity of life SPECIES BIOLOGICAL CONCEPT Population or groups

More information

Lecture Notes: BIOL2007 Molecular Evolution

Lecture Notes: BIOL2007 Molecular Evolution Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits

More information

Cladistics and Bioinformatics Questions 2013

Cladistics and Bioinformatics Questions 2013 AP Biology Name Cladistics and Bioinformatics Questions 2013 1. The following table shows the percentage similarity in sequences of nucleotides from a homologous gene derived from five different species

More information

Reproductive Morphology

Reproductive Morphology Week 3; Wednesday Announcements: 1 st lab quiz TODAY Reproductive Morphology Reproductive morphology - any portion of a plant that is involved with or a direct product of sexual reproduction Example: cones,

More information

Gametophyte. (n) (2n) Sporangium. Sporophyte. (2n) N N

Gametophyte. (n) (2n) Sporangium. Sporophyte. (2n) N N Tetrads of spores (n) Spores (n) Gametophyte (n) Archegonium with egg (n) Antheridium with sperm (n) fertilization Meiosis Zygote (2n) Spore mother cell (2n) Sporangium (2n) Sporophyte (2n) Embryo (2n)

More information

What is a Plant? Plant Life Cycle. What did they evolve from? Original Habitat 1/15/2018. Plant Life Cycle Alternation of Generations

What is a Plant? Plant Life Cycle. What did they evolve from? Original Habitat 1/15/2018. Plant Life Cycle Alternation of Generations What is a Plant? Multicellular Eukaryotic Autotrophic (photosynthesis) Has cell walls containing cellulose Lack mobility (sessile) Display Alternation of Generations in their life cycle Introduction to

More information

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

CHAPTERS 24-25: Evidence for Evolution and Phylogeny CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology

More information

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the

More information

Chapter 27: Evolutionary Genetics

Chapter 27: Evolutionary Genetics Chapter 27: Evolutionary Genetics Student Learning Objectives Upon completion of this chapter you should be able to: 1. Understand what the term species means to biology. 2. Recognize the various patterns

More information

Evolution of gene network controlling plant reproductive development.

Evolution of gene network controlling plant reproductive development. 1 / 8 Evolution of floral development gene network. V Villimová. Evolution of gene network controlling plant reproductive development. Veronika Villimová. Master BioSciences, Département de Biologie, Ecole

More information

Chapter 26 Phylogeny and the Tree of Life

Chapter 26 Phylogeny and the Tree of Life Chapter 26 Phylogeny and the Tree of Life Chapter focus Shifting from the process of how evolution works to the pattern evolution produces over time. Phylogeny Phylon = tribe, geny = genesis or origin

More information

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the

More information

Unit 7: Evolution Guided Reading Questions (80 pts total)

Unit 7: Evolution Guided Reading Questions (80 pts total) AP Biology Biology, Campbell and Reece, 10th Edition Adapted from chapter reading guides originally created by Lynn Miriello Name: Unit 7: Evolution Guided Reading Questions (80 pts total) Chapter 22 Descent

More information

Unit 7: Plant Evolution, Structure and Function

Unit 7: Plant Evolution, Structure and Function Time: 7 Days (some time spent working over breaks on this topic) and then an exam 16% of the AP Exam is on this material. Topics Covered: Reproduction, growth, and development Structural, physiological,

More information

Supporting Information

Supporting Information Supporting Information Das et al. 10.1073/pnas.1302500110 < SP >< LRRNT > < LRR1 > < LRRV1 > < LRRV2 Pm-VLRC M G F V V A L L V L G A W C G S C S A Q - R Q R A C V E A G K S D V C I C S S A T D S S P E

More information

Plant Systematics and Plant/Pollinator Interactions. Jacob Landis

Plant Systematics and Plant/Pollinator Interactions. Jacob Landis Plant Systematics and Plant/Pollinator Interactions Jacob Landis Why study plants Important for food Clothing Drugs Bryophytes Lycophytes,Monilophytes Gymnosperms Angiosperms Mosses, Liverworts, Hornworts

More information

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot

More information

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION Using Anatomy, Embryology, Biochemistry, and Paleontology Scientific Fields Different fields of science have contributed evidence for the theory of

More information

O 3 O 4 O 5. q 3. q 4. Transition

O 3 O 4 O 5. q 3. q 4. Transition Hidden Markov Models Hidden Markov models (HMM) were developed in the early part of the 1970 s and at that time mostly applied in the area of computerized speech recognition. They are first described in

More information

Outline. Classification of Living Things

Outline. Classification of Living Things Outline Classification of Living Things Chapter 20 Mader: Biology 8th Ed. Taxonomy Binomial System Species Identification Classification Categories Phylogenetic Trees Tracing Phylogeny Cladistic Systematics

More information

Evolutionary Morphology of Land Plants

Evolutionary Morphology of Land Plants MHRD Scheme on Global Initiative on Academic Network (GIAN) Evolutionary Morphology of Land Plants Overview This century will be dominated by Life Sciences for well reasons of understanding evolution,

More information

Genome-wide analysis of the MYB transcription factor superfamily in soybean

Genome-wide analysis of the MYB transcription factor superfamily in soybean Du et al. BMC Plant Biology 2012, 12:106 RESEARCH ARTICLE Open Access Genome-wide analysis of the MYB transcription factor superfamily in soybean Hai Du 1,2,3, Si-Si Yang 1,2, Zhe Liang 4, Bo-Run Feng

More information

75 Development of floral organ identity: stories from the MADS house Günter Theißen Recent studies on AGAMOUS-LIKE2-, DEFICIENS- and GLOBOSA-like MADS

75 Development of floral organ identity: stories from the MADS house Günter Theißen Recent studies on AGAMOUS-LIKE2-, DEFICIENS- and GLOBOSA-like MADS 75 Development of floral organ identity: stories from the MADS house Günter Theißen Recent studies on AGAMOUS-LIKE2-, DEFICIENS- and GLOBOSA-like MADS-box genes in diverse seed plant species have provided

More information

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p.110-114 Arrangement of information in DNA----- requirements for RNA Common arrangement of protein-coding genes in prokaryotes=

More information

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline Phylogenetics Todd Vision iology 522 March 26, 2007 pplications of phylogenetics Studying organismal or biogeographic history Systematics ating events in the fossil record onservation biology Studying

More information

Announcements. Lab Quiz #1 on Monday: (30pts) conifers + cones, vegetative morphology. Study: Display case outside HCK 132 with labeled conifers

Announcements. Lab Quiz #1 on Monday: (30pts) conifers + cones, vegetative morphology. Study: Display case outside HCK 132 with labeled conifers Announcements Lab Quiz #1 on Monday: (30pts) conifers + cones, vegetative morphology Study: Display case outside HCK 132 with labeled conifers Movie: Sexual Encounters of the Floral Kind Intro to Keying/Greenhouse

More information

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses Anatomy of a tree outgroup: an early branching relative of the interest groups sister taxa: taxa derived from the same recent ancestor polytomy: >2 taxa emerge from a node Anatomy of a tree clade is group

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2011

PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION Integrative Biology 200B Spring 2011 "PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2011 Evolution and development ("evo-devo") The last frontier in our understanding of biological forms is an understanding

More information

Phylogenetic inference

Phylogenetic inference Phylogenetic inference Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, March 7 th 016 After this lecture, you can discuss (dis-) advantages of different information types

More information

Session 5: Phylogenomics

Session 5: Phylogenomics Session 5: Phylogenomics B.- Phylogeny based orthology assignment REMINDER: Gene tree reconstruction is divided in three steps: homology search, multiple sequence alignment and model selection plus tree

More information

C.DARWIN ( )

C.DARWIN ( ) C.DARWIN (1809-1882) LAMARCK Each evolutionary lineage has evolved, transforming itself, from a ancestor appeared by spontaneous generation DARWIN All organisms are historically interconnected. Their relationships

More information

Unit 2B- The Plants. Plants can be classified according to the presence or absence of vascular tissue.

Unit 2B- The Plants. Plants can be classified according to the presence or absence of vascular tissue. Unit 2B- The Plants Botany is the study of plants. All plants are said to have a common ancestor; (ie.) it is thought that plants have evolved from an ancient group of green algae. Plants and green algae

More information

A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS

A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS CRYSTAL L. KAHN and BENJAMIN J. RAPHAEL Box 1910, Brown University Department of Computer Science & Center for Computational Molecular Biology

More information

BLAST. Varieties of BLAST

BLAST. Varieties of BLAST BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database

More information

Kingdom Plantae. Biology : A Brief Survey of Plants. Jun 22 7:09 PM

Kingdom Plantae. Biology : A Brief Survey of Plants. Jun 22 7:09 PM Kingdom Plantae Biology 2201 6.1 6.2 : A Brief Survey of Plants The study of plants is called botany. Plants are believed to have evolved from green algae. The main plant (land) characteristics are as

More information

Name: Class: Date: ID: A

Name: Class: Date: ID: A Class: _ Date: _ Ch 17 Practice test 1. A segment of DNA that stores genetic information is called a(n) a. amino acid. b. gene. c. protein. d. intron. 2. In which of the following processes does change

More information

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory

More information

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9 Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic

More information

Reconstructing the history of lineages

Reconstructing the history of lineages Reconstructing the history of lineages Class outline Systematics Phylogenetic systematics Phylogenetic trees and maps Class outline Definitions Systematics Phylogenetic systematics/cladistics Systematics

More information

A Phylogenetic Network Construction due to Constrained Recombination

A Phylogenetic Network Construction due to Constrained Recombination A Phylogenetic Network Construction due to Constrained Recombination Mohd. Abdul Hai Zahid Research Scholar Research Supervisors: Dr. R.C. Joshi Dr. Ankush Mittal Department of Electronics and Computer

More information

AP Biology. Cladistics

AP Biology. Cladistics Cladistics Kingdom Summary Review slide Review slide Classification Old 5 Kingdom system Eukaryote Monera, Protists, Plants, Fungi, Animals New 3 Domain system reflects a greater understanding of evolution

More information

AP Biology. Evolution of Land Plants. Kingdom: Plants. Plant Diversity. Animal vs. Plant life cycle. Bryophytes: mosses & liverworts

AP Biology. Evolution of Land Plants. Kingdom: Plants. Plant Diversity. Animal vs. Plant life cycle. Bryophytes: mosses & liverworts Kingdom: Plants Domain Eukarya Domain Domain Domain Eubacteria Archaea Eukarya 2007-2008 Common ancestor Evolution of Land Plants 500 mya land plants evolved special adaptations for life on dry land protection

More information

Classification, Phylogeny yand Evolutionary History

Classification, Phylogeny yand Evolutionary History Classification, Phylogeny yand Evolutionary History The diversity of life is great. To communicate about it, there must be a scheme for organization. There are many species that would be difficult to organize

More information

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION - theory that groups of organisms change over time so that descendeants differ structurally

More information

BINF6201/8201. Molecular phylogenetic methods

BINF6201/8201. Molecular phylogenetic methods BINF60/80 Molecular phylogenetic methods 0-7-06 Phylogenetics Ø According to the evolutionary theory, all life forms on this planet are related to one another by descent. Ø Traditionally, phylogenetics

More information

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale evolutionary changes such as speciation events, origin of

More information

Diversity of Plants How Plants Colonized the Land

Diversity of Plants How Plants Colonized the Land Chapter 29, 30. Diversity of Plants How Plants Colonized the Land 1 The first plants For more than 3 billion years, Earth s terrestrial surface was lifeless life evolved in the seas 1st photosynthetic

More information

Kingdom Plantae. A Brief Survey of Plants

Kingdom Plantae. A Brief Survey of Plants Kingdom Plantae A Brief Survey of Plants The study of plants is called botany. Plants are believed to have evolved from green algae. The main plant (land) characteristics are as follows: 1. Common cellular

More information

Phylogenetic analysis. Characters

Phylogenetic analysis. Characters Typical steps: Phylogenetic analysis Selection of taxa. Selection of characters. Construction of data matrix: character coding. Estimating the best-fitting tree (model) from the data matrix: phylogenetic

More information

Constructing Evolutionary/Phylogenetic Trees

Constructing Evolutionary/Phylogenetic Trees Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istance-based methods Ultrametric Additive: UPGMA Transformed istance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood

More information

Kingdom: Plantae. Domain Archaea. Domain Eukarya. Domain Bacteria. Common ancestor

Kingdom: Plantae. Domain Archaea. Domain Eukarya. Domain Bacteria. Common ancestor Kingdom: Plantae Domain Eukarya Domain Bacteria Domain Archaea Domain Eukarya Common ancestor The First Plants For more than 3 billion years, Earth s terrestrial surface was lifeless life evolved in the

More information

Genetic transcription and regulation

Genetic transcription and regulation Genetic transcription and regulation Central dogma of biology DNA codes for DNA DNA codes for RNA RNA codes for proteins not surprisingly, many points for regulation of the process DNA codes for DNA replication

More information

Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles

Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles created by CRISPR-Cas9 Shigeru Makino, Ryutaro Fukumura, Yoichi Gondo* Mutagenesis and Genomics Team, RIKEN

More information

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class

More information

Intraspecific gene genealogies: trees grafting into networks

Intraspecific gene genealogies: trees grafting into networks Intraspecific gene genealogies: trees grafting into networks by David Posada & Keith A. Crandall Kessy Abarenkov Tartu, 2004 Article describes: Population genetics principles Intraspecific genetic variation

More information

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Paper by: James P. Balhoff and Gregory A. Wray Presentation by: Stephanie Lucas Reviewed

More information

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17. Genetic Variation: The genetic substrate for natural selection What about organisms that do not have sexual reproduction? Horizontal Gene Transfer Dr. Carol E. Lee, University of Wisconsin In prokaryotes:

More information

Plants Review 1. List the 6 general characteristics of plants. 2. What did plants probably evolve from? 3. What are some advantages for life on land

Plants Review 1. List the 6 general characteristics of plants. 2. What did plants probably evolve from? 3. What are some advantages for life on land Plants Review 1. List the 6 general characteristics of plants. 2. What did plants probably evolve from? 3. What are some advantages for life on land for a plant? 4. What are the 3 main groups of plants?

More information

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST Introduction Bioinformatics is a powerful tool which can be used to determine evolutionary relationships and

More information

Review of molecular biology

Review of molecular biology Review of molecular biology DNA is into RNA, which is into protein. What mrna sequence would be transcribed from the DNA template CTA? What sequence of trna would be attracted by the above mrna sequence?

More information

Genomes and Their Evolution

Genomes and Their Evolution Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT Inferring phylogeny Constructing phylogenetic trees Tõnu Margus Contents What is phylogeny? How/why it is possible to infer it? Representing evolutionary relationships on trees What type questions questions

More information

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Bioinformatics tools for phylogeny and visualization. Yanbin Yin Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and

More information

Chapters 25 and 26. Searching for Homology. Phylogeny

Chapters 25 and 26. Searching for Homology. Phylogeny Chapters 25 and 26 The Origin of Life as we know it. Phylogeny traces evolutionary history of taxa Systematics- analyzes relationships (modern and past) of organisms Figure 25.1 A gallery of fossils The

More information

BIOINFORMATICS: An Introduction

BIOINFORMATICS: An Introduction BIOINFORMATICS: An Introduction What is Bioinformatics? The term was first coined in 1988 by Dr. Hwa Lim The original definition was : a collective term for data compilation, organisation, analysis and

More information

BIOLOGY 317 Spring First Hourly Exam 4/20/12

BIOLOGY 317 Spring First Hourly Exam 4/20/12 Name: Lab sect. (TA name/time): BIOLOGY 317 Spring 2012 First Hourly Exam 4/20/12 1) (24 pts) Match the letter of the family given on the right with the characteristics for a plant described on the left.

More information

Theory of Evolution Charles Darwin

Theory of Evolution Charles Darwin Theory of Evolution Charles arwin 858-59: Origin of Species 5 year voyage of H.M.S. eagle (83-36) Populations have variations. Natural Selection & Survival of the fittest: nature selects best adapted varieties

More information

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES HOW CAN BIOINFORMATICS BE USED AS A TOOL TO DETERMINE EVOLUTIONARY RELATIONSHPS AND TO BETTER UNDERSTAND PROTEIN HERITAGE?

More information

MiGA: The Microbial Genome Atlas

MiGA: The Microbial Genome Atlas December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From

More information

Three types of RNA polymerase in eukaryotic nuclei

Three types of RNA polymerase in eukaryotic nuclei Three types of RNA polymerase in eukaryotic nuclei Type Location RNA synthesized Effect of α-amanitin I Nucleolus Pre-rRNA for 18,.8 and 8S rrnas Insensitive II Nucleoplasm Pre-mRNA, some snrnas Sensitive

More information

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Integrative Biology 200 PRINCIPLES OF PHYLOGENETICS Spring 2018 University of California, Berkeley Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler Feb. 14, 2018. Phylogenetic trees VI: Dating in the 21st century: clocks, & calibrations;

More information

Evolution and Development of Flower Diversity. Kelsey Galimba Di Stilio Lab Department of Biology University of Washington

Evolution and Development of Flower Diversity. Kelsey Galimba Di Stilio Lab Department of Biology University of Washington Evolution and Development of Flower Diversity Kelsey Galimba Di Stilio Lab Department of Biology University of Washington Lecture Outline Evolution of Angiosperms! The first flower! Morphological Diversity

More information

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007 -2 Transcript Alignment Assembly and Automated Gene Structure Improvements Using PASA-2 Mathangi Thiagarajan mathangi@jcvi.org Rice Genome Annotation Workshop May 23rd, 2007 About PASA PASA is an open

More information

Organizing Life s Diversity

Organizing Life s Diversity 17 Organizing Life s Diversity section 2 Modern Classification Classification systems have changed over time as information has increased. What You ll Learn species concepts methods to reveal phylogeny

More information

CHAPTER 29 PLANT DIVERSITY I: HOW PLANTS COLONIZED LAND. Section A: An Overview of Land Plant Evolution

CHAPTER 29 PLANT DIVERSITY I: HOW PLANTS COLONIZED LAND. Section A: An Overview of Land Plant Evolution CHAPTER 29 PLANT DIVERSITY I: HOW PLANTS COLONIZED LAND Section A: An Overview of Land Plant Evolution 1. Evolutionary adaptations to terrestrial living characterize the four main groups of land plants

More information

Macroevolution Part I: Phylogenies

Macroevolution Part I: Phylogenies Macroevolution Part I: Phylogenies Taxonomy Classification originated with Carolus Linnaeus in the 18 th century. Based on structural (outward and inward) similarities Hierarchal scheme, the largest most

More information

Chapter 26 Phylogeny and the Tree of Life

Chapter 26 Phylogeny and the Tree of Life Chapter 26 Phylogeny and the Tree of Life Biologists estimate that there are about 5 to 100 million species of organisms living on Earth today. Evidence from morphological, biochemical, and gene sequence

More information

22 1 Introduction to Plants Slide 2 of 33

22 1 Introduction to Plants Slide 2 of 33 2 of 33 What Is a Plant? What is a plant? 3 of 33 What Is a Plant? What Is a Plant? Plants are multicellular eukaryotes that have cell walls made of cellulose. Plants develop from multicellular embryos

More information

Biol/Env St 204 Quiz 2 Spring 2008

Biol/Env St 204 Quiz 2 Spring 2008 Biol/Env St 204 Quiz 2 Spring 2008 Name: 40 points Short-Answer Section (20 points total) 1. In the reading Interview with a Fungus, why did Mr. Pilobolus conclude that fungi characterize mankind as expendable?

More information