Complete Sequence, Gene Arrangement, and Genetic Code of Mitochondrial DNA of the Cephalochordate Branchiostoma floridae (Amphioxus)

Size: px
Start display at page:

Download "Complete Sequence, Gene Arrangement, and Genetic Code of Mitochondrial DNA of the Cephalochordate Branchiostoma floridae (Amphioxus)"

Transcription

1 Complete Sequence, Gene Arrangement, and Genetic Code of Mitochondrial DNA of the Cephalochordate Branchiostoma floridae (Amphioxus) Jeffrey L. Boore, L. Lynne Daehler, and Wesley M. Brown Department of Biology, University of Michigan, Ann Arbor We have determined the 15,083-nucleotide (nt) sequence of the mitochondrial DNA (mtdna) of the lancelet Branchiostoma floridae (Chordata: Cephalochordata). As is typical in metazoans, the mtdna encodes 13 protein, 2 rrna, and 22 trna genes. The gene arrangement differs from the common vertebrate arrangement by only four trna gene positions. Three of these are unique to Branchiostoma, but the fourth is in a position that is primitive for chordates. It shares the genetic code variations found in vertebrate mtdnas except that AGA serine, a code variation found in many invertebrate phyla but not in vertebrates (the related codon AGG was not found). Branchiostoma mtdna lacks a vertebrate-like control region; its largest noncoding region (129 nt) is unremarkable in sequence or base composition, and its location between ND5 and trna G differs from that usually found in vertebrates. It also lacks a potential hairpin DNA structure like those found in many (though not in all) vertebrates to serve as the second-strand (i.e., L-strand) origin of replication. Perhaps related to this, the sequence corresponding to the DHU arm of trna C cannot form a helical stem, a condition found in a few other vertebrate mtdnas that also lack a canonical L-strand origin of replication. ATG and GTG codons appear to initiate translation in 11 and 2 of the protein-encoding genes, respectively. Protein genes end with complete (TAA or TAG) or incomplete (T or TA) stop codons; the latter are presumably converted to TAA by post-transcriptional polyadenylation. Introduction Complete mitochondrial DNA (mtdna) sequences have been determined for 36 vertebrate species and partial sequences for hundreds of others. All are circular DNA molecules containing 37 genes: 13 for proteins (COI-III, ND1-6, ND4L, Cytb, A6, A8); two for rrnas (srrna and lrrna); and 22 for trnas (designated by the one-letter amino acid code, with the two S and two L trnas differentiated by the codons recognized [AGN/ UCN and CUN/UUR, respectively]). The genes are arranged very compactly, with no introns and few intergenic nucleotides. However, all vertebrate mtdnas examined have a single, large, noncoding region, highly variable in size among (and sometimes within) species, that contains signalling elements for regulating transcription and replication (reviewed by Shadel and Clayton 1997). Comparisons of mitochondrial systems are useful for modeling genome evolution and for phylogenetic inference. Many complex features are available for comparison: modes of replication and transcription; RNA processing; protein, trna, and rrna secondary structures; patterns of transcript editing; genetic code variations; and the relative arrangements of genes (Sankoff et al. 1992; Smith et al. 1993; Boore and Brown 1994; Boore et al. 1995; Kumazawa and Nishida 1995; Boore 1996; Boore, Lavrov, and Brown 1998). While these (and many additional) features are also present in nuclear genomes, they are currently more accessible for study in the much smaller and simpler mitochondrial genomes (Garesse et al. 1997). Here we describe and Key words: Branchiostoma, amphioxus, mitochondria, evolution, chordate, genome. Address for correspondence and reprints: J. L. Boore, Department of Biology, University of Michigan, 830 N. University Avenue, Ann Arbor, Michigan jboore@umich.edu. Mol. Biol. Evol. 16(3): by the Society for Molecular Biology and Evolution. ISSN: analyze the complete sequence of Branchiostoma floridae mtdna (GenBank accession number AF098298), a species in the group Cephalochordata, a primitive chordate having diverged before the vertebrate radiation. Materials and Methods Live specimens of B. floridae were purchased from Gulf Specimen Co., Panacea, Fla. Mitochondrial DNA was purified by cesium chloride ethidium bromide centrifugation as in Wright, Spolsky, and Brown (1983). A detailed restriction map was determined, separating radiolabeled fragments on both 1% agarose and 3.5% acrylamide gels, using conditions that allowed detection of fragments as small as 40 base pairs (bp) (Brown 1980). The entire Branchiostoma mtdna was then cloned into the lambda vector EMBL4 (Stratagene) using the unique EcoRI site at position 8,834 8,839 (numbering from the first nucleotide of COI; see figs. 1 and 2). This clone was verified by comparing restriction enzyme fragment sizes with those expected from the cleavage map of the native mtdna. After subsequent digestion with other restriction enzymes, mtdna fragments were subcloned into pbluescript plasmids (Stratagene) and sequenced; some of the longer fragments required additional exonuclease deletion cloning steps (Erase-a- Base ; Promega). Sequences were determined on both strands using dideoxynucleotide terminators and radiolabeled nucleotides (Sanger, Nicklen, and Coulson 1977); oligonucleotide sequencing primers were designed as necessary. Protein-encoding genes were identified by sequence similarity of open reading frames to mitochondrial gene sequences of Cyprinus carpio (Chang, Huang, and Lo 1994). Ribosomal RNA genes were identified by their potential to form trna-like secondary structures; specific identification were made according to anticodon sequences. 410

2 Lancelet Mitochondrial DNA 411 FIG. 1. The gene map of Branchiostoma floridae mtdna. Genes are abbreviated as in the text and scaling is only approximate; NC refers to the largest noncoding region. Asterisks mark those genes whose positions differ from those of the basic vertebrate arrangement as exemplified by human mtdna (Anderson et al. 1981) (the choice to mark M rather than Q is arbitrary, as these are in switched positions). Transfer RNA genes identified outside of the circle are transcribed clockwise in this figure, as are all other genes except ND6; ND6 and the trnas marked inside are transcribed from the opposite strand. An arc marks a homologous portion previously determined for a congeneric species, Branchiostoma lanceolatum (Delarbre et al. 1997). Results and Discussion Gene Content and Organization Complete mtdna sequences have been published for 36 vertebrate species. The 37 encoded genes are arranged identically in most, but minor variations of the basic arrangement have been found in some animals (sea lamprey [Lee and Kocher 1995], some frogs [Yoneyama 1987; Fujii et al. 1988], reptiles [Seutin et al. 1994; Kumazawa and Nishida 1995; Kumazawa et al. 1996; Quinn and Mindell 1996; Janke and Arnason 1997; Macey et al. 1997], birds [Glaus et al. 1980; Desjardins, Ramirez, and Morais 1990; Desjardins and Morais 1990, 1991; Quinn and Wilson 1993; Ramirez, Savoie, and Morais 1993; Harlid, Janke, and Arnason 1997], and marsupials [Pääbo et al. 1991; Janke et al. 1994]). Each of these variations appears to be derived independently, because the basic arrangement is shared among several vertebrate classes and none of the variations are shared among distantly related groups. This inference is strengthened further by the gene arrangement in Branchiostoma mtdna, which, except for four trna gene positions, is identical to the basic vertebrate arrangement (as exemplified in mammals [Anderson et al. 1981], Xenopus [Roe et al. 1985], and bony fish [Chang, Huang, and Lo 1994]). None of the four variant positions is shared by the corresponding gene in another vertebrate species. The four trna genes whose arrangement in Branchiostoma mtdna differs from the basic vertebrate arrangement are those for glycine (G), phenylalanine (F), methionine (M), and asparagine (N) (fig. 1). Two of the differences (trna M and trna N ) are the same as noted in the sequenced portion of Branchiostoma lanceolatum mtdna (Delarbre et al. 1997). Because the positions of trna G and trna M are identical in the basic vertebrate arrangement and in the mtdna of Drosophila (Clary and Wolstenholme 1985) and because the position of trna F is similar in the basic arrangement and in the mtdnas of echinoderms (Jacobs et al. 1988; Smith et al. 1989), it can be argued that the positions of these three trnas are derived in Branchiostoma and that their positions in the basic arrangement represent the primitive state for chordates. From published data it is not possible to determine the primitive chordate position of trna N. The primitive arrangement could be as in Branchiostoma mtdna (between ND2 and trna W ) with its translocation to a position between trna A and trna C being a derived condition for vertebrates or, alternatively, the basic vertebrate arrangement could be primitive with an independent translocation in the lineage leading to Branchiostoma. This is resolved, however, by noting that the position of trna N in a hemichordate mtdna (S. Pääbo, personal communication) is identical to that in Branchiostoma. Assuming that Hemichordata is an outgroup to a clade of Cephalochordata Vertebrata, the most parsimonious explanation is that the trna N position in Branchiostoma mtdna is primitive for chordates. Branchiostoma mtdna is arranged very compactly, even for a mitochondrial genome. In total, there appear to be only 154 noncoding nucleotides (nt): 129 in a single region between ND5 trna G ; 8 between t- RNA S(UCN) trna D ; 2 between each of trna R ND4L and trna F trna V ; 1 between each of trna Q ND2, ND6 trna E, and trna E Cytb; and 10 between trna Y COI (fig. 2). Several protein-encoding genes are predicted to end with abbreviated stop codons (see below). In six cases, adjacent genes overlap (COI trna S(UCN), A8 A6, trna G ND6, ND2 trna N, trna N trna W, and trna W trna A ); however, in all except one (A8 A6, discussed below) the overlapping genes are encoded on opposite DNA strands. The lack of overlap of the ND4L ND4 genes is unusual. Base Composition Overall, B. floridae mtdna is 63% A T, slightly higher than other chordate mtdnas (e.g., Petromyzon marinus 62% [Lee and Kocher 1995], C. carpio 57% [Chang, Huang, and Lo 1994], and Protopterus dolloi 58% [Zardoya and Meyer 1996]). The 11,262 nt making up the protein-encoding genes are 62% A T, nearly identical to the mtdna overall (table 1). The A T content of first, second, and third codon positions is 54%, 62%, and 70%, respectively. As is typical of metazoan mtdnas (Cardon et al. 1994), the dinucleotide CpG is significantly underrepresented in Branchiostoma mtdna. In earlier studies (Naylor and Brown 1997, 1998), comparison of the protein-encoding portions of the B. floridae mtdna sequence with those of other metazoans

3 412 Boore et al. FIG. 2. A partly schematic representation of the mtdna sequence of Branchiostoma floridae. Numbers within the slash marks indicate omitted nucleotides. For two genes, the inferred initiation codon is GTG; the corresponding amino acid (M) is in parentheses here to indicate presumed noncomformity with the generally employed genetic code. (Mitochondrial proteins appear to initiate with formyl-methionine [Smith and Marcker 1968] as do those of their bacterial progenitors.) Stop codons, including those inferred to be abbreviated, are marked by an asterisk and the single large noncoding region by a row of dots. A dart ( ) marks the last nucleotide of each gene and indicates the direction of transcription. The EcoRI restriction enzyme site used for cloning (positions 8,826 8,831) is underlined.

4 Lancelet Mitochondrial DNA 413 Table 1 Number of Occurrences and Percentage of Total of the 3,754 Codons in the 13 Protein-Encoding Genes of Branciostoma floridae mtdna Amino Acid Codon N % Phe(F)... [GAA] a... Leu(L)... [UAA]... Leu(L)... [UAG]... Ile(I)... [GAU]... Met (M)... [CAU]... Val(V)... [UAC]... TTT TTC TTA TTG CTT CTC CTA CTG ATT ATC ATA ATG GTT GTC GTA GTG Amino Acid Codon N % Ser (S) [UGA] Pro (P) [UGG] Thr (T) [UGU] Ala (A) [UGC] TCT TCC TCA TCG CCT CCC CCA CCG ACT ACC ACA ACG GCT GCC GCA GCG a The anticodon of the corresponding trna is shown in brackets. b Stop codons are omitted from this analysis Amino Acid Codon N % Tyr (Y) [GUA] TER b His (H) [GUG] Gln (Q) [UUG] Asn (N) [GUU] Lys (K) [UUU] Asp (D) [GUC] Glu (E) [UUC] TAT TAC TAA TAG CAT CAC CAA CAG AAT AAC AAA AAG GAT GAC GAA GAG Amino Acid Codon N % Cys (C) [GCA] Trp (W) [UCA] Arg (R) [UCG] Ser (S) [GCU] Gly (G) [UCC] TGT TGC TGA TGG CGT CGC CGA CGG AGT AGC AGA AGG GGT GGC GGA GGG failed to confirm Branchiostoma as the sister taxon to vertebrates, suggesting instead that it is the sister taxon to (echinoderms vertebrates) (which was viewed as artifactual by the authors). From review of the original data used for these studies, 13 sequencing errors were detected, resulting in frameshifts in COI, ND2, ND4L, and ND6. A total of 79 amino acids were inferred in error for those previous studies, about 2% of the total amino acids analyzed; whether this would significantly affect their conclusions awaits further analysis. Initiation and Termination of Protein-Encoding Genes The mitochondrial protein genes of Branchiostoma correspond well in size and sequence to those of other metazoans (table 2). ATG codons initiate 11 of the 13, COI and ND1 being the exceptions. The inferred initiation codon for COI is GTG, as no ATG or other initiation codon employed in metazoan mitochondrial systems is nearby, and because the sequence of the protein initiated at this position is highly similar to the amino terminal sequences of COI in other metazoans. There is an in-frame, TAG stop codon 6 nt upstream of this GTG, and Delarbre et al. (1997) found GTG at the corresponding position in B. lanceolatum, which they inferred as the initiation codon for COI. GTG has been inferred to initiate this and other genes in many metazoan mtdnas (Wolstenholme 1992). Inference of the initiation codon for ND1 is less certain, as two commonly used initiation codons, GTG and ATA, are in-frame and immediately adjacent at the probable ND1 start site (positions 12,576 12,581 in fig. 2). The GTG codon directly abuts the upstream trna L(UUR) gene, followed immediately by ATA. The same ambiguity is present at the 5 end of ND1 in B. lanceolatum, for which Delarbre et al. (1997) arbitrarily designated ATA as the initiation codon. Complete TAG stop codons are present in COI, Cytb, ND2, ND5, and ND6, none of which overlaps a downstream gene having the same transcriptional orientation, and a complete TAA stop codon is present in COIII (table 2). Also, A8 almost certainly ends at the TAG codon following the highly conserved sequence (WPW) at the 3 end of A8, where A8 and A6 overlap. These genes commonly overlap in chordate mtdnas, where they are known to be translated from the same bicistronic mrna (Fearnley and Walker 1986). The other genes end at incomplete (i.e., T or TA) codons. After transcription and processing, mrnas ending in T or TA are converted to TAA by polyadenylation (Ojala, Montoya, and Attardi 1981). This is surely the case for COII; if this transcript extended to the first in-frame stop codon, it would overlap the adjacent gene by 29 nt. A6, ND1, ND3, ND4, and ND4L are all inferred to end at incomplete stop codons that directly abut their adjacent, downstream genes; for each, however, allowing the transcript to overlap the downstream gene by 1 or 2 nt would complete the termination codon. Delarbre et al. (1997) sequenced a cdna to the ND1 mrna of B. lanceolatum and found that it terminated with TAA, thus confirming the incomplete codon hypothesis for that gene (overlap would have resulted in a TAG stop codon). However, it seems likely that the presence, at least prior to processing, of potentially complete termination codons is not merely coincidence and may be a mechanism for preventing translational readthrough in cases where correct transcript processing fails. Transfer RNAs Twenty-two sequences can be folded into trnalike structures (fig. 3). In these, the sequences corre-

5 414 Boore et al. Table 2 Comparisons of the Mitochondrial Protein-Coding Genes of a Lancelet (Branchiostoma floridae), a Carp (Cyprinus carpio; Chang, Huang, and Lo 1994), and a Sea Urchin (Paracentrotus lividus; Cantatore et al. 1989) PREDICTED INITIATION AND TERMINATION c CODONS PERCENT AMINO ACID IDENTITY b NUMBER OF AMINO ACIDS a Lancelet Carp Sea urchin Lancelet/ sea urchin Sea urchin/carp Lancelet/carp Lancelet Carp Sea urchin PROTEIN GTG TAA ATR TAA ATG TA(A) GTG TAA ATG T* ATG TA(A) ATG TA(A) GTG TAG ATG T* A6 A8 COI COII COIII ATG TA(G) ATR TAG ATR TAG ATG T* ATG TA(G) ATG T(AG) ATG T* GTG T(AG) ATG T(AA) ATG TA(G) Cytb ND1 ND2 ND3 ND4 ATC TA(A) ATG TA(A) ND4L ND5 ND6 a Gene lengths are as inferred in the text and depicted in figure 1 or obtained from Genbank. Actual gene length could be slightly different due to ambiguity in determining start and stop codons. b Percent identity is the number of identical inferred amino acids in a pairwise alignment divided by the mean length of the two compared sequences. c For predicted stop codons the parentheses indicate the potential of a complete stop codon overlapping a downstream gene with the same transcriptional orientation. The asterisks indicate that no such potential reasonably exists and that the stop codon is incomplete, presumably completed by polyadenylation of the mrna (see text). sponding in position to the anticodons are identical to those for the mitochondrial trna gene of human, chicken, frog, and fish mtdnas (Anderson et al. 1981; Roe et al. 1985; Desjardins and Morais 1990; Chang, Huang, and Lo 1994). All B. floridae mitochondrial trna genes have a T C loop of 3 7 nt and a T C stem of 3 6 nt; two (trna R and trna Q ) have a single mismatch in this stem. All but trna M, trna F, and trna W have a fully paired, seven-member acceptor stem, and all but trna Q and trna L(UUR) have a fully paired five-member anticodon stem. All except trna T have a four-member extra arm. The dinucleotide between the acceptor and DHU arms is TpA in all trna genes except trna S(UCN) and trna Y, where it is TpG, and in trna V, where it is GpA. In all of the trna genes, the 2 nt preceding the anticodon are pyrimidines and the nucleotide following it is a purine. Except for trna C and trna S(AGN), all have DHU arms with a stem of 3 5 nt and a loop of 3 11 nt. For all but trna G, the 2 most proximal unpaired nt in the DHU loop are purines (almost always A s). The unpaired replacement for the DHU arm of trna S(AGN) is typical of metazoan mtdnas, as is the potential for additional pairing at the end of the anticodon stem. An unpaired DHU arm in trna C is unusual but has precedents among vertebrates; in some (but not all) cases (for examples, Seutin et al. 1994; Macey et al. 1997) it is correlated with the loss of the immediately adjacent, stem loop structure that functions, in many vertebrate mtdnas, as the second-strand (i.e., L-strand) origin of replication. We speculate that this aberrant trna C might represent a compromise structure, serving as both an origin of replication and a functional trna gene. A similar condition appears in the mtdna of P. dolloi (Zardoya and Meyer 1996) where trna C and this stem loop structure partially share the same sequence. Unassigned DNA The largest noncoding region in B. floridae mtdna is only 129 nt. By contrast, the largest noncoding region is 198 nt in P. marinus (Lee and Kocher 1995), 928 nt in C. carpio (Chang, Huang, and Lo 1994), and 1183 nt in P. dolloi (Zardoya and Meyer 1996) mtdnas. A search of all vertebrate mtdna sequences identified no obvious similarity with this noncoding sequence, and its location, between ND5 and trna G, differs from that usually found in vertebrates. In B. floridae, this region is slightly less A T-rich (59%) than the overall mtdna (63%). Other than in this region, there are only 25 nt, distributed in blocks of 1 10 nt, that are unassigned to genes, and the composition of these appears unremarkable. Genetic Code: AGA Specifies Serine, Not Glycine, in Branchiostoma mtdna In vertebrate mtdnas only AGY specifies serine, with AGR codons being absent or, when present, used as stop codons. In all invertebrate mtdnas except those of cnidarians both AGR and AGY appear to specify serine (reviewed by Wolstenholme 1992). (AGR specifies arginine in cnidarian mtdna, as in the universal

6 Lancelet Mitochondrial DNA 415 FIG. 3. The potential secondary structures of the 22 inferred trnas of Branchiostoma floridae mtdna. Nomenclature for trna arms is shown for trna V. The five additional nucleotides in parentheses outside of the structures and accompanied by an arrow indicate the only differences in comparing the eight sequenced trna genes of Branchiostoma lanceolatum (Delarbre et al. 1997). code, presumably due to the use of imported, nuclearencoded trnas.) Based on a single AGA and no AGG codons, Delabre et al. (1997) suggested a variation for the lancelet mitochondrial genetic code in which AGR specifies glycine and AGY, serine. Based on the paucity of data available to them, that suggestion may have been reasonable. However, with the much larger data set pre-

7 416 Boore et al. sented here (and noting that no AGG codons are present), it is clear that AGA (along with AGY) specifies serine in Branchiostoma mtdna. There are 12 AGA codons in B. floridae mtdna, 1 of which is identical in position to the single AGA codon found by Delabre et al. (1997) in the ND2 gene of B. lanceolatum. In alignments with the corresponding gene sequences of P. marinus (Lee and Kocher 1995), C. carpio (Chang, Huang, and Lo 1994), P. dolloi (Zardoya and Meyer 1996), Gadus morhua (Johansen, Guddal, and Johansen 1990), and Crossostome lacustre (Tzeng et al. 1992), the Branchiostoma AGA codons correspond most frequently to serine codons, with the correspondence to TCN codons being even more frequent than to AGY codons. No trna genes in B. floridae mtdna have a TCT anticodon, as would be needed to discriminate AGR from the AGN codon family; moreover, only trna S(AGN) has an NCT anticodon. Even though we believe that there is unequivocal evidence that AGN specifies serine in Branchiostoma mtdna, the absence of AGG and reduced usage of AGA codons in this mtdna can be viewed as a precondition for codon reassignment, as shown even better for the mtdna of the hemichordate Balanoglossus (Castresana, Feldmaier-Fuchs, and Pääbo 1998). AGR codons appear to specify glycine in the urochordate Halocynthia roretzi mtdna, because of their frequent alignment with glycine (GGN) codons in other metazoan mtdnas (Yokobori, Ueda, and Watanabe 1993). The assignment of glycine as the amino acid specified by AGR codons in Halocynthia mtdna is based on 19 occurrences in a 1,263-nt fragment of COI (Yokobori, Ueda, and Watanabe 1993). No AGR codons appear in COI of B. floridae; however, 11 of the 12 AGA codons and 5 of the 7 AGG codons in Halocynthia COI align with glycine codons (GGN) in B. floridae COI, providing further evidence for the reassignment of AGR in the urochordate lineage, after it split from the lineage leading to cephalochordates. Sequence alignments of the B. floridae and vertebrate mitochondrial proteins suggest that there are no other differences between its genetic code and that used in vertebrate mtdnas. Nucleotide Sequence Comparisons with B. lanceolatum mtdna The 2,562 nt previously determined for B. lanceolatum (Delarbre et al. 1997) are remarkably similar in sequence to the corresponding region of B. floridae mtdna, with only 73 nt differences between the two ( 97% sequence identity). This common region includes complete genes for ND1, ND2, and eight trnas, and parts of genes for COI and a ninth trna. There is evidence for only one insertion/deletion event. This results in a single nucleotide difference in the lengths of the noncoding region between trna Y COI (10 nt in B. floridae, 9ntinB. lanceolatum) that are otherwise identical. In the aggregate, the trna gene sequences of the two species differ by five substitutions (fig. 3). Of these, four are in loops and one is in a stem (which changes a T G pair to a C G pair); all five substitutions are transitions. The ND1 proteins differ by 15 substitutions (disregarding whether initiated by GTG or ATA), of which 13 are synonymous and 2 nonsynonymous; 11 are transitions and 4 are transversions. The ND2 proteins differ by 52 substitutions, of which 47 are synonymous and 5 nonsynonymous; 50 are transitions and 2 are transversions. However, the ND2 situation is made complex by Delarbre et al. s (1997) report of a second ND2 copy, which could be either mitochondrial or nuclear. The sequence of the second copy was not specifically reported, but they noted that the two differed by 42 substitutions, 8 of which cause amino acid replacements, which they describe. Of these eight, the B. floridae ND2 amino acid sequence is identical to three in the first (reported) copy and to five in the second. We found no evidence for a second ND2 copy in the B. floridae mtdna sequence, and the size of the purified mtdna, estimated from restriction fragment length summations, is clearly insufficient to accommodate a second copy of this gene. Acknowledgments Thanks to Susan Fuerstenberg, Kevin Helfenbein, Gavin Naylor, and Alan Wolf for helpful comments on the manuscript and to Svante Pääbo, Jose Castresana, and coworkers for sharing the Balanoglossus data with us prior to publication. This work was supported by NSF grant DEB to W.M.B. LITERATURE CITED ANDERSON, S., A. T. BANKIER, B.G.BARRELL et al. (14 coauthors) Sequence and organization of the human mitochondrial genome. Nature 290: BOORE, J. L Ancient patterns of arthropod evolution are recorded in mitochondrial genome rearrangements. Pp in M. NEI and N. TAKAHATA, eds. Current topics in molecular evolution: proceedings of the U.S. Japan Binational Workshop on Molecular Evolution. Graduate School for Advanced Studies, Hayama, Japan. BOORE, J. L., and W. M. BROWN Mitochondrial genomes and the phylogeny of mollusks. Nautilus 108(Suppl. 2): BOORE, J. L., T. M. COLLINS, D. STANTON, L. L. DAEHLER, and W. M. BROWN Deducing the pattern of arthropod phylogeny from mitochondrial DNA rearrangements. Nature 376: BOORE, J. L., D. V. LAVROV, and W. M. BROWN Gene translocation links insects and crustaceans. Nature 393: BROWN, W. M Polymorphism in mitochondrial DNA of humans as revealed by restriction endonuclease analysis. Proc. Natl. Acad. Sci. USA 77: CANTATORE, P., M. ROBERTI, G. RAINALDI, M. N. GADALETA, and C. SACCONE The complete nucleotide sequence, gene order and genetic code of the mitochondrial genome of Paracentrotus lividus. J. Biol. Chem. 264: CARDON, L. R., C. BURGE, D. A. CLAYTON, and S. KARLIN Pervasive CpG suppression in animal mitochondrial genomes. Proc. Natl. Acad. Sci. USA 91: CASTRESANA, J., G. FELDMAIER-FUCHS, and S. PÄÄBO Codon reassignment and amino acid composition in hemi-

8 Lancelet Mitochondrial DNA 417 chordate mitochondria. Proc. Natl. Acad. Sci. USA 95: CHANG, Y.-S., F.-L. HUANG, and T.-B. LO The complete nucleotide sequence and gene organization of carp (Cyprinus carpio) mitochondrial genome. J. Mol. Evol. 38: CLARY, D. O., and D. R. WOLSTENHOLME The mitochondrial DNA molecule of Drosophila yakuba: nucleotide sequence, gene organization, and genetic code. J. Mol. Evol. 22: DELARBRE, C., V. BARRIEL, S. TILLIER, P. JANVIER, and G. GACHELIN The main features of the craniate mitochondrial DNA between the ND1 and the COI genes were established in the common ancestor with the lancelet. Mol. Biol. Evol. 14: DESJARDINS, P., and R. MORAIS Sequence and gene organization of the chicken mitochondrial genome: a novel gene order in higher vertebrates. J. Mol. Biol. 212: Nucleotide sequence and evolution of coding and noncoding regions of a quail mitochondrial genome. J. Mol. Evol. 32: DESJARDINS, P., V. RAMIREZ, and R. MORAIS Gene organization of the Peking duck mitochondrial genome. Curr. Genet. 17: FEARNLEY, I. M., and J. E. WALKER Two overlapping genes in bovine mitochondrial DNA encode membrane components of ATP synthase. EMBO J. 5: FUJII, H., T. SHIMADA, Y. GOTO, and T. OKAZAKI Cloning of the mitochondrial genome of Rana catesbeiana and the nucleotide sequences of the ND2 and five trna genes J. Biochem. 103: GARESSE, R., J. A. CARRODEGUAS, J. SANTIAGO, M. L. PEREZ, R. MARCO, and C. G. VALLEJO Artemia mitochondrial genome: molecular biology and evolutive considerations. Comp. Biochem. Physiol. B. Biochem. Mol. Biol. 117: GLAUS, K. R., H. P. ZASSENHAUS, N. S. FECHHEIMER, and P. S. PERLMAN Avian mtdna: structure, organization and evolution. Pp in A. M. KROON and C. SAC- CONE, eds. The organization and expression of the mitochondrial genome. Elsevier/North-Holland Biomedical Press, Amsterdam. HARLID, A., A. JANKE, and U. ARNASON The mtdna sequence of the ostrich and the divergence between paleognathous and neognathous birds. Mol. Biol. Evol. 14: JACOBS, H. T., D. J. ELLIOTT, V.B.MATH, and A. FARQUAR- SON Nucleotide sequence and gene organization of sea urchin mitochondrial DNA. J. Mol. Biol. 202: JANKE, A., and U. ARNASON The complete mitochondrial genome of Alligator mississippiensis and the separation between recent Archosauria (birds and crocodiles). Mol. Biol. Evol. 14: JANKE, A., G. FELDMAIER-FUCHS, W. K. THOMAS, A.VON HAESELER, and S. PÄÄBO The marsupial mitochondrial genome and the evolution of placental mammals. Genetics 137: JOHANSEN, H., P. H. GUDDAL, and T. JOHANSEN Organization of the mitochondrial genome of Atlantic cod, Gadus morhua. Nucleic Acids Res. 18: KUMAZAWA, Y., and M. NISHIDA Variations in mitochondrial trna gene organization of reptiles as phylogenetic markers. Mol. Biol. Evol. 12: KUMAZAWA, Y., H. OTA, M. NISHIDA, and T. OZAWA Gene rearrangements in a snake mitochondrial genomes: highly concerted evolution of control-region-like sequences duplicated and inserted into a trna gene cluster. Mol. Biol. Evol. 13: LEE W.-J., and T. KOCHER Complete sequence of a sea lamprey (Petromyzon marinus) mitochondrial genome: early establishment of the vertebrate genome organization. Genetics 139: MACEY, J. R., A. LARSON, N.B.ANANJEVA, and T. PAPENFUSS Evolutionary shifts in three major structural features of the mitochondrial genome among iguanian lizards. J. Mol. Evol. 44: NAYLOR, G. J. P., and W. M. BROWN Structural biology and phylogenetic estimation. Nature 388: Amphioxus mitochondrial DNA, chordate phylogeny, and the limits of inference based on comparisons of sequences. Syst. Biol. 47: OJALA, D., J. MONTOYA, and G. ATTARDI trna punctuation model of RNA processing in human mitochondria. Nature 290: PÄÄBO, S., W. K. THOMAS, K. M. WHITFIELD, Y. KUMAZAWA, and A. C. WILSON Rearrangements of mitochondrial transfer RNA genes in marsupials. J. Mol. Evol. 33: QUINN, T. W., and D. P. MINDELL Mitochondrial gene order adjacent to the control region in crocodile, turtle, and tuatara. Mol. Phylogenet. Evol. 5: QUINN, T. W., and A. C. WILSON Sequence evolution in and around the mitochondrial control region in birds. J. Mol. Evol. 37: RAMIREZ, V., P. SAVOIE, and R. MORAIS Molecular characterization and evolution of a duck mitochondrial genome. J. Mol. Evol. 37: ROE, B. A., D.-P. MA, R. K. WILSON, and J. J.-H. WONG The complete nucleotide sequence of the Xenopus laevis mitochondrial genome. J. Biol. Chem. 260: SANGER, F., S. NICKLEN, and A. R. COULSON DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74: SANKOFF, D., G. LEDUC, N. ANTOINE, B. PAQUIN, B. F. LANG, and R. J. CEDERGREN Gene order comparisons for phylogenetic inference: evolution of the mitochondrial genome. Proc. Natl. Acad. Sci. USA 89: SEUTIN, G., B. F. LANG, D. P. MINDELL, and R. MORAIS Evolution of the WANCY region in amniote mitochondrial DNA. Mol. Biol. Evol. 11: SHADEL, G. S., and D. A. CLAYTON Mitochondrial DNA maintenance in vertebrates. Annu. Rev. Biochem. 66: SMITH, M. J., A. ARNDT, S.GORSKI, and E. FAJBER The phylogeny of echinoderm classes based on mitochondrial gene arrangements. J. Mol. Evol. 36: SMITH, M. J., D. K. BANFIELD, K. DOTEVAL, S. GORSKI, and D. J. KOWBEL Gene arrangement in sea star mitochondrial DNA demonstrates a major inversion event during echinoderm evolution. Gene 76: SMITH, A. E., and K. A. MARCKER N-formylmethionyl transfer RNA in mitochondria from yeast and rat liver. J. Mol. Biol. 38: TZENG, C.-S., C.-F. HUI, S.-C. SHEN, and P. C. HUANG The complete nucleotide sequence of the Crossostome lacustre mitochondrial genome: conservation and variations among vertebrates. Nucleic Acids Res. 20: WOLSTENHOLME, D. R Animal mitochondrial DNA: structure and evolution. Int. Rev. Cytol. 141: WRIGHT, J. W., C. SPOLSKY, and W. M. BROWN The origin of the parthenogenetic lizard Cnemidophorus laredoensis inferred from mitochondrial DNA analysis. Herpetologica 39:

9 418 Boore et al. YOKOBORI, S.-I., T. UEDA, and K. WATANABE Codons AGA and AGG are read as glycine in ascidian mitochondria. J. Mol. Evol. 36:1 8. YONEYAMA, Y The nucleotide sequences of the heavy and light strand replication origins of the Rana catesbeiana mitochondrial genome. J. Nippon Med. Sch. (Nippon Ika Daigaku Zasshi) 54: [in Japanese]. ZARDOYA, R., and A. MEYER The complete nucleotide sequence of the mitochondrial genome of the lungfish (Protopterus dolloi) supports its phylogenetic position as a close relative of land vertebrates. Genetics 142: STEPHEN PALUMBI, reviewing editor Accepted December 9, 1998

Practical Bioinformatics

Practical Bioinformatics 5/2/2017 Dictionaries d i c t i o n a r y = { A : T, T : A, G : C, C : G } d i c t i o n a r y [ G ] d i c t i o n a r y [ N ] = N d i c t i o n a r y. h a s k e y ( C ) Dictionaries g e n e t i c C o

More information

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS 1 Prokaryotes and Eukaryotes 2 DNA and RNA 3 4 Double helix structure Codons Codons are triplets of bases from the RNA sequence. Each triplet defines an amino-acid.

More information

Objective: You will be able to justify the claim that organisms share many conserved core processes and features.

Objective: You will be able to justify the claim that organisms share many conserved core processes and features. Objective: You will be able to justify the claim that organisms share many conserved core processes and features. Do Now: Read Enduring Understanding B Essential knowledge: Organisms share many conserved

More information

SUPPORTING INFORMATION FOR. SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA

SUPPORTING INFORMATION FOR. SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA SUPPORTING INFORMATION FOR SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA Aik T. Ooi, Cliff I. Stains, Indraneel Ghosh *, David J. Segal

More information

Aoife McLysaght Dept. of Genetics Trinity College Dublin

Aoife McLysaght Dept. of Genetics Trinity College Dublin Aoife McLysaght Dept. of Genetics Trinity College Dublin Evolution of genome arrangement Evolution of genome content. Evolution of genome arrangement Gene order changes Inversions, translocations Evolution

More information

Supplementary Information for

Supplementary Information for Supplementary Information for Evolutionary conservation of codon optimality reveals hidden signatures of co-translational folding Sebastian Pechmann & Judith Frydman Department of Biology and BioX, Stanford

More information

Advanced topics in bioinformatics

Advanced topics in bioinformatics Feinberg Graduate School of the Weizmann Institute of Science Advanced topics in bioinformatics Shmuel Pietrokovski & Eitan Rubin Spring 2003 Course WWW site: http://bioinformatics.weizmann.ac.il/courses/atib

More information

SUPPLEMENTARY DATA - 1 -

SUPPLEMENTARY DATA - 1 - - 1 - SUPPLEMENTARY DATA Construction of B. subtilis rnpb complementation plasmids For complementation, the B. subtilis rnpb wild-type gene (rnpbwt) under control of its native rnpb promoter and terminator

More information

NSCI Basic Properties of Life and The Biochemistry of Life on Earth

NSCI Basic Properties of Life and The Biochemistry of Life on Earth NSCI 314 LIFE IN THE COSMOS 4 Basic Properties of Life and The Biochemistry of Life on Earth Dr. Karen Kolehmainen Department of Physics CSUSB http://physics.csusb.edu/~karen/ WHAT IS LIFE? HARD TO DEFINE,

More information

Supplemental data. Pommerrenig et al. (2011). Plant Cell /tpc

Supplemental data. Pommerrenig et al. (2011). Plant Cell /tpc Supplemental Figure 1. Prediction of phloem-specific MTK1 expression in Arabidopsis shoots and roots. The images and the corresponding numbers showing absolute (A) or relative expression levels (B) of

More information

High throughput near infrared screening discovers DNA-templated silver clusters with peak fluorescence beyond 950 nm

High throughput near infrared screening discovers DNA-templated silver clusters with peak fluorescence beyond 950 nm Electronic Supplementary Material (ESI) for Nanoscale. This journal is The Royal Society of Chemistry 2018 High throughput near infrared screening discovers DNA-templated silver clusters with peak fluorescence

More information

Crick s early Hypothesis Revisited

Crick s early Hypothesis Revisited Crick s early Hypothesis Revisited Or The Existence of a Universal Coding Frame Ryan Rossi, Jean-Louis Lassez and Axel Bernal UPenn Center for Bioinformatics BIOINFORMATICS The application of computer

More information

3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies

3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies Richard Owen (1848) introduced the term Homology to refer to structural similarities among organisms. To Owen, these similarities indicated that organisms were created following a common plan or archetype.

More information

SSR ( ) Vol. 48 No ( Microsatellite marker) ( Simple sequence repeat,ssr),

SSR ( ) Vol. 48 No ( Microsatellite marker) ( Simple sequence repeat,ssr), 48 3 () Vol. 48 No. 3 2009 5 Journal of Xiamen University (Nat ural Science) May 2009 SSR,,,, 3 (, 361005) : SSR. 21 516,410. 60 %96. 7 %. (),(Between2groups linkage method),.,, 11 (),. 12,. (, ), : 0.

More information

Characterization of Pathogenic Genes through Condensed Matrix Method, Case Study through Bacterial Zeta Toxin

Characterization of Pathogenic Genes through Condensed Matrix Method, Case Study through Bacterial Zeta Toxin International Journal of Genetic Engineering and Biotechnology. ISSN 0974-3073 Volume 2, Number 1 (2011), pp. 109-114 International Research Publication House http://www.irphouse.com Characterization of

More information

Supplementary Information

Supplementary Information Electronic Supplementary Material (ESI) for RSC Advances. This journal is The Royal Society of Chemistry 2014 Directed self-assembly of genomic sequences into monomeric and polymeric branched DNA structures

More information

Clay Carter. Department of Biology. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture.

Clay Carter. Department of Biology. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. Clay Carter Department of Biology QuickTime and a TIFF (LZW) decompressor are needed to see this picture. Ornamental tobacco

More information

Codon Distribution in Error-Detecting Circular Codes

Codon Distribution in Error-Detecting Circular Codes life Article Codon Distribution in Error-Detecting Circular Codes Elena Fimmel, * and Lutz Strüngmann Institute for Mathematical Biology, Faculty of Computer Science, Mannheim University of Applied Sciences,

More information

Electronic supplementary material

Electronic supplementary material Applied Microbiology and Biotechnology Electronic supplementary material A family of AA9 lytic polysaccharide monooxygenases in Aspergillus nidulans is differentially regulated by multiple substrates and

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Zn 2+ -binding sites in USP18. (a) The two molecules of USP18 present in the asymmetric unit are shown. Chain A is shown in blue, chain B in green. Bound Zn 2+ ions are shown as

More information

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008 MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Number-controlled spatial arrangement of gold nanoparticles with

Number-controlled spatial arrangement of gold nanoparticles with Electronic Supplementary Material (ESI) for RSC Advances. This journal is The Royal Society of Chemistry 2016 Number-controlled spatial arrangement of gold nanoparticles with DNA dendrimers Ping Chen,*

More information

Why do more divergent sequences produce smaller nonsynonymous/synonymous

Why do more divergent sequences produce smaller nonsynonymous/synonymous Genetics: Early Online, published on June 21, 2013 as 10.1534/genetics.113.152025 Why do more divergent sequences produce smaller nonsynonymous/synonymous rate ratios in pairwise sequence comparisons?

More information

Using an Artificial Regulatory Network to Investigate Neural Computation

Using an Artificial Regulatory Network to Investigate Neural Computation Using an Artificial Regulatory Network to Investigate Neural Computation W. Garrett Mitchener College of Charleston January 6, 25 W. Garrett Mitchener (C of C) UM January 6, 25 / 4 Evolution and Computing

More information

The Trigram and other Fundamental Philosophies

The Trigram and other Fundamental Philosophies The Trigram and other Fundamental Philosophies by Weimin Kwauk July 2012 The following offers a minimal introduction to the trigram and other Chinese fundamental philosophies. A trigram consists of three

More information

Regulatory Sequence Analysis. Sequence models (Bernoulli and Markov models)

Regulatory Sequence Analysis. Sequence models (Bernoulli and Markov models) Regulatory Sequence Analysis Sequence models (Bernoulli and Markov models) 1 Why do we need random models? Any pattern discovery relies on an underlying model to estimate the random expectation. This model

More information

Supporting Information for. Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)-

Supporting Information for. Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)- Supporting Information for Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)- Dependence and Its Ability to Chelate Multiple Nutrient Transition Metal Ions Rose C. Hadley,

More information

Modelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics

Modelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics 582746 Modelling and Analysis in Bioinformatics Lecture 1: Genomic k-mer Statistics Juha Kärkkäinen 06.09.2016 Outline Course introduction Genomic k-mers 1-Mers 2-Mers 3-Mers k-mers for Larger k Outline

More information

Protein Threading. Combinatorial optimization approach. Stefan Balev.

Protein Threading. Combinatorial optimization approach. Stefan Balev. Protein Threading Combinatorial optimization approach Stefan Balev Stefan.Balev@univ-lehavre.fr Laboratoire d informatique du Havre Université du Havre Stefan Balev Cours DEA 30/01/2004 p.1/42 Outline

More information

Evolutionary Analysis of Viral Genomes

Evolutionary Analysis of Viral Genomes University of Oxford, Department of Zoology Evolutionary Biology Group Department of Zoology University of Oxford South Parks Road Oxford OX1 3PS, U.K. Fax: +44 1865 271249 Evolutionary Analysis of Viral

More information

Supplemental Figure 1.

Supplemental Figure 1. A wt spoiiiaδ spoiiiahδ bofaδ B C D E spoiiiaδ, bofaδ Supplemental Figure 1. GFP-SpoIVFA is more mislocalized in the absence of both BofA and SpoIIIAH. Sporulation was induced by resuspension in wild-type

More information

Building a Multifunctional Aptamer-Based DNA Nanoassembly for Targeted Cancer Therapy

Building a Multifunctional Aptamer-Based DNA Nanoassembly for Targeted Cancer Therapy Supporting Information Building a Multifunctional Aptamer-Based DNA Nanoassembly for Targeted Cancer Therapy Cuichen Wu,, Da Han,, Tao Chen,, Lu Peng, Guizhi Zhu,, Mingxu You,, Liping Qiu,, Kwame Sefah,

More information

Table S1. Primers and PCR conditions used in this paper Primers Sequence (5 3 ) Thermal conditions Reference Rhizobacteria 27F 1492R

Table S1. Primers and PCR conditions used in this paper Primers Sequence (5 3 ) Thermal conditions Reference Rhizobacteria 27F 1492R Table S1. Primers and PCR conditions used in this paper Primers Sequence (5 3 ) Thermal conditions Reference Rhizobacteria 27F 1492R AAC MGG ATT AGA TAC CCK G GGY TAC CTT GTT ACG ACT T Detection of Candidatus

More information

Supporting Information

Supporting Information Supporting Information T. Pellegrino 1,2,3,#, R. A. Sperling 1,#, A. P. Alivisatos 2, W. J. Parak 1,2,* 1 Center for Nanoscience, Ludwig Maximilians Universität München, München, Germany 2 Department of

More information

Introduction to Molecular Phylogeny

Introduction to Molecular Phylogeny Introduction to Molecular Phylogeny Starting point: a set of homologous, aligned DNA or protein sequences Result of the process: a tree describing evolutionary relationships between studied sequences =

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer Codon usage patterns in Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster and Homo sapiens; a review of the considerable

More information

Genetic Code, Attributive Mappings and Stochastic Matrices

Genetic Code, Attributive Mappings and Stochastic Matrices Genetic Code, Attributive Mappings and Stochastic Matrices Matthew He Division of Math, Science and Technology Nova Southeastern University Ft. Lauderdale, FL 33314, USA Email: hem@nova.edu Abstract: In

More information

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid. 1. A change that makes a polypeptide defective has been discovered in its amino acid sequence. The normal and defective amino acid sequences are shown below. Researchers are attempting to reproduce the

More information

Supplemental Table 1. Primers used for cloning and PCR amplification in this study

Supplemental Table 1. Primers used for cloning and PCR amplification in this study Supplemental Table 1. Primers used for cloning and PCR amplification in this study Target Gene Primer sequence NATA1 (At2g393) forward GGG GAC AAG TTT GTA CAA AAA AGC AGG CTT CAT GGC GCC TCC AAC CGC AGC

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION DOI:.38/NCHEM.246 Optimizing the specificity of nucleic acid hyridization David Yu Zhang, Sherry Xi Chen, and Peng Yin. Analytic framework and proe design 3.. Concentration-adjusted

More information

TM1 TM2 TM3 TM4 TM5 TM6 TM bp

TM1 TM2 TM3 TM4 TM5 TM6 TM bp a 467 bp 1 482 2 93 3 321 4 7 281 6 21 7 66 8 176 19 12 13 212 113 16 8 b ATG TCA GGA CAT GTA ATG GAG GAA TGT GTA GTT CAC GGT ACG TTA GCG GCA GTA TTG CGT TTA ATG GGC GTA GTG M S G H V M E E C V V H G T

More information

In previous lecture. Shannon s information measure x. Intuitive notion: H = number of required yes/no questions.

In previous lecture. Shannon s information measure x. Intuitive notion: H = number of required yes/no questions. In previous lecture Shannon s information measure H ( X ) p log p log p x x 2 x 2 x Intuitive notion: H = number of required yes/no questions. The basic information unit is bit = 1 yes/no question or coin

More information

A p-adic Model of DNA Sequence and Genetic Code 1

A p-adic Model of DNA Sequence and Genetic Code 1 ISSN 2070-0466, p-adic Numbers, Ultrametric Analysis and Applications, 2009, Vol. 1, No. 1, pp. 34 41. c Pleiades Publishing, Ltd., 2009. RESEARCH ARTICLES A p-adic Model of DNA Sequence and Genetic Code

More information

part 3: analysis of natural selection pressure

part 3: analysis of natural selection pressure part 3: analysis of natural selection pressure markov models are good phenomenological codon models do have many benefits: o principled framework for statistical inference o avoiding ad hoc corrections

More information

Genetic code on the dyadic plane

Genetic code on the dyadic plane Genetic code on the dyadic plane arxiv:q-bio/0701007v3 [q-bio.qm] 2 Nov 2007 A.Yu.Khrennikov, S.V.Kozyrev June 18, 2018 Abstract We introduce the simple parametrization for the space of codons (triples

More information

Lecture IV A. Shannon s theory of noisy channels and molecular codes

Lecture IV A. Shannon s theory of noisy channels and molecular codes Lecture IV A Shannon s theory of noisy channels and molecular codes Noisy molecular codes: Rate-Distortion theory S Mapping M Channel/Code = mapping between two molecular spaces. Two functionals determine

More information

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Protein Synthesis Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Protein Synthesis: Protein synthesis uses the information in genes to make proteins. 2 Steps

More information

A Minimum Principle in Codon-Anticodon Interaction

A Minimum Principle in Codon-Anticodon Interaction A Minimum Principle in Codon-Anticodon Interaction A. Sciarrino a,b,, P. Sorba c arxiv:0.480v [q-bio.qm] 9 Oct 0 Abstract a Dipartimento di Scienze Fisiche, Università di Napoli Federico II Complesso Universitario

More information

Evolutionary dynamics of abundant stop codon readthrough in Anopheles and Drosophila

Evolutionary dynamics of abundant stop codon readthrough in Anopheles and Drosophila biorxiv preprint first posted online May. 3, 2016; doi: http://dx.doi.org/10.1101/051557. The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. All rights reserved.

More information

Biology 155 Practice FINAL EXAM

Biology 155 Practice FINAL EXAM Biology 155 Practice FINAL EXAM 1. Which of the following is NOT necessary for adaptive evolution? a. differential fitness among phenotypes b. small population size c. phenotypic variation d. heritability

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION DOI:.8/NCHEM. Conditionally Fluorescent Molecular Probes for Detecting Single Base Changes in Double-stranded DNA Sherry Xi Chen, David Yu Zhang, Georg Seelig. Analytic framework and probe design.. Design

More information

Complete sequence of the amphioxus (Branchiostoma lanceolatum) mitochondrial genome: relations to vertebrates

Complete sequence of the amphioxus (Branchiostoma lanceolatum) mitochondrial genome: relations to vertebrates 1998 Oxford University Press Nucleic Acids Research, 1998, Vol. 26, No. 13 3279 3285 Complete sequence of the amphioxus (Branchiostoma lanceolatum) mitochondrial genome: relations to vertebrates Nathalie

More information

Sequence Divergence & The Molecular Clock. Sequence Divergence

Sequence Divergence & The Molecular Clock. Sequence Divergence Sequence Divergence & The Molecular Clock Sequence Divergence v simple genetic distance, d = the proportion of sites that differ between two aligned, homologous sequences v given a constant mutation/substitution

More information

Evolvable Neural Networks for Time Series Prediction with Adaptive Learning Interval

Evolvable Neural Networks for Time Series Prediction with Adaptive Learning Interval Evolvable Neural Networs for Time Series Prediction with Adaptive Learning Interval Dong-Woo Lee *, Seong G. Kong *, and Kwee-Bo Sim ** *Department of Electrical and Computer Engineering, The University

More information

Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective

Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective Jacobs University Bremen Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective Semester Project II By: Dawit Nigatu Supervisor: Prof. Dr. Werner Henkel Transmission

More information

PROTEIN SYNTHESIS INTRO

PROTEIN SYNTHESIS INTRO MR. POMERANTZ Page 1 of 6 Protein synthesis Intro. Use the text book to help properly answer the following questions 1. RNA differs from DNA in that RNA a. is single-stranded. c. contains the nitrogen

More information

From Gene to Protein

From Gene to Protein From Gene to Protein Gene Expression Process by which DNA directs the synthesis of a protein 2 stages transcription translation All organisms One gene one protein 1. Transcription of DNA Gene Composed

More information

evoglow - express N kit distributed by Cat.#: FP product information broad host range vectors - gram negative bacteria

evoglow - express N kit distributed by Cat.#: FP product information broad host range vectors - gram negative bacteria evoglow - express N kit broad host range vectors - gram negative bacteria product information distributed by Cat.#: FP-21020 Content: Product Overview... 3 evoglow express N -kit... 3 The evoglow -Fluorescent

More information

A modular Fibonacci sequence in proteins

A modular Fibonacci sequence in proteins A modular Fibonacci sequence in proteins P. Dominy 1 and G. Rosen 2 1 Hagerty Library, Drexel University, Philadelphia, PA 19104, USA 2 Department of Physics, Drexel University, Philadelphia, PA 19104,

More information

How Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building

How Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building How Molecules Evolve Guest Lecture: Principles and Methods of Systematic Biology 11 November 2013 Chris Simon Approaching phylogenetics from the point of view of the data Understanding how sequences evolve

More information

Biosynthesis of Bacterial Glycogen: Primary Structure of Salmonella typhimurium ADPglucose Synthetase as Deduced from the

Biosynthesis of Bacterial Glycogen: Primary Structure of Salmonella typhimurium ADPglucose Synthetase as Deduced from the JOURNAL OF BACTERIOLOGY, Sept. 1987, p. 4355-4360 0021-9193/87/094355-06$02.00/0 Copyright X) 1987, American Society for Microbiology Vol. 169, No. 9 Biosynthesis of Bacterial Glycogen: Primary Structure

More information

From DNA to protein, i.e. the central dogma

From DNA to protein, i.e. the central dogma From DNA to protein, i.e. the central dogma DNA RNA Protein Biochemistry, chapters1 5 and Chapters 29 31. Chapters 2 5 and 29 31 will be covered more in detail in other lectures. ph, chapter 1, will be

More information

evoglow - express N kit Cat. No.: product information broad host range vectors - gram negative bacteria

evoglow - express N kit Cat. No.: product information broad host range vectors - gram negative bacteria evoglow - express N kit broad host range vectors - gram negative bacteria product information Cat. No.: 2.1.020 evocatal GmbH 2 Content: Product Overview... 4 evoglow express N kit... 4 The evoglow Fluorescent

More information

THE MATHEMATICAL STRUCTURE OF THE GENETIC CODE: A TOOL FOR INQUIRING ON THE ORIGIN OF LIFE

THE MATHEMATICAL STRUCTURE OF THE GENETIC CODE: A TOOL FOR INQUIRING ON THE ORIGIN OF LIFE STATISTICA, anno LXIX, n. 2 3, 2009 THE MATHEMATICAL STRUCTURE OF THE GENETIC CODE: A TOOL FOR INQUIRING ON THE ORIGIN OF LIFE Diego Luis Gonzalez CNR-IMM, Bologna Section, Via Gobetti 101, I-40129, Bologna,

More information

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certifi cate of Education Advanced Subsidiary Level and Advanced Level

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certifi cate of Education Advanced Subsidiary Level and Advanced Level *1166350738* UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certifi cate of Education Advanced Subsidiary Level and Advanced Level CEMISTRY 9701/43 Paper 4 Structured Questions October/November

More information

Sex-Linked Inheritance in Macaque Monkeys: Implications for Effective Population Size and Dispersal to Sulawesi

Sex-Linked Inheritance in Macaque Monkeys: Implications for Effective Population Size and Dispersal to Sulawesi Supporting Information http://www.genetics.org/cgi/content/full/genetics.110.116228/dc1 Sex-Linked Inheritance in Macaque Monkeys: Implications for Effective Population Size and Dispersal to Sulawesi Ben

More information

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers Quiz 1 Advanced Topics in RNA and DNA DNA Microarrays Aptamers 2 Quantifying mrna levels to asses protein expression 3 The DNA Microarray Experiment 4 Application of DNA Microarrays 5 Some applications

More information

Lect. 19. Natural Selection I. 4 April 2017 EEB 2245, C. Simon

Lect. 19. Natural Selection I. 4 April 2017 EEB 2245, C. Simon Lect. 19. Natural Selection I 4 April 2017 EEB 2245, C. Simon Last Time Gene flow reduces among population variability, reduces structure Interaction of climate, ecology, bottlenecks, drift, and gene flow

More information

Translation. A ribosome, mrna, and trna.

Translation. A ribosome, mrna, and trna. Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno

More information

Midterm Review Guide. Unit 1 : Biochemistry: 1. Give the ph values for an acid and a base. 2. What do buffers do? 3. Define monomer and polymer.

Midterm Review Guide. Unit 1 : Biochemistry: 1. Give the ph values for an acid and a base. 2. What do buffers do? 3. Define monomer and polymer. Midterm Review Guide Name: Unit 1 : Biochemistry: 1. Give the ph values for an acid and a base. 2. What do buffers do? 3. Define monomer and polymer. 4. Fill in the Organic Compounds chart : Elements Monomer

More information

The 3 Genomic Numbers Discovery: How Our Genome Single-Stranded DNA Sequence Is Self-Designed as a Numerical Whole

The 3 Genomic Numbers Discovery: How Our Genome Single-Stranded DNA Sequence Is Self-Designed as a Numerical Whole Applied Mathematics, 2013, 4, 37-53 http://dx.doi.org/10.4236/am.2013.410a2004 Published Online October 2013 (http://www.scirp.org/journal/am) The 3 Genomic Numbers Discovery: How Our Genome Single-Stranded

More information

Videos. Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu.

Videos. Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu. Translation Translation Videos Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu.be/itsb2sqr-r0 Translation Translation The

More information

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: m Eukaryotic mrna processing Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: Cap structure a modified guanine base is added to the 5 end. Poly-A tail

More information

Timing molecular motion and production with a synthetic transcriptional clock

Timing molecular motion and production with a synthetic transcriptional clock Timing molecular motion and production with a synthetic transcriptional clock Elisa Franco,1, Eike Friedrichs 2, Jongmin Kim 3, Ralf Jungmann 2, Richard Murray 1, Erik Winfree 3,4,5, and Friedrich C. Simmel

More information

CHEMISTRY 9701/42 Paper 4 Structured Questions May/June hours Candidates answer on the Question Paper. Additional Materials: Data Booklet

CHEMISTRY 9701/42 Paper 4 Structured Questions May/June hours Candidates answer on the Question Paper. Additional Materials: Data Booklet Cambridge International Examinations Cambridge International Advanced Level CHEMISTRY 9701/42 Paper 4 Structured Questions May/June 2014 2 hours Candidates answer on the Question Paper. Additional Materials:

More information

The role of the FliD C-terminal domain in pentamer formation and

The role of the FliD C-terminal domain in pentamer formation and The role of the FliD C-terminal domain in pentamer formation and interaction with FliT Hee Jung Kim 1,2,*, Woongjae Yoo 3,*, Kyeong Sik Jin 4, Sangryeol Ryu 3,5 & Hyung Ho Lee 1, 1 Department of Chemistry,

More information

In this article, we investigate the possible existence of errordetection/correction

In this article, we investigate the possible existence of errordetection/correction EYEWIRE BY DIEGO LUIS GONZALEZ, SIMONE GIANNERINI, AND RODOLFO ROSA In this article, we investigate the possible existence of errordetection/correction mechanisms in the genetic machinery by means of a

More information

The degeneracy of the genetic code and Hadamard matrices. Sergey V. Petoukhov

The degeneracy of the genetic code and Hadamard matrices. Sergey V. Petoukhov The degeneracy of the genetic code and Hadamard matrices Sergey V. Petoukhov Department of Biomechanics, Mechanical Engineering Research Institute of the Russian Academy of Sciences petoukhov@hotmail.com,

More information

Re- engineering cellular physiology by rewiring high- level global regulatory genes

Re- engineering cellular physiology by rewiring high- level global regulatory genes Re- engineering cellular physiology by rewiring high- level global regulatory genes Stephen Fitzgerald 1,2,, Shane C Dillon 1, Tzu- Chiao Chao 2, Heather L Wiencko 3, Karsten Hokamp 3, Andrew DS Cameron

More information

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern

More information

THE deuterostomes are a major group of metazoans which are worm-like, marine animals that live buried

THE deuterostomes are a major group of metazoans which are worm-like, marine animals that live buried Copyright 1998 by the Genetics Society of America The Mitochondrial Genome of the Hemichordate Balanoglossus carnosus and the Evolution of Deuterostome Mitochondria Jose Castresana,*,1 Gertraud Feldmaier-Fuchs,*

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Reducing Redundancy of Codons through Total Graph

Reducing Redundancy of Codons through Total Graph American Journal of Bioinformatics Original Research Paper Reducing Redundancy of Codons through Total Graph Nisha Gohain, Tazid Ali and Adil Akhtar Department of Mathematics, Dibrugarh University, Dibrugarh-786004,

More information

Mathematics of Bioinformatics ---Theory, Practice, and Applications (Part II)

Mathematics of Bioinformatics ---Theory, Practice, and Applications (Part II) Mathematics of Bioinformatics ---Theory, Practice, and Applications (Part II) Matthew He, Ph.D. Professor/Director Division of Math, Science, and Technology Nova Southeastern University, Florida, USA December

More information

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the

More information

Slide 1 / 54. Gene Expression in Eukaryotic cells

Slide 1 / 54. Gene Expression in Eukaryotic cells Slide 1 / 54 Gene Expression in Eukaryotic cells Slide 2 / 54 Central Dogma DNA is the the genetic material of the eukaryotic cell. Watson & Crick worked out the structure of DNA as a double helix. According

More information

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Protein Synthesis Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Types of RNA Messenger RNA (mrna) makes a copy of DNA, carries instructions for making proteins,

More information

part 4: phenomenological load and biological inference. phenomenological load review types of models. Gαβ = 8π Tαβ. Newton.

part 4: phenomenological load and biological inference. phenomenological load review types of models. Gαβ = 8π Tαβ. Newton. 2017-07-29 part 4: and biological inference review types of models phenomenological Newton F= Gm1m2 r2 mechanistic Einstein Gαβ = 8π Tαβ 1 molecular evolution is process and pattern process pattern MutSel

More information

Organic Chemistry Option II: Chemical Biology

Organic Chemistry Option II: Chemical Biology Organic Chemistry Option II: Chemical Biology Recommended books: Dr Stuart Conway Department of Chemistry, Chemistry Research Laboratory, University of Oxford email: stuart.conway@chem.ox.ac.uk Teaching

More information

Using algebraic geometry for phylogenetic reconstruction

Using algebraic geometry for phylogenetic reconstruction Using algebraic geometry for phylogenetic reconstruction Marta Casanellas i Rius (joint work with Jesús Fernández-Sánchez) Departament de Matemàtica Aplicada I Universitat Politècnica de Catalunya IMA

More information

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell.

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell. RNAs L.Os. Know the different types of RNA & their relative concentration Know the structure of each RNA Understand their functions Know their locations in the cell Understand the differences between prokaryotic

More information

A Mathematical Model of the Genetic Code, the Origin of Protein Coding, and the Ribosome as a Dynamical Molecular Machine

A Mathematical Model of the Genetic Code, the Origin of Protein Coding, and the Ribosome as a Dynamical Molecular Machine A Mathematical Model of the Genetic Code, the Origin of Protein Coding, and the Ribosome as a Dynamical Molecular Machine Diego L. Gonzalez CNR- IMM Is)tuto per la Microele4ronica e i Microsistemi Dipar)mento

More information

Lecture 15: Realities of Genome Assembly Protein Sequencing

Lecture 15: Realities of Genome Assembly Protein Sequencing Lecture 15: Realities of Genome Assembly Protein Sequencing Study Chapter 8.10-8.15 1 Euler s Theorems A graph is balanced if for every vertex the number of incoming edges equals to the number of outgoing

More information

Evidence for Evolution: Change Over Time (Make Up Assignment)

Evidence for Evolution: Change Over Time (Make Up Assignment) Lesson 7.2 Evidence for Evolution: Change Over Time (Make Up Assignment) Name Date Period Key Terms Adaptive radiation Molecular Record Vestigial organ Homologous structure Strata Divergent evolution Evolution

More information

Laith AL-Mustafa. Protein synthesis. Nabil Bashir 10\28\ First

Laith AL-Mustafa. Protein synthesis. Nabil Bashir 10\28\ First Laith AL-Mustafa Protein synthesis Nabil Bashir 10\28\2015 http://1drv.ms/1gigdnv 01 First 0 Protein synthesis In previous lectures we started talking about DNA Replication (DNA synthesis) and we covered

More information

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016 Molecular phylogeny - Using molecular sequences to infer evolutionary relationships Tore Samuelsson Feb 2016 Molecular phylogeny is being used in the identification and characterization of new pathogens,

More information

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION - theory that groups of organisms change over time so that descendeants differ structurally

More information

Supplementary materials

Supplementary materials 1 Supplementary materials 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 Mitochondrial diversity. The mtdna sequences used to compare nucleotide diversity between Culicidae species were available on GenBank:

More information

160, and 220 bases, respectively, shorter than pbr322/hag93. (data not shown). The DNA sequence of approximately 100 bases of each

160, and 220 bases, respectively, shorter than pbr322/hag93. (data not shown). The DNA sequence of approximately 100 bases of each JOURNAL OF BACTEROLOGY, JUlY 1988, p. 3305-3309 0021-9193/88/073305-05$02.00/0 Copyright 1988, American Society for Microbiology Vol. 170, No. 7 Construction of a Minimum-Size Functional Flagellin of Escherichia

More information

Lesson Overview. Ribosomes and Protein Synthesis 13.2

Lesson Overview. Ribosomes and Protein Synthesis 13.2 13.2 The Genetic Code The first step in decoding genetic messages is to transcribe a nucleotide base sequence from DNA to mrna. This transcribed information contains a code for making proteins. The Genetic

More information