arab Gene and Nucleotide Sequence of the arac Gene of Erwinia carotovora

Size: px
Start display at page:

Download "arab Gene and Nucleotide Sequence of the arac Gene of Erwinia carotovora"

Transcription

1 JOURNAL OF BACTERIOLOGY, Nov. 1985, p /85/ $02.00/0 Copyright 1985, American Society for Microbiology Vol. 164, No. 2 arab Gene and Nucleotide Sequence of the arac Gene of Erwinia carotovora SHAU-PING LEI, HUN-CHI LIN, LAUREL HEFFERNAN, AND GARY WILCOX* Department of Microbiology and Molecular Biology Institute, University of California, Los Angeles, California Received 7 May 1985/Accepted 7 August 1985 The arab and arac genes of Erwinia carotovora were expressed in Escherichia coli and Salmonella typhimurium. The arab and arac genes in E. coli,, and S. typhimurium were transcribed in divergent directions. In, the arab and arac genes were separated by 3.5 kilobase pairs, whereas in E. coli and S. typhimurium they were separated by 147 base pairs. The nucleotide sequence of the E. carotovora arac gene was determined. The predicted sequence of AraC protein of was 18 and 29 amino acids longer than that of AraC protein of E. coli and S. typhimurium, respectively. The DNA sequence of the arac gene of was 58% homologous to that of E. coli and 59% homologous to that of S. typhimunum, with respect to the common region they share. The predicted amino acid sequence of AraC protein was 57% homologous to that of E. coli and 58% homologous to that of S. typhimurium. The 5' noncoding regions of the arab and ardc genes of had little homology to either of the other two species. The L-arabinose gene-enzyme complex in both Escherichia coli (8, 9, 16) and Salmonella typhimurium (13, 19) has been characterized. The arac gene is a positive and negative regulator of the arabad operon and also a negative regulator of its own synthesis (4, 12, 17, 21). The regulation and organization of the arac and arabad genes in the two species are very similar. The DNA sequences of the arac genes are 82% homologous, and the predicted amino acid sequences of the AraC proteins are 92% conserved (6). Biochemical studies (3, 10) demonstrate that at the DNA level the gram-negative enteric bacterium Erwinia sp. is only 50% homologous to E. coli (10). To initiate a study of the organization and regulation of the ara genes in other members of the family Enterobacteriaceae, we cloned the arab and arac genes from EC. The distance between the arab and arac genes was determined, and the arac gene was sequenced. MATERIALS AND METHODS Strains and plasmids. The E. coli bacterial strains and plasmids used in this study are listed in Table 1. wild-type EC was obtained from A. K. Chatterjee, Kansas State University. Plasmids were introduced into E. coli strains by transformation (7). Plasmid DNA was prepared as described previously (18). Media, chemicals, and enzymes. MacConkey agar base (Difco Laboratories) was supplemented with 1% L-arabinose (Sigma Chemical Co.). Restriction enzymes were purchased from Bethesda Research Laboratories, Inc., and T4 DNA ligase was purchased from Boehringer Mannheim Biochemicals. [a-32p]datp and L-[35S]methionihe were purchased from Amersham, Corp. When required, ampicillin was added to the medium at 50,ug/ml. Plasmid constructions. chromosomal DNA was partially digested with EcoRI and ligated with EcoRIlinearized pbr322. Ligated DNA was introduced into arab or arac mutants of E. coli by transformation. The cells were spread on MacConkey-agar-arabinose-ampicillin plates, and * Corresponding author. 717 TABLE 1. E. coli strains and plasmids Strain or plasmid ara genotype Origin or reference E. coli strainsa HB101 ara-14 1 LA310 A(araB) LA3 A(araOC) LA840 arac::lacz Laura Cass, this laboratory MC1061 A(araDABPOC) 5 SB7223 A(araABPOC) Plasmidsb psh100 arac+b+a' This paper pspl8 arac+ This paper pspl10 arac+ This paper pspl11 arab+ This paper pspl147 arac+ This paper pspl148 arac+ This paper pjb8 15 pbr322 2 phl2 arac+b+a+d+ 19 ptb1 arac+b+ 11 puc13 25 a All the LA strains are derivatives of E. coli K-12 RR1 and contain the ara region from E. coli B/r. SB7223 is also a derivative of E. coli K-12 that contains the ara region from E. coli B/r. b The ara genes of phl2 and ptb1 are from S. typhimurium and E. coli 83/r, respectively. the plates were incubated at 37 C. Colonies that appeared red after 20 h of incubation were scored as complementation positive. The ara genes were also detected by complementation of ara mutants after transformation with the cosmid library of that was prepared previously (18). Construction of BAL 31 asymmetric deletion mutants. Plasmid pspl147 (containing the arac gene) was first digested with EcoRV (Fig. 1). The linearized DNA was then digested with BAL 31, and samples were removed at different time points. After phenol extraction and ethanol precipitation, the DNA was digested with HpaI and ligated with HincII-digested puc13. Ligated DNA was introduced into the E. coli strain containing the A(araOC)719 mutation

2 718 LEI ET AL. AprTCr ;:lc EcoRI I EcoRI BomHI Ligation (pspl147 AprTcJr EcoRI ElcRV EcoRV E rri CssHJ EcoRV j(corv + EcoRV BssHI EcoRI HpaI s BAL31 tpoi AvaI A#bI EcoRV Sco I /_EoRI pbr322 BomHI AprvoI MVI EcoRV BssHI EcoRI /OfpI ITI Ligation FIG. 1. Plasmid constructions. The thin curved lines indicate vector DNA, and the thick curved lines indicate chromosomal DNA. The curved arrow indicates transcription orientation. by transforniation. The cells were spread on MacConkey aga~-arabinose-ampicillin indicator medium, and the plates were.incubated at 37 C for 20 h. Ot-ganization of the arab and arac genes. Southern blot hybridizations were used to define the organization of the arab and arac genes in. A 1.4-kilobase-pair (kb) DNA' fragment that contains the entire arac gene was isolated from pspl8 (Fig. 1 and Fig. 2) and used as an arac probe. A 1.5-kb EcoRI fragment that contains the arab gene was isolated from pspl1i and used as an arab J. BACTERIOL. probe. Both the arac and arab probes were `nick translated as described previously (22). Plasmids psi4100 (arab and arac), pspl10 (arac), and pspl11 (arab) w'ere digested with various restriction enzymes, and the fragments were separated on a 1% agarose gel. The DNA was bi-blotted (24) onto nitrocellulose paper and hybridized with the arac or arab probes, respectively. DNA sequence analysis. The DNA sequence was determined by the Sanger dideoxy chain-te'rmination method (23). Plasmid pspl8 (Fig. 1) was digested with EcoRI and HindIII and ligated with EcoRI- and HindIII-digested M13mplO or M13mpll to obtain bpth orientations. Nonrandom deletions were kenerated in the M13mpf10 and M13mpll derivatives (20). The DNA sequence beyond the arac gtructural gene anfd the HpaI site was determined with an M13mp8 derivative containing a 900-base-pair taql DNA fragment (Fig. 2). In vitro anaiysis of plasmid-enioded proteins. E. coli SB7223 was used to prepare an S-30 fraction as described previously (27) except that the cells were broken in a tirench pressure cell at 5,000 lb/in2. The in vitro protein synthesis reactions were performed as described previously (19). RESULTS Cloning of arac and arib genes. An library was constructed in pbr322 from a partial EcoRI digestion of total DNA as deecribed in Materials and Methods. Plasmid pspl10 containing the arac,, gene was identified from the library by its ability to-complement an E. coli strain containihg the A(araOC)719 mutation. The insert in pspl10 is 16 kb and contains an internal EcoRI site (Fig. 1). Each of the EcoRI fragments derived from the pspl10 insert was subcloned in pbr322. Neither subclone could complement the arac mutation. This suggested that the EcoRI site was within the arac gene. A 3-kb Aval fragment from pspl10 was subcloned into the AvaI site of pbr322. The resulting plasmid, designated pspl147 (Fig.. 1), complemented the A(araOC)719 mutation (Table 2). Plasmid pspl147 could also repress lacz expression when the plasmid existed in trans to an E. coli arac::lacz chromosomal fusion strain (Table 2). An arab-containing plasmid (pspl11) was identified from the EcoRI library by its ability to complement an E. coli strain containing the A(araB)809 mutation (Table 2). Plasmid pspl11 contained a 3.5-kb EcoRI insert. Plasmid psh100 was identified from a cosmid library of E. carotovora (18) by its ability to complement the E. coli A(araOC)719 mutation. Subsequently, plasmid psh100 was also found to complement the A(araB)809 mutation (Table 2). The DNA insert in psh100 is approximately 35 kb. However, none of the plasmids from either library was able TABLE 2. Identification and chatacterization of the cldned ara genes of a Ability of LA840 (arac::i1acz) to Ability of strain [genotype] to utilize arabinose utilize: Plasmid MC1061 LA3 LA310 A(araDABPOC) tl(araoc)719 A(araB)809 Arabinose Lactose No plasmid pspl11 (arab) pspl147 (arac) psh100 (arab arac) NDb ND a The cells transformed with the plasmids were plated on MacConkey agar-arabinose-ampicillin or MacConkey agar-lactose-ampicillin plates. Symbols: + ability to utilize arabinose or lactose; -, inability to utilize these carbohydrates. b ND, Not determined.

3 VOL. 164, 1985 arab AND arac GENES OF E. CAROTOVORA 719 uruc OIarB OQB_probe are_ I probei E carotovlria ci cr c cr m V - D1 111 I II ZI I I(IcT I EccoliDovA B A' FIG. 2. Somparison fragment wa,s'derived from plasm,id psh100. The DNA fra'gment derived fr,om B of the organization of the ara operon and arac gene of, E. coli, and S. typhimurium. The relative position of each gene is indicated by A, B, C, and D; the transcription orientation is indicated by an arrow. The 21-kb DNA pspl8 and used as the arac probe for Southern blot hybridizations is indicated. 1~~~~~~~~~~~~~~~~~~~~~~~1k BamHI and HindIII sites on the vector psfil8 (tfig. 1) were used to excise the probe. The DNA fragment used a,s thetarab probe was from pspl11. The arab probe is t'he leftmqst 3..S-kb EeoRI fragment. The restriction map of S. typhimurium is from Lin et al. (19), and that of E. coli is from Greenfield et al. (11). F.oGlyai GCCTCGCGCATGACCAGCGGCC AGCCtiATGACGTCCATCGGGTATGiAGCTGATCGTTATTTCTGCCTGTGTGCTGGGTGGGGTATCGGTAAAGGCGGGATCGGTAAAATTTCCTACGTGGTGGCTGGCGTACTGAT TTTAGGTACGGTCGAGAACGCGATGAACCTGCTAAATATCTCGCCGTTCTCTCAGTATGTGGTACGTGGCCTGATTCTGCTGGCGGCGGTTATCT,TCGACCGCTACAAACAACTG GCAAAACGGTATAAATtCCTGTG(G- CAGGTAAGAGCTTATCCCAGTAGGCGTTATTGGTGCAGCCAGTTTGGACACGGACAGCGCGCAGAAACCGGAGCGTACACGTAGT ACGTGAGGATTTTGuAGCACTGCCCAGGTTCAAAATGGCAAATAAAATAGTCCTAATGGGATGGCTCTAAGTcATTCTGCCGCGTCTTATCATTACTAGCGTGTACCggrCCTG 1 ~~~~~~~~~~~~~~50 ATG TA'r CAC CGT ATG GCG CAT GAA TCT CAG CCT AAT CCA CTG TTG CCG GGA TAT TCG TTC AAC GCT TAC CTC GTT GCA GGT TTG ACG l4et Tyr Hi s Arj AXet Ala His Glu Ser Gln Pro Asn Pro Leu Leu Pro Gly Tyr Saer Phe Asn Ala Tyr Leu Val Ala Gly Leu Thr CCG ATT CTG GCA GAA GGG CCA CTC GAT TTC TTT Arc GAT CGC CCT GAC GGC ATG AAA GGC TAC-ATC ATC AAT CTC ACC ATG AAA GGA Pro Ile Leu Ala Glu Gly Pro Leu Asp Phe Phe Ile Asp Arg Pro Asp Gly Met Lys Gly Tyr Ile Ile Asn Leu Thr Met Lys Gly CAu GGC CAG ATT TTT GAT GuT GAT GMA ACT TTT TTC TGT AAT CCC GGC GAT CTA CTG TTG TTT CCG CCG AAA TCG 'ACG CAT TTT TAT lnf tlh eorg Asp Gly Asp Glu Thr Phe Phe Cys Asn Pro Gly Asp Leu Leu Leu Phe Pro Pro Lys Ser Thr His Phe Tyr.59 ~~~~~~~~~300 higg CGT CCa AtC TCA AGC GAC TGH TGlil TAT CAT CGC TGh GTC TAT TTT CGA CCA CGC GCC TAT TGG GCA GAT TGG CTG GAG TGG CAT aelpy Arjf er ProSer Ser Asp Cys Trp Tyr His Arg Trp Val Tyr Phe Arg Pro Arg Ala Tyr Trp Ala AspTrip Leu Glu Trp His A; AAA AGC AGC GTiG ATT GGG CGC ATG AGT TTG CCG AAT MC CAG TTG TTA TTG GAA TTC GAC AGG CTG TTT GCC MT ATC GAG CAG Tnr Lys r Ser er Gly Ile Gly Arg letser Leu Pro Asn Asn Gln Leu Leu Leu Glu Phe Asp A r Leu VAla Asn Ile Glu Gln ACCTCAG C AT TCC GGCGi CGC TTC TCG GAG TTAr CTG GGT ATG CC CTG CTG GMCGA CTG TTG CTC AGA GCG ATG GAA GAA GAT CCA Tnr Gl n Arj Ser Gly Arj ArLPhe Ser Glu Glu Leu Gly Met Asn Leu Leu Glu Arg TyrLeLeu Arg i Ala Met Glu Glu Asp Pro I CAG AGC CCA CAG AAA ATTAT GC CA GC GTT TT GT MCCC G GAT ACC AGT TTT CTG GCT GGA GAG TTG CGT ATT GAC -Gln Ser Pro Gln Lys AIle Get Asp Pro ArT Val Ile Glu Ala Cys Gln Phe Ile Thr Ser Asn Leu Ala Gly Glu Leu Arg Ile Asp GCCA GTG GCG G CAC GCT TCC CTG TCG CCG TCG CGCTG GTG CTT CTG TTC CGT GAA CAG GTG GGT ATT GAT T TTG CGC TGG CGT GIu Val Ala Arg His Val Cys Alsa Leu Ser Pro Ser Arg Leu 1is Leu Phe Arg AluGln Val Gly Ile Asn Ile Leu Arg Trp Arg GAA GAT CAG CGC GT( ATT CGC GCC AAG CTG TT G CAG ACG ACG CAG GAA TCC ATC GCC AT ATT GGT CGC GTG MTG GGG TAT GAC Tlu Asp Gln Arg Val Ile Arg Ala Lys Leu Leu Ler Asn Asn Gin lu GirI heasp,a,sn IlePe G AlaVas lie Gly Tyr Asp GAT CAG CTC TAT TTC TCA CGC GTA TTC CGTAAG CGA GTC GGT GTC AGC CCT AGC GAT TTT CGT CGC CGC AGC AGC GAA ATC MC TAT AspGT n Leu Tyr Pne Ser Arg Val Phe Arg Lys Arg Val Gly Val Ser Pro Ser PAr Asp ALrL e Are Arg Ser Ser Glu Ile Asn Tyr CCA GCA GCC AAA ACG CTG CCC GTC GCG TGG GAG CAG ATA CCC CAT GT ATG AGC AGT TAA CGGGCACAAAA iro Ala Ala Lys Thr Leu Pro Val Ale Trp Gly Glu Gln Ile Pro PHes Ala Val Ser Ser 289 FIG. 3. DNA sequence and predicted amino acid sequence of the arac gene of. The DNA sequence of the arac gene was determined from a 1.4-kb DNA fragment of pspl8 by a method described previously (20). Numbering starts from the first nucleotide of the first methionine coding for AraC protein. The GGAG sequence in the box indicates the presumed ribosome-binding site.

4 720 LEI ET AL. Erwlin/a arac- Salmonella a roac - co C\i N% _j J I UC) F- m~ Q a. qmw -r co/i orac FIG. 4. In vitro analysis of the molecular weight of AraC protein of. Autoradiograph of the protein synthesized in vitro. Each S-30 reaction mixture (6,ul) prepared as described in Materials and Methods was mixed with loading buffer, heated at 100 C for 5 min, and loaded on a 10% sodium dodecyl sulfatepolyacrylamide (acrylamide:bis 30:0.8) gel. Plasmid phl2 = contains the arac gene from S. typhimurium which codes for a protein of 281 amino acids with a molecular weight of 32,034 (6). This plasmid codes for P-lactamase. Plasmid pspl148 contains the arac gene from and does not code for 3-lactamase. Plasmid ptb1 contains the arac gene from E. coli which codes for a protein of 292 amino acids and has a molecular weight of 33,315 (11). This plasmid does not produce P-lactamase. The location of AraC protein of S. typhimurium in this gel system was determined previously (19). The location of each AraC protein is indicated by the horizontal lines. to complement the A(araDABPOC) mutation in strain MC1061, indicating that they do not contain all of araa and arad (Table 2). Organization and restriction map of the arab and arac genes. To determine the location of the arac and arab genes on plasmid psh100, Southern blot hybridizations were performed (26). The restriction maps of the arab and arac genes are shown in Fig. 2. A 6-kb BssHII fragmnent in psh100 was identified that hybridized with both the arab probe (the 3.5-kb EcoRI fragment of pspl11) and the arac probe (the 1.4-kb HindIII-BamHI fragment of pspl8) (Fig. 2). The locations of the BssHII sites were identified in both pspl147 and pspl11. The location and orientation of the arab and araa genes were determined by DNA sequence analysis of the termini of the 3.5-kb EcoRI fragment of pspl11 (data not shown). By comparing the sequence of each end of this fragment to the sequence of the arab and araa genes of S. typhimurium, regions of homology in arab (base pairs 1 to 90 from the beginning of the arab gene) and araa (base pairs 1099 to 1449 from the beginning of the araa gene) were found. The sequence of part of the 2-kb EcoRI fragment of pspl10 was homologous to the arac gene of S. J. BACTERIOL. typhimurium (see below). From the location of the BssHII 5ites, the orientation of the arab gene in pspl11, and the sequence homology, we conclude that there is about 3.5 kb between the arab and arac genes in (Fig. 2) and that they are transcribed in opposite directions. DNA sequence analysis of the arac gene. The complementation analysis of plasmid pspl10 and its EcoRI subclones suggested that the EcoRI site was within the arac gene. The DNA sequence from the EcoRI site towards the BssHII site was determined and was homologous to the arac gene of E. coli and S. typhimurium, confirming that the EcoRI site was within arac. DNA sequence analysis was carried out in both directions from the EcoRI site. The DNA sequence of the arac gene of and the predicted amino acid sequence of AraC protein are shown in Fig. 3. A presumed ribosome-binding site, GGAG, was followed by an open reading frame that is homologous to the arac gene from E. coli and S. typhimurium. The predicted AraC protein contains 310 amino acid residues and has a calculated molecular weight of 35,144. In vitro analysis of plasmid-endoded protein. The size of E. carotovora AraC protein made in vitro was estimated by analysis on sodium dodecyl sulfate-polyacrylamide gels. Attempts to identify AraC protein with plasmid pspl147 as the DNA template were unsuccessful, probably because the location of AraC protein was obscured by,b-lactamase. Therefore, the arac-containing AvaI fragment of pspl147 (Fig. 1) was blunt ended with the large fragment of DNA polymerase I and then cloned into the ScaI site of pbr322, thus, destroying the bla gene. The resulting plasmid, designated pspl148 (Fig. 1), was used to direct protein synthesis. Subsequent gel electrophoresis and autoradiography allowed us to identify AraC protein and determine that its molecular weight was 36,000 (Fig. 4). DISCUSSION The arab and arac genes of were identified from both the pbr322 and cosmid libraries. The location of the arac gene in pspl10 indicates that approximately 1.5 kb separates arac from the EcoRI site immediately to the left of arac in Fig. 1. The observation that expression of the arab gene in pspl11 requires the arac regulatory protein indicates that pspl11 contains a functional promoter for the arab gene. In conjunction with the results of the Southern blot hybridization of psh100 with arab or arac probes, we concluded that the arac and arab genes in are separated by approximately 3.5 kb. A comparison of the DNA sequence of the arac and part of the arab genes of E. carotovora with the DNA sequence of S. typhimurium (6, 19) indicates that the orientation of arac relative to arab genes is identical in all three organisms. Thus, the genes must also be divergently transcribed in (Fig. 2). The approximately 3.5 kb that separates the arac and arab genes in is quite different from the situation for E. coli and S. typhimuriurh in which the genes are adjacent to one another and share a common controlling region of 147 base pairs. The arac gene of E, carotovora and its flanking regions were sequenced. An open reading frame that followed a presumed ribosome-binding site was identified. This open reading frame codes for a protein of 310 amino acids that is homologous to the predicted AraC protein sequence of S. typhimurium and E. coli. The methionine at position five corresponds to the N-terminal anmino acid of the AraC protein of both E. coli and S. typhimurium. We tentatively conclude that the AraC protein of starts at the

5 VOL. 164,1985 T. coli S. typhimurium E. F. carotovora coli S. typhimurium 'E. coli 9. typhiimurium T. coli 9. typhimurium!. coli l. typhimurium arab AND arac GENES OF E. CAROTOVORA 721 MY HR MA H ES Q P N AP L L PG Y SF NAY LV AG L T PI LA E G P L D F F ID R PD GM KG Y II R: :A:A QG:D: E :::I::H:: ::E:N:Y:::G:I:H:H:L:A:RE:L NLTM KGQGQIFDGD E TFFCNPGDLLLFPP KS THFYGRSPSSDCWYHRWVYFRP VIR: : :VVK N QGR E: V R: ::I:::::G E I :H: ::H:EARE::::: :::IR:E:V:NNNG:Q:V:R:::I:::::GEI:H:::H:DASE:::Q:::::: RAYWADWLEWHTKSSGIGR M S LPNNQLLLEFDRLFANA IEQTQRSGRRFSE E L : : : :HE: : N : P S I F A N T :F F R :D E A H Q P H: SD: : G Q I :N A G :GE: : Y: :L: :QE: :T: P: IFAQT:FFR A:DEARQPH: SE: :GQI :SAG:GE: A:Y: :L: GM N L L E R L L L RAM E E D P Q S P Q KIM D PRV I E A C Q FITS N LA AG E L RID E VAR H V AI::::Q ::::R :AINE :LHPP ::N:: R : : : : Y :SDH ::DSNFD :AS:: Q AI::::Q ::::R :AVINE :LHPP ::S:: RD : : : Y :SDH ::DSHFD : AS:: Q CLSPSRLAHLFREQVGINILRWREDQRVIRA K LLLQTTQESIANIGRVVGYDD :::: Q :L::SV:S::::::IS Q :::::S:: R M P ::TV:: N ::F ::::S::::Q :L : :SV :S::::::ISQ :::::S:: RMP ::TV: :N::F Q L Y F S R V F R K R V G V S P S D F R R R S S E I N Y P A A K T L P V A W G E Q I P H A V S S F. coli :::::::: K:CT:A:::E::AGCE: K V NDV:V K : S Ltyphimurium. :::::::: K: C T: A::: E:: A G C E FIG. 5. Comparison of the amino acid sequences of the arac genes of, E. coli, and S. typhimurium. Amino acids are given in the single letter code: A, alanine; C, cysteine; D, aspartic acid; E, glutamic acid; F, phenylalanine; G, glycine; H, histidine; I, isoleucine; K, lysine; L, leucine; M, methionine; N, asparagine; P, proline; Q, glutamine; R, arginine; S, serine; T, threonine; V, valine; W, tryptophan; and Y, tyrosine. Symbols: A, deletion; :, identical amino acid; -, no comparable region because initiation and termination are at different locations. first methionine of the open reading frame, because there is no Shine-Dalgarno sequence upstream from the methionine at position five. The predicted amino acid sequences of the AraC protein from E. coli, S. typhimurium, and are com- TABLE 3. Codon usage for the arac genes of, E. coli, and S. typhimuriuma AA Codon EC E S AA Codon EC E S Phe TTC Ala GCG TIT GCT Leu CTG GCA TTG GCC CTC Tyr TAT CTA TAC TTA His CAC CTT CAT Ile ATC Arg CGT ATT CGC ATA CGA Met ATG AGG Val GOT AGA GTC CGG GTA Lys AAA GTG AAG Ser TCC Asp GAC TCT GAT TCG Glu GAA AGC GAG TCA Cys TGT AGT TGC Pro CCT Trp TGG CCA Gly GGA CCG GGT CCC GGC Thr ACG GGG ACC Asn AAT ACT AAC ACA End TAA Gln CAG TAG CAA TGA a AA, Amino acid; EC, ; E, E. coli; and S, S. typhimurium. pared in Fig. 5. In the common regions, 57% of the sequences are conserved between and E. coli, compared with 58% between and S. typhimurium. The regions which had more differences are located at amino acids 56 to 72, 90 to 95, 114 to 153, 172 to 180, and 191 to 204. The sequence between amino acids 114 and 153 is especially variable. Since there is very little sequence homology and two deletions and one insertion, perhaps this region is not important for AraC function except as a spacer. However, the regions between amino acids 13 and 55, 73 and 89, 96 and 113, 154 and 174, 181 and 190, and especially 205 and 283 are conserved in these three species. This is consistent with an earlier report (6) indicating sequence conservation in the amino- and carboxy-terminal regions of the AraC proteins from E. coli and S. typhimurium. In addition, several non-self-regulating arac mutants have been identified in S.typhimurium. These mutations are located in the carboxy-terminal region of the AraC protein and are at residues which are conserved in all three organisms (P. Clarke, J.-H. Lee, K. Burke, and G. Wilcox, unpublished data). Thus, this region might contain sequences required for DNA binding. The structures of the ara genes in different organisms were compared. The 5' noncoding regions of the arac gene in both E. coli and S. typhimurium were highly conserved. Surprisingly, in the DNA sequence upstream of the predicted translation initiation site has very little homology to either E. coli or S. typhimurium sequences. However, the arac gene and part of the arab and araa (S.-P. Lei, unpublished result) genes are about 56% homologous between and S. typhimurium. In addition, the codon usage in the arac genes of these three species is not very different except that in there is no CTT codon used for leucine, and the AGT and the CCA codons are more frequently used for serine and proline, respectively (Table 3). The predicted amino terminus of the AraC protein from is four amino acid residues longer than that from E. coli and S. typhimurium. The predicted carboxy terminus of the AraC protein from E. carotovora is 14 and 25 amino acid residues longer than that from E. coli and S. typhimurium, respectively.

6 722 LEI ET AL. Although the degree of heterogeneity of the arac DNA sequences is greater between and E. coli than between E. coli and S. typhimurium, the ara genes from E. carotovora can be expressed in E. coli (Table 2) and S. typhimurium (data not shown). The results shown in Table 2 indicate that the AraC regulatory protein of has activator function on the arabad operon and represses arac expression. Thus, the regulatory functions of AraC protein remain the same, even though the degree of heterogeneity of the DNA sequence of the arac gene is high. ACKNOWLEDGMENTS We thank Patrick Clarke for providing the S-30 used in the in vitro analysis and for discussion of his unpublished data. We thank Lori Stoltzfus, Laura Cass, and Patrick Clarke for comments on the manuscript. This research was supported by Public Health Service grants GM and GM from the National Institute of General Medical Sciences. LITERATURE CITED 1. Bolivar, F., and K. Backman Plasmids of Escherichia coli as cloning vectors. Methods Enzymol. 68: Bolivar, F., R. L. Rodriguez, P. J. Greene, M. C. Betlach, H. L. Heyneker, H. W. Boyer, J. H. Crosa, and S. Falkow Construction and characterization of new cloning vehicles. II. A multipurpose cloning system. Gene 2: Brenner, D. J Deoxyribonucleic acid reassociation in the taxonomy of enteric bacteria. Int. J. Syst. Bacteriol. 23: Casadaban, M Regulation of the regulatory gene for the arabinose pathway, arac. J. Mol. Biol. 104: Cdaban, M. J., and S. N. Cohen Analysis of gene control signals by DNA fusion and cloning in Escherichia coli. J. Mol. Biol. 138: Clarke, P., H. C. Lin, and G. Wilcox The nucleotide sequence of the arac regulatory gene in Salmonella typhimurium LT2. Gene 18: Cohen, S. N., A. C. Y. Chang, and L. Hen Nonchromosomal antibiotic resistance in bacteria: genetic transformation of Escherichia coli by R-factor DNA. Proc. Natl. Acad. Sci. USA 69: Englesberg, E Regulation in the L-arabinose system, p In H. J. Vogel (ed.), Metabolic regulation, vol 5. Academic Press, Inc., New York. 9. Englesberg, E., and G. Wilcox Regulation: positive control. Annu. Rev. Genet. 8: Filer, D., R. Dhar, and A. V. Furano The conservation of DNA sequences over very long periods of evolutionary time. Eur. J. Biochem. 120: Greenfield, L., T. Boone, and G. Wilcox DNA sequence of the arabad promoter in Escherichia coli B/r. Proc. Natl. Acad. J. BACTERIOL. Sci. USA 75: Hendrickson, W., and R. Schleif A dimer of AraC protein contacts three adjacent major groove regions of the arai DNA site. Proc. Natl. Acad. Sci. USA 82: Horwitz, A. H., L. Heffernan, C. Morandi, J. H. Lee, J. Timko, and G. Wilcox DNA sequence of the arabad-arac controlling site region in Salmonella typhimurium LT2. Gene 14: Horwitz, A. H., C. Morandi, and G. Wilcox Deoxyribonucleic acid sequence of arabad promoter mutants of Escherichia coli. J. Bacteriol. 142: Ish-Horowicz, D., and J. F. Burke Rapid and efficient cosmid cloning. Nucleic. Acids Res. 9: Lee, N Molecular aspects of ara regulation, p In J. H. Miller and W. S. Reznikoff (ed.), The operon. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 17. Lee, N. L., W. 0. Gielow, and R. G. Wailace Mechanism of arac autoregulation and the domains of two overlapping promoters, P, and PBAD, in the L-arabinose regulatory region of Escherichia coli. Proc. Natl. Acad. Sci. USA 78: Lei, S.-P, H.-C. Lin, L. Heffernan, and G. Wilcox Cloning of the pectate lyase genes from Erwinia carotovora and their expression in Escherichia coli. Gene 35: Lin, H. -C., S. -P. Lei, and G. Wilcox The arabad operon from Salmonella typhimurium LT2. I. Nucleotide sequence of arab and primary structure of its product, ribulokinase. Gene 34: Lin, H.-C, S.-P Lei, and G. Wilcox An improved DNA sequencing strategy. Anal. Biochem. 147: Miyada, C. G., L. Stoltzfus, and G. Wilcox Regulation of the arac gene of Escherichia coli: catabolite repression, autoregulation, and effect on arabad expression. Proc. Natl. Acad. Sci. USA 81: Rigby, P. W. J., M. Dieckmann, C. Rhodes, and P. Berg Labeling deoxyribonucleic acid to high specific activity in vitro by nick translation with DNA polymerase I. J. Mol. Biol. 113: Sanger, F., S. Nicklen, and A. R. Coulson DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74: Smith, G. E., and M. D. Summers The bidirectional transfer of DNA and RNA to nitrocellulose or diazobenzyloxymethyl paper. Anal. Biochem. 109: Vieira, J., and J. Messing The puc plasmids, an M13mp7-derived system for insertion mutagenesis and sequencing with synthetic universal primer. Gene 19: Wahl, G. M., M. Stern, and G. R. Stank Efficient transfer of large DNA fragments from agarose gels to diazobenzyloxymethyl-dextran sulfate. Proc. Natl. Acad. Sci. USA 76: Wilcox, G., P. Meuris, R. Bass, and E. Englesberg Regulation of the L-arabinose operon BAD in vitro. J. Biol. Chem. 249:

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS 1 Prokaryotes and Eukaryotes 2 DNA and RNA 3 4 Double helix structure Codons Codons are triplets of bases from the RNA sequence. Each triplet defines an amino-acid.

More information

Practical Bioinformatics

Practical Bioinformatics 5/2/2017 Dictionaries d i c t i o n a r y = { A : T, T : A, G : C, C : G } d i c t i o n a r y [ G ] d i c t i o n a r y [ N ] = N d i c t i o n a r y. h a s k e y ( C ) Dictionaries g e n e t i c C o

More information

SUPPORTING INFORMATION FOR. SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA

SUPPORTING INFORMATION FOR. SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA SUPPORTING INFORMATION FOR SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA Aik T. Ooi, Cliff I. Stains, Indraneel Ghosh *, David J. Segal

More information

NSCI Basic Properties of Life and The Biochemistry of Life on Earth

NSCI Basic Properties of Life and The Biochemistry of Life on Earth NSCI 314 LIFE IN THE COSMOS 4 Basic Properties of Life and The Biochemistry of Life on Earth Dr. Karen Kolehmainen Department of Physics CSUSB http://physics.csusb.edu/~karen/ WHAT IS LIFE? HARD TO DEFINE,

More information

SUPPLEMENTARY DATA - 1 -

SUPPLEMENTARY DATA - 1 - - 1 - SUPPLEMENTARY DATA Construction of B. subtilis rnpb complementation plasmids For complementation, the B. subtilis rnpb wild-type gene (rnpbwt) under control of its native rnpb promoter and terminator

More information

Supplemental data. Pommerrenig et al. (2011). Plant Cell /tpc

Supplemental data. Pommerrenig et al. (2011). Plant Cell /tpc Supplemental Figure 1. Prediction of phloem-specific MTK1 expression in Arabidopsis shoots and roots. The images and the corresponding numbers showing absolute (A) or relative expression levels (B) of

More information

Advanced topics in bioinformatics

Advanced topics in bioinformatics Feinberg Graduate School of the Weizmann Institute of Science Advanced topics in bioinformatics Shmuel Pietrokovski & Eitan Rubin Spring 2003 Course WWW site: http://bioinformatics.weizmann.ac.il/courses/atib

More information

High throughput near infrared screening discovers DNA-templated silver clusters with peak fluorescence beyond 950 nm

High throughput near infrared screening discovers DNA-templated silver clusters with peak fluorescence beyond 950 nm Electronic Supplementary Material (ESI) for Nanoscale. This journal is The Royal Society of Chemistry 2018 High throughput near infrared screening discovers DNA-templated silver clusters with peak fluorescence

More information

Biosynthesis of Bacterial Glycogen: Primary Structure of Salmonella typhimurium ADPglucose Synthetase as Deduced from the

Biosynthesis of Bacterial Glycogen: Primary Structure of Salmonella typhimurium ADPglucose Synthetase as Deduced from the JOURNAL OF BACTERIOLOGY, Sept. 1987, p. 4355-4360 0021-9193/87/094355-06$02.00/0 Copyright X) 1987, American Society for Microbiology Vol. 169, No. 9 Biosynthesis of Bacterial Glycogen: Primary Structure

More information

Crick s early Hypothesis Revisited

Crick s early Hypothesis Revisited Crick s early Hypothesis Revisited Or The Existence of a Universal Coding Frame Ryan Rossi, Jean-Louis Lassez and Axel Bernal UPenn Center for Bioinformatics BIOINFORMATICS The application of computer

More information

Number-controlled spatial arrangement of gold nanoparticles with

Number-controlled spatial arrangement of gold nanoparticles with Electronic Supplementary Material (ESI) for RSC Advances. This journal is The Royal Society of Chemistry 2016 Number-controlled spatial arrangement of gold nanoparticles with DNA dendrimers Ping Chen,*

More information

Characterization of Pathogenic Genes through Condensed Matrix Method, Case Study through Bacterial Zeta Toxin

Characterization of Pathogenic Genes through Condensed Matrix Method, Case Study through Bacterial Zeta Toxin International Journal of Genetic Engineering and Biotechnology. ISSN 0974-3073 Volume 2, Number 1 (2011), pp. 109-114 International Research Publication House http://www.irphouse.com Characterization of

More information

SSR ( ) Vol. 48 No ( Microsatellite marker) ( Simple sequence repeat,ssr),

SSR ( ) Vol. 48 No ( Microsatellite marker) ( Simple sequence repeat,ssr), 48 3 () Vol. 48 No. 3 2009 5 Journal of Xiamen University (Nat ural Science) May 2009 SSR,,,, 3 (, 361005) : SSR. 21 516,410. 60 %96. 7 %. (),(Between2groups linkage method),.,, 11 (),. 12,. (, ), : 0.

More information

Supplementary Information for

Supplementary Information for Supplementary Information for Evolutionary conservation of codon optimality reveals hidden signatures of co-translational folding Sebastian Pechmann & Judith Frydman Department of Biology and BioX, Stanford

More information

Electronic supplementary material

Electronic supplementary material Applied Microbiology and Biotechnology Electronic supplementary material A family of AA9 lytic polysaccharide monooxygenases in Aspergillus nidulans is differentially regulated by multiple substrates and

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Zn 2+ -binding sites in USP18. (a) The two molecules of USP18 present in the asymmetric unit are shown. Chain A is shown in blue, chain B in green. Bound Zn 2+ ions are shown as

More information

Clay Carter. Department of Biology. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture.

Clay Carter. Department of Biology. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. Clay Carter Department of Biology QuickTime and a TIFF (LZW) decompressor are needed to see this picture. Ornamental tobacco

More information

3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies

3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies Richard Owen (1848) introduced the term Homology to refer to structural similarities among organisms. To Owen, these similarities indicated that organisms were created following a common plan or archetype.

More information

Supplemental Figure 1.

Supplemental Figure 1. A wt spoiiiaδ spoiiiahδ bofaδ B C D E spoiiiaδ, bofaδ Supplemental Figure 1. GFP-SpoIVFA is more mislocalized in the absence of both BofA and SpoIIIAH. Sporulation was induced by resuspension in wild-type

More information

Proteins: Characteristics and Properties of Amino Acids

Proteins: Characteristics and Properties of Amino Acids SBI4U:Biochemistry Macromolecules Eachaminoacidhasatleastoneamineandoneacidfunctionalgroupasthe nameimplies.thedifferentpropertiesresultfromvariationsinthestructuresof differentrgroups.thergroupisoftenreferredtoastheaminoacidsidechain.

More information

Supporting Information for. Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)-

Supporting Information for. Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)- Supporting Information for Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)- Dependence and Its Ability to Chelate Multiple Nutrient Transition Metal Ions Rose C. Hadley,

More information

Codon Distribution in Error-Detecting Circular Codes

Codon Distribution in Error-Detecting Circular Codes life Article Codon Distribution in Error-Detecting Circular Codes Elena Fimmel, * and Lutz Strüngmann Institute for Mathematical Biology, Faculty of Computer Science, Mannheim University of Applied Sciences,

More information

Protein Threading. Combinatorial optimization approach. Stefan Balev.

Protein Threading. Combinatorial optimization approach. Stefan Balev. Protein Threading Combinatorial optimization approach Stefan Balev Stefan.Balev@univ-lehavre.fr Laboratoire d informatique du Havre Université du Havre Stefan Balev Cours DEA 30/01/2004 p.1/42 Outline

More information

The Trigram and other Fundamental Philosophies

The Trigram and other Fundamental Philosophies The Trigram and other Fundamental Philosophies by Weimin Kwauk July 2012 The following offers a minimal introduction to the trigram and other Chinese fundamental philosophies. A trigram consists of three

More information

Building a Multifunctional Aptamer-Based DNA Nanoassembly for Targeted Cancer Therapy

Building a Multifunctional Aptamer-Based DNA Nanoassembly for Targeted Cancer Therapy Supporting Information Building a Multifunctional Aptamer-Based DNA Nanoassembly for Targeted Cancer Therapy Cuichen Wu,, Da Han,, Tao Chen,, Lu Peng, Guizhi Zhu,, Mingxu You,, Liping Qiu,, Kwame Sefah,

More information

Using an Artificial Regulatory Network to Investigate Neural Computation

Using an Artificial Regulatory Network to Investigate Neural Computation Using an Artificial Regulatory Network to Investigate Neural Computation W. Garrett Mitchener College of Charleston January 6, 25 W. Garrett Mitchener (C of C) UM January 6, 25 / 4 Evolution and Computing

More information

Supplementary Information

Supplementary Information Electronic Supplementary Material (ESI) for RSC Advances. This journal is The Royal Society of Chemistry 2014 Directed self-assembly of genomic sequences into monomeric and polymeric branched DNA structures

More information

Supporting Information

Supporting Information Supporting Information T. Pellegrino 1,2,3,#, R. A. Sperling 1,#, A. P. Alivisatos 2, W. J. Parak 1,2,* 1 Center for Nanoscience, Ludwig Maximilians Universität München, München, Germany 2 Department of

More information

Lecture 15: Realities of Genome Assembly Protein Sequencing

Lecture 15: Realities of Genome Assembly Protein Sequencing Lecture 15: Realities of Genome Assembly Protein Sequencing Study Chapter 8.10-8.15 1 Euler s Theorems A graph is balanced if for every vertex the number of incoming edges equals to the number of outgoing

More information

Regulatory Sequence Analysis. Sequence models (Bernoulli and Markov models)

Regulatory Sequence Analysis. Sequence models (Bernoulli and Markov models) Regulatory Sequence Analysis Sequence models (Bernoulli and Markov models) 1 Why do we need random models? Any pattern discovery relies on an underlying model to estimate the random expectation. This model

More information

Translation. A ribosome, mrna, and trna.

Translation. A ribosome, mrna, and trna. Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno

More information

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008 MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Table S1. Primers and PCR conditions used in this paper Primers Sequence (5 3 ) Thermal conditions Reference Rhizobacteria 27F 1492R

Table S1. Primers and PCR conditions used in this paper Primers Sequence (5 3 ) Thermal conditions Reference Rhizobacteria 27F 1492R Table S1. Primers and PCR conditions used in this paper Primers Sequence (5 3 ) Thermal conditions Reference Rhizobacteria 27F 1492R AAC MGG ATT AGA TAC CCK G GGY TAC CTT GTT ACG ACT T Detection of Candidatus

More information

160, and 220 bases, respectively, shorter than pbr322/hag93. (data not shown). The DNA sequence of approximately 100 bases of each

160, and 220 bases, respectively, shorter than pbr322/hag93. (data not shown). The DNA sequence of approximately 100 bases of each JOURNAL OF BACTEROLOGY, JUlY 1988, p. 3305-3309 0021-9193/88/073305-05$02.00/0 Copyright 1988, American Society for Microbiology Vol. 170, No. 7 Construction of a Minimum-Size Functional Flagellin of Escherichia

More information

evoglow - express N kit Cat. No.: product information broad host range vectors - gram negative bacteria

evoglow - express N kit Cat. No.: product information broad host range vectors - gram negative bacteria evoglow - express N kit broad host range vectors - gram negative bacteria product information Cat. No.: 2.1.020 evocatal GmbH 2 Content: Product Overview... 4 evoglow express N kit... 4 The evoglow Fluorescent

More information

Modelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics

Modelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics 582746 Modelling and Analysis in Bioinformatics Lecture 1: Genomic k-mer Statistics Juha Kärkkäinen 06.09.2016 Outline Course introduction Genomic k-mers 1-Mers 2-Mers 3-Mers k-mers for Larger k Outline

More information

evoglow - express N kit distributed by Cat.#: FP product information broad host range vectors - gram negative bacteria

evoglow - express N kit distributed by Cat.#: FP product information broad host range vectors - gram negative bacteria evoglow - express N kit broad host range vectors - gram negative bacteria product information distributed by Cat.#: FP-21020 Content: Product Overview... 3 evoglow express N -kit... 3 The evoglow -Fluorescent

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION DOI:.8/NCHEM. Conditionally Fluorescent Molecular Probes for Detecting Single Base Changes in Double-stranded DNA Sherry Xi Chen, David Yu Zhang, Georg Seelig. Analytic framework and probe design.. Design

More information

TM1 TM2 TM3 TM4 TM5 TM6 TM bp

TM1 TM2 TM3 TM4 TM5 TM6 TM bp a 467 bp 1 482 2 93 3 321 4 7 281 6 21 7 66 8 176 19 12 13 212 113 16 8 b ATG TCA GGA CAT GTA ATG GAG GAA TGT GTA GTT CAC GGT ACG TTA GCG GCA GTA TTG CGT TTA ATG GGC GTA GTG M S G H V M E E C V V H G T

More information

Supplemental Table 1. Primers used for cloning and PCR amplification in this study

Supplemental Table 1. Primers used for cloning and PCR amplification in this study Supplemental Table 1. Primers used for cloning and PCR amplification in this study Target Gene Primer sequence NATA1 (At2g393) forward GGG GAC AAG TTT GTA CAA AAA AGC AGG CTT CAT GGC GCC TCC AAC CGC AGC

More information

ydci GTC TGT TTG AAC GCG GGC GAC TGG GCG CGC AAT TAA CGG TGT GTA GGC TGG AGC TGC TTC

ydci GTC TGT TTG AAC GCG GGC GAC TGG GCG CGC AAT TAA CGG TGT GTA GGC TGG AGC TGC TTC Table S1. DNA primers used in this study. Name ydci P1ydcIkd3 Sequence GTC TGT TTG AAC GCG GGC GAC TGG GCG CGC AAT TAA CGG TGT GTA GGC TGG AGC TGC TTC Kd3ydcIp2 lacz fusion YdcIendP1 YdcItrgP2 GAC AGC

More information

The role of the FliD C-terminal domain in pentamer formation and

The role of the FliD C-terminal domain in pentamer formation and The role of the FliD C-terminal domain in pentamer formation and interaction with FliT Hee Jung Kim 1,2,*, Woongjae Yoo 3,*, Kyeong Sik Jin 4, Sangryeol Ryu 3,5 & Hyung Ho Lee 1, 1 Department of Chemistry,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION DOI:.38/NCHEM.246 Optimizing the specificity of nucleic acid hyridization David Yu Zhang, Sherry Xi Chen, and Peng Yin. Analytic framework and proe design 3.. Concentration-adjusted

More information

Objective: You will be able to justify the claim that organisms share many conserved core processes and features.

Objective: You will be able to justify the claim that organisms share many conserved core processes and features. Objective: You will be able to justify the claim that organisms share many conserved core processes and features. Do Now: Read Enduring Understanding B Essential knowledge: Organisms share many conserved

More information

part 3: analysis of natural selection pressure

part 3: analysis of natural selection pressure part 3: analysis of natural selection pressure markov models are good phenomenological codon models do have many benefits: o principled framework for statistical inference o avoiding ad hoc corrections

More information

Evolvable Neural Networks for Time Series Prediction with Adaptive Learning Interval

Evolvable Neural Networks for Time Series Prediction with Adaptive Learning Interval Evolvable Neural Networs for Time Series Prediction with Adaptive Learning Interval Dong-Woo Lee *, Seong G. Kong *, and Kwee-Bo Sim ** *Department of Electrical and Computer Engineering, The University

More information

Introduction to Molecular Phylogeny

Introduction to Molecular Phylogeny Introduction to Molecular Phylogeny Starting point: a set of homologous, aligned DNA or protein sequences Result of the process: a tree describing evolutionary relationships between studied sequences =

More information

Why do more divergent sequences produce smaller nonsynonymous/synonymous

Why do more divergent sequences produce smaller nonsynonymous/synonymous Genetics: Early Online, published on June 21, 2013 as 10.1534/genetics.113.152025 Why do more divergent sequences produce smaller nonsynonymous/synonymous rate ratios in pairwise sequence comparisons?

More information

Aoife McLysaght Dept. of Genetics Trinity College Dublin

Aoife McLysaght Dept. of Genetics Trinity College Dublin Aoife McLysaght Dept. of Genetics Trinity College Dublin Evolution of genome arrangement Evolution of genome content. Evolution of genome arrangement Gene order changes Inversions, translocations Evolution

More information

Supplementary Information

Supplementary Information Supplementary Information Arginine-rhamnosylation as new strategy to activate translation elongation factor P Jürgen Lassak 1,2,*, Eva Keilhauer 3, Max Fürst 1,2, Kristin Wuichet 4, Julia Gödeke 5, Agata

More information

Diversity of Chlamydia trachomatis Major Outer Membrane

Diversity of Chlamydia trachomatis Major Outer Membrane JOURNAL OF ACTERIOLOGY, Sept. 1987, p. 3879-3885 Vol. 169, No. 9 0021-9193/87/093879-07$02.00/0 Copyright 1987, American Society for Microbiology Diversity of Chlamydia trachomatis Major Outer Membrane

More information

Re- engineering cellular physiology by rewiring high- level global regulatory genes

Re- engineering cellular physiology by rewiring high- level global regulatory genes Re- engineering cellular physiology by rewiring high- level global regulatory genes Stephen Fitzgerald 1,2,, Shane C Dillon 1, Tzu- Chiao Chao 2, Heather L Wiencko 3, Karsten Hokamp 3, Andrew DS Cameron

More information

THE MATHEMATICAL STRUCTURE OF THE GENETIC CODE: A TOOL FOR INQUIRING ON THE ORIGIN OF LIFE

THE MATHEMATICAL STRUCTURE OF THE GENETIC CODE: A TOOL FOR INQUIRING ON THE ORIGIN OF LIFE STATISTICA, anno LXIX, n. 2 3, 2009 THE MATHEMATICAL STRUCTURE OF THE GENETIC CODE: A TOOL FOR INQUIRING ON THE ORIGIN OF LIFE Diego Luis Gonzalez CNR-IMM, Bologna Section, Via Gobetti 101, I-40129, Bologna,

More information

Identification of a Locus Involved in the Utilization of Iron by Haemophilus influenzae

Identification of a Locus Involved in the Utilization of Iron by Haemophilus influenzae INFECrION AND IMMUNITY, OCt. 1994, p. 4515-4525 0019-9567/94/$04.00+0 Copyright 1994, American Society for Microbiology Vol. 62, No. 10 Identification of a Locus Involved in the Utilization of Iron by

More information

FliZ Is a Posttranslational Activator of FlhD 4 C 2 -Dependent Flagellar Gene Expression

FliZ Is a Posttranslational Activator of FlhD 4 C 2 -Dependent Flagellar Gene Expression JOURNAL OF BACTERIOLOGY, July 2008, p. 4979 4988 Vol. 190, No. 14 0021-9193/08/$08.00 0 doi:10.1128/jb.01996-07 Copyright 2008, American Society for Microbiology. All Rights Reserved. FliZ Is a Posttranslational

More information

Evolutionary Analysis of Viral Genomes

Evolutionary Analysis of Viral Genomes University of Oxford, Department of Zoology Evolutionary Biology Group Department of Zoology University of Oxford South Parks Road Oxford OX1 3PS, U.K. Fax: +44 1865 271249 Evolutionary Analysis of Viral

More information

Properties of amino acids in proteins

Properties of amino acids in proteins Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated

More information

Amino Acids and Peptides

Amino Acids and Peptides Amino Acids Amino Acids and Peptides Amino acid a compound that contains both an amino group and a carboxyl group α-amino acid an amino acid in which the amino group is on the carbon adjacent to the carboxyl

More information

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Mathematics and Biochemistry University of Wisconsin - Madison 0 There Are Many Kinds Of Proteins The word protein comes

More information

Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective

Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective Jacobs University Bremen Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective Semester Project II By: Dawit Nigatu Supervisor: Prof. Dr. Werner Henkel Transmission

More information

Solutions In each case, the chirality center has the R configuration

Solutions In each case, the chirality center has the R configuration CAPTER 25 669 Solutions 25.1. In each case, the chirality center has the R configuration. C C 2 2 C 3 C(C 3 ) 2 D-Alanine D-Valine 25.2. 2 2 S 2 d) 2 25.3. Pro,, Trp, Tyr, and is, Trp, Tyr, and is Arg,

More information

The 3 Genomic Numbers Discovery: How Our Genome Single-Stranded DNA Sequence Is Self-Designed as a Numerical Whole

The 3 Genomic Numbers Discovery: How Our Genome Single-Stranded DNA Sequence Is Self-Designed as a Numerical Whole Applied Mathematics, 2013, 4, 37-53 http://dx.doi.org/10.4236/am.2013.410a2004 Published Online October 2013 (http://www.scirp.org/journal/am) The 3 Genomic Numbers Discovery: How Our Genome Single-Stranded

More information

Sex-Linked Inheritance in Macaque Monkeys: Implications for Effective Population Size and Dispersal to Sulawesi

Sex-Linked Inheritance in Macaque Monkeys: Implications for Effective Population Size and Dispersal to Sulawesi Supporting Information http://www.genetics.org/cgi/content/full/genetics.110.116228/dc1 Sex-Linked Inheritance in Macaque Monkeys: Implications for Effective Population Size and Dispersal to Sulawesi Ben

More information

Pathways and Controls of N 2 O Production in Nitritation Anammox Biomass

Pathways and Controls of N 2 O Production in Nitritation Anammox Biomass Supporting Information for Pathways and Controls of N 2 O Production in Nitritation Anammox Biomass Chun Ma, Marlene Mark Jensen, Barth F. Smets, Bo Thamdrup, Department of Biology, University of Southern

More information

DNA sequence analysis of the imp UV protection and mutation operon of the plasmid TP110: identification of a third gene

DNA sequence analysis of the imp UV protection and mutation operon of the plasmid TP110: identification of a third gene QD) 1990 Oxford University Press Nucleic Acids Research, Vol. 18, No. 17 5045 DNA sequence analysis of the imp UV protection and mutation operon of the plasmid TP110: identification of a third gene David

More information

Evolutionary dynamics of abundant stop codon readthrough in Anopheles and Drosophila

Evolutionary dynamics of abundant stop codon readthrough in Anopheles and Drosophila biorxiv preprint first posted online May. 3, 2016; doi: http://dx.doi.org/10.1101/051557. The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. All rights reserved.

More information

PROTEIN SYNTHESIS INTRO

PROTEIN SYNTHESIS INTRO MR. POMERANTZ Page 1 of 6 Protein synthesis Intro. Use the text book to help properly answer the following questions 1. RNA differs from DNA in that RNA a. is single-stranded. c. contains the nitrogen

More information

Sequence Divergence & The Molecular Clock. Sequence Divergence

Sequence Divergence & The Molecular Clock. Sequence Divergence Sequence Divergence & The Molecular Clock Sequence Divergence v simple genetic distance, d = the proportion of sites that differ between two aligned, homologous sequences v given a constant mutation/substitution

More information

It is the author's version of the article accepted for publication in the journal "Biosystems" on 03/10/2015.

It is the author's version of the article accepted for publication in the journal Biosystems on 03/10/2015. It is the author's version of the article accepted for publication in the journal "Biosystems" on 03/10/2015. The system-resonance approach in modeling genetic structures Sergey V. Petoukhov Institute

More information

373 The Evidence of how DNA and the Scriptures have Identical Numeric Signatures

373 The Evidence of how DNA and the Scriptures have Identical Numeric Signatures 4 373 The Evidence of how DNA and the Scriptures have Identical Numeric Signatures How are Gen 1.1, Deut 6.4 & John 1.1 encoded in the DNA molecule Peter Bluer A note on Neighbourhood values using Prime

More information

evoglow yeast kit distributed by product information Cat.#: FP-21040

evoglow yeast kit distributed by product information Cat.#: FP-21040 evoglow yeast kit product information distributed by Cat.#: FP-21040 Flavin-mononucleotide-based Fluorescent Protein (FbFP) evoglow basic kit Cat.# FP-21010 Quantity 20 µg each General Information Fluorescent

More information

evoglow basic kit product information

evoglow basic kit product information evoglow basic kit product information Cat. No.: 2.1.010 Flavin-mononucleotide-based Fluorescent Protein (FbFP) evoglow basic kit Catalog No. evo-2.1.010 Quantity 20 µg each General Information Fluorescent

More information

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods Cell communication channel Bioinformatics Methods Iosif Vaisman Email: ivaisman@gmu.edu SEQUENCE STRUCTURE DNA Sequence Protein Sequence Protein Structure Protein structure ATGAAATTTGGAAACTTCCTTCTCACTTATCAGCCACCT...

More information

Cloning, Nucleotide Sequencing, and Expression of the Clostridium perfringens Enterotoxin Gene in Eschenichia coli

Cloning, Nucleotide Sequencing, and Expression of the Clostridium perfringens Enterotoxin Gene in Eschenichia coli INFECTION AND IMMUNIrY, Aug. 1993, p. 3429-3439 0019-9567/93/083429-11$02.00/0 Copyright 1993, American Society for Microbiology Vol. 61, No. 8 Cloning, Nucleotide Sequencing, and Expression of the Clostridium

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer Codon usage patterns in Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster and Homo sapiens; a review of the considerable

More information

ChemiScreen CaS Calcium Sensor Receptor Stable Cell Line

ChemiScreen CaS Calcium Sensor Receptor Stable Cell Line PRODUCT DATASHEET ChemiScreen CaS Calcium Sensor Receptor Stable Cell Line CATALOG NUMBER: HTS137C CONTENTS: 2 vials of mycoplasma-free cells, 1 ml per vial. STORAGE: Vials are to be stored in liquid N

More information

Translation - Prokaryotes

Translation - Prokaryotes 1 Translation - Prokaryotes Shine-Dalgarno (SD) Sequence rrna 3 -GAUACCAUCCUCCUUA-5 mrna...ggagg..(5-7bp)...aug Influences: Secondary structure!! SD and AUG in unstructured region Start AUG 91% GUG 8 UUG

More information

Supplemental Figure 1. Phenotype of ProRGA:RGAd17 plants under long day

Supplemental Figure 1. Phenotype of ProRGA:RGAd17 plants under long day Supplemental Figure 1. Phenotype of ProRGA:RGAd17 plants under long day conditions. Photo was taken when the wild type plant started to bolt. Scale bar represents 1 cm. Supplemental Figure 2. Flowering

More information

Bacillus subtilis Succinate Dehydrogenase Complex

Bacillus subtilis Succinate Dehydrogenase Complex JOURNAL OF BACTERIOLOGY, June 1986, p. 1067-1071 0021-9193/86/061067-05$02.00/0 Copyright 1986, American Society for Microbiology Vol. 166, No. 3 Nucleotide Sequence of the Gene for Cytochrome b558 of

More information

Chemistry Chapter 22

Chemistry Chapter 22 hemistry 2100 hapter 22 Proteins Proteins serve many functions, including the following. 1. Structure: ollagen and keratin are the chief constituents of skin, bone, hair, and nails. 2. atalysts: Virtually

More information

Genetic code on the dyadic plane

Genetic code on the dyadic plane Genetic code on the dyadic plane arxiv:q-bio/0701007v3 [q-bio.qm] 2 Nov 2007 A.Yu.Khrennikov, S.V.Kozyrev June 18, 2018 Abstract We introduce the simple parametrization for the space of codons (triples

More information

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Viewing and Analyzing Proteins, Ligands and their Complexes 2 2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing

More information

Timing molecular motion and production with a synthetic transcriptional clock

Timing molecular motion and production with a synthetic transcriptional clock Timing molecular motion and production with a synthetic transcriptional clock Elisa Franco,1, Eike Friedrichs 2, Jongmin Kim 3, Ralf Jungmann 2, Richard Murray 1, Erik Winfree 3,4,5, and Friedrich C. Simmel

More information

Near-instant surface-selective fluorogenic protein quantification using sulfonated

Near-instant surface-selective fluorogenic protein quantification using sulfonated Electronic Supplementary Material (ESI) for rganic & Biomolecular Chemistry. This journal is The Royal Society of Chemistry 2014 Supplemental nline Materials for ear-instant surface-selective fluorogenic

More information

From DNA to protein, i.e. the central dogma

From DNA to protein, i.e. the central dogma From DNA to protein, i.e. the central dogma DNA RNA Protein Biochemistry, chapters1 5 and Chapters 29 31. Chapters 2 5 and 29 31 will be covered more in detail in other lectures. ph, chapter 1, will be

More information

part 4: phenomenological load and biological inference. phenomenological load review types of models. Gαβ = 8π Tαβ. Newton.

part 4: phenomenological load and biological inference. phenomenological load review types of models. Gαβ = 8π Tαβ. Newton. 2017-07-29 part 4: and biological inference review types of models phenomenological Newton F= Gm1m2 r2 mechanistic Einstein Gαβ = 8π Tαβ 1 molecular evolution is process and pattern process pattern MutSel

More information

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Exam III. Please read through each question carefully, and make sure you provide all of the requested information. 09-107 onors Chemistry ame Exam III Please read through each question carefully, and make sure you provide all of the requested information. 1. A series of octahedral metal compounds are made from 1 mol

More information

Slide 1 / 54. Gene Expression in Eukaryotic cells

Slide 1 / 54. Gene Expression in Eukaryotic cells Slide 1 / 54 Gene Expression in Eukaryotic cells Slide 2 / 54 Central Dogma DNA is the the genetic material of the eukaryotic cell. Watson & Crick worked out the structure of DNA as a double helix. According

More information

7.05 Spring 2004 February 27, Recitation #2

7.05 Spring 2004 February 27, Recitation #2 Recitation #2 Contact Information TA: Victor Sai Recitation: Friday, 3-4pm, 2-132 E-mail: sai@mit.edu ffice ours: Friday, 4-5pm, 2-132 Unit 1 Schedule Recitation/Exam Date Lectures covered Recitation #2

More information

Chapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes

Chapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes Chapter 16 Lecture Concepts Of Genetics Tenth Edition Regulation of Gene Expression in Prokaryotes Chapter Contents 16.1 Prokaryotes Regulate Gene Expression in Response to Environmental Conditions 16.2

More information

ANALYZING THE DIVERSITY OF A SMALL ANTIBODY MIMIC LIBRARY. Nick Empey. Chapel Hill 2010

ANALYZING THE DIVERSITY OF A SMALL ANTIBODY MIMIC LIBRARY. Nick Empey. Chapel Hill 2010 ANALYZING THE DIVERSITY OF A SMALL ANTIBODY MIMIC LIBRARY Nick Empey A thesis submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for

More information

Biology 112 Practice Midterm Questions

Biology 112 Practice Midterm Questions Biology 112 Practice Midterm Questions 1. Identify which statement is true or false I. Bacterial cell walls prevent osmotic lysis II. All bacterial cell walls contain an LPS layer III. In a Gram stain,

More information

AtTIL-P91V. AtTIL-P92V. AtTIL-P95V. AtTIL-P98V YFP-HPR

AtTIL-P91V. AtTIL-P92V. AtTIL-P95V. AtTIL-P98V YFP-HPR Online Resource 1. Primers used to generate constructs AtTIL-P91V, AtTIL-P92V, AtTIL-P95V and AtTIL-P98V and YFP(HPR) using overlapping PCR. pentr/d- TOPO-AtTIL was used as template to generate the constructs

More information

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

UNIT TWELVE. a, I _,o ' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I -oh UNT TWELVE PROTENS : PEPTDE BONDNG AND POLYPEPTDES 12 CONCEPTS Many proteins are important in biological structure-for example, the keratin of hair, collagen of skin and leather, and fibroin of silk. Other

More information

Supporting Information. An Electric Single-Molecule Hybridisation Detector for short DNA Fragments

Supporting Information. An Electric Single-Molecule Hybridisation Detector for short DNA Fragments Supporting Information An Electric Single-Molecule Hybridisation Detector for short DNA Fragments A.Y.Y. Loh, 1 C.H. Burgess, 2 D.A. Tanase, 1 G. Ferrari, 3 M.A. Maclachlan, 2 A.E.G. Cass, 1 T. Albrecht*

More information

The effects of leader peptide sequence and length on attenuation control of the trp operon of E.coli

The effects of leader peptide sequence and length on attenuation control of the trp operon of E.coli Nucleic Acids Research, Vol. 19, No. 4 795 The effects of leader peptide sequence and length on attenuation control of the trp operon of E.coli James R.Roesser + and Charles Yanofsky* Department of Biological

More information

Chain-like assembly of gold nanoparticles on artificial DNA templates via Click Chemistry

Chain-like assembly of gold nanoparticles on artificial DNA templates via Click Chemistry Electronic Supporting Information: Chain-like assembly of gold nanoparticles on artificial DNA templates via Click Chemistry Monika Fischler, Alla Sologubenko, Joachim Mayer, Guido Clever, Glenn Burley,

More information

Insects act as vectors for a number of important diseases of

Insects act as vectors for a number of important diseases of pubs.acs.org/synthbio Novel Synthetic Medea Selfish Genetic Elements Drive Population Replacement in Drosophila; a Theoretical Exploration of Medea- Dependent Population Suppression Omar S. Abari,,# Chun-Hong

More information

Molecular Biology, Genetic Engineering & Biotechnology Operons ???

Molecular Biology, Genetic Engineering & Biotechnology Operons ??? 1 Description of Module Subject Name?? Paper Name Module Name/Title XV- 04: 2 OPERONS OBJECTIVES To understand how gene is expressed and regulated in prokaryotic cell To understand the regulation of Lactose

More information

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain. Biochemistry Quiz Review 1I A general note: Short answer questions are just that, short. Writing a paragraph filled with every term you can remember from class won t improve your answer just answer clearly,

More information