Bioinformatics Study On Mirror DNA In Mycobacterium Tuberculosis Using Fast Parallel Complement Blast Strategy.

Similar documents
Research & Reviews: Journal of Microbiology and Biotechnology

GCD3033:Cell Biology. Transcription

Supplemental Materials

Genomics and bioinformatics summary. Finding genes -- computer searches

Comparative genomics: Overview & Tools + MUMmer algorithm

Flow of Genetic Information

Introduction to Molecular and Cell Biology

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

BME 5742 Biosystems Modeling and Control

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

This document describes the process by which operons are predicted for genes within the BioHealthBase database.

What is the central dogma of biology?

Bio 119 Bacterial Genomics 6/26/10

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Sequence analysis and comparison

UNIT 5. Protein Synthesis 11/22/16

Translation and Operons

Introduction to Microbiology BIOL 220 Summer Session I, 1996 Exam # 1

Biology I Fall Semester Exam Review 2014

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

Number of questions TEK (Learning Target) Biomolecules & Enzymes

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage.

Biology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p.

BIOINFORMATICS LAB AP BIOLOGY

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Bioinformatics for Biologists

Is inversion symmetry of chromosomes a law of nature?

- conserved in Eukaryotes. - proteins in the cluster have identifiable conserved domains. - human gene should be included in the cluster.

VCE BIOLOGY Relationship between the key knowledge and key skills of the Study Design and the Study Design

The nature of genomes. Viral genomes. Prokaryotic genome. Nonliving particle. DNA or RNA. Compact genomes with little spacer DNA

2. Cellular and Molecular Biology

Bioinformatics Exercises

Introduction to molecular biology. Mitesh Shrestha

Review. Membrane proteins. Membrane transport

SUPPLEMENTARY INFORMATION

Multiple Choice Review- Eukaryotic Gene Expression

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Microbiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms

2 Genome evolution: gene fusion versus gene fission

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION

Objective 3.01 (DNA, RNA and Protein Synthesis)

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

15.2 Prokaryotic Transcription *

Microbial Genetics, Mutation and Repair. 2. State the function of Rec A proteins in homologous genetic recombination.

Genomes and Their Evolution

Biology 112 Practice Midterm Questions

2015 FALL FINAL REVIEW

CELL BIOLOGY, BIOINFORMATICS AND SYSTEMS BIOLOGY BIO160 (GU); UMF012 (Chalmers) schedule 2010

Bioinformatics Chapter 1. Introduction

DNA Technology, Bacteria, Virus and Meiosis Test REVIEW

A DNA Sequence 2017/12/6 1

Chapter 15 Active Reading Guide Regulation of Gene Expression

DNA, Chromosomes, and Genes

Computational Biology: Basics & Interesting Problems

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.

Comparing whole genomes

2. What was the Avery-MacLeod-McCarty experiment and why was it significant? 3. What was the Hershey-Chase experiment and why was it significant?

CS612 - Algorithms in Bioinformatics

Ti plasmid derived plant vector systems: binary and co - integrative vectors transformation process; regeneration of the transformed lines

Cell Division: the process of copying and dividing entire cells The cell grows, prepares for division, and then divides to form new daughter cells.

Variation of Traits. genetic variation: the measure of the differences among individuals within a population

Table 5. Genes unique to G. thermodenitrificans NG80-2 Gene ID Gene name Gene product COG functional category

Hands-On Nine The PAX6 Gene and Protein

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

From Gene to Protein

Reading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype

Sequences, Structures, and Gene Regulatory Networks

Biology Notes 2. Mitosis vs Meiosis

Introduction to Bioinformatics Integrated Science, 11/9/05

Bacterial Genetics & Operons

SPRINGFIELD TECHNICAL COMMUNITY COLLEGE ACADEMIC AFFAIRS

Plant and animal cells (eukaryotic cells) have a cell membrane, cytoplasm and genetic material enclosed in a nucleus.

Visit to BPRC. Data is crucial! Case study: Evolution of AIRE protein 6/7/13

Algorithms in Computational Biology (236522) spring 2008 Lecture #1

Motifs and Logos. Six Introduction to Bioinformatics. Importance and Abundance of Motifs. Getting the CDS. From DNA to Protein 6.1.

Topic 1 - The building blocks of. cells! Name:!

Name: Date: Hour: Unit Four: Cell Cycle, Mitosis and Meiosis. Monomer Polymer Example Drawing Function in a cell DNA

Introduction to Bioinformatics

Cell Division. OpenStax College. 1 Genomic DNA

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Chapter 1. DNA is made from the building blocks adenine, guanine, cytosine, and. Answer: d

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

A genomic insight into evolution and virulence of Corynebacterium diphtheriae

COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

Big Idea 3: Living systems store, retrieve, transmit, and respond to information essential to life processes.

GO ID GO term Number of members GO: translation 225 GO: nucleosome 50 GO: calcium ion binding 76 GO: structural

SUPPLEMENTARY INFORMATION

MCB 110. "Molecular Biology: Macromolecular Synthesis and Cellular Function" Spring, 2018

Cellular Organization

Computational Cell Biology Lecture 4

Related Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever.

Introduction to Microbiology. CLS 212: Medical Microbiology Miss Zeina Alkudmani

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1

Computational Molecular Biology (

Degeneracy. Two types of degeneracy:

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA

What Organelle Makes Proteins According To The Instructions Given By Dna

Cellular Neuroanatomy I The Prototypical Neuron: Soma. Reading: BCP Chapter 2

Transcription:

INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 Bioinformatics Study On Mirror DNA In Mycobacterium Tuberculosis Using Fast Parallel Complement Blast Strategy. Munzali S. Zubair, Musa Hassan.Muhammad, R.G. Ahmad, Umar A.A Abstract: Many softwares that have been used for identification DNA mirror repeats were not simple and effective for finding mirror repeats, so a new method for the identification of mirror repeats within the gene sequences in both eukaryotic and prokaryotic organisms has been developed and now is available. The method is called FAST PARALLEL COMPLEMENT BLAST which is the easiest method for observing mirror repeats in bacterial genome and other eukaryotic genome. This simple method involves only three steps: Downloading sequence of mycobacterium tuberculosis gene using NCBI in FASTA format, Making a Parallel complement, and searching for mirror repeats using BLAST which shows the homology between the original sequence and the parallel complement sequence. We explored twenty genes (20) of Mycobacterium tuberculosis H37Rv for DNA mirror repeats and the result shows that mirror repeats of 8 genes were successfully found. Furthermore, nobody knows the exact role of mirror repeats yet and detailed mechanisms and functions of most repeats are still unknown. Keywords: Fast Parallel Complement Blast, Bioinformatics, DNA, mirror repeats, mycobacterium tuberculosis H37Rv, and Sequences. INTRODUCTION A DNA mirror repeats can be identified when a sequence shares homology with the other part of the sequence within a gene or genome. A DNA mirror repeat is a sequence segment delimited on the basis of its containing a center of symmetry on a single strand and identical terminal nucleotides (Dorothy M. Lang, 2007). This implies that one part of the sequence is a mirror image of the other sequence. For example topa DNA mirror repeat 5 CAGCGGCCGGCGAC3, the sequence in red is a mirror image to sequence in black sequence. Most different type of repeats are found in bacterial genomes such as direct repeats, inverted repeats, complement repeats, and reverse complement (palindromic) repeats. Direct repeats are two or more repeating DNA fragments appeared in the same orientation and on the same strand. For example, a sequence direct repeats fragmenttggaa and another sequence fragment ACCTT and are direct to each other. Inverted repeats are two or more repeating DNA fragments appeared in inverted orientation and on the same strand. For example, a sequence fragment GGAATCGATCTT and another sequence fragment AAGATCGATTCC are reverse repeats. Palindromic repeats are also called Complement repeats are two or more repeating DNA fragments looked in the same orientation but on the complement strand, i.e. in complement orientation on the same strand. For example, a sequence fragment CAGCGGC and another sequence fragmentgtcgccg are reverse repeats. Reverse complement repeats are two or more repeating DNA fragments appeared in inverted orientation but on the complement strand, i.e. in reverse complement orientation on the same strand. For example a sequence fragmentatggc and another sequence fragment GCCAT are reverse complement repeats (Yo-Cheng Chang,203). 5 TGGAATGGAA 3 direct repeats 3 ACCTTACCTT5 5 GGAATCGATCTTAAGATCGATTCC 3 inverted repeats 3 CCTTAGCTAGAATTCTAGCTAAGG 5 5 CAGCGGCCGGCGAC3 DNA mirror repeats 5 CAGCGGCCGGCGAC 3 The aim of this work is to find the mirror repeats in the Mycobacterium Tuberculosis H37Rv genome. Methodology Sequences The gene sequences or coding sequences were downloaded in FASTA format from NCBI link: http://www.ncbi.nlm.nih.gov Munzali S. Zubair, Musa Hassan.Muhammad, R.G. Ahmad, Umar A.A Phone Number:+234706882387 E-Mail: musachakla@yahoo.com Affiliation: Federal University Dutse, Department of Biological Science IJSTR 205 300

INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 Figure : shows home page Parallel Complement Sequence Parallel complement sequences of the twenty (20) were downloaded from reverse Complement program (http://www.bioinformatics.org/sms/rev_comp.html) Making Mirror Repeats Mirror repeats of the downloaded sequences were found using BLAST _N. In which both FASTA format of Nucleotide sequence and its parallel complement were pasted and alignedfor BLAST homology search using BLAST tool: (http://blast.ncbi.nlm.nih.gov/blast.cgi?page=megablast&p ROGRAM=blastn&BLAST_PROGRAMS=megaBlast&PAG E_TYPE=BlastSearch&BLAST_SPEC=blast2seq&QUERY =&SUBJECTS=) Figure 2: shows NCBI BLAST IJSTR 205 30

INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 >Mirror repeats analysis: It is mirror repeats when the numbers (equal) on subject and query reverse in opposite direction but it is not mirror repeat if the numbers on subject and query are different. Find the gene sequence from NCBI in FASTA format Make the parallel complement by putting the original sequence at reverse complement tool http://www.bioinformatics.org/ ms/rev comp.html Align the original gene sequence with parallel complement sequence using blast tool http://blast.ncbi.nlm.nih.gov/blastalign.cgi The mirror repeats can be identified if the number of nucleotide is reversed in both subject and query. Example Query 529 GCCGGGGGCCG 539 Sbjct 539 GCCGGGGGCCG 529 The figure 3: Depicts the simple strategy for the identification of mirror repeat (Pattabiraman, 204) S/N s name ID DNA topoisomerase I topa 885608 2 bacterioferritin bfrb 88676 3 iron-regulated lsr2 885580 4 phosphoserine/ threonine phosphatase PstP 887070 5 isocitrate lyase icl 88629 6 membrane protein MmpS5 888233 7 chromosomal replication initiator protein dnaa 88504 8 acyl-coa reductase acra 887950 9 recn DNA repair protein 885805 0 bacterioferritin bfra Rv876 885767 transmembrane transport protein mmpl5 88829 2 isochorismate synthase entc 888824 3 katg catalase-peroxidase 885638 4 lipoprotein LprG 886700 5 beta-glucosidase BglS 886780 6 rpsl 30S ribosomal protein S2 888259 7 fabg 3-oxoacyl-ACP reductase FabG 88655 8 DNA replication and repair protein recf 887089 9 inha NADH-dependent enoyl-[acp] reductase 886523 20 bacterioferritin bfra 885767 Table : Various types of mirror repeats observed in Mycobacterium tuberculosis H37Rv using FPCB strategy. Result and Discussion This study applies bioinformatic methods to the identification and function of mirror repeats in the M. tuberculosis H37Rv genomes. Mycobacterium tuberculosis, the causative agent of tuberculosis, is one of the most successful and deadly pathogens that kills millions of people. Mycobacterium tuberculosis, H37Rv strain was analyzed in order to improve our understanding of the biology of Mycobacterium tuberculosis in causing the disease and to help develop of new therapeutic interventions. Using computer applications, mirror repeats were found in mycobacterium tuberculosis H37Rv strain from 8 different genes out of 20 genes. It is known that a single gene may perform different function and a gene may be found in multiple species which may perform either the same or different function. Most of the genes (8 out of 20 genes) analyzed using FPCB strategy in this study were found to have the presence of the mirror repeats. It will be a high significant to find the role of mirror DNA at molecular level in respect to transcription and translation process occurring in a cell. Up till now the exact function of mirror repeats has not been elucidated. Future studies may be required to determine the exact role of mirror repeats. IJSTR 205 302

INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 MIRROR REPEATS OF MYCOBACTERIUM TUBERCULOSIS Sl/no. /cds Sequence reference No. of mirror repeats Mirror repeats CAGCGGCCGGCGAC CGCGCCGCCGCCGAGC TGGAAGAAGGT CGCCGCGCCGC DNA topoisomeras e I topa 885608 CGCGCCGGCGCCGCGC ACCCGCAAGGTGAAGAACGCCCA GTCGGCGGCGAGGCTGACGATGCCGAGCGGCGACTG GGCAAGAACGG CTCAGCGACTC GGAGTGGACCCCGCCTCGGGTGAGG CCGGCAAAACGGCC 2 bacterioferritin bfrb 88676 GCCGGGGGCCG 3 iron-regulated lsr2 885580 GCCGGCGCGGCCG 4 phosphoserin e/ threonine phosphatase PstP 887070 CCGGCTCGGCC 5 isocitrate lyase icl 88629 2 GGGCGTTGCGGG TGCCACCACCGT 6 membrane protein 888233 GAACTCTCAAG IJSTR 205 303

INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 MmpS5 GGCCAGTGGGGTGACCGG 7 acyl-coa reductase acra 887950 5 CGCAACGGCAACGC GCCGGCGCGGCCG CGGCGCCGCGGC 8 9 chromosomal replication initiator protein dnaa stearoyl-coa 9-desaturas desa3 88504 2 88882 GGTGGTGGTGG CTCAGCCGCCGACTC CTTCCACACCTTC GTGGACAGGTG ACGTGGTGCA 0 recn DNA repair protein 885805 5 CCGCGGCGCC GCTCAACTCG GTCGGCGGCTG transmembran e transport protein mmpl5 88829 CGGGCGGCGGCTGGC 2 GGCCAGGACCGG CTGGGCTCGGGTC GACCGGGGCCAG CTGGCCCGGTC 2 betaglucosidase BglS 886780 7 TCAACGTGCAACT ATCGGCGGCTA GGCGGAGGCGG GGTGGCGGTGG 3 isochorismate synthase entc 888824 2 GCCAAGAACCG CGACGCGCAGC 4 katg catalaseperoxidase 885638 3 GGCCGCGGCGGCGCCGG CAAGCGCGGCGGCGCCAAC 5 fabg 3- oxoacyl-acp reductase FabG 88655 GGCAAGGTGAAGTGGACCGG 0 No mirror repeats IJSTR 205 304

INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 6 lipoprotein LprG 886700 2 GGCTGCTCGTCGG CTGGGTGGGTC 7 rpsl 30S ribosomal protein S2 888259 CCGAAGAAGCC 8 DNA replication and repair proteinrecf 887089 0 No mirror repeats 9 885767 bacterioferritin bfra Rv876 GGCG-GCCCACACCCGCGCGG 20 inha NADHdependent enoyl-[acp] reductase 886523 TTCGTATGCTT Table 2: Shows summary of different types of mirror repeats observed in various mycobacterium tuberculosis using FPCB strategy their NCBI number. The red sequences are mirror to the blue sequences. References [] Pattabiraman N: Can the Double Helix Be Parallel? Biopolymers. 986; 25(9): 603 606 [2] Dorothy M. Lang, (2007). Imperfect DNA mirror repeats in the gag gene of HIV- (HXB2)identify key functional domains and coincide with protein structuralelements in each of the mature proteins. Virology Journal 2007. (2007). [5] http://www.bioinformatics.org/sms/rev_comp.html [6] http://blast.ncbi.nlm.nih.gov/blast.cgi?page=mega Blast&PROGRAM=blastn&BLAS_PROGRAMS=m egablast&page_type=blastsearch&blast_sp EC=blast2seq&QUERY=&SUBJECTS=) [3] Yo-Cheng Chang Chuan-Hsiung Chang, (2003). Common Repeat Sequences in Bacterial Genomes. Journal of Medical and Biological Engineering, 23(2): 65-72 [4] http://www.ncbi.nlm.nih.gov IJSTR 205 305