Chapter # EVOLUTION AND ORIGIN OF NEUROFIBROMIN, THE PRODUCT OF THE NEUROFIBROMATOSIS TYPE 1 (NF1) TUMOR-SUPRESSOR GENE

Similar documents
Sequence Alignment Techniques and Their Uses

Small RNA in rice genome

Ensembl focuses on metazoan (animal) genomes. The genomes currently available at the Ensembl site are:

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Hands-On Nine The PAX6 Gene and Protein

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage.

BIOINFORMATICS LAB AP BIOLOGY

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Genomes and Their Evolution

Computational Biology: Basics & Interesting Problems

Example of Function Prediction

BLAST. Varieties of BLAST

Taxonomy. Content. How to determine & classify a species. Phylogeny and evolution

Supplemental Data. Perea-Resa et al. Plant Cell. (2012) /tpc

Visit to BPRC. Data is crucial! Case study: Evolution of AIRE protein 6/7/13

SUPPLEMENTARY INFORMATION

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

Dr. Amira A. AL-Hosary

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr

Procedure to Create NCBI KOGS

Bioinformatics Chapter 1. Introduction

Bioinformatics Exercises

Comparing Genomes! Homologies and Families! Sequence Alignments!

Algorithms in Computational Biology (236522) spring 2008 Lecture #1

Introduction to Bioinformatics

BLAST Database Searching. BME 110: CompBio Tools Todd Lowe April 8, 2010

Homology Modeling. Roberto Lins EPFL - summer semester 2005

BSC 4934: QʼBIC Capstone Workshop" Giri Narasimhan. ECS 254A; Phone: x3748

Multiple Sequence Alignment. Sequences

Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment

Basic Local Alignment Search Tool

SUPPLEMENTARY INFORMATION

Cladistics and Bioinformatics Questions 2013

Processes of Evolution

Sequences, Structures, and Gene Regulatory Networks

A profile-based protein sequence alignment algorithm for a domain clustering database

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Name: Class: Date: ID: A

CGS 5991 (2 Credits) Bioinformatics Tools

BIOINFORMATICS: An Introduction

- conserved in Eukaryotes. - proteins in the cluster have identifiable conserved domains. - human gene should be included in the cluster.

Phylogenetics in the Age of Genomics: Prospects and Challenges

Statistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences

Supplementary text for the section Interactions conserved across species: can one select the conserved interactions?

Map of AP-Aligned Bio-Rad Kits with Learning Objectives

Introduction to protein alignments

Homology and Information Gathering and Domain Annotation for Proteins

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

MiGA: The Microbial Genome Atlas

Principles of Genetics

An Efficient Algorithm for Protein-Protein Interaction Network Analysis to Discover Overlapping Functional Modules

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ

Biology Scope and Sequence Student Outcomes (Objectives Skills/Verbs)

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

In-Depth Assessment of Local Sequence Alignment

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1

Hereditary Hemochromatosis

CSCE555 Bioinformatics. Protein Function Annotation

A A A A B B1

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

I. Molecules and Cells: Cells are the structural and functional units of life; cellular processes are based on physical and chemical changes.

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

I. Molecules & Cells. A. Unit One: The Nature of Science. B. Unit Two: The Chemistry of Life. C. Unit Three: The Biology of the Cell.

MULTIPLE SEQUENCE ALIGNMENT FOR CONSTRUCTION OF PHYLOGENETIC TREE

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Tools and Algorithms in Bioinformatics

AP Biology Essential Knowledge Cards BIG IDEA 1

Chapter 16: Reconstructing and Using Phylogenies

Group activities: Making animal model of human behaviors e.g. Wine preference model in mice

NIH Public Access Author Manuscript Immunogenetics. Author manuscript; available in PMC 2006 May 31.

Course Descriptions Biology

Big Idea 3: Living systems store, retrieve, transmit, and respond to information essential to life processes.

Tandem repeat 16,225 20,284. 0kb 5kb 10kb 15kb 20kb 25kb 30kb 35kb

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

Computational approaches for functional genomics

Early History up to Schedule. Proteins DNA & RNA Schwann and Schleiden Cell Theory Charles Darwin publishes Origin of Species

BIOLOGY Grades Summer Units: 10 high school credits UC Requirement Category: d. General Description:

C3020 Molecular Evolution. Exercises #3: Phylogenetics

The nonsynonymous/synonymous substitution rate ratio versus the radical/conservative replacement rate ratio in the evolution of mammalian genes

86 Part 4 SUMMARY INTRODUCTION

Comparative Bioinformatics Midterm II Fall 2004

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences

RELATIONSHIPS BETWEEN GENES/PROTEINS HOMOLOGUES

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Comparison and Analysis of Heat Shock Proteins in Organisms of the Kingdom Viridiplantae. Emily Germain, Rensselaer Polytechnic Institute

Analytical Study of Hexapod mirnas using Phylogenetic Methods

8/23/2014. Phylogeny and the Tree of Life

Name Block Date Final Exam Study Guide

AP Curriculum Framework with Learning Objectives

Sequence alignment methods. Pairwise alignment. The universe of biological sequence analysis

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Honors Biology Reading Guide Chapter 11

Effects of Gap Open and Gap Extension Penalties

How to read and make phylogenetic trees Zuzana Starostová

Introduction to Bioinformatics Integrated Science, 11/9/05

Transcription:

142 Part 5 Chapter # EVOLUTION AND ORIGIN OF NEUROFIBROMIN, THE PRODUCT OF THE NEUROFIBROMATOSIS TYPE 1 (NF1) TUMOR-SUPRESSOR GENE Golovnina K. *1, Blinov A. 1, Chang L.-S. 2 1 Institute of Cytology and Genetics, SB RAS, Novosibirsk, Russia; 2 Center for Childhood Cancer, Children s Research Institute, Children s Hospital and Department of Pediatrics, The Ohio State University, USA * Corresponding author: e-mail: ksu@bionet.nsc.ru Key words: neurofibromin, the Neurofibromatosis type 1 (NF1) gene, RasGAP protein family, GTPase-activator proteins for Ras-like GTPase, phylogeny, BLAST SUMMARY Motivation: Neurofibromatosis type 1 (NF1) is a common genetic disorder, which predisposes affected individuals to a variety of clinical features including tumors of the central and peripheral nervous systems. The product of the NF1 gene, neurofibromin, is a tumor suppressor which most likely acts through the interaction of its GTPase activating protein (GAP) related domain (GRD) with RAS to regulate cellular growth. While clinical features of NF1 as well as functional activity of human neurofibromin are intensively studied now little is understood about its evolution, diversity, and overall distribution among different taxa. Results: By combining bioinformatic and phylogenetic approaches, we demonstrated that NF1 homologs are present across a wide range of eukaryotic lineages. We observed 26 similar to NF1 amino-acid sequences from Chordata, Echinodermata, Arthropoda, Platyhelmintes and Fungi taxons. Taking into account a presence of NF1 gene in fungi, we can suggest the derivation of this gene before the Metazoan origin. In this case, an absence of NF1 in Nematoda and Mollusca should be a result of this gene elimination. INTRODUCTION Clinical features of NF1. Neurofibromatosis type I (NF1) or von Recklinghausen neurofibromatosis is the most common cancer predisposition syndrome affecting the nervous system with the incidence of 1 in 3000 worldwide (Gutmann, 2001). Typical manifestations include café au lait spots (hyperpigmented macules), cutaneous and subcutaneous neurofibromas (benign tumors), and malignancies of the central and peripheral nervous systems. The less common abnormalities observed in NF1 patients include learning disabilities (although frank mental retardation is rare) and skeletal abnormalities such as scoliosisand pseudoarthrosis (Skuse, Cappione, 1997). In addition to these features, children with NF1 can present within the first 6 years of life with lowgrade glial tumors involving the optic parthway. Two intriguing features of NF1 are the wide range of potentially affected tissues and the great variation in expressivity of disease traits across those affected. To date, the underlying source of this variation remains somewhat unclear, but evidence suggests that aberrations in normal NF1 RNA processing may be involved (Skuse, Cappione, 1997).

Comparative and evolutionary genomics and proteomics 143 Genetic features of NF1. Early insights into the patogenesis of NF1-associated tumors began with the identification of the NF1 gene in 1990 (Wallace et al., 1990). The NF1 gene itself, located at 17q11.2, encompasses >300 kb of human chromosome 17 (Li et al., 1995). The 60 exons which constitute the human NF1 gene give rise to several alternatively spliced transcripts. Within the central portion of the NF1 encoded protein, neurofibromin, lies a region with homology with the mammalian GTPase activating proteins (GAP) and the yeast inhibitor of RAS proteins 1 and 2 termed the GAP related domain (GRD). Neurofibromatoses. Discussions about neurofibromatoses include two the most common forms of this disorder: neurofibromatosis 1(NF1) and neurofibromatosis 2 (NF2). Individuals with two inherited cancer syndromes, NF1 and NF2 develop both benign and malignant tumors. The corresponding genes mutated in these two disorders encode tumor suppressor proteins, termed neurofibromin (NF1) and merlin (NF2), which have a similar function to regulate cell growth and differentiation. While both genes are not related and located on the different chromosomes in human, nothing is known about their relationships, distribution and function in other organism as well as about their origin. In the previous study we investigated evolution of merlin and postulated its origin in early metazoan (Golovnina et al., 2005). Here we represent the initial phylogenetic research of NF1 protein and its homologs based on bioinformatics approaches. METHODS AND ALGORITHMS BLAST search. Initial sequences of genes and proteins of interest from various organisms were identified by performing multiple TBLASTN and BLASTP (Altschul et al., 1997) searches against GenBank (http://www.ncbi.nlm.nih.gov/genomes/), Ensembl (http://www.ensembl.org ) and wormbase (www.worbase.org) databases. In each case putative H.sapiens NF1 protein were used as the query sequence. Only NF1-like representatives of insects were located by D. melanogaster NF1 protein homology. To obtain more information we then searched the desirable sequences across genomic databases of completely or partially sequenced genomes available at The Sanger Institute (http://www.sanger.ac.uk/datasearch) and The Institute for Genomic Research (TIGR) (http://tigrblast.tigr.org/tgi/), Doe Joint Genome Institute (JGI) (http://www.jgi.doe.gov), The Broad Institute (http://www.broad.mit.edu). The predicted nucleotide and aminoacid sequences of many species were assembled manually using available contigs and assemblies of genomes by homology to query sequence. Alignments and phylogeny. The Clustal X program (Thompson et al., 1997) was used to align the characterized or predicted protein sequences from different species. All alignments were corrected for obvious alignment ambiguity The resultant alignment contained 3888 aligned positions and was used to construct phylogenetic tree. Phylogenetic analysis was carried out using the Neighbor-Joining method in MEGA 3.1 program (Kumar et al., 2004). RESULTS AND DISCUSSION In total, 22 NF1-like sequences, that have overall homology, from Deuterostomia, Arthropoda, Platyhelmintes have been located in the present investigation. The main criterion for identification these sequences was their evident homology to known NF1 proteins. Moreover, we have found four Fungi proteins that show not so strongly similarity to NF1-like sequences. Five of 22 NF1-like proteins are experimentally annotated, namely H. sapiens NF1, M. musculus NF1, R. norvegicus NF1, T. rubripes NF1, and D. melanogaster NF1. Eight homologs were bioinformatics predicted previously and were located by initial BLASTP search across NCBI database

144 Part 5 (P. troglodytes, C. familiaris, G. gallus, A. gambiae), as well as NF1-related protein of N. crassa and three other RasGAPs of fungi. The rest sequences were assembled manually by parts using available paired scaffolds, contigs, and assemblies of the sequenced genomes together with protein and EST databases. Based on the obtained NF1 sequences and the close related proteins the phylogenetic tree was constructed (Fig. 1). RasGap family members from another groups were included in our analysis as an outgroup to show a homology of all obtained NF1- like sequences. We conducted phylogenetic analysis including only RasGap domain and corresponding sequences of newly identified proteins in view of possible different origin of some other domains except RasGap (Fig. 2) Both trees are similar in topology with small alterations in Fungi group. Figure 1. Phylogenetic analysis of NF1 homologs and its close related proteins from RasGAP group. NF1 clade is shown by solid line on the right. The dashed line denotes observed fungi proteins that are similar to NF1. UniProt accession numbers for full length sequences are represented in bold letters, GenBank in regular.

Comparative and evolutionary genomics and proteomics 145 Figure 2. Phylogenetic analysis of NF1 homologs and its close related proteins from RasGAP group based on RasGap domain sequences. Fungi proteins that have similarity with NF1-like sequences are marked with dark round. Phylogenetic analysis demonstrate a distribution of NF1 homologs across all investigated and available to date the major metazoan (chordata, urochordata, echinodermata, insects, and platyhelmintes) and fungi lineages. Among all species investigated, no NF1 sequences have been found in round worms (C. elegans, C. briggsae, C. remanie, B. malayi) and mollusk (B. glabrata). There are two possible explanations: (i) the derivation of NF1 gene occured after a separation of both Nematoda and mollusca or (ii) it was lost specifically in these groups. In C. elegans genome were previously annotated RasGAP family members (GAP1 and GAP2) that have homologs in C. briggsae genome. They are clasterized together with synras group of proteins that means their evolutionary relatedness. The presence of NF1-related sequences in fungi is disputable. NF1-like proteins have been found in four species of fungi kingdom, belonging to both Ascomycota and Basidiomycota. These amino-acid sequences have overall similarity to other NF1-like proteins but not so strong besides sequences corresponding to RasGap domain are more similar to other fungi RasGap proteins (Fig. 2). One of them (N. crassa) was previously annotated as NF1-related protein. On the other hands, no NF1-like sequences were detected in the rest fungi species examined.

146 Part 5 Therefore, the root of neurofibromin origin is unclear now and with the great progress in genome sequencing projects it promised to be more definite. Previously, no merlin homologs (another tumor suppressor) in fungi were observed. ACKNOWLEDGEMENTS This study was supported by grants from the US Department of Defense Neurofibromatosis Research Program. REFERENCES Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl. Acids Res., 25, 3389 3402. Golovnina K., Blinov A., Akhmametyeva E.M., Omelyanchuk L.V., Chang L.-S. (2005) Evolution and origin of merlin, the product of the neurofibromatosis type 2 (NF2) tumor-suppressor gene. BMC Evolutionary Biology, 5, 69. Gutmann D.H. (2001) The neurofibromatosis: when less is more. Hum. Mol. Genet., 10, 747 755. Kumar S., Tamura K., Nei M. (2004) MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Briefings in Bioinformatics, 5, 150 163. Li Y., O Connell P., Breidenbach H.H., Cawthon R., Stevens J., Xu G., Neil S., Robertson M., White R., Viskochil D. (1995) Genomic organization of the neurofibromatosis 1 gene (NF1). Genomics, 25, 9 18. Skuse G.R., Cappione A.J. (1997) RNA processing and clinical variability in neurofibromatosis type I (NF1). Hum. Mol. Genet., 6, 1707 1712. Thompson J.D., Gibson T.J., Plewniak F., Jeanmougin F., Higgins D.G. (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl. Acids Res., 15, 4876 4882. Wallace M.R., Marchuk D.A., Andersen L.B., Letcher R., Odeh H.M., Saulino A.M., Fountain J.W., Brereton A., Nicholson J., Mitchell A.L. et al. (1990) Type 1 neurofibromatosis gene: identification of a large transcript disrupted in three NF1 patients. Science, 249, 181 186.