Molecular Population Genetics of Arabidopsis thaliana Ferulate-5-Hydroxylase and Flavanone-3-Hydroxylase Genes
|
|
- Antony Walsh
- 5 years ago
- Views:
Transcription
1 PLSC 731 Plant Molecular Genetics Molecular Population Genetics of Arabidopsis thaliana Ferulate-5-Hydroxylase and Flavanone-3-Hydroxylase Genes Due: April 13, 2006, 11am DNA sequence data provides for the detailed population genetics analysis of genes. For this assignment, you will work in your groups to analyze the ferulate-5-hydroxylase and flavone- 3-hydoxylase genes of Arabidopsis thaliana. The appropriate data files for this analysis are available for download from the class WWW site. You are to use these data sets and prepare a report that: 1. presents data that supports your description of the degree of diversity and polymorphism among the individuals in this population for the two genes and their expressed proteins; 2. illustrates and discusses the relationship among the individuals in this population for these two genes and their expressed proteins with particular reference to geographical location; and 3. uses population genetics data to support a discussion regarding the evolution of these two loci. A report that contains two to three written pages (standard format) should be sufficient. It is up to each group to determine what they feel is sufficient detail. But remember, you are writing a report for a graduate class, so it is expected that it be representative of a short professional report. (This page length does not include the coversheet, references, and tables and figures you should also prepare for your report.) In addition, you are supplied with a number of papers that describe nucleotide variation among a number of different genes in A. thaliana. You should use these manuscripts as a guide for your report. You should also compare the results you obtain with these two genes with the genes described in these papers. In particular, the results in the Genetics (2000) 155:863 article should be of interest to you. In addition, your group should also consult the sequence diversity papers that we discussed in class. 1
2 1. Downloading the appropriate software You will need three software programs for this assignment. All of these are available from the WWW. Here is the software and the URL where you can download them. CLUSTALX 1.8.1: sent as an attachment TreeView 1.6.6: DnaSP : 2. Downloading the datafiles You will need four datafiles for this assignment. These are available from the following WWW site for this assignment. 3. Gathering basic information about the sequence diversity within your population a. To perform a comparative analysis of a collection of DNA or protein sequences, it is best to use a tool that readily shows the differences. Luckily such a tool exists that allows you to align the sequences and easily view the differences. The best way to identify the regions of homology is to align all of sequences for each gene or protein. To accomplish this, navigate to the Multalin WWW site. Multalin is an on-line software program that will align your sequences. The URL is: b. Open your nucleotide or amino acid for file for a particular gene and copy all of the records. Paste the data into the box below the Cut and paste your sequences here below. statement. c. Now you need to set the following parameters: If you are going to perform a DNA comparison, go to drop-down menu below Symbol comparison table and set the parameter to DNA-5-0 (Click elsewhere on the WWW page. Sometimes this is necessary to ensure the form records your symbol selection.) If you are going to perform an amino acid comparison, use the default Blosum Next go to the bottom of the page and find Maximum line length and set the value to 100 (This will ensure that you can see all of the alignment without the need of using a slide bar.) d. Now click on the Start Multalin! button. e. In a short time, a new page will appear with your alignment. For the nucleotide data, it will be informative to collect data relative to the amount of nucleotide changes. Remember that 2
3 you are using Arabidopsis lyrata (Aly-etc) as a reference. The remainder of the records, of the form Ath-etc, refer to different A. thaliana genotypes. f. It will be informative to determine if your differences occur in the exons or introns of the gene. In the file fah-f3h-cdna-genomic.txt, you will find the entire genomic and cdna sequence for these two genes. You can add these to the alignment (paste them into the data box along with the other sequences). This alignment should provide you with the necessary to determine where the introns are located, and thus, the distribution of the variation relative to the exons and introns. 4. Performing multiple alignment and building a phylogenetic tree Multalin! is a great program for visualizing your differences, yet it is not great for tree building. A good program for that task is ClustalX. Building your tree involves three steps. First you need to perform a multiple alignment in ClustalX. Next, you need to build the tree. And finally, you need to perform a bootstrap analysis of your tree. There a number of different tree algorithm, but ClustalX just supports neighbor joining distance tree method. a. You will first align the DNA nucleotide sequences. To accomplish this task, you will use the CLUSTALX software. This widely used software performs multiple alignments. The underlying algorithm is CLUSTALW. CLUSTALX is window-based interface to CLUSTALW. The alignment is developed in three steps. The first step involves a pairwise alignment of each sequence. These alignments are then used to develop a guide tree. Finally, the guide tree is used to create the multiple alignment. b. Open the CLUSTALX software from the location in which you placed it. Go to File/Load Sequence in the drop-down menus. Find your sequence f3h-nt.txt for example and select Open. c. Go to Alignment/Ouput Format Options in the drop-down menus and check the following boxes: CLUSTAL format GCG/MSF format NBRF/PIR format Leave the remainder of the options in their default settings and click CLOSE. Each of these output formats is useful in a number of programs. The CLUSTAL format is used by CLUTALX, whereas the NBRF/PIR format will be used in a later analysis. The GCG/MSF formatted output will be used in analyses beyond the scope of this assignment. 3
4 d. Go to Alignment/Alignment Parameters/Pairwise Alignment Parameters from the drop-down menus. These are the options used by the program for the first step, pairwise alignment. First, you can set the penalty for introducing a gap and the penalty for increasing the gap. For DNA alignments, set the following parameters to these values. These may be your default settings. Gap Opening: Gap Extension: 6.66 Using the DNA Weight Matrix IUB is also appropriate. Click CLOSE. e. Go to Alignment/Alignment Parameters/Multiple Alignment Parameters Pairwise Alignments from the drop-down menus. Again, set the following parameters to these values. These may be your default settings. Gap Opening: Gap Extension: 6.66 Using the DNA Weight Matrix IUB is also appropriate. Click CLOSE. f. Go to Alignment/Do Complete Alignment. The Output Guide Tree File: and Output Alignment Files: are the names that these will be given. These are outputted by CLUSTALX. Unless you change these settings, the files will have the same prefix as the input file (f3-nt, for example) and be given the extensions.dnd for the guide tree,.aln for the Clustal file,.pir for the NBRF/PIR file, and.msf for the GCG/MSF file. Click ALIGN. g. You can follow the progress of the alignment process at the very bottom of the interface. h. Phylogenetic trees are essential tools that depict the relationship between different sequences in your analysis. The trees show which genes are most closely related, and which are more distant. All methods are built under the assumption that those species with the fewest differences are most alike whereas those with the greatest differences are the most different. There are three basic types of phylogenetic trees. Distance methods use a distance matrix to determine those sequences with the smallest distance from each and then calculates the distance of each sequence to the node that joins them. It is calculated as a distance tree because the lengths of branches are defined distances. Distance methods, such as Neighbor-Joining, produce a single tree. Parsimony methods search for the single tree that uses the fewest number of 4
5 evolutionary steps. This procedure can generate a number of trees that contain the same number of evolutionary steps for their construction. Maximum likelihood procedures attempt to discover that tree that maximizes the probability of observing the data. We will use CLUSTALX to develop a Neighbor-Joining distance tree. Go to Trees/Output Format Options. Change the Bootstrap labels on: from BRANCH to NODE. This produces a cosmetic change in the manner in which the tree will be labeled. This will be important when you actually display the trees you develop. Click CLOSE. i. Go to Trees/Draw N-J Tree. The SAVE PHYLIP TREE AS: is the name that will be given to the N-J tree. It uses the same file naming convention as described in 4.f, except the file extension is.ph. Click OK. j. Trees are built based on the alignment. But is important to provide a level of statistical confidence to the trees that you build. The standard statistical analysis to provide a confidence level is called bootstrapping. Bootstrapping is an iterative process in which subsets of the data are reanalyzed to determine how frequently certain entries are grouped together. The more often entries are grouped together during the bootstrap process, the higher the confidence you can have that sequences have a degree of relatedness. Go to Trees/Bootstrap N-J Tree. The Random number generator seed [1-1000] is a value that seeds the beginning of the bootstrap analysis. (If you are reanalyzing the same data set over and over, you should change this number each time. It can be any number between 1 and 1000.) The Number of bootstrap trials [1-1000] is self-explanatory. Before the advent of computers with high-speed processors, it was computationally expensive to perform a large number of bootstrap trials. That is not the case today. Therefore, you should use the default value of 1000 trials. A second output tree is generated. This tree will contain the bootstrap values on the nodes. This tree is given a.phb extension. k. Be sure you perform this analysis for both nucleotide data sets. 5. Performing protein amino acid multiple alignments and tree building a. CLUSTALX is also used to align protein amino acid sequences. Steps a through g are nearly identical to those described in section 4 for the DNA nucleotide alignments. Here are the differences you need to implement i. Load sequence f3-aa.txt, for example. ii. Go to Alignment/Alignment Parameters/Pairwise Alignment Parameters from the drop-down menus. Set the Gap Opening to 35 Gap Extension
6 Select BLOSUM in the Protein Weight Matrix. iii. Go to Alignment/Alignment Parameters/Multiple Alignment Parameters from the drop-down menus. Set the Gap Opening to 15 Gap Extension 0.30 Select BLOSUM in the Protein Weight Matrix. b. CLUSTALX is also an appropriate tool to create a phylogenetic tree for amino acid sequences. Follow steps 4h through 4j above to create your amino acid tree. c. Be sure to analysis both amino acid data sets. 6. Viewing the phylogenetic trees a. To see the trees that you have developed, you need to use the TreeView program. This program is specifically designed to view phylogenetic trees calculated by programs such as CLUSTALX. To view the trees, open the Treeview program. b. Go to File/Open and select the f3-nt.php file (for example). c. By default, the tree appears as a Slanted Cladogram. To depict this as a distance tree, go to Tree/Phylogram. (You can also click the rightmost tree shape on the group of tree shapes on the menu bar.) To show your bootstrap values, go to Tree/Show internal edge labels. (Alternatively, you can click the menu bar button with 12 surrounded by a tree cluster.) d. Lastly, you need to save the tree in a format that can be viewed by drawing programs. To do this, go to File/Save as graphic In the menu box type the following file name f3h-nt-tree-graphic (as example) Leave the file type as.emf. e. Now, you need to view and save the protein tree. Open the appropriate.php file for each protein data set. 6
7 f. Repeat steps c and d above for the protein file. Save the file as f3h-aa-tree-graphic (as example) g. Be sure that you perform the tree analysis for both nucleotide and amino acid data sets. 7. Population genetic analysis of the gene a. The last step of the assignment is to perform a comparative analysis of the data. This will be done using the DnaSP software. This software can calculate several important population genetics that can give an indication of the variability of the gene sequences, and the types of selection pressure the gene is undergoing. Open the DnaSP software from the location in which you placed it. f3h-nt.pir b. Go to File/Open Data File #1 For example, choose the file Click Close on the Data Information box. c. Rather than recording the results of each analyze as you go, DnaSP can create an output in which you can store all of your analyses. You can then go back to this text file later to collect the data you need for your report. Go to File/Send All Output to File Name your file f3h-dna, for example and hit Save. The program will append this file name with the file type extension.out. Click OK in the pop-up box. d. To view the data, go to Display/View Data. This will display your data. The display is restricted to just 28 nucleotides. e. Go to Analysis/DNA Polymorhphism. Click OK. Study this output (but remember the data will also be found in you output file. You will also use this information as you fill the described in section 9 below. Haplotypes can be defined as a unique set of sequences. From a genetics perspective, these are nucleotide variants that reside so close to each other that that are inherited as a unit. If two sequences are identical, then they are considered to be the same haplotype. Nucleotide variation can expressed as diversity or polymorphism. Diversity (π) is estimated as the average number of nucleotide substitutions per site when two sequences are compared. The value DnaSP reports is the average of all two sequence comparisons. [To read more about this and other analyses performed by DnaSP go to Help/Contents (or hit the F2 key) and click on the analysis of interest.] Polymorphism (θ) is estimated as the proportion of nucleotide sites that 7
8 are polymorphic. The statistic considers all of the data as a group. Because is expressed on a per site basis, it is appropriate to use this statistic to compare variation at two different genes. f. Go to Analysis/Tajima s Tests Click OK. This analysis provides indications whether the gene sequence is undergoing selection or whether the variation was generated by mutation and drift (the neutral theory). Typically, this is performed within a species, but it is such an important test that you should perform it on your data set to gain some understanding of the important principles underlying it. Darwin s landmark book made the case for natural selection as the driving force for the generation of variation. This theory states that in a large population experiencing fitness and viability restraints because of limited resources in the environment, those alleles which provide the population with an advantage in the environment will appear at a higher frequency (be selected) in the next generation. After many generations, that allele will become the prominent allele in that population. This is in contrast to the neutral theory that states that a balance between mutation and genetic drift lead to the fixation of nucleotide changes. This theory suggests that the random process that drive genetic drift (and not selection) lead to the increase in frequencies of specific alleles in the subsequent generation. The cumulative sum of these effects over generations leads to the fixation of certain mutation events. A number of statistical tests have been developed to determine if a gene is being neutrally selected. One popular test of neutrality based on nucleotide data is Tajima s D. Without going into great depth, this estimate is a corrected ratio of diversity divided by polymorphism. This value can be either positive (if π > θ) or negative (f π < θ). If the D value is too large or too small (based on probability of obtaining a specific value), then the assumption of neutrality at the locus is rejected. It is then generally assumed that this is one indication that the locus is evolving by selection. But, it is important to note that other population effects, such as bottlenecks, can lead to significant D tests. g. Go to Analysis/Fu and Li s (and other) Tests Click OK. This is another test of neutrality. We discussed this in class, and you have read about this in the papers we discussed in class. h. Go to Analysis/Recombination Click OK. This is a test for the number of recombination events which can be defined based on four-gamete types. The four gametes represent the two parental and the two recombinant chromosome types. A computer simulation is used to determine the minimum number of recombination events. Evidence of recombination indicates that ancestral populations must have existed that represented the variation of served in this population. 8
9 i. Go to File/Close Output File: f3h-nt.out. This closes your data output file. You can now read that file in TextPad (or another program that reads.txt files). j. Make sure you complete these analyses for both genes. You just want to analyze the nucleotide data. 8. Important geographic information a. You are to determine if there is some geographic relationship among your genotypes for each nucleotide or amino acid sequence. The following table provides information regarding the collection location for each of the genotypes analyzed in this data set. Table 1. Geographic collection site for each of the Arabidopsis genotypes analyzed in this assignment. Genotype Country of origin A. lyrata Russia A. thaliana CAN-0 Canary Islands, Spain A. thaliana CHA-0 Champex, Switzerland A. thaliana COL-2 Landsberg, Poland A. thaliana COND Condara, Khurmatov, Tadjikistan A. thaliana CVI-0 Cape Verdi A. thaliana GR-5 Graz, Austria A. thaliana ITA-0 Ibel Tazekka, Morocco A. thaliana KAS-1 Kashmir, India A. thaliana LA-0 Landsberg, Poland A. thaliana ME-0 Mechtshausen, Germany A. thaliana MH-0 Muehlen, Poland A. thaliana MR-0 Monte/Tosso, Italy A. thaliana NC-1 Ville-en-Vermois, France A. thaliana PER-1 Perm, Russia A. thaliana RI-0 Richmond, British Columbia, Canada A. thaliana RSCH-0 Rschew/Starizd, Russia A. thaliana RUB-1 Rubezhnoe, Ukraina A. thaliana TUL-0 Turk Lake, Florida A. thaliana WS-0 Vasljevici/Drijept, Byelorussia A. thaliana YO-0 Yosemite, California, USA 9
10 9. Complete this data file Your output from the DnaSP program will a tremendous amount of data that will assist with your analysis of these data sets. From the output file, you will be able to complete the follow table. Number sequences Number total sites Polymorphic sites Haplotypes Nucleotide variation Diversity (π/bp) Polymorphism (θ/bp) Neutrality tests Tajima s D Fu and Li s D* Fu and Li s F* Recombination # four gamete type site pairs R M b f3h fah 10
Phylogenetic analyses. Kirsi Kostamo
Phylogenetic analyses Kirsi Kostamo The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among different groups (individuals, populations, species,
More informationTHEORY. Based on sequence Length According to the length of sequence being compared it is of following two types
Exp 11- THEORY Sequence Alignment is a process of aligning two sequences to achieve maximum levels of identity between them. This help to derive functional, structural and evolutionary relationships between
More information08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega
BLAST Multiple Sequence Alignments: Clustal Omega What does basic BLAST do (e.g. what is input sequence and how does BLAST look for matches?) Susan Parrish McDaniel College Multiple Sequence Alignments
More information7. Tests for selection
Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationProcesses of Evolution
15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection
More informationUsing Bioinformatics to Study Evolutionary Relationships Instructions
3 Using Bioinformatics to Study Evolutionary Relationships Instructions Student Researcher Background: Making and Using Multiple Sequence Alignments One of the primary tasks of genetic researchers is comparing
More informationClassical Selection, Balancing Selection, and Neutral Mutations
Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected
More informationNeutral Theory of Molecular Evolution
Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation
More informationInvestigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST
Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST Introduction Bioinformatics is a powerful tool which can be used to determine evolutionary relationships and
More informationBioinformatics tools for phylogeny and visualization. Yanbin Yin
Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and
More informationComparing whole genomes
BioNumerics Tutorial: Comparing whole genomes 1 Aim The Chromosome Comparison window in BioNumerics has been designed for large-scale comparison of sequences of unlimited length. In this tutorial you will
More informationCopyright 2000 N. AYDIN. All rights reserved. 1
Introduction to Bioinformatics Prof. Dr. Nizamettin AYDIN naydin@yildiz.edu.tr Multiple Sequence Alignment Outline Multiple sequence alignment introduction to msa methods of msa progressive global alignment
More informationAmira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut
Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological
More informationIntraspecific gene genealogies: trees grafting into networks
Intraspecific gene genealogies: trees grafting into networks by David Posada & Keith A. Crandall Kessy Abarenkov Tartu, 2004 Article describes: Population genetics principles Intraspecific genetic variation
More informationSolutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin
Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele
More informationSequence Alignment: A General Overview. COMP Fall 2010 Luay Nakhleh, Rice University
Sequence Alignment: A General Overview COMP 571 - Fall 2010 Luay Nakhleh, Rice University Life through Evolution All living organisms are related to each other through evolution This means: any pair of
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationEmily Blanton Phylogeny Lab Report May 2009
Introduction It is suggested through scientific research that all living organisms are connected- that we all share a common ancestor and that, through time, we have all evolved from the same starting
More informationDetecting selection from differentiation between populations: the FLK and hapflk approach.
Detecting selection from differentiation between populations: the FLK and hapflk approach. Bertrand Servin bservin@toulouse.inra.fr Maria-Ines Fariello, Simon Boitard, Claude Chevalet, Magali SanCristobal,
More information"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky
MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION - theory that groups of organisms change over time so that descendeants differ structurally
More informationUnit 9: Evolution Guided Reading Questions (80 pts total)
Name: AP Biology Biology, Campbell and Reece, 7th Edition Adapted from chapter reading guides originally created by Lynn Miriello Unit 9: Evolution Guided Reading Questions (80 pts total) Chapter 22 Descent
More informationBig Idea #1: The process of evolution drives the diversity and unity of life
BIG IDEA! Big Idea #1: The process of evolution drives the diversity and unity of life Key Terms for this section: emigration phenotype adaptation evolution phylogenetic tree adaptive radiation fertility
More informationIntroduction to Bioinformatics Introduction to Bioinformatics
Dr. rer. nat. Gong Jing Cancer Research Center Medicine School of Shandong University 2012.11.09 1 Chapter 4 Phylogenetic Tree 2 Phylogeny Evidence from morphological ( 形态学的 ), biochemical, and gene sequence
More informationPOPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics
POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the
More informationLecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM)
Bioinformatics II Probability and Statistics Universität Zürich and ETH Zürich Spring Semester 2009 Lecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM) Dr Fraser Daly adapted from
More information5. MULTIPLE SEQUENCE ALIGNMENT BIOINFORMATICS COURSE MTAT
5. MULTIPLE SEQUENCE ALIGNMENT BIOINFORMATICS COURSE MTAT.03.239 03.10.2012 ALIGNMENT Alignment is the task of locating equivalent regions of two or more sequences to maximize their similarity. Homology:
More informationEffects of Gap Open and Gap Extension Penalties
Brigham Young University BYU ScholarsArchive All Faculty Publications 200-10-01 Effects of Gap Open and Gap Extension Penalties Hyrum Carroll hyrumcarroll@gmail.com Mark J. Clement clement@cs.byu.edu See
More informationCladistics and Bioinformatics Questions 2013
AP Biology Name Cladistics and Bioinformatics Questions 2013 1. The following table shows the percentage similarity in sequences of nucleotides from a homologous gene derived from five different species
More informationIntroduction to Bioinformatics Online Course: IBT
Introduction to Bioinformatics Online Course: IBT Multiple Sequence Alignment Building Multiple Sequence Alignment Lec1 Building a Multiple Sequence Alignment Learning Outcomes 1- Understanding Why multiple
More informationSupplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss
Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss Methods Identification of orthologues, alignment and evolutionary distances A preliminary set of orthologues was
More informationMajor questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.
Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary
More informationBasic Local Alignment Search Tool
Basic Local Alignment Search Tool Alignments used to uncover homologies between sequences combined with phylogenetic studies o can determine orthologous and paralogous relationships Local Alignment uses
More informationI. Short Answer Questions DO ALL QUESTIONS
EVOLUTION 313 FINAL EXAM Part 1 Saturday, 7 May 2005 page 1 I. Short Answer Questions DO ALL QUESTIONS SAQ #1. Please state and BRIEFLY explain the major objectives of this course in evolution. Recall
More informationGENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.
!! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory
More informationSupplemental Data. Perea-Resa et al. Plant Cell. (2012) /tpc
Supplemental Data. Perea-Resa et al. Plant Cell. (22)..5/tpc.2.3697 Sm Sm2 Supplemental Figure. Sequence alignment of Arabidopsis LSM proteins. Alignment of the eleven Arabidopsis LSM proteins. Sm and
More informationCONCEPT OF SEQUENCE COMPARISON. Natapol Pornputtapong 18 January 2018
CONCEPT OF SEQUENCE COMPARISON Natapol Pornputtapong 18 January 2018 SEQUENCE ANALYSIS - A ROSETTA STONE OF LIFE Sequence analysis is the process of subjecting a DNA, RNA or peptide sequence to any of
More informationEvolutionary change. Evolution and Diversity. Two British naturalists, one revolutionary idea. Darwin observed organisms in many environments
Evolutionary change Evolution and Diversity Ch 13 How populations evolve Organisms change over time In baby steps Species (including humans) are descended from other species Two British naturalists, one
More informationTeachers Guide. Overview
Teachers Guide Overview BioLogica is multilevel courseware for genetics. All the levels are linked so that changes in one level are reflected in all the other levels. The BioLogica activities guide learners
More informationCHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection
CHAPTER 23 THE EVOLUTIONS OF POPULATIONS Section C: Genetic Variation, the Substrate for Natural Selection 1. Genetic variation occurs within and between populations 2. Mutation and sexual recombination
More informationC3020 Molecular Evolution. Exercises #3: Phylogenetics
C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from
More informationLecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012 Last Time Sequence data and quantification of variation Infinite sites model Nucleotide diversity (π) Sequence-based
More informationBIOINFORMATICS LAB AP BIOLOGY
BIOINFORMATICS LAB AP BIOLOGY Bioinformatics is the science of collecting and analyzing complex biological data. Bioinformatics combines computer science, statistics and biology to allow scientists to
More informationUSING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES
USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES HOW CAN BIOINFORMATICS BE USED AS A TOOL TO DETERMINE EVOLUTIONARY RELATIONSHPS AND TO BETTER UNDERSTAND PROTEIN HERITAGE?
More informationPhylogenies Scores for Exhaustive Maximum Likelihood and Parsimony Scores Searches
Int. J. Bioinformatics Research and Applications, Vol. x, No. x, xxxx Phylogenies Scores for Exhaustive Maximum Likelihood and s Searches Hyrum D. Carroll, Perry G. Ridge, Mark J. Clement, Quinn O. Snell
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: Distance-based methods Ultrametric Additive: UPGMA Transformed Distance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More informationPhylogenetic inference
Phylogenetic inference Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, March 7 th 016 After this lecture, you can discuss (dis-) advantages of different information types
More informationA Phylogenetic Network Construction due to Constrained Recombination
A Phylogenetic Network Construction due to Constrained Recombination Mohd. Abdul Hai Zahid Research Scholar Research Supervisors: Dr. R.C. Joshi Dr. Ankush Mittal Department of Electronics and Computer
More informationSupplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles
Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles John Novembre and Montgomery Slatkin Supplementary Methods To
More informationBioinformatics Exercises
Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted
More informationQuantifying sequence similarity
Quantifying sequence similarity Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, February 16 th 2016 After this lecture, you can define homology, similarity, and identity
More informationUsing Phylogenomics to Predict Novel Fungal Pathogenicity Genes
Using Phylogenomics to Predict Novel Fungal Pathogenicity Genes David DeCaprio, Ying Li, Hung Nguyen (sequenced Ascomycetes genomes courtesy of the Broad Institute) Phylogenomics Combining whole genome
More informationQ1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.
OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall
More informationAP Biology Concepts and Connections. Reading Guide. Your Name: ! Chapter 13 How Populations Evolve. Key Terms
AP Biology Concepts and Connections Chapter 13 How Populations Evolve Reading Guide Key Terms adaptation fossils microevolution artificial selection founder effect molecular biology balancing selection
More informationOECD QSAR Toolbox v.4.0. Tutorial on how to predict Skin sensitization potential taking into account alert performance
OECD QSAR Toolbox v.4.0 Tutorial on how to predict Skin sensitization potential taking into account alert performance Outlook Background Objectives Specific Aims Read across and analogue approach The exercise
More informationThe neutral theory of molecular evolution
The neutral theory of molecular evolution Introduction I didn t make a big deal of it in what we just went over, but in deriving the Jukes-Cantor equation I used the phrase substitution rate instead of
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istance-based methods Ultrametric Additive: UPGMA Transformed istance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More informationInferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT
Inferring phylogeny Constructing phylogenetic trees Tõnu Margus Contents What is phylogeny? How/why it is possible to infer it? Representing evolutionary relationships on trees What type questions questions
More informationLecture Notes: BIOL2007 Molecular Evolution
Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits
More informationThe Phylogenetic Handbook
The Phylogenetic Handbook A Practical Approach to DNA and Protein Phylogeny Edited by Marco Salemi University of California, Irvine and Katholieke Universiteit Leuven, Belgium and Anne-Mieke Vandamme Rega
More informationTree Building Activity
Tree Building Activity Introduction In this activity, you will construct phylogenetic trees using a phenotypic similarity (cartoon microbe pictures) and genotypic similarity (real microbe sequences). For
More informationStatistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences
Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD Department of Computer Science University of Missouri 2008 Free for Academic
More informationHomework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:
Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships
More informationJournal of Proteomics & Bioinformatics - Open Access
Abstract Methodology for Phylogenetic Tree Construction Kudipudi Srinivas 2, Allam Appa Rao 1, GR Sridhar 3, Srinubabu Gedela 1* 1 International Center for Bioinformatics & Center for Biotechnology, Andhra
More information1.A- Natural Selection
1.A- Natural Selection Big Idea 1: The process of evolution drives the diversity and unity of life. EU 1.A- Evolution is change in the genetic makeup of a population over time. EU 1.B- Organisms are linked
More informationWhat is Phylogenetics
What is Phylogenetics Phylogenetics is the area of research concerned with finding the genetic connections and relationships between species. The basic idea is to compare specific characters (features)
More informationPhylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz
Phylogenetic Trees What They Are Why We Do It & How To Do It Presented by Amy Harris Dr Brad Morantz Overview What is a phylogenetic tree Why do we do it How do we do it Methods and programs Parallels
More informationFull file at CHAPTER 2 Genetics
CHAPTER 2 Genetics MULTIPLE CHOICE 1. Chromosomes are a. small linear bodies. b. contained in cells. c. replicated during cell division. 2. A cross between true-breeding plants bearing yellow seeds produces
More informationEvolution AP Biology
Darwin s Theory of Evolution How do biologists use evolutionary theory to develop better flu vaccines? Theory: Evolutionary Theory: Why do we need to understand the Theory of Evolution? Charles Darwin:
More informationBioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics
Bioinformatics 1 Biology, Sequences, Phylogenetics Part 4 Sepp Hochreiter Klausur Mo. 30.01.2011 Zeit: 15:30 17:00 Raum: HS14 Anmeldung Kusss Contents Methods and Bootstrapping of Maximum Methods Methods
More informationDarwin, Mendel, and Genetics
Darwin, Mendel, and Genetics The age old questions Who am I? In particular, what traits define me? How (and why) did I get to be who I am, that is, how were these traits passed on to me? Pre-Science (and
More informationPhylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.
Five Sami Khuri Department of Computer Science San José State University San José, California, USA sami.khuri@sjsu.edu v Distance Methods v Character Methods v Molecular Clock v UPGMA v Maximum Parsimony
More informationICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009
ICM-Chemist How-To Guide Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist HOW TO IMPORT, SKETCH AND EDIT CHEMICALS How to access the ICM Molecular Editor. 1. Click here 2. Start sketching How to sketch
More informationEvolution. Species Changing over time
Evolution Species Changing over time Objectives I can differentiate between natural selection and artificial selection and I can give examples of each. I can explain several reasons for genetic variation
More informationNCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157)
NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157) Evidence Statement Q1 Expected coverage Merit Excellence
More informationChapter 5 Evolution of Biodiversity. Sunday, October 1, 17
Chapter 5 Evolution of Biodiversity CHAPTER INTRO: The Dung of the Devil Read and Answer Questions Provided Module 14 The Biodiversity of Earth After reading this module you should be able to understand
More informationMETHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.
Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods
More informationLab 1 Uniform Motion - Graphing and Analyzing Motion
Lab 1 Uniform Motion - Graphing and Analyzing Motion Objectives: < To observe the distance-time relation for motion at constant velocity. < To make a straight line fit to the distance-time data. < To interpret
More informationDraft document version 0.6; ClustalX version 2.1(PC), (Mac); NJplot version 2.3; 3/26/2012
Comparing DNA Sequences to Determine Evolutionary Relationships of Molluscs This activity serves as a supplement to the online activity Biodiversity and Evolutionary Trees: An Activity on Biological Classification
More informationSingle alignment: Substitution Matrix. 16 march 2017
Single alignment: Substitution Matrix 16 march 2017 BLOSUM Matrix BLOSUM Matrix [2] (Blocks Amino Acid Substitution Matrices ) It is based on the amino acids substitutions observed in ~2000 conserved block
More informationOECD QSAR Toolbox v.3.4. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding
OECD QSAR Toolbox v.3.4 Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding Outlook Background Objectives Specific Aims The exercise Workflow
More informationInDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9
Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic
More informationHaplotyping as Perfect Phylogeny: A direct approach
Haplotyping as Perfect Phylogeny: A direct approach Vineet Bafna Dan Gusfield Giuseppe Lancia Shibu Yooseph February 7, 2003 Abstract A full Haplotype Map of the human genome will prove extremely valuable
More informationBMI/CS 776 Lecture #20 Alignment of whole genomes. Colin Dewey (with slides adapted from those by Mark Craven)
BMI/CS 776 Lecture #20 Alignment of whole genomes Colin Dewey (with slides adapted from those by Mark Craven) 2007.03.29 1 Multiple whole genome alignment Input set of whole genome sequences genomes diverged
More informationReview of molecular biology
Review of molecular biology DNA is into RNA, which is into protein. What mrna sequence would be transcribed from the DNA template CTA? What sequence of trna would be attracted by the above mrna sequence?
More informationA. Correct! Genetically a female is XX, and has 22 pairs of autosomes.
MCAT Biology - Problem Drill 08: Meiosis and Genetic Variability Question No. 1 of 10 1. A human female has pairs of autosomes and her sex chromosomes are. Question #01 (A) 22, XX. (B) 23, X. (C) 23, XX.
More informationMichael Yaffe Lecture #5 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D
7.91 Lecture #5 Database Searching & Molecular Phylogenetics Michael Yaffe B C D B C D (((,B)C)D) Outline Distance Matrix Methods Neighbor-Joining Method and Related Neighbor Methods Maximum Likelihood
More informationPhylogeny: traditional and Bayesian approaches
Phylogeny: traditional and Bayesian approaches 5-Feb-2014 DEKM book Notes from Dr. B. John Holder and Lewis, Nature Reviews Genetics 4, 275-284, 2003 1 Phylogeny A graph depicting the ancestor-descendent
More informationChapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)
12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²
More informationCHAPTERS 24-25: Evidence for Evolution and Phylogeny
CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology
More informationSequence analysis and comparison
The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species
More informationNOTES CH 17 Evolution of. Populations
NOTES CH 17 Evolution of Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Populations 17.1 Genes & Variation Darwin
More information6 Introduction to Population Genetics
Grundlagen der Bioinformatik, SoSe 14, D. Huson, May 18, 2014 67 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,
More informationHow to read and make phylogenetic trees Zuzana Starostová
How to read and make phylogenetic trees Zuzana Starostová How to make phylogenetic trees? Workflow: obtain DNA sequence quality check sequence alignment calculating genetic distances phylogeny estimation
More informationPhylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?
Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species
More informationprotein interaction analysis bulletin 6300
protein interaction analysis bulletin 6300 Guide to SPR Data Analysis on the ProteOn XPR36 System Ruben Luo, Bio-Rad Laboratories, Inc., 2000 Alfred Nobel Drive, Hercules, CA 94547 Kinetic Analysis To
More informationName Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.
Section 1: Chromosomes and Meiosis KEY CONCEPT Gametes have half the number of chromosomes that body cells have. VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous
More informationPopulation Genetics I. Bio
Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn
More informationLecture 14: Multiple Sequence Alignment (Gene Finding, Conserved Elements) Scribe: John Ekins
Lecture 14: Multiple Sequence Alignment (Gene Finding, Conserved Elements) 2 19 2015 Scribe: John Ekins Multiple Sequence Alignment Given N sequences x 1, x 2,, x N : Insert gaps in each of the sequences
More information