Genome-wide analysis of nucleotide-binding site disease resistance genes in Medicago truncatula

Size: px
Start display at page:

Download "Genome-wide analysis of nucleotide-binding site disease resistance genes in Medicago truncatula"

Transcription

1 Chin. Sci. Bull. (2014) 59(11): DOI /s Article csb.scichina.com Bioinformatics Genome-wide analysis of nucleotide-binding site disease resistance genes in Medicago truncatula Hui Song Zhibiao Nan Received: 6 May 2013 / Accepted: 13 November 2013 / Published online: 23 February 2014 Ó Science China Press and Springer-Verlag Berlin Heidelberg 2014 Abstract The class of nucleotide-binding site (NBS)- Leucine-rich repeat (LRR) disease resistance genes play an important role in defending plants from a variety of pathogens and insect pests. Consequently, many NBS-LRR genes have been identified in various plant species. In this study, we identified 617 NBS-encoding genes in the Medicago truncatula genome (Mt3.5v5) and divided them into two groups, regular (490) and non-regular (127) NBS- LRR genes. The regular NBS-LRR genes were characterized on the bases of structural diversity, chromosomal location, gene duplication, conserved protein motifs, and EST expression profiling. According to N-terminal motifs and LRR motifs, the 490 regular NBS-LRR genes were then classified into 10 types: CC-NBS (4), CC-NBS-LRR (212), TIR-NBS (20), TIR-NBS-LRR (160), TIR-NBS-TIR (1), TIR-NBS-TIR-LRR (2), NBS-TIR (7), NBS-TIR-LRR (1), NBS (10), and NBS-LRR (73). Analysis of the physical location and duplications of the regular NBS-LRR genes revealed that the M. truncatula genome is similar to rice. Interestingly, we found that TIR-type genes are more frequently expressed than non-tir-type genes in M. truncatula, whereas the number of non-tir-type regular NBS- LRR genes was greater than TIR-type genes, suggesting the gene expression was not associated with the total number of NBS-LRR genes. Moreover, we found that the phylogenetic tree supported our division of the regular Electronic supplementary material The online version of this article (doi: /s ) contains supplementary material, which is available to authorized users. H. Song Z. Nan (&) Key Laboratory of Grassland Agro-Ecosystems of Ministry of Agriculture, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou , China zhibiao@lzu.edu.cn NBS-LRR genes into two distinct clades (TIR-type and non-tir-type), but some of the non-tir-type lineages contain TIR-type genes. These analyses provide a robust database of NBS-LRR genes in M. truncatula that will facilitate the isolation of new resistance genes and breeding strategies to engineer disease resistance in leguminous crop. Keywords Medicago truncatula Disease resistance genes Nucleotide-binding sites Phylogenetic tree 1 Introduction Diseases are one of plant s major threats and may be caused by diverse microorganisms, including bacteria, fungi, viruses, nematodes, and insects. Over evolutionary time plants have developed various sophisticated mechanisms to protect themselves. One of the roles of the plant immune response is characterized by gene-for-gene interactions between a host disease resistance (R) gene and a pathogen avirulence (Avr) gene [1]. The R genes may be divided into at least five classes based on the structure of the encoded proteins [2 5] and the biggest category is NBS-LRR proteins [6]. Although how NBS-LRR proteins function in disease resistance is not completely clear, previous studies show that plants contain a large number of NBS-LRR proteins to confer resistance to diverse pathogens [7]. The NBS-LRR protein contains two major domains, the nucleotide-binding site (NBS) and the leucine-rich repeat (LRR) [8]. However, many plant NBS-LRR proteins also contain coiled-coil (CC) or a Toll/mammalian interleukin-1 receptor (TIR) domain at their N-terminal end [8, 9].

2 1130 Chin. Sci. Bull. (2014) 59(11): Therefore, NBS-LRR proteins are classified into TIR-NBS- LRR (TNL) and CC-NBS-LRR (CNL) groups [10]. In general, CNL genes are found in both monocot and dicot genomes, whereas TNL genes have been identified only in dicots [11]. Generally, the NBS domain contains several conserved motifs, which have been confirmed to play a role in binding and hydrolysis of ATP and GTP [2, 12, 13]. In addition, over-expression of the TIR or CC domain of some NBS-LRR proteins can induce hypersensitive response [14 16]. Recently, Sayar-Turet et al. [17] used NBS conserved motifs as a molecular marker to assess variation associated with potentially functional regions of the genome underlying specific phenotypes. Previous studies found that the LRR domain mediated direct or indirect interactions with pathogen molecules [18]. In addition, the LRR domain seems to act as a negative regulator to block inappropriate NBS activation [6]. Medicago truncatula (2n = 16) is an annual diploid in the tribe Trifolieae, cultivated as a forage crop and closely related to tetraploid alfalfa (Medicago sativa). However, M. truncatula is susceptible to a variety of pathogens. As an important economic crop, it is essential that we find ways to improve the disease resistance in M. truncatula. In recent years, more and more draft sequences of M. truncatula have been described [19], providing a unique opportunity for analysis of NBS-LRR genes in M. truncatula. NBS-LRR genes belong to one of the largest gene families in plant genomes and have been detected in a wide variety of plants, including nonvascular plants and angiosperms (i.e., Physcomitrella patens and Arabidopsis thaliana) [20, 21]. Although NBS-LRR genes have been identified in the M. truncatula draft genome (Mt1.0) and analyzed according to their phylogenetic pattern and genomic organization [22], the M. truncatula genome was resequenced in 2011 and this draft have better coverage of the genome [23]. Therefore, in this study, we identified 617 NBS-LRR genes in the recently sequenced genome (Mt3.5v5), including 490 regular and 127 non-regular NBS-LRR genes. We analyzed structural diversity, phylogenetic relationships, gene duplication, chromosomal location, conserved motif, and EST expression profiling in the NBS-LRR gene family. 2 Materials and methods 2.1 Database search and sequence retrieval The M. truncatula genome sequence (Mt3.5v5) was downloaded from The complete set of NBS-LRR gene sequences was identified using a deliberative process. First, a Hidden Markov Model (HMM) profile of NBS domain (PF00931) was downloaded from the Pfam database ( ac.uk/) [24]. We employed the NBS domain as a query to identify all possible NBS-LRR sequences in the M. truncatula genome database using the BLASTP program (P = 0.001), according to the method from previous studies [10, 25]. To insure that there were no additional related genes missing from the NBS-LRR gene prediction, another NBS domain was generated from alignments of A. thaliana NBS-LRR sequences downloaded from niblrrs.ucdavis.edu. Subsequently, searches on the Pfam database and COILS ( were used to confirm and classify each predicted NBS-LRR sequence. We located overlapping genes by aligning all of the candidate NBS-LRR sequences using Clustal W [27] and manually checking the alignment. Only the nonoverlapping NBS-LRR sequences were used for further analysis. 2.2 Identification of conserved motifs The program MEME (version 4.8, [28] was used to elucidate motifs in M. truncatula NBS- LRR protein sequences with the following parameters as described in previous methods [29]: (1) optimum motif widths were constrained to residues; (2) the number of repetitions was set to any, and (3) the maximum number of motifs was set to 20. Structural motif annotation was performed using the Pfam and COILS databases, as well as the SMART program ( de) [30]. 2.3 Phylogenetic analysis It is well known that amino acid sequences of NBS resistance genes are highly conservative [10]. Multiple sequence alignments of the highly conserved NBS domain were performed by Clustal X (version 1.83) [31]. The tree was analyzed and displayed in MEGA (version 4.0) software [32] with the neighbor-joining method and 1,000 bootstrap replicates. NBS-LRR genes are known to be subdivided into two clades: TIR-type and non-tir-type. Therefore, to perform phylogenetic analyses, we constructed TIR-type and non-tir-type phylogenetic trees between M. truncatula and Glycine max using the whole protein sequences by the same method. 2.4 Chromosomal locations of NBS-LRR genes and gene duplication The chromosomal locations of NBS-LRR genes were identified by methods described by Meyers [10]. Genome

3 Chin. Sci. Bull. (2014) 59(11): Pixelizer software was used to draw the location images of the M. truncatula NBS genes ( edu/genomepixelizer/genomepixelizer_w-elcome.html). To detect potential gene duplication, we aligned and calculated all of the relevant genes identified in M. truncatula genomes. We defined gene duplication such that between any two loci [25]: (1) the alignable nucleotide sequence covered [70 % of the longer sequence; (2) the amino acid identity between the sequences was [70 % identical. 2.5 Digital expression analysis: EST expression profiling Medicago truncatula EST sequences were downloaded from GenBank ( The NBS-LRR genes were used as query to perform a local BLASTN search against all of the ESTs. The following parameters were employed [29]: maximum sequence similarity [96 %, length [ 200 bp, and E-value \ Results 3.1 Identification and classification of NBS-LRR genes In this study, we identified 617 NBS-LRR genes with the 375 Mb Mt3.5v5 draft genome [19], including 490 regular and 127 non-regular NBS-LRR genes (Table 1, Table S1). The length of non-regular NBS domain is shorter (B2/3 of normal NBS domain) than that of normal NBS domain [25, 33, 34]. TIR-only and TIR-X sequences were excluded in this work. The 617 regular and non-regular NBS-LRR genes were analyzed against the Pfam database. In 490 out of 617 NBS-LRR genes, all of the characteristic NBS domain motifs are conserved. The non-regular genes are totally different in their structures and lack specific motifs and/or domains. In the regular NBS-LRR genes, we detected 191 TIRtype resistance genes, including NBS-TIR (7), TIR-NBS (20), NBS-TIR-LRR (1), TIR-NBS-LRR (160), TIR-NBS- TIR (1), and TIR-NBS-TIR-LRR (2). We found 216 non- TIR-type genes, including CC-NBS (4) and CC-NBS-LRR (212). Eighty-three NBS-LRR genes lacked CC and TIR motif regions (10 NBS and 73 NBS-LRR; Table 1). In addition, we found 22 potential pseudogenes among the 127 non-regular NBS-LRR genes with either a premature stop codon or a frameshift mutation (Table S2). The non-regular NBS-LRR genes include CC (25), TIR (31), and LRR (70) motif regions and 37 non-regular NBS-LRR genes contained only a conserved NBS motif (Table 1). Noticeably, the TIR motif was not detected in regular Table 1 Numbers of NBS-LRR gene in M. truncatula and rice genomes Predicted Letter code Medicago Rice a Regular NBS genes NBS N NBS-LRR NL CC-NBS CN 4 7 CC-NBS-LRR CNL NBS-TIR NT 7 0 TIR-NBS TN 20 0 NBS-TIR-LRR NTL 1 0 TIR-NBS-LRR TNL TIR-NBS-TIR TNT 1 0 TIR-NBS-TIR-LRR TNTL 2 0 Total regular NBS genes Non-regular NBS genes NBS NBS-LRR CC-NBS 3 0 CC-NBS-LRR 22 0 TIR-NBS 15 3 NBS-TIR 2 0 TIR-NBS-LRR 14 0 Total non-regular NBS genes a Data from Zhou et al. [25] NBS-LRR genes of the rice genome, except in non-regular NBS-LRR genes (3TIR-NBS; Table 1). 3.2 Identification of conserved motifs To analyze the conserved motifs of NBS-LRR genes in M. truncatula, we divided CNL (212) and TNL (160) groups into three parts defined by their N-terminal, NBS, and LRR domains (Table 1). The detailed motif sequences are shown in Table S3. The N-terminal domain in NBS-LRR genes covers amino acids. The N-terminal CC domain has been identified as a representative motif in the amino terminal of CNL proteins [35], and the presence or absence of a CC motif can be anticipated on the basis of characteristic motifs present in the NBS-LRR [21, 36]. In this study, we identified 20 motifs in the N-terminal region from CNL genes (Table S3). Compared with the CNL gene, motifs found in A. thaliana and rice [10, 25], the CNL genes in M. truncatula are highly variable and show many different patterns. The CNL genes all contained a Q (L/I/V) RD motif at the N-terminal [37] and we found similar motifs in the N-terminal domain (Table S3, motif 10).

4 1132 Chin. Sci. Bull. (2014) 59(11): Several conserved motifs were confirmed as present in the TIR domain of plant NBS-LRR genes and related proteins (Table S3). Sequence analysis showed high similarity between TIR motifs of M. truncatula and counterparts in A. thaliana [10]. A previous study in A. thaliana identified eight major NBS motifs [10], including P-loop, RNBS-A, Kinase 2, RNBS-B, RNBS-C, GLPL, RNBS-D, and MHDV. The sequences of these motifs differ in CNL and TNL proteins. In this study, MEME analyses identified seven consensus sequence motifs, but lack MHDV motif in CNL and TNL genes of M. truncatula (Table 2). The MHDV motif in different plant species has been classified as MHDL, MHSL, QHDV, or QHDL [11, 25, 34, 38]. The MHDV motif was not detected in the M. truncatula draft genome (Mt3.5v5) using MEME software, prompting us to consider several possibilities. (1) MEME may not have been able to recognize MHDV motif in M. truncatula if the MHDV motif possesses high levels of diversity in different plants. (2) MHDV motif in M. truncatula may have originated through a different origin of CNL and TNL genes. (3) An ancestral MHDV motif may have been lost during the evolution of M. truncatula or evolved into other structures through mutation. The LRR regions were characterized and found to be variable in sequence, size, and occurrence per gene [11]. High sequence variability in LRR is consistent with their role in determining the recognition specificity of pathogen encoded Avr proteins [38]. To investigate structural features in the LRR domain, we analyzed those domains using the COILS program, excluding some of the CNL and TNL genes that had a short LRR domain (\300 amino acids). 3.3 Phylogenetic analysis A phylogenetic analysis of 490 regular NBS-LRR sequences was performed to classify genes into groups and identify relationships between genes. One hundred and twenty-seven genes (non-regular genes) were excluded due to their excessively short length (B2/3 the normal length). Since a previous study showed that phylogenetic analyses of NBS domains allow groups of R genes to be distinguished [21], we constructed a composite phylogenetic tree for all candidate genes using the NBS domain sequence. There was a clear separation between the TIR-type and non-tir-type clades with well-supported bootstrap values, reflecting ancient differentiation of NBS-LRR genes into two major groups (Figs. 1, S1). However, in some cases, the distribution of M. truncatula NBS-LRR proteins reveals a high level of sequence divergence within TIRtype and non-tir-type clades. For example, a clade of non- TIR-type sequences resolves within eight TIR-type proteins (Fig. 1). Moreover, NBS/NBS-LRR genes were distributed in both TIR and non-tir groups. This topology indicates a paraphyletic origin of NBS/NBS-LRR genes and suggests that they may be distinguished by a loss of a domain [38]. Based on the major division evident in the NBS domain tree, two neighbor-joining trees were constructed from the aligned whole protein sequences of G. max and M. Table 2 Consensus sequences of major NBS domain motif in predicted M. truncatula CNL and TNL proteins Class Motif Consensus sequence a TIR-NBS-LRR P-loop IWGMSGIGKTTIAKQLFAKLF RNBS-A LVHLQEQLLSKLLGEK Kinase 2 RLxRKKVLIVLDDVD RNBS-B WFGPGSRIIVTTRDKHLLxSH RNBS-C YEVKELNETEALELFCWKAF GLPL YAGGLPLALKVLGSNLFGKSIEEWKSALDKLEKIPN RNBS-D EKSIFLDIACFFNGx CC-NBS-LRR P-loop VISIVGMGGLGKTTLAQLVYN RNBS-A QKHFDLKIWVCVSED Kinase 2 KRFLLVLDDVW RNBS-B DGSKGSKVLVTTRSEKVAxIM RNBS-C HxLxGLSEEEAWSLFKKHAFG GLPL LExIGKEIAKKCGGLPLAIKTLGGLLRSK RNBS-D CFLYCALFPEDYEIxKKxLIRLWIAEGFIxS a Consensus amino acid sequence derived from MEME. Related motifs in the NBS domains of CNL and TNL proteins are aligned. X indicates a non-conserved residue. Underlined residues indicate consensus sequences identical to the motifs detected in Arabidopsis NBS domains by Meyers et al. [10]

5 Chin. Sci. Bull. (2014) 59(11): Fig. 1 Phylogenetic relationship of the TIR-type and non-tir-type NBS-LRR proteins in M. truncatula. The neighbor-joining tree was constructed using the NBS domains. Black circle, red circle, and diamond indicate the TIR-type, non-tir-type, and NBS/NBS-LRR genes, respectively, truncatula, resulting in trees of TIR-type and non-tir-type NBS-LRR genes (Figs. 2, S2, and S3). In some cases, one clade includes many genes from a single species, indicating that some gene clades are species-specific, and these sequences were highly conserved by the gene duplication. However, in some cases, clades contain genes from G. max and M. truncatula, indicating the same origin or evolution of NBS-LRR genes from G. max and M. truncatula. The phylogeny-based method was often used to identify orthologs or paralogs [39]. In this study, we applied this method to identify 10 possible orthologs pair between G. max and M. truncatula (Fig. 2). The research of the NBS- LRR orthologs of G. max and M. truncatula contributes to the study on genetic transformation between G. max and M. truncatula. 3.4 Chromosomal location of NBS-LRR genes and gene duplication Figure 3 shows the locations of the NBS-LRR genes on the chromosomes of M. truncatula. The chromosomal distribution of the genes is unevenly (Fig. 3). Chromosome 3 contains the largest number of NBS-LRR genes (129), while chromosome 1 contains 15 NBS-LRR genes. There was no obvious difference between the distributions of the TIR-type and non-tir-type genes on the chromosomes (Fig. 3). However, chromosome 3 contributes roughly 89 (44.5 %) of all non-tir-type genes, while chromosome 6 contributes roughly 44 (27.5 %) of all TIR-type genes. Studies in A. thaliana and rice have shown the most NBS-LRR genes are found clustered on certain chromosomes [10, 25]. Holub [40] defined a gene cluster as a chromosome region with four or more genes within 200 kb. Using this window size, we found that 309 NBS- LRR genes reside in 78 gene clusters, and that the average number of NBS-LRR genes in a cluster was The ratio is more than that in A. thaliana (3.21), rice (3.48), maize (2.77), respectively (Table 3). The largest cluster contains 10 NBS-LRR genes located Chromosome 6 and 8, respectively (Table S4). In addition, 181 NBS-LRR singletons were dispersed over all of the chromosomes (Table 3). In this study, we confirmed previously hypothesized genome duplications by BLAST comparison [37]. A total

6 1134 Chin. Sci. Bull. (2014) 59(11): Fig. 2 Phylogenetic comparison of G. max (blue triangle) and M. truncatula (red circle) NBS-LRR genes. Each NBS-LRR predicted orthologous gene was indicated in green color. a Phylogenetic tree base on the whole protein sequence from G. max (blue triangle) and M. truncatula (red circle) CC-NBS-LRR genes. b Phylogenetic tree base on the whole protein sequence from G. max (blue triangle) and M. truncatula (red circle) TIR-NBS-LRR genes of 250 out of the 490 regular NBS-LRR genes with duplication events were detected in 86 gene families in the M. truncatula genome (Table 3). The percentage of NBS-LRR genes in M. truncatula multigene genes (51.0 %; n = 86) and the total number of genes was similar in rice (53.7 %; n = 93), while the number in M. truncatula was greater than A. thaliana (46.6 %; n = 25), maize (37.6 %; n = 17), or sorghum (47.3 %; n = 20). The largest number of genes in a family in M. truncatula was 14 and the average number of family members was Moreover, the average number of genes in each NBS-LRR gene family was 3.24, 3.00, 2.41, and 6.10 in A. thaliana, rice, maize, and sorghum, respectively. In addition, these results suggest that the number of NBS- LRR gene duplications in M. truncatula is similar to those found in rice. 3.5 Digital expression analysis: EST expression profiling Publicly available Expressed Sequence Tags (ESTs) were considered a useful means of studying gene expression profiles (a type of digital northern) [41]. A total of 269,501 M. truncatula EST sequences were downloaded from GenBank. The digital expression profiles of 255 regular NBS-LRR genes were acquired from the EST database at NCBI, which resulted in the assignment of NBS-LRR genes of M. truncatula to eight groups on the basis of tissue and organ types, including shoot, stem, leaf, root, flower, seed, glandular trichome, and multiple tissues. The EST database did not contain sequences for the rest of the 235 NBS-LRR genes (Table S5). EST expression profiling revealed that the maximum number of NBS-LRR genes expressed in multiple tissues was 205 (80.4 %), but only 12 (4.7 %) of the NBS-LRR genes were expressed in seeds. In addition, we discovered that the number of NBS-LRR genes expressed in shoots, stems, leaves, roots, flowers, and glandular trichomes was 34 (13.3 %), 47 (18.4 %), 25 (9.8 %), 77 (30.2 %), 14 (5.5 %), and 23 (9.0 %), respectively. Furthermore, we detected 112 TIR-type, 103 non-tir-type, and 40 NBS/ NBS-LRR genes in the expression database (Table S5). TIR-type genes are more prevalent in the expression data than non-tir-type genes in M. truncatula. 4 Discussion Many aspects of the NBS-LRR disease resistance gene family have been studied extensively and this gene family has been described in monocotyledonous and dicotyledonous plants, including A. thaliana [10, 42], Oryza sativa [25, 43], Vitis vinifera [44], Brassica rapa [11], G. max [45], Sorghum bicolor [46], Brachypodium distachyon [37], Zea mays [34], and Solanum tuberosum [47]. Although the complete genome of M. truncatula had not

7 Chin. Sci. Bull. (2014) 59(11): Fig. 3 Chromosomal locations of M. truncatula NBS-LRR genes. The sizes of chromosome can be estimated using the scale at the top. Boxes above and below each M. truncatula chromosome (chrm; gray bars) designate the approximate locations of each gene. Unmapped NBS-LRR genes are show on the chromosome 0. The arrow on MtChr 5 indicates the TIR-type proteins distributed in non-tir-type clade. Color code: CN (yellow), CNL (orange), N (pink), NL (red), NT (brown), NTL (gray), TN (cyan), TNL (violet), TNT (blue), and TNTL (green) Table 3 Organization of NBS-LRR genes in the five plant genomes Organization Medicago Arabidopsis a Rice a Maize b Sorghum b Single-genes Multi-genes Gene family number Maximal member of a family Average members of a family Percentage of multi-genes Singleton genes Cluster genes Cluster number Maximal members of a cluster Mean members per cluster a Data from Yang et al. [44] Data from Cheng et al. [34]

8 1136 Chin. Sci. Bull. (2014) 59(11): been fully sequenced, approximately 94 % of the genome is assembled. In this study, we identified 617 NBS-LRR genes in the Mt3.5v5 draft genome, which represents 0.99 % of all the predicted proteins. The number of NBS- LRR genes was similar to NBS-LRR genes found in rice (1.00 %) [43] and Populus trichocarpa (1.00 %) [38], while the number of NBS-LRR genes in M. truncatula was higher compared to A. thaliana (0.43 %) [10], B. distachyon (0.77 %) [37], and Carica papaya (0.22 %) [48]. Interestingly, the ratio of NBS-LRR genes in the Mt3.5v5 draft genome was lower than that (1.33 %) of the Mt1.0 draft genome [22], suggesting that the number of NBS- LRR genes would no longer increase with the increase of genome size. In addition, our results show that in M. truncatula there are more non-tir-type genes (241) than TIR-type genes (222), suggesting that the gene duplication of non-tir-type experienced more duplications and more recent duplications than that of TIR-type. This result is similar to that found based on the Mt1.0 genome (152 non-tir-type genes and 122 TIRtype genes) [22]. A previous study analyzed A. thaliana NBS domain regions and estimated a tree that formed two clades [49], corresponding to TIR and non-tir-nbs-lrr genes. However, NBS-LRR genes from rice did not cluster into two clades and generally lacked topological resolution [25]. In this study, the phylogenetic analysis found two distinct clades of 490 NBS-LRR genes. The tree showed multigene genes clustered together (Fig. 1; Table S4), but also recovered clades of a mixture of cluster gene types (Fig. 3, the arrow), suggesting chromosomal rearrangement, to transposition, or to large-scale genomic duplications have occurred in the Medicago lineage. We note that eight TIR-type genes (Medtr5g , Medtr5g , Medtr5g , Medtr5g , Medtr5g , Medtr5g , Medtr5g , and Medtr5g ) were resolved within a clade of non-tir-type genes, suggesting non-tir-type NBS-LRR genes were more conservative than that of the TIR-type. Similarly, the interesting results were also found in grapevine, but the two CNL genes nested in TIR-NBS clade [44]. McHale et al. [7] and Yang et al. [44] confirmed that the CNL group originated before the divergence of the monocots and dicots, while the TNL group appeared more recently in the eudicots, after the divergence of the monocots. Recently, however, a study suggested that TNL genes actually originated earlier than CNL genes [50]. In this study we identified both TNL and CNL genes, and we predicted that the loss of CC domain and fusion of TIR domain could lead to the composition of eight TIR-type genes based on the analysis of phylogentic tree. Therefore, we support an earlier origin of CNL genes by phylogenetic tree. Further study of these eight TIR-type genes may give us additional insights in NBS-LRR evolution. For example, phylogenetic analysis showed that seven TIR-type genes formed a clade, to the exclusion of Medtr5g Chromosomal localization results showed that the seven closely related TIR-type genes are arranged in a cluster on chromosome 5 (Fig. 3, the arrow). This suggests that Medtr5g has a different origin from the other seven TIR-type genes. We attribute above two different conclusions to the fact that the studies had different research objectives. McHale et al. [7] and Yang et al. [44] sampled flowering plants, whereas Yue et al. [50] sampled fungi and non-vascular plants. Although some of the TNL genes were identified in bryophytes through degenerate primer-based PCR and BLAST-based homologous searches, these genes were not obtained as full-length sequences [20, 36]. Therefore, we can not verify the function of TNL genes in bryophytes by Yue et al. s experimental methods [50]. In addition, bryophytes (e.g., P. patens) may be not susceptible to diseases during their relatively short life span, suggesting this may result in rapid divergence of NBS-LRR genes due to natural selection. NBS-LRR contains three main domains, including an N-terminal variable domain, a central nucleotide-binding site (NBS) domain, and a C-terminal leucine-rich repeat (LRR) domain. Depending on whether they also encoded an N-terminal TIR domain, NBS-LRR genes could be further divided into two groups, the TNL group and the CNL group [21]. Previous studies have shown that the TIR, NBS, and LRR domains originated before the split of prokaryotes and eukaryotes [50]. Moreover, some studies have revealed that the TIR domain is more conserved across prokaryotes and eukaryotes than the CC domain [10, 25]. Our results are consistent with that generalization. The M. truncatula LRR regions were highly variable in sequence and size, which we deemed to be evidence that LRR plays an important role in the pathogen recognition process and activation of signal transduction in response to pathogen attack [8, 51, 52]. Although three primary LRR motif patterns were detected in A. thaliana [10], the highly diverse LRR repeats in M. truncatula could not classified. Table S3 demonstrates how difficult it is to identify the motif. According to previous studies, most plant NBS-LRR genes are unevenly distributed and clustered in certain chromosomes [22, 43]. Gene duplication, including tandem and whole-genome duplication (WGD), play a crucial role in increasing gene copy numbers. Nobuta et al. [53] and Yu et al. [54] found that the ratio of NBS-LRR genes in WGD blocks in A. thaliana and O. sativa was 10 % and 0 %, respectively. Yang et al. [44] compared four plants of A. thaliana, O. sativa, P. trichocarpa, and V. vinifera and found that in plant genomes tandem duplication played a primary role in NBS-LRR gene family expansion, but

9 Chin. Sci. Bull. (2014) 59(11): WGD does not. The same conclusions were found in B. distachyon [37]. This study classified 78 gene clusters, including 309 NBS-LRR genes and found that they were randomly distributed on chromosomes 0 8. The results indicted NBS-LRR genes in M. truncatula genome were formed by tandem duplication. We used an EST database to test the expression levels of the regular NBS-LRR from different tissues and organs of M. truncatula. We found approximately 52 % of the regular NBS-LRR genes were expressed in various tissues and organs. The result is similar to that found by Ameline- Torregrosa et al. [22] study. Interestingly, Cheng et al. [34] and Kohler et al. [38] showed that most of the R genes present in a plant genome are not expressed in the transcriptome during a pathogen infection [34, 38]. On the contrary, NBS-LRR genes were only expressed under specific conditions in specific tissues and not necessarily during an infection [55]. More interestingly, some of the pseudogenes have been detected in other plants [33, 43]. Therefore, expression of the NBS-LRR genes of M. truncatula should be confirmed by further experimental transcriptomic studies. Acknowledgments This work was supported by the National Basic Research Program of China (2014CB138702). We thank members of the State Key Laboratory of Grassland Agro-ecosystems for their assistance in this study. References 1. Flor HH (1971) Current status of the gene-for-gene concept. Annu Rev Phytopathol 9: Dangl JL, Jones JD (2001) Plant pathogens and integrated defence responses to infection. Nature 411: Hammond-Kosack KE, Jones JDG (1997) Plant disease resistance genes. Annu Rev Plant Physiol Plant Mol Biol 48: Martin GB, Bogdanove AJ, Sessa G (2003) Understanding the functions of plant disease resistance proteins. Annu Rev Plant Biol 54: Staskawicz BJ, Mudgett MB, Dangl JL et al (2001) Common and contrasting themes of plant and animal diseases. Science 292: Jones JD, Dangl JL (2006) The plant immune system. Nature 444: McHale L, Tan X, Koehl P et al (2006) Plant NBS LRR proteins: adaptable guards. Genome Biol 7: Dodds PN, Rathjen JP (2010) Plant immunity: towards an integrated view of plant pathogen interactions. Nat Rev Genet 11: Gay NJ, Gangloff M (2007) Structure and function of Toll receptors and their ligands. Annu Rev Biochem 76: Meyers BC, Kozik A, Griego A et al (2003) Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis. Plant Cell 15: Mun JH, Yu HJ, Park S et al (2009) Genome-wide identification of NBS-encoding resistance genes in Brassica rapa. Mol Genet Genomics 282: Tameling WI, Elzinga SD, Darmin PS et al (2002) The tomato R gene products I-2 and MI-1 are functional ATP binding proteins with ATPase activity. Plant Cell 14: Tameling WI, Vossen JH, Albrecht M et al (2006) Mutations in the NB-ARC domain of I-2 that impair ATP hydrolysis cause autoactivation. Plant Physiol 140: Swiderski MR, Birker D, Jones JD (2009) The TIR domain of TIR-NB-LRR resistance proteins is a signaling domain involved in cell death induction. Mol Plant Microbe Interact 22: Krasileva KV, Dahlbeck D, Staskawicz BJ (2010) Activation of an Arabidopsis resistance protein is specified by the in planta association of its leucine-rich repeat domain with the cognate oomycete effector. Plant Cell 22: Maekawa T, Cheng W, Spiridon LN et al (2011) Coiled-coil domain-dependent homodimerization of intracellular barley immune receptors defines a minimal functional module for triggering cell death. Cell Host Microbe 9: Sayar-Turet M, Dreisigacker S, Braun HJ et al (2011) Genetic variation within and between winter wheat genotypes from Turkey, Kazakhstan, and Europe as determined by nucleotide-binding-site profiling. Genome 54: Belkhadir Y, Nimchuk Z, Hubert DA et al (2004) Arabidopsis RIN4 negatively regulates disease resistance mediated by RPS2 and RPM1 downstream or independent of the NDR1 signal modulator and is not required for the virulence functions of bacterial type III effectors AvrRpt2 or AvrRpm1. Plant Cell 16: Young ND, Debellé F, Oldroyd GE et al (2011) The Medicago genome provides insight into the evolution of rhizobial symbioses. Nature 480: Akita M, Valkonen JP (2002) A novel gene family in moss (Physcomitrella patens) shows sequence homology and a phylogenetic relationship with the TIR-NBS class of plant disease resistance genes. J Mol Evol 55: Meyers BC, Dickrman AW, Michelmore RW et al (1999) Plant disease resistance genes encode members of an ancient and diverse protein family within the nucleotide-binding superfamily. Plant J 20: Ameline-Torregrosa C, Wang BB, O Bleness MS et al (2008) Identification and characterization of nucleotide-binding siteleucine-rich repeat genes in the model plant Medicago truncatula. Plant Physiol 146: Branca A, Paape TD, Zhou P et al (2011) Whole-genome nucleotide diversity, recombination, and linkage disequilibrium in the model legume Medicago truncatula. Proc Natl Acad Sci USA 108:E864 E Finn RD, Mistry J, Schuster-Böckler B et al (2006) Pfam:clan, web tools and services. Nucleic Acids Res 34: Zhou T, Wang Y, Chen JQ et al (2004) Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-tir NBS-LRR genes. Mol Genet Genomics 271: Berger B, Wilson DB, Wolf E et al (1995) Predicting coiled coils by use of pairwise residue correlations. Proc Natl Acad Sci USA 92: Thompson JD, Higgings DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: Bailey TL, Elkan C (1995) The value of prior knowledge in discovering motifs with MEME. Proc Int Conf Intell Syst Mol Biol 3: Zhao Y, Li X, Chen W et al (2011) Whole-genome survey and characterization of MADS-box gene family in maize and sorghum. Plant Cell Tiss Organ Cult 105:

10 1138 Chin. Sci. Bull. (2014) 59(11): Letunic I, Doerks T, Bork P (2012) SMART 7: recent updates to the protein domain annotation resource. Nucleic Acids Res 40: Thompson JD, Gibson TJ, Plewniak F et al (1997) The CLUS- TAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: Tamura K, Dudley J, Nei M et al (2007) MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol 24: Li X, Cheng Y, Ma W et al (2010) Identification and characterization of NBS-encoding disease resistance genes in Lotus japonicus. Plant Syst Evol 289: Cheng Y, Li X, Jiang H et al (2012) Systematic analysis and comparison of nucleotide-binding site disease resistance genes in maize. FEBS J 279: Pan Q, Wendel J, Fluhr R (2000) Divergent evolution of plant NBS-LRR resistance gene homologues in dicot and cereal genomes. J Mol Evol 50: Meyers BC, Morgante M, Michelmore RW (2002) TIR-X and TIR-NBS proteins: two new families related to disease resistance TIR-NBS-LRR proteins encoded in Arabidopsis and other plant genomes. Plant J 32: Tan S, Wu S (2012) Genome wide analysis of nucleotide-binding site disease resistance genes in Brachypodium distachyon. Comp Funct Genomics 2012: Kohler A, Rinaldi C, Duplessis S et al (2008) Genome-wide identification of NBS resistance genes in Populus trichocarpa. Plant Mol Biol 66: Li L, Stoeckert CJ Jr, Roos DS (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 13: Holub EB (2001) The arms race is ancient history in Arabidopsis, the wildflower. Nat Rev Genet 2: Ohlrogge J, Benning C (2000) Unraveling plant metabolism by EST analysis. Curr Opin Plant Biol 3: Bai J, Pennill LA, Ning J et al (2002) Diversity in nucleotide binding site-leucine-rich repeat genes in cereals. Genome Res 12: Monosi B, Wisser RJ, Pennill L et al (2004) Full-genome analysis of resistance gene homologues in rice. Theor Appl Genet 109: Yang S, Zhang X, Yue JX et al (2008) Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics 280: Zhang X, Feng Y, Cheng H et al (2011) Relative evolutionary rates of NBS-encoding genes revealed by soybean segmental duplication. Mol Genet Genomics 285: Luo S, Zhang Y, Hu Q et al (2012) Dynamic nucleotide-binding site and leucine-rich repeat-encoding genes in the grass family. Plant Physiol 159: Lozano R, Ponce O, Ramirez M et al (2012) Genome-wide identification and mapping of NBS-encoding resistance genes in Solanum tuberosum group phureja. PLoS ONE 7:e Porter BW, Paidi M, Ming R et al (2009) Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family. Mol Genet Genomics 281: Richly E, Kurth J, Leister D (2002) Mode of amplification and reorganization of resistance genes during recent Arabidopsis thaliana evolution. Mol Biol Evol 19: Yue JX, Meyers BC, Chen JQ et al (2012) Tracing the origin and evolutionary history of plant nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes. New Phytol 193: Shen QH, Zhou F, Birei S et al (2003) Recognition specificity and RAR1/SGT1 dependence in barley Mla disease resistance genes to the powdery mildew fungus. Plant Cell 15: Rairdan GJ, Moffett P (2006) Distinct domains in the ARC region of the potato resistance protein Rx mediate LRR binding and inhibition of activation. Plant Cell 18: Nobuta K, Ashfield T, Kim S et al (2005) Diversification of non- TIR class NB-LRR genes in relation to whole-genome duplication events in Arabidopsis. Mol Plant Microbe Interact 18: Yu J, Wang J, Lin W et al (2005) The genomes of Oryza sativa:a history of duplications. PLoS Biol 3:e Tan X, Meyers BC, Kozik A et al (2007) Global expression analysis of nucleotide binding site-leucine rich repeat-encoding and related genes in Arabidopsis. BMC Plant Biol 7:56

A genomic analysis of disease-resistance genes encoding nucleotide binding sites in Sorghum bicolor

A genomic analysis of disease-resistance genes encoding nucleotide binding sites in Sorghum bicolor Research Article Genetics and Molecular Biology, 33, 2, 292-297 (2010) Copyright 2010, Sociedade Brasileira de Genética. Printed in Brazil www.sbg.org.br A genomic analysis of disease-resistance genes

More information

Genomewide analysis of NBS-encoding genes in kiwi fruit (Actinidia chinensis)

Genomewide analysis of NBS-encoding genes in kiwi fruit (Actinidia chinensis) c Indian Academy of Sciences RESEARCH NOTE Genomewide analysis of NBS-encoding genes in kiwi fruit (Actinidia chinensis) YINGJUN LI, YAN ZHONG, KAIHUI HUANG and ZONG-MING CHENG College of Horticulture,

More information

Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii

Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii Hengling Wei 1,2, Wei Li 1, Xiwei Sun 1, Shuijin Zhu 1 *, Jun Zhu 1 * 1 Key

More information

Genome-Wide Analysis of NBS-LRR Encoding Genes in Arabidopsis

Genome-Wide Analysis of NBS-LRR Encoding Genes in Arabidopsis The Plant Cell, Vol. 15, 809 834, April 2003, www.plantcell.org 2003 American Society of Plant Biologists GENOMICS ARTICLE Genome-Wide Analysis of NBS-LRR Encoding Genes in Arabidopsis Blake C. Meyers,

More information

In silico analysis of the NBS protein family in Ectocarpus siliculosus

In silico analysis of the NBS protein family in Ectocarpus siliculosus Indian Journal of Biotechnology Vol 12, January 2013, pp 98-102 In silico analysis of the NBS protein family in Ectocarpus siliculosus Niaz Mahmood 1 * and Mahdi Muhammad Moosa 1,2 1 Molecular Biology

More information

Potato Genome Analysis

Potato Genome Analysis Potato Genome Analysis Xin Liu Deputy director BGI research 2016.1.21 WCRTC 2016 @ Nanning Reference genome construction???????????????????????????????????????? Sequencing HELL RIEND WELCOME BGI ZHEN LLOFRI

More information

Research Article Genome Wide Analysis of Nucleotide-Binding Site Disease Resistance Genes in Brachypodium distachyon

Research Article Genome Wide Analysis of Nucleotide-Binding Site Disease Resistance Genes in Brachypodium distachyon Comparative and Functional Genomics Volume, Article ID, pages doi:.// Research Article Genome Wide Analysis of Nucleotide-Binding Site Disease Resistance Genes in Brachypodium distachyon Shenglong Tan,

More information

Resistance gene analogues of Arabidopsis thaliana: recognition by structure

Resistance gene analogues of Arabidopsis thaliana: recognition by structure J. Appl. Genet. 44(3), 2003, pp. 311-321 Resistance gene analogues of Arabidopsis thaliana: recognition by structure Jerzy CHE KOWSKI, Grzegorz KOCZYK Institute of Plant Genetics, Polish Academy of Sciences,

More information

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Maheshi Dassanayake 1,9, Dong-Ha Oh 1,9, Jeffrey S. Haas 1,2, Alvaro Hernandez 3, Hyewon Hong 1,4, Shahjahan

More information

Supplemental Data. Perea-Resa et al. Plant Cell. (2012) /tpc

Supplemental Data. Perea-Resa et al. Plant Cell. (2012) /tpc Supplemental Data. Perea-Resa et al. Plant Cell. (22)..5/tpc.2.3697 Sm Sm2 Supplemental Figure. Sequence alignment of Arabidopsis LSM proteins. Alignment of the eleven Arabidopsis LSM proteins. Sm and

More information

Host-Pathogen interaction-ii. Pl Path 604 PN Sharma Department of Plant Pathology CSK HPKV, Palampur

Host-Pathogen interaction-ii. Pl Path 604 PN Sharma Department of Plant Pathology CSK HPKV, Palampur Host-Pathogen interaction-ii Pl Path 604 PN Sharma Department of Plant Pathology CSK HPKV, Palampur-176062 It was originally believed that gene-for-gene resistance was conferred by a direct interaction

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)

More information

Supplementary Figure 3

Supplementary Figure 3 Supplementary Figure 3 7.0 Col Kas-1 Line FTH1A 8.4 F3PII3 8.9 F26H11 ATQ1 T9I22 PLS8 F26B6-B 9.6 F27L4 9.81 F27D4 9.92 9.96 10.12 10.14 10.2 11.1 0.5 Mb T1D16 Col % RGR 83.3 101 227 93.5 75.9 132 90 375

More information

Small RNA in rice genome

Small RNA in rice genome Vol. 45 No. 5 SCIENCE IN CHINA (Series C) October 2002 Small RNA in rice genome WANG Kai ( 1, ZHU Xiaopeng ( 2, ZHONG Lan ( 1,3 & CHEN Runsheng ( 1,2 1. Beijing Genomics Institute/Center of Genomics and

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species.

Nature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species. Supplementary Figure 1 Icm/Dot secretion system region I in 41 Legionella species. Homologs of the effector-coding gene lega15 (orange) were found within Icm/Dot region I in 13 Legionella species. In four

More information

Paleo-evolutionary plasticity of plant disease resistance genes

Paleo-evolutionary plasticity of plant disease resistance genes Zhang et al. BMC Genomics 2014, 15:187 RESEARCH ARTICLE Paleo-evolutionary plasticity of plant disease resistance genes Rongzhi Zhang 1,2, Florent Murat 1, Caroline Pont 1, Thierry Langin 1 and Jerome

More information

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Bioinformatics tools for phylogeny and visualization. Yanbin Yin Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S3 (box) Methods Methods Genome weighting The currently available collection of archaeal and bacterial genomes has a highly biased distribution of isolates across taxa. For example,

More information

Supplemental Data. Yang et al. (2012). Plant Cell /tpc

Supplemental Data. Yang et al. (2012). Plant Cell /tpc Supplemental Figure 1. Mature flowers of P. heterotricha. (A) An inflorescence of P. heterotricha showing the front view of a zygomorphic flower characterized by two small dorsal petals and only two fertile

More information

Plant Nucleotide Binding Site Leucine-Rich Repeat (NBS-LRR) Genes: Active Guardians in Host Defense Responses

Plant Nucleotide Binding Site Leucine-Rich Repeat (NBS-LRR) Genes: Active Guardians in Host Defense Responses Int. J. Mol. Sci. 2013, 14, 7302-7326; doi:10.3390/ijms14047302 Review OPEN ACCESS International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Plant Nucleotide Binding Site Leucine-Rich

More information

Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus

Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus Xiaohan Yang Sara Jawdy Timothy Tschaplinski Gerald Tuskan Environmental Sciences Division Oak Ridge National Laboratory

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Genome-wide analysis of the MYB transcription factor superfamily in soybean

Genome-wide analysis of the MYB transcription factor superfamily in soybean Du et al. BMC Plant Biology 2012, 12:106 RESEARCH ARTICLE Open Access Genome-wide analysis of the MYB transcription factor superfamily in soybean Hai Du 1,2,3, Si-Si Yang 1,2, Zhe Liang 4, Bo-Run Feng

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

TNL genes in peach: insights into the post- LRR domain

TNL genes in peach: insights into the post- LRR domain Van Ghelder and Esmenjaud BMC Genomics (2016) 17:317 DOI 10.1186/s12864-016-2635-0 RESEARCH ARTICLE TNL genes in peach: insights into the post- LRR domain Cyril Van Ghelder 1,2,3* and Daniel Esmenjaud

More information

Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants Liu et al. BMC Evolutionary Biology (2017) 17:47 DOI 10.1186/s12862-017-0891-5 RESEARCH ARTICLE Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

More information

MiGA: The Microbial Genome Atlas

MiGA: The Microbial Genome Atlas December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From

More information

Tracing the origin and evolutionary history of plant nucleotide-binding site leucine-rich repeat (NBS-LRR) genes

Tracing the origin and evolutionary history of plant nucleotide-binding site leucine-rich repeat (NBS-LRR) genes Research Tracing the origin and evolutionary history of plant nucleotide-binding site leucine-rich repeat (NBS-LRR) genes Jia-Xing Yue 1,2, Blake C. Meyers 3, Jian-Qun Chen 1, Dacheng Tian 1 and Sihai

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

Bioinformatics tools to analyze complex genomes. Yves Van de Peer Ghent University/VIB

Bioinformatics tools to analyze complex genomes. Yves Van de Peer Ghent University/VIB Bioinformatics tools to analyze complex genomes Yves Van de Peer Ghent University/VIB Detecting colinearity and large-scale gene duplications A 1 2 3 4 5 6 7 8 9 10 11 Speciation/Duplicatio n S1 S2 1

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type

Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type A B 2 3 3 2 1 1 Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type (A) and d27 (B) seedlings at the four

More information

Figure S1: Mitochondrial gene map for Pythium ultimum BR144. Arrows indicate transcriptional orientation, clockwise for the outer row and

Figure S1: Mitochondrial gene map for Pythium ultimum BR144. Arrows indicate transcriptional orientation, clockwise for the outer row and Figure S1: Mitochondrial gene map for Pythium ultimum BR144. Arrows indicate transcriptional orientation, clockwise for the outer row and counterclockwise for the inner row, with green representing coding

More information

Sequence Alignment Techniques and Their Uses

Sequence Alignment Techniques and Their Uses Sequence Alignment Techniques and Their Uses Sarah Fiorentino Since rapid sequencing technology and whole genomes sequencing, the amount of sequence information has grown exponentially. With all of this

More information

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega BLAST Multiple Sequence Alignments: Clustal Omega What does basic BLAST do (e.g. what is input sequence and how does BLAST look for matches?) Susan Parrish McDaniel College Multiple Sequence Alignments

More information

Genomes and Their Evolution

Genomes and Their Evolution Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Genome-wide analysis of immunophilin FKBP genes and expression patterns in Zea mays

Genome-wide analysis of immunophilin FKBP genes and expression patterns in Zea mays Genome-wide analysis of immunophilin FKBP genes and expression patterns in Zea mays W.-W. Wang 1,2, Q. Ma 2, Y. Xiang 2, S.-W. Zhu 2 and B.-J. Cheng 2 1 School of Chemistry and Life Sciences, Suzhou University,

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

PAMP-triggered immunity (PTI)

PAMP-triggered immunity (PTI) PAMP-triggered immunity (PTI) PAMP-triggered immunity (PTI) Recognition of danger signals - Distinguish self or damaged self versus non-self fundamental to any immune system - PAMP or MAMP pathogen/microbe-associated

More information

Genômica comparativa. João Carlos Setubal IQ-USP outubro /5/2012 J. C. Setubal

Genômica comparativa. João Carlos Setubal IQ-USP outubro /5/2012 J. C. Setubal Genômica comparativa João Carlos Setubal IQ-USP outubro 2012 11/5/2012 J. C. Setubal 1 Comparative genomics There are currently (out/2012) 2,230 completed sequenced microbial genomes publicly available

More information

Supporting Information

Supporting Information Supporting Information Das et al. 10.1073/pnas.1302500110 < SP >< LRRNT > < LRR1 > < LRRV1 > < LRRV2 Pm-VLRC M G F V V A L L V L G A W C G S C S A Q - R Q R A C V E A G K S D V C I C S S A T D S S P E

More information

GENOME-WIDE IDENTIFICATION OF DISEASE RESISTANCE GENES IN AEGILOPS TAUSCHII COSS. (POACEAE)

GENOME-WIDE IDENTIFICATION OF DISEASE RESISTANCE GENES IN AEGILOPS TAUSCHII COSS. (POACEAE) Proceedings of the South Dakota Academy of Science, Vol. 94 (2015) 281 GENOME-WIDE IDENTIFICATION OF DISEASE RESISTANCE GENES IN AEGILOPS TAUSCHII COSS. (POACEAE) Ethan J. Andersen, Samantha R. Shaw, and

More information

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Bioinformatics. Dept. of Computational Biology & Bioinformatics Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS

More information

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD Department of Computer Science University of Missouri 2008 Free for Academic

More information

Basic Local Alignment Search Tool

Basic Local Alignment Search Tool Basic Local Alignment Search Tool Alignments used to uncover homologies between sequences combined with phylogenetic studies o can determine orthologous and paralogous relationships Local Alignment uses

More information

Research Article Identification of Immune Related LRR-Containing Genes in Maize (Zea mays L.) by Genome-Wide Sequence Analysis

Research Article Identification of Immune Related LRR-Containing Genes in Maize (Zea mays L.) by Genome-Wide Sequence Analysis International Journal of Genomics Volume 2015, Article ID 231358, 11 pages http://dx.doi.org/10.1155/2015/231358 Research Article Identification of Immune Related -Containing Genes in Maize (Zea mays L.)

More information

CSCE555 Bioinformatics. Protein Function Annotation

CSCE555 Bioinformatics. Protein Function Annotation CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The

More information

Overview of tomato (Solanum lycopersicum) candidate pathogen recognition genes reveals important Solanum R locus dynamics

Overview of tomato (Solanum lycopersicum) candidate pathogen recognition genes reveals important Solanum R locus dynamics Research Overview of tomato (Solanum lycopersicum) candidate pathogen recognition genes reveals important Solanum R locus dynamics G. Andolfo 1 *, W. Sanseverino 1 *, S. Rombauts 2, Y. Van de Peer 2, J.

More information

Nature Genetics: doi: /ng Supplementary Figure 1. The phenotypes of PI , BR121, and Harosoy under short-day conditions.

Nature Genetics: doi: /ng Supplementary Figure 1. The phenotypes of PI , BR121, and Harosoy under short-day conditions. Supplementary Figure 1 The phenotypes of PI 159925, BR121, and Harosoy under short-day conditions. (a) Plant height. (b) Number of branches. (c) Average internode length. (d) Number of nodes. (e) Pods

More information

Cladistics and Bioinformatics Questions 2013

Cladistics and Bioinformatics Questions 2013 AP Biology Name Cladistics and Bioinformatics Questions 2013 1. The following table shows the percentage similarity in sequences of nucleotides from a homologous gene derived from five different species

More information

Host_microbe_PPI - R package to analyse intra-species and interspecies protein-protein interactions in the model plant Arabidopsis thaliana

Host_microbe_PPI - R package to analyse intra-species and interspecies protein-protein interactions in the model plant Arabidopsis thaliana Host_microbe_PPI - R package to analyse intra-species and interspecies protein-protein interactions in the model plant Arabidopsis thaliana Thomas Nussbaumer 1,2 1 Institute of Network Biology (INET),

More information

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew TEACHER S GUIDE Case Study Who s the daddy? Polyploidy in plant evolution James Clarkson Royal Botanic Gardens, Kew Dean Madden [Ed.] NCBE, University of Reading Version 1.1 Polypoidy in plant evolution

More information

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9 Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic

More information

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern

More information

Comparative Genomics II

Comparative Genomics II Comparative Genomics II Advances in Bioinformatics and Genomics GEN 240B Jason Stajich May 19 Comparative Genomics II Slide 1/31 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods

More information

The Plant Cell, November. 2017, American Society of Plant Biologists. All rights reserved

The Plant Cell, November. 2017, American Society of Plant Biologists. All rights reserved The Genetics of Floral Development Teaching Guide Overview The development of flowers in angiosperm plants provided a critical evolutionary advantage, allowing more options for pollen dispersal and seed

More information

Comprehensive Analysis of NAC Domain Transcription Factor Gene Family in Populus trichocarpa

Comprehensive Analysis of NAC Domain Transcription Factor Gene Family in Populus trichocarpa RESEARCH ARTICLE Open Access Comprehensive Analysis of NAC Domain Transcription Factor Gene Family in Populus trichocarpa Ruibo Hu 1, Guang Qi 1, Yingzhen Kong 1,2, Dejing Kong 1, Qian Gao 1, Gongke Zhou

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Chapter 5 Evolution of Biodiversity. Sunday, October 1, 17

Chapter 5 Evolution of Biodiversity. Sunday, October 1, 17 Chapter 5 Evolution of Biodiversity CHAPTER INTRO: The Dung of the Devil Read and Answer Questions Provided Module 14 The Biodiversity of Earth After reading this module you should be able to understand

More information

Host-Pathogen Interaction. PN Sharma Department of Plant Pathology CSK HPKV, Palampur

Host-Pathogen Interaction. PN Sharma Department of Plant Pathology CSK HPKV, Palampur Host-Pathogen Interaction PN Sharma Department of Plant Pathology CSK HPKV, Palampur-176062 PATHOGEN DEFENCE IN PLANTS A BIOLOGICAL AND MOLECULAR VIEW Two types of plant resistance response to potential

More information

MOST of the disease resistance (R) genes in plants

MOST of the disease resistance (R) genes in plants Copyright Ó 2006 by the Genetics Society of America DOI: 10.1534/genetics.105.047290 Unique Evolutionary Mechanism in R-Genes Under the Presence/Absence Polymorphism in Arabidopsis thaliana Jingdan Shen,*,1

More information

Supplemental Table 1. Primers used for cloning and PCR amplification in this study

Supplemental Table 1. Primers used for cloning and PCR amplification in this study Supplemental Table 1. Primers used for cloning and PCR amplification in this study Target Gene Primer sequence NATA1 (At2g393) forward GGG GAC AAG TTT GTA CAA AAA AGC AGG CTT CAT GGC GCC TCC AAC CGC AGC

More information

Introduction to Bioinformatics Integrated Science, 11/9/05

Introduction to Bioinformatics Integrated Science, 11/9/05 1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction

More information

P LANT P ATHOLOGY REVIEW Evolutionary Dynamics of Plant R-Genes. Joy Bergelson,* Martin Kreitman, Eli A. Stahl, Dacheng Tian

P LANT P ATHOLOGY REVIEW Evolutionary Dynamics of Plant R-Genes. Joy Bergelson,* Martin Kreitman, Eli A. Stahl, Dacheng Tian P LANT REVIEW Evolutionary Dynamics of Plant R-Genes Joy Bergelson,* Martin Kreitman, Eli A. Stahl, Dacheng Tian Plant R-genes involved in gene-for-gene interactions with pathogens are expected to undergo

More information

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class

More information

Microbial Taxonomy and the Evolution of Diversity

Microbial Taxonomy and the Evolution of Diversity 19 Microbial Taxonomy and the Evolution of Diversity Copyright McGraw-Hill Global Education Holdings, LLC. Permission required for reproduction or display. 1 Taxonomy Introduction to Microbial Taxonomy

More information

USDA-DOE Plant Feedstock Genomics for Bioenergy

USDA-DOE Plant Feedstock Genomics for Bioenergy USDA-DOE Plant Feedstock Genomics for Bioenergy BERAC Thursday, June 7, 2012 Cathy Ronning, DOE-BER Ed Kaleikau, USDA-NIFA Plant Feedstock Genomics for Bioenergy Joint competitive grants program initiated

More information

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST Introduction Bioinformatics is a powerful tool which can be used to determine evolutionary relationships and

More information

The Phylogenetic Reconstruction of the Grass Family (Poaceae) Using matk Gene Sequences

The Phylogenetic Reconstruction of the Grass Family (Poaceae) Using matk Gene Sequences The Phylogenetic Reconstruction of the Grass Family (Poaceae) Using matk Gene Sequences by Hongping Liang Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University

More information

Tools and Algorithms in Bioinformatics

Tools and Algorithms in Bioinformatics Tools and Algorithms in Bioinformatics GCBA815, Fall 2015 Week-4 BLAST Algorithm Continued Multiple Sequence Alignment Babu Guda, Ph.D. Department of Genetics, Cell Biology & Anatomy Bioinformatics and

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly Comparative Genomics: Human versus chimpanzee 1. Introduction The chimpanzee is the closest living relative to humans. The two species are nearly identical in DNA sequence (>98% identity), yet vastly different

More information

Prediction and Classif ication of Human G-protein Coupled Receptors Based on Support Vector Machines

Prediction and Classif ication of Human G-protein Coupled Receptors Based on Support Vector Machines Article Prediction and Classif ication of Human G-protein Coupled Receptors Based on Support Vector Machines Yun-Fei Wang, Huan Chen, and Yan-Hong Zhou* Hubei Bioinformatics and Molecular Imaging Key Laboratory,

More information

Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium

Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium Tiziana Vonlanthen, Laurin Müller 27.10.15 1 Second paper: Origin and Domestication of the Fungal Wheat Pathogen

More information

Lineage specific conserved noncoding sequences in plants

Lineage specific conserved noncoding sequences in plants Lineage specific conserved noncoding sequences in plants Nilmini Hettiarachchi Department of Genetics, SOKENDAI National Institute of Genetics, Mishima, Japan 20 th June 2014 Conserved Noncoding Sequences

More information

Stage 1: Karyotype Stage 2: Gene content & order Step 3

Stage 1: Karyotype Stage 2: Gene content & order Step 3 Supplementary Figure Method used for ancestral genome reconstruction. MRCA (Most Recent Common Ancestor), AMK (Ancestral Monocot Karyotype), AEK (Ancestral Eudicot Karyotype), AGK (Ancestral Grass Karyotype)

More information

Supplementary text for the section Interactions conserved across species: can one select the conserved interactions?

Supplementary text for the section Interactions conserved across species: can one select the conserved interactions? 1 Supporting Information: What Evidence is There for the Homology of Protein-Protein Interactions? Anna C. F. Lewis, Nick S. Jones, Mason A. Porter, Charlotte M. Deane Supplementary text for the section

More information

BIOINFORMATICS LAB AP BIOLOGY

BIOINFORMATICS LAB AP BIOLOGY BIOINFORMATICS LAB AP BIOLOGY Bioinformatics is the science of collecting and analyzing complex biological data. Bioinformatics combines computer science, statistics and biology to allow scientists to

More information

BLAST. Varieties of BLAST

BLAST. Varieties of BLAST BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database

More information

O 3 O 4 O 5. q 3. q 4. Transition

O 3 O 4 O 5. q 3. q 4. Transition Hidden Markov Models Hidden Markov models (HMM) were developed in the early part of the 1970 s and at that time mostly applied in the area of computerized speech recognition. They are first described in

More information

Frequently Asked Questions (FAQs)

Frequently Asked Questions (FAQs) Frequently Asked Questions (FAQs) Q1. What is meant by Satellite and Repetitive DNA? Ans: Satellite and repetitive DNA generally refers to DNA whose base sequence is repeated many times throughout the

More information

Non-host resistance to wheat stem rust in Brachypodium species

Non-host resistance to wheat stem rust in Brachypodium species Non-host resistance to wheat stem rust in Brachypodium species Dr. Melania Figueroa Assistant Professor Department of Plant Pathology and Stakman-Borlaug Center for Sustainable Plant Health University

More information

Bioinformatics Exercises

Bioinformatics Exercises Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted

More information

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ Proteomics Chapter 5. Proteomics and the analysis of protein sequence Ⅱ 1 Pairwise similarity searching (1) Figure 5.5: manual alignment One of the amino acids in the top sequence has no equivalent and

More information

Unit G: Pest Management. Lesson 2: Managing Crop Diseases

Unit G: Pest Management. Lesson 2: Managing Crop Diseases Unit G: Pest Management Lesson 2: Managing Crop Diseases 1 Terms Abiotic disease Bacteria Biotic disease Cultural disease control Disease avoidance Disease resistance Disease tolerance Fungi Infectious

More information

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species PGA: A Program for Genome Annotation by Comparative Analysis of Maximum Likelihood Phylogenies of Genes and Species Paulo Bandiera-Paiva 1 and Marcelo R.S. Briones 2 1 Departmento de Informática em Saúde

More information

Hands-On Nine The PAX6 Gene and Protein

Hands-On Nine The PAX6 Gene and Protein Hands-On Nine The PAX6 Gene and Protein Main Purpose of Hands-On Activity: Using bioinformatics tools to examine the sequences, homology, and disease relevance of the Pax6: a master gene of eye formation.

More information

MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei"

MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS Masatoshi Nei" Abstract: Phylogenetic trees: Recent advances in statistical methods for phylogenetic reconstruction and genetic diversity analysis were

More information

Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags

Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags Lingfei Shangguan 1, Jian Han 1, Emrul Kayesh 1, Xin Sun 1, Changqing Zhang 2, Tariq Pervaiz 1, Xicheng Wen

More information

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17. Genetic Variation: The genetic substrate for natural selection What about organisms that do not have sexual reproduction? Horizontal Gene Transfer Dr. Carol E. Lee, University of Wisconsin In prokaryotes:

More information

1 Abstract. 2 Introduction. 3 Requirements. 4 Procedure

1 Abstract. 2 Introduction. 3 Requirements. 4 Procedure 1 Abstract None 2 Introduction The archaeal core set is used in testing the completeness of the archaeal draft genomes. The core set comprises of conserved single copy genes from 25 genomes. Coverage statistic

More information

The genetic architecture of resistance Nevin Dale Young

The genetic architecture of resistance Nevin Dale Young 285 The genetic architecture of resistance Nevin Dale Young Plant resistance genes (R genes), especially the nucleotide binding site leucine-rich repeat (NBS-LRR) family of sequences, have been extensively

More information

Impact of recurrent gene duplication on adaptation of plant genomes

Impact of recurrent gene duplication on adaptation of plant genomes Fischer et al. BMC Plant Biology 2014, 14:151 RESEARCH ARTICLE Open Access Impact of recurrent gene duplication on adaptation of plant genomes Iris Fischer 1,2*, Jacques Dainat 3,6, Vincent Ranwez 3, Sylvain

More information

Origins of Life. Fundamental Properties of Life. Conditions on Early Earth. Evolution of Cells. The Tree of Life

Origins of Life. Fundamental Properties of Life. Conditions on Early Earth. Evolution of Cells. The Tree of Life The Tree of Life Chapter 26 Origins of Life The Earth formed as a hot mass of molten rock about 4.5 billion years ago (BYA) -As it cooled, chemically-rich oceans were formed from water condensation Life

More information

Prospecting for Green Revolution Genes

Prospecting for Green Revolution Genes Learning Objectives: Prospecting for Green Revolution Genes 1) Discover how changes in individual genes produce phenotypic change 2) Learn to apply bioinformatics tools to identify groups of related genes

More information

Supplementary Information

Supplementary Information Supplementary Information Rice APC/C TE controls tillering through mediating the degradation of MONOCULM 1 Qibing Lin 1*, Dan Wang 1*, Hui Dong 2*, Suhai Gu 1, Zhijun Cheng 1, Jie Gong 2, Ruizhen Qin 1,

More information

17 Non-collinear alignment Motivation A B C A B C A B C A B C D A C. This exposition is based on:

17 Non-collinear alignment Motivation A B C A B C A B C A B C D A C. This exposition is based on: 17 Non-collinear alignment This exposition is based on: 1. Darling, A.E., Mau, B., Perna, N.T. (2010) progressivemauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5(6):e11147.

More information

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:

More information

Classical Selection, Balancing Selection, and Neutral Mutations

Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected

More information