South Green Bioinformatics activities at CIRAD

Size: px
Start display at page:

Download "South Green Bioinformatics activities at CIRAD"

Transcription

1 South Green Bioinformatics activities at CIRAD Data Integration Team of the research unit DAP Manuel Ruiz, CIP, Lima, 23rd january

2 The Joint Research Unit DAP (Développement et Amélioration des Plantes = Plant Development and Genetic Improvement) : The 2 main research thematics focus on genetics and plant improvement & development and adaptation.

3 Studied species: rice, wheat, sorghum, sugarcane, banana, coconut, oil palm, yam, coffee, rubber tree, cocoa, cotton, apple and olive

4 Web portal information systems (IS) IS in development Haplophyle MS-DMind (very soon)

5 Information systems a database that manages genetic and genomic information about tropical crops banana, cocoa, coconut, coffee, cotton, oil palm, rice, rubber, sugarcane, sorghum an interactive information system for rice reverse genetics a database for the phenotypic characterization of the Génoplante rice insertion line library a Web portal for crossing cocoa phenotypic, genetic and genomic data rice rice cocoa

6 a database that manages genetic and genomic information about tropical crops Version 1.0 genetic map QTL data marker : RFLP, RAPD, SSR, etc. genotype data phenotype data germplasm data

7 a database that manages genetic and genomic information about tropical crops Chantal Hamelin Version 1.0 genetic map QTL data marker : RFLP, RAPD, SSR, etc. genotype data phenotype data germplasm data Xavier Argout,

8

9

10

11

12 Gaétan Droc

13

14 Pierre Larmande

15 Analysis A methodology for genome-wide searches for orthologs in plants Christophe Périn Matthieu Conte Mathieu Rouard (Bioversity) Analysis pipeline for cdna (secure access) Application for SSR marker development

16 Xavier Argout,

17 Xavier Argout, Jean-François Rami Claire Billot

18 Plants genomes sequencing species Genome size (Mb) Chromosomes number Arabidopsis thaliana Complete Oryza sativa Complete Populus trichocarpa Complete Vitis vinifera Complete Chlamydomonas reinhardtii Complete Sorghum bicolor Complete Medicago truncatula Complete Physcomitrella patens Complete Solanum lycopersicum In progress Triticum aestivum In progress Zea mays In progress C Périn

19 Analysis of plant genomes Species Clone Feature Musa acuminata Calcutta 4 Cavendish Grande Naine Pisang Musa balbisiana Klutuk Wulung wild banana Cultivated bananas are sterile, parthenocarpic, vegetatvely propagated plants wild banana Ploidy, Heterozygosity Heterozygous diploid AA Heterozygous triploid AAA Heterozygous diploid BB 40 BAC 6 BAC 18 BAC Musa acuminata Pahang doubled haploid Homozygous AA 2*600 Mb (2*11Χ) Roux & D'Hont Saccharum hybrid R570 Oryza sativa Japonica sugarcane Rice is a model organism for monocotyledon Heterozygous dodecaploid aneuploid (spontaneum and officinarum parents) diploid 12 BAC Data Organization Manager project Status Organism Family 2*430 Mb (2*12Χ) International Consortium for Sugarcane Biotechnology (ICSB) International Rice Genome Sequencing Project (IRGSP) D'Hont Droc Complete genome JGI Genoscope Submitted BAC Genoscope In progress complete genome complete Sorghum bicolor Sorghum diploid 2*800 Mb (2*10Χ) Rami genome JGI Elaeis guineensis Jacq African oil palm diploid 2*1000 Mb (2*16Χ) Billotte complete genome Arabidopsis thaliana Col-0 Thale cress is a model organism for dicotyledon diploid Theobroma cacao Criollo Cacao tree Homozygous diploid Global Musa Genomics consortium Rouard & Baurens 2*115 Mb (2*5Χ) International Collaboration international 2*380 Mb (2*10Χ) consortium of Lanaud cocoa genomics BAC NIAS complete genome complete genome In progress Done In progress To submit Done To submit Monocotyledon Dicotyledon Musaceae Poaceae Arecaceae Brassicaceae Malvaceae

20 Manual curation is not sufficient Function comment fields for all proteins in Swiss-Prot over time. Baumgartner bioinformatics 2007

21 project Develop a platform of structural and functional annotation supported by comparative genomics Dedicated to plant and bio-aggressor genomes Allowing both automatic predictions and manual curation of genes and transposable elements User-friendly, generic, modular, portable, sustainable, upgradable et compatible GDEC BIVI Spo

22 Instances

23 Instances Stéphanie Sidibe-Bocs Valentin Guignon Gaetan Droc

24 Information system model

25

26 Apollo Editors x

27

28 Find homologs using phylogenomic analysis GreenPhylDB use phylogenomic method to identify homologous genes I suggest that functional predictions can be greatly improved by focusing on how the genes became similar in sequence (i.e., evolution) rather than on the sequence similarity itself. Jonathan A. Eisen 1998

29 Homologs genes: orthologs and paralogs Orthologous genes are homologous genes that are descended from the last common ancestor through speciation and most probably encode proteins with a similar function in different species Speciation event Arabidopsis gene Rice gene A Orthologs Paralogs Gene duplication event Rice gene B Paralogous genes are referred as homologous genes that evolved through duplications and may encode proteins with more divergent functions

30 GreenPhylDB V1.0 Oryza sativa and Arabidopsis thaliana model plants Full genome available Gene annotation quality TAIR gene database release 8: gene ID like At1g12345 TIGR gene database release 5: gene ID like Os01g12345 Most of functional evidence.

31 Pipeline phylogénomique: GreenPhyl Low-complexity masking CAST FILTERING Splices selection Filtering procedure SS* LEON* Gene id indexing GI* Alignment MAFFT MULTIALIGNEMENT Alignment refinement Alignment masking Rascal AL2CO TREE CONSTRUCTION Bootstrapping alignement (x100) Genetic distance (x100) Tree construction (x100) SeqBoot ProtDist PHYML Rooting tree (x100) SDI ORTHOLOGS INFERENCE Set Bootstrap values on PHYML tree Gene id indexing Orthologs Inference SB* GI* DoRIO Output: Orthologs predictions (.txt & NHX files)

32 GreenPhyl phylogenomic pipeline Arabidopsis genes TAIR rice genes TIGR Automatic clustering procedure 6420 manually validated gene families 4400 phylogeneticaly analyzed gene families orthologs relationships between rice and Arabidopsis Probable same function

33 i-gost (Iterative GreenPhyl Orthog Search Tool) 2 objectives Get Add information on to a rice new or sequenced gene using information on to a rice new or sequenced arath with a gene using from information a new species available particularly from studied rice or arabidopsis? Gene with KNOWN? Query biological information Query Add biological information?? Gene with UNKNOWN function

34 GreenPhylDB V2.0 in progress Objectives 10 news fully sequenced genomes are now available (Populus alba, Glycine max, sorghum bicolor, Medicago truncatula, Vitis vinifera, Selaginella moellendorffii, Physcomitrella patens, Ostreococcus Tauri, Chlamydomonas reinhardtii, Cyanidioschyzon merolae ) Why do you integrate these species? 1. Complete sequencing and gene prediction 2. Will provide the complete list of plant gene families! 3. Use functional information available on these species 4. Reinforce phylogenomic signal and then orthologs predictions 5. Have a good taxonomy sampling

35 GreenPhylDB V2.0 A huge database ~390,000 sequences ~ 25,000 clusters 10 news species ~300,000 sequences GreenPhyl Database v2.0 2 species 81,000 genes 21,400 clusters 6,400 genes families GreenPhyl Database V1.0

36 Functional Annotation

37 Annotation fonctionnelle

38 Knowledge modeling of the structure-function relationships

39 The insulin receptor pathway

40 Knowledge modeling of the sequence-structure relationships

41 project

42 GCP Generation Challenge Programme A global consortium of crop research institutes established in 2003 with an approximately 10 year mandate to integrate comparative genomics and genetic resources molecular characterisation into plant breeding for stress tolerance, in particular, in drought-prone environments.

43 Generation Challenge Program GenDiversity is a query and analysis application combining genotyping data from diverse data sources, developed in support of diversity studies. Gautier Sarah Haplophyle, Methodology development for reconstruction of Genealogies based on Haplotypes related to geographic patterns (HaploPhyle: graphical haplotype network in the light of external data)

44 Data Integration

45 Data Integration non GCP DB Software analysis Platform Scientists GCP DB CIMMYT, CIRAD, IRRI, CIP, ICISAT, etc. Raw data

46 Platform architecture

47 Partnership SEG Agropolis Equipes biométrie Cirad X. Perrier L. Baudouin France SRG GS DIA-PC GDP INRA, Genoscope,CNG DGB GCP program Bioversity CIP, IRRI, CIMMYT, EMBRAPA, International ID DAR Swissprot GMOD consortium Biotec (Thailand) LIRMM O. Gascuel I. Mougenot CINES

48 Agropolis Plants Bioinformatics (genetics and genomics) UMRs DAP, DIAPC, BGPI, LSTM, SPO, RPB, BIVI, LGDP ATGC: LIRMM Bioinformatics platform Evolution Sequence analysis Analysis of gene expression Biological Ressources Genomics Genetic diversity Genetic Ressources New algorithms

49 High Power Computing CINES, Montpellier

Supplementary Material

Supplementary Material Supplementary Material Supplementary Table S1. Genomes available in build 47 Supplementary Table S2. Counts of putative contiguous gene split models in 39 plant reference genomes in build 47 Supplementary

More information

The Origin of Species

The Origin of Species The Origin of Species Introduction A species can be defined as a group of organisms whose members can breed and produce fertile offspring, but who do not produce fertile offspring with members of other

More information

Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type

Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type A B 2 3 3 2 1 1 Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type (A) and d27 (B) seedlings at the four

More information

Supplementary Figure 3

Supplementary Figure 3 Supplementary Figure 3 7.0 Col Kas-1 Line FTH1A 8.4 F3PII3 8.9 F26H11 ATQ1 T9I22 PLS8 F26B6-B 9.6 F27L4 9.81 F27D4 9.92 9.96 10.12 10.14 10.2 11.1 0.5 Mb T1D16 Col % RGR 83.3 101 227 93.5 75.9 132 90 375

More information

Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting

Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting Delphine Larivière & David Couvin Under the supervision of Dominique This, Jean-François Dufayard

More information

Impact of recurrent gene duplication on adaptation of plant genomes

Impact of recurrent gene duplication on adaptation of plant genomes Fischer et al. BMC Plant Biology 2014, 14:151 RESEARCH ARTICLE Open Access Impact of recurrent gene duplication on adaptation of plant genomes Iris Fischer 1,2*, Jacques Dainat 3,6, Vincent Ranwez 3, Sylvain

More information

Potato Genome Analysis

Potato Genome Analysis Potato Genome Analysis Xin Liu Deputy director BGI research 2016.1.21 WCRTC 2016 @ Nanning Reference genome construction???????????????????????????????????????? Sequencing HELL RIEND WELCOME BGI ZHEN LLOFRI

More information

Stage 1: Karyotype Stage 2: Gene content & order Step 3

Stage 1: Karyotype Stage 2: Gene content & order Step 3 Supplementary Figure Method used for ancestral genome reconstruction. MRCA (Most Recent Common Ancestor), AMK (Ancestral Monocot Karyotype), AEK (Ancestral Eudicot Karyotype), AGK (Ancestral Grass Karyotype)

More information

USDA-DOE Plant Feedstock Genomics for Bioenergy

USDA-DOE Plant Feedstock Genomics for Bioenergy USDA-DOE Plant Feedstock Genomics for Bioenergy BERAC Thursday, June 7, 2012 Cathy Ronning, DOE-BER Ed Kaleikau, USDA-NIFA Plant Feedstock Genomics for Bioenergy Joint competitive grants program initiated

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature13082 Supplementary Table 1. Examination of nectar production in wild-type and atsweet9 flowers. No. of flowers with detectable nectar out of the total observed

More information

Session 5: Phylogenomics

Session 5: Phylogenomics Session 5: Phylogenomics B.- Phylogeny based orthology assignment REMINDER: Gene tree reconstruction is divided in three steps: homology search, multiple sequence alignment and model selection plus tree

More information

AtTIL-P91V. AtTIL-P92V. AtTIL-P95V. AtTIL-P98V YFP-HPR

AtTIL-P91V. AtTIL-P92V. AtTIL-P95V. AtTIL-P98V YFP-HPR Online Resource 1. Primers used to generate constructs AtTIL-P91V, AtTIL-P92V, AtTIL-P95V and AtTIL-P98V and YFP(HPR) using overlapping PCR. pentr/d- TOPO-AtTIL was used as template to generate the constructs

More information

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Maheshi Dassanayake 1,9, Dong-Ha Oh 1,9, Jeffrey S. Haas 1,2, Alvaro Hernandez 3, Hyewon Hong 1,4, Shahjahan

More information

Bioinformatics tools to analyze complex genomes. Yves Van de Peer Ghent University/VIB

Bioinformatics tools to analyze complex genomes. Yves Van de Peer Ghent University/VIB Bioinformatics tools to analyze complex genomes Yves Van de Peer Ghent University/VIB Detecting colinearity and large-scale gene duplications A 1 2 3 4 5 6 7 8 9 10 11 Speciation/Duplicatio n S1 S2 1

More information

Chapter 14 The Origin of Species

Chapter 14 The Origin of Species Chapter 14 The Origin of Species PowerPoint Lectures for Biology: Concepts & Connections, Sixth Edition Campbell, Reece, Taylor, Simon, and Dickey Copyright 2009 Pearson Education, Inc. Lecture by Joan

More information

Browsing Genomic Information with Ensembl Plants

Browsing Genomic Information with Ensembl Plants Browsing Genomic Information with Ensembl Plants Etienne de Villiers, PhD (Adapted from slides by Bert Overduin EMBL-EBI) Outline of workshop Brief introduction to Ensembl Plants History Content Tutorial

More information

Wheat Genetics and Molecular Genetics: Past and Future. Graham Moore

Wheat Genetics and Molecular Genetics: Past and Future. Graham Moore Wheat Genetics and Molecular Genetics: Past and Future Graham Moore 1960s onwards Wheat traits genetically dissected Chromosome pairing and exchange (Ph1) Height (Rht) Vernalisation (Vrn1) Photoperiodism

More information

Regulatory Change in YABBY-like Transcription Factor Led to Evolution of Extreme Fruit Size during Tomato Domestication

Regulatory Change in YABBY-like Transcription Factor Led to Evolution of Extreme Fruit Size during Tomato Domestication SUPPORTING ONLINE MATERIALS Regulatory Change in YABBY-like Transcription Factor Led to Evolution of Extreme Fruit Size during Tomato Domestication Bin Cong, Luz Barrero, & Steven Tanksley 1 SUPPORTING

More information

Supplementary Figure 1. Number of CC- and TIR- type NBS- LRR genes and presence of mir482/2118 on sequenced plant genomes.

Supplementary Figure 1. Number of CC- and TIR- type NBS- LRR genes and presence of mir482/2118 on sequenced plant genomes. Number of CC- NBS and CC- NBS- LRR R- genes Number of TIR- NBS and TIR- NBS- LRR R- genes 0 50 100 150 200 250 0 50 100 150 200 250 300 350 400 450 mir482 and mir2118 Cajanus cajan Glycine max Hevea brasiliensis

More information

Homology. and. Information Gathering and Domain Annotation for Proteins

Homology. and. Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline WHAT IS HOMOLOGY? HOW TO GATHER KNOWN PROTEIN INFORMATION? HOW TO ANNOTATE PROTEIN DOMAINS? EXAMPLES AND EXERCISES Homology

More information

Supplemental Data. Perea-Resa et al. Plant Cell. (2012) /tpc

Supplemental Data. Perea-Resa et al. Plant Cell. (2012) /tpc Supplemental Data. Perea-Resa et al. Plant Cell. (22)..5/tpc.2.3697 Sm Sm2 Supplemental Figure. Sequence alignment of Arabidopsis LSM proteins. Alignment of the eleven Arabidopsis LSM proteins. Sm and

More information

Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants Liu et al. BMC Evolutionary Biology (2017) 17:47 DOI 10.1186/s12862-017-0891-5 RESEARCH ARTICLE Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

More information

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting. Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction

More information

Model plants and their Role in genetic manipulation. Mitesh Shrestha

Model plants and their Role in genetic manipulation. Mitesh Shrestha Model plants and their Role in genetic manipulation Mitesh Shrestha Definition of Model Organism Specific species or organism Extensively studied in research laboratories Advance our understanding of Cellular

More information

Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags

Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags Lingfei Shangguan 1, Jian Han 1, Emrul Kayesh 1, Xin Sun 1, Changqing Zhang 2, Tariq Pervaiz 1, Xicheng Wen

More information

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Bioinformatics. Dept. of Computational Biology & Bioinformatics Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS

More information

Identification and Characterization of Shared Duplications between Rice and Wheat Provide New Insight into Grass Genome Evolution W

Identification and Characterization of Shared Duplications between Rice and Wheat Provide New Insight into Grass Genome Evolution W The Plant Cell, Vol. 20: 11 24, January 2008, www.plantcell.org ª 2008 American Society of Plant Biologists RESEARCH ARTICLES Identification and Characterization of Shared Duplications between Rice and

More information

Cao, J, K Schneeberger, S Ossowski, et al Whole genome sequencing of multiple Arabidopsis thaliana populations. Nat Genet 43:

Cao, J, K Schneeberger, S Ossowski, et al Whole genome sequencing of multiple Arabidopsis thaliana populations. Nat Genet 43: Figure S1. Syntenic map of SAE1B duplication. We have used the nucleotide sequences of Arabidopsis thaliana Col-0 gene tandem duplicates AT5G50580 and AT5G506800 as queries in independent BLASTN searches

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:1.138/nature111 cytosol Model: PILS function in cellular auxin homeostasis ER nucleus IAA degradation? sequestration? conjugation? storage? signalling? PILS IAA ER cytosol Supplemental Figure 1 Model

More information

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Bioinformatics tools for phylogeny and visualization. Yanbin Yin Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and

More information

Homology and Information Gathering and Domain Annotation for Proteins

Homology and Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

Genome-wide discovery of G-quadruplex forming sequences and their functional

Genome-wide discovery of G-quadruplex forming sequences and their functional *Correspondence and requests for materials should be addressed to R.G. (rohini@nipgr.ac.in) Genome-wide discovery of G-quadruplex forming sequences and their functional relevance in plants Rohini Garg*,

More information

Araport, a community portal for Arabidopsis. Data integration, sharing and reuse. sergio contrino University of Cambridge

Araport, a community portal for Arabidopsis. Data integration, sharing and reuse. sergio contrino University of Cambridge Araport, a community portal for Arabidopsis. Data integration, sharing and reuse sergio contrino University of Cambridge Acknowledgements J Craig Venter Institute Chris Town Agnes Chan Vivek Krishnakumar

More information

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007 -2 Transcript Alignment Assembly and Automated Gene Structure Improvements Using PASA-2 Mathangi Thiagarajan mathangi@jcvi.org Rice Genome Annotation Workshop May 23rd, 2007 About PASA PASA is an open

More information

Francisco M. Couto Mário J. Silva Pedro Coutinho

Francisco M. Couto Mário J. Silva Pedro Coutinho Francisco M. Couto Mário J. Silva Pedro Coutinho DI FCUL TR 03 29 Departamento de Informática Faculdade de Ciências da Universidade de Lisboa Campo Grande, 1749 016 Lisboa Portugal Technical reports are

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Figure S1. Haploid plant produced by centromere-mediated genome elimination Chromosomes containing altered CENH3 in their centromeres (green dots) are eliminated after fertilization in a cross to wild

More information

Heredity and Genetics WKSH

Heredity and Genetics WKSH Chapter 6, Section 3 Heredity and Genetics WKSH KEY CONCEPT Mendel s research showed that traits are inherited as discrete units. Vocabulary trait purebred law of segregation genetics cross MAIN IDEA:

More information

From BBCC Conference 2017 Naples, Italy December 2017

From BBCC Conference 2017 Naples, Italy December 2017 Ambrosino et al. BMC Bioinformatics 2018, 19(Suppl 15):435 https://doi.org/10.1186/s12859-018-2420-y RESEARCH Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities

More information

Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii

Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii Systematic Analysis and Comparison of Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium raimondii Hengling Wei 1,2, Wei Li 1, Xiwei Sun 1, Shuijin Zhu 1 *, Jun Zhu 1 * 1 Key

More information

Chapter 2. Gene Orthology Assessment with OrthologID. Mary Egan, Ernest K. Lee, Joanna C. Chiu, Gloria Coruzzi, and Rob DeSalle.

Chapter 2. Gene Orthology Assessment with OrthologID. Mary Egan, Ernest K. Lee, Joanna C. Chiu, Gloria Coruzzi, and Rob DeSalle. Chapter 2 Gene Orthology Assessment with OrthologID Mary Egan, Ernest K. Lee, Joanna C. Chiu, Gloria Coruzzi, and Rob DeSalle Abstract OrthologID (http://nypg.bio.nyu.edu/orthologid/) allows for the rapid

More information

Meiosis and Mendel. Chapter 6

Meiosis and Mendel. Chapter 6 Meiosis and Mendel Chapter 6 6.1 CHROMOSOMES AND MEIOSIS Key Concept Gametes have half the number of chromosomes that body cells have. Body Cells vs. Gametes You have body cells and gametes body cells

More information

Evolution by duplication: paleopolyploidy events in plants reconstructed by deciphering the evolutionary history of VOZ transcription factors

Evolution by duplication: paleopolyploidy events in plants reconstructed by deciphering the evolutionary history of VOZ transcription factors Gao et al. BMC Plant Biology (2018) 18:256 https://doi.org/10.1186/s12870-018-1437-8 RESEARCH ARTICLE Evolution by duplication: paleopolyploidy events in plants reconstructed by deciphering the evolutionary

More information

Managing segregating populations

Managing segregating populations Managing segregating populations Aim of the module At the end of the module, we should be able to: Apply the general principles of managing segregating populations generated from parental crossing; Describe

More information

Principles of QTL Mapping. M.Imtiaz

Principles of QTL Mapping. M.Imtiaz Principles of QTL Mapping M.Imtiaz Introduction Definitions of terminology Reasons for QTL mapping Principles of QTL mapping Requirements For QTL Mapping Demonstration with experimental data Merit of QTL

More information

Genetic diversity and population structure in rice. S. Kresovich 1,2 and T. Tai 3,5. Plant Breeding Dept, Cornell University, Ithaca, NY

Genetic diversity and population structure in rice. S. Kresovich 1,2 and T. Tai 3,5. Plant Breeding Dept, Cornell University, Ithaca, NY Genetic diversity and population structure in rice S. McCouch 1, A. Garris 1,2, J. Edwards 1, H. Lu 1,3 M Redus 4, J. Coburn 1, N. Rutger 4, S. Kresovich 1,2 and T. Tai 3,5 1 Plant Breeding Dept, Cornell

More information

SUPPLEMENTARY MATERIAL SUPPLEMENTARY TABLES

SUPPLEMENTARY MATERIAL SUPPLEMENTARY TABLES SUPPLEMENTARY MATERIAL SUPPLEMENTARY TABLES Supplementary Table 1. Genomes available in Gramene build 38 Supplementary Table 2. Ontology associations in Gramene build 38 Supplementary Table 3. Synteny

More information

Supplemental Table 1. Primers used for cloning and PCR amplification in this study

Supplemental Table 1. Primers used for cloning and PCR amplification in this study Supplemental Table 1. Primers used for cloning and PCR amplification in this study Target Gene Primer sequence NATA1 (At2g393) forward GGG GAC AAG TTT GTA CAA AAA AGC AGG CTT CAT GGC GCC TCC AAC CGC AGC

More information

A computational analysis of Salt Overly Sensitive 1 homologs in halophytes and glycophytes

A computational analysis of Salt Overly Sensitive 1 homologs in halophytes and glycophytes A computational analysis of Salt Overly Sensitive 1 homologs in halophytes and glycophytes Cherin E. Kim 1 and Ray A. Bressan 2 1 West Lafayette Jr./Sr. High School, West Lafayette, IN, USA 2 Department

More information

How to connect to CGIAR wheat (CIMMYT and ICARDA) CRP??- Public wheat breeding for developing world

How to connect to CGIAR wheat (CIMMYT and ICARDA) CRP??- Public wheat breeding for developing world Wheat breeding only exploits 10% of the diversity available The public sector can t breed elite varieties-how to connect to private sector breeders?? How to connect to CGIAR wheat (CIMMYT and ICARDA) CRP??-

More information

BLAST. Varieties of BLAST

BLAST. Varieties of BLAST BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Polyploidy so many options

Polyploidy so many options Polyploidy so many options Impacts of Ploidy Changes Changes in chromosome number and structure can have major health impacts e.g. trisomy 21 Polyploidy in cultivated and domesticated plants is widespread

More information

UON, CAS, DBSC, General Biology II (BIOL102) Dr. Mustafa. A. Mansi. The Origin of Species

UON, CAS, DBSC, General Biology II (BIOL102) Dr. Mustafa. A. Mansi. The Origin of Species The Origin of Species Galápagos Islands, landforms newly emerged from the sea, despite their geologic youth, are filled with plants and animals known no-where else in the world, Speciation: The origin

More information

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species PGA: A Program for Genome Annotation by Comparative Analysis of Maximum Likelihood Phylogenies of Genes and Species Paulo Bandiera-Paiva 1 and Marcelo R.S. Briones 2 1 Departmento de Informática em Saúde

More information

Supplemental Figure 1. Comparisons of GC3 distribution computed with raw EST data, bi-beta fits and complete genome sequences for 6 species.

Supplemental Figure 1. Comparisons of GC3 distribution computed with raw EST data, bi-beta fits and complete genome sequences for 6 species. Supplemental Figure 1. Comparisons of GC3 distribution computed with raw EST data, bi-beta fits and complete genome sequences for 6 species. Filled distributions: GC3 computed with raw EST data. Dashed

More information

Microevolutionary changes show us how populations change over time. When do we know that distinctly new species have evolved?

Microevolutionary changes show us how populations change over time. When do we know that distinctly new species have evolved? Microevolutionary changes show us how populations change over time. When do we know that distinctly new species have evolved? Critical to determining the limits of a species is understanding if two populations

More information

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew TEACHER S GUIDE Case Study Who s the daddy? Polyploidy in plant evolution James Clarkson Royal Botanic Gardens, Kew Dean Madden [Ed.] NCBE, University of Reading Version 1.1 Polypoidy in plant evolution

More information

Bioinformatics Exercises

Bioinformatics Exercises Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted

More information

BIOINFORMATICS: An Introduction

BIOINFORMATICS: An Introduction BIOINFORMATICS: An Introduction What is Bioinformatics? The term was first coined in 1988 by Dr. Hwa Lim The original definition was : a collective term for data compilation, organisation, analysis and

More information

Ch 11.Introduction to Genetics.Biology.Landis

Ch 11.Introduction to Genetics.Biology.Landis Nom Section 11 1 The Work of Gregor Mendel (pages 263 266) This section describes how Gregor Mendel studied the inheritance of traits in garden peas and what his conclusions were. Introduction (page 263)

More information

Quantitative Genetics & Evolutionary Genetics

Quantitative Genetics & Evolutionary Genetics Quantitative Genetics & Evolutionary Genetics (CHAPTER 24 & 26- Brooker Text) May 14, 2007 BIO 184 Dr. Tom Peavy Quantitative genetics (the study of traits that can be described numerically) is important

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:1.138/nature11394 Supplementary Note Our study is based on two different tracriptome indices. Both combine tracriptional information with an important evolutionary parameter. For the tracriptome age

More information

Add Up and Cross Over Sordaria Genetics Simulation

Add Up and Cross Over Sordaria Genetics Simulation Introduction Add Up and Cross Over Sordaria Genetics Simulation Publication No. Crossing over occurs during metaphase I of meiosis. During crossing over, homologous pairs of chromosomes exchange sections

More information

Fei Lu. Post doctoral Associate Cornell University

Fei Lu. Post doctoral Associate Cornell University Fei Lu Post doctoral Associate Cornell University http://www.maizegenetics.net Genotyping by sequencing (GBS) is simple and cost effective 1. Digest DNA 2. Ligate adapters with barcodes 3. Pool DNAs 4.

More information

Miloš Duchoslav and Lukáš Fischer *

Miloš Duchoslav and Lukáš Fischer * Duchoslav and Fischer BMC Plant Biology (2015) 15:133 DOI 10.1186/s12870-015-0523-4 RESEARCH ARTICLE Open Access Parallel subfunctionalisation of PsbO protein isoforms in angiosperms revealed by phylogenetic

More information

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi) Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction Lesser Tenrec (Echinops telfairi) Goals: 1. Use phylogenetic experimental design theory to select optimal taxa to

More information

Computational methods for predicting protein-protein interactions

Computational methods for predicting protein-protein interactions Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational

More information

Genomewide Selection in Oil Palm: Increasing Selection Gain per Unit Time and Cost with Small Populations

Genomewide Selection in Oil Palm: Increasing Selection Gain per Unit Time and Cost with Small Populations Genomewide Selection in Oil Palm: Increasing Selection Gain per Unit Time and Cost with Small Populations C.K. Wong R. Bernardo 1 ABSTRACT Oil palm (Elaeis guineensis Jacq.) requires 19 years per cycle

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/331/6019/876/dc1 Supporting Online Material for Synthetic Clonal Reproduction Through Seeds Mohan P. A. Marimuthu, Sylvie Jolivet, Maruthachalam Ravi, Lucie Pereira,

More information

Evolutionary Patterns, Rates, and Trends

Evolutionary Patterns, Rates, and Trends Evolutionary Patterns, Rates, and Trends Macroevolution Major patterns and trends among lineages Rates of change in geologic time Comparative Morphology Comparing body forms and structures of major lineages

More information

Slovene Plant Gene Bank (SPGB) and Genetic Resources Programme

Slovene Plant Gene Bank (SPGB) and Genetic Resources Programme Slovene Plant Gene Bank (SPGB) and Genetic Resources Programme Second Meeting of the ECPGR Working Group on Leafy Vegetables 8 9 October, Ljubljana, Slovenia Vladimir MEGLIČ, Jelka ŠUŠTAR VOZLIČ Slovene

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

Diseases of cacao in Colombia: What we know and what we need to know.

Diseases of cacao in Colombia: What we know and what we need to know. Diseases of cacao in Colombia: What we know and what we need to know. Bryan A. Bailey a, Shahin S. Ali a, Mary D. Strem a, Alina Campbell b, Osman GuAerrez b, Dapeng Zhang a, Lyndel W. Meinhardt a a Sustainable

More information

Supporting Information

Supporting Information Supporting Information Fawcett et al. 10.1073/pnas.0900906106 SI Text Estimating the Age of Gene Duplication Events: The Use of K S Values. One of the most common methods used to study and visualize gene

More information

Orthologs Detection and Applications

Orthologs Detection and Applications Orthologs Detection and Applications Marcus Lechner Bioinformatics Leipzig 2009-10-23 Marcus Lechner (Bioinformatics Leipzig) Orthologs Detection and Applications 2009-10-23 1 / 25 Table of contents 1

More information

Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus

Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus Genome-wide Identification of Lineage Specific Genes in Arabidopsis, Oryza and Populus Xiaohan Yang Sara Jawdy Timothy Tschaplinski Gerald Tuskan Environmental Sciences Division Oak Ridge National Laboratory

More information

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next genetics the study of heredity heredity sequence of DNA that codes for a protein and thus determines a trait genome a specific characteristic that varies from one individual to another gene trait the passing

More information

Computational Structural Bioinformatics

Computational Structural Bioinformatics Computational Structural Bioinformatics ECS129 Instructor: Patrice Koehl http://koehllab.genomecenter.ucdavis.edu/teaching/ecs129 koehl@cs.ucdavis.edu Learning curve Math / CS Biology/ Chemistry Pre-requisite

More information

Impact of recurrent gene duplication on adaptation of plant genomes

Impact of recurrent gene duplication on adaptation of plant genomes Impact of recurrent gene duplication on adaptation of plant genomes Iris Fischer, Jacques Dainat, Vincent Ranwez, Sylvain Glémin, Jacques David, Jean-François Dufayard, Nathalie Chantret Plant Genomes

More information

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM Life Cycles, Meiosis and Genetic Variability iclicker: 1. A chromosome just before mitosis contains two double stranded DNA molecules. 2. This replicated chromosome contains DNA from only one of your parents

More information

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have. Section 1: Chromosomes and Meiosis KEY CONCEPT Gametes have half the number of chromosomes that body cells have. VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous

More information

Host_microbe_PPI - R package to analyse intra-species and interspecies protein-protein interactions in the model plant Arabidopsis thaliana

Host_microbe_PPI - R package to analyse intra-species and interspecies protein-protein interactions in the model plant Arabidopsis thaliana Host_microbe_PPI - R package to analyse intra-species and interspecies protein-protein interactions in the model plant Arabidopsis thaliana Thomas Nussbaumer 1,2 1 Institute of Network Biology (INET),

More information

16. Methods of breeding introduction and acclimatization

16. Methods of breeding introduction and acclimatization 16. Methods of breeding introduction and acclimatization The following are the methods of breeding autogamous plants. 1. Introduction 2. Selection a) Pure line selection b) Mass selection 3. Hybridization

More information

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis 10 December 2012 - Corrections - Exercise 1 Non-vertebrate chordates generally possess 2 homologs, vertebrates 3 or more gene copies; a Drosophila

More information

ELIXIR French Node. J-F. Gibrat

ELIXIR French Node. J-F. Gibrat ELIXIR French Node J-F. Gibrat Unité Mixte de Service IFB-core, CNRS, Gif-sur-Yvette and Unité Mathématique, Informatique et Génome, INRA, Jouy-en-Josas Transplant-ELIXIR workshop, Hinxton July 1-2 2014

More information

PREDICTING COMPLEX PHENOTYPE- GENOTYPE RELATIONSHIPS IN GRASSES: A SYSTEMS GENETICS APPROACH

PREDICTING COMPLEX PHENOTYPE- GENOTYPE RELATIONSHIPS IN GRASSES: A SYSTEMS GENETICS APPROACH Clemson University TigerPrints All Dissertations Dissertations 5-2013 PREDICTING COMPLEX PHENOTYPE- GENOTYPE RELATIONSHIPS IN GRASSES: A SYSTEMS GENETICS APPROACH Stephen Ficklin Clemson University, spficklin@gmail.com

More information

Genotyping By Sequencing (GBS) Method Overview

Genotyping By Sequencing (GBS) Method Overview enotyping By Sequencing (BS) Method Overview Sharon E Mitchell Institute for enomic Diversity Cornell University http://wwwmaizegeneticsnet/ Topics Presented Background/oals BS lab protocol Illumina sequencing

More information

Review of Plant Cytogenetics

Review of Plant Cytogenetics Review of Plant Cytogenetics Updated 2/13/06 Reading: Richards, A.J. and R.K. Dawe. 1998. Plant centromeres: structure and control. Current Op. Plant Biol. 1: 130-135. R.K. Dawe. 2005. Centromere renewal

More information

Intraspecific gene genealogies: trees grafting into networks

Intraspecific gene genealogies: trees grafting into networks Intraspecific gene genealogies: trees grafting into networks by David Posada & Keith A. Crandall Kessy Abarenkov Tartu, 2004 Article describes: Population genetics principles Intraspecific genetic variation

More information

UNIT 3: GENETICS 1. Inheritance and Reproduction Genetics inheritance Heredity parent to offspring chemical code genes specific order traits allele

UNIT 3: GENETICS 1. Inheritance and Reproduction Genetics inheritance Heredity parent to offspring chemical code genes specific order traits allele UNIT 3: GENETICS 1. Inheritance and Reproduction Genetics the study of the inheritance of biological traits Heredity- the passing of traits from parent to offspring = Inheritance - heredity is controlled

More information

METODOLOGIE INTEGRATE PER LA SELEZIONE GENOMICA DI PIANTE ORTIVE SELEZIONE DELLE RISORSE GENOMICHE

METODOLOGIE INTEGRATE PER LA SELEZIONE GENOMICA DI PIANTE ORTIVE SELEZIONE DELLE RISORSE GENOMICHE CORSO GENHORT METODOLOGIE INTEGRATE PER LA SELEZIONE GENOMICA DI PIANTE ORTIVE SELEZIONE DELLE RISORSE GENOMICHE Marzo 2014 Docente: e-mail: Pasquale Termolino termolin@unina.it QUALI SONO LE FONTI DI

More information

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr Introduction to Bioinformatics Shifra Ben-Dor Irit Orr Lecture Outline: Technical Course Items Introduction to Bioinformatics Introduction to Databases This week and next week What is bioinformatics? A

More information

Hands-On Nine The PAX6 Gene and Protein

Hands-On Nine The PAX6 Gene and Protein Hands-On Nine The PAX6 Gene and Protein Main Purpose of Hands-On Activity: Using bioinformatics tools to examine the sequences, homology, and disease relevance of the Pax6: a master gene of eye formation.

More information

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Homology Modeling. Roberto Lins EPFL - summer semester 2005 Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,

More information

Prospecting for Green Revolution Genes

Prospecting for Green Revolution Genes Learning Objectives: Prospecting for Green Revolution Genes 1) Discover how changes in individual genes produce phenotypic change 2) Learn to apply bioinformatics tools to identify groups of related genes

More information

Genetics 275 Notes Week 7

Genetics 275 Notes Week 7 Cytoplasmic Inheritance Genetics 275 Notes Week 7 Criteriafor recognition of cytoplasmic inheritance: 1. Reciprocal crosses give different results -mainly due to the fact that the female parent contributes

More information

Small RNA in rice genome

Small RNA in rice genome Vol. 45 No. 5 SCIENCE IN CHINA (Series C) October 2002 Small RNA in rice genome WANG Kai ( 1, ZHU Xiaopeng ( 2, ZHONG Lan ( 1,3 & CHEN Runsheng ( 1,2 1. Beijing Genomics Institute/Center of Genomics and

More information

Doubled haploid ramets via embryogenesis of haploid tissue cultures

Doubled haploid ramets via embryogenesis of haploid tissue cultures Doubled haploid ramets via embryogenesis of haploid tissue cultures Harry E. Iswandar 1, J. M. Dunwell 2, Brian P. Forster 3, Stephen P. C. Nelson 1,4 and Peter D. S. Caligari,3,4,5 ABSTRACT Tissue culture

More information