Slide 1. Slide 2. Slide 3. BIOINFORMATIKA doc.dr.gregor Anderluh Oddelek za biologijo, Biotehniška fakulteta, Ljubljana.

Size: px
Start display at page:

Download "Slide 1. Slide 2. Slide 3. BIOINFORMATIKA doc.dr.gregor Anderluh Oddelek za biologijo, Biotehniška fakulteta, Ljubljana."

Transcription

1 Slide 1 BIOINFORMATIKA doc.dr.gregor Anderluh Oddelek za biologijo, Biotehniška fakulteta, Ljubljana gregor.anderluh@bf.uni-lj.si Slide 2 Slide 3 Vihinen M (2001) Biomol. Eng. 18:

2 Slide 4 BIOKEMIJA, MOL. BIOLOGIJA IN VARSTVO OKOLJA biokemijski procesi, encimi OSNOVNI CELIČNI GRADNIKI DNA, beljakovine, zgradba beljakovin, zgradba DNA, razvoj molekularne biologije (PRE-GENOMIKA) high-throughput biologija, določanje nukleotidnega zaporedja DNA genoma (GENOMIKA) BIOINFORMATIKA razvoj računalništva, pomoč pri delu biokemika, podatkovne zbirke (proteini, DNA, 3D strukture), programi POST-GENOMIKA strukturna genomika, funkcionalna genomika, transkriptom, proteom, interaktom, metabolom, integrirane podatkovne zbirke, specializirani strežniki Slide 5 Slide 6

3 Slide 7 Slide 8 KONKRETEN PRIMER Sprememba amoniaka v dušik (denitrifikacija) V nekaj korakih 1. Nitrosomonas (bakterija) spremeni NH 4+ in NH 3 v NO 2-2. Nitrobacter winogradskyi spremeni NO 2- v NO 3 - nitritna oksidoreduktaza NO - 2 NO O 2 3. NO 3- se nato spremeni v O 2 in N 2 N. europaea Uporablja amoniak kot vir energije Lahko razgrajuje tudi nekatere halogenirane organske spojine, npr. trikloretilen, benzen in vinil klorid Slide 9

4 Slide 10 Toksična spojina Netoksična spojina Mikroorganizem (bakterija, alga, gliva...) Encim (protein, beljakovina) Okoljska biotehnologija, Proteinsko inženirstvo Slide 11 The central dogma states that once information has passed into protein it cannot get out again. The transfer of information form nucleic acid to nucleic acid, or from nucleic acid to protein, may be possible, but transfer from protein to protein, or from protein to nucleic acid, is impossible. Information means here the precise determination of sequence, either of basis in the nucleic acid or of amino acid residues in the protein. Francis Crick, 1958 Slide 12 DNA Osnovni gradniki so NUKLEOTIDI Adenin Gvanin Citozin Timidin >gi gb CF CF tac36g08.y1 Hydra EST -IV Hydra magnipapillata cdna 5' similar to TR:Q9Y1U9 Q9Y1U9 EQUINATOXIN IV PRECURSOR. ;, mrna sequence AACTAGCAAAGCAGACGCTGGAGGAGGCGCTGGAATAGCAATTTTAGGAGTATTAGCAAAAGTCGGCGTA GAAGCTGCGTTACAGCAAATTGATAATATTTGGAAAGGAGATGTAGTGAGGTATTGGAAATGCGCTGTAG AAAACAGATCCGATAAAACACTTTATGCTTACGGAACAACTACGGAATCGGGAAGTATGGGTACAGTTTT TGCCGATATTCCCCCTGGAAGCACAGGAATTTTTGTGTGGGAAAAATCTAGAGGAGCTGCAACTGGAGCA AGTGGTGTAGTTCATTATCGTTATGGAGACAAGATACTTAATTTAATGGCATCCATTCCATATGATTGGA ACTTGTATCAATCTTGGGCTAATGCACGAGTTTCGAATGAAAAGGAAAGCTTTTACAACTTGTATAACGG GTTAAATGGTGCGAAACCTGCTACAAGAGGAGGAAACTGGGGCGACGTTGATGGTGCAAAATTTTTTCTT ACTGAAAAAAGTCACGCAGAGTTTAAAGTTATTTTTTCTGGTTAACCAAATGTTGTCAAAAAAATTGTTC TTTTTCTGTAAA Protein Osnovni gradniki so AMINOKISLINE

5 Slide 13 Slide 14 Slide 15 The proteins are among the most complicated and enigmatic substances in Nature and appear to be particularly closely related to all that we call Life. Presentation speech, Nobel prize for Chemistry 1958

6 Slide 16 >SLIT_DROME (P24014): MAAPSRTTLMPPPFRLQLRLLILPILLLLRHDAVHAEPYSGGFGSSAVS SGGLGSVGIHIPGGGVGVITEARCPRVCSCTGLNVDCSHRGLTSVPRKI SADVERLELQGNNLTVIYETDFQRLTKLRMLQLTDNQIHTIERNSFQDL VSLERLDISNNVITTVGRRVFKGAQSLRSLQLDNNQITCLDEHAFKGLV ELEILTLNNNNLTSLPHNIFGGLGRLRALRLSDNPFACDCHLSWLSRFL RSATRLAPYTRCQSPSQLKGQNVADLHDQEFKCSGLTEHAPMECGAENS CPHPCRCADGIVDCREKSLTSVPVTLPDDTTDVRLEQNFITELPPKSFS SFRRLRRIDLSNNNISRIAHDALSGLKQLTTLVLYGNKIKDLPSGVFKG LGSLRLLLLNANEISCIRKDAFRDLHSLSLLSLYDNNIQSLANGTFDAM KSMKTVHLAKNPFICDCNLRWLADYLHKNPIETSGARCESPKRMHRRRI ESLREEKFKCSWGELRMKLSGECRMDSDCPAMCHCEGTTVDCTGRRLKE IPRDIPLHTTELLLNDNELGRISSDGLFGRLPHLVKLELKRNQLTGIEP NAFEGASHIQELQLGENKIKEISNKMFLGLHQLKTLNLYDNQISCVMPG SFEHLNSLTSLNLASNPFNCNCHLAWFAECVRKKSLNGGAARCGAPSKV RDVQIKDLPHSEFKCSSENSEGCLGDGYCPPSCTCTGTVVACSRNQLKE IPRGIPAETSELYLESNEIEQIHYERIRHLRSLTRLDLSNNQITILSNY TFANLTKLSTLIISYNKLQCLQRHALSGLNNLRVVSLHGNRISMLPEGS FEDLKSLTHIALGSNPLYCDCGLKWFSDWIKLDYVEPGIARCAEPEQMK DKLILSTPSSSFVCRGRVRNDILAKCNACFEQPCQNQAQCVALPQREYQ CLCQPGYHGKHCEFMIDACYGNPCRNNATCTVLEEGRFSCQCAPGYTGA RCETNIDDCLGEIKCQNNATCIDGVESYKCECQPGFSGEFCDTKIQFCS PEFNPCANGAKCMDHFTHYSCDCQAGFHGTNCTDNIDDCQNHMCQNGGT CVDGINDYQCRCPDDYTGKYCEGHNMISMMYPQTSPCQNHECKHGVCFQ PNAQGSDYLCRCHPGYTGKWCEYLTSISFVHNNSFVELEPLRTRPEANV TIVFSSAEQNGILMYDGQDAHLAVELFNGRIRVSYDVGNHPVSTMYSFE MVADGKYHAVELLAIKKNFTLRVDRGLARSIINEGSNDYLKLTTPMFLG GLPVDPAQQAYKNWQIRNLTSFKGCMKEVWINHKLVDFGNAQRQQKITP GCALLEGEQQEEEDDEQDFMDETPHIKEEPVDPCLENKCRRGSRCVPNS NARDGYQCKCKHGQRGRYCDQGEGSTEPPTVTAASTCRKEQVREYYTEN DCRSRQPLKYAKCVGGCGNQCCAAKIVRRRKVRMVCSNNRKYIKNLDIV RKCGCTKKCY Slide Struktura α-heliksa in β-ploskev Pauling and Corey (1951) Proc. Natl. Acad. Sci. USA 27, ; Proc. Natl. Acad. Sci. USA 37, Slide 18 The Nobel Prize in Chemistry 1954 Linus Carl Pauling "for his research into the nature of the chemical bond and its application to the elucidation of the structure of complex substances " Pauling has tried this method in his studies of the structure of proteins with which he has been occupied during recent years. To make a direct determination of the structure of a protein by X-ray methods is out of the question for the present, owing to the enormous number of atoms in the molecule. A molecule of the coloured blood constituent hemoglobin, which is a protein, contains for example more than 8,000 atoms. On this basis Pauling deduced some possible structures of the fundamental units in proteins, and the problem was then to examine whether these could explain the X-ray data obtained. It has thus become apparent that one of these structures, the so-called alpha-helix, probably exists in several proteins.

7 Slide Prvo amino kislinsko zaporedje proteina (insulin) Sanger, F., Thompson, E.O.P. and Kitai, R. (1955) Biochem. J. 59, 509 Slide 20 The Nobel Prize in Chemistry 1958 Frederick Sanger "for his work on the structure of proteins, especially that of insulin" Doctor Frederick Sanger. It sometimes happens that an important scientific discovery is made so to say "overnight" - if the time is ripe and the necessary background is there. Yours is not of that kind. The first successful determination of the structure of a protein is the result of many years of persistent and zealous work, in which the final solution of the problem has been approached step by step. You knew when you began to look into the structure of the insulin molecule 15 years ago that the problem was a formidable one. Slide 21 The Nobel Prize in Chemistry 1962 Max Ferdinand Perutz But even if the path was now open for a direct structure determination of haemoglobin and myoglobin, there was still an enormous amount of data to be processed. Myoglobin, the smaller of the two molecules, contains about 2,600 atoms, and the positions of most of these are now known. But for this purpose, Kendrew had to examine 110 crystals and measure the intensities of about 250,000 X-ray reflections. The calculations would not have been practicable if he had not had access to a very large electronic computer. The haemoglobin molecule is four times as large, and its structure is known less thoroughly. In both cases, however, Kendrew and Perutz are currently collecting and processing an even greater number of reflections in order to obtain a more detailed picture. John Cowdery Kendrew "for their studies of the structures of globular proteins"

8 Slide 22 Slide Slide 24 The Nobel Prize in Physiology or Medicine 1962 Francis Harry Compton Crick James Dewey Watson Maurice Hugh Frederick Wilkins "for their discoveries concerning the molecular structure of nucleic acids and its significance for information transfer in living material" Today no one can really ascertain the consequences of this new exact knowledge of the mechanisms of heredity. We can foresee new possibilities to conquer disease and to gain better knowledge of the interaction of heredity and environment and a greater understanding for the mechanisms of the origin of life. In whatever direction we look we see new vistas. We can, through the discovery by Crick, Watson and Wilkins, to quote John Kendrew, see «the first glimpses of a new world».

9 Slide 25 Slide 26 The Nobel Prize in Physiology or Medicine 1968 Robert W. Holley Har Gobind Khorana Marshall W. Nirenberg "for their interpretation of the genetic code and its function in protein synthesis" Dr. Holley, Dr. Khorana, Dr. Nirenberg. At the end of his Nobel lecture, Edward Tatum in 1958 looked into his crystal ball and tried to predict some of the future developments in molecular biology. He suggested among other things that the solution of the genetic code might come during the lifetime of at least some of the members of his audience. This appeared to be a bold prophecy at that time. In reality it took less than three years before the first letters of the code were deciphered and, because of the ingenuity of you three, the nature of the code and much of its function in protein synthesis were known within less than eight years. Together you have written the most exciting chapter in modern biology. Slide 27

10 Slide 28 The Nobel Prize in Chemistry 1980 Walter Gilbert Frederick Sanger "for their contributions concerning the determination of base sequences in nucleic acids" Sanger is responsible for the first complete determination of the sequence of a DNA molecule. He has established the sequence of the 5375 building blocks in DNA from a bacterial virus called phi-x174. Sequence investigations with the methods of Gilbert and Sanger together with the recombinant-dna technique make excellent tools for continued investigations of the structure and function of the genetic material. Slide Avtomatizacija določanja nukleotidnih zaporedij in uporaba fluorescentnih barvil Leroy Hood in Mike Hunkapiller Slide 30

11 Slide 31 Genomics is the study of an organism's genome and the use of the genes. It deals with the systematic use of genome information, associated with other data, to provide answers in biology, medicine, and industry. Genomics has the potential of offering new therapeutic methods for the treatment of some diseases, as well as new diagnostic methods. Other applications are in the food and agriculture sectors. The major tools and methods related to genomics are bioinformatics, genetic analysis, measurement of gene expression, and determination of gene function. In biology the genome of an organism is the whole hereditary information of an organism that is encoded in the DNA (or, for some viruses, RNA). This includes both the genes and the non-coding sequences. The term was first coined, in 1920, by Hans Winkler, Professor of Botany at the University of Hamburg. Wikipedia 1986 revija GENOMICS (Roderick Thomas) Slide 32 Bakteriofag φx174 Mycoplasma genitalium Hemophilus influenzae M. tuberculosis Escherichia coli Saccharomyces cerevisiae Caenorhabditis elegans Drosophila melanogaster Arabidopsis thaliana Človek Število baznih parov Število genov Določen evkariontskih, 243 evbakterijskih, 24 arhebakterijskih KEGG: Kyoto Encyclopedia of Genes and Genomes Slide 33

12 Slide ABI PRISM 3700 DNA Analyzers, Applied Biosystems 800 povezanih Compaq Alpha-based 64-bit postaj, vsaka sposobna več kot 250 bilijonov primerjav zaporedij na uro. Slide 35 BIOINFORMATIKA Vmesnik med biologijo in računalništvom. Matematične, statistične in računalniške metode za reševanje bioloških problemov z uporabo DNA in proteinskih zaporedij in povezane informacije. Organiziranje, analiza in distribucija biološke informacije pri opisovanju in reševanju bioloških problemov. Zbiranje, organiziranje, shranjevanje in iskanje biološke informacije v podatkovnih zbirkah. NI ISKANJE ČLANKOV IN BRSKANJE ZA REVIJAMI!!! Slide 36 RAZVOJ RAČUNALNIŠTVA DO KONCA 90. LET 1969 ARPANET, povezava računalnikov na Stanfordu in UCLA UNIX 1974 povezava omrežij računalnikov v internet, protokol TCP (Transmission Control Protocol) 1975 Microsoft Corporation bitni procesorji 1981 Prvi osebni računalnik na tržišču (IBM), DOS 1983 Zgoščenka (CD ROM) 1984 Macintosh (Apple) 1985 Microsoft Windows 1991 Protokoli, ki omogočijo WWW (CERN) 1993 Pentium 32 bitni procesor 1995 Windows 1995

13 Slide 37 Količina Leto Ninety-nine percent of people don t have an inkling about how fast this revolution is coming. Steve Fodor, president of Affimetrix Slide 38 POPLAVA PODATKOV!!!! GenBank nukleotidna zaporedja baz v zaporedjih- tradicionalne zbirke baz v zaporedjih- WGS zbirka iz več kot organizmov UniProt proteinska zaporedja vpisov, ki predstavljajo aminokislin iz zaporedij iz 9495 organizmov PDB- Protein Data Bank 3D strukture makromolekul struktur (1514 nukleinskih kislin, proteinov, ostalo sladkorji, kompleksi...) Slide 39 >gi sp P02023 HBB_HUMAN HEMOGLOBIN BETA CHAIN MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMG NPVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVL AHHFGKEFTPPVQAAYQKVVAGVANALAHKYH

14 Slide 40 SRPASTOCELIČNA ANEMIJA MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPVK AHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEF TPPVQAAYQKVVAGVANALAHKYH HbB HbS CCT GAG GAG P E E CCT GTG GAG P V E Slide 41 ORTOLOGI rezultat speciacije PARALOGI rezultat podvojitve genov po speciaciji HOMOLOGI Slide MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH-----G-SAQVKGHGKKVADALTNAVAHVDDMPNA ALPHA1 80 ALPHA2 -MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH-----G-SAQVKGHGKKVADALTNAVAHVDDMPNA 80 THETA -MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSP-----G-SSQVRAHGQKVADALSLAVERLDDLPHA 80 GAMMA MGHFTEEDKATITSLWGKVN--VEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGT 85 BETA MVHLTPEEKSAVTALWGKVN--VDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGT 85 EPSILON MVHFTAEEKAAVTSLWSKMN--VEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPKVKAHGKKVLTSFGDAIKNMDNLKPA 85 DELTA MVHLTPEEKTAVNALWGKVN--VDAVGGEALGRLLVVYPWTQRFFESFGDLSSPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGT 85 MYOGLOBIN -MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAE LSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR ALPHA1 142 ALPHA2 LSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR THETA LSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSPALQASLDKFLSHVISALVSEYR GAMMA FAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSSRYH BETA FATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH EPSILON FAKLSELHCDKLHVDPENFKLLGNVMVIILATHFGKEFTPEVQAAWQKLVSAVAIALAHKYH DELTA FSQLSELHCDKLHVDPENFRLLGNVLVCVLARNFGKEFTPQMQAAYQKVVAGVANALAHKYH MYOGLOBIN IKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG 154

15 Slide MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNP- Človek 59 Šimpanz MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK 60 Zajec MVHLSSEEKSAVTALWGKVNVEEVGGEALGRLLVVYPWTQRFFESFGDLSSANAVMNNPK 60 Zlata ribica -VEWTDAERSAIIGLWGKLNPDELGPQALARCLIVYPWTQRYFATFGNLSSPAAIMGNPK 59 Lastovka -VQWTAEEKQLITGLWGKVNVAECGGEALARLLIVYPWTQRFFASFGNLSSPTAVLGNPK 59 Krokodil -ASFDPHEKQLIGDLWHKVDVAHCGGEALSRMLIVYPWKRRYFENFGDISNAQAIMHNEK VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG Človek 119 Šimpanz VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG 120 Zajec VKAHGKKVLAAFSEGLSHLDNLKGTFAKLSELHCDKLHVDPENFRLLGNVLVIVLSHHFG 120 Zlata ribica VAAHGRTVMGGLERAIKNMDNIKATYAPLSVMHSEKLHVDPDNFRLLADCITVCAAMKFG 119 Lastovka VQAHGKKVLTSFGEAVKNLDSIKNTFSQLSELHCDKLHVDPENFRLLGDILVVVLAAHFG 119 Krokodil VQAHGKKVLASFGEAVCHLDGIRAHFANLSKLHCEKLHVDPENFKLLGDIIIIVLAAHYP KEFTPPVQAAYQKVVAGVANALAHKYH Človek 146 Šimpanz -KEFTPPVQAAYQKVVAGVANALAHKYH 147 Zajec -KEFTPQVQAAYQKVVAGVANALAHKYH 147 Zlata ribica PSGFNADVQEAWQKFLSVVVSALCRQYH 147 Lastovka -KDFTPDCQAAWQKLVRVVAHALARKYH 146 Krokodil -KDFGLECHAAYQKLVRQVAAALAAEYH 146 Slide 44 Slide 45 primarna zgradba sekundarna zgradba terciarna, kvartarna zgradba nukleotidno zaporedje proteinsko zaporedje motiv domena ATGGTGCACCTGAC TCCTGAGGAGAAG... MVHLTPEEK... M-[VIL]-H-[VIL]- -[ST]-P-E2-K... Podatkovna zbirka primarna GeneBank, EMBL, DDBJ primarna Swiss-Prot, PIR, MIPS sekundarna Prosite, Profiles, Pfam, BLOCKS strukturna PDB, SCOP, CATH

16 Slide 46 KEGG: Kyoto Encyclopedia of Genes and Genomes Release 20.1, December 2001 (plus daily updates) Kyoto Encyclopedia of Genes and Genomes (KEGG) is an effort to computerize current knowledge of molecular and cellular biology in terms of the information pathways that consist of interacting molecules or genes and to provide links from the gene catalogs produced by genome sequencing projects. The KEGG project is undertaken in the Bioinformatics Center, Institute for Chemical Research, Kyoto University with supports from the Ministry of Education, Culture, Sports, Science and Technology and the Japan Society for the Promotion of Science. Slide 47 NCBI- The National Center for Biotechnology Information 1988 Kot del National Library of Medicine Namen: razvoj novih informacijskih tehnologij, ki pomagajo razumeti molekularne in genetske procese 1992 Upravljanje z GenBank Entrez Brskalnik za iskanje po bioloških podatkovnih zbirkah, ki jih ureja NCBI (proteinska zaporedja, nukleotidna zaporedja, gensko mapiranje, OMIM, 3D strukture iz PDB, PubMed) Slide 48

17 Slide 49 BIBLIOGRAFSKE PODATKOVNE ZBIRKE NCBI- tekstovne podatkovne zbirke PubMed OMIM BOOK-SHELF PubMed Central PubMed Na NCBI. Prosto dostopna podatkovna zbirka. Vsebuje citate iz MEDLINE in še dodatno informacijo (nekatere druge revije, ki jih v MEDLINE ni) MEDLINE: vsebuje bibliografsko informacijo 4300 revij iz 70 držav; vsebuje preko 11 mio citatov od 1960 naprej. (OLDMEDLINE ) PubMed Journal Browser- informacija o revijah, iz katerih so članki v PubMed; JournalLinkOutProvider- povezave na spletne strani založnikov, ki imajo revije na spletu; MeSH browser (Medical Subject Heading)- pojmi, ki jih PubMed uporablja za indeksiranje člankov Slide 50 OMIM On-line Mendelian Inheritance in Man Katalog človeških genov in bolezni s katerimi so povezani. (=fenotipski dodatek človeškemu genomu) Ureja dr. Victor A McKusick, John Hopkins University Tekstovna informacija in reference, povezave na podatkovne zbirke znotraj NCBI in ostale. Skupaj opisov ( ) Iskanje s ključnimi besedami Slide 51 ExPASy Expert Protein Analysis System Swiss Institute of bioinformatics (SIB) The ExPASy (Expert Protein Analysis System) proteomics server from the Swiss Institute of Bioinformatics (SIB) is dedicated to molecular biology with an emphasis on data relevant to proteins. Avgust 1993 Appel R.D., Bairoch A., Hochstrasser D.F. A new generation of information retrieval tools for biologists: the example of the ExPASy WWW server. Trends Biochem. Sci. 19: (1994). Podatkovne zbirke: SwissProt TrEMBL PROSITE SWISS 2D-PAGE Programi: Orodja za proteomiko Melanie (analiza 2D gelov) Biochemical Pathways

18 Slide 52 Slide 53 Slide 54

19 Slide 55 Slide 56 Slide 57

20 Slide 58 OBLIKA ZAPISA Računalnik vs človek GBFF ASN.1 FASTA(Pearson) najbolj enostaven zapis >P00695 MKALIVLGLVLLSVTVQGKVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTRATNYNAGDRS TDYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVRDPQGIRAWVAWRNRCQNRD VRQYVQGCGV >gi ref NP_ lysozyme precursor [Homo sapiens] MKALIVLGLVLLSVTVQGKVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTRATNYNAGDRS TDYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVRDPQGIRAWVAWRNRCQNRD VRQYVQGCGV >gi ref NM_ Homo sapiens lysozyme (renal amyloidosis) (LYZ), mrna CTAGCACTCTGACCTAGCAGTCAACATGAAGGCTCTCATTGTTCTGGGGCTTGTCCTCCTTTCTGTTACG GTCCAGGGCAAGGTCTTTGAAAGGTGTGAGTTGGCCAGAACTCTGAAAAGATTGGGAATGGATGGCTACA GGGGAATCAGCCTAGCAAACTGGATGTGTTTGGCCAAATGGGAGAGTGGTTACAACACACGAGCTACAAA CTACAATGCTGGAGACAGAAGCACTGATTATGGGATATTTCAGATCAATAGCCGCTACTGGTGTAATGAT GGCAAAACCCCAGGAGCAGTTAATGCCTGTCATTTATCCTGCAGTGCTTTGCTGCAAGATAACATCGCTG Slide 59 Slide 60

21 Slide 61 Slide 62 Slide 63 Score = 90.9 bits (224), Expect = 1e-18 Identities =55/144 (38%), Positives =84/144 (58%),Gaps =7/144(4%) Beta 4 LTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPVK 61 L+ E+++ V ALW K+ NV EAL R + +P T+ +F DLS + V+ Theta 3 LSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHL-DLSPGSS----QVR 57 Beta 62 AHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKE 121 AHG+KV A S + LD+L + LS LH +L VDP +F+LLG+ L+ LA H+ + Theta 58 AHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGD 117 Beta 122 FTPPVQAAYQKVVAGVANALAHKY 145 F+P +QA+ K ++ V +AL +Y Theta 118 FSPALQASLDKFLSHVISALVSEY 141

22 Slide 64 Vihinen M (2001) Biomol. Eng. 18: FUNKCIONALNA GENOMIKA TRANSKRIPTOMIKA PROTEOMIKA INTERAKTOMIKA METABOLOMIKA Slide 65 TRANSKRIPTOM 1991 EST- expressed sequence tags (Venter JC et al. (1991) Science 252: ) Craig J Venter TIGR The Institute for Genomic Research dbest Slide Affimetrix proizvede prve DNA čipe

23 Slide 67 SLIKA TRANSFORMACIJA 1 MATRIKA TRANSFORMACIJA 2 BIOLOŠKI POJAV Slide 68 Slide 69

24 Slide 70 Environmental genome shotgun sequencing of the Sargasso Sea. Venter, J.C. et al Science 304: milijona zaporedij, ki so predstavljala več kot 1.2 milijona novih genov z računalniško analizo so uspeli sestaviti 12 kompletnih bakterijskih genomov. iz podatkov ocenjujejo, da je v vzorcu Sargaškega morja vsaj 1800 bakterijskih vrst in preko povsem novih genov. od teh je pet vrst zastopanih v večjem številu. Slide 71 Katalog vseh genov na zemeljski obli? ersion1/html/main.htm Sargasso samples generated over 1 billion bp of ocean genomes and 1.3 million new genes Sampling from 300 additional sites can provide hundreds of billions of base pairs of DNA sequence and up to 1 billion new genes that will permit cost effective analysis of ocean microbial diversity and environmental impact One time cost for DNA sequencing and database $50-70 Million ($15 million in hand) Slide 72 VIRI Spletna stran o bioinformatiki Literatura Popoln seznam na zgornjih spletnih straneh Attwood TK, Parry-Smith DJ (1999) Introduction to Bioinformatics. Prentice Hall, Harlow, United Kingdom gregor.anderluh@bf.uni-lj.si

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Bioinformatics. Dept. of Computational Biology & Bioinformatics Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage.

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 389; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs06.html 1/12/06 CAP5510/CGS5166 1 Evaluation

More information

CGS 5991 (2 Credits) Bioinformatics Tools

CGS 5991 (2 Credits) Bioinformatics Tools CAP 5991 (3 Credits) Introduction to Bioinformatics CGS 5991 (2 Credits) Bioinformatics Tools Giri Narasimhan 8/26/03 CAP/CGS 5991: Lecture 1 1 Course Schedules CAP 5991 (3 credit) will meet every Tue

More information

Algorithmics and Bioinformatics

Algorithmics and Bioinformatics Algorithmics and Bioinformatics Gregory Kucherov and Philippe Gambette LIGM/CNRS Université Paris-Est Marne-la-Vallée, France Schedule Course webpage: https://wikimpri.dptinfo.ens-cachan.fr/doku.php?id=cours:c-1-32

More information

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting. Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction

More information

GENETICS I: Introduction to Genetics. Assist. Prof. Dr. Betul Akcesme

GENETICS I: Introduction to Genetics. Assist. Prof. Dr. Betul Akcesme GENETICS I: Introduction to Genetics Assist. Prof. Dr. Betul Akcesme 2 NS209 Genetics I Office: F1.7 bakcesme@ius.edu.ba Monday-Wednesday 13:30-14:45 Classroom: A F1. 10 Books: Genetics From Genes to genomes.

More information

BIOINFORMATICS LAB AP BIOLOGY

BIOINFORMATICS LAB AP BIOLOGY BIOINFORMATICS LAB AP BIOLOGY Bioinformatics is the science of collecting and analyzing complex biological data. Bioinformatics combines computer science, statistics and biology to allow scientists to

More information

Related Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever.

Related Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever. CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio Review Autumn 2004 Larry Ruzzo Related Courses He who asks is a fool for five minutes, but he who does not ask remains

More information

Bioinformatics Exercises

Bioinformatics Exercises Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted

More information

Analysis and Prediction of Protein Structure (I)

Analysis and Prediction of Protein Structure (I) Analysis and Prediction of Protein Structure (I) Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 2006 Free for academic use. Copyright @ Jianlin Cheng

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available

More information

BIOINFORMATICS: An Introduction

BIOINFORMATICS: An Introduction BIOINFORMATICS: An Introduction What is Bioinformatics? The term was first coined in 1988 by Dr. Hwa Lim The original definition was : a collective term for data compilation, organisation, analysis and

More information

BSC 4934: QʼBIC Capstone Workshop" Giri Narasimhan. ECS 254A; Phone: x3748

BSC 4934: QʼBIC Capstone Workshop Giri Narasimhan. ECS 254A; Phone: x3748 BSC 4934: QʼBIC Capstone Workshop" Giri Narasimhan ECS 254A; Phone: x3748 giri@cs.fiu.edu http://www.cs.fiu.edu/~giri/teach/bsc4934_su10.html July 2010 7/12/10 Q'BIC Bioinformatics 1 Overview of Course"

More information

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Jianlin Cheng, PhD Department of Computer Science University of Missouri, Columbia

More information

EBI web resources II: Ensembl and InterPro. Yanbin Yin Spring 2013

EBI web resources II: Ensembl and InterPro. Yanbin Yin Spring 2013 EBI web resources II: Ensembl and InterPro Yanbin Yin Spring 2013 1 Outline Intro to genome annotation Protein family/domain databases InterPro, Pfam, Superfamily etc. Genome browser Ensembl Hands on Practice

More information

Sugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following

Sugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following Name: Score: / Quiz 2 on Lectures 3 &4 Part 1 Sugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following foods is not a significant source of

More information

Biology Tutorial. Aarti Balasubramani Anusha Bharadwaj Massa Shoura Stefan Giovan

Biology Tutorial. Aarti Balasubramani Anusha Bharadwaj Massa Shoura Stefan Giovan Biology Tutorial Aarti Balasubramani Anusha Bharadwaj Massa Shoura Stefan Giovan Viruses A T4 bacteriophage injecting DNA into a cell. Influenza A virus Electron micrograph of HIV. Cone-shaped cores are

More information

Computational Structural Bioinformatics

Computational Structural Bioinformatics Computational Structural Bioinformatics ECS129 Instructor: Patrice Koehl http://koehllab.genomecenter.ucdavis.edu/teaching/ecs129 koehl@cs.ucdavis.edu Learning curve Math / CS Biology/ Chemistry Pre-requisite

More information

Introduction to Bioinformatics Integrated Science, 11/9/05

Introduction to Bioinformatics Integrated Science, 11/9/05 1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction

More information

Unit 5- Concept 1 THE DNA DISCOVERY

Unit 5- Concept 1 THE DNA DISCOVERY Unit 5- Concept 1 THE DNA DISCOVERY Inheritance has always puzzled people No one really knew how it worked Mendel wasn t known till the late 1800 s He didn t even know what chromosomes were! DNA was discovered

More information

full file at

full file at Chapter 1 1. Genetics contribute to advances in: Answer: E A. agriculture. B. pharmaceuticals. C. medicine. D. modern biology. E. All of the above. 2. Genetic information can be carried in which of the

More information

Development of a Cell Signaling Networks Database. Division of Chem-Bio Informatics, National Institute of Health Sciences,

Development of a Cell Signaling Networks Database. Division of Chem-Bio Informatics, National Institute of Health Sciences, Development of a Cell Signaling Networks Database TAKAKO IGARASHI and TSUGUCHIKA KAMINUMA Division of Chem-Bio Informatics, National Institute of Health Sciences, 1-18-1 Kami-Yoga, Setagaya-ku, Tokyo 158,

More information

The physicist's guide to biology

The physicist's guide to biology Welcome guest from Kyoto Daigaku! Your institution Kyoto Daigaku has provided you with access to Chemistry World. However, there are lots of reasons for you to register a personal account with us today,

More information

Welcome to BIOL 572: Recombinant DNA techniques

Welcome to BIOL 572: Recombinant DNA techniques Lecture 1: 1 Welcome to BIOL 572: Recombinant DNA techniques Agenda 1: Introduce yourselves Agenda 2: Course introduction Agenda 3: Some logistics for BIOL 572 Agenda 4: Q&A section Agenda 1: Introduce

More information

Molecular and cellular biology is about studying cell structure and function

Molecular and cellular biology is about studying cell structure and function Chapter 1 Exploring the World of the Cell In This Chapter Discovering the microscopic world Getting matter and energy Reading the genetic code Molecular and cellular biology is about studying cell structure

More information

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Course Name: Structural Bioinformatics Course Description: Instructor: This course introduces fundamental concepts and methods for structural

More information

Sequencing alignment Ameer Effat M. Elfarash

Sequencing alignment Ameer Effat M. Elfarash Sequencing alignment Ameer Effat M. Elfarash Dept. of Genetics Fac. of Agriculture, Assiut Univ. aelfarash@aun.edu.eg Why perform a multiple sequence alignment? MSAs are at the heart of comparative genomics

More information

Introduction Biology before Systems Biology: Reductionism Reduce the study from the whole organism to inner most details like protein or the DNA.

Introduction Biology before Systems Biology: Reductionism Reduce the study from the whole organism to inner most details like protein or the DNA. Systems Biology-Models and Approaches Introduction Biology before Systems Biology: Reductionism Reduce the study from the whole organism to inner most details like protein or the DNA. Taxonomy Study external

More information

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools CAP 5510: to : Tools ECS 254A / EC 2474; Phone x3748; Email: giri@cis.fiu.edu My Homepage: http://www.cs.fiu.edu/~giri http://www.cs.fiu.edu/~giri/teach/bioinfs15.html Office ECS 254 (and EC 2474); Phone:

More information

Cells and the Stuff They re Made of. Indiana University P575 1

Cells and the Stuff They re Made of. Indiana University P575 1 Cells and the Stuff They re Made of Indiana University P575 1 Goal: Establish hierarchy of spatial and temporal scales, and governing processes at each scale of cellular function: o Where does emergent

More information

G4120: Introduction to Computational Biology

G4120: Introduction to Computational Biology ICB Fall 2009 G4120: Introduction to Computational Biology Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology & Immunology Copyright 2008 Oliver Jovanovic, All Rights Reserved. Genome

More information

Topic 1 - The building blocks of. cells! Name:!

Topic 1 - The building blocks of. cells! Name:! B2 - Revision Topic 1 - The building blocks of Lesson cells Name: Topic B2.1 Plant and Animal Cells B2.2 Inside Bacteria B2.3 DNA B2.4 Extracting DNA: PCA B2.5 DNA Discovery B2.6 Genetic Engineering B2.7

More information

11/24/13. Science, then, and now. Computational Structural Bioinformatics. Learning curve. ECS129 Instructor: Patrice Koehl

11/24/13. Science, then, and now. Computational Structural Bioinformatics. Learning curve. ECS129 Instructor: Patrice Koehl Computational Structural Bioinformatics ECS129 Instructor: Patrice Koehl http://www.cs.ucdavis.edu/~koehl/teaching/ecs129/index.html koehl@cs.ucdavis.edu Learning curve Math / CS Biology/ Chemistry Pre-requisite

More information

THERE ARE SIMILARITIES BETWEEN ANGELS AND HUMANS BASED ON THE MOLECULAR STRUCTURE OF NUCLEIC ACIDS. Ahmad Sudirman

THERE ARE SIMILARITIES BETWEEN ANGELS AND HUMANS BASED ON THE MOLECULAR STRUCTURE OF NUCLEIC ACIDS. Ahmad Sudirman THERE ARE SIMILARITIES BETWEEN ANGELS AND HUMANS BASED ON THE MOLECULAR STRUCTURE OF NUCLEIC ACIDS Ahmad Sudirman CAD, CAM and CNC Technology Education with quality (3CTEQ) STOCKHOLM, January 09, 2015

More information

Sequencing alignment Ameer Effat M. Elfarash

Sequencing alignment Ameer Effat M. Elfarash Sequencing alignment Ameer Effat M. Elfarash Dept. of Genetics Fac. of Agriculture, Assiut Univ. amir_effat@yahoo.com Why perform a multiple sequence alignment? MSAs are at the heart of comparative genomics

More information

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr Introduction to Bioinformatics Shifra Ben-Dor Irit Orr Lecture Outline: Technical Course Items Introduction to Bioinformatics Introduction to Databases This week and next week What is bioinformatics? A

More information

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1 Tiffany Samaroo MB&B 452a December 8, 2003 Take Home Final Topic 1 Prior to 1970, protein and DNA sequence alignment was limited to visual comparison. This was a very tedious process; even proteins with

More information

Overview of I519 & Introduction to Bioinformatics. Yuzhen Ye School of Informatics and Computing, IUB

Overview of I519 & Introduction to Bioinformatics. Yuzhen Ye School of Informatics and Computing, IUB Overview of I519 & Introduction to Bioinformatics Yuzhen Ye (yye@indiana.edu) School of Informatics and Computing, IUB Structure of I519 Two classes and one lab each week Python & C (& R) Textbook: Understanding

More information

Cracking the genetic code: replicating a scientific discovery

Cracking the genetic code: replicating a scientific discovery Cracking the genetic code: replicating a scientific discovery Image courtesy of Caroline Davis2010; image source: flickr By Jordi Domènech-Casal In 1953, James Watson and Francis Crick discovered the structure

More information

Bioinformatics in the post-sequence era

Bioinformatics in the post-sequence era Bioinformatics in the post-sequence era review Minoru Kanehisa 1 & Peer Bork 2 doi:10.1038/ng1109 In the past decade, bioinformatics has become an integral part of research and development in the biomedical

More information

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson Grundlagen der Bioinformatik, SS 10, D. Huson, April 12, 2010 1 1 Introduction Grundlagen der Bioinformatik Summer semester 2010 Lecturer: Prof. Daniel Huson Office hours: Thursdays 17-18h (Sand 14, C310a)

More information

Lecture 4. What is Life? 1944 book by Erwin Schrodinger

Lecture 4. What is Life? 1944 book by Erwin Schrodinger Lecture 4 What is Life? 1944 book by Erwin Schrodinger What is What is Life?" A little general-reader book published by Erwin Schrodinger in 1944 An influential book on biology written by a theoretical

More information

EBI web resources II: Ensembl and InterPro

EBI web resources II: Ensembl and InterPro EBI web resources II: Ensembl and InterPro Yanbin Yin http://www.ebi.ac.uk/training/online/course/ 1 Homework 3 Go to http://www.ebi.ac.uk/interpro/training.htmland finish the second online training course

More information

Homology. and. Information Gathering and Domain Annotation for Proteins

Homology. and. Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline WHAT IS HOMOLOGY? HOW TO GATHER KNOWN PROTEIN INFORMATION? HOW TO ANNOTATE PROTEIN DOMAINS? EXAMPLES AND EXERCISES Homology

More information

"Omics" - Experimental Approachs 11/18/05

Omics - Experimental Approachs 11/18/05 "Omics" - Experimental Approachs Bioinformatics Seminars "Omics" Experimental Approaches Nov 18 Fri 12:10 BCB Seminar in E164 Lago Using P-Values for the Planning and Analysis of Microarray Experiments

More information

Sequences, Structures, and Gene Regulatory Networks

Sequences, Structures, and Gene Regulatory Networks Sequences, Structures, and Gene Regulatory Networks Learning Outcomes After this class, you will Understand gene expression and protein structure in more detail Appreciate why biologists like to align

More information

Homology and Information Gathering and Domain Annotation for Proteins

Homology and Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The

More information

ProtoNet 4.0: A hierarchical classification of one million protein sequences

ProtoNet 4.0: A hierarchical classification of one million protein sequences ProtoNet 4.0: A hierarchical classification of one million protein sequences Noam Kaplan 1*, Ori Sasson 2, Uri Inbar 2, Moriah Friedlich 2, Menachem Fromer 2, Hillel Fleischer 2, Elon Portugaly 2, Nathan

More information

Laying the Foundation of Molecular Biology:

Laying the Foundation of Molecular Biology: Laying the Foundation of Molecular Biology: Max Delbrück and Salvador Luria; Ed Tatum and Joshua Lederberg; Marsha Chase and Al Hershey; Jim Watson and Francis Crick The great speed at which the basic

More information

Hie-Joon Kim. Professor Emeritus Seoul National University. Experience. Representative Publications

Hie-Joon Kim. Professor Emeritus Seoul National University. Experience. Representative Publications Hie-Joon Kim Professor Emeritus Seoul National University B.S. Chemistry, Seoul National University, Korea, 1970 Ph.D. Chemistry, University of Chicago, USA, 1977 Experience Professor, Department of Chemistry

More information

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS. !! www.clutchprep.com CONCEPT: HISTORY OF GENETICS The earliest use of genetics was through of plants and animals (8000-1000 B.C.) Selective breeding (artificial selection) is the process of breeding organisms

More information

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:

More information

Procedure to Create NCBI KOGS

Procedure to Create NCBI KOGS Procedure to Create NCBI KOGS full details in: Tatusov et al (2003) BMC Bioinformatics 4:41. 1. Detect and mask typical repetitive domains Reason: masking prevents spurious lumping of non-orthologs based

More information

Microbiome: 16S rrna Sequencing 3/30/2018

Microbiome: 16S rrna Sequencing 3/30/2018 Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics

More information

Lecture 1. Introduction to X-ray Crystallography. Tuesday, February 1, 2011

Lecture 1. Introduction to X-ray Crystallography. Tuesday, February 1, 2011 Lecture 1 Introduction to X-ray Crystallography Tuesday, February 1, 2011 Protein Crystallography Crystal Structure Determination in Principle: From Crystal to Structure Dr. Susan Yates Contact Information

More information

Proteomics. 2 nd semester, Department of Biotechnology and Bioinformatics Laboratory of Nano-Biotechnology and Artificial Bioengineering

Proteomics. 2 nd semester, Department of Biotechnology and Bioinformatics Laboratory of Nano-Biotechnology and Artificial Bioengineering Proteomics 2 nd semester, 2013 1 Text book Principles of Proteomics by R. M. Twyman, BIOS Scientific Publications Other Reference books 1) Proteomics by C. David O Connor and B. David Hames, Scion Publishing

More information

6.096 Algorithms for Computational Biology. Prof. Manolis Kellis

6.096 Algorithms for Computational Biology. Prof. Manolis Kellis 6.096 Algorithms for Computational Biology Prof. Manolis Kellis Today s Goals Introduction Class introduction Challenges in Computational Biology Gene Regulation: Regulatory Motif Discovery Exhaustive

More information

Chem Lecture 3 Hemoglobin & Myoglobin

Chem Lecture 3 Hemoglobin & Myoglobin Chem 452 - Lecture 3 Hemoglobin & Myoglobin 111003 Hemoglobin (Hb) and Myoglobin (Mb) function as oxygen transport and storage molecules in higher organisms. There functions have been long studied and,

More information

Early History up to Schedule. Proteins DNA & RNA Schwann and Schleiden Cell Theory Charles Darwin publishes Origin of Species

Early History up to Schedule. Proteins DNA & RNA Schwann and Schleiden Cell Theory Charles Darwin publishes Origin of Species Schedule Bioinformatics and Computational Biology: History and Biological Background (JH) 0.0 he Parsimony criterion GKN.0 Stochastic Models of Sequence Evolution GKN 7.0 he Likelihood criterion GKN 0.0

More information

Today in Astronomy 106: the long molecules of life

Today in Astronomy 106: the long molecules of life Today in Astronomy 106: the long molecules of life Wet chemistry of nucleobases Nuances of polymerization Replication or mass production of nucleic acids Transcription Codons The protein hemoglobin. From

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

Exploring Evolution & Bioinformatics

Exploring Evolution & Bioinformatics Chapter 6 Exploring Evolution & Bioinformatics Jane Goodall The human sequence (red) differs from the chimpanzee sequence (blue) in only one amino acid in a protein chain of 153 residues for myoglobin

More information

Bio 4337 Seminal papers in Molecular Biology

Bio 4337 Seminal papers in Molecular Biology Bio 4337 Seminal papers in Molecular Biology Vincent P. Cirillo Spring, 2007, MW 11:00 AM-12:45 PM Seminal Papers is a history course. Selected theoretical and experimental papers in the history of Molecular

More information

What is Systems Biology

What is Systems Biology What is Systems Biology 2 CBS, Department of Systems Biology 3 CBS, Department of Systems Biology Data integration In the Big Data era Combine different types of data, describing different things or the

More information

Flow of Genetic Information

Flow of Genetic Information presents Flow of Genetic Information A Montagud E Navarro P Fernández de Córdoba JF Urchueguía Elements Nucleic acid DNA RNA building block structure & organization genome building block types Amino acid

More information

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB Homology Modeling (Comparative Structure Modeling) Aims of Structural Genomics High-throughput 3D structure determination and analysis To determine or predict the 3D structures of all the proteins encoded

More information

Chapter 2 Evolution: Constructing a Fundamental Scientific Theory

Chapter 2 Evolution: Constructing a Fundamental Scientific Theory Chapter 2 Evolution: Constructing a Fundamental Scientific Theory TRUE/FALSE 1. Organisms classified in two different biological orders can still belong to the same genus. 2. Before 1700, most Western

More information

Computational Molecular Modeling

Computational Molecular Modeling Computational Molecular Modeling Lecture 1: Structure Models, Properties Chandrajit Bajaj Today s Outline Intro to atoms, bonds, structure, biomolecules, Geometry of Proteins, Nucleic Acids, Ribosomes,

More information

X-Ray structure analysis

X-Ray structure analysis X-Ray structure analysis Kay Diederichs kay.diederichs@uni-konstanz.de Analysis of what? Proteins ( /ˈproʊˌtiːnz/ or /ˈproʊti.ɨnz/) are biochemical compounds consisting of one or more polypeptides typically

More information

I. Molecules & Cells. A. Unit One: The Nature of Science. B. Unit Two: The Chemistry of Life. C. Unit Three: The Biology of the Cell.

I. Molecules & Cells. A. Unit One: The Nature of Science. B. Unit Two: The Chemistry of Life. C. Unit Three: The Biology of the Cell. I. Molecules & Cells A. Unit One: The Nature of Science a. How is the scientific method used to solve problems? b. What is the importance of controls? c. How does Darwin s theory of evolution illustrate

More information

Protein function prediction based on sequence analysis

Protein function prediction based on sequence analysis Performing sequence searches Post-Blast analysis, Using profiles and pattern-matching Protein function prediction based on sequence analysis Slides from a lecture on MOL204 - Applied Bioinformatics 18-Oct-2005

More information

Genom, transkriptom in proteom

Genom, transkriptom in proteom Genom, transkriptom in proteom The conceptual relationship of the genome, transcriptome, proteome and metabolome Fenotipi/Phenotypes Morphological phenotypes are fixed in distinct dog breeds. Fizikalna

More information

The focus of this study is whether evolutionary medicine can be considered a distinct scientific discipline.

The focus of this study is whether evolutionary medicine can be considered a distinct scientific discipline. The purpose of this study is to characterize trends in the application of evolutionary biology in health and disease. The focus of this study is whether evolutionary medicine can be considered a distinct

More information

Proteins: Structure & Function. Ulf Leser

Proteins: Structure & Function. Ulf Leser Proteins: Structure & Function Ulf Leser This Lecture Proteins Structure Function Databases Predicting Protein Secondary Structure Many figures from Zvelebil, M. and Baum, J. O. (2008). "Understanding

More information

Biological Concepts and Information Technology (Systems Biology)

Biological Concepts and Information Technology (Systems Biology) Biological Concepts and Information Technology (Systems Biology) Janaina de Andréa Dernowsek Postdoctoral at Center for Information Technology Renato Archer Janaina.dernowsek@cti.gov.br Division of 3D

More information

Small RNA in rice genome

Small RNA in rice genome Vol. 45 No. 5 SCIENCE IN CHINA (Series C) October 2002 Small RNA in rice genome WANG Kai ( 1, ZHU Xiaopeng ( 2, ZHONG Lan ( 1,3 & CHEN Runsheng ( 1,2 1. Beijing Genomics Institute/Center of Genomics and

More information

You are required to know all terms defined in lecture. EXPLORE THE COURSE WEB SITE 1/6/2010 MENDEL AND MODELS

You are required to know all terms defined in lecture. EXPLORE THE COURSE WEB SITE 1/6/2010 MENDEL AND MODELS 1/6/2010 MENDEL AND MODELS!!! GENETIC TERMINOLOGY!!! Essential to the mastery of genetics is a thorough knowledge and understanding of the vocabulary of this science. New terms will be introduced and defined

More information

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology 2012 Univ. 1301 Aguilera Lecture Introduction to Molecular and Cell Biology Molecular biology seeks to understand the physical and chemical basis of life. and helps us answer the following? What is the

More information

Sequence Database Search Techniques I: Blast and PatternHunter tools

Sequence Database Search Techniques I: Blast and PatternHunter tools Sequence Database Search Techniques I: Blast and PatternHunter tools Zhang Louxin National University of Singapore Outline. Database search 2. BLAST (and filtration technique) 3. PatternHunter (empowered

More information

Introduction to Bioinformatics

Introduction to Bioinformatics CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics

More information

Lecture 5. How DNA governs protein synthesis. Primary goal: How does sequence of A,G,T, and C specify the sequence of amino acids in a protein?

Lecture 5. How DNA governs protein synthesis. Primary goal: How does sequence of A,G,T, and C specify the sequence of amino acids in a protein? Lecture 5 (FW) February 4, 2009 Translation, trna adaptors, and the code Reading.Chapters 8 and 9 Lecture 5. How DNA governs protein synthesis. Primary goal: How does sequence of A,G,T, and C specify the

More information

Bioinformatics Practical for Biochemists

Bioinformatics Practical for Biochemists Bioinformatics Practical for Biochemists Andrei Lupas, Birte Höcker, Steffen Schmidt WS 2013/2014 01. DNA & Genomics!! 1 Description Lectures about general topics in Bioinformatics & History Tutorials

More information

Short Course Mathematical Molecular Biology Bob Eisenberg Shanghai Jiao Tong University Sponsor Zhenli Xu

Short Course Mathematical Molecular Biology Bob Eisenberg Shanghai Jiao Tong University Sponsor Zhenli Xu u Short Course Mathematical Molecular Biology Bob Eisenberg Shanghai Jiao Tong University Sponsor Zhenli Xu 1 Mathematics in Molecular and Cellular Biology Many thanks to Zhenli for inviting me!! Page

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

I. Molecules and Cells: Cells are the structural and functional units of life; cellular processes are based on physical and chemical changes.

I. Molecules and Cells: Cells are the structural and functional units of life; cellular processes are based on physical and chemical changes. I. Molecules and Cells: Cells are the structural and functional units of life; cellular processes are based on physical and chemical changes. A. Chemistry of Life B. Cells 1. Water How do the unique chemical

More information

Implications of Structural Genomics Target Selection Strategies: Pfam5000, Whole Genome, and Random Approaches

Implications of Structural Genomics Target Selection Strategies: Pfam5000, Whole Genome, and Random Approaches PROTEINS: Structure, Function, and Bioinformatics 58:166 179 (2005) Implications of Structural Genomics Target Selection Strategies: Pfam5000, Whole Genome, and Random Approaches John-Marc Chandonia 1

More information

Electronegativity and Bond Polarity

Electronegativity and Bond Polarity Electronegativity and Bond Polarity Although we defined covalent bonding as electron sharing, the electrons in a covalent bond are not always shared equally by the two bonded atoms. Unless the bond connects

More information

Large-Scale Genomic Surveys

Large-Scale Genomic Surveys Bioinformatics Subtopics Fold Recognition Secondary Structure Prediction Docking & Drug Design Protein Geometry Protein Flexibility Homology Modeling Sequence Alignment Structure Classification Gene Prediction

More information

Introduction to Molecular and Cell Biology

Introduction to Molecular and Cell Biology Introduction to Molecular and Cell Biology Molecular biology seeks to understand the physical and chemical basis of life. and helps us answer the following? What is the molecular basis of disease? What

More information

G4120: Introduction to Computational Biology

G4120: Introduction to Computational Biology ICB Fall 2003 G4120: Introduction to Computational Biology Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology Copyright 2003 Oliver Jovanovic, All Rights Reserved. Bioinformatics and

More information

Genomics and bioinformatics summary. Finding genes -- computer searches

Genomics and bioinformatics summary. Finding genes -- computer searches Genomics and bioinformatics summary 1. Gene finding: computer searches, cdnas, ESTs, 2. Microarrays 3. Use BLAST to find homologous sequences 4. Multiple sequence alignments (MSAs) 5. Trees quantify sequence

More information

Updated: 10/11/2018 Page 1 of 5

Updated: 10/11/2018 Page 1 of 5 A. Academic Division: Health Sciences B. Discipline: Biology C. Course Number and Title: BIOL1230 Biology I MASTER SYLLABUS 2018-2019 D. Course Coordinator: Justin Tickhill Assistant Dean: Melinda Roepke,

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Lecture : p he biological problem p lobal alignment p Local alignment p Multiple alignment 6 Background: comparative genomics p Basic question in biology: what properties

More information

Genomes and Their Evolution

Genomes and Their Evolution Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Lecture 1: Introduction to Bioinformatics September 22, 2014

Lecture 1: Introduction to Bioinformatics September 22, 2014 ICQB Introduction to Computational & Quantitative Biology (G4120) Fall 2014 Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology & Immunology The Genomics Era Archaea 176 Bacteria 3,021

More information

ALL LECTURES IN SB Introduction

ALL LECTURES IN SB Introduction 1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL

More information

Journal of Proteomics & Bioinformatics - Open Access

Journal of Proteomics & Bioinformatics - Open Access Abstract Methodology for Phylogenetic Tree Construction Kudipudi Srinivas 2, Allam Appa Rao 1, GR Sridhar 3, Srinubabu Gedela 1* 1 International Center for Bioinformatics & Center for Biotechnology, Andhra

More information

Curriculum Links. AQA GCE Biology. AS level

Curriculum Links. AQA GCE Biology. AS level Curriculum Links AQA GCE Biology Unit 2 BIOL2 The variety of living organisms 3.2.1 Living organisms vary and this variation is influenced by genetic and environmental factors Causes of variation 3.2.2

More information

Bioinformatics. Part 8. Sequence Analysis An introduction. Mahdi Vasighi

Bioinformatics. Part 8. Sequence Analysis An introduction. Mahdi Vasighi Bioinformatics Sequence Analysis An introduction Part 8 Mahdi Vasighi Sequence analysis Some of the earliest problems in genomics concerned how to measure similarity of DNA and protein sequences, either

More information