Gre C G G A T T A T T C A T A T A A T T G T T A T A C C A G A C G G T C G C
|
|
- Curtis Newton
- 6 years ago
- Views:
Transcription
1 2.1.1 PAH cdna Sequence Compiled by S.Byck, P.M.Nowacki, and Max Caccione. This Reference Sequence (simplified) is deposited in GenBank: U This sequence is derived from the following: Partial 5'UTR sequence, Konecki et al., Exon sequences, Kwok et al., Partial Intron sequences, Svensson (thesis) and others. Other DiLella 1986, Kwok (1992). Polymorphisms are indicated in color above the reference sequence where available. Polymorphisms were obtained from the following sources: red from Gulberg primer sequence. purple from Kalaydjieva et al., blue from DiLella intron data. dark green from Abadie et al., light blue from Woo primer sequence. olive green from Svensson 5'UTR sequence. NOTE: Where a nucleotide is inserted, the reference and inserted nucleotides are written above the reference. Where a nucleotide is deleted, 'del' is written above the reference nucleotide This document is color coded, the color version is in MS Excel format and is available at Pvu II C A G C T G G G G G T A A G G G G G G Gre C G G A T T A T T C A T A T A A T T G T T A T A C C A G A C G G T C G C CAAT Gre Ap-2 A G G C T T A G T C C A A T T G C A G A G A A C T C G C T T C C C A G G C T T C T G A G A G T C C C G G A A G T G C C T A A A C C T G T C T A A CACCC Ap-2 T C G A C G G G G C T T G G G T G G C C C G T C G C T C C C T G G C T T Ap-2 C T T C C C T T T A C C C A G G G C G G G C A G C G A A G T G G T G C C CACCC T C C T G C G T C C C C C A C A C C C T C C C T C A G C C C C T C C C C T C C G G C C C G T C C T G G G C A G G T G A C C T G G A G C A T C C G Cre GG G GA G C A G G C T G C C C T G G C C T C C T G C G T C A G G A C A A G C C C C T del A C G A G G G G C G T T A C T G T G C G G A G A T G C A C C A C G C A A del G A G A C A C C C T T T G T A A C T C T C T T C T C C T C C C T A G T G C G A G G T T A A A A C C T T C A G C C C C A C G T G C T G T T T G C A A A C C T G C C T G T A C C T G A G G C C C T A A A A A G C C A G A G A START M e t S e r T h r A l a V a l C C T C A C T C C C G G G G A G C C A G C A T G T C C A C T G C G G T C 'UTR E1 <- -> L e u G l u A s n P r o G l y L e u G l y A r g L y s L e u S e r A s p C T G G A A A A C C C A G G C T T G G G C A G G A A A C T C T C T G A C
2 P h e G l y G l n T T T G G A C A G g t g a g c c a c g g c a g c c t g a g c t g c t c a g t t a g g g g a a t t t g g g c c t c c a g a g a a a g a g a t c c g a a g a c t g c t g g t g c t t c c t g g t t t c a t a a g c t c a g t a a g a a g t c t g a a t t c g g a a g g c t g c c t g g c c t c t g c g t c a g g a c a g c c a g a g g g c g t t a c t g t g c g g a g a t g c a c a g a a g a g a c a c c t t t g t a c t c t c t c t c t c c t a g t g c g a g t a a t c a g c c a g t g t c g t t g a a c g t c g t c t a c t a g g c t a a g a t t a t g c a a g a t t t a a c t a g a a g a a a g g a g t t c a t g c t t g c t t t g t c c a t g g a g g t t t a a c a g g a a t g a a t t g c t a a a c t g t g g a a a a t g t t t t a a a c a G l u T h r S e r T y r I l e G l a a t g c a t c t t a t c c t g t a g G A A A C A A G C T A T A T T G A 61 E u A s p A s n C y s A s n G l n A s n G l y A l a I l e S e r L e u I l A G A C A A C T G C A A T C A A A A T G G T G C C A T A T C A C T G A T e P h e S e r L e u L y s G l u G l u V a l G l y A l a L e u A l a L y C T T C T C A C T C A A A G A A G A A G T T G G T G C A T T G G C C A A s V a l L e u A r g L e u P h e G l u a A G T A T T G C G C T T A T T T G A G g t c a g t g c t a c a a t c a t g t t t g t c t t g g a t a a t g t c g t a g c a a a c t t c c a t g t t c t t t t c t a g t t a g a t g c a a t g a a a a g a a c a c a g g a t c t g g a a c a g g c t g a a c t c t c c a t t t g t t g c g t t a g g t t t t c c t g t t c t g g t t c t g c a t c t t t g g c c t g c g t t a g t t c c t g t g a c t g t c t c c t c a c c c t c c c c a t t c t t G l u A s n A s p V a l A s n L e u T h r H i s I l e c t c g t c t a g G A G A A T G A T G T A A A C C T G A C C C A C A T T E G l u S e r A r g P r o S e r A r g L e u L y s L y s A s p G l u T y r G A A T C T A G A C C T T C T C G T T T A A A G A A A G A T G A G T A T G l u P h e P h e T h r H i s L e u A s p L y s A r g S e r L e u P r o G A A T T T T T C A C C C A T T T G G A T A A A C G T A G C C T G C C T A l a L e u T h r A s n I l e I l e L y s I l e L e u A r g H i s A s p G C T C T G A C A A A C A T C A T C A A G A T C T T G A G G C A T G A C I l e G l y A l a T h r V a l H i s G l u L e u S e r A r g A s p L y s A T T G G T G C C A C T G T C C A T G A G C T T T C A C G A G A T A A G L y s L y s A s p T h r V (a l) A A G A A A G A C A C A G g t a a g a a t t a g a g g a a t t t t g c a
3 a c a t a a g t a a c t c a c a t g t c t t a t a a t g t t c t g c c a a 118 V a t c t g t a c t c a g g a c g t t g c c t t c t c t g t g t t t c a g T E l P r o T r p P h e P r o A r g T h r I l e G l n G l u L e u A s p A r G C C C T G G T T C C C A A G A A C C A T T C A A G A G C T G G A C A G g P h e A l a A s n G l n I l e L e u S e r T y r G l y A l a G l u L e A T T T G C C A A T C A G A T T C T C A G C T A T G G A G C G G A A C T u A s p A l a A s p H i s P r o t g t c G G A T G C T G A C C A C C C T g t g a g t c c a t g g c c c g t a g a t g t g a g a t t t t t t g a c a a a t c c a c a g c g a g g a g g c t c a a g g t a a a t c a g g g a a g t t a c t a a t a a g g t g t c a c t t a c a a t a t c t g g t a g g a g a g c t a a g t t t a a c c c g a g a c t t t a g t t t t a g g g t a t a c c c a a g g g a a g g a g a g a t g c a c t g t c a t g g c t t t a g a g c c c c c a t t c a a a g c a t t c a t a a a g g t a c c a g a c c t c t t c c t a t g a a g c c t t g G l y P h e L y s a a a a a t c a g g t g t c t c t t t t c t c c t a g G G T T T T A A A E A s p P r o V a l T y r A r g A l a A r g A r g L y s G l n P h e A l a G A T C C T G T G T A C C G T G C A A G A C G G A A G C A G T T T G C T A s p I l e A l a T y r A s n T y r A r g H i (s) G A C A T T G C C T A C A A C T A C C G C C A g t a a g t c t g c c t t g c t t g t t g a g g g g a a g a g a g a a a a a c a c a c c c c t a g del c c t g c t t c t c c c t t g c c c t c a t c c a g t t g a g g a t g g a a a a t a c c a g c a t g a a g a g t a a t c t t g c a c g c c c t c g c t c a a g c c t t c g t c a c t g g t c c c g c c a c c a a a c g t t t c g g c g a g a a g c a g g c c a t t a t c g c c g g c a t g g c g g c c g a c g c g c t g g g c t a c t t c t t g c t g c g t t c c g a c g c g a g g c t g g a t g g c c t t c c c c a t t a t g a t t a g a a t t t t c t c t t c a a a t a a t g g t a g t a a a t t c a c t g t a g c a a g t g a t g g c a g c t c a c a g g t t c t g g t c c c c g a c t c c c t c t g c t a a c c t a a c c t g c a t t c t g c t g t g c c c c t del ga tt g c c c t g c c t t g a g c a c c t a t t t t g t g c c t g t a t t c t (H i) s G l y G l n P r o I l e P r o A r g V a l G l u T y r M e t G l u a g T G G G C A G C C C A T C C C T C G A G T G G A A T A C A T G G A G 510 E G l u G l u L y s L y s T h r T r p G l y T h r V a l P h e L y s T h r G A A G A A A A G A A A A C A T G G G G C A C A G T G T T C A A G A C T L e u L y s S e r L e u T y r L y s T h r H i s A l a C y s T y r G l u C T G A A G T C C T T G T A T A A A A C C C A T G C T T G C T A T G A G T y r A s n H i s I l e P h e P r o L e u L e u G l u L y s T y r C y s T A C A A T C A C A T T T T T C C A C T T C T T G A A A A G T A C T G T
4 G l y P h e H i s G l u A s p A s n I l e P r o G l n L e u G l u A s p G G C T T C C A T G A A G A T A A C A T T C C C C A G C T G G A A G A C V a l S e r G l n P h e L e u G l n T (h r) G T T T C T C A A T T C C T G C A G A g t a a g t c c a c a t c a g g g gg gc del t c a a t g c c c t g a g a a a g t t g g g g g a g g a t t g a g g c a g a g g a g a g g g a a t t a t g t g a c t a c t c c a c t a c c a a a g gc g t c t c c t a g t g c c t c t g a c t g a g t g g t g a t g a g c t t t (T) h r C y s T h r t g a g t t t t c t t t c t t c t t t t c a t c c c a g C T T G C A C T 707 E G l y P h e A r g L e u A r g P r o V a l A l a G l y L e u L e u S e r G G T T T C C G C C T C C G A C C T G T G G C T G G C C T G C T T T C C S e r A r g A s p P h e L e u G l y G l y L e u A l a P h e A r g V a l T C T C G G G A T T T C T T G G G T G G C C T G G C C T T C C G A G T C P h e H i s C y s T h r G l n T y r I l e A r g H i s G l y S e r L y s T T C C A C T G C A C A C A G T A C A T C A G A C A T G G A T C C A A G P r o M e t T y r T h r P r o G l u P r (o) C C C A T G T A T A C C C C C G A A C C g t g a g t a c t g t c c t c c c t a g c t a c c a g t t g c c a g g c a c a a t g a g c g c c a t c t t t t t c c t g c t g c a a g a a t g a g g t t t g g g t t c a g t t g c t g g c t g g t c c a c a g g c t a g t t g t g t a g a c t g t t t g c t c g t a g t a c t a a g a a c a t t c a t c t a t t c t g a g t t t g c t c a t c a g a g a g c a g c a t t c t t t c t g c c c a t t c c t c a t g t a g a a a g a c t g a g t c t g g c t t g g c t t a a a c c t c c t (P r) r A c c c c t c c g a g c t c t c t g t g c t t t c t g t c t t t c a g T G 843 E s p I l e C y s H i s G l u L e u L e u G l y H i s V a l P r o L e u P A C A T C T G C C A T G A G C T G T T G G G A C A T G T G C C C T T G T h e S e r A s p A r g S e r P h e A l a G l n P h e S e r G l n T T T C A G A T C G C A G C T T T G C C C A G T T T T C C C A G g t a a g g a a t g g a t t t t t t a g c c t t c t a g t t a t a g g t c t g t g a c c t g a a t t t c t c a a a t g a g t t g a g c c c a g g g a g g g g t c c t c a t g c c c t c t g c a a g a g c g g a a a c c a g g t a c a g t t c t a t g a t c c c a c c t a a a t g g g t g t g t g g t t c t g g g a a g c t a a a t a c c c a t t t g g a g a t c a a g c a t g g g g t t g g t g a a t c a c t c a t c c t c t g a a g a c t g g t a t t g g t g a a t c a c t c a t c c t c t g a a t c a t a t g t g t g t t c t t c t a c g t g a c t a a t c t c a t t t t t c c t g c t g a g a c c
5 t t t a c t a c c t g a a g c c t c c t g a t t a g t t c t g t g a t t c t a t a a c a t a t g g g c a g g a a a g a c t t g a t a a c a c a c t c a g g g t c t a t g t g g g c t g t t c t g a a g g c a t c t g g c c a c c c a t c a c c t t t t t a t g g c c a a g t a c t a g g t t g g t t c t g t g g t t c c a a t t a c a g g a a c a g a a c a g g t t c t G l u I l e G l y L e u A l a S e r a t t t t c c c c c a a t t a c a g G A A A T T G G C C T T G C C T C T 913 E L e u G l y A l a P r o A s p G l u T y r I l e G l u L y s L e u A l a C T G G G T G C A C C T G A T G A A T A C A T T G A A A A G C T C G C C T h r A C A g t a a g t c c c t t c t c t c c c t g g g t g g a t g g t g g a gt g t g c t a t a g g c t a t g g c c c t c a g g t c t t t g a a a c t c t c t a c t c c t g g a g c a g g t a t c t g g g a a a a a t a c a g t g t g t a a c t t c a a g t a t c a t c c c a g t c a a g g t g a c a c cg a t a a a a t t a a c c a t c a t a g a g t g t g c t c t c a g a t t g del I l e T y r T r p P h e T h r V a l G l u P a c t t t c c a t t c c a g A T T T A C T G G T T T A C T G T G G A G T 970 E h e G l y L e u C y s L y s G l n G l y A s p S e r I l e L y s V a l T T T G G G C T C T G C A A A C A A G G A G A C T C C A T A A A G G C A T y r G l y A l a G l y L e u L e u S e r S e r P h e G l y G l u L e u G A T G G T G C T G G G C T C C T G T C A T C C T T T G G T G A A T T A C l n A G g t a t g a c c t t c a c a g g a a c c a a g g a t a g a t t t a a a a g t g g t g g g t a c a g a a a a c c a t t a t t g t g c t t t c a a t a t t g t t g a a a c c c t a t t t g t a t c c g t t t t g a t a t g c a a c c t g g g a a c t c a t t c t c c a g t g a a t t c t c c c t t g g c t g t a g t t t t a g a t a g g c t a t a a g g t a g a a c c t a t a t g t g c a a a t a a t t t g g c t c a c a g c a t t t g g g a c t g t g g t g t a g a a g g a a t c g g g t t g a g a t g a g a g a a g g g g c a c a a a t g g c c t a t g g g a t g c a g c a g g g a a t a c t g a t c c t g a t t t a a c a g t g a t a a t a a c t t t t c a c t t a T y r C y s.l e u S e r G l u L y s P r o L y s L g g g g c c t a c a g T A C T G C T T A T C A G A G A A G C C A A A G C 1066 E e u L e u P r o L e u G l u L e u G l u L y s T h r A l a I l e T l n A T T C T C C C C C T G G A G C T G G A G A A G A C A G C C A T C C A A A s n T y r T h r V a l T h r G l u P h e G l n P r o L e u T y r T y r V A T T A C A C T G T C A C G G A G T T C C A G C C C C T G T A T T A C G a l A l a G l u S e r P h e A s n A s p A l a L y s G l u L y s V a l A
6 T G G C A G A G A G T T T T A A T G A T G C C A A G G A G A A A G T A A r (g) G g t g a g g t g g t g a c a a a g g t g a g c c a c t a g c t c t g g 1199 g g g c c t c c t g a c t g g t g c c a c t c a t c t g t g g g t g g t tgg del c aac t c c a g g a g a g t g g a c t c c a a t g t c t a c a g t a t t t g t cc a c a t a g a c a g t t t c t c t c c a t g t t c t c a c c t c t t t g g t c g t t a a g g a t c c c g t c t a g g g a g g t g t c c t g t t t c c t a a a a a a g a a g t a a a a t g c c a c t g a g a a c t c t c t t a a g a c t a c c t t t c t c c a a a t g g t g c c c c t t c a c t c (A r) g A s n P h e A l a A l a T a a g c c t g t g g t t t t g g t c t t a g G A A C T T T G C T G C C A 1200 E h r I l e P r o A r g P r o P h e S e r V a l A r g T y r A s p P r o T C A A T A C C T C G G C C C T T C T C A G T T C G C T A C G A C C C A T y r T h r G l n A r g I l e G l u V a l L e u A s p A s n T h r G l n G A C A C C C A A A G G A T T G A G G T C T T G G A C A A T A C C C A G C l n L e u L y s I l e L e u A l a A s p S e r I l e A s n S (e r) A G C T T A A G A T T T T G G C T G A T T C C A T T A A C A g t a a g t del del del del a a t t t a c a c c t t a c g a g g c c a c t c g g t t t c t c a g t a a t c g a a g a c t g t c t t t c c c t a c c a t c g c c a t a g g a g c a c c c a g c t c a t c c a a g a a g g c c c a c t t a t c c c c t a g t c t t t c a c t a g g a c a c t t g a a g a g t t t t t g c t t a t gt/ga t a t a c a a g t g g c c c a t t t t g a t g g t g t t t t t c t t t g (S) e r G l u I l e G l y I l e L e u C y s S e r A l a L e u G l u L t a g G T G A A A T T G G A A T C C T T T G C A G T G C C C T C C A G A 1316 E STOP y s I l e L y s T e r A A A T A A A G T A A A G C C A T G G A C A G A A T G T G G T C T G T C 'UTR -> A G C T G T G A A T C T G T T G A T G G A G A T C C A A C T A T T T C T T T C A T C A G A A A A A G T C C G A A A A G C A A A C C T T A A T T T G A A A T A A C A G C C T T A A A T C C T T T A C A A G A T G G A G A A A C A A C A A A T A A G T C A A A A T A A T C T G A A A T G A C A G G A T A T G A G T A C A T A C T C A A G A G C A T A A T G G T A A A T C T T T T G G G G T C A T C T T T G A T T T A G A G A T G A T A A T C C C A T A C T C T C A A T T G A G T T A A A T C A G T A A T C T G T C G C A T T T C A T C A A G A T T A A T T A A A A T T T G G G A C C T G C T T C A T T C A A G C T T C A T A T A T G C T T T G C A G A G A A C T C A T A A A G G A G C A T A T A A G G C T A A A T G T A A A A C A C A A G A C T G T C A T T A G A A T T G A A T T A T T G G G C T T A A T A T A A A T C G T
7 A A C C T A T G A A G T T T A T T T T C T A T T T T A G T T A A C T A T G A T T C C A A T T A C T A C T T T G T T A T T G T A C C T A A G T A A A T T T T C T T T A G G T C A G A A G C C C A T T A A A A T A G T T A C A A G C A T T G A A C T T C T T T A G T A T T A T A T T A A T A T A A A A A C A T T T T T G T A T G T T T T A T T G T A A T C A T A A A T A C T G C T G T A T A A G G T A A T A A A A C T C T G C A C C T A A T C C C C A T A A C T T C C A G T A T C A T T T T C C A A T T A A T T A T C A A G T C T G T T T G G G A A A C A C T T T G A G G A C A A T T T T G A T G C A G C A G A T G T T G A C T A A A G G C T T G G T T G G T A G A T A T T C A G G A A A T G T T C A C T G A A T A A A A T A A G T A A A T A C A T T A T T G A A A A G C A A A T C T G T A T A A A T G T G A A A T T T T T A T T T G T A T T A G T A A T A A A A C A T T A G T A G T T T
Lecture 3: Markov chains.
1 BIOINFORMATIK II PROBABILITY & STATISTICS Summer semester 2008 The University of Zürich and ETH Zürich Lecture 3: Markov chains. Prof. Andrew Barbour Dr. Nicolas Pétrélis Adapted from a course by Dr.
More informationIntroduction to Hidden Markov Models (HMMs)
Introduction to Hidden Markov Models (HMMs) But first, some probability and statistics background Important Topics 1.! Random Variables and Probability 2.! Probability Distributions 3.! Parameter Estimation
More informationSupporting Information
Supporting Information Das et al. 10.1073/pnas.1302500110 < SP >< LRRNT > < LRR1 > < LRRV1 > < LRRV2 Pm-VLRC M G F V V A L L V L G A W C G S C S A Q - R Q R A C V E A G K S D V C I C S S A T D S S P E
More informationMarkov Models & DNA Sequence Evolution
7.91 / 7.36 / BE.490 Lecture #5 Mar. 9, 2004 Markov Models & DNA Sequence Evolution Chris Burge Review of Markov & HMM Models for DNA Markov Models for splice sites Hidden Markov Models - looking under
More informationEvolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites
Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Paper by: James P. Balhoff and Gregory A. Wray Presentation by: Stephanie Lucas Reviewed
More informationComputational Biology and Chemistry
Computational Biology and Chemistry 33 (2009) 245 252 Contents lists available at ScienceDirect Computational Biology and Chemistry journal homepage: www.elsevier.com/locate/compbiolchem Research Article
More informationNature Genetics: doi:0.1038/ng.2768
Supplementary Figure 1: Graphic representation of the duplicated region at Xq28 in each one of the 31 samples as revealed by acgh. Duplications are represented in red and triplications in blue. Top: Genomic
More informationBiology 644: Bioinformatics
A stochastic (probabilistic) model that assumes the Markov property Markov property is satisfied when the conditional probability distribution of future states of the process (conditional on both past
More informationSEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA
SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS 1 Prokaryotes and Eukaryotes 2 DNA and RNA 3 4 Double helix structure Codons Codons are triplets of bases from the RNA sequence. Each triplet defines an amino-acid.
More informationRNA Processing: Eukaryotic mrnas
RNA Processing: Eukaryotic mrnas Eukaryotic mrnas have three main parts (Figure 13.8): 5! untranslated region (5! UTR), varies in length. The coding sequence specifies the amino acid sequence of the protein
More informationEVOLUTIONARY DISTANCE MODEL BASED ON DIFFERENTIAL EQUATION AND MARKOV PROCESS
August 0 Vol 4 No 005-0 JATIT & LLS All rights reserved ISSN: 99-8645 wwwjatitorg E-ISSN: 87-95 EVOLUTIONAY DISTANCE MODEL BASED ON DIFFEENTIAL EUATION AND MAKOV OCESS XIAOFENG WANG College of Mathematical
More informationO 3 O 4 O 5. q 3. q 4. Transition
Hidden Markov Models Hidden Markov models (HMM) were developed in the early part of the 1970 s and at that time mostly applied in the area of computerized speech recognition. They are first described in
More informationSupplementary Figure S1. High Resolution Melting temperature (Tm) profiles of DNA/DNA and RNA/DNA duplexes for the tree variants of IL2 gene.
Supplementary Figure S1. High Resolution Melting temperature (Tm) profiles of DNA/DNA and RNA/DNA duplexes for the tree variants of IL2 gene. a. Normalized fluorescence intensity (n.f.) of intercalated
More informationSSR ( ) Vol. 48 No ( Microsatellite marker) ( Simple sequence repeat,ssr),
48 3 () Vol. 48 No. 3 2009 5 Journal of Xiamen University (Nat ural Science) May 2009 SSR,,,, 3 (, 361005) : SSR. 21 516,410. 60 %96. 7 %. (),(Between2groups linkage method),.,, 11 (),. 12,. (, ), : 0.
More informationIllegitimate translation causes unexpected gene expression from on-target out-of-frame alleles
Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles created by CRISPR-Cas9 Shigeru Makino, Ryutaro Fukumura, Yoichi Gondo* Mutagenesis and Genomics Team, RIKEN
More informationModelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics
582746 Modelling and Analysis in Bioinformatics Lecture 1: Genomic k-mer Statistics Juha Kärkkäinen 06.09.2016 Outline Course introduction Genomic k-mers 1-Mers 2-Mers 3-Mers k-mers for Larger k Outline
More informationIs KIT locus polymorphism rs related to white belt phenotype in Krškopolje pig?
Is KIT locus polymorphism rs328592739 related to white belt phenotype in Krškopolje pig? Jernej Ogorevc, Minja Zorc, Martin Škrlep, Riccardo Bozzi, Matthias Petig, Luca Fontanesi, Marjeta Čandek-Potokar,
More informationLecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM)
Bioinformatics II Probability and Statistics Universität Zürich and ETH Zürich Spring Semester 2009 Lecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM) Dr Fraser Daly adapted from
More informationSupplementary Information for Discovery and characterization of indel and point mutations
Supplementary Information for Discovery and characterization of indel and point mutations using DeNovoGear Avinash Ramu 1 Michiel J. Noordam 1 Rachel S. Schwartz 2 Arthur Wuster 3 Matthew E. Hurles 3 Reed
More informationCellular Neuroanatomy I The Prototypical Neuron: Soma. Reading: BCP Chapter 2
Cellular Neuroanatomy I The Prototypical Neuron: Soma Reading: BCP Chapter 2 Functional Unit of the Nervous System The functional unit of the nervous system is the neuron. Neurons are cells specialized
More informationHow much non-coding DNA do eukaryotes require?
How much non-coding DNA do eukaryotes require? Andrei Zinovyev UMR U900 Computational Systems Biology of Cancer Institute Curie/INSERM/Ecole de Mine Paritech Dr. Sebastian Ahnert Dr. Thomas Fink Bioinformatics
More informationSupplemental data. Pommerrenig et al. (2011). Plant Cell /tpc
Supplemental Figure 1. Prediction of phloem-specific MTK1 expression in Arabidopsis shoots and roots. The images and the corresponding numbers showing absolute (A) or relative expression levels (B) of
More informationAS A SERVICE TO THE RESEARCH COMMUNITY, GENOME BIOLOGY PROVIDES A 'PREPRINT' DEPOSITORY
http://genomebiology.com/2002/3/12/preprint/0011.1 This information has not been peer-reviewed. Responsibility for the findings rests solely with the author(s). Deposited research article MRD: a microsatellite
More informationMotifs and Logos. Six Introduction to Bioinformatics. Importance and Abundance of Motifs. Getting the CDS. From DNA to Protein 6.1.
Motifs and Logos Six Discovering Genomics, Proteomics, and Bioinformatics by A. Malcolm Campbell and Laurie J. Heyer Chapter 2 Genome Sequence Acquisition and Analysis Sami Khuri Department of Computer
More informationTowards More Effective Formulations of the Genome Assembly Problem
Towards More Effective Formulations of the Genome Assembly Problem Alexandru Tomescu Department of Computer Science University of Helsinki, Finland DACS June 26, 2015 1 / 25 2 / 25 CENTRAL DOGMA OF BIOLOGY
More informationIntroduction to Linkage Disequilibrium
Introduction to September 10, 2014 Suppose we have two genes on a single chromosome gene A and gene B such that each gene has only two alleles Aalleles : A 1 and A 2 Balleles : B 1 and B 2 Suppose we have
More information3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT
3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT.03.239 25.09.2012 SEQUENCE ANALYSIS IS IMPORTANT FOR... Prediction of function Gene finding the process of identifying the regions of genomic DNA that encode
More informationLecture 7 Mutation and genetic variation
Lecture 7 Mutation and genetic variation Thymidine dimer Natural selection at a single locus 2. Purifying selection a form of selection acting to eliminate harmful (deleterious) alleles from natural populations.
More informationNucleotide substitution models
Nucleotide substitution models Alexander Churbanov University of Wyoming, Laramie Nucleotide substitution models p. 1/23 Jukes and Cantor s model [1] The simples symmetrical model of DNA evolution All
More information1. In most cases, genes code for and it is that
Name Chapter 10 Reading Guide From DNA to Protein: Gene Expression Concept 10.1 Genetics Shows That Genes Code for Proteins 1. In most cases, genes code for and it is that determine. 2. Describe what Garrod
More information6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008
MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationПредсказание и анализ промотерных последовательностей. Татьяна Татаринова
Предсказание и анализ промотерных последовательностей Татьяна Татаринова Eukaryotic Transcription 2 Initiation Promoter: the DNA sequence that initially binds the RNA polymerase The structure of promoter-polymerase
More informationInterpolated Markov Models for Gene Finding. BMI/CS 776 Spring 2015 Colin Dewey
Interpolated Markov Models for Gene Finding BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2015 Colin Dewey cdewey@biostat.wisc.edu Goals for Lecture the key concepts to understand are the following the
More informationLevels of genetic variation for a single gene, multiple genes or an entire genome
From previous lectures: binomial and multinomial probabilities Hardy-Weinberg equilibrium and testing HW proportions (statistical tests) estimation of genotype & allele frequencies within population maximum
More informationLecture 2: Pairwise Alignment. CG Ron Shamir
Lecture 2: Pairwise Alignment 1 Main source 2 Why compare sequences? Human hexosaminidase A vs Mouse hexosaminidase A 3 www.mathworks.com/.../jan04/bio_genome.html Sequence Alignment עימוד רצפים The problem:
More informationLecture 15: Realities of Genome Assembly Protein Sequencing
Lecture 15: Realities of Genome Assembly Protein Sequencing Study Chapter 8.10-8.15 1 Euler s Theorems A graph is balanced if for every vertex the number of incoming edges equals to the number of outgoing
More informationSupplemental Data. Hou et al. (2016). Plant Cell /tpc
Supplemental Data. Hou et al. (216). Plant Cell 1.115/tpc.16.295 A Distance to 1 st nt of start codon Distance to 1 st nt of stop codon B Normalized PARE abundance 8 14 nt 17 nt Frame1 Arabidopsis inflorescence
More informationRegulatory Sequence Analysis. Sequence models (Bernoulli and Markov models)
Regulatory Sequence Analysis Sequence models (Bernoulli and Markov models) 1 Why do we need random models? Any pattern discovery relies on an underlying model to estimate the random expectation. This model
More informationRegulatory Change in YABBY-like Transcription Factor Led to Evolution of Extreme Fruit Size during Tomato Domestication
SUPPORTING ONLINE MATERIALS Regulatory Change in YABBY-like Transcription Factor Led to Evolution of Extreme Fruit Size during Tomato Domestication Bin Cong, Luz Barrero, & Steven Tanksley 1 SUPPORTING
More informationAnnotation of Drosophila grimashawi Contig12
Annotation of Drosophila grimashawi Contig12 Marshall Strother April 27, 2009 Contents 1 Overview 3 2 Genes 3 2.1 Genscan Feature 12.4............................................. 3 2.1.1 Genome Browser:
More informationConjugate Gradient algorithm. Storage: fixed, independent of number of steps.
Conjugate Gradient algorithm Need: A symmetric positive definite; Cost: 1 matrix-vector product per step; Storage: fixed, independent of number of steps. The CG method minimizes the A norm of the error,
More informationToday s Lecture: HMMs
Today s Lecture: HMMs Definitions Examples Probability calculations WDAG Dynamic programming algorithms: Forward Viterbi Parameter estimation Viterbi training 1 Hidden Markov Models Probability models
More informationDNA Feature Sensors. B. Majoros
DNA Feature Sensors B. Majoros What is Feature Sensing? A feature is any DNA subsequence of biological significance. For practical reasons, we recognize two broad classes of features: signals short, fixed-length
More informationTable S1. Summary statistics from weighted, grouped logistic regressions of SNP frequency against latitude (0.060)
Table S1. Summary statistics from weighted, grouped logistic regressions of SNP frequency against latitude. Gene SNP number β 0 (± S. E.), q-value* pseudo-r 2 β 1# (± S. E.) BRh opsin 6 0.133 (0.046) 0.008
More informationComputational Design of New and Recombinant Selenoproteins
Computational Design of ew and Recombinant Selenoproteins Rolf Backofen and Friedrich-Schiller-University Jena Institute of Computer Science Chair for Bioinformatics 1 Computational Design of ew and Recombinant
More informationSUPPLEMENTARY INFORMATION
SUPPLEMENTARY INFORMATION Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals Young Seok Ju, Jong-Il Kim, Sheehyun
More informationLecture Notes: BIOL2007 Molecular Evolution
Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits
More informationGoodbye to Large G? Marek Karliner. Cambridge University and Tel Aviv University. with John Ellis, hep-ph/ ph/
Goodbye to Large G? Marek Karliner Cambridge University and Tel Aviv University with John Ellis, hep-ph/0501115 ph/0501115 LC2005, Cairns, 7/2005 nucleon spin quark helicities: q = Z 1 0 h q (x) q (x)
More informationBio nformatics. Lecture 16. Saad Mneimneh
Bio nformatics Lecture 16 DNA sequencing To sequence a DNA is to obtain the string of bases that it contains. It is impossible to sequence the whole DNA molecule directly. We may however obtain a piece
More informationSOLiD Data 2 Base Encoding Sequencing by Oligonucleotide Ligation and Detection
SOLiD Data 2 Base Encoding Sequencing by Oligonucleotide Ligation and Detection What is two base encoding? Rather than a probe, reading out the single base present at the 5 th position, a two base encoded
More informationThe Computational Problem. We are given a sequence of DNA and we wish to know which subsequence or concatenation of subsequences constitutes a gene.
GENE FINDING The Computational Problem We are given a sequence of DNA and we wish to know which subsequence or concatenation of subsequences constitutes a gene. The Computational Problem Confounding Realities:
More informationDepartment of Forensic Psychiatry, School of Medicine & Forensics, Xi'an Jiaotong University, Xi'an, China;
Title: Evaluation of genetic susceptibility of common variants in CACNA1D with schizophrenia in Han Chinese Author names and affiliations: Fanglin Guan a,e, Lu Li b, Chuchu Qiao b, Gang Chen b, Tinglin
More informationHidden Markov Models (I)
GLOBEX Bioinformatics (Summer 2015) Hidden Markov Models (I) a. The model b. The decoding: Viterbi algorithm Hidden Markov models A Markov chain of states At each state, there are a set of possible observables
More informationROBI POLIKAR. ECE 402/504 Lecture Hidden Markov Models IGNAL PROCESSING & PATTERN RECOGNITION ROWAN UNIVERSITY
BIOINFORMATICS Lecture 11-12 Hidden Markov Models ROBI POLIKAR 2011, All Rights Reserved, Robi Polikar. IGNAL PROCESSING & PATTERN RECOGNITION LABORATORY @ ROWAN UNIVERSITY These lecture notes are prepared
More informationFUNDAMENTALS OF MOLECULAR EVOLUTION
FUNDAMENTALS OF MOLECULAR EVOLUTION Second Edition Dan Graur TELAVIV UNIVERSITY Wen-Hsiung Li UNIVERSITY OF CHICAGO SINAUER ASSOCIATES, INC., Publishers Sunderland, Massachusetts Contents Preface xiii
More informationSequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson) February 5,
Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson) February 5, 2015 31 11 Motif Finding Sources for this section: Rouchka, 1997, A Brief Overview of Gibbs Sapling. J. Buhler, M. Topa:
More informationPROTEIN SYNTHESIS INTRO
MR. POMERANTZ Page 1 of 6 Protein synthesis Intro. Use the text book to help properly answer the following questions 1. RNA differs from DNA in that RNA a. is single-stranded. c. contains the nitrogen
More informationc. Covalent bonds involve the transfer of electrons between atoms, and ionic bonds involve the sharing of neutrons between atoms. d.
Final Exam Review *Disclaimer I do not have a PHD. Everything here is just speculation for what I think will be on your test. Your professor is going over everything that will be on your test 12/02/17
More informationSupplementary Fig. 1. Expression profiles of gap genes in large and small embryos. Shown are the mean mrna expression profiles (as a function of
Supplementary Fig. 1. Expression profiles of gap genes in large and small embryos. Shown are the mean mrna expression profiles (as a function of fractional embryo length ξ = x/l) of the indicated six gap
More informationRNA- seq read mapping
RNA- seq read mapping Pär Engström SciLifeLab RNA- seq workshop October 216 IniDal steps in RNA- seq data processing 1. Quality checks on reads 2. Trim 3' adapters (opdonal (for species with a reference
More informationCover Requirements: Name of Unit Colored picture representing something in the unit
Name: Period: Cover Requirements: Name of Unit Colored picture representing something in the unit Biology B1 1 Target # Biology Unit B1 (Genetics & Meiosis) Learning Targets Genetics & Meiosis I can explain
More informationLecture 15. Saad Mneimneh
Computat onal Biology Lecture 15 DNA sequencing Shortest common superstring SCS An elegant theoretical abstraction, but fundamentally flawed R. Karp Given a set of fragments F, Find the shortest string
More informationDNA sequencing. Bad example (repeats) Lecture 15. Shortest common superstring SCS. Given a set of fragments F,
Computat onal Biology Lecture 15 DNA sequencing Shortest common superstring SCS An elegant theoretical abstraction, but fundamentally flawed R. Karp Given a set of fragments F, Find the shortest string
More informationEukaryotic vs. Prokaryotic genes
BIO 5099: Molecular Biology for Computer Scientists (et al) Lecture 18: Eukaryotic genes http://compbio.uchsc.edu/hunter/bio5099 Larry.Hunter@uchsc.edu Eukaryotic vs. Prokaryotic genes Like in prokaryotes,
More informationTE content correlates positively with genome size
TE content correlates positively with genome size Mb 3000 Genomic DNA 2500 2000 1500 1000 TE DNA Protein-coding DNA 500 0 Feschotte & Pritham 2006 Transposable elements. Variation in gene numbers cannot
More informationStochastic processes and
Stochastic processes and Markov chains (part I) Wessel van Wieringen w.n.van.wieringen@vu.nl wieringen@vu nl Department of Epidemiology and Biostatistics, VUmc & Department of Mathematics, VU University
More informationHarvard Life Science Outreach December 7, 2017 Measuring ecosystem carbon fluxes using eddy covariance data ACTIVITIES I. NAME THAT ECOSYSTEM!
Harvard Life Science Outreach December 7, 2017 Measuring ecosystem carbon fluxes using eddy covariance data ACTIVITIES I. NAME THAT ECOSYSTEM! Objective: Distinguish ecosystems (tropical forest vs. temperate
More informationMultiple Choice Review- Eukaryotic Gene Expression
Multiple Choice Review- Eukaryotic Gene Expression 1. Which of the following is the Central Dogma of cell biology? a. DNA Nucleic Acid Protein Amino Acid b. Prokaryote Bacteria - Eukaryote c. Atom Molecule
More informationobjective functions...
objective functions... COFFEE (Notredame et al. 1998) measures column by column similarity between pairwise and multiple sequence alignments assumes that the pairwise alignments are optimal assumes a set
More informationBLAST. Varieties of BLAST
BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database
More informationCSE : Computational Issues in Molecular Biology. Lecture 6. Spring 2004
CSE 397-497: Computational Issues in Molecular Biology Lecture 6 Spring 2004-1 - Topics for today Based on premise that algorithms we've studied are too slow: Faster method for global comparison when sequences
More informationUnit 3 - Molecular Biology & Genetics - Review Packet
Name Date Hour Unit 3 - Molecular Biology & Genetics - Review Packet True / False Questions - Indicate True or False for the following statements. 1. Eye color, hair color and the shape of your ears can
More informationG4120: Introduction to Computational Biology
ICB Fall 2009 G4120: Introduction to Computational Biology Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology & Immunology Copyright 2008 Oliver Jovanovic, All Rights Reserved. Genome
More informationHMMs and biological sequence analysis
HMMs and biological sequence analysis Hidden Markov Model A Markov chain is a sequence of random variables X 1, X 2, X 3,... That has the property that the value of the current state depends only on the
More informationTranslation Part 2 of Protein Synthesis
Translation Part 2 of Protein Synthesis IN: How is transcription like making a jello mold? (be specific) What process does this diagram represent? A. Mutation B. Replication C.Transcription D.Translation
More informationAnalytic Pattern Matching: From DNA to Twitter. AxA Workshop, Venice, 2016 Dedicated to Alberto Apostolico
Analytic Pattern Matching: From DNA to Twitter Wojciech Szpankowski Purdue University W. Lafayette, IN 47907 June 19, 2016 AxA Workshop, Venice, 2016 Dedicated to Alberto Apostolico Joint work with Philippe
More informationHidden Markov Models (HMMs) November 14, 2017
Hidden Markov Models (HMMs) November 14, 2017 inferring a hidden truth 1) You hear a static-filled radio transmission. how can you determine what did the sender intended to say? 2) You know that genes
More information(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.
1. A change that makes a polypeptide defective has been discovered in its amino acid sequence. The normal and defective amino acid sequences are shown below. Researchers are attempting to reproduce the
More informationAP CHEMISTRY 2008 SCORING GUIDELINES (Form B)
AP CHEMISTRY 2008 SCORING GUIDELINES (Form B) Question 2 A(g) + B(g) C(g) + D(g) For the gas-phase reaction represented above, the following experimental data were obtained. Experiment Initial [A] (mol
More informationSUPPORTING INFORMATION FOR. SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA
SUPPORTING INFORMATION FOR SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA Aik T. Ooi, Cliff I. Stains, Indraneel Ghosh *, David J. Segal
More informationBiology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall
Biology Biology 1 of 26 Fruit fly chromosome 12-5 Gene Regulation Mouse chromosomes Fruit fly embryo Mouse embryo Adult fruit fly Adult mouse 2 of 26 Gene Regulation: An Example Gene Regulation: An Example
More informationSupplementary Figure 1
Supplementary Figure 1 Supplementary Figure 1. HSP21 expression in 35S:HSP21 and hsp21 knockdown plants. (a) Since no T- DNA insertion line for HSP21 is available in the publicly available T-DNA collections,
More informationComposite gluino at the LHC
Composite gluino at the LHC Thomas Grégoire University of Edinburgh work in progress with Ami Katz What will we see at the LHC? Natural theory of EWSB? Supersymmetry? Higgs as PGSB (LH, RS-like)? Extra-
More informationLifted Polynomials Over F 16 and Their Applications to DNA Codes
Filomat 27:3 (2013), 459 466 DOI 10.2298/FIL1303459O Published by Faculty of Sciences and Mathematics, University of Niš, Serbia Available at: http://www.pmf.ni.ac.rs/filomat Lifted Polynomials Over F
More informationPhylogenetics. Andreas Bernauer, March 28, Expected number of substitutions using matrix algebra 2
Phylogenetics Andreas Bernauer, andreas@carrot.mcb.uconn.edu March 28, 2004 Contents 1 ts:tr rate ratio vs. ts:tr ratio 1 2 Expected number of substitutions using matrix algebra 2 3 Why the GTR model can
More informationGlobal Mineral Sand Industry: Trends and Opportunities ( ) February 2014
Global Mineral Sand Industry: Trends and Opportunities (2013-2018) February 2014 Scope of the Report The report titled Global Mineral Sand Industry: Trends and Opportunities (2013-2018) provides an in-depth
More informationAP CHEMISTRY LAB RATES OF CHEMICAL REACTIONS (II)
PURPOSE: Observe a redox reaction. AP CHEMISTRY LAB RATES OF CHEMICAL REACTIONS (II) Apply graphing techniques to analyze data. Practice computer skills to develop a data table. Determine the order of
More informationthat a w r o n g has been committed recognize where the responsibility
Z- XX q q J U Y ' G G, w, 142 16, U, j J ' B ' B B k - J, 5 6 5:30 7:00, $125;, -, 10:00 $300;, 4:00, 6:00, B C U 000 2:00, J $125; ' :, q C, k w G x q k w w w w q ' 60,, w q, w w k w w - G w z w w w C
More informationStochastic processes and
Stochastic processes and Markov chains (part II) Wessel van Wieringen w.n.van.wieringen@vu.nl wieringen@vu nl Department of Epidemiology and Biostatistics, VUmc & Department of Mathematics, VU University
More informationWhat Is Conservation?
What Is Conservation? Lee A. Newberg February 22, 2005 A Central Dogma Junk DNA mutates at a background rate, but functional DNA exhibits conservation. Today s Question What is this conservation? Lee A.
More informationMore Codon Usage Bias
.. CSC448 Bioinformatics Algorithms Alexander Dehtyar.. DA Sequence Evaluation Part II More Codon Usage Bias Scaled χ 2 χ 2 measure. In statistics, the χ 2 statstic computes how different the distribution
More information<excelunusual.com> Building a live Excel clock by George Lungu
Building a live Excel clock by George Lungu Here are a few date and time functions in VBA: Now Date Time Timer TimeValue() DateValue() Current date and time. Example: 7/5/00 3:16:38 PM returned by Now
More informationMRC-Holland MLPA. Description version 14; 21 January 2015
SALSA MLPA probemix P229-B2 OPA1 Lot B2-0412. As compared to version B1-0809, two reference probes and the 88 and 96 nt control fragments have been replaced (QDX2). The OPA1 gene product is a nuclear-encoded
More informationReading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype
Lecture Series 7 From DNA to Protein: Genotype to Phenotype Reading Assignments Read Chapter 7 From DNA to Protein A. Genes and the Synthesis of Polypeptides Genes are made up of DNA and are expressed
More informationReview sheet for the material covered by exam III
Review sheet for the material covered by exam III WARNING: Like last time, I have tried to be complete, but I may have missed something. You are responsible for all the material discussed in class. This
More informationBioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment
Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment Substitution score matrices, PAM, BLOSUM Needleman-Wunsch algorithm (Global) Smith-Waterman algorithm (Local) BLAST (local, heuristic) E-value
More informationGenomes Comparision via de Bruijn graphs
Genomes Comparision via de Bruijn graphs Student: Ilya Minkin Advisor: Son Pham St. Petersburg Academic University June 4, 2012 1 / 19 Synteny Blocks: Algorithmic challenge Suppose that we are given two
More informationSupplementary Information
Supplementary Information LINE-1-like retrotransposons contribute to RNA-based gene duplication in dicots Zhenglin Zhu 1, Shengjun Tan 2, Yaqiong Zhang 2, Yong E. Zhang 2,3 1. School of Life Sciences,
More informationChapter 12. Genes: Expression and Regulation
Chapter 12 Genes: Expression and Regulation 1 DNA Transcription or RNA Synthesis produces three types of RNA trna carries amino acids during protein synthesis rrna component of ribosomes mrna directs protein
More informationSUPPLEMENTARY INFORMATION
DOI:.8/NCHEM. Conditionally Fluorescent Molecular Probes for Detecting Single Base Changes in Double-stranded DNA Sherry Xi Chen, David Yu Zhang, Georg Seelig. Analytic framework and probe design.. Design
More information