C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... methods use different attributes related to mis sense mutations such as

Size: px
Start display at page:

Download "C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... methods use different attributes related to mis sense mutations such as"

Transcription

1

2 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e Introduction smentionedinchapter1,severalmethodsareavailabletoclassifyhuman missensemutationsintoeitherbenignorpathogeniccategoriesandthese methodsusedifferentattributesrelatedtomissensemutationssuchas sequencebased(ngandhenikoff,2001,2003;ferrercostaetal.,2004;thomasetal., 2004;Capriottietal.,2006;Tianetal.,2007;Calabreseetal.,2009),evolutionarybased (Fayetal.,2001),physiochemicalproperties(StoneandSidow,2005),combinationof structuralandevolutionaryinformation(sunyaevetal.,2001;chasmanandadams, 2001;FerrerCostaetal.,2002;BrombergandRost,2007;Ramenskyetal.,2002;Stitziel etal.,2004;reumersetal.,2005;cavalloandmartin,2005;baoetal.,2005;ferrer Costaetal.,2005;Yueetal.,2006;Matheetal.,2006;Lietal.,2009;Adzhubeietal., 2010). Recently,aSVMbasedmethodhasbeendevelopedwhichusesanewsetoffeatures which refer to as neutraldisease missense mutation discriminatory (NDMSMD) features.thendmsmdfeaturesincludepositionspecificprobabilityscorescalculated usingdirichletmixtureofpriorinformationaswellasgribskov sapproach,predicted solventaccessibilityandsecondarystructuralfeatures,blosum62substitutionscores andchangeinfreeenergychangesassociatedwithbothwildtypeandmutantamino acidresidues.beforeutilizingthese10featuresinsvm,thedistributionpatternsofeach Page38

3 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... ofthefeaturesinknowndiseaseandneutralmissensemutationswerestudiedandthe detailsaregiveninthischapter. 2.2MaterialsandMethods The Neutral Disease MisSense Mutation Discriminatory (NDMSMD) Features ThetenNDMSMDfeaturesincludesixpositionspecificfeatures,twoproteinstructure basedandtwoaminoacidresiduebasedfeatures(table2.1).thedetailsofcalculation ofthesefeaturesaregivenbelow ThePositionSpecificFeatures Thesefeaturescorrespondtopositionspecificpreferencesofaminoacidresiduesatthe missensemutationsites.oneofthefeaturesisthepositionspecificprobabilityscore andtheotheristhegribskov sscore(gribskovetal.,1987). scores (Tianetal.,2007)werecalculatedusingDirichletmixtureasthepriorinformation availableintheformof20componentdirichletmixtures(dms)aswellastheobserved aminoacidfrequenciesfromthemultiplesequencealignmentofthetargetprotein sequenceanditshomologues(sjölanderetal.,1996,tianetal.,2007).thedetailsof calculationaregivenbelow. Page39

4 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... Table2.1:Thelistof10NDMSMDfeaturesanalyzedinthepresentstudy. Features Positionspecific Sequence & Structure based Amino acid residue based Description Positionspecificprobabilityscoreofthewildtypeaminoacid Positionspecificprobabilityscoreofthemutanttypeaminoacid residues( Differencebetweenposition specificprobabilitiesscoreofthe Wild and the Mutant amino acid residues i.e., diff Gribskov sscoreofthewildtypeaminoacidresidues * Gribskov s Score of the mutanttype amino acid residues * DifferencebetweenGribskov sscoreofthewildtypeandthe mutantaminoacidresidues i.e.,diff( ) Solventaccessibilitystatusoftheaminoacidatthemutation site&&;1ifitisburied(solventaccessiblesurfaceareais<10%); 0ifitisexposed. Secondarystructuralstatusoftheaminoacidresidueatthe mutationsite**;1ifitisapartofalphahelix;2ifitisapartof extendedstrandor0forothertypes. Difference in transfer free energy values of wild type and mutatedtypefrominsidetosurfaceoftheprotein@@ BLOSUM62Substitutionscoresfor scores were calculated using the perl script psap.pl available at *ThesescoreswerecalculatedusingthePROPHECYprogramavailableintheEMBOSS suite(riceetal.,2000). &&Solventaccessibility calculatedfromaccpro4.0(chengetal.,2005). **Secondarystructureprediction frominsidetooutsideofaglobularprotein(janin, 1979). Page40

5 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... isanestimationofthepositionspecificprobabilityofaminoacid a atthemis sensemutationsite b ofagivenhumanproteinsequence(query)andiscalculatedas follows (2.1) Where (2.2) Where isthemixturecoefficientofthecomponent m fromadirichletmixtureof priors, isthecorrespondingalphaparameter, t isthetotalnumberofcomponents whichis20, isthebetafunction,istheaminoacidcountvectorthatisobservedin themultiplesequencealignmentofthehumansequenceanditshomologues. And in the equation 2.2 is calculated by multiplication of total number of sequences S with (2.3) Where issumofthehenikoff spositionspecificweights(henikoffandhenikoff, 1996)calculatedforaminoacid a ataparticularposition b withoutgapsinthe multiplesequencealignmentofthequeryanditshomologues. (2.4) Page41

6 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... Where isthefrequencyofanyaminoacid a observedinthenature(jonesetal., 1992)andistheweightedcolumnobtainedfromthemultiplesequencealignment ofthequeryanditshomologueswithnogapsandiscalculatedasgivenbelowand calculatedasinequation2.5exceptitcontaingaps (2.5) Where N isthetotalnumberofalignedsequencesand (2.6) Where isthesequenceweightattheposition b ; isthenumberofdifferent aminoacidresiduetypesthatarepresentatposition b inthealignmentand isthe numberoftimesthataparticularresiduefromqueryproteinappearsinthealignment column. Gribskov sscore(gribskovetal.,1987)wascalculatedusingtheprophecy programavailableintheembosssuite(riceetal.,2000)usingmultiplesequence alignmentsusedforcalculating.theisdefinedasaweightedaverageofthe similarityscoresforanaminoacid a attheposition b, (2.7) Page42

7 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... WhereSM(a,aa)isthesimilaritymatrix(BLOSUM62)scoreforaminoacid a replacing anotheraminoacid aa and (2.8) Where isdefinedasthenumberofaminoacidresidues aa dividedbythetotal numberofaminoacidresiduespresentataposition b includinggapspresentinthe alignmentcolumn. Themultiplesequencealignments(MSAs)ofhumanproteinwithitshomologueswere calculatedusingclustalw2(chennaetal.,2003). The humanproteinhomologswere identifiedusingpsiblastsearchesagainstthenonredundant(nr)databasewithane value 1e15 (Mooney and Klein, 2002) with threefour rounds of iteration until convergenceisreached. Whilesearchingforhomologs,onlytherelevantdomaincontainingagivenmutation was given as the query. Domain boundaries were identified using ProDom ( Bru et al., 2005). From PSI BLASThits,humanproteinswereremoved.Furthermore,thehomologuesshorterthan 70%ofthequerysequencelengthwerealsoremovedbeforedoingtheMSAswith CLUSTALW2. Page43

8 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e StructurebasedFeatures Twostructurebasedfeatureswereconsidered.Theyarethesolventaccessibilitystatus andsecondarystructuralstatus.accpro(chengetal.,2005)wasusedtopredictthe solventaccessibilityvaluesofaminoacidresidues.predictedsolventaccessibilityvalue< 10%wastakenasindicativeofaburiedresiduewhereasavalue>10%wasconsidered asexposedresidue.ssprov4.5ofthescratchsuite(chengetal.,2005)wasusedto predictthesecondarystructurestatusoftheaminoacidresidues Freeenergyandpairwisesubstitutionscores Thedifferenceintransferfreeenergyvalues(Table2.2)frominsidetosurfaceofthe protein(janin,1979)ofwildtypeandmutantaminoacidswasusedasoneofthetwo amino acid features. The other feature was the pairwise substitution scores of BLOSUM62substitutionmatrix(HenikoffandHenikoff,1992)(Figure2.1)forwildtype andmutantresidues. 2.3Thedatasetofknownneutralanddiseasemissensemutations As mentioned in Chapter 1, various databases are available for disease/neutral variations.forthepurposesofthepresentanalysisandalsoforthetrainingandtesting ofsvmasreportedinchapter3,acurateddatasetknownashumvardataset(capriotti etal.,2006;tianetal.,2007;adzhubeietal.,2010)wasused.thisdatasethasbeen derivedfromtheswissprot/uniprotdatabase(capriottietal.,2006)andcomprisesof Page44

9 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... two categories of missense mutations viz., Disease and Polymorphism. This datasetatthetimeofthestudycomprisedof13,032 Disease from1111proteinsand 8,946 Polymorphism from3484proteins.theresultsreportedinthepresentstudyas wellasintheotherchapterspertaintothisdataset.recently,thedatasethasbeen revised and comprises of 12,598 Disease from 1101 proteins and 8,638 Polymorphism from 3399 proteins. All the missense mutations designated as PolymorphismwereconsideredasNeutralinthepresentstudy.Themainreasonfor usingthisdatasetinthepresentstudystemsfromthefactthatthisdatasethasbeen usedasabenchmarksetbyothergroups(capriottietal.,2006;tianetal.,2007; Adzhubeietal.,2010)whiledevelopingtheirmethodsforpredictionofpathogenic effectsofmissensemutations.thehumvardatasetisavailableattwowebsites:(a) ftp://genetics.bwh.harvard.edu/datasets/pph2/humvar tar.gz (Azdubehi et al., 2010)and(b) al.,2006).theoneusedinthepresentthesiswasdownloadedfromthesite(a). Page45

10 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... Table2.2:Transferfreeenergyvaluesof20aminoacidsfrominsidetooutsideofa protein(takenfromjanin,1979). Aminoacids Alanine Arginine Asparagine Aspartate Cysteine Glutamine Glutamate Glycine Histidine Isoleucine Leucine Lysine Methionine Phenylalanine Proline Serine Threonine Trypsin Tyrosine Valine Freeenergyoftransferfrominsidetooutside ofaprotein Page46

11 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... A R N D C Q E G H I L K M F P S T W Y V B Z X * A R N D C Q E G H I L K M F P S T W Y V B Z X * Figure2.1:BLOSUM62aminoacidsubstitutionscores(HenikoffandHenikoff,1992). Page47

12 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e ResultsandDiscussion 2.4.1Distributionofpositionspecificfeatures Scores Itshouldbenotedthatthescores andforagivenaminoacidresidue a at position b essentiallyindicateitspreferenceorotherwiseinaprotein.preferenceis indicatedbyscoresof >0.05and>0.0andnonpreferenceisindicatedby scoresof <0.05and<0.0.Needlesstomentionallwildtype(WT)amino acidresiduesaswellasneutralmutationsshouldthereforeshowscorescorresponding topreferenceswhereasdiseasemutationsshouldshowscorescorrespondingtonon preferences. Thedistributionofscoresfordiseasemutationsaswellasneutralmutationsfromthe HumVardatasetshasbeeninvestigated.(Figure2.2(a)).Amongthediseasemutations, about91%inhumvardatasetsshowedscores<0.05indicatingtheexcellentutilityof thesepositionspecificscoresforpredictingdiseasemutations.however,inthecaseof neutralmutations,only43%showedscores>0.05.thisindicatedthatnotalltheneutral mutationsarethepreferredsubstitutions.tomakesurethatscoresindicatepreferred aminoacidresiduescorrectly,ihaveinvestigatedthedistributionof scoresforthe wildtype(wt)aminoacidresiduesfromthehumvardatasets(figure2.2(b)).more than 95% of the WTs show scores >0.05. The fact that about half of the neutral Page48

13 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... mutations of the HumVar dataset show scores <0.05 (indicative nonpreference) indicate that these may be the mutations occurring infrequently though their phenotypic effect is benign. This could also be due to annotation mistakes in the SWISSPROTdatabasefromwhereHumVardatasetwascurated. Thedistributionofdifferencesbetween scoresofwildtype(wt)andmutanttype (MT)(Figure2.2(c))wasalsoexaminedwhichshowedtwodistinct(butoverlapping) distributionsfordiseaseandbenignmutations Scores Ascorelessthan0indicatesthatresidue a atposition b isnotpreferredand henceisindicativeofapathogenicmutation.ontheotherhand,ascore>0.0indicates preferenceandisindicativeofaneutralmutation.fromfigure2.3(a),itcanbeseen that77%ofthediseasemutationscorrespondtoscores<0.0andabout50%ofneutral mutations have scores >0.0. Although this result indicates thatscore is less discriminating than however, it is still a good discriminator. For comparison purposes,distributionofwildtypeaminoacidresidueswasexamined(figure2.3(b)). About 90% of them showed values >0 further indicating utility of the score. The differencescoresbetweenofwildtypeandmutanttype(figure2.3(c))show distinctbutoverlappingdistributionfordiseaseandbenignmutations. Page49

14 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e % 90% 91% 80% 70% 60% 57% 50% 40% 43% 30% 20% 10% 9% 0% MT<0.05 MT>0.05 NEUTRAL DISEASE Figure2.2(a):Thedistributionof scoreofmutatedaminoacid(mt)fordiseaseand benignmutationsinthehumvardatasets. Page50

15 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e WT<0.05 WT>0.05 Percentage Figure2.2(b):Thedistributionof scoreofwildtypeaminoacid(wt)forhumvar datasets. Page51

16 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e percentage N D diffpab(wtmt) Figure2.2(c):Distributionofdifferencein (WT)and (MT)scoresformissense mutationsinhumvardataset.n=neutralandd=diseasemutations. Page52

17 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e % 90% 80% 77% 70% 60% 50% 40% 50% 47% 30% 20% 23% 10% 0% MT<0 MT>0 NEUTRAL DISEASE Figure2.3(a):DistributionofscoresforDisease(D)andNeutral(N)mutations. N=NeutralandD=Diseasemutations. Page53

18 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e WT<0 WT>0 Percentage Figure2.3(b):Distributionofscorescalculatedforthewildtype(WT)amino acidresiduesinthehumvardataset. Page54

19 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e percentage N D diffgab(wtmt) Figure2.3(c):Distributionofdifferencein(WT)and(MT)scoresforthe mutationsinhumvardataset(n=neutralandd=diseasemutations). Page55

20 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e StructureandAminoacidbasedfeatures Solventaccessibilitystatusofthemissensemutations Asalreadymentioned,thesolventaccessibilityvaluesofthemissensemutationswere predictedusingaccproprogram(chengetal.,2005)availableasapartofscratch package.ifthepredicted%accessiblesurfaceareaislessthan10thenthemutationwas consideredasburiedandifthe%accessiblesurfaceareaismorethan10thenthe mutationwasconsideredexposed. Figure2.4showsthedistributionoftheHumVarmutationsintoburiedandexposed categories.about77%oftheneutralmutationsarepredictedtobeexposedtosolvent indicatingthatthemutationsoftheexposedresidueshaveaneutraleffect.among diseasemutationsabout52%arepredictedtobeburiedandtheremaining(48%)are predictedtobeexposedtosolvent.onewouldhaveexpectedamajorityofdisease causingmutationstoburiedaschangesattheburiedpositionwoulddestabilizeprotein structureleadingtopathogeniceffects(wangandmoult,2001;sunyaevetal.,2000; GongandBlundell,2010).However,thecurrentanalysisindicatesthatevenmutations attheexposedpositionstoocanleadtopathogeniceffects(kowarschetal.,2010).this resultisnotentirelysurprisinggiventhefactthatsomeofthesurfacepositionscould form parts of binding surfaces involved in intermolecular interactions and hence mutationsatsuchfunctionallyimportantsiteswouldinvariablyyieldpathogeniceffects. Page56

21 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e percentage NEUTRAL DISEASE Buried Exposed StatusofSolventAccessibility Figure 2.4: The distribution of status of mutations as buried or exposed for the mutationsinthehumvardatasets. Page57

22 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e Secondarystructuralstatusofthemissensemutations Distributionofdiseaseandneutralmutationsinthethreesecondarystructurestates viz.,helices,strandsandcoilregionsareshowninfigure2.5.ascanbeseen, diseaseaswellasneutralmutationsdoshowsomelocalizationtendency.forexample, diseasemutationsarepredictedtobelocalizedmoreonhelicesandbetastrandsthan neutral mutations. Comparatively, coil regions harbor more number of neutral mutationsthandiseasemutations Aminoacidbasedfeatures. Differenceinfreeenergytransfervaluesbetweenthewildtypeandthemutationwas calculatedforeachentryinthehumvardataset.asmentionedalready,thefreeenergy transfervaluesofthe20aminoacidresidues(table2.2)weretakenfromjanin(1979). Althoughforeveryintervalstartingfrom2.0to+2.0(Figure2.6),bothdiseaseand neutral mutations can be seen, however, their proportions vary along the entire spectrum. The proportion of disease mutations increases at every interval for differences greater than 0.2 indicating energetically unfavoured situations in those cases. DistributionofBLOSUM62substitutionscoresforbothdiseaseandneutralmutations (Figure2.7)werealsoexamined.Ascoreof<1and>+1indicatespreferredandnon Page58

23 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e percentage NEUTRAL DISEASE 10 0 Helix Strand Rest StatusofSecondaryStructure Figure2.5:ThedistributionofsecondarystructurestatusfortheHumVardatasets. Page59

24 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e perecntage NEUTRAL DISEASE Difffreeenergy(WTMT) Figure2.6:Thedistributionoftransferfreeenergyvaluesfrominsidetosurfaceofthe proteinfordifferenceinfreeenergyfrominsidetooutsideoftheprotein. Page60

25 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e percentage Neutral Disease <1 >1 BLOSUM62SCORE Figure 2.7: The BLOSUM62 substitution score distribution for disease and neutral mutations Page61

26 C h a p t e r 2 A n a l y s i s o f s o m e S e q u e n c e... preferred substitutions respectively. Of the disease mutations, 65% correspond to scores<1whereas60%ofneutralmutationscorrespondtoscores>+1.thisresult reconfirms the observations made by earlier reports (Cargill et al., 1999; Balasubramanianetal.,2005). 2.5Summary Inthischapter,Ihaveinvestigatedsomesequenceandstructurebasedfeaturestobe usedinthesvmbasedmethodrelatedtothediseaseandbenignmutations.ihave used 10 new set of features refer to as neutraldisease missense mutation discriminatory(ndmsmd)features.thetenndmsmdfeaturesincludesixposition specific features, two protein structure based and two amino acid residue based features.thisstudyalsorevealedthediscriminatorypowerofalltenindividualfeatures tobeusedinthepredictionofpathogenicmutations.thenextchapterdealswiththe usageof10ndmsmdfeaturesinthenewlydevelopedsvmbasedmethodforthe predictionofpathogenicmutations. Page62

Proteins: Characteristics and Properties of Amino Acids

Proteins: Characteristics and Properties of Amino Acids SBI4U:Biochemistry Macromolecules Eachaminoacidhasatleastoneamineandoneacidfunctionalgroupasthe nameimplies.thedifferentpropertiesresultfromvariationsinthestructuresof differentrgroups.thergroupisoftenreferredtoastheaminoacidsidechain.

More information

Sequence comparison: Score matrices. Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas

Sequence comparison: Score matrices. Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas Sequence comparison: Score matrices Genome 559: Introduction to Statistical and omputational Genomics Prof James H Thomas Informal inductive proof of best alignment path onsider the last step in the best

More information

Sequence comparison: Score matrices

Sequence comparison: Score matrices Sequence comparison: Score matrices http://facultywashingtonedu/jht/gs559_2013/ Genome 559: Introduction to Statistical and omputational Genomics Prof James H Thomas FYI - informal inductive proof of best

More information

Sequence comparison: Score matrices. Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas

Sequence comparison: Score matrices. Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas Sequence comparison: Score matrices Genome 559: Introduction to Statistical and omputational Genomics Prof James H Thomas FYI - informal inductive proof of best alignment path onsider the last step in

More information

Protein Secondary Structure Prediction

Protein Secondary Structure Prediction part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 the goal is the prediction of the secondary structure conformation which is local each amino

More information

Range of Certified Values in Reference Materials. Range of Expanded Uncertainties as Disseminated. NMI Service

Range of Certified Values in Reference Materials. Range of Expanded Uncertainties as Disseminated. NMI Service Calibration and Capabilities Amount of substance,, Russian Federation (Ural Scientific and Research Institiute Metrology, Rosstandart) (D.I. Mendeleyev Institute Metrology, Rosstandart) The uncertainty

More information

PROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration:

PROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration: PTEI STUTUE ydrolysis of proteins with aqueous acid or base yields a mixture of free amino acids. Each type of protein yields a characteristic mixture of the ~ 20 amino acids. AMI AIDS Zwitterion (dipolar

More information

Translation. A ribosome, mrna, and trna.

Translation. A ribosome, mrna, and trna. Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno

More information

Properties of amino acids in proteins

Properties of amino acids in proteins Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated

More information

Proteome Informatics. Brian C. Searle Creative Commons Attribution

Proteome Informatics. Brian C. Searle Creative Commons Attribution Proteome Informatics Brian C. Searle searleb@uw.edu Creative Commons Attribution Section structure Class 1 Class 2 Homework 1 Mass spectrometry and de novo sequencing Database searching and E-value estimation

More information

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Mathematics and Biochemistry University of Wisconsin - Madison 0 There Are Many Kinds Of Proteins The word protein comes

More information

Periodic Table. 8/3/2006 MEDC 501 Fall

Periodic Table. 8/3/2006 MEDC 501 Fall Periodic Table 8/3/2006 MEDC 501 Fall 2006 1 rbitals Shapes of rbitals s - orbital p -orbital 8/3/2006 MEDC 501 Fall 2006 2 Ionic Bond - acl Electronic Structure 11 a :: 1s 2 2s 2 2p x2 2p y2 2p z2 3s

More information

Lecture 14 - Cells. Astronomy Winter Lecture 14 Cells: The Building Blocks of Life

Lecture 14 - Cells. Astronomy Winter Lecture 14 Cells: The Building Blocks of Life Lecture 14 Cells: The Building Blocks of Life Astronomy 141 Winter 2012 This lecture describes Cells, the basic structural units of all life on Earth. Basic components of cells: carbohydrates, lipids,

More information

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

CHAPTER 29 HW: AMINO ACIDS + PROTEINS CAPTER 29 W: AMI ACIDS + PRTEIS For all problems, consult the table of 20 Amino Acids provided in lecture if an amino acid structure is needed; these will be given on exams. Use natural amino acids (L)

More information

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods Cell communication channel Bioinformatics Methods Iosif Vaisman Email: ivaisman@gmu.edu SEQUENCE STRUCTURE DNA Sequence Protein Sequence Protein Structure Protein structure ATGAAATTTGGAAACTTCCTTCTCACTTATCAGCCACCT...

More information

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS Int. J. LifeSc. Bt & Pharm. Res. 2012 Kaladhar, 2012 Research Paper ISSN 2250-3137 www.ijlbpr.com Vol.1, Issue. 1, January 2012 2012 IJLBPR. All Rights Reserved PROTEIN SECONDARY STRUCTURE PREDICTION:

More information

Chemistry Chapter 22

Chemistry Chapter 22 hemistry 2100 hapter 22 Proteins Proteins serve many functions, including the following. 1. Structure: ollagen and keratin are the chief constituents of skin, bone, hair, and nails. 2. atalysts: Virtually

More information

Protein Identification Using Tandem Mass Spectrometry. Nathan Edwards Informatics Research Applied Biosystems

Protein Identification Using Tandem Mass Spectrometry. Nathan Edwards Informatics Research Applied Biosystems Protein Identification Using Tandem Mass Spectrometry Nathan Edwards Informatics Research Applied Biosystems Outline Proteomics context Tandem mass spectrometry Peptide fragmentation Peptide identification

More information

The Select Command and Boolean Operators

The Select Command and Boolean Operators The Select Command and Boolean Operators Part of the Jmol Training Guide from the MSOE Center for BioMolecular Modeling Interactive version available at http://cbm.msoe.edu/teachingresources/jmol/jmoltraining/boolean.html

More information

Part 4 The Select Command and Boolean Operators

Part 4 The Select Command and Boolean Operators Part 4 The Select Command and Boolean Operators http://cbm.msoe.edu/newwebsite/learntomodel Introduction By default, every command you enter into the Console affects the entire molecular structure. However,

More information

Patrick: An Introduction to Medicinal Chemistry 5e Chapter 03

Patrick: An Introduction to Medicinal Chemistry 5e Chapter 03 01) Which of the following statements is not true regarding the active site of an enzyme? a. An active site is normally on the surface of an enzyme. b. An active site is normally hydrophobic in nature.

More information

Enzyme Catalysis & Biotechnology

Enzyme Catalysis & Biotechnology L28-1 Enzyme Catalysis & Biotechnology Bovine Pancreatic RNase A Biochemistry, Life, and all that L28-2 A brief word about biochemistry traditionally, chemical engineers used organic and inorganic chemistry

More information

Hypergraphs, Metabolic Networks, Bioreaction Systems. G. Bastin

Hypergraphs, Metabolic Networks, Bioreaction Systems. G. Bastin Hypergraphs, Metabolic Networks, Bioreaction Systems. G. Bastin PART 1 : Metabolic flux analysis and minimal bioreaction modelling PART 2 : Dynamic metabolic flux analysis of underdetermined networks 2

More information

Lecture 15: Realities of Genome Assembly Protein Sequencing

Lecture 15: Realities of Genome Assembly Protein Sequencing Lecture 15: Realities of Genome Assembly Protein Sequencing Study Chapter 8.10-8.15 1 Euler s Theorems A graph is balanced if for every vertex the number of incoming edges equals to the number of outgoing

More information

EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I Dr. Glen B Legge

EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I Dr. Glen B Legge EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I 2009 Dr. Glen B Legge This is a Scantron exam. All answers should be transferred to the Scantron sheet using a #2 pencil. Write and bubble

More information

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure. Dr. anjeeva rivastava Primary tructure econdary tructure Tertiary tructure Quaternary tructure Amino acid residues α Helix Polypeptide chain Assembled subunits 2 1 Amino acid sequence determines 3-D structure

More information

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Exam III. Please read through each question carefully, and make sure you provide all of the requested information. 09-107 onors Chemistry ame Exam III Please read through each question carefully, and make sure you provide all of the requested information. 1. A series of octahedral metal compounds are made from 1 mol

More information

Proteomics. November 13, 2007

Proteomics. November 13, 2007 Proteomics November 13, 2007 Acknowledgement Slides presented here have been borrowed from presentations by : Dr. Mark A. Knepper (LKEM, NHLBI, NIH) Dr. Nathan Edwards (Center for Bioinformatics and Computational

More information

Protein Struktur (optional, flexible)

Protein Struktur (optional, flexible) Protein Struktur (optional, flexible) 22/10/2009 [ 1 ] Andrew Torda, Wintersemester 2009 / 2010, AST nur für Informatiker, Mathematiker,.. 26 kt, 3 ov 2009 Proteins - who cares? 22/10/2009 [ 2 ] Most important

More information

Protein Structure Bioinformatics Introduction

Protein Structure Bioinformatics Introduction 1 Swiss Institute of Bioinformatics Protein Structure Bioinformatics Introduction Basel, 27. September 2004 Torsten Schwede Biozentrum - Universität Basel Swiss Institute of Bioinformatics Klingelbergstr

More information

1. Amino Acids and Peptides Structures and Properties

1. Amino Acids and Peptides Structures and Properties 1. Amino Acids and Peptides Structures and Properties Chemical nature of amino acids The!-amino acids in peptides and proteins (excluding proline) consist of a carboxylic acid ( COOH) and an amino ( NH

More information

INTRODUCTION. Amino acids occurring in nature have the general structure shown below:

INTRODUCTION. Amino acids occurring in nature have the general structure shown below: Biochemistry I Laboratory Amino Acid Thin Layer Chromatography INTRODUCTION The primary importance of amino acids in cell structure and metabolism lies in the fact that they serve as building blocks for

More information

Amino Acids and Peptides

Amino Acids and Peptides Amino Acids Amino Acids and Peptides Amino acid a compound that contains both an amino group and a carboxyl group α-amino acid an amino acid in which the amino group is on the carbon adjacent to the carboxyl

More information

Solutions In each case, the chirality center has the R configuration

Solutions In each case, the chirality center has the R configuration CAPTER 25 669 Solutions 25.1. In each case, the chirality center has the R configuration. C C 2 2 C 3 C(C 3 ) 2 D-Alanine D-Valine 25.2. 2 2 S 2 d) 2 25.3. Pro,, Trp, Tyr, and is, Trp, Tyr, and is Arg,

More information

Scoring Matrices. Shifra Ben-Dor Irit Orr

Scoring Matrices. Shifra Ben-Dor Irit Orr Scoring Matrices Shifra Ben-Dor Irit Orr Scoring matrices Sequence alignment and database searching programs compare sequences to each other as a series of characters. All algorithms (programs) for comparison

More information

Evidence from Evolution Activity 75 Points. Fossils Use your textbook and the diagrams on the next page to answer the following questions.

Evidence from Evolution Activity 75 Points. Fossils Use your textbook and the diagrams on the next page to answer the following questions. Name(s): Biology Evidence from Evolution Activity 75 Points Fossils Use your textbook and the diagrams on the next page to answer the following questions. 1. What are fossils? How are most fossils formed?

More information

A rapid and highly selective colorimetric method for direct detection of tryptophan in proteins via DMSO acceleration

A rapid and highly selective colorimetric method for direct detection of tryptophan in proteins via DMSO acceleration A rapid and highly selective colorimetric method for direct detection of tryptophan in proteins via DMSO acceleration Yanyan Huang, Shaoxiang Xiong, Guoquan Liu, Rui Zhao Beijing National Laboratory for

More information

How did they form? Exploring Meteorite Mysteries

How did they form? Exploring Meteorite Mysteries Exploring Meteorite Mysteries Objectives Students will: recognize that carbonaceous chondrite meteorites contain amino acids, the first step towards living plants and animals. conduct experiments that

More information

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ Proteomics Chapter 5. Proteomics and the analysis of protein sequence Ⅱ 1 Pairwise similarity searching (1) Figure 5.5: manual alignment One of the amino acids in the top sequence has no equivalent and

More information

Discussion Section (Day, Time):

Discussion Section (Day, Time): Chemistry 27 Spring 2005 Exam 1 Chemistry 27 Professor Gavin MacBeath arvard University Spring 2005 our Exam 1 Friday, February 25, 2005 11:07 AM 12:00 PM Discussion Section (Day, Time): TF: Directions:

More information

Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39

Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39 Protein Struktur Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep wake up at slide 39 Andrew Torda, Wintersemester 2016/ 2017 Andrew Torda 17.10.2016 [ 1 ] Proteins - who cares?

More information

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS 1 Prokaryotes and Eukaryotes 2 DNA and RNA 3 4 Double helix structure Codons Codons are triplets of bases from the RNA sequence. Each triplet defines an amino-acid.

More information

Finding the Best Biological Pairwise Alignment Through Genetic Algorithm Determinando o Melhor Alinhamento Biológico Através do Algoritmo Genético

Finding the Best Biological Pairwise Alignment Through Genetic Algorithm Determinando o Melhor Alinhamento Biológico Através do Algoritmo Genético Finding the Best Biological Pairwise Alignment Through Genetic Algorithm Determinando o Melhor Alinhamento Biológico Através do Algoritmo Genético Paulo Mologni 1, Ailton Akira Shinoda 2, Carlos Dias Maciel

More information

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Viewing and Analyzing Proteins, Ligands and their Complexes 2 2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing

More information

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov 2018 Biochemistry 110 California Institute of Technology Lecture 2: Principles of Protein Structure Linus Pauling (1901-1994) began his studies at Caltech in 1922 and was directed by Arthur Amos oyes to

More information

Discussion Section (Day, Time): TF:

Discussion Section (Day, Time): TF: ame: Chemistry 27 Professor Gavin MacBeath arvard University Spring 2004 Final Exam Thursday, May 28, 2004 2:15 PM - 5:15 PM Discussion Section (Day, Time): Directions: TF: 1. Do not write in red ink.

More information

Chemical Properties of Amino Acids

Chemical Properties of Amino Acids hemical Properties of Amino Acids Protein Function Make up about 15% of the cell and have many functions in the cell 1. atalysis: enzymes 2. Structure: muscle proteins 3. Movement: myosin, actin 4. Defense:

More information

Discussion Section (Day, Time):

Discussion Section (Day, Time): Chemistry 27 Spring 2005 Exam 3 Chemistry 27 Professor Gavin MacBeath arvard University Spring 2005 our Exam 3 Friday April 29 th, 2005 11:07 AM 12:00 PM Discussion Section (Day, Time): TF: Directions:

More information

National Nutrient Database for Standard Reference Release 28 slightly revised May, 2016

National Nutrient Database for Standard Reference Release 28 slightly revised May, 2016 National base for Standard Reference Release 28 slightly revised May, 206 Full Report (All s) 005, Cheese, cottage, lowfat, 2% milkfat Report Date: February 23, 208 02:7 EST values and weights are for

More information

All Proteins Have a Basic Molecular Formula

All Proteins Have a Basic Molecular Formula All Proteins Have a Basic Molecular Formula Homa Torabizadeh Abstract This study proposes a basic molecular formula for all proteins. A total of 10,739 proteins belonging to 9 different protein groups

More information

DATA MINING OF ELECTROSTATIC INTERACTIONS BETWEEN AMINO ACIDS IN COILED-COIL PROTEINS USING THE STABLE COIL ALGORITHM ANKUR S.

DATA MINING OF ELECTROSTATIC INTERACTIONS BETWEEN AMINO ACIDS IN COILED-COIL PROTEINS USING THE STABLE COIL ALGORITHM ANKUR S. University of Colorado at Colorado Springs i DATA MINING OF ELECTROSTATIC INTERACTIONS BETWEEN AMINO ACIDS IN COILED-COIL PROTEINS USING THE STABLE COIL ALGORITHM BY ANKUR S. DESHMUKH A project submitted

More information

Structures in equilibrium at point A: Structures in equilibrium at point B: (ii) Structure at the isoelectric point:

Structures in equilibrium at point A: Structures in equilibrium at point B: (ii) Structure at the isoelectric point: ame 21 F10-Final Exam Page 2 I. (42 points) (1) (16 points) The titration curve for L-lysine is shown below. Provide (i) the main structures in equilibrium at each of points A and B indicated below and

More information

8 Grundlagen der Bioinformatik, SS 09, D. Huson, April 28, 2009

8 Grundlagen der Bioinformatik, SS 09, D. Huson, April 28, 2009 8 Grundlagen der Bioinformatik, SS 09, D. Huson, April 28, 2009 2 Pairwise alignment We will discuss: 1. Strings 2. Dot matrix method for comparing sequences 3. Edit distance and alignment 4. The number

More information

8 Grundlagen der Bioinformatik, SoSe 11, D. Huson, April 18, 2011

8 Grundlagen der Bioinformatik, SoSe 11, D. Huson, April 18, 2011 8 Grundlagen der Bioinformatik, SoSe 11, D. Huson, April 18, 2011 2 Pairwise alignment We will discuss: 1. Strings 2. Dot matrix method for comparing sequences 3. Edit distance and alignment 4. The number

More information

Studies Leading to the Development of a Highly Selective. Colorimetric and Fluorescent Chemosensor for Lysine

Studies Leading to the Development of a Highly Selective. Colorimetric and Fluorescent Chemosensor for Lysine Supporting Information for Studies Leading to the Development of a Highly Selective Colorimetric and Fluorescent Chemosensor for Lysine Ying Zhou, a Jiyeon Won, c Jin Yong Lee, c * and Juyoung Yoon a,

More information

Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013

Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013 Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013 The presentation is based on the presentation by Professor Alexander Dikiy, which is given in the course compedium:

More information

Systematic approaches to study cancer cell metabolism

Systematic approaches to study cancer cell metabolism Systematic approaches to study cancer cell metabolism Kivanc Birsoy Laboratory of Metabolic Regulation and Genetics The Rockefeller University, NY Cellular metabolism is complex ~3,000 metabolic genes

More information

The Structure of Enzymes!

The Structure of Enzymes! The Structure of Enzymes Levels of Protein Structure 0 order amino acid composition Primary Secondary Motifs Tertiary Domains Quaternary ther sequence repeating structural patterns defined by torsion angles

More information

The Structure of Enzymes!

The Structure of Enzymes! The Structure of Enzymes Levels of Protein Structure 0 order amino acid composition Primary Secondary Motifs Tertiary Domains Quaternary ther sequence repeating structural patterns defined by torsion angles

More information

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics Bioinformatics Protein Structure Principles & Architecture Marjolein Thunnissen Dep. of Biochemistry & Structural Biology Lund University September 2011 Homology, pattern and 3D structure searches need

More information

BCH 4053 Exam I Review Spring 2017

BCH 4053 Exam I Review Spring 2017 BCH 4053 SI - Spring 2017 Reed BCH 4053 Exam I Review Spring 2017 Chapter 1 1. Calculate G for the reaction A + A P + Q. Assume the following equilibrium concentrations: [A] = 20mM, [Q] = [P] = 40fM. Assume

More information

Lecture'18:'April'2,'2013

Lecture'18:'April'2,'2013 CM'224' 'rganic'chemistry'ii Spring'2013,'Des'Plaines' 'Prof.'Chad'Landrie 2 3 N cysteine (Cys) S oxidation S S 3 N cystine N 3 Lecture'18:'April'2,'2013 Disaccharides+&+Polysaccharides Amino+acids++(26.1926.3)

More information

Generation Date: 12/07/2015 Generated By: Tristan Wiley Title: Bio I Winter Packet

Generation Date: 12/07/2015 Generated By: Tristan Wiley Title: Bio I Winter Packet Generation Date: 12/07/2015 Generated By: Tristan Wiley Title: Bio I Winter Packet 1. Many natural ecosystems have been destroyed by human activity. To better manage our remaining natural ecosystems, we

More information

Performing pka Calculations in Proteins Using Free Energy Perturbation Adiabatic Charging (FEP/AC)

Performing pka Calculations in Proteins Using Free Energy Perturbation Adiabatic Charging (FEP/AC) Performing pka Calculations in Proteins Using Free Energy Perturbation Adiabatic Charging (FEP/AC) Lynn Kamerlin Uppsala University Purpose of the workshop: The aim of this workshop will be to provide

More information

1. Wings 5.. Jumping legs 2. 6 Legs 6. Crushing mouthparts 3. Segmented Body 7. Legs 4. Double set of wings 8. Curly antennae

1. Wings 5.. Jumping legs 2. 6 Legs 6. Crushing mouthparts 3. Segmented Body 7. Legs 4. Double set of wings 8. Curly antennae Biology Cladogram practice Name Per Date What is a cladogram? It is a diagram that depicts evolutionary relationships among groups. It is based on PHYLOGENY, which is the study of evolutionary relationships.

More information

Lecture 7. Protein Secondary Structure Prediction. Secondary Structure DSSP. Master Course DNA/Protein Structurefunction.

Lecture 7. Protein Secondary Structure Prediction. Secondary Structure DSSP. Master Course DNA/Protein Structurefunction. C N T R F O R N T G R A T V B O N F O R M A T C S V U Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 7 Protein Secondary Structure Prediction Protein primary structure 20 amino

More information

Discussion Section (Day, Time):

Discussion Section (Day, Time): Chemistry 27 pring 2005 Exam 3 Chemistry 27 Professor Gavin MacBeath arvard University pring 2005 our Exam 3 Friday April 29 th, 2005 11:07 AM 12:00 PM Discussion ection (Day, Time): TF: Directions: 1.

More information

1 large 50g. 1 extra large 56g

1 large 50g. 1 extra large 56g National base for Standard Reference Release 28 slightly revised May, 206 Full Report (All s) 023, Egg, whole, raw, fresh Report Date: February 08, 208 06:8 EST values and weights are for edible portion.

More information

Protein Secondary Structure Prediction

Protein Secondary Structure Prediction C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 7 Protein Secondary Structure Prediction Protein primary

More information

Molecular Selective Binding of Basic Amino Acids by a Water-soluble Pillar[5]arene

Molecular Selective Binding of Basic Amino Acids by a Water-soluble Pillar[5]arene Electronic supplementary information Molecular Selective Binding of Basic Amino Acids y a Water-solule Pillar[5]arene Chunju Li,* a, Junwei Ma, a Liu Zhao, a Yanyan Zhang, c Yihua Yu, c Xiaoyan Shu, a

More information

NSCI Basic Properties of Life and The Biochemistry of Life on Earth

NSCI Basic Properties of Life and The Biochemistry of Life on Earth NSCI 314 LIFE IN THE COSMOS 4 Basic Properties of Life and The Biochemistry of Life on Earth Dr. Karen Kolehmainen Department of Physics CSUSB http://physics.csusb.edu/~karen/ WHAT IS LIFE? HARD TO DEFINE,

More information

4. The Michaelis-Menten combined rate constant Km, is defined for the following kinetic mechanism as k 1 k 2 E + S ES E + P k -1

4. The Michaelis-Menten combined rate constant Km, is defined for the following kinetic mechanism as k 1 k 2 E + S ES E + P k -1 Fall 2000 CH 595C Exam 1 Answer Key Multiple Choice 1. One of the reasons that enzymes are such efficient catalysts is that a) the energy level of the enzyme-transition state complex is much higher than

More information

BENG 183 Trey Ideker. Protein Sequencing

BENG 183 Trey Ideker. Protein Sequencing BENG 183 Trey Ideker Protein Sequencing The following slides borrowed from Hong Li s Biochemistry Course: www.sb.fsu.edu/~hongli/4053notes Introduction to Proteins Proteins are of vital importance to biological

More information

CHEM 3653 Exam # 1 (03/07/13)

CHEM 3653 Exam # 1 (03/07/13) 1. Using phylogeny all living organisms can be divided into the following domains: A. Bacteria, Eukarya, and Vertebrate B. Archaea and Eukarya C. Bacteria, Eukarya, and Archaea D. Eukarya and Bacteria

More information

Bioinformatics: Network Analysis

Bioinformatics: Network Analysis Bioinformatics: Network Analysis Flux Balance Analysis and Metabolic Control Analysis COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Flux Balance Analysis (FBA) Flux balance

More information

Potentiometric Titration of an Amino Acid. Introduction

Potentiometric Titration of an Amino Acid. Introduction NAME: Course: DATE Sign-Off: Performed: Potentiometric Titration of an Amino Acid Introduction In previous course-work, you explored the potentiometric titration of a weak acid (HOAc). In this experiment,

More information

Introduction to graph theory and molecular networks

Introduction to graph theory and molecular networks Introduction to graph theory and molecular networks Sushmita Roy sroy@biostat.wisc.edu Computational Network Biology Biostatistics & Medical Informatics 826 https://compnetbiocourse.discovery.wisc.edu

More information

A Theoretical Inference of Protein Schemes from Amino Acid Sequences

A Theoretical Inference of Protein Schemes from Amino Acid Sequences A Theoretical Inference of Protein Schemes from Amino Acid Sequences Angel Villahoz-Baleta angel_villahozbaleta@student.uml.edu ABSTRACT Proteins are based on tri-dimensional dispositions generated from

More information

PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES

PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES by Lipontseng Cecilia Tsilo A thesis submitted to Rhodes University in partial fulfillment of the requirements for

More information

ANSWERS TO CASE STUDIES Chapter 2: Drug Design and Relationship of Functional Groups to Pharmacologic Activity

ANSWERS TO CASE STUDIES Chapter 2: Drug Design and Relationship of Functional Groups to Pharmacologic Activity ANSWERS TO CASE STUDIES Chapter 2: Drug Design and Relationship of Functional Groups to Pharmacologic Activity Absorption/Acid-Base Case (p. 42) Question #1: Drug Cetirizine Clemastin e Functional groups

More information

Basic Principles of Protein Structures

Basic Principles of Protein Structures Basic Principles of Protein Structures Proteins Proteins: The Molecule of Life Proteins: Building Blocks Proteins: Secondary Structures Proteins: Tertiary and Quartenary Structure Proteins: Geometry Proteins

More information

A modular Fibonacci sequence in proteins

A modular Fibonacci sequence in proteins A modular Fibonacci sequence in proteins P. Dominy 1 and G. Rosen 2 1 Hagerty Library, Drexel University, Philadelphia, PA 19104, USA 2 Department of Physics, Drexel University, Philadelphia, PA 19104,

More information

Lecture 14 Secondary Structure Prediction

Lecture 14 Secondary Structure Prediction C N T R F O R I N T G R A T I V Protein structure B I O I N F O R M A T I C S V U Lecture 14 Secondary Structure Prediction Bioinformatics Center IBIVU James Watson & Francis Crick (1953) Linus Pauling

More information

Collision Cross Section: Ideal elastic hard sphere collision:

Collision Cross Section: Ideal elastic hard sphere collision: Collision Cross Section: Ideal elastic hard sphere collision: ( r r 1 ) Where is the collision cross-section r 1 r ) ( 1 Where is the collision distance r 1 r These equations negate potential interactions

More information

Using an Artificial Regulatory Network to Investigate Neural Computation

Using an Artificial Regulatory Network to Investigate Neural Computation Using an Artificial Regulatory Network to Investigate Neural Computation W. Garrett Mitchener College of Charleston January 6, 25 W. Garrett Mitchener (C of C) UM January 6, 25 / 4 Evolution and Computing

More information

Supplementary Table S1. Pathological analysis for non-synonymous sequence changes

Supplementary Table S1. Pathological analysis for non-synonymous sequence changes Abu-Amero and Bosley, Page 1 Supplementary Table S1. Pathological analysis for non-synonymous sequence changes 1393 G > A 3460 G > A 4040 C > G 4216 T > C This sequence change is located in the 12S rrna

More information

CHEMISTRY ATAR COURSE DATA BOOKLET

CHEMISTRY ATAR COURSE DATA BOOKLET CHEMISTRY ATAR COURSE DATA BOOKLET 2018 2018/2457 Chemistry ATAR Course Data Booklet 2018 Table of contents Periodic table of the elements...3 Formulae...4 Units...4 Constants...4 Solubility rules for

More information

Module No. 31: Peptide Synthesis: Definition, Methodology & applications

Module No. 31: Peptide Synthesis: Definition, Methodology & applications PAPER 9: TECHNIQUES USED IN MOLECULAR BIOPHYSICS I Module No. 31: Peptide Synthesis: Definition, Methodology & applications Objectives: 1. Introduction 2. Synthesis of peptide 2.1. N-terminal protected

More information

In eukaryotes the most important regulatory genes contain homeobox sequences and are called homeotic genes.

In eukaryotes the most important regulatory genes contain homeobox sequences and are called homeotic genes. 1 rowth and development in organisms is controlled by a number of mechanisms that operate at the cellular level. The control elements involved in these mechanisms include hormones, the second messenger

More information

Modelinig and Simulation of Amino Acide

Modelinig and Simulation of Amino Acide Modelinig and Simulation of Amino Acide Basil Younis Thanoon Mathmathics Dept College of Computer Sciences & Mathematics Mosul, Iraq Fatima Mahmood Hasan Zamzwm Mathmathics Dept College of Computer Sciences

More information

Separation of Large and Small Peptides by Supercritical Fluid Chromatography and Detection by Mass Spectrometry

Separation of Large and Small Peptides by Supercritical Fluid Chromatography and Detection by Mass Spectrometry Separation of Large and Small Peptides by Supercritical Fluid Chromatography and Detection by Mass Spectrometry Application Note Biologics and Biosimilars Author Edgar Naegele Agilent Technologies, Inc.

More information

Analysis of Relevant Physicochemical Properties in Obligate and Non-obligate Protein-protein Interactions

Analysis of Relevant Physicochemical Properties in Obligate and Non-obligate Protein-protein Interactions Analysis of Relevant Physicochemical Properties in Obligate and Non-obligate Protein-protein Interactions Mina Maleki, Md. Mominul Aziz, Luis Rueda School of Computer Science, University of Windsor 401

More information

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor Note: Adequate space is given for each answer. Questions that require a brief explanation should

More information

ARE BINDING RESIDUES CONSERVED?

ARE BINDING RESIDUES CONSERVED? ARE BINDING RESIDUES CONSERVED? CHRISTOS OUZOUNIS 1, CAROL PEREZ-IRRATXETA2, CHRIS SANDER1, ALFONSO VALENCIA2,c 1The European Bioinfonnatics Institute, EMBL Outstation, Cambridge UK 2Protein Design Group,

More information

7 Protein secondary structure

7 Protein secondary structure 78 Grundlagen der Bioinformatik, SS 1, D. Huson, June 17, 21 7 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden & Tooze,

More information

Biophysical Society On-line Textbook

Biophysical Society On-line Textbook Biophysical Society On-line Textbook PROTEINS CHAPTER 1. PROTEIN STRUCTURE Section 1. Primary structure, secondary motifs, tertiary architecture, and quaternary organization Jannette Carey* and Vanessa

More information

8 Protein secondary structure

8 Protein secondary structure Grundlagen der Bioinformatik, SoSe 11, D. Huson, June 6, 211 13 8 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden & Tooze,

More information

A Model for Protein Secondary Structure Prediction Meta - Classifiers

A Model for Protein Secondary Structure Prediction Meta - Classifiers A Model for Protein Secondary Structure Prediction Meta - Classifiers Anita Wasilewska Computer Science Department Stony Brook University Stony Brook, NY, USA Overview Introduction to proteins Four levels

More information

12 Protein secondary structure

12 Protein secondary structure Grundlagen der Bioinformatik, SoSe 14, D. Huson, July 2, 214 147 12 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden &

More information

Principles of Biochemistry

Principles of Biochemistry Principles of Biochemistry Fourth Edition Donald Voet Judith G. Voet Charlotte W. Pratt Chapter 4 Amino Acids: The Building Blocks of proteins (Page 76-90) Chapter Contents 1- Amino acids Structure: 2-

More information