Statistical Mechanics of the Combinatorial Synthesis and Analysis of Folding Macromolecules

Size: px
Start display at page:

Download "Statistical Mechanics of the Combinatorial Synthesis and Analysis of Folding Macromolecules"

Transcription

1 J. Phys. Che. B 1997, 101, Statistical Mechanics of the Cobinatorial Synthesis and Analysis of Folding Macroolecules Jeffery G. Saven*, and Peter G. Wolynes School of Cheical Sciences, UniVersity of Illinois at UrbanasChapaign, Urbana, Illinois ReceiVed: May 22, 1997; In Final For: July 25, 1997 X Cobinatorial cheistry techniques provide a proising route to the design of acroolecules that acquire predeterined folded conforations. A library of sequences based on a pool of different onoer types can be synthesized, where the sequences are partially designed so as to be consistent with a particular target conforation. The library is screened for folding olecules. The nuber of sequences grows rapidly with the length of the polyer, however, and both the experiental and coputational tabulation of sequences becoe infeasible. For polyers and libraries of arbitrary size, we present a self-consistent, ean-field theory that can be used to estiate the nuber of sequences as a function of the energy in a target structure. The theory also yields the probabilities that each position in the sequence is occupied by a particular onoer type. The theory is tested using a siple lattice odel of proteins, and excellent agreeent between the theory and the results of exact enuerations are observed. The theory ay be used to quantify particular design strategies and the facility of finding low-energy sequences for particular structures. The theory is discussed with an eye toward protein design and the utability of particular residues in known proteins. 1. Introduction Cheists have long been able to design and synthesize sall copounds, even when the olecular echaniss of such syntheses are not well-understood. Using such synthetic ethods, researchers can craft new olecules and probe their cheical properties, thus shedding light on the echaniss involved. For exaple, the physical organic cheist ight ake a series of copounds all based on a coon thee in her search for quantifiable trends. 1 The cheist is guided by cheical theory both in her choice of olecules and in her interpretation of the experients, experients which in turn infor the theory. This sybiotic pairing of synthetic cheistry and cheical theory has proven fruitful in such areas as cheical bonding, conforational analysis, and reactivity. Clearly, our cheist is ore likely to arrive at soe sort of generalized understanding if she can access and analyze a large nuber of copounds. Facilitated by recent advances in anaging cheical diversity, researchers can now synthesize, catalog, and assay huge libraries of olecules, where each eber of the library is a different cobination of a particular set of cheical building blocks. These cobinatorial cheistry experients are typically used to search for olecules having a desired property, i.e., the binding of a drug candidate to a receptor. By noting the nubers and coon features of olecules with the desired property, researchers can also use these sae experients to reveal uch about the underlying cheistry. Soe researchers in protein cheistry have begun to take a siilar cobinatorial approach to the design of particular folded architectures. For ost proteins, the polyer s three-diensional structure is deterined by its aino acid sequence. The protein cheist should be able then to design sequences that fold to a particular structure, once the physical cheistry of the folding is understood. While it is straightforward to ake linear polyeric sequences, the protein cheist is troubled by a nuber of coplications. Due to the size and coplexity of proteins, any Present address: Departent of Cheistry, University of Pennsylvania, Philadelphia, PA X Abstract published in AdVance ACS Abstracts, Septeber 15, S (97) CCC: $14.00 of the theoretical ethods used to study sall olecules, e.g., ab initio ethods and coplete conforational search, becoe infeasible. Siply synthesizing a representative nuber of sequences is a daunting task, since the nuber of possible peptide sequences is exponentially dependent on, the nuber of residues. Choosing an appropriate target structure is also not without abiguity. One recent study has found that although nearly half of all nonhoologous, globular proteins adopt one of nine failies of folds, any protein folds have little siilarity with any of the reaining protein structures. 2 In addition, in ost proteins no single interaction doinates the folding energetics; hydrogen bonds, hydrophobic effects, van der Waals interactions, and steric packing effects all act in concert to stabilize the folded structure, but the relative agnitudes of their respective contributions are still controversial. Much has been done in the design of particular sequences, but as the size of the peptides in the library grows, the explicit tabulation of all sequences quickly becoes intractable, experientally as well as coputationally. Yet surveying the whole library can reveal trends as to what interactions are likely to destabilize as well as stabilize particular structures. We discuss in this report a workable theory that subtends all ebers of a given library. The theory can aid in the analysis and partial (cobinatorial) design of proteins and other intraolecularly self-organizing olecules, and the theory can be used as a tool to winnow such libraries to a anageable size. In the design of foldable protein sequences, we ust first address why a particular aino acid sequence should fold to a unique structure. We consider this question fro the viewpoint of the energy landscape theory of protein folding. 3-6 Aino acid sequences are heteropolyeric and typically coprise a large nuber of the available aino acids in an apparently rando order. Upon nonspecific collapse of the rando heteropolyer, incoensurate residues are likely to be brought into contact with one another. In addition, charged residues ay be transiently buried in the interior of the globule, or hydrophobic residues ay be exposed to solvent. For nearly all such rando, collapsed states, not all of the interactions within the globule will be satisfied. The energy surface is said 1997 Aerican Cheical Society

2 8376 J. Phys. Che. B, Vol. 101, o. 41, 1997 Saven and Wolynes to be frustrated, and the connectivity of the polyer chain is largely responsible for this frustration. 7 In addition, the nuber of collapsed conforational inia is large; it is exponential in the length of the chain. As in spin glasses, the lowest energy states for such rando, frustrated systes need not be structurally siilar. 8 At low teperatures the rando heteropolyer ay becoe trapped in any one of these low-energy inia. On the other hand, proteins are able to negotiate this rugged terrain and do fold reversibly to unique structures. At physiological teperatures, a protein ust therefore have an additional feature to its energy landscape. The energy of the native (folded) state is sufficiently low such that it copensates for the free energy of the nonnative conforational states of the polyer. The free energy of the unfolded states is a function of the average energy of these conforations, the distribution of their energies, and their nuber (entropy). Due to the energetic ruggedness and the entropy of the nonfolded states, the energy of the folded structure ust be uch lower in energy than is typical for a collapsed, rando heteropolyer of the sae coposition. The structure of the folded state is low in energy because it has a large nuber of favorable interactions, both between residues and between a particular residue type and its local environent. This energetic consistency between a protein sequence and its folded conforation has been tered the principle of inial frustration. 7,9 A protein s aino acid sequence deterines its structure, and at present any different protein topologies are known. Given this inherent plasticity of peptides, researchers have attepted to sculpt particular protein architectures by careful choice of the aino acid sequence. Soe researchers interested in de novo protein design have focused on iniizing frustration, through hierarchical design schees. 10 For exaple, in designing four helix bundles, workers have used peptide sequences that are known to for stable helixes. These helixes are then engineered to have copleentary hydrophobic faces that interact upon aggregation. Appropriate linker sequences are added to connect the these helixes into one contiguous sequence. In one case, a disulfide bond connecting two helixes has been introduced to yield a conforationally defined, though sall, two-helix peptide. 11 Using these ethods, researchers have attepted to engineer helical bundles, R/β proteins, and β sheet proteins. In each case the global tertiary fold being designed is siilar to a naturally occurring one, but recently, Dolgikh et al. have aied for a truly novel protein structure. 12 The proteins so developed have any of the properties of natural proteins in that they are copact and have substantial aounts of secondary structure, but typically what fors is a olten globule state that has no well-defined tertiary fold. 13 otwithstanding, these efforts appear very proising, and this de novo design of particular sequences is likely to lead to artificial proteins soon. Alongside these experiental efforts, researchers have been tackling the issue of protein design fro a coputational viewpoint. Methods for scoring trial sequences are required, and researchers have developed a nuber of ways of quantifying which aino acid sequences ight be copatible with a chosen structure. The chosen structure specifies a three-diensional skeleton for the trial sequences. Richards and co-workers have deterined what utations are copatible with a given structure. 14,15 These authors used an atoistic approach in which they utated side chain residues and then iniized the energy. These ethods reveal uch about the interactions between side chains and the effect of utation on the packing of the individual residues. The coputational deands of such ethods, however, liit the nuber of sequences that ay be considered. Other researchers have developed eans of scoring trial sequences based upon the coparison of sequences that are known to fold to a particular structure in the Protein Data Bank (PDB) Focusing priarily on backbone degrees of freedo, researchers have also considered siplified odels of proteins, where reduced energy functions that involve effective residue-residue interaction energies have been used. The ost coonly used energy functions are the inforationbased potentials, those that are derived statistically fro the probability of occurrence of a particular residue contact pair in the PDB While uch concern reains about the reliability, eaning, and accuracy of such potentials, these potentials are at present the ost useful reduced description for the free energies involved in folding. General issues regarding design using reduced descriptions have been addressed More concretely, soe workers have used such potentials to verify design algoriths using both off-lattice, 32 and ore coonly, lattice odels of proteins. 33,34 Generally, such algoriths search for low-energy structures using a siple energy function, where utations are randoly introduced along the chain. The search ay be done by screening against a pool of known sequences 17 by using genetic algoriths 35,36 or by using Monte Carlo ethods. 33,37-41 Other researchers have considered the polyerization of sequences on a lattice in the presence of an object to which the polyer binds as a route to designed ( iprinted ) structures. 42 In one design schee, the energy of the sequence is iniized at constant coposition, under the assuption that the free energy of the unfolded enseble of states is solely a function of the total nubers of each type of residue. 33 (If the coposition were not constrained, a hoopolyer would be the lowest energy sequence, which we know does not fold to a unique structure.) We ust ention that a sequence with a low-energy in the target structure does not necessarily fold uniquely to that structure; that sequence ay acquire a different conforation that is still lower in energy. 43 Researchers have addressed these issues by using alternate design criteria that take into account structures other than the target ,40 All these ethods are liited, however, in that they rely upon the explicit tabulation of sequences and the subsequent evaluation of each sequence s energy in a particular structure (and in soe studies nonnative structures). While uch can be learned about the sequence-structure apping, 44,45 with the exception of very siple systes, only a sparse sapling of sequence space of real proteins is possible. Our knowledge of the principles of protein design reains incoplete, a fact which otivates the use of a cobinatorial approach to protein design. By synthesizing large nubers of peptide sequences, researchers not only enhance their chance of discovering sequences that fold to a particular structure but they also stand to learn what properties foldable sequences in a library share. Cobinatorial ethods can reveal generalizable principles about the forces that stabilize protein structures. The design of specific sequences typically only probes portions of the sequence landscape for a chosen target structure. Given that any sequences can share a coon structure, we seek to understand as large a tract of the sequence energy landscape as possible. In addition, in designing novel proteins, dictating the specific contacts between residues or the precise packing aong residues ay be unnecessary. This fact is underscored by the presence of peptides with a nuber of protein-like properties that have been isolated fro rando sequence libraries. 46,47 Cobinatorial surveys have also been used to search for sequences that are copatible with a given structure. Katekar et al. have synthesized a library of peptide sequences, where a

3 Folding Macroolecules J. Phys. Che. B, Vol. 101, o. 41, binary patterning of hydrophobic and hydrophilic residues was chosen to be consistent with a four-helix bundle. 48 The sites on the interior (exterior) of the structure were chosen to be hydrophobic (hydrophilic). The precise aino acids at the these positions, however, was allowed to fluctuate. At each hydrophobic and hydrophilic position, five and six possible residue types, respectively, were peritted. Of a sapling of 48 sequences that were correctly expressed, 29 of these were protein-like in that they were resistant to proteases. Soe of these sequences also folded to copact structures and had significant secondary structure. This partial design, based on a siple binary pattern, not only yielded olecules with proteinlike properties but the experient confired that nonspecific hydrophobic interactions, with concoitant secondary structure foration, are iportant deterinants in specifying the folded structure. The issue of which aino acids in a sequence are necessary to for a stable structure and which are not ay be addressed by studying utations in naturally occurring proteins. The Matthews group has synthesized any stable utants of T4 lysozye 49 and even a few stable double utants. 50 The core residues of λ repressor can be very robust with respect to utation; 70% of 125 cobinations of the hydrophobic residues yield biologically active proteins. Axe et al. have found that 12 of the 13 core residues of barnase ay be substituted and it still retains its biological activity, 51 thus indicating that siply aintaining a hydrophobic core is a doinant factor in the stability of the enzye. Itzhaki et al. have found that chyotrypsin inhibitor 2 can support 100 different utations. 52 In addition to laboratory utagenesis, researchers have also considered the utability of the residues in structures contained in the PDB. 53 Often, though not always, these conserved sites are crucial to the function or stability of the protein. Fro the viewpoint of designing new proteins and understanding utational variability using cobinatorial libraries, the synthetic protein cheist would prefer to know a priori the nuber of sequences that are likely to fold to a given structure and the identities of those sequencessat least in soe average sense. Given the nuber of possible sequences of even a oderately sized protein of 100 residues, , obtaining an understanding of even this library sees at first glance to be an insuperable task. onetheless, a serviceable theory for the distribution of the energies of different sequences in the target structure can guide the cheist in his design and interpretation of his experients. Katekar et al. used qualitative cheical concepts to guide the design of their four-helix bundle library, but a quantitative theory has the potential to have even a stronger and ore detailed predictive power. In this paper, we present such a theory. Counting the nuber of sequences as a function of the energy is a task well-suited to statistical echanics. We are further aided by the observation that the protein design proble has uch in coon with the Ising agnet of condensed atter theory. 33 A coonly used approach in condensed atter physics is ean-field theory, wherein we consider average local energies associated with each site that are deterined by that site s local environent. Meanfield theory has seen extensive application to protein folding and the exploration of conforations, but here we use it to quantify the characteristics of sequence space. Here the internal variables are not the conforational states of the onoers but, rather, the type of aino acid that is present at each sequence position. The theory can also accoodate restrictions iposed by the researcher, such as constraints on the nuber of each type of onoer and on the residue identities allowed at particular positions in the sequence. The theory provides us with a way to estiate not only the nuber of sequences for a given overall energy but also the probability that each sequence position is occupied by a given onoer type. The theory provides a convenient eans to evaluate different cobinatorial design strategies, where we can copare the effects of changing residue pattern and target structure. In this report, we present the odel and copare it with results of an exactly solvable lattice polyer. 2. Theory for the uber and Coposition of Sequences Copatible with a Chosen Structure In this section we present a ean-field theory for counting the nuber of sequences copatible with a particular structure. The theory also yields the probability that each site is occupied by a particular type of onoer. Here we consider only the distribution of energies for different sequences in a chosen structure. Although siply finding the iniu energy sequence can be a isleading design strategy for soe energy functions, 58 sequences that fold to the chosen structure ust surely reside aong those that are sufficiently low in energy. Thus the theory is useful in that, for a particular design strategy, it provides estiates of the nuber and average identities of the low energy sequences. In our developent of the theory, we neglect sapling issues. All possible sequences of a typical protein cannot be synthesized, e.g., a polypeptide of length of 100 residues has ore than possible sequences. Thus any laboratory experient ust reflect a sapling of all possible realizations. In what follows, we assue that the space of all possible sequences is sapled uniforly, so that the probabilities presented below should iic those seen experientally. Since we are interested in characterizing the full sequence space, we neglect variation in the nubers of sequences due to incoplete sapling of the sequence space and non-unifor distribution of the onoers available at each sequence position that ay be present in actual experients. 48 We outline the presentation of the theory. In section 2.1 we show how the probability that sequences are less than a certain energy ay be calculated using the nuber of sequences as a function of the energy. Thus we need to deterine a icrocanonical entropy S(E), which is generally an involved calculation. For one-body energy functions, however, we can write siple expressions for the entropy (section 2.2) that are valid for arbitrary (section 2.2.1) and fixed (section 2.2.2) total copositions. We review how effective one-body energies ay be obtained using a siple ean-field theory in section 2.3. Last, we discuss the quantities of cheical interest that the theory yields, the ost iportant of which are S(E), the logarith of the nuber of sequences, and the individual probability w i (R) that a site i is occupied by a particular onoer type R Distribution of Energies Over Sequences. Here we let E be the energy of a sequence when it assues the folded or target conforation. Ω s (E) is the nuber of sequences having energy E in the chosen structure, and Ω s is the total nuber of sequences. Both Ω s (E) and Ω s satisfy any constraints concerning the total nuber of each type of onoer or the allowed onoer identities at particular sites. The sequence entropy S(E) is the logarith of the nuber of states having energy E in the folded structure. S(E) ) k B ln Ω s (E) (1) The sequence entropy S(E) is defined in a way that is exactly analogous to Boltzann s epitaphic equation for the entropy.

4 8378 J. Phys. Che. B, Vol. 101, o. 41, 1997 Saven and Wolynes In calculating the nuber of sequences of a particular energy, we focus on estiating S(E), exploiting the tools of statistical therodynaics. For low energies, S(E) an increasing function of E. AsE increases, the frustration present in the folded structure increases. As the nuber of unfavorable interactions increases, there are ore ways to distribute the, and the sequence entropy increases. To find the probability of finding a sequence with energy less than E in the chosen target structure, we integrate the Ω s (E) up to the chosen energy E. E f(e) ) - de Ωs (E )/Ω s (2) Alternately, if the allowed energies are discrete, as they are in a lattice odel, we su over allowed energies E < E. E f(e) ) Ω s (E )/Ω s (3) E ) The Sequence Entropy for S(E) for One-Body Energy Functions. We now turn to the estiation of S(E). Generally, for systes coprising any interacting sites, calculating the icrocanonical entropy exactly is nontrivial. If the energy of the syste is the su of individual energies of each coponent, however, then equations for the entropy ay be straightforwardly obtained. The ideal gas is one exaple. In our case, the interacting eleents are the individual residues, as they occur the target structure. Effective energy functions, i.e., the profile scoring functions, have been developed by Bowie et al. that depend only on the identity and location of a residue in a particular structure. 16 Hence these types of energy functions are of a purely one-body for, for which the theory presented here is exact. For ore coplicated energy functions that involve any-body interactions, we can use the ean-field theory presented in section 2.3 to obtain a self-consistent effective one-body energy function. For such an energy function, the energy of a particular sequence in the chosen structure is given by E ) ɛ i (R i ) (4) Here is the length of the polyer in onoer units, and ɛ i is the effective one-body energy at site i in the structure. The sequence is denoted by an ordered list of onoer identities {R 1... R }, where R i is the onoer type present at sequence position i. Since the conforation of the polyer is fixed, the index i labels both a particular onoer s (one-diensional) position in the sequence and its (three-diensional) position in the target structure. ɛ i is a function of the onoer identity R i present at site i. The sequence entropy is obtained by axiizing the entropy S(E) with respect to any unconstrained internal paraeters. Since the energy ay be written as a su of individual one-body ters, the S(E) ay be expressed as S(E)/k B )- R w i (R i )lnw i (R i ) (5) where k B is Boltzann s constant, and is the nuber of onoer types. In the case of peptides, is just the nuber of different aino acids used in synthesizing the sequences. Here w i (R) is the probability that residue type R is at position i in the structure. The sequence entropy S(E) is axiized subject to the constraint that the total energy is conserved. We incorporate this constraint by restricting the value of the internal energy U such that U ) E, where U ) R ɛ i (R i )w i (R i ) (6) An additional constraint is that the su of the identity probabilities on each site is unity, i.e., each site ust be occupied by at least one onoer 1 ) w i (R) (7) R)1 If by design, onoer R is precluded fro occupying site i, then we have additional constraints of the for w i (R) ) 0, if R is not allowed at site i (8) There are - i such constraints for each sequence position i, where i is the nuber of allowed residue identities at position i. As is done in conventional statistical therodynaics, 59 the values of the individual identity probabilities are those that axiize the entropy, eq 5, subject to the constraints in eqs 6-8. We perfor this axiization using the calculus of variations and introduce Lagrange ultipliers for each of the constraints. The ultipliers that arise fro the constraints given in eq 7 are easily evaluated using the constraint conditions eq Arbitrary Total Coposition. Here we consider the case where the only restrictions on the total nubers of each type of aino acid in a sequence are those dictated by the allowed aino acids at each site (see eq 8). For such a syste, after axiization of eq 5 subject to the constraints eqs 6-8, the sequence entropy ay be written as S(E) k B ) β U + ln z i (9) where β is an unevaluated Lagrange ultiplier, and z i ) ξ i (R i ) exp(-β ɛ i (R i )) (10) R i )1 ξ i (R i ) arises due to possible constraints on the allowed residue types at each sequence location (see eq 8), and it obeys ξ i (R i ) ) { 1 if residue R i is allowed at site i 0 if not (11) The site identity probabilities are given by a for that looks very uch like a Boltzann weight, w i (R) ) z ī 1 ξ i (R) exp(-β ɛ i (R)) (12) The Lagrange ultiplier β satisfies the constant energy constraint (see eq 6). ɛ i (R i ) exp(-β ɛ i (R i )) U ) R ξ i (R i ) (13) z i The constant energy constraint eq 6 introduces the Lagrange

5 Folding Macroolecules J. Phys. Che. B, Vol. 101, o. 41, ultiplier β. Using the analogy to statistical therodynaics, we see that β -1 has all the properties of an effective teperature, the conjugate variable of the energy in therodynaics, e.g., k B β ) S/ E. As the energy increases, so does β -1. Other researchers have referred to β -1 as the selection teperature. 29,31 In the language of therodynaics, β -1 is the teperature at which the average internal energy of the syste is U. For large Ω s (E), sequences sharing the sae effective teperature β -1 will have the sae internal energy. For large β -1 (high E near the axiu of S(E)), any different rearrangeents of the onoer types are consistent with these high energies, and S(E) is relatively insensitive to the precise location of each onoer type. At low effective teperatures β -1 (low E), the requireents on the sequences having these low energies are severe, and the energy of a sequence in the chosen structure is acutely sensitive to the particular ordering of the onoer types. We noralize the icrocanonical entropy so that it yields the correct total nuber of possible sequences. For an arbitrary total coposition, when no restrictions are placed on the nuber of each type of onoer, Recall that i is the nuber of onoer identities allowed at position i. The sequence entropy is noralized so that ) Ω s (15) Here E in and E ax are the iniu and axiu allowed values for a given energy function Fixed Coposition. Here we treat the case where the nuber of each type of onoer n(r) is the sae for each sequence. This we refer to as the constraint of constant coposition. The locations of the individual identities, however, can fluctuate and are constrained only by eq 8. Let n(r) bethe total nuber of residues of onoer type R. Recall that w i (R) is the probability that site i has residue type R. In applying this constraint, we see that for each sequence the su of the individual identity probabilities over sequence positions ust equal the nuber of onoers of that type. Recall that if the residue identity R is not allowed at site i, i.e., ξ i (R) ) 0, then w i (R) ) 0, as in eq 12. As in section 2.2.1, we axiize the entropy S(E) with respect to the constraints of constant energy (eq 6) and probability conservation (eq 7). We also include the constant coposition constraints eq 16, each of which has a corresponding Lagrange ultiplier we denote as the product β µ R. The resulting icrocanonical entropy is S(E) is where Ω s ) i (14) E axde Ein exp ( S(E) k B ) n(r) ) w i (R) (16) S(E)/k B ) β U + ln z i + β µ Rn(R) (17) R)1 z i ) ξ i (R) exp(-β (ɛ i (R) + µ R)) (18) R)1 The sequence identity probabilities becoe w i (R) ) z ī 1 ξ i (R) exp(-β (ɛ i (R) + µ R)) (19) where z i is given in eq 18. Using the constant coposition constraint in eq 16 we can write down a self-consistent equation that µ R ust satisfy. β µ R ) ln[ z 1 ī exp(-β ɛ i - ln n(r) (20) (R))] We can again draw the analogy to statistical therodynaics. As we discussed previously, β -1 has the properties of an effective teperature. The constant coposition constraints eq 16 require that the nuber of each onoer type is constant. The therodynaic conjugate variables of the nubers of each coponent are their effective cheical potentials µ R. The effective cheical potential µ R is an effective free energy per particle for onoer type R. For a specified overall coposition, soe onoer types ay be ore favorable than others on average. The effective cheical potentials readjust to aintain constant coposition in such cases. When the nuber of each onoer type is predeterined, the total nuber of sequences is given siply by the corresponding ultinoial coefficient 59,60 Ω s )! n(r)! R)1 (21) where the reader will recall that is the chain length in onoers, is the total nuber of available residues, and n(r) is the nuber of residues of type R. Equation 21 holds when all onoer types are allowed at each of the sequence positions, i.e., i ) for all i. The sequence entropy S(E) in eq 17 is noralized according to eq 15, where eq 21 is used to define Ω s Mean Field Treatent of the Molecular Energy Function for a Given Target Structure. In section 2.2, we presented the sequence entropy S(E) and the sequence identity probabilities w i (R) for arbitrary and constant copositions. The foralis applies when the energy ay be written as a su of one body ters. Though soe authors have chosen to describe sequence-structure copatibility in ters of one-body functions 16 for which the theory is exact, the ajority of energy functions involve two-body interactions in the for of a contact potential. 17,19,22 These are paraetrized such that the probability that two residues are within a prescribed distance of one another yields an effective contact energy. Typically, such potentials are derived fro the frequencies with which each type of contact occurs in a representative set of known protein structures. Soe authors have also even included three-body interactions between residues. 17,61 The siple foralis presented below ay be easily extended to energy functions involving arbitrary anybody interactions. For the sake of siplicity, we consider here only energy functions that involve a su of one- and two-body ters. The siple route to an effective one-body energy is based upon a well-known tool in condensed atter physics, that of a ean-field theory. The interactions aong the coponents of a particular syste ake the syste s statistical behavior nontrivial; the energy present at a particular site depends on the states, identities, and positions of the other coponents with which it interacts. In a siple for of ean-field theory, 62-65

6 8380 J. Phys. Che. B, Vol. 101, o. 41, 1997 Saven and Wolynes the fluctuating local energy at a particular position is replaced by the average energy of the interactions of the particle with its neighbors. The averaging is done over different realizations that are consistent with a particular set of therodynaic conditions. We take a siilar approach here. We note that any other researchers have applied the technique to biocheical probles, but usually with the intent of understanding conforational statistics Shakhnovich and Gutin have noted that protein design has any features in coon with the rando field Ising agnet, 33 and this approach otivated their design algorith. Other physicists have developed design algoriths with this analogy in ind. 37,40 Though ean-field theory does poorly in the critical region for three-diensional systes, 64,65 the critical region is not of interest in design probles where we are concerned priarily with configurations (sequences) of low energy or low effective teperature. Furtherore, ean-field theory is known to iprove as the diensionality of the syste increases, that is as each particle interacts with a larger nuber of the reaining particles. The average coordination nuber c i in proteins can fluctuate and depends on the prescribed interaction radius r c for two-body interactions, but in general it can be quite large: when r c ) 12 Å, as it does for soe energy functions, 22,61 then c i 20. (In this crude exaple, r c is the distance between alpha carbons, and we have used the PDB structures of yoheerythrin, yoglobin, and ribonuclease A). The energy function ost coonly applied to proteins includes one- and two-body ters. For a sequence whose site identities are specified by the configuration {R 1... R }, where R i is the identity of site i, the energy of the sequence when it takes on the native conforation is E ) γ i (R i ) + γ ij (R i,r j ;r ij ) (22) i*j where is the total nuber of residues, γ i (R i ) is an energy contribution due to the presence of residue type R i at site i, and γ ij (R i,r j ;r ij ) is the interaction between sites i and j when their onoer types are R i and R j, respectively. Typically, γ ij is a function of r ij the distance of separation between sites i and j. The one-body ter γ i quantifies the propensity of onoer types to reside in a particular structural context. 16,22 For exaple, different aino acids will have different propensities to reside in an alpha helix or other secondary structure. Also included in γ i are any propensities for onoers to be buried within the globule or exposed to solvent at the target structure s surface, the surface accessibility. The two-body ter γ i,j can be used to quantify interresidue contact propensities, 19,20,22,23 as well as excluded volue interactions. For the sequence design proble discussed here, all the r ij are deterined by the chosen target conforation and do not fluctuate. Therefore we will suppress the dependence of γ ij on r ij. We define ɛ i (R i ) as the local field (energy contribution) at site i when it is occupied by residue type R i due to any local energy contribution plus that is due to its interaction with its neighbors. ɛ i (R) ) γ i (R) + γ ij (R,R j ) (23) j)1 In our ean field treatent, we assue that ɛ i (R i ) can be replaced by its average value. The best choice for an effective one-body site energy is well-known and ay be obtained using the Gibbs-Bogobiulov inequality 66 ɛ i (R) ) γ i (R) + j)1 The average denoted by... is over all the possible identities of the neighboring residues at a given energy. The joint probability for a particular arrangeent of onoer identities aong site i s neighbors is siply the product of the individual site identity probabilities w j (R j ). Since w j (R j ) is noralized to unity (see eq 7), the average contribution due to the two-body ter is γ ij (R,R j ) j)1 ) j)1 Recall that the γ ij are nonzero only if residues i and j interact with one another. Therefore, the su only has nonzero contributions fro the neighbors of residue i. The resulting effective energy function is a su of one-body ters ɛ i (R), ɛ i (R) ) γ i (R) + j)1 Equation 26 provides us with an expression for the effective local field in ters of the putative one- and two-body interactions γ i and γ ij. The reader will notice, however, that the w j (R j ) also are functions of the ɛ i (R) (see eqs 12 and 19). Thus for a given energy function, we solve self-consistently for the weights w i (R) and the effective site energies ɛ i (R). ote that, with this for of the effective site energy eq 26, the contribution due to two-body interactions in eq 6 is double counted. 54 Therefore, we ust subtract one-half the value of this two-body energy contribution. E ) U - 1 / 2 R j)1 where U is defined as in eq 6. Alternately, we could have also subtracted a constant ter fro each of the energies ɛ i (R). Regardless, no correction is necessary for the individual site identity probabilities w i (R). ote that this developent of an effective one-body energy function is siilar in spirit to the quasi-cheical approach in the theory of the Ising agnet and binary alloys. 63,64,66 In the protein case considered here, however, all the sites are not equivalent. The equivalence aong sites in the lattice Ising agnet siplifies the solution of the that proble, but it is the nonequivalence of the sites in the protein proble, the heterogeneity of the sequence and structure, that is paraount for the ability of the protein to fold to a unique structure Suary of Theory. We now suarize the ipleentation of the theory. In section 2.3 we presented how an effective one-body energy (eq 26) ay be obtained fro an energy function that involves a one-body (profile) and twobody (interonoer contact) interactions. The effective site energies ɛ i (R) and site identity probabilities w i (R) can be solved for self-consistently using eqs 26 and 12, or, if the coposition is fixed, eqs 26 and 19. The icrocanonical sequence entropy γ ij (R,R j (24) ) Rj)1 γ ij (R,R j )w j (R j ) (25) Rj)1 γ ij (R,R j )w j (R j ) (26) Rj)1 γ ij (R i,r j )w i (R i )w j (R j ) (27) U ) R w i (R i ) ɛ i (R i ) (28)

7 Folding Macroolecules J. Phys. Che. B, Vol. 101, o. 41, S(E) is then obtained using eq 9 or eq 17. In each case, the sequence entropy is noralized according to the total nuber of possible sequences (see eq 21). We ay then use eq 2 to obtain the cuulative probability that a sequence having energy less than E in the target structure exists. In practice, we solve for the ɛ i (R k ) and µ k for a given value of β, and then use eq 27 to deterine the corresponding energy for these paraeters. In therodynaic ters, this is equivalent to choosing a teperature for the syste β -1 such that the internal energy is equal to the energy of interest. For a given target structure, the theory yields the probability w i (R) that a site i in the sequence is occupied by the onoer type R at a given energy. The theory also provides the logarith of the nuber of sequences S(E) that have a particular energy in the target structure. The theory ay be used to calculate a quantity that reflects the utational variability of each site. We will refer to this quantity s i as the local sequence entropy. 45,53 s i )- w i (R)lnw i (R) (29) R)1 where the su is over the residue identities at site i. Depending upon the structure, soe sites are ore likely to have a particular identity than others. For a given value of the overall energy E, the local sequence entropy s i is a easure of how likely utations are at a particular residue position. ote that if only one residue type is peritted at site i, then s i ) 0. If all possible residue types are equally likely at site i (w i (R) ) 1/ i ), then s i ) ln i. Recall that i is the nuber of onoer identities allowed at site i. At a given total energy E, a site having a sall value of s i has an identity that is likely to be conserved across different sequences. 3. Results and Applications of the Model to Lattice Models of Proteins The focus of this study is to illustrate the validity of the selfconsistent ean-field theory presented in section 2. For natural protein structures, there is currently soe controversy about the proper choice of energy functions, 27 and the nuber of available onoers (the aino acids) is large (20), iplying that the nuber of possible sequences is gigantic (20 ). For these reasons, we initially apply the theory to a syste that ay be uch ore easily understood. We choose the well-studied 27- er cubic lattice polyer. 34,67-70 In such lattice odels, siple energy functions ay be used for which soe sequences exhibit protein-like behavior: any conforations are possible but one conforation is therodynaically preferred. The siplicity of the odel akes it easy to discuss quantities that appear in the theory in ters of particular structural features. Furtherore, if the nuber of allowed onoer types is sufficiently sall, all possible sequences ay be enuerated for any given target structure, and hence the lattice polyer provides an exactly solvable odel that we can copare with the theory presented in section 2. We choose energy functions that are of a pure two-body (contact) for. In the theory presented in section 2, no siplifying approxiations are ade with regard to the onebody ter in a given energy function (see eq 22). In addition, ost of the energy functions that have been used for the lattice odels involve only interresidue contacts. Therefore, in illustrating the theory, we consider energy functions that involve only two-body interactions, i.e., all the γ i (R) ) 0. In the inialist odel we consider here, there are just two types of residues, H and P, which crudely iic hydrophobic and Figure 1. Copact structures of the 27-er cubic lattice odel. (A) The structure of Li et al. that is the conforational ground state of 3794 sequences. 70 (B) The ground state conforation of the 002 sequence of Socci and Onuchic. 71 hydrophilic residues in proteins. 5 we consider is of the for E ) i j>i Each energy function that γ ij (R i,r j ) (30) where R i, the aino acid identity, is either H or P. The γ ij are nonzero only if the sequence positions i and j are nearest neighbors in the target structure and if i and j are not adjacent in the sequence ( i - j > 1). In addition, the contact energies between dissiilar residues are syetric γ ij (H,P) ) γ ij (P,H). Thus the energy of a particular sequence is solely a function of the nuber of H-H, H-P, and P-P contacts that are ade within the target structure. The two energy functions that we consider in sections 3.1 and 3.2 differ only in the relative strengths of these three types of contacts. In each case, the energy function is known to possess therodynaically foldable sequences. That is, for soe sequences, at sufficiently low teperatures one conforation has the doinant Boltzann weight. Due to the discrete nature of both the energy functions and the lattice conforations, the exact total energy of each sequence ay only take on specific discrete values. We consider two copact lattice conforations of recent interest. For a specific choice of the contact energy function, a conforation recently highlighted by Li et al. (structure A in Figure 1) is the lowest energy conforation for 3794 sequences. 70 In contrast, soe conforations are the lowest energy state of only a few sequences, or none at all. For a different choice of energy function, structure B in Figure 1 is the lowest energy conforation of just two sequences. One of these, the 002 sequence, has been studied extensively with respect to its folding kinetics and therodynaics. 71,72 In the reainder of this section, we copare the exact results with those of the theory, which takes as input only the target structure and a given energy function. The exact results are obtained by enuerating all sequences for each target conforation. With only two types of onoer, the total nuber of possible sequences for the 27-er is large, 2 27 ) , but is still easy to enuerate coputationally. 67,70,71 For a given target structure, we group sequences according to their energy. In so doing, we obtain the exact S(E) that ay be copared with the self-consistent theory. Fro the explicit tabulation, we ay also obtain the exact individual site identity probabilities w i (R) and, using the w i (R), the exact local sequence entropies s i. We copare these three quantities with the values that are estiated by the theory Energy Function that Favors Buried Hydrophobic Residues. In this section we consider an energy function of the pure contact for in eq 30. Here we choose a for of the

8 8382 J. Phys. Che. B, Vol. 101, o. 41, 1997 Saven and Wolynes Figure 2. The sequence entropy S(E)vsEfor structure A. The energy function is that specified by eq 31. The heterogeneity of the sequences are varied by changing the ratio of polar (P) to hydrophobic (H) onoers. The following total P:H copositions were used: 14:13 (circles), 17:10 (squares), 20:7 (diaonds), 24:3 (triangles), and 26:1 (inverted triangles). The curves are the theoretical results for each coposition. energy function that is very siilar to one used recently by Li et al. 70 γ ij (H,H) )-3ɛ, γ ij (P,P) ) 0ɛ, and γ ij (H,P) )-ɛ (31) For this choice of the energy function, copact conforations have lower energies than extended ones. In addition, it satisfies the inequality γ ij (H,H) < γ ij (H,P) < γ ij (P,P), and the energy decreases for those sequences that have larger nuber of contacts involving H onoers. Hence, the energy function favors placing H onoers in the interior of a particular structure, since positions in the interior have the largest coordination nubers. Sequences with any H onoers in the interior are ore likely to have large nubers of the lowest energy H-H contacts. Thus the energy function favors placing H onoers on the interior of a conforation, which confors with the burial of hydrophobic residues seen in folded protein structures. This energy function also satisfies γ ij (H,H) + γ ij (P,P) < 2γ ij (H,P), which iplies that dissiilar onoers favor segregation within the collapsed globule. In the study of Li et al., an effective H-H contact strength of γ ij (H,H) )-2.3ɛ was used, but these authors ention that their results were not sensitive to the precise value of γ ij (H,H), as long as the above inequalities are satisfied. For structure A, we consider the nuber of sequences having a given energy in this target conforation. In Figure 2, we plot the sequence entropy S(E) for five different overall onoer copositions. The heterogeneity of the sequences decreases as the ratio of polar to hydrophobic residues (P:H) increases. The copositions considered have the following values (P:H): 14:13, 17:10, 20:7, 24:3, and 26:1. In each case there is excellent agreeent between the theory and the exact results. The predicted entropy S(E) is slightly lower than the exact result for low and high energies, but it covers the sae range of energies as the exact result with rearkable fidelity. The axiu value of S(E) decreases with increasing hoogeneity, since there are fewer sequences for the ore hoogeneous copositions (see eq 21). ote that, as the sequences gain increasing P content and becoe ore like a hoopolyer, the energies at the axia of these distributions increases. With Figure 3. The fraction of sequences f(e) having energy less than E for structure A. The energy function in eq 31 was used. The following total P:H copositions were used: 14:13 (circles), 17:10 (squares), 20:7 (diaonds), 24:3 (triangles), and 26:1 (inverted triangles). The curves are the theoretical results for each coposition. increasing P content, the nuber of stabilizing H-P and H-H contacts decreases. Furtherore, as the heterogeneity of the sequences decreases, the width of each distribution also decreases. The ost heteropolyeric coposition (14:13) spans the largest range of energy values. The corresponding P- hoopolyer distribution (27:0) has zero width at E ) 0 (not shown). In Figure 3 we present the cuulative probability f(e) for each of the five chosen copositions (see eq 3). Since f(e) is siply the integral of S(E), close agreeent with the exact result is again obtained. ote that at low energies, S(E) drops rapidly as the energy decreases. It is this low-energy region that is ost likely to contain sequences that fold to the target structure. The identity probabilities w i (R) reveal which residue types at each position are responsible for stabilizing these low energy sequences. In Figure 4, we plot the probability w i (P) that each site of conforation A is occupied by a polar residue for different energies. The P:H coposition in this case is 14:13. By coparing with Figure 2, we see that the nubers of sequences at each energy are Ω s (E )-50ɛ) ) 118, Ω s (E )-45ɛ) ) , and Ω s (E )-45ɛ) ) ote that at each of the energies, there is excellent agreeent between the theory and the exact tabulation. At the lowest energy considered E ) -50ɛ, any of the sites are predoinantly occupied by one type of residue; w i (P) 1 or 0. For the given coposition, there are severe restrictions on the allowed residues at each position if a sequence is to have this low energy in the target structure. As the energy is increased, these restrictions are relaxed, and a few high-energy contacts are present. The site identity probabilities take on values interediate between 0 and 1. At the energy that axiizes S(E) nearly all residue types are allowed at each position, subject only to the constraint of constant coposition: w i (P) 14/ (not shown). On the low-energy side of S(E), the residues that are ost likely to reain hydrophobic with increasing energy are those in the center of this conforation, residues 1, 26, and 27 (see Figure 1). Each of these residues is coordinated with four other nonbonded neighbors, a coordination nuber that is larger than that of any of the other residues. For low-energy sequences, their high coordination nubers iply that these residues are likely to be hydrophobic (H). Hence w i (P) should be sall. The energy function favors hydrophobes at sites of high coordination, which are likely to be buried within the structure.

Optimizing energy potentials for success in protein tertiary structure prediction Ting-Lan Chiu 1 and Richard A Goldstein 1,2

Optimizing energy potentials for success in protein tertiary structure prediction Ting-Lan Chiu 1 and Richard A Goldstein 1,2 Research Paper 223 Optiizing energy potentials for success in protein tertiary structure prediction Ting-Lan Chiu 1 and Richard A Goldstein 1,2 Background: Success in solving the protein structure prediction

More information

Using EM To Estimate A Probablity Density With A Mixture Of Gaussians

Using EM To Estimate A Probablity Density With A Mixture Of Gaussians Using EM To Estiate A Probablity Density With A Mixture Of Gaussians Aaron A. D Souza adsouza@usc.edu Introduction The proble we are trying to address in this note is siple. Given a set of data points

More information

Chapter 6 1-D Continuous Groups

Chapter 6 1-D Continuous Groups Chapter 6 1-D Continuous Groups Continuous groups consist of group eleents labelled by one or ore continuous variables, say a 1, a 2,, a r, where each variable has a well- defined range. This chapter explores:

More information

Kinetic Theory of Gases: Elementary Ideas

Kinetic Theory of Gases: Elementary Ideas Kinetic Theory of Gases: Eleentary Ideas 17th February 2010 1 Kinetic Theory: A Discussion Based on a Siplified iew of the Motion of Gases 1.1 Pressure: Consul Engel and Reid Ch. 33.1) for a discussion

More information

Kinetic Theory of Gases: Elementary Ideas

Kinetic Theory of Gases: Elementary Ideas Kinetic Theory of Gases: Eleentary Ideas 9th February 011 1 Kinetic Theory: A Discussion Based on a Siplified iew of the Motion of Gases 1.1 Pressure: Consul Engel and Reid Ch. 33.1) for a discussion of

More information

The Thermal Dependence and Urea Concentration Dependence of Rnase A Denaturant Transition

The Thermal Dependence and Urea Concentration Dependence of Rnase A Denaturant Transition The Theral Dependence and Urea Concentration Dependence of Rnase A Denaturant Transition Bin LI Departent of Physics & Astronoy, University of Pittsburgh, Pittsburgh, PA 15260, U.S.A Feb.20 th, 2001 Abstract:

More information

Nonuniqueness of canonical ensemble theory. arising from microcanonical basis

Nonuniqueness of canonical ensemble theory. arising from microcanonical basis onuniueness of canonical enseble theory arising fro icrocanonical basis arxiv:uant-ph/99097 v2 25 Oct 2000 Suiyoshi Abe and A. K. Rajagopal 2 College of Science and Technology, ihon University, Funabashi,

More information

Protein Structure Prediction in a 210-Type Lattice Model: Parameter Optimization in the Genetic Algorithm Using Orthogonal Array

Protein Structure Prediction in a 210-Type Lattice Model: Parameter Optimization in the Genetic Algorithm Using Orthogonal Array Journal of rotein Cheistry, Vol. 8, No., 999 rotein Structure rediction in a 0-Type Lattice Model: araeter Optiization in the Genetic Algorith Using Orthogonal Array Zhirong Sun, Xiaofeng Xia, Qing Guo,

More information

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon Model Fitting CURM Background Material, Fall 014 Dr. Doreen De Leon 1 Introduction Given a set of data points, we often want to fit a selected odel or type to the data (e.g., we suspect an exponential

More information

Statistical properties of contact maps

Statistical properties of contact maps PHYSICAL REVIEW E VOLUME 59, NUMBER 1 JANUARY 1999 Statistical properties of contact aps Michele Vendruscolo, 1 Balakrishna Subraanian, 2 Ido Kanter, 3 Eytan Doany, 1 and Joel Lebowitz 2 1 Departent of

More information

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t.

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t. CS 493: Algoriths for Massive Data Sets Feb 2, 2002 Local Models, Bloo Filter Scribe: Qin Lv Local Models In global odels, every inverted file entry is copressed with the sae odel. This work wells when

More information

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines Intelligent Systes: Reasoning and Recognition Jaes L. Crowley osig 1 Winter Seester 2018 Lesson 6 27 February 2018 Outline Perceptrons and Support Vector achines Notation...2 Linear odels...3 Lines, Planes

More information

Quantum algorithms (CO 781, Winter 2008) Prof. Andrew Childs, University of Waterloo LECTURE 15: Unstructured search and spatial search

Quantum algorithms (CO 781, Winter 2008) Prof. Andrew Childs, University of Waterloo LECTURE 15: Unstructured search and spatial search Quantu algoriths (CO 781, Winter 2008) Prof Andrew Childs, University of Waterloo LECTURE 15: Unstructured search and spatial search ow we begin to discuss applications of quantu walks to search algoriths

More information

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all Lecture 6 Introduction to kinetic theory of plasa waves Introduction to kinetic theory So far we have been odeling plasa dynaics using fluid equations. The assuption has been that the pressure can be either

More information

Annealing contour Monte Carlo algorithm for structure optimization in an off-lattice protein model

Annealing contour Monte Carlo algorithm for structure optimization in an off-lattice protein model JOURNAL OF CHEMICAL PHYSICS VOLUME 120, NUMBER 14 8 APRIL 2004 Annealing contour Monte Carlo algorith for structure optiization in an off-lattice protein odel Faing Liang a) Departent of Statistics, Texas

More information

McMillan Mayer theory for solvent effects in inhomogeneous systems: Calculation of interaction pressure in aqueous electrical double layers

McMillan Mayer theory for solvent effects in inhomogeneous systems: Calculation of interaction pressure in aqueous electrical double layers McMillan Mayer theory for solvent effects in inhoogeneous systes: Calculation of interaction pressure in aqueous electrical double layers Roland Kjellander, Alexander P. Lyubartsev, and Stjepan Marčelja

More information

Probability Distributions

Probability Distributions Probability Distributions In Chapter, we ephasized the central role played by probability theory in the solution of pattern recognition probles. We turn now to an exploration of soe particular exaples

More information

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 22 Oct 1998

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 22 Oct 1998 arxiv:cond-at/9810285v1 [cond-at.stat-ech] 22 Oct 1998 Statistical Properties of Contact Maps Michele Vendruscolo (1), Balakrishna Subraanian (2), Ido Kanter (3), Eytan Doany (1) and Joel Lebowitz (2)

More information

Measuring Temperature with a Silicon Diode

Measuring Temperature with a Silicon Diode Measuring Teperature with a Silicon Diode Due to the high sensitivity, nearly linear response, and easy availability, we will use a 1N4148 diode for the teperature transducer in our easureents 10 Analysis

More information

arxiv: v1 [cs.ds] 3 Feb 2014

arxiv: v1 [cs.ds] 3 Feb 2014 arxiv:40.043v [cs.ds] 3 Feb 04 A Bound on the Expected Optiality of Rando Feasible Solutions to Cobinatorial Optiization Probles Evan A. Sultani The Johns Hopins University APL evan@sultani.co http://www.sultani.co/

More information

Interactive Markov Models of Evolutionary Algorithms

Interactive Markov Models of Evolutionary Algorithms Cleveland State University EngagedScholarship@CSU Electrical Engineering & Coputer Science Faculty Publications Electrical Engineering & Coputer Science Departent 2015 Interactive Markov Models of Evolutionary

More information

13 Harmonic oscillator revisited: Dirac s approach and introduction to Second Quantization

13 Harmonic oscillator revisited: Dirac s approach and introduction to Second Quantization 3 Haronic oscillator revisited: Dirac s approach and introduction to Second Quantization. Dirac cae up with a ore elegant way to solve the haronic oscillator proble. We will now study this approach. The

More information

Polygonal Designs: Existence and Construction

Polygonal Designs: Existence and Construction Polygonal Designs: Existence and Construction John Hegean Departent of Matheatics, Stanford University, Stanford, CA 9405 Jeff Langford Departent of Matheatics, Drake University, Des Moines, IA 5011 G

More information

General Properties of Radiation Detectors Supplements

General Properties of Radiation Detectors Supplements Phys. 649: Nuclear Techniques Physics Departent Yarouk University Chapter 4: General Properties of Radiation Detectors Suppleents Dr. Nidal M. Ershaidat Overview Phys. 649: Nuclear Techniques Physics Departent

More information

Block designs and statistics

Block designs and statistics Bloc designs and statistics Notes for Math 447 May 3, 2011 The ain paraeters of a bloc design are nuber of varieties v, bloc size, nuber of blocs b. A design is built on a set of v eleents. Each eleent

More information

Spine Fin Efficiency A Three Sided Pyramidal Fin of Equilateral Triangular Cross-Sectional Area

Spine Fin Efficiency A Three Sided Pyramidal Fin of Equilateral Triangular Cross-Sectional Area Proceedings of the 006 WSEAS/IASME International Conference on Heat and Mass Transfer, Miai, Florida, USA, January 18-0, 006 (pp13-18) Spine Fin Efficiency A Three Sided Pyraidal Fin of Equilateral Triangular

More information

In this chapter, we consider several graph-theoretic and probabilistic models

In this chapter, we consider several graph-theoretic and probabilistic models THREE ONE GRAPH-THEORETIC AND STATISTICAL MODELS 3.1 INTRODUCTION In this chapter, we consider several graph-theoretic and probabilistic odels for a social network, which we do under different assuptions

More information

PHY 171. Lecture 14. (February 16, 2012)

PHY 171. Lecture 14. (February 16, 2012) PHY 171 Lecture 14 (February 16, 212) In the last lecture, we looked at a quantitative connection between acroscopic and icroscopic quantities by deriving an expression for pressure based on the assuptions

More information

Optical Properties of Plasmas of High-Z Elements

Optical Properties of Plasmas of High-Z Elements Forschungszentru Karlsruhe Techni und Uwelt Wissenschaftlishe Berichte FZK Optical Properties of Plasas of High-Z Eleents V.Tolach 1, G.Miloshevsy 1, H.Würz Project Kernfusion 1 Heat and Mass Transfer

More information

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels Extension of CSRSM for the Paraetric Study of the Face Stability of Pressurized Tunnels Guilhe Mollon 1, Daniel Dias 2, and Abdul-Haid Soubra 3, M.ASCE 1 LGCIE, INSA Lyon, Université de Lyon, Doaine scientifique

More information

The Weierstrass Approximation Theorem

The Weierstrass Approximation Theorem 36 The Weierstrass Approxiation Theore Recall that the fundaental idea underlying the construction of the real nubers is approxiation by the sipler rational nubers. Firstly, nubers are often deterined

More information

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis E0 370 tatistical Learning Theory Lecture 6 (Aug 30, 20) Margin Analysis Lecturer: hivani Agarwal cribe: Narasihan R Introduction In the last few lectures we have seen how to obtain high confidence bounds

More information

Feature Extraction Techniques

Feature Extraction Techniques Feature Extraction Techniques Unsupervised Learning II Feature Extraction Unsupervised ethods can also be used to find features which can be useful for categorization. There are unsupervised ethods that

More information

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization Recent Researches in Coputer Science Support Vector Machine Classification of Uncertain and Ibalanced data using Robust Optiization RAGHAV PAT, THEODORE B. TRAFALIS, KASH BARKER School of Industrial Engineering

More information

EXPLORING PHASE SPACES OF BIOMOLECULES WITH MONTE CARLO METHODS

EXPLORING PHASE SPACES OF BIOMOLECULES WITH MONTE CARLO METHODS Journal of Optoelectronics and Advanced Materials Vol. 7, o. 3, June 2005, p. 1563-1571 EXPLORIG PHASE SPACES OF BIOMOLECULES WITH MOTE CARLO METHODS A. Bu u * ational Institute for Research and Developent

More information

Supporting information for Self-assembly of multicomponent structures in and out of equilibrium

Supporting information for Self-assembly of multicomponent structures in and out of equilibrium Supporting inforation for Self-assebly of ulticoponent structures in and out of equilibriu Stephen Whitela 1, Rebecca Schulan 2, Lester Hedges 1 1 Molecular Foundry, Lawrence Berkeley National Laboratory,

More information

OBJECTIVES INTRODUCTION

OBJECTIVES INTRODUCTION M7 Chapter 3 Section 1 OBJECTIVES Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance, and

More information

ASSUME a source over an alphabet size m, from which a sequence of n independent samples are drawn. The classical

ASSUME a source over an alphabet size m, from which a sequence of n independent samples are drawn. The classical IEEE TRANSACTIONS ON INFORMATION THEORY Large Alphabet Source Coding using Independent Coponent Analysis Aichai Painsky, Meber, IEEE, Saharon Rosset and Meir Feder, Fellow, IEEE arxiv:67.7v [cs.it] Jul

More information

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES Proc. of the IEEE/OES Seventh Working Conference on Current Measureent Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES Belinda Lipa Codar Ocean Sensors 15 La Sandra Way, Portola Valley, CA 98 blipa@pogo.co

More information

Qualitative Modelling of Time Series Using Self-Organizing Maps: Application to Animal Science

Qualitative Modelling of Time Series Using Self-Organizing Maps: Application to Animal Science Proceedings of the 6th WSEAS International Conference on Applied Coputer Science, Tenerife, Canary Islands, Spain, Deceber 16-18, 2006 183 Qualitative Modelling of Tie Series Using Self-Organizing Maps:

More information

Physics 215 Winter The Density Matrix

Physics 215 Winter The Density Matrix Physics 215 Winter 2018 The Density Matrix The quantu space of states is a Hilbert space H. Any state vector ψ H is a pure state. Since any linear cobination of eleents of H are also an eleent of H, it

More information

THERMAL ENDURANCE OF UNREINFORCED UNSATURATED POLYESTERS AND VINYL ESTER RESINS = (1) ln = COMPOSITES & POLYCON 2009

THERMAL ENDURANCE OF UNREINFORCED UNSATURATED POLYESTERS AND VINYL ESTER RESINS = (1) ln = COMPOSITES & POLYCON 2009 Aerican Coposites Manufacturers Association January 15-17, 29 Tapa, FL USA Abstract THERMAL ENDURANCE OF UNREINFORCED UNSATURATED POLYESTERS AND VINYL ESTER RESINS by Thore M. Klaveness, Reichhold AS In

More information

Kernel Methods and Support Vector Machines

Kernel Methods and Support Vector Machines Intelligent Systes: Reasoning and Recognition Jaes L. Crowley ENSIAG 2 / osig 1 Second Seester 2012/2013 Lesson 20 2 ay 2013 Kernel ethods and Support Vector achines Contents Kernel Functions...2 Quadratic

More information

Physics 139B Solutions to Homework Set 3 Fall 2009

Physics 139B Solutions to Homework Set 3 Fall 2009 Physics 139B Solutions to Hoework Set 3 Fall 009 1. Consider a particle of ass attached to a rigid assless rod of fixed length R whose other end is fixed at the origin. The rod is free to rotate about

More information

Classical systems in equilibrium

Classical systems in equilibrium 35 Classical systes in equilibriu Ideal gas Distinguishable particles Here we assue that every particle can be labeled by an index i... and distinguished fro any other particle by its label if not by any

More information

Non-Parametric Non-Line-of-Sight Identification 1

Non-Parametric Non-Line-of-Sight Identification 1 Non-Paraetric Non-Line-of-Sight Identification Sinan Gezici, Hisashi Kobayashi and H. Vincent Poor Departent of Electrical Engineering School of Engineering and Applied Science Princeton University, Princeton,

More information

Chaotic Coupled Map Lattices

Chaotic Coupled Map Lattices Chaotic Coupled Map Lattices Author: Dustin Keys Advisors: Dr. Robert Indik, Dr. Kevin Lin 1 Introduction When a syste of chaotic aps is coupled in a way that allows the to share inforation about each

More information

Synchronization in large directed networks of coupled phase oscillators

Synchronization in large directed networks of coupled phase oscillators CHAOS 16, 015107 2005 Synchronization in large directed networks of coupled phase oscillators Juan G. Restrepo a Institute for Research in Electronics and Applied Physics, University of Maryland, College

More information

Inspection; structural health monitoring; reliability; Bayesian analysis; updating; decision analysis; value of information

Inspection; structural health monitoring; reliability; Bayesian analysis; updating; decision analysis; value of information Cite as: Straub D. (2014). Value of inforation analysis with structural reliability ethods. Structural Safety, 49: 75-86. Value of Inforation Analysis with Structural Reliability Methods Daniel Straub

More information

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are,

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are, Page of 8 Suppleentary Materials: A ultiple testing procedure for ulti-diensional pairwise coparisons with application to gene expression studies Anjana Grandhi, Wenge Guo, Shyaal D. Peddada S Notations

More information

A note on the multiplication of sparse matrices

A note on the multiplication of sparse matrices Cent. Eur. J. Cop. Sci. 41) 2014 1-11 DOI: 10.2478/s13537-014-0201-x Central European Journal of Coputer Science A note on the ultiplication of sparse atrices Research Article Keivan Borna 12, Sohrab Aboozarkhani

More information

Chapter 12. Quantum gases Microcanonical ensemble

Chapter 12. Quantum gases Microcanonical ensemble Chapter 2 Quantu gases In classical statistical echanics, we evaluated therodynaic relations often for an ideal gas, which approxiates a real gas in the highly diluted liit. An iportant difference between

More information

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis City University of New York (CUNY) CUNY Acadeic Works International Conference on Hydroinforatics 8-1-2014 Experiental Design For Model Discriination And Precise Paraeter Estiation In WDS Analysis Giovanna

More information

m potential kinetic forms of energy.

m potential kinetic forms of energy. Spring, Chapter : A. near the surface of the earth. The forces of gravity and an ideal spring are conservative forces. With only the forces of an ideal spring and gravity acting on a ass, energy F F will

More information

Chapter 4: Hypothesis of Diffusion-Limited Growth

Chapter 4: Hypothesis of Diffusion-Limited Growth Suary This section derives a useful equation to predict quantu dot size evolution under typical organoetallic synthesis conditions that are used to achieve narrow size distributions. Assuing diffusion-controlled

More information

Topic 5a Introduction to Curve Fitting & Linear Regression

Topic 5a Introduction to Curve Fitting & Linear Regression /7/08 Course Instructor Dr. Rayond C. Rup Oice: A 337 Phone: (95) 747 6958 E ail: rcrup@utep.edu opic 5a Introduction to Curve Fitting & Linear Regression EE 4386/530 Coputational ethods in EE Outline

More information

Combining Classifiers

Combining Classifiers Cobining Classifiers Generic ethods of generating and cobining ultiple classifiers Bagging Boosting References: Duda, Hart & Stork, pg 475-480. Hastie, Tibsharini, Friedan, pg 246-256 and Chapter 10. http://www.boosting.org/

More information

Fast Structural Similarity Search of Noncoding RNAs Based on Matched Filtering of Stem Patterns

Fast Structural Similarity Search of Noncoding RNAs Based on Matched Filtering of Stem Patterns Fast Structural Siilarity Search of Noncoding RNs Based on Matched Filtering of Ste Patterns Byung-Jun Yoon Dept. of Electrical Engineering alifornia Institute of Technology Pasadena, 91125, S Eail: bjyoon@caltech.edu

More information

Illustration of transition path theory on a collection of simple examples

Illustration of transition path theory on a collection of simple examples THE JOURNAL OF CHEMICAL PHYSICS 125, 084110 2006 Illustration of transition path theory on a collection of siple exaples Philipp Metzner a and Christof Schütte b Departent of Matheatics and Coputer Science,

More information

Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson) February 5,

Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson) February 5, Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson) February 5, 2015 31 11 Motif Finding Sources for this section: Rouchka, 1997, A Brief Overview of Gibbs Sapling. J. Buhler, M. Topa:

More information

Graphical Models in Local, Asymmetric Multi-Agent Markov Decision Processes

Graphical Models in Local, Asymmetric Multi-Agent Markov Decision Processes Graphical Models in Local, Asyetric Multi-Agent Markov Decision Processes Ditri Dolgov and Edund Durfee Departent of Electrical Engineering and Coputer Science University of Michigan Ann Arbor, MI 48109

More information

12 Towards hydrodynamic equations J Nonlinear Dynamics II: Continuum Systems Lecture 12 Spring 2015

12 Towards hydrodynamic equations J Nonlinear Dynamics II: Continuum Systems Lecture 12 Spring 2015 18.354J Nonlinear Dynaics II: Continuu Systes Lecture 12 Spring 2015 12 Towards hydrodynaic equations The previous classes focussed on the continuu description of static (tie-independent) elastic systes.

More information

On the approximation of Feynman-Kac path integrals

On the approximation of Feynman-Kac path integrals On the approxiation of Feynan-Kac path integrals Stephen D. Bond, Brian B. Laird, and Benedict J. Leikuhler University of California, San Diego, Departents of Matheatics and Cheistry, La Jolla, CA 993,

More information

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

e-companion ONLY AVAILABLE IN ELECTRONIC FORM OPERATIONS RESEARCH doi 10.1287/opre.1070.0427ec pp. ec1 ec5 e-copanion ONLY AVAILABLE IN ELECTRONIC FORM infors 07 INFORMS Electronic Copanion A Learning Approach for Interactive Marketing to a Custoer

More information

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES S. E. Ahed, R. J. Tokins and A. I. Volodin Departent of Matheatics and Statistics University of Regina Regina,

More information

Compression and Predictive Distributions for Large Alphabet i.i.d and Markov models

Compression and Predictive Distributions for Large Alphabet i.i.d and Markov models 2014 IEEE International Syposiu on Inforation Theory Copression and Predictive Distributions for Large Alphabet i.i.d and Markov odels Xiao Yang Departent of Statistics Yale University New Haven, CT, 06511

More information

Research Article Rapidly-Converging Series Representations of a Mutual-Information Integral

Research Article Rapidly-Converging Series Representations of a Mutual-Information Integral International Scholarly Research Network ISRN Counications and Networking Volue 11, Article ID 5465, 6 pages doi:1.54/11/5465 Research Article Rapidly-Converging Series Representations of a Mutual-Inforation

More information

Lower Bounds for Quantized Matrix Completion

Lower Bounds for Quantized Matrix Completion Lower Bounds for Quantized Matrix Copletion Mary Wootters and Yaniv Plan Departent of Matheatics University of Michigan Ann Arbor, MI Eail: wootters, yplan}@uich.edu Mark A. Davenport School of Elec. &

More information

Ensemble Based on Data Envelopment Analysis

Ensemble Based on Data Envelopment Analysis Enseble Based on Data Envelopent Analysis So Young Sohn & Hong Choi Departent of Coputer Science & Industrial Systes Engineering, Yonsei University, Seoul, Korea Tel) 82-2-223-404, Fax) 82-2- 364-7807

More information

IN A SENSE, every material is a composite, even if the

IN A SENSE, every material is a composite, even if the IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, VOL. 47, NO. 11, NOVEMBER 1999 2075 Magnetis fro Conductors and Enhanced Nonlinear Phenoena J. B. Pendry, A. J. Holden, D. J. Robbins, and W. J. Stewart,

More information

AP Physics Thermodynamics Wrap-up

AP Physics Thermodynamics Wrap-up AP Physics herodynaics Wrap-up Here are your basic equations for therodynaics. here s a bunch of the. 3 his equation converts teperature fro Fahrenheit to Celsius. his is the rate of heat transfer for

More information

Fairness via priority scheduling

Fairness via priority scheduling Fairness via priority scheduling Veeraruna Kavitha, N Heachandra and Debayan Das IEOR, IIT Bobay, Mubai, 400076, India vavitha,nh,debayan}@iitbacin Abstract In the context of ulti-agent resource allocation

More information

The Wilson Model of Cortical Neurons Richard B. Wells

The Wilson Model of Cortical Neurons Richard B. Wells The Wilson Model of Cortical Neurons Richard B. Wells I. Refineents on the odgkin-uxley Model The years since odgkin s and uxley s pioneering work have produced a nuber of derivative odgkin-uxley-like

More information

The Transactional Nature of Quantum Information

The Transactional Nature of Quantum Information The Transactional Nature of Quantu Inforation Subhash Kak Departent of Coputer Science Oklahoa State University Stillwater, OK 7478 ABSTRACT Inforation, in its counications sense, is a transactional property.

More information

An analytical relation between relaxation time spectrum and molecular weight distribution

An analytical relation between relaxation time spectrum and molecular weight distribution An analytical relation between relaxation tie spectru and olecular weight distribution Wolfgang Thi, Christian Friedrich, a) Michael Marth, and Josef Honerkap b) Freiburger Materialforschungszentru, Stefan-Meier-Straße

More information

Upper bound on false alarm rate for landmine detection and classification using syntactic pattern recognition

Upper bound on false alarm rate for landmine detection and classification using syntactic pattern recognition Upper bound on false alar rate for landine detection and classification using syntactic pattern recognition Ahed O. Nasif, Brian L. Mark, Kenneth J. Hintz, and Nathalia Peixoto Dept. of Electrical and

More information

Machine Learning Basics: Estimators, Bias and Variance

Machine Learning Basics: Estimators, Bias and Variance Machine Learning Basics: Estiators, Bias and Variance Sargur N. srihari@cedar.buffalo.edu This is part of lecture slides on Deep Learning: http://www.cedar.buffalo.edu/~srihari/cse676 1 Topics in Basics

More information

About the definition of parameters and regimes of active two-port networks with variable loads on the basis of projective geometry

About the definition of parameters and regimes of active two-port networks with variable loads on the basis of projective geometry About the definition of paraeters and regies of active two-port networks with variable loads on the basis of projective geoetry PENN ALEXANDR nstitute of Electronic Engineering and Nanotechnologies "D

More information

A Low-Complexity Congestion Control and Scheduling Algorithm for Multihop Wireless Networks with Order-Optimal Per-Flow Delay

A Low-Complexity Congestion Control and Scheduling Algorithm for Multihop Wireless Networks with Order-Optimal Per-Flow Delay A Low-Coplexity Congestion Control and Scheduling Algorith for Multihop Wireless Networks with Order-Optial Per-Flow Delay Po-Kai Huang, Xiaojun Lin, and Chih-Chun Wang School of Electrical and Coputer

More information

Recovering Data from Underdetermined Quadratic Measurements (CS 229a Project: Final Writeup)

Recovering Data from Underdetermined Quadratic Measurements (CS 229a Project: Final Writeup) Recovering Data fro Underdeterined Quadratic Measureents (CS 229a Project: Final Writeup) Mahdi Soltanolkotabi Deceber 16, 2011 1 Introduction Data that arises fro engineering applications often contains

More information

ln P 1 saturation = T ln P 2 saturation = T

ln P 1 saturation = T ln P 2 saturation = T More Tutorial at www.littledubdoctor.co Physical Cheistry Answer each question in the space provided; use back of page if extra space is needed. Answer questions so the grader can READILY understand your

More information

On the Analysis of the Quantum-inspired Evolutionary Algorithm with a Single Individual

On the Analysis of the Quantum-inspired Evolutionary Algorithm with a Single Individual 6 IEEE Congress on Evolutionary Coputation Sheraton Vancouver Wall Centre Hotel, Vancouver, BC, Canada July 16-1, 6 On the Analysis of the Quantu-inspired Evolutionary Algorith with a Single Individual

More information

1 Proof of learning bounds

1 Proof of learning bounds COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #4 Scribe: Akshay Mittal February 13, 2013 1 Proof of learning bounds For intuition of the following theore, suppose there exists a

More information

A Note on the Applied Use of MDL Approximations

A Note on the Applied Use of MDL Approximations A Note on the Applied Use of MDL Approxiations Daniel J. Navarro Departent of Psychology Ohio State University Abstract An applied proble is discussed in which two nested psychological odels of retention

More information

COS 424: Interacting with Data. Written Exercises

COS 424: Interacting with Data. Written Exercises COS 424: Interacting with Data Hoework #4 Spring 2007 Regression Due: Wednesday, April 18 Written Exercises See the course website for iportant inforation about collaboration and late policies, as well

More information

Lecture #8-3 Oscillations, Simple Harmonic Motion

Lecture #8-3 Oscillations, Simple Harmonic Motion Lecture #8-3 Oscillations Siple Haronic Motion So far we have considered two basic types of otion: translation and rotation. But these are not the only two types of otion we can observe in every day life.

More information

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Soft Coputing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Beverly Rivera 1,2, Irbis Gallegos 1, and Vladik Kreinovich 2 1 Regional Cyber and Energy Security Center RCES

More information

KINETIC THEORY. Contents

KINETIC THEORY. Contents KINETIC THEORY This brief paper on inetic theory deals with three topics: the hypotheses on which the theory is founded, the calculation of pressure and absolute teperature of an ideal gas and the principal

More information

Using a De-Convolution Window for Operating Modal Analysis

Using a De-Convolution Window for Operating Modal Analysis Using a De-Convolution Window for Operating Modal Analysis Brian Schwarz Vibrant Technology, Inc. Scotts Valley, CA Mark Richardson Vibrant Technology, Inc. Scotts Valley, CA Abstract Operating Modal Analysis

More information

A Simple Regression Problem

A Simple Regression Problem A Siple Regression Proble R. M. Castro March 23, 2 In this brief note a siple regression proble will be introduced, illustrating clearly the bias-variance tradeoff. Let Y i f(x i ) + W i, i,..., n, where

More information

Intelligent Systems: Reasoning and Recognition. Artificial Neural Networks

Intelligent Systems: Reasoning and Recognition. Artificial Neural Networks Intelligent Systes: Reasoning and Recognition Jaes L. Crowley MOSIG M1 Winter Seester 2018 Lesson 7 1 March 2018 Outline Artificial Neural Networks Notation...2 Introduction...3 Key Equations... 3 Artificial

More information

Meta-Analytic Interval Estimation for Bivariate Correlations

Meta-Analytic Interval Estimation for Bivariate Correlations Psychological Methods 2008, Vol. 13, No. 3, 173 181 Copyright 2008 by the Aerican Psychological Association 1082-989X/08/$12.00 DOI: 10.1037/a0012868 Meta-Analytic Interval Estiation for Bivariate Correlations

More information

Example A1: Preparation of a Calibration Standard

Example A1: Preparation of a Calibration Standard Suary Goal A calibration standard is prepared fro a high purity etal (cadiu) with a concentration of ca.1000 g l -1. Measureent procedure The surface of the high purity etal is cleaned to reove any etal-oxide

More information

Pattern Recognition and Machine Learning. Artificial Neural networks

Pattern Recognition and Machine Learning. Artificial Neural networks Pattern Recognition and Machine Learning Jaes L. Crowley ENSIMAG 3 - MMIS Fall Seester 2016 Lessons 7 14 Dec 2016 Outline Artificial Neural networks Notation...2 1. Introduction...3... 3 The Artificial

More information

26 Impulse and Momentum

26 Impulse and Momentum 6 Ipulse and Moentu First, a Few More Words on Work and Energy, for Coparison Purposes Iagine a gigantic air hockey table with a whole bunch of pucks of various asses, none of which experiences any friction

More information

1 Bounding the Margin

1 Bounding the Margin COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #12 Scribe: Jian Min Si March 14, 2013 1 Bounding the Margin We are continuing the proof of a bound on the generalization error of AdaBoost

More information

School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, United Kingdom

School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, United Kingdom pubs.acs.org/jpcb Calculation of Partition Functions and Free Energies of a Binary Mixture Using the Energy Partitioning Method: Application to Carbon Dioxide and Methane Haina Do,* Jonathan D. Hirst,

More information

SIMULATION OF THE HEATING STEP WITHIN THE THERMOFORMING PROCESS USING THE FINITE DIFFERENCE METHOD

SIMULATION OF THE HEATING STEP WITHIN THE THERMOFORMING PROCESS USING THE FINITE DIFFERENCE METHOD SIMULATION OF THE HEATING STEP WITHIN THE THERMOFORMING PROCESS USING THE FINITE DIFFERENCE METHOD A. Fertschej 1 *, G.R. Langecker 1 University of Leoben artur.fertschej@u-leoben.at; Franz-Josef Strasse

More information

UNCERTAINTIES IN THE APPLICATION OF ATMOSPHERIC AND ALTITUDE CORRECTIONS AS RECOMMENDED IN IEC STANDARDS

UNCERTAINTIES IN THE APPLICATION OF ATMOSPHERIC AND ALTITUDE CORRECTIONS AS RECOMMENDED IN IEC STANDARDS Paper Published on the16th International Syposiu on High Voltage Engineering, Cape Town, South Africa, 2009 UNCERTAINTIES IN THE APPLICATION OF ATMOSPHERIC AND ALTITUDE CORRECTIONS AS RECOMMENDED IN IEC

More information

Analyzing Simulation Results

Analyzing Simulation Results Analyzing Siulation Results Dr. John Mellor-Cruey Departent of Coputer Science Rice University johnc@cs.rice.edu COMP 528 Lecture 20 31 March 2005 Topics for Today Model verification Model validation Transient

More information