Roles for the Two-hybrid System in Exploration of the Yeast Protein Interactome*

Size: px
Start display at page:

Download "Roles for the Two-hybrid System in Exploration of the Yeast Protein Interactome*"

Transcription

1 Reviews/Perspectives Roles for the Two-hybrid System in Exploration of the Yeast Protein Interactome* Takashi Ito, Kazuhisa Ota, Hiroyuki Kubota, Yoshihiro Yamaguchi, Tomoko Chiba, Kazumi Sakuraba, and Mikio Yoshida ** Comprehensive analysis of protein-protein interactions is a challenging endeavor of functional proteomics and has been best explored in the budding yeast. The yeast protein interactome analysis was achieved first by using the yeast two-hybrid system in a proteome-wide scale and next by large-scale mass spectrometric analysis of affinity-purified protein complexes. While these interaction data have led to a number of novel findings and the emergence of a single huge network containing thousands of proteins, they suffer many false signals and fall short of grasping the entire interactome. Thus, continuous efforts are necessary in both bioinformatics and experimentation to fully exploit these data and to proceed another step forward to the goal. Computational tools to integrate existing biological knowledge buried in literature and various functional genomic data with the interactome data are required for biological interpretation of the huge protein interaction network. Novel experimental methods have to be developed to detect weak, transient interactions involving low abundance proteins as well as to obtain clues to the biological role for each interaction. Since the yeast two-hybrid system can be used for the mapping of the interaction domains and the isolation of interactiondefective mutants, it would serve as a technical basis for the latter purpose, thereby playing another important role in the next phase of protein interactome research. Molecular & Cellular Proteomics 1: , WHY PROTEIN INTERACTOME? Proteins rarely work by themselves. They almost always interact with other biomolecules to execute their functions. Networks of such biomolecular interactions constitute the basis for life, and those occurring between proteins play extremely important roles. Thus deciphering of entire protein interaction networks or protein interactome is vital to our From the Division of Genome Biology, Cancer Research Institute, Kanazawa University, Kanazawa , Japan, the Institute for Bioinformatics Research and Development (BIRD), Japan Science and Technology Corporation (JST), Tokyo , Japan, and **INTEC Web and Genome Informatics Corporation, Tokyo , Japan Received, September 3, 2002, and in revised form, September 16, 2002 Published, MCP Papers in Press, September 16, 2002, DOI /mcp.R MCP200 understanding of life as a system of molecules. Although identification of novel protein interactions is an integral part of conventional or target-oriented studies, these are severely skewed to the neighbors of proteins with current popularity. It is thus necessary to perform a complementary, hypothesisfree approach in protein interactome analysis to obtain more unbiased representation. Such an analysis would also help us guess the functions of numerous novel proteins revealed by the genome projects, which are currently lacking any clue as to their specific functions. If a novel protein is found to bind a well characterized one, the former is likely involved in the same functional category as the latter (i.e. guilt by association). At the same time, the novel protein indicates a previously unrecognized aspect of the molecular pathways involving the known one, thereby expanding our knowledge on that pathway. Pioneering works toward this goal were first undertaken using the two-hybrid system to analyze the yeast protein interactome. THE PRINCIPLE OF THE YEAST TWO-HYBRID SYSTEM The yeast two-hybrid (Y2H) 1 system was developed by Stanley Fields (1) on the basis of modular domain structure of the transcription factor GAL4, comprised of a DNA binding domain and transcription activation domain. In the Y2H system, one of the proteins of one s interest, termed X, is expressed as a hybrid protein with the GAL4 DNA binding domain, whereas the other, termed Y, is expressed with the activation domain. If X and Y interact, the two hybrid proteins, often coined as bait and prey, respectively, are assembled onto GAL4 binding sites in the yeast genome. The assembly functionally reconstitutes the GAL4 transcription factor and induces the expression of reporter genes integrated in the region downstream of the GAL4 binding sites. The Y2H system enables highly sensitive detection of protein-protein interactions in vivo without handling any protein molecules. It also allows one to screen a library of activation domain fusions or preys for the binding partners of one s favorite protein expressed as a DNA binding domain fusion or bait, and it can be used to pinpoint protein regions mediating the interactions. On the other hand, the Y2H system has limitations. First, in principle, it cannot detect interactions requiring three or more 1 The abbreviations used are: Y2H, yeast two-hybrid; ORF, open reading frame; IST, interaction sequence tag; MS, mass spectrometric or mass spectrometry by The American Society for Biochemistry and Molecular Biology, Inc. Molecular & Cellular Proteomics This paper is available on line at

2 proteins and those depending on posttranslational modifications. However, note that, when applied to the budding yeast itself, it can occasionally detect interactions involving three proteins or posttranslational modification by the aid of endogenous third proteins or modifying enzymes. Second, the Y2H system is not suitable for the detection of interactions involving membrane proteins, although a substantial number of such interactions have been detected via an unexplained mechanism. Finally, the Y2H interaction does not guarantee that the inferred interactions are of physiological relevance. Despite these and other limitations, the power of the Y2H system is so tremendous that it is now established as a standard technique in molecular biology. The Y2H system has been successfully used to examine an interaction between the two proteins of one s interest and also to screen for unknown binding partners of one s favorite protein. It can be, in principle, used in a more comprehensive fashion to examine all possible binary combinations between the proteins encoded by any single genome. Three groups (ours, CuraGen s, and Fields s) launched such ambitious projects using the budding yeast as the target. GENOME-WIDE Y2H ANALYSIS OF THE BUDDING YEAST We amplified all open reading frames (ORFs) of the budding yeast by means of PCR and cloned them into two types of vectors, one expressing each ORF as bait and the other as prey (2). The bait and prey plasmids were introduced into Y2H host strains one by one: the former and the latter were transformed to Y2H hosts bearing mating type a and, respectively. Bearing opposite mating types, bait clone and prey clone can mate to form diploid cells. Consequently each diploid cell has a unique combination of bait and prey. If they interact, the reporter genes are activated to allow the cells to survive the selection. In other words, each survivor should bear a pair of mutually interacting bait and prey, which can be revealed by tag sequencing of the cohabiting plasmids to generate an interaction sequence tag (IST). These ISTs can then be used for the data base search to decode the inferred protein-protein interactions. We prepared pools for screening, each containing 96 bait or prey clones, performed the mating-based screening described above in all possible combinations between the pools, and finally revealed 4,549 independent two-hybrid interactions (3). Of these, 841 were detected more than three times and were assumed to be of high relevance. Hence we call these interactions as our core data. Notably more than 80% of these interactions were the ones never described before. A similar IST project was conducted by CuraGen (4), who screened a pool of 6,000 preys with each unique bait. They revealed 691 interactions in total, most of which were also novel. Comparison between the two data sets revealed an unexpectedly small overlap: they share 141 interactions, which correspond to 10% of the total independent interactions (Fig. 1) (3). There would be a number of plausible reasons for FIG. 1.Overlap between interactome data. The upper and lower Venn diagrams indicate the overlap between the two data sets of high-throughput two-hybrid analysis and the overlap between the two large-scale co-precipitation/ms analyses, respectively. The two-hybrid data were taken from the high-throughput approach of Uetz et al. (4) and the core data of our analysis (3). The MS data were taken from supplementary Table S1 of Gavin et al. (7) and Table S2 of Ho et al. (8). the small overlap. The systems used by the two groups were different: we used multicopy vectors in the host bearing multiple reporter genes, whereas they used single-copy vectors but used only a single reporter gene. Since both groups PCR-amplified the ORFs, some would inevitably bear mutations that affect interactions. Although both groups pooled clones, the screen does not seem saturated: two-thirds of our 4,549 interactions (3), and one-third of CuraGen s 691 interactions were identified only once (4). Of course, any twohybrid screen contains false signals (see below). These and other unidentified factors are assumed to contribute to the small overlap observed between the two IST projects. The group led by Fields (4) took a different approach in which an array of 6,000 prey clones was mated with each unique bait strain, and the diploid cells formed were replicaplated onto the selection medium to decode interactions from the coordinates of the survivors. This approach is rather slow and tedious but is highly sensitive and free from the problem of unsaturated screening. They examined 142 baits to reveal 281 interactions, which again failed to largely overlap with those by IST approaches. FALSE POSITIVES One of the major concerns for the Y2H system is so-called false positives, which actually include two different categories, namely technical and biological ones. The technical false 562 Molecular & Cellular Proteomics 1.8

3 positive is an apparent two-hybrid interaction that is not based on the assembly of two hybrid proteins: expression of some baits or preys seems to induce unexplained events leading to artificial induction of reporter genes. Use of multiple reporter genes driven by different GAL4-responsive promoters, as we did in our project (3), was reported to minimize such technical false positives. The biological false positive means a bona fide two-hybrid interaction with no physiological relevance and is discussed below. What fraction of the genome-wide Y2H data is biologically relevant? To estimate the reliability of our data, we inspected a subset of our core data composed of 415 interactions because these interactions occur between two known proteins and hence can be, more or less, evaluated for their biological relevance. This analysis indicated that 50% of the interactions can be assumed to be biologically relevant (3). More recently an interesting method was developed to evaluate the validity of interaction data based on the similarity of the gene expression profile between the genes for the bait and prey displaying a two-hybrid interaction (5). The analysis of our data by this method indicates that interactions with more than three IST hits, or our core data, are expected to be 60% reliable (5). While these two independent estimates may illustrate the overall quality of genome-wide two-hybrid data, users of these data still have to evaluate each interaction of their interest. Even in our non-core data, one does find a number of intriguing interactions. On the other hand, those with high IST hits may well contain a substantial number of biologically meaningless interactions. Therefore, bioinformatics tools to assist such evaluation are critical to fully exploit these genome-wide data (see below). FALSE NEGATIVES It should be also noted that the genome-wide Y2H projects missed most (as much as 90%) of known interactions (i.e. false negatives) (3). Recently an interesting result was reported by Vidal s group (6), who tried to recapitulate twohybrid interactions reported by the three groups (i.e. Ito, CuraGen, and Fields). They amplified yeast ORFs by themselves, cloned them into their own two-hybrid vectors, and examined the interactions in their own Y2H system. Of the 72 interactions examined, 19 (26%) were recapitulated in their study. We further analyzed their data by examining the origin of each interaction examined in their study. The analysis revealed that 9 of the 19 interactions reproduced were originally detected by at least two of the three groups, whereas more than 90% of the interactions that they failed to recapitulate were those detected only by a single group. Although such irreproducible interactions may well be technical false positives discussed above, some interactions seem to be sensitive to subtle difference in the constructs and Y2H system used, whereas others are largely insensitive and easily reproduced by anyone. Such a tendency may become more prominent when using full-length ORFs in the Y2H system because it is known that full-length proteins often show much weaker signals than the appropriately trimmed protein regions containing the interaction domains. These features are inherent to the Y2H system and seem to have contributed to the small overlap observed between the different genome-wide screen data. Y2H AND OTHER INTERACTOME DATA Two impressive studies were published to report largescale mass spectrometric (MS) analyses of affinity-purified yeast protein complexes to demonstrate the power of proteomics (7, 8). They, however, also illustrate the difficulty and limitations of the approach. For instance, Gavin et al. (7) and Ho et al. (8) purified 589 and 493 complexes, respectively, of which 93 were purified by both groups using the same proteins as baits. Comparison of MS analysis on these 93 complexes between the two groups revealed that 48 complexes (52%) contain at least one protein detected by both groups, whereas the other 45 (48%) failed to share any. With respect to the entire proteins detected in these complexes, Gavin et al. (7) and Ho et al. (8) revealed 577 and 877, respectively. The overlap between these proteins was only 133, thereby comprising 10% of the 1,321 proteins collectively reported by the two groups (Fig. 1). Even in the 48 complexes described above, the proteins detected in both studies comprise 14% of the total. Thus the rate of overlap is similar to the one observed in the two-hybrid projects. Although the strategies of the two groups are different and the comparison at the level of protein nexuses revealed by several different baits improves the overlap, it should be noted that even these proteomic studies contain substantial false signals. It is also interesting to note that the interactions revealed by these approaches are somewhat complementary to those by the Y2H system. The Y2H projects essentially detect binary interactions including those of rather weak or transient nature. On the other hand, the MS studies reveal more complex interactions, which are inevitably biased toward those with high abundance and stability (9). Novel analytical platforms are thus required for the detection of weak or fast interactions by means of MS. One of the promising approaches would be an integration of MS with biomolecular interaction analysis based on the principle of surface plasmon resonance (10). Intriguing features of these data sets are also revealed by the integration of gene expression data (11). The data set by Gavin et al. (7) based on genomic integration of tandem affinity purification tags displays strong co-expression among the genes encoding the identified proteins. In contrast, those by Ho et al. (8) based on episomal overexpression of epitopetagged proteins shows rather weak co-expression patterns similar to those of Y2H projects. Recent analysis of accumulated protein interaction data provides further detail on the various aspects of both Y2H and MS data sets (9). Molecular & Cellular Proteomics

4 HUGE PROTEIN INTERACTION NETWORK The Y2H projects led to the explosion of yeast protein interaction data, and integration of binary interaction data in silico has generated a single huge nexus of proteins including up to 4,000 proteins (3, 12, 13). The additional interaction data by co-precipitation/ms studies would further expand the largest network. The entire network is obviously too complex for the human brain to understand. We need a method to extract biologically meaningful clusters or subnetworks from the huge nexus to formulate a novel hypothesis for further experimentation. However, one should note that the network has become too complex due to the lack of spatial and temporal resolution. For instance, while RNA polymerase I, II, and III are distinct entities, they would be linked into a single huge nexus in silico because of common subunits shared by the three. Thus, we have to integrate existing knowledge on yeast proteins with the massive interactome data. In addition, we should evaluate the relevance of each interaction provided by any large-scale projects. Ideally each edge of the complex graph should be weighed to help the evaluation of each interaction. As discussed above, the number of IST hits may serve as a good measure for the reliability of Y2H data (3, 5). The independent lines of evidence for the interaction, such as coincidence between Y2H and MS data, presence of genetic interaction, similar mutant phenotype, shared subcellular localization, and co-expression of the genes, would be more important. Even provided with these valuable data, construction of the protein interaction network model is still a tedious task that requires many trials and errors. We thus developed a bioinformatics tool to visualize and help one estimate the structure of networks by referring to existing knowledge and other data (Fig. 2) (3, 14). Such tools would become critical to fully exploit interactome data as well as a plethora of other functional genomic data. TOP-DOWN APPROACH TO INTERPRET HUGE INTERACTION NETWORK The approach described above is a knowledge-based, bottom-up approach to construct biologically meaningful subnetworks around the protein of one s interest. However, a totally different approach might be undertaken to interpret the huge network. Recent studies have revealed a scale-free nature of various complex networks of both artificial and natural origins (15). These networks are composed of a small number of highly connected nodes and numerous poorly connected ones, and the number of nodes and the interaction partners of each node follow a power-law distribution (15). The yeast protein interaction network revealed by the interactome analysis shares a similar structure (16). Intriguingly, a highly biased distribution of connectivity has proved to ensure a robustness of the network against random perturbations. On the other hand, such networks are extremely vulnerable to targeted attacks to the highly connected nodes or hubs. Consistent with this, such hubs of the huge protein interaction networks tend to be the products of essential genes (16). Thus, the protein interactome seems to share a basic design principle with other complex networks. It is also conceivable that highly connected nodes lacking apparent homologs in mammals may serve as good targets for antifungal drug development. The complex network has a heterogeneous organization with local clustering or communities, which may represent functional modules to be identified in the case of protein interactome. Recently an intriguing approach was developed to dissect complex network into communities based solely on the topology of the network and was successfully applied to biological and social networks (17). This approach calculates the shortest paths between every combination of the nodes in the complex graph and identifies the most frequently used one. The edge with the highest usage is just like the main traffic connecting one town to the other, and cutting the network at the edge can properly split the networks into two communities. This method can be applied to the dissection of protein interaction networks. Indeed an application of a similar idea to the complex protein network, which was constructed by the conserved co-occurrence of genes in operons, was reported to successfully split a single huge network into clusters composed of functionally more homogeneous proteins (18). FROM CATALOGING TO FUNCTIONAL INTERPRETATION The catalog of protein-protein interactions is still growing to increase the complexity of the largest network, which would be split by various bioinformatics approaches into clusters or individual functional modules (see above). Each cluster is represented as a group of proteins mutually connected by interaction arrows. However, each arrow has a unique biological meaning. Some activate the binding partners, and others suppress. Some are stable, and others are transient. Thus, to biologically understand these clusters, we have to assign additional information onto each edge. Although it can be extracted from literature or existing knowledge, most interactions are revealed by recent high-throughput methods and hence are not associated with functional data. Therefore, one of the next challenges in protein interactome analysis would be to develop strategies for systematic collection of functional data on each of the cataloged interactions. One key would be profiling of the interaction. It is quite informative to learn when and where an interaction occurs. Combinatorial use of recent proteomic techniques for protein complex purification and expression profiling will play a major role for this purpose (19 21). The set of yeast strains bearing tandem affinity purification-tagged ORFs would serve as a valuable resource to perform these analyses. Although the interaction profiling is of particular importance, it provides us with nothing more than an indirect hint for the role of an interaction. To unequivocally uncover it, one 564 Molecular & Cellular Proteomics 1.8

5 FIG. 2.Tools for analyzing protein interactome data. Analysis of protein interaction networks is substantially facilitated by the use of specialized bioinformatics tools such as WebGenNet (genome.c.kanazawa-u.ac.jp/ webgen/webgen.html). In this system, one can select and display the proteins of one s interest with their interaction partners (left windows). Note that interactions are indicated by arrows with different colors and thickness according to their origins and reliability (e.g. numbers of IST hits), respectively (left upper window). One can also retrieve information on each protein or node of the graph (right window). The system can be used for integrative analysis with other functional genomic data. For instance, red and green circles surrounding each node in this figure indicate the expression profile of the gene encoding each protein at a particular time point of the cell cycle. has to examine what happens if the interaction is specifically disrupted. To perform such an interaction targeting, one has to know the protein regions that mediate the interaction. Once such interaction domains are pinpointed, they can be overexpressed as dominant negatives to disrupt the cognate interaction between the endogenous proteins. Furthermore, they can be used for the isolation of interaction-defective mutants. The most versatile method for the mapping of interaction domains is obviously the Y2H system. Notably it can also be used as so-called reverse two-hybrid selection to select against interaction (22, 23). Using the reverse Y2H system, one can select interaction-defective alleles from a randomly mutagenized population. Once identified, responsible mutations can be easily introduced into the genome using a standard technique of yeast molecular genetics, and phenotypes of such interaction-defective mutants are expected to tell the biology of the interaction. A pitfall of the reverse Y2H system is the lack of discrimination between missense and nonsense mutations, the latter of which abolish all of the functions born by the regions C-terminal to the mutations and hence have to be avoided in interaction targeting. Similarly, missense mutations destabilizing the protein should be eliminated. To achieve this goal, we applied the dual bait Y2H method (24, 25) to guarantee that the introduced mutations induce neither truncation nor destabilization of the protein to be analyzed (26). This guaranteed reverse Y2H system is ideal for the identification of missense mutations suitable for interaction targeting to clarify the biological role for the interaction per se. Molecular & Cellular Proteomics

6 The applications described above exemplify the potential of the Y2H system as a tool for functional characterization of protein-protein interactions. It should be noted that even the interactions originally detected by other means can be similarly analyzed if they are successfully recapitulated by the Y2H system. However, the current Y2H system is prone to show false negative signals (see above). Hence the development of the Y2H system with low false negatives would be of particular significance in accelerating the functional analysis of cataloged protein-protein interactions to proceed into the next stage of protein interactome analysis. CONCLUSION The Y2H system has been a major player in the cataloging phase of protein interactome analysis. If proteomics intends not to stay as a mere cataloging effort but to proceed into biology or functional analyses, the Y2H system will again play a key role in the near feature. * This work was supported in part by research grants from the Ministry of Education, Culture, Sports, Science and Technology of Japan (MEXT), the Japan Society for the Promotion of Science (JSPS), and the New Energy and Industrial Technology Development Organization (NEDO). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. To whom correspondence should be addressed: Division of Genome Biology, Cancer Research Inst., Kanazawa University, 13-1 Takaramachi, Kanazawa , Japan. Tel.: ; Fax: ; titolab@kenroku.kanazawa-u.ac.jp. Recipient of the postdoctoral fellowship from JSPS. REFERENCES 1. Fields, S., and Song, O. (1989) A novel genetic system to detect proteinprotein interactions. Nature 340, Ito, T., Tashiro, K., Muta, S., Ozawa, R., Chiba, T., Nishizawa, M., Yamamoto, K., Kuhara, S., and Sakaki, Y. (2000) Toward a proteinprotein interaction map of the budding yeast: a comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proc. Natl. Acad. Sci. U. S. A. 97, Ito, T., Chiba, T., Ozawa, R., Yoshida, M., Hattori, M., and Sakaki, Y. (2001) A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl. Acad. Sci. U. S. A. 98, Uetz, P., Giot, L., Cagney, G., Mansfield, T. A., Judson, R. S., Knight, J. R., Lockshon, D., Narayan, V., Srinivasan, M., Pochart, P., Qureshi-Emilli, A., Li, Y., Godwin, B., Conover, D., Kalbfleisch, T., Vijayadamodar, G., Yang, M., Johnston, M., Fields, S., and Rothberg, J. M. (2000) A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403, Deane, C. M., Salwinski, L., Xenarios, I., and Eisenberg, D. (2002) Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol. Cell. Proteomics 1, Matthews, L. R., Vaglio, P., Reboul, J., Ge, H., Davis, B. P., Garrels, J., Vincent, S., and Vidal, M. (2001) Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or interologs. Genome Res. 11, Gavin, A.-C., Bosche, M., Krause, R., Grandi, P., Marzioch, M., Bauer, A., Schultz, J., Rick, J. M., Michon, A. M., Cruciat, C. M., Remor, M., Hofert, C., Schelder, M., Brajenovic, M., Ruffner, H., Merino, A., Klein, K., Hudak, M., Dickson, D., Rudi, T., Gnau, V., Bauch, A., Bastuck, S., Huhse, B., Leutwein, C., Heurtier, M. A., Copley, R. R., Edelmann, A., Querfurth, E., Rybin, V., Drewes, G., Raida, M., Bouwmeester, T., Bork, P., Seraphin, B., Kuster, B., Neubauer, G., and Superti-Furga, G. (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, Ho, Y., Gruhler, A., Heilbut, A., Bader, G. D., Moore, L., Adams, S. L., Millar, A., Taylor, P., Bennett, K., Boutilier, K., Yang, L., Wolting, C., Donaldson, I., Schandorff, S., Shewnarane, J., Vo, M., Taggart, J., Goudreault, M., Muskat, B., Alfarano, C., Dewar, D., Lin, Z., Michalickova, K., Willems, A. R., Sassi, H., Nielsen, P. A., Rasmussen, K. J., Andersen, J, R., Johansen, L. E., Hansen, L. H., Jespersen, H., Podtelejnikov, A., Nielsen, E., Crawford, J., Poulsen, V., Sorensen, B. D., Matthiesen, J., Hendrickson, R. C., Gleeson, F., Pawson, T., Moran, M. F., Durocher, D., Mann, M., Hogue, C. W., Figeys, D., and Tyers, M. (2002) Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature 415, von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S. G., Fields, S., and Bork, P. (2002) Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417, Natsume, T., Nakayama, H., and Isobe, T. (2001) BIA-MS-MS: biomolecular interaction analysis for functional proteomics. Trends Biotechnol. 19, S28 S Kemmeren, P., van Berkum, N. L., Vilo, J., Bijma, T., Donders, R., Brazma, A., and Holstege, F. C. (2002) Protein interaction verification and functional annotation by integrated analysis of genome-scale data. Mol. Cell 9, Fellenberg, M., Albermann, K., Zollner, A., Mewes, H. W., and Hani, J. (2000) Integrative analysis of protein interaction data. Proc. Int. Conf. Intell. Syst. Mol. Biol. 8, Schwikowski, B., Uetz, P., and Fields, S. (2000) A network of proteinprotein interactions in yeast. Nat. Biotechnol. 18, Ito, T., Chiba, T., and Yoshida, M. (2001) Exploring the protein interactome using comprehensive two-hybrid projects. Trends Biotechnol. 19, S23 S Strogatz, S. H. (2001) Exploring complex networks. Nature 410, Jeong, H., Mason, S. P., Barabási, A.-L., and Oltvai, Z. N. (2001) Lethality and centrality in protein networks. Nature 411, Girvan, M., and Newman, M. E. (2002) Community structure in social and biological networks. Proc. Natl. Acad. Sci. U. S. A. 99, Snel, B., Bork, P., and Huynen, M. A. (2002) The identification of functional modules from the genomic association of genes. Proc. Natl. Acad. Sci. U. S. A. 99, Rigaout, G., Shevchenko, A., Rutz, B., Wilm, M., Mann, M., and Seraphin, B. (1999) A generic protein purification method for protein complex characterization and proteome exploration. Nat. Biotechnol. 17, Oda, Y., Huang, K., Cross, F. R., Cowburn, D., and Chait, B. T. (1999) Accurate quantitation of protein expression and site-specific phosphorylation. Proc. Natl. Acad. Sci. U. S. A. 96, Gypi, S. P., Rist, B., Gerber, S. A., Turecek, F., Gelb, M. H., and Aebersold, R. (1999) Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat. Biotechnol. 17, Le Dourain, B., Pierrat, B., vom Baur, E., Chambon, P., and Losson, R. (1995) A new version of the two-hybrid assay for detection of proteinprotein interactions. Nucleic Acids Res. 23, Vidal, M., Brachman, R. K., Fattaey, A., Harlow, E., and Boeke, J. D. (1996) Reverse two-hybrid and one-hybrid systems to detect dissociation of protein-protein and DNA-protein interactions. Proc. Natl. Acad. Sci. U. S. A. 93, Inouye, C., Dhillon, N., Durfee, T., Zambryski, P. C., and Thorner, J. (1997) Mutational analysis of STE5 in the yeast Saccharomyces cerevisiae: application of a differential interaction trap assay for examining proteinprotein interactions. Genetics 147, Serebriiskii, I. G., Mitina, O., Chernoff, J., and Golemis, E. (2001) Twohybrid dual bait system to discriminate specificity of protein interactions in small GTPases. Methods Enzymol. 332, Kubota, H., Ota, K., Sakaki, Y., and Ito, T. (2001) Budding yeast GCN1 binds the GI domain to activate the eif2 kinase GCN2. J. Biol. Chem. 276, Molecular & Cellular Proteomics 1.8

Physical and Functional Modularity of the Protein Network in Yeast*

Physical and Functional Modularity of the Protein Network in Yeast* Research Physical and Functional Modularity of the Protein Network in Yeast* Thomas Wilhelm, Heinz-Peter Nasheuer, and Sui Huang While protein-protein interactions have been studied largely as a network

More information

Lecture 10: May 19, High-Throughput technologies for measuring proteinprotein

Lecture 10: May 19, High-Throughput technologies for measuring proteinprotein Analysis of Gene Expression Data Spring Semester, 2005 Lecture 10: May 19, 2005 Lecturer: Roded Sharan Scribe: Daniela Raijman and Igor Ulitsky 10.1 Protein Interaction Networks In the past we have discussed

More information

A Method for Assessing the Statistical Significance of Mass Spectrometry-Based Protein Identifications Using General Scoring Schemes

A Method for Assessing the Statistical Significance of Mass Spectrometry-Based Protein Identifications Using General Scoring Schemes Anal. Chem. 2003, 75, 768-774 A Method for Assessing the Statistical Significance of Mass Spectrometry-Based Protein Identifications Using General Scoring Schemes David Fenyo1 and Ronald C. Beavis* Genomic

More information

Biological Pathway Completion Using Network Motifs and Random Walks on Graphs

Biological Pathway Completion Using Network Motifs and Random Walks on Graphs Biological Pathway Completion Using Network Motifs and Random Walks on Graphs Maya El Dayeh and Michael Hahsler Department of Computer Science and Engineering Southern Methodist University Dallas, TX,

More information

Systems biology and biological networks

Systems biology and biological networks Systems Biology Workshop Systems biology and biological networks Center for Biological Sequence Analysis Networks in electronics Radio kindly provided by Lazebnik, Cancer Cell, 2002 Systems Biology Workshop,

More information

Evidence for dynamically organized modularity in the yeast protein-protein interaction network

Evidence for dynamically organized modularity in the yeast protein-protein interaction network Evidence for dynamically organized modularity in the yeast protein-protein interaction network Sari Bombino Helsinki 27.3.2007 UNIVERSITY OF HELSINKI Department of Computer Science Seminar on Computational

More information

I-DIRT, A General Method for Distinguishing between Specific and Nonspecific Protein Interactions

I-DIRT, A General Method for Distinguishing between Specific and Nonspecific Protein Interactions I-DIRT, A General Method for Distinguishing between Specific and Nonspecific Protein Interactions Alan J. Tackett, Jeffrey A. DeGrasse, Matthew D. Sekedat, Marlene Oeffinger, Michael P. Rout, and Brian

More information

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling Lethality and centrality in protein networks Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling molecules, or building blocks of cells and microorganisms.

More information

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law Ze Zhang,* Z. W. Luo,* Hirohisa Kishino,à and Mike J. Kearsey *School of Biosciences, University of Birmingham,

More information

A Two-Step Approach for Clustering Proteins based on Protein Interaction Profile

A Two-Step Approach for Clustering Proteins based on Protein Interaction Profile A Two-Step Approach for Clustering Proteins based on Protein Interaction Profile Pengjun Pei and Aidong Zhang Department of Computer Science and Engineering State University of New York at Buffalo Buffalo,

More information

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it? Proteomics What is it? Reveal protein interactions Protein profiling in a sample Yeast two hybrid screening High throughput 2D PAGE Automatic analysis of 2D Page Yeast two hybrid Use two mating strains

More information

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics GENOME Bioinformatics 2 Proteomics protein-gene PROTEOME protein-protein METABOLISM Slide from http://www.nd.edu/~networks/ Citrate Cycle Bio-chemical reactions What is it? Proteomics Reveal protein Protein

More information

Constructing More Reliable Protein-Protein Interaction Maps

Constructing More Reliable Protein-Protein Interaction Maps Constructing More Reliable Protein-Protein Interaction Maps Limsoon Wong National University of Singapore email: wongls@comp.nus.edu.sg 1 Introduction Progress in high-throughput experimental techniques

More information

Lecture 4: Yeast as a model organism for functional and evolutionary genomics. Part II

Lecture 4: Yeast as a model organism for functional and evolutionary genomics. Part II Lecture 4: Yeast as a model organism for functional and evolutionary genomics Part II A brief review What have we discussed: Yeast genome in a glance Gene expression can tell us about yeast functions Transcriptional

More information

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai Network Biology: Understanding the cell s functional organization Albert-László Barabási Zoltán N. Oltvai Outline: Evolutionary origin of scale-free networks Motifs, modules and hierarchical networks Network

More information

The geneticist s questions

The geneticist s questions The geneticist s questions a) What is consequence of reduced gene function? 1) gene knockout (deletion, RNAi) b) What is the consequence of increased gene function? 2) gene overexpression c) What does

More information

Biological Networks. Gavin Conant 163B ASRC

Biological Networks. Gavin Conant 163B ASRC Biological Networks Gavin Conant 163B ASRC conantg@missouri.edu 882-2931 Types of Network Regulatory Protein-interaction Metabolic Signaling Co-expressing General principle Relationship between genes Gene/protein/enzyme

More information

Towards Detecting Protein Complexes from Protein Interaction Data

Towards Detecting Protein Complexes from Protein Interaction Data Towards Detecting Protein Complexes from Protein Interaction Data Pengjun Pei 1 and Aidong Zhang 1 Department of Computer Science and Engineering State University of New York at Buffalo Buffalo NY 14260,

More information

Bayesian Inference of Protein and Domain Interactions Using The Sum-Product Algorithm

Bayesian Inference of Protein and Domain Interactions Using The Sum-Product Algorithm Bayesian Inference of Protein and Domain Interactions Using The Sum-Product Algorithm Marcin Sikora, Faruck Morcos, Daniel J. Costello, Jr., and Jesús A. Izaguirre Department of Electrical Engineering,

More information

Computational Analyses of High-Throughput Protein-Protein Interaction Data

Computational Analyses of High-Throughput Protein-Protein Interaction Data Current Protein and Peptide Science, 2003, 4, 159-181 159 Computational Analyses of High-Throughput Protein-Protein Interaction Data Yu Chen 1, 2 and Dong Xu 1, 2 * 1 Protein Informatics Group, Life Sciences

More information

Interaction Network Topologies

Interaction Network Topologies Proceedings of International Joint Conference on Neural Networks, Montreal, Canada, July 31 - August 4, 2005 Inferrng Protein-Protein Interactions Using Interaction Network Topologies Alberto Paccanarot*,

More information

GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS

GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS 141 GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS MATTHIAS E. FUTSCHIK 1 ANNA TSCHAUT 2 m.futschik@staff.hu-berlin.de tschaut@zedat.fu-berlin.de GAUTAM

More information

networks in molecular biology Wolfgang Huber

networks in molecular biology Wolfgang Huber networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic

More information

Lecture Notes for Fall Network Modeling. Ernest Fraenkel

Lecture Notes for Fall Network Modeling. Ernest Fraenkel Lecture Notes for 20.320 Fall 2012 Network Modeling Ernest Fraenkel In this lecture we will explore ways in which network models can help us to understand better biological data. We will explore how networks

More information

Approximation Algorithms and Hardness Results for Shortest Path Based Graph Orientations

Approximation Algorithms and Hardness Results for Shortest Path Based Graph Orientations Approximation Algorithms and Hardness Results for Shortest Path Based Graph Orientations Dima Blokh 1, Danny Segev 2, and Roded Sharan 1 1 Blavatnik School of Computer Science, Tel Aviv University, Tel

More information

Discrete Applied Mathematics. Integration of topological measures for eliminating non-specific interactions in protein interaction networks

Discrete Applied Mathematics. Integration of topological measures for eliminating non-specific interactions in protein interaction networks Discrete Applied Mathematics 157 (2009) 2416 2424 Contents lists available at ScienceDirect Discrete Applied Mathematics journal homepage: www.elsevier.com/locate/dam Integration of topological measures

More information

Types of biological networks. I. Intra-cellurar networks

Types of biological networks. I. Intra-cellurar networks Types of biological networks I. Intra-cellurar networks 1 Some intra-cellular networks: 1. Metabolic networks 2. Transcriptional regulation networks 3. Cell signalling networks 4. Protein-protein interaction

More information

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1 Tiffany Samaroo MB&B 452a December 8, 2003 Take Home Final Topic 1 Prior to 1970, protein and DNA sequence alignment was limited to visual comparison. This was a very tedious process; even proteins with

More information

Cellular Biophysics SS Prof. Manfred Radmacher

Cellular Biophysics SS Prof. Manfred Radmacher SS 20007 Manfred Radmacher Ch. 12 Systems Biology Let's recall chemotaxis in Dictiostelium synthesis of camp excretion of camp external camp gradient detection cell polarity cell migration 2 Single cells

More information

Comparison of Protein-Protein Interaction Confidence Assignment Schemes

Comparison of Protein-Protein Interaction Confidence Assignment Schemes Comparison of Protein-Protein Interaction Confidence Assignment Schemes Silpa Suthram 1, Tomer Shlomi 2, Eytan Ruppin 2, Roded Sharan 2, and Trey Ideker 1 1 Department of Bioengineering, University of

More information

Discovering Temporal Relations in Molecular Pathways Using Protein-Protein Interactions

Discovering Temporal Relations in Molecular Pathways Using Protein-Protein Interactions Discovering Temporal Relations in Molecular Pathways Using Protein-Protein Interactions Martin Farach-Colton Department of Computer Science, Rutgers University Piscataway, NJ 08854, USA farach@cs.rutgers.edu

More information

Protein interactions: Two methods for assessment of the. reliability of high-throughput observations

Protein interactions: Two methods for assessment of the. reliability of high-throughput observations MCP Papers in Press. Published on April 4, 2002 as Manuscript M100037-MCP200 Protein interactions: Two methods for assessment of the reliability of high-throughput observations Charlotte M. Deane 1, Lukasz

More information

Basic modeling approaches for biological systems. Mahesh Bule

Basic modeling approaches for biological systems. Mahesh Bule Basic modeling approaches for biological systems Mahesh Bule The hierarchy of life from atoms to living organisms Modeling biological processes often requires accounting for action and feedback involving

More information

Robust Community Detection Methods with Resolution Parameter for Complex Detection in Protein Protein Interaction Networks

Robust Community Detection Methods with Resolution Parameter for Complex Detection in Protein Protein Interaction Networks Robust Community Detection Methods with Resolution Parameter for Complex Detection in Protein Protein Interaction Networks Twan van Laarhoven and Elena Marchiori Institute for Computing and Information

More information

Comparison of Human Protein-Protein Interaction Maps

Comparison of Human Protein-Protein Interaction Maps Comparison of Human Protein-Protein Interaction Maps Matthias E. Futschik 1, Gautam Chaurasia 1,2, Erich Wanker 2 and Hanspeter Herzel 1 1 Institute for Theoretical Biology, Charité, Humboldt-Universität

More information

arxiv: v1 [q-bio.mn] 5 Feb 2008

arxiv: v1 [q-bio.mn] 5 Feb 2008 Uncovering Biological Network Function via Graphlet Degree Signatures Tijana Milenković and Nataša Pržulj Department of Computer Science, University of California, Irvine, CA 92697-3435, USA Technical

More information

Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae ABSTRACT

Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae ABSTRACT OMICS A Journal of Integrative Biology Volume 8, Number 4, 2004 Mary Ann Liebert, Inc. Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae

More information

7.06 Problem Set #4, Spring 2005

7.06 Problem Set #4, Spring 2005 7.06 Problem Set #4, Spring 2005 1. You re doing a mutant hunt in S. cerevisiae (budding yeast), looking for temperaturesensitive mutants that are defective in the cell cycle. You discover a mutant strain

More information

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Written Exam 15 December Course name: Introduction to Systems Biology Course no Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate

More information

A critical and integrated view of the yeast interactome

A critical and integrated view of the yeast interactome Comparative and Functional Genomics Comp Funct Genom 2004; 5: 382 402. Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/cfg.412 Research Article A critical and integrated

More information

Modeling Interactome: Scale-Free or Geometric?

Modeling Interactome: Scale-Free or Geometric? Modeling Interactome: Scale-Free or Geometric? Pržulj, N., Department of Computer Science, University of Toronto, Toronto, M5S 3G4, Canada Corneil, D. G., Department of Computer Science, University of

More information

Computational methods for predicting protein-protein interactions

Computational methods for predicting protein-protein interactions Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational

More information

Protein-protein interaction networks Prof. Peter Csermely

Protein-protein interaction networks Prof. Peter Csermely Protein-Protein Interaction Networks 1 Department of Medical Chemistry Semmelweis University, Budapest, Hungary www.linkgroup.hu csermely@eok.sote.hu Advantages of multi-disciplinarity Networks have general

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Analysis of Biological Networks: Network Robustness and Evolution

Analysis of Biological Networks: Network Robustness and Evolution Analysis of Biological Networks: Network Robustness and Evolution Lecturer: Roded Sharan Scribers: Sasha Medvedovsky and Eitan Hirsh Lecture 14, February 2, 2006 1 Introduction The chapter is divided into

More information

2. Yeast two-hybrid system

2. Yeast two-hybrid system 2. Yeast two-hybrid system I. Process workflow a. Mating of haploid two-hybrid strains on YPD plates b. Replica-plating of diploids on selective plates c. Two-hydrid experiment plating on selective plates

More information

The geneticist s questions. Deleting yeast genes. Functional genomics. From Wikipedia, the free encyclopedia

The geneticist s questions. Deleting yeast genes. Functional genomics. From Wikipedia, the free encyclopedia From Wikipedia, the free encyclopedia Functional genomics..is a field of molecular biology that attempts to make use of the vast wealth of data produced by genomic projects (such as genome sequencing projects)

More information

Predicting Protein Functions and Domain Interactions from Protein Interactions

Predicting Protein Functions and Domain Interactions from Protein Interactions Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput

More information

Improved peptide sequencing using isotope information inherent in tandem mass spectra

Improved peptide sequencing using isotope information inherent in tandem mass spectra RAPID COMMUNICATIONS IN MASS SPECTROMETRY Rapid Commun. Mass Spectrom. 2003; 17: 1793 1801 Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/rcm.1119 Improved peptide sequencing

More information

Models of transcriptional regulation

Models of transcriptional regulation Models of transcriptional regulation We have already discussed four simple mechanisms of transcriptional regulation, nuclear exclusion nuclear concentration modification of bound activator redirection

More information

MTopGO: a tool for module identification in PPI Networks

MTopGO: a tool for module identification in PPI Networks MTopGO: a tool for module identification in PPI Networks Danila Vella 1,2, Simone Marini 3,4, Francesca Vitali 5,6,7, Riccardo Bellazzi 1,4 1 Clinical Scientific Institute Maugeri, Pavia, Italy, 2 Department

More information

Measuring TF-DNA interactions

Measuring TF-DNA interactions Measuring TF-DNA interactions How is Biological Complexity Achieved? Mediated by Transcription Factors (TFs) 2 Regulation of Gene Expression by Transcription Factors TF trans-acting factors TF TF TF TF

More information

PROTEOMICS. Is the intrinsic disorder of proteins the cause of the scale-free architecture of protein-protein interaction networks?

PROTEOMICS. Is the intrinsic disorder of proteins the cause of the scale-free architecture of protein-protein interaction networks? Is the intrinsic disorder of proteins the cause of the scale-free architecture of protein-protein interaction networks? Journal: Manuscript ID: Wiley - Manuscript type: Date Submitted by the Author: Complete

More information

Identifying Signaling Pathways

Identifying Signaling Pathways These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by Anthony Gitter, Mark Craven, Colin Dewey Identifying Signaling Pathways BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018

More information

Eukaryotic Gene Expression

Eukaryotic Gene Expression Eukaryotic Gene Expression Lectures 22-23 Several Features Distinguish Eukaryotic Processes From Mechanisms in Bacteria 123 Eukaryotic Gene Expression Several Features Distinguish Eukaryotic Processes

More information

Noisy PPI Data: Alarmingly High False Positive and False Negative Rates

Noisy PPI Data: Alarmingly High False Positive and False Negative Rates xix Preface The time has come for modern biology to move from the study of single molecules to networked systems. The recent technological advances in genomics and proteomics have resulted in the identification

More information

Map of AP-Aligned Bio-Rad Kits with Learning Objectives

Map of AP-Aligned Bio-Rad Kits with Learning Objectives Map of AP-Aligned Bio-Rad Kits with Learning Objectives Cover more than one AP Biology Big Idea with these AP-aligned Bio-Rad kits. Big Idea 1 Big Idea 2 Big Idea 3 Big Idea 4 ThINQ! pglo Transformation

More information

An Efficient Algorithm for Protein-Protein Interaction Network Analysis to Discover Overlapping Functional Modules

An Efficient Algorithm for Protein-Protein Interaction Network Analysis to Discover Overlapping Functional Modules An Efficient Algorithm for Protein-Protein Interaction Network Analysis to Discover Overlapping Functional Modules Ying Liu 1 Department of Computer Science, Mathematics and Science, College of Professional

More information

Graph Theory Approaches to Protein Interaction Data Analysis

Graph Theory Approaches to Protein Interaction Data Analysis Graph Theory Approaches to Protein Interaction Data Analysis Nataša Pržulj September 8, 2003 Contents 1 Introduction 2 1.1 Graph Theoretic Terminology................................. 3 1.2 Biological

More information

Comparative Network Analysis

Comparative Network Analysis Comparative Network Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by

More information

Analysis and modelling of protein interaction networks

Analysis and modelling of protein interaction networks Analysis and modelling of protein interaction networks - A study of the two-hybrid experiment Karin Stibius Jensen Analysis and modelling of protein interaction networks - A study of the two-hybrid experiment

More information

Modeling Mass Spectrometry-Based Protein Analysis

Modeling Mass Spectrometry-Based Protein Analysis Chapter 8 Jan Eriksson and David Fenyö Abstract The success of mass spectrometry based proteomics depends on efficient methods for data analysis. These methods require a detailed understanding of the information

More information

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON PROKARYOTE GENES: E. COLI LAC OPERON CHAPTER 13 CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON Figure 1. Electron micrograph of growing E. coli. Some show the constriction at the location where daughter

More information

Introduction. Gene expression is the combined process of :

Introduction. Gene expression is the combined process of : 1 To know and explain: Regulation of Bacterial Gene Expression Constitutive ( house keeping) vs. Controllable genes OPERON structure and its role in gene regulation Regulation of Eukaryotic Gene Expression

More information

Introduction to Bioinformatics

Introduction to Bioinformatics CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics

More information

Ensemble Non-negative Matrix Factorization Methods for Clustering Protein-Protein Interactions

Ensemble Non-negative Matrix Factorization Methods for Clustering Protein-Protein Interactions Belfield Campus Map Ensemble Non-negative Matrix Factorization Methods for Clustering Protein-Protein Interactions

More information

Connectivity and expression in protein networks: Proteins in a complex are uniformly expressed

Connectivity and expression in protein networks: Proteins in a complex are uniformly expressed Connectivity and expression in protein networks: Proteins in a complex are uniformly expressed Shai Carmi, 1 Erez Y. Levanon, 2 Shlomo Havlin, 1 and Eli Eisenberg 3 1 Minerva Center and Department of Physics,

More information

Constructing Signal Transduction Networks Using Multiple Signaling Feature Data

Constructing Signal Transduction Networks Using Multiple Signaling Feature Data Constructing Signal Transduction Networks Using Multiple Signaling Feature Data Thanh-Phuong Nguyen 1, Kenji Satou 2, Tu-Bao Ho 1 and Katsuhiko Takabayashi 3 1 Japan Advanced Institute of Science and Technology

More information

Biological Knowledge Discovery Through Mining Multiple Sources of High-Throughput Data

Biological Knowledge Discovery Through Mining Multiple Sources of High-Throughput Data University of Tennessee, Knoxville Trace: Tennessee Research and Creative Exchange Doctoral Dissertations Graduate School 8-2004 Biological Knowledge Discovery Through Mining Multiple Sources of High-Throughput

More information

An introduction to SYSTEMS BIOLOGY

An introduction to SYSTEMS BIOLOGY An introduction to SYSTEMS BIOLOGY Paolo Tieri CNR Consiglio Nazionale delle Ricerche, Rome, Italy 10 February 2015 Universidade Federal de Minas Gerais, Belo Horizonte, Brasil Course outline Day 1: intro

More information

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide Name: Period: Date: AP Bio Module 6: Bacterial Genetics and Operons, Student Learning Guide Getting started. Work in pairs (share a computer). Make sure that you log in for the first quiz so that you get

More information

Journal of Biology. BioMed Central

Journal of Biology. BioMed Central Journal of Biology BioMed Central Open Access Research article Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae Teresa Reguly *, Ashton Breitkreutz *, Lorrie

More information

AP Curriculum Framework with Learning Objectives

AP Curriculum Framework with Learning Objectives Big Ideas Big Idea 1: The process of evolution drives the diversity and unity of life. AP Curriculum Framework with Learning Objectives Understanding 1.A: Change in the genetic makeup of a population over

More information

Introduction to Bioinformatics Integrated Science, 11/9/05

Introduction to Bioinformatics Integrated Science, 11/9/05 1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction

More information

2 Genome evolution: gene fusion versus gene fission

2 Genome evolution: gene fusion versus gene fission 2 Genome evolution: gene fusion versus gene fission Berend Snel, Peer Bork and Martijn A. Huynen Trends in Genetics 16 (2000) 9-11 13 Chapter 2 Introduction With the advent of complete genome sequencing,

More information

COMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE

COMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE NUE FEATURE T R A N S F O R M I N G C H A L L E N G E S I N T O M E D I C I N E Nuevolution Feature no. 1 October 2015 Technical Information COMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE A PROMISING

More information

DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS

DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS 1 DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS Xiao-Li Li Knowledge Discovery Department, Institute for Infocomm Research, Heng Mui Keng Terrace, 119613,

More information

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Systems Biology Exp. Methods

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Systems Biology Exp. Methods BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer 2013 14. Systems Biology Exp. Methods Overview Transcriptomics Basics of microarrays Comparative analysis Interactomics:

More information

A direct comparison of protein interaction confidence assignment schemes

A direct comparison of protein interaction confidence assignment schemes BMC Bioinformatics This Provisional PDF corresponds to the article as it appeared upon acceptance. Copyedited and fully formatted PDF and full text (HTML) versions will be made available soon. A direct

More information

STRING: Protein association networks. Lars Juhl Jensen

STRING: Protein association networks. Lars Juhl Jensen STRING: Protein association networks Lars Juhl Jensen interaction networks association networks guilt by association protein networks STRING 9.6 million proteins common foundation Exercise 1 Go to http://string-db.org/

More information

How Scale-free Type-based Networks Emerge from Instance-based Dynamics

How Scale-free Type-based Networks Emerge from Instance-based Dynamics How Scale-free Type-based Networks Emerge from Instance-based Dynamics Tom Lenaerts, Hugues Bersini and Francisco C. Santos IRIDIA, CP 194/6, Université Libre de Bruxelles, Avenue Franklin Roosevelt 50,

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

BME 5742 Biosystems Modeling and Control

BME 5742 Biosystems Modeling and Control BME 5742 Biosystems Modeling and Control Lecture 24 Unregulated Gene Expression Model Dr. Zvi Roth (FAU) 1 The genetic material inside a cell, encoded in its DNA, governs the response of a cell to various

More information

Random Boolean Networks

Random Boolean Networks Random Boolean Networks Boolean network definition The first Boolean networks were proposed by Stuart A. Kauffman in 1969, as random models of genetic regulatory networks (Kauffman 1969, 1993). A Random

More information

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA XIUFENG WAN xw6@cs.msstate.edu Department of Computer Science Box 9637 JOHN A. BOYLE jab@ra.msstate.edu Department of Biochemistry and Molecular Biology

More information

Big Idea 1: The process of evolution drives the diversity and unity of life.

Big Idea 1: The process of evolution drives the diversity and unity of life. Big Idea 1: The process of evolution drives the diversity and unity of life. understanding 1.A: Change in the genetic makeup of a population over time is evolution. 1.A.1: Natural selection is a major

More information

Analysis and Simulation of Biological Systems

Analysis and Simulation of Biological Systems Analysis and Simulation of Biological Systems Dr. Carlo Cosentino School of Computer and Biomedical Engineering Department of Experimental and Clinical Medicine Università degli Studi Magna Graecia Catanzaro,

More information

Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data

Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data Anthony J Bonner Han Liu Abstract This paper addresses a central problem of Proteomics: estimating the amounts of each of

More information

A A A A B B1

A A A A B B1 LEARNING OBJECTIVES FOR EACH BIG IDEA WITH ASSOCIATED SCIENCE PRACTICES AND ESSENTIAL KNOWLEDGE Learning Objectives will be the target for AP Biology exam questions Learning Objectives Sci Prac Es Knowl

More information

The architecture of transcription elongation A crystal structure explains how transcription factors enhance elongation and pausing

The architecture of transcription elongation A crystal structure explains how transcription factors enhance elongation and pausing The architecture of transcription elongation A crystal structure explains how transcription factors enhance elongation and pausing By Thomas Fouqueau and Finn Werner The molecular machines that carry out

More information

V 6 Network analysis

V 6 Network analysis V 6 Network analysis - Dijkstra algorithm: compute shortest pathways - Graph layout - Network robustness - Graph modularity Fri, May 4, 2018 Bioinformatics 3 SS 18 V 6 1 The Shortest Path Problem Problem:

More information

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution.

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution. The AP Biology course is designed to enable you to develop advanced inquiry and reasoning skills, such as designing a plan for collecting data, analyzing data, applying mathematical routines, and connecting

More information

A complementation test would be done by crossing the haploid strains and scoring the phenotype in the diploids.

A complementation test would be done by crossing the haploid strains and scoring the phenotype in the diploids. Problem set H answers 1. To study DNA repair mechanisms, geneticists isolated yeast mutants that were sensitive to various types of radiation; for example, mutants that were more sensitive to UV light.

More information

Biological Concepts and Information Technology (Systems Biology)

Biological Concepts and Information Technology (Systems Biology) Biological Concepts and Information Technology (Systems Biology) Janaina de Andréa Dernowsek Postdoctoral at Center for Information Technology Renato Archer Janaina.dernowsek@cti.gov.br Division of 3D

More information

INCORPORATING GRAPH FEATURES FOR PREDICTING PROTEIN-PROTEIN INTERACTIONS

INCORPORATING GRAPH FEATURES FOR PREDICTING PROTEIN-PROTEIN INTERACTIONS INCORPORATING GRAPH FEATURES FOR PREDICTING PROTEIN-PROTEIN INTERACTIONS Martin S. R. Paradesi Department of Computing and Information Sciences, Kansas State University 234 Nichols Hall, Manhattan, KS

More information

V 5 Robustness and Modularity

V 5 Robustness and Modularity Bioinformatics 3 V 5 Robustness and Modularity Mon, Oct 29, 2012 Network Robustness Network = set of connections Failure events: loss of edges loss of nodes (together with their edges) loss of connectivity

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Bioinformatics I. CPBS 7711 October 29, 2015 Protein interaction networks. Debra Goldberg

Bioinformatics I. CPBS 7711 October 29, 2015 Protein interaction networks. Debra Goldberg Bioinformatics I CPBS 7711 October 29, 2015 Protein interaction networks Debra Goldberg debra@colorado.edu Overview Networks, protein interaction networks (PINs) Network models What can we learn from PINs

More information

Overview. Overview. Social networks. What is a network? 10/29/14. Bioinformatics I. Networks are everywhere! Introduction to Networks

Overview. Overview. Social networks. What is a network? 10/29/14. Bioinformatics I. Networks are everywhere! Introduction to Networks Bioinformatics I Overview CPBS 7711 October 29, 2014 Protein interaction networks Debra Goldberg debra@colorado.edu Networks, protein interaction networks (PINs) Network models What can we learn from PINs

More information

Peeling the yeast protein network

Peeling the yeast protein network 444 DOI 10.1002/pmic.200400962 Proteomics 2005, 5, 444 449 REGULAR ARTICLE Peeling the yeast protein network Stefan Wuchty and Eivind Almaas Department of Physics, University of Notre Dame, Notre Dame,

More information