Bioinformatics Transcriptome Jacques.van.Helden@ulb.ac.be Université Libre de Bruxelles, Belgique Laboratoire de Bioinformatique des Génomes et des Réseaux (BiGRe) http://www.bigre.ulb.ac.be/
Bioinformatics Transcriptome Jacques.van.Helden@ulb.ac.be Université Libre de Bruxelles, Belgique Laboratoire de Bioinformatique des Génomes et des Réseaux (BiGRe) http://www.bigre.ulb.ac.be/
Measuring the expression of all the genes of a genome derisi et al. (1997). Science 278: 680-686 In 1997, derisi and co-workers develop a method to measure the level of transcription of all the genes of a genome. The method allows to compare the concentrations of mrna of each gene between two experimental conditions Green channel: reference Red channel: test The intensity of a spot indicates the average concentration of the corresponding mrna in the two samples. The color of a spot indicates regulation: Red: up-regulated in the test, relative to the reference condition Green: down-regulated
DNA chip technology Cell culture, tissue,... RNA extraction Synthesis of fluorescent cdna Sample 1 Sample 2 RNA cdna RNA cdna Brightness Quantity Color Specificity yellowish reddish greenish not specific sample 1 - specific sample 2 - specific DNA chip Source: derisi et al., Science 1997
Scanning result slide from Peter Sterk
Complete microarray Source:DeRisi et al. (1997) Science, 278(5338), 680-6. derisi et al. (1997). Science 278: 680-686
DNA chips raw measurements Raw measurements Red intensity Red background Green intensity Green background Intensity background = level of expression Red in experimental conditions Green in control
DNA chips useful metrics The level of regulation is represented by the ratio r = red " red.bg green " green.bg r >1 r < 1 up-regulated down-regulated The log-ratio provides a more convenient statistic (we will see why during the course) log 2 is even more convenient because the scale is intuitive # red " red.bg & R = log 2 % ( $ green " green.bg' R < 0 down-regulated R > 0 up-regulated R > 1 regulated by a factor of 2 R > 2 regulated by a factor of 4 R > w regulated by a factor of 2 w
Time series At each time point, the expression level is compared to the control (log-ratio) Example: Nitrogen depletion ORF Gene 30 min 1 hour 2 hours 4 hours 8 hours 12 hours 1 days 2 days 3 days 5 days YAL001C TFC3 0.33 0.37 0.18 0.06 0.32 0.09-0.09 0.02-0.01-0.06 YAL002W VPS8 0.22 0.29-0.04-0.3 0.18 0.04 0.3-0.06-0.07-0.21 YAL003W EFB1-1.6-1.43-0.84-0.25-1.22-0.92-1.22-0.97-0.84-0.89 YAL004W YAL004W -0.81-0.63-1.43-0.85-0.88-0.68-0.59-1.01-0.68-0.71 YAL005C SSA1-0.81-0.74-1.36-0.97-0.94-0.86-1.56-1.89-1.89-1.06 YAL007C ERP2-0.42-0.3-0.38-0.22-0.81-1.15-1.6-1.89-2.06-2.4 YAL008W FUN14 0.23 0.25-0.01-0.42 0.58 0.52 0.34 0.23 0.7 0.24 YAL009W SPO7-0.1-0.01-0.13 0.02 0.15 0.26 0.59 0.26 0.38 1.1 YAL010C MDM10 0.39 0.26 0.18-0.14 0.31 0.28 0.11-0.04 0.11-0.14 YAL011W YAL011W -0.23-0.06 0.2-0.04 0.19-0.17 0.31-0.23 0.25-0.23 YAL012W CYS3 1.94 1.62 0.32 0.04-2.06-2.64-2.84-2.47-2.47-2.12 YAL013W DEP1-0.94-0.22-0.64-0.74-0.32-0.3 0.24-0.62-0.45-0.69 YAL014C YAL014C -0.01 0.28-0.45-0.58-0.06 0.16 0.73 0.54 0.61 0.11 YAL015C NTG1 0.37 0.56 0.01-0.14-0.15-0.14-0.25-0.07 0.16 0.03.................................... Source: Gasch et al (2000) Molecular Biology of the Cell 11:4241-4257
Examples of experimental conditions Presence/absence of a metabolite gal vs glucose Transcription factor mutants Yap1p over-expression TUP1 deletion Massive environmental changes rich versus minimal medium diauxic shift (7 time points during the shift) Cell differentiation sporulation mating type Cell cycle
Temporal profiles of expression derisi et al measured the level of expression of all the genes at 7 time points during the diauxic shit. The figure shows groups of genes show similar expression profiles, Some of these groups contain genes with similar function (e.g. coding for ribosomal proteins) Some of these groups have a common regulatory element in their promoter (e.g. stress response element). derisi et al. (1997). Science 278: 680-686
Cell cycle In 1998, Spellman and colleagues measure the expression of all yeast genes during the cell cycle. They detect 800 genes showing periodical fluctuations of expression. These genes can be sorted according to the peak of expression, in order to group genes induced during the different phases of the cell cycle (G1, S, G2, M). Spellman et al. (1998) Molecular Biology of the Cell 9:3273-3297
Gene expression data: hierarchical clustering Alpha cdc15 cdc28 Elu MCM CLB2 SIC1 MAT CLN2 Y' On the image, genes are clustered according to expression profiles, using Michael Eisen s software cluster (Eisen et al., PNAS 1998: 95, 14863-8). Strengths The profiles and the clusters are visible together Familiar to biologists (frequently used for phylogeny) Weaknesses Isomorphism: each node of the tree can be permuted vertical distance between genes does not reflect the real distance Where to set the cluster boundaries? The tree does not reflect the combinatorial aspect of regulation MET Spellman et al. (1998). Mol Biol Cell 9(12), 3273-97.
Gasch (2000) - gene response to environmental changes Gasch et al. (2000) measure the transcriptional response of yeast genes to various environmental changes 173 microarrays ~6000 genes per microarray
Classification of cancer types Microarrays are also used to select genes which will serve as molecular signatures to classify cancer types. These genes can then be used to establish a diagnostic for new patients. Golub et al. (1999). Science 286: 531-537