Gordon Smyth, WEHI Analysis of Replicated Experiments. January 2004 IMS NUS Microarray Tutorial. Some Web Sites. Analysis of Replicated Experiments

Size: px
Start display at page:

Download "Gordon Smyth, WEHI Analysis of Replicated Experiments. January 2004 IMS NUS Microarray Tutorial. Some Web Sites. Analysis of Replicated Experiments"

Transcription

1 Some Web Sites Statistical Methods in Microarray Analysis Tutorial Institute for Mathematical Sciences National University of Sinapore January, 004 Gordon Smyth Walter and Eliza Hall Institute Technical reports etc Statistical software: R - Packaes within R environment - Bioconductor - LIMMA - SPOT (cdna imae analysis) 1 Five Statistical Issues Desinin ene expression experiments Acquirin the raw data: imae analysis Summarizin and removin artefacts from the data Discoverin which enes are differentially expressed Discoverin which enes exhibit interestin expression patterns For a review see Smyth, Yan and Speed, Statistical issues in microarray data analysis, In: Functional Genomics: Methods and Protocols, Methods in Molecular Bioloy, Humana Press, March 003 Lots of other bioinformatics issues 3 Imae Analysis Yan, Buckley, Dudoit and Speed, Comparison of methods for imae analysis on cdna microarray data, Journal of Computational and Graphical Statistics, January Imae Analysis Software If you re usin Affymetrix arrays, the imae analysis will be done as part of the Affy system. If you re usin spotted arrays, you ll scan your arrays to produce TIFF imaes. The imaes will be processed by a proram such as SPOT, GenePix, Imaene or Quantarray to acquire intensity measurements 5 Printin custom library on lass slide Array Printin 6 1

2 Microarray Imae False-coloured imae with two channels overlaid: more hihly expressed in mutant equally expressed more hihly expressed in normal 6 x 6 spots in each pin roup Genes printed in duplicate pairs Arrows indicate print direction AGRF NIA 15k mouse array: 1 x 4 pin roups 7 8 Steps in Imae Processin Sementation 1. Addressin: locate centers. Sementation: divide pixels into foreround (part of spot) or backround 3. Information extraction: calculate sinal intensity pairs, backround and quality measures for each spot Seeded reion rowin SPOT Fixed circle method Genepix: adaptive circles 9 Foreround measured from pixels inside the circle backround measured from ambient intensity around spot 10 Local Backrounds Measures Morpholoical Backround One channel rey scale Genepix: reion includin pink diamonds 11 Measures SPOT estimates overall backround baseline backround usin a nonlinear level morpholoical filter a lower, less variable measure 1

3 Quantification of Expression: cdna Foreround red, reen Backround red, reen Backround corrected lo-ratio ( Minus ) Averae intensity ( Add ) Rf, Gf Rb, Gb R = Rf Rb, G = Gf Gb M = lo R lo G A= (lo R+ lo G) / Example of a quality weihtin: downweiht spots smaller or larer than nominal size Many other possibilities Spot Quality Weihts Lots of issues: which b is best, to subtract b or not, quality, filterin Quantification of Expression: Affymetrix 1-40,000 probe sets/chip, each consistin of 11-0 Perfect Match (PM) and MisMatch (MM) probe pairs. Consider one set: chips i=1,,.. probes j=1,,.. Affymetrix summary: lo(sinal Intensity) = TukeyBiweiht{lo(PM j -MM j *)} dchip model (Li, Schadt & Won): PM ij -MM ij = θ i ϕ j + ε ij Robust Multichip Analysis (RMA): Use median polish or robust reression to fit Exploratory Data Display lo (PM-*BG) ij = chip i effect + probe j effect + ε ij See Bioconductor packaes: affy, AffyPLM Spatial bias in cdna arrays Spatial Plots: Backround Intensities Lo-ratios Print-tip roups Boxplots of M = lo R/G by print-tip roup (1-16) Same slide Locations of spots with extreme 5% M: hih, low

4 MA Plot Backround Affects Variability 19 From Spot: note lack of fannin (not same data) From GenePix: note fannin out at low intensities 0 MA plots of 10 Affymetrix replicate chip pairs Normalization Each of the 5 chips was scanned on a different scanner. 1 Smyth and Speed Normalization of cdna microarray data, in: METHODS: Selectin Candidate Genes from DNA Array Screens, December 003 No Absolute Measurement Scale MA Plot The spotted DNA sequences differ in bindin efficiency, so absolute intensities are not directly comparable between enes The response of the dyes is subject to arbitrary rescalin for each dye on each array need to normalize the red/reen balance for each array (to make M-values unbiased) 3 M = lo R lo G A = 1 / ( lo R + lo G ) 4 4

5 Print-tip-loess Normalization 6 x 4 pin-roups M = lo R lo G 5 A = 1 / ( lo R + lo G ) 6 After Normalization Before normalization M = lo R lo G A = 1 / ( lo R + lo G ) 7 8 After normalization Which Genes are Differentially Expressed? Smyth, Linear models and empirical Bayes methods for assessin differential expression in microarray experiments, July

6 Replicate Arrays Need replication to do statistical analysis of differential expression. Most basic possible is series of arrays all comparin the same two RNA sources: A A A } B B B n arrays 31 Gene-wise Summaries Each ene ives a series of lo-ratios Summarize lo-ratios by averae and standard deviation for each ene, or robust versions of these: M,, 1 Mn M = ave M s = std.dev M 3 Rankin Criteria Moderated t-statistics Averae lo fold chane. Problem: non DE enes with lare variances have too much chance of bein selected. t-statistics Problem: enes with very small sample variances are suspect Moderated t-statistics. We use a happy compromise between the two M t t M = s n M = s n 33 Shrunk standard deviations sd + sd 0 0 s = d + d0 Moderated t-statistics t M = s c Eliminates lare t-statistics merely from very small s 34 Shrinkae of Standard Deviations Posterior Odds s 1 s 1 s s s 0 The data decides whether t,pooled or to s n s n fold chane t t t should be closer to d d 0 Posterior probability of differential expression for any ene is 1+ d+ d0 1/ 0 = c c + c 0 p( β 0 M, s ) p c t + d + d p( β = 0 M, s ) 1 p c + c t d d Monotonic function of t 35 Reparametrization of Lönnstedt and Speed t for constant d 6

7 Estimatin Hyper-Parameters Simulations Closed form estimators with ood properties are available: for s 0 and d 0 in terms of the first two moments of lo s for c 0 in terms of quantiles of the t N False Discoveries Fold Chane SMA Efron t Moderated t Ordinary t N False Discoveries Fold Chane SMA Efron t Moderated t Ordinary t N Genes Selected N Genes Selected 37 σ similar σ very different 38 How many enes are differentially expressed? Rankin Easier Than Testin Assinin absolute sinificance levels on the basis of probability models is problematic: Lo-ratios don t appear to be normally distributed, hard to check Lo-ratios for different enes are correlated in unknown way Hih level of multiple testin means that very small p-values are required distributional assumptions must hold in extreme tail Little opportunity for usual CLT results to apply 39 If there was only one ene, a t-test would ive a reliable P-value for judin whether the true lo-ratio was zero With many enes, computed P-values cannot be trusted (unless have > 16 arrays) It is more realistic to rank the enes in order of evidence for differential expression 40 In Search of Truth We treat moderated t and posterior odds as rankin criteria rather than as providers of absolute sinificance or posterior odds To riorously estimate type I or type II error rates or to compare competin analysis methods, need to construct microarray data with known truth Spike-Ins Spike-in artificial RNA cocktail to induce known fold chanes in control enes Spike-ins can ive an objective basis for choosin cut-off for differential expression

8 Spike-In Controls MA-Plot with spike-in controls hihlihted Print artificial enes on microarray, then add spike-ins of correspondin RNA in known quantities to sample RNA before labellin and hybridization ScoreCard and SpotReport are commercial systems in use at the AGRF Scorecard Controls 40 How well do moderated t- statistics distinuish ratio (DE) controls from calibration (non- DE) controls? Moderated t-statistics Calib01 Calib0 Calib03 Calib04 Calib05 Calib06 Calib07 Calib08 Calib09 Calib10 Ratio 3/1 Low Ratio 1/3 Low Ratio 3/1 Hih Ratio 1/3 Hih Ratio 10/1 Low Ratio 1/10 Low Ratio 10/1 Hih Ratio 1/10 Hih 46 Empirical Cut Off t > 4 appears a conservative rule for differential expression with very small false discovery rate for this data 47 Other Data with Constructed Truth Assessin Precision: Replicate arrays Same vs same hybridizations Whole library pool titration series dynamic rane but not differentially expressed Assessin Bias: Independent measurement of differential expression PCR, independent arrays Spike-in ratio and calibration controls Mixture and dilution experiments 48 8

9 LIMMA Packae for R limma GUI Linear models for microarray data. A software packae for the R prorammin environment. Focus is differential expression includin - moderated t-statistics - methods for duplicate spots - classifyin F-tests - stemmed heat diarams Available from Acknowledements WEHI Bioinformatics Terry Speed Matt Ritchie Natalie Thorne James Wettenhall everyone else WEHI Scott Lab Joelle Michaud Catherine Carmichael Robert Escher Hamish Scott AGRF Steve Wilcox Cathy Jensen Melanie O Keefe WEHI Immunoloy Steve Nutt Lynn Corcoran Mirielle Lahoud Melissa Holmes Berkeley Jean Yan Speed Lab Nai Lab 51 When the chips are done WEHI Bioinformatics 5 9

Low-Level Analysis of High- Density Oligonucleotide Microarray Data

Low-Level Analysis of High- Density Oligonucleotide Microarray Data Low-Level Analysis of High- Density Oligonucleotide Microarray Data Ben Bolstad http://www.stat.berkeley.edu/~bolstad Biostatistics, University of California, Berkeley UC Berkeley Feb 23, 2004 Outline

More information

cdna Microarray Analysis

cdna Microarray Analysis cdna Microarray Analysis with BioConductor packages Nolwenn Le Meur Copyright 2007 Outline Data acquisition Pre-processing Quality assessment Pre-processing background correction normalization summarization

More information

January 2004 IMS NUS Microarray Tutorial. Gordon Smyth, WEHI Analysis of Complex Experiments

January 2004 IMS NUS Microarray Tutorial. Gordon Smyth, WEHI Analysis of Complex Experiments nalsis of Complex Experiments Januar 4 nalsis of Complex Experiments Statistical Methods in Microarra nalsis Tutorial Institute for Mathematical Sciences National Universit of Sinapore Januar, 4 Gordon

More information

Lesson 11. Functional Genomics I: Microarray Analysis

Lesson 11. Functional Genomics I: Microarray Analysis Lesson 11 Functional Genomics I: Microarray Analysis Transcription of DNA and translation of RNA vary with biological conditions 3 kinds of microarray platforms Spotted Array - 2 color - Pat Brown (Stanford)

More information

SPOTTED cdna MICROARRAYS

SPOTTED cdna MICROARRAYS SPOTTED cdna MICROARRAYS Spot size: 50um - 150um SPOTTED cdna MICROARRAYS Compare the genetic expression in two samples of cells PRINT cdna from one gene on each spot SAMPLES cdna labelled red/green e.g.

More information

Probe-Level Analysis of Affymetrix GeneChip Microarray Data

Probe-Level Analysis of Affymetrix GeneChip Microarray Data Probe-Level Analysis of Affymetrix GeneChip Microarray Data Ben Bolstad http://www.stat.berkeley.edu/~bolstad Biostatistics, University of California, Berkeley University of Minnesota Mar 30, 2004 Outline

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Bradley Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org

More information

Lecture 5 Processing microarray data

Lecture 5 Processing microarray data Lecture 5 Processin microarray data (1)Transform the data into a scale suitable for analysis ()Remove the effects of systematic and obfuscatin sources of variation (3)Identify discrepant observations Preprocessin

More information

Probe-Level Analysis of Affymetrix GeneChip Microarray Data

Probe-Level Analysis of Affymetrix GeneChip Microarray Data Probe-Level Analysis of Affymetrix GeneChip Microarray Data Ben Bolstad http://www.stat.berkeley.edu/~bolstad Biostatistics, University of California, Berkeley Memorial Sloan-Kettering Cancer Center July

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org

More information

Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means

Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means Veterinary Dianostic and Production Animal Medicine Publications Veterinary Dianostic and Production Animal Medicine 01 Robust Semiparametric Optimal Testin Procedure for Multiple Normal Means Pen Liu

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org

More information

Linear Models and Empirical Bayes Methods for. Assessing Differential Expression in Microarray Experiments

Linear Models and Empirical Bayes Methods for. Assessing Differential Expression in Microarray Experiments Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments by Gordon K. Smyth (as interpreted by Aaron J. Baraff) STAT 572 Intro Talk April 10, 2014 Microarray

More information

Normalization. Example of Replicate Data. Biostatistics Rafael A. Irizarry

Normalization. Example of Replicate Data. Biostatistics Rafael A. Irizarry This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Microarray Preprocessing

Microarray Preprocessing Microarray Preprocessing Normaliza$on Normaliza$on is needed to ensure that differences in intensi$es are indeed due to differen$al expression, and not some prin$ng, hybridiza$on, or scanning ar$fact.

More information

Limma = linear models for microarray data. Linear models and Limma. Expression measures. Questions of Interest. y ga = log 2 (R/G) y ga =

Limma = linear models for microarray data. Linear models and Limma. Expression measures. Questions of Interest. y ga = log 2 (R/G) y ga = Linear models and Limma Københavns Universitet, 19 August 2009 Mark D. Robinson Bioinformatics, Walter+Eliza Hall Institute Epigenetics Laboratory, Garvan Institute (with many slides taken from Gordon

More information

Biochip informatics-(i)

Biochip informatics-(i) Biochip informatics-(i) : biochip normalization & differential expression Ju Han Kim, M.D., Ph.D. SNUBI: SNUBiomedical Informatics http://www.snubi snubi.org/ Biochip Informatics - (I) Biochip basics Preprocessing

More information

Linear Models and Empirical Bayes Methods for Microarrays

Linear Models and Empirical Bayes Methods for Microarrays Methods for Microarrays by Gordon Smyth Alex Sánchez and Carme Ruíz de Villa Department d Estadística Universitat de Barcelona 16-12-2004 Outline 1 An introductory example Paper overview 2 3 Lönnsted and

More information

Session 06 (A): Microarray Basic Data Analysis

Session 06 (A): Microarray Basic Data Analysis 1 SJTU-Bioinformatics Summer School 2017 Session 06 (A): Microarray Basic Data Analysis Maoying,Wu ricket.woo@gmail.com Dept. of Bioinformatics & Biostatistics Shanghai Jiao Tong University Summer, 2017

More information

Design of Microarray Experiments. Xiangqin Cui

Design of Microarray Experiments. Xiangqin Cui Design of Microarray Experiments Xiangqin Cui Experimental design Experimental design: is a term used about efficient methods for planning the collection of data, in order to obtain the maximum amount

More information

Lecture 1: The Structure of Microarray Data

Lecture 1: The Structure of Microarray Data INTRODUCTION TO MICROARRAYS 1 Lecture 1: The Structure of Microarray Data So, Why are we here? What are we trying to measure, and how? Mechanics the data files produced and used Getting numbers: normalization

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org

More information

Seminar Microarray-Datenanalyse

Seminar Microarray-Datenanalyse Seminar Microarray- Normalization Hans-Ulrich Klein Christian Ruckert Institut für Medizinische Informatik WWU Münster SS 2011 Organisation 1 09.05.11 Normalisierung 2 10.05.11 Bestimmen diff. expr. Gene,

More information

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters Empirical Bayes Moderation of Asymptotically Linear Parameters Nima Hejazi Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi nimahejazi.org twitter/@nshejazi github/nhejazi

More information

Data Preprocessing. Data Preprocessing

Data Preprocessing. Data Preprocessing Data Preprocessing 1 Data Preprocessing Normalization: the process of removing sampleto-sample variations in the measurements not due to differential gene expression. Bringing measurements from the different

More information

SUPPLEMENTAL DATA: ROBUST ESTIMATORS FOR EXPRESSION ANALYSIS EARL HUBBELL, WEI-MIN LIU, AND RUI MEI

SUPPLEMENTAL DATA: ROBUST ESTIMATORS FOR EXPRESSION ANALYSIS EARL HUBBELL, WEI-MIN LIU, AND RUI MEI SUPPLEMENTAL DATA: ROBUST ESTIMATORS FOR EXPRESSION ANALYSIS EARL HUBBELL, WEI-MIN LIU, AND RUI MEI ABSTRACT. This is supplemental data extracted from the paper Robust Estimators for Expression Analysis

More information

Chapter 3: Statistical methods for estimation and testing. Key reference: Statistical methods in bioinformatics by Ewens & Grant (2001).

Chapter 3: Statistical methods for estimation and testing. Key reference: Statistical methods in bioinformatics by Ewens & Grant (2001). Chapter 3: Statistical methods for estimation and testing Key reference: Statistical methods in bioinformatics by Ewens & Grant (2001). Chapter 3: Statistical methods for estimation and testing Key reference:

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Bradley Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Bradley Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org

More information

Stat260: Bayesian Modeling and Inference Lecture Date: March 10, 2010

Stat260: Bayesian Modeling and Inference Lecture Date: March 10, 2010 Stat60: Bayesian Modelin and Inference Lecture Date: March 10, 010 Bayes Factors, -priors, and Model Selection for Reression Lecturer: Michael I. Jordan Scribe: Tamara Broderick The readin for this lecture

More information

Error models and normalization. Wolfgang Huber DKFZ Heidelberg

Error models and normalization. Wolfgang Huber DKFZ Heidelberg Error models and normalization Wolfgang Huber DKFZ Heidelberg Acknowledgements Anja von Heydebreck, Martin Vingron Andreas Buness, Markus Ruschhaupt, Klaus Steiner, Jörg Schneider, Katharina Finis, Anke

More information

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters Empirical Bayes Moderation of Asymptotically Linear Parameters Nima Hejazi Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi nimahejazi.org twitter/@nshejazi github/nhejazi

More information

Technologie w skali genomowej 2/ Algorytmiczne i statystyczne aspekty sekwencjonowania DNA

Technologie w skali genomowej 2/ Algorytmiczne i statystyczne aspekty sekwencjonowania DNA Technologie w skali genomowej 2/ Algorytmiczne i statystyczne aspekty sekwencjonowania DNA Expression analysis for RNA-seq data Ewa Szczurek Instytut Informatyki Uniwersytet Warszawski 1/35 The problem

More information

Statistical analysis of microarray data: a Bayesian approach

Statistical analysis of microarray data: a Bayesian approach Biostatistics (003), 4, 4,pp. 597 60 Printed in Great Britain Statistical analysis of microarray data: a Bayesian approach RAPHAEL GTTARD University of Washington, Department of Statistics, Box 3543, Seattle,

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume 4, Issue 1 2005 Article 30 Weighted Analysis of Paired Microarray Experiments Erik Kristiansson Anders Sjögren Mats Rudemo Olle Nerman

More information

Experimental Design. Experimental design. Outline. Choice of platform Array design. Target samples

Experimental Design. Experimental design. Outline. Choice of platform Array design. Target samples Experimental Design Credit for some of today s materials: Jean Yang, Terry Speed, and Christina Kendziorski Experimental design Choice of platform rray design Creation of probes Location on the array Controls

More information

Use of Agilent Feature Extraction Software (v8.1) QC Report to Evaluate Microarray Performance

Use of Agilent Feature Extraction Software (v8.1) QC Report to Evaluate Microarray Performance Use of Agilent Feature Extraction Software (v8.1) QC Report to Evaluate Microarray Performance Anthea Dokidis Glenda Delenstarr Abstract The performance of the Agilent microarray system can now be evaluated

More information

REPLICATED MICROARRAY DATA

REPLICATED MICROARRAY DATA Statistica Sinica 1(), 31-46 REPLICATED MICROARRAY DATA Ingrid Lönnstedt and Terry Speed Uppsala University, University of California, Berkeley and Walter and Eliza Hall Institute Abstract: cdna microarrays

More information

David M. Rocke Division of Biostatistics and Department of Biomedical Engineering University of California, Davis

David M. Rocke Division of Biostatistics and Department of Biomedical Engineering University of California, Davis David M. Rocke Division of Biostatistics and Department of Biomedical Engineering University of California, Davis March 18, 2016 UVA Seminar RNA Seq 1 RNA Seq Gene expression is the transcription of the

More information

changes in gene expression, we developed and tested several models. Each model was

changes in gene expression, we developed and tested several models. Each model was Additional Files Additional File 1 File format: PDF Title: Experimental design and linear models Description: This additional file describes in detail the experimental design and linear models used to

More information

Androgen-independent prostate cancer

Androgen-independent prostate cancer The following tutorial walks through the identification of biological themes in a microarray dataset examining androgen-independent. Visit the GeneSifter Data Center (www.genesifter.net/web/datacenter.html)

More information

Single gene analysis of differential expression

Single gene analysis of differential expression Single gene analysis of differential expression Giorgio Valentini DSI Dipartimento di Scienze dell Informazione Università degli Studi di Milano valentini@dsi.unimi.it Comparing two conditions Each condition

More information

Expression arrays, normalization, and error models

Expression arrays, normalization, and error models 1 Epression arrays, normalization, and error models There are a number of different array technologies available for measuring mrna transcript levels in cell populations, from spotted cdna arrays to in

More information

DETECTING DIFFERENTIALLY EXPRESSED GENES WHILE CONTROLLING THE FALSE DISCOVERY RATE FOR MICROARRAY DATA

DETECTING DIFFERENTIALLY EXPRESSED GENES WHILE CONTROLLING THE FALSE DISCOVERY RATE FOR MICROARRAY DATA University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Dissertations and Theses in Statistics Statistics, Department of 2009 DETECTING DIFFERENTIALLY EXPRESSED GENES WHILE CONTROLLING

More information

Improving the identification of differentially expressed genes in cdna microarray experiments

Improving the identification of differentially expressed genes in cdna microarray experiments Improving the identification of differentially expressed genes in cdna microarray experiments Databionics Research Group University of Marburg 33 Marburg, Germany Alfred Ultsch Abstract. The identification

More information

Improved Statistical Tests for Differential Gene Expression by Shrinking Variance Components Estimates

Improved Statistical Tests for Differential Gene Expression by Shrinking Variance Components Estimates Improved Statistical Tests for Differential Gene Expression by Shrinking Variance Components Estimates September 4, 2003 Xiangqin Cui, J. T. Gene Hwang, Jing Qiu, Natalie J. Blades, and Gary A. Churchill

More information

Topics on statistical design and analysis. of cdna microarray experiment

Topics on statistical design and analysis. of cdna microarray experiment Topics on statistical design and analysis of cdna microarray experiment Ximin Zhu A Dissertation Submitted to the University of Glasgow for the degree of Doctor of Philosophy Department of Statistics May

More information

Design of microarray experiments

Design of microarray experiments Design of microarray experiments Ulrich ansmann mansmann@imbi.uni-heidelberg.de Practical microarray analysis September Heidelberg Heidelberg, September otivation The lab biologist and theoretician need

More information

DIRECT VERSUS INDIRECT DESIGNS FOR edna MICROARRAY EXPERIMENTS

DIRECT VERSUS INDIRECT DESIGNS FOR edna MICROARRAY EXPERIMENTS Sankhyā : The Indian Journal of Statistics Special issue in memory of D. Basu 2002, Volume 64, Series A, Pt. 3, pp 706-720 DIRECT VERSUS INDIRECT DESIGNS FOR edna MICROARRAY EXPERIMENTS By TERENCE P. SPEED

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2004 Paper 147 Multiple Testing Methods For ChIP-Chip High Density Oligonucleotide Array Data Sunduz

More information

Practical Applications and Properties of the Exponentially. Modified Gaussian (EMG) Distribution. A Thesis. Submitted to the Faculty

Practical Applications and Properties of the Exponentially. Modified Gaussian (EMG) Distribution. A Thesis. Submitted to the Faculty Practical Applications and Properties of the Exponentially Modified Gaussian (EMG) Distribution A Thesis Submitted to the Faculty of Drexel University by Scott Haney in partial fulfillment of the requirements

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org

More information

Design of microarray experiments

Design of microarray experiments Design of microarray experiments Ulrich Mansmann mansmann@imbi.uni-heidelberg.de Practical microarray analysis March 23 Heidelberg Heidelberg, March 23 Experiments Scientists deal mostly with experiments

More information

Laurent Gautier 1, 2. August 18 th, 2009

Laurent Gautier 1, 2. August 18 th, 2009 s Processing of Laurent Gautier 1, 2 1 DTU Multi-Assay Core(DMAC) 2 Center for Biological Sequence (CBS) August 18 th, 2009 s 1 s 2 3 4 s 1 s 2 3 4 Photolithography s Courtesy of The Science Creative Quarterly,

More information

Tools and topics for microarray analysis

Tools and topics for microarray analysis Tools and topics for microarray analysis USSES Conference, Blowing Rock, North Carolina, June, 2005 Jason A. Osborne, osborne@stat.ncsu.edu Department of Statistics, North Carolina State University 1 Outline

More information

Overview - MS Proteomics in One Slide. MS masses of peptides. MS/MS fragments of a peptide. Results! Match to sequence database

Overview - MS Proteomics in One Slide. MS masses of peptides. MS/MS fragments of a peptide. Results! Match to sequence database Overview - MS Proteomics in One Slide Obtain protein Digest into peptides Acquire spectra in mass spectrometer MS masses of peptides MS/MS fragments of a peptide Results! Match to sequence database 2 But

More information

Microarray time course experiments. Terry Speed UC Berkeley and WEHI

Microarray time course experiments. Terry Speed UC Berkeley and WEHI Microarray time course experiments Terry Speed UC Berkeley and WEHI Functional Genomics: Principles of Statistics and Biology Firenze, May 9-13, 2005 2 Outline Types and features Design, including replication

More information

Advanced Statistical Methods: Beyond Linear Regression

Advanced Statistical Methods: Beyond Linear Regression Advanced Statistical Methods: Beyond Linear Regression John R. Stevens Utah State University Notes 3. Statistical Methods II Mathematics Educators Worshop 28 March 2009 1 http://www.stat.usu.edu/~jrstevens/pcmi

More information

Non-specific filtering and control of false positives

Non-specific filtering and control of false positives Non-specific filtering and control of false positives Richard Bourgon 16 June 2009 bourgon@ebi.ac.uk EBI is an outstation of the European Molecular Biology Laboratory Outline Multiple testing I: overview

More information

Analysis of visibility level in road lighting using image processing techniques

Analysis of visibility level in road lighting using image processing techniques Scientific Research and Essays Vol. 5 (18), pp. 2779-2785, 18 September, 2010 Available online at http://www.academicjournals.or/sre ISSN 1992-2248 2010 Academic Journals Full enth Research Paper Analysis

More information

SPH 247 Statistical Analysis of Laboratory Data. April 28, 2015 SPH 247 Statistics for Laboratory Data 1

SPH 247 Statistical Analysis of Laboratory Data. April 28, 2015 SPH 247 Statistics for Laboratory Data 1 SPH 247 Statistical Analysis of Laboratory Data April 28, 2015 SPH 247 Statistics for Laboratory Data 1 Outline RNA-Seq for differential expression analysis Statistical methods for RNA-Seq: Structure and

More information

Solution to the take home exam for ECON 3150/4150

Solution to the take home exam for ECON 3150/4150 Solution to the tae home exam for ECO 350/450 Jia Zhiyan and Jo Thori Lind April 2004 General comments Most of the copies we ot were quite ood, and it seems most of you have done a real effort on the problem

More information

Multiplicative background correction for spotted. microarrays to improve reproducibility

Multiplicative background correction for spotted. microarrays to improve reproducibility Multiplicative background correction for spotted microarrays to improve reproducibility DABAO ZHANG,, MIN ZHANG, MARTIN T. WELLS, March 12, 2006 Department of Statistics, Purdue University, West Lafayette,

More information

Statistics for Differential Expression in Sequencing Studies. Naomi Altman

Statistics for Differential Expression in Sequencing Studies. Naomi Altman Statistics for Differential Expression in Sequencing Studies Naomi Altman naomi@stat.psu.edu Outline Preliminaries what you need to do before the DE analysis Stat Background what you need to know to understand

More information

Bioconductor Project Working Papers

Bioconductor Project Working Papers Bioconductor Project Working Papers Bioconductor Project Year 2004 Paper 6 Error models for microarray intensities Wolfgang Huber Anja von Heydebreck Martin Vingron Department of Molecular Genome Analysis,

More information

ABSSeq: a new RNA-Seq analysis method based on modelling absolute expression differences

ABSSeq: a new RNA-Seq analysis method based on modelling absolute expression differences ABSSeq: a new RNA-Seq analysis method based on modelling absolute expression differences Wentao Yang October 30, 2018 1 Introduction This vignette is intended to give a brief introduction of the ABSSeq

More information

Package plw. R topics documented: May 7, Type Package

Package plw. R topics documented: May 7, Type Package Type Package Package plw May 7, 2018 Title Probe level Locally moderated Weighted t-tests. Version 1.40.0 Date 2009-07-22 Author Magnus Astrand Maintainer Magnus Astrand

More information

Manual: R package HTSmix

Manual: R package HTSmix Manual: R package HTSmix Olga Vitek and Danni Yu May 2, 2011 1 Overview High-throughput screens (HTS) measure phenotypes of thousands of biological samples under various conditions. The phenotypes are

More information

Multiple testing: Intro & FWER 1

Multiple testing: Intro & FWER 1 Multiple testing: Intro & FWER 1 Mark van de Wiel mark.vdwiel@vumc.nl Dep of Epidemiology & Biostatistics,VUmc, Amsterdam Dep of Mathematics, VU 1 Some slides courtesy of Jelle Goeman 1 Practical notes

More information

Single gene analysis of differential expression. Giorgio Valentini

Single gene analysis of differential expression. Giorgio Valentini Single gene analysis of differential expression Giorgio Valentini valenti@disi.unige.it Comparing two conditions Each condition may be represented by one or more RNA samples. Using cdna microarrays, samples

More information

Optimal normalization of DNA-microarray data

Optimal normalization of DNA-microarray data Optimal normalization of DNA-microarray data Daniel Faller 1, HD Dr. J. Timmer 1, Dr. H. U. Voss 1, Prof. Dr. Honerkamp 1 and Dr. U. Hobohm 2 1 Freiburg Center for Data Analysis and Modeling 1 F. Hoffman-La

More information

Design and Analysis of Gene Expression Experiments

Design and Analysis of Gene Expression Experiments Design and Analysis of Gene Expression Experiments Guilherme J. M. Rosa Department of Animal Sciences Department of Biostatistics & Medical Informatics University of Wisconsin - Madison OUTLINE Æ Linear

More information

Causal Bayesian Networks

Causal Bayesian Networks Causal Bayesian Networks () Ste7 (2) (3) Kss Fus3 Ste2 () Fiure : Simple Example While Bayesian networks should typically be viewed as acausal, it is possible to impose a causal interpretation on these

More information

DATA ANALYSIS FOR MICROARRAY EXPERIMENT AND DNA BARCODE OF LIFE

DATA ANALYSIS FOR MICROARRAY EXPERIMENT AND DNA BARCODE OF LIFE DATA ANALYSIS FOR MICROARRAY EXPERIMENT AND DNA BARCODE OF LIFE BY CHING-RAY YU A dissertation submitted to the Graduate School New Brunswick Rutgers, The State University of New Jersey in partial fulfillment

More information

Optimal design of microarray experiments

Optimal design of microarray experiments University of Groningen e.c.wit@rug.nl http://www.math.rug.nl/ ernst 7 June 2011 What is a cdna Microarray Experiment? GREEN (Cy3) Cancer Tissue mrna Mix tissues in equal amounts G G G R R R R R G R G

More information

Correlated Component Regression: A Fast Parsimonious Approach for Predicting Outcome Variables from a Large Number of Predictors

Correlated Component Regression: A Fast Parsimonious Approach for Predicting Outcome Variables from a Large Number of Predictors Correlated Component Reression: A Fast Parsimonious Approach for Predictin Outcome Variables from a Lare Number of Predictors Jay Maidson, Ph.D. Statistical Innovations COMPTSTAT 2010, Paris, France 1

More information

DEGseq: an R package for identifying differentially expressed genes from RNA-seq data

DEGseq: an R package for identifying differentially expressed genes from RNA-seq data DEGseq: an R package for identifying differentially expressed genes from RNA-seq data Likun Wang Zhixing Feng i Wang iaowo Wang * and uegong Zhang * MOE Key Laboratory of Bioinformatics and Bioinformatics

More information

THE SIGNAL ESTIMATOR LIMIT SETTING METHOD

THE SIGNAL ESTIMATOR LIMIT SETTING METHOD ' THE SIGNAL ESTIMATOR LIMIT SETTING METHOD Shan Jin, Peter McNamara Department of Physics, University of Wisconsin Madison, Madison, WI 53706 Abstract A new method of backround subtraction is presented

More information

g A n(a, g) n(a, ḡ) = n(a) n(a, g) n(a) B n(b, g) n(a, ḡ) = n(b) n(b, g) n(b) g A,B A, B 2 RNA-seq (D) RNA mrna [3] RNA 2. 2 NGS 2 A, B NGS n(

g A n(a, g) n(a, ḡ) = n(a) n(a, g) n(a) B n(b, g) n(a, ḡ) = n(b) n(b, g) n(b) g A,B A, B 2 RNA-seq (D) RNA mrna [3] RNA 2. 2 NGS 2 A, B NGS n( ,a) RNA-seq RNA-seq Cuffdiff, edger, DESeq Sese Jun,a) Abstract: Frequently used biological experiment technique for observing comprehensive gene expression has been changed from microarray using cdna

More information

Modelling the inhalation of drug particles in a human nasal cavity

Modelling the inhalation of drug particles in a human nasal cavity J. Biomedical Science and Enineerin, 2010, 3, 52-58 doi: 10.4236/jbise.2010.31008 Published Online January 2010 (http://www.scirp.or/journal/jbise/). Modellin the inhalation of dru particles in a human

More information

EE 435 Lecture 13. Cascaded Amplifiers. -- Two-Stage Op Amp Design

EE 435 Lecture 13. Cascaded Amplifiers. -- Two-Stage Op Amp Design EE 435 Lecture 13 ascaded Amplifiers -- Two-Stae Op Amp Desin Review from Last Time Routh-Hurwitz Stability riteria: A third-order polynomial s 3 +a 2 s 2 +a 1 s+a 0 has all poles in the LHP iff all coefficients

More information

Estimation of Transformations for Microarray Data Using Maximum Likelihood and Related Methods

Estimation of Transformations for Microarray Data Using Maximum Likelihood and Related Methods Estimation of Transformations for Microarray Data Using Maximum Likelihood and Related Methods Blythe Durbin, Department of Statistics, UC Davis, Davis, CA 95616 David M. Rocke, Department of Applied Science,

More information

Disclaimer: This lab write-up is not

Disclaimer: This lab write-up is not Disclaimer: This lab write-up is not to be copied, in whole or in part, unless a proper reference is made as to the source. (It is stronly recommended that you use this document only to enerate ideas,

More information

Gene Expression an Overview of Problems & Solutions: 3&4. Utah State University Bioinformatics: Problems and Solutions Summer 2006

Gene Expression an Overview of Problems & Solutions: 3&4. Utah State University Bioinformatics: Problems and Solutions Summer 2006 Gene Expression an Overview of Problems & Solutions: 3&4 Utah State University Bioinformatics: Problems and Solutions Summer 006 Review Considering several problems & solutions with gene expression data

More information

Bias in RNA sequencing and what to do about it

Bias in RNA sequencing and what to do about it Bias in RNA sequencing and what to do about it Walter L. (Larry) Ruzzo Computer Science and Engineering Genome Sciences University of Washington Fred Hutchinson Cancer Research Center Seattle, WA, USA

More information

Two-step Anomalous Diffusion Tensor Imaging

Two-step Anomalous Diffusion Tensor Imaging Two-step Anomalous Diffusion Tensor Imain Thomas R. Barrick 1, Matthew G. Hall 2 1 Centre for Stroke and Dementia, Division of Cardiac and Vascular Sciences, St. Geore s University of London, 2 Department

More information

Probabilistic Inference for Multiple Testing

Probabilistic Inference for Multiple Testing This is the title page! This is the title page! Probabilistic Inference for Multiple Testing Chuanhai Liu and Jun Xie Department of Statistics, Purdue University, West Lafayette, IN 47907. E-mail: chuanhai,

More information

High-throughput Testing

High-throughput Testing High-throughput Testing Noah Simon and Richard Simon July 2016 1 / 29 Testing vs Prediction On each of n patients measure y i - single binary outcome (eg. progression after a year, PCR) x i - p-vector

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org kcoombes@mdanderson.org

More information

Design of Screening Experiments with Partial Replication

Design of Screening Experiments with Partial Replication Design of Screening Experiments with Partial Replication David J. Edwards Department of Statistical Sciences & Operations Research Virginia Commonwealth University Robert D. Leonard Department of Information

More information

Targeted Learning for High-Dimensional Variable Importance

Targeted Learning for High-Dimensional Variable Importance Targeted Learning for High-Dimensional Variable Importance Alan Hubbard, Nima Hejazi, Wilson Cai, Anna Decker Division of Biostatistics University of California, Berkeley July 27, 2016 for Centre de Recherches

More information

Example 1: Two-Treatment CRD

Example 1: Two-Treatment CRD Introduction to Mixed Linear Models in Microarray Experiments //0 Copyright 0 Dan Nettleton Statistical Models A statistical model describes a formal mathematical data generation mechanism from which an

More information

Advanced Methods Development for Equilibrium Cycle Calculations of the RBWR. Andrew Hall 11/7/2013

Advanced Methods Development for Equilibrium Cycle Calculations of the RBWR. Andrew Hall 11/7/2013 Advanced Methods Development for Equilibrium Cycle Calculations of the RBWR Andrew Hall 11/7/2013 Outline RBWR Motivation and Desin Why use Serpent Cross Sections? Modelin the RBWR Axial Discontinuity

More information

BLAST Database Searching. BME 110: CompBio Tools Todd Lowe April 8, 2010

BLAST Database Searching. BME 110: CompBio Tools Todd Lowe April 8, 2010 BLAST Database Searching BME 110: CompBio Tools Todd Lowe April 8, 2010 Admin Reading: Read chapter 7, and the NCBI Blast Guide and tutorial http://www.ncbi.nlm.nih.gov/blast/why.shtml Read Chapter 8 for

More information

Altitude measurement for model rocketry

Altitude measurement for model rocketry Altitude measurement for model rocketry David A. Cauhey Sibley School of Mechanical Aerospace Enineerin, Cornell University, Ithaca, New York 14853 I. INTRODUCTION In his book, Rocket Boys, 1 Homer Hickam

More information

Electronic Supplementary Information Increasing Hybridization Rate and Sensitivity of DNA Microarrays Using Isotachophoresis

Electronic Supplementary Information Increasing Hybridization Rate and Sensitivity of DNA Microarrays Using Isotachophoresis Electronic Supplementary Material (ESI) for Lab on a Chip. This journal is The Royal Society of Chemistry 2014 Electronic Supplementary Information Increasing Hybridization Rate and Sensitivity of DNA

More information