SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis
|
|
- Rosalind Washington
- 6 years ago
- Views:
Transcription
1 SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical mdel fr micrarray data analysis David Rssell Department f Bistatistics M.D. Andersn Cancer Center, Hustn, TX 77030, USA rsselldavid@gmail.cm Cntents 1 Bayesian prcedure fr gene differential expressin analysis 2 2 Frequentist perating characteristics f the Bayesian prcedure 3 3 Frequentist FDR fr the Armstrng dataset 5 4 Gdness f fit in the Armstrng dataset 6 1
2 1 Bayesian prcedure fr gene differential expressin analysis Here we detail the Bayesian prcedure that minimizes the Bayesian FNR subject t the Bayesian FDR being belw a threshld α. Let δ 1,..., δ n dente the expressin pattern that each gene fllws, e.g. if there are nly tw grups t be cmpared δ i = 0 dentes the null hypthesis that bth grups are equal and δ i = 1 dentes that they are different. Dente as d i = d i (x) the pattern that gene i is assigned t, i.e. d i = 0 means that we declare the gene as equally expressed (EE) and d i 0 that we declare it differentially expressed (DE). The false negative (FNP) and false discvery prprtins (FDP) can be written as: FNP = FDP = i=1 I(d i = 0)I(δ i 0) i=1 I(d i = 0) i=1 I(d i = 1)I(δ i = 0) i=1 I(d, (1) i = 1) where I( ) is the indicatr functin. That is, the FNP is the prprtin f genes declared EE that are actually DE, and the FDP is the prprtin f genes declared DE that are actually EE. The Bayesian FNR and Bayesian FDR are defined as the expected FNP and FDP, respectively, where the expectatin is taken with respect t the psterir distributin f δ 1,..., δ n (1). Fr any fixed decisins d 1,..., d n, ne can evaluate the Bayesian FNR and FDR simply as BFNR = BFDR = i=1 I(d i = 0)(1 v i0 ) i=1 I(d i = 0) i=1 I(d i = 1)v i0 n i=1 I(d i = 1), (2) where v i0 = P (δ i = 0 x) is the psterir prbability that gene i is EE. (2) shwed that, in a setup with nly tw hyptheses, the ptimal rule t minimize BFNR subject t BFDR< α is t declare as DE all genes with v i0 belw a certain threshld t, i.e. d i = I(v i0 < t), where t is the minimum value such that BFDR α. Nte that BFNR and BFDR nly take int accunt whether a gene was classified int pattern 0 r nt. Therefre, when minimizing BFNR subject t BFDR α it makes n difference whether a particular gene is assigned t pattern 1 r pattern 2, say. We prpse the
3 bvius: given that a gene is declared DE, we assign it t the pattern with highest psterir prbability, i.e. δ i = I(v i0 < t) argmax k {1,...,H 1} (v ik ). It is straightfrward t see that, fr any fixed BFNR and BFDR, this rule maximizes the expected number f genes crrectly classified int their expressin pattern. 2 Frequentist perating characteristics f the Bayesian prcedure In this sectin we review the algrithm that (3) prpsed t estimate the frequentist FDR fr any given prcedure, i.e. the expected FDP in (1) when the prcedure is applied under repeated sampling. As a first step, ne btains btstrap samples frm the riginal data, in such a way that the sample mean and variance f each gene are rughly preserved and that it represents a sample under the cmplete null hypthesis that n genes are DE. Then ne repeatedly applies the prcedure t each btstrap dataset, btaining an estimate f the number f false psitives, and cmpares that t the number f genes fund in the riginal dataset. Mre specifically, the algrithm is as fllws: Algrithm 1 1. Apply the prcedure t the riginal dataset, and dente the number f genes declared t be DE as P. Dente as X i and S i the sample mean and standard deviatin f the gene expressin measurements fr gene i = 1... n. 2. Fr b = 1... B, d: Cmpute z ij = (x ij X i )/S i, i = 1... n, j = 1... J. Fr each gene, btain a sample f size J with replacement frm the cllectin f all z ij. Dente this sampled values as z (b) ij. Then cmpute x (b) ij = S i z (b) ij X i. Apply the prcedure t find differentially expressed genes t the btstrap dataset. Since all discveries are false psitives, dente the number f genes declared DE as FP b. P B b=1 FP 3. Estimate the frequentist FDR as FDR = ˆπ b 0, where ˆπ B P 0 is an estimate f the prprtin f EE genes.
4 Grup 1 Grup Table 1: Hypthetical expressin values fr a gene, 10 arrays and 2 grups There is an imprtant remark t make here: there are ther ways t simulate data under the null hypthesis. Fr instance, ne culd simply permute the grup labels r btstrap data within each gene, but this culd be trublesme when the sample size is t small t prvide an accurate representatin f the null. Determining what sample size is large enugh is nt a simple matter. Fr example, suppse that we have a gene fr which mst expressin values are small, but fr ne f the grups it ccasinally presents large expressin values. Table 1 10 presents hypthetical values fr a single gene and 2 grups. Nte that there are ( ) 10 5 = 184, 756 ways t permute the grup labels, which at first sight may seem a large enugh number. Hwever, in grup 2 there is an utlying value. Fr any permutatin, whatever grup the utlier is assigned t will tend t be declared t have higher expressin levels than the ther grup, and it will be cunted as a false psitive. If ne btains a btstrap sample, there is sme prbability that the effect f the utlier will be mitigated (the utlier may nt be sampled at all, r be sampled the same number f times in bth grups), but it will still tend t increase t false psitive cunt. If ne btstraps residuals frm ther genes, such as Algrithm 1 des, the value f 10.5 may nt be utlying at all anymre, since ther genes may present values that are als far away frm the mean. Hence, we see that three reasnable strategies t sample under the null may result in three quite different sampling distributins and estimated FDR. The main issue is whether the utlying value shuld be cnsidered t be an errr r nt. In ur experience, in micrarrays it is nt unfrequent t encunter data such as that in Table 1. A pssible bilgical explanatin is that a small prprtin f individuals frm grup 2 experience sme kind f
5 GaGa mdel MiGaGa mdel (M=2) Estimated frequentist FDR 5 arrays 10 arrays 15 arrays All data Estimated frequentist FDR 5 arrays 10 arrays 15 arrays All data Bayesian FDR Bayesian FDR Figure 1: Armstrng dataset. Frequentist FDR fr the GaGa (a) and Mi- GaGa (b) mdels. mutatin, which causes the expressin f a particular gene t raise cnsiderably. If such a discvery is bilgically meaningful ne shuld nt cunt it as a false psitive, and hence methds based n permuting r btstrapping each gene separately wuld nt be apprpriate. We agree with (3) that mre research is needed regarding this tpic, but we feel that unless the number f arrays is quite large it may be beneficial t use a resampling scheme that uses data frm several genes at nce. 3 Frequentist FDR fr the Armstrng dataset We nw estimate the frequentist FDR f the Bayesian prcedure utlined in Sectin 1 by applying Algrithm 1 t the Armstrng dataset. Figure 1(a) displays the estimated frequentist FDR fr target Bayesian FDRs ranging frm 0 t 0.1, bth fr the GaGa and MiGaGa mdels and increasing amunts f data. Fr Bayesian FDR at the 0.05 level, the estimated frequentist FDR is always belw The nly exceptin is the MiGaGa mdel applied t the full dataset, fr which the frequentist FDR is estimated t be 6.4%. Figure 2 prvides the analgus estimates fr the mntnically transfrmed data, which was analyzed with a GaGa mdel. Fr a Bayesian FDR f 0.05, the estimated frequentist FDR is belw 0.05, as desired.
6 (a) (b) GaGa mdel, transfrmed data Estimated frequentist FDR 5 arrays 10 arrays 15 arrays All data Density Observed data GaGa Bayesian FDR Expressin levels (lg scale) Figure 2: Armstrng dataset after mntnic transfrmatin. (a): frequentist FDR ft the GaGa mdel; (b): marginal distributin f data vs. prir predictive f GaGa mdel with ω = ˆω. We cnclude that the Bayesian prcedure frm Sectin 1 has reasnably gd frequentist perating characteristics when applied t the Armstrng dataset. 4 Gdness f fit in the Armstrng dataset In the riginal paper, Sectin 6.2.1, we evaluated sme aspects f the verall gdness-f-fit. Figure 2(b) cmpares the marginal distributin f the mntnically transfrmed data with draws frm the prir-predictive GaGa mdel, setting the hyper-parameters t their psterir mean. The mntnic transfrmatin imprves the fit f the GaGa mdel substantially (cmpare with Figure 4(a) in the riginal paper). We nw assess the fit fr sme genes individually. First, we select the tw genes with the highest prbability f being DE accrding t the Ga mdel. Figure 3(a) cmpares their bserved expressin values with draws frm their psterir predictive distributin based n the Ga mdel. We see that, even thugh the mdel underestimates the variability fr the MLL grup, the tw genes d actually seem t be differentially expressed. Figure 3(b) shws hw draws frm the GaGa mdel psterir predictive mre apprpriately capture
7 (a) (b) Ga mdel GaGa mdel Prbe 37809_at ALL MLL Prbe 37809_at CI CII Prbe 1914_at (c) Ga mdel Prbe 1914_at (d) GaGa mdel Prbe 2087_s_at ALL MLL Prbe 2087_s_at CI CII Prbe 1369_s_at Prbe 1369_s_at Figure 3: Observed expressin values vs. predictive distributin. Large black symbls are actual bservatins, small gray symbls are draws frm the psterir predictive. (a),(b): the tw genes with highest prbability f being DE accrding t the Ga mdel; (c),(d): tw genes declared DE by the Ga mdel and declared EE by the GaGa mdel
8 the variability f the data. Fr these tw genes in particular, hwever, the result f the inference is the same: bth mdels declare prbes 1914 at and at t be differentially expressed. We nw select tw genes that are declared DE by the Ga mdel and EE by the GaGa mdel, and again cmpare the bserved values t their psterir predictive distributin. Figure 3(c) reveals that the Ga mdel underestimates the variability in the data, while in panel (d) we see that GaGa represents it mre satisfactrily. In this case the inference abut the prbes 1369 s at and 2087 s at frm bth mdels is radically different, fr the GaGa mdel assigns a psterir prbability < 0.01 that each gene is differentially expressed. The pr Ga fit t these tw genes and the fact that n strng differences between grups are bserved in Figure 3(c)-(d) suggest that the inference prvided by the GaGa mdel is mre realiable. Althugh nt presented here, we bserve a similar favrable behavir f the MiGaGa mdel with M = 2 cmpnents. We cnclude that the psterir distributin f the GaGa and MiGaGa mdels present an adequate fit t the data. This is in cntrast with the prir-predictive plt in Figure 3(b) in the riginal paper, which suggested the GaGa fit t be f limited quality. This shuld nt be t surprising, it merely reflects that the psterir distributin incrprates the infrmatin abut bimdality present in the data. References [1] P. Müller, G. Parmigiani, and K. Rice. FDR and Bayesian Multiple Cmparisns Rules. Oxfrd University Press, [2] P. Müller, G. Parmigiani, C. Rbert, and J. Russeau. Optimal sample size fr multiple testing: the case f gene expressin micrarrays. Jurnal f the American Statistical Assciatin, 99: , [3] J.D. Strey. The ptimal discvery prcedure: A new apprach t simultaneus significance testing. Jurnal f the Ryal Statistical Sciety B, 69: , 2007.
Bootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >
Btstrap Methd > # Purpse: understand hw btstrap methd wrks > bs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(bs) > mean(bs) [1] 21.64625 > # estimate f lambda > lambda = 1/mean(bs);
More informationGaGa: a Parsimonious and Flexible Hierarchical Model for Microarray Data Analysis
GaGa: a Parsimnius and Fleible Hierarchical Mdel fr Micrarray Data Analysis David Rssell Department f Bistatistics M.D. Andersn Cancer Center, Hustn, TX 77030, USA rsselldavid@gmail.cm Abstract Bayesian
More informationHypothesis Tests for One Population Mean
Hypthesis Tests fr One Ppulatin Mean Chapter 9 Ala Abdelbaki Objective Objective: T estimate the value f ne ppulatin mean Inferential statistics using statistics in rder t estimate parameters We will be
More informationCHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS
CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS 1 Influential bservatins are bservatins whse presence in the data can have a distrting effect n the parameter estimates and pssibly the entire analysis,
More informationCS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007
CS 477/677 Analysis f Algrithms Fall 2007 Dr. Gerge Bebis Curse Prject Due Date: 11/29/2007 Part1: Cmparisn f Srting Algrithms (70% f the prject grade) The bjective f the first part f the assignment is
More informationModelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA
Mdelling f Clck Behaviur Dn Percival Applied Physics Labratry University f Washingtn Seattle, Washingtn, USA verheads and paper fr talk available at http://faculty.washingtn.edu/dbp/talks.html 1 Overview
More informationName: Block: Date: Science 10: The Great Geyser Experiment A controlled experiment
Science 10: The Great Geyser Experiment A cntrlled experiment Yu will prduce a GEYSER by drpping Ments int a bttle f diet pp Sme questins t think abut are: What are yu ging t test? What are yu ging t measure?
More informationLab 1 The Scientific Method
INTRODUCTION The fllwing labratry exercise is designed t give yu, the student, an pprtunity t explre unknwn systems, r universes, and hypthesize pssible rules which may gvern the behavir within them. Scientific
More informationAP Statistics Notes Unit Two: The Normal Distributions
AP Statistics Ntes Unit Tw: The Nrmal Distributins Syllabus Objectives: 1.5 The student will summarize distributins f data measuring the psitin using quartiles, percentiles, and standardized scres (z-scres).
More informationPSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa
There are tw parts t this lab. The first is intended t demnstrate hw t request and interpret the spatial diagnstics f a standard OLS regressin mdel using GeDa. The diagnstics prvide infrmatin abut the
More information, which yields. where z1. and z2
The Gaussian r Nrmal PDF, Page 1 The Gaussian r Nrmal Prbability Density Functin Authr: Jhn M Cimbala, Penn State University Latest revisin: 11 September 13 The Gaussian r Nrmal Prbability Density Functin
More information4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression
4th Indian Institute f Astrphysics - PennState Astrstatistics Schl July, 2013 Vainu Bappu Observatry, Kavalur Crrelatin and Regressin Rahul Ry Indian Statistical Institute, Delhi. Crrelatin Cnsider a tw
More informationTree Structured Classifier
Tree Structured Classifier Reference: Classificatin and Regressin Trees by L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stne, Chapman & Hall, 98. A Medical Eample (CART): Predict high risk patients
More informationSimple Linear Regression (single variable)
Simple Linear Regressin (single variable) Intrductin t Machine Learning Marek Petrik January 31, 2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins
More informationWe say that y is a linear function of x if. Chapter 13: The Correlation Coefficient and the Regression Line
Chapter 13: The Crrelatin Cefficient and the Regressin Line We begin with a sme useful facts abut straight lines. Recall the x, y crdinate system, as pictured belw. 3 2 1 y = 2.5 y = 0.5x 3 2 1 1 2 3 1
More informationHow do scientists measure trees? What is DBH?
Hw d scientists measure trees? What is DBH? Purpse Students develp an understanding f tree size and hw scientists measure trees. Students bserve and measure tree ckies and explre the relatinship between
More informationMATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank
MATCHING TECHNIQUES Technical Track Sessin VI Emanuela Galass The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Emanuela Galass fr the purpse f this wrkshp When can we use
More informationPipetting 101 Developed by BSU CityLab
Discver the Micrbes Within: The Wlbachia Prject Pipetting 101 Develped by BSU CityLab Clr Cmparisns Pipetting Exercise #1 STUDENT OBJECTIVES Students will be able t: Chse the crrect size micrpipette fr
More informationComputational modeling techniques
Cmputatinal mdeling techniques Lecture 4: Mdel checing fr ODE mdels In Petre Department f IT, Åb Aademi http://www.users.ab.fi/ipetre/cmpmd/ Cntent Stichimetric matrix Calculating the mass cnservatin relatins
More informationComparing Several Means: ANOVA. Group Means and Grand Mean
STAT 511 ANOVA and Regressin 1 Cmparing Several Means: ANOVA Slide 1 Blue Lake snap beans were grwn in 12 pen-tp chambers which are subject t 4 treatments 3 each with O 3 and SO 2 present/absent. The ttal
More informationChapter Summary. Mathematical Induction Strong Induction Recursive Definitions Structural Induction Recursive Algorithms
Chapter 5 1 Chapter Summary Mathematical Inductin Strng Inductin Recursive Definitins Structural Inductin Recursive Algrithms Sectin 5.1 3 Sectin Summary Mathematical Inductin Examples f Prf by Mathematical
More informationSequential Allocation with Minimal Switching
In Cmputing Science and Statistics 28 (1996), pp. 567 572 Sequential Allcatin with Minimal Switching Quentin F. Stut 1 Janis Hardwick 1 EECS Dept., University f Michigan Statistics Dept., Purdue University
More informationBiplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 13 Offprint
Biplts in Practice MICHAEL GREENACRE Prfessr f Statistics at the Pmpeu Fabra University Chapter 13 Offprint CASE STUDY BIOMEDICINE Cmparing Cancer Types Accrding t Gene Epressin Arrays First published:
More informationKinematic transformation of mechanical behavior Neville Hogan
inematic transfrmatin f mechanical behavir Neville Hgan Generalized crdinates are fundamental If we assume that a linkage may accurately be described as a cllectin f linked rigid bdies, their generalized
More informationResampling Methods. Chapter 5. Chapter 5 1 / 52
Resampling Methds Chapter 5 Chapter 5 1 / 52 1 51 Validatin set apprach 2 52 Crss validatin 3 53 Btstrap Chapter 5 2 / 52 Abut Resampling An imprtant statistical tl Pretending the data as ppulatin and
More informationA Matrix Representation of Panel Data
web Extensin 6 Appendix 6.A A Matrix Representatin f Panel Data Panel data mdels cme in tw brad varieties, distinct intercept DGPs and errr cmpnent DGPs. his appendix presents matrix algebra representatins
More informationCAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank
CAUSAL INFERENCE Technical Track Sessin I Phillippe Leite The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Phillippe Leite fr the purpse f this wrkshp Plicy questins are causal
More informationA NOTE ON BAYESIAN ANALYSIS OF THE. University of Oxford. and. A. C. Davison. Swiss Federal Institute of Technology. March 10, 1998.
A NOTE ON BAYESIAN ANALYSIS OF THE POLY-WEIBULL MODEL F. Luzada-Net University f Oxfrd and A. C. Davisn Swiss Federal Institute f Technlgy March 10, 1998 Summary We cnsider apprximate Bayesian analysis
More informationLecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff
Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised
More informationMODULE FOUR. This module addresses functions. SC Academic Elementary Algebra Standards:
MODULE FOUR This mdule addresses functins SC Academic Standards: EA-3.1 Classify a relatinship as being either a functin r nt a functin when given data as a table, set f rdered pairs, r graph. EA-3.2 Use
More informationECEN 4872/5827 Lecture Notes
ECEN 4872/5827 Lecture Ntes Lecture #5 Objectives fr lecture #5: 1. Analysis f precisin current reference 2. Appraches fr evaluating tlerances 3. Temperature Cefficients evaluatin technique 4. Fundamentals
More informationDifferentiation Applications 1: Related Rates
Differentiatin Applicatins 1: Related Rates 151 Differentiatin Applicatins 1: Related Rates Mdel 1: Sliding Ladder 10 ladder y 10 ladder 10 ladder A 10 ft ladder is leaning against a wall when the bttm
More informationUNIV1"'RSITY OF NORTH CAROLINA Department of Statistics Chapel Hill, N. C. CUMULATIVE SUM CONTROL CHARTS FOR THE FOLDED NORMAL DISTRIBUTION
UNIV1"'RSITY OF NORTH CAROLINA Department f Statistics Chapel Hill, N. C. CUMULATIVE SUM CONTROL CHARTS FOR THE FOLDED NORMAL DISTRIBUTION by N. L. Jlmsn December 1962 Grant N. AFOSR -62..148 Methds f
More informationLecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff
Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised
More informationmaking triangle (ie same reference angle) ). This is a standard form that will allow us all to have the X= y=
Intrductin t Vectrs I 21 Intrductin t Vectrs I 22 I. Determine the hrizntal and vertical cmpnents f the resultant vectr by cunting n the grid. X= y= J. Draw a mangle with hrizntal and vertical cmpnents
More informationMATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank
MATCHING TECHNIQUES Technical Track Sessin VI Céline Ferré The Wrld Bank When can we use matching? What if the assignment t the treatment is nt dne randmly r based n an eligibility index, but n the basis
More informationVerification of Quality Parameters of a Solar Panel and Modification in Formulae of its Series Resistance
Verificatin f Quality Parameters f a Slar Panel and Mdificatin in Frmulae f its Series Resistance Sanika Gawhane Pune-411037-India Onkar Hule Pune-411037- India Chinmy Kulkarni Pune-411037-India Ojas Pandav
More informationOn Huntsberger Type Shrinkage Estimator for the Mean of Normal Distribution ABSTRACT INTRODUCTION
Malaysian Jurnal f Mathematical Sciences 4(): 7-4 () On Huntsberger Type Shrinkage Estimatr fr the Mean f Nrmal Distributin Department f Mathematical and Physical Sciences, University f Nizwa, Sultanate
More informationPattern Recognition 2014 Support Vector Machines
Pattern Recgnitin 2014 Supprt Vectr Machines Ad Feelders Universiteit Utrecht Ad Feelders ( Universiteit Utrecht ) Pattern Recgnitin 1 / 55 Overview 1 Separable Case 2 Kernel Functins 3 Allwing Errrs (Sft
More informationx 1 Outline IAML: Logistic Regression Decision Boundaries Example Data
Outline IAML: Lgistic Regressin Charles Suttn and Victr Lavrenk Schl f Infrmatics Semester Lgistic functin Lgistic regressin Learning lgistic regressin Optimizatin The pwer f nn-linear basis functins Least-squares
More informationWhat is Statistical Learning?
What is Statistical Learning? Sales 5 10 15 20 25 Sales 5 10 15 20 25 Sales 5 10 15 20 25 0 50 100 200 300 TV 0 10 20 30 40 50 Radi 0 20 40 60 80 100 Newspaper Shwn are Sales vs TV, Radi and Newspaper,
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Sectin 7 Mdel Assessment This sectin is based n Stck and Watsn s Chapter 9. Internal vs. external validity Internal validity refers t whether the analysis is valid fr the ppulatin and sample being studied.
More informationPart 3 Introduction to statistical classification techniques
Part 3 Intrductin t statistical classificatin techniques Machine Learning, Part 3, March 07 Fabi Rli Preamble ØIn Part we have seen that if we knw: Psterir prbabilities P(ω i / ) Or the equivalent terms
More informationSections 15.1 to 15.12, 16.1 and 16.2 of the textbook (Robbins-Miller) cover the materials required for this topic.
Tpic : AC Fundamentals, Sinusidal Wavefrm, and Phasrs Sectins 5. t 5., 6. and 6. f the textbk (Rbbins-Miller) cver the materials required fr this tpic.. Wavefrms in electrical systems are current r vltage
More informationSIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST. Mark C. Otto Statistics Research Division, Bureau of the Census Washington, D.C , U.S.A.
SIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST Mark C. Ott Statistics Research Divisin, Bureau f the Census Washingtn, D.C. 20233, U.S.A. and Kenneth H. Pllck Department f Statistics, Nrth Carlina State
More informationLead/Lag Compensator Frequency Domain Properties and Design Methods
Lectures 6 and 7 Lead/Lag Cmpensatr Frequency Dmain Prperties and Design Methds Definitin Cnsider the cmpensatr (ie cntrller Fr, it is called a lag cmpensatr s K Fr s, it is called a lead cmpensatr Ntatin
More informationPhysics 2B Chapter 23 Notes - Faraday s Law & Inductors Spring 2018
Michael Faraday lived in the Lndn area frm 1791 t 1867. He was 29 years ld when Hand Oersted, in 1820, accidentally discvered that electric current creates magnetic field. Thrugh empirical bservatin and
More informationNUMBERS, MATHEMATICS AND EQUATIONS
AUSTRALIAN CURRICULUM PHYSICS GETTING STARTED WITH PHYSICS NUMBERS, MATHEMATICS AND EQUATIONS An integral part t the understanding f ur physical wrld is the use f mathematical mdels which can be used t
More informationREADING STATECHART DIAGRAMS
READING STATECHART DIAGRAMS Figure 4.48 A Statechart diagram with events The diagram in Figure 4.48 shws all states that the bject plane can be in during the curse f its life. Furthermre, it shws the pssible
More informationPerfrmance f Sensitizing Rules n Shewhart Cntrl Charts with Autcrrelated Data Key Wrds: Autregressive, Mving Average, Runs Tests, Shewhart Cntrl Chart
Perfrmance f Sensitizing Rules n Shewhart Cntrl Charts with Autcrrelated Data Sandy D. Balkin Dennis K. J. Lin y Pennsylvania State University, University Park, PA 16802 Sandy Balkin is a graduate student
More informationDistributions, spatial statistics and a Bayesian perspective
Distributins, spatial statistics and a Bayesian perspective Dug Nychka Natinal Center fr Atmspheric Research Distributins and densities Cnditinal distributins and Bayes Thm Bivariate nrmal Spatial statistics
More informationActivity Guide Loops and Random Numbers
Unit 3 Lessn 7 Name(s) Perid Date Activity Guide Lps and Randm Numbers CS Cntent Lps are a relatively straightfrward idea in prgramming - yu want a certain chunk f cde t run repeatedly - but it takes a
More informationA New Evaluation Measure. J. Joiner and L. Werner. The problems of evaluation and the needed criteria of evaluation
III-l III. A New Evaluatin Measure J. Jiner and L. Werner Abstract The prblems f evaluatin and the needed criteria f evaluatin measures in the SMART system f infrmatin retrieval are reviewed and discussed.
More informationLecture 17: Free Energy of Multi-phase Solutions at Equilibrium
Lecture 17: 11.07.05 Free Energy f Multi-phase Slutins at Equilibrium Tday: LAST TIME...2 FREE ENERGY DIAGRAMS OF MULTI-PHASE SOLUTIONS 1...3 The cmmn tangent cnstructin and the lever rule...3 Practical
More informationCOMP 551 Applied Machine Learning Lecture 11: Support Vector Machines
COMP 551 Applied Machine Learning Lecture 11: Supprt Vectr Machines Instructr: (jpineau@cs.mcgill.ca) Class web page: www.cs.mcgill.ca/~jpineau/cmp551 Unless therwise nted, all material psted fr this curse
More informationLecture 13: Markov Chain Monte Carlo. Gibbs sampling
Lecture 13: Markv hain Mnte arl Gibbs sampling Gibbs sampling Markv chains 1 Recall: Apprximate inference using samples Main idea: we generate samples frm ur Bayes net, then cmpute prbabilities using (weighted)
More informationCHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.
MATH 1342 Ch. 24 April 25 and 27, 2013 Page 1 f 5 CHAPTER 24: INFERENCE IN REGRESSION Chapters 4 and 5: Relatinships between tw quantitative variables. Be able t Make a graph (scatterplt) Summarize the
More informationENG2410 Digital Design Sequential Circuits: Part B
ENG24 Digital Design Sequential Circuits: Part B Fall 27 S. Areibi Schl f Engineering University f Guelph Analysis f Sequential Circuits Earlier we learned hw t analyze cmbinatinal circuits We will extend
More informationResampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017
Resampling Methds Crss-validatin, Btstrapping Marek Petrik 2/21/2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins in R (Springer, 2013) with
More informationChapter 3: Cluster Analysis
Chapter 3: Cluster Analysis } 3.1 Basic Cncepts f Clustering 3.1.1 Cluster Analysis 3.1. Clustering Categries } 3. Partitining Methds 3..1 The principle 3.. K-Means Methd 3..3 K-Medids Methd 3..4 CLARA
More informationInference in the Multiple-Regression
Sectin 5 Mdel Inference in the Multiple-Regressin Kinds f hypthesis tests in a multiple regressin There are several distinct kinds f hypthesis tests we can run in a multiple regressin. Suppse that amng
More informationHow T o Start A n Objective Evaluation O f Your Training Program
J O U R N A L Hw T Start A n Objective Evaluatin O f Yur Training Prgram DONALD L. KIRKPATRICK, Ph.D. Assistant Prfessr, Industrial Management Institute University f Wiscnsin Mst training m e n agree that
More informationIntroduction to Quantitative Genetics II: Resemblance Between Relatives
Intrductin t Quantitative Genetics II: Resemblance Between Relatives Bruce Walsh 8 Nvember 006 EEB 600A The heritability f a trait, a central cncept in quantitative genetics, is the prprtin f variatin
More informationPhysics 2010 Motion with Constant Acceleration Experiment 1
. Physics 00 Mtin with Cnstant Acceleratin Experiment In this lab, we will study the mtin f a glider as it accelerates dwnhill n a tilted air track. The glider is supprted ver the air track by a cushin
More informationLesson Plan. Recode: They will do a graphic organizer to sequence the steps of scientific method.
Lessn Plan Reach: Ask the students if they ever ppped a bag f micrwave ppcrn and nticed hw many kernels were unppped at the bttm f the bag which made yu wnder if ther brands pp better than the ne yu are
More informationCESAR Science Case The differential rotation of the Sun and its Chromosphere. Introduction. Material that is necessary during the laboratory
Teacher s guide CESAR Science Case The differential rtatin f the Sun and its Chrmsphere Material that is necessary during the labratry CESAR Astrnmical wrd list CESAR Bklet CESAR Frmula sheet CESAR Student
More informationLCAO APPROXIMATIONS OF ORGANIC Pi MO SYSTEMS The allyl system (cation, anion or radical).
Principles f Organic Chemistry lecture 5, page LCAO APPROIMATIONS OF ORGANIC Pi MO SYSTEMS The allyl system (catin, anin r radical).. Draw mlecule and set up determinant. 2 3 0 3 C C 2 = 0 C 2 3 0 = -
More information5 th grade Common Core Standards
5 th grade Cmmn Cre Standards In Grade 5, instructinal time shuld fcus n three critical areas: (1) develping fluency with additin and subtractin f fractins, and develping understanding f the multiplicatin
More informationCHM112 Lab Graphing with Excel Grading Rubric
Name CHM112 Lab Graphing with Excel Grading Rubric Criteria Pints pssible Pints earned Graphs crrectly pltted and adhere t all guidelines (including descriptive title, prperly frmatted axes, trendline
More informationExperiment #3. Graphing with Excel
Experiment #3. Graphing with Excel Study the "Graphing with Excel" instructins that have been prvided. Additinal help with learning t use Excel can be fund n several web sites, including http://www.ncsu.edu/labwrite/res/gt/gt-
More informationRevision: August 19, E Main Suite D Pullman, WA (509) Voice and Fax
.7.4: Direct frequency dmain circuit analysis Revisin: August 9, 00 5 E Main Suite D Pullman, WA 9963 (509) 334 6306 ice and Fax Overview n chapter.7., we determined the steadystate respnse f electrical
More informationIf (IV) is (increased, decreased, changed), then (DV) will (increase, decrease, change) because (reason based on prior research).
Science Fair Prject Set Up Instructins 1) Hypthesis Statement 2) Materials List 3) Prcedures 4) Safety Instructins 5) Data Table 1) Hw t write a HYPOTHESIS STATEMENT Use the fllwing frmat: If (IV) is (increased,
More informationChecking the resolved resonance region in EXFOR database
Checking the reslved resnance regin in EXFOR database Gttfried Bertn Sciété de Calcul Mathématique (SCM) Oscar Cabells OECD/NEA Data Bank JEFF Meetings - Sessin JEFF Experiments Nvember 0-4, 017 Bulgne-Billancurt,
More informationParticle Size Distributions from SANS Data Using the Maximum Entropy Method. By J. A. POTTON, G. J. DANIELL AND B. D. RAINFORD
3 J. Appl. Cryst. (1988). 21,3-8 Particle Size Distributins frm SANS Data Using the Maximum Entrpy Methd By J. A. PTTN, G. J. DANIELL AND B. D. RAINFRD Physics Department, The University, Suthamptn S9
More informationLHS Mathematics Department Honors Pre-Calculus Final Exam 2002 Answers
LHS Mathematics Department Hnrs Pre-alculus Final Eam nswers Part Shrt Prblems The table at the right gives the ppulatin f Massachusetts ver the past several decades Using an epnential mdel, predict the
More informationSAMPLING DYNAMICAL SYSTEMS
SAMPLING DYNAMICAL SYSTEMS Melvin J. Hinich Applied Research Labratries The University f Texas at Austin Austin, TX 78713-8029, USA (512) 835-3278 (Vice) 835-3259 (Fax) hinich@mail.la.utexas.edu ABSTRACT
More informationarxiv:hep-ph/ v1 2 Jun 1995
WIS-95//May-PH The rati F n /F p frm the analysis f data using a new scaling variable S. A. Gurvitz arxiv:hep-ph/95063v1 Jun 1995 Department f Particle Physics, Weizmann Institute f Science, Rehvt 76100,
More informationNUROP CONGRESS PAPER CHINESE PINYIN TO CHINESE CHARACTER CONVERSION
NUROP Chinese Pinyin T Chinese Character Cnversin NUROP CONGRESS PAPER CHINESE PINYIN TO CHINESE CHARACTER CONVERSION CHIA LI SHI 1 AND LUA KIM TENG 2 Schl f Cmputing, Natinal University f Singapre 3 Science
More informationCLASS. Fractions and Angles. Teacher Report. No. of test takers: 25. School Name: EI School. City: Ahmedabad CLASS 6 B 8709
SEPTEMBER 07 Math Fractins and Angles CLASS 6 Teacher Reprt Test Taken 4 5 6 7 8 Schl Name: EI Schl City: Ahmedabad CLASS SECTION EXAM CODE 6 B 8709 N. f test takers: 5 6.5 Average.5 9.0 Range (Scres are
More informationCOMP 551 Applied Machine Learning Lecture 5: Generative models for linear classification
COMP 551 Applied Machine Learning Lecture 5: Generative mdels fr linear classificatin Instructr: Herke van Hf (herke.vanhf@mail.mcgill.ca) Slides mstly by: Jelle Pineau Class web page: www.cs.mcgill.ca/~hvanh2/cmp551
More informationPhys. 344 Ch 7 Lecture 8 Fri., April. 10 th,
Phys. 344 Ch 7 Lecture 8 Fri., April. 0 th, 009 Fri. 4/0 8. Ising Mdel f Ferrmagnets HW30 66, 74 Mn. 4/3 Review Sat. 4/8 3pm Exam 3 HW Mnday: Review fr est 3. See n-line practice test lecture-prep is t
More informationApplication of ILIUM to the estimation of the T eff [Fe/H] pair from BP/RP
Applicatin f ILIUM t the estimatin f the T eff [Fe/H] pair frm BP/RP prepared by: apprved by: reference: issue: 1 revisin: 1 date: 2009-02-10 status: Issued Cryn A.L. Bailer-Jnes Max Planck Institute fr
More informationLecture 24: Flory-Huggins Theory
Lecture 24: 12.07.05 Flry-Huggins Thery Tday: LAST TIME...2 Lattice Mdels f Slutins...2 ENTROPY OF MIXING IN THE FLORY-HUGGINS MODEL...3 CONFIGURATIONS OF A SINGLE CHAIN...3 COUNTING CONFIGURATIONS FOR
More informationUnit Project Descriptio
Unit Prject Descriptin: Using Newtn s Laws f Mtin and the scientific methd, create a catapult r trebuchet that will sht a marshmallw at least eight feet. After building and testing yur machine at hme,
More informationAdmissibility Conditions and Asymptotic Behavior of Strongly Regular Graphs
Admissibility Cnditins and Asympttic Behavir f Strngly Regular Graphs VASCO MOÇO MANO Department f Mathematics University f Prt Oprt PORTUGAL vascmcman@gmailcm LUÍS ANTÓNIO DE ALMEIDA VIEIRA Department
More informationBOUNDED UNCERTAINTY AND CLIMATE CHANGE ECONOMICS. Christopher Costello, Andrew Solow, Michael Neubert, and Stephen Polasky
BOUNDED UNCERTAINTY AND CLIMATE CHANGE ECONOMICS Christpher Cstell, Andrew Slw, Michael Neubert, and Stephen Plasky Intrductin The central questin in the ecnmic analysis f climate change plicy cncerns
More informationThis section is primarily focused on tools to aid us in finding roots/zeros/ -intercepts of polynomials. Essentially, our focus turns to solving.
Sectin 3.2: Many f yu WILL need t watch the crrespnding vides fr this sectin n MyOpenMath! This sectin is primarily fcused n tls t aid us in finding rts/zers/ -intercepts f plynmials. Essentially, ur fcus
More informationRevisiting the Socrates Example
Sectin 1.6 Sectin Summary Valid Arguments Inference Rules fr Prpsitinal Lgic Using Rules f Inference t Build Arguments Rules f Inference fr Quantified Statements Building Arguments fr Quantified Statements
More informationCSE4334/5334 Data Mining Association Rule Mining. Chengkai Li University of Texas at Arlington Fall 2017
CSE4334/5334 Data Mining Assciatin Rule Mining Chengkai Li University f Texas at Arlingtn Fall 27 Assciatin Rule Mining Given a set f transactins, find rules that will predict the ccurrence f an item based
More informationTHE LIFE OF AN OBJECT IT SYSTEMS
THE LIFE OF AN OBJECT IT SYSTEMS Persns, bjects, r cncepts frm the real wrld, which we mdel as bjects in the IT system, have "lives". Actually, they have tw lives; the riginal in the real wrld has a life,
More informationCHAPTER 3 INEQUALITIES. Copyright -The Institute of Chartered Accountants of India
CHAPTER 3 INEQUALITIES Cpyright -The Institute f Chartered Accuntants f India INEQUALITIES LEARNING OBJECTIVES One f the widely used decisin making prblems, nwadays, is t decide n the ptimal mix f scarce
More informationPerformance Bounds for Detect and Avoid Signal Sensing
Perfrmance unds fr Detect and Avid Signal Sensing Sam Reisenfeld Real-ime Infrmatin etwrks, University f echnlgy, Sydney, radway, SW 007, Australia samr@uts.edu.au Abstract Detect and Avid (DAA) is a Cgnitive
More informationInverse Document Frequency (IDF): A Measure of Deviations from Poisson
Inverse Dcument Frequency (IDF): A Measure f Deviatins frm Pissn Kenneth W. Church William A. Gale AT&T Bell Labratries Murray Hill, NJ, USA 07974 kwc@research.att.cm Abstract Lw frequency wrds tend t
More informationSubject description processes
Subject representatin 6.1.2. Subject descriptin prcesses Overview Fur majr prcesses r areas f practice fr representing subjects are classificatin, subject catalging, indexing, and abstracting. The prcesses
More informationBASD HIGH SCHOOL FORMAL LAB REPORT
BASD HIGH SCHOOL FORMAL LAB REPORT *WARNING: After an explanatin f what t include in each sectin, there is an example f hw the sectin might lk using a sample experiment Keep in mind, the sample lab used
More informationMaximum A Posteriori (MAP) CS 109 Lecture 22 May 16th, 2016
Maximum A Psteriri (MAP) CS 109 Lecture 22 May 16th, 2016 Previusly in CS109 Game f Estimatrs Maximum Likelihd Nn spiler: this didn t happen Side Plt argmax argmax f lg Mther f ptimizatins? Reviving an
More informationChapter 13: The Correlation Coefficient and the Regression Line. We begin with a some useful facts about straight lines.
Chapter 13: The Crrelatin Cefficient and the Regressin Line We begin with a sme useful facts abut straight lines. Recall the x, y crdinate system, as pictured belw. 3 2 y = 2.5 1 y = 0.5x 3 2 1 1 2 3 1
More informationALE 21. Gibbs Free Energy. At what temperature does the spontaneity of a reaction change?
Name Chem 163 Sectin: Team Number: ALE 21. Gibbs Free Energy (Reference: 20.3 Silberberg 5 th editin) At what temperature des the spntaneity f a reactin change? The Mdel: The Definitin f Free Energy S
More informationEquilibrium of Stress
Equilibrium f Stress Cnsider tw perpendicular planes passing thrugh a pint p. The stress cmpnents acting n these planes are as shwn in ig. 3.4.1a. These stresses are usuall shwn tgether acting n a small
More informationDetermining the Accuracy of Modal Parameter Estimation Methods
Determining the Accuracy f Mdal Parameter Estimatin Methds by Michael Lee Ph.D., P.E. & Mar Richardsn Ph.D. Structural Measurement Systems Milpitas, CA Abstract The mst cmmn type f mdal testing system
More information