Advanced Differential Expression Analysis

Size: px
Start display at page:

Download "Advanced Differential Expression Analysis"

Transcription

1 Ths work s lcensed under a Creatve Commons Attrbuton-NonCommercal-ShareAlke Lcense. Your use of ths materal consttutes accetance of that lcense and the condtons of use of materals on ths ste. Coyrght 2006, The Johns Hokns Unversty and Rafael A. Irzarry. All rghts reserved. Use of these materals ermtted only n accordance wth lcense rghts granted. Materals rovded AS IS ; no reresentatons or warrantes rovded. User assumes all resonsblty for use, and all lablty related thereto, and must ndeendently revew all materals for accuracy and effcacy. May contan materals owned by others. User s resonsble for obtanng ermssons for use from thrd artes as needed.

2 BIOINFORMATICS AND COMPUTATIONAL BIOLOGY SOLUTIONS USING R AND BIOCONDUCTOR Bostatstcs Rafael A. Irzarry Advanced Dfferental Exresson Analyss Outlne Revew of the basc deas Introducton to (Emrcal Bayesan Statstcs The multle comarson roblem SAM 1

3 Quantfyng Dfferentally Exresson Two questons Can we order genes by nterest? One goal s to assgn a one number summary and consder large values nterestng. We wll refer to ths number as a score How nterestng are the most nterestng genes? How do ther scores comare to the those of genes known not to be nterestng? Examle Consder a case were we have observed two genes wth fold changes of 2 Is ths worth reortng? Are they both as nterestng? Some journals requre statstcal sgnfcance. What does ths mean? * * 2

4 Reeated Exerment Reeated Exerment Revew of Statstcal Inference Let Y-X be our measurement reresentng dfferental exresson. What s the tycal null hyothess? P-value s Prob(Y-X as extreme under null and s a way to summarze how nterestng a gene s. Poular assumton: Under the null,y-x follows a normal dstrbuton wth mean 0 and standard devaton σ. Wthout σ we do not know the -value. We can estmate σ by takng a samle and usng the samle standard devaton s. Note: Dfferent genes have dfferent σ, 3

5 Samle Summares Observatons: X 1,K, X M Y 1,K,Y N Averages: SD 2 or varances: M X = 1 M M " =1 X Y = 1 N 1 s 2 X = #(X " X 2 s 2 M "1 Y = 1 N "1 =1 N # =1 N Y " =1 (Y "Y 2 The t-statstc t - statstc: Y " X 2 2 s Y N + s X M Proertes of t-statstc If the number of relcates s very large the t- statstc s normally dstrbuted wth mean 0 and and SD of 1 If the observed data,.e. Y-X, are normally dstrbuted then the t-statstc follows a t dstrbuton regardless of samle sze Wth one of these two we can comute - values wth one R command 4

6 Data Show Problems Data Show Problems Data Show Problems 5

7 Problems Problem 1: T-statstc bgger for genes wth smaller standard errors estmates Imlcaton: Rankng mght not be otmal Problem 2: T-statstc not t-dstrbuted. Imlcaton: -values/nference ncorrect Problem 1 Wth few relcates SD estmates are unstable Emrcal Bayes methodology and Sten estmators rovdes a statstcally rgorous way of mrovng ths estmate SAM, a more ad-hoc rocedure, works well n ractce Note: We won t talk about Sten estmators. See a aer by Gary Churchll for detals Problem 2 Even f we use a arametrc model to mrove standard error estmates, the assumtons mght not be good enough to rovde trust-worthy -values We wll descrbe non-arametrc aroaches for obtanng -values Note: We stll haven t dscussed the multle comarson roblem. That comes later. 6

8 Introducton to Emrcal Bayes Outlne General Introducton Models for relatve exresson Models for absolute exresson 7

9 8

10 Borrowng Strength An advantage of havng tens of thousands of genes s that we can try to learn about tycal standard devatons by lookng at all genes Emrcal Bayes gves us a formal way of dong ths Modelng Relatve Exresson Courtesy of Gordon Smyth Herarchcal Model Normal Model Pror Rearametrzaton of Lönnstedt and Seed 2002 Normalty, ndeendence assumtons are wrong but convenent, resultng methods are useful 9

11 Posteror Statstcs Posteror varance estmators Moderated t-statstcs Elmnates large t-statstcs merely from very small s Margnal Dstrbutons The margnal dstrbutons of the samle varances and moderated t-statstcs are mutually ndeendent Degrees of freedom add Shrnkage of Standard Devatons The data decdes whether should be closer to t g,ooled or to t g 10

12 Posteror Odds Posteror robablty of dfferental exresson for any gene s Monotonc functon of for constant d Rearametrzaton of Lönnstedt and Seed 2002 Modelng the Absolute Exresson Courtesy of Chrstna Kendzorsk Herarchcal Model for Exresson Data (One condton y µ f µ ( g1, k g ~ 1 g1 y µ ( g2, k µ g ~ f 2 g2 y g µ ( g3, k µ ~ f g 3 3 µ g 1 µ µ g g 2 3 µ g ~ f (" µ g1 µ g2 µ g3 11

13 Herarchcal Model for Exresson Data (Two condtons Let x = [ x c1,x c 2 ] denote data (one gene n condtons C1 and C2. Two atterns of exresson: # P0 (EE : P1 (DE: ( f µ µ c1 = µ c 2 µ c1 " µ c 2 For P0, x ~ f x µ ( dµ " f 0 ( x For P1, x ~ " f ( x µ c1,µ c2 f ( µ c1,µ c2 dµ c1 dµ c 2 " # f ( x c1 µ c1 f ( µ c1 dµ c1 # f ( x c2 µ c 2 f ( µ c 2 dµ c 2 " f 1 ( x ( f 0 ( x c 2 f 0 x c1 Herarchcal Mxture Model for Exresson Data Two condtons: x ~ 0 f 0 ( x + 1 f 1 x Multle condtons: K ( => ( P1 x = x ~ " k f k ( x => Pk' x k=1 Parameter estmates va EM 1 f ( x P1 ( + 1 f ( x P1 0 f x P0 ( = k' f ( x Pk' # k f ( x Pk Bayes rule determnes threshold here; could target secfc FDR. k"k' For every transcrt, two condtons => two atterns (DE, EE EE: m 1 = m 2 DE: m 1 m 2 ( DE y ( EE y P f odds g = = P f ( yg DE P( DE ( y EE P( EE Emrcal Bayes methods make use all of the data to make gene secfc nferences. g 12

14 Odds lot: SCD knockout vs. SV129 (Atte lab EBarrays: Contour Plots of Odds Comments on Emrcal Bayes Aroach(EBarrays Herarchcal model s used to estmate osteror robabltes of atterns of exresson. The model accounts for the measurement error rocess and for fluctuatons n absolute exresson levels. Multle condtons are handled n the same way as two condtons (no extra work requred. Posteror robabltes of exresson atterns are calculated for every transcrt. Threshold can be adjusted to target a secfc FDR. In Boconductor 13

15 Emrcal Bayes for Mcroarrays (EBarrays On Dfferental Varablty of Exresson Ratos: Imrovng Statstcal Inference About Gene Exresson Changes from Mcroarray Data by M.A. Newton, C.M. Kendzorsk, C.S. Rchmond, F.R. Blattner, and K.W. Tsu Journal of Comutatonal Bology 8: 37-52, On Parametrc Emrcal Bayes Methods for Comarng Multle Grous Usng Relcated Gene Exresson Profles by C.M. Kendzorsk, M.A. Newton, H. Lan and M.N. Gould Statstcs n Medcne, to aear, Inference and the Multle Comarson Problem Many sldes courtesy of John Storey Hyothess testng Once you have a gven score for each gene, how do you decde on a cut-off? -values are oular. But how do we decde on a cut-off? Are 0.05 and 0.01 arorate? Are the -values correct? 14

16 P-values by ermutaton It s common for the assumtons used to derve the statstcs used to summarze nterest are not aroxmate enough to yeld useful -values An alternatve s to use ermutatons -values by ermutatons We focus on one gene only. For the bth teraton, b = 1,, B; 1. Permute the n data onts for the gene (x. The frst n 1 are referred to as treatments, the second n 2 as controls. 2. For each gene, calculate the corresondng two samle t-statstc, t b. After all the B ermutatons are done; 3. Put = #{b: t b t observed }/B ( lower f we use >. Multle Comarson Problem If we do have useful aroxmatons of our -values, we stll face the multle comarson roblem When erformng many ndeendent tests -values no longer have the same nterretaton 15

17 Hyothess Testng Test for each gene null hyothess: no dfferental exresson. Two tyes of errors can be commtted Tye I error or false ostve (say that a gene s dfferentally exressed when t s not,.e., reject a true null hyothess. Tye II error or false negatve (fal to dentfy a truly dfferentally exressed gene,.e.,fal to reject a false null hyothess Multle Hyothess Testng What haens f we call all genes sgnfcant wth -values 0.05, for examle? Null True Called Sgnfcant V Not Called Sgnfcant m 0 V Total m 0 Altern.True S m 1 S m 1 Total R m R m Null = Equvalent Exresson; Alternatve = Dfferental Exresson Other ways of thnkng of P-values A -value s defned to be the mnmum false ostve rate at whch an observed statstc can be called sgnfcant If the null hyothess s smle, then a null -value s unformly dstrbuted 16

18 Multle Hyothess Test Error Controllng Procedure Suose m hyotheses are tested wth - values 1, 2,, m A multle hyothess error controllng rocedure s a functon T(; α such that rejectng all nulls wth T(; α mles that Error α Error s a oulaton quantty (not random Weak and Strong Control If T(; α s such Error α only when m 0 = m, then the rocedure rovdes weak control of the error measure If T(; α s such Error α for any value of m 0, then the rocedure rovdes strong control of the error measure note that m 0 s not an argument of T(; α Error Rates Per comarson error rate (PCER: the exected value of the number of Tye I errors over the number of hyotheses PCER = E(V/m Per famly error rate (PFER: the exected number of Tye I errors PFER = E(V Famly-wse error rate: the robablty of at least one Tye I error FEWR = Pr(V 1 False dscovery rate (FDR rate that false dscoveres occur FDR = E(V/R; R>0 = E(V/R R>0Pr(R>0 Postve false dscovery rate (FDR: rate that dscoveres are false FDR = E(V/R R>0. 17

19 18 Bonferron Bonferron Procedure Procedure m m H m H m V m T m C " = # $ % & ' ( # $ % & ' ( * +, -. / 0 = 1 = Pr mn Pr 1 Pr( : max ; ( Provdes strong control.. Sdak Sdak Procedure Procedure { } ( ( = " " > " = " " # # $ " " # = % = (1 1 Pr 1 (1 1 mn Pr 1 Pr( (1 1 : max ; ( 1 0 1/ 0 1/ 1/ m m C m m H H V T Requres ndeendence for strong control Holm Procedure Holm Procedure ( { } 1 1/( ( ( ( ( ( (2 (1 1 1 : mn ; ( 1 : mn ; ( values Order the - + > = " # $ % & ' + > = ( ( ( m m T m T L Requres ndeendence for strong control

20 Hochberg Procedure % " ( T(;" = max& ( : ( # ' m $ +1*...the ste-u analogue of Holm Smes/BH Procedure & T (; = max% ( : ( $ ' # ( " m Weak controls the FWER (Smes 1986 Strongly controls FDR (Benjamn & Hochberg 1995 Both requre the null -values to be ndeendent False Dscovery Rate The false dscovery rate measures the roorton of false ostves among all genes called sgnfcant: #false ostves V V = = #called sgnfcant V + S R Ths s usually arorate because one wants to fnd as many truly dfferentally exressed genes as ossble wth relatvely few false ostves The false dscovery rate gves the rate at whch further bologcal verfcaton wll result n dead-ends 19

21 False Postve Rate versus False Dscovery Rate False ostve rate s the rate at whch truly null genes are called sgnfcant #false ostves V FPR = # truly null m False dscovery rate s the rate at whch sgnfcant genes are truly null FDR #false ostves #called sgnfcant = 0 V R False Postve Rate and P-values The -value s a measure of sgnfcance n terms of the false ostve rate (aka Tye I error rate P-value s defned to be the mnmum false ostve rate at whch the statstc can be called sgnfcant Can be descrbed as the robablty a truly null statstc s as or more extreme than the observed one False Dscovery Rate and Q-values The q-value s a measure of sgnfcance n terms of the false dscovery rate Q-value s defned to be the mnmum false dscovery rate at whch the statstc can be called sgnfcant Can be descrbed as the robablty a statstc as or more extreme s truly null 20

22 21

23 Bayesan Connectons Ths allows Bayesans to estmate FDR as well: ( = Pr( = 0 X = x f ( x x #" FDR " H dx Ths motvates the name q-value drectly: " value( x = Pr q " value( x = Pr ( X x H = 0 ( H = 0 X x All the estmaton resented below can be vewed as an emrcal Bayes aroach Possble FDR Goals 1. For some re-chosen α, estmate a sgnfcance cut-off so that on average FDR α 2. For some re-chosen sgnfcance cut-off, estmate FDR so that E FD ˆ R [ ] " FDR 3. Estmate FDR so that t s smultaneously conservatve over all sgnfcance cut-offs 4. Estmate q-values for all genes that are smultaneously conservatve Unversal Goal 1. The q-value, an FDR-based measure of sgnfcance, s assocated wth each gene 2. The estmated q-values are conservatve over all genes smultaneously In dong so, all four otons wll be met 22

24 Estmate of FDR We begn by estmatng FDR when callng all genes sgnfcant wth - values t Heurstc motvaton: E FDR( t " E [ V ( t ] [ R( t ] [ t} ] E [#{ t} ] E #{null = mˆ " t Dˆ R( t = #{ t} F 0 =m 0 t Estmate of π 0 We frst estmate the more easly nterreted π 0 = m 0 /m, the roorton of truly null (non-dfferentally exressed genes: #{ > } " ˆ0( = m $ (1 # Then clearly m ˆ " m 0 = ˆ 0 #{ > } " ˆ0 ( = m $ (1 # ˆ0 23

25 Choose λ by Balancng Bas and Varance ˆ0 = 0.67 Overall FDR Estmate The overall estmate of FDR(t s ˆ m" t Dˆ R( t = # #{ t} F 0 The mlct estmate used n the orgnal FDR aer s a secal case of the above estmate wth π 0 estmated as 1. Numercal Examle Suose we call all genes sgnfcant wth -values 0.03 The estmate of the FDR s F Dˆ R = = = Could use any threshold 0 t 1 24

26 Q-value Estmate The mathematcal defnton of the q- value of gene s q - value( = mn t Snce FDR FDR, we estmate the q-value of gene by qˆ ( = mn t FDR( t FDˆ R( t Q-Plots Theoretcal Results Suose that the emrcal dstrbuton functons of the null statstcs and of the alternatve statstcs converge as the number of genes m gets large The FDR estmates are asymtotcally conservatve smultaneously over all sgnfcance regons The estmated q-values are smultaneously conservatve over all genes Ths s equvalent to controllng the FDR at all levels α smultaneously 25

27 26

28 27

29 Smulaton Study Performed 3000 hyothess tests of H 0 :N(0,1 versus H 1 :N(2,1 The statstcs had correlaton 0.40 n blocks of 50 Two conclusons: 1. The true q-values under ths deendence structure are the same as those gven under the ndeendence model 2. The estmated q-values are smultaneously conservatve 28

30 3000 Deendent Tests of N(0,1 versus N(2,1 Power Comarson Power Comarson FDR Level # Sgnfcant BH # Sgnfcant PP ˆ0 =1 ˆ0 =

31 & ˆ0 FDˆ R( " = & ˆ ("' = 0 B b= 1 SAM Verson B 0b 0b 0b #{ d : d % l( " or d $ r( "}/ b= 1 #{ d : d % l( " or d $ r( "} # # { d : d > l( "' or d < r( "'} 0b 0b 0b { d : d > l( "' or d < r( "'}/ B & ˆ0 # avg no. nulls called sgnfcant = no. observed called sgnfcant B What should one look for n a multle testng rocedure? As we wll see, there s a bewlderng varety of multle testng rocedures. How can we choose whch to use? There s no smle answer here, but each can be judged accordng to a number of crtera: Interretaton: does the rocedure answer a relevant queston for you? Tye of control: strong or weak? Valdty: are the assumtons under whch the rocedure ales clear and defntely or lausbly true, or are they unclear and most robably not true? Comutablty: are the rocedure s calculatons straghtforward to calculate accurately, or s there ossbly numercal or smulaton uncertanty, or dscreteness? Selected references Westfall, PH and SS Young (1993 Resamlng-based multle testng: Examles and methods for -value adjustment, John Wley & Sons, Inc Benjamn, Y & Y Hochberg (1995 Controllng the false dscovery rate: a ractcal and owerful aroach to multle testng JRSS B 57: J Storey (2001: 3 aers (some wth other authors, www-stat.stanford.edu/~jstorey/ The ostve false dscovery rate: a Bayesan nterretaton and the q-value. A drect aroach to false dscovery rates Estmatng false dscovery rates under deendence, wth alcatons to mcroarrays Y Ge et al (2001 Fast algorthm for resamlng based -value adjustment for multle testng 30

32 Sgnfcance analyss of mcroarrays (SAM A clever adataton of the t-rato to borrow nformaton across genes In Boconductor, sggenes acakge s avalable SAM-statstc For gene d = y " x s + s 0 y = mean of Irradated samles x = mean of Unrradated samles = s = s 0 Standard devaton of resduals for gene a assumng same varance Exchangeablty factor estmated usng all genes The exchangeablty factor Chosen to make sgnal-to-nose ratos ndeendent of sgnal Comutaton Let s be the ercentle of the s values. Let d = r /( s s + Comute the 100 quantles of the values, denoted by q 1 < q2 < L < q100 " ( 0,0.05,0.10, K,1.0 For " Comute v j = mad( d s [ q j, q j+ 1, j = 1,2, K,99, where mad s the medan absolute devaton from the medan, dvded by 0.64 Comute cv( = coeffcent of varaton of the Choose ˆ = arg mn[ cv( ]. ˆ s ˆ = s. and v j 0 s 31

33 Scatter lots of relatve dfference Random fluctuatons n the data, measured by balanced ermutatons (for cell lne 1 and 2 Relatve dfference for a ermutaton of the data that was balanced betwe en cell lnes 1 and 2. The reference dstrbuton Order the values of (could be any stat d ( 1 d(2 L d( Permute the treatment labels, and comute a new set of ordered values * * * d ( 1 d(2 L d( Reeat ste 2 for, say, 100 ermutatons: *1 *1 *1 d d L d d (1 *2 (1 *100 *100 *100 d(1 d(2 L d( From these, comute the average largest, average second largest etc. d d (2 *2 (2 M L M d ( *2 ( Selected genes 32

34 exected d( Delta Ave # falsely sgnfcant # called sgnfcant False dscovery rate Delta s the half-wdth of the bar around the 45-degree lne. More general versons of SAM More than two grous Pared data Survval data, wth censored resonse 33

35 Lmtatons of SAM Solutons for s_0 are often at the extremes and senstve to the resoluton of the quantle grd. Permutaton analyss throws all genes n the same bag Requres a monotone sgnal-tonose relatonsh 34

Advanced Differential Expression Analysis. Outline. Quantifying Differentially Expression. Review of the basic ideas

Advanced Differential Expression Analysis. Outline. Quantifying Differentially Expression. Review of the basic ideas Advanced Dfferental Expresson Analyss Outlne Revew of the basc deas Introducton to (Emprcal) Bayesan Statstcs The multple comparson problem SAM Quantfyng Dfferentally Expresson 1 Two questons Can we order

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE II LECTURE - GENERAL LINEAR HYPOTHESIS AND ANALYSIS OF VARIANCE Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 3.

More information

Comparing two Quantiles: the Burr Type X and Weibull Cases

Comparing two Quantiles: the Burr Type X and Weibull Cases IOSR Journal of Mathematcs (IOSR-JM) e-issn: 78-578, -ISSN: 39-765X. Volume, Issue 5 Ver. VII (Se. - Oct.06), PP 8-40 www.osrjournals.org Comarng two Quantles: the Burr Tye X and Webull Cases Mohammed

More information

Machine Learning. Classification. Theory of Classification and Nonparametric Classifier. Representing data: Hypothesis (classifier) Eric Xing

Machine Learning. Classification. Theory of Classification and Nonparametric Classifier. Representing data: Hypothesis (classifier) Eric Xing Machne Learnng 0-70/5 70/5-78, 78, Fall 008 Theory of Classfcaton and Nonarametrc Classfer Erc ng Lecture, Setember 0, 008 Readng: Cha.,5 CB and handouts Classfcaton Reresentng data: M K Hyothess classfer

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Logistic regression with one predictor. STK4900/ Lecture 7. Program

Logistic regression with one predictor. STK4900/ Lecture 7. Program Logstc regresson wth one redctor STK49/99 - Lecture 7 Program. Logstc regresson wth one redctor 2. Maxmum lkelhood estmaton 3. Logstc regresson wth several redctors 4. Devance and lkelhood rato tests 5.

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

Confidence intervals for weighted polynomial calibrations

Confidence intervals for weighted polynomial calibrations Confdence ntervals for weghted olynomal calbratons Sergey Maltsev, Amersand Ltd., Moscow, Russa; ur Kalambet, Amersand Internatonal, Inc., Beachwood, OH e-mal: kalambet@amersand-ntl.com htt://www.chromandsec.com

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Pattern Recognition. Approximating class densities, Bayesian classifier, Errors in Biometric Systems

Pattern Recognition. Approximating class densities, Bayesian classifier, Errors in Biometric Systems htt://.cubs.buffalo.edu attern Recognton Aromatng class denstes, Bayesan classfer, Errors n Bometrc Systems B. W. Slverman, Densty estmaton for statstcs and data analyss. London: Chaman and Hall, 986.

More information

On New Selection Procedures for Unequal Probability Sampling

On New Selection Procedures for Unequal Probability Sampling Int. J. Oen Problems Comt. Math., Vol. 4, o. 1, March 011 ISS 1998-66; Coyrght ICSRS Publcaton, 011 www.-csrs.org On ew Selecton Procedures for Unequal Probablty Samlng Muhammad Qaser Shahbaz, Saman Shahbaz

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Algorithms for factoring

Algorithms for factoring CSA E0 235: Crytograhy Arl 9,2015 Instructor: Arta Patra Algorthms for factorng Submtted by: Jay Oza, Nranjan Sngh Introducton Factorsaton of large ntegers has been a wdely studed toc manly because of

More information

Homework 10 Stat 547. Problem ) Z D!

Homework 10 Stat 547. Problem ) Z D! Homework 0 Stat 547 Problem 74 Notaton: h s the hazard rate for the aneulod grou, h s the hazard rate for the dlod grou (a Log-rank test s erformed: H 0 : h (t = h (t Sgnfcance level α = 005 Test statstc

More information

Basic Statistical Analysis and Yield Calculations

Basic Statistical Analysis and Yield Calculations October 17, 007 Basc Statstcal Analyss and Yeld Calculatons Dr. José Ernesto Rayas Sánchez 1 Outlne Sources of desgn-performance uncertanty Desgn and development processes Desgn for manufacturablty A general

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics ) Ismor Fscher, 8//008 Stat 54 / -8.3 Summary Statstcs Measures of Center and Spread Dstrbuton of dscrete contnuous POPULATION Random Varable, numercal True center =??? True spread =???? parameters ( populaton

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Pattern Classification

Pattern Classification attern Classfcaton All materals n these sldes were taken from attern Classfcaton nd ed by R. O. Duda,. E. Hart and D. G. Stork, John Wley & Sons, 000 wth the ermsson of the authors and the ublsher Chater

More information

Multiple Regression Analysis

Multiple Regression Analysis Multle Regresson Analss Roland Szlág Ph.D. Assocate rofessor Correlaton descres the strength of a relatonsh, the degree to whch one varale s lnearl related to another Regresson shows us how to determne

More information

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov, UCLA STAT 3 ntroducton to Statstcal Methods for the Lfe and Health Scences nstructor: vo Dnov, Asst. Prof. of Statstcs and Neurology Chapter Analyss of Varance - ANOVA Teachng Assstants: Fred Phoa, Anwer

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Multiple Contrasts (Simulation)

Multiple Contrasts (Simulation) Chapter 590 Multple Contrasts (Smulaton) Introducton Ths procedure uses smulaton to analyze the power and sgnfcance level of two multple-comparson procedures that perform two-sded hypothess tests of contrasts

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

A Quadratic Cumulative Production Model for the Material Balance of Abnormally-Pressured Gas Reservoirs F.E. Gonzalez M.S.

A Quadratic Cumulative Production Model for the Material Balance of Abnormally-Pressured Gas Reservoirs F.E. Gonzalez M.S. Natural as Engneerng A Quadratc Cumulatve Producton Model for the Materal Balance of Abnormally-Pressured as Reservors F.E. onale M.S. Thess (2003) T.A. Blasngame, Texas A&M U. Deartment of Petroleum Engneerng

More information

Non-Ideality Through Fugacity and Activity

Non-Ideality Through Fugacity and Activity Non-Idealty Through Fugacty and Actvty S. Patel Deartment of Chemstry and Bochemstry, Unversty of Delaware, Newark, Delaware 19716, USA Corresondng author. E-mal: saatel@udel.edu 1 I. FUGACITY In ths dscusson,

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

A General Class of Selection Procedures and Modified Murthy Estimator

A General Class of Selection Procedures and Modified Murthy Estimator ISS 684-8403 Journal of Statstcs Volume 4, 007,. 3-9 A General Class of Selecton Procedures and Modfed Murthy Estmator Abdul Bast and Muhammad Qasar Shahbaz Abstract A new selecton rocedure for unequal

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

A Quadratic Cumulative Production Model for the Material Balance of Abnormally-Pressured Gas Reservoirs F.E. Gonzalez M.S.

A Quadratic Cumulative Production Model for the Material Balance of Abnormally-Pressured Gas Reservoirs F.E. Gonzalez M.S. Formaton Evaluaton and the Analyss of Reservor Performance A Quadratc Cumulatve Producton Model for the Materal Balance of Abnormally-Pressured as Reservors F.E. onale M.S. Thess (2003) T.A. Blasngame,

More information

Bayesian Decision Theory

Bayesian Decision Theory No.4 Bayesan Decson Theory Hu Jang Deartment of Electrcal Engneerng and Comuter Scence Lassonde School of Engneerng York Unversty, Toronto, Canada Outlne attern Classfcaton roblems Bayesan Decson Theory

More information

Web-Mining Agents Probabilistic Information Retrieval

Web-Mining Agents Probabilistic Information Retrieval Web-Mnng Agents Probablstc Informaton etreval Prof. Dr. alf Möller Unverstät zu Lübeck Insttut für Informatonssysteme Karsten Martny Übungen Acknowledgements Sldes taken from: Introducton to Informaton

More information

MIMA Group. Chapter 2 Bayesian Decision Theory. School of Computer Science and Technology, Shandong University. Xin-Shun SDU

MIMA Group. Chapter 2 Bayesian Decision Theory. School of Computer Science and Technology, Shandong University. Xin-Shun SDU Group M D L M Chapter Bayesan Decson heory Xn-Shun Xu @ SDU School of Computer Scence and echnology, Shandong Unversty Bayesan Decson heory Bayesan decson theory s a statstcal approach to data mnng/pattern

More information

Bayesian networks for scenario analysis of nuclear waste repositories

Bayesian networks for scenario analysis of nuclear waste repositories Bayesan networks for scenaro analyss of nuclear waste reostores Edoardo Toson ab Aht Salo a Enrco Zo bc a. Systems Analyss Laboratory Det of Mathematcs and Systems Analyss - Aalto Unversty b. Laboratory

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Definition. Measures of Dispersion. Measures of Dispersion. Definition. The Range. Measures of Dispersion 3/24/2014

Definition. Measures of Dispersion. Measures of Dispersion. Definition. The Range. Measures of Dispersion 3/24/2014 Measures of Dsperson Defenton Range Interquartle Range Varance and Standard Devaton Defnton Measures of dsperson are descrptve statstcs that descrbe how smlar a set of scores are to each other The more

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

Digital PI Controller Equations

Digital PI Controller Equations Ver. 4, 9 th March 7 Dgtal PI Controller Equatons Probably the most common tye of controller n ndustral ower electroncs s the PI (Proortonal - Integral) controller. In feld orented motor control, PI controllers

More information

Hidden Markov Model Cheat Sheet

Hidden Markov Model Cheat Sheet Hdden Markov Model Cheat Sheet (GIT ID: dc2f391536d67ed5847290d5250d4baae103487e) Ths document s a cheat sheet on Hdden Markov Models (HMMs). It resembles lecture notes, excet that t cuts to the chase

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

The Small Noise Arbitrage Pricing Theory

The Small Noise Arbitrage Pricing Theory The Small Nose Arbtrage Prcng Theory S. Satchell Trnty College Unversty of Cambrdge and School of Fnance and Economcs Unversty of Technology, Sydney December 998 Ths aer was wrtten when the Author was

More information

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Fuzzy approach to solve multi-objective capacitated transportation problem

Fuzzy approach to solve multi-objective capacitated transportation problem Internatonal Journal of Bonformatcs Research, ISSN: 0975 087, Volume, Issue, 00, -0-4 Fuzzy aroach to solve mult-objectve caactated transortaton roblem Lohgaonkar M. H. and Bajaj V. H.* * Deartment of

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Journal of Multivariate Analysis

Journal of Multivariate Analysis Journal of Multvarate Analyss 07 (202) 232 243 Contents lsts avalable at ScVerse ScenceDrect Journal of Multvarate Analyss journal homeage: www.elsever.com/locate/jmva James Sten tye estmators of varances

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Advanced Topics in Optimization. Piecewise Linear Approximation of a Nonlinear Function

Advanced Topics in Optimization. Piecewise Linear Approximation of a Nonlinear Function Advanced Tocs n Otmzaton Pecewse Lnear Aroxmaton of a Nonlnear Functon Otmzaton Methods: M8L Introducton and Objectves Introducton There exsts no general algorthm for nonlnear rogrammng due to ts rregular

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

6. Hamilton s Equations

6. Hamilton s Equations 6. Hamlton s Equatons Mchael Fowler A Dynamcal System s Path n Confguraton Sace and n State Sace The story so far: For a mechancal system wth n degrees of freedom, the satal confguraton at some nstant

More information

AN ASYMMETRIC GENERALIZED FGM COPULA AND ITS PROPERTIES

AN ASYMMETRIC GENERALIZED FGM COPULA AND ITS PROPERTIES Pa. J. Statst. 015 Vol. 31(1), 95-106 AN ASYMMETRIC GENERALIZED FGM COPULA AND ITS PROPERTIES Berzadeh, H., Parham, G.A. and Zadaram, M.R. Deartment of Statstcs, Shahd Chamran Unversty, Ahvaz, Iran. Corresondng

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Spatial Statistics and Analysis Methods (for GEOG 104 class).

Spatial Statistics and Analysis Methods (for GEOG 104 class). Spatal Statstcs and Analyss Methods (for GEOG 104 class). Provded by Dr. An L, San Dego State Unversty. 1 Ponts Types of spatal data Pont pattern analyss (PPA; such as nearest neghbor dstance, quadrat

More information

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout

Chapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout Serk Sagtov, Chalmers and GU, February 0, 018 Chapter 1. Analyss of varance Chapter 11: I = samples ndependent samples pared samples Chapter 1: I 3 samples of equal sze one-way layout two-way layout 1

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Classification Bayesian Classifiers

Classification Bayesian Classifiers lassfcaton Bayesan lassfers Jeff Howbert Introducton to Machne Learnng Wnter 2014 1 Bayesan classfcaton A robablstc framework for solvng classfcaton roblems. Used where class assgnment s not determnstc,.e.

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Departure Process from a M/M/m/ Queue

Departure Process from a M/M/m/ Queue Dearture rocess fro a M/M// Queue Q - (-) Q Q3 Q4 (-) Knowledge of the nature of the dearture rocess fro a queue would be useful as we can then use t to analyze sle cases of queueng networs as shown. The

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Using Genetic Algorithms in System Identification

Using Genetic Algorithms in System Identification Usng Genetc Algorthms n System Identfcaton Ecaterna Vladu Deartment of Electrcal Engneerng and Informaton Technology, Unversty of Oradea, Unverstat, 410087 Oradea, Româna Phone: +40259408435, Fax: +40259408408,

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

+, where 0 x N - n. k k

+, where 0 x N - n. k k CO 745, Mdterm Len Cabrera. A multle choce eam has questons, each of whch has ossble answers. A student nows the correct answer to n of these questons. For the remanng - n questons, he checs the answers

More information

Naïve Bayes Classifier

Naïve Bayes Classifier 9/8/07 MIST.6060 Busness Intellgence and Data Mnng Naïve Bayes Classfer Termnology Predctors: the attrbutes (varables) whose values are used for redcton and classfcaton. Predctors are also called nut varables,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Supplementary Material for Spectral Clustering based on the graph p-laplacian

Supplementary Material for Spectral Clustering based on the graph p-laplacian Sulementary Materal for Sectral Clusterng based on the grah -Lalacan Thomas Bühler and Matthas Hen Saarland Unversty, Saarbrücken, Germany {tb,hen}@csun-sbde May 009 Corrected verson, June 00 Abstract

More information

Managing Capacity Through Reward Programs. on-line companion page. Byung-Do Kim Seoul National University College of Business Administration

Managing Capacity Through Reward Programs. on-line companion page. Byung-Do Kim Seoul National University College of Business Administration Managng Caacty Through eward Programs on-lne comanon age Byung-Do Km Seoul Natonal Unversty College of Busness Admnstraton Mengze Sh Unversty of Toronto otman School of Management Toronto ON M5S E6 Canada

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Probabilistic Variation Mode and Effect Analysis: A Case Study of an Air Engine Component

Probabilistic Variation Mode and Effect Analysis: A Case Study of an Air Engine Component Probablstc Varaton Mode and Effect Analyss: A Case Study of an Ar Engne Comonent Pär Johannesson Fraunhofer-Chalmers Research Centre for Industral Mathematcs, Sweden; Par.Johannesson@fcc.chalmers.se Thomas

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Michael Batty. Alan Wilson Plenary Session Entropy, Complexity, & Information in Spatial Analysis

Michael Batty. Alan Wilson Plenary Session Entropy, Complexity, & Information in Spatial Analysis Alan Wlson Plenary Sesson Entroy, Comlexty, & Informaton n Satal Analyss Mchael Batty m.batty@ucl.ac.uk @jmchaelbatty htt://www.comlexcty.nfo/ htt://www.satalcomlexty.nfo/ for Advanced Satal Analyss CentreCentre

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Independent Component Analysis

Independent Component Analysis Indeendent Comonent Analyss Mture Data Data that are mngled from multle sources May not now how many sources May not now the mng mechansm Good Reresentaton Uncorrelated, nformaton-bearng comonents PCA

More information

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2) 1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Stat 543 Exam 2 Spring 2016

Stat 543 Exam 2 Spring 2016 Stat 543 Exam 2 Sprng 2016 I have nether gven nor receved unauthorzed assstance on ths exam. Name Sgned Date Name Prnted Ths Exam conssts of 11 questons. Do at least 10 of the 11 parts of the man exam.

More information

Lecture 06 Multinomials and CMR Models for closed populations

Lecture 06 Multinomials and CMR Models for closed populations WILD 750 - Analyss of Wldlfe Poulatons of 4 Lecture 06 Multnomals and CMR Models for closed oulatons Readngs Wllams, B, JD Nchols, and MJ Conroy 00 Analyss and Management of Anmal Poulatons Academc Press

More information

Derivatives of Value at Risk and Expected Shortfall

Derivatives of Value at Risk and Expected Shortfall Dervatves of Value at Rsk and Exected Shortfall November 2003 Hans Rau-Bredow hans.rau-bredow@mal.un-wuerzburg.de Dr. Hans Rau-Bredow Unversty of Cologne; Unversty of Würzburg Leo Wesmantel Str. 4 D-97074

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information