Additional Notes and Computational Formulas CHAPTER 3
|
|
- Beatrix Small
- 5 years ago
- Views:
Transcription
1 Additioal Notes ad Computatioal Formulas APPENDIX CHAPTER 3 1 The Greek capital sigma is the mathematical sig for summatio If we have a sample of observatios say y 1 y 2 y 3 y their sum is y 1 + y 2 + y y This ca also be writte y i or y i which is read as the sum of the y i for i ruig from 1 to The computatioal formula for the sample mea which we shall deote yis y = 1 y i 2 Usig the same otatio the variace is s 2 = 1 1 y i y = 1 1 where d i = y i y This is the best formula to use o a computer that ca store all the y i i memory If a calculator is beig used it is usually more coveiet to use the mathematically equivalet formula s 2 = 1 1 y 2 i 1 d 2 i y i Basic Biostatistics for Geeticists ad Epidemiologists: A Practical Approach 2008 Joh Wiley & Sos Ltd ISBN: R Elsto ad W Johso
2 354 APPENDIX First y i is calculated squared ad divided by ; this gives the correctio factor 1 y 2 i The y2 i is calculated ad the correctio factor subtracted from it The result is divided by 1 3 The coefficiets of skewess ad kurtosis are respectively 1/2 y i y 3 [ y i y] 3/2 ad y i y 4 [ y i y] 2 CHAPTER 4 1 Bayes theorem ca be derived quite simply from the basic laws of probability We have PD j S = PD j ad S PS ad PS = PD 1 ad S ord 2 ad S or D k ad S = PD 1 ad S + PD 2 ad S + + PD k ad S because D 1 D 2 D k are mutually exclusive It follows immediately that PD j S = PD j ad S PD 1 ad S + PD 2 ad S + +PD k ad S 2 Data collected by a paterity testig laboratory ca be used to estimate the proportio of past cases i which the alleged father was the true father without ay kowledge of whether i each case the alleged father was or was ot the true father Briefly oe method of doig this is as follows From a large series of cases that have come to the laboratory we ca calculate the proportio of cases that resulted i a exclusio which we shall call PE Usig geetic theory ad a kowledge of the gee frequecies i the populatio we ca calculate PE D 2 the probability of exclusio give that a radom ma is the true father see for example MacCluer ad Schull 1963 If we assume oly two mutually exclusive
3 APPENDIX 355 possibilities exist D 1 the alleged father is the true father or D 2 a radom ma is the true father we must have PE = PD 1 ad E + PD 2 ad E = PD 1 PE D 1 + PD 2 PE D 2 But PE D 1 the probability of exclusio give that the alleged father is the true father is zero wheever the alleged father is the true father he will ot be excluded by the blood testig Therefore PE = PD 2 PE D 2 ad so we ca calculate PD 2 = PE/PE D 2 ad hece PD 1 = 1 PD 2 I situatios i which this prior probability of paterity has bee estimated it has bee foud to be typically betwee 065 ad 09 CHAPTER 6 1 The asymptotic properties of ubiasedess efficiecy ad ormality hold i geeral oly for maximum likelihood estimators if the likelihood is made a maximum i the mathematical sese ie whe the estimates are substituted for the parameters the likelihood has to be larger tha whe ay other eighborig values are substituted A likelihood just like a probability desity fuctio ca have oe mode several modes or o modes; each modal value is a particular maximum likelihood estimate of the parameter Thus maximum likelihood estimates may ot be uique or eed they eve exist for permissible values of the parameters A variace for example must be positive ad yet its maximum likelihood estimate may sometimes be egative I a situatio such as this the maximum likelihood estimate is sometimes said to be zero if at zero the likelihood is largest for all permissible values of the parameter I geetics the recombiatio fractio caot be greater tha 05 For this reaso the maximum likelihood estimate is ofte said to be 05 if the true maximum occurs at some value greater tha 05 Sice i these cases the likelihood is ot however at a mathematical maximum ie it is ot at a mode such estimators do ot possess the same good properties asymptotically that true maximum likelihood estimators possess 2 Eve if a maximum likelihood estimate is obtaied as a mathematical maximum of the likelihood there are still certai cases i which however large the sample it is still ot ubiased This occurs if as the sample becomes larger ad larger so does the umber of ukow parameters I such situatios the problem is ofte overcome by maximizig a so-called coditioal likelihood You are most
4 356 APPENDIX likely to see this term i coectio with matched-pair desigs You may also ecouter the term partial likelihood ; estimators that maximize this have the same usual asymptotic properties of maximum likelihood estimators 3 To show the equivalece betwee ad P 2 Y μ σ Y 2 2 = 095 P Y 2σ Y μ Y + 2σ Y = 095 first ote that withi each probability statemet there are two iequalities We ca maipulate these iequalities by the ordiary rules of algebra without chagig the probabilities as follows read the sig <=> as is equilvalet to : 2 Y μ σ Y 2 <=> 2 Y μ σ Y ad Y μ σ Y 2 <=> 2σ Y Y μ ad Y μ 2σ Y <=>μ 2σ Y Y ad Y μ + 2σ Y <=>μ Y + 2σ Y ad Y 2σ Y μ <=> Y 2σ Y μ Y + 2σ Y CHAPTER 7 1 The ull hypothesis must be very specific sice we eed to kow the distributio of the test criterio uder it If our research hypothesis is for example < 06 we wat to disprove the alterative 06 But to be specific we take as our ull hypothesis the specific value of π that is least favorable to our research hypothesis; amely = 06 Clearly if we reject the hypothesis = 06 i favor of <06 we must reject with eve greater covictio the possibility that >06 I other words wheever a iequality is ivolved i what we try to disprove we take as our specific ull hypothesis the equality that is closest to the research hypothesis; for if we disprove that particular equality eg = 06 we shall have automatically disproved the whole iequality eg 06
5 APPENDIX The mea of the statistic T for the rak sum test ca be derived as follows First ote that the average of N umbers 1 2 N is N + 1/2 Uder the ull hypothesis T is the sum of 1 radomly picked umbers from the set of umbers So puttig N = their average is /2; therefore the sum of 1 radom umbers from the set would be expected o a average to be /2 If T is less tha this we would suspect that the media of the first populatio is less tha that of the secod ad coversely if T is greater tha this It is ot so simple to derive the stadard deviatio of T 3 Notice that throughout this chapter we talk about testig the ull hypothesis hopig to disprove it i favor of the alterative hypothesis Sometimes we wat to show that two treatmets are equivalet hopig to disprove the hypothesis that they are differet We might wish to show for example that two differet drugs lead to the same chages i blood pressure 1 ad 2 However it is doubtful whether we would ever be i a situatio where the equality 1 = 2 holds exactly ad we would probably be more iterested i kowig whether 1 2 > where is the largest value of the mea differece that is ot cliically sigificat I this case we let the ull hypothesis be 1 2 = that is 1 2 = 0 ad the alterative is two-sided: 1 2 < 0 or 1 2 > 0 which is the same as 1 2 > CHAPTER 8 Let the two hypotheses be H 0 ad H 1 with prior probabilities PH 0 = PH 0 is true ad PH 1 = PH 0 is false respectively ad deote the data D Thus we write: PH 1 D = PH 1 PD H 1 PH 1 PD H 1 + PH 0 PD H 0 First ote that because we assume PH 0 + PH 1 = 1 the deomiator of the fractio o the right is equal to PD that is we have PH 1 D = PH 1PD H 1 PD Now divide both sides by PH 0 D ad we obtai: PH 1 D PH 0 D = PH 1PD H 1 PDPH 0 D
6 358 APPENDIX But the deomiator o the right is PDPH 0 D = PD H 0 = PH 0 PD H 0 so that we have PH 1 D PH 0 D = PH 1PD H 1 PH 0 PD H 0 that is the posterior odds = the prior odds the likelihood ratio 1 For ay quatity x 2 we have Thus CHAPTER 9 x 2 = 1 1 x x 2 y1 π 1 π 1 1 π 1 = 1 π y1 π 1 1 π 1 1 π 1 + π y1 π 1 1 π 1 1 π 1 y1 π 1 y1 π 1 = π π 1 Now substitute y 1 = y 2 ad 1 = 1 2 i the umerator of the secod term; we obtai [ y1 π 1 y2 1 π2 ] 2 y1 π 1 y2 + π 2 + = + π 1 1 π 1 π 1 π 2 y1 π 1 y2 π 2 = + π 1 π 2 2 There is aother mathematically idetical formula for calculatig the Pearso chi-square statistic from a 2 2 cotigecy table Suppose we write the table as follows: The formula is the a b a+ b c d c+ d a + c b+ d N= a + b + c + d ad bc N a + bc + da + cb + d
7 APPENDIX 359 Sometimes this formula is modified to iclude a so-called correctio for cotiuity that makes the resultig chi-square smaller Provided each expected value is at least 5 however there is o eed for this modificatio 3 Suppose we calculate the usual cotigecy table chi-square to test for idepedece betwee two respose traits whe we have matched pairs As i Chapter 9 suppose each pair cosists of a ma ad a woma matched for age ad the respose variables are the cholesterol level of the ma ad the cholesterol level of the woma i each pair The the usual cotigecy table chi-square would be relevat for aswerig the questio: is the cholesterol level of the woma i the pair idepedet of the cholesterol level of the ma i the pair? That is does age the matchig variable have a commo effect o the cholesterol level of both me ad wome? I other words i this type of study the usual cotigecy table chi-square tests whether the matchig was ecessary a sigificat result idicatig that it was a osigificat result idicatig it was ot CHAPTER 10 1 For a sample of pairs let x ad y be the sample meas Defie the quatities: SS x = SS y = SS xy = x i x = x 2 i 1 2 x i yi y 2 = y 2 i 1 2 y i x i x y i y = x i y i 1 x i y i Thus SS x is the total sum of the squared deviatios of the x i from the mea x ad SS y is the total sum of the squared deviatios of the y i from the mea yss xy is the aalogous total sum of the cross-products; ad the same cosideratios gover which formula to use for calculatig this as gover the calculatio of SS x ad SS y See ote 2 for Chapter 3 i this Appedix Usig these quatities we successively calculate the estimated regressio coefficiet b 1 ad itercept b 0 for the regressio lie ŷ i = b 0 + b 1 x i as follows: b 1 = SS xy SS x
8 360 APPENDIX b 0 = y SS xy SS x x = y b 1 x The sum of squares i Table 101 ca the be calculated as SS R = SS2 xy SS x = b 1 SS xy SS E = SS y SS2 xy SS x = SS y b 1 SS xy ad as explaied the mea squares are obtaied by dividig the sums of the squares by their respective degrees of freedom that is Note that the total sum of squares is MS R = SS R 1 = SS R MS E = SS E 2 SS T = SS R + SS E = SS y ie the total sum of squares of the respose variable Y about its mea The stadard error of b 1 is MS E /SS x 2 The estimated regressio lie of X o y is give by b 1 = SS xy SS y b 0 = x SS xy SS y y = x b 1 y The covariace betwee X ad Y is estimated by ad the correlatio by SS xy 1 SS xy SSx SS y
9 APPENDIX It is istructive to check i the tables of the t-distributio ad F-distributio that the square of the 975th percetile of Studet s t with k df is equal to the 95th percetile of F with 1 ad k df Similarly the square of the 95th percetile of t with k df is equal to the 90th percetile of F with 1 ad k df Just as χ 2 with 1 df is the square of a ormal radom variable so F with 1 ad k df is the square of a radom variable distributed as t with k df Thus articles you read i the literature may use either of these equivalet test statistics CHAPTER 11 1 It is possible to calculate the mea squares i Table 113 from the data preseted i Table 112 The mea square amog drug groups is 10 the umber of patiets i each group times the variace of the four group meas ie 10 times the variace of the set of four umbers ad 90 The mea square withi drug groups is the pooled withi-group variace Sice each group cotais the same umber of patiets this is the simple average of the four variaces There are several multiple compariso procedures that ca be used to test which pairs of a set of meas are sigificatly differet The simplest Fisher s least sigificat differece method is to perform Studet s t-test o all the pairwise comparisos but to require i additio before makig the pairwise comparisos a sigificat overall F-test for the equality of the meas beig cosidered The t-tests are performed usig the pooled estimate of 2 give by the error mea square of the aalysis of variace Aother procedure the Fisher Boferroi method similarly uses multiple t-tests but each test must reach sigificace at the /c level where c is the total umber of comparisos made to esure a overall sigificace level of Several procedures due to Tukey Newma ad Keuls ad Duca begi by comparig the largest mea with the smallest ad cotiue with the ext largest differece ad so o util either a osigificat result is ecoutered or util all pairwise comparisos have bee made At each step the differece is compared to a appropriate ull distributio A further method Scheffé s allows for testig more complex cotrasts such as H 0 : = 0 as well as all pairwise comparisos of meas Fially there is a procedure Duett s aimed specifically at idetifyig group meas that are sigificatly differet from the mea of a cotrol group
10 362 APPENDIX All these tests are aimed at maitaiig the overall sigificace level at some fixed value ie esurig that if the ull hypothesis is true the probability of oe or more sigificat differeces beig foud is 3 We ow give simple rules for writig dow formulas for sums of squares that apply i the case of ay balaced desig ie ay desig i which all groups of a particular type are the same size The rules are easy to apply if the degrees of freedom are kow for each source of variatio Cosider first the amog groups sums of squares i Table The degrees of freedom are a 1 2 Write dow a pair of paretheses ad a square superscript 2 for each term i the degrees of freedom: 3 I frot of each pair of paretheses write the appropriate sig correspodig to the term i the degrees of freedom ad a summatio sig for each letter i the degrees of freedom: a 4 Put summatio sigs iside the paretheses for all factors ad replicates that are ot summed over outside the paretheses followed by the symbol for a typical observatio: a a y ik k=1 y ik 5 Fially divide each term by the umber of observatios summed withi the paretheses: SS A = a y 2 k=1 ik k=1 a k=1 y ik Applyig this same sequece of rules we obtai the sum of squares withi groups for Table 111 otig that we multiply out the expressio for the degrees of freedom 1 a a 2 a 3 a k=1 a
11 APPENDIX 363 a 4 y ik a 2 y ik k=1 k=1 a a k=1 yik 5 SS R = y 2 k=1 ik 1 Similarly for the ested factor aalysis of variace i Table 114 we obtai SS A = SS B = SS R = a b a b j=1 a b j=1 k=1 y j=1 k=1 ijk b y 2 k=1 ijk a y 2 ijk a b j=1 b j=1 k=1 y ijk a b For the two-way aalysis i Table 116 we obtai SS A = a b a b j=1 j=1 k=1 y ijk b y 2 k=1 ijk ab y j=1 k=1 ijk b y 2 k=1 ijk a b y j=1 k=1 ijk ab SS B = SS AB = a SS R = b a j=1 a b j=1 a y 2 b k=1 ijk y j=1 k=1 ijk a ab y 2 a b k=1 ijk y j=1 k=1 ijk b a y 2 b k=1 ijk y j=1 k=1 ijk + a ab a b y 2 ijk y 2 j=1 k=1 ijk b a j=1 b j=1 k=1 REFERENCE MacCluer JW ad Schull WJ 1963 O the estimatio of the frequecy of opaterity America Joural of Huma Geetics 15:
Topic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationStat 200 -Testing Summary Page 1
Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More information10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random
Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),
More informationSNAP Centre Workshop. Basic Algebraic Manipulation
SNAP Cetre Workshop Basic Algebraic Maipulatio 8 Simplifyig Algebraic Expressios Whe a expressio is writte i the most compact maer possible, it is cosidered to be simplified. Not Simplified: x(x + 4x)
More informationRecall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationCommon Large/Small Sample Tests 1/55
Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationPower and Type II Error
Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error
More informationBIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov
Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationTopic 18: Composite Hypotheses
Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More information3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.
3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear
More informationS Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y
1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationDescribing the Relation between Two Variables
Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of
More information2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2
Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.
ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic
More informationStatistics 20: Final Exam Solutions Summer Session 2007
1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets
More informationGG313 GEOLOGICAL DATA ANALYSIS
GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationImportant Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.
Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More information1 Models for Matched Pairs
1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i
More informationInvestigating the Significance of a Correlation Coefficient using Jackknife Estimates
Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationSequences. Notation. Convergence of a Sequence
Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it
More informationUnderstanding Samples
1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor
More informationCorrelation Regression
Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More informationTopic 10: Introduction to Estimation
Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationApril 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE
April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationPSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9
Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I
More informationEXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY
EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 017 MODULE 4 : Liear models Time allowed: Oe ad a half hours Cadidates should aswer THREE questios. Each questio carries
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More informationLECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if
LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio
More informationEconomics Spring 2015
1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures
More informationOutput Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
More informationTable 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab
Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationThis chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.
Chapter 9 & : Comparig Two Treatmets: This chapter focuses o two eperimetal desigs that are crucial to comparative studies: () idepedet samples ad () matched pair samples Idepedet Radom amples from Two
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More information[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION
[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION BY ALAN STUART Divisio of Research Techiques, Lodo School of Ecoomics 1. INTRODUCTION There are several circumstaces
More informationLecture 6 Simple alternatives and the Neyman-Pearson lemma
STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull
More informationRegression, Inference, and Model Building
Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship
More informationFinal Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationStatistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium
Statistical Hypothesis Testig STAT 536: Geetic Statistics Kari S. Dorma Departmet of Statistics Iowa State Uiversity September 7, 006 Idetify a hypothesis, a idea you wat to test for its applicability
More informationACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics
ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationLesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67
Lesso 7--7 Chaptre 3 Projects ad Had-is Project I: latest ovember Project I: latest december Laboratio Measuremet systems aalysis I: latest december Project - are volutary. Laboratio is obligatory. Give
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationLecture Notes 15 Hypothesis Testing (Chapter 10)
1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we
More informationMidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday
Aoucemets MidtermII Review Sta 101 - Fall 2016 Duke Uiversity, Departmet of Statistical Sciece Office Hours Wedesday 12:30-2:30pm Watch liear regressio videos before lab o Thursday Dr. Abrahamse Slides
More informationChapter 23: Inferences About Means
Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For
More informationCS284A: Representations and Algorithms in Molecular Biology
CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationActivity 3: Length Measurements with the Four-Sided Meter Stick
Activity 3: Legth Measuremets with the Four-Sided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a four-sided meter
More informationInfinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationChapter 11: Asking and Answering Questions About the Difference of Two Proportions
Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,
More informationThe variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.
SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample
More informationMath 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency
Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationInferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.
Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike
More informationSTATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:
Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal
More informationConfidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.
MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval
More information