Inference and Regression

Size: px
Start display at page:

Download "Inference and Regression"

Transcription

1 Inference and Regression Assignment 4 - Solutions Department of IOMS Professor William Greene Phone: Office: KMC 7-90 Home page: wgreene@stern.nyu.edu Course web page: 1. The following 16 numbers came from normal random number generator on a computer: a. What would you guess the mean and variance (µ and σ 2 ) of the generating normal distribution were? b. Give 90%, 95%, and 99% confidence intervals for µ. c. How much larger a sample do you think you would need to halve the length of the confidence interval for µ? a. The sample mean is The sample variance is b. Confidence intervals (based on the t distribution with 15 degrees of freedom). The standard error of the mean is /sqr() where =16. The 90% confidence interval is / ( /4) = The 95% confidence interval is / ( /4) = The 90% confidence interval is / ( /4) = c. Confidence intervals for σ. The confidence intervals for σ 2 are from (-1)s 2 /χ 2 < σ 2 < (-1)s 2 /χ 2 Where the terminal chi squareds are for (1-α/2) and α/2. =16, s 2 = The critical chi squared values are For 1%, (4.60,32.80), 5%, (6.26,27.49), 10%(7.26,25.00) The intervals for σ 2 are as follows: s = The 99% confidence interval 15(s 2 )/32.80 to 15(s 2 )/4.60 = The 95% confidence interval 15(s 2 )/27.49 to 15(s 2 )/6.26 = The 90% confidence interval 15(s 2 )/25.00 to 15(s 2 )/7.26 = Use the square roots for the interval for σ. ( ), ( ), ( ). d. The length of the confidence interval is 2(t*)s/4 where s is the sample standard deviation and 4 is the square root of the sample size. To halve the length of the interval, that 4 must become an 8, so must go from 16 to 64 up by a factor of Suppose that X 1, X 2,..., X n are i.i.d. (µ, σ 2 ), where µ and σ are unknown. How should the constant c be chosen so that the interval (, X + c) is a 95% confidence interval for µ; that is, c should be chosen so that P( < μ X + c) =.95.

2 c should be chosen so that the probability to the left of c is.95. That would be standard deviations above x-bar. So that the probability to the right of c is A coin is thrown independently 10 times to test the hypothesis that the probability of heads is 1/2 versus the alternative that the probability is not 1/2. The test rejects if either 0 or 10 heads are observed. a. What is the significance level of the test? b. If in fact the probability of heads is.1, what is the power of the test? a. The probability of a type 1 error is the Prob(X=0 or 10 p=.5) = 2*1/1024 = 1/512. b. The power of the test is the probability it will reject p=.5 given that p really is.1. This is the probability of 0 p=.1 + probability of 10 p=.1 = = True or false, and state why: a. The significance level of a statistical test is equal to the probability that the null hypothesis is true. b. If the significance level of a test is decreased, the power would be expected to increase. c. If a hypothesis is rejected at the significance level α, the probability that the null hypothesis is true equals α. d. The probability that the null hypothesis is falsely rejected is equal to the power of the test. e. A type I error occurs when the test statistic falls in the rejection region of the test. f. A type II error is more serious than a type I error. g. The power of a test is determined by the null distribution of the test statistic. h. The likelihood ratio is a random variable. a. False. The significance level is the probability that the test will reject the null hypothesis when it is true. b. If the significance level of a test is decreased, the power would be expected to increase. The significance level, α is the probability that the null hypothesis will be rejected when it is true. The power of the test will be the probability that the hypothesis is rejected when it is false. The power of the test is a function of the alternative hypothesis. In general, decreasing the significance level of a test makes it harder to reject the hypothesis. This should operate similarly on the power. Take an example. Suppose the hypothesis is that the mean of a normal distribution is zero. If the significance level is.05, the test is rejected if the sample mean is less than standard errors or greater than standard errors. Suppose the specific alternative is that the mean is.1. Then, the power of the test is the probability that the mean will be greater than 1.86 standard errors from.1 or less than standard errors from.1. The power is ow, reduce the significance level to.01. The probability of the type I error is now.01. The probability that the null will be rejected when the alternative, µ =.1, is true becomes Reducing the significance level reduces the power of the test as well. What does increase when the significance is reduced is the probability of a type II error, not rejecting the null when it is false. This is one minus the power. The probabiliy of a type II error increases when the probability of a type I error decreases. The power of the test is one minus the probability of a type II error. So, the answer to the question is FALSE. c. False. The probability is 1 - α, not α. d. False. The probability that the null is falsely rejected is 1 - α e. True, but it must be qualified that the null hypothesis must also be true. f. ot necessarily. It depends on the application. g. False. The power is determined by the alternative. h. True. It is a function of a statistic computed from the data. 5. Let X 1,..., X 25 be a sample from a normal distribution having a variance of 100. Find the rejection region for a test at level α =.10 of H 0 : μ = 0 versus H A : μ = 1.5. What is the power of the test? Repeat for α =.01.

3 The rejection region is the values of x-bar that are greater than 1.28 standard errors above zero That is, where the probability that x-bar is above the value is.1. That is 1.28 standard errors. The standard error is σ/sqr(n) = 10/5 = 2. So, the rejection region is values of x-bar above The power of the test is the probability that it will reject H 0 when it is false. That is the probability that x-bar will exceed 2.56 when µ is 1.5. If µ is 1.5, then this is the probability that z is greater than ( )/2 = prob(z >.53) = If α is.01, then the 1.28 becomes 2.33 so the critical value is The probability that z exceeds 4.66 if µ is 1.5 is Prob(z > ( )/2) = Prob(z > 1.58) = Let X 1,..., X n be a random sample from an exponential distribution with the density function f (x θ) = θexp[ θx]. Derive a likelihood ratio test of H 0 : θ = θ 0 versus H A : θ θ 0, and show that the rejection region is of the form { x exp[ θ 0 x ] c}. The likelihood under the null hypothesis is θ exp( θ Σ x ). Under the alternative, 0 0 the likelihood is maximized by setting θ = 1/x. So, the likelihood under the alternative hypothesis is (1/x) exp( 1/ x Σ x ). But, Σ x = x, so the likelihood i i i i under the alternative is (1/x) exp( ). The likelihood ratio is 0 exp( 0x) exp( 0x) 0 = θ 0 = θ0 θ θ θ θ L = ( x) ( x exp( x) ). (1/x) exp( ) exp( 1) exp( 1) ( ) The rejection region is values of x for which x exp( θ x) is small. 7. a. In 1965, a newspaper carried a story about a high school student who reported getting 9207 heads and 8743 tails in 17,950 coin tosses. Is this a significant discrepancy from the null hypothesis H 0 : p = ½? b. Jack Youden, a statistician at the ational Bureau of Standards, contacted the student and asked him exactly how he had performed the experiment (Youden 1974). To save time, the student had tossed groups of five coins at a time, and a younger brother had recorded the results, shown in the following table: umber of Heads Frequency Are the data consistent with the hypothesis that all the coins were fair (p = ½)? a. The mean would be.5(17950) = 8975 and the variance =.5(.5)17950 = The standard deviation is heads is ( )/ = standard deviations from the mean. Using the normal approximation, this seems like a significant difference. b. In tosses of 5 coins, assuming they are fair, the probabilities are 1/32, 5/32, 10/32, 10/32, 5/32, 1/32 = (.03125,.15625,.3125,.3125,.15625,.03125). The expected values in 3590 tosses of 5 coins are (112.18, , , , , ). Computing the chi-squared = Σ [(Observed Expected) 2 /Expected] = with 5 degrees of freedom. The critical value is about Something wrong in here. i 0 i 8. If X ~ (μ x, σ x 2 ) and Y is independent (μ y, σ y 2 ), what is π = P(X < Y ) in terms of μ x, μ y, σ x, and σ y?

4 Prob(X < Y) = Prob(X Y < 0). Z = X Y is normally distributed with mean µ X - µ Y and variance σ X 2 + σ Y 2. So, the desired probability is prob(z < [0 (µ x - µ y )]/sqr(σ X 2 + σ Y 2 )]. 9. Lin, Sutton, and Qurashi (1979) compared microbiological and hydroxylamine methods for the analysis of ampicillin dosages. In one series of experiments, pairs of tablets were analyzed by the two methods. The data in the following table give the percentages of claimed amount of ampicillin found by the two methods in several pairs of tablets. What are E[X Y] and the standard deviation of X Y? If the pairing had been erroneously ignored and it had been assumed that the two samples were independent, what would have been the estimate of the standard deviation of X Y? Analyze the data to determine if there is a systematic difference between the two methods. Microbiological Method Hydroxylamine Method The means are and The difference is The standard deviation of x-bar y-bar is the square root of Var(x-bar)+var(ybar)-2Cov(xbar,ybar) = sqr( ( ))/15 =.308. If it were assumed that the pairings were independent, the covariance term in the result above would be omitted, and the standard deviation of xbar-ybar would be estimated by sqr( )/15 = Are they different? Rice seems to think not. Whichever method is used, the difference in the means is not statistically different from zero. The independence assumption seems dubious given that the correlation of the two sets of values is.98.

5 10. A market research team conducted a survey to investigate the relationship of personality to attitude toward small cars. A sample of 250 adults in a metropolitan area were asked to fill out a 16-item selfperception questionnaire, on the bases of which they were classified into three types: cautions conservative, middle of the roader and confident explorer. There were then asked to five their overall opionion of small cars, favorable, neutral, or unfavorable. Is there a relationship between personality type and attitude toward small cars? If so, what is the nature of the relationship? Personality types Attitude Cautious Midroad Explorer Favorable eutral Unfavorable There is an error in the problem. The table contains 299 responses, not 250. Use the chi squared test of independence. The observed frequencies are in the table. I convert these to proportions Cautious Midroad Explorer Total Favorable.264 (.205).194 (.208).164 (.208).622 eutral.033 (.030).027 (.030).030 (.030).090 Unfavorable.033 (.095).114 (.096).141* (.096).288 (* rounded up) Total Expected proportions are the products of the marginals, given in parentheses. The chi squared is 299 times the sum over the 9 cells of [(observed expected) 2 /expected] matrix ; list ; a=[.264,.194,.164/.033,.027,.030/.033,.114,.141] $ A matrix ; list ; cs = 1'a$ matrix ; list ; rs = a*[1/1/1] $ matrix ; list ; f = rs*cs $ F matrix ; list ; d = a-f $ Observed - expected matrix ; list ; dd = dirp(d,d) $ Square element by element matrix ; list ; fi = diri(f)$ Reciprocals of elements matrix ; list ; c = dirp(dd,fi)$ Product, element by element matrix ; list ; c2 = 299* 1'c*[1/1/1]$ Sum all elements * 299 result is This is a chi squared with (3-1)(3-1) = 4 degrees of freedom. This is much larger than the critical chi squared with 4 degrees of freedom of The hypothesis of independence is rejected. The test does not imply the nature of the relationship; it merely suggests that the two opinions are correlated. 11. A random variable X is said to follow a lognormal distribution if Y = ln(x) follows a normal distribution. The lognormal is sometimes used as a model for heavy tailed skewed distributions. The following data are the dorsal lengths in millimeters of taxonomically distinct octopods. a. Calculate the density function of the lognormal distribution. (What are the parameters µ and σ.) b. Examine whether the lognoral distribution roughly fits the data.

6 Hint: According to the problem, the data given come from a lognormal population. Thus, logs (base e) will be normally distributed. Take the logs, then sort the data. Divide the observed data into a set of five ranges. Observe the proportions of observations that fall in those ranges. You can compute the counterparts to these proportions as the predicted probabilities for the ranges from a normal distribution with the mean and variance that occur in the data. A chi squared goodness of fit can then be used. These are the logs of the sorted observations. The mean is The standard deviation is a. If x ~ [µ,σ 2 ] then x µ f( x) = exp σ 2π 2 σ x y = e, x = log y, dx = 1/ y dy log y µ f( y) = exp yσ 2π 2 σ b. (Hint: According to the problem, the data given come from a lognormal population. Thus, logs (base e) will be normally distributed. Take the logs, then sort the data. Divide the observed data into a set of ranges. Observe the proportions of observations that fall in those ranges. You can compute the counterparts to these proportions as the predicted probabilities for the ranges from a normal distribution with the mean and variance that occur in the data. A chi squared goodness of fit can then be used. The sorted data on the logs of the variables are listed below:

7 The mean and standard deviation of the data are and I divided the data into the 5 sets above. The boundaries of 5 regions are [-, ], ( , ], ( , ], ( , ], ( , - ]. Standardized (by taking (value )/.78543), these are (-,-.86), (-.86,-.32), (-.32,+.32), (+.32,+.77),(+.77,+ ) Using the normal table, the probabilities for these 5 intervals are.195,.180,.251,.154,.220. (Rounding error of.001, so I reduced the top cell to make these add to 1.000). Based on my division of the data (yours might be different), the 5 sample proportions are.191,.191,.191,.191,.236. (again, a bit of rounding error in the top cell). The chi squared is Σ [(Observed Expected) 2 /Expected] = The number of degrees if 5-1 = 4. The critical chi squared for 4 degrees of freedom is 9.49 (5% significance), so the normality hypothesis is not rejected.

Inference and Regression

Inference and Regression Inference and Regression Assignment 3 Department of IOMS Professor William Greene Phone:.998.0876 Office: KMC 7-90 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu Course web page: www.stern.nyu.edu/~wgreene/econometrics/econometrics.htm.

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

EC2001 Econometrics 1 Dr. Jose Olmo Room D309 EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:

More information

Chapter 10. Hypothesis Testing (I)

Chapter 10. Hypothesis Testing (I) Chapter 10. Hypothesis Testing (I) Hypothesis Testing, together with statistical estimation, are the two most frequently used statistical inference methods. It addresses a different type of practical problems

More information

Inference for Proportions, Variance and Standard Deviation

Inference for Proportions, Variance and Standard Deviation Inference for Proportions, Variance and Standard Deviation Sections 7.10 & 7.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 12 Cathy

More information

Probability & Statistics - FALL 2008 FINAL EXAM

Probability & Statistics - FALL 2008 FINAL EXAM 550.3 Probability & Statistics - FALL 008 FINAL EXAM NAME. An urn contains white marbles and 8 red marbles. A marble is drawn at random from the urn 00 times with replacement. Which of the following is

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

Fundamental Probability and Statistics

Fundamental Probability and Statistics Fundamental Probability and Statistics "There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are

More information

6.4 Type I and Type II Errors

6.4 Type I and Type II Errors 6.4 Type I and Type II Errors Ulrich Hoensch Friday, March 22, 2013 Null and Alternative Hypothesis Neyman-Pearson Approach to Statistical Inference: A statistical test (also known as a hypothesis test)

More information

This does not cover everything on the final. Look at the posted practice problems for other topics.

This does not cover everything on the final. Look at the posted practice problems for other topics. Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry

More information

Chapter 7: Hypothesis Testing

Chapter 7: Hypothesis Testing Chapter 7: Hypothesis Testing *Mathematical statistics with applications; Elsevier Academic Press, 2009 The elements of a statistical hypothesis 1. The null hypothesis, denoted by H 0, is usually the nullification

More information

Lecture 10: Generalized likelihood ratio test

Lecture 10: Generalized likelihood ratio test Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual

More information

Sampling, Confidence Interval and Hypothesis Testing

Sampling, Confidence Interval and Hypothesis Testing Sampling, Confidence Interval and Hypothesis Testing Christopher Grigoriou Executive MBA HEC Lausanne 2007-2008 1 Sampling : Careful with convenience samples! World War II: A statistical study to decide

More information

ECONOMETRICS I. Assignment 5 Estimation

ECONOMETRICS I. Assignment 5 Estimation ECONOMETRICS I Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Email: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/econometrics/econometrics.htm

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Inference and Regression

Inference and Regression Inference and Regression Midterm Examination, 016 Department of IOMS Instructions Please write your name at the top of this page. Please answer all questions on this question book. Do not turn in a blue

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

1. Let A be a 2 2 nonzero real matrix. Which of the following is true?

1. Let A be a 2 2 nonzero real matrix. Which of the following is true? 1. Let A be a 2 2 nonzero real matrix. Which of the following is true? (A) A has a nonzero eigenvalue. (B) A 2 has at least one positive entry. (C) trace (A 2 ) is positive. (D) All entries of A 2 cannot

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Statistical inference provides methods for drawing conclusions about a population from sample data.

Statistical inference provides methods for drawing conclusions about a population from sample data. Introduction to inference Confidence Intervals Statistical inference provides methods for drawing conclusions about a population from sample data. 10.1 Estimating with confidence SAT σ = 100 n = 500 µ

More information

Ling 289 Contingency Table Statistics

Ling 289 Contingency Table Statistics Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,

More information

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL 1). Two urns are provided as follows: urn 1 contains 2 white chips and 4 red chips, while urn 2 contains 5 white chips and 3 red chips. One chip is chosen

More information

Basic Probability Reference Sheet

Basic Probability Reference Sheet February 27, 2001 Basic Probability Reference Sheet 17.846, 2001 This is intended to be used in addition to, not as a substitute for, a textbook. X is a random variable. This means that X is a variable

More information

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu Home Work: 1 1. Describe the sample space when a coin is tossed (a) once, (b) three times, (c) n times, (d) an infinite number of times. 2. A coin is tossed until for the first time the same result appear

More information

Naïve Bayes classification

Naïve Bayes classification Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

The Logit Model: Estimation, Testing and Interpretation

The Logit Model: Estimation, Testing and Interpretation The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,

More information

Homework for 4/9 Due 4/16

Homework for 4/9 Due 4/16 Name: ID: Homework for 4/9 Due 4/16 1. [ 13-6] It is covetioal wisdom i military squadros that pilots ted to father more girls tha boys. Syder 1961 gathered data for military fighter pilots. The sex of

More information

Notes on the Multivariate Normal and Related Topics

Notes on the Multivariate Normal and Related Topics Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

Swarthmore Honors Exam 2012: Statistics

Swarthmore Honors Exam 2012: Statistics Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

Ch. 7 Statistical Intervals Based on a Single Sample

Ch. 7 Statistical Intervals Based on a Single Sample Ch. 7 Statistical Intervals Based on a Single Sample Before discussing the topics in Ch. 7, we need to cover one important concept from Ch. 6. Standard error The standard error is the standard deviation

More information

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability Probability theory Naïve Bayes classification Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. s: A person s height, the outcome of a coin toss Distinguish

More information

Statistical Inference

Statistical Inference Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 11 January 7, 2013 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline How to communicate the statistical uncertainty

More information

Robustness and Distribution Assumptions

Robustness and Distribution Assumptions Chapter 1 Robustness and Distribution Assumptions 1.1 Introduction In statistics, one often works with model assumptions, i.e., one assumes that data follow a certain model. Then one makes use of methodology

More information

VTU Edusat Programme 16

VTU Edusat Programme 16 VTU Edusat Programme 16 Subject : Engineering Mathematics Sub Code: 10MAT41 UNIT 8: Sampling Theory Dr. K.S.Basavarajappa Professor & Head Department of Mathematics Bapuji Institute of Engineering and

More information

375 PU M Sc Statistics

375 PU M Sc Statistics 375 PU M Sc Statistics 1 of 100 193 PU_2016_375_E For the following 2x2 contingency table for two attributes the value of chi-square is:- 20/36 10/38 100/21 10/18 2 of 100 120 PU_2016_375_E If the values

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

For use only in [the name of your school] 2014 S4 Note. S4 Notes (Edexcel)

For use only in [the name of your school] 2014 S4 Note. S4 Notes (Edexcel) s (Edexcel) Copyright www.pgmaths.co.uk - For AS, A2 notes and IGCSE / GCSE worksheets 1 Copyright www.pgmaths.co.uk - For AS, A2 notes and IGCSE / GCSE worksheets 2 Copyright www.pgmaths.co.uk - For AS,

More information

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses Space Telescope Science Institute statistics mini-course October 2011 Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses James L Rosenberger Acknowledgements: Donald Richards, William

More information

MAS223 Statistical Inference and Modelling Exercises

MAS223 Statistical Inference and Modelling Exercises MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Bayesian Inference. STA 121: Regression Analysis Artin Armagan

Bayesian Inference. STA 121: Regression Analysis Artin Armagan Bayesian Inference STA 121: Regression Analysis Artin Armagan Bayes Rule...s! Reverend Thomas Bayes Posterior Prior p(θ y) = p(y θ)p(θ)/p(y) Likelihood - Sampling Distribution Normalizing Constant: p(y

More information

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1 4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first

More information

STATISTICS 3A03. Applied Regression Analysis with SAS. Angelo J. Canty

STATISTICS 3A03. Applied Regression Analysis with SAS. Angelo J. Canty STATISTICS 3A03 Applied Regression Analysis with SAS Angelo J. Canty Office : Hamilton Hall 209 Phone : (905) 525-9140 extn 27079 E-mail : cantya@mcmaster.ca SAS Labs : L1 Friday 11:30 in BSB 249 L2 Tuesday

More information

Stat 704 Data Analysis I Probability Review

Stat 704 Data Analysis I Probability Review 1 / 39 Stat 704 Data Analysis I Probability Review Dr. Yen-Yi Ho Department of Statistics, University of South Carolina A.3 Random Variables 2 / 39 def n: A random variable is defined as a function that

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

Test of Hypothesis for Small and Large Samples and Test of Goodness of Fit

Test of Hypothesis for Small and Large Samples and Test of Goodness of Fit Quest Journals Journal of Software Engineering and Simulation Volume 2 ~ Issue 7(2014) pp: 08-15 ISSN(Online) :2321-3795 ISSN (Print):2321-3809 www.questjournals.org Research Paper Test of Hypothesis for

More information

Hypothesis Testing Chap 10p460

Hypothesis Testing Chap 10p460 Hypothesis Testing Chap 1p46 Elements of a statistical test p462 - Null hypothesis - Alternative hypothesis - Test Statistic - Rejection region Rejection Region p462 The rejection region (RR) specifies

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Chapters 10. Hypothesis Testing

Chapters 10. Hypothesis Testing Chapters 10. Hypothesis Testing Some examples of hypothesis testing 1. Toss a coin 100 times and get 62 heads. Is this coin a fair coin? 2. Is the new treatment on blood pressure more effective than the

More information

AP Statistics Ch 12 Inference for Proportions

AP Statistics Ch 12 Inference for Proportions Ch 12.1 Inference for a Population Proportion Conditions for Inference The statistic that estimates the parameter p (population proportion) is the sample proportion p ˆ. p ˆ = Count of successes in the

More information

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018 Mathematics Ph.D. Qualifying Examination Stat 52800 Probability, January 2018 NOTE: Answers all questions completely. Justify every step. Time allowed: 3 hours. 1. Let X 1,..., X n be a random sample from

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Question 1. Suppose you want to estimate the percentage of

More information

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1, Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Solutions to the Spring 2015 CAS Exam ST

Solutions to the Spring 2015 CAS Exam ST Solutions to the Spring 2015 CAS Exam ST (updated to include the CAS Final Answer Key of July 15) There were 25 questions in total, of equal value, on this 2.5 hour exam. There was a 10 minute reading

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered) Test 3 Practice Test A NOTE: Ignore Q10 (not covered) MA 180/418 Midterm Test 3, Version A Fall 2010 Student Name (PRINT):............................................. Student Signature:...................................................

More information

Stat 135 Fall 2013 FINAL EXAM December 18, 2013

Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Name: Person on right SID: Person on left There will be one, double sided, handwritten, 8.5in x 11in page of notes allowed during the exam. The exam is closed

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability and Statistics FS 2017 Session Exam 22.08.2017 Time Limit: 180 Minutes Name: Student ID: This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided

More information

DS-GA 1003: Machine Learning and Computational Statistics Homework 7: Bayesian Modeling

DS-GA 1003: Machine Learning and Computational Statistics Homework 7: Bayesian Modeling DS-GA 1003: Machine Learning and Computational Statistics Homework 7: Bayesian Modeling Due: Tuesday, May 10, 2016, at 6pm (Submit via NYU Classes) Instructions: Your answers to the questions below, including

More information

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1. Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION Unit : I - V Unit I: Syllabus Probability and its types Theorems on Probability Law Decision Theory Decision Environment Decision Process Decision tree

More information

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is Math 416 Lecture 3 Expected values The average or mean or expected value of x 1, x 2, x 3,..., x n is x 1 x 2... x n n x 1 1 n x 2 1 n... x n 1 n 1 n x i p x i where p x i 1 n is the probability of x i

More information

Chapter 5 Statistical Inference

Chapter 5 Statistical Inference Chapter 5 Statistical Inference CHAPTER OUTLINE Section 1 Why Do We Need Statistics? Section 2 Inference Using a Probability Model Section 3 Statistical Models Section 4 Data Collection Section 5 Some

More information

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015 Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences 18.30 21.15h, February 12, 2015 Question 1 is on this page. Always motivate your answers. Write your answers in English. Only the

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

Chapter Three. Hypothesis Testing

Chapter Three. Hypothesis Testing 3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

You can compute the maximum likelihood estimate for the correlation

You can compute the maximum likelihood estimate for the correlation Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

Introduction: MLE, MAP, Bayesian reasoning (28/8/13) STA561: Probabilistic machine learning Introduction: MLE, MAP, Bayesian reasoning (28/8/13) Lecturer: Barbara Engelhardt Scribes: K. Ulrich, J. Subramanian, N. Raval, J. O Hollaren 1 Classifiers In this

More information

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices.

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. 1. What is the difference between a deterministic model and a probabilistic model? (Two or three sentences only). 2. What is the

More information

Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence

Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence Valen E. Johnson Texas A&M University February 27, 2014 Valen E. Johnson Texas A&M University Uniformly most powerful Bayes

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

Introduction to Statistical Inference

Introduction to Statistical Inference Introduction to Statistical Inference Dr. Fatima Sanchez-Cabo f.sanchezcabo@tugraz.at http://www.genome.tugraz.at Institute for Genomics and Bioinformatics, Graz University of Technology, Austria Introduction

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

This paper is not to be removed from the Examination Halls

This paper is not to be removed from the Examination Halls ~~ST104B ZA d0 This paper is not to be removed from the Examination Halls UNIVERSITY OF LONDON ST104B ZB BSc degrees and Diplomas for Graduates in Economics, Management, Finance and the Social Sciences,

More information

F79SM STATISTICAL METHODS

F79SM STATISTICAL METHODS F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is

More information

Statistical Hypothesis Testing

Statistical Hypothesis Testing Statistical Hypothesis Testing Dr. Phillip YAM 2012/2013 Spring Semester Reference: Chapter 7 of Tests of Statistical Hypotheses by Hogg and Tanis. Section 7.1 Tests about Proportions A statistical hypothesis

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math

More information

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Let s face it. Statistics has a really bad reputation. Why? 1. It is boring. 2. It doesn t make a lot of sense. Actually, the

More information