Stat 135 Fall 2013 FINAL EXAM December 18, 2013

Similar documents
Statistics 135 Fall 2007 Midterm Exam

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56

WISE International Masters

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Confidence Intervals for Comparing Means

STAT FINAL EXAM

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

Correlation 1. December 4, HMS, 2017, v1.1

Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion

Statistics 135 Fall 2008 Final Exam

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Statistics 135: Fall 2004 Final Exam

4. Nonlinear regression functions

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.

STAT 525 Fall Final exam. Tuesday December 14, 2010

Correlation and Regression

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Direction: This test is worth 250 points and each problem worth points. DO ANY SIX

Final Exam - Solutions

Lecture 6: Linear Regression

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Lecture 10: Generalized likelihood ratio test

Probability and Statistics Notes

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

Point Estimation and Confidence Interval

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics

, 0 x < 2. a. Find the probability that the text is checked out for more than half an hour but less than an hour. = (1/2)2

*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)

Stat 101 Exam 1 Important Formulas and Concepts 1

STA442/2101: Assignment 5

DSST Principles of Statistics

Midterm 2 - Solutions

ST430 Exam 1 with Answers

Margin of Error. What is margin of error and why does it exist?

First Year Examination Department of Statistics, University of Florida

1 Binomial Probability [15 points]

Exam #2 Results (as percentages)

2.1 Linear regression with matrices

Psych 230. Psychological Measurement and Statistics

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Bivariate Data: Graphical Display The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

Announcements. Lecture 10: Relationship between Measurement Variables. Poverty vs. HS graduate rate. Response vs. explanatory

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Concordia University (5+5)Q 1.

Lecture 6: Linear Regression (continued)

EXAM # 2. Total 100. Please show all work! Problem Points Grade. STAT 301, Spring 2013 Name

Estimation and Confidence Intervals

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

MATH 1150 Chapter 2 Notation and Terminology

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Review for Final Exam Stat 205: Statistics for the Life Sciences

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

Swarthmore Honors Exam 2012: Statistics

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S. Duration - 3 hours. Aids Allowed: Calculator

16.400/453J Human Factors Engineering. Design of Experiments II

Econometrics Problem Set 6

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

This gives us an upper and lower bound that capture our population mean.

Multiple Choice Circle the letter corresponding to the best answer for each of the problems below (4 pts each)

Business Statistics. Lecture 9: Simple Regression

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Problem 1 (20) Log-normal. f(x) Cauchy

Mathematical Notation Math Introduction to Applied Statistics

Unit 6 - Simple linear regression

Post-exam 2 practice questions 18.05, Spring 2014

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) HIGHER CERTIFICATE IN STATISTICS, 2004

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

MA : Introductory Probability

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Conditional Probability Solutions STAT-UB.0103 Statistics for Business Control and Regression Models

Tribhuvan University Institute of Science and Technology 2065

Linear Regression With Special Variables

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

9 Correlation and Regression

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Sampling Distribution Models. Chapter 17

Mathematical Notation Math Introduction to Applied Statistics

Lecture 28 Chi-Square Analysis

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College

Solutions to First Midterm Exam, Stat 371, Spring those values: = Or, we can use Rule 6: = 0.63.

For more information about how to cite these materials visit

Homework 2: Simple Linear Regression

Harvard University. Rigorous Research in Engineering Education

Section I Questions with parts

Assumptions, Diagnostics, and Inferences for the Simple Linear Regression Model with Normal Residuals

Two-Sample Inference for Proportions and Inference for Linear Regression

Midterm 1 and 2 results

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Ch 2: Simple Linear Regression

Transcription:

Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Name: Person on right SID: Person on left There will be one, double sided, handwritten, 8.5in x 11in page of notes allowed during the exam. The exam is closed book and will be 2 hours and 50 minutes long. You are allowed the use of a non-programmable calculator, that cannot communicate with any other device. Relevant tables are provided. The test begins on the next page. Please do not write below this line. ALWAYS SHOW YOUR REASONING. No work, no credit. Page Maximum score Score achieved 2 10 3 10 4 10 5 10 6 15 7 10 8 15 9 10 Total 90 1

1. One year, there were about 3,000 institutions of higher learning in the US, and as part of a study, the Carnegie Commission took a simple random sample of 400 of these. The average enrollment in the 400 sample schools was 3,700; and the sample standard deviation was 6,500. (a) True or false: An approximate 68% confidence interval for the average enrollment of all 3,000 schools runs from about 3,400 to 4,000. (b) True or false: If a statistician takes a simple random sample of 400 institutions out of 3,000, and goes 1 SE in either direction from the sample average enrollment, there is about a 68% chance that the resultant interval will contain the average enrollment of all 3,000 schools. (Here SE indicates the estimated standard error of the sample mean). (c) True or false: It is estimated that about 68% of the 3,000 schools enrolled between 3,000 and 4,000 students. (d) Sketch (roughly) the following distributions, and approximately mark the mean and SD of the distribution. (2 points each) i. The sampling distribution of the average enrollment of the 400 sample schools. ii. The enrollments of all 3,000 schools. 2

2. Lost Springs is a town in Converse County, WY. As of the 2010 census, the population of Lost Springs was 4. The ages of the 4 residents are {50,54,65,80}. We want to take a sample of size 2 from this population. Suppose, instead of a simple random sample, the census official decides to sample from the following pairs, with each pair being equally likely: {50,54},{54,80},{65,80},{54,65} (a) Write down the distribution of the sample mean. (3 points) (b) Is the sample mean unbiased? 3. Two large populations are independently surveyed using simple random samples of size n, and two proportions are estimated: ˆp 1 = 0.15, and ˆp 2 = 0.12. How large should n be so that the standard error of the difference will be less than 0.05? (3 points) 4. According to a recent Gallup poll, 51% of Americans disapprove of the Affordable Care Act ( Obamacare ). The poll also states that the results for this Gallup poll are based on telephone interviews conducted Dec. 11-12, 2013, on the Gallup Daily tracking survey, with a random sample of 1,011 adults, aged 18 and older, living in all 50 U.S. states and the District of Columbia, and that the margin of sampling error is ±3 percentage points at the 95% confidence level. How is this margin of error (±3) arrived at? 3

5. Statisticians A and B obtain independent samples X 1, X 2,... X 10 and Y 1, Y 2,..., Y 17 respectively, both from a N(µ, σ 2 ) distribution, with both µ and σ 2 unknown. They estimate (µ, σ 2 ) by ( X, SX 2 ) and (Ȳ, S2 Y ) respectively. Given that X = 5.5 and Ȳ = 5.8 respectively, which statistician s estimate of σ2 is more probable to have exceeded the true value by more than 50%? Explain! 6. A random sample of 59 people from the planet Krypton yielded the following results: Gender Eye Color Blue Brown Male 19 10 Female 9 21 Professor A had believed that sex and eye-color are independent factors on Krypton. After doing a χ 2 test of his hypothesis against the alternative H 1 : i j π ij = 1, he finds that he has to reject his null hypothesis at the 5% significance level. Professor B has always believed the much stronger hypothesis that π ij = 1/4 for all i and j. After doing a χ 2 test of his hypothesis against H 1, he finds that he does not need to reject it at the 5% level. Please check their computations. They both are doing χ 2 tests, so why do they get different results? 4

7. Suppose that X is a discrete random variable with: P (X = 0) = 2 3 θ P (X = 1) = 1 3 θ P (X = 2) = 2 (1 θ) 3 P (X = 3) = 1 (1 θ) 3 where 0 θ 1 is a parameter. We have 10 independent observations from this distribution:(3, 0, 2, 1, 3, 2, 1, 0, 2, 1). (a) Find the method of moments estimate of θ. (b) Find an approximate standard error for your estimate. 5

(c) Find the maximum likelihood estimate for θ. (d) What is an approximate standard error of the maximum likelihood estimate? (10 points) 6

8. Let Y i = β 0 + β 1 x i + e i, i = 1, 2,..., n be the simple linear model for the response Y, given the predictor x. Let ˆβ 0 and ˆβ 1 be the least squares estimates for β 0 and β 1 respectively. Show that the residuals ê i = Y i Ŷi sum to 0. (The Ŷi = ˆβ 0 + ˆβ 1 x i are the fitted values.) 9. At a large engineering school, students are required to take aptitude tests in various subjects, including mathematics and physics. Students who do well on the mathematics test also tend to score high on the physics tests. On both tests, the average score is 60, and the spreads are about the same for both tests. The data are homoscedastic, and the distributions of the scores for both tests are approximately normal. Of the students scoring about 75 on the mathematics test: (a) just about half scored over 75 on the physics test. (b) more than half scored over 75 on the physics test. (c) les than half scored over 75 on the physics test. Choose one option, and explain. 10. Several different regression lines are used to predict the price of a stock (from different independent variables). Histograms for the residuals from each line are sketched below. Match the description with the histogram, and explain your choices. (3 points) (a) s = $5 (b) s = $15 (c) something s wrong 7

11. Let X 1, X 2,..., X n be independent and identically distributed Bernoulli(p) random variables. That is, for x = 0, 1 f(x; p) = p x (1 p) 1 x and f(x; p) is 0 elsewhere. (a) Find the best critical region (corresponding to the best test) for testing H 0 : p = 1 2 against H 1 : p = 1 is of the { } 3 form Xi c. (7 points) (b) Use the Central Limit Theorem to find n and c so that the significance level is approximately 0.1, and the power is approximately 0.8. (8 points) 8

12. (a) The following numbers give the percent extension under a given load for two random samples of lengths of yarn, the first sample being taken before washing, and the second after six washings. Before washing: 12.3 13.7 10.4 11.4 14.9 12.6 After 6 washings: 15.7 10.3 12.6 14.5 12.6 13.8 11.9 Do they provide any justification for concluding that extensibility is affected by washing? (You may assume that the standard deviation is unaltered by the washings.) (b) In another experiment with the same type of yarn, six lengths of yarn were selected at random and each length cut into two halves. One of the halves was tested for extension without washing, and the other after six washings, giving the following percent extensions: Length: A B C D E F Before washing: 13.9 12.5 11.0 11.8 10.8 14.6 After 6 washings: 14.7 12.1 13.2 13.6 11.5 15.4 What evidence do these provide as to the effect of washing on extensibility? analysed differently from the previous one? Why should this experiment be 9

10