n but for a small sample of the population, the mean is defined as: n 2. For a lognormal distribution, the median equals the mean.

Similar documents
CE3502 Environmental Monitoring, Measurements, and Data Analysis (EMMA) Spring 2008 Final Review

1 Inferential Methods for Correlation and Regression Analysis

Regression, Inference, and Model Building

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

Final Examination Solutions 17/6/2010

Stat 139 Homework 7 Solutions, Fall 2015

Grant MacEwan University STAT 252 Dr. Karen Buro Formula Sheet

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

STP 226 EXAMPLE EXAM #1

Comparing your lab results with the others by one-way ANOVA

Linear Regression Models

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Formulas and Tables for Gerstman

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Simple Linear Regression

Statistical Fundamentals and Control Charts

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Chapter 4 - Summarizing Numerical Data

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

11 Correlation and Regression

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

(6) Fundamental Sampling Distribution and Data Discription

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Sample Size Determination (Two or More Samples)

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

University of California, Los Angeles Department of Statistics. Simple regression analysis

Properties and Hypothesis Testing

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Pearson Edexcel Level 3 Advanced Subsidiary and Advanced GCE in Statistics

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Chapter 13, Part A Analysis of Variance and Experimental Design

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

Chapter 23: Inferences About Means

Parameter, Statistic and Random Samples

Chapter 1 (Definitions)

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Paired Data and Linear Correlation

Statistics 511 Additional Materials

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

Exam 2 Instructions not multiple versions

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

(all terms are scalars).the minimization is clearer in sum notation:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Correlation and Regression

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other

Statistics 20: Final Exam Solutions Summer Session 2007

Read through these prior to coming to the test and follow them when you take your test.

DAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

Topic 10: Introduction to Estimation

Chapter 12 Correlation

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

MA238 Assignment 4 Solutions (part a)

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Correlation Regression

Describing the Relation between Two Variables

Sampling, Sampling Distribution and Normality

Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

LESSON 20: HYPOTHESIS TESTING

5. A formulae page and two tables are provided at the end of Part A of the examination PART A

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Simple Linear Regression

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

University of California, Los Angeles Department of Statistics. Practice problems - simple regression 2 - solutions

Statistical Intervals for a Single Sample

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X.

Topic 9: Sampling Distributions of Estimators

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

STP 226 ELEMENTARY STATISTICS

Topic 9: Sampling Distributions of Estimators

Chapter 3: Other Issues in Multiple regression (Part 1)

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

A statistical method to determine sample size to estimate characteristic value of soil parameters

TAMS24: Notations and Formulas

Chapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Lecture 24 Floods and flood frequency

Transcription:

Sectio. True or False Questios (2 pts each). For a populatio the meas is defied as i= μ = i but for a small sample of the populatio, the mea is defied as: = i= i 2. For a logormal distributio, the media equals the mea. 3. If the media equals the mea, the data must be ormally distributed. 4. The stadard error is always bigger tha the stadard deviatio. 5. A r 2 value of.85 always idicates that the correlatio is statistically sigificat. 6. A systematic error ca lead to bias i the data. 7. Precisio depeds o the magitudes of both systematic ad radom errors. ( /4)

Sectio 2. Multiple choice questios: circle all correct aswers. (3 pts each) 8. Circle all that are true or correct. a. The total error equals the square root of the sums of the squares of the systematic ad radom errors; b. The total error equals the product of the systematic ad radom errors; c. The total error equals the sum of the systematic ad radom errors; d. The total error equals the larger of the systematic ad radom errors; e. Noe of the above; 9. Circle all that are true or correct. a. The COV is the costraits o variatio; b. The COV is the coefficiet of veracular; c. The COV is the coefficiet of variatio; d. The COV equals the stadard deviatio divided by the mea; e. Noe of the above.. Circle all that are true or correct. a. Radom errors are predictable; b. I a large sample, radom errors are likely to be ormally distributed; c. All radom errors are beyod the cotrol of the data collector; d. Radom errors occur relatively seldom; e. Radom errors are likely i all data sets.. Circle all that are true or correct. a. Accurate implies freedom from all errors; b. Accurate implies freedom from radom errors; c. Accurate implies freedom from systematic errors; d. Accurate implies freedom from bias; e. Accurate implies highly precise. 2. Circle all that are true or correct. a. High precisio implies that all errors are small; b. High precisio implies that radom errors are small; c. High precisio implies that systematic errors are small; d. High precisio implies that bias is small; e. High precisio implies high accuracy. ( /5) 2

Sectio 3. Short-aswer questios 3. The results of 7 replicate measuremets of Mied Liquor Volatile Solids (MLVS) cocetratios are show i the graph below. The idividual measuremets are show as black circles; the mea is show as a hollow bo, ad the error bars represet S.D. You wish to determie whether the highest value should be cosidered a outlier. MLVS (mg/l) 8 6 4 2 8 6 4 2 a. What is the criterio that you would use to determie if poit 7 (the highest value) should be regarded as a outlier? (5 pts) b. Based o the data tabulated below, is the highest poit a outlier? Show all calculatios. (8 pts) Group MLVS (mg/l) Mea ( to i ) S.D. ( to i ) 865 865 NA 2 956 9 64 3 46 956 9 4 87 989 99 5 39 9 9 6 372 78 74 7 585 5 249 --------------------------------------------------------------------------------------------- ( /3) 3

4. A histogram of all of the idividual measuremets of mied liquor suspeded solids cocetratios is show below. You wish to determie if the data are ormally distributed. Number of measuremets 4 3 2 5 7 9 3 5 7 9 2 23 25 MLSS (mg/l) You costruct a probability plot (show below) to help determie if the data are ormally distributed. Q (i) or Normal MLSS (mg/l) 25 2 5 5 y =.97 + 49.98 R 2 =.95 5 5 2 25 MLSS (mg/l) a. What should the slope of the regressio lie i the probability plot be if the data are ormally distributed? (4 pts) a..5 b..75 c.. d..25 e. There is o fied value that the slope should have f. Caot tell from the iformatio give b. What two criteria must the probability plot meet for you to decide that the data are ormally distributed? (2 pts each). 2. ( /8) 4

5. List four tests that you ca apply to determie if a set of data is ormally distributed. (8 pts). 2. 3. 4. 6. The results from your measuremets of SRP i the ifluet to the wastewater treatmet plat are tabulated below. Ca you say with 95% cofidece that the mea is less tha 4. mg/l? Eplai ad show clearly the basis for your aswer. (5 pts) Coc (mgp/l) 3.7 2.27 2.24 3.26 3.36 3.25 2.87 3. 6.8 Mea 3.3 S.D..6 N 9 ( /23) 5

7. The historical record for aual sowfall i Houghto Couty is show below. Output from a regressio aalysis i ecel is show below the graph. Base your aswers to the followig questios o either the graph or the regressio aalysis. Yearly Sowfall for Houghto, MI (Oct-May) Sowfall (i) 4 35 3 25 2 5 5 88 9 92 94 96 98 2 22 Year Regressio Statistics Multiple R.652 R Square.425 Adjusted R Squ.42 Stadard Error 44.48 Observatios 7 ANOVA df SS MS F igificace F Regressio 68456 68456 85.443.64E-5 Residual 5 227524 978 Total 6 39598 Coefficietstadard Erro t Stat P-value Lower 95%Upper 95% Itercept -24 237-8.48685 8.47E-4-2484 -544 X Variable.2.2 9.23.64E-5.88.37 a. Based o the statistics table provided, is the tred toward icreasig sowfall statistically sigificat? Eplai the basis for your aswer. ( pts) b. Is the rate of icrease i aual sowfall sigificatly differet from zero? Eplai the basis for your aswer. (8 pts) c. Based o the regressio, how may iches of sow do you predict will fall i 225? (4 pts) d. What is the error (iches) i your predictio? (5 pts) 6

EQUATIONS X i i = = σ = ( ) 2 i SE = σ CI = t να, s ε = ε + ε 2 2 2 total radom systematic t t k j = t k + = j j t = ( φφ ) t j j= S DL = mi mi Bl m S = Bl+ K s Bl MDL = t s να, m f( ) = e σ 2π ( μ ) 2 2 2σ 2 2 2 2 2 2 2 = a + b + c ε ε ε ε a b c P= + zi s UTL = + ki s k = tνα, i + 7

Pearso Product-Momet Correlatio Coefficiet Table of Critical Values df= N-2 Level of sigificace for two-tailed test N = umber of pairs of data..5.2..988.997.9995.9999 2.9.95.98.99 3.85.878.934.959 4.729.8.882.97 5.669.754.833.874 6.622.77.789.834 7.582.666.75.798 8.549.632.76.765 9.52.62.685.735.497.576.658.78.476.553.634.684 2.458.532.62.66 3.44.54.592.64 4.426.497.574.628 5.42.482.558.66 6.4.468.542.59 7.389.456.528.575 8.378.444.56.56 9.369.433.53.549 2.36.423.492.537 2.352.43.482.526 22.344.44.472.55 23.337.396.462.55 24.33.388.453.495 25.323.38.445.487 26.37.374.437.479 27.3.367.43.47 28.36.36.423.463 29.3.355.46.456 3.296.349.49.449 35.275.325.38.48 4.257.34.358.393 5.23.273.322.354 6.2.25.295.325 7.95.232.274.32 8.83.27.256.284 9.73.25.242.267.64.95.23.254 5.78.234 8

9