Q1: Calculate the mean, median, sample variance, and standard deviation of 25, 40, 05, 70, 05, 40, 70.

Similar documents
: 5: ) A

Chapter 3 Describing Data Using Numerical Measures

AS-Level Maths: Statistics 1 for Edexcel

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Statistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )

Statistics MINITAB - Lab 2

Definition. Measures of Dispersion. Measures of Dispersion. Definition. The Range. Measures of Dispersion 3/24/2014

Chapter 9: Statistical Inference and the Relationship between Two Variables

PhysicsAndMathsTutor.com

ISQS 6348 Final Open notes, no books. Points out of 100 in parentheses. Y 1 ε 2

Lecture 3: Probability Distributions

/ n ) are compared. The logic is: if the two

28. SIMPLE LINEAR REGRESSION III

SIMPLE LINEAR REGRESSION

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Measuring the Strength of Association

18. SIMPLE LINEAR REGRESSION III

Introduction to Regression

Lecture 3 Stat102, Spring 2007

Scatter Plot x

Linear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

The Fundamental Theorem of Algebra. Objective To use the Fundamental Theorem of Algebra to solve polynomial equations with complex solutions

Rao IIT Academy/ SSC - Board Exam 2018 / Mathematics Code-A / QP + Solutions JEE MEDICAL-UG BOARDS KVPY NTSE OLYMPIADS SSC - BOARD

Propagation of error for multivariable function

e i is a random error

The topics in this section concern with the second course objective. Correlation is a linear relation between two random variables.

Statistics for Economics & Business

Negative Binomial Regression

Chapter 14 Simple Linear Regression

Sociology 470. Bivariate Regression. Extra Points. Regression. Liying Luo Job talk on Thursday 11/3 at Pond 302

Linear Regression Analysis: Terminology and Notation

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Chapter 11: Simple Linear Regression and Correlation

Expected Value and Variance

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

Chapter 3. Descriptive Statistics Numerical Methods

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

β0 + β1xi. You are interested in estimating the unknown parameters β

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

Cathy Walker March 5, 2010

Methods in Epidemiology. Medical statistics 02/11/2014

Statistics Spring MIT Department of Nuclear Engineering

First Year Examination Department of Statistics, University of Florida

Exercise 1 The General Linear Model : Answers

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

STAT 3008 Applied Regression Analysis

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis

MOUNT SAINT JOSEPH MILPERRA

β0 + β1xi and want to estimate the unknown

Physics 2A Chapter 9 HW Solutions

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values

Physics 2A Chapter 3 HW Solutions

Multiple Choice. Choose the one that best completes the statement or answers the question.

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

Limited Dependent Variables

A random variable is a function which associates a real number to each element of the sample space

AP Statistics Ch 3 Examining Relationships

WINTER 2017 EXAMINATION

CinChE Problem-Solving Strategy Chapter 4 Development of a Mathematical Model. formulation. procedure

Dummy variables in multiple variable regression model

CORRELATION AND REGRESSION

10-701/ Machine Learning, Fall 2005 Homework 3

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Comparison of Regression Lines

Physics 114 Exam 2 Fall 2014 Solutions. Name:

EXAMINATION. N0028N Econometrics. Luleå University of Technology. Date: (A1016) Time: Aid: Calculator and dictionary

The Ordinary Least Squares (OLS) Estimator

CONDUCTORS AND INSULATORS

= z 20 z n. (k 20) + 4 z k = 4

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

Module 2. Random Processes. Version 2 ECE IIT, Kharagpur

LECTURE 9 CANONICAL CORRELATION ANALYSIS

a. (All your answers should be in the letter!

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M

Check off these skills when you feel that you have mastered them. List and describe two types of distributions for a histogram.

Lecture 6: Introduction to Linear Regression

STATISTICS QUESTIONS. Step by Step Solutions.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Economics 130. Lecture 4 Simple Linear Regression Continued

Engineering Risk Benefit Analysis

Probability and Samples. Sampling. Point Estimates

PROBABILITY PRIMER. Exercise Solutions

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Rockefeller College University at Albany

STAT 511 FINAL EXAM NAME Spring 2001

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity

Introduction to Random Variables

CORRELATION AND REGRESSION

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Basic Business Statistics, 10/e

Central tendency. mean for metric data. The mean. "I say what I means and I means what I say!."

β0 + β1xi. You are interested in estimating the unknown parameters β

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Transcription:

Q1: Calculate the mean, medan, sample varance, and standard devaton of 5, 40, 05, 70, 05, 40, 70. Q: The frequenc dstrbuton for a data set s gven below. Measurements 0 1 3 4 Frequenc 3 5 8 3 1 a) What happens to the mean f we double the frequences? (Now we have 40 data ponts) The mean does not change f the frequences are doubled. In such a case, the sum of the data ponts wll get doubled, and the number of data ponts s doubled also. Therefore rato of the sum of the data ponts to the number of data ponts wll sta the same. b) What happens to the mean f we double the measurement values? (There are stll 0 data ponts) The mean gets doubled. The sum of the data ponts get doubled because ou have doubled each measurement value, but the number of data ponts stas the same, causng the mean to get doubled. c) Does the standard devaton ncrease, decrease, or sta the same f we double the frequences? n The term gets doubled because snce the frequences are doubled, ou have to add the n square of each number twce. The term gets doubled also, as eplaned n part a. When

the frequences are doubled, because of the term, the standard devaton decreases ver slghtl. d) What happens to the standard devaton f we double the measurement values? n The term gets multpled b 4, as each value s twce the orgnal value, so ther squares wll be 4 tmes the squares of the orgnal values. The term n gets doubled as eplaned n part b. The varance where the measurement values are NOT doubled s: 0 0 [ ( ) ] The varance where the measurement values ARE doubled s: 0 [ ( ) ], whch s equvalent to 0 [ 0 ( 0 ) ] Therefore the varance s 4 tmes the ntal varance. Snce the standard devaton s the square root of varance, the standard devaton gets doubled. Q3: What s the 85 th percentle of the data set consstng of all natural numbers between 1 and 140? The data set s: { }. There are 140 numbers n the data set. Snce the 85 th percentle s asked, p= 0.85. Np= 11. The 85 th percentle s gven b. s 11, and s 10. Therefore the answer s 11.5. Q5: a) The stem-and-leaf dagram of the age of a group of tenns plaers s shown below: 1 7 0 3 4 0 8 0 3 8 1 6 1 6 1 8 3 4 4 6 Fnd the mode, IQR, and relatve frequenc of 4. b) 37 students receved a hgher grade than Nuran on mdterm 1. whch was taken b 183 students. What s the percentle of Nuran s grade? c) The average weght of a group of 40 Sumo wrestlers s 167.5 kg, and the medan s 141 kg. What can ou sa about the dstrbuton of weght among ths group? Eplan!

Q6: A bvarate numercal data set conssts of the pars of values (-,-3), (-1,-3), (0,-), (1,-), (,-1), (3,0), (4,1) for the varables (,). a) Fnd the correlaton coeffcent b) Fnd the least square ftted lne c) What s the vertcal error E 3 for the thrd pont (0,-)?

Q7 The weght and heght of female students was measured. Let 1,..., be the weght measurements, and let 1,..., be the heght measurements. The sample averages are 58 kg and 163 cm, and the sample standard devatons are 6.1 kg and 4.8 cm. The correlaton coeffcent between weght and heght measurements s 0.4 a) Fnd and b) Fnd ( )( ) What s gven n the queston: = 58 = 163 = 6.1 = 4.8 r = 0.4 c) Fnd

a) ( ) We know that s 6.1, and that the sum of the values of (the sum of the weghts) s the average of the weghts tmes the number of people. can easl be calculated. ( ( = 30 573.7 Smlarl, ( ( ( ) ) ) ) ) = 3 305.3 b) ( )( ), so we can fnd b fndng and. (r s alread known to be 0.4). We can fnd and snce and. and Therefore,

c) = 85 306.1 Q: The number of goals scored per game b Lverpool at Anfeld Road n the last three seasons s gven b the followng table: Goals per game 0 1 3 4 5 6 7 8 Frequenc 8 11 37 1 6 4 A 0 1 Rel. Freq. 0.088 a) What s A? What s the relatve frequenc of 3 goals scored per game? b) Do a Boplot of the data. (If ou could not fnd A n part (a), assume ts 0) Q10: There are three -3 s, four -1 s, two 0 s, four s, and one 3 n a data set of temperatures (n degrees Celsus). a) Draw a relatve frequenc hstogram

b) Fnd the mode, medan, mean, and 70 th percentle. Mode = -1 and Medan = = 70 th percentle: p= 0.7 Np=.8 k=10 The 70 th percentle s equal to, whch s. c) Repeat (b) after addng two -3 s and two 3 s to the temperature data set. The mode becomes -3, snce there are more -3 s than anthng else now. d) Repeat (b) after convertng the temperature data n Celsus to a temperature data n Fahrenhet. ( Degrees Fahrenhet = 1.8 Degrees Celsus + 3. Eample: 41 F 5 C )

Q11: 0 data ponts, some sums and the correlalaton coeffcent are gven below. X 15 60 10 36 50 3 18 71 5 7 85 5 1 5 51 6 74 5 5 56 66 75 0 1 84 44 1 5 45 80 3 43 60 40 34 45 71 71 61 31 0 j1 56, 0 110, 0 6071, 0 665, 0 A, r 0.03. (a) What s the value of S? (b) What does r = - 0.03 tell ou? Eplan. (c) What s A (the sum of the cross products of the and )? (d) Fnd the mean and standard devaton for the, and the mean for the.

Q1: There are 5 data ponts, (, ), = 1,...,5. Also, 30 and. The ftted least squares lne (regresson lne) of the data set s ˆ 1.8 1. 6. (a) Fnd S. (b) Fnd S. (c) Fnd. 5

Q 13: The hstogram of a dscrete data set consstng of 100 measurements s gven below. Hstogram Relatve Frequenc 0,4 0,3 0, 0,1 0 0,35 0,5 0,15 0,15 0,1 1, 3,3 5, 6,3 (a) Fnd the mode, medan and mean. (b) Fnd the nterquartle range. (c) Fnd the frequenc of. (d) Fnd the sum of data ponts. N (e) The varance s.687. What s?