Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Similar documents
Lecture Notes Types of economic variables

Summary of the lecture in Biostatistics

Econometric Methods. Review of Estimation

Section l h l Stem=Tens. 8l Leaf=Ones. 8h l 03. 9h 58

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

is the score of the 1 st student, x

Simple Linear Regression

Statistics MINITAB - Lab 5

Parameter, Statistic and Random Samples

The expected value of a sum of random variables,, is the sum of the expected values:

MEASURES OF DISPERSION

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Correlation and Simple Linear Regression

Lecture 3. Sampling, sampling distributions, and parameter estimation

ESS Line Fitting

Chapter 5 Properties of a Random Sample

Functions of Random Variables

Simulation Output Analysis

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Chapter 8: Statistical Analysis of Simulated Data

Continuous Distributions

CHAPTER VI Statistical Analysis of Experimental Data

Module 7. Lecture 7: Statistical parameter estimation

X ε ) = 0, or equivalently, lim

f f... f 1 n n (ii) Median : It is the value of the middle-most observation(s).

= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

Simple Linear Regression

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

ENGI 4421 Propagation of Error Page 8-01

Lecture 1 Review of Fundamental Statistical Concepts

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

Chapter 8. Inferences about More Than Two Population Central Values

Measures of Dispersion

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Lecture 3 Probability review (cont d)

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

Lesson 3. Group and individual indexes. Design and Data Analysis in Psychology I English group (A) School of Psychology Dpt. Experimental Psychology

Statistics: Unlocking the Power of Data Lock 5

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

Chapter Statistics Background of Regression Analysis

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

Uncertainty, Data, and Judgment

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Chapter 13 Student Lecture Notes 13-1

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

ENGI 3423 Simple Linear Regression Page 12-01

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

STK3100 and STK4100 Autumn 2018

LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Can we take the Mysticism Out of the Pearson Coefficient of Linear Correlation?

Class 13,14 June 17, 19, 2015


Statistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura

Probability and. Lecture 13: and Correlation

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation

Law of Large Numbers

Point Estimation: definition of estimators

STK3100 and STK4100 Autumn 2017

Chapter 4 Multiple Random Variables

STA302/1001-Fall 2008 Midterm Test October 21, 2008

2SLS Estimates ECON In this case, begin with the assumption that E[ i

: At least two means differ SST

Chapter 14 Logistic Regression Models

Linear Regression with One Regressor

IFYMB002 Mathematics Business Appendix C Formula Booklet

Improving coverage probabilities of confidence intervals in random effects meta-analysis with publication bias

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Quantitative analysis requires : sound knowledge of chemistry : possibility of interferences WHY do we need to use STATISTICS in Anal. Chem.?

Chapter 4 Multiple Random Variables

Descriptive Statistics

(Monte Carlo) Resampling Technique in Validity Testing and Reliability Testing

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

Multiple Choice Test. Chapter Adequacy of Models for Regression

Lecture 9: Tolerant Testing

Reaction Time VS. Drug Percentage Subject Amount of Drug Times % Reaction Time in Seconds 1 Mary John Carl Sara William 5 4

Lecture 8: Linear Regression

Chapter Two. An Introduction to Regression ( )

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

Analyzing Two-Dimensional Data. Analyzing Two-Dimensional Data

Chapter -2 Simple Random Sampling

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Bootstrap Method for Testing of Equality of Several Coefficients of Variation

Multiple Linear Regression Analysis

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

ε. Therefore, the estimate

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

r y Simple Linear Regression How To Study Relation Between Two Quantitative Variables? Scatter Plot Pearson s Sample Correlation Correlation

Laboratory I.10 It All Adds Up

Special Instructions / Useful Data

Transcription:

Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot ordal or omal. Meda Value that dvdes the sample so that a equal umber of cases are above ad below. Sort the cases by magtude of x: x [ ] x[]... x[ ] If s odd, the meda s the mddle value: e.g. x =,, 5, 0,. Meda s 5. If s eve, meda s the average of the two mddle values: e.g., x =,, 5, 0. Meda s (+5)/ = 3.5 For test score example, sorted values are 3 8 9 0 4 4 7 7 8 8 8 8 8 9 9 3 3 3 33 33 35 So meda s 8. Meda vs. Mea Ulke the meda, the mea s sestve to extreme scores (outlers):,, 3, 4, 5 => mea=3, meda=3,, 3, 4, 00 => mea=, meda=3 I symmetrcal dstrbutos, mea ad meda wll be the same. I skewed dstrbutos, they wll be dfferet. (more later). So the meda s ofte preferred for varables lke come whch have a relatvely small umber of extremely hgh scores.

Varace How dfferet s each score from the mea? x x What s the average of these dffereces? ( x x ) = 0 Postve devatos cacel out the egatve Mea s the oly score for whch ths s true. How to fx?. Take absolute values before averagg: Mea absolute devato. x x. Square the devatos before averagg: Varace. ( x x) = s The square root of ths s called the stadard devato: s = ( x x) For ay ormal dstrbuto, the followg rule holds: 68% of the cases fall wth s.d. of the mea. 95% fall wth s.d.s of the mea. 99.7% fall wth 3 s.d.s of the mea.

Y 0.8 0.7 0.6 0.5 0.4 0.3 0. 0. 0.0-4 -3 - - 0 3 4 X DENSITY 0.40 0.35 0.30 0.5 0.0 0.5 0.0 0.05 0.00-4 -3 - - 0 3 4 X 3

Stadard Error Every statstc has a stadard error assocated wth t. Not always reported ad ot always easy to calculate. Examples: Watg tmes, compaes A measure of the ()accuracy of the statstc. A stadard error of 0 meas that the statstc has o radom error. The bgger the stadard error, the less accurate the statstc. Implct ths the dea that aythg we calculate a sample of data s subject to radom errors. The mea we calculated for the watg tmes s ot the true mea, but oly a estmate of the true mea. Eve f we could perfectly replcate our study, we would get a dfferet value for the mea. What are the sources of error? Classc approach statstcs: our data set may be oly a radom sample from some larger populato. We may make errors of measuremet. There are lots of other radom factors affectg our outcome that we ca t cotrol. The stadard error of a statstc s the stadard devato of that statstc across hypothetcal repeated samples. Example: 00 replcatos of watg tme study. I theory, eed to replcate a fte umber of tmes. 4

The stadard errors that are reported computer output are oly estmates of the true stadard errors. Remarkably, we ca estmate the varablty across repeated samples by usg the varablty wth samples. The more varablty wth the sample, the more varablty betwee samples. The formula for the stadard error of the mea s by the square root of the sample sze. s,.e., the stadard devato dvded I geeral, the bgger the sample, the smaller the stadard error. Why? Bg samples gve us more formato to estmate the quatty we re terested. The stadard error geerally goes dow wth the square root of the sample sze. Thus, f you quadruple the sample sze, you cut the stadard error half. Cofdece Itervals The stadard error s ofte used to costruct cofdece tervals. To costruct a 95 percet cofdece terval aroud the mea, add two stadard errors ad subtract two stadard errors. E.g., for the watg tme example, the mea was approx. 0 ad ts stadard error was. The the upper cofdece lmt s ad the lower cofdece lmt s 8. Iterpretato: we ca be 95 percet cofdet that the true mea s somewhere betwee 8 ad. Further terpretato: Suppose we could replcate our study may tmes. For each replcato we could costruct a 95 percet cofdece terval by addg ad subtractg stadard errors from the mea. The 95 percet of those cofdece tervals would cota the true mea. Why two stadard errors? Remember our rule for ormal dstrbutos: 95% of the cases fall wth two stadard devatos of the mea. 5

Eve though the orgal dstrbuto of watg tmes was ot well approxmated by a ormal dstrbuto, the dstrbuto of meas across repeated samples s approxmately ormal. Why? Cetral lmt theorem: Wheever you average a buch of thgs together, the resultg average teds to be approxmately ormally dstrbuted. The more thgs you add together, the closer the approxmato. I large samples, most statstcs have approxmately a ormal dstrbuto across repeated samples. Correlato A measure of the stregth of the relatoshp betwee two varables. There are may measures of correlato. The most commo s Pearso s product-momet correlato coeffcet, usually deoted by r. Facts about the correlato: Whe r s 0, there s o correlato betwee the two varables. Whe r s or, there s a perfect lear relatoshp. r caot be greater tha or less tha. The correlato measures the degree of scatter aroud a straght le. Correlato oly measures the lear relatoshp betwee two varables. Correlato s symmetrc: the correlato betwee x ad y s the same as the correlato betwee y ad x. 6