Uncertainty, Data, and Judgment

Similar documents
Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Chapter 13 Student Lecture Notes 13-1

Simple Linear Regression - Scalar Form

Summary of the lecture in Biostatistics

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 3. Sampling, sampling distributions, and parameter estimation

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

Econometric Methods. Review of Estimation

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Continuous Distributions

The expected value of a sum of random variables,, is the sum of the expected values:

Parameter, Statistic and Random Samples

ENGI 3423 Simple Linear Regression Page 12-01

The Mathematics of Portfolio Theory

Lecture Notes Types of economic variables

Econometrics. 3) Statistical properties of the OLS estimator

Law of Large Numbers

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

Class 13,14 June 17, 19, 2015

Special Instructions / Useful Data

ESS Line Fitting

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

Simulation Output Analysis

UNIVERSITY OF EAST ANGLIA. Main Series UG Examination

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

CHAPTER VI Statistical Analysis of Experimental Data

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

Chapter 4 Multiple Random Variables

Linear Regression with One Regressor

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:


Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Simple Linear Regression

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Statistics MINITAB - Lab 5

Chapter 4: Elements of Statistics

Simple Linear Regression

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

Chapter 8: Statistical Analysis of Simulated Data

22 Nonparametric Methods.

Module 7. Lecture 7: Statistical parameter estimation

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

residual. (Note that usually in descriptions of regression analysis, upper-case

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1

ρ < 1 be five real numbers. The

ENGI 4421 Propagation of Error Page 8-01

Statistics: Unlocking the Power of Data Lock 5

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Functions of Random Variables

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

STA302/1001-Fall 2008 Midterm Test October 21, 2008

Objectives of Multiple Regression

Chapter 4 Multiple Random Variables

Chapter 5 Properties of a Random Sample

Random Variables and Probability Distributions

Chapter 3 Sampling For Proportions and Percentages

Quantitative analysis requires : sound knowledge of chemistry : possibility of interferences WHY do we need to use STATISTICS in Anal. Chem.?

Correlation and Simple Linear Regression

Statistics Review Part 3. Hypothesis Tests, Regression

Lecture 1 Review of Fundamental Statistical Concepts

STATISTICAL INFERENCE

Mathematics HL and Further mathematics HL Formula booklet

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE

: At least two means differ SST

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

is the score of the 1 st student, x

IFYMB002 Mathematics Business Appendix C Formula Booklet

Qualifying Exam Statistical Theory Problem Solutions August 2005

Line Fitting and Regression

1 Solution to Problem 6.40

X ε ) = 0, or equivalently, lim

Lecture 8: Linear Regression

PROPERTIES OF GOOD ESTIMATORS

Multivariate Transformation of Variables and Maximum Likelihood Estimation

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 2 STATISTICAL INFERENCE

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Chapter Two. An Introduction to Regression ( )

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

Module 7: Probability and Statistics

Correlation and Regression Analysis

Chapter 5 Elementary Statistics, Empirical Probability Distributions, and More on Simulation

GOALS The Samples Why Sample the Population? What is a Probability Sample? Four Most Commonly Used Probability Sampling Methods

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

Measures of Dispersion

Section 2 Notes. Elizabeth Stone and Charles Wang. January 15, Expectation and Conditional Expectation of a Random Variable.

MEASURES OF DISPERSION

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR

Fitting models to data.

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

BASICS ON DISTRIBUTIONS

Topic 4: Simple Correlation and Regression Analysis

Analyzing Two-Dimensional Data. Analyzing Two-Dimensional Data

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

Transcription:

Ucertaty, Data, ad Judgmet Sesso 06 Structure of the Course Topc Sesso Probablty -5 Estmato 6-8 Hypothess Testg 9-10 Regresso 11-16 1

Mcrosoft AND Itel (50-50) You vest $,500 MSFT ad $,500 INTC X = Aual retur o $1 vested Mcrosoft: X ~ N(1.1,0.1 ) Y = Aual retur o $1 vested Itel : Y ~ N(1.1,0.1 ) The your et proft s W = -5,000 +,500 X +,500 Y Questos: What s your average et proft? What s the varace of your et proft? How lkely are you to make postve et proft? MSFT vs. INTC weekly

Correlato Plots Facts About Covarace Idcates the drecto of a relatoshp If COV[X,Y] < 0 X ad Y are versely related If COV[X,Y] > 0 X ad Y are postvely related If X ad Y are depedet COV[X,Y] = 0 3

Facts About Correlato Ut-free measure of the stregth of a lear relatoshp Les betwee -1 ad 1. If X ad Y are depedet, the CORR[X,Y] = 0. CORR[X,Y] COV[X,Y] X Y Correlato vs. Causato 4

Gettg Dzzy. W = a + b X + c Y E(W) = a + be(x) + ce(y) Var(W) = b Var(X) + c Var(Y) + bc[cov(x, Y)] Mcrosoft AND Itel (50-50) You vest $,500 MSFT ad $,500 INTC X ~ N(1.1,0.1 ), Y ~ N(1.1,0.1 ) Your et proft s W = -5,000 +,500 X +,500 Y W ~ N( W, w ) W = E(W)= -5,000 +,500 X +,500 Y = 500 W = Var(W) =,500 X +,500 Y + *,500*,500 * COV (X,Y) =,500 X +,500 Y + *,500*,500 * X Y CORR(X,Y) 5

INVESTING IN MICROSOFT AND/OR INTEL Sx Cases Case Acto Correlato E(W) St.Dev(W) 1 All Mcrosoft N/A ½ Mcrosoft + ½ Itel 0 3 ½ Mcrosoft + ½ Itel 0.6 4 ½ Mcrosoft + ½ Itel -0.6 5 ½ Mcrosoft + ½ Itel 1 6 ½ Mcrosoft + ½ Itel -1 Ivestmet: Dstrbuto of et profts Correlato E(W) St.Dev(W) 0 500 353.55 0.6 500 447.1-0.6 500 3.61 1 500 500-1 500 0 6

Fuctos of Radom Varables Today s materal s relevat to several mportat practcal problems: Moder Portfolo Theory Rsk Maagemet Reveue Plag Producto Plag Ad we oly scratched the surface. Key Questos UDJ 1. How to look at Data? (1-3) Judgmet ssues Measures of locato & dsperso Measures of extremes (Chebyshev s T., Emprcal Rule) How to model Ucertaty? (4-5) Dstrbutos: Bomal, Posso, Normal Fuctos of Radom Varables. How to make Estmatos? (6-8) Make educated guesses about populato parameters Say how cofdet you are those guesses 3. How to make Decsos based o data? (9-10) Hypothess testg / Correct Observatos/Theores 4. How to make Predctos based o data? (11-16) Regresso / forecastg 7

Some practcal questos What s the average salary of a INSEAD MBA after 5 years? What s the expected market sze for the Pad/a ew razor? What s the proporto of defectve cars produced by Toyota? What s the average rate of arrvals at a bak/hosptal? What percetage of techology startups go bust? What s the average watg tme for a tax? What s the average umber of vewers of a TV ad? What s the uemploymet rate the US? What s the average alcohol cosumpto at a INSEAD party? Populato x 1, x,..., x N Radom sample of small sze Parameters = Populato mea = Populato stadard devato Estmates Sample x 1,x,...,x X = Sample mea s = Sample stadard devato 8

Our Goal 1. Estmate the populato mea usg X. Quatfy the possble error we make usg X as a estmate of Mea ad Varace Populato Sample Mea Varace SD N x x x x x N s x 1 x x x N s s 1 9

What s the Populato Hstogram? The Cetral Lmt Theorem* For a radom sample of sze 30 take from a populato wth mea ad stadard devato : The sample mea X ~ Normal hstogram X = X = / * Ths holds for ay populato hstogram of X. s use as a estmate of If we do t kow sgma, the IF 30 10

Key Questo What s the max. possble error we make? Buldg a cofdece terval A Illustrato of the dervato of 95% cofdece tervals. 95% 1. 96 1. 96 X 3 0 3 Z 1.96 11

95% Cofdece Itervals From the dagram, we ca wrte: P 1.96 X 1.96 0.95 Algebra P X 1.96 X 1.96 0.95 Example The WSJ reports that from a study of 300 subscrbers they fd that the average come s 80.000 ad the std s 15.000. How close to that would the real average (of ALL readers) be wth 95% probablty/cofdece? 1

A Illustrato of the dervato of 95% cofdece tervals. 95% 1. 96 1. 96 X 3 0 3 Z 1.96 100(1-a)% Za Za X 0 Z Z a/ 13

100(1-a)% Cofdece Itervals From the dagram, we ca wrte: P Z X Z 1 a a a Algebra P X Z X Z 1 a a a Case 1: s kow, 30 X - / ~ Z Case : s ukow, 30 X - s/ ~ Z Cofdece Iterval Lmts X Z a X Z a s 14

Example: 99% Cofdece Aother way If you sample 100 subscrbers, what s the probablty that the error of your estmated average salary from these 100 subscrbers s less tha 1000? 15

Aother way The std of the come of ALL subscrbers to the WSJ s 15.000 How may subscrbers should we sample to be 95% cofdet that the average come of ALL the subscrbers s wth X of the sample mea? 16