Business Statistics: 41000

Size: px
Start display at page:

Download "Business Statistics: 41000"

Transcription

1 Business Statistics: Week 3: Learning from Data: Estimation Nick Polson The University of Chicago Booth School of Business Suggested Reading OpenIntro Statistics, Chapters 4,5&6

2 Why R? R is free! All the code for the course and assignments are in Rexamples.R! Most Data Scientists are using R. R s syntax is very simple. There are a large number of add-on statistical packages that can be installed and run in R. You don t need to code anything. You can use your favorite software: Excel, Stata, Matlab.

3 Statistics with R You can download R and Rstudio for Windows, Linux, Mac at Rstudio is a very useful front-end interface to R. Links, code for class, video tutorials are up on the course-page. A list of books on doing statistics in R is at Start by watching R videos for Libraries and Packages/Files and Data.

4 Standard R Commands Given vectors x and y we can apply mean(... ) computes the sample mean median(... ) computes the median var(... sd(... cov(... cor(... ) computes the sample variance ) computes the sample standard deviation ) computes the sample covariance ) computes the sample correlation pnorm(... hist(... ) calculates normal probabilities ) makes histograms lm(... ) for linear model (a.k.a regression) summary(... ) provides a summary analysis of the output

5 Stats in Excel average(... median(... ) computes the sample mean ) computes the median var(... ) computes the sample variance stdev(... ) computes the sample standard deviation covar(...,... ) computes the sample covariance normsdist(... histogram(... ) calculates normal probabilities ) makes histograms Regression commands include linest(yrange,xrange) for linear model (a.k.a multiple regression) slope(yrange,xrange) provides the regression β

6 Google 2016 Let X be daily returns. We assume that returns are independent and identically distributed as X N (µ, σ 2) Question: What are the values of µ and σ? Let s assume that each observation in the random sample {x 1, x 2,..., x n } is independent and distributed according to the model above, X i N(µ, σ 2 ). Then we use the sample mean, x, and sample variance, s 2, x = 1 n n i=1 s 2 = 1 n 1 x i n (x i x) 2 i=1

7 Google 2018 Prices and Returns

8 Google 2018 Assume that I invest all my money in Google! Your job is to tell me the following: 1. What is my expected one year return? 2. How about the risk? 3. What s the probability that I lose 3% in a day?

9 Google 2018 We can use our model and R to answer our questions >mean(ret) [1] >sd(ret) [1] Annual expected return = 17.8% a year Daily risk or volatility of 1.86% How about losing money? >pnorm(-0.03, , ) [1] There s a 4.9% chance I can lose 3% in a day

10 Sampling Distribution of Sample Mean Consider the mean for an iid sample of n observations of a random variable, denoted by {x 1,..., x n } If n is large enough ) X N (µ, σ2 n This is called the sampling distribution of the mean... Also known as the Central Limit Theorem (CLT)

11 Sampling Distribution of Sample Mean The sampling distribution explains how much our estimate X will vary over different datasets of size n We ll use s 2 to estimate σ 2 and use ) X N (µ, s2 n The quantity s 2 X = s2 n defines the standard deviation s X = s n.

12 Confidence Intervals In summary, our best guess at µ is x How large a mistake can we make? ( ) The distribution is X N µ, s 2 X [ X ± 1.96s X ] give us a 95% range of plausible values for µ We call it a 95% Confidence Interval.

13 Prediction intervals 1. A confidence interval estimates the mean, µ. 2. A prediction interval answers the question: What range of values are plausible for a single future observation? 100(1 α%) of the data lie in the interval x ± z α/2 s 1 + 1/n. For example, if α = 0.05, we have z 0.05/2 = 1.96.

14 Standard Error and Confidence Interval for a Proportion Suppose that we have a proportion ˆp = X /n rather than a mean We can compute the standard error for the proportion: ˆp(1 ˆp) sˆp = n Then we can calculate a 95% Confidence Interval ˆp ± 1.96 sˆp

15 Example: Google Search Algorithm Google is testing a new search algorithm. They experiment with 2, 500 searches and check how often people clicked through. Here s our data: Table: Google Search Algorithm Algo1 Algo2 success failure The probability of success is estimated to be ˆp 1 = for the current algorithm and ˆp 2 = for the new algorithm. Is the new algorithm better? For sure??

16 Example: Google Search We could calculate 95% confidence intervals separately For Algo 1: For Algo 2: ± 1.96 = (0.693, 0.711) ± 1.96 = (0.718, 0.735) 2500 What do you think? Maybe its more accurate to do the difference!!

17 Standard Error for the Difference in Means Suppose that we have two samples We can compute the standard error for the difference in means: s X1 X2 = s 2 X1 n 1 + s2 X2 n 2 We can compute the standard error for the difference in proportions: sˆp1 ˆp 2 = ˆp 1 (1 ˆp 1 ) n 1 + ˆp 2(1 ˆp 2 ) n 2

18 Confidence Interval for the Difference in Means & Proportions We can now compute the confidence interval for the difference in means: ( ˆX1 ˆX2 ) ± 1.96 s X1 X2 or the confidence interval for the difference in proportions: (ˆp 1 ˆp 2 ) ± 1.96 sˆp1 ˆp 2

19 Example: Google Search Now we get sˆp1 ˆp 2 = = The Confidence Interval becomes ( ) ± = ( 0.05, 0.00) What s our conclusion now?

20 Let s revisit the Patriots example... The data tells us that the Patriots have won 19 out 25 tosses. Assume they were using the same coin the entire time and that the Patriots always choose heads... What is the data telling us about the probability of heads in this coin?

21 Patriots and Coin Tosses Our best guess at p is ˆp = = 0.76 We have a 95% Confidence Interval given by ± = 0.76 ± The interval is (0.592, 0.927). Why so wide? What do we conclude??

22 Conclusions There s a number of things: 1. Estimates are based on random samples and therefore random (uncertain) themselves We need to account for this uncertainty! 2. Standard Error measures the uncertainty of an estimate 3. We construct 95% Confidence Intervals This provides us with a plausible range for the quantity we are trying to estimate.

23 Poisson Distribution The Poisson distribution counts the occurrence of events Given the rate λ we calculate probabilities as follows P(X = x) = e λ λ x x! where x = 0, 1, 2, 3,... The Poisson Mean and Variance are: Poisson Distribution Parameters Expected value µ = E(X) = λ Variance σ 2 = Var(X) = λ λ is the rate of occurrence of an event.

24 Poisson Distribution Frequency lambda=5 lambda=2 Distn Poisson Poisson Figure: Comparing Poisson Distributions

25 Poisson Probabilities Suppose that there are 1.5 homicides a week. The parameter λ is the rate, so λ = 1.5 Poisson distribution tells us that there is a still a 1.4% chance of seeing 5 homicides in a week P(X = 5) = e 1.5 (1.5) 5 5! = On average this will happen once every 1/0.014 = 71 weeks

26 Poisson Probabilities What s the chance of having no homicides in a week? Poisson distribution gives P(X = 0) = e 1.5 (1.5) 0 0! = 0.22 So the probability is as high as 22 % You shouldn t be surprised to see this happening The underlying rate is still 1.5 per week. In R: dpois(x,lambda) and rpois(1000,lambda)

27 English Premier League (EPL) How do we calculate odds for the possible scores in a match? 0 0, 1 0, 0 1, 1 1, 2 0,... Let X = Goals scored by Arsenal and Y = Goals scored by Liverpool What s the odds of a team winning? P (X > Y ) Odds s of a draw? P (X = Y ) z1 = rpois(100,0.6) z2 = rpois(100,1.4) sum(z1<z2)/100 sum(z1=z2)/100 # Team 2 wins # Draw

28 Chelsea EPL 2014 Let s take a historical set of data on scores Then estimate λ with the sample mean 1 Chelsea 2 0 Hull City 2 Chelsea 2 1 Aston Villa 3 Manchester United 0 0 Chelsea 4 Everton 1 0 Chelsea 5 Chelsea 2 0 Fulham 6 Tottenham Hotspur 1 1 Chelsea...

29 EPL Chelsea Figure: Poisson Distribution for Goals Scored Our Poisson model fits the empirical data!!

30 EPL Chelsea Figure: Poisson Distribution for Goals Against Chelsea has a great defence Again our model fits the outcomes

31 EPL: Poisson Distribution Given λ the average number of goals scored

32 EPL Attack and Defence Strength Each team gets an attack strength and defence weakness rating Adjust home and away average goal estimates

33 Modeling Hull vs ManU Poisson Distribution ManU Average away goals = Prediction: = 2.12 Attack strength times Hull s defense weakness times average Hull Average home goals = Prediction: = Simulation

34 EPL Predictions A model is only as good as its predictions

35 Business Statistics: Week 4: A/B Testing Nick Polson

36 A/B Testing Scientific Testing is the underpinning of many things... Controlled experiment (designed) or observational study 1. A Statistic. Sample mean, proportion, difference in means, A Null hypothesis: H 0 : µ = µ 0, H 0 : p = p 0,... A level of significance α 3. Sampling distribution: The probability distribution of the statistics values (Normal, t)

37 Sampling distribution of a Proportion Suppose the r.v. X is the number of successes in a binomial sample of n trials, whose probability of success is p. The sample proportion is ˆp = X /n 1. E(ˆp) = p 2. It has standard error s.e.( ˆp) = p(1 p) n. 3. If n is large (np(1 p) > 9 or roughly n 40), ˆp N(p, σ 2ˆp ) = N(p, p(1 p)/n).

38 Z and T-Score How many standard deviations are you away from the mean? We ll address this with a Z or T -score If σ is unknown, we ll use s instead Z = X µ σ/ n T = X µ s/ n For small samples, Student s t distribution with (n 1) degrees of freedom. Guinness: Testing the quality of beer in 1908.

39 Coke versus Pepsi The most famous hypothesis test in history in whether people can decide the difference between Coke and Pepsi Double Blind: neither the experimenter or subject know the allocation Pepsi claimed that more than 1 2 of Diet Coke drinkers said they preferred to drink Diet Pepsi Suppose we take a random sample of 100 drinkers and find that 56 favor Pepsi?

40 Coke versus Pepsi This is a hypothesis test about the proportion of drinkers who prefer Pepsi H 0 : p = 1 2 and H 1 : p > 1 2 ˆp = X /n = 56/100 = 0.56 This is my best estimate of the true p se(ˆp) The standard error of my statistic ˆp(1 ˆp)/n = The 95% confidence interval is then 0.56 ± 1.96(0.0496) = 0.56 ± = (0.463, 0.657) p = 0.5 lies inside the confidence interval. Conclusion?

41 Testing The Z -score now with sˆp = p 0 (1 p 0 )/n = 0.05 Let s take the usual α = 0.05 Z = ˆp p = = 1.2 < 1.64 sˆp 0.05 Don t reject H 0 for a one-sided test at 5% level. We need a larger n to come to a more definitive conclusion. Common to take n = 1000, 2000 As a Hypothesis test: prop.test(56,100,alternative=''greater'', conf.level = 0.95)

42 A/B Testing The key elements of hypothesis testing are: 1. Setting Up a Hypothesis Test 2. Significance Levels 3. p-values 4. Testing a Mean and Difference in Means 5. Testing a Proportion 6. Tests for Small Samples

43 What is Hypothesis Testing? A hypothesis is a statement about a population developed for the purpose of testing with data Step 1: Null Hypothesis (H 0 ): assume to be true unless there is sufficient evidence to the contrary. Alternative Hypothesis (H 1 ): test against the null. If there is evidence that H 0 is false, we accept H 1. Step 2: Select the significance level α. α = 0.05 (the 5% level) is the most commonly used. α = 0.01 (the 1% level) is prevalent in medical and quality assurance examples.

44 Hypothesis Testing Making a decision Step 3: Compute the Test Statistic (Z or T ) Step 4: Formulate the Decision Rule For example, reject the Null hypothesis if Z > 1.96 Step 5: Make a Decision, Compute the p-value. p-value: The smallest significance level at which a null hypothesis can be rejected.

45 Type I and II Errors There are two types of errors you can make when testing 1. Type I Error: Rejecting a true H Type II Error: Not rejecting a false H 0. Significance Level: P(rejectH 0 H 0 true) = P(type I error).

46 Type I and II Errors Economist

47 Scientific Reporting Higgs Boson The five-sigma concept is somewhat counter-intuitive. It has to do with a one-in-3.5-million probability. That is not the probability that the Higgs boson doesn t exist. It is, rather, the inverse: If the particle doesn t exist, one in 3.5 million is the chance an experiment just like the one announced this week would nevertheless come up with a result appearing to confirm it does exist. In other words, one in 3.5 million is the likelihood of finding a false positive a fluke produced by random statistical fluctuation that seems as definitive as the findings released by two teams of researchers at the CERN laboratory in Geneva. p-value one-in-3.5-million and Z-score Z = 5.

48 Biogen and Avonex Biogen made the following assertion: Avonex delivers the highest rate of satisfaction: 95% among patients 1. The U.S. Food and Drug Administration (FDA) on October 30th, 2002 informed the biotech company Biogen to stop publishing misleading promotions for its multiple sclerosis drug Avonex. 2. The FDA found that in a random sample of 75 patients surveyed, only 60% said they were very satisfied with Avonex. Who is Right?

49 Setting up a Hypothesis Test Step 1: The null hypothesis: H 0 : p = 0.95 = p 0. The alternative hypothesis: H 1 : p < A 1-sided alternative. Step 2: We ll use a small significance level, 1%. Step 3: The appropriate test statistic is where ˆp = 0.60, p 0 = 0.95 and n = 75. Hence Z = = 14. Z = ˆp p 0 p 0 (1 p 0 ) n

50 Critical Region Now lets find the critical region and p-value Step 4: The critical region is Z < As the observed test statistic Z falls well within the rejection region. We reject H 0 at the 1% level. Step 5: The p-value of the test is P(Z < 14) = Again the statistical evidence is that the FDA is right and Biogen is not.

51 Avonex: Testing Proportions in R Null Hypothesis: Biogen is innocent prop.test(45,75,0.95) data: 45 out of 75, null probability percent confidence interval: sample estimates: p 0.6 The p-value is !

52 Pfizer Pfizer introduced Viagra in early 1998 During 1998 of the 6 million Viagra users 77 died from coronary problems such as heart attacks. Pfizer claimed that this rate is no more than the general population. A clinical study found 11 out of 1, 500, 000 men who were not on Viagra died of coronary problems during the same length of time as the 77 Viagra users who died in Is this statistically significant?

53 Confidence Interval A 95% confidence interval for a difference in proportions p 1 p 2 is (ˆp 1 ˆp 2 ) ± 1.96 ˆp 1 (1 ˆp 1 ) n 1 + ˆp 2(1 ˆp 2 ) n 2 1. Can do a confidence interval or a Z -score test. 2. With Viagra, ˆp 1 = 77/ = and without Viagra ˆp 2 = 11/ = Need to test whether these are equal.

54 Confidence Interval With a 95% confidence interval for (p 1 p 2 ) you get an interval ( , ) This doesn t contain zero. The evidence is that the proportion is higher. 1. Measured very accurately as n is large even though p is small. 2. With testing might use a one-sided test and an α of 0.01.

55 Viagra Difference of proportions: prop.test(x=c(11,77), n=c( , ), alternative= greater,conf.level=.95) 2-sample test for equality of proportions with continuity correction data: c(11, 77) out of c( , 6e+06) X-squared = , df = 1, p-value = percent confidence interval: e e+00 sample estimates: prop 1 prop e e-05

56 Discovering Argon Lord Rayleigh won the Nobel Prize for discovering Argon. This discovery occurred when he noticed a small discrepancy between two sets of measurements on nitrogen gas that he had extracted from the air and one he had made in the lab. 1. First, he removed all oxygen from a sample of air. He measured the density of the remaining gas in a fixed volume at constant temperature and pressure. 2. Second, he prepared the same volume of pure nitrogen by the chemical decomposition of nitrous oxide (N 2 O) and nitric oxide NO. Here s the results

57 Data Lord Rayleigh''s Measured Masses of Nitrogen Lord Rayleigh s data... Mass (g) Nitrogen from Air Nitrogen from Chemical Decomposition From air From chemical decomposition average st dev

58 Discovering Argon in R t.test(a,b,var.equal=t) t = , df = 13, p-value = 3.321e-11 alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: sample estimates: mean of x mean of y There s a T = 20.21! We ve found Argon!

59 Electronic Arts EA SimCity 5, one of EA s most popular video games, sold 1.1 million copies in the first two weeks of its launch last year. 50% of digital sales due to a A/B testing strategy. Promotion banner or not? Surprising results: The variation with no offer messaging whatsoever drove 43.4% more purchases. Confounding variables?

60 R: abtest Examine whether a black or pink background results in more purchases Run experiment for one week: Pink background: 40% purchase rate with 500 visitors Black background: 30% purchase rate with 550 visitors abtest to see which is more effective

61 R: abtest CI: conversion rates for each variation of site abtestfunc function calculates CIs (80% significance, Z = 1.28) site1 = c(.40, 500) # pink site2 = c(.30, 550) # black abtestfunc <- function(ad1, ad2){ sterror1 = sqrt( ad1[1] * (1-ad1[1]) / ad1[2] ) sterror2 = sqrt( ad2[1] * (1-ad2[1]) / ad2[2] ) minmax1 = c((ad1[1] *sterror1) * 100, (ad1[1] *sterror1) * 100) minmax2 = c((ad2[1] *sterror2) * 100, (ad2[1] *sterror2) * 100) print( round(minmax1,2) ) print( round(minmax2,2) ) } abtestfunc(site1, site2) Purchase rate for the pink background is significantly higher

62 Google Google offers A/B Testing Figure: Steps to create A/B testing on Google

63 Netflix Netflix Every product change Netflix considers goes through a rigorous A/B testing process before becoming the default user experience. Team of data scientists constantly A/B test their innovations to adjust adaptive streaming and content delivery network algorithms.

64 Summary Learning from Data Estimating Parameters and Fitting Distributions Confidence and Prediction Intervals Means, Proportions, Differences A/B Testing

Section 2: Estimation, Confidence Intervals and Testing Hypothesis

Section 2: Estimation, Confidence Intervals and Testing Hypothesis Section 2: Estimation, Confidence Intervals and Testing Hypothesis Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/

More information

University of Chicago Graduate School of Business. Business 41000: Business Statistics

University of Chicago Graduate School of Business. Business 41000: Business Statistics Name: OUTLINE SOLUTION University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas.

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Final Exam Bus 320 Spring 2000 Russell

Final Exam Bus 320 Spring 2000 Russell Name Final Exam Bus 320 Spring 2000 Russell Do not turn over this page until you are told to do so. You will have 3 hours minutes to complete this exam. The exam has a total of 100 points and is divided

More information

Expectations and Variance

Expectations and Variance 4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

First we look at some terms to be used in this section.

First we look at some terms to be used in this section. 8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling

More information

Homework Exercises. 1. You want to conduct a test of significance for p the population proportion.

Homework Exercises. 1. You want to conduct a test of significance for p the population proportion. Homework Exercises 1. You want to conduct a test of significance for p the population proportion. The test you will run is H 0 : p = 0.4 Ha: p > 0.4, n = 80. you decide that the critical value will be

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Are data normally normally distributed?

Are data normally normally distributed? Standard Normal Image source Are data normally normally distributed? Sample mean: 66.78 Sample standard deviation: 3.37 (66.78-1 x 3.37, 66.78 + 1 x 3.37) (66.78-2 x 3.37, 66.78 + 2 x 3.37) (66.78-3 x

More information

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Question 1. Suppose you want to estimate the percentage of

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Probability Distribution. Stat Camp for the MBA Program. Debbon Air Seat Release

Probability Distribution. Stat Camp for the MBA Program. Debbon Air Seat Release Stat Camp for the MBA Program Daniel Solow Lecture 3 Random Variables and Distributions 136 Probability Distribution Recall that a random variable is a quantity of interest whose value is uncertain and

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics February 19, 2018 CS 361: Probability & Statistics Random variables Markov s inequality This theorem says that for any random variable X and any value a, we have A random variable is unlikely to have an

More information

Advanced Herd Management Probabilities and distributions

Advanced Herd Management Probabilities and distributions Advanced Herd Management Probabilities and distributions Anders Ringgaard Kristensen Slide 1 Outline Probabilities Conditional probabilities Bayes theorem Distributions Discrete Continuous Distribution

More information

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015 Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences 18.30 21.15h, February 12, 2015 Question 1 is on this page. Always motivate your answers. Write your answers in English. Only the

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)

More information

Math 143: Introduction to Biostatistics

Math 143: Introduction to Biostatistics Math 143: Introduction to Biostatistics R Pruim Spring 2012 0.2 Last Modified: April 2, 2012 Math 143 : Spring 2012 : Pruim Contents 10 More About Random Variables 1 10.1 The Mean of a Random Variable....................................

More information

Expectations and Variance

Expectations and Variance 4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance

More information

Discrete probability distributions

Discrete probability distributions Discrete probability s BSAD 30 Dave Novak Fall 08 Source: Anderson et al., 05 Quantitative Methods for Business th edition some slides are directly from J. Loucks 03 Cengage Learning Covered so far Chapter

More information

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge Business Statistics Midterm Exam Fall 5 Russell Name Do not turn over this page until you are told to do so. You will have hour and 3 minutes to complete the exam. There are a total of points divided into

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018 Measuring Effect Size: Cohen s d Simply finding whether a

More information

Hypothesis tests

Hypothesis tests 6.1 6.4 Hypothesis tests Prof. Tesler Math 186 February 26, 2014 Prof. Tesler 6.1 6.4 Hypothesis tests Math 186 / February 26, 2014 1 / 41 6.1 6.2 Intro to hypothesis tests and decision rules Hypothesis

More information

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1 PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi Week 2 Chapter 4 Bivariate Data Data with two/paired variables, Pearson correlation coefficient and its properties, general variance sum law Chapter 6

More information

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

Class 26: review for final exam 18.05, Spring 2014

Class 26: review for final exam 18.05, Spring 2014 Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event

More information

STAT 4385 Topic 01: Introduction & Review

STAT 4385 Topic 01: Introduction & Review STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics

More information

Medical statistics part I, autumn 2010: One sample test of hypothesis

Medical statistics part I, autumn 2010: One sample test of hypothesis Medical statistics part I, autumn 2010: One sample test of hypothesis Eirik Skogvoll Consultant/ Professor Faculty of Medicine Dept. of Anaesthesiology and Emergency Medicine 1 What is a hypothesis test?

More information

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0. () () a. X is a binomial distribution with n = 000, p = /6 b. The expected value, variance, and standard deviation of X is: E(X) = np = 000 = 000 6 var(x) = np( p) = 000 5 6 666 stdev(x) = np( p) = 000

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

CS 160: Lecture 16. Quantitative Studies. Outline. Random variables and trials. Random variables. Qualitative vs. Quantitative Studies

CS 160: Lecture 16. Quantitative Studies. Outline. Random variables and trials. Random variables. Qualitative vs. Quantitative Studies Qualitative vs. Quantitative Studies CS 160: Lecture 16 Professor John Canny Qualitative: What we ve been doing so far: * Contextual Inquiry: trying to understand user s tasks and their conceptual model.

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Data Mining. CS57300 Purdue University. March 22, 2018

Data Mining. CS57300 Purdue University. March 22, 2018 Data Mining CS57300 Purdue University March 22, 2018 1 Hypothesis Testing Select 50% users to see headline A Unlimited Clean Energy: Cold Fusion has Arrived Select 50% users to see headline B Wedding War

More information

Example 1: Dear Abby. Stat Camp for the MBA Program

Example 1: Dear Abby. Stat Camp for the MBA Program Stat Camp for the MBA Program Daniel Solow Lecture 4 The Normal Distribution and the Central Limit Theorem 187 Example 1: Dear Abby You wrote that a woman is pregnant for 266 days. Who said so? I carried

More information

Bayesian modelling of football outcomes. 1 Introduction. Synopsis. (using Skellam s Distribution)

Bayesian modelling of football outcomes. 1 Introduction. Synopsis. (using Skellam s Distribution) Karlis & Ntzoufras: Bayesian modelling of football outcomes 3 Bayesian modelling of football outcomes (using Skellam s Distribution) Dimitris Karlis & Ioannis Ntzoufras e-mails: {karlis, ntzoufras}@aueb.gr

More information

Statistical Inference

Statistical Inference Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts

More information

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I.

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I. Statistiek I Sampling John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/41 Overview 1 Samples and Populations 2 Confidence Intervals 3 Hypotheses

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

Business Statistics 41000: Homework # 5

Business Statistics 41000: Homework # 5 Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive

More information

Two sample Hypothesis tests in R.

Two sample Hypothesis tests in R. Example. (Dependent samples) Two sample Hypothesis tests in R. A Calculus professor gives their students a 10 question algebra pretest on the first day of class, and a similar test towards the end of the

More information

Uncertainty. Michael Peters December 27, 2013

Uncertainty. Michael Peters December 27, 2013 Uncertainty Michael Peters December 27, 20 Lotteries In many problems in economics, people are forced to make decisions without knowing exactly what the consequences will be. For example, when you buy

More information

Confidence Intervals with σ unknown

Confidence Intervals with σ unknown STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence

More information

Robustness and Distribution Assumptions

Robustness and Distribution Assumptions Chapter 1 Robustness and Distribution Assumptions 1.1 Introduction In statistics, one often works with model assumptions, i.e., one assumes that data follow a certain model. Then one makes use of methodology

More information

Swarthmore Honors Exam 2012: Statistics

Swarthmore Honors Exam 2012: Statistics Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may

More information

Gov 2000: 6. Hypothesis Testing

Gov 2000: 6. Hypothesis Testing Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

1. If X has density. cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. f(x) =

1. If X has density. cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. f(x) = 1. If X has density f(x) = { cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. 2. Let X have density f(x) = { xe x, 0 < x < 0, otherwise. (a) Find P (X > 2). (b) Find

More information

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam.

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam. LAST NAME (Please Print): KEY FIRST NAME (Please Print): HONOR PLEDGE (Please Sign): Statistics 111 Midterm 1 This is a closed book exam. You may use your calculator and a single page of notes. The room

More information

AMS 132: Discussion Section 2

AMS 132: Discussion Section 2 Prof. David Draper Department of Applied Mathematics and Statistics University of California, Santa Cruz AMS 132: Discussion Section 2 All computer operations in this course will be described for the Windows

More information

One-sample categorical data: approximate inference

One-sample categorical data: approximate inference One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution

More information

Common Discrete Distributions

Common Discrete Distributions Common Discrete Distributions Statistics 104 Autumn 2004 Taken from Statistics 110 Lecture Notes Copyright c 2004 by Mark E. Irwin Common Discrete Distributions There are a wide range of popular discrete

More information

Content by Week Week of October 14 27

Content by Week Week of October 14 27 Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence

More information

Homework 1 Solutions

Homework 1 Solutions Homework 1 Solutions January 18, 2012 Contents 1 Normal Probability Calculations 2 2 Stereo System (SLR) 2 3 Match Histograms 3 4 Match Scatter Plots 4 5 Housing (SLR) 4 6 Shock Absorber (SLR) 5 7 Participation

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

An introduction to biostatistics: part 1

An introduction to biostatistics: part 1 An introduction to biostatistics: part 1 Cavan Reilly September 6, 2017 Table of contents Introduction to data analysis Uncertainty Probability Conditional probability Random variables Discrete random

More information

18.05 Practice Final Exam

18.05 Practice Final Exam No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population

More information

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

1 Hypothesis testing for a single mean

1 Hypothesis testing for a single mean This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Stats + Homework 2 Review. CS100 TAs

Stats + Homework 2 Review. CS100 TAs Stats + Homework 2 Review CS100 TAs What s on Homework 2? Confidence/Confidence intervals (mean, proportion, difference of each [all Z based]) CLT, LOLN Some hypothesis testing (p-values) Statistical significance

More information

23. MORE HYPOTHESIS TESTING

23. MORE HYPOTHESIS TESTING 23. MORE HYPOTHESIS TESTING The Logic Behind Hypothesis Testing For simplicity, consider testing H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0. Even if H 0 is true (so that the expectation

More information

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Department of Educational Psychology University of Illinois at Urbana-Champaign 1 Inferential

More information

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal.

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal. What we will do today s David Meredith Department of Mathematics San Francisco State University October 22, 2009 s 1 2 s 3 What is a? Decision support Political decisions s s Goal of statistics: optimize

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October

More information

F79SM STATISTICAL METHODS

F79SM STATISTICAL METHODS F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is

More information

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean Interval estimation October 3, 2018 STAT 151 Class 7 Slide 1 Pandemic data Treatment outcome, X, from n = 100 patients in a pandemic: 1 = recovered and 0 = not recovered 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution

More information

Introduction to Statistics

Introduction to Statistics MTH4106 Introduction to Statistics Notes 6 Spring 2013 Testing Hypotheses about a Proportion Example Pete s Pizza Palace offers a choice of three toppings. Pete has noticed that rather few customers ask

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

PASS Sample Size Software. Poisson Regression

PASS Sample Size Software. Poisson Regression Chapter 870 Introduction Poisson regression is used when the dependent variable is a count. Following the results of Signorini (99), this procedure calculates power and sample size for testing the hypothesis

More information

Week 6, 9/24/12-9/28/12, Notes: Bernoulli, Binomial, Hypergeometric, and Poisson Random Variables

Week 6, 9/24/12-9/28/12, Notes: Bernoulli, Binomial, Hypergeometric, and Poisson Random Variables Week 6, 9/24/12-9/28/12, Notes: Bernoulli, Binomial, Hypergeometric, and Poisson Random Variables 1 Monday 9/24/12 on Bernoulli and Binomial R.V.s We are now discussing discrete random variables that have

More information

Probability and Discrete Distributions

Probability and Discrete Distributions AMS 7L LAB #3 Fall, 2007 Objectives: Probability and Discrete Distributions 1. To explore relative frequency and the Law of Large Numbers 2. To practice the basic rules of probability 3. To work with the

More information

Lecture 18: Central Limit Theorem. Lisa Yan August 6, 2018

Lecture 18: Central Limit Theorem. Lisa Yan August 6, 2018 Lecture 18: Central Limit Theorem Lisa Yan August 6, 2018 Announcements PS5 due today Pain poll PS6 out today Due next Monday 8/13 (1:30pm) (will not be accepted after Wed 8/15) Programming part: Java,

More information

Probabilities & Statistics Revision

Probabilities & Statistics Revision Probabilities & Statistics Revision Christopher Ting Christopher Ting http://www.mysmu.edu/faculty/christophert/ : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 January 6, 2017 Christopher Ting QF

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

EXAM 4. Practice #1. There are 110 possible points on this exam. The test is out of 100.

EXAM 4. Practice #1. There are 110 possible points on this exam. The test is out of 100. Name: BSAD 210 Montgomery College David Youngberg EXAM 4 Practice #1 There are 110 possible points on this exam. The test is out of 100. You have two hours to complete this exam, but you should be able

More information

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter 9. Hypothesis testing. 9.1 Introduction Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second

More information

Applied Regression Analysis. Section 2: Multiple Linear Regression

Applied Regression Analysis. Section 2: Multiple Linear Regression Applied Regression Analysis Section 2: Multiple Linear Regression 1 The Multiple Regression Model Many problems involve more than one independent variable or factor which affects the dependent or response

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017 Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0

More information