Probability. Hosung Sohn

Size: px
Start display at page:

Download "Probability. Hosung Sohn"

Transcription

1 Probability Hosung Sohn Department of Public Administration and International Affairs Maxwell School of Citizenship and Public Affairs Syracuse University Lecture Slide 4-3 (October 8, 2015) 1/ 43

2 Table of Contents 1 Means and Variances of Random Variables 2/ 43

3 Announcement Revised Lecture Note 4. Lecture Note 5 will be posted by weekend. Please submit midterm evaluation forms you received via . Problem Set 2 will be posted on Friday (October 9, 2015): = Due on October 20, 2015! A mistake in the syllabus: = The deadline for Problem Set 4 is December 8, 2015, not December 1, / 43

4 Review of Previous Lecture Random variable: = A variable X is a random variable if the value that X takes at the conclusion of an experiment is a chance or random occurrence that cannot be predicted with certainty in advance. Two types of random variables: 1. Discrete random variables. 2. Continuous random variables. 4/ 43

5 Review of Previous Lecture Discrete random variables: = A random variable X is discrete if X can take only a finite number of different values. Discrete probability distributions: = A discrete probability distribution is a table, graph, or rule that associates a probability P (X = x i ) with each possible value x i that the discrete random variable X can take. 5/ 43

6 Review of Previous Lecture Discrete probability distribution in tables: Value of X x 1 x 2 x 3 x 4 Probability P (X = x 1) P (X = x 2) P (X = x 3) P (X = x 4) Discrete probability distributions in figures: 6/ 43

7 Review of Previous Lecture Continuous random variables: = A random variable X is continuous if X can take all the values in some interval. Continuous probability distributions: = A continuous probability distribution of a continuous random variable X is described by a density curve. The probability of any event is the area under the density curve and above the values of X that make up the event. 7/ 43

8 Review of Previous Lecture If X is a continuous random variable, P (X x i ) = P (X > x i ): = P (X = x i ) = 0. One example of continuous probability distributions: = Uniform distributions. Another example of continuous random variables: = Normal distributions. 8/ 43

9 Review Exercise 1 Suppose the population proportion of Internet users who say they use Twitter to post updates about themselves is 19%. Think about selecting random samples from a population in which 19% are Twitter users. 1. What is the sample space for selecting a single person? = S = {Y, N}. 2. If you select three people, what is the sample space? = S = {Y Y Y, NY Y, Y NY, Y Y N, NNY, Y NN, NY N, NNN}. 3. Define the sample space for the random variable that expresses the number of Twitter users in the sample of size 3. = X = {0, 1, 2, 3}. 4. What information is contained in the sample space for Question 2 that is not contained in the sample space for Question 3? = The sample space in Question (2) reveals which of the three people use Twitter. 9/ 43

10 Review Exercise 2 The Twitter example continued: 1. Assign probabilities for S = {Y, N}? = P (Y ) = 0.19 and P (N) = For S = {Y Y Y, NY Y, Y NY, Y Y N, NNY, Y NN, NY N, NNN}? = P (Y Y Y ) = P (Y ) P (Y ) P (Y ) = = (why?). = P (NY Y ) = P (N) P (Y ) P (Y ) = = = P (NNY ) = P (N) P (N) P (Y ) = = = P (NNN) = P (N) P (N) P (N) = = Probability distributions for the random variable X. Outcome YYY NYY YNY YYN NNY YNN NYN NNN Value of X Probability = = / 43

11 Introduction When describing data, we used tables and graphs (e.g., histograms, scatterplots etc.). Similarly, when describing random variables, we used tables and graphs. = We used tables and graphs to describe probability distributions of discrete or continuous random variables. On the other hand, we also used numerical measures to describe data (e.g., means, variance, etc.). We can also use numerical measures to describe random variables. = We can estimate the mean or the standard deviation of random variables. 11/ 43

12 The Expected Value (or Mean) of a Random Variable When talking about the mean of a random variable, we use the term expected value of a random variable. The expected value of a random variable is used as a measure of the center of the probability distribution of the random variable X. And it is denoted as E(X) or µ X. Recall that a statistic such as x can be considered as a random variable. = So we can define the expected value of x; i.e., E( x). Difference between E( x) and x? 12/ 43

13 The Expected Value (or Mean) of a Random Variable E( x) vs. x 13/ 43

14 The Expected Value (or Mean) of a Random Variable The expected value of a discrete random variable X is E(X) = µ X = k x i p i i=1 = x 1 p 1 + x 2 p x k p k where x i is a value of X and p i is the corresponding probability: = i.e.) p i = P (X = x i ). The mean is called the expected value because it denotes the average value that we would expect to occur if the experiment were repeated a large number of times. Another way to think about the expected value: = A weighted average in which each outcome (i.e., x i ) is weighted by its probability. 14/ 43

15 The Expected Value (or Mean) of a Random Variable Example: Suppose a random variable X denotes the years of education before entering the MPA program for our class. And assume that we have the following probability distribution for X: Value of X (i.e., Years of Education) Probability What is E(X)? Using the formula for the expected value, E(X) = 3 x i p i i=1 = x 1 p 1 + x 2 p 2 + x 3 p 3 = = / 43

16 The Expected Value (or Mean) of a Random Variable The above example illustrates the calculation of the expected value of a discrete random variable. How do we calculate E(X) if X is a continuous random variable. The formula for the expected value of a continuous random variable is E(X) = b a xf(x)dx, where f(x) is a probability function of a continuous variable. Intuitively, the expected value of a continuous random variable is the point at which the area under density curve would balance. 16/ 43

17 The Expected Value (or Mean) of a Random Variable 17/ 43

18 Statistical Estimation and the Law of Large Numbers Our goal in using statistical science: = Estimate the population parameter using a statistic! Suppose we want to estimate the mean height µ of the population of all American women between the ages of 18 and 24 years. To estimate µ: 1. We draw an SRS of young women. 2. Use the sample mean x to estimate µ. To reiterate, µ is a parameter and x is a statistic. 18/ 43

19 Statistical Estimation and the Law of Large Numbers Statistics such as x obtained from probability sampling designs are random variables. Why? = We don t know their values until we draw an SRS, and their values vary in repeated sampling. Thus, we can think of the sampling distributions of these statistics as the probability distributions of these random variables. 19/ 43

20 Statistical Estimation and the Law of Large Numbers We also learned that it is reasonable to use x to estimate µ. = An SRS should fairly represent the population, so x should be somewhat near µ. But we don t expect x to be exactly equal to µ. And we know that if we draw another SRS, then it would give us a different x. 20/ 43

21 Statistical Estimation and the Law of Large Numbers If x is rarely right and varies from sample to sample, why are we using this to estimate µ? One answer we learned is that it is because x is an unbiased estimator for µ. Another reason: if we keep on increasing the sample size when we draw an SRS, the statistic x is guaranteed to get as close as we wish to the parameter µ. = This fact is called the law of large numbers (LLN). LLN is very useful law because the law holds for any population, regardless of the shape or spread of the distribution of population data. 21/ 43

22 Statistical Estimation and the Law of Large Numbers Law of Large Numbers (LLN) Definition The law of large numbers (LLN) states that as the number of observations drawn increases in a single SRS, the mean x eventually approaches the mean µ of the population. 22/ 43

23 Statistical Estimation and the Law of Large Numbers Suppose that the mean of all women is 64.5 inches; i.e., µ = Figure below shows the behavior of the mean height x of n women chosen at random from a population. 23/ 43

24 Statistical Estimation and the Law of Large Numbers At first, the graph shows that the mean of the sample changes as we take more observations. Eventually, however, the mean gets close to the population mean µ = 64.5 and settles down at that value. = LLN says that this always happens. 24/ 43

25 Statistical Estimation and the Law of Large Numbers LLN is intuitively clear. Suppose our population size is 10, If you take an SRS of size 100, then x based on this 100 observations is not exactly equal to µ that is based on 10, What if you take an SRS of 9,500. Then x based on this 9,500 observations would be almost equal to µ. 3. What if you take an SRS of 9,999. Then it is almost certain that x based on this 9,999 observations is equal to µ. This is what LLN is telling us about. 25/ 43

26 Statistical Estimation and the Law of Large Numbers So LLN tells us that if we draw a large number of observations in a single SRS, then x is almost equal to µ. But we can ask a question: how large is a large number? = The answer depends on the variability of the population. If our outcome of interest in population is so variable, then we need more observations. If our outcome of interest in population is not so variable, then LLN holds even if we don t have that many observations. 26/ 43

27 Statistical Estimation and the Law of Large Numbers Suppose we would like to estimate the mean salary of all the people in the US. = The salary level is so variable, so we need quite a large number of observations to exactly estimate the population mean salary. Suppose we would like to estimate the mean number of cars that the households in the US possess. = In general, the number of cars possessed by households does not vary to a great extent (maybe around one to three). = So we don t need a large number of observations to estimate the mean number of cars. 27/ 43

28 Rules for Means Sometimes, there are instances in which we want to find out the expected value of two or more random variables. There are some rules that come in handy when we calculate the expected value of such random variables. We will study four rules. 28/ 43

29 Rules for Means Rule 1: If X is a random variable and a and b are fixed numbers (i.e., constants), then E(a + bx) = a + be(x). = We say that a + bx is a linear transformation of the random variable X. Rule 2: If X and Y are random variables, then E(X + Y ) = E(X) + E(Y ). 29/ 43

30 Rules for Means Rule 3: If X and Y are random variables, then E(X Y ) = E(X) E(Y ). Rule 4: If we combine Rule 1 and Rule 2 or 3, then we have the following rule: E(a + bx + cy ) = a + be(x) + ce(y ) or E(a bx + cy ) = a be(x) + ce(y ) 30/ 43

31 Rules for Means But!!! The rule in general doesn t hold hold for multiplication and division. That is E(XY ) E(X)E(Y ) and E(X/Y ) E(X)/E(Y ) 31/ 43

32 Rules for Means Example: Let X and Y be random variables that denote the number of courses taken in the fall and spring semester, respectively, by students at the Maxwell School. And we have the following probability distributions. Value of X Probability Value of Y Probability If we pick a student randomly, what is the expected value of the number of courses in the both semesters? 32/ 43

33 Rules for Means Value of X Probability Solution: Value of Y Probability The question asks to solve E(X + Y ). So using Rule 2 above, we know that E(X + Y ) = E(X) + E(Y ). So the expected value is E(X + Y ) = E(X) + E(Y ) = ( ) + ( ) = = / 43

34 The Variance of a Random Variable The expected value is a numerical measure of the center of a probability distribution. We need another measure; i.e., the spread or variability of the probability distribution. = We use the variance and standard deviation of a random variable. We write the variance of a random variable X as V ar(x) or σ 2 X. 34/ 43

35 The Variance of a Random Variable The variance of a discrete random variable X is the expected value of the squared deviations from the mean and is given by the formula V ar(x) = σx 2 = E [(X E(X)) 2] = [x 1 E(X)] 2 p 1 + [x 2 E(X)] 2 p [x k E(X)] 2 p k k = (x i E(X)) 2 p i. i=1 Note that the variance can also be calculated by the following formula: V ar(x) = E(X 2 ) [E(X)] 2. 35/ 43

36 The Variance of a Random Variable Let s prove the alternative formula: V ar(x) = E(X 2 ) [E(X)] 2. Proof = E [(X E(X)) 2] = E [ X 2 2XE(X) + [E(X)] 2] = E ( X 2) 2E(X)E(X) + [E(X)] 2 (by the rules of the mean) = E(X 2 ) [E(X)] 2. In some cases, especially when the mean is not an integer, it may be easier to calculate the variance by using the alternative formula rather than the original formula. 36/ 43

37 The Standard Deviation of a Random Variable The standard deviation of a discrete random variable X is given by the formula SD(X) = σ X = V ar(x) Question: can you tell the difference between V ar(x) and s 2? Answer: 1. V ar(x) indicates the variability that arises from repeated sampling. 2. s 2 indicates the variability in the values among observations in a single sample. 37/ 43

38 The Standard Deviation of a Random Variable Example: Find the variance and the standard deviation of the random variable X that has the following probability distribution: Solutions: Value of X 0 3 Probability First, E(X) = = To calculate the variance, I will use the alternative formula: = V ar(x) = E(X 2 ) [E(X)] E(X 2 ) = = V ar(x) = E(X 2 ) [E(X)] 2 = = SD(X) = V ar(x) = / 43

39 The Rules of the Variance Some rules for the variance. Rule 1: If X is a random variable, and a and b are fixed numbers, then V ar(a + bx) = b 2 V ar(x). = Notice that the constant a disappears and b comes out as a squared term. Rule 2: If X and Y are random variables, then V ar(x + Y ) = V ar(x) + V ar(y ) + 2Cov(X, Y ) and V ar(x Y ) = V ar(x) + V ar(y ) 2Cov(X, Y ). = Notice the sign right before the covariance term. 39/ 43

40 The Rules of the Variance Rule 3: If X and Y are independent random variables, then V ar(x + Y ) = V ar(x) + V ar(y ) and V ar(x Y ) = V ar(x) + V ar(y ). = Notice that the covariance term disappears, as well as the sign (both are positive). = Rule 3 above implies that if X and Y are independent, then Cov(X, Y ) = 0 or Corr(X, Y ) = 0. 40/ 43

41 Independence Between Two Random Variables The notion of independence between two random variables is very important in statistics especially when you learn Econometrics. Suppose X is a random variable that indicates whether the students in the Maxwell School are taking PAI 721 Introduction to Statistics. And suppose that Y is a random variable that denotes whether the students are in the MPA program or not. Are the two random variables X and Y independent? 41/ 43

42 Independence Between Two Random Variables Students in the MPA program are required to take PAI 721. So if you know the value of Y for Student A, then you are more likely to know the value of X for this student. That is information you have regarding the random variable Y is helpful for determining the information of the random variable X. Hence, in this case, we say that the two random variables X and Y are not independent, or we say that the two random variables are dependent. 42/ 43

43 Independence Between Two Random Variables On the other hand, suppose X denotes the toe size of students in the Maxwell School. Knowing whether a student is in the MPA program will not help us from determining the toe size of students in the Maxwell School. So in this case, we can reasonably assume that X and Y are independent. 43/ 43

Lecture 13 (Part 2): Deviation from mean: Markov s inequality, variance and its properties, Chebyshev s inequality

Lecture 13 (Part 2): Deviation from mean: Markov s inequality, variance and its properties, Chebyshev s inequality Lecture 13 (Part 2): Deviation from mean: Markov s inequality, variance and its properties, Chebyshev s inequality Discrete Structures II (Summer 2018) Rutgers University Instructor: Abhishek Bhrushundi

More information

Lecture 22: Variance and Covariance

Lecture 22: Variance and Covariance EE5110 : Probability Foundations for Electrical Engineers July-November 2015 Lecture 22: Variance and Covariance Lecturer: Dr. Krishna Jagannathan Scribes: R.Ravi Kiran In this lecture we will introduce

More information

Properties of Summation Operator

Properties of Summation Operator Econ 325 Section 003/004 Notes on Variance, Covariance, and Summation Operator By Hiro Kasahara Properties of Summation Operator For a sequence of the values {x 1, x 2,..., x n, we write the sum of x 1,

More information

Lecture 2: Review of Probability

Lecture 2: Review of Probability Lecture 2: Review of Probability Zheng Tian Contents 1 Random Variables and Probability Distributions 2 1.1 Defining probabilities and random variables..................... 2 1.2 Probability distributions................................

More information

Math 510 midterm 3 answers

Math 510 midterm 3 answers Math 51 midterm 3 answers Problem 1 (1 pts) Suppose X and Y are independent exponential random variables both with parameter λ 1. Find the probability that Y < 7X. P (Y < 7X) 7x 7x f(x, y) dy dx e x e

More information

Joint Probability Distributions and Random Samples (Devore Chapter Five)

Joint Probability Distributions and Random Samples (Devore Chapter Five) Joint Probability Distributions and Random Samples (Devore Chapter Five) 1016-345-01: Probability and Statistics for Engineers Spring 2013 Contents 1 Joint Probability Distributions 2 1.1 Two Discrete

More information

Chapter 15. Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 15. Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 15 Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc. The General Addition Rule When two events A and B are disjoint, we can use the addition rule for disjoint events from Chapter

More information

Examples of frequentist probability include games of chance, sample surveys, and randomized experiments. We will focus on frequentist probability sinc

Examples of frequentist probability include games of chance, sample surveys, and randomized experiments. We will focus on frequentist probability sinc FPPA-Chapters 13,14 and parts of 16,17, and 18 STATISTICS 50 Richard A. Berk Spring, 1997 May 30, 1997 1 Thinking about Chance People talk about \chance" and \probability" all the time. There are many

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari DATA 8 Spring 2018 Lecture 27 Sample Averages Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Announcements Questions for This Week How can we quantify natural

More information

Review of probability and statistics 1 / 31

Review of probability and statistics 1 / 31 Review of probability and statistics 1 / 31 2 / 31 Why? This chapter follows Stock and Watson (all graphs are from Stock and Watson). You may as well refer to the appendix in Wooldridge or any other introduction

More information

Vectors and Matrices Statistics with Vectors and Matrices

Vectors and Matrices Statistics with Vectors and Matrices Vectors and Matrices Statistics with Vectors and Matrices Lecture 3 September 7, 005 Analysis Lecture #3-9/7/005 Slide 1 of 55 Today s Lecture Vectors and Matrices (Supplement A - augmented with SAS proc

More information

Random Variables and Expectations

Random Variables and Expectations Inside ECOOMICS Random Variables Introduction to Econometrics Random Variables and Expectations A random variable has an outcome that is determined by an experiment and takes on a numerical value. A procedure

More information

STOR Lecture 16. Properties of Expectation - I

STOR Lecture 16. Properties of Expectation - I STOR 435.001 Lecture 16 Properties of Expectation - I Jan Hannig UNC Chapel Hill 1 / 22 Motivation Recall we found joint distributions to be pretty complicated objects. Need various tools from combinatorics

More information

PROBABILITY THEORY REVIEW

PROBABILITY THEORY REVIEW PROBABILITY THEORY REVIEW CMPUT 466/551 Martha White Fall, 2017 REMINDERS Assignment 1 is due on September 28 Thought questions 1 are due on September 21 Chapters 1-4, about 40 pages If you are printing,

More information

1 Correlation between an independent variable and the error

1 Correlation between an independent variable and the error Chapter 7 outline, Econometrics Instrumental variables and model estimation 1 Correlation between an independent variable and the error Recall that one of the assumptions that we make when proving the

More information

Section 7.2 Homework Answers

Section 7.2 Homework Answers 25.5 30 Sample Mean P 0.1226 sum n b. The two z-scores are z 25 20(1.7) n 1.0 20 sum n 2.012 and z 30 20(1.7) n 1.0 0.894, 20 so the probability is approximately 0.1635 (0.1645 using Table A). P14. a.

More information

Problem Set #5. Econ 103. Solution: By the complement rule p(0) = 1 p q. q, 1 x 0 < 0 1 p, 0 x 0 < 1. Solution: E[X] = 1 q + 0 (1 p q) + p 1 = p q

Problem Set #5. Econ 103. Solution: By the complement rule p(0) = 1 p q. q, 1 x 0 < 0 1 p, 0 x 0 < 1. Solution: E[X] = 1 q + 0 (1 p q) + p 1 = p q Problem Set #5 Econ 103 Part I Problems from the Textbook Chapter 4: 1, 3, 5, 7, 9, 11, 13, 15, 25, 27, 29 Chapter 5: 1, 3, 5, 9, 11, 13, 17 Part II Additional Problems 1. Suppose X is a random variable

More information

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3 Probability Paul Schrimpf January 23, 2018 Contents 1 Definitions 2 2 Properties 3 3 Random variables 4 3.1 Discrete........................................... 4 3.2 Continuous.........................................

More information

Continuous Probability Distributions

Continuous Probability Distributions 1 Chapter 5 Continuous Probability Distributions 5.1 Probability density function Example 5.1.1. Revisit Example 3.1.1. 11 12 13 14 15 16 21 22 23 24 25 26 S = 31 32 33 34 35 36 41 42 43 44 45 46 (5.1.1)

More information

STAT 414: Introduction to Probability Theory

STAT 414: Introduction to Probability Theory STAT 414: Introduction to Probability Theory Spring 2016; Homework Assignments Latest updated on April 29, 2016 HW1 (Due on Jan. 21) Chapter 1 Problems 1, 8, 9, 10, 11, 18, 19, 26, 28, 30 Theoretical Exercises

More information

An Introduction to Parameter Estimation

An Introduction to Parameter Estimation Introduction Introduction to Econometrics An Introduction to Parameter Estimation This document combines several important econometric foundations and corresponds to other documents such as the Introduction

More information

SUMMARY OF PROBABILITY CONCEPTS SO FAR (SUPPLEMENT FOR MA416)

SUMMARY OF PROBABILITY CONCEPTS SO FAR (SUPPLEMENT FOR MA416) SUMMARY OF PROBABILITY CONCEPTS SO FAR (SUPPLEMENT FOR MA416) D. ARAPURA This is a summary of the essential material covered so far. The final will be cumulative. I ve also included some review problems

More information

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,

More information

Econ 371 Problem Set #1 Answer Sheet

Econ 371 Problem Set #1 Answer Sheet Econ 371 Problem Set #1 Answer Sheet 2.1 In this question, you are asked to consider the random variable Y, which denotes the number of heads that occur when two coins are tossed. a. The first part of

More information

Lecture 10. Variance and standard deviation

Lecture 10. Variance and standard deviation 18.440: Lecture 10 Variance and standard deviation Scott Sheffield MIT 1 Outline Defining variance Examples Properties Decomposition trick 2 Outline Defining variance Examples Properties Decomposition

More information

STAT 111 Recitation 7

STAT 111 Recitation 7 STAT 111 Recitation 7 Xin Lu Tan xtan@wharton.upenn.edu October 25, 2013 1 / 13 Miscellaneous Please turn in homework 6. Please pick up homework 7 and the graded homework 5. Please check your grade and

More information

Week 2: Review of probability and statistics

Week 2: Review of probability and statistics Week 2: Review of probability and statistics Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED

More information

Probability Theory and Statistics. Peter Jochumzen

Probability Theory and Statistics. Peter Jochumzen Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................

More information

ECON Fundamentals of Probability

ECON Fundamentals of Probability ECON 351 - Fundamentals of Probability Maggie Jones 1 / 32 Random Variables A random variable is one that takes on numerical values, i.e. numerical summary of a random outcome e.g., prices, total GDP,

More information

Bivariate Distributions. Discrete Bivariate Distribution Example

Bivariate Distributions. Discrete Bivariate Distribution Example Spring 7 Geog C: Phaedon C. Kyriakidis Bivariate Distributions Definition: class of multivariate probability distributions describing joint variation of outcomes of two random variables (discrete or continuous),

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

Lecture 16 : Independence, Covariance and Correlation of Discrete Random Variables

Lecture 16 : Independence, Covariance and Correlation of Discrete Random Variables Lecture 6 : Independence, Covariance and Correlation of Discrete Random Variables 0/ 3 Definition Two discrete random variables X and Y defined on the same sample space are said to be independent if for

More information

Fourier and Stats / Astro Stats and Measurement : Stats Notes

Fourier and Stats / Astro Stats and Measurement : Stats Notes Fourier and Stats / Astro Stats and Measurement : Stats Notes Andy Lawrence, University of Edinburgh Autumn 2013 1 Probabilities, distributions, and errors Laplace once said Probability theory is nothing

More information

Chapter 18. Sampling Distribution Models /51

Chapter 18. Sampling Distribution Models /51 Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample

More information

LIST OF FORMULAS FOR STK1100 AND STK1110

LIST OF FORMULAS FOR STK1100 AND STK1110 LIST OF FORMULAS FOR STK1100 AND STK1110 (Version of 11. November 2015) 1. Probability Let A, B, A 1, A 2,..., B 1, B 2,... be events, that is, subsets of a sample space Ω. a) Axioms: A probability function

More information

Class 8 Review Problems solutions, 18.05, Spring 2014

Class 8 Review Problems solutions, 18.05, Spring 2014 Class 8 Review Problems solutions, 8.5, Spring 4 Counting and Probability. (a) Create an arrangement in stages and count the number of possibilities at each stage: ( ) Stage : Choose three of the slots

More information

AP Statistics Ch 7 Random Variables

AP Statistics Ch 7 Random Variables Ch 7.1 Discrete and Continuous Random Variables Introduction A random variable is a variable whose value is a numerical outcome of a random phenomenon. If an experiment or sample survey is repeated, different

More information

Math/Stat 3850 Exam 1

Math/Stat 3850 Exam 1 2/21/18 Name: Math/Stat 3850 Exam 1 There are 10 questions, worth a total of 100 points. You may use R, your calculator, and any written or internet resources on this test, although you are not allowed

More information

CSE 21 Practice Exam for Midterm 2 Fall 2017

CSE 21 Practice Exam for Midterm 2 Fall 2017 CSE 1 Practice Exam for Midterm Fall 017 These practice problems should help prepare you for the second midterm, which is on monday, November 11 This is longer than the actual exam will be, but good practice

More information

Problem Set 0 Solutions

Problem Set 0 Solutions CS446: Machine Learning Spring 2017 Problem Set 0 Solutions Handed Out: January 25 th, 2017 Handed In: NONE 1. [Probability] Assume that the probability of obtaining heads when tossing a coin is λ. a.

More information

1. Regressions and Regression Models. 2. Model Example. EEP/IAS Introductory Applied Econometrics Fall Erin Kelley Section Handout 1

1. Regressions and Regression Models. 2. Model Example. EEP/IAS Introductory Applied Econometrics Fall Erin Kelley Section Handout 1 1. Regressions and Regression Models Simply put, economists use regression models to study the relationship between two variables. If Y and X are two variables, representing some population, we are interested

More information

f X, Y (x, y)dx (x), where f(x,y) is the joint pdf of X and Y. (x) dx

f X, Y (x, y)dx (x), where f(x,y) is the joint pdf of X and Y. (x) dx INDEPENDENCE, COVARIANCE AND CORRELATION Independence: Intuitive idea of "Y is independent of X": The distribution of Y doesn't depend on the value of X. In terms of the conditional pdf's: "f(y x doesn't

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different

More information

HW5 Solutions. (a) (8 pts.) Show that if two random variables X and Y are independent, then E[XY ] = E[X]E[Y ] xy p X,Y (x, y)

HW5 Solutions. (a) (8 pts.) Show that if two random variables X and Y are independent, then E[XY ] = E[X]E[Y ] xy p X,Y (x, y) HW5 Solutions 1. (50 pts.) Random homeworks again (a) (8 pts.) Show that if two random variables X and Y are independent, then E[XY ] = E[X]E[Y ] Answer: Applying the definition of expectation we have

More information

Notes on Random Processes

Notes on Random Processes otes on Random Processes Brian Borchers and Rick Aster October 27, 2008 A Brief Review of Probability In this section of the course, we will work with random variables which are denoted by capital letters,

More information

STAT 418: Probability and Stochastic Processes

STAT 418: Probability and Stochastic Processes STAT 418: Probability and Stochastic Processes Spring 2016; Homework Assignments Latest updated on April 29, 2016 HW1 (Due on Jan. 21) Chapter 1 Problems 1, 8, 9, 10, 11, 18, 19, 26, 28, 30 Theoretical

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Sampling Distributions and Central Limit Theorem In previous topics we have discussed taking a single observation from a distribution. More accurately, we looked at the probability of a single variable

More information

PageRank: The Math-y Version (Or, What To Do When You Can t Tear Up Little Pieces of Paper)

PageRank: The Math-y Version (Or, What To Do When You Can t Tear Up Little Pieces of Paper) PageRank: The Math-y Version (Or, What To Do When You Can t Tear Up Little Pieces of Paper) In class, we saw this graph, with each node representing people who are following each other on Twitter: Our

More information

Week 12-13: Discrete Probability

Week 12-13: Discrete Probability Week 12-13: Discrete Probability November 21, 2018 1 Probability Space There are many problems about chances or possibilities, called probability in mathematics. When we roll two dice there are possible

More information

Basic Probability. Introduction

Basic Probability. Introduction Basic Probability Introduction The world is an uncertain place. Making predictions about something as seemingly mundane as tomorrow s weather, for example, is actually quite a difficult task. Even with

More information

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters. Chapter 9: Sampling Distributions 9.1: Sampling Distributions IDEA: How often would a given method of sampling give a correct answer if it was repeated many times? That is, if you took repeated samples

More information

STA 2201/442 Assignment 2

STA 2201/442 Assignment 2 STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution

More information

Preliminary Statistics Lecture 3: Probability Models and Distributions (Outline) prelimsoas.webs.com

Preliminary Statistics Lecture 3: Probability Models and Distributions (Outline) prelimsoas.webs.com 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 3: Probability Models and Distributions (Outline) prelimsoas.webs.com Gujarati D. Basic Econometrics,

More information

CS 246 Review of Proof Techniques and Probability 01/14/19

CS 246 Review of Proof Techniques and Probability 01/14/19 Note: This document has been adapted from a similar review session for CS224W (Autumn 2018). It was originally compiled by Jessica Su, with minor edits by Jayadev Bhaskaran. 1 Proof techniques Here we

More information

Midterm, Form A 07MAR2000

Midterm, Form A 07MAR2000 STA 110B Spring 2000 Name Section Midterm, Form A 07MAR2000 1. This test is CLOSED BOOK; however, you may use one 8.5 11 sheet (front and back) of notes. 2. You may use a calculator. 3. You have 75 minutes

More information

Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya

Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya BBM 205 Discrete Mathematics Hacettepe University http://web.cs.hacettepe.edu.tr/ bbm205 Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya Resources: Kenneth Rosen, Discrete

More information

Unit 22: Sampling Distributions

Unit 22: Sampling Distributions Unit 22: Sampling Distributions Summary of Video If we know an entire population, then we can compute population parameters such as the population mean or standard deviation. However, we generally don

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

Chapter 18 Sampling Distribution Models

Chapter 18 Sampling Distribution Models Chapter 18 Sampling Distribution Models The histogram above is a simulation of what we'd get if we could see all the proportions from all possible samples. The distribution has a special name. It's called

More information

Midterm Exam 1 Solution

Midterm Exam 1 Solution EECS 126 Probability and Random Processes University of California, Berkeley: Fall 2015 Kannan Ramchandran September 22, 2015 Midterm Exam 1 Solution Last name First name SID Name of student on your left:

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

Regression Analysis. Ordinary Least Squares. The Linear Model

Regression Analysis. Ordinary Least Squares. The Linear Model Regression Analysis Linear regression is one of the most widely used tools in statistics. Suppose we were jobless college students interested in finding out how big (or small) our salaries would be 20

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic Section 8.1A What Is a Sampling Distribution? Learning Objectives After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution DISTINGUISH between

More information

2 (Statistics) Random variables

2 (Statistics) Random variables 2 (Statistics) Random variables References: DeGroot and Schervish, chapters 3, 4 and 5; Stirzaker, chapters 4, 5 and 6 We will now study the main tools use for modeling experiments with unknown outcomes

More information

18.600: Lecture 24 Covariance and some conditional expectation exercises

18.600: Lecture 24 Covariance and some conditional expectation exercises 18.600: Lecture 24 Covariance and some conditional expectation exercises Scott Sheffield MIT Outline Covariance and correlation Paradoxes: getting ready to think about conditional expectation Outline Covariance

More information

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 5: May 9, Abstract

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 5: May 9, Abstract Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 5: May 9, 2013 Abstract We discuss random variables and probability distributions. We introduce statistical inference,

More information

Lecture 7: Confidence interval and Normal approximation

Lecture 7: Confidence interval and Normal approximation Lecture 7: Confidence interval and Normal approximation 26th of November 2015 Confidence interval 26th of November 2015 1 / 23 Random sample and uncertainty Example: we aim at estimating the average height

More information

LECTURE 1. Introduction to Econometrics

LECTURE 1. Introduction to Econometrics LECTURE 1 Introduction to Econometrics Ján Palguta September 20, 2016 1 / 29 WHAT IS ECONOMETRICS? To beginning students, it may seem as if econometrics is an overly complex obstacle to an otherwise useful

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on

Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on Version A.) No grade disputes now. Will have a chance to

More information

Chapter 8 Sampling Distributions Defn Defn

Chapter 8 Sampling Distributions Defn Defn 1 Chapter 8 Sampling Distributions Defn: Sampling error is the error resulting from using a sample to infer a population characteristic. Example: We want to estimate the mean amount of Pepsi-Cola in 12-oz.

More information

STATISTICS 1 REVISION NOTES

STATISTICS 1 REVISION NOTES STATISTICS 1 REVISION NOTES Statistical Model Representing and summarising Sample Data Key words: Quantitative Data This is data in NUMERICAL FORM such as shoe size, height etc. Qualitative Data This is

More information

Stat 20 Midterm 1 Review

Stat 20 Midterm 1 Review Stat 20 Midterm Review February 7, 2007 This handout is intended to be a comprehensive study guide for the first Stat 20 midterm exam. I have tried to cover all the course material in a way that targets

More information

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1 Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10 EECS 70 Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10 Introduction to Basic Discrete Probability In the last note we considered the probabilistic experiment where we flipped

More information

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Topic Distribution of means When it is needed How to build one (from scratch) Determining the characteristics

More information

7.1: What is a Sampling Distribution?!?!

7.1: What is a Sampling Distribution?!?! 7.1: What is a Sampling Distribution?!?! Section 7.1 What Is a Sampling Distribution? After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution

More information

Welcome to Physics 211! General Physics I

Welcome to Physics 211! General Physics I Welcome to Physics 211! General Physics I Physics 211 Fall 2015 Lecture 01-1 1 Physics 215 Honors & Majors Are you interested in becoming a physics major? Do you have a strong background in physics and

More information

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already

More information

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you.

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. ISQS 5347 Final Exam Spring 2017 Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. 1. Recall the commute

More information

Math 381 Midterm Practice Problem Solutions

Math 381 Midterm Practice Problem Solutions Math 381 Midterm Practice Problem Solutions Notes: -Many of the exercises below are adapted from Operations Research: Applications and Algorithms by Winston. -I have included a list of topics covered on

More information

Chapter 7: Sampling Distributions

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution? The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 7 Sampling Distributions 7.1 What is a Sampling

More information

Math 31 Lesson Plan. Day 2: Sets; Binary Operations. Elizabeth Gillaspy. September 23, 2011

Math 31 Lesson Plan. Day 2: Sets; Binary Operations. Elizabeth Gillaspy. September 23, 2011 Math 31 Lesson Plan Day 2: Sets; Binary Operations Elizabeth Gillaspy September 23, 2011 Supplies needed: 30 worksheets. Scratch paper? Sign in sheet Goals for myself: Tell them what you re going to tell

More information

Disjointness and Additivity

Disjointness and Additivity Midterm 2: Format Midterm 2 Review CS70 Summer 2016 - Lecture 6D David Dinh 28 July 2016 UC Berkeley 8 questions, 190 points, 110 minutes (same as MT1). Two pages (one double-sided sheet) of handwritten

More information

STAT Chapter 3: Probability

STAT Chapter 3: Probability Basic Definitions STAT 515 --- Chapter 3: Probability Experiment: A process which leads to a single outcome (called a sample point) that cannot be predicted with certainty. Sample Space (of an experiment):

More information

32. SOLVING LINEAR EQUATIONS IN ONE VARIABLE

32. SOLVING LINEAR EQUATIONS IN ONE VARIABLE get the complete book: /getfulltextfullbook.htm 32. SOLVING LINEAR EQUATIONS IN ONE VARIABLE classifying families of sentences In mathematics, it is common to group together sentences of the same type

More information

Problem 1. Problem 2. Problem 3. Problem 4

Problem 1. Problem 2. Problem 3. Problem 4 Problem Let A be the event that the fungus is present, and B the event that the staph-bacteria is present. We have P A = 4, P B = 9, P B A =. We wish to find P AB, to do this we use the multiplication

More information

MS&E 226: Small Data. Lecture 6: Bias and variance (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 6: Bias and variance (v2) Ramesh Johari MS&E 226: Small Data Lecture 6: Bias and variance (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 47 Our plan today We saw in last lecture that model scoring methods seem to be trading o two di erent

More information

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean Interval estimation October 3, 2018 STAT 151 Class 7 Slide 1 Pandemic data Treatment outcome, X, from n = 100 patients in a pandemic: 1 = recovered and 0 = not recovered 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0

More information

Then the distribution of X is called the standard normal distribution, denoted by N(0, 1).

Then the distribution of X is called the standard normal distribution, denoted by N(0, 1). Normal distributions Normal distributions are commonly used because distributions of averages of IID random variables can often be approximated well by normal distributions The standard normal distribution

More information

STA 2101/442 Assignment 2 1

STA 2101/442 Assignment 2 1 STA 2101/442 Assignment 2 1 These questions are practice for the midterm and final exam, and are not to be handed in. 1. A polling firm plans to ask a random sample of registered voters in Quebec whether

More information

Expectation is linear. So far we saw that E(X + Y ) = E(X) + E(Y ). Let α R. Then,

Expectation is linear. So far we saw that E(X + Y ) = E(X) + E(Y ). Let α R. Then, Expectation is linear So far we saw that E(X + Y ) = E(X) + E(Y ). Let α R. Then, E(αX) = ω = ω (αx)(ω) Pr(ω) αx(ω) Pr(ω) = α ω X(ω) Pr(ω) = αe(x). Corollary. For α, β R, E(αX + βy ) = αe(x) + βe(y ).

More information

MAS113 Introduction to Probability and Statistics. Proofs of theorems

MAS113 Introduction to Probability and Statistics. Proofs of theorems MAS113 Introduction to Probability and Statistics Proofs of theorems Theorem 1 De Morgan s Laws) See MAS110 Theorem 2 M1 By definition, B and A \ B are disjoint, and their union is A So, because m is a

More information

Figure Figure

Figure Figure Figure 4-12. Equal probability of selection with simple random sampling of equal-sized clusters at first stage and simple random sampling of equal number at second stage. The next sampling approach, shown

More information

Mathematics E-15 Seminar on Limits Suggested Lesson Topics

Mathematics E-15 Seminar on Limits Suggested Lesson Topics Mathematics E-15 Seminar on Limits Suggested Lesson Topics Lesson Presentation Guidelines Each lesson should last approximately 45 minutes. This will leave us with some time at the end for constructive

More information

Single Maths B: Introduction to Probability

Single Maths B: Introduction to Probability Single Maths B: Introduction to Probability Overview Lecturer Email Office Homework Webpage Dr Jonathan Cumming j.a.cumming@durham.ac.uk CM233 None! http://maths.dur.ac.uk/stats/people/jac/singleb/ 1 Introduction

More information

Raquel Prado. Name: Department of Applied Mathematics and Statistics AMS-131. Spring 2010

Raquel Prado. Name: Department of Applied Mathematics and Statistics AMS-131. Spring 2010 Raquel Prado Name: Department of Applied Mathematics and Statistics AMS-131. Spring 2010 Final Exam (Type B) The midterm is closed-book, you are only allowed to use one page of notes and a calculator.

More information