Chapter 5: Probability in Our Daily Lives

Similar documents
Chapter 7: Sampling Distributions

Week 2. Section Texas A& M University. Department of Mathematics Texas A& M University, College Station 22 January-24 January 2019

Chapter 6: Probability The Study of Randomness

4. Probability of an event A for equally likely outcomes:

Probability- describes the pattern of chance outcomes

Econ 325: Introduction to Empirical Economics

Chapter 1 (Basic Probability)

Statistics 251: Statistical Methods

Probability Theory and Applications

Recap. The study of randomness and uncertainty Chances, odds, likelihood, expected, probably, on average,... PROBABILITY INFERENTIAL STATISTICS

1 Probability Theory. 1.1 Introduction

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Math 1313 Experiments, Events and Sample Spaces

Chapter 5 : Probability. Exercise Sheet. SHilal. 1 P a g e

ORF 245 Fundamentals of Statistics Chapter 5 Probability

MA : Introductory Probability

ORF 245 Fundamentals of Statistics Chapter 1 Probability

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

4 Lecture 4 Notes: Introduction to Probability. Probability Rules. Independence and Conditional Probability. Bayes Theorem. Risk and Odds Ratio

Chapter 8: Sampling Variability and Sampling Distributions

Venn Diagrams; Probability Laws. Notes. Set Operations and Relations. Venn Diagram 2.1. Venn Diagrams; Probability Laws. Notes

Probability: Terminology and Examples Class 2, Jeremy Orloff and Jonathan Bloom

Probability: Part 1 Naima Hammoud

Properties of Probability

Last time. Numerical summaries for continuous variables. Center: mean and median. Spread: Standard deviation and inter-quartile range

STAT 111 Recitation 1

3.2 Probability Rules

Discrete Probability. Mark Huiskes, LIACS Probability and Statistics, Mark Huiskes, LIACS, Lecture 2

Probability (special topic)

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Fundamentals of Probability CE 311S

Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability

Conditional Probability

Probability (Devore Chapter Two)

STAT 201 Chapter 5. Probability

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail}

1 Preliminaries Sample Space and Events Interpretation of Probability... 13

Announcements. Topics: To Do:

The Practice of Statistics Third Edition

3.1 Events, Sample Spaces, and Probability

Probabilistic models

ELEG 3143 Probability & Stochastic Process Ch. 1 Probability

PERMUTATIONS, COMBINATIONS AND DISCRETE PROBABILITY

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

Single Maths B: Introduction to Probability

MATH 3C: MIDTERM 1 REVIEW. 1. Counting

Conditional Probability and Independence

Axioms of Probability

AMS7: WEEK 2. CLASS 2

Intermediate Math Circles November 8, 2017 Probability II

CS626 Data Analysis and Simulation

Lecture 1. Chapter 1. (Part I) Material Covered in This Lecture: Chapter 1, Chapter 2 ( ). 1. What is Statistics?

Statistics for Financial Engineering Session 2: Basic Set Theory March 19 th, 2006

Elements of probability theory

CMPSCI 240: Reasoning about Uncertainty

CIVL Why are we studying probability and statistics? Learning Objectives. Basic Laws and Axioms of Probability

Probability Rules. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

4/17/2012. NE ( ) # of ways an event can happen NS ( ) # of events in the sample space

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

STA 247 Solutions to Assignment #1

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

Lecture 2: Probability. Readings: Sections Statistical Inference: drawing conclusions about the population based on a sample

STATISTICAL INDEPENDENCE AND AN INVITATION TO THE Art OF CONDITIONING

Lecture 1: Probability Fundamentals

2.6 Tools for Counting sample points

Recitation 2: Probability

Deep Learning for Computer Vision

MAT2377. Ali Karimnezhad. Version September 9, Ali Karimnezhad

Lecture 1: Basics of Probability

Probability: Sets, Sample Spaces, Events

Mathematical Foundations of Computer Science Lecture Outline October 18, 2018

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

Notes 1 Autumn Sample space, events. S is the number of elements in the set S.)

3 PROBABILITY TOPICS

STA 291 Lecture 8. Probability. Probability Rules. Joint and Marginal Probability. STA Lecture 8 1

Chapter 7 Wednesday, May 26th

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 2: Random Experiments. Prof. Vince Calhoun

STA Module 4 Probability Concepts. Rev.F08 1

STA111 - Lecture 1 Welcome to STA111! 1 What is the difference between Probability and Statistics?

ELEG 3143 Probability & Stochastic Process Ch. 1 Experiments, Models, and Probabilities

2.4. Conditional Probability

Compound Events. The event E = E c (the complement of E) is the event consisting of those outcomes which are not in E.

CHAPTER 3 PROBABILITY TOPICS

Business Statistics. Lecture 3: Random Variables and the Normal Distribution

Lecture 6 Probability

ENGI 4421 Introduction to Probability; Sets & Venn Diagrams Page α 2 θ 1 u 3. wear coat. θ 2 = warm u 2 = sweaty! θ 1 = cold u 3 = brrr!

Probabilistic models

P [(E and F )] P [F ]

Term Definition Example Random Phenomena

STAT Chapter 3: Probability

Intro to Stats Lecture 11

Lecture Lecture 5

Probability and Sample space

Probability Notes. Definitions: The probability of an event is the likelihood of choosing an outcome from that event.

Mutually Exclusive Events

What is the probability of getting a heads when flipping a coin

MATH MW Elementary Probability Course Notes Part I: Models and Counting

1 The Basic Counting Principles

Chapter 2 PROBABILITY SAMPLE SPACE

Transcription:

Chapter 5: in Our Daily Lives These notes reflect material from our text, Statistics: The Art and Science of Learning from Data, Third Edition, by Alan Agresti and Catherine Franklin, published by Pearson, 2013. quantifies randomness. It is a formal framework with a very specific vocabulary and notation. Imagine an experiment with a specific set of outcomes (say, flipping a fair coin twice). S is the sample space of all possible outcomes. Subsets of S are called events and are denoted with letters like A and B. The empty set, φ, is the event that contains no outcomes. Two events are disjoint if their intersection is empty. The Russian mathematician Kolmogorov helped to clarify the essential properties of a probability function, P P (S) = 1 for the entire sample space S 0 P (A) 1 for any event A S P ( n i=1 A i) = n i=1 P (A i) for disjoint events A i First examples : flip a coin, flip three coins, roll a die, roll two dice If you roll a die once the result is completely uncertain, because the individual outcomes are equally likely. But now begin to methodically roll the die and after each toss calculate the total number of 6 s observed so far divided by the total number of rolls at this point. Call this a cumulative proportion and graph these cumulative proportions for a large number of rolls of the die, say 100,000 rolls. A computer did this and displayed the following graph. In this particular simulation, the first ten rolls of the die produced the sequence 0001010010, where 1 means a 6 was rolled and 0 means something else appeared. Calculate the first ten cumulative sums for this short sequence and compare your results to the following chart. What is the height of the dotted red line? 0.30 0.25 0.20 p^n 0.15 0.10 0.05 0.00 1 10 100 1,000 10,000 100,000 n (number of rolls) Fig. Cumulative proportions of a 6 in 100,000 rolls of a fair die, from OpenIntro Statistics, chapter 2 Display discrete probabilities in a table Flip a fair coin outcome h t probability 0.5 0.5 Spring 2015 Page 1 of 7

Venn diagram A B Rules of Mutually exclusive events. A B = φ Unions. P (A B) = P (A) + P (B) P (A B). Complements. P (A c ) = 1 P (A). Independent events. P (A B) = P (A)P (B) when A and B are independent. Conditional probability. P (A B) = P (A B)/P (B) when P (B) 0 Intersections. P (A B) = P (A B)P (B) Spring 2015 Page 2 of 7

Contingency tables and conditional probabilities Vocabulary for diagnostic testing, S medical state present, P OS test positive : sensitivity P (P OS S), specificity P (NEG S c ), incidence P (S) Consider the Triple Blood Test for Down Syndrome (Agresti and Franklin, chapter 5, pp.232-233) Blood Test Status P OS NEG T otal D (Down) 48 6 54 D c (unaffected) 1307 3921 5228 T otal 1355 3927 5282 Calculate the following probabilities based on the figures in this study: sensitivity P (P OS D), specificity P (NEG D c ), incidence P (D) false positives P (P OS D c ), false negatives P (NEG D) An individual being tested would be most concerned about P (D P OS). What is this probability? Why is it so small? Hint: Calculate P (D c P OS). Again, an individual being tested would want to know P (D NEG). How would that probability compare to the a priori P (D)? Triple Blood Test POS NEG status unaffected Down blood test Spring 2015 Page 3 of 7

Using R to Compute Conditional Probabilities Construct a data frame named down to represent the Down Syndrome contingency table, and then use addmargins(down) to compute its row and column totals. down <- c(48, 1307, 6, 3921) dim(down) <- c(2, 2) dimnames(down) <- list(status=c("down", "unaffected"), "blood test"=c("pos", "neg")) down # status pos neg # down 48 6 # unaffected 1307 3921 addmargins(down) # status pos neg Sum # down 48 6 54 # unaffected 1307 3921 5228 # Sum 1355 3927 5282 Then prop.table(down, 1) will divide each row by its row sum. The numbers in each row are conditional probabilities. And prop.table(down, 2) will divide each column by its column sum. The numbers in each column are conditional probabilities. Therefore, each of the eight numbers shown below is a conditional probability of the form P (A B) for some A and B. Identify the correct A and B for each number. prop.table(down, 1) # status pos neg # down 0.8888889 0.1111111 # unaffected 0.2500000 0.7500000 prop.table(down, 2) # status pos neg # down 0.03542435 0.001527884 # unaffected 0.96457565 0.998472116 What values do these tables indicate for P (pos down) and P (down pos)? Spring 2015 Page 4 of 7

Boston Smallpox Epidemic of 1721 The following contingency table (OpenIntro Statistics, pp.83 87) refers to the Boston smallpox epidemic of 1721. A total of 6224 residents of Boston contracted smallpox in this epidemic and 850 of them died. The epidemic was marked by vigorous public debate of the value (or lack thereof) of a type of inoculation known as variolation (which was dangerous). The Reverend Cotton Mather advocated inoculation but the physician William Douglass was firmly against it. See the article in Harvard s Contagion for more details. An effective smallpox vaccination procedure was eventually demonstrated by Edward Jenner in England in 1796, and succeeding efforts to eradicate smallpox from the world were finally declared to be successful in 1980 by the World Health Organization. Cotton Mather, on the other hand, lives on in infamy for his role in the Salem witch trials. Inoculated Result yes no T otal lived 238 5136 5374 died 6 844 850 T otal 244 5980 6224 Smallpox Epidemic, Boston, 1721 yes no died result lived innoculated Spring 2015 Page 5 of 7

Tree Diagrams The following tree diagram, generated by OpenIntro software, summarizes the relevant statistics for the Boston smallpox epidemic of 1721. Here Inoculated is a categorical explanatory variable with levels yes and no. In the Inoculated column of the tree diagram are the probabilities P (yes) and P (no). The categorical response variable Result has levels lived and died. The conditional probabilities in the Result column are P (lived yes), P (died yes), P (lived no), P (died no). The probabilities calculated by the software in the third column are P (lived and yes), P (died and yes), P (lived and no), P (died and no), because P (A) P (B A) = P (A B). Innoculated yes, 0.0392 Result lived, 0.9754 died, 0.0246 0.0392*0.9754 = 0.03824 0.0392*0.0246 = 0.00096 no, 0.9608 lived, 0.8589 died, 0.1411 0.9608*0.8589 = 0.82523 0.9608*0.1411 = 0.13557 Fig. Smallpox in Boston, 1721, from OpenIntro Statistics, chapter 2, pp.83-87 Spring 2015 Page 6 of 7

Exercises We will attempt to solve some of the following exercises as a community project in class today. Finish these solutions as homework exercises, write them up carefully and clearly, and hand them in at the beginning of class next Friday. Exercises for Chapter 5: 5.10 (coin), 5.12 (stock market), 5.21 (risky behavior), 5.23 (seat belts), 5.34 (free throws), 5.37 (Down), 5.38 (job), 5.40 (serves), 5.50 (Masters), 5.57 (mammogram) Class work 5a probability Exercises from Chapter 5: 5.10 (coin), 5.12 (stock market), 5.21 (risky behavior), 5.23 (seat belts), 5.34 (free throws) Class work 5b probability Exercises from Chapter 5: 5.37 (Down), 5.38 (job), 5.40 (serves), 5.50 (Masters), 5.57 (mammogram) Spring 2015 Page 7 of 7