Review. More Review. Things to know about Probability: Let Ω be the sample space for a probability measure P.

Similar documents
Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability

Properties of Probability

Lecture 01: Introduction

Lecture 3: Probability

Probability and Statistics. Terms and concepts

Axioms of Probability

Base Rates and Bayes Theorem

Today we ll discuss ways to learn how to think about events that are influenced by chance.

3.1 Events, Sample Spaces, and Probability

Discrete Probability

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population

Basic Statistics and Probability Chapter 3: Probability

Formal Modeling in Cognitive Science

Formal Modeling in Cognitive Science Lecture 19: Application of Bayes Theorem; Discrete Random Variables; Distributions. Background.

Conditional Probabilities

Announcements. Topics: To Do:

Conditional probability

Bayes Formula. MATH 107: Finite Mathematics University of Louisville. March 26, 2014

Probability. Introduction to Biostatistics

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

Probability and Conditional Probability

( ) P A B : Probability of A given B. Probability that A happens

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks

Elements of probability theory

Chapter Six: Two Independent Samples Methods 1/51

The Bayes Theorema. Converting pre-diagnostic odds into post-diagnostic odds. Prof. Dr. F. Vanstapel, MD PhD Laboratoriumgeneeskunde UZ KULeuven

Probability. Patrick Breheny. September 1. Events and probability Working with probabilities Additional theorems/rules Summary

Statistics in medicine

2.6 Tools for Counting sample points

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Probability and Statistics. Joyeeta Dutta-Moscato June 29, 2015

2. AXIOMATIC PROBABILITY

Bayes Theorem & Diagnostic Tests Screening Tests

Econ 325: Introduction to Empirical Economics

Data Analysis and Statistical Methods Statistics 651

4. Probability of an event A for equally likely outcomes:

PROBABILITY.

1 The Basic Counting Principles

Discrete Probability. Chemistry & Physics. Medicine

Lecture Stat 302 Introduction to Probability - Slides 5

Probability II. Patrick Breheny. February 16. Basic rules (cont d) Advanced rules Summary

2.3 Conditional Probability

Chapter 6. Probability

MATH 10 INTRODUCTORY STATISTICS

Probability, Conditional Probability and Bayes Rule IE231 - Lecture Notes 3 Mar 6, 2018

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

A survey of Probability concepts. Chapter 5

PHP 2510 Probability Part 3. Calculate probability by counting (cont ) Conditional probability. Law of total probability. Bayes Theorem.

3 Conditional Probability

Math 140 Introductory Statistics

Math 140 Introductory Statistics

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 8. Chapter 8. Classification: Basic Concepts

Probability & Random Variables

Performance evaluation of binary classifiers

Notes for Math 324, Part 12

Random Signals and Systems. Chapter 1. Jitendra K Tugnait. Department of Electrical & Computer Engineering. James B Davis Professor.

STAC51: Categorical data Analysis

Bayesian Decision Theory

4 Lecture 4 Notes: Introduction to Probability. Probability Rules. Independence and Conditional Probability. Bayes Theorem. Risk and Odds Ratio

Introduction. Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University

It applies to discrete and continuous random variables, and a mix of the two.

P (E) = P (A 1 )P (A 2 )... P (A n ).

Lecture 2. Conditional Probability

Event A: at least one tail observed A:

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Unit 2 Introduction to Probability

Probability Calculus. p.1

Probability Theory and Statistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

CS 246 Review of Proof Techniques and Probability 01/14/19

Chapter 2: Describing Contingency Tables - I

Conditional Probability (cont...) 10/06/2005

When to use Bayes Rule. Bayes Rule for two choices. Bayes Rule for multiple choices. Will Murray s Probability, VI. Bayes Rule 1. VI.

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1

Relative Risks (RR) and Odds Ratios (OR) 20

P (B)P (A B) = P (AB) = P (A)P (B A), and dividing both sides by P (B) gives Bayes rule: P (A B) = P (A) P (B A) P (B),

Chapter 2 Exercise Solutions. Chapter 2

Statistics for Engineers Lecture 1 Introduction to Probability Theory

Compound Events. The event E = E c (the complement of E) is the event consisting of those outcomes which are not in E.

COMP61011! Probabilistic Classifiers! Part 1, Bayes Theorem!

What is the probability of getting a heads when flipping a coin

lecture notes October 22, Probability Theory

Math 243 Section 3.1 Introduction to Probability Lab

Lecture Notes. 15MA301-Probability & Statistics. Prepared by S ATHITHAN

Conditional Probability (cont'd)

Tests for Population Proportion(s)

Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio

The Central Limit Theorem

Probability. Lecture Notes. Adolfo J. Rumbos

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

1. Let A and B be two events such that P(A)=0.6 and P(B)=0.6. Which of the following MUST be true?

An AI-ish view of Probability, Conditional Probability & Bayes Theorem

10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty.

Probability Notes (A) , Fall 2010

Given a experiment with outcomes in sample space: Ω Probability measure applied to subsets of Ω: P[A] 0 P[A B] = P[A] + P[B] P[AB] = P(AB)

Matching Problems. Roberto Lucchetti. Politecnico di Milano

Lecture 2. October 21, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Examples of frequentist probability include games of chance, sample surveys, and randomized experiments. We will focus on frequentist probability sinc

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Transcription:

1 2 Review Data for assessing the sensitivity and specificity of a test are usually of the form disease category test result diseased (+) nondiseased ( ) + A B C D Sensitivity: is the proportion of diseased people who would be correctly classified (estimated by Sens A/(A + C)). Specificity: is the proportion of nondiseased people who would be correctly classified (estimated by Spec D/(B + D)). The prevalence: of a disease is the percent of the population with the disease (estimated by R (A + C)/(A + B + C + D)). Note that a random sample is required to estimate prevalence. Positive Predictive Value: is the proportion of people who tested positive that truly are positive. (estimated by PPV A/(A + B)). Negative Predictive Value: is the proportion of people who tested negative that truly are negative. (estimated by NPV D/(C + D)). False Negative: The probability of a false negative is the probability of testing negative given a truly positive condition. Serum Creat. (mg %) Sens Spec 1.2 0.939 0.123 1.3 0.939 0.203 1.4 0.909 0.281 1.5 0.818 0.380 1.6 0.758 0.461 1.7 0.727 0.535 1.8 0.636 0.649 1.9 0.636 0.711 2.0 0.545 0.766 2.1 0.485 0.773 2.2 0.485 0.803 2.3 0.394 0.811 2.4 0.394 0.843 2.5 0.364 0.870 2.6 0.333 0.891 2.7 0.333 0.894 2.8 0.333 0.896 2.9 0.303 0.909 Sensitivity and Specificity of serum creatinine level for predicting transplant rejection. False Positive: The probability of a false positive is the probability of testing positive given a truly negative condition. An ROC curve is a graph that plots the sensitivity as a function of the false positive (1 - spec) probability. sens 0.0 0.2 0.4 0.6 0.8 1.0 1-spec 0.0 0.2 0.4 0.6 0.8 1.0 3 More Review A random experiment is an experiment for which the outcome cannot be predicted with certainty, but all possible outcomes can be identified prior to its performance, and it may be repeated under the same conditions. The set of all possible outcomes of a random experiment is called the sample space, denoted Ω; subsets of the sample space are events, often denoted by A or A k for k 1, 2,..., n. Things to know about Probability: Let Ω be the sample space for a probability measure P. 1. 0 P (A) 1, for all events A, and P (Ω) 1. 2. P ( ) 0 3. For each A Ω, P (A) 1 P (Ā) 1 P (Ac ). 4. If A 1 A 2 Ω, P (A 1 ) P (A 2 ) 5. Two events A 1 and A 2 are mutually exclusive if they cannot both happen at the same time : (A 1 A 2 ). In this case P (A 1 A 2 ) 0. 4 6. Two events are independent if P (A 1 A 2 ) P (A 1 )P (A 2 )

5 6 Things to know about Probability (continued): 7. For any events A 1 and A 2, P (A 1 A 2 ) P (A 1 ) + P (A 2 ) P (A 1 A 2 ) (note for mutually exclusive events, the probability of the union is the sum of the probabilities - this extends to more than just two groups - see notes from last time. This is sometimes called the addition law of probability). 8. For any events A 1 and A 2, P (A 2 A 1 ) P (A 2 A 1 ) P (A 1 ) (note that for mutually exclusive events, P (A 2 A 1 ) 0; for independent events P (A 2 A 1 ) P (A 2 )). Relationships between Sens, Spec, PPV, NPV, FN and FP. Since 9.1 % of the truly positive people tested negative (1,000 out of 11,000), then (100-9.1 90.9) % of these truly positive people tested positive. Note that 10,000/11,000 0.909. Prob(false neg) 0.091 and Sens is 1-0.091 0.909. Convince yourself that Spec 1 - PFP where PFP is the probability of a false positive. 9. The above can be written as what is sometimes called the multiplication rule of probability: P (A 2 A 1 ) P (A 2 A 1 )P (A 1 ). 10. Law of Total Probability: Suppose events A 1, A 2,... A n are mutually exclusive and also exhaustive - exhaustive means n i1a i Ω. Then, P (B) n P (A i B) n P (B A i )P (A i ) i1 i1 7 8 P (D + test + ) P (D+ test + ) P (test + ) P ((test + D + ) (test + D )) P (test + D + ) + P (test + D ) + P (test + D )P (D ) Sens x Sens x + (1 Spec) (1 x) where x P (D + ) is the disease prevalence. Bayes Rule The example above is an application of Bayes Rule. Bayes Rule: If A 1, A 2,..., A n are n mutually exclusive events (A i A j for i j) that are also exhaustive ( n i1 A i Ω), then for any event B where P (B) > 0, P (A i B) P (B A i )P (A i ) n j1 P (B A j )P (A j ) Note that a person randomly selected from a population (according to the probabilities estimated in our hypothetical study) has an 11% chance of being HIV positive. This probability (0.11) is called the prior probability. In contrast, the posterior probability takes into account additional information. In this case, the additional information is the result of an HIV test. P (HIV + test + ) is a posterior probability.

Suppose that a 60 year old man who has never smoked cigarettes presents to a physician with symptoms consisting of a chronic cough and occasional breathlessness. The physician expects that the patient is either if normal (A 1 ), has lung cancer (A 2 ), or has sarcoidosis (A 3 ) and a biopsy is ordered. Suppose results of biopsy are positive or negative and that a positive result is consistent with either lung cancer or sarcoidosis. Outcomes are: (A 1 test + ), (A 1 test ), (A 2 test + ),..., (A 3 test ) What is P (A 1 test + )? Note that (A i test + ) (A i test ) A i for i 1, 2, 3 are mutually exclusive events and that 3 i1 A i Ω. Bayes rule! P (A 1 test + ) P (test + A 1 )P (A 1 ) 3 i1 P (test + A i )P (A i ) 9 Suppose that P (test + A 1 ) 0.001, P (test + A 2 ) 0.9 and P (test + A 3 ) 0.9 and that in 60 year old non-smoking men, P (A 1 ) 0.99, P (A 2 ) 0.001 and P (A 3 ) 0.009. The first set of probabilities could be obtained from clinical experimence. The second set would have to be obtained from agesex-smoking specific prevalence rates for the conditions in question. P (A 1 test + ) P (test + A 1 )P (A 1 ) 3 i1 P (test + A i )P (A i ) 0.001 0.99 0.001 0.99 + 0.9 0.001 + 0.9 0.009 0.099 We could work out P (A 2 test + ) 0.090 and P (A 3 test + ) 0.811 Although the unconditional (prior) probability of sarcoidosis is very low (0.009), the conditional probability of the disease given these symptoms and this age/sex/smoking group is 0.811. 10 How would P (A 1 test + ) change if the man was a smoker? In this case, P (A 1 ) 0.98, P (A 2 ) 0.015, P (A 3 ) 0.005. Convince yourself that if the man was a smoker, P (A 1 test + ) 0.052, P (A 2 test + ) 0.711, and P (A 3 test + ) 0.237. Now, lung cancer is the most likely diagnosis. 11 Consider a manuscript published in the Journal of Ultrasound in Medicine (1995). Q1: Using the information in Table 3 of that paper, what is the probability that a 42 year old will have a child with autosomal trisomy? How does that probability change if you know the woman has had a test indicating that trisomy is not present? 12 Q2: What is the probability that a 35 year old woman will have a child with autosomal trisomy?

In the manuscript, sonographic scores were reviewed for 97 trisomic (T + ) and 694 non-trisomic (T ) infants. These infants were not randomly sampled from the population. A positive test implies trisomy was detected. Of the 97, 83 tested positive; of the 694, 606 tested negative. trisomy Sonographic test (T + ) (T ) total + 83 88 171-14 606 620 total 97 694 791 13 Q: What is the probability of a 42 year old woman having a child with trisomy, P (T + 42)? We cannot get an estimate from the table on the previous page for two main reasons: (1) The table incorporates infants from women of varying ages and (2) The table is NOT a random sample of 791 infants. We can get an estimate of P (T + 42) from previous studies. Data from other studies is shown in Table 3 of the manuscript. 14 Sens 83 97 0.85567 Spec 606 694 0.87319 Q: What is the probability of a 42 year old woman having a child with trisomy given she has had a normal sonogram? P (T + 42 test ) P (test T + 42)P (T + 42) P (test T + 42)P (T + 42) + P (test T 42)P (T 42) P (test T + 42) and P (test T 42) are taken from the study data given in the table shown previously (where sensitivity and specificity are calculated). One could argue that these probabilities are NOT affected by a woman s age, they only depend on the accuracy of the test. P (T 42) and P (T + 42) are affected by a woman s age. 15 Good estimates are available in the literature. These estimates are given in Table 3 of the manuscript. The study was done to evaluate the effectiveness of the sonogram to accurately detect trisomy. To do this, cases of known trisomy were obtained and how well the test performed was recorded. Similarly, cases of known non-trisomy were obtained and test performance was recorded. Using this information, sensitivity and specificity estimates were obtained. Because this was not a random sample of a population, an accurate estimate of the prevalence could not be obtained. This information was obtained from the literature. **** Bayes rule was used to combine the estimates of sensitivity and specificity with the estimate of prevalence to answer the question: What is the probability of a 42 year old woman having a child with trisomy given she has had a normal sonogram?. We could answer the question for any child bearing aged woman (age n) by adjusting our estimate of the prevalence. 16 (1 0.85567) 0.0306 (1 0.85567) 0.0306 + (0.87319) 0.9694 0.00519 P (T + n test ) We just looked at P (test T + n)p (T + n) P (test T + n)p (T + n) + P (test T n)p (T n) P (d + test ) P (test d + )P (d + ) P (test d + )P (d + ) + P (test d )P (d )

Again, two reasons Bayes rule is important: It allows you to incorporate extra information into probability estimates. For example, the prior probability P (d + ) is adjusted to account for additional information resulting in the posterior probability, P (d + test + ) It is a formula that relates conditional probabilities. Many times, certain conditional probabilities are well known (e.g. in the literature). You can use these to obtain conditional probabilities that you re interested in. 17 Other important conditional probabilities: Relative Risk and Odds Ratio The relative risk (RR) is the probability that a member of an exposed group will develop a disease relative to the probability that a member of an unexposed group will develop that same disease. RR P (disease exposed) P (disease unexposed) If an event takes place with probability p, the odds in favor of the event are p 1 p to 1. p 1 2 implies 1 to 1 odds; p 2 3 implies 2 to 1 odds. 18 In this class, the odds ratio (OR) is the odds of disease among exposed individuals divided by the odds of disease among unexposed. OR P (disease exposed)/(1 P (disease exposed)) P (disease unexposed)/(1 P (disease unexposed))