An Informal Introduction to Statistics in 2h. Tim Kraska

Size: px
Start display at page:

Download "An Informal Introduction to Statistics in 2h. Tim Kraska"

Transcription

1 An Informal Introduction to Statistics in 2h Tim Kraska

2 Goal of this Lecture This is not a replacement for a proper introduction to probability and statistics Instead, it only tries to convey the very basic intuition behind some of the ideas The risk of this lecture: Half knowledge can be dangerous Most slides are based on CS 155 (big thanks to Eli)

3 The Very Basic

4 Statistics Probability Probability: mathematical theory that describes uncertainty. Statistics: set of techniques for extracting useful information from data.

5 Probability Space

6 Probability Function

7 Tossing a (Fair) Coin Ω = { H,T} F = 2 Ω = 2 2 = 4Events { { } F = { }, { H}, { T}, H,T Pr ({ })= 0 Pr Pr Pr ({ H} )= 0.5 ({ T} )= 0.5 ({ H,T} )= 1

8 Rolling a Dice Ω = { 1,2,3,4,5,6 } F = 2 Ω = 2 6 Events Pr ({ })= 0 Pr { 1} { } { } { } { } { } ( )= Pr( 2 )= Pr( 3 )= Pr( 4 )= Pr( 5 )= Pr( 6 )= 1 6 Pr { 1,2}... { } { } { } { } ( )= Pr( 1,3 )= Pr( 1,4 )= Pr( 1,5 )= Pr( 1,6 )= 2 6

9 Independent Events

10 Tossing a (Fair) Coin Twice Ω = { HH, HT,TH,TT } F = 2 Ω = 2 4 Events Pr ({ })= 0 Pr { HH} Pr { HT} Pr { HT,TT} Pr { HH, HT}... { } { } { } { } { } { } ( )= Pr( H )Pr ( H )= = 0.25 ( )= Pr( TH )= Pr( TT )= 0.25 ( )= Pr( HH,TH )= 0.5 ( )= Pr( TH,TT )= 0.5

11 Conditional Probability

12 Computing Conditional Probabilities

13 Example - a posteriori probability

14

15 Law of Total Probability

16 In Class Exercises 1. A fair coin was tossed 10 times and always ended up on its head. What is the likelihood that it will end up tail next? 2. Stan has two kids. One of his kids is a boy. What is the likelihood that the other one is also a boy

17 Bayesian Statistics

18 Bayes Law

19 Bayes Theorem Likelihood Probability of collecting this data when our hypothesis is true P(H D) = P(D H) P(H) P(D) Prior The probability of the hypothesis being true before collecting data Posterior The probability of our hypothesis being true given the data collected Marginal What is the probability of collecting this data under all possible hypotheses?

20 Deriving Bayes Law B P(A B) ~B A ~A U P(A B) = P(A B) * P(B) = P(B A) * P(A) U P(A B) = P(B A) * P(A) P(B)

21 Application: Finding a Biased Coin

22

23 Class Example: Drug Test 0.4% of the Rhode Island population use Marijuana* Drug Test: The test will produce 99% true positive results for drug users and 99% true negative results for non-drug users. If a randomly selected individual is tested positive, what is the probability he or she is a user?

24 Class Example: Drug Test 0.4% of the Rhode Island population use Marijuana* Drug Test: The test will produce 99% true positive results for drug users and 99% true negative results for non-drug users. If a randomly selected individual is tested positive, what is the probability he or she is a user? P( User + )= P ( + User )P( User) P + = ( ) ( ) P ( + User)P User P ( + User)P ( User)+ P ( +!User )P!User = = 28.4% ( )

25 Spam Filtering with Naïve Bayes ( ) P( spam words)= P ( spam )P words spam P( words) P( spam viagra,rich,..., friend)= P ( spam )P viagra,rich,..., friend spam P viagra,rich,..., friend ( ) ( ) P( spam words) P ( spam )P( viagra spam)p( rich spam) P friend spam P viagra,rich,..., friend ( ) ( ) 9/12/13 Bill Howe, UW 25

26 Bayesian Inference ( ) ( ) P( H E)= P ( E H )P H P E P( Θ E α)= P ( E Θ α )P Θ α P E α ( ) ( ) H Hypothesis ( ) Prior Probability P H P( H E) Posterior Probability ( ) Probability of observing E given H, likelihood P E H ( ) Model Evidence (marginal likelihood) P E

27 Random Variables

28 How to Model A Simple Game I get $5 from you You get $10 from me

29 Random Variables

30 Independence

31 Expectation µ

32 Linearity of Expectation

33 How to Model A Simple Game I get $5 from you You get $10 from me Would you play this game?

34 Variance

35 Variance

36 So far we knew the distribution What if we do not?

37 Red/Blue/Green Lottery Population N

38 Empirical Probability Population N f i = n i N = n in i i f blue = f green = 4 20 f red = 6 20

39 Population Mean µ = i N x i Variance σ 2 = i ( x i µ ) N 2

40 Red/Blue/Green Lottery Population N Sample n

41 Population vs. Sample Mean Population (parameter) µ = i N x i Sample (Statistic) àestimates x i i x = n Variance σ 2 = i ( x i µ ) N 2 Biased Estimate Un-Biased Estimate S N 2 = 2 S N 1 i = ( x i µ ) i 2 n ( x i µ ) n 1 2

42 Big Data How to calculate the Variance in 1-Pass 2 S N 1 2 ( x i µ ) i = n 1 = 1 x 2 i 1 n 1 i n = 1 n 1 i i x i ( x i x) 2 1 n 2 i ( x i x) 2

43 Law of Large Numbers

44 Law of Large Numbers Draw independent observations at random from any population with finite mean μ. As the number of observations increases, the sample mean approaches mean μ of the population. The more variation in the outcomes, the more trials are needed to ensure that is close to μ.

45 Weak Law of large numbers Strong law of large numbers X µ Pr( lim X n = µ )= 1 n

46 Central Limit Theorem

47 Law of Large Numbers (Coin)

48 Convolution Tossing 2 Dice Die 1: X Die 2: Y Dice 1+2 : Z = X Y k= P( Z = z)= P(X = k)p(y = z k)

49 Distribution of X 1 : Die 1 or Die

50 Distribution of S 2 : 2 Dice

51 Distribution of S

52 Distribution of S

53 Distribution of S

54 Distribution of S

55 Distribution of X

56 Distribution of S

57 Distribution of S

58 Distribution of S

59 Distribution of S

60 Distribution of S

61 Distribution of X

62 Distribution of S

63 Distribution of S

64 Distribution of S

65 Distribution of S

66 Distribution of S

67 Normal Distribution Probability Density Function 1 f ( x) = e x σ σ 2π 2 ( µ ) / ( 2 2) Ν ( µ,σ 2 ) Probability Density Function (PDF) Cumulative Distribution Function (CDF)

68 The Central Limit Theorem 1. The distribution of means will be approximately a normal distribution for larger sample sizes 2. The mean of the distribution of means approaches the population mean, μ, for large sample sizes 3. The standard deviation of the distribution of means approaches σ/ n for large sample sizes, where σ is the standard deviation of the population and n is the sample size

69 The Central Limit Theorem Side Notes 1. For practical purposes, the distribution of means will be nearly normal if the sample size is larger than If the original population is normally distributed, then the sample means will remain normally distributed for any sample size n, and it will become narrower 3. The original variable can have any distribution, it does not have to be a normal distribution

70 Shapes of Distributions as Sample Size Increases

71 Testing

72 Hypothesis Testing The FDA or science needs to decide on a new theory, drug, treatment H 0 : The null hypothesis - the current theory, drug, treatment, is as good or better H 1 : The alternative hypothesis - the new theory, drug, treatment, should replace the old one Researchers do not know which hypothesis is true. They must make a decision on the basis of evidence presented.

73 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill of this city is μ = $42 population proportion Example: The proportion of adults in this city with cell phones is p =.68 Statistics for Business and Economics, 6e 2007 Pearson Education, Inc. Chap 10-73

74 The Null Hypothesis, H 0 H 0 : μ = 3 H 0 : μ = 3 H 0 : X = 3 Statistics for Business and Economics, 6e 2007 Pearson Education, Inc.

75 Hypothesis Testing Process Is Claim: the population mean age is 50. (Null Hypothesis: H 0 : μ = 50 ) X=20 likely if μ = 50? If not likely, REJECT Null Hypothesis Suppose the sample mean age is 20: X = 20 Population Now select a random sample Sample

76 Reason for Rejecting H 0 76

77 Outcomes and Probabilities Possible Hypothesis Test Outcomes Key: Outcome (Probability) Decision Do Not Reject H 0 Reject H 0 Actual Situation H 0 True No error (1 - α ) Type I Error ( ) α H 0 False Type II Error ( β ) No Error ( 1 - β ) Statistics for Business and Economics, 6e 2007 Pearson Education, Inc. Chap 10-77

78 Level of Significance and the Rejection Region Level of significance = α H 0 : μ = 3 H 1 : μ 3 H 0 : μ 3 H 1 : μ > 3 Two-tail test Upper-tail test α/2 0 0 α /2 α Represents critical value Rejection region is shaded H 0 : μ 3 H 1 : μ < 3 α Statistics for Business and Economics, 6e 2007 Pearson Education, Inc. Lower-tail test Chap

79 p-value Approach to Testing p-value: Probability of obtaining a test statistic more extreme ( or ) than the observed sample value given H 0 is true Also called observed level of significance Smallest value of α for which H 0 can be rejected Statistics for Business and Economics, 6e 2007 Pearson Education, Inc. Chap 10-79

80 p-value Calculate the p-value and compare to α p-value Reject H 0 α =.10 0 Do not reject H 0 Reject H 0 Statistics for Business and Economics, 6e 2007 Pearson Education, Inc. Chap 10-80

81 Jonah Lehrer, 2010, The New Yorker The Truth Wears off John Davis, University of Illinois Davis has a forthcoming analysis demonstrating that the efficacy of antidepressants has gone down as much as threefold in recent decades. Anders Pape Møller, 1991 female barn swallows were far more likely to mate with male birds that had long, symmetrical feathers Between 1992 and 1997, the average effect size shrank by eighty per cent. Jonathan Schooler, 1990 subjects shown a face and asked to describe it were much less likely to recognize the face when shown it later than those who had simply looked at it. The effect became increasingly difficult to measure. Joseph Rhine, 1930s, coiner of the term extrasensory perception Tested individuals with card-guessing experiments. A few students achieved multiple low-probability streaks. But there was a decline effect their performance became worse over time. 9/12/13 Bill Howe, Data Science, Autumn

82 Reason 1: Publication Bias In the last few years, several meta-analyses have reappraised the efficacy and safety of antidepressants and concluded that the therapeutic value of these drugs may have been significantly overestimated. Although publication bias has been documented in the literature for decades and its origins and consequences debated extensively, there is evidence suggesting that this bias is increasing. A case in point is the field of biomedical research in autism spectrum disorder (ASD), which suggests that in some areas negative results are completely absent (emphasis mine) a highly significant correlation (R 2 = 0.13, p < 0.001) between impact factor and overestimation of effect sizes has been reported. Publication bias: What are the challenges and can they be overcome? Ridha Joober, Norbert Schmitz, Lawrence Annable, and Patricia Boksa 9/12/13 Bill Howe, Data Science, Autumn J Psychiatry Neurosci May; 37(3): doi: /jpn

83 Publication Bias decline effect 9/12/13 Bill Howe, UW 83

84 decline effect = publication bias! 9/12/13 Bill Howe, UW 84

85 Background: Effect Size Effect size = [Mean of experimental group] [Mean of control group] standard deviation Expressed in relevant units Not just significant how significant? Used prolifically in meta-analysis to combine results from multiple studies But be careful averaging results from different experiments can produce nonsense Caveat: Other definitions of effect size exist: odds-ratio, correlation coefficient Robert Coe, 2002, Annual Conference of the British Educational Research Association It's 9/12/13 the Effect Size, Stupid: What effect size is Bill and Howe, why UW it is important. 85

86 Effect Size Standardized Mean Difference Lots of ways to estimate the pooled standard deviation Glass, 1976 e.g., Hartung et al., /12/2013 Bill Howe, UW 86

87 Effect size: Cohen s Heuristic Standardized mean difference effect size small = 0.20 medium = 0.50 large = /12/2013 Bill Howe, UW 87

88 Reason 3: Multiple Hypothesis Testing If you perform experiments over and over, you re bound to find something This is a bit different than the publication bias problem: Same sample, different hypotheses Significance level must be adjusted down when performing multiple hypothesis tests 9/12/13 Bill Howe, UW 88

89 P(detecting an effect when there is none) = α = 0.05 P(detecting an effect when it exists) = 1 α P(detecting an effect when it exists on every experiment) = (1 α) k P(detecting an effect when there is none on at least one experiment) = 1 (1 α) k α = 0.05 Familywise Error Rate 9/12/13 Bill Howe, UW 89

90 Familywise Error Rate Corrections Bonferroni Correction Just divide by the number of hypotheses Šidák Correction Asserts independence 09/12/2013 Bill Howe, UW 90

91 Summary Stochastic Variables Basics in Statistics Bayes Law Central Limit Theorem Law of Large Numbers Testing

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com Gujarati D. Basic Econometrics, Appendix

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten

More information

Lecture 9: Naive Bayes, SVM, Kernels. Saravanan Thirumuruganathan

Lecture 9: Naive Bayes, SVM, Kernels. Saravanan Thirumuruganathan Lecture 9: Naive Bayes, SVM, Kernels Instructor: Outline 1 Probability basics 2 Probabilistic Interpretation of Classification 3 Bayesian Classifiers, Naive Bayes 4 Support Vector Machines Probability

More information

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

The probability of an event is viewed as a numerical measure of the chance that the event will occur. Chapter 5 This chapter introduces probability to quantify randomness. Section 5.1: How Can Probability Quantify Randomness? The probability of an event is viewed as a numerical measure of the chance that

More information

PROBABILITY: AN INTERACTIVE REVIEW, II CS1951A INTRO TO DATA SCIENCE DAN POTTER

PROBABILITY: AN INTERACTIVE REVIEW, II CS1951A INTRO TO DATA SCIENCE DAN POTTER PROBABILITY: AN INTERACTIVE REVIEW, II CS1951A INTRO TO DATA SCIENCE DAN POTTER 1 CONTENT CREDIT Many of of these slides include content produced by Jeremy Orloff Jonathan Bloom MIT Course Number 18.05

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Lecture 2 Probability Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 3-1 3.1 Definition Random Experiment a process leading to an uncertain

More information

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability What is Probability? the chance of an event occuring eg 1classical probability 2empirical probability 3subjective probability Section 2 - Probability (1) Probability - Terminology random (probability)

More information

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events LECTURE 1 1 Introduction The first part of our adventure is a highly selective review of probability theory, focusing especially on things that are most useful in statistics. 1.1 Sample spaces and events

More information

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias Recap Announcements Lecture 5: Statistics 101 Mine Çetinkaya-Rundel September 13, 2011 HW1 due TA hours Thursday - Sunday 4pm - 9pm at Old Chem 211A If you added the class last week please make sure to

More information

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1 Topic -2 Probability Larson & Farber, Elementary Statistics: Picturing the World, 3e 1 Probability Experiments Experiment : An experiment is an act that can be repeated under given condition. Rolling a

More information

Probability Rules. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

Probability Rules. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics Probability Rules MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Introduction Probability is a measure of the likelihood of the occurrence of a certain behavior

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

18.05 Practice Final Exam

18.05 Practice Final Exam No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For

More information

Be able to define the following terms and answer basic questions about them:

Be able to define the following terms and answer basic questions about them: CS440/ECE448 Section Q Fall 2017 Final Review Be able to define the following terms and answer basic questions about them: Probability o Random variables, axioms of probability o Joint, marginal, conditional

More information

PROBABILITY THEORY. Prof. S. J. Soni. Assistant Professor Computer Engg. Department SPCE, Visnagar

PROBABILITY THEORY. Prof. S. J. Soni. Assistant Professor Computer Engg. Department SPCE, Visnagar PROBABILITY THEORY By Prof. S. J. Soni Assistant Professor Computer Engg. Department SPCE, Visnagar Introduction Signals whose values at any instant t are determined by their analytical or graphical description

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 3 Probability Contents 1. Events, Sample Spaces, and Probability 2. Unions and Intersections 3. Complementary Events 4. The Additive Rule and Mutually Exclusive

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Chapter 7: Section 7-1 Probability Theory and Counting Principles

Chapter 7: Section 7-1 Probability Theory and Counting Principles Chapter 7: Section 7-1 Probability Theory and Counting Principles D. S. Malik Creighton University, Omaha, NE D. S. Malik Creighton University, Omaha, NE Chapter () 7: Section 7-1 Probability Theory and

More information

Elementary Discrete Probability

Elementary Discrete Probability Elementary Discrete Probability MATH 472 Financial Mathematics J Robert Buchanan 2018 Objectives In this lesson we will learn: the terminology of elementary probability, elementary rules of probability,

More information

Chapter 7 Wednesday, May 26th

Chapter 7 Wednesday, May 26th Chapter 7 Wednesday, May 26 th Random event A random event is an event that the outcome is unpredictable. Example: There are 45 students in this class. What is the probability that if I select one student,

More information

6.4 Type I and Type II Errors

6.4 Type I and Type II Errors 6.4 Type I and Type II Errors Ulrich Hoensch Friday, March 22, 2013 Null and Alternative Hypothesis Neyman-Pearson Approach to Statistical Inference: A statistical test (also known as a hypothesis test)

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Lecture Lecture 5

Lecture Lecture 5 Lecture 4 --- Lecture 5 A. Basic Concepts (4.1-4.2) 1. Experiment: A process of observing a phenomenon that has variation in its outcome. Examples: (E1). Rolling a die, (E2). Drawing a card form a shuffled

More information

Computational Perception. Bayesian Inference

Computational Perception. Bayesian Inference Computational Perception 15-485/785 January 24, 2008 Bayesian Inference The process of probabilistic inference 1. define model of problem 2. derive posterior distributions and estimators 3. estimate parameters

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October

More information

MAT 271E Probability and Statistics

MAT 271E Probability and Statistics MAT 271E Probability and Statistics Spring 2011 Instructor : Class Meets : Office Hours : Textbook : Supp. Text : İlker Bayram EEB 1103 ibayram@itu.edu.tr 13.30 16.30, Wednesday EEB? 10.00 12.00, Wednesday

More information

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution

More information

Probability Year 10. Terminology

Probability Year 10. Terminology Probability Year 10 Terminology Probability measures the chance something happens. Formally, we say it measures how likely is the outcome of an event. We write P(result) as a shorthand. An event is some

More information

Deep Learning for Computer Vision

Deep Learning for Computer Vision Deep Learning for Computer Vision Lecture 3: Probability, Bayes Theorem, and Bayes Classification Peter Belhumeur Computer Science Columbia University Probability Should you play this game? Game: A fair

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Review. December 4 th, Review

Review. December 4 th, Review December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter

More information

2. AXIOMATIC PROBABILITY

2. AXIOMATIC PROBABILITY IA Probability Lent Term 2. AXIOMATIC PROBABILITY 2. The axioms The formulation for classical probability in which all outcomes or points in the sample space are equally likely is too restrictive to develop

More information

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu Home Work: 1 1. Describe the sample space when a coin is tossed (a) once, (b) three times, (c) n times, (d) an infinite number of times. 2. A coin is tossed until for the first time the same result appear

More information

With Question/Answer Animations. Chapter 7

With Question/Answer Animations. Chapter 7 With Question/Answer Animations Chapter 7 Chapter Summary Introduction to Discrete Probability Probability Theory Bayes Theorem Section 7.1 Section Summary Finite Probability Probabilities of Complements

More information

Probability 5-4 The Multiplication Rules and Conditional Probability

Probability 5-4 The Multiplication Rules and Conditional Probability Outline Lecture 8 5-1 Introduction 5-2 Sample Spaces and 5-3 The Addition Rules for 5-4 The Multiplication Rules and Conditional 5-11 Introduction 5-11 Introduction as a general concept can be defined

More information

Frequentist Statistics and Hypothesis Testing Spring

Frequentist Statistics and Hypothesis Testing Spring Frequentist Statistics and Hypothesis Testing 18.05 Spring 2018 http://xkcd.com/539/ Agenda Introduction to the frequentist way of life. What is a statistic? NHST ingredients; rejection regions Simple

More information

Presentation on Theo e ry r y o f P r P o r bab a il i i l t i y

Presentation on Theo e ry r y o f P r P o r bab a il i i l t i y Presentation on Theory of Probability Meaning of Probability: Chance of occurrence of any event In practical life we come across situation where the result are uncertain Theory of probability was originated

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017 Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0

More information

Entropy. Probability and Computing. Presentation 22. Probability and Computing Presentation 22 Entropy 1/39

Entropy. Probability and Computing. Presentation 22. Probability and Computing Presentation 22 Entropy 1/39 Entropy Probability and Computing Presentation 22 Probability and Computing Presentation 22 Entropy 1/39 Introduction Why randomness and information are related? An event that is almost certain to occur

More information

6.2 Introduction to Probability. The Deal. Possible outcomes: STAT1010 Intro to probability. Definitions. Terms: What are the chances of?

6.2 Introduction to Probability. The Deal. Possible outcomes: STAT1010 Intro to probability. Definitions. Terms: What are the chances of? 6.2 Introduction to Probability Terms: What are the chances of?! Personal probability (subjective) " Based on feeling or opinion. " Gut reaction.! Empirical probability (evidence based) " Based on experience

More information

What is Probability? Probability. Sample Spaces and Events. Simple Event

What is Probability? Probability. Sample Spaces and Events. Simple Event What is Probability? Probability Peter Lo Probability is the numerical measure of likelihood that the event will occur. Simple Event Joint Event Compound Event Lies between 0 & 1 Sum of events is 1 1.5

More information

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing Robert Vanderbei Fall 2014 Slides last edited on November 24, 2014 http://www.princeton.edu/ rvdb Coin Tossing Example Consider two coins.

More information

Monty Hall Puzzle. Draw a tree diagram of possible choices (a possibility tree ) One for each strategy switch or no-switch

Monty Hall Puzzle. Draw a tree diagram of possible choices (a possibility tree ) One for each strategy switch or no-switch Monty Hall Puzzle Example: You are asked to select one of the three doors to open. There is a large prize behind one of the doors and if you select that door, you win the prize. After you select a door,

More information

Mutually Exclusive Events

Mutually Exclusive Events 172 CHAPTER 3 PROBABILITY TOPICS c. QS, 7D, 6D, KS Mutually Exclusive Events A and B are mutually exclusive events if they cannot occur at the same time. This means that A and B do not share any outcomes

More information

Event A: at least one tail observed A:

Event A: at least one tail observed A: Chapter 3 Probability 3.1 Events, sample space, and probability Basic definitions: An is an act of observation that leads to a single outcome that cannot be predicted with certainty. A (or simple event)

More information

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007 MIT OpenCourseWare http://ocw.mit.edu HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Example. χ 2 = Continued on the next page. All cells

Example. χ 2 = Continued on the next page. All cells Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E

More information

2. A Basic Statistical Toolbox

2. A Basic Statistical Toolbox . A Basic Statistical Toolbo Statistics is a mathematical science pertaining to the collection, analysis, interpretation, and presentation of data. Wikipedia definition Mathematical statistics: concerned

More information

4.2 Probability Models

4.2 Probability Models 4.2 Probability Models Ulrich Hoensch Tuesday, February 19, 2013 Sample Spaces Examples 1. When tossing a coin, the sample space is S = {H, T }, where H = heads, T = tails. 2. When randomly selecting a

More information

Exam 2 Practice Questions, 18.05, Spring 2014

Exam 2 Practice Questions, 18.05, Spring 2014 Exam 2 Practice Questions, 18.05, Spring 2014 Note: This is a set of practice problems for exam 2. The actual exam will be much shorter. Within each section we ve arranged the problems roughly in order

More information

4. Conditional Probability P( ) CSE 312 Autumn 2012 W.L. Ruzzo

4. Conditional Probability P( ) CSE 312 Autumn 2012 W.L. Ruzzo 4. Conditional Probability P( ) CSE 312 Autumn 2012 W.L. Ruzzo 1 conditional probability Conditional probability of E given F: probability that E occurs given that F has occurred. Conditioning on F S Written

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10 EECS 70 Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10 Introduction to Basic Discrete Probability In the last note we considered the probabilistic experiment where we flipped

More information

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience Hypothesis Testing Definitions Hypothesis a claim about something Null hypothesis ( H 0 ) the hypothesis that suggests no change from previous experience Alternative hypothesis ( H 1 ) the hypothesis that

More information

Discrete Structures for Computer Science

Discrete Structures for Computer Science Discrete Structures for Computer Science William Garrison bill@cs.pitt.edu 6311 Sennott Square Lecture #24: Probability Theory Based on materials developed by Dr. Adam Lee Not all events are equally likely

More information

Probability and Statistics. Terms and concepts

Probability and Statistics. Terms and concepts Probability and Statistics Joyeeta Dutta Moscato June 30, 2014 Terms and concepts Sample vs population Central tendency: Mean, median, mode Variance, standard deviation Normal distribution Cumulative distribution

More information

Statistical Preliminaries. Stony Brook University CSE545, Fall 2016

Statistical Preliminaries. Stony Brook University CSE545, Fall 2016 Statistical Preliminaries Stony Brook University CSE545, Fall 2016 Random Variables X: A mapping from Ω to R that describes the question we care about in practice. 2 Random Variables X: A mapping from

More information

Chap 4 Probability p227 The probability of any outcome in a random phenomenon is the proportion of times the outcome would occur in a long series of

Chap 4 Probability p227 The probability of any outcome in a random phenomenon is the proportion of times the outcome would occur in a long series of Chap 4 Probability p227 The probability of any outcome in a random phenomenon is the proportion of times the outcome would occur in a long series of repetitions. (p229) That is, probability is a long-term

More information

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail}

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail} Random Experiment In random experiments, the result is unpredictable, unknown prior to its conduct, and can be one of several choices. Examples: The Experiment of tossing a coin (head, tail) The Experiment

More information

Probability the chance that an uncertain event will occur (always between 0 and 1)

Probability the chance that an uncertain event will occur (always between 0 and 1) Quantitative Methods 2013 1 Probability as a Numerical Measure of the Likelihood of Occurrence Probability the chance that an uncertain event will occur (always between 0 and 1) Increasing Likelihood of

More information

2.4. Conditional Probability

2.4. Conditional Probability 2.4. Conditional Probability Objectives. Definition of conditional probability and multiplication rule Total probability Bayes Theorem Example 2.4.1. (#46 p.80 textbook) Suppose an individual is randomly

More information

Probability Theory and Applications

Probability Theory and Applications Probability Theory and Applications Videos of the topics covered in this manual are available at the following links: Lesson 4 Probability I http://faculty.citadel.edu/silver/ba205/online course/lesson

More information

Probability Notes (A) , Fall 2010

Probability Notes (A) , Fall 2010 Probability Notes (A) 18.310, Fall 2010 We are going to be spending around four lectures on probability theory this year. These notes cover approximately the first three lectures on it. Probability theory

More information

Topic 5 Basics of Probability

Topic 5 Basics of Probability Topic 5 Basics of Probability Equally Likely Outcomes and the Axioms of Probability 1 / 13 Outline Equally Likely Outcomes Axioms of Probability Consequences of the Axioms 2 / 13 Introduction A probability

More information

Formalizing Probability. Choosing the Sample Space. Probability Measures

Formalizing Probability. Choosing the Sample Space. Probability Measures Formalizing Probability Choosing the Sample Space What do we assign probability to? Intuitively, we assign them to possible events (things that might happen, outcomes of an experiment) Formally, we take

More information

Conditional Probability. CS231 Dianna Xu

Conditional Probability. CS231 Dianna Xu Conditional Probability CS231 Dianna Xu 1 Boy or Girl? A couple has two children, one of them is a girl. What is the probability that the other one is also a girl? Assuming 50/50 chances of conceiving

More information

CSC Discrete Math I, Spring Discrete Probability

CSC Discrete Math I, Spring Discrete Probability CSC 125 - Discrete Math I, Spring 2017 Discrete Probability Probability of an Event Pierre-Simon Laplace s classical theory of probability: Definition of terms: An experiment is a procedure that yields

More information

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COURSE: CBS 221 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate

More information

MAT 271E Probability and Statistics

MAT 271E Probability and Statistics MAT 7E Probability and Statistics Spring 6 Instructor : Class Meets : Office Hours : Textbook : İlker Bayram EEB 3 ibayram@itu.edu.tr 3.3 6.3, Wednesday EEB 6.., Monday D. B. Bertsekas, J. N. Tsitsiklis,

More information

Statistics for Managers Using Microsoft Excel (3 rd Edition)

Statistics for Managers Using Microsoft Excel (3 rd Edition) Statistics for Managers Using Microsoft Excel (3 rd Edition) Chapter 4 Basic Probability and Discrete Probability Distributions 2002 Prentice-Hall, Inc. Chap 4-1 Chapter Topics Basic probability concepts

More information

Probability Theory and Simulation Methods

Probability Theory and Simulation Methods Feb 28th, 2018 Lecture 10: Random variables Countdown to midterm (March 21st): 28 days Week 1 Chapter 1: Axioms of probability Week 2 Chapter 3: Conditional probability and independence Week 4 Chapters

More information

Statistical Methods for the Social Sciences, Autumn 2012

Statistical Methods for the Social Sciences, Autumn 2012 Statistical Methods for the Social Sciences, Autumn 2012 Review Session 3: Probability. Exercises Ch.4. More on Stata TA: Anastasia Aladysheva anastasia.aladysheva@graduateinstitute.ch Office hours: Mon

More information

Today. Statistical Learning. Coin Flip. Coin Flip. Experiment 1: Heads. Experiment 1: Heads. Which coin will I use? Which coin will I use?

Today. Statistical Learning. Coin Flip. Coin Flip. Experiment 1: Heads. Experiment 1: Heads. Which coin will I use? Which coin will I use? Today Statistical Learning Parameter Estimation: Maximum Likelihood (ML) Maximum A Posteriori (MAP) Bayesian Continuous case Learning Parameters for a Bayesian Network Naive Bayes Maximum Likelihood estimates

More information

Contents. Decision Making under Uncertainty 1. Meanings of uncertainty. Classical interpretation

Contents. Decision Making under Uncertainty 1. Meanings of uncertainty. Classical interpretation Contents Decision Making under Uncertainty 1 elearning resources Prof. Ahti Salo Helsinki University of Technology http://www.dm.hut.fi Meanings of uncertainty Interpretations of probability Biases in

More information

Lecture notes for probability. Math 124

Lecture notes for probability. Math 124 Lecture notes for probability Math 124 What is probability? Probabilities are ratios, expressed as fractions, decimals, or percents, determined by considering results or outcomes of experiments whose result

More information

Bayesian Inference and MCMC

Bayesian Inference and MCMC Bayesian Inference and MCMC Aryan Arbabi Partly based on MCMC slides from CSC412 Fall 2018 1 / 18 Bayesian Inference - Motivation Consider we have a data set D = {x 1,..., x n }. E.g each x i can be the

More information

MAE 493G, CpE 493M, Mobile Robotics. 6. Basic Probability

MAE 493G, CpE 493M, Mobile Robotics. 6. Basic Probability MAE 493G, CpE 493M, Mobile Robotics 6. Basic Probability Instructor: Yu Gu, Fall 2013 Uncertainties in Robotics Robot environments are inherently unpredictable; Sensors and data acquisition systems are

More information

L2: Review of probability and statistics

L2: Review of probability and statistics Probability L2: Review of probability and statistics Definition of probability Axioms and properties Conditional probability Bayes theorem Random variables Definition of a random variable Cumulative distribution

More information

Lecture 6 Probability

Lecture 6 Probability Lecture 6 Probability Example: When you toss a coin, there are only two possible outcomes, heads and tails. What if we toss a coin 4 times? Figure below shows the results of tossing a coin 5000 times twice.

More information

STAT200 Elementary Statistics for applications

STAT200 Elementary Statistics for applications STAT200 Elementary Statistics for applications Lecture # 12 Dr. Ruben Zamar Winter 2011 / 2012 http://www4.agr.gc.ca/aafc-aac/display-afficher.do?id=1256763623482 Randomness Randomness is unpredictable

More information

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019 Lecture 10: Probability distributions DANIEL WELLER TUESDAY, FEBRUARY 19, 2019 Agenda What is probability? (again) Describing probabilities (distributions) Understanding probabilities (expectation) Partial

More information

MATH 19B FINAL EXAM PROBABILITY REVIEW PROBLEMS SPRING, 2010

MATH 19B FINAL EXAM PROBABILITY REVIEW PROBLEMS SPRING, 2010 MATH 9B FINAL EXAM PROBABILITY REVIEW PROBLEMS SPRING, 00 This handout is meant to provide a collection of exercises that use the material from the probability and statistics portion of the course The

More information

Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Business Statistics: A Decision-Making Approach, 6e. Chapter Goals Chapter 4 Student Lecture Notes 4-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 4 Using Probability and Probability Distributions Fundamentals of Business Statistics Murali Shanker

More information

Introduction to probability theory

Introduction to probability theory Introduction to probability theory Fátima Sánchez Cabo Institute for Genomics and Bioinformatics, TUGraz f.sanchezcabo@tugraz.at 07/03/2007 - p. 1/35 Outline Random and conditional probability (7 March)

More information

Chapter 7: Hypothesis Testing

Chapter 7: Hypothesis Testing Chapter 7: Hypothesis Testing *Mathematical statistics with applications; Elsevier Academic Press, 2009 The elements of a statistical hypothesis 1. The null hypothesis, denoted by H 0, is usually the nullification

More information

Naïve Bayes classification

Naïve Bayes classification Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss

More information

(a) Fill in the missing probabilities in the table. (b) Calculate P(F G). (c) Calculate P(E c ). (d) Is this a uniform sample space?

(a) Fill in the missing probabilities in the table. (b) Calculate P(F G). (c) Calculate P(E c ). (d) Is this a uniform sample space? Math 166 Exam 1 Review Sections L.1-L.2, 1.1-1.7 Note: This review is more heavily weighted on the new material this week: Sections 1.5-1.7. For more practice problems on previous material, take a look

More information

Topic 2 Probability. Basic probability Conditional probability and independence Bayes rule Basic reliability

Topic 2 Probability. Basic probability Conditional probability and independence Bayes rule Basic reliability Topic 2 Probability Basic probability Conditional probability and independence Bayes rule Basic reliability Random process: a process whose outcome can not be predicted with certainty Examples: rolling

More information

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2 Probability Probability is the study of uncertain events or outcomes. Games of chance that involve rolling dice or dealing cards are one obvious area of application. However, probability models underlie

More information

Probability theory basics

Probability theory basics Probability theory basics Michael Franke Basics of probability theory: axiomatic definition, interpretation, joint distributions, marginalization, conditional probability & Bayes rule. Random variables:

More information

Mathematical Foundations of Computer Science Lecture Outline October 18, 2018

Mathematical Foundations of Computer Science Lecture Outline October 18, 2018 Mathematical Foundations of Computer Science Lecture Outline October 18, 2018 The Total Probability Theorem. Consider events E and F. Consider a sample point ω E. Observe that ω belongs to either F or

More information

Statistical Theory 1

Statistical Theory 1 Statistical Theory 1 Set Theory and Probability Paolo Bautista September 12, 2017 Set Theory We start by defining terms in Set Theory which will be used in the following sections. Definition 1 A set is

More information

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

Introduction: MLE, MAP, Bayesian reasoning (28/8/13) STA561: Probabilistic machine learning Introduction: MLE, MAP, Bayesian reasoning (28/8/13) Lecturer: Barbara Engelhardt Scribes: K. Ulrich, J. Subramanian, N. Raval, J. O Hollaren 1 Classifiers In this

More information

Answers and expectations

Answers and expectations Answers and expectations For a function f(x) and distribution P(x), the expectation of f with respect to P is The expectation is the average of f, when x is drawn from the probability distribution P E

More information

Statistical Inference: Uses, Abuses, and Misconceptions

Statistical Inference: Uses, Abuses, and Misconceptions Statistical Inference: Uses, Abuses, and Misconceptions Michael W. Trosset Indiana Statistical Consulting Center Department of Statistics ISCC is part of IU s Department of Statistics, chaired by Stanley

More information

STAT 516: Basic Probability and its Applications

STAT 516: Basic Probability and its Applications Lecture 3: Conditional Probability and Independence Prof. Michael September 29, 2015 Motivating Example Experiment ξ consists of rolling a fair die twice; A = { the first roll is 6 } amd B = { the sum

More information

Business Statistics. Lecture 3: Random Variables and the Normal Distribution

Business Statistics. Lecture 3: Random Variables and the Normal Distribution Business Statistics Lecture 3: Random Variables and the Normal Distribution 1 Goals for this Lecture A little bit of probability Random variables The normal distribution 2 Probability vs. Statistics Probability:

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

Classical and Bayesian inference

Classical and Bayesian inference Classical and Bayesian inference AMS 132 Claudia Wehrhahn (UCSC) Classical and Bayesian inference January 8 1 / 11 The Prior Distribution Definition Suppose that one has a statistical model with parameter

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics February 12, 2018 CS 361: Probability & Statistics Random Variables Monty hall problem Recall the setup, there are 3 doors, behind two of them are indistinguishable goats, behind one is a car. You pick

More information