Data Analysis and Statistical Methods Statistics 651

Size: px
Start display at page:

Download "Data Analysis and Statistical Methods Statistics 651"

Transcription

1 Data Analysis and Statistical Methods Statistics Lecture 5 (MWF) Probabilities and the rules Suhasini Subba Rao

2 Review of previous lecture We looked at the interquartile range. The concept of a variance and standard deviation was introduced. The standard deviation is the square root of the variance. Like the mean, the standard deviation (which is the square root of the variance) is a measure of spread of the population, and is not random. On the other hand, the sample variance and standard deviation are random. In statistics we have: Populations, Random variables (the measurements we are interested in the population) and Probabilities which are allocated 1

3 to the random variables. In this lecture we will introduce the notion of a probability and the rules that go with it. 2

4 Probability Suppose there is a population of people and I select one person at random. What is the chance (probability) that the selected person s height lies in the interval [5.5, 5.75]feet? How do we calculate this chance? The probability that the randomly selected person s height lies in the interval [5.5, 5.75] is: Total number of people in population with height in the interval [5.5, 5.75]. Total number of people in population 3

5 Probability and random variables A probability is always positive and is greater than or equal to zero and less than or equal to one (lies in the interval [0, 1]). Random variables and probabilities Recall a random variable X denotes a measurement on a randomly selected individual (it can be the height of a randomly selected student or the gender of a random selected person (for the sake of simplicity we will assume either male or female)). A random variable can take any one of several different outcomes, to each outcome we allocate a probability. We now consider some examples to familiarize ourselves with this notation. 4

6 Examples of random variables and their probabilities For example if X is the gender of a randomly chosen in the world population. Then clearly X = {Male or Female}. We write P (X =Female) to denote the probability a random chosen person is female. Similar X can denote the height of a randomly chosen person. Then P (5 < X < 5.25), denotes the probability of that a randomly selected students height is in the interval [5, 5.25]. 5

7 Illustration:Data from a 651 class Summary of information from a 651 class. 6

8 Probabilities and random variables: Illustration 1 (i) The measurement of interest may be the height of the randomly chosen person. Therefore X = height of randomly selected person. The set of all possible outcomes is any number in the interval [5, 7]. Therefore, P (5 X 7) = 1. P (5.75 X < 6.25) = P (5.75 X < 6) + P (6 X < 6.25) = It is impossible from this plot to evaluate P (5.9 X < 6.15). (ii) The measurement of interest may be the gender of the randomly chosen person. Therefore Y = gender of randomly chosen person. The set of all possible outcomes is {male, female}. To each outcome we assign a probability. Since {male, female} is the set of all outcomes (there are no other), we have that P (Y = male or female) =1 and P (Y = Male) =

9 Mutually exclusive events Two events (outcomes) are mutually exclusive, if the occurrence of one event excludes the occurrence of the other event. In the height example, the interval A = [5, 5.5) is an (this is the event that a randomly chosen person s height lies in the interval A = [5, 5.5)). Similarly B = [5.75, 6) is an event. A = [5, 5.5) and B = [5.75, 6) are mutually exclusive events. This means if the randomly chosen person s height lies in the interval A = [5, 5.5), then their height cannot lie in the interval B = [5.75, 6]. Similarly if we know that X (the height of a randomly chosen person) lies in B = [5.75, 6) then it cannot lie in A = [5, 5.5). On the other hand, the events A = [5, 5.5) and C = [5.25, 5.75) are not mutually exclusive. If X = 5.3, then it lies in both events A and C. 8

10 Suppose X is the time of the day I wake up. Let A=night time and B=day time, then A and B are mutually exclusive events. In anyone day, I cannot get up at both night and day time. 9

11 Mutually exclusive events and probabilities Benefits of mutually exclusive events If two events A and B are mutually exclusive, then P (A or B) = P (A) + P (B) (often P (A or B) = P (A B)). Height Example Suppose X is the height of a randomly chosen person. If A = [5, 5.5) and B = [5.75, 6), then P (X in A or X in B) = P (5 X < 5.5 or 5.75 X < 6). Since A and B are mutually exclusive then P (X in A or X in B) = P (5 X < 5.5 or 5.75 X < 6) = P (5 X < 5.5) + P (5.75 X < 6). Plot these two events on a density plot. 10

12 Example 2 Suppose that X is the height of a randomly chosen person. Suppose that the probability P (X t) is known for every single number t. That is I know P (X 4), P (X 4.5) P (X 6) etc. Using this information, how can I evaluate: (i) P (4 < X 7). (ii) P (X 8) (iii) P (4 < X 7 or 8 < X 9) (iv) P (4 < X 7 or 6 < X 8) Solutions in probabilities_lecture5.pdf 11

13 Reasons for the above exercise Soon you will need to understand how to calculate the chance of a certain event happening (such as the sample mean being less than a certain value). This will require you to use the normal tables and look up probabilites and the above will be useful. It will give you some idea how percentiles are calculated. Let us return to the doctor office example. Suppose your blood level lies between 13-15, then given the chances the distribution of blood levels (based on) P (X t), you can calculate the chance P (13 < X 15). 12

14 Motivation: Conditional probabilities Is there a relationship between binge drinking (excessive consumption of alcohol) and gender students were surveyed (7180 males and 9916 females). The data is given below. Male Female Binge Not Binge In order to analyze this type of data we need to introduce the notion of a conditional probability. This is a probability calculated using not the entire population but the conditioned subpopulation. 13

15 Motivation: cont A conditional probability is the probability the probability of an event given that we already have some (possibly partial) information about. Male Female Binge Not Binge Subtotals The probability of binge drinking given that you know person is female is 1684/ %, whereas the probability of binge drinking given that you know person is male is 1630/ %. We can write these conditional probabilities as P(binge drink Male) = and P(binge drink Female) = Based on the above information, does gender change the chance of 14

16 binge drinking? In other words does information about the gender of the person change the chance of bring drinking. This example alludes to the notion of independence, which we discuss at the end of this lecture. 15

17 Conditional probability Example Suppose X is the height of a randomly selected person and we know that X lies in the interval [5, 6], then what is the probability X lies in [5, 5.5]? This probability is Number of people whose height is in [5, 5.5] Number of people whose height is in [5, 6]. We write this probability as: the probability that X lies in [5, 5.5] given that X lies in [5, 6]. Using probability notation this is written as: P (5 X X 6) = P (5 X 5.5 }{{} 5 X 6) denotes given 16

18 Compare this with the the probability a random person lies in the interval [5, 5.5], written as P (5 X 5.5). In this case P (5 X 5.5) will be smaller than P (5 X X 6). What is P (5 X 5.5 X 6)? 17

19 Conditional probability: Heights and gender of 18 people Height Gender F M F M M F F M F Height Gender M M F M F M M F F Suppose we randomly select a person and let X denote their height and Y their gender. Calculate (i) P (X 5.5). (ii) P (X 5.5 Y = M). (iii) P (X 5.5 Y = F ). What do we observe? Is there a difference in the distribution of male and female heights. 18

20 Example - heights and gender (general) Suppose we randomly choose a person, and let X denote their height and Y their gender. Then, P (5 X 5.5 Y = female), is the probability that a randomly person s height lies in interval [5, 5.5] given that we know the person is female. Clearly P (5 X 5.5 Y = female) P (5 X 5.5) and P (5 X 5.5 Y = female) P (5 X 5.5 Y = male). Question In general, which do you think is the larger probability; P (5 X 5.5 Y = female) or P (5 X 5.5 Y = male). Answer Since females, in general, tends to be smaller than males, and [5, 5.5] feet is quite small. It is more likely that a female should have a height in the interval in [5, 5.5] than a male to have a height in [5, 5.5]. 19

21 This is saying that the probability a randomly chosen person s height lies in [5, 5.5] given that the person is female is greater than the probability a randomly chosen persons height lies in [5, 5.5] given that the person is male. In more succinct notation, this is saying that P (5 X 5.5 Y = female) > P (5 X 5.5 Y = male). Suppose that X denotes the height of a randomly chosen person in the 651 data and Y denotes their gender. How to evaluate: (i) P (5 X 5.5) (ii) P (5 X 5.5 Y = female) (iii) P (5 X 5.5 Y = male)? We will not be using conditional probabilities much in this course. 20

22 However, the idea is important to understand. This is because when we do statistical testing we will use the idea of probability of observing some event given that A is true. 21

23 Independence and conditional probabilities Definition Suppose that we have two events A and B. The events A and B are independent of each other if P (A B) = P (A). This means the event B has no influence what so ever on the chance of event A occurring. Let us return to the example of heights and gender. Do you think height and gender are independent? In other words, does information about the gender of a person play no role in the chance of their being a certain height. Intuitively this seems not be the case. We showed this in the above example. Where we showed that P (5 X 5.5 Y = female) > P (5 X 5.5 Y = male). Which means that P (5 X 5.5 Y = female) P (5 X 5.5) Hence gender and height are not independent. 22

24 Random variables X and Y are independent if for any outcome of X and and outcome of Y, P (X = event A Y = event B) = P (X = event A). Independent and mutually exclusive (they not the same) Example If A is the event a height is in the interval [5, 5.5] and B is the event a height is in the interval [6, 6.5], then P (X in [5, 5.5] X in [6, 6.5]) = 0 P (X in [5, 5.5]). A and B are not independent events. 23

25 Example: Binge drinking We recall that data set: Male Female Binge Not Binge Subtotals We recall the probability of binge drinking given that you know the person is female = P(binge drink Female) = 0.17, whereas the probability of binge drinking given that you know the person is male = P(binge drink Male) = As these conditional probabilities are different, there is a dependence/association between gender and binge drinking. 24

26 Examples: Independent events Define the random variables. X is the season in college station. Y are the number of hair dressers in college station. Z is the temperature in college station. It is unlikely that the number of hair dressers in college station have an influence on the temperature. Therefore P (Z [25, 28] Y ) = P (X [25, 28]). 25

27 On the other hand season does have an unfluence on temperature, therefore P (X [25, 28] X = summer ) P (X [25, 28]) or equivalently, P (X [25, 28] and Z = summer ) P (X [25, 28])P (Z = summer). We will understand the above statement in lecture 6. 26

28 Dependence does not imply causality A study was done in the early 90s to see if the mortality rates between left and right handed people were the same. To do this the psychologists collected the death records of 2000 people who died in May, 1990, in Southern California. They rang the families of all these people and asked whether the were left or right handed. They also categorized the people who died into those above 60 and below 60 years old. This is the data they collected. Out of the 2000, 400 were left handed. Out of the 2000, 150 were left handed and died below 60. Out of the 2000, 300 were right handed and died below

29 (a) A person dies below 60? Lecture 5 (MWF) Random variables, Probability and conditional probabilities (b) A person dies below 60 given that they are left handed? (c) Does the data suggest there is an association/dependence between left handedess and early mortality? 28

30 Solution It instructive to summarize the data as a contingency table: Died before 60 Died after 60 totals Left Right Totals (a) The chance a person dies below 60. Calculate the total proportion who died before 60 P (before 60) = (b) A person dies below 60 given that they are left handed. 29

31 Focus on people who are only left handed. There are 400 of these. Of these, 150 died before 60. Therefore the proportion of left handed people in the sample who died before 60 is P (before 60 left handed) = = 3 8 = 37.5% Using the same argument the proportion of right handed people in the sample who died before 60 is P (before 60 right handed) = = 3 16 = 18.75%. (c) As these proportions are different, it suggests there is some sort of dependence/association between lefthandedness and early mortality. 30

32 Why the dependence? Can we conclude from this data that being left handed increases the chance of early mortality? This statement is a causal statement, it asks whether being left handed causes early mortality. This data set cannot answer this question, does not take into account other variables associated with being left handed which are causing the effect you are seeing. The most plausible explanation for the difference in proportions is that the incidence of left-handedness has increased over the past 50 years (because people are no longer forced to be right handed). The unobserved variable is 31

33 Example: Before 1930, no one was left handed, what would the numbers in the table and proportions be? 32

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html https://www.openintro.org/stat/textbook.php?stat_book=os (Chapter 2) Lecture 5 (MWF) Probabilities

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 6 (MWF) Conditional probabilities and associations Suhasini Subba Rao Review of previous lecture

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 31 (MWF) Review of test for independence and starting with linear regression Suhasini Subba

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Boxplots and standard deviations Suhasini Subba Rao Review of previous lecture In the previous lecture

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching/ Suhasini Subba Rao Review In the previous lecture we looked at the statistics of M&Ms. This example illustrates

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 9 (MWF) Calculations for the normal distribution Suhasini Subba Rao Evaluating probabilities

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

Conditional Probability Solutions STAT-UB.0103 Statistics for Business Control and Regression Models

Conditional Probability Solutions STAT-UB.0103 Statistics for Business Control and Regression Models Conditional Probability Solutions STAT-UB.0103 Statistics for Business Control and Regression Models Counting (Review) 1. There are 10 people in a club. How many ways are there to choose the following:

More information

ECON Semester 1 PASS Mock Mid-Semester Exam ANSWERS

ECON Semester 1 PASS Mock Mid-Semester Exam ANSWERS ECON1310 2006 Semester 1 PASS Mock Mid-Semester Exam ANSWERS MULTIPLE CHOICE QUESTIONS 1. Unemployment rates are an example of: a. Cross-sectional, quantitative, continuous data b. Time-series, quantitative,

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH)

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 003 MOCK EXAMINATIONS IOP 01-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) Time: hours READ THE INSTRUCTIONS BELOW VERY CAREFULLY. Do not open this question paper until

More information

Sampling. Module II Chapter 3

Sampling. Module II Chapter 3 Sampling Module II Chapter 3 Topics Introduction Terms in Sampling Techniques of Sampling Essentials of Good Sampling Introduction In research terms a sample is a group of people, objects, or items that

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review Our objective: to make confident statements about a parameter (aspect) in

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

Statistical Theory 1

Statistical Theory 1 Statistical Theory 1 Set Theory and Probability Paolo Bautista September 12, 2017 Set Theory We start by defining terms in Set Theory which will be used in the following sections. Definition 1 A set is

More information

Two-sample Categorical data: Testing

Two-sample Categorical data: Testing Two-sample Categorical data: Testing Patrick Breheny April 1 Patrick Breheny Introduction to Biostatistics (171:161) 1/28 Separate vs. paired samples Despite the fact that paired samples usually offer

More information

Chapter 5 : Probability. Exercise Sheet. SHilal. 1 P a g e

Chapter 5 : Probability. Exercise Sheet. SHilal. 1 P a g e 1 P a g e experiment ( observing / measuring ) outcomes = results sample space = set of all outcomes events = subset of outcomes If we collect all outcomes we are forming a sample space If we collect some

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

3 PROBABILITY TOPICS

3 PROBABILITY TOPICS Chapter 3 Probability Topics 135 3 PROBABILITY TOPICS Figure 3.1 Meteor showers are rare, but the probability of them occurring can be calculated. (credit: Navicore/flickr) Introduction It is often necessary

More information

Chapter. Probability

Chapter. Probability Chapter 3 Probability Section 3.1 Basic Concepts of Probability Section 3.1 Objectives Identify the sample space of a probability experiment Identify simple events Use the Fundamental Counting Principle

More information

Probability and Discrete Distributions

Probability and Discrete Distributions AMS 7L LAB #3 Fall, 2007 Objectives: Probability and Discrete Distributions 1. To explore relative frequency and the Law of Large Numbers 2. To practice the basic rules of probability 3. To work with the

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

Statistics 1L03 - Midterm #2 Review

Statistics 1L03 - Midterm #2 Review Statistics 1L03 - Midterm # Review Atinder Bharaj Made with L A TEX October, 01 Introduction As many of you will soon find out, I will not be holding the next midterm review. To make it a bit easier on

More information

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1 Topic -2 Probability Larson & Farber, Elementary Statistics: Picturing the World, 3e 1 Probability Experiments Experiment : An experiment is an act that can be repeated under given condition. Rolling a

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review of previous lecture We showed if S n were a binomial random variable, where

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 26 (MWF) Tests and CI based on two proportions Suhasini Subba Rao Comparing proportions in

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Potential Outcomes Model (POM)

Potential Outcomes Model (POM) Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 26 (MWF) Tests and CI based on two proportions Suhasini Subba Rao Comparing proportions in

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 65 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Comparing populations Suppose I want to compare the heights of males and females

More information

Lesson 8: Graphs of Simple Non Linear Functions

Lesson 8: Graphs of Simple Non Linear Functions Student Outcomes Students examine the average rate of change for non linear functions and learn that, unlike linear functions, non linear functions do not have a constant rate of change. Students determine

More information

2 Chapter 2: Conditional Probability

2 Chapter 2: Conditional Probability STAT 421 Lecture Notes 18 2 Chapter 2: Conditional Probability Consider a sample space S and two events A and B. For example, suppose that the equally likely sample space is S = {0, 1, 2,..., 99} and A

More information

Ec1123 Section 7 Instrumental Variables

Ec1123 Section 7 Instrumental Variables Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Probability (special topic)

Probability (special topic) Chapter 2 Probability (special topic) Probability forms a foundation for statistics. You may already be familiar with many aspects of probability, however, formalization of the concepts is new for most.

More information

Binomial and Poisson Probability Distributions

Binomial and Poisson Probability Distributions Binomial and Poisson Probability Distributions Esra Akdeniz March 3, 2016 Bernoulli Random Variable Any random variable whose only possible values are 0 or 1 is called a Bernoulli random variable. What

More information

Chapter 15. General Probability Rules /42

Chapter 15. General Probability Rules /42 Chapter 15 General Probability Rules 1 Homework p361 2, 3, 4, 10, 11, 12, 30, 36, 38, 40, 42 2 3 Objective Students use the general addition and multiplication rules to find probabilities of random events.

More information

( ) P A B : Probability of A given B. Probability that A happens

( ) P A B : Probability of A given B. Probability that A happens A B A or B One or the other or both occurs At least one of A or B occurs Probability Review A B A and B Both A and B occur ( ) P A B : Probability of A given B. Probability that A happens given that B

More information

Section 5.1: Probability and area

Section 5.1: Probability and area Section 5.1: Probability and area Review Normal Distribution s z = x - m s Standard Normal Distribution s=1 m x m=0 z The area that falls in the interval under the nonstandard normal curve is the same

More information

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E OBJECTIVE COURSE Understand the concept of population and sampling in the research. Identify the type

More information

Chapter 5 Random vectors, Joint distributions. Lectures 18-23

Chapter 5 Random vectors, Joint distributions. Lectures 18-23 Chapter 5 Random vectors, Joint distributions Lectures 18-23 In many real life problems, one often encounter multiple random objects. For example, if one is interested in the future price of two different

More information

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22 Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Lecture 8 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. 5.1 Models of Random Behavior Outcome: Result or answer

More information

STAT Chapter 3: Probability

STAT Chapter 3: Probability Basic Definitions STAT 515 --- Chapter 3: Probability Experiment: A process which leads to a single outcome (called a sample point) that cannot be predicted with certainty. Sample Space (of an experiment):

More information

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups CHAPTER 10 Comparing Two Populations or Groups 10.1 Comparing Two Proportions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Proportions

More information

Probability. Hosung Sohn

Probability. Hosung Sohn Probability Hosung Sohn Department of Public Administration and International Affairs Maxwell School of Citizenship and Public Affairs Syracuse University Lecture Slide 4-3 (October 8, 2015) 1/ 43 Table

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

Data Collection: What Is Sampling?

Data Collection: What Is Sampling? Project Planner Data Collection: What Is Sampling? Title: Data Collection: What Is Sampling? Originally Published: 2017 Publishing Company: SAGE Publications, Inc. City: London, United Kingdom ISBN: 9781526408563

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

Nicole Dalzell. July 2, 2014

Nicole Dalzell. July 2, 2014 UNIT 1: INTRODUCTION TO DATA LECTURE 3: EDA (CONT.) AND INTRODUCTION TO STATISTICAL INFERENCE VIA SIMULATION STATISTICS 101 Nicole Dalzell July 2, 2014 Teams and Announcements Team1 = Houdan Sai Cui Huanqi

More information

Lesson 19: Understanding Variability When Estimating a Population Proportion

Lesson 19: Understanding Variability When Estimating a Population Proportion Lesson 19: Understanding Variability When Estimating a Population Proportion Student Outcomes Students understand the term sampling variability in the context of estimating a population proportion. Students

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Slide 1 Math 1520, Lecture 21

Slide 1 Math 1520, Lecture 21 Slide 1 Math 1520, Lecture 21 This lecture is concerned with a posteriori probability, which is the probability that a previous event had occurred given the outcome of a later event. Slide 2 Conditional

More information

Objectives. 2.1 Scatterplots. Scatterplots Explanatory and response variables. Interpreting scatterplots Outliers

Objectives. 2.1 Scatterplots. Scatterplots Explanatory and response variables. Interpreting scatterplots Outliers Objectives 2.1 Scatterplots Scatterplots Explanatory and response variables Interpreting scatterplots Outliers Adapted from authors slides 2012 W.H. Freeman and Company Relationships A very important aspect

More information

Probability: Why do we care? Lecture 2: Probability and Distributions. Classical Definition. What is Probability?

Probability: Why do we care? Lecture 2: Probability and Distributions. Classical Definition. What is Probability? Probability: Why do we care? Lecture 2: Probability and Distributions Sandy Eckel seckel@jhsph.edu 22 April 2008 Probability helps us by: Allowing us to translate scientific questions into mathematical

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Basic Probability Learning Objectives In this lecture(s), you learn: Basic probability concepts Conditional probability To use Bayes Theorem to revise probabilities

More information

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Lecture 9 Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Univariate categorical data Univariate categorical data are best summarized in a one way frequency table.

More information

Chapter 2 Solutions Page 15 of 28

Chapter 2 Solutions Page 15 of 28 Chapter Solutions Page 15 of 8.50 a. The median is 55. The mean is about 105. b. The median is a more representative average" than the median here. Notice in the stem-and-leaf plot on p.3 of the text that

More information

6 THE NORMAL DISTRIBUTION

6 THE NORMAL DISTRIBUTION CHAPTER 6 THE NORMAL DISTRIBUTION 341 6 THE NORMAL DISTRIBUTION Figure 6.1 If you ask enough people about their shoe size, you will find that your graphed data is shaped like a bell curve and can be described

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in

More information

Lecture 6. Probability events. Definition 1. The sample space, S, of a. probability experiment is the collection of all

Lecture 6. Probability events. Definition 1. The sample space, S, of a. probability experiment is the collection of all Lecture 6 1 Lecture 6 Probability events Definition 1. The sample space, S, of a probability experiment is the collection of all possible outcomes of an experiment. One such outcome is called a simple

More information

Chapter 7 Wednesday, May 26th

Chapter 7 Wednesday, May 26th Chapter 7 Wednesday, May 26 th Random event A random event is an event that the outcome is unpredictable. Example: There are 45 students in this class. What is the probability that if I select one student,

More information

Regression Analysis. Ordinary Least Squares. The Linear Model

Regression Analysis. Ordinary Least Squares. The Linear Model Regression Analysis Linear regression is one of the most widely used tools in statistics. Suppose we were jobless college students interested in finding out how big (or small) our salaries would be 20

More information

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias Recap Announcements Lecture 5: Statistics 101 Mine Çetinkaya-Rundel September 13, 2011 HW1 due TA hours Thursday - Sunday 4pm - 9pm at Old Chem 211A If you added the class last week please make sure to

More information

Topic 4 Probability. Terminology. Sample Space and Event

Topic 4 Probability. Terminology. Sample Space and Event Topic 4 Probability The Sample Space is the collection of all possible outcomes Experimental outcome An outcome from a sample space with one characteristic Event May involve two or more outcomes simultaneously

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Chapter 2. Mean and Standard Deviation

Chapter 2. Mean and Standard Deviation Chapter 2. Mean and Standard Deviation The median is known as a measure of location; that is, it tells us where the data are. As stated in, we do not need to know all the exact values to calculate the

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. MGF 1106 Math for Liberal Arts I Summer 2008 - Practice Final Exam Dr. Schnackenberg If you do not agree with the given answers, answer "E" for "None of the above". MULTIPLE CHOICE. Choose the one alternative

More information

Exercise 1. Exercise 2. Lesson 2 Theoretical Foundations Probabilities Solutions You ip a coin three times.

Exercise 1. Exercise 2. Lesson 2 Theoretical Foundations Probabilities Solutions You ip a coin three times. Lesson 2 Theoretical Foundations Probabilities Solutions monia.ranalli@uniroma3.it Exercise 1 You ip a coin three times. 1. Use a tree diagram to show the possible outcome patterns. How many outcomes are

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 5. Models of Random Behavior Math 40 Introductory Statistics Professor Silvia Fernández Chapter 5 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Outcome: Result or answer

More information

Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on

Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on Version A.) No grade disputes now. Will have a chance to

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

Econ 113. Lecture Module 2

Econ 113. Lecture Module 2 Econ 113 Lecture Module 2 Contents 1. Experiments and definitions 2. Events and probabilities 3. Assigning probabilities 4. Probability of complements 5. Conditional probability 6. Statistical independence

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

Lecture 2. Conditional Probability

Lecture 2. Conditional Probability Math 408 - Mathematical Statistics Lecture 2. Conditional Probability January 18, 2013 Konstantin Zuev (USC) Math 408, Lecture 2 January 18, 2013 1 / 9 Agenda Motivation and Definition Properties of Conditional

More information

Lecture 2: Probability and Distributions

Lecture 2: Probability and Distributions Lecture 2: Probability and Distributions Ani Manichaikul amanicha@jhsph.edu 17 April 2007 1 / 65 Probability: Why do we care? Probability helps us by: Allowing us to translate scientific questions info

More information

7.1: What is a Sampling Distribution?!?!

7.1: What is a Sampling Distribution?!?! 7.1: What is a Sampling Distribution?!?! Section 7.1 What Is a Sampling Distribution? After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution

More information

tossing a coin selecting a card from a deck measuring the commuting time on a particular morning

tossing a coin selecting a card from a deck measuring the commuting time on a particular morning 2 Probability Experiment An experiment or random variable is any activity whose outcome is unknown or random upfront: tossing a coin selecting a card from a deck measuring the commuting time on a particular

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

STT 315 Problem Set #3

STT 315 Problem Set #3 1. A student is asked to calculate the probability that x = 3.5 when x is chosen from a normal distribution with the following parameters: mean=3, sd=5. To calculate the answer, he uses this command: >

More information

Conditional Probability 2 Solutions COR1-GB.1305 Statistics and Data Analysis

Conditional Probability 2 Solutions COR1-GB.1305 Statistics and Data Analysis Conditional Probability 2 Solutions COR-GB.305 Statistics and Data Analysis The Birthday Problem. A class has 50 students. What is the probability that at least two students have the same birthday? Assume

More information

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail}

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail} Random Experiment In random experiments, the result is unpredictable, unknown prior to its conduct, and can be one of several choices. Examples: The Experiment of tossing a coin (head, tail) The Experiment

More information

Intro to Probability Day 3 (Compound events & their probabilities)

Intro to Probability Day 3 (Compound events & their probabilities) Intro to Probability Day 3 (Compound events & their probabilities) Compound Events Let A, and B be two event. Then we can define 3 new events as follows: 1) A or B (also A B ) is the list of all outcomes

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten

More information

Section 7.2 Homework Answers

Section 7.2 Homework Answers 25.5 30 Sample Mean P 0.1226 sum n b. The two z-scores are z 25 20(1.7) n 1.0 20 sum n 2.012 and z 30 20(1.7) n 1.0 0.894, 20 so the probability is approximately 0.1635 (0.1645 using Table A). P14. a.

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Review of probability. Nuno Vasconcelos UCSD

Review of probability. Nuno Vasconcelos UCSD Review of probability Nuno Vasconcelos UCSD robability probability is the language to deal with processes that are non-deterministic examples: if I flip a coin 00 times how many can I expect to see heads?

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College Introductory Statistics Lectures Analysis of Variance 1-Way ANOVA: Many sample test of means Department of Mathematics Pima Community College Redistribution of this material is prohibited without written

More information

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary) Chapter 14 From Randomness to Probability How to measure a likelihood of an event? How likely is it to answer correctly one out of two true-false questions on a quiz? Is it more, less, or equally likely

More information

Bemidji Area Schools Outcomes in Mathematics Algebra 2 Applications. Based on Minnesota Academic Standards in Mathematics (2007) Page 1 of 7

Bemidji Area Schools Outcomes in Mathematics Algebra 2 Applications. Based on Minnesota Academic Standards in Mathematics (2007) Page 1 of 7 9.2.1.1 Understand the definition of a function. Use functional notation and evaluate a function at a given point in its domain. For example: If f x 1, find f(-4). x2 3 Understand the concept of function,

More information

From Bayes Theorem to Pattern Recognition via Bayes Rule

From Bayes Theorem to Pattern Recognition via Bayes Rule From Bayes Theorem to Pattern Recognition via Bayes Rule Slecture by Varun Vasudevan (partially based on Prof. Mireille Boutin s ECE 662 lecture) February 12, 2014 What will you learn from this slecture?

More information

the yellow gene from each of the two parents he wrote Experiments in Plant

the yellow gene from each of the two parents he wrote Experiments in Plant CHAPTER PROBLEM Did Mendel s results from plant hybridization experiments contradict his theory? Gregor Mendel conducted original experiments offspring can have a yellow pod only if it inherits to study

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.

More information

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow CHAPTER PROBLEM Did Mendel s results from plant hybridization experiments contradict his theory? Gregor Mendel conducted original experiments to study the genetic traits of pea plants. In 1865 he wrote

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Psych Jan. 5, 2005

Psych Jan. 5, 2005 Psych 124 1 Wee 1: Introductory Notes on Variables and Probability Distributions (1/5/05) (Reading: Aron & Aron, Chaps. 1, 14, and this Handout.) All handouts are available outside Mija s office. Lecture

More information