Introduction to Statistics

Size: px
Start display at page:

Download "Introduction to Statistics"

Transcription

1 Chapter 1 Introduction to Statistics 1.1 Preliminary Definitions Definition 1.1. Data are observations (such as measurements, genders, survey responses) that have been collected. Definition 1.2. Statistics is a collection of methods for planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on data. Definition 1.3. A Population is the entire collection of individuals or measurements about which information is desired. Definition 1.4. A Sample is a subset of the population that has been selected for study. Definition 1.5. A statistic is a numerical description of a SAMPLE. Definition 1.6. A parameter is a numerical description of a POPULATION. Definition 1.7. Statistical Inference consists of methods of techniques for generalizing from a sample to the population from which the sample is selected. Definition 1.8. Sampling Variability describe the extent to which samples differ from one another. 1

2 2 1.2 Framework of Statistics Population Sample Parameter statistic

3 3 Idea for a Confidence Interval

4 4 Idea for a Hypothesis Test

5 Chapter 2 Probability Remark 2.1. The information regarding probability can be found in Chapter 4 of your textbook. How do we measure likeliness? How do we determine what is considered (un)likely? 1 Definition 2.1. The Probability of an event is 0 Definition 2.2. A significance level α is the largest probability an unlikely event can have. 5

6 6 2.1 Definitions & Examples Definition 2.3. The result of a single trial of a given procedure is called an outcome. Definition 2.4. An event is any collection of results of outcomes of a procedure. Definition 2.5. A simple event is an outcome or an event that cannot be further broken down into simpler components. Definition 2.6. The sample space for a procedure consists of all possible simple events. Example 2.1. A bucket contains some numbered balls. Eventually, one ball will be removed at random. 1. Find the sample space for this procedure. 2. Let A denote the event that the outcome is even. Describe A in terms of simple events. Example 2.2. Two numbered balls are removed individually from a bucket. Replacing each after they are removed. The numbers on the balls are written down. 1. Find the sample space for this procedure. 2. Let B denote the event that the outcome at least one ball is a three. Describe B in terms of simple events. 3. Is the event at least one ball is a three a simple event? 4. If the balls were added together, what would be the sample space?

7 7 2.2 Some Methods for Computing Probabilities of an Event There are three approaches to determining the probability of an event: 1. Subjective Probabilities 2. Relative Frequency Approximation 3. Classical Approach Theorem 2.1. The Law of Large Numbers states that as a procedure is repeated again and again, the relative frequency approximation for the probability of an event tends to approach the actual probability.

8 8 Example men and women were surveyed. They were asked Which do you like better: Pollen or Propolis? The answers are tallied below. Pollen Propolis Total Men Women Total a. What is the probability that a randomly selected survey respondent will be a woman? Pollen Propolis Total Men Women Total b. What is the probability that a randomly selected survey respondent will prefer pollen? Pollen Propolis Total Men Women Total c. If you consider only the female responses, what is the probability that you would randomly select one of the women that prefer pollen? Pollen Propolis Total Men Women Total Example 2.4. A colored ball is removed, at random, from a bucket. What is the probability that the ball will be green?

9 9 Example 2.5. Two fair four-sided dice is rolled. What is the probability that both numbers will be even? Example fair four-sided dice are rolled. What is the probability that all the numbers will be even?

10 Counting Fundamental Counting Rule Given two sequential events, if the first can occur m ways, and the second event can occur n ways, then the number of ways both events can occur in sequence is equal to m n. Example 2.7. An airline has 6 routes from city A to City B, and 9 routes from City B to City C. If you were to take this use this airline, how many routes could you take from City A to City C? Example 2.8. How many ways can a family with 6 members be lined up to take a family portrait?

11 Order or no order? Repeats or not? Definition 2.7. Permutations of items are arrangements in which different sequences of the same items are counted separately. Definition 2.8. Combinations of items are arrangements in which different sequences of the same items are not counted separately. Selecting r of n distinct objects. Unordered Repeats No Repeats nc r Ordered n r np r

12 12 Example 2.9. You have 4 extra tickets for a concert and 7 friends. How many different groups of your friends could accompany you to the concert? Example You have three astronauts, Anna, George, and Michele, on the first Mission to Mars. For the first Marswalk, two of them will be allowed to leave their flying saucer, and walk on the planet; one will have to remain behind. How many different ways can they be assigned a job for their first landing? If they are randomly given their assignment, what is the probability that George will be left on the ship? Example How many five letter words can be made with the letters F, S, H, E. A letter can be used more than once. What is the probability that a five letter word will start with the letter F? Only the letters F, S, H, E can be used. Letters can be repeated. When some Items are Identical to Others - Another Permutation Rule Example How many different ways can the letters in TENNESSEE be arranged? If these letters are randomly arranged, what it the probability that they will spell TENNESSEE?

13 The Addition Rule for Probabilities Definition 2.9. A compound event is any event combining two or more simple events. Notation 2.1. More notation that will be used (A or B) = (A and B) = Formal Addition Rule P (A or B) = P (A) + P (B) P (A and B) Example Suppose the following: P (A) =.9, P (B) =.8, P (A and B) =.77. Find P (A or B). Example In a group of 101 students 40 are juniors, 50 are female, and 22 are female juniors. Find the probability that a student picked from this group at random is either a junior or female.

14 14 Example A family of 6 is going to have their picture taken. The photographer is going to randomly line everyone up. What is the probability that the mother ends up in the first chair or the father ends up in the sixth chair? Example A single card is chosen at random from a standard deck of 52 playing cards. What is the probability of choosing a king or a club? Example Two dice are rolled. The first is a fair 6-sided die. The second is a fair 4-sided die. Once they are rolled, the two numbers on the two disc are used to create a 2 digit number. The number from the six sided die is used to make the 10s digit. The number from the 4-sided die is used to make the ones digit. What it the probability that the resulting number is odd or begins with an even number?

15 15 Definition Events A and B are disjoint( or mutually exclusive) if they cannot occur at the same time. (That is, they do not overlap.) Probability of the Intersection of Two Disjoint Events If events A and B are disjoint, then P (A and B) = Addition Rule for DISJOINT Events If events A and B are disjoint, then P (A or B) = P (A) + P (B) P (A and B) = Example Suppose that A and B are disjoint events such that the following is true: P (A) =.9, P (B) =.06. Find P (A or B). Example In a group of 201 students 70 are freshmen, 41 are sophomores, 30 are junior, 50 are seniors, and 10 are graduate students. Find the probability that a student picked from this group at random is either a freshman or sophomore. Example A family of 6 is going to have their picture taken. The photographer is going to randomly line everyone up. What is the probability that the mother ends up in the first chair or the father ends up in the first chair?

16 16 Example Two dice are rolled. The first is a fair 6-sided die. The second is a fair 4-sided die. Once they are rolled, the two numbers on the two disc are used to create a 2 digit number. The number from the six sided die is used to make the 10s digit. The number from the 4-sided die is used to make the ones digit. What it the probability that the resulting number is odd or ends with a 2? Example A bucket contains some bouncy balls that are colored as well as numbered. The following table indicates the number of each kind of ball in the bucket. Yellow Green Orange Red Blue Brown Purple Total Odd Even Total If a ball is randomly chosen, what is the probability that the ball will be blue even ball or a purple odd ball? 2. If a ball is randomly chosen, what is the probability that the ball will be blue or purple? 3. If a ball is randomly chosen, what is the probability that the ball will be even, or purple?

17 17 Rule for Complimentary Events Example Find the indicated probabilities. 1. Suppose P (A) =.23. Find P (Ā). 2. Suppose P (Ā) =.12, P ( B) =.21, P ( C) =.22. Find P (B). Example Same bucket as used in Example What is the probability that a randomly selected ball is neither brown nor even? Yellow Green Orange Red Blue Brown Purple Total Odd Even Total

18 18 Example Two dice are rolled. The first is a fair 6-sided die. The second is a fair 4-sided die. Once they are rolled, the two numbers on the two disc are used to create a 2 digit number. The number from the six sided die is used to make the 10s digit. The number from the 4-sided die is used to make the ones digit. What it the probability that the resulting number is not 42? Example A single card is chosen at random from a standard deck of 52 playing cards. What is the probability of choosing neither a king nor a club? Example In a group of 700 families, 75 had more than 3 children, 125 had exactly 3 children, 300 had 2 children, and 100 had only a single child. If one family is randomly selected, what is the probability that it will have no children?

19 Conditional Probability & The Multiplication Rule Definition Let A and B be two events. The conditional probability of A given B, P (A B), is the probability that A happens given the information that B occurs. It is the probability of an event with the additional information that some other event has already occurred. Denoted by P (B A). Example A bucket contains some bouncy balls that are colored as well as numbered. The following table indicates the number of each kind of ball in the bucket. The contents of the bucket are separated into two buckets, odds and evens. If we randomly select a single ball from the odd bucket, what is the probability that the ball is red? Yellow Green Orange Red Blue Brown Purple Total Odd Even Total Example Two dice are rolled. The first is a fair 6-sided die. The second is a fair 4-sided die. Once they are rolled, the two numbers on the two disc are used to create a 2 digit number. The number from the six sided die is used to make the 10s digit. The number from the 4-sided die is used to make the ones digit. If a two appeared on the 6-sided die, what it the probability that the resulting number is odd? Example Five cards are dealt from a freshly shuffled deck of cards. Suppose the first four cards are kings, what it the probability that the fifth card will be an ace?

20 20 Definition Two events A and B are independent if the occurrence of one event does not affect the probability of the occurrence of the other event. If A and B are not independent, they are said to be dependent. If two events A and B are independent, then P (A B) = P (A) P (B A) = P (B) P (B and A) = P (A)P (B) Example Given two events A and B. Suppose that P (A B) =.8 and P (A) =.81. Are the events A and B independent? Example Given two independent events A and B. Suppose that P (B) =.8 and P (A) =.42. Find P (B and A). Example An urn contains 2 colored balls: 1 blue & 1 red. If two balls are removed, one at a time, replacing each after it is drawn. What is the probability that the second ball is red, if the first was blue? Example An urn contains 2 colored balls: 1 blue & 1 red. If two balls are removed, one at a time, without replacing each after it is drawn. What is the probability that the second ball is red, if the first was blue? Remark 2.2. The method used for selecting, or sampling items, is very important and can determine whether two events are independent or dependent. Selections (Sampling) without replacement: Dependent events. Selections (Sampling) with replacement: Independent events.

21 21 Formal Multiplication Rule P (A and B) = P (A) P (B A) Example A bucket contains several colored bouncy balls, red,yellow and blue. One at a time, two balls are removed from the bucket. After the first ball is removed, it will not be replaced. What is the probability that the first ball is red and the second bouncy ball is green. Example A bucket contains several colored bouncy balls, red,yellow and blue. One at a time, two balls are removed from the bucket. After the first ball is removed, it is replaced. What is the probability that the first ball is red and the second bouncy ball is green. Example If two cards are dealt from a deck without replacing them, what is the probability that an ace will be dealt first and a two will be dealt second? Example If two cards are dealt from a deck with replacement, what is the probability that an ace will be dealt first and a two will be dealt second?

22 More Conditional Probability Definition Let A and B be two events. The conditional probability of A given B, P (A B), is the probability that A happens given the information that B occurs. It is the probability of an event with the additional information that some other event has already occurred. P (B A) = P (A and B) P (A) Example A statistics professor tosses two coins that cannot be seen by any of the students. One student asks: Did one of the coins turn up heads? Suppose the professor answered yes, find the probability that both coins turned up heads. Example An urn contains 3 colored balls: 2 blue & 1 red. If two balls are removed, one at a time, without replacing each after it is drawn. What is the probability that the second ball is red, if the first was blue?

23 23 Example A student answers a multiple choice examination question that has 4 possible answers. Suppose that the probability that the student knows the answer to the question is 0.80 and the probability that the student guesses is Also, If the student guesses, the probability of a correct guess is If the question is answered correctly, what is the probability that the student really knew the correct answer?

24 Chapter 3 Probability & Random Variables Remark 3.1. This is chapter 5 in the textbook. Our goal is to compute probabilities for Random Procedures/Phenomenon whose outcomes are numbers Definition 3.1. A random variable is a variable (typically represented by x) that has a single numerical value, determined by chance, for each outcome of a procedure. A random variable is a variable whose value is a numerical outcome of a random procedure/phenomenon. Example 3.1. Examples of Random Variables. The weight of a randomly selected package taken from the post office. The amount of time it takes to walk from the first floor to the fourth floor. The temperature of a randomly selected popsicle. The amount of money you spend on your next tank of gas. The number of lunches served in the cafeteria on a given day. The color of a ball pulled out of a bucket. 24

25 25 There are two ways to assign probabilities to a random variable. These provide two types of random variables: Definition 3.2. A Continuous Random Variable has infinitely many values, and the collection of values is not countable. Definition 3.3. A Discrete Random Variable has a collection of possible values that is finite or countable. Random variables will usually ( but not always ) be denoted by capital letters from the end of the alphabet. When a random variable describes a random phenomenon, the sample space S lists the possible values of the random variable. Definition 3.4. A Probability Distribution is a description that gives the probability for each possible value of a random variable. It is often expressed as a table, a formula, or a graph. Examples of Probability Distributions Example 3.2. A bucket contains 4 green, 3 brown and 3 purple bouncy balls. A ball is randomly selected from the bucket. We check the color of the ball. (We could say that we count the number of green balls observed.)

26 26 Example 3.3. A bucket contains 4 green, 3 brown and 3 purple bouncy balls. One at a time, four balls are randomly removed, and replaced, from the bucket. We count the number of green balls observed. Definition 3.5. A Binomial Probability Distribution results from a procedure that meets all the following requirements: a.) The procedure has a fixed number of trials. A trial is a single observation. b.) The trials must be independent. The outcome of any one trial has no affect on the probabilities in the other trials. c.) Each trial must have all outcomes classified into two categories (commonly referred to as success and failure). d.) The probability of a success remains the same for all trials. If X has the Binomial distribution B(n, p) with n observations and probability p of success on each experiment, or observation, the possible values of X are 0, 1, 2,..., n. If k is any one of these values, the binomial probability is P (X = k) = n C k p k (1 p) n k. The mean and standard deviation of a binomial random variable X is µ = np σ = np(1 p)

27 27 Example 3.4. A coin is tossed four times. 1. What is the probability distribution of the discrete random variable X that counts the number of heads? 2. Find P (X > 1). 3. Find P (X 1). 4. Find P (X 1). Remark 3.2. A Binomial Probability Distribution results from a procedure that meets all the following requirements: a.) The procedure has a fixed number of trials. A trial is a single observation. b.) The trials must be independent. The outcome of any one trial has no affect on the probabilities in the other trials. c.) Each trial must have all outcomes classified into two categories (commonly referred to as success and failure). d.) The probability of a success remains the same for all trials.

28 28 Definition 3.6. If X has the Poisson distribution, P oisson(µ), with mean number of occurrences equal to µ, the possible values of X are 0, 1, 2, 3,.... If k is any one of these values, the Poisson probability is P (x) = µx e µ. x! The mean is µ. The standard deviation of a Poisson random variable X is σ = µ. Remark 3.3. A Poisson Probability Distribution results from a procedure that meets all the following requirements: a.) The random variable counts the number of occurrences of an event over a time interval; b.) The occurrences must be random, independent, and uniformly distributed over the time interval. Example 3.5. Assume that the mean number of aircraft accidents in the United States is 8.5 per month. Use the Poisson distribution to find the probability that in a month there will be a.) 6 aircraft accidents. b.) at least 5 aircraft accidents., c.) no more than 7 aircraft accidents. d.) Over a one year period, how many aircraft accidents would you expect there to be?

29 29 PDF vs CDF More Examples of Random Variables - Continuous The probability distribution of X is described by a density curve (a graph). The probability of any event is the area under the density curve and above the x axis, and between the values of X that make up the event. The total area under a density curve is equal to 1, and a density curve never goes below the x-axis. Every individual outcome for a continuous random variable has probability zero.

30 30 Definition 3.7. A continuous random variable has a uniform distribution if its values are spread evenly over the range of possible values. The density curve (graph) of a uniformly distributed random variable is a rectangle. Example 3.6. The amount of time a particular subway train will wait at a station is uniformly distributed between 5 and 10 minutes. Find the probability that the train will wait 1. exactly 6 minutes. 2. at most 6 minutes. 3. at least 7 minutes

31 31 Definition 3.8. A continuous random variable X has a normal distribution with mean µ and standard deviation σ if its density curve is given by y = 1 2πσ e 1 2( x µ σ ) 2. Normal Distribution Density µ µ 3σ µ 2σ µ 1σ x value µ+1σ µ+2σ µ+3σ Normal Distribution Density µ µ 3σ µ 2σ µ 1σ x value µ+1σ µ+2σ µ+3σ Normal Distribution Density µ µ 3σ µ 2σ µ 1σ x value µ+1σ µ+2σ µ+3σ Normal Distribution Density µ µ 3σ µ 2σ µ 1σ x value µ+1σ µ+2σ µ+3σ The probability distribution of X is described by a density curve (a graph). The probability of any event is the area under the density curve and above the x axis, and between the values of X that make up the event. The total area under a density curve is equal to 1, and a density curve never goes below the x-axis. Every individual outcome for a continuous random variable has probability zero.

32 32 Example 3.7. The heights of fully grown white oak trees are normally distributed with a mean height of 90 feet and standard deviation of 3.5 feet. 1. What is the probability that a randomly selected fully grown white oak tree is less than 87 feet tall? 2. What is the probability that a randomly selected fully grown white oak tree is greater than 94 feet tall? Example 3.8. The ACT is an exam used by colleges and universities to evaluate undergraduate applicants. The test scores are normally distributed. In a recent year, the mean test score was 20.1 and the standard deviation was What is the probability that a randomly selected ACT score is between 16 and 24? 2. What is the probability that a randomly selected ACT score is greater then 22.5?

33 t distributions X Definition 3.9. A continuous random variable X has a t-distribution with k degrees of freedom, if its density curve is given by y = Γ ( ) k+1 2 ( kπγ k ) 2 ) k+1 (1 + x2 2. k t distributions X t distributions X t distributions X t distributions X The probability distribution of X is described by a density curve (a graph). The probability of any event is the area under the density curve and above the x axis, and between the values of X that make up the event. The total area under a density curve is equal to 1, and a density curve never goes below the x-axis. Every individual outcome for a continuous random variable has probability zero.

34 chi square distributions X Definition A continuous random variable X has a χ 2 -distribution with k degrees of freedom, if its density curve is given by y = 1 2 k 2 Γ ( )x k 2 1 e x 2 k. 2 chi square distributions X chi square distributions X chi square distributions X The probability distribution of X is described by a density curve (a graph). The probability of any event is the area under the density curve and above the x axis, and between the values of X that make up the event. The total area under a density curve is equal to 1, and a density curve never goes below the x-axis. Every individual outcome for a continuous random variable has probability zero.

35 Measuring the Center of a Distribution p = 0.1 p = 0.25 p = 0.5 p = x x x x Definition The mean of a probability distribution, or the mean of a random variable, is a number that indicates the center, or location, of the random variables distribution. If X is a discrete random variable whose distribution is Possible Value of X x 1 x 2... x k Probability P (x 1 ) P (x 2 )... P (x k ) then mean of X is computed as follows: µ X = x 1 P (x 1 ) + x 2 P (x 2 ) + + x k P (x k ) The mean for a random variable X is also called the EXPECTED VALUE OF X. If you repeat a random procedure an extreme number of times, and average the observed random variable will be very close to the mean of the random variable. The mean is what you expect to see on average. If a random variable X has a Binomial Distribution with n trials and probability of success p, then µ X = np. You will not need to compute the mean for a continuous random variable X

36 Measuring the Spread of a Distribution Definition The standard deviation of a probability distribution, or the standard deviation of a random variable, is a number that indicates the spread, or dispersion, of the random variables distribution. If X is a discrete random variable with mean µ, and distribution Possible Value of X x 1 x 2... x k Probability P (x 1 ) P (x 2 )... P (x k ) then the standard deviation of X is σ = (x 1 µ) 2 P (x 1 ) + (x 2 µ) 2 P (x 2 ) + + (x k µ) 2 P (x k ) Example 3.9. Determine the mean, standard deviation, and variance for the following distribution: X P (X) If a random variable X has a Binomial Distribution with n trials and probability of success p, then σ X = np(1 p). You will not need to compute the mean for a continuous random variable. Variance X The Variance of a random variable X is its standard deviation squared. The Variance of a random variable is another measure of the spread of a random variables distribution.

37 Percentiles & Critical Values Percentiles Definition The 100α th -percentile is a number, P 100α, that divides the probability distribution of a random variable X into two parts where P (X P α ) α and P (X P α ) 1 α. The 100α th -percentile is a number, P 100α, that separates the bottom 100α% of a distribution from the top 100(1 α)%. Normal Chi Square t distribution InvNorm(α, µ, σ) InvT (α, df) MATH Solver... MATH Solver... 0 = α χ 2 cdf(0, X, df) 0 = α tcdf( 2 99, X, df) ENTER ALPHA ENTER ENTER ALPHA ENTER

38 38 Normal Chi Square t distribution InvNorm(α, µ, σ) InvT (α, df) MATH Solver... MATH Solver... 0 = α χ 2 cdf(0, X, df) 0 = α tcdf( 2 99, X, df) ENTER ALPHA ENTER ENTER ALPHA ENTER Example Find P 99 for a t distributed random variable with 5 degrees of freedom. Example Find P 95 for a χ 2 -square distributed random variable with 3 degrees of freedom. Example Find P 90 for a normally distributed random variable with µ = 5, and σ = 3. Example In a large section of a statistics class, the points for the final exam are normally distributed with a mean of 72 and a standard deviation of 9. Find the lowest score on the final exam that would qualify a student for an A, if an A should include the top 10% of the class. Example The annual per capita utilization of apples (in pounds) in the United States can be approximated by a normal distribution with µ = 17.4 lb. and σ = 4 lb. What annual per capita utilization of apples represents the 10th percentile?

39 39 Critical Values Definition A critical value is a number that is used to separate unusual ( unlikely ) values for a random variable from those values that are expected ( likely ) to occur. The placement of a critical value will depend on: the distribution of the random variable; the significance level α used to define what it means for an event to be unlikely. Some questions will require the determination of two critical values. ( Usual, Expected, Common, Likely ) values will generally be considered values close to the mean. ( Unusual, Unexpected, Surprising, Unlikely ) values will generally be considered values far to the mean.

40 40 Critical Values for Specific Distributions Notation 3.1. z α, or z, denotes a critical value for a Standard Normal Random variable with an area, or probability, of α to its right. Example Find z.05 standard normal Notation 3.2. t α,k, or t, denotes a critical value for a t-random Variable, with k degrees of freedom, with an area, or probability, of α to its right. Example Find t.05,3 t distribution Notation 3.3. χ 2 α,k denotes a critical value for a χ2 -Random Variable, with k degrees of freedom, with an area, or probability, of α to its right. Example Find χ 2.05,4 chi square The critical values given above define ( Unusual, Unexpected, Surprising, Unlikely ) values to be numbers that are far from zero. Later, we will define these values to be the distance between what we expect to happen, and what actually happens. This translates into the idea that unlikely values are those that are a great distance (relatively) from what we expect.

41 41 Tail Events & Tail Probabilities Definition A one-tail event for a random variable X is an event such as {X t}, {X t}, where t is any number. Definition A two-tail event for a random variable X is an event such as {X > t or X < r}, where r < t are any numbers. Definition A tail probability is the probability of a ( two ) tail event. Percentiles and Critical Values are defined in terms of tail events. If a tail probability is smaller than a given significance level, α, then the tail event will be considered unlikely. If a tail probability is smaller than a given significance level, α, then any outcome within that tail event will be considered ( Unusual, Unexpected, Surprising, Unlikely ).

42 42 Depending upon the situation, and significance level α, we may define ( Unusual, Unexpected, Surprising, Unlikely ) values to be values that are Far from the mean AND too small µ Far from the mean AND too big µ Far from the mean AND either too big or too small µ

43 Chapter 4 Samples Population Sample Parameter statistic 43

44 44 Remark 4.1. You should read Chapter 1 from your textbook. We will cover only the information necessary for the procedures that will be introduced later. 4.1 Goals: Describe a population s unknown distribution; Describe a population s unknown parameters; Describe the nature of the relationship between populations. 4.2 Collecting Data Definition 4.1. SAMPLE: 1. VERB To sample a population is the act of selecting individuals, items, object, or members of a population. 2. NOUN A Sample is the subset of the population that has been selected. Definition 4.2. A simple random sample of n subjects is selected in such a way that every possible sample of the same size n has the same probability of being selected. All of the procedures that will be discussed later will use a simple random sample. A simple random sample is a selection of n subjects without replacement. This means we have dependent selections from a finite population. If the sample size is no more than 5% of the overall population, we will treat the selections as being independent. We will think of our samples, as selections make with replacement. For examples in class, we will take samples ( make selections ) with replacement.

45 45 Other Sample Types Definition 4.3. In systematic sample, we select some starting point and then select every k th element in a population. Definition 4.4. In stratified sample, we subdivide the population into at least two different subgroups ( or strata ) so that subjects within the same subgroup share the same characteristics. Then we draw a sample from each subgroup (or stratum). Definition 4.5. In cluster sampling, we first divide the population area into sections ( or clusters ). Then we randomly select some of those clusters and choose all the members from those selected clusters. Definition 4.6. With convenience sample, we simply use results that are very easy to get. Definition 4.7. In an observational study, we observe and measure specific characteristics, but do not attempt to modify the subjects being studied. Definition 4.8. In an experiment, we apply some treatment and then proceed to observe its effects on the subjects. ( Subjects in experiments are called experimental units.) Type of Observational Studies Definition 4.9. In a cross-sectional study, data are observed measured, and collected at one point in time. Definition In a retrospective study, data are collected from the past by going back in time (through examination of records, interviews, and so on. Definition In a prospective study, data are collected in the future from groups sharing common factors.

46 Describing Populations using Graphs of Sample Data Graphs of Sample ( Quantitative ) data can be used to make guesses about the distribution of a population. We will look at the graphs to determine whether they appear to be : Normal Uniform Symmetric Skewed Definition A ( relative ) frequency histogram is a graph consisting of bars of equal width drawn adjacent to each other ( unless there are gaps in the data). The horizontal scale represents classes of quantitative data value and the vertical represents ( relative )frequencies. The heights of the bars correspond to the ( relative ) frequency values.

47 47 Remark 4.2. Having a guess about the SHAPE of a distribution, allows you make a guess about how to compute probabilities about future samples from the same type of distribution. If we do not know the SHAPE of a distribution, we CAN NOT make any GOOD guesses about the probability of an event. Assessing Normality with a Small Data Set With a small data set, the shape of a distribution may not be very clear. It is very important to us to be able to identify populations with Normal Distributions. A normal quantile plot can assist us with this. Normal Distribution Non-Normal Distribution

48 48 Stemplot A Stemplot (Stem & Leaf plot) is a quick way to look at the SHAPE of a distribution, if your working by hand, and have a relatively small data set. Stem Leaf Other Types of Graphics Definition A scatterplot is a plot of paired (x, y) quantitative data with a horizontal x-axis and vertical y-axis.

49 49 Definition A time-series graph is a graph of times-series data, which are quantitative data that have been collected over a period of time. Definition A Pareto chart is a bar graph for categorical data, with the bars arranged in descending order according to frequencies. Definition A Pie Chart is a graph that depicts categorical data as slices of a circle, in which each slice is proportional to the frequency count for the category.

50 Estimating Population Parameters using Sample Data With a probability distribution for a random variable, defined several numbers that could be used to describe the characteristics of the distribution. Center Mean Spread Standard Deviation Proportion of Successes Percentiles If we have a population, but don t know its distribution, we probably don t know some of these parameters. We will need a method to estimate these parameters, based on samples that we take. Remark 4.3. Not every parameter is interesting for every population.

51 Estimating a Population Mean Definition The sample mean is an estimate of the mean of a probability distribution. It can be found by adding all the sample data values together, and dividing by the sample size. x = x 1 + x x n n Example 4.1. Find the mean of the following sample values: It is a statistic. It is one possible measure of the center of a SAMPLE. It is an estimate of a center of a probability distribution. Its value will change depending upon the sample taken. one extreme value can change the value of the mean substantially. Sample means drawn from the same population tend to vary less than other measures of center.

52 52 Estimating the SAMPLE MEAN from a Frequency Distribution # Frequency N 106 Estimating the SAMPLE MEAN from a Relative Frequency Distribution # Frequency

53 Estimating a Population Standard Deviation Definition The sample standard deviation is an estimate of the standard deviation of a probability distribution. It is denoted by s and is a measure of how much the sample data deviates away from the sample mean x. s = (x x) 2 n 1 Example 4.2. Find the sample standard deviation of the following sample values: Facts about the sample standard deviation s 0 s = 0 only if all if the data values are the same. s will increase greatly if only one additional data value is added that looks very different from the others. The units for s are the same as the units on the original data. s 2 = the sample variance is another measure of variation. It is the square of the sample standard deviation.

54 54 Estimating the STANDARD DEVIATION from a Dataset # Frequency N 106 Definition The range of a data set is the measure of spread found by subtracting the smallest data value from the largest data value. Range Rule of Thumb σ Range Estimating a Proportion of Successes Definition The sample proportion is an estimate of the probability of a success p for some random procedure. It is denoted by ˆp. It is also called a sample proportion. ˆp = # of successes n Example 4.3. Find the sample proportion for the following samples:

55 Estimating Percentiles Definition The 100α-Percentile of a dataset, P 100α, is a number that breaks the ordered dataset into two groups with about 100α% of the dataset less than, or equal to, P 100α and about 100(1 α)% of the dataset greater than, or equal to, P 100α. Finding the Percentile of a Data Value Percentile of x = # of data values < x n Example 4.4. Find the percentile of 18 for the following data: 100 (Round up) 2, 3, 4, 6, 7, 7, 8, 8, 9, 10, 13, 13, 14, 16, 18, 22, 22, 34, 56, 78 Converting a Percentile to a Data Value L = ( ) k 100 n Example 4.5. Find the value of the 20 th percentile, P 20, for the following data: 2, 3, 4, 6, 7, 7, 8, 8, 9, 10, 13, 13, 14, 16, 18, 22, 22, 34, 56, 78 Example 4.6. Find the value of the 33 rd percentile, P 33, for the following data: 2, 3, 4, 6, 7, 7, 8, 8, 9, 10, 13, 13, 14, 16, 18, 22, 22, 34, 56, 78

56 Boxplot - Using Sample Percentiles Definition For a set of data, the 5-number summary consists of these five values: Minimum, Q 1, Q 2, Q 3, Maximum Example 4.7. Give the 5-number summary for the following data: 2, 3, 4, 6, 7, 7, 8, 8, 9, 10, 13, 13, 14, 16, 18, 22, 22, 34, 56, 78 Definition A boxplot is a graph of a data set that consists of a number line extending from the minimum to the maximum data value, and a box drawn at the first, second and third quartiles. Example 4.8. Construct a boxplot for the following data: 2, 3, 4, 6, 7, 7, 8, 8, 9, 10, 13, 13, 14, 16, 18, 22, 22, 34, 56, 78

57 IQR Guideline for outliers It is always important to look for data values that don t apparently fit with the rest. Potential outliers can be identifies as those data values that are less than Q IQR. greater than Q IQR. Example 4.9. Identify any potential outliers for the following data: 2, 3, 4, 6, 7, 7, 7, 8, 9, 10, 13, 13, 14, 16, 18, 22, 22, 34, 56, 78 This rule helps identify values that are far away from the central 50% of the data values Relative Distance From the Center Definition A z-score or standardized value is the number of standard deviations that a given value x is above or below the mean. A z-score is calculated as follows:

58 58 Facts about z-scores A z-score allows a comparison of distances between two distributions that are spread out in different manners. In many cases, a z-score will represent the relative distance between an observation and a distributions expected value. Large z-scores will represent observations that are far to what is expected. These observations would be considered ( Unusual, Unexpected, Surprising, Unlikely ). Small z-scores will represent observations that are close to what we expect. These observations would be considered ( Usual, Expected, Common, Likely ).

59 59 Example Two statistics classes take an exam. The distribution of the test scores looked relatively normal. Class A has a mean of 72 and a standard deviation of 3. Class B had a mean of 83 and a standard deviation of 6. Michele is in Class A. She received a score of 81. Elaine is in Class B. She received a 91. Elaine obviously has the higher overall score, but who did better with respect to their class? Does either one of them have an unusually high score compared to their class?

60 Probability distribution of a z-score The observation used in the computation of a z-score are generally the outcome of some random procedure. The observation represents the outcome of some random variable. If the probability distribution of the observation has a Normal distribution, then the z-score is a random variable, has a standard normal distribution. If X Normal(µ X, σ X ) then z = X µ X σ X Normal(0, 1) We can use this idea to make estimates about the probabilities of future events, or about proportions of a dataset. Example A sample was taken and the following histogram was made. Estimate the proportion of the data that was within 1 standard deviations of the mean. Which data values appear to be within 1 standard deviations of the mean?

61 61 Example A sample was taken and the following histogram was made. Estimate the proportion of the data that was within 1 standard deviations of the mean. Which data values appear to be within one standard deviations of the mean? Example A sample was taken and the following histogram was made. Estimate the proportion of the data that was within 2 standard deviations of the mean. Which data values appear to be within 2 standard deviations of the mean?

62 62 Example A sample was taken and the following histogram was made. Estimate the proportion of the data that was within 3 standard deviations of the mean. Which data values appear to be within 3 standard deviations of the mean? Empirical Rule: Normal Distribution Density µ 3σ µ 2σ µ 1σ µ x value µ+1σ µ+2σ µ+3σ

63 Sampling Distributions Definition The sampling distribution of a statistic is the distribution of that statistic based on a fixed sample size. Recall. The following statistics are random variables: Sample Mean x Sample Proportion ˆp Sample Standard Deviation s Remark 4.4. Many other statistics exist Central Limit Theorem Theorem 4.1. Central Limit Theorem Suppose that a random variable X has a mean µ X and a standard deviation σ X <, then the (sampling) distribution ( based on a simple random sample of size n ) of x will be: Normally distributed with mean µ X and standard deviation σ/ n, if X has a normal distribution. Approximately Normally distributed with mean µ X and standard deviation σ X / n, if the n > 30 and the distribution of X is not heavily skewed. x Normal ( ) σ µ X, n

64 64 Example The height of adult females is normally distributed with a mean of cm and a standard deviation of 8.6 cm. 1. What is the probability that a randomly selected female will be taller than 210 cm? 2. What is the probability that the average height of 25 randomly selected females will be taller than 210 cm? 3. (α =.01) What heights of females would be considered unusually tall? 4. (α =.01) If 25 women are randomly selected, what would be considered an unusually high average height?

65 65 Example Suppose that the amount of time that you will wait for a bus, at a particular bus stop, has a mean of 10 minutes with a standard deviation of 1 minute? 1. What is the probability that on a randomly selected day, you will wait longer than 12 minutes? 2. What is the probability that over 31 randomly selected days you will wait longer than 12 minutes on average? 3. (α =.05) What would be considered an unusually long wait time? 4. (α =.05) Over the course of 31 randomly selected days, what would be considered an unusually long average wait?

66 66 Corollary 4.2. If a population can be split into two disjoint groups, success and failure, and the proportion of success is equal to p and a sample of size n is taken, where np 5 and n(1 p) 5 then ( ) p(1 p) ˆp Normal p, n Example Seventy percent of a town is republican. A random sample of 100 residents will be taken. What is the probability more than 71% of those sampled will be republicans? Example A coin is flipped 25 times, what is the probability that more than 60% of the flips will be tails?

67 Chapter 5 Inference: Confidence Intervals Idea for a Confidence Interval

68 Confidence Intervals for a Single Population Definition 5.1. A Confidence Level 100(1 α)% indicates that there is a 1 α probability that a random procedure produced an acceptable result. Definition 5.2. An Interval Estimate is a range of numbers, determined by following a random procedure, used to estimate an unknown population parameter. Definition 5.3. A 100(1 α)% Confidence Interval is an Interval Estimate produced by following a procedure that correctly estimates an unknown population parameter at least 100(1 α)% of the time, i.e. the procedure has a 100(1 α)% Confidence Level.

69 69 General Procedure for Constructing a Confidence Interval for a Mean or Proportion 1. Decide how confident you want to be in your interval estimate. 2. Decide how precise you want your estimate to be. 3. Using Step 1 and Step 2, determine the necessary sample size n. 4. If necessary, revisit Step 1 and Step 2, if the sample size determined in Step 3 is too large to manage. 5. Take a sample of at least size n. 6. Compute x or ˆp. 7. Compute your margin of error E. 8. Construct your Confidence Interval. (Estimate Margin of Error, Estimate + Margin of Error) 9. State with 100(1 α)% Confidence that the unknown parameter is captured by the confidence interval.

70 Confidence Interval for a Population Mean One possible way to produce a confidence interval for a mean. However, it is unrealistic. It assumes that we know a population standard deviation x z α 2 σ n < µ < x + z α 2 σ n z α 2 z = x µ σ/ n 0 z α 2

71 71 Real Life We don t know the distribution. In real life, we don t know σ. We estimate σ with s. We estimate the z-score with a t-score: t = x µ s/ n t = x µ s/ n t α 0 t α (1 α)% Confidence Interval for µ x t α 2 s n < µ < x + t α 2 s n

72 Confidence Interval for a Population Proportion In a similar manner to the mean, we can make an estimate for a population proportion. p(1 p) p(1 p) ˆp z α 2 n < p < ˆp + z α 2 n z α 2 z = ˆp p p(1 p) n 0 z α 2 We ended with a method for estimating the unknown population proportion p. This has the problem that we need to know the population proportion in order to estimate the population proportion. 100(1 α)% Confidence Interval for p ˆp(1 ˆp) ˆp(1 ˆp) ˆp z α 2 n < p < ˆp + z α 2 n

73 Examples Example 5.1. Twelve leaves were randomly selected from the ground below a single tree and their length (cm) was measured. Use the following information to estimate the mean length of all leaves found under this tree. (95% Confidence) x = s = Normal Q Q Plot Histogram of Data Sample Quantiles Frequency Theoretical Quantiles Data

74 74 Example 5.2. A survey of 17 randomly selected UTM students was conducted. (Not really) They were each asked if they had ever seen an episode of The Walking Dead. Their responses are recorded below. A 1 indicates that they said yes. A 0 indicates that they said no. Estimate with 99% Confidence the true proportion of UTM students that have seen an episode of The Walking Dead

75 Precision A short Confidence Interval gives a more precise estimate for the unknown population parameter. Precision is controlled by three things: The desired and acceptable precision The Confidence Level The Sample Size Example 5.3. A moving company is asked to move 10,000 identical blocks. The moving company wants to know how much each box weighs in order to determine what equipment is needed to move the blocks. The owner of the blocks knows that they all weigh about the same amount. Which would be a more useful guess? Between 2 and 300 pounds; Between 30 and 40 pounds.

76 76 Sample Size for Estimating a Population Mean ( zα/2 σ ) 2 n = ( round up ) E where σ is the known population standard deviation, an estimate of the population standard deviation taken from a previous study, estimated using the range rule of thumb, Sample Size for Estimating a Population Proportion When an estimate of p is known: n = ˆp(1 ˆp) ( zα/2 E ) 2 ( round up ) When an estimate of p is unknown: n = 0.25 ( zα/2 E ) 2 ( round up )

77 77 Example 5.4. You want to estimate the mean SAT score of all college applicants. Possible SAT scores range from 600 to How many scores must be sampled if you would like to estimate the population mean score to within 100 points with 98% confidence? Example 5.5. Find the sample size needed to estimate the percentage of Republicans among registered voters in California to within 3 percentage points with 90% confidence. Example 5.6. A prior Pew Research Center report suggests that 15% of adults have consulted fortune tellers. Determine the sample size necessary to estimate the percentage of adults that consult fortune tellers within 3 percentage points with 98% confidence.

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

2.6 Tools for Counting sample points

2.6 Tools for Counting sample points 2.6 Tools for Counting sample points When the number of simple events in S is too large, manual enumeration of every sample point in S is tedious or even impossible. (Example) If S contains N equiprobable

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Name: Exam 2 Solutions. March 13, 2017

Name: Exam 2 Solutions. March 13, 2017 Department of Mathematics University of Notre Dame Math 00 Finite Math Spring 07 Name: Instructors: Conant/Galvin Exam Solutions March, 07 This exam is in two parts on pages and contains problems worth

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Statistics 100 Exam 2 March 8, 2017

Statistics 100 Exam 2 March 8, 2017 STAT 100 EXAM 2 Spring 2017 (This page is worth 1 point. Graded on writing your name and net id clearly and circling section.) PRINT NAME (Last name) (First name) net ID CIRCLE SECTION please! L1 (MWF

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

M & M Project. Think! Crunch those numbers! Answer!

M & M Project. Think! Crunch those numbers! Answer! M & M Project Think! Crunch those numbers! Answer! Chapters 1-2 Exploring Data and Describing Location in a Distribution Univariate Data: Length Stemplot and Frequency Table Stem (Units Digit) 0 1 1 Leaf

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.)

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.) Section 1.1 Statistics - The science of planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on the data.

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly. Introduction to Statistics Math 1040 Sample Final Exam - Chapters 1-11 6 Problem Pages Time Limit: 1 hour and 50 minutes Open Textbook Calculator Allowed: Scientific Name: The point value of each problem

More information

Basic Concepts of Probability. Section 3.1 Basic Concepts of Probability. Probability Experiments. Chapter 3 Probability

Basic Concepts of Probability. Section 3.1 Basic Concepts of Probability. Probability Experiments. Chapter 3 Probability Chapter 3 Probability 3.1 Basic Concepts of Probability 3.2 Conditional Probability and the Multiplication Rule 3.3 The Addition Rule 3.4 Additional Topics in Probability and Counting Section 3.1 Basic

More information

Properties of Probability

Properties of Probability Econ 325 Notes on Probability 1 By Hiro Kasahara Properties of Probability In statistics, we consider random experiments, experiments for which the outcome is random, i.e., cannot be predicted with certainty.

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

Class 26: review for final exam 18.05, Spring 2014

Class 26: review for final exam 18.05, Spring 2014 Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event

More information

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2 Probability Probability is the study of uncertain events or outcomes. Games of chance that involve rolling dice or dealing cards are one obvious area of application. However, probability models underlie

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

Probabilistic models

Probabilistic models Kolmogorov (Andrei Nikolaevich, 1903 1987) put forward an axiomatic system for probability theory. Foundations of the Calculus of Probabilities, published in 1933, immediately became the definitive formulation

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

3 PROBABILITY TOPICS

3 PROBABILITY TOPICS Chapter 3 Probability Topics 135 3 PROBABILITY TOPICS Figure 3.1 Meteor showers are rare, but the probability of them occurring can be calculated. (credit: Navicore/flickr) Introduction It is often necessary

More information

4. Suppose that we roll two die and let X be equal to the maximum of the two rolls. Find P (X {1, 3, 5}) and draw the PMF for X.

4. Suppose that we roll two die and let X be equal to the maximum of the two rolls. Find P (X {1, 3, 5}) and draw the PMF for X. Math 10B with Professor Stankova Worksheet, Midterm #2; Wednesday, 3/21/2018 GSI name: Roy Zhao 1 Problems 1.1 Bayes Theorem 1. Suppose a test is 99% accurate and 1% of people have a disease. What is the

More information

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% Show all work, simplify as appropriate, and use good form and procedure (as in class). Box in your final

More information

3.2 Probability Rules

3.2 Probability Rules 3.2 Probability Rules The idea of probability rests on the fact that chance behavior is predictable in the long run. In the last section, we used simulation to imitate chance behavior. Do we always need

More information

2. AXIOMATIC PROBABILITY

2. AXIOMATIC PROBABILITY IA Probability Lent Term 2. AXIOMATIC PROBABILITY 2. The axioms The formulation for classical probability in which all outcomes or points in the sample space are equally likely is too restrictive to develop

More information

Men. Women. Men. Men. Women. Women

Men. Women. Men. Men. Women. Women Math 203 Topics for second exam Statistics: the science of data Chapter 5: Producing data Statistics is all about drawing conclusions about the opinions/behavior/structure of large populations based on

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Intermediate Math Circles November 8, 2017 Probability II

Intermediate Math Circles November 8, 2017 Probability II Intersection of Events and Independence Consider two groups of pairs of events Intermediate Math Circles November 8, 017 Probability II Group 1 (Dependent Events) A = {a sales associate has training} B

More information

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then 1.1 Probabilities Def n: A random experiment is a process that, when performed, results in one and only one of many observations (or outcomes). The sample space S is the set of all elementary outcomes

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences Random Variables Christopher Adolph Department of Political Science and Center for Statistics and the Social Sciences University

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Lecture 8: Conditional probability I: definition, independence, the tree method, sampling, chain rule for independent events

Lecture 8: Conditional probability I: definition, independence, the tree method, sampling, chain rule for independent events Lecture 8: Conditional probability I: definition, independence, the tree method, sampling, chain rule for independent events Discrete Structures II (Summer 2018) Rutgers University Instructor: Abhishek

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

REVIEW: Midterm Exam. Spring 2012

REVIEW: Midterm Exam. Spring 2012 REVIEW: Midterm Exam Spring 2012 Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample Types of Data Parameter (Describing a characteristic of the Population) Statistic

More information

Compound Events. The event E = E c (the complement of E) is the event consisting of those outcomes which are not in E.

Compound Events. The event E = E c (the complement of E) is the event consisting of those outcomes which are not in E. Compound Events Because we are using the framework of set theory to analyze probability, we can use unions, intersections and complements to break complex events into compositions of events for which it

More information

CHAPTER 3 PROBABILITY TOPICS

CHAPTER 3 PROBABILITY TOPICS CHAPTER 3 PROBABILITY TOPICS 1. Terminology In this chapter, we are interested in the probability of a particular event occurring when we conduct an experiment. The sample space of an experiment is the

More information

MAT2377. Ali Karimnezhad. Version September 9, Ali Karimnezhad

MAT2377. Ali Karimnezhad. Version September 9, Ali Karimnezhad MAT2377 Ali Karimnezhad Version September 9, 2015 Ali Karimnezhad Comments These slides cover material from Chapter 1. In class, I may use a blackboard. I recommend reading these slides before you come

More information

Section 4.2 Basic Concepts of Probability

Section 4.2 Basic Concepts of Probability Section 4.2 Basic Concepts of Probability 2012 Pearson Education, Inc. All rights reserved. 1 of 88 Section 4.2 Objectives Identify the sample space of a probability experiment Identify simple events Use

More information

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-13 13 14 3 15 8 16 4 17 10 18 9 19 7 20 3 21 16 22 2 Total 75 1 Multiple choice questions (1 point each) 1. Look at

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

PROBABILITY.

PROBABILITY. PROBABILITY PROBABILITY(Basic Terminology) Random Experiment: If in each trial of an experiment conducted under identical conditions, the outcome is not unique, but may be any one of the possible outcomes,

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% 1) (6 points). A college has 32 course sections in math. A frequency table for the numbers of students

More information

Exercises from Chapter 3, Section 1

Exercises from Chapter 3, Section 1 Exercises from Chapter 3, Section 1 1. Consider the following sample consisting of 20 numbers. (a) Find the mode of the data 21 23 24 24 25 26 29 30 32 34 39 41 41 41 42 43 48 51 53 53 (b) Find the median

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER 2 2017/2018 DR. ANTHONY BROWN 5.1. Introduction to Probability. 5. Probability You are probably familiar with the elementary

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Probabilistic models

Probabilistic models Probabilistic models Kolmogorov (Andrei Nikolaevich, 1903 1987) put forward an axiomatic system for probability theory. Foundations of the Calculus of Probabilities, published in 1933, immediately became

More information

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?! Topic 3: Introduction to Statistics Collecting Data We collect data through observation, surveys and experiments. We can collect two different types of data: Categorical Quantitative Algebra 1 Table of

More information

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam.

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam. LAST NAME (Please Print): KEY FIRST NAME (Please Print): HONOR PLEDGE (Please Sign): Statistics 111 Midterm 1 This is a closed book exam. You may use your calculator and a single page of notes. The room

More information

DSST Principles of Statistics

DSST Principles of Statistics DSST Principles of Statistics Time 10 Minutes 98 Questions Each incomplete statement is followed by four suggested completions. Select the one that is best in each case. 1. Which of the following variables

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Chapter 8: Confidence Intervals

Chapter 8: Confidence Intervals Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write

More information

Keystone Exams: Algebra

Keystone Exams: Algebra KeystoneExams:Algebra TheKeystoneGlossaryincludestermsanddefinitionsassociatedwiththeKeystoneAssessmentAnchorsand Eligible Content. The terms and definitions included in the glossary are intended to assist

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

4. Probability of an event A for equally likely outcomes:

4. Probability of an event A for equally likely outcomes: University of California, Los Angeles Department of Statistics Statistics 110A Instructor: Nicolas Christou Probability Probability: A measure of the chance that something will occur. 1. Random experiment:

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

(6, 1), (5, 2), (4, 3), (3, 4), (2, 5), (1, 6)

(6, 1), (5, 2), (4, 3), (3, 4), (2, 5), (1, 6) Section 7.3: Compound Events Because we are using the framework of set theory to analyze probability, we can use unions, intersections and complements to break complex events into compositions of events

More information

Statistical Theory 1

Statistical Theory 1 Statistical Theory 1 Set Theory and Probability Paolo Bautista September 12, 2017 Set Theory We start by defining terms in Set Theory which will be used in the following sections. Definition 1 A set is

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu Home Work: 1 1. Describe the sample space when a coin is tossed (a) once, (b) three times, (c) n times, (d) an infinite number of times. 2. A coin is tossed until for the first time the same result appear

More information

Random processes. Lecture 17: Probability, Part 1. Probability. Law of large numbers

Random processes. Lecture 17: Probability, Part 1. Probability. Law of large numbers Random processes Lecture 17: Probability, Part 1 Statistics 10 Colin Rundel March 26, 2012 A random process is a situation in which we know what outcomes could happen, but we don t know which particular

More information

Basic Concepts of Probability

Basic Concepts of Probability Probability Probability theory is the branch of math that deals with random events Probability is used to describe how likely a particular outcome is in a random event the probability of obtaining heads

More information

Math P (A 1 ) =.5, P (A 2 ) =.6, P (A 1 A 2 ) =.9r

Math P (A 1 ) =.5, P (A 2 ) =.6, P (A 1 A 2 ) =.9r Math 3070 1. Treibergs σιι First Midterm Exam Name: SAMPLE January 31, 2000 (1. Compute the sample mean x and sample standard deviation s for the January mean temperatures (in F for Seattle from 1900 to

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

Lesson One Hundred and Sixty-One Normal Distribution for some Resolution

Lesson One Hundred and Sixty-One Normal Distribution for some Resolution STUDENT MANUAL ALGEBRA II / LESSON 161 Lesson One Hundred and Sixty-One Normal Distribution for some Resolution Today we re going to continue looking at data sets and how they can be represented in different

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi Week 2 Chapter 4 Bivariate Data Data with two/paired variables, Pearson correlation coefficient and its properties, general variance sum law Chapter 6

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

1 True/False. Math 10B with Professor Stankova Worksheet, Discussion #9; Thursday, 2/15/2018 GSI name: Roy Zhao

1 True/False. Math 10B with Professor Stankova Worksheet, Discussion #9; Thursday, 2/15/2018 GSI name: Roy Zhao Math 10B with Professor Stankova Worksheet, Discussion #9; Thursday, 2/15/2018 GSI name: Roy Zhao 1 True/False 1. True False When we solve a problem one way, it is not useful to try to solve it in a second

More information

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

The probability of an event is viewed as a numerical measure of the chance that the event will occur. Chapter 5 This chapter introduces probability to quantify randomness. Section 5.1: How Can Probability Quantify Randomness? The probability of an event is viewed as a numerical measure of the chance that

More information

(a) Fill in the missing probabilities in the table. (b) Calculate P(F G). (c) Calculate P(E c ). (d) Is this a uniform sample space?

(a) Fill in the missing probabilities in the table. (b) Calculate P(F G). (c) Calculate P(E c ). (d) Is this a uniform sample space? Math 166 Exam 1 Review Sections L.1-L.2, 1.1-1.7 Note: This review is more heavily weighted on the new material this week: Sections 1.5-1.7. For more practice problems on previous material, take a look

More information

Example. If 4 tickets are drawn with replacement from ,

Example. If 4 tickets are drawn with replacement from , Example. If 4 tickets are drawn with replacement from 1 2 2 4 6, what are the chances that we observe exactly two 2 s? Exactly two 2 s in a sequence of four draws can occur in many ways. For example, (

More information

Chapter. Probability

Chapter. Probability Chapter 3 Probability Section 3.1 Basic Concepts of Probability Section 3.1 Objectives Identify the sample space of a probability experiment Identify simple events Use the Fundamental Counting Principle

More information

Chapter 2 Class Notes

Chapter 2 Class Notes Chapter 2 Class Notes Probability can be thought of in many ways, for example as a relative frequency of a long series of trials (e.g. flips of a coin or die) Another approach is to let an expert (such

More information

Problem # Number of points 1 /20 2 /20 3 /20 4 /20 5 /20 6 /20 7 /20 8 /20 Total /150

Problem # Number of points 1 /20 2 /20 3 /20 4 /20 5 /20 6 /20 7 /20 8 /20 Total /150 Name Student ID # Instructor: SOLUTION Sergey Kirshner STAT 516 Fall 09 Practice Midterm #1 January 31, 2010 You are not allowed to use books or notes. Non-programmable non-graphic calculators are permitted.

More information

Example. What is the sample space for flipping a fair coin? Rolling a 6-sided die? Find the event E where E = {x x has exactly one head}

Example. What is the sample space for flipping a fair coin? Rolling a 6-sided die? Find the event E where E = {x x has exactly one head} Chapter 7 Notes 1 (c) Epstein, 2013 CHAPTER 7: PROBABILITY 7.1: Experiments, Sample Spaces and Events Chapter 7 Notes 2 (c) Epstein, 2013 What is the sample space for flipping a fair coin three times?

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

Outline Conditional Probability The Law of Total Probability and Bayes Theorem Independent Events. Week 4 Classical Probability, Part II

Outline Conditional Probability The Law of Total Probability and Bayes Theorem Independent Events. Week 4 Classical Probability, Part II Week 4 Classical Probability, Part II Week 4 Objectives This week we continue covering topics from classical probability. The notion of conditional probability is presented first. Important results/tools

More information

Counting principles, including permutations and combinations.

Counting principles, including permutations and combinations. 1 Counting principles, including permutations and combinations. The binomial theorem: expansion of a + b n, n ε N. THE PRODUCT RULE If there are m different ways of performing an operation and for each

More information

Example. χ 2 = Continued on the next page. All cells

Example. χ 2 = Continued on the next page. All cells Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Additional practice with these ideas can be found in the problems for Tintle Section P.1.1

Additional practice with these ideas can be found in the problems for Tintle Section P.1.1 Psych 10 / Stats 60, Practice Problem Set 3 (Week 3 Material) Part 1: Decide if each variable below is quantitative, ordinal, or categorical. If the variable is categorical, also decide whether or not

More information

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27 Chapter 7: Statistics Describing Data Chapter 7: Statistics Describing Data 1 / 27 Categorical Data Four ways to display categorical data: 1 Frequency and Relative Frequency Table 2 Bar graph (Pareto chart)

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

PRACTICE PROBLEMS FOR EXAM 1

PRACTICE PROBLEMS FOR EXAM 1 PRACTICE PROBLEMS FOR EXAM 1 Math 3160Q Spring 01 Professor Hohn Below is a list of practice questions for Exam 1. Any quiz, homework, or example problem has a chance of being on the exam. For more practice,

More information

Part 3: Parametric Models

Part 3: Parametric Models Part 3: Parametric Models Matthew Sperrin and Juhyun Park August 19, 2008 1 Introduction There are three main objectives to this section: 1. To introduce the concepts of probability and random variables.

More information

Ch 14 Randomness and Probability

Ch 14 Randomness and Probability Ch 14 Randomness and Probability We ll begin a new part: randomness and probability. This part contain 4 chapters: 14-17. Why we need to learn this part? Probability is not a portion of statistics. Instead

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

The area under a probability density curve between any two values a and b has two interpretations:

The area under a probability density curve between any two values a and b has two interpretations: Chapter 7 7.1 The Standard Normal Curve Introduction Probability density curve: The area under a probability density curve between any two values a and b has two interpretations: 1. 2. The region above

More information

1 Basic continuous random variable problems

1 Basic continuous random variable problems Name M362K Final Here are problems concerning material from Chapters 5 and 6. To review the other chapters, look over previous practice sheets for the two exams, previous quizzes, previous homeworks and

More information