Introduction to Statistics

Size: px
Start display at page:

Download "Introduction to Statistics"

Transcription

1 Introduction to Statistics Class Overheads for APA 3381 Measurement and Data Analysis in Human Kinetics by D. Gordon E. Robertson, PhD, FCSB School of Human Kinetics University of Ottawa Copyright D.G.E. Robertson, September 2015

2 Introduction to Statistics 2 Parameter: measurable characteristic of a population. Population: all members of a definable group. For statistical purposes a population must have definable characteristics even if it is not possible to measure the variable or even count the number of members in the population. Sample: subset or subgroup of a population. Usually obtained by random sampling of a single population. Statistic: measurable characteristic of a sample. E.g., height, weight, political affiliation, ethnicity, aerobic capacity, strength, power,... Data or Data Set: collection of numerical and/or nonnumerical values (plural of datum). Datum: single measured value (singular of data).

3 Statistics 3 Statistics: 1. plural of statistic, 2. science of conducting studies to collect, organize, summarize, analyze and draw conclusions from data. Descriptive statistics: collection, description, organization, presentation and analysis of data. Inferential statistics: generalizing from samples to populations, testing of hypotheses, determining relationships among variables and making decisions, uses probability theory to make decisions. Hypothesis: less than a thesis, a testable conjecture based on a theory. Thesis: a dissertation or learned argument which defends a particular proposition or theory. Qualitative measurements: typically non-numerical, subjectively measured, judgmentally determined, categorical. E.g., religious affiliation, teacher/professor evaluations, emotional states, flavour, gender. Quantitative measurements: typically numerical, objectively measured, reliability (repeatability or precision) and validity (accuracy) can be evaluated against a criterion. E.g., salary, course grade, foot size, IQ, age, girth.

4 Types of quantitative measures: Constants: quantities with fixed characteristics. Physical constants: G, c, h (Planck s constant) Mathematical constants:, e, i Variables: quantities whose characteristics vary. Discrete variables: numerical variables that have finitely many possibilities (usually integers), countable many possible values Examples: value of $ bills or coins, card count Continuous variables: numerical variables that have infinitely many possible values within a range of values (numbers between 1 and +1) or unbounded (Real numbers, numbers greater than 0). Examples: height, duration, angle (only a fixed number of significant figures are reported). Significant Figures: When reporting numerical information, especially when obtained by a calculator, usually only 3 or 4 digits are required. The general rule that is accurate to 0.5% holds that only 4 significant figures are needed if the first nonzero number is a 1 and 3 when it is not. Examples: , 1.234, 2.45, , , 8910, and Exceptions are frequencies and counts when all digits are reported and financial numbers, hich are too nearest dollar or nearest cent depending on the amount. 4

5 Measurement Scales 5 Nominal: classifies data into mutually exclusive (nonoverlapping), exhaustive categories in which no ordering or ranking of the categories is implied. E.g., colour, flavour, religion, gender, sex, nationality, county of residence, postal code. Ordinal: classifies data into categories that can be ordered or ranked (highest to lowest or vice versa), precise differences between categories does not exist. E.g., teaching evaluations, letter grade (A+, A, A,... F), judges scores (0 10), preferences (polls), skill rankings. Interval: numerical data with precise differences between categories but with no true zero (i.e., zero implies absence of quantity). E.g., IQ (0 means could not be measured), temperature (degrees Celsius), z-scores (0 is average value), acidity (ph, 7 is neutral). Ratio: interval data with a true zero, true ratios exist E.g., height, weight, temperature (in Kelvins), strength, price, age, duration.

6 Methods of Sampling 6 Random: subjects are randomly selected from a population, all subjects have equal probability of being selected, subjects may not be selected twice. Systematic: subjects are numbered sequentially and th every n subject is selected to obtain a sample of N/n subjects (N is number of people in population). Stratified: population is divided into identifiable groups (strata) by some relevant variable (income, gender, age, education) and each strata is sampled randomly in proportion to the strata s relative size in the population. Cluster: subjects are randomly sampled from representative clusters or regions of the population. Economical method if subjects are widely dispersed geographically. Convenience: typically used in student projects and by journalists, uses subjects that can be conveniently polled or tested. Not suitable for pollsters or medical research.

7 7 Graphing 1 Types of graphs: Pictogram: numeric data are represented by pictures, usually only nominal data are depicted in this way Example: milk production increases by 200% Before Biassed way: After height of cow is doubled but two-dimensionally cow is four times bigger, three-dimensionally it is eight times bigger Unbiassed way: increase is correctly depicted as two times greater

8 8 Graphing 2 Pie chart: used with nominal or frequency data Example: number of students by province and country a segment can be emphasized by separating it two-dimensional pie cannot create a biassed view three-dimensional pies can bias a slice depending on its position - a slice in front appears - put a slice in the back larger to reduce its size - separating it creates even more emphasis

9 Graphing 3 Bar graph: used for nominal data, usually frequency counts, are depicted by bars proportional to their magnitudes 9 bars are separated extreme length bars can be split Histogram: used for ordinal data bars are adjacent, no gaps one axis is ordered, first or last bars may include extremes

10 10 Line graph: used with interval and ratio data scaling can create a bias use large scales to hide changes Graphing 4 truncated axis reduces white space scaling to minimum and maximum emphasizes changes

11 11 Graphing 5 Ogive or cumulative frequency: (pronounced 0- jive) line starts at zero and accumulates to 100% useful for determining percentages (by interpolation)

12 Rules for Constructing a Frequency Histogram 1. There should be between 5 and 20 classes. this is strictly for aesthetic purposes 2. The class width should be an odd number. this ensures that the midpoint has the same number of decimal places as the original data The classes must be mutually exclusive. each datum must fall into one class and one class only 4. The classes must be continuous. there should be no gaps in the number line even if a class has no members 5. The classes must be exhaustive. all possible data must fit into one of the classes 6. The classes must have equal width. if not there will be a bias among the classes you can have open-ended classes at the ends (i.e., for ages you may use 10 and under or 65 and over, etc.)

13 13 Types of Frequency Distributions Categorical Ungrouped - for nominal types of data - for numerical data with few scores Grouped - for numerical data with many scores Example: Distribution of the number of hours that boat batteries lasted. Class Class Tally Frequency Cumulative Cumulative Limits Bounds frequency percentages /// 3 3 3/25*100= 12% +))) / 1 1+3= 4 4/25*100= 16% +))) ///// 5 5+4= 9 9/25*100= 36% +))) ///// //// 9 9+9= 18 18/25*100= 72% +))) ///// / = 24 24/25*100= 96% +)))) / = 25 25/25*100= 100% Total % +))))))))))))))))))))))))))))))))- +)))))))- Use these numbers for frequency polygon. Use these numbers for constructing cumulative frequency polygon, also called an ogive.

14 Frequency Polygon and Ogive 14 Frequency polygon: Cumulative frequency or ogive:

15 Measures of Central Tendency Mode: most frequent score. best average for nominal data sometimes none or more than one mode in a sample bimodal or multimodal distributions indicate several groups included in sample easy to determine 15 Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used Median: value that divides distribution in half. best average for ordinal data more appropriate average for skewed ratio or interval data or data on salaries difficult to compute because data must be sorted unaffected by extreme data Arithmetic mean: centre of balance of data. sum of numbers divided by n best average for unskewed ratio or interval data easy to compute sample mean= population mean = Other measures: harmonic mean, geometric mean and quadratic mean, also called root mean square (RMS)

16 Skewed Data 16 direction of skew is the direction of the tail positive direction of a number line is to the right, left is negative direction mean, mode and median (MD) are the same for symmetrical distributions notice mean is closest to the tail (i.e., more influenced by extreme values)

17 Measures of Variation 17 Range: highest minus lowest values. used for ordinal data R = highest lowest th th Interquartile range: 75 minus 25 percentile. used for determining outliers IQR = Q Q 3 1 Variance: mean of squared differences between scores and the mean value ( ). used on ratio or interval data used for advanced statistical analysis (ANOVAs) Standard deviation: has same units as raw data. used on ratio or interval data most commonly used measure of variation Coefficient of variation: percentage of standard deviation to mean. used to compare variability among data with different units of measure. Not suitable for interval data.

18 Biased and Unbiased Estimators 18 sample mean is an unbiased estimate of the population mean ( ) variances and standard deviations are biased estimators because mean is used in their computation Why? Last score can be determined from mean and all other scores, therefore, it is not free to vary or add to variability. To compensate divide sums of squares by n 1 instead of n. Instead of using the standard formula a computing formula is used so that running totals of scores and scores squared may be used to compute variability. Computing Formulae 2 Variance: s = sample variance Standard deviation: S = sample standard deviation

19 Measures of Position 19 Percentile: score which exceeds a specified percentage of the population. suitable for ordinal, ratio or interval data th median (MD or Q 2) is 50 percentile th th first and third quartiles (Q 1 and Q 3) are 25 and 75 percentiles easier for non-statisticians to understand than z-scores scores are all positive numbers Standard or z-scores: based on mean and standard deviation and the normal distribution. suitable for ratio and interval numbers approximately 68% of scores are within 1 standard deviation of the mean, approximately 95% are within 2 standard deviations and approximately 99% are within 3 standard deviations half the scores are negative numbers mean score is zero excellent way of comparing measures or scores which have different units (i.e., heights vs. weights, metric vs. Imperial units, psychological vs. physiological measures)

20 Measures of Position and Outliers 20 Other measures of position: th th th Deciles: 10, 20, percentiles (D 1, D 2,...D 10) often used in education or demographic studies th th th Quartiles: 25, 50 and 75 percentiles (Q 1, Q 2, Q 3) frequently used for exploratory statistics and to determine outliers (Q is same as median) 2 Outliers: extreme values that adversely affect statistical measures of central tendency and variation. Method of determining outliers: compute interquartile range (IRQ) multiply IRQ by 1.5 lower bound is Q 1 minus 1.5 IRQ upper bound is Q 3 plus 1.5 IRQ values outside these bounds are outliers and may be removed from the data set it is assumed that outliers are the result of errors in measurement or recording or were taken from an unrepresentative individual Alternate method for normally distributed data: +/ 4 or 5 standard deviations

21 Counting Techniques 21 Fundamental Counting Rule: In a sequence of n events with each event having k possibilities, the total number of outcomes is: n k = k k k... k Examples: How many 6 digit student ID numbers = How many ways of throwing 5 dice (Yahtze) = How many ways of selecting 3 letters = In a sequence of n events in which there are k 1 possibilities for the first event, k 2 possibilities for the second event and k 3 for the third, etc., the total number of possible outcomes is: k 1 k 2 k 3... kn Examples: How many 3 digit phone exchanges before = 160 (minus 911, 311, 411, 511) How many 3 digit phone exchanges after = 800 (minus 911, 311, 411, 511) How many 7 character Ontario licence plates (4 letters, 3 numbers) =

22 Factorial Notation 22 Factorials: Factorial numbers are identified with an exclamation mark (n!). They are defined: n! = n n 1 n ! is defined to be 1 Examples: 1! = 1 2! = 2 1 = 2 5! = = ! = !

23 Permutations and Combinations 1 23 Permutations: Rule 1: How many ways are there of arranging ALL n unique items if replacement is NOT allowed? n! = n n 1 n Examples: How many ways of arranging 6 items in a display. n = 6! = = 720 How many ways of ordering three experiments. n = 3! = 6 Rule 2: How many ways are there of selecting r items from n unique possibilities if replacement is NOT allowed? n P = n! / (n-r)! r Examples: How many ways of selecting 2 items from group of 6. How many ways of selecting committee of four from a staff of 20 if order of selection is significant.

24 Permutations and Combinations 2 24 Rule 3:How many ways are there of selecting n items if replacement is NOT allowed but k 1, k 2,... k n items are identical? n! / (k 1! k 2!... k n!) Examples: How many words from the letters in OTTAWA. n = 6! / (2! 2!) = 180 How many words from the letters in TORONTO. n = 7! / (2! 3!) = 420 Combinations: How many ways are there of selecting r items from n unique possibilities if replacement is NOT allowed and order is not important? C = n! / [(n-r)! r!] = P /r! n r n r Examples: How many ways are there of selecting 2 items from a group of 6. How many ways of selecting committee of four from a staff of 20 if order of selection is unimportant.

25 Probability 25 Basic Concepts: Probability experiment: process that leads to welldefined results, called outcomes. Outcome: result of a single trial of a probability experiment (a datum). Sample space: all the possible outcomes of a probability experiment, i.e., the population of outcomes. Event: consists of one or more outcomes of a probability experiment, i.e., a sample of outcomes. I. Classical Probability: ratio of number of ways for an event to occur over total number of outcomes in the sample space, i.e., where E is the event, S is the sample space, P(E) is the probability of the event E occurring, n(e) is the number of possible outcomes of event E, n(s) is the number of outcomes in the sample space To obtain a probability using classical methods the event and sample space must be countable. Often the rules for determining permutations and combinations are required.

26 Classical Probability 1 26 Examples: Probability of rolling snake eyes (1, 1) with two dice. Sample space for rolling two dice (one white, one black): sevens elevens n(s) = 36 possible outcomes Notice, there are four different ways of reporting a probability (proper fraction, ratio, decimal and percentage). Probability of 7 or 11 from rolling two dice. Probability of doubles from two dice. Probability of drawing a queen from a card deck.

27 Classical Probability 2 Examples: Probability of drawing a spade. 27 Probability of drawing a red card. Probability of flipping heads in a coin toss. Probability of flipping heads after 10 coin tosses of heads in a row. coin cannot remember its history of outcomes Probability of red on a double zero roulette wheel. wheel has numbers 1 to 36, half are red and half are black, plus green zero and double zero (n=38) Probability of not getting a red on a roulette wheel.

28 Rules of Probability 28 Rule 1: all probabilities range from 0 to 1 inclusively 0 P(E) 1 Rule 2: probability that an event will never occur is zero P(E) = 0 Rule 3: probability that an event will always occur is one P(E) = 1 Rule 4: if P(E) is the probability that an event will occur, the probability that the event will not occur is (also called the complement of an event): Venn diagrams: P (not E) = 1 P(E) Rule 5: the sum of probabilities of all outcomes in a sample space is one P(S) = P(E) = 1

29 Empirical Probability 29 II. Empirical Probability: obtained empirically by sampling a population and creating a representative frequency distribution. for a given frequency distribution, probability is the ratio of frequency of an event class to the total number of data in the frequency distribution, i.e., Examples: Probability of a girl baby. Assume that a population has a blood type distribution of: 2% AB, 5% B, 23% A, and 70% O Probability of a person having type B or AB blood. P(AB or B) = 2% + 5% = 7.00% = Probability of strongly left-handed person. P(strongly left-handed) = = 5.00% Probability of natural blues eyes. P(blues eyes) = = 6.50%

30 Addition Rules 1 Addition Rule 1: if two events are mutually exclusive, (i.e., no outcomes in common) then the probability of A or B occurring is: 30 Venn diagrams of events that are mutually exclusive: If three events are mutually exclusive: Examples: Probability of selecting a spade or a red card. Probability of drawing a face card (king or queen or jack). Probability of 7, 11 or doubles with 2 dice.

31 Addition Rules 2 31 Addition Rule 2: if two events are not mutually exclusive, the probability of A or B occurring is: Venn diagrams of events that are NOT mutually exclusive: When three events are not mutually exclusive there is a region that is common to all three events. This area gets added and subtracted three times and therefore must be added back once. That is,

32 Addition Rules 2, cont d Examples: Probability of selecting a spade or a face card. 13 spades 12 face cards (3 per suit) 32 Probability of selecting a female student or a third-year student from 103 students. 53 are female 70 students in third year 45 females in third year Probability of selecting a male or person with type O blood from 100 people. half are males 70% are O-type blood Probability of selecting a left-handed person or a Liberal from 1000 people. 32% Liberals, 5% left-handed

33 Independence Definition: two events are independent if the occurrence of one event has no effect on the occurrence of the other event. In probability experiments where there is no memory from one event to the next, the events are called independent. Examples of Independent Events: Coin tosses. Even when 10 heads are flipped in a row the next coin toss still as a 50:50 chance of being a head. Roulette wheel spins. Each spin of the wheel is theoretically independent. Each number on the wheel has equal probability of occurring at each spin. Rolling dice repeatedly. The dice cannot remember what they rolled from one toss to another. Drawing cards with replacement. With replacement means after a card is drawn it is put back in the deck; thus all cards are equally likely to be drawn each time. Examples of Dependent Events: Drawing cards without replacement. Once a card is drawn it cannot be drawn a second time. This changes the characteristics of the remaining deck of cards. Bingo numbers. Once a ball is drawn it is not replaced. Lottery 6/49. All numbers (1 49) are equally likely to be chosen but can only be chosen once. 33

34 Multiplication Rules 1 34 Multiplication Rule 1: if two events are independent (i.e., have NO on influence of each other s probability) then the probability of A and B occurring is: Venn diagram: Examples: Coin and dice tossing, lotteries, slot machines, roulette wheels etc., any game or experiment where knowledge of an outcome is not remembered by the next game or experiment. Probability of tossing heads twice. Probability of rolling seven twice with two dice. Probability of having nine daughters in a row.

35 Multiplication Rules 2 35 Multiplication Rule 2: if two events are dependent then the probability of A and B occurring is: where P(B A) means the probability of B occurring given that A occurs or occurred, also called the conditional probability Examples: Card games where the cards are not replaced or selections where replacement is not allowed. Results from one experiment affect outcome of next. Probability of a drawing a two then a three. Probability of a drawing an ace then a face card. Probability of a drawing a pair. Probability of a drawing a pair of aces.

36 Probability Distributions 36 Definition: distribution of the values of a random variable and their probability of occurrence. Random variable: discrete or continuous variable whose values are determined by chance. Examples: 1. Probability distribution of a coin toss (approximately 1 half) 2. Probability distribution of a th fair die toss (each 1/6 ) 3. Probability distribution of polls (correct 19 times out of 20)

37 Mean, Variance and Expectation 37 Mean: of a probability distribution (weighted average) th where X is the i outcome and P(X ) is its probability. i Examples: 1. Mean number of heads for tossing two coins i 2. Mean number of spots for tossing a single die Notice that the answer does not have to be possible. Variance and Standard Deviation: Expectation: the expectation or expected value of a probability distribution is equal to the mean. for predicting the cost of playing games and lotteries

38 Expectation cont d 38 Examples: 1. Compute the expectation of playing a lottery where 100 tickets are sold for $1 and the winning prize is worth $100. This is considered a fair game. If the prize was $50 the expectation would be $0.50. Any negative value is a loser for the player; any positive value is a good game for the player. 2. Compute the profit or loss of playing a lottery where the cost of a ticket is $10, there are 1000 tickets sold and the prizes are: st 1 place wins $1000, nd 2 place wins $500 and rd five 3 places win $100

39 Binomial Distribution Definition: probability distribution in which there are only two outcomes, or can be reduced to only two by some rule ( an event occurs and the event does not occur ). Examples: heads and tails, true and false, success and failure, boy or girl, equal to a value and not equal, roll a 1 and not roll an 1 with a die. Rules: - only two outcomes per trial - fixed number of trials - independence from trial to trial - probability same from trial to trial Notation: p = probability of success q = probability of failure n = number of trials x = number of successes where 0 x n x n x P(x) = nc x p q Note, since p + q = 1 therefore q = 1 p Examples: 1. Probability of 4 sixes in 4 tosses of a die Probability of tossing five heads in seven tosses.

40 Binomial Distributions 40 Examples: 1. Tossing of a fair coin (1 trial and 4 trials) 2. Rolling a six with a fair die (rolling a die is multinomial) 3. Answering a four-choice multiple choice question correctly

41 41 Normal Distribution Many biological and physical processes exhibit a distribution called the normal or Gaussian distribution. Values tend to cluster around a mean and extreme values are relatively rare. Each normal distribution has different units of measure but they can be normalized using the z-score transform (z = (X - )/ ). This defines the standard normal distribution or z-distribution. Definition: x 1 e is called the exponential function. e, called Euler s number, is a transcendental number equal to: 1/n! Properties of Standard Normal Distribution: bell shaped, unimodal distribution mean, median and mode are the same and equal to 0 standard deviation is equal to 1 area under curve equals 1 symmetrical about the mean continuous function (no gaps & for every x there is a single y) asymptotic to the x-axis at both ends (y values approach but never become zero)

42 42 Applications of Normal Distribution 1 Uses: computing areas and percentiles of scores that are normally distributed testing hypotheses concerning means of different populations (are they the same or different?) Examples: 1. Find the percentage of scores between +/ 1 standard deviations. area between 0 and +1 = area between 0 and 1 = area between 1 and +1 = = = 68.3% 2. Find the percentage of scores between +/ 2 and +/ 3 standard deviations. area between 0 and +2 = area between 2 and +2 = = = 95.4% area between 3 and +3 = = = 99.7% 3. Find the z-score that defines 95% of scores around the mean.

43 43 Applications of Normal Distribution 2 th 4. Find z-score that defines the lower 90 percent. th th 5. Find 25 and 75 percentile z-scores. 6. College wants top 15% of students who take a test which has a mean of 125 and standard deviation of Determine the 5 and 95 percentile heights of a th th population that has a mean of ±20.0 cm.

44 44 Central Limit Theorem 1 Sampling Distribution of Sample Means: distribution based on the means of random samples of a specified size (n=constant) taken from a population. Example: Test scores from a class of four students. Scores were 2,4,6,8 (uniform distribution) List all possible samples of size n=2, allowing replacement. Sample Mean Sample Mean 2,2 2 6,2 4 2,4 3 6,4 5 2,6 4 6,6 6 2,8 5 6,8 7 4,2 3 8,2 5 4,4 4 8,4 6 4,6 5 8,6 7 4,8 6 8,8 8 Sampling distribution of means: Mean frequency

45 45 Central Limit Theorem 2 As sample size (n) increases the shape of the sampling distribution of sample means taken from a population with mean,, and standard deviation,, will approach a normal distribution, with mean,, and standard deviation,. the standard deviation of the sampling distribution is called the standard error of the mean (notice, by definition, it is always less than sample s standard deviation when n > 1) Note, whenever the sample size (n) exceeds 5% of the population size (N) the standard error must be adjusted by the Finite Population Correction Factor: That is, Example: What is the standard error of the mean for a sampling distribution given a sample of size of 100 and s.d. of 5.00 taken from a population of size, 1000.

46 Confidence Intervals 1 46 Point Estimate: a specific value that estimates a parameter e.g., a sample mean ( ) is best estimator of the population mean ( ) problem is that there is no way to determine how close a point estimate is to the parameter Properties of a Good Estimator: 1. must be an unbiased estimator -expected value of estimator or mean obtained from samples of a given size must be equal to the parameter 2. must be consistent -as sample size increases estimator approaches value of the parameter 3. must be relatively efficient -estimator must have smallest variance of all other estimators Interval Estimate: range of values that estimate a parameter th e.g., mean +/- standard deviation ( ), 25 ± 10 kg, 5 to th 95 %ile precise probabilities can be assigned the validity of the interval

47 Confidence Intervals 2 47 Confidence Interval: interval estimate based on sample data and a given confidence level Confidence Level: probability that a parameter will fall within an interval estimate related to alpha ( ) level, that is, Confidence Level = 1 - E.g., C.L. = 95% means = 0.05 C.L. = 99% means = 0.01 Formula for Computing Confidence Intervals where z /2 is the z-score that places the area /2 in the right tail of the normal distribution For example if is 5% then z /2 is the z-score that places 2.5% in the right tail and z /2 2.5% in the left tail. That is z = /2

48 Confidence Intervals 3 48 Maximum Error of Estimate (E): Example: th Compute the 95 percentile confidence interval from a sample of size 30 which has a standard deviation of 5.00 and a mean of Since is unknown use s. From z-table: z = /2 Therefore, Confidence interval = C.I. = 25.0 ±1.789 = 23.2 to 26.8

49 49 Confidence Intervals when is Unknown and n is Small When population standard deviation is known, always use the z-distribution (normal distribution). If not known, which is often the case, use sample standard deviation, s, as long as n is 30 or greater. If not, use the t-distribution with n 1 degrees of freedom. Decision Tree for Selecting Statistical Method Formula for Computing Confidence Intervals for Small Sample Sizes

50 50 Example: Confidence Intervals when is Unknown and n is Small th Compute the 95 percentile confidence interval from a sample of size of 10 that has a standard deviation of 5.00 and a mean of (Similar to previous example.) From t-table with degrees of freedom = df = n 1 = 9: t = /2 Therefore, Confidence interval = C.I. = 25.0 ± 3.58 = 21.4 to 28.6

51 Sample Size 51 Minimum Sample Size for Interval Estimate of Population Mean: where n is sample size, is the population standard deviation and E is the maximum error of estimate. When is unknown it may be estimated from the sample standard deviation, s. Example: Calculate the sample size needed to estimate muscle strength from a population that has a standard deviation of newtons if you want to be 95% confident and within 50.0 newtons. 2 n = ( / 50.0) = 30.1 You will need a sample size of 31. Note, always round up to the next highest integer when there is a fraction.

52 Hypothesis Testing Hypothesis: conjecture, proposition or statement based on published literature, data or a theory which may or may not be true. Statistical Hypothesis: conjecture about a population parameter. usually stated in mathematical terms two types, null and alternate Null Hypothesis (H 0): states that there is NO difference between a parameter and a specific value or among several different parameters Alternate Hypothesis (H 1): states that there is a significant difference between a parameter and a specific value or among several different parameters Examples: H 0: = 82 kg H 1: 82 kg* H 0: 150 cm H 1: > 150 cm H 0: 65.0 s H 1: < 65.0 s H 0: 0 = 1 H 1: ì 0 ì 1* H : H : < Notice that the equality symbols are always with the null hypotheses. * These are called two-tailed tests; others are all directional or one-tailed tests. 52

53 Two-tailed vs One-tailed Tests Two-tailed: also called a non-directional test. null hypothesis is disproved if sample mean falls in either tail most appropriate test especially with no previous experimentation less powerful than onetailed 53 One-tailed: also called a directional test. researcher must have reason that permits selecting in which tail the test will be done, i.e., will the experimental protocol increase or decrease the sample statistic more powerful than two-tailed since it is easier to achieve a significant difference fails to handle the situation when the sample means falls in the wrong tail One-tailed, left One-tailed, right

54 Statistical Testing 54 To determine the veracity (truth) of an hypothesis a statistical test must be undertaken that yields a test value. This value is then evaluated to determine if it falls in the critical region of a appropriate probability distribution for a given significance or alpha ( ) level. The critical region is the region of the probability distribution which rejects the null hypothesis. Its limit(s), called the critical value(s), are defined by the specified confidence level. The confidence level must be selected in advance of computing the test value. To do otherwise is statistical dishonesty. When in doubt one should always use a two-tailed test. Instead of reporting significance levels ( = 0.05) or equivalent probabilities (P<0.05) many researchers report the test values as probabilities or P- values (e.g., P = , P = 0.253, P < 0.001). Advanced statistical programs report P-values, if not, use P<0.05 or P<0.01. If test shows P=0.000 then report P< Truth table: H 0 is true and H is false Test rejects H 0 (accepts H ) Test does not reject H (accepts H ) Error ( ) Type I error Correct (1 - ) (experiment failed) H 0 is false and H is true 1 Correct (1 - ) (experiment succeeded) Error ( ) Type II error

55 z-test and t-test 55 Test for a Single Mean: used to test a single sample mean ( ) when the population mean ( ) is known Is the sample taken from the population or is it different (greater, lesser or either)? z-test: when population s.d. ( ) is known Test value: if z is in critical region defined by critical value(s) then sample mean is significantly different from population mean, if is unknown then use sample, s, as long as sample size is 30 or greater Test value: t-test: if is unknown and n < 30 then use t-test and t-distribution with d.f. = n-1 Test value:

56 Flow Diagram for Choosing the Correct Statistical Test 56 Same as flow diagram used for confidence intervals. Generally the sample s mean and standard deviation are used with the t-distribution. The t-distribution becomes indistinguishable from the z-distribution (normal distribution) when n is 30 or greater.

57 Power of a Statistical Test 57 Power: ability of a statistical test to detect a real difference. probability of rejecting the null hypothesis when it is false (i.e., there is a real difference) equal to 1 (1 probability of Type II error) Ways of increasing power Increasing will increase power but it also increases chance of a Type I error Increasing sample size (increases costs) Using ratio or interval data versus nominal or ordinal. Tests involving ratio/interval data are called parametric tests. Those involving nominal and ordinal data are called nonparametric tests. Using repeated measures tests, such as, the repeated measures t-test or ANOVA. By using the same subjects repeatedly, variability is reduced. If variances are equal use pooled estimates of variance (e.g., Independent groups t-test) Using samples that represent extremes. Reduces generalizability of experiment results. Standardizing testing procedures reduces variability. Using one-tailed vs. two-tailed tests. Problem occurs if results are in wrong tail. Not recommended.

58 Testing Differences between Two Means Large Independent Sample Means: used to test whether the data from two samples come from the same population or two different populations. Assumptions: data were randomly sampled samples are independent, i.e., there can be no relationship between the two samples standard deviations known and if sample size < 30 population(s) is/are normally distributed if more than two sample means are tested adjustments must be made to significance level (e.g., Bonferroni correction, Bonferroni = /number of tests) z-test: Test value: Critical value comes from standard normal (z) distribution. Use one- or two-tailed test. Conservatively, choose the two-tailed test. Values are also available at bottom of t-distribution. 58

59 The Step-by-Step Approach 59 Step 1: State hypotheses Two-tailed: One-tailed: H : = H : or H : H : H : > or H : < Step 2: Find critical value Look up z-score for specified significance ( ) level and for one- or two-tailed test (selected in advance). Usually use = 0.05 and two-tailed test, i.e., z critical = ± For one- tailed use z = ± critical Step 3: Compute test value Step 4: Make decision Draw diagram of normal distribution and critical regions. If test value is in critical region reject the null hypothesis otherwise do not reject. Step 5: Summarize results Restate hypothesis (null or alternate) accepted in step 4.

60 60 If null is rejected: There is enough evidence to reject the null hypothesis. If null is not rejected: There is not enough evidence to reject the null hypothesis. Optionally, reword hypothesis in lay terms. E.g., There is or is not a difference between the two populations or one population is greater/lesser than the other for the independent variable.

61 Testing Differences between Two Means Small Independent Sample Means: when population standard deviations are unknown and sample size is < 30 use t-distribution for critical values and t-test for test values. Use F-ratio to determine whether sample variances are equal or unequal. Then choose the appropriate t-test. Assumptions samples are random samples two samples must be independent, i.e., different subjects, if not, use dependent groups t-test population must be normally distributed If sample variances are NOT equal: Use test value: for degrees of freedom (df) use smaller of n 1 1 and n 2 1 (i.e., conservative choice, higher critical value) If sample variances are equal: Use test value: and df = n 1 + n Uses a pooled estimate of variance that combined with reduced degrees of freedom increases the test s power.

62 Test for Equal Variances 62 Also called Homogeneity of Variance used primarily to determine which t-test to use uses F-distribution and F-test (later used for ANOVA) assume variances are equal and test if unequal SPSS uses Levine s Test for Equality of Variances. If P (Sig.) < variances are NOT equal. Step 1: Always a two-tailed test. 2 2 H 0: s 1 = s2 2 2 H 1: s 1 s2 Step 2: Find critical value (F CV) from F-distribution. Use degrees of freedom of larger variance (df N = n larger 1) as numerator and degrees of freedom of smaller variance as denominator (df D = n 1). smaller Step 3: Compute test value: Note, F TV will always be 1. Steps 4 and 5: If F TV > F CV then reject H 0 and conclude variances are unequal. If F TV F CV then do NOT reject H 0 and conclude variances are equal. I.e., you have homogeneity of variances. You can now select the appropriate Independent Groups t-test.

63 Flow Diagram for Choosing the Correct Independent Samples t-test 63 Similar to flow diagram used for single sample means. But requires a test for equality of variances (homogeneity of variance). Generally the sample s mean and standard deviation are used with the t-distribution. The t- distribution becomes indistinguishable from the z- distribution (i.e., normal distribution) when n is 30 or greater. Samples must be independent.

64 Testing Differences between Two Means Dependent Sample Means: Used when two samples are not independent. More powerful than independent groups t-test and easier to perform (no variance test required). Simplifies research protocol (i.e., fewer subjects) but dependence may limit generalizability. Examples: repeated measures (test/retest, before/after) matched pairs t-test (subjects matched by a relevant variable: height, weight, shoe size, IQ score, age) twin studies (identical, heterozygotic, living apart) Step 1: Two-tailed: One-tailed: H 0: D = 0 H 0: D 0 or H 0: D 0 H 1: D 0 H 1: D > 0 or H 1: D < 0 Step 2: Critical value from t-distribution with degrees of freedom equal to number of data pairs minus one (df = n 1). Step 3: Compute differences between pairs (D) then mean difference ( ) and s : D Test value: Step 4 and 5: If test value > critical value reject H 0 otherwise there is no difference between the two trials/groups. 64

65 Correlation and Regression 65 Linear Correlation: Does one variable increase or decrease linearly with another? Is there a linear relationship between two or more variables? Types of linear relationships: Positive linear Negative linear No relationship None or weak

66 Scattergrams 66 Weak linear Strong Linear Other relationships: Nonlinear or Curvilinear Linear and Exponential?

67 Correlation 67 Pearson Product Moment Correlation Coefficient: simply called correlation coefficient, PPMC, or r-value linear correlation between two variables Examples: Weight increases with height. IQ with brain size?! Used for calibration of instruments, force transducers, spring scales, electrogoniometers (measure joint angles). Multiple Correlation: used when several independent variables influence a dependent variable R-value (capital R vs r) Defined as: Y = A + B 1 X 1 + B 2 X 2 + B 3 X B n Xn Examples: Heart disease is affected by family history, obesity, smoking, diet, etc. Academic performance is affected by intelligence, economics, experience, memory, etc. Lean body mass is predicted by a combination of body mass, thigh, triceps and abdominal skinfold measures.

68 Significance of Correlation Coefficient 68 Method 1 Step 1: H 0: 0 Step 2: Look up r crit for n 2 degrees of freedom (Table I) Step 3: Compute sample r (as above) Step 4: Sample r is significant if it is greater than r crit Step 5: If significance occurs data are linearly correlated otherwise they are not. If table of significant correlation coefficients is not available or significance level ( ) is not 0.05 or 0.01 use Method 2. Method 2 Step 1: H 0: 0 Step 2: Look up t crit for n 2 degrees of freedom Step 3: Compute sample r then t Step 4: Sample t is significant if it is greater than t crit Step 5: If significance occurs data are linearly correlated otherwise they are not.

69 Regression 69 Regression: can only be done if a significant correlation exists. Equation of line or curve which defines the relationship between variables. The line of best fit. Mathematical technique is called least squares method. This technique computes the line that minimizes the squares of the deviations of the data from the line.

70 Coefficient of Determination and Standard Error of Estimate 70 Coefficient of Determination Measures the strength of the relationship between the two variables. Equal to the explained variation divided by the total 2 variation = r. Usually given as a percentage, i.e., 2 coefficient of determination = r 100% For example, an r of 0.90 has 81% of total variation explained but an r of 0.60 has only 36% of its variation. A correlation may be significant but explain very little. Standard Error Of Estimate Measure of the variability of the observed values about the regression line Can be used to compute a confidence interval for a predicted value standard error of estimate:

71 71 Possible Reasons for a Significant Correlation 1. There is a direct cause-and-effect relationship between the variables. That is, x causes y. For example, positive reinforcement improves learning, smoking causes lung cancer and heat causes ice to melt. 2. There is a reverse cause-and-effect relationship between the variables. That is, y causes x. For example, suppose a researcher believes excessive coffee consumption causes nervousness, but the researcher fails to consider that the reverse situation may occur. That is, it may be that an nervous people crave coffee. 3. The relationship between the variables may be caused by a third variable. For example, if a statistician correlated the number of deaths due to drowning and the number of cans of soft drinks consumed during the summer, he or she would probably find a significant relationship. However, the soft drink is not necessarily responsible for the deaths, since both variables may be related to heat and humidity. 4. There may be a complexity of interrelationships among many variables. For example, a researcher may find a significant relationship between students high school grades and college grades. But there probably are many other variables involved, such as IQ, hours of study, influence of parents, motivation, age and instructors. 5. The relationship may be coincidental. For example, a researcher may be able to find a significant relationship between the increase in the number of people who are exercising and the increase in the number of people who are committing crimes. But common sense dictates that any relationship between these two variables must be due to coincidence.

72 Comparing Frequencies using Chi-square 72 Chi-square or 2 : pronounced ki squared. Used to test whether the frequency of nominal data fit a certain pattern (goodness of fit) or whether two variables have a dependency (test for independence). Can be used to test whether data are normally distributed and for homogeneity of proportions Frequency of each nominal category is computed and compared to an expected frequency. Goodness of Fit: Need to know expected pattern of frequencies. If not known assume equal distribution among all categories. Assumptions: data are from a random sample expected frequency for each category must be 5 or more Examples: test for product / procedure preference (each is assumed equally likely to be selected) test for fairness of coin, die, roulette wheel (expect each outcome equally) test for expected frequency distribution (need theoretically expected pattern)

73 Goodness of Fit Test 73 Step 1 H 0: data fit the expected pattern H : data do not fit expected pattern 1 Step 2 2 Find critical value from table. Test is always a one-tailed right test with n-1 degrees of freedom, where n is number of categories. Step 3 Compute test value from: O = observed freq. E = expected frequency Step 4 Step 5 2 Make decision. If > critical value reject H 0. Summarize the results. E.g., There is (not) enough evidence to accept/reject the claim that there is a preference for. E.g., Coin is fair / unfair Die is fair / loaded Wheel is fair / flawed

74 Step 1 Test for Independence H 0: two variables are independent H : two variables are dependent 1 74 Step 2 2 Find critical value from table. Test is always one-tailed right with (n row 1)(n col. 1) degrees of freedom, where n row and n col. are the number of categories of each variable. These correspond to the number of rows and columns in the contingency table. Step 3 Create the contingency table to derive the expected values (see next page). Compute test value from: O = observed freq. E = expected frequency Step 4 2 Make decision. If > critical value reject H 0. Step 5 Summarize the results. E.g. Getting a cold is dependent upon whether you took a cold vaccine. - Smoking and lung disease are dependent. - Is a cure dependent upon placebo vs. drug.

75 75 Contingency Table First, enter observed (O) scores and compute row and column totals. Col.1 Col.2 Col.3 totals Row Row Row totals * / 125* * Notice sums of row and column totals must be equal. Second, compute expected (E) values based of row and column totals. Col.1 Col.2 Col.3 Row 1 40x40/125=E11 40x60/125=E12 40x25/125=E13 Row 2 35x40/125=E21 35x60/125=E22 35x25/125=E23 Row 3 50x40/125=E31 50x60/125=E32 50x25/125=E33 Finally, compute the test value:

76 Analysis of Variance (ANOVA) 76 One-way ANOVA: used to test for significant differences among sample means differs from t-test since more than 2 groups are tested, simultaneously one factor (independent variable) is analyzed, also called the grouping variable dependent variable should be interval or ratio but factor is nominal Factorial Design: - groups must be independent (i.e., subjects in each group are different and unrelated) Assumptions: data must be normally distributed or nearly variances must be equal (i.e., homogeneity of variance) Examples: Does fitness level (VO2Max) depend on province of residence? Fitness level is a ratio variable, residence is a nominal variable. Does statistics grade depend of highest level of mathematics course taken? Does hand grip strength vary with gender? (Can be done with t-test. t-test can handle equal or unequal variances.)

77 One-way ANOVA 77 An ANOVA tests whether one or more samples means are significantly different from each other. To determine which or how many sample means are different requires post hoc testing. Two samples where means are significantly different. These two sample means are NOT significantly different due to smaller difference and high variability. Even with same difference between means, if variances are reduced the means can be significantly different.

78 One-way ANOVA 78 Step 1 Step 2 Step 3 H 0: all sample means are equal H : at least one mean is different 1 Find critical value from F table (Table H). Tables are for one-tailed test. ANOVA is always one-tailed. Compute test value from: Step 4 Step 5 Make decision. If F > critical value reject H. Summarize the results with ANOVA table. 0 All means are the same, i.e., come from the same population or at least one mean is significantly different. Step 6 If a significant difference is found, perform post hoc testing to determine which mean(s) is/are different.

79 ANOVA Summary Table 79 Source Sums of d.f. Mean F P squares square 2 2 Between SSB k-1 SS B/(k-1) =sb sb /s 2 W (also called Main effect) Within SSW N k SS W/(N k) =s 2 W (also called Error term) Total SS +SS (k 1)+(N k)= N 1 Examples: B W One-way Factorial Source Sums of d.f. Mean F P squares square Between <0.01 Within Total Two-way Factorial Source Sums of d.f. Mean F P squares square Factor A NS Factor B <0.025 A x B <0.005 Within Total

80 Post Hoc Testing 80 Post Hoc testing used to determine which mean or group of means is/are significantly different from the others many different choices depending upon research design and research question (Duncan s, Scheffé s, Tukey s HSD,...) only done when ANOVA yields a significant F Scheffé test: when sample sizes are unequal conservative test is desirable when all comparisons are to be tested Critical value: Use critical value from ANOVA and multiply by k 1. k = number of groups (means) F' = (k 1) F critical critical Test value: Decision: If F s > F' critical, then the two means are significantly different. Summary: graph the sample means

81 Post Hoc Testing 2 81 Tukey HSD test: sample sizes must be equal used when less conservative test is desirable, i.e., more powerful when only some comparisons are to be tested Critical value: Use Table N, where k = number of groups and v = degrees of freedom of s 2 W Test value: Decision: If q > critical value, then the means are significantly different. Summary: Graph the results and summarize.

Correlation and Regression

Correlation and Regression Correlation and Regression Linear Correlation: Does one variable increase or decrease linearly with another? Is there a linear relationship between two or more variables? Types of linear relationships:

More information

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used Measures of Central Tendency Mode: most frequent score. best average for nominal data sometimes none or multiple modes in a sample bimodal or multimodal distributions indicate several groups included in

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Introduction to Statistics

Introduction to Statistics Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and

More information

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability STA301- Statistics and Probability Solved MCQS From Midterm Papers March 19,2012 MC100401285 Moaaz.pk@gmail.com Mc100401285@gmail.com PSMD01 MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Example 10-2: Absences/Final Grades Please enter the data below in L1 and L2. The data appears on page 537 of your textbook.

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above King Abdul Aziz University Faculty of Sciences Statistics Department Final Exam STAT 0 First Term 49-430 A 40 Name No ID: Section: You have 40 questions in 9 pages. You have 90 minutes to solve the exam.

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

Statistics 100 Exam 2 March 8, 2017

Statistics 100 Exam 2 March 8, 2017 STAT 100 EXAM 2 Spring 2017 (This page is worth 1 point. Graded on writing your name and net id clearly and circling section.) PRINT NAME (Last name) (First name) net ID CIRCLE SECTION please! L1 (MWF

More information

DSST Principles of Statistics

DSST Principles of Statistics DSST Principles of Statistics Time 10 Minutes 98 Questions Each incomplete statement is followed by four suggested completions. Select the one that is best in each case. 1. Which of the following variables

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly. Introduction to Statistics Math 1040 Sample Final Exam - Chapters 1-11 6 Problem Pages Time Limit: 1 hour and 50 minutes Open Textbook Calculator Allowed: Scientific Name: The point value of each problem

More information

Chapter 01 : What is Statistics?

Chapter 01 : What is Statistics? Chapter 01 : What is Statistics? Feras Awad Data: The information coming from observations, counts, measurements, and responses. Statistics: The science of collecting, organizing, analyzing, and interpreting

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability

Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability Probability Chapter 1 Probability 1.1 asic Concepts researcher claims that 10% of a large population have disease H. random sample of 100 people is taken from this population and examined. If 20 people

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Statistics Primer. A Brief Overview of Basic Statistical and Probability Principles. Essential Statistics for Data Analysts Using Excel

Statistics Primer. A Brief Overview of Basic Statistical and Probability Principles. Essential Statistics for Data Analysts Using Excel Statistics Primer A Brief Overview of Basic Statistical and Probability Principles Liberty J. Munson, PhD 9/19/16 Essential Statistics for Data Analysts Using Excel Table of Contents What is a Variable?...

More information

Macomb Community College Department of Mathematics. Review for the Math 1340 Final Exam

Macomb Community College Department of Mathematics. Review for the Math 1340 Final Exam Macomb Community College Department of Mathematics Review for the Math 0 Final Exam WINTER 0 MATH 0 Practice Final Exam WI0 Math0PF/lm Page of MATH 0 Practice Final Exam MATH 0 DEPARTMENT REVIEW FOR THE

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

3.2 Probability Rules

3.2 Probability Rules 3.2 Probability Rules The idea of probability rests on the fact that chance behavior is predictable in the long run. In the last section, we used simulation to imitate chance behavior. Do we always need

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 4.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 4.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola 4.1-1 4-1 Review and Preview Chapter 4 Probability 4-2 Basic Concepts of Probability 4-3 Addition

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Intermediate Math Circles November 8, 2017 Probability II

Intermediate Math Circles November 8, 2017 Probability II Intersection of Events and Independence Consider two groups of pairs of events Intermediate Math Circles November 8, 017 Probability II Group 1 (Dependent Events) A = {a sales associate has training} B

More information

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for BUSINESS STATISTICS - BMGT 571 Chapters 1 through 6 Professor Ahmadi, Ph.D. Department of Management Revised May 005 Glossary of Terms: Statistics Chapter 1 Data Data Set Elements Variable

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

MIDTERM EXAMINATION STA301- Statistics and Probability (Session - 4) Question No: 1 (Marks: 1) - Please choose one 10! =. 362880 3628800 362280 362800 Question No: 2 (Marks: 1) - Please choose one If a

More information

A SHORT INTRODUCTION TO PROBABILITY

A SHORT INTRODUCTION TO PROBABILITY A Lecture for B.Sc. 2 nd Semester, Statistics (General) A SHORT INTRODUCTION TO PROBABILITY By Dr. Ajit Goswami Dept. of Statistics MDKG College, Dibrugarh 19-Apr-18 1 Terminology The possible outcomes

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Basic Statistical Analysis

Basic Statistical Analysis indexerrt.qxd 8/21/2002 9:47 AM Page 1 Corrected index pages for Sprinthall Basic Statistical Analysis Seventh Edition indexerrt.qxd 8/21/2002 9:47 AM Page 656 Index Abscissa, 24 AB-STAT, vii ADD-OR rule,

More information

20 Hypothesis Testing, Part I

20 Hypothesis Testing, Part I 20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then 1.1 Probabilities Def n: A random experiment is a process that, when performed, results in one and only one of many observations (or outcomes). The sample space S is the set of all elementary outcomes

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.)

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.) Section 1.1 Statistics - The science of planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on the data.

More information

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COURSE: CBS 221 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Overview of Dispersion. Standard. Deviation

Overview of Dispersion. Standard. Deviation 15.30 STATISTICS UNIT II: DISPERSION After reading this chapter, students will be able to understand: LEARNING OBJECTIVES To understand different measures of Dispersion i.e Range, Quartile Deviation, Mean

More information

Probability and Statistics Notes

Probability and Statistics Notes Probability and Statistics Notes Chapter One Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Chapter One Notes 1 / 71 Outline 1 A Sketch of Probability and

More information

Event A: at least one tail observed A:

Event A: at least one tail observed A: Chapter 3 Probability 3.1 Events, sample space, and probability Basic definitions: An is an act of observation that leads to a single outcome that cannot be predicted with certainty. A (or simple event)

More information

Discrete Probability. Chemistry & Physics. Medicine

Discrete Probability. Chemistry & Physics. Medicine Discrete Probability The existence of gambling for many centuries is evidence of long-running interest in probability. But a good understanding of probability transcends mere gambling. The mathematics

More information

STA301- Statistics and Probability Solved Subjective From Final term Papers. STA301- Statistics and Probability Final Term Examination - Spring 2012

STA301- Statistics and Probability Solved Subjective From Final term Papers. STA301- Statistics and Probability Final Term Examination - Spring 2012 STA30- Statistics and Probability Solved Subjective From Final term Papers Feb 6,03 MC004085 Moaaz.pk@gmail.com Mc004085@gmail.com PSMD0 STA30- Statistics and Probability Final Term Examination - Spring

More information

Topic 3: Introduction to Probability

Topic 3: Introduction to Probability Topic 3: Introduction to Probability 1 Contents 1. Introduction 2. Simple Definitions 3. Types of Probability 4. Theorems of Probability 5. Probabilities under conditions of statistically independent events

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Chapter One: Data and Statistics Statistics A collection of procedures and principles

More information

PROBABILITY.

PROBABILITY. PROBABILITY PROBABILITY(Basic Terminology) Random Experiment: If in each trial of an experiment conducted under identical conditions, the outcome is not unique, but may be any one of the possible outcomes,

More information

Introduction to Basic Statistics Version 2

Introduction to Basic Statistics Version 2 Introduction to Basic Statistics Version 2 Pat Hammett, Ph.D. University of Michigan 2014 Instructor Comments: This document contains a brief overview of basic statistics and core terminology/concepts

More information

dates given in your syllabus.

dates given in your syllabus. Slide 2-1 For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of paper with formulas and notes written or typed on both sides to each exam. For the rest of the quizzes, you will take your

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product?

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product? A. Attribute data B. Numerical data C. Quantitative data D. Sample data E. Qualitative data F. Statistic G. Parameter Chapter #1 Match the following descriptions with the best term or classification given

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

Class 26: review for final exam 18.05, Spring 2014

Class 26: review for final exam 18.05, Spring 2014 Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event

More information

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n GLOSSARY A absolute cell referencing A spreadsheet feature that blocks automatic adjustment of cell references when formulas are moved or copied. References preceded by a dollar sign $A$1, for example

More information

Elisha Mae Kostka 243 Assignment Mock Test 1 due 02/11/2015 at 09:01am PST

Elisha Mae Kostka 243 Assignment Mock Test 1 due 02/11/2015 at 09:01am PST Elisha Mae Kostka 243 Assignment Mock Test 1 due 02/11/2015 at 09:01am PST 1. (1 pt) Luis Gonzalez began his career as a major league baseball player in 1990. You are given a sample of the number of homeruns

More information

Math 221, REVIEW, Instructor: Susan Sun Nunamaker

Math 221, REVIEW, Instructor: Susan Sun Nunamaker Math 221, REVIEW, Instructor: Susan Sun Nunamaker Good Luck & Contact me through through e-mail if you have any questions. 1. Bar graphs can only be vertical. a. true b. false 2.

More information

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary) Chapter 14 From Randomness to Probability How to measure a likelihood of an event? How likely is it to answer correctly one out of two true-false questions on a quiz? Is it more, less, or equally likely

More information

Discrete Random Variables

Discrete Random Variables Discrete Random Variables An Undergraduate Introduction to Financial Mathematics J. Robert Buchanan 2014 Introduction The markets can be thought of as a complex interaction of a large number of random

More information

8 Probability. An event occurs if, when the experiment is performed, the outcome is in that event.

8 Probability. An event occurs if, when the experiment is performed, the outcome is in that event. 8 Probability When an experiment is performed, an outcome is observed. A set of possible outcomes is an event. The sample space for the experiment is the set of all possible outcomes. An experiment must

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Note: Solve these papers by yourself This VU Group is not responsible for any solved content. Paper 1. Question No: 3 ( Marks: 1 ) - Please choose one

Note: Solve these papers by yourself This VU Group is not responsible for any solved content. Paper 1. Question No: 3 ( Marks: 1 ) - Please choose one Paper Composed & Solved STA30 Finalterm Papers 7 Papers Solved.. By Arman Makhani Statistic and Probability STA30 7Final term paper Question No: ( Marks: ) - Please choose one Mean deviation is always:

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

Lesson One Hundred and Sixty-One Normal Distribution for some Resolution

Lesson One Hundred and Sixty-One Normal Distribution for some Resolution STUDENT MANUAL ALGEBRA II / LESSON 161 Lesson One Hundred and Sixty-One Normal Distribution for some Resolution Today we re going to continue looking at data sets and how they can be represented in different

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 4-1 Overview 4-2 Fundamentals 4-3 Addition Rule Chapter 4 Probability 4-4 Multiplication Rule:

More information

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers Math 2000 Practice Final Exam: Homework problems to review Pages: Problem numbers 52 20 65 1 181 14 189 23, 30 245 56 256 13 280 4, 15 301 21 315 18 379 14 388 13 441 13 450 10 461 1 553 13, 16 561 13,

More information

Chapter 4 Probability

Chapter 4 Probability 4-1 Review and Preview Chapter 4 Probability 4-2 Basic Concepts of Probability 4-3 Addition Rule 4-4 Multiplication Rule: Basics 4-5 Multiplication Rule: Complements and Conditional Probability 4-6 Counting

More information

P(A) = Definitions. Overview. P - denotes a probability. A, B, and C - denote specific events. P (A) - Chapter 3 Probability

P(A) = Definitions. Overview. P - denotes a probability. A, B, and C - denote specific events. P (A) - Chapter 3 Probability Chapter 3 Probability Slide 1 Slide 2 3-1 Overview 3-2 Fundamentals 3-3 Addition Rule 3-4 Multiplication Rule: Basics 3-5 Multiplication Rule: Complements and Conditional Probability 3-6 Probabilities

More information

Probability 5-4 The Multiplication Rules and Conditional Probability

Probability 5-4 The Multiplication Rules and Conditional Probability Outline Lecture 8 5-1 Introduction 5-2 Sample Spaces and 5-3 The Addition Rules for 5-4 The Multiplication Rules and Conditional 5-11 Introduction 5-11 Introduction as a general concept can be defined

More information

Chapter 3 Data Description

Chapter 3 Data Description Chapter 3 Data Description Section 3.1: Measures of Central Tendency Section 3.2: Measures of Variation Section 3.3: Measures of Position Section 3.1: Measures of Central Tendency Definition of Average

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION Unit : I - V Unit I: Syllabus Probability and its types Theorems on Probability Law Decision Theory Decision Environment Decision Process Decision tree

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

3 PROBABILITY TOPICS

3 PROBABILITY TOPICS Chapter 3 Probability Topics 135 3 PROBABILITY TOPICS Figure 3.1 Meteor showers are rare, but the probability of them occurring can be calculated. (credit: Navicore/flickr) Introduction It is often necessary

More information

STA Module 4 Probability Concepts. Rev.F08 1

STA Module 4 Probability Concepts. Rev.F08 1 STA 2023 Module 4 Probability Concepts Rev.F08 1 Learning Objectives Upon completing this module, you should be able to: 1. Compute probabilities for experiments having equally likely outcomes. 2. Interpret

More information

Module 1. Probability

Module 1. Probability Module 1 Probability 1. Introduction In our daily life we come across many processes whose nature cannot be predicted in advance. Such processes are referred to as random processes. The only way to derive

More information

Chapter. Probability

Chapter. Probability Chapter 3 Probability Section 3.1 Basic Concepts of Probability Section 3.1 Objectives Identify the sample space of a probability experiment Identify simple events Use the Fundamental Counting Principle

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information