Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.
|
|
- Silvester Richards
- 5 years ago
- Views:
Transcription
1 Contingency Tables Definition & Examples. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. (Using more than two factors gets complicated, so we won't be looking at this in this class). A factor can be defined as a categorical variable ; examples might be sex, color, age, etc. We already looked at test where we only have one factor - goodness of fit tests. A level cab be defined as the different values for the factor; examples matching the above might be {male, female}, {red, blue, green}, {0-4, 5-9, 10-14, etc.}. For each combination of levels, we have a certain count, which tells us how many individuals (or whatever we re measuring) are in that category. An example (data from 1988, Florida): Safety equipment in use Injury Fatal Non-fatal Total None 1, , ,128 Seat belt , ,878 Total 2, , ,006 We have two factors, injury and safety equipment used, each with two levels. So now that we have an example, what are we interested in? Depends. In the example above, what would be interesting to know? Do seat-belts save lives? Specifically, is p 1 = p 2?, where: p 1 = proportion of fatalities for people not wearing a seat belt (estimated by p 1 ). p 2 = proportion of fatalities for people wearing a seat belt (estimated by p 2 ). Incidentally, we re obviously interested in a one-sided alternative here (why?).
2 But let's do from our old textbook. This one's kind of interesting as it was based on a study done in 1899, and then re-analyzed in 1954 by Goodman and Kruskal. Kruskal is famous for being one of the people to develop the Kruskal-Wallis test, which we discuss in slightly more advanced classes: Hair color dark light Total Eye dark color light Total Now what do we want to know? Does it make sense to figure out if the proportion of people with dark eyes is the same if they have dark hair or light hair? Well, maybe. It might be much more interesting to ask the question like this: Does eye color influence hair color, or vice-versa? Or in statistical language: Is eye color independent of hair color? So, depending on the kind of data, we re either interested in: 1) Comparing proportions 2) Establishing independence/dependence So here's an outline of how to do our test: We phrase our hypotheses accordingly: 1) H 0 : p 1 = p 2, where the p's are the true population proportions, H 1 : p 1 p 2 (or p 1 > p 2, etc.) 2) H 0 : Factor 1 and factor 2 are independent H 1 : Factor 1 and factor 2 are dependent We'll notice that the math will be the same regardless of which set of hypotheses we use. Now let s choose α, just as always Calculate our test statistic: χ 2 * = i=1 c (O i E i ) 2 E i
3 Note that this is identical to that used for the goodness of fit test. But now c is the number of cells in our table (four in both of our examples so far). We ll figure out how to get expected values below. Look up the tabulated χ 2 value Our degrees of freedom are not c-1 anymore. Instead, they are (r-1) x (k-1), where: r = # of rows k = # of columns So for both our examples so far we have: (2-1) x (2-1) = 1 x 1 = 1 Compare our χ 2* with χ 2 table, and if it s larger (or equal to), then reject H 0. State our conclusion in terms of our original hypothesis. So what about our expected values? Let's work with the proportion of people with dark eyes and suppose that hair color has no effect on eye color. This implies that the proportion of people with dark eyes is the same in both columns. Therefore, we figure out what our overall proportion of people with dark eyes using the row totals: Add up all people with dark eyes, and divide this by the total number of people in our sample (in other words, we use the row totals). Now we note that if it doesn't make any difference if you have light or dark hair, then the proportion of people with dark eyes should be the same in both columns. In other words, we now multiply the column totals by the proportion of people with dark eyes. This give us the expected number of people with dark eyes for each column. If we think about this a bit, it gives us the following (easy to remember) formula: Expected value = (Row total) (Column total) (Grand total)
4 So now let's do a few examples: Seat belts and fatalities: State hypotheses: H 0 : The proportion of people killed is the same whether or not they are wearing a seatbelt or: H 0 : p 1 = p 2 (as defined above) H 1 : The proportions are not the same (we ll stick with a two-sided test for the moment) Let's use α =.05 We calculate our expected values from our observed values: Safety equipment in use Injury Fatal Non-fatal Total None 1, , ,128 Seat belt , ,878 Total 2, , ,006 So, our first expected value (fatal, none): (2,111 x 167,128)/580,006 = Our second expected value (non-fatal, none): (577,895 x 167,128)/580,006 = 166, Our third expected value (fatal, seat belt): (2,111 x 412,878)/580,006 = 1, And finally, (non-fatal, seat belt): (577,895 x 412,878)/580,006 = 411,375.28
5 We can put this in a table if we want (to keep it straight): Expected values: Safety equipment in use Injury Fatal Non-fatal Total None , ,128 Seat belt 1, , ,878 Total 2, , ,006 Now let s calculate our χ 2* : χ 2 * = ( ) (165, ,519.72)2 166, ( ) (412, ,375)2 411,375,28 = 2, If we look up our critical value of χ 2 in the table (1 d.f., and α =.05), we get: χ 2.05,1 = 3.84 So we reject H 0 and conclude that seat belts do affect the outcome of a traffic accident. Incidentally, p < 1 x , though I sort of doubt the accuracy of that figure. However, we suspect seatbelts save lives, so what we really wanted was a one sided test: Proceed as above, though now you re alternative hypothesis is H 1 : p 1 > p 2 (p 1 is the proportion of fatalities for folks not wearing seatbelts). Make sure the data deviate from the null hypothesis in the direction of your alternative, otherwise STOP. In other words, make sure p 1 p 2 (notice that p 1 = , p 2 = , so we're okay) Now just use the appropriate column in your χ 2 tables (divide by two): 2 table 2 =.05,1 = 2.71 And again we get to reject
6 Now let s do our second example: Notice that the χ 2 table value is lower, which makes rejection easier and gives us more power (not really needed in this example, but still true!). Hair color dark light Total Eye dark color light Total H 0 : Eye color and Hair color are independent H 1 : They are not independent α =.05 Calculate expected values (I m skipping the details, they re in your text, and I went through them above): Hair color dark light Eye dark color light Calculate χ 2* (the same as usual, I m skipping the details): Our tabulated χ 2 = 3.84 χ 2* = So we reject our H 0, and conclude H 1. Do we want to do anything else?? Yes, we can see which direction our sample deviates from our H 0. In other words, since eye color and hair color are not independent, we can look and see if dark eyes occurs more commonly with dark hair, or if dark eyes occurs more commonly with light hair: If a person has dark hair, the proportion of dark eyes is 729 / 3858 = If a person has light hair, the proportion of dark eyes is 131 / 2945 = From that information, we can conclude that dark hair goes with dark eyes, and therefore light hair with light eyes. (No surprise: blond, blue-eyed, etc. etc. )
7 Note that to calculate the proportions above we used the column totals in the denominator of both calculations. You can also use the row totals - as long as you're consistent. You'd be calculating different proportions, but you should still be able to interpret them: E.g., if you have dark eyes, the proportion of people with dark hair is, and then if you have light eyes the proportion of people with dark hair is, and so on. R x K tables. We refer to any tables bigger than 2 x 2 as R x K This is pretty easy to deal with. With the possible exception of figuring out your hypotheses, you know how to do this. Let's look at three different species of squirrel and compare food preferences: Peanuts Walnuts TOTAL Species A Species B Species C TOTAL An obvious thing to compare is if there's a difference in food preference between the three species. H 0 : The proportions of peanuts and walnuts is the same for all three species of squirrel. H 1 : The proportions are not the same. (Note that we can't do a directional or one sided alternative once we're bigger than 2 x 2 tables). α =.05 χ 2* = (calculated the same way as always!) df = ν = 2 (r-1)(k-1) = 2 x 1 = 2 From our table, the critical value of chi-square with 2 d.f. and an α of.05 = 5.991, so we reject. Our conclusion is that the proportions of blood types is not the same in the two groups (some blood types are more prone to getting ulcers - Biological note: what actually causes ulcers?).
8 IV) Comments: One can calculate things like Odds ratios and relative risk for tables like this. These are actually very important in medical trials. We don't have the time to go into the details, but notice that the relative risk isn't that difficult to calculate: For example, let's look at the risk of dying in a car accident when not wearing a seatbelt as opposed to wearing a seatbelt: ^ RR = ^p 1 ^p 2 = = 7.73 Where ^ RR is the estimated relative risk. This tells us that the risk of dying in a car crash is 7.73 times higher than if you're not wearing a seatbelt!! The odds ratio is a similar measure, but requires rather more explanation. As mentioned, both are used extensively in medical trials (e.g., the risk of getting lung cancer if you smoke is about 40 times that of a non-smoker (figures are approximately correct!)) Finally, a quick word about the assumptions. They are identical to those for a goodness of fit test: Random data Smallest expected value 5
Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878
Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each
More informationCategorical Data Analysis. The data are often just counts of how many things each category has.
Categorical Data Analysis So far we ve been looking at continuous data arranged into one or two groups, where each group has more than one observation. E.g., a series of measurements on one or two things.
More informationHypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =
Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,
More informationOne sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:
One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get
More informationRegression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph.
Regression, Part I I. Difference from correlation. II. Basic idea: A) Correlation describes the relationship between two variables, where neither is independent or a predictor. - In correlation, it would
More informationHypothesis Testing: Chi-Square Test 1
Hypothesis Testing: Chi-Square Test 1 November 9, 2017 1 HMS, 2017, v1.0 Chapter References Diez: Chapter 6.3 Navidi, Chapter 6.10 Chapter References 2 Chi-square Distributions Let X 1, X 2,... X n be
More informationViolating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?
Violating the normal distribution assumption So what do you do if the data are not normal and you still need to perform a test? Remember, if your n is reasonably large, don t bother doing anything. Your
More informationModule 03 Lecture 14 Inferential Statistics ANOVA and TOI
Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module
More informationRegression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.
Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if
More informationChapter 10: Chi-Square and F Distributions
Chapter 10: Chi-Square and F Distributions Chapter Notes 1 Chi-Square: Tests of Independence 2 4 & of Homogeneity 2 Chi-Square: Goodness of Fit 5 6 3 Testing & Estimating a Single Variance 7 10 or Standard
More informationThis is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!
Two sample tests (part II): What to do if your data are not distributed normally: Option 1: if your sample size is large enough, don't worry - go ahead and use a t-test (the CLT will take care of non-normal
More informationLecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests
Lecture 9 Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Univariate categorical data Univariate categorical data are best summarized in a one way frequency table.
More informationMITOCW ocw f99-lec09_300k
MITOCW ocw-18.06-f99-lec09_300k OK, this is linear algebra lecture nine. And this is a key lecture, this is where we get these ideas of linear independence, when a bunch of vectors are independent -- or
More informationDescriptive Statistics (And a little bit on rounding and significant digits)
Descriptive Statistics (And a little bit on rounding and significant digits) Now that we know what our data look like, we d like to be able to describe it numerically. In other words, how can we represent
More informationCorrelation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?
Comment: notes are adapted from BIOL 214/312. I. Correlation. Correlation A) Correlation is used when we want to examine the relationship of two continuous variables. We are not interested in prediction.
More information- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.
Descriptive Statistics: One of the most important things we can do is to describe our data. Some of this can be done graphically (you should be familiar with histograms, boxplots, scatter plots and so
More informationChapter 26: Comparing Counts (Chi Square)
Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces
More informationTwo sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.
Two sided, two sample t-tests. I. Brief review: 1) We are interested in how a sample compares to some pre-conceived notion. For example: a) IQ = 100 b) Average height for men = 5 10. c) Average number
More informationStatistics 3858 : Contingency Tables
Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson
More informationChapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.
Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:
More informationIs there a connection between gender, maths grade, hair colour and eye colour? Contents
5 Sample project This Maths Studies project has been graded by a moderator. As you read through it, you will see comments from the moderator in boxes like this: At the end of the sample project is a summary
More informationChapter 1 Review of Equations and Inequalities
Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve
More informationSTAC51: Categorical data Analysis
STAC51: Categorical data Analysis Mahinda Samarakoon January 26, 2016 Mahinda Samarakoon STAC51: Categorical data Analysis 1 / 32 Table of contents Contingency Tables 1 Contingency Tables Mahinda Samarakoon
More informationChi Square Analysis M&M Statistics. Name Period Date
Chi Square Analysis M&M Statistics Name Period Date Have you ever wondered why the package of M&Ms you just bought never seems to have enough of your favorite color? Or, why is it that you always seem
More informationMITOCW ocw f99-lec23_300k
MITOCW ocw-18.06-f99-lec23_300k -- and lift-off on differential equations. So, this section is about how to solve a system of first order, first derivative, constant coefficient linear equations. And if
More informationLecture 41 Sections Mon, Apr 7, 2008
Lecture 41 Sections 14.1-14.3 Hampden-Sydney College Mon, Apr 7, 2008 Outline 1 2 3 4 5 one-proportion test that we just studied allows us to test a hypothesis concerning one proportion, or two categories,
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationMITOCW ocw f99-lec05_300k
MITOCW ocw-18.06-f99-lec05_300k This is lecture five in linear algebra. And, it will complete this chapter of the book. So the last section of this chapter is two point seven that talks about permutations,
More informationDealing with the assumption of independence between samples - introducing the paired design.
Dealing with the assumption of independence between samples - introducing the paired design. a) Suppose you deliberately collect one sample and measure something. Then you collect another sample in such
More informationRama Nada. -Ensherah Mokheemer. 1 P a g e
- 9 - Rama Nada -Ensherah Mokheemer - 1 P a g e Quick revision: Remember from the last lecture that chi square is an example of nonparametric test, other examples include Kruskal Wallis, Mann Whitney and
More informationSTAT 705: Analysis of Contingency Tables
STAT 705: Analysis of Contingency Tables Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Analysis of Contingency Tables 1 / 45 Outline of Part I: models and parameters Basic
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationFog Chamber Testing the Label: Photo of Fog. Joshua Gutwill 10/29/1999
Fog Chamber Testing the Label: Photo of Fog Joshua Gutwill 10/29/1999 Keywords: < > formative heat&temp exhibit interview 1 Goals/Context Fog Chamber Interview Results Testing Label: Photo of fog on Golden
More information11-2 Multinomial Experiment
Chapter 11 Multinomial Experiments and Contingency Tables 1 Chapter 11 Multinomial Experiments and Contingency Tables 11-11 Overview 11-2 Multinomial Experiments: Goodness-of-fitfit 11-3 Contingency Tables:
More informationExplanation of R 2, and Other Stories
Explanation of R 2, and Other Stories I accidentally gave an incorrect formula for R 2 in class. This summary was initially just going to be me correcting my error, but I've taken the opportunity to clarify
More information- a value calculated or derived from the data.
Descriptive statistics: Note: I'm assuming you know some basics. If you don't, please read chapter 1 on your own. It's pretty easy material, and it gives you a good background as to why we need statistics.
More informationComparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success
Comparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success When the experiment consists of a series of n independent trials, and each trial may end in either success or failure,
More informationWe're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).
Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,
More informationMITOCW ocw f99-lec01_300k
MITOCW ocw-18.06-f99-lec01_300k Hi. This is the first lecture in MIT's course 18.06, linear algebra, and I'm Gilbert Strang. The text for the course is this book, Introduction to Linear Algebra. And the
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationMITOCW ocw lec8
MITOCW ocw-5.112-lec8 The following content is provided by MIT OpenCourseWare under a Creative Commons license. Additional information about our license and MIT OpenCourseWare in general is available at
More informationBasic Business Statistics, 10/e
Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,
More informationLesson 6: Algebra. Chapter 2, Video 1: "Variables"
Lesson 6: Algebra Chapter 2, Video 1: "Variables" Algebra 1, variables. In math, when the value of a number isn't known, a letter is used to represent the unknown number. This letter is called a variable.
More informationLecture 41 Sections Wed, Nov 12, 2008
Lecture 41 Sections 14.1-14.3 Hampden-Sydney College Wed, Nov 12, 2008 Outline 1 2 3 4 5 6 7 one-proportion test that we just studied allows us to test a hypothesis concerning one proportion, or two categories,
More informationMITOCW MIT18_01SCF10Rec_24_300k
MITOCW MIT18_01SCF10Rec_24_300k JOEL LEWIS: Hi. Welcome back to recitation. In lecture, you've been doing related rates problems. I've got another example for you, here. So this one's a really tricky one.
More informationSolving Equations by Adding and Subtracting
SECTION 2.1 Solving Equations by Adding and Subtracting 2.1 OBJECTIVES 1. Determine whether a given number is a solution for an equation 2. Use the addition property to solve equations 3. Determine whether
More informationPsych 230. Psychological Measurement and Statistics
Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State
More informationChapter 2: Describing Contingency Tables - I
: Describing Contingency Tables - I Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More informationThe following are generally referred to as the laws or rules of exponents. x a x b = x a+b (5.1) 1 x b a (5.2) (x a ) b = x ab (5.
Chapter 5 Exponents 5. Exponent Concepts An exponent means repeated multiplication. For instance, 0 6 means 0 0 0 0 0 0, or,000,000. You ve probably noticed that there is a logical progression of operations.
More informationStatistics for Managers Using Microsoft Excel
Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap
More informationChapter 10. Prof. Tesler. Math 186 Winter χ 2 tests for goodness of fit and independence
Chapter 10 χ 2 tests for goodness of fit and independence Prof. Tesler Math 186 Winter 2018 Prof. Tesler Ch. 10: χ 2 goodness of fit tests Math 186 / Winter 2018 1 / 26 Multinomial test Consider a k-sided
More informationChapter 6. Logistic Regression. 6.1 A linear model for the log odds
Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,
More informationLing 289 Contingency Table Statistics
Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,
More informationTo: Amanda From: Daddy Date: 2004 February 19 About: How to solve math problems
to Amanda p.1 To: Amanda From: Daddy Date: 2004 February 19 About: How to solve math problems There are 4 steps in solving the kind of math problem you showed me. I'll list the steps first, then explain
More information11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should
11 CHI-SQUARED Chapter 11 Chi-squared Objectives After studying this chapter you should be able to use the χ 2 distribution to test if a set of observations fits an appropriate model; know how to calculate
More informationConfidence intervals
Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which
More informationTwo-sample Categorical data: Testing
Two-sample Categorical data: Testing Patrick Breheny April 1 Patrick Breheny Introduction to Biostatistics (171:161) 1/28 Separate vs. paired samples Despite the fact that paired samples usually offer
More informationStatistics 1L03 - Midterm #2 Review
Statistics 1L03 - Midterm # Review Atinder Bharaj Made with L A TEX October, 01 Introduction As many of you will soon find out, I will not be holding the next midterm review. To make it a bit easier on
More information6: Polynomials and Polynomial Functions
6: Polynomials and Polynomial Functions 6-1: Polynomial Functions Okay you know what a variable is A term is a product of constants and powers of variables (for example: x ; 5xy ) For now, let's restrict
More informationMITOCW ocw-18_02-f07-lec17_220k
MITOCW ocw-18_02-f07-lec17_220k The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free.
More informationTesting Independence
Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1
More information( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of
Factoring Review for Algebra II The saddest thing about not doing well in Algebra II is that almost any math teacher can tell you going into it what s going to trip you up. One of the first things they
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationMITOCW ocw-18_02-f07-lec02_220k
MITOCW ocw-18_02-f07-lec02_220k The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free.
More informationChapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions
Chapter 9 Inferences from Two Samples 9. Inferences About Two Proportions 9.3 Inferences About Two s (Independent) 9.4 Inferences About Two s (Matched Pairs) 9.5 Comparing Variation in Two Samples Objective
More informationFrequency Distribution Cross-Tabulation
Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape
More informationLecture 28 Chi-Square Analysis
Lecture 28 STAT 225 Introduction to Probability Models April 23, 2014 Whitney Huang Purdue University 28.1 χ 2 test for For a given contingency table, we want to test if two have a relationship or not
More informationWe know from STAT.1030 that the relevant test statistic for equality of proportions is:
2. Chi 2 -tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which
More informationExamples of frequentist probability include games of chance, sample surveys, and randomized experiments. We will focus on frequentist probability sinc
FPPA-Chapters 13,14 and parts of 16,17, and 18 STATISTICS 50 Richard A. Berk Spring, 1997 May 30, 1997 1 Thinking about Chance People talk about \chance" and \probability" all the time. There are many
More informationElementary Algebra Study Guide Some Basic Facts This section will cover the following topics
Elementary Algebra Study Guide Some Basic Facts This section will cover the following topics Notation Order of Operations Notation Math is a language of its own. It has vocabulary and punctuation (notation)
More informationTopic 1. Definitions
S Topic. Definitions. Scalar A scalar is a number. 2. Vector A vector is a column of numbers. 3. Linear combination A scalar times a vector plus a scalar times a vector, plus a scalar times a vector...
More informationMITOCW MITRES18_006F10_26_0602_300k-mp4
MITOCW MITRES18_006F10_26_0602_300k-mp4 FEMALE VOICE: The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational
More informationBusiness Statistics 41000: Homework # 5
Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive
More informationMITOCW ocw f99-lec17_300k
MITOCW ocw-18.06-f99-lec17_300k OK, here's the last lecture in the chapter on orthogonality. So we met orthogonal vectors, two vectors, we met orthogonal subspaces, like the row space and null space. Now
More informationWe're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).
Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,
More information10: Crosstabs & Independent Proportions
10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church
More informationProperties of Arithmetic
Excerpt from "Prealgebra" 205 AoPS Inc. 4 6 7 4 5 8 22 23 5 7 0 Arithmetic is being able to count up to twenty without taking o your shoes. Mickey Mouse CHAPTER Properties of Arithmetic. Why Start with
More informationChi-Squared Tests. Semester 1. Chi-Squared Tests
Semester 1 Goodness of Fit Up to now, we have tested hypotheses concerning the values of population parameters such as the population mean or proportion. We have not considered testing hypotheses about
More informationThe Cycloid. and the Kinematic Circumference. by Miles Mathis
return to updates The Cycloid and the Kinematic Circumference First published August 31, 2016 by Miles Mathis Those of you who have read my papers on π=4 will know I have explained that problem using many
More informationQuestion. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?
Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.
More informationANOVA - analysis of variance - used to compare the means of several populations.
12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.
More informationStatistics and Quantitative Analysis U4320
Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one
More informationMITOCW R11. Double Pendulum System
MITOCW R11. Double Pendulum System The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for
More informationExample. χ 2 = Continued on the next page. All cells
Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E
More informationLesson 11-1: Parabolas
Lesson -: Parabolas The first conic curve we will work with is the parabola. You may think that you ve never really used or encountered a parabola before. Think about it how many times have you been going
More informationCHAPTER 9: HYPOTHESIS TESTING
CHAPTER 9: HYPOTHESIS TESTING THE SECOND LAST EXAMPLE CLEARLY ILLUSTRATES THAT THERE IS ONE IMPORTANT ISSUE WE NEED TO EXPLORE: IS THERE (IN OUR TWO SAMPLES) SUFFICIENT STATISTICAL EVIDENCE TO CONCLUDE
More information22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)
22s:152 Applied Linear Regression Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) We now consider an analysis with only categorical predictors (i.e. all predictors are
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More informationTHE SIMPLE PROOF OF GOLDBACH'S CONJECTURE. by Miles Mathis
THE SIMPLE PROOF OF GOLDBACH'S CONJECTURE by Miles Mathis miles@mileswmathis.com Abstract Here I solve Goldbach's Conjecture by the simplest method possible. I do this by first calculating probabilites
More informationIntroducing Statistics for Bioscientists
Introducing Statistics for Bioscientists Keith Gregson February 14, 2007 Contents 1 Why statistics? 2 2 Basic Statistics 6 3 The Statistical Method 8 4 The Normal Frequency Distribution 9 5 The t-test
More informationMultiple Regression Theory 2006 Samuel L. Baker
MULTIPLE REGRESSION THEORY 1 Multiple Regression Theory 2006 Samuel L. Baker Multiple regression is regression with two or more independent variables on the right-hand side of the equation. Use multiple
More informationPS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence.
PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. Example by eye. On a hot day, nine cars were left in the sun in a car parking lot. The length of time each car was left
More informationAnalysis of variance (ANOVA) Comparing the means of more than two groups
Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments
More information13.1 Categorical Data and the Multinomial Experiment
Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)
More informationUni- and Bivariate Power
Uni- and Bivariate Power Copyright 2002, 2014, J. Toby Mordkoff Note that the relationship between risk and power is unidirectional. Power depends on risk, but risk is completely independent of power.
More informationIntroduction to Survey Analysis!
Introduction to Survey Analysis! Professor Ron Fricker! Naval Postgraduate School! Monterey, California! Reading Assignment:! 2/22/13 None! 1 Goals for this Lecture! Introduction to analysis for surveys!
More informationDiscrete Multivariate Statistics
Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are
More informationMITOCW ocw f99-lec30_300k
MITOCW ocw-18.06-f99-lec30_300k OK, this is the lecture on linear transformations. Actually, linear algebra courses used to begin with this lecture, so you could say I'm beginning this course again by
More information