ph: 5.2, 5.6, 5.8, 6.4, 6.5, 6.8, 6.9, 7.2, 7.5 sample mean = sample sd = sample size, n = 9
|
|
- Kathleen Atkins
- 5 years ago
- Views:
Transcription
1 Name: SOLUTIONS Final Part 1 (100 pts) and Final Part 2 (120 pts) For all of the questions below, please show enough work that it is completely clear how your final solution was derived. Sit at least one seat apart and put away laptops and cell phones. The quiz is open book, open notes, open cheat sheets, and open calculator. *1 4 pts) Sally decides to see if she can make her cat s saliva non neutral, i.e. acidic, ph < 7, or basic, ph > 7, by putting her beloved pet on a snow cone diet for a week (see figure to left). At the end of the week she measures the ph of the cat s saliva 10 times. After removing an outlier, she analyzes the following 9 measures. ph: 5.2, 5.6, 5.8, 6.4, 6.5, 6.8, 6.9, 7.2, 7.5 sample mean = sample sd = sample size, n = 9 image used without permission from 13a39e2aaaff36f190d5258ae960 Write the null and alternative hypothesis for a two sided test of this question. Remember to use Greek letters for parameters. Ho: µ ph = 7. Ha: µ ph 7. *2 8 pts) Referring to question 1, calculate an appropriate test statistic to three decimal places. Remember, don t round your numbers during calculations. Round at the very end to get three decimals. ( ) / ( / sqrt(9) ) = / = = In R: > x < c( 5.2, 5.6, 5.8, 6.4, 6.5, 6.8, 6.9, 7.2, 7.5 ) > t.test( x, mu=7 )$statistic t
2 *3 4 pts) Referring to questions 1 2, what is the appropriate critical value for a 5% significance level two sided test? Provide critical value to 3 decimal places. t 8, = *4 4 pts) Referring to question 1 3, what is an appropriate p value for this test? t 8, 0.95 = 1.860, thus the p value < To be explicit, 0.05 < p < In R: > t.test( x, mu=7 )$p.value [1] *5 8 pts) Referring to questions 1 4, calculate a 95% confidence interval for the mean ph of Sally s cat s saliva after exposure to the snow cone diet. Write your answer to three decimals * / sqrt(9) = * = ( , ) = ( 5.844, ) Wrong answer: * = ( 5.933, ) 2
3 *6 3 pts) Referring to question 1, the outlier Sally dropped was a 2.1. Here is the data with the outlier included and corresponding R output. (The question is at the bottom of the page.) > ph < c(2.1, 5.2, 5.6, 5.8, 6.4, 6.5, 6.8, 6.9, 7.2, 7.5) > mean(ph) [1] 6 > sd(ph) [1] > t.test(ph, mu=7) One Sample t test data: ph t = , df = 9, p value = alternative hypothesis: true mean is not equal to 7 95 percent confidence interval: sample estimates: mean of x 6 > wilcox.test(ph, mu=7) Wilcoxon signed rank test with continuity correction data: ph V = 7, p value = alternative hypothesis: true location is not equal to 7 If analyzing all 10 points of data, which test should Sally use and why? Sally should use the Wilcoxon signed rank test because the t test is sensitive to outliers, especially with a small sample size. The outlier makes the assumption of normality questionable, thus a non parametric test is more appropriate. 3
4 *7 4 pts) In a one sample case, the Wilcoxon signed rank test is for testing the null hypothesis, Ho: F(X) is symmetric about µ o, which is often simplified to the easier to understand though technically inaccurate Ho: median(x) = µ o, where µ o depends on the context of the problem. In the context of question 6, what is an appropriate null and alternative hypothesis for the problem? Best answer: Ho: F(X ph ) is symmetric about 7. Ha: F(X ph ) is not symmetric about 7. Acceptable though technically inaccurate answer: Ho: median(x ph ) = 7 Ha: median(x ph ) 7 *8 3 pts) Referring to questions 6 7, write a brief conclusion for the analysis of all 10 data points using a 5% significance level. Include an appropriate p value in your conclusion. We reject the null hypothesis that the distribution of ph measures is symmetric about 7, p value = The data suggests that the cat s saliva is slightly acidic. 4
5 Type -- Color Blue Red Orange Yellow Green Brown Total Milk chocolate Dark *9 8 pts) During class we collected the following M&Ms data. Create a 95% confidence interval for the risk difference (RD) for the risk of blue for Milk Chocolate M&Ms minus the risk of blue for Dark M&Ms. Write your answer to 3 decimal places, remembering to only round your numbers after you re done with all your calculations. We haven t learned the Wilson interval for the risk difference, so we ll use a Wald interval. image used without permission from com/blog/uploaded_images/unsi gneduser_662.unsignedchar_ png ( p milk p dark ) * sqrt( p milk * (1 p milk ) / n milk + p dark * (1 p dark ) / n dark ) ( 86/405 57/417 ) * sqrt( 86/405 * 319/405 / /417 * 360/417 / 417 ) * sqrt( ) * ( , ) ( 0.024, ) 5
6 *10 10 pts) Referring to the data in question 9, create a 95% confidence interval for the odds ratio (OR) for the odds of blue for milk chocolate M&Ms versus the odds of blue for dark M&Ms. Write your answer to 3 decimal places, remembering to only round your numbers after you re done with all your calculations. Rewriting the data in our a, b, c, d table format we have Blue Not Blue Milk a = 86 b = 319 Dark c = 57 d = 360 Odds ratio = ad/bc = (86*360)/(57*319) = To double check that we ve set up the table correctly, we can calculate the odds first and then calculate the ratio. Odds blue for milk = (86/405) / (319/405) = Odds blue for dark = (57/417) / (360/417) = Odds ratio = / = % CI assuming the n s are big enough such that the log(or) is roughly normally distributed. exp( ln(ad/bc) * sqrt(1/a + 1/b + 1/c + 1/d) ) = exp( ln( (86*360)/(57*319) ) * sqrt(1/86 + 1/ /57 + 1/360) ) = exp( ln( ) * sqrt( ) ) = exp( * ) = exp( ) = exp( , ) = ( , ) = ( 1.179, 2.458). 6
7 image used without permission from stories/2009/03/app_store_church_lady.jpg *11 4 pts) In order to ensure that patient examinations are of high quality, a hospital employs a team of reviewers to do routine chart reviews on samples of exams. This team of reviewers rate the examinations as passing or failing. Two new reviewers, Pat and Terry, are being testing on their rating accuracy. They both rate 300 exams for which a gold standard rating has been done. Thus, it is known if their ratings are correct for these 300 exams. The results follow. Pat Correct Pat Incorrect Terry Correct Terry Incorrect Given that Pat and Terry are both rating the same exams, what is an appropriate statistical test with which to test whether Pat or Terry is the better reviewer? Provide the test s name. McNemar s Test for paired categorical data. **12 10 pts) Calculate a p value for a two sided test of the question in problem 11. Write your answer to 4 decimal places, remembering to only round your numbers after you re done with all your calculations. n D, the number of dichordant pairs = 80, which is > 20. Therefore, we can use the normal theory test. Similar rules of thumb, n*po*qo = 80*0.5*0.5 = 20 > 5. Also, n*phat*qhat = 80*48/80*32/80 = 19.2 > 5. So by all rules of thumb, the normal approximation is fine. With continuity correction: ( n A n D /2 ½ ) 2 / (n D /4) = ( 48 80/2 ½ ) 2 / (80/4) = 7.5^2 / 20 = ( n A n B 1 ) 2 / (n A + n B ) = ( ) 2 / 80 = 15^2 / 80 = χ 2 1,.90 = χ 2 1,.95 = Therefore, the p value < Explicitly, 0.05 < p < Remember the χ 2 is for the two sided test, so you don t need to double the p values here. Because χ 2 1 is Z 2, we can be more precise if we wish. z = sqrt(2.8125) = = Using Rosner table 3, p value = 2* = Because we used the Z curve, we did need to double the p value for the two sided test. Without continuity correction: ( n A n B ) 2 / (n A + n B ) = ( ) 2 / 80 = 16^2 / 80 = 3.2 Therefore, 0.05 < p < More precisely, z = sqrt(3.2) = = 1.79, gives us p value = 2* =
8 **13 10 pts) We saw in question 12 that reviewers sometimes give incorrect ratings. In an effort to improve the accuracy of the ratings, the hospital uses the following system. An examination is rated by two reviewers. If the two reviewers agree (both rate pass or both rate fail) then the agreed upon rating is given. If the two reviewers disagree (one rates pass and the other fail), then a third reviewer reviews the examination with their rating breaking the tie. Suppose the probability of a single reviewer correctly rating an exam is 70% for all reviewers. What is the probability that the hospital s method will yield the correct exam rating? Give your answer to three decimal places. Think through all of the scenarios carefully. P(correct rating given) = P(1 st correct & 2 nd correct) + P(1 st correct & 2 nd wrong & 3 rd correct) + P(1 st wrong & 2 nd correct & 3 rd correct) Assuming the reviewers ratings are independent, P(correct rating given) = P(1 st correct ) * P(2 nd correct) + P(1 st correct) * P(2 nd wrong) * P(3 rd correct) + P(1 st wrong) * P(2 nd correct) * P(3 rd correct) = 0.7 * * 0.3 * * 0.7 * 0.7 =
9 **14 10 pts) Sally s parents form a mini IRB and tell their daughter that any of her future experiments on her pets need to be sufficiently powered. Sally wants to redo her snow cone experiment from question 1. If she assumes that the ph of her cat s saliva after the diet will be normally distributed with a mean of 6.5 and a standard deviation of 1, how many ph measures will she need to take to have her study powered at 80%? In other words, what sample size will yield a power of 80%? Assume Sally will be doing a two sided test at a 5% significance level with the assumed standard deviation treated as known. The power for a one sample Z test for the mean of a population with a two sided alternative with a 5% significance level is approximately φ[ µo µ1 / ( σ/sqrt(n) ) ]. We want to choose n such that φ[ µo µ1 / ( σ/sqrt(n) ) ] = We solve as follows. φ[ µo µ1 / ( σ/sqrt(n) ) ] = µo µ1 / ( σ/sqrt(n) ) = 0.85, by choosing the appropriate value from the normal table / ( 1/sqrt(n) ) = 0.85, by inserting the assumed values. 0.5 / ( 1/sqrt(n) ) = sqrt(n) = ( ) / 0.5 sqrt(n) = 5.62 n = 31.6 Therefore, n = 32. 9
10 ***15 10 pts) Suppose you have two normally distributed samples, both of sample size n, as follows. X1, X2,, Xn ~iid Normal( µ=10, σ=1 ) Y1, Y2,, Yn ~iid Normal( µ=50, σ=1 ) Consider performing a two sided t test on X vs Y and a two sided Wilcoxon rank sum test on X vs Y. What is the smallest sample size n such that the t test p value and the Wilcoxon test p value will agree to four decimal places with near certainty? In other words, choose the smallest n such that P( p value t test p value wilcoxon < ) > Include proof that your n is the correct answer. First notice that the means of X and Y are 40 units apart, while their standard deviations are only 1. Thus, with probability all of the values of X and Y will be within 5 standard deviations of the true mean. All values of X will be within 5 to 15, and all of the values of Y will be within 45 to 55. Consequently, by the time we get to n = 4, the p value for the t test will be essentially 0. Consider as an upper bound for probable t test p values the p value from a random sample X = {5, 5, 15, 15} and Y = {45, 45, 55, 55}, p value = If n = 6, X = {5, 5, 5, 15, 15, 15} and Y = {45, 45, 45, 55, 55, 55} yields p value = Better yet, notice that the power = 1 for the t test when n = 3 and alpha = , in other words, the p value for the t test will be 0 to four decimals with probability 1 with n > 3. > power.t.test(n = 3, delta = 40, sd = 1, sig.level = , type = "two.sample", alternative = "two.sided")$power = 1. So if the t test p value goes to 0 very quickly, this question is essentially, what s the smallest n where the p value of the Wilcoxon test is 0 to four decimals. P( p value t test p value wilcoxon < ) > is equivalent to asking P( p value wilcoxon < ) > We ve already observed that samples from X and Y will essentially never overlap. So the Wilcoxon test will always take its smallest possible p value. Thus we solve for the smallest n where p value wilcoxon < In general, the two sided p value will equal 2 / choose(2n, n). n p value Thus, we ve shown 9 is the smallest n such that P( p value t test p value wilcoxon < ) >
11 ***16 5 pts) Using Calculus, show that the maximum likelihood estimator for the proportion of blue M&Ms from a sample of size n with x blue M&Ms observed is equal to x/n, i.e. show that x/n is the maximum likelihood estimator of θ. 1 The likelihood has a single maximum for θ in the range (0, 1). To solve for that maximum, we can take the first derivative of L(θ x) with respect to θ, set it equal to zero, and solve for θ ***17 5 pts) Referring to question 16, when using a beta prior distribution, B(α, β), show that the mean posterior estimator for the proportion of blue M&Ms is equal to x/n only when α = 1 and β = 1. From the Bayes workshop, we know that with a Beta(α, β) prior, we have a posterior f(θ x) = θ ^(α 1 + x) * (1 θ)^(β 1 + n x) / B(α 1 + x, β 1 + n x). This is also a beta distribution, B(α 1 + x, β 1 + n x). The mean of a beta distribution, B(a, b) is a/(a+b). Thus, the mean of the posterior is (α 1 + x) / (α 1 + x+ β 1 + n x) = (x + α 1) / (n + α+β 2). Therefore, the posterior mean will only equal x/n if α = 1 and β = 1. 11
12 *18 3 pts) Referring to question 9, create a 95% confidence interval for the relative risk (RR) for the risk of blue for Milk Chocolate M&Ms versus the risk of blue for Dark M&Ms. Write your answer to 3 decimal places. Rewriting the data in our a, b, c, d table format we have Blue Not Blue Milk a = 86 b = 319 Dark c = 57 d = 360 Relative Risk = ( a*(c+d) ) / ( (a+b) * c ) = ( 86 * (57+360) ) / ( (86+319) * 57 ) = I m tired, so I m going to make R do this. install.packages('epitools') library('epitools') MMs < matrix( c(86,57,319,360), 2, 2) dimnames(mms) < list( "Type" = c("milk", "Dark"), "Color" = c("blue", "NotBlue") ) MMs epitab(mms, method="riskratio", rev='b') ( 1.144, ), or ( 1.14, 2.11 ) *19 3 pts) Referring to question 9, create a 1/8 th support interval for the relative risk (RR) for the risk of blue for Milk Chocolate M&Ms versus the risk of blue for Dark M&Ms. Write your answer to 2 3 decimal places. rr.lik (86, 405, 57, 417) ( 1.14, 2.14 ) *20 3 pts) Referring to question 9, create a 95% credible interval for the relative risk (RR) for the risk of blue for Milk Chocolate M&Ms versus the risk of blue for Dark M&Ms using independent Beta(1, 1) prior distribution for the proportion of blue M&Ms in Milk Chocolate and Dark Chocolate. Write your answer to 2 3 decimal places. posterior1 < rbeta( 3*10^6, , ) posterior2 < rbeta( 3*10^6, , ) posteriorrr < posterior1 / posterior2 round( quantile( posteriorrr, probs=c(0.025, 0.975) ), 3 ) 2.5% 97.5% ( 1.150, ) Note, the last decimal place isn t stable, even with 3 million samples from each posterior. ( 1.15, 2.13 ) is more stable. *21 1 pts) Referring to questions 18 20, how do the intervals for the RR from the three statistical paradigms compare to each other? They are very similar. 12
Name: SOLUTIONS Final Part 1 (In class, solo work, open book and notes)
Name: SOLUTIONS Final Part 1 (In class, solo work, open book and notes) Throughout the exam, show your work and, unless specified otherwise, round all your final answers to 3 decimal places, e.g. 1.0015
More informationBlue Not Blue
Name: SOLUTIONS Final Exam (take home, open everything) One mouse study showed Brilliant Blue G, the food coloring used in blue M&Ms, accelerated healing of spinal injuries. Is the proportion of blue M&Ms
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers
More information18.05 Practice Final Exam
No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For
More informationSwarthmore Honors Exam 2012: Statistics
Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may
More information18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages
Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution
More informationChapter 26: Comparing Counts (Chi Square)
Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces
More informationPower and the computation of sample size
9 Power and the computation of sample size A statistical test will not be able to detect a true difference if the sample size is too small compared with the magnitude of the difference. When designing
More informationLab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d
BIOS 4120: Introduction to Biostatistics Breheny Lab #11 We will explore observational studies in today s lab and review how to make inferences on contingency tables. We will only use 2x2 tables for today
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What
More informationMultiple Regression Analysis
Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators
More informationExam 2 Practice Questions, 18.05, Spring 2014
Exam 2 Practice Questions, 18.05, Spring 2014 Note: This is a set of practice problems for exam 2. The actual exam will be much shorter. Within each section we ve arranged the problems roughly in order
More informationTA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM
STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th
More informationAlgebra Exam. Solutions and Grading Guide
Algebra Exam Solutions and Grading Guide You should use this grading guide to carefully grade your own exam, trying to be as objective as possible about what score the TAs would give your responses. Full
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS 1a. Under the null hypothesis X has the binomial (100,.5) distribution with E(X) = 50 and SE(X) = 5. So P ( X 50 > 10) is (approximately) two tails
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationCategorical Data Analysis. The data are often just counts of how many things each category has.
Categorical Data Analysis So far we ve been looking at continuous data arranged into one or two groups, where each group has more than one observation. E.g., a series of measurements on one or two things.
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationName: SOLUTIONS Exam 01 (Midterm Part 2 take home, open everything)
Name: SOLUTIONS Exam 01 (Midterm Part 2 take home, open everything) To help you budget your time, questions are marked with *s. One * indicates a straightforward question testing foundational knowledge.
More informationT-tests for 2 Independent Means
T-tests for 2 Independent Means February 22, 208 Contents t-test for 2 independent means Tutorial Example : one-tailed test for independent means, equal sample sizes Error Bars Example 2: two-tailed test
More informationT-tests for 2 Independent Means
T-tests for 2 Independent Means January 15, 2019 Contents t-test for 2 independent means Tutorial Example 1: one-tailed test for independent means, equal sample sizes Error Bars Example 2: two-tailed test
More informationSTAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS
STAT 512 MidTerm I (2/21/2013) Spring 2013 Name: Key INSTRUCTIONS 1. This exam is open book/open notes. All papers (but no electronic devices except for calculators) are allowed. 2. There are 5 pages in
More informationPower of a hypothesis test
Power of a hypothesis test Scenario #1 Scenario #2 H 0 is true H 0 is not true test rejects H 0 type I error test rejects H 0 OK test does not reject H 0 OK test does not reject H 0 type II error Power
More informationBayesian Inference for Normal Mean
Al Nosedal. University of Toronto. November 18, 2015 Likelihood of Single Observation The conditional observation distribution of y µ is Normal with mean µ and variance σ 2, which is known. Its density
More informationContingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878
Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each
More informationMath 5a Reading Assignments for Sections
Math 5a Reading Assignments for Sections 4.1 4.5 Due Dates for Reading Assignments Note: There will be a very short online reading quiz (WebWork) on each reading assignment due one hour before class on
More informationTests for the Odds Ratio in Logistic Regression with One Binary X (Wald Test)
Chapter 861 Tests for the Odds Ratio in Logistic Regression with One Binary X (Wald Test) Introduction Logistic regression expresses the relationship between a binary response variable and one or more
More informationFundamental Statistical Concepts and Methods Needed in a Test-and-Evaluator s Toolkit. Air Academy Associates
Fundamental Statistical Concepts and Methods Needed in a Test-and-Evaluator s Toolkit Mark Kiemele Air Academy Associates mkiemele@airacad.com ITEA 010 Symposium Glendale, AZ 13 September 010 INTRODUCTIONS
More informationConfidence intervals
Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which
More informationChapter 4 HOMEWORK ASSIGNMENTS. 4.1 Homework #1
Chapter 4 HOMEWORK ASSIGNMENTS These homeworks may be modified as the semester progresses. It is your responsibility to keep up to date with the correctly assigned homeworks. There may be some errors in
More informationMy data doesn t look like that..
Testing assumptions My data doesn t look like that.. We have made a big deal about testing model assumptions each week. Bill Pine Testing assumptions Testing assumptions We have made a big deal about testing
More informationmu_(x Y) = E[ X Y ] by definition. E[ X Y ] = E[ X ] E[ Y ], regardless of the independence of X & Y. = mu_x mu_y by definition.
Your Full Name: SOLUTIONS Bios 311 Exam 2, final exam part 1 (100 pts) This quiz is open book, notes, and calculator; and closed laptop, cellphone, classmates, etc. Where necessary, work your solution
More informationOne sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:
One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get
More informationBusiness Statistics 41000: Homework # 5
Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive
More informationAppendix A. Review of Basic Mathematical Operations. 22Introduction
Appendix A Review of Basic Mathematical Operations I never did very well in math I could never seem to persuade the teacher that I hadn t meant my answers literally. Introduction Calvin Trillin Many of
More informationSolution E[sum of all eleven dice] = E[sum of ten d20] + E[one d6] = 10 * E[one d20] + E[one d6]
Name: SOLUTIONS Midterm (take home version) To help you budget your time, questions are marked with *s. One * indicates a straight forward question testing foundational knowledge. Two ** indicate a more
More informationPower. January 12, 2019
Power January 12, 2019 Contents Definition of power Z-test example If H 0 is true If H 0 is false The 2x2 matrix of All Things That Can Happen Pr(Type I error) = α Pr(Type II error) = β power = 1 β Things
More informationHypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =
Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,
More informationHi, my name is Dr. Ann Weaver of Argosy University. This WebEx is about something in statistics called z-
Hi, my name is Dr. Ann Weaver of Argosy University. This WebEx is about something in statistics called z- Scores. I have two purposes for this WebEx, one, I just want to show you how to use z-scores in
More informationAdditional Problems Additional Problem 1 Like the http://www.stat.umn.edu/geyer/5102/examp/rlike.html#lmax example of maximum likelihood done by computer except instead of the gamma shape model, we will
More informationBasic Math Problems Unit 1
Basic Math Problems Unit 1 Name Period Using fractions: When you are using fractions in science, we need to convert them into decimals. You can do this by dividing the top number by the bottom number.
More informationGov Univariate Inference II: Interval Estimation and Testing
Gov 2000-5. Univariate Inference II: Interval Estimation and Testing Matthew Blackwell October 13, 2015 1 / 68 Large Sample Confidence Intervals Confidence Intervals Example Hypothesis Tests Hypothesis
More informationGov 2000: 6. Hypothesis Testing
Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.
More informationMachine Learning, Midterm Exam: Spring 2008 SOLUTIONS. Q Topic Max. Score Score. 1 Short answer questions 20.
10-601 Machine Learning, Midterm Exam: Spring 2008 Please put your name on this cover sheet If you need more room to work out your answer to a question, use the back of the page and clearly mark on the
More informationLecture 10: Generalized likelihood ratio test
Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationMock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual
Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Question 1. Suppose you want to estimate the percentage of
More informationRegression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.
Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if
More informationBayesian Regression (1/31/13)
STA613/CBB540: Statistical methods in computational biology Bayesian Regression (1/31/13) Lecturer: Barbara Engelhardt Scribe: Amanda Lea 1 Bayesian Paradigm Bayesian methods ask: given that I have observed
More informationBackground to Statistics
FACT SHEET Background to Statistics Introduction Statistics include a broad range of methods for manipulating, presenting and interpreting data. Professional scientists of all kinds need to be proficient
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationStatistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing
Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing So, What is Statistics? Theory and techniques for learning from data How to collect How to analyze How to interpret
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationLecture 7: The χ 2 and t distributions, frequentist confidence intervals, the Neyman-Pearson paradigm, and introductory frequentist hypothesis testing
Lecture 7: The χ 2 and t distributions, frequentist confidence intervals, the Neyman-Pearson paradigm, and introductory frequentist hypothesis testing 17 October 2007 In this lecture we ll learn the following:
More informationPart Possible Score Base 5 5 MC Total 50
Stat 220 Final Exam December 16, 2004 Schafer NAME: ANDREW ID: Read This First: You have three hours to work on the exam. The other questions require you to work out answers to the questions; be sure to
More informationDescriptive Statistics-I. Dr Mahmoud Alhussami
Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.
More informationChapter 7. Practice Exam Questions and Solutions for Final Exam, Spring 2009 Statistics 301, Professor Wardrop
Practice Exam Questions and Solutions for Final Exam, Spring 2009 Statistics 301, Professor Wardrop Chapter 6 1. A random sample of size n = 452 yields 113 successes. Calculate the 95% confidence interval
More informationTwo sample Hypothesis tests in R.
Example. (Dependent samples) Two sample Hypothesis tests in R. A Calculus professor gives their students a 10 question algebra pretest on the first day of class, and a similar test towards the end of the
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationMotion in 1 Dimension. By Prof. Massimiliano Galeazzi, University of Miami
Motion in 1 Dimension By Prof. Massimiliano Galeazzi, University of Miami When you throw a pebble straight up, how high does it go? How fast is it when it gets back? If you are in your car at a red light
More informationOn Assumptions. On Assumptions
On Assumptions An overview Normality Independence Detection Stem-and-leaf plot Study design Normal scores plot Correction Transformation More complex models Nonparametric procedure e.g. time series Robustness
More informationWritten Exam (2 hours)
M. Müller Applied Analysis of Variance and Experimental Design Summer 2015 Written Exam (2 hours) General remarks: Open book exam. Switch off your mobile phone! Do not stay too long on a part where you
More informationSAMPLE CHAPTER. Avi Pfeffer. FOREWORD BY Stuart Russell MANNING
SAMPLE CHAPTER Avi Pfeffer FOREWORD BY Stuart Russell MANNING Practical Probabilistic Programming by Avi Pfeffer Chapter 9 Copyright 2016 Manning Publications brief contents PART 1 INTRODUCING PROBABILISTIC
More information1 A simple example. A short introduction to Bayesian statistics, part I Math 217 Probability and Statistics Prof. D.
probabilities, we ll use Bayes formula. We can easily compute the reverse probabilities A short introduction to Bayesian statistics, part I Math 17 Probability and Statistics Prof. D. Joyce, Fall 014 I
More informationGeneral Certificate of Education Advanced Level Examination June 2013
General Certificate of Education Advanced Level Examination June 2013 Human Biology HBI6T/P13/task Unit 6T A2 Investigative Skills Assignment Task Sheet Investigation into whether reaction time is different
More informationexp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1
4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationPurposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions
Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationGrade 2 MATHEMATICS WINTER STUDENT ENRICHMENT PACKET
Grade 2 MATHEMATICS WINTER STUDENT ENRICHMENT PACKET Detroit Public Schools Community District Department of Curriculum and Instruction Office of Mathematics Directions and Notes for Students and Parents
More informationResampling Methods. Lukas Meier
Resampling Methods Lukas Meier 20.01.2014 Introduction: Example Hail prevention (early 80s) Is a vaccination of clouds really reducing total energy? Data: Hail energy for n clouds (via radar image) Y i
More informationLast few slides from last time
Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an
More information(1) Introduction to Bayesian statistics
Spring, 2018 A motivating example Student 1 will write down a number and then flip a coin If the flip is heads, they will honestly tell student 2 if the number is even or odd If the flip is tails, they
More informationComplete Week 18 Package
Complete Week 18 Package Jeanette Stein Table of Contents Unit 4 Pacing Chart -------------------------------------------------------------------------------------------- 1 Day 86 Bellringer --------------------------------------------------------------------------------------------
More informationMath 31 Lesson Plan. Day 2: Sets; Binary Operations. Elizabeth Gillaspy. September 23, 2011
Math 31 Lesson Plan Day 2: Sets; Binary Operations Elizabeth Gillaspy September 23, 2011 Supplies needed: 30 worksheets. Scratch paper? Sign in sheet Goals for myself: Tell them what you re going to tell
More informationConfidence intervals CE 311S
CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationPsych 10 / Stats 60, Practice Problem Set 5 (Week 5 Material) Part 1: Power (and building blocks of power)
Psych 10 / Stats 60, Practice Problem Set 5 (Week 5 Material) Part 1: Power (and building blocks of power) 1. A researcher plans to do a two-tailed hypothesis test with a sample of n = 100 people and a
More informationPower and nonparametric methods Basic statistics for experimental researchersrs 2017
Faculty of Health Sciences Outline Power and nonparametric methods Basic statistics for experimental researchersrs 2017 Statistical power Julie Lyng Forman Department of Biostatistics, University of Copenhagen
More informationPARAMETER ESTIMATION: BAYESIAN APPROACH. These notes summarize the lectures on Bayesian parameter estimation.
PARAMETER ESTIMATION: BAYESIAN APPROACH. These notes summarize the lectures on Bayesian parameter estimation.. Beta Distribution We ll start by learning about the Beta distribution, since we end up using
More informationSoc 3811 Basic Social Statistics Second Midterm Exam Spring Your Name [50 points]: ID #: ANSWERS
Soc 3811 Basic Social Statistics Second idterm Exam Spring 010 our Name [50 points]: ID #: INSTRUCTIONS: ANSERS (A) rite your name on the line at top front of every sheet. (B) If you use a page of notes
More informationProbability and Statistics qualifying exam, May 2015
Probability and Statistics qualifying exam, May 2015 Name: Instructions: 1. The exam is divided into 3 sections: Linear Models, Mathematical Statistics and Probability. You must pass each section to pass
More informationHYPOTHESIS TESTING: FREQUENTIST APPROACH.
HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous
More informationGalileo Educator Network
Galileo Educator Network D1.3 Moons of Jupiter (1 hour and 45 minutes + 15 minute Break) 1. Observing Jupiter s Moons (15 minutes) Explain how Galileo used the telescope to learn more about objects in
More informationANOVA - analysis of variance - used to compare the means of several populations.
12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 65 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review In the previous lecture we considered the following tests: The independent
More informationStat 135 Fall 2013 FINAL EXAM December 18, 2013
Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Name: Person on right SID: Person on left There will be one, double sided, handwritten, 8.5in x 11in page of notes allowed during the exam. The exam is closed
More informationContents 1. Contents
Contents 1 Contents 1 One-Sample Methods 3 1.1 Parametric Methods.................... 4 1.1.1 One-sample Z-test (see Chapter 0.3.1)...... 4 1.1.2 One-sample t-test................. 6 1.1.3 Large sample
More information(4) One-parameter models - Beta/binomial. ST440/550: Applied Bayesian Statistics
Estimating a proportion using the beta/binomial model A fundamental task in statistics is to estimate a proportion using a series of trials: What is the success probability of a new cancer treatment? What
More informationBayesian Updating with Continuous Priors Class 13, Jeremy Orloff and Jonathan Bloom
Bayesian Updating with Continuous Priors Class 3, 8.05 Jeremy Orloff and Jonathan Bloom Learning Goals. Understand a parameterized family of distributions as representing a continuous range of hypotheses
More informationINFINITE SUMS. In this chapter, let s take that power to infinity! And it will be equally natural and straightforward.
EXPLODING DOTS CHAPTER 7 INFINITE SUMS In the previous chapter we played with the machine and saw the power of that machine to make advanced school algebra so natural and straightforward. In this chapter,
More informationBEGINNING BAYES IN R. Bayes with discrete models
BEGINNING BAYES IN R Bayes with discrete models Beginning Bayes in R Survey on eating out What is your favorite day for eating out? Construct a prior for p Define p: proportion of all students who answer
More informationAnalysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not
More informationSample Size. Vorasith Sornsrivichai, MD., FETP Epidemiology Unit, Faculty of Medicine Prince of Songkla University
Sample Size Vorasith Sornsrivichai, MD., FETP Epidemiology Unit, Faculty of Medicine Prince of Songkla University All nature is but art, unknown to thee; All chance, direction, which thou canst not see;
More information1.1 The Language of Mathematics Expressions versus Sentences
The Language of Mathematics Expressions versus Sentences a hypothetical situation the importance of language Study Strategies for Students of Mathematics characteristics of the language of mathematics
More information6-6 Solving Systems of Linear Inequalities 6-6. Solving Systems of Linear Inequalities
6-6 Solving Systems of Linear Inequalities Warm Up Lesson Presentation Lesson Quiz 1 2 pts 3 pts 5 pts Bell Quiz 6-6 Solve each inequality for y. 1. 8x + y < 6 2. 3x 2y > 10 3. Graph the solutions of 4x
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More information