= X+(z.95)SE(X) X-(z.95)SE(X)
|
|
- Gervase Porter
- 6 years ago
- Views:
Transcription
1 BIOSTATS 540 Fall 05 Introductory Biostatistics Page of 6 Unit 6 Estimation Homework #0 (Unit 6 Estimation, part of ) SOLUTIONS. Alzheimers disease has a poorer prognosis when it is diagnosed at a relatively young age. Suppose we want to estimate the age at which the disease was first diagnosed using a 90% confidence interval. Under the assumption that the distribution of age at diagnosis is normal, if the population variance is 85, how large a sample size is required if we want a confidence interval that is 0 years wide? Answer: n0 Recall: The 90% confidence interval is given by ( ) X± z SE(X), à Confidence interval width [upper limit of CI] - [lower limit of CI] X+(z.95)SE(X) X-(z.95)SE(X) X+(z )SE(X) - X + (z )SE(X) ()(z.95)se(x) ()(z.95) n.95 Setting confidence interval width on the left hand side to width 0 allows us to write 0 ()(z.95) n à 5 (z.95) n à (z.95) n 5 à (z.95) n 5 n à (.645) (85) Rounding up yields the required sample size of 0. sol_estimation_.doc
2 BIOSTATS 540 Fall 05 Introductory Biostatistics Page of 6. The National Health and Nutrition Examination Survey of give the following data on serum cholesterol levels in US males. Group Age, years Population Mean, µ Population Standard Deviation, Suppose the distribution of serum cholesterol is normal in each age group. If you draw simple random samples of size 50 from each of the two groups, what is the probability that the difference between the two sample means (Group mean Group mean) will be more than 5? Answer:.57 On page 3 of the lecture notes, at the bottom of the page, we learn that (X-X ) is distributed Normal[ (µ -µ), ( + )] n n Let X Average among age 0-4. It is distributed Normal(µ 80, X Average among age It is distributed Normal(µ 99, 43 ) X n X n 50 ) Thus, Y(X X ). is distributed Normal with µ (µ -µ ) 9 Y X X Y + 85 n n Now we use the z-score method that we learned in Unit 5, Normal Distribution, and in particular the z-score standardization that is found on page 9 under (3), we have that Probability group mean group mean will be more than 5 Pr[Y > 5] Y-µ Y 5-9 Pr > Y 9.95 Pr[Normal(0,) > 0.65].57 sol_estimation_.doc
3 BIOSTATS 540 Fall 05 Introductory Biostatistics Page 3 of 6 3 The objectives of a study by Kennedy and Bhambhani (99) were to use physiological measurements to determine the test-retest reliability of the Baltimore Therapeutic Equipment Work Simulator during three simulated tasks performed at light, medium, and heavy work intensities, and to examine the criterion validity of these tasks by comparing them to real tasks performed in a controlled laboratory setting. Subjects were 30 healthy men between the ages of 8 and 35. The investigators reported a standard deviation of s0.57 for the variable peak oxygen consumption (/min) during one of the procedures. Assuming normality, compute a 95% confidence interval for the population variance for the oxygen consumption variable. Answer: (.,.59) (n-) 9 S 0.57 χ χ α/.975; DF9 χ α/ χ.05; DF Lower limit Upper limit (n-)s χ - α/;df(n-) (n-)s χ α /;df(n-) (9)(0.57 ) (9)(0.57 ) The purpose of an investigation by Alahuhta et al (99) was to evaluate the influence of extradural block for elective caesarian section simultaneously on several maternal and fetal hemodynamic variables and to determine if the block modified fetal myocardial function. The study subjects were eight healthy parturient in gestational weeks 38-4 with uncomplicated singleton pregnancies undergoing elective caesarian section under extradural anesthesia. Among the measurements taken, were maternal diastolic arterial pressure during two stages of the study. The following are the lowest values of this variable at the two stages. Compute a 95% confidence interval for the difference in diastolic blood pressure between the two stages. Patient ID Stage Stage sol_estimation_.doc
4 BIOSTATS 540 Fall 05 Introductory Biostatistics Page 4 of 6 Answer: (-0.06, +6.6) Because two measurements are made on each patient, at stages and, these data fit the definition of paired. The analysis focuses on the differences, per the table below: Patient ID, i d i Stage The actual calculations required to complete the solution are similar to the example on pp 4-44 of the unit 6 notes. For this exercise, we have d3.5 Sd Sd d S SE(d).3983 n 8 df(n-)7 t- α /;df t.975; % CI for µ d ± (t ) SE(d) d.975;df7 (3.5) ± (.365)(.3983) (-0.057, 6.557) 5. A possible environmental determinant of lung function in children is the amount of cigarette smoking in the home. To study this question, two groups of children were studied. Group consisted of 3 nonsmoking children aged 5-9 both of whose parents smoke in the home. Group consisted of 0 nonsmoking children aged 5-9 neither of whose parents smoke. The sample mean (sample SD) of FEV for group is. L (0.7) and for the Group children, the sample mean (sample SD) of FEV is.3 L (0.4). Under the assumption of normality, construct a 95% confidence interval for the ratio of the variance of the two groups. What is your conclusion regarding the reasonableness of the assumption of equality of variances? Answer: (.4, 7.37) Use the F-distribution calculator from the Daniel Soper site: sol_estimation_.doc
5 BIOSTATS 540 Fall 05 Introductory Biostatistics Page 5 of 6 Remember this calculator provides RIGHT tail areas only. Therefore, the.5 th and 97.5 th percentiles are obtained as follows when using this site. S 0.7 S 0.4 (n ) (3 ) (n ) (0 ) 9 Solution for.5 th percentile of F-distribution Numerator df and denominator df9 Solution for 97.5 th percentile of F-distn Numerator df and denominator df9 The area to the RIGHT of the.5 th percentile is à The area to the RIGHT of the 97.5 th percentile is à Fn,n ; α / F,9; Fn,n ; α / F,9; S 0.7 Lower limit.357 F n ;n ;( α /) S S 0.7 Upper limit F n ;n ; S / α Since the confidence interval has lower limit equal to.357, a number that is above, these data are not consistent with the assumption of equal variances. (Logic if the variances are equal, then their ratio is equal to. It then follows that, if the confidence interval for the ratio does not include, then the data are not consistent with the assumption of equal variances). sol_estimation_.doc
6 BIOSTATS 540 Fall 05 Introductory Biostatistics Page 6 of 6 6. For the same data in problem #5 and drawing upon your answer to #5 (regarding the reasonableness of equality of variances), compute a 95% confidence interval for the true mean difference in FEV between 5-9 year old children whose parents smoke and comparable children whose parents do not smoke. Answer: (-0.55, +0.5) This solution assumes that the variances are Unequal because the confidence interval obtained for #9 does not include a ratio of variances value. Therefore, the correct standard error formula to use is Solution 3 on page 48 of the notes. X X S S SEˆ X X n n 3 0 S S n n f by rounding DOWN S S n n n 9 n t t.03 -α/;f.975;df35 ( ) ( ) ˆ.975;DF35 ( ) 95%CI X -X ± t SE X -X (-0.)±(.03)(0.7) ( , ) sol_estimation_.doc
Unit 9 ONE Sample Inference SOLUTIONS
BIOSTATS 540 Fall 017 Introductory Biostatistics Solutions Unit 9 ONE Sample Inference SOLUTIONS 1. This exercise gives you practice calculating a confidence interval for the mean of a Normal distribution
More informationUnit 6 Estimation Week #10 - Practice Problems SOLUTIONS
PubHlth 540 Itroductory Biostatistics Page of 7 Uit 6 Estimatio Week #0 - Practice Problems SOLUTIONS. A etomologist samples a field for egg masses of a harmful isect by placig a yardsquare frame at radom
More informationUnit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users
BIOSTATS 640 Spring 2017 Review of Introductory Biostatistics STATA solutions Page 1 of 16 Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users #1. The following table lists length of
More informationBE540W Estimation Page 1 of 72. Topic 6 Estimation
BE540W Estimation Page 1 of 7 Topic 6 Estimation Topics 1. Introduction......... Goals of Estimation.... 3. Some notation and definitions,..... 4. How to Interpret a Confidence Interval..... 5. Normal:
More informationPubHlth 540 Estimation Page 1 of 69. Unit 6 Estimation
PubHlth 540 Estimation Page 1 of 69 Unit 6 Estimation Topic 1. Introduction........ a. Goals of Estimation. b. Notation and Definitions. c. How to Interpret a Confidence Interval. Preliminaries: Some Useful
More information7 Estimation. 7.1 Population and Sample (P.91-92)
7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for
More informationBIO5312 Biostatistics Lecture 6: Statistical hypothesis testings
BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical
More informationMAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics
MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationCentral Limit Theorem Confidence Intervals Worked example #6. July 24, 2017
Central Limit Theorem Confidence Intervals Worked example #6 July 24, 2017 10 8 Raw scores 6 4 Mean=71.4% 2 0 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80 80-90 90+ Scaling is to add 3.6% to bring mean
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationBIOSTATISTICS NURS 3324
Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship
More informationBINF 702 SPRING Chapter 8 Hypothesis Testing: Two-Sample Inference. BINF702 SPRING 2014 Chapter 8 Hypothesis Testing: Two- Sample Inference 1
BINF 702 SPRING 2014 Chapter 8 Hypothesis Testing: Two-Sample Inference Two- Sample Inference 1 A Poster Child for two-sample hypothesis testing Ex 8.1 Obstetrics In the birthweight data in Example 7.2,
More informationPubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH
PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;
More informationCOMPARING GROUPS PART 1CONTINUOUS DATA
COMPARING GROUPS PART 1CONTINUOUS DATA Min Chen, Ph.D. Assistant Professor Quantitative Biomedical Research Center Department of Clinical Sciences Bioinformatics Shared Resource Simmons Comprehensive Cancer
More informationOnline supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population
Online supplement Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in Breathlessness in the General Population Table S1. Comparison between patients who were excluded or included
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationPubH 5450 Biostatistics I Prof. Carlin. Lecture 13
PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationSampling Distributions
Sampling Error As you may remember from the first lecture, samples provide incomplete information about the population In particular, a statistic (e.g., M, s) computed on any particular sample drawn from
More informationT-Test QUESTION T-TEST GROUPS = sex(1 2) /MISSING = ANALYSIS /VARIABLES = quiz1 quiz2 quiz3 quiz4 quiz5 final total /CRITERIA = CI(.95).
QUESTION 11.1 GROUPS = sex(1 2) /MISSING = ANALYSIS /VARIABLES = quiz2 quiz3 quiz4 quiz5 final total /CRITERIA = CI(.95). Group Statistics quiz2 quiz3 quiz4 quiz5 final total sex N Mean Std. Deviation
More informationIdentify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.
Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency
More informationSampling Variability and Confidence Intervals. John McGready Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationMeasurement Error. Martin Bland. Accuracy and precision. Error. Measurement in Health and Disease. Professor of Health Statistics University of York
Measurement in Health and Disease Measurement Error Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Accuracy and precision In this lecture: measurements which are
More information1. Poisson distribution is widely used in statistics for modeling rare events.
Discrete probability distributions - Class 5 January 20, 2014 Debdeep Pati Poisson distribution 1. Poisson distribution is widely used in statistics for modeling rare events. 2. Ex. Infectious Disease
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationPopulation Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8
Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research
More information6. Multiple regression - PROC GLM
Use of SAS - November 2016 6. Multiple regression - PROC GLM Karl Bang Christensen Department of Biostatistics, University of Copenhagen. http://biostat.ku.dk/~kach/sas2016/ kach@biostat.ku.dk, tel: 35327491
More informationImportant note: Transcripts are not substitutes for textbook assignments. 1
In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance
More informationClinical Research Module: Biostatistics
Clinical Research Module: Biostatistics Lecture 1 Alberto Nettel-Aguirre, PhD, PStat These lecture notes based on others developed by Drs. Peter Faris, Sarah Rose Luz Palacios-Derflingher and myself Who
More informationLecture 12: Effect modification, and confounding in logistic regression
Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression
More informationPubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I
PubH 7405: REGRESSION ANALYSIS MLR: INFERENCES, Part I TESTING HYPOTHESES Once we have fitted a multiple linear regression model and obtained estimates for the various parameters of interest, we want to
More information171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th
Name 171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Use the selected SAS output to help you answer the questions. The SAS output is all at the back of the exam on pages
More informationCohen s s Kappa and Log-linear Models
Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance
More informationTwo sample hypothesis testing
Statistics February 26, 2014 Debdeep Pati Two sample hypothesis testing 1. Suppose we want to study the relationship between use of oral contraceptives (OC) and level of blood pressure (BP) in women. 2.
More informationInstrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data
Norm-Referenced vs. Criterion- Referenced Instruments Instrumentation (cont.) October 1, 2007 Note: Measurement Plan Due Next Week All derived scores give meaning to individual scores by comparing them
More informationContent by Week Week of October 14 27
Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence
More informationBIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke
BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart
More informationBiostatistics for biomedical profession. BIMM34 Karin Källen & Linda Hartman November-December 2015
Biostatistics for biomedical profession BIMM34 Karin Källen & Linda Hartman November-December 2015 12015-11-02 Who needs a course in biostatistics? - Anyone who uses quntitative methods to interpret biological
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region
More informationStatistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts
Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple
More informationLecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio
Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK March 3-5,
More informationChapter 16 One-way Analysis of Variance
Chapter 16 One-way Analysis of Variance I am assuming that most people would prefer to see the solutions to these problems as computer printout. (I will use R and SPSS for consistency.) 16.1 Analysis of
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationProblem Set 4 - Solutions
Problem Set 4 - Solutions Econ-310, Spring 004 8. a. If we wish to test the research hypothesis that the mean GHQ score for all unemployed men exceeds 10, we test: H 0 : µ 10 H a : µ > 10 This is a one-tailed
More informationThe t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18
and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of
More informationPS 203 Spring 2002 Homework One - Answer Key
PS 203 Spring 2002 Homework One - Answer Key 1. If you have a home or office computer, download and install WinBUGS. If you don t have your own computer, try running WinBUGS in the Department lab. 2. The
More informationCorrelation and Regression Bangkok, 14-18, Sept. 2015
Analysing and Understanding Learning Assessment for Evidence-based Policy Making Correlation and Regression Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Correlation The strength
More informationChapter 11 Sampling Distribution. Stat 115
Chapter 11 Sampling Distribution Stat 115 1 Definition 11.1 : Random Sample (finite population) Suppose we select n distinct elements from a population consisting of N elements, using a particular probability
More informationLecture 12: Small Sample Intervals Based on a Normal Population Distribution
Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)
More informationProbability and Probability Distributions. Dr. Mohammed Alahmed
Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about
More information10.4 Hypothesis Testing: Two Independent Samples Proportion
10.4 Hypothesis Testing: Two Independent Samples Proportion Example 3: Smoking cigarettes has been known to cause cancer and other ailments. One politician believes that a higher tax should be imposed
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationTECHNICAL APPENDIX WITH ADDITIONAL INFORMATION ON METHODS AND APPENDIX EXHIBITS. Ten health risks in this and the previous study were
Goetzel RZ, Pei X, Tabrizi MJ, Henke RM, Kowlessar N, Nelson CF, Metz RD. Ten modifiable health risk factors are linked to more than one-fifth of employer-employee health care spending. Health Aff (Millwood).
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More informationSimple Linear Regression: One Qualitative IV
Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression
More informationT test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016
T test for two Independent Samples Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016 Q1. The mean serum creatinine level is measured in 36 patients after they received
More informationUNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004
UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 50W - Introduction to Biostatistics Fall 00 Exercises with Solutions Topic Summarizing Data Due: Monday September 7, 00 READINGS.
More informationContinuous Probability Distributions
Continuous Probability Distributions Called a Probability density function. The probability is interpreted as "area under the curve." 1) The random variable takes on an infinite # of values within a given
More informationLecture 10: The Normal Distribution. So far all the random variables have been discrete.
Lecture 10: The Normal Distribution 1. Continuous Random Variables So far all the random variables have been discrete. We need a different type of model (called a probability density function) for continuous
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute
More informationExploring, summarizing and presenting data. Berghold, IMI, MUG
Exploring, summarizing and presenting data Example Patient Nr Gender Age Weight Height PAVK-Grade W alking Distance Physical Functioning Scale Total Cholesterol Triglycerides 01 m 65 90 185 II b 200 70
More informationTwo sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.
Two sided, two sample t-tests. I. Brief review: 1) We are interested in how a sample compares to some pre-conceived notion. For example: a) IQ = 100 b) Average height for men = 5 10. c) Average number
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationSuppose that we are concerned about the effects of smoking. How could we deal with this?
Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart
More informationTests for Population Proportion(s)
Tests for Population Proportion(s) Esra Akdeniz April 6th, 2016 Motivation We are interested in estimating the prevalence rate of breast cancer among 50- to 54-year-old women whose mothers have had breast
More informationLecture 10: Comparing two populations: proportions
Lecture 10: Comparing two populations: proportions Problem: Compare two sets of sample data: e.g. is the proportion of As in this semester 152 the same as last Fall? Methods: Extend the methods introduced
More informationInference for Single Proportions and Means T.Scofield
Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter
More information11: Comparing Group Variances. Review of Variance
11: Comparing Group Variances Review of Variance Parametric measures of variability are often based on sum of squares (SS) around e mean: (1) For e data set {3, 4, 5, 8}, = 5 and SS = (3 5) + (4 5) + (5
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationTruck prices - linear model? Truck prices - log transform of the response variable. Interpreting models with log transformation
Background Regression so far... Lecture 23 - Sta 111 Colin Rundel June 17, 2014 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical or categorical
More informationDescriptive statistics
Patrick Breheny February 6 Patrick Breheny to Biostatistics (171:161) 1/25 Tables and figures Human beings are not good at sifting through large streams of data; we understand data much better when it
More informationUnit 9 Regression and Correlation Homework #14 (Unit 9 Regression and Correlation) SOLUTIONS. X = cigarette consumption (per capita in 1930)
BIOSTATS 540 Fall 2015 Introductory Biostatistics Page 1 of 10 Unit 9 Regression and Correlation Homework #14 (Unit 9 Regression and Correlation) SOLUTIONS Consider the following study of the relationship
More informationStatistics for IT Managers
Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample
More informationWhile you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 12 1
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 12 1 Chapter 12 Analysis of Variance McGraw-Hill, Bluman, 7th ed., Chapter 12 2
More informationRegression so far... Lecture 21 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102
Background Regression so far... Lecture 21 - Sta102 / BME102 Colin Rundel November 18, 2014 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationBasic Statistics and Probability Chapter 9: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses
Basic Statistics and Probability Chapter 9: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses Identifying the Target Parameter Comparing Two Population Means: Independent Sampling
More information9 One-Way Analysis of Variance
9 One-Way Analysis of Variance SW Chapter 11 - all sections except 6. The one-way analysis of variance (ANOVA) is a generalization of the two sample t test to k 2 groups. Assume that the populations of
More informationLinear models Analysis of Covariance
Esben Budtz-Jørgensen April 22, 2008 Linear models Analysis of Covariance Confounding Interactions Parameterizations Analysis of Covariance group comparisons can become biased if an important predictor
More informationZ score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound
1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score
More information5 Basic Steps in Any Hypothesis Test
5 Basic Steps in Any Hypothesis Test Step 1: Determine hypotheses (H0 and Ha). H0: μ d = 0 (μ 1 μ 2 =0) Ha: μ d > 0 (μ 1 μ 2 >0) upper-sided Ha: : μ d 0 (μ 1 μ 2 0) two-sided Step 2: Verify necessary conditions,
More informationSTP 226 EXAMPLE EXAM #3 INSTRUCTOR:
STP 226 EXAMPLE EXAM #3 INSTRUCTOR: Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned. Signed Date PRINTED
More informationDISCRETE PROBABILITY DISTRIBUTIONS
DISCRETE PROBABILITY DISTRIBUTIONS REVIEW OF KEY CONCEPTS SECTION 41 Random Variable A random variable X is a numerically valued quantity that takes on specific values with different probabilities The
More informationLinear models Analysis of Covariance
Esben Budtz-Jørgensen November 20, 2007 Linear models Analysis of Covariance Confounding Interactions Parameterizations Analysis of Covariance group comparisons can become biased if an important predictor
More informationDisadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means
Stat 529 (Winter 2011) Analysis of Variance (ANOVA) Reading: Sections 5.1 5.3. Introduction and notation Birthweight example Disadvantages of using many pooled t procedures The analysis of variance procedure
More informationLatent Growth Models 1
1 We will use the dataset bp3, which has diastolic blood pressure measurements at four time points for 256 patients undergoing three types of blood pressure medication. These are our observed variables:
More informationChapter 9. Hypothesis testing. 9.1 Introduction
Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second
More informationFaculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics
Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationLecture 7 Time-dependent Covariates in Cox Regression
Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the
More informationEssential Question: How are the mean and the standard deviation determined from a discrete probability distribution?
Probability and Statistics The Binomial Probability Distribution and Related Topics Chapter 5 Section 1 Introduction to Random Variables and Probability Distributions Essential Question: How are the mean
More informationLecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000
Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination
More informationLecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)
Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination
More informationPubH 7405: REGRESSION ANALYSIS. Correlation Analysis
PubH 7405: REGRESSION ANALYSIS Correlation Analysis CORRELATION & REGRESSION We have continuous measurements made on each subject, one is the response variable Y, the other predictor X. There are two types
More informationDr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38
BIO5312 Biostatistics Lecture 11: Multisample Hypothesis Testing II Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/8/2016 1/38 Outline In this lecture, we will continue to
More informationHypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)
Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect
More informationChapter 2. Mean and Standard Deviation
Chapter 2. Mean and Standard Deviation The median is known as a measure of location; that is, it tells us where the data are. As stated in, we do not need to know all the exact values to calculate the
More information