7.2 Inference for comparing means of two populations where the samples are independent
|
|
- Jade Jenkins
- 6 years ago
- Views:
Transcription
1 Objectives 7.2 Inference for comaring means of two oulations where the samles are indeendent Two-samle t significance test (we give three examles) Two-samle t confidence interval htt://onlinestatbook.com/2/tests_of_means/difference_means.ht ml
2 Toics: Indeendent two samle t-test Be able to construct the aroriate hyothesis for comaring two oulations based on what researchers want to rove. When given a data sets and the story behind it, be able to identify when to use an indeendent two samle t-test. Understand the statcrunch outut for an indeendent two samle t- test and confidence interval. Be able to construct confidence intervals and hyothesis tests based on a art of the outut. Understand the standard error for the indeendent two samles t-test and confidence interval. Understand what combination of the samles sizes yields the smallest standard error and why. Be able to check the validity (accuracy) of the -values and confidence intervals.
3 Standard errors We have learnt that standard errors are crucial in constructing both confidence intervals and also statistical testing. Do not get mixed u between standard error of an estimator and standard deviation of the samle. The amount of variation in the samle (the average (suared) distance between each estimate and the oulation mean) is measured by its standard error, which is s.e. = s n = (amount of variation in the samle) (suare root of the samle size) You can imagine that the unknown oulation mean should be in some roximity to the known samle mean. The roximity is measured by the standard error. The samle mean tends to get closer in roximity/recision to the oulation mean as you increase the samle size. As we continue with the course, the standard errors will become more comlex, but the underlying ideas are the same.
4 Comarisons everywhere! You will see comarisons being made all over the lace. Just look at some of the roducts you have at home: Dentex floss sticks are clinically roven to remove more laue than regular floss. What does this mean how on earth do they rove this? This is an examle of where they rove the results statistically. It is done via clinical trials, by collecting data: Aim: to see if it is ossible to rove that on average the amount of laue removed using floss sticks is more than the average amount of laue removed using regular floss. They state their hyothesis as H 0 : μ FP -μ F 0 against H A : μ FP μ F >0. where μ FP = mean floss removed using Floss stick and μ F = mean laue removed using regular loss.
5 Designing the floss study There are two ways the data could have been collected: Either one simle random samle of individuals is taken. For each individual (on searate far aart days) is asked to use a floss stick and regular floss and the amount of laue removed for each treatment is measured. This is an examle of a matched air study where the same individual is used in both treatments. In this case a matched aired-test is done which we covered in the revious lectures. The advantage of this design is that it avoids confounding because the same individual is used for both exeriments. The disadvantage is that it takes time and effort because we need to do it over several days. Alternatively a simle random samle is taken and randomly slit into two grous. Some are asked to use floss and others are asked to use floss sticks. The individuals in both grous are comletely indeendent of each other and there isn t any matching. The advantage it that it is uick to do this exeriment. Disadvantage: larger standard errors.
6 Indeendent samles inference The urose of most studies is to comare the effects of different treatments or conditions. Using matching to design an exeriment is very useful way to make comarisons between oulations since it tends to reduce confounding factors (such as the ability of a erson to floss). If we have reason to believe that there is matching between subjects, then we should use a matched aired t-test. However, in many situations it is imossible to have any matching between the samles. If we want to see whether a drug works, we need to comare a SRS (simle random samle) of atients treated with the drug with a SRS of atients gives the lacebo.
7 Examle 1: Floss A simle random grou of individuals are chosen and the amount of laue removed using both regular floss and floss icks is measured. Secifically, 50 individuals were given floss and another 50 were given floss sticks. The average amount of laue removed in the grous using floss sticks is 3.16mg whereas the average amount of laue removed in the grous using regular floss is 2.99mg.
8 Examle 1: Floss The data is on the left. Note that each row does not corresond to the same individual. These are two different individuals with no airing/matching. They are two indeendent samles. It is hard to understand these numbers, so the data is summarized in the table below.
9 Does the difference of 3.16 vs 2.99 automatically rove that floss sticks are better? No it doesn t. The roducts makers cannot use this as roof. They need to show that given the data it is imlausible that over the entire oulation there is no difference between the floss and floss sticks (they need to show that the null is not lausible). Technically, this means calculating the chance of observing a difference of 3.16mg 2.99mg by fluke. This is the -value. If the -value is large (over a re-determined significance level, say 5%), then we cannot reject the null. This means that there isn t any evidence in the collected data that that floss icks are better than regular floww. If the -value is small (below a re-determined significance level), then the data collected suggests that floss stick function better than floss. Fortunately, comuter software does the calculation. However, the usual assumtions of normality of the samle means still aly. So these still have to be checked:
10 The indeendent samle t-test in Statcrunch Go to Stat -> T Stats -> Two Samle-> With Data We test the hyothesis H 0 : μ FP -μ F 0 against H A : μ FP μ F >0.
11 Anaylsis of data The green lines in the dot lot corresond to the samle means calculated from the data. We test the hyothesis H 0 : μ FP -μ F 0 against H A : μ FP μ F >0. We see that the samle mean for FlossSticks is greater than the samle mean of regular floss. But just looking at two data sets it see there is a large overla in numbers. This makes it visually hard to discriminate between the two data sets and determine the -value. However, the -value can be calculated and it is 3.15%
12 The -value of 3.15% (which is less than 5%) tell us that there is some evidence in the claim that floss sticks remove more laue than floss. To find out on average how much more laue floss sticks remove comared with regular floss we calculate the 95% confidence interval. This can be done using the outut and the t-calculator in Statcrunch gives us the critical value to use in construction of the 95% confidence interval for the mean difference. [ , ] =[ , 0.329]
13 Thus with 95% confidence the mean difference difference lies somewhere between and Note that zero lies in this interval, this is because we can reject the null using a one-sided test, but not a 2 sided one. Alternatively, use Statcrunch to give the confidence interval: Observe that these numbers are identical to the calculation we made.
14 Since Dentex found a statistically significant difference and they can make the claim that Floss sticks remove more laue than Floss. Checking the normality assumtion of the samle means: Samle size is relatively large and the distribution of the two samle means (look at the green lots) look uite normal. Therefore we can say the 3.15% -value is close to the truth -value based on normality of the samle means.
15 Summary In the situation discussed above the samles are comletely indeendent of each other there isn t any matching. In this situation we need to use an indeendent t-test. In general, subjects are often observed searately under the different conditions, resulting in samles that are indeendent. That is, the subjects of each samle are obtained and observed searately from, and without any regard to, the subjects of the other samles. As in the matched airs design, subjects should be samled randomly from the oulation of interest. By the end of the class you should be able to identify which test to aly give the situation. You should look to see if there is any matching in the data, if there is matching never do an indeendent samle t-test (this will give the wrong standard errors and can lead to unreliable results). If the samles aear to comletely indeendent of each other use an indeendent samle t-test.
16 Examle 2: Heights Consider the following roblem that we already know the answer to: In general, do males students tend to be taller than female students? In terms of a hyothesis test we to see if there is evidence to suort: H 0 : μ M - μ F 0 against H A : μ M - μ F > 0. A matched design is ossible, by random samling male and female student siblings. Such data may be hard to come by. In addition, we exclude the sub-oulation of eole with same-sex or no siblings. Instead, a random samle of students was drawn and an indeendent samle t-test is done. Statcrunch instructions: Stat -> T-stat > Two Samle -> With data. Then lace the relevant columns in each box and uncheck the box that says ooled variance. You have the otion of doing a test (one or two sided) or constructing a confidence interval.
17 In this samle there were 27 males and 37 females, there is clearly no matching. The difference in samle means is Visually, there seems to be a large difference between the data sets. Meaning that it is unlikely they share the same mean (small - value).
18 We see that the -value is less 0.01% (we do the test at the 5% level) which means there is strong evidence to suggest that males are on average taller than females. t-value = =7.27 We can use the same outut to construct a 99% confidence interval for the mean difference. The only difference is that the degrees of freedom is unusual it is 48.29%. However, we do exactly the same as before, we either look-u tables (rovided by me in the exam aer) or use software such as Statcrunch [0.466 ± ] = [0.29, 0.64] With 99% confidence we believe the mean difference in height lies between 0.29 to 0.64 feet.
19
20 Examle 3: Diets We want to know whether there is any difference between two different diets. 20 randomly samled eole are randomly laced into two grous of 10. The first grou goes on Diet I and the second grou on Diet 2. The weight loss for each grou (after dieting for one month) is given below We need to use an indeendent two samle t-rocedure (no matching between individuals). As we have no reason to believe one diet is better than another, our hyothesis of interest is: H 0 : μ 1 - μ 2 = 0 against H A : μ 1 - μ 2 0 The samle means are different, but there is a large overla in the dots.
21 The 95% confidence interval is [-2.23,0.598]. This tells us with 95% confidence the mean difference between the diets is somewhere in this interval. As this contains the mean difference of 0, we cannot reject the null (for the two sided test). The -value is greater for H 0 : μ 1 - μ 2 = 0 against H A : μ 1 - μ 2 0 is greater than 5% for the two sided test. To calculate the recise -value we use the t-transform t-value = = 1.22 Using Statcrunch we see that the smallest area is the area to the LEFT of -1.22, this is 12%. Thus the -value for the two-sided test is 24%. From the data there is no evidence to suggest there is any difference between the means of the diets.
22 Examle 4: Does calcium interact with iron absortion? It is believed that too much calcium in a diet can reduce the absortion of iron. To test this, 20 randomly samled eole were ut into two grous of 10. One grou was given a calcium high diet and the iron absortion recorded. The other grou was given a calcium low diet and iron recorded. The differences from their revious level is given below (this is why you see some negative numbers). The data and summary statistics is given below: We observe that for this grou there those in a calcium low grou absorb more iron, is this statistically significant?
23 The hyothesis of interest is H 0 : μ CH - μ CL 0 against H A : μ CH - μ CL < 0. The hyothesis given in the outut above is oosite of what we want to test. However, from the outut we immediately see that the -value for H 0 : μ CH - μ CL 0 against H A : μ CH - μ CL < 0 is the area to the LEFT of which is = 0.26%. As this -value is less than 5% there is evidence to reject the null and conclude that high calcium decreases iron absortion (comared with low calcium). The 95% confidence interval for the mean difference is [ ± ]
24 Examle 5: Calf treatments Comaring the weights of calves and different treatments Treatment A vs B Is there is evidence in the data to suggest there is a difference between treatments A and B. This means we are testing H 0: μ A μ B =0 against H A: μ A μ B 0. Eyeballing the numbers, we see that the is not much difference in the samle means and It is very hard to discriminate between the two data sets. This corresonds to a large -value.
25 Examle 5: Calf treatments The -value for H 0: μ A μ B =0 against H A: μ A μ B 0 is 93%. This tells us that obtaining a differences seen in the two grous when there is no difference in the treatments (in terms of weight) is highly likely. Thus there is no evidence to reject the null Note: To analyze the calf data in Statcrunch you need to slit each grou into their own columns. To do this go to Data -> Arrange -> Slit -> Select Column data you want to analyze (for examle Wt 8) and Select the grou you want (for examle TRT)
26 Treatment A vsd From the summary statistics, the difference between treatment A and B aears uite large (7.7), can this difference be exlained by random chance? We test the hyothesis H 0: μ A μ D =0 against H A: μ A μ D 0. There is a 7.7 oint difference in the treatments but a large overla in the data sets (both have a large standard deviation).
27 The mean difference may be -7.7 but the -value is 34%, this tells us there is over a 1/3 chance of observing a difference of 7.7 in the samle means when there is in fact no difference in the treatments. This is uite large over the 5% significant level, so there is no evidence to reject the null We now construct a 95% confidence interval. To do this we use statcrunch to find the critical value of a t-distribution with df The 95% confidence interval for the difference in mean weights for the treatments in [ , ] = [-24,9.2]. This is an interval where we believe the mean difference should lie and exlains why we were not able to reject the null, desite 7.7 being subjectively large. The difference this interval is wide is that the standard error is large, due to small samle size and large standard deviation of calf weights.
28 The standard error, what is that? We illustrate the idea with the female and male height examle For every samle the difference in samle means X M XF will vary. If the samle size is large enough X M XF will have a normal distribution (thanks to the central limit theorem). The normal distribution will be centered about the true mean μ M - μ F (oulation male mean minus oulation female mean) and but it will have a comlicated standard error: r 2M F 34 Where σ M = standard deviation of heights and σ F = standard deviation of female heights.
29 Therefore, just like in the one-samle case, in order to do the test we simly take the z-transform under the null that the mean male and female height is the same (μ M - μ F = 0). z = M 27 + F 2 34 At this oint we encounter a roblem. We do not know the oulation standard deviations σ M and σ F. But we see from the summary statistics that we do have estimates for them. Thus we can relace the true oulation standard deviations by its estimates. And obtain the transformation: t =
30 The distribution of this ratio? Having exchanged the unknown true standard deviations with their estimators (calculated from the data) it seems reasonable to suose that extra variability has been added to this ratio and we need to correct for it by changing from a normal distribution to another distribution. Previously for the one samle case, the new distribution which took into account of this variability was the t- distribution. In the two samle case, the ratio t = X M XF s 2 M 27 + s2 F 34 This ratio has aroximately a t-distribution with a very strange number of degrees of freedom. 2 This is why using software is imortant, you don t want to calculate this stuff!! 2 s 1 + s 2 $ 2 # & " n df= 1 n 2 % 2 1! 2 s 1 $ # & + 1! # n 1 1 " % n 2 1 " n 1 2 s 2 n 2 $ & % 2
31 We are testing H 0 : μ M - μ F = 0 against H A : μ M - μ F > 0 and have the t-transform =7.27 Which we know has degree of freedom. Now going to Statcrunch -> Stat -> Calculators -> T we get The area to the right of 7.11 for a t-distribution with degrees of freedom is tiny. So at both the 5% and 1% significance level we would reject the null. This means there is lenty of evidence to reject the null and conclude the mean height of males is greater than females. Remember If the samle sizes are both over 15, and the data not too skewed, using the t-distribution reasonable.
32 Summary of Analysis: Significant effect Remember: Significance means the evidence of the data is sufficient to reject the null hyothesis (at our stated level α). Only data, and the statistics we calculate from the data, can be statistically significant. We can say that the samle means are significantly different or that the observed effect is significant. But the conclusion about the oulation means is simly they are different. The observed effect of 0.46 between male and female height is significant so we conclude that the true effect μ M-- -μ F is greater than zero. Having made this conclusion, or even if we have not, we can always estimate the difference using the confidence interval [0.33,0.58].
33 Standard errors In the one-samle case the standard error is s(standard deviation of oulation) n(samle size) = r s 2 n In the indeendent two-samle case the standard error is s s 2 1 (variance of oulation one) n(samle size) + s2 2 (variance of oulation two) m(samle size) These two different standard errors are for different situations but the ideas are the same. Remember, that a smaller standard error leads to more reliable estimators. Therefore if we are designing the exeriment to decrease the samle size we observe that: For the one-samle case, we can decrease the standard error by increasing the samle size (it is usually imossible to decrease the standard deviation) For the two-samle case, we can decrease the standard error by increasing the size of both samles (again it is usually imossible to decrease the standard deviation of the oulations).
34 Choosing the samle size We now consider how to distribute the samle sizes in the case that the standard deviations for both samles are about the same. In this case the standard error is: r r s 2 n + s2 1 m = s n + 1 m Remember the standard deviation is fixed we cannot change this value. Suose that we only have enough funds to include 200 subjects in our exeriment, how to distribute them amongst the two grous: It makes no sense to have on subject in grou 1 and 199 in grou 2. For examle, if we are comaring male and female heights, this would be using one male height to estimate the mean height of males and 199 females heights to estimate the mean height of females. Clearly this is wrong, and we r can understand why from the standard which is 1 s =1.002s On the other hand if we distributed them evenly, 100 and 100, the standard error is a lot smaller r 1 s =0.141s
35 Which tye of test? One samle, aired samles or two indeendent samles? Comaring vitamin content of bread immediately after baking vs. 3 days later (the same loaves are used on day one and 3 days later). Comaring vitamin content of bread immediately after baking vs. 3 days later (tests made on indeendent loaves). Average fuel efficiency for 2005 vehicles is 21 miles er gallon. Is average fuel efficiency higher in the new generation green vehicles? Is blood ressure altered by use of an oral contracetive? Comaring a grou of women not using an oral contracetive with a grou taking it. Review insurance records for dollar amount aid after fire damage in houses euied with a fire extinguisher vs. houses without one. Was there a difference in the average dollar amount aid?
36 Cautions about the two samle t-test or interval Using the correct standard error and degrees of freedom is critical. As in the one samle t-test, the method assumes simle random samles. Likewise, it also assumes the oulations have normal distributions. Skewness and outliers can make the methods inaccurate (that is, having confidence/significance level other that what they are suosed to have). The larger the samle sizes, the less this is a roblem. It also is less of a roblem if the oulations have similar skewness and the two samles are close to the same size. Significant effect merely means we have sufficient evidence to say the two true means are different. It does not exlain why they are different or how meaningful/imortant the difference is. A confidence interval is needed to determine how big the effect is.
37 Summary: Distribution of two samle means In order to do statistical inference, we must know a few things about the samling distribution of our statistic. The samling distribution of has standard deviation (Mathematically, the variance of the difference is the sum of s n + s n the variances of the two samle means.) This is estimated by the standard error 1 2 If the samle sizes are both over 15, and the data not too skewed, using the t-distribution reasonable. Then the two-samle t statistic is x - x t = ( x1-x2) -( µ 1-µ 2). 2 2 s1 s2 + n n s2 s n SE = + n. This statistic has an aroximate t-distribution on which we will base our inferences. But the degrees of freedom is comlicated
38 Two-samle t confidence interval Recall that we have two indeendent samles and we use the difference between the samle averages ( ) to estimate (μ 1 μ 2 ) s This estimate has standard error 1 s2 SE = +. n1 n2 The margin of error for a confidence interval of μ 1 μ 2 is We find t* is found using the comuter. The confidence interval is then comuted as 2 2 * s1 s2 * m= t + = t SE n n ( x - x ) ± m x - x The interretation of confidence is the same as before: it is the roortion of ossible samles for which the method leads to a true statement about the arameters.
39 Two-samle t significance test The null hyothesis is that both oulation means μ 1 and μ 2 are eual, thus their difference is eual to zero. H 0 : μ 1 = μ 2 Û H 0 : μ 1 μ 2 = 0. Either a one-sided or a two-sided alternative hyothesis can be tested. Using the value (μ 1 μ 2 ) = 0 given in H 0, the test statistic becomes t = ( x1-x2) s2 s n + n 1 2 To find the P-value, we look u the aroriate robability of the t-distribution using the df given by Statcrunch or me.
40 Statistics in the media Look at this article and the data they describe: htt:// What is the data that Dr. Carrasco has? If we did a indeendent samle t-test to see whether those with Alzeheimer s had more fungal cells than those who did not Alzheimer s what would be the -value (give a rough estimate)?
41 Accomanying roblems associated with this Chater Quiz 14 Homework 7 (Questions 5,6 and 7)
The one-sample t test for a population mean
Objectives Constructing and assessing hyotheses The t-statistic and the P-value Statistical significance The one-samle t test for a oulation mean One-sided versus two-sided tests Further reading: OS3,
More informationMeasuring center and spread for density curves. Calculating probabilities using the standard Normal Table (CIS Chapter 8, p 105 mainly p114)
Objectives 1.3 Density curves and Normal distributions Density curves Measuring center and sread for density curves Normal distributions The 68-95-99.7 (Emirical) rule Standardizing observations Calculating
More informationMeasuring center and spread for density curves. Calculating probabilities using the standard Normal Table (CIS Chapter 8, p 105 mainly p114)
Objectives Density curves Measuring center and sread for density curves Normal distributions The 68-95-99.7 (Emirical) rule Standardizing observations Calculating robabilities using the standard Normal
More informationObjectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)
Objectives 6.1, 7.1 Estimating with confidence (CIS: Chater 10) Statistical confidence (CIS gives a good exlanation of a 95% CI) Confidence intervals. Further reading htt://onlinestatbook.com/2/estimation/confidence.html
More informationObjectives. Estimating with confidence Confidence intervals.
Objectives Estimating with confidence Confidence intervals. Sections 6.1 and 7.1 in IPS. Page 174-180 OS3. Choosing the samle size t distributions. Further reading htt://onlinestatbook.com/2/estimation/t_distribution.html
More informationCHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit
Chater 5 Statistical Inference 69 CHAPTER 5 STATISTICAL INFERENCE.0 Hyothesis Testing.0 Decision Errors 3.0 How a Hyothesis is Tested 4.0 Test for Goodness of Fit 5.0 Inferences about Two Means It ain't
More informationHotelling s Two- Sample T 2
Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test
More informationOne-way ANOVA Inference for one-way ANOVA
One-way ANOVA Inference for one-way ANOVA IPS Chater 12.1 2009 W.H. Freeman and Comany Objectives (IPS Chater 12.1) Inference for one-way ANOVA Comaring means The two-samle t statistic An overview of ANOVA
More informationTests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)
Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant
More informationChapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population
Chater 7 and s Selecting a Samle Point Estimation Introduction to s of Proerties of Point Estimators Other Methods Introduction An element is the entity on which data are collected. A oulation is a collection
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 379: Business and Economics Statistics Instructor: Yogesh Ual Email: yual@ysu.edu Chater 9, Part A: Hyothesis Tests Develoing Null and Alternative Hyotheses Tye I and Tye II Errors Poulation Mean:
More informationIntroduction to Probability and Statistics
Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based
More informationMonte Carlo Studies. Monte Carlo Studies. Sampling Distribution
Monte Carlo Studies Do not let yourself be intimidated by the material in this lecture This lecture involves more theory but is meant to imrove your understanding of: Samling distributions and tests of
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 379: Business and Economics Statistics Instructor: Yogesh Ual Email: yual@ysu.edu Chater 9, Part A: Hyothesis Tests Develoing Null and Alternative Hyotheses Tye I and Tye II Errors Poulation Mean:
More informationSlides Prepared by JOHN S. LOUCKS St. Edward s s University Thomson/South-Western. Slide
s Preared by JOHN S. LOUCKS St. Edward s s University 1 Chater 11 Comarisons Involving Proortions and a Test of Indeendence Inferences About the Difference Between Two Poulation Proortions Hyothesis Test
More informationObjectives. Displaying data and distributions with graphs. Variables Types of variables (CIS p40-41) Distribution of a variable
Objectives Dislaying data and distributions with grahs Variables Tyes of variables (CIS 40-41) Distribution of a variable Bar grahs for categorical variables (CIS 42) Histograms for quantitative variables
More informationMATH 2710: NOTES FOR ANALYSIS
MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite
More informationHypothesis Test-Confidence Interval connection
Hyothesis Test-Confidence Interval connection Hyothesis tests for mean Tell whether observed data are consistent with μ = μ. More secifically An hyothesis test with significance level α will reject the
More information2. Sample representativeness. That means some type of probability/random sampling.
1 Neuendorf Cluster Analysis Assumes: 1. Actually, any level of measurement (nominal, ordinal, interval/ratio) is accetable for certain tyes of clustering. The tyical methods, though, require metric (I/R)
More informationMorten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014
Morten Frydenberg Section for Biostatistics Version :Friday, 05 Setember 204 All models are aroximations! The best model does not exist! Comlicated models needs a lot of data. lower your ambitions or get
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationSplit the integral into two: [0,1] and (1, )
. A continuous random variable X has the iecewise df f( ) 0, 0, 0, where 0 is a ositive real number. - (a) For any real number such that 0, rove that the eected value of h( X ) X is E X. (0 ts) Solution:
More informationPlotting the Wilson distribution
, Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion
More informationDownloaded from jhs.mazums.ac.ir at 9: on Monday September 17th 2018 [ DOI: /acadpub.jhs ]
Iranian journal of health sciences 013; 1(): 56-60 htt://jhs.mazums.ac.ir Original Article Comaring Two Formulas of Samle Size Determination for Prevalence Studies Hamed Tabesh 1 *Azadeh Saki Fatemeh Pourmotahari
More informationThe Poisson Regression Model
The Poisson Regression Model The Poisson regression model aims at modeling a counting variable Y, counting the number of times that a certain event occurs during a given time eriod. We observe a samle
More information¼ ¼ 6:0. sum of all sample means in ð8þ 25
1. Samling Distribution of means. A oulation consists of the five numbers 2, 3, 6, 8, and 11. Consider all ossible samles of size 2 that can be drawn with relacement from this oulation. Find the mean of
More informationReal Analysis 1 Fall Homework 3. a n.
eal Analysis Fall 06 Homework 3. Let and consider the measure sace N, P, µ, where µ is counting measure. That is, if N, then µ equals the number of elements in if is finite; µ = otherwise. One usually
More informationNotes on Instrumental Variables Methods
Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of
More informationBrownian Motion and Random Prime Factorization
Brownian Motion and Random Prime Factorization Kendrick Tang June 4, 202 Contents Introduction 2 2 Brownian Motion 2 2. Develoing Brownian Motion.................... 2 2.. Measure Saces and Borel Sigma-Algebras.........
More informationTopic 7: Using identity types
Toic 7: Using identity tyes June 10, 2014 Now we would like to learn how to use identity tyes and how to do some actual mathematics with them. By now we have essentially introduced all inference rules
More informationRobustness of multiple comparisons against variance heterogeneity Dijkstra, J.B.
Robustness of multile comarisons against variance heterogeneity Dijkstra, J.B. Published: 01/01/1983 Document Version Publisher s PDF, also known as Version of Record (includes final age, issue and volume
More informationUse of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek
Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.
More informationSTA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2
STA 25: Statistics Notes 7. Bayesian Aroach to Statistics Book chaters: 7.2 1 From calibrating a rocedure to quantifying uncertainty We saw that the central idea of classical testing is to rovide a rigorous
More informationEstimation of Separable Representations in Psychophysical Experiments
Estimation of Searable Reresentations in Psychohysical Exeriments Michele Bernasconi (mbernasconi@eco.uninsubria.it) Christine Choirat (cchoirat@eco.uninsubria.it) Raffaello Seri (rseri@eco.uninsubria.it)
More informationSoci Data Analysis in Sociological Research. Homework 4 Computer Handout. Chapter 19 Confidence Intervals for Proportions
University of North Carolina Chael Hill Soci252-002 Data Analysis in Sociological Research Sring 2013 Professor François Nielsen Homework 4 Comuter Handout Readings This handout covers comuter issues related
More information0.6 Factoring 73. As always, the reader is encouraged to multiply out (3
0.6 Factoring 7 5. The G.C.F. of the terms in 81 16t is just 1 so there is nothing of substance to factor out from both terms. With just a difference of two terms, we are limited to fitting this olynomial
More informationChapter 7 Rational and Irrational Numbers
Chater 7 Rational and Irrational Numbers In this chater we first review the real line model for numbers, as discussed in Chater 2 of seventh grade, by recalling how the integers and then the rational numbers
More information8 STOCHASTIC PROCESSES
8 STOCHASTIC PROCESSES The word stochastic is derived from the Greek στoχαστικoς, meaning to aim at a target. Stochastic rocesses involve state which changes in a random way. A Markov rocess is a articular
More informationOn split sample and randomized confidence intervals for binomial proportions
On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have
More informationECON 4130 Supplementary Exercises 1-4
HG Set. 0 ECON 430 Sulementary Exercises - 4 Exercise Quantiles (ercentiles). Let X be a continuous random variable (rv.) with df f( x ) and cdf F( x ). For 0< < we define -th quantile (or 00-th ercentile),
More informationOn the Rank of the Elliptic Curve y 2 = x(x p)(x 2)
On the Rank of the Ellitic Curve y = x(x )(x ) Jeffrey Hatley Aril 9, 009 Abstract An ellitic curve E defined over Q is an algebraic variety which forms a finitely generated abelian grou, and the structure
More informationStatistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform
Statistics II Logistic Regression Çağrı Çöltekin Exam date & time: June 21, 10:00 13:00 (The same day/time lanned at the beginning of the semester) University of Groningen, Det of Information Science May
More informationLecture 1.2 Units, Dimensions, Estimations 1. Units To measure a quantity in physics means to compare it with a standard. Since there are many
Lecture. Units, Dimensions, Estimations. Units To measure a quantity in hysics means to comare it with a standard. Since there are many different quantities in nature, it should be many standards for those
More informationMath 4400/6400 Homework #8 solutions. 1. Let P be an odd integer (not necessarily prime). Show that modulo 2,
MATH 4400 roblems. Math 4400/6400 Homework # solutions 1. Let P be an odd integer not necessarily rime. Show that modulo, { P 1 0 if P 1, 7 mod, 1 if P 3, mod. Proof. Suose that P 1 mod. Then we can write
More informationRANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES
RANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES AARON ZWIEBACH Abstract. In this aer we will analyze research that has been recently done in the field of discrete
More information15-451/651: Design & Analysis of Algorithms October 23, 2018 Lecture #17: Prediction from Expert Advice last changed: October 25, 2018
5-45/65: Design & Analysis of Algorithms October 23, 208 Lecture #7: Prediction from Exert Advice last changed: October 25, 208 Prediction with Exert Advice Today we ll study the roblem of making redictions
More information18. Two-sample problems for population means (σ unknown)
8. Two-samle roblems for oulatio meas (σ ukow) The Practice of Statistics i the Life Scieces Third Editio 04 W.H. Freema ad Comay Objectives (PSLS Chater 8) Comarig two meas (σ ukow) Two-samle situatios
More informationUniversal Finite Memory Coding of Binary Sequences
Deartment of Electrical Engineering Systems Universal Finite Memory Coding of Binary Sequences Thesis submitted towards the degree of Master of Science in Electrical and Electronic Engineering in Tel-Aviv
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More informationBiostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression
Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Resonse) Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout
More informationFeedback-error control
Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller
More informationNUMERICAL AND THEORETICAL INVESTIGATIONS ON DETONATION- INERT CONFINEMENT INTERACTIONS
NUMERICAL AND THEORETICAL INVESTIGATIONS ON DETONATION- INERT CONFINEMENT INTERACTIONS Tariq D. Aslam and John B. Bdzil Los Alamos National Laboratory Los Alamos, NM 87545 hone: 1-55-667-1367, fax: 1-55-667-6372
More informationHiggs Modeling using EXPER and Weak Fusion. by Woody Stanford (c) 2016 Stanford Systems.
iggs Modeling using EXPER and Weak Fusion by Woody Stanford (c) 2016 Stanford Systems. Introduction The EXPER roject, even though its original findings were inconclusive has lead to various ideas as to
More informationdn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential
Chem 467 Sulement to Lectures 33 Phase Equilibrium Chemical Potential Revisited We introduced the chemical otential as the conjugate variable to amount. Briefly reviewing, the total Gibbs energy of a system
More informationAn Analysis of Reliable Classifiers through ROC Isometrics
An Analysis of Reliable Classifiers through ROC Isometrics Stijn Vanderlooy s.vanderlooy@cs.unimaas.nl Ida G. Srinkhuizen-Kuyer kuyer@cs.unimaas.nl Evgueni N. Smirnov smirnov@cs.unimaas.nl MICC-IKAT, Universiteit
More informationUniform Law on the Unit Sphere of a Banach Space
Uniform Law on the Unit Shere of a Banach Sace by Bernard Beauzamy Société de Calcul Mathématique SA Faubourg Saint Honoré 75008 Paris France Setember 008 Abstract We investigate the construction of a
More informationEstimation of the large covariance matrix with two-step monotone missing data
Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo
More informationTowards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK
Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)
More informationMATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK
Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment
More informationarxiv:cond-mat/ v2 25 Sep 2002
Energy fluctuations at the multicritical oint in two-dimensional sin glasses arxiv:cond-mat/0207694 v2 25 Se 2002 1. Introduction Hidetoshi Nishimori, Cyril Falvo and Yukiyasu Ozeki Deartment of Physics,
More informationLecture: Condorcet s Theorem
Social Networs and Social Choice Lecture Date: August 3, 00 Lecture: Condorcet s Theorem Lecturer: Elchanan Mossel Scribes: J. Neeman, N. Truong, and S. Troxler Condorcet s theorem, the most basic jury
More informationPositivity, local smoothing and Harnack inequalities for very fast diffusion equations
Positivity, local smoothing and Harnack inequalities for very fast diffusion equations Dedicated to Luis Caffarelli for his ucoming 60 th birthday Matteo Bonforte a, b and Juan Luis Vázquez a, c Abstract
More informationI have not proofread these notes; so please watch out for typos, anything misleading or just plain wrong.
hermodynamics I have not roofread these notes; so lease watch out for tyos, anything misleading or just lain wrong. Please read ages 227 246 in Chater 8 of Kittel and Kroemer and ay attention to the first
More informationTopic: Lower Bounds on Randomized Algorithms Date: September 22, 2004 Scribe: Srinath Sridhar
15-859(M): Randomized Algorithms Lecturer: Anuam Guta Toic: Lower Bounds on Randomized Algorithms Date: Setember 22, 2004 Scribe: Srinath Sridhar 4.1 Introduction In this lecture, we will first consider
More informationSTK4900/ Lecture 7. Program
STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment
More informationSupplementary Materials for Robust Estimation of the False Discovery Rate
Sulementary Materials for Robust Estimation of the False Discovery Rate Stan Pounds and Cheng Cheng This sulemental contains roofs regarding theoretical roerties of the roosed method (Section S1), rovides
More information1 Random Experiments from Random Experiments
Random Exeriments from Random Exeriments. Bernoulli Trials The simlest tye of random exeriment is called a Bernoulli trial. A Bernoulli trial is a random exeriment that has only two ossible outcomes: success
More informationCompletely Randomized Design
CHAPTER 4 Comletely Randomized Design 4.1 Descrition of the Design Chaters 1 to 3 introduced some basic concets and statistical tools that are used in exerimental design. In this and the following chaters,
More informationMetrics Performance Evaluation: Application to Face Recognition
Metrics Performance Evaluation: Alication to Face Recognition Naser Zaeri, Abeer AlSadeq, and Abdallah Cherri Electrical Engineering Det., Kuwait University, P.O. Box 5969, Safat 6, Kuwait {zaery, abeer,
More informationCombining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)
Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment
More informationResearch Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **
Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R
More informationSAS for Bayesian Mediation Analysis
Paer 1569-2014 SAS for Bayesian Mediation Analysis Miočević Milica, Arizona State University; David P. MacKinnon, Arizona State University ABSTRACT Recent statistical mediation analysis research focuses
More informationOn the Toppling of a Sand Pile
Discrete Mathematics and Theoretical Comuter Science Proceedings AA (DM-CCG), 2001, 275 286 On the Toling of a Sand Pile Jean-Christohe Novelli 1 and Dominique Rossin 2 1 CNRS, LIFL, Bâtiment M3, Université
More informationTempo effect in first marriage table: Japan and China. Kiyosi Hirosima (Shimane University, Japan)
Temo effect in first marriage table: Jaan and China Kiyosi Hirosima (Shimane University, Jaan) Introduction Recently many East Asian countries and regions are undergoing the raid increase in age at marriage.
More informationRECIPROCITY LAWS JEREMY BOOHER
RECIPROCITY LAWS JEREMY BOOHER 1 Introduction The law of uadratic recirocity gives a beautiful descrition of which rimes are suares modulo Secial cases of this law going back to Fermat, and Euler and Legendre
More informationApplied Statistical Mechanics Lecture Note - 4 Quantum Mechanics Molecular Structure
Alied Statistical Mechanics Lecture Note - 4 Quantum Mechanics Molecular Structure Jeong Won Kang Deartment of Chemical Engineering Korea University Subjects Structure of Comlex Atoms - Continued Molecular
More informationComparing Variabilities in Clinical Research
Comaring Variabilities in Clinical Research Yonghee Lee University of Wisconsin Madison, Madison, Wisconsin, U.S.A. Hansheng Wang Peking University, Beijing, Peole s Reublic of China Shein-Chung Chow Millennium
More informationLower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data
Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment
More informationHEAT, WORK, AND THE FIRST LAW OF THERMODYNAMICS
HET, ORK, ND THE FIRST L OF THERMODYNMIS 8 EXERISES Section 8. The First Law of Thermodynamics 5. INTERPRET e identify the system as the water in the insulated container. The roblem involves calculating
More informationt 0 Xt sup X t p c p inf t 0
SHARP MAXIMAL L -ESTIMATES FOR MARTINGALES RODRIGO BAÑUELOS AND ADAM OSȨKOWSKI ABSTRACT. Let X be a suermartingale starting from 0 which has only nonnegative jums. For each 0 < < we determine the best
More informationarxiv: v1 [physics.data-an] 26 Oct 2012
Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch
More informationA New Asymmetric Interaction Ridge (AIR) Regression Method
A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and
More informationTrading OTC and Incentives to Clear Centrally
Trading OTC and Incentives to Clear Centrally Gaetano Antinolfi Francesca Caraella Francesco Carli March 1, 2013 Abstract Central counterparties CCPs have been art of the modern financial system since
More informationAnalysis of some entrance probabilities for killed birth-death processes
Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction
More informationPreconditioning techniques for Newton s method for the incompressible Navier Stokes equations
Preconditioning techniques for Newton s method for the incomressible Navier Stokes equations H. C. ELMAN 1, D. LOGHIN 2 and A. J. WATHEN 3 1 Deartment of Comuter Science, University of Maryland, College
More information16. CHARACTERISTICS OF SHOCK-WAVE UNDER LORENTZ FORCE AND ENERGY EXCHANGE
16. CHARACTERISTICS OF SHOCK-WAVE UNDER LORENTZ FORCE AND ENERGY EXCHANGE H. Yamasaki, M. Abe and Y. Okuno Graduate School at Nagatsuta, Tokyo Institute of Technology 459, Nagatsuta, Midori-ku, Yokohama,
More informationBayesian Spatially Varying Coefficient Models in the Presence of Collinearity
Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory
More informationA New Perspective on Learning Linear Separators with Large L q L p Margins
A New Persective on Learning Linear Searators with Large L q L Margins Maria-Florina Balcan Georgia Institute of Technology Christoher Berlind Georgia Institute of Technology Abstract We give theoretical
More informationThe non-stochastic multi-armed bandit problem
Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationPretest (Optional) Use as an additional pacing tool to guide instruction. August 21
Trimester 1 Pretest (Otional) Use as an additional acing tool to guide instruction. August 21 Beyond the Basic Facts In Trimester 1, Grade 8 focus on multilication. Daily Unit 1: Rational vs. Irrational
More informationAn Introduction to Information Theory: Notes
An Introduction to Information Theory: Notes Jon Shlens jonshlens@ucsd.edu 03 February 003 Preliminaries. Goals. Define basic set-u of information theory. Derive why entroy is the measure of information
More informationPHYS 301 HOMEWORK #9-- SOLUTIONS
PHYS 0 HOMEWORK #9-- SOLUTIONS. We are asked to use Dirichlet' s theorem to determine the value of f (x) as defined below at x = 0, ± /, ± f(x) = 0, - < x
More informationUsing the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process
Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund
More informationSome Measures of Agreement Between Close Partitions
Some Measures of Agreement Between Close Partitions Genane Youness and Gilbert Saorta CEDRIC CNAM, BP - 466, Beirut, Lebanon, genane99@hotmail.com Chaire de Statistique Aliquée- CEDRIC, CNAM, 9 rue Saint
More informationChapter 10. Supplemental Text Material
Chater 1. Sulemental Tet Material S1-1. The Covariance Matri of the Regression Coefficients In Section 1-3 of the tetbook, we show that the least squares estimator of β in the linear regression model y=
More informationChapter 13 Variable Selection and Model Building
Chater 3 Variable Selection and Model Building The comlete regsion analysis deends on the exlanatory variables ent in the model. It is understood in the regsion analysis that only correct and imortant
More informationSampling. Inferential statistics draws probabilistic conclusions about populations on the basis of sample statistics
Samling Inferential statistics draws robabilistic conclusions about oulations on the basis of samle statistics Probability models assume that every observation in the oulation is equally likely to be observed
More informationMODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL
Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management
More informationStatics and dynamics: some elementary concepts
1 Statics and dynamics: some elementary concets Dynamics is the study of the movement through time of variables such as heartbeat, temerature, secies oulation, voltage, roduction, emloyment, rices and
More information