psychological statistics

Size: px
Start display at page:

Download "psychological statistics"

Transcription

1 psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA, INDIA

2 UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION STUDY MATERIAL B Sc. COUNSELLING PSYCHOLOGY III Semester COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS Prepared by: Dr.Vijayakumari.K, Associate Professor, Farook Teacher Training College, Farook College.P.O. Feroke Scrutinised by: Prof.C.Jayan, Department of Psychology, University of Calicut. Layout & Settings Computer Section, SDE Reserved Psychological Statistics Page

3 CONTENTS MODULE I CORRELATION MODULE II PARAMETRIC AND NON-PARAMETRIC TESTS 14-0 MODULE III MEDIAN TEST 1-6 Psychological Statistics Page 3

4 MODULE I CORRELATION Objectives: The student will be - acquainted with knowledge of correlation, types of correlation and different methods of calculating correlation; - able to calculate coefficient of correlation using different methods. - able to draw scatter diagram and interpret it. Correlation In Psychology, there are situations where two or more variables are involved. Some of them will be related to each other and some will be independent. To know whether these variables are related or not, one has to be thorough with the concept of correlation. Correlation is a statistical technique that is used to measure and describe relationship between two variables. Two variables are said to be related, if for a change in one variable, there is a change in the other. That is, the two quantities vary in such a way that movements in one are accompanied by movements in the other. For example, the scholastic achievement and general intelligence of a child will be related to each other. The extent of relationship and the nature of relationship will be measured using a correlation coefficient. If an increase in one variable brings a corresponding increase in the other variable or a decrease in one variable brings a corresponding decrease in the other variable, the two variables are said to be positively related. For example, it is supposed that as height of individuals increase weight also increases. Hence the two variables height and weight are positively related. If increase (or decrease) in one variable brings a corresponding decrease (or increase) in the other variable the two variable are negatively related. The relation between performance of an individual in a test and test anxiety will be negative as for an increase in test anxiety, there will be a decrease in the performance or vice versa. If there is no corresponding change in one variable for the changes in the other variable, the two variables are said to be not related or there is zero correlation between the variables. The height of an individual and his intelligence are independent (or not related to each other) as there will not be any changes in intelligence accompanied with changes in height or vice versa. It is to be noted that correlation gives idea about the relation between the variables, but not the causation. That is, the change in one variable is not the cause for change in the other variable, but due to some reasons, the two variables vary together. Psychological Statistics Page 4

5 The relation between two variables can be linear or curvilinear. If the relation between two variables can be represented graphically by a straight line, the relation is linear. If the graph is not a straight line, but a curve, the relation is curvilinear. In other words, if the amount of change in one variable tends to bear constant ratio to the amount of change in the other variable, then the correlation is said to be linear. If the change in one variable does not bear a constant ratio to the amount of change in the other variable, the relation is non-linear or curvilinear. In common purposes, usually linear correlation is calculated, and it reveals how the change in one variable is accompanied by a change in the other variable. If two variables are linearly related, the degree of relationship and the nature of relation can be measured using coefficient of correlation. It is a ratio which expresses the extent to which changes in one variable are accompanied by changes in the other variable. A coefficient of correlation ranges from -1 to +1. Two important methods of computing coefficient of correlation are 1. Pearson's Product Moment method and. Spearman's Rank different method. Pearson's Product Moment Method This method is the most common method of calculating the linear relationship between the variables (usually with interval or ratio measure) Pearson's Coefficient of Correlation r = X X Y Y X X Y Y Where X & Y are the arithmetic means of the two sets of values X and Y respectively. Machine formula for Pearson's coefficient of correlation is r N XY X Y N X X N Y Y Spearman's Rank difference method Pearson's product moment correlation is used when the variables are in interval or ratio scale and if the relation between them is linear. But in practice one may face situation when the data are not in interval or ratio scale or the expected relationship does not fit a linear form. Spearman's method is one alternative to Pearson's method when the variables are in ordinal form (When X and Y values are ranks) or when the expected relationship is not linear. (When one wants to measure the consistency of a relationship between X and Y, independent of the specific form of the relationship). Psychological Statistics Page 5

6 Spearman's coefficient of correlation n 6 D 1 Where D is the difference between X rank and the Y rank for n 1 each individual and n-number of individuals. If X and Y are not given as ranks, convert X and Y into ranks. Rank can be assigned by taking either highest value as 1 or the lowest value as 1, but the same method must be followed while ranking the other variable. If two or more entries are equal, each entry is given an average rank. For example if two scores are ranked equal at 6 th place, they are each given the rank , giving the 8 th rank for the next item. If three scores are at 6 th place, the average rank 7 will be given to the three values, the next item with rank 9. 3 Interpretation of correlation co-efficient A correlation coefficient tells whether there is any relation between the variables and if a relationship exists between variables, is it positive or negative. It also indicates the degree of relationship, ie., closeness of relationship. Thus a correlation coefficient can be interpreted with respect to 1) The sign : If the coefficient of correlation computed is positive, there is a positive relationship between the variables and if it is negative, the relationship between the variables is negative. ) The magnitude: The value ranges from -1 to +1. One way of interpreting the correlation coefficient is, Illustration 0 - zero relation or no relationship ± Negligible to low relation ±0.0 - ±.40 - Low to moderate relation. ± ±.60 - Moderate to high relation. ± ±.80 - High to very high relation ± ± 1 - Very high to perfect relation. Pearson's Method Following are the marks obtained by five students in two tests. Pearson's Product moment coefficient of correlation and interpret it. Sl. No. Test 1 Test Calculate Psychological Statistics Page 6

7 This can be written as X Y X X Y Y X X Y Y X XY Y X Y 5 r = X XY Y X X Y Y = x r = Negative sign of 'r' indicates a negative correlation between the two sets of test scores. That is, for an increase in the first test score, there will be a decrease in the second test score or vice versa. The magnitude is which is in between ±0.0 and ± 0.40 and hence the relationship can be considered as moderate or substantial. Thus, the two sets of test scores are negatively related and the relationship is moderate. Using Machine formula X Y X Y XY Psychological Statistics Page 7

8 r = N XY X Y N X X N Y Y = 5x74 6x5 5x x = Illustration Spearman's Rank difference method 74x The ranks given by two judges for ten items of a Thurston's scale of attitude are given below. Find the correlation between the two ranks. Item Judge 1 Judge Item Item 8 6 Item Item 4 1 Item Item Item 7 1 Item Item Item R 1 R D=R 1 -R D Psychological Statistics Page 8

9 p = 6 n n 1 D 1 6x4 144 p = x = Here n 10 The correlation coefficient obtained is The positive sign indicates the direction of correlation, that is the two sets of ranks are positively related. The magnitude is which denote a high correlation between the two sets of data. Hence the two judges have agreement in the ranks given to the items. Illustration 3 X Y R R 1 D=R 1 -R D Here the value 5 repeats two times in X-set and the rank for the two values will 5 6 be five and six. Hence the average ie 5. 5 is given as rank for both the values. The next lower value 4 is given the rank '7'. Similarly, '3' repeats twice and the corresponding ranks are 8 and 9, the rank 8 9 given is 8. 5 for the 3's. The next lower value is given the rank 10. In the second set of values (y - values), '10' occurs three times occupying the positions 4, 5 and 6. Hence the average 5 is given as rank for 10, the next 3 lower value '9' getting the rank '7'. Similarly, '8' has got the rank 8.5 n 6 D p = 1 n 1 6x = ) 990 = = -0.0 Psychological Statistics Page 9

10 The obtained value shows that the two sets of values are negatively related but the relationship is negligible. Scatter diagram Scatter diagram or scatter plot is the graphical representation of correlation between two variables. It gives the pattern of relationship between the two variables at a glance itself. To construct a scatter diagram the following steps can be followed. Draw the axes and decide which variable goes on which axis. Two perpendicular axes are drawn and one variable is taken on the horizontal axis and the other on the vertical axis. Determine the range of values for each variable and mark them on the axes. Usually the point at which the axes meet is taken as 'zero' and the points are marked on the axes by taking an appropriate scale. The lowest and highest values of both variables must be included in the respective axes. It is conventional that the scatter diagrams are roughly square in shape and hence the scales must be taken so that the horizontal and vertical axes have almost the same length (1:1 ratio). Mark a 'dot' for each pair of scores. Plot points on the paper by taking the score of the variable on horizontal axis as the X-co-ordinate and that of the vertical axis as the Y-co-ordinate. Each pair of scores can be represented as a point by following this method. Examples The scores obtained by 10 individuals in two tests are given below. Draw a scatter diagram. Individual Test A Test B Score Test B Scatter diagram Score Test A Psychological Statistics Page 10

11 Here the points plotted tend to lie on a straight line and hence the two variables are linearly related (or the relation is linear). The points are from left bottom to right top which indicate a positive relationship between the variables. In the following diagram the points are arranged from the left top to right bottom indicating a negative relationship between the variables. Individuals. X Y A 4 B 3 3 C 3 D 1 4 E Y X If the variables are not related or if there is zero relation between the variables, there will not be any pattern in the distribution of points, the points will be scattered in the plane. Indls. X Y A 1 B 4 C 3 1 D 4 3 E 5 Psychological Statistics Page 11

12 Y X But scatter diagram do not give any clear picture about the strength of the relation as the correlation coefficient give. Any how, whether the variables are highly related or the relationship is low can be identified by looking into how do the points fall to a straight line. That is, if the points more tend to a straight line, the variables are highly related. If they fall far away from the straight line, the relationship is zero or very low. The correlation is moderate, if the pattern of dots is somewhere between a low and a high correlation. If the points lie exactly on a straight line (as shown in the figure, the correlation is perfect). Indls. X Y A 1 1 B C 3 3 D 4 4 E 5 5 Y X Psychological Statistics Page 1

13 Scatter diagram showing positive perfect linear relationship Indls. X Y A 1 5 B 4 C 3 3 D 4 E Y X Scatter diagram showing negative perfect linear relationship Scatter diagrams are important because they clearly exhibit the nature of relationship, whether linear or curvilinear. If the variables are found to be linearly related, then only, coefficients of correlation can be calculated. When the relation is curvilinear, calculating coefficient of correlation and interpreting it will be misleading or will be an error. Psychological Statistics Page 13

14 Objectives MODULE PARAMETRIC AND NON-PARAMETRIC TESTS The student will be able to - discriminate parametric and non-parametric tests. - use chi-square test and c-coefficient of contingency in appropriate situations. Introduction In order to understand this module, a more knowledge about statistics is needed. First of all you should be familiar with the descriptive and inferential statistics. Descriptive statistics describes the data as does arithmetic mean or median or standard deviation or correlation. But statistics is mainly used to predict or infer the unknown. This function of statistics is done by inferential statistics. Two terms related to inferential statistics are parameter and statistics. Usually we are conducting research on a sample, not on the entire population. But the purpose of the study will be to know about the population and not the sample. The value calculated from the sample is called statistic and the corresponding value for the population is parameter. For example, if arithmetic mean of a sample is, it is a statistic. The corresponding arithmetic mean of the population can be taken as µ, the parameter. Statistics tries to predict the unknown parameter from the known statistics. In the process of predicting or inferring population values, hypotheses are formulated and tested. A hypothesis is a tentative solution to the problem which is tested through the study. Hypotheses are stated as affirmative sentences. Inferential statistics tests these hypotheses using the data available through the study of sample, a representative group of the population. Hypotheses can be directional or non directional. The statement, Boys are better than girls in mechanical aptitude is a directional hypothesis as there is a clear indication of direction of change. But the statement, Boys and girls differ in their mechanical aptitude is a non directional hypothesis as there is no indication of direction of change. Statistical procedures are there to test hypothesis in which no change or no effect is stated. Such hypothesis is known as null hypothesis denoted as H0. H0 is tested against the alternative hypothesis, H1. An alternative hypothesis is the hypothesis that is accepted when the null hypothesis is rejected. Example: H0: There is no significant mean difference in mechanical aptitude between boys and girls.(µ1= µ). H1: There is significant difference in mechanical aptitude between boys and girls. ( µ1 µ). While testing a hypothesis, there is a possibility of occurrence of two types of errors. One error is rejecting H0 when it is true and the other is accepting H0 when it is not true. The first error is known as Type I error and the second is Type II error. That is any decision regarding hypothesis testing may include these two types of errors. Probability of Type I Error ie., rejecting H0 when it is true is known as Level of Significance. In hypothesis testing level of significance is determined by the researcher in advance itself. In behavioural sciences, it is taken commonly as.05 or.01. As the probability of type I error decreases, probability of type II error increases and vice versa. Psychological Statistics Page 14

15 Decisions regarding hypothesis testing are made based on the concept of a Normal distribution. Normal distribution is a theoretical model of distribution of quantitative data. A normal probability curve is a bell shaped curve which is symmetric with respect to the ordinate at mean. It is non-skewed, lepto kurtic curve for which mean, median and mode coincide. It has wide application in statistics. The two tails of the curve determines the chance of accepting or rejecting a null hypothesis. The area for which the hypothesis is rejected is known as rejection area or critical region. (More about normal curve is included in fourth semester). Parametric and Non-parametric Tests The researches in behavioural science need different types of statistical tests to test hypotheses about specific population parameters (Values corresponding to population are called parameters and those of sample are statistics). A number of statistical techniques like 't' tests and ANOVA make assumptions about the shape of the population distribution and about other population parameters. As these tests concern parameters and require assumptions about parameters, they are called 'Parametric tests'. Parametric tests require a numerical score for each individual in the sample. The scores then are added, squared, averaged and otherwise manipulated using basic arithmetic. That is, parametric tests require data from an interval or ratio scale. If some assumptions are not met, it may not be appropriate to use a parametric test. Violation of assumptions of a test may lead to erroneous interpretation of the data. In such situations, there are several hypothesis testing techniques that provide alternatives to parametric tests. These tests are known as 'non-parametric' tests. Non-parametric tests usually do not state hypotheses in terms of a specific parameter and there may few (if any) assumptions about population distribution. (Because of this reason, non-parametric tests are also known as distribution free tests) Chi-square test, Sign test etc are non-parametric tests. For non-parametric tests, the subjects (sample elements) are usually classified into categories. (These measures are on nominal or ordinal scales and they do not produce numerical values that can be used to calculate means and variances). The data for many non-parametric tests are frequencies. Non-parametric tests are not as sensitive as parametric tests. (i.e., nonparametric tests are more likely to fail in detecting a real difference between two treatments). Hence whenever there is a chance to use parametric test, it must be preferred than a non-parametric test. Chi-Square Test Chi-square ( ) test is a non-parametric test with wide applications in the field of social sciences. (The name of the test is related with the Greek letter '' 'Kye' used to identify the test statistic). Chi-square test is used when the data is in the form of frequencies (i e., the measurement is on nominal scale). Mainly the test is used for two purposes- testing goodness of fit and to test the independence between two nominal variables. Psychological Statistics Page 15

16 The Chi-square test for Goodness of fit Here Chi-square test is used for testing hypothesis related to sample proportions with respect to the corresponding population proportions. The Chi-square test for goodness of fit determines how well the obtained sample proportions fit the population proportions specified by the null hypothesis. Formula for calculating is ( O E E ) where O is the observed frequency and E is the expected frequency (as per the null hypothesis) Usually in the test of Goodness of fit, the null hypothesis will fall into one of the following categories. 1. No preference: The null hypothesis states that there is no preference among the different categories. ie., Ho states that the population is divided equally among the categories.. No difference from other population The null hypothesis can state that the frequency distribution for one population is not different from the distribution that is known to exist for another population. The null hypothesis for the goodness of fit test specifies an exact distribution for the population. So the alternative hypothesis (H1) state that the population distribution has a different shape from that specified in Ho. For example, if Ho states that the population is equally divided among three categories, the alternative hypothesis will say that the population is not divided equally. measures the discrepancy between the observed frequencies (the data) and the expected frequencies (Ho). When the calculated value of is large, observed and expected frequencies differ highly and hence Ho is rejected. If value is small, the difference between the observed and expected frequencies are less, then Ho is accepted. To determine whether the value is large or small, the theoretical distribution of is used. The table of gives values for different levels of significance and degrees of freedom(df). In goodness of fit test, the degree of freedom of is C-1 where C is the number of categories. If there are three groups or categories in the specified population, the degrees of freedom will be 3-1=. If there are five categories, the df will be 5-1=4. In a table, the first column lists df, the top row lists proportions of area in the extreme right hand tail of the distribution (or level of significance). The numbers in the body of the table are the critical values of Chi-square. To test the hypothesis the critical value of for given df and level of significance is located from the table. Then the calculated value is compared with the tabled value. If the calculated value is greater than the tabled value, Ho is rejected and if it is less than the tabled value, Ho is accepted. Psychological Statistics Page 16

17 Illustration 1 Forty eight subjects were asked to express their response to an item in an attitude scale in three categories Agree, Undecided and Disagree. Of the members in the group, 4 marked 'agree', 1 'undecided' and 1 'disagree'. Do these results indicate a significant difference among groups? Here the null hypothesis Ho can be stated as Ho: There is no significant difference among groups H1: The group differ significantly. Category O E Agree 4 16 Undecided 1 16 Disagree 1 16 Since the assumption is that the groups do not differ in their size, the total 48 can be equally divided into three groups. So the expected frequency of each group is Next step is finding by summing ( O E E ) (4 16) = 16 (1 16) = = 6 16 (1 16) 16 ( O E) E With degree of freedom c-1=3-1=. Fix the level of significance as From table of, the value of ( df) 0.05 level is Since the calculated value is greater than the tabled value of, Ho is rejected. That is one cannot expect equal preference among the subjects at 0.05 level of significance or the subjects showed a significant difference in the response to the given item. Illustration- In a study to know if high-performance over powered cars are more likely to be involved in accidents than other types of cars, the investigator classified 50 cars as high performance, sub compact, mid-size, or full size. The observed frequencies are given below. Psychological Statistics Page 17

18 High performance Subcompact Mid size Full size It is assumed that only 10% of the cars in the population are the highperformance variety. The subcompact, mid size and full size cars are 40%, 30% and 0% respectively. Can the researcher conclude that the observed pattern of accidents does not fit the predicted values (Test with =0.05). Ho: In the population, no particular type of car shows a disproportionate number of accidents. (or the observed frequencies are in agreement with the expected frequencies) H1: In the population, a disproportionate number of accidents occur with certain types of cars (There is disagreement with O and E). O E (O-E) (O-E) /E Here expected frequencies are calculated according to the assumption. So the 10 expected frequency of the first group is 50 x = 5x1= The E of second group is 0 (50 x ) The E of third group is 50 x = and that of fourth group is 50 x = ( O E) = E degrees of freedom = C-1=4-1=3 The tabled value of (3 df) at 0.05 level is Since the calculated value is greater than the tabled value of for 3df at 0.05 level, Ho is rejected. That is a disproportionate number of accidents occur with certain types of cars. Psychological Statistics Page 18

19 Chi-square test of independence Chi-square test can be used to test whether two nominal variables (values expressed as frequencies) are independent or not (associated or not). The formula for calculating is same as that of Goodness of fit. ( O E E ) Degrees of freedom in the case of Test of Independence is (C-1) (R-1) where C is the number of columns and R number of rows. The steps involved in the test of independence are Calculate the expected frequencies. Expected frequency for each cell can be calculated by RTxCT E N E- Expected frequency RT- CT- The row total for the row containing the cell The column total for the column containing the cell N- The total number of observations. Take the difference between observed and expected frequencies and obtain the squares of these differences, i.e., obtain the values of (O-E). Divide the values of (O -E) obtained in step by the respective expected ( O E) frequency and obtain the total E The calculated value of is compared with the table value. Example: Two hundred students were classified by personality and colour preference as given below. Test whether colour preference and personality are dependent for the student population. Red Yellow Green Blue Introvert Extrovert N=00 Here the null hypothesis is Ho: Colour preference and personality of students are independent. The expected frequencies of the cells are calculated by multiplying the corresponding row total and column total and then dividing it by total number of participants. Psychological Statistics Page 19

20 The expected (theoretical) frequencies will be Red Yellow Green Blue Introvert 50x =5 50x0 = x40 = x40 =10 00 Extrovert 150x =75 150x0 00 =15 150x40 00 =30 150x40 00 =30 ( O E E ) substituting the values (10 5) (3 5) (18 15) + 15 = 37.4 Degrees of freedom (df) is (15 10) + 10 (R-1) (C-1) = (-1) (4-1) = 1x3=3 ( 10) + 10 (90 75) + 75 The table value for 3 df at 0.05 level is and at 0.01 level is (17 15) + 15 (5 15) + 15 Since the calculated value of is greater than the tabled value for significance at 0.01 level, the null hypothesis is rejected. That is, the hypothesis "Colour preference and personality of students are independent' is rejected. Hence the two variables colour preference and personality are associated or dependent. Note: If two variables are found to be associated using test of independence, the strength of association can be calculated using c-coefficient of contingency. C N Basic Assumption Chi-square test is a non-parametric test and hence is distribution free. That is, as in parametric tests, variables need not be normally distributed. But, the variables are to be measured in a nominal scale. That is, the data is in the form of frequencies. Psychological Statistics Page 0

21 Objectives MODULE 3 MEDIAN TEST Introduction The student will know about various non-parametric tests like Man-Whitney U test, Sign test, Wilcoxon Matched pairs signed ranks test. Non-parametric tests can be used in situations where - the sample size is quite small (i.e., when N=5 or 6) - assumptions like normality of the distribution of values in the population is not ensured. - When the measurement of data is either in ordinal (ranks) or nominal (frequencies). As non-parametric tests are less powerful than the parametric tests, when ever possible, parametric tests are to be used. But one cannot neglect the nonparametric tests because researchers in psychology need such tests for statistical inferences in many situations. Some major non-parametric tests are discussed below. 1. Mann-Whitney U Test To test the significance of difference between two independent means, usually t- test (parametric) is used. But when the normality of the variables are questioned, t-test is not applicable. Then Mann-Whitney U test can be used, but it tries to find out the difference between population distributions, and not the population means. It is a powerful test to test the difference between two independent samples having uncorrelated data. The basic assumption for Mann-Whitney U test is given below: "If two sets of data differ significantly, generally, the scores in one sample will be larger than the scores in the other. If the two samples are combined and all the scores are placed in rank order on a line, then the scores from one sample will concentrate at one end of the line, the scores of the other sample will concentrate at the other end. But if there is no significant difference between the groups, the large and small values will be mixed evenly in the two samples, values of one sample not concentrating at one side of the line. Step 1 The null hypothesis Ho can be written as there is no tendency for the ranks in one group to be systematically higher or lower than the ranks in the other group. Fix the level of significance as 0.05 or Step Two samples are there sample A and sample B. Let na be the number of subjects in sample A and nb be the number of subjects (sample size) of sample B. Then combine the scores of the two groups and the na+ nb scores are ranked. Psychological Statistics Page 1

22 Step 3 The ranks in each group are summed up to get RAand RB (RA+ RB = N (N+1)/ where N= na+ nb) Step 4 Step 5 Then the Mann-Whitney U for the two sets are calculated using the formula n (n + 1) U = n n + R n (n + 1) U = n n + R Take the Mann-Whitney U as the smaller of UA and UB. Refer the table of critical values of Mann-Whitney U for the tabled value of U corresponding to the obtained UA and UB and the level of significance fixed. If the tabled value is less than the obtained U value, Ho can be accepted and if the obtained U value is less than the critical value Ho is rejected. Example: The time in seconds recorded for 13 children (5 boys and 8 girls) in a blockmanipulation task are given below. Test whether there is consistent difference between boys and girls in the time required for completing the task. Step 1 Boys Girls The null hypothesis Ho: There is no consistent difference between boys and girls in the time needed for completion of the task. Alternate hypothesis: H1: There is a systematic difference. Take level of significance as =0.05. Step na = 5 nb = 8 na + nb = 13 Boys Ranks Girls RA = RB = 69 Psychological Statistics Page

23 U = n n + n (n + 1) R U = = = 40-7 = 33 U = n n + 5(5 + 1) n (n + 1) = ( ) 69 = = 7 R The least value of UA and UB is 7 hence U=7 The critical value is 6 (for na =5 & nb =8) at 0.05 level of significance. Since the obtained U value is greater than the tabled value, accept Ho. At the 0.05 level of significance, the data do not provide sufficient evidence to conclude that there is significant difference between boys and girls in time needed for completing the task.. The Sign Test Sign test is the most simple non-parametric test. It is used for comparing two correlated samples, i.e., two parallel set of scores which are paired off in some way (dependent samples). This test uses plus and minus signs instead of quantitative values and hence the name 'sign test'. Sign test is useful when 1. Normality and homogeneity of variance of the variables are not sure, but the variable considered has a continuous distribution.. All the subjects are not from the same population. 3. Two correlated samples are to be compared, with the null hypothesis the median difference between the pairs is zero. 4. There are two sets of measurements which can be matched (paired) with respect to relevant extraneous variables. 5. When the data is not in interval or ratio scale, especially when the direction of change is given, not the quantitative measure. If the number of individuals in the single sample is less than or equal to 5, the table of probabilities associated with values as small as observed values of x (number of fewer signs) in the Binomial test. If N is greater than 5, the normal approximation to the binomial distribution or may be used; Here Psychological Statistics Page 3

24 = ( ) 1/. With Yate's correction this formula can be rewritten as = ( ±. ) / where x+.5 is taken when <,.5 is taken when > ' ' is the total of '+' or '-' signs. Example: A researcher tests the effect of a diet plan on body weight. A sample of 50 people is selected. The observation made at the start of the diet and one month later revealed that 37 people lost weight, 1 gained and one had no change in weight. Test whether the treatment leads to gain in weight. Number of positive signs (initially high value, second one smaller, i.e., loss in weight) = 37 Number of negative signs (gain in weight) =1 Here N = Number of positive signs + Number of negative signs =49 (not the sample size 50). Ho: The diet has no effect; H1: The diet has some effect. ( ± 0.5) / = x can be taken as 37 or 1 leading to the same numerical value of z. The only change will be in the sign. = (. ) / =.. =. = 3.43 Since this value is greater than 1.96, the critical value for significance at 0.05 level, Ho is rejected. That is the assumption that 'the diet has no effect on weight' is rejected. (If N is small, instead of calculating z, the value of N and x is taken for finding the value of P from a table of probabilities associated with values in the Binomial test. If it is greater than 0.05, the null hypothesis is accepted, if it is less than 0.05, the null hypothesis is rejected). 3. Wilcoxon Matched-pairs Signed Ranks Test Wilcoxon Matched- Pairs Signed Ranks Test is more powerful than the sign test because it tests not only direction but also magnitude of differences within pairs of matched group. This test is used to test the difference between two related (dependent) samples/matched pairs of individuals and is not applicable to independent groups. Method Step 1 Let d1 be the difference of scores for any matched pair, representing the difference between pair's scores under two treatments A and B. Psychological Statistics Page 4

25 Step Step 3 Delete all such pairs for which d1 = 0. Rank all the s with out regard to sign, giving rank 1 to the smallest d1, rank to the next smallest and so on. Step 4 Indicate which ranks arose from negative d 1s and which ranks arouse from positive d ' 1 s by giving sign of difference to each rank. Step 5 Sum the ranks for the positive differences and sum the ranks for the negative differences. Under the null hypothesis, the two sums are expected to be equal. Z is calculated using the formula ( ) = where ( )( ) N- Number of pairs related. T- Sum of the ranks of the smaller, of the like signed ranks Illusteration The local Red Cross has conducted an intensive campaign to increase blood donations. This campaign has been concentrated in 10 local businesses. In each company, the goal was to increase the percentage of employees who participate in the blood donation program. Using Wilcoxon test to decide whether these data provide evidence that the campaign had a significant impact on blood donations. The data from each company has listed in a rank order to the absolute value of the difference scores. Company Percentage of participation Rank Before After Difference A B C D E F G H I J Psychological Statistics Page 5

26 Step 1: The null hypotheses state that the campaign has no effect. Therefore, any differences are due to chance, and there should be no consistent pattern. Step : the two companies with difference scores of zero are discarded, and n is reduced to 8 and α=.05, the critical value for the Wilcoxon test is T=3. A sample value that is less than or equal to 3 will lead us to reject Ho. Step 3: For these data, the positive differences have ranks of 3, 4, 5.5, 5.5, 7, and 8; R+ = 33 The negative differences have ranks of 1 and ; = 3 The Wilcoxon T is the smaller of these sums, so T = 3. Step 4: T vlue from the data is in the critical region. This value is very unlikely to occur by chance (p<.05); therefore, we reject Ho and conclude that there is a significant change in participation after Red Cross campaign. (source: Gravetter, F. J., & Wallnau, L. B. (000). Statistics for the Behavioral Sciences (5 th ed) Psychological Statistics Page 6

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

Statistics: revision

Statistics: revision NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Upon completion of this chapter, you should be able to:

Upon completion of this chapter, you should be able to: 1 Chaptter 7:: CORRELATIION Upon completion of this chapter, you should be able to: Explain the concept of relationship between variables Discuss the use of the statistical tests to determine correlation

More information

QUANTITATIVE TECHNIQUES

QUANTITATIVE TECHNIQUES UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION (For B Com. IV Semester & BBA III Semester) COMPLEMENTARY COURSE QUANTITATIVE TECHNIQUES QUESTION BANK 1. The techniques which provide the decision maker

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)

More information

Background to Statistics

Background to Statistics FACT SHEET Background to Statistics Introduction Statistics include a broad range of methods for manipulating, presenting and interpreting data. Professional scientists of all kinds need to be proficient

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

Everything is not normal

Everything is not normal Everything is not normal According to the dictionary, one thing is considered normal when it s in its natural state or conforms to standards set in advance. And this is its normal meaning. But, like many

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

Non-parametric tests, part A:

Non-parametric tests, part A: Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are

More information

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com)

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) 1. GENERAL PURPOSE 1.1 Brief review of the idea of significance testing To understand the idea of non-parametric statistics (the term non-parametric

More information

STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.

STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data. STAT 518 --- Section 3.4: The Sign Test The sign test, as we will typically use it, is a method for analyzing paired data. Examples of Paired Data: Similar subjects are paired off and one of two treatments

More information

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

Analysis of Variance and Co-variance. By Manza Ramesh

Analysis of Variance and Co-variance. By Manza Ramesh Analysis of Variance and Co-variance By Manza Ramesh Contents Analysis of Variance (ANOVA) What is ANOVA? The Basic Principle of ANOVA ANOVA Technique Setting up Analysis of Variance Table Short-cut Method

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

3. Nonparametric methods

3. Nonparametric methods 3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression Recall, back some time ago, we used a descriptive statistic which allowed us to draw the best fit line through a scatter plot. We

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

Correlation and regression

Correlation and regression NST 1B Experimental Psychology Statistics practical 1 Correlation and regression Rudolf Cardinal & Mike Aitken 11 / 12 November 2003 Department of Experimental Psychology University of Cambridge Handouts:

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION Structure 4.0 Introduction 4.1 Objectives 4. Rank-Order s 4..1 Rank-order data 4.. Assumptions Underlying Pearson s r are Not Satisfied 4.3 Spearman

More information

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47 Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................

More information

Measuring Associations : Pearson s correlation

Measuring Associations : Pearson s correlation Measuring Associations : Pearson s correlation Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

Transition Passage to Descriptive Statistics 28

Transition Passage to Descriptive Statistics 28 viii Preface xiv chapter 1 Introduction 1 Disciplines That Use Quantitative Data 5 What Do You Mean, Statistics? 6 Statistics: A Dynamic Discipline 8 Some Terminology 9 Problems and Answers 12 Scales of

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

CORELATION - Pearson-r - Spearman-rho

CORELATION - Pearson-r - Spearman-rho CORELATION - Pearson-r - Spearman-rho Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the set is

More information

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COURSE: CBS 221 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate

More information

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain

More information

Intuitive Biostatistics: Choosing a statistical test

Intuitive Biostatistics: Choosing a statistical test pagina 1 van 5 < BACK Intuitive Biostatistics: Choosing a statistical This is chapter 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc.

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling Review for Final For a detailed review of Chapters 1 7, please see the review sheets for exam 1 and. The following only briefly covers these sections. The final exam could contain problems that are included

More information

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13 C h a p t e r 13 Independent-Samples t Test and Mann- Whitney U Test 13.1 Introduction and Objectives This chapter continues the theme of hypothesis testing as an inferential statistical procedure. In

More information

QT (Al Jamia Arts and Science College, Poopalam)

QT (Al Jamia Arts and Science College, Poopalam) QUANTITATIVE TECHNIQUES Quantitative techniques may be defined as those techniques which provide the decision makes a systematic and powerful means of analysis, based on quantitative data. It is a scientific

More information

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing Page Title PSY 305 Module 3 Introduction to Hypothesis Testing Z-tests Five steps in hypothesis testing State the research and null hypothesis Determine characteristics of comparison distribution Five

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

Introduction to Business Statistics QM 220 Chapter 12

Introduction to Business Statistics QM 220 Chapter 12 Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,

More information

UNIT 3 CONCEPT OF DISPERSION

UNIT 3 CONCEPT OF DISPERSION UNIT 3 CONCEPT OF DISPERSION Structure 3.0 Introduction 3.1 Objectives 3.2 Concept of Dispersion 3.2.1 Functions of Dispersion 3.2.2 Measures of Dispersion 3.2.3 Meaning of Dispersion 3.2.4 Absolute Dispersion

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS PSYCHOLOGICAL STATISTICS B Sc. Counselling Psychology 011 Admission onwards II SEMESTER COMPLIMETARY COURSE UIVERSITY OF CALICUT SCHOOL OF DISTACE EDUCATIO CALICUT UIVERSITY.P.O., MALAPPURAM, KERALA, IDIA

More information

LOOKING FOR RELATIONSHIPS

LOOKING FOR RELATIONSHIPS LOOKING FOR RELATIONSHIPS One of most common types of investigation we do is to look for relationships between variables. Variables may be nominal (categorical), for example looking at the effect of an

More information

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large Z Test Comparing a group mean to a hypothesis T test (about 1 mean) T test (about 2 means) Comparing mean to sample mean. Similar means = will have same response to treatment Two unknown means are different

More information

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH)

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 003 MOCK EXAMINATIONS IOP 01-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) Time: hours READ THE INSTRUCTIONS BELOW VERY CAREFULLY. Do not open this question paper until

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone First, a bit about Parametric Statistics Data are expected to be randomly drawn from a normal population Minimum sample size

More information

Measures of Central Tendency and their dispersion and applications. Acknowledgement: Dr Muslima Ejaz

Measures of Central Tendency and their dispersion and applications. Acknowledgement: Dr Muslima Ejaz Measures of Central Tendency and their dispersion and applications Acknowledgement: Dr Muslima Ejaz LEARNING OBJECTIVES: Compute and distinguish between the uses of measures of central tendency: mean,

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Physics 509: Non-Parametric Statistics and Correlation Testing

Physics 509: Non-Parametric Statistics and Correlation Testing Physics 509: Non-Parametric Statistics and Correlation Testing Scott Oser Lecture #19 Physics 509 1 What is non-parametric statistics? Non-parametric statistics is the application of statistical tests

More information

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data

More information

Workshop Research Methods and Statistical Analysis

Workshop Research Methods and Statistical Analysis Workshop Research Methods and Statistical Analysis Session 2 Data Analysis Sandra Poeschl 08.04.2013 Page 1 Research process Research Question State of Research / Theoretical Background Design Data Collection

More information

CORRELATION. compiled by Dr Kunal Pathak

CORRELATION. compiled by Dr Kunal Pathak CORRELATION compiled by Dr Kunal Pathak Flow of Presentation Definition Types of correlation Method of studying correlation a) Scatter diagram b) Karl Pearson s coefficient of correlation c) Spearman s

More information

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers Diploma Part 2 Quantitative Methods Examiners Suggested Answers Q1 (a) A frequency distribution is a table or graph (i.e. a histogram) that shows the total number of measurements that fall in each of a

More information

Rama Nada. -Ensherah Mokheemer. 1 P a g e

Rama Nada. -Ensherah Mokheemer. 1 P a g e - 9 - Rama Nada -Ensherah Mokheemer - 1 P a g e Quick revision: Remember from the last lecture that chi square is an example of nonparametric test, other examples include Kruskal Wallis, Mann Whitney and

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

10/31/2012. One-Way ANOVA F-test

10/31/2012. One-Way ANOVA F-test PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 1. Situation/hypotheses 2. Test statistic 3.Distribution 4. Assumptions One-Way ANOVA F-test One factor J>2 independent samples

More information

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up? Comment: notes are adapted from BIOL 214/312. I. Correlation. Correlation A) Correlation is used when we want to examine the relationship of two continuous variables. We are not interested in prediction.

More information

Math 101: Elementary Statistics Tests of Hypothesis

Math 101: Elementary Statistics Tests of Hypothesis Tests of Hypothesis Department of Mathematics and Computer Science University of the Philippines Baguio November 15, 2018 Basic Concepts of Statistical Hypothesis Testing A statistical hypothesis is an

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Types of Statistical Tests DR. MIKE MARRAPODI

Types of Statistical Tests DR. MIKE MARRAPODI Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample

More information

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation

More information

Ch. 16: Correlation and Regression

Ch. 16: Correlation and Regression Ch. 1: Correlation and Regression With the shift to correlational analyses, we change the very nature of the question we are asking of our data. Heretofore, we were asking if a difference was likely to

More information

Essential Maths Skills. for AS/A-level. Psychology Answers. Molly Marshall

Essential Maths Skills. for AS/A-level. Psychology Answers. Molly Marshall Essential Maths Skills for AS/A-level Psychology Answers Molly Marshall 1 Arithmetic and numerical computation Expressions in decimal and standard form Decimal form Guided question (p.9) 1 Table A.1 Millions

More information

Copyright, Nick E. Nolfi MPM1D9 Unit 6 Statistics (Data Analysis) STA-1

Copyright, Nick E. Nolfi MPM1D9 Unit 6 Statistics (Data Analysis) STA-1 UNIT 6 STATISTICS (DATA ANALYSIS) UNIT 6 STATISTICS (DATA ANALYSIS)... 1 INTRODUCTION TO STATISTICS... 2 UNDERSTANDING STATISTICS REQUIRES A CHANGE IN MINDSET... 2 UNDERSTANDING SCATTER PLOTS #1... 3 UNDERSTANDING

More information

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

UNIVERSITY OF CALICUT

UNIVERSITY OF CALICUT QUANTITATIVE TECHNIQUES FOR BUSINESS COMPLEMENTARY COURSE BBA (III Semester) B Com( (IV Semester) (2011 Admission) UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION Calicut University P.O. Malappuram,

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Chapter 16: Correlation

Chapter 16: Correlation Chapter : Correlation So far We ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e. for the population from which the sample came) Which answers

More information

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining

More information

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information