Elementary Statistics in Social Research Essentials Jack Levin James Alan Fox Third Edition

Similar documents
Introductory Statistics Neil A. Weiss Ninth Edition

Elementary Linear Algebra with Applications Bernard Kolman David Hill Ninth Edition

The Practice Book for Conceptual Physics. Paul G. Hewitt Eleventh Edition

Differential Equations and Linear Algebra C. Henry Edwards David E. Penney Third Edition

William R. Wade Fourth Edition

Introductory Chemistry Essentials Nivaldo J. Tro Fourth Edition

Physics for Scientists & Engineers with Modern Physics Douglas C. Giancoli Fourth Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Student Workbook for Physics for Scientists and Engineers: A Strategic Approach with Modern Physics Randall D. Knight Third Edition

Student Workbook for College Physics: A Strategic Approach Volume 2 Knight Jones Field Andrews Second Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Multivariate Data Analysis Joseph F. Hair Jr. William C. Black Barry J. Babin Rolph E. Anderson Seventh Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

A Second Course in Statistics Regression Analysis William Mendenhall Terry Sincich Seventh Edition......

A First Course in Probability Sheldon Ross Ninth Edition

Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Process Control Instrumentation Technology Curtis D. Johnson Eighth Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Chemistry: The Central Science Brown LeMay Bursten Murphy Woodward Twelfth Edition

Introduction to Electrodynamics David J. Griffiths Fourth Edition

Essential Organic Chemistry. Paula Y. Bruice Second Edition

Karen C. Timberlake William Timberlake Fourth Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

History of Mathematics. Victor J. Katz Third Edition

Introduction to Mathematical Statistics and Its Applications Richard J. Larsen Morris L. Marx Fifth Edition

Figure Histogram (a) and normal probability plot (b) of butterfly wings data

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Organic Chemistry Paula Y. Bruice Seventh Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

GLOBAL EDITION. Thomas. CALCULUS Early Transcendentals Thirteenth Edition in SI Units

Pearson Education Limited. Edinburgh Gate Harlow Essex CM20 2JE England. and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Suzanne Bell Second Edition

Series editor: Anita Straker

Physical Chemistry Thomas Engel Philip Reid Third Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Proportional Relationships

Essentials of Geology F.K. Lutgens E.J. Tarbuck D.G. Tasa Eleventh Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Probability and Statistical Inference NINTH EDITION

Earth Life System. An Introduction to the

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

MAPWORK PRACTICE EXERCISES 13+ ABERDEEN

Physics Technology Update James S. Walker Fourth Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Modern Control Systems. Richard C. Dorf Robert H. Bishop Twelfth Edition

MATH STUDENT BOOK. 6th Grade Unit 9

Statistics for Managers Using Microsoft Excel 7 th Edition

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Seasons. by Helen Stanley. illustrated by Gerardo Suzan HOUGHTON MIFFLIN

Blast Off to the Moon!

Acknowledgements of third party content appear on page 33, which constitutes an extension of this copyright page.

SAT & ACT Foundations

APPLIED STRUCTURAL EQUATION MODELLING FOR RESEARCHERS AND PRACTITIONERS. Using R and Stata for Behavioural Research

SAMPLE. The SSAT Course Book MIDDLE & UPPER LEVEL QUANTITATIVE. Focusing on the Individual Student

Practical Statistics for Geographers and Earth Scientists

This content has been downloaded from IOPscience. Please scroll down to see the full text.

Advanced Engineering. Dynamics. H. R. Harrison. T. Nettleton. Formerly Department of Mechanical Engineering & Aeronautics City University London

Statistics for Managers using Microsoft Excel 6 th Edition

MATH STUDENT BOOK. 11th Grade Unit 9

Our true, holographic blueprint of the human matrix system

M208 Pure Mathematics AA1. Numbers

2002 HSC Notes from the Marking Centre Geography

Introduction to Mathematical Statistics Robert V. Hogg Joeseph McKean Allen T. Craig Seventh Edition

This content has been downloaded from IOPscience. Please scroll down to see the full text.

Sample file. Page 1 of 18. Copyright 2013 A+ Interactive MATH (an A+ TutorSoft Inc. company), All Rights Reserved.

Outline Maps of Canada

by Claire Tan HOUGHTON MIFFLIN

ArcGIS Pro: Essential Workflows STUDENT EDITION

Satanism, Magic and Mysticism in Fin-de-siècle France

Astronomy with a Budget Telescope

Practice General Test # 4 with Answers and Explanations. Large Print (18 point) Edition

ì<(sk$m)=bdhded< +^-Ä-U-Ä-U

Applied Structural Equation Modelling for Researchers and Practitioners Using R and Stata for Behavioural Research

Acknowledgements of third party content appear on page 18, which constitutes an extension of this copyright page.

SAMPLE. stinction. Secondary. Ivan Lau Kim Soon. ~ Marshall Cavendish ~Education

Where do I live? Geography teaching resource. Paula Owens. Primary. Locating own home address

Inequalities. Inequalities. Curriculum Ready.

ELEMENTARY ENGINEERING MECHANICS

FOUNDATION SCIENCE FOR ENGINEERS

ArcGIS 9 ArcGIS StreetMap Tutorial

Finite Element Analysis for Heat Transfer. Theory and Software

New Uses of Sulfur II

Arrow Pushing in Organic Chemistry

Mark Scheme (Results) June GCE Decision D2 (6690) Paper 1

Mark Scheme (Results) June GCSE Mathematics (2MB01) Foundation 5MB2F (Non-Calculator) Paper 01

The normal distribution Mixed exercise 3

Photo Credits: All images Harcourt

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Alternative Presentation of the Standard Normal Distribution

EL-W535B Calculator Teaching Activities For The Classroom

HOUGHTON MIFFLIN. by Miguela Halcón

Preparing Spatial Data

The Shape, Center and Spread of a Normal Distribution - Basic

Mark Scheme (Results) June GCE Core Mathematics C1 (6663) Paper 1

Suggested levels for Guided Reading, DRA, Lexile, and Reading Recovery are provided in the Pearson Scott Foresman Leveling Guide.

Transcription:

Elementary Statistics in Social Research Essentials Jack Levin James Alan Fox Third Edition

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Pearson Education Limited 2014 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without either the prior written permission of the publisher or a licence permitting restricted copying in the United Kingdom issued by the Copyright Licensing Agency Ltd, Saffron House, 6 10 Kirby Street, London EC1N 8TS. All trademarks used herein are the property of their respective owners. The use of any trademark in this text does not vest in the author or publisher any trademark ownership rights in such trademarks, nor does the use of such trademarks imply any affiliation with or endorsement of this book by such owners. ISBN 10: 1-292-02718-5 ISBN 10: 1-269-37450-8 ISBN 13: 978-1-292-02718-0 ISBN 13: 978-1-269-37450-7 British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library Printed in the United States of America

m +1s +2s +1.40s FIGURE 11 The position of a raw score that lies 1.40s above m than 2s from the mean. Thus, we know this distance from the mean would include more than 34.13% but less than 47.72% of the total area under the normal curve. To determine the exact percentage within this interval, we must employ the table below. This shows the percent under the normal curve (1) between the mean and various sigma distances from the mean (in column b) and (2) at or beyond various scores toward either tail of the distribution (in column c). These sigma distances (from 0.00 to 4.00) are labeled z in the left-hand column (column a) and have been given to two decimal places. Notice that the symmetry of the normal curve makes it possible to give percentages for only one side of the mean, that is, only one-half of the curve (50%). Values in the table below represent either side. (a) z (b) Area between Mean and z (c) Area beyond z.00.00 50.00.01.40 49.60.02.80 49.20.03 1.20 48.80.04 1.60 48.40.05 1.99 48.01.06 2.39 47.61.07 2.79 47.21.08 3.19 46.81.09 3.59 46.41 104

When learning to use and understand this table, we might first attempt to locate the percent of cases between a sigma distance of 1.00 and the mean (the reason being that we already know that 34.13% of the total area falls between these points on the base line). Looking at column b, we see it indeed indicates that exactly 34.13% of the total frequency falls between the mean and a sigma distance of 1.00. Likewise, we see that the area between the mean and the sigma distance 2.00 includes exactly 47.72% of the total area under the curve. But what about finding the percent of cases between the mean and a sigma distance of 1.40? This was the problem in Figure 11, which necessitated the use of the table in the first place. The entry in column b corresponding to a sigma distance of 1.40 includes 41.92% of the total area under the curve. Finally, how do we determine the percent of cases at or beyond 1.40 standard deviations from the mean? Without a table to help us, we might locate the percentage in this area under the normal curve by simply subtracting our earlier answer from 50%, because this is the total area lying on either side of the mean. However, this has already been done in column c, where we see that exactly 8.08% 150-41.92 = 8.082 of the cases fall at or above the score that is 1.40 standard deviations from the mean. Standard Scores and the Normal Curve We are now prepared to find the percent of the total area under the normal curve associated with any given sigma distance from the mean. However, at least one more important question remains to be answered: How do we determine the sigma distance of any given raw score? That is, how do we translate our raw score the score that we originally collected from our respondents into units of standard deviation? If we wished to translate feet into yards, we would simply divide the number of feet by 3, because there are 3 ft in a yard. Likewise, if we were translating minutes into hours, we would divide the number of minutes by 60, because there are 60 min in every hour. In precisely the same manner, we can translate any given raw score into sigma units by dividing the distance of the raw score from the mean by the standard deviation. To illustrate, let us imagine a raw score of 16 from a distribution in which m is 13 and s is 2. Taking the difference between the raw score and the mean and obtaining a deviation 116-132, we see that a raw score of 16 is 3 raw-score units above the mean. Dividing this raw-score distance by s = 2, we see that this raw score is 1.5 (one and one-half) standard deviations above the mean. In other words, the sigma distance of a raw score of 16 in this particular distribution is 1.5 standard deviations above the mean. We should note that regardless of the measurement situation, there are always 3 ft in a yard and 60 min in an hour. The constancy that marks these other standard measures is not shared by the standard deviation. It changes from one distribution to another. For this reason, we must know the standard deviation of a distribution by calculating it, estimating it, or being given it by someone else before we are able to translate any particular raw score into units of standard deviation. The process that we have just illustrated that of finding sigma distance from the mean, yields a value called a z score or standard score, which indicates the direction and degree that any given raw score deviates from the mean of a distribution on a scale of sigma units. Thus, a z score of +1.4 indicates that the raw score lies 1.4 s (or almost 1 1 2 s ) above the mean, 105

z = 2.1 m z = +1.4 FIGURE 12 The position of z = -2.1 and z = +1.4 in a normal distribution whereas a z score of -2.1 means that the raw score falls slightly more than 2s below the mean (see Figure 12). We obtain a z score by finding the deviation 1X - m2, which gives the distance of the raw score from the mean, and then dividing this raw-score deviation by the standard deviation. Computed by formula, z = X - m s where m = mean of a distribution s = standard deviation of a distribution z = standard score As an example, suppose we are studying the distribution of annual income for home healthcare workers in a large agency in which the mean annual income is $20,000 and the standard deviation is $1,500. Assuming that the distribution of annual income is normally distributed we can translate the raw score from this distribution, $22,000, into a standard score in the following manner: z = 22,000-20,000 1,500 = +1.33 Thus, an annual income of $22,000 is 1.33 standard deviations above the mean annual income of $20,000 (see Figure 13). 106

$20,000 $22,000 z = +1.33 FIGURE 13 The position of z = +1.33 for the raw score $22,000 As another example, suppose that we are working with a normal distribution of scores representing job satisfaction among a group of city workers. The scale ranges from 0 to 20, with higher scores representing greater satisfaction with the job. Let us say this distribution has a mean of 10 and a standard deviation of 3. To determine how many standard deviations a score of 3 lies from the mean of 10, we obtain the difference between this score and the mean, that is, X - m = 3-10 We then divide by the standard deviation: = -7 3 = -7 z = X - m s = -2.33 Thus, as shown in Figure 14, a raw score of 3 in this distribution of scores falls 2.33 standard deviations below the mean. Finding Probability under the Normal Curve As we shall now see, the normal curve can be used in conjunction with z scores and the table shown earlier to determine the probability of obtaining any raw score in a distribution. In the 107

3 z = 2.33 10 FIGURE 14 The position of z = -2.33 for the raw score 3 present context, the normal curve is a distribution in which it is possible to determine probabilities associated with various points along its baseline. As noted earlier, the normal curve is a probability distribution in which the total area under the curve equals 100%; it contains a central area surrounding the mean, where scores occur most frequently, and smaller areas toward either end, where there is a gradual flattening out and thus a smaller proportion of extremely high and low scores. In probability terms, then, we can say that probability decreases as we travel along the baseline away from the mean in either direction. Thus, to say that 68.26% of the total frequency under the normal curve falls between -1s and +1s from the mean is to say that the probability is approximately 68 in 100 that any given raw score will fall within this interval. Similarly, to say that 95.44% of the total frequency under the normal curve falls between -2s and +2s from the mean is also to say that the probability is approximately 95 in 100 that any raw score will fall within this interval, and so on. This is precisely the same concept of probability or relative frequency that we saw in operation when flipping pairs of coins. Note, however, that the probabilities associated with areas under the normal curve are always given relative to 100%, which is the entire area under the curve (for example, 68 in 100, 95 in 100, 99 in 100). BOX 1 Step-by-Step Illustration: Probability under the Normal Curve To apply the concept of probability in relation to the normal distribution, let us return to an earlier example. We were then asked to translate into its z-score equivalent a raw score from an agency s distribution of annual salary for healthcare workers, which we assumed approximated a normal curve. This distribution of income had a mean of $20,000 with a standard deviation of $1,500. 108