UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004

Size: px
Start display at page:

Download "UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004"

Transcription

1 UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 50W - Introduction to Biostatistics Fall 00 Exercises with Solutions Topic Summarizing Data Due: Monday September 7, 00 READINGS. Text (Rosner B. Fundamentals of Biostatistics, 5 th Edition) Chapters and.. Study Guide (Rosner B. Study Guide for Fundamentals of Biostatistics, 5 th Edition) Chapter. EXERCISES:. For each of the following variables indicate whether it is quantitative or qualitative and specify the measurement scale that is employed when taking measurements on each: (source: Daniel, page, problem #6.) a) Class standing of members of this class relative to each other b) Admitting diagnosis of patients admitted to a mental health clinic c) Weights of babies born in a hospital during a year d) Gender of babies born in a hospital during a year e) Range of motion of elbow joint of students enrolled in a university health sciences curriculum f) Under-arm temperature of day-old infants born in a hospital

2 . Using the data below (source: Daniel, 6 th edition page 30, problem.3.5), a. Construct a stem and leaf display. b. Construct a frequency table with columns for frequency, relative frequency, cumulative frequency, and cumulative relative frequency. c. Construct a histogram. d. Construct a frequency polygon 3. Data were recorded on the age in years and height in cm of 0 high school students in a classroom. Females Males Age Height Age Height

3 3 a. Create a frequency table for age, with columns for frequency, relative frequency, cumulative frequency, and cumulative relative frequency. b. Create a histogram for age. c. For each sex, create a stem-and-leaf display for height. What does a comparison of the displays suggest about the students? d. For each sex, create histograms for height using the same scale.. Let x =3, x =, x 3 =, and x =6 a. Express the following sum in sigma notation and evaluate numerically. (x + x + x 3 + x ) b. Express the following sum in sigma notation and evaluate numerically. x + x + x 3 + x c. Evaluate the following numerically. Σ (X i ) for i=. d. Evaluate the following numerically. Σ 3X i for i=. 5. a. The following are behavioral ratings as measured by the Zang Anxiety Scale (ZAS) for 6 persons with a diagnosis of panic disorder: Compute the mean, median, mode, range, variance, and standard deviation, and the 5th and 75th percentiles.

4 b. The following are behavioral ratings as measured by the Zang Anxiety Scale (ZAS) for healthy controls: Compute the mean, median, mode, range, variance, and standard deviation, and the 5th and 75th percentiles. c. Construct Box and Whisker plots using the data from parts "a" and "b". In one or two sentences, compare the two groups of sentences. 6. The following table shows the age distribution of cases of a certain disease reported during a year in a particular state. Age Number of Cases TOTAL 75 6a. Construct a frequency table with columns for class endpoints, class midpoint, frequency, relative frequency, cumulative frequency, and cumulative relative frequency. 6b. Construct a cumulative relative frequency plot of the data. Use this plot to estimate the 0th, 5th, 50th, and 75th percentiles. 6c. Compute the mean, median, variance, and standard deviation.

5 5 7. For women undergoing in vitro fertilization, various therapies are used to stimulate the ovaries. In one study comparing the effectiveness of a new hormone therapy on three groups of women with different types of fertility problems, an outcome of interest is the number of oocytes that 'ripened'. Some summary statistics on the number of ripened oocytes per woman for each of the three groups are reported below. Group Statistic 3 n 38 9 mean median P5 5 5 P75 minimum 5 maximum 0 3 7a. Compute box and whisker plots for the three groups. 7b. In your opinion, which statistics are best for comparing these three groups? Why?

6 6 SOLUTIONS #. a. Qualitative - ordinal b. Qualitative - nominal c. Quantitative - ratio d. Qualitative - nominal e. Quantitative - ratio f. Quantitative - interval #. How to Create a STATISTIX Data Set Containing This Information STEP. Enter the data. ) Begin the software program STATISTIX ) Click x on DATA. 3) Click x on INSERT ) Click x on VARIABLES. 5) Click x inside the NEW VARIABLE NAMES dialog box. Type X <enter>. 6) Click x on OK At this point you will see a spread sheet with space for only one record. You ll need to instruct STATISTIX that you actually have a data set of 5 records 7) Click x on DATA 8) Click x on INSERT 9) Click x on CASES 0) Click x inside the FIRST NEW CASE NUMBER dialog box Type <enter> ) Click x inside the NUMBER OF CASES TO INSERT dialog box. Type <enter> 7) To enter your data by rows: Type in the first value. Press the RIGHT arrow. Type in the second value. Press <enter>. All other values are entered the same as the second value. To enter your data by columns: Type in the first value. Press the DOWN arrow. Type in the second value. Press <enter>.

7 7 All other values are entered the same as the second value. Continue until all values of one column are entered. Use the arrow keys to get to the first record in the next column and proceed like with the first column. You should have 5 records with values per record. If you have 6 records, click x on the record number 6. Click x on EDIT. Click x on CUT. STEP. Save the data. ) Click x on FILE. ) Click x on SAVE AS. 3) *.sx should be highlighted in the FILE NAME dialog box. Type unit3ex.sx <enter>. #a. Here is the stem and leaf diagram I constructed Stem Leaf Other groupings for the stem are okay.

8 8 How to Request a Stem and Leaf Plot in STATISTIX Following assumes that you are already in STATISTIX and have already opened unit3ex.sx ) Click x on STATISTICS ) Click x on SUMMARY STATISTICS. 3) Click x on STEM AND LEAF PLOT ) Click x on the variable X Then click on the RIGHT ARROW 5) Click x on OK You should get the following STEM AND LEAF PLOT OF X LEAF DIGIT UNIT = MINIMUM REPRESENTS. MEDIAN MAXIMUM STEM LEAVES (7) CASES INCLUDED 0 MISSING CASES To the reader: Can you guess what the numbers at the far left are telling you? Hint Read from top to bottom, then from bottom to top!

9 9 How To Insert Text STATISTIX Results into a WORD Document Following assumes that you are in STATISTIX and have just gotten the plot ) Click x on FILE ) Click x on SAVE AS. 3) (Check to be sure results are being saved to the directory of your choosing) ) Type unit3exa.txt in the file name dialog box 5) (Minimize the STATISTIX window so that you can work in WORD) Next assumes that you are in WORD 6) Position cursor to where you want the results to be placed 7) Click x on INSERT 8) Click x on FILE 9) Using the LOOK IN feature, position yourself in the directory containing your results 0) Click x on unit3exa.txt

10 0 #b. Here is what I constructed by hand Class Relative Cumulative Cumulative Interval Frequency Frequency Frequency Rel. Frequency TOTAL Other class intervals are okay.

11 How to Request a Frequency Distribution in STATISTIX Following assumes that you are already in STATISTIX and have already opened unit3ex.sx AND that you have just finished your stem and leaf diagram ) Click x on WINDOW ) Click x on unit3ex.sx. At this point, STATISTIX should have returned you to your spreadsheet of data 3) Click x on STATISTICS 3) Click x on SUMMARY STATISTICS ) Click x on FREQUENCY DISTRIBUTION ) Click x on the variable X Then click on the RIGHT ARROW 5) Click x on OK You should get the following FREQUENCY DISTRIBUTION OF X CUMULATIVE VALUE FREQ PERCENT FREQ PERCENT TOTAL

12 #c. How to Request a Histogram in STATISTIX Following assumes that you are already in STATISTIX and have already opened unit3ex.sx AND that you have just finished some other description or analysis ) Click x on WINDOW ) Click x on unit3ex.sx. 3) Click x on STATISTICS ) Click x on SUMMARY STATISTICS 5) Click x on HISTOGRAM 6) Click x on the variable X Then click on the RIGHT ARROW 6) Click x on OK You should get the following (note on titles below ) 0 Histogram for #c 8 Frequency X To get the titles as shown here 7) Click x on RESULTS 8) In the TOP TITLE dialog box, type Histogram for #c

13 3 How To Insert GRAPHICAL STATISTIX Results into a WORD Document Following assumes that you are in STATISTIX and have just gotten the plot ) Click x on FILE ) Click x on SAVE AS. ) (Check to be sure results are being saved to the directory of your choosing) ) Type unit3exc.emf in the file name dialog box 5) (Minimize the STATISTIX window so that you can work in WORD) Next assumes that you are in WORD 6) Position cursor to where you want the results to be placed 7) Click x on INSERT 8) Click x on PICTURE 9) Click x on FROM FILE 0) Using the LOOK IN feature, position yourself in the directory containing your results ) Click x on unit3exc.emf

14 #d. How to Request a Cumulative Distribution in STATISTIX Following assumes that you are already in STATISTIX and have already opened unit3ex.sx AND that you have just finished some other description or analysis ) Click x on WINDOW ) Click x on unit3ex.sx. 3) Click x on STATISTICS ) Click x on SUMMARY STATISTICS 5) Click x on HISTOGRAM 6) Click x on the variable X Then click on the RIGHT ARROW 7) CLICK x on the CUMULATIVE DISTRIBUTION option 8) Click x on OK You should get the following (except that I changed the title a bit ) 00 Cumulative Distribution 80 Percent X

15 5 #3a. Relative Cumulative Cumulative Age Frequency Frequency Frequency Rel. Frequency TOTAL 0.00 #3b. Histogram of Age Frequency Age, years

16 6 #3c. Here is what I get by hand (I like it better than the STATISTIX output below) Females Stem Males How To Supply Labels for Values of Discrete Variables (so that output is easier to read!) Following assumes that you are in STATISTIX and you wish to assign labels to the values 0 and for the variable SEX so that 0=FEMALE and =MALE ) Click x on WINDOW ) Click x on unit3ex3 spreadsheet. 3) Click x on DATA ) Click x on LABELS 5) Click x on VALUE LABELS 6) Click x on the variable SEX and RIGHT ARROW it to the SOURCE VARIABLE box 7) In the VALUE box type 0. Below this, in the LABEL box type 0=Female 8) In the VALUE box type. Below this, in the LABEL box, type =Male. 9) Click x on SAVE. 0) Click x on CLOSE.

17 7 This is what I get in STATISTIX STEM AND LEAF PLOT OF HEIGHT FOR SEX = 0=female LEAF DIGIT UNIT = MINIMUM REPRESENTS 5. MEDIAN MAXIMUM 7.00 STEM LEAVES () CASES INCLUDED 0 MISSING CASES STEM AND LEAF PLOT OF HEIGHT FOR SEX = =male LEAF DIGIT UNIT = MINIMUM REPRESENTS 67. MEDIAN MAXIMUM STEM LEAVES () CASES INCLUDED 0 MISSING CASES Males tend to be taller than females #3d. Class FEMALES MALES Interval Freq. Re. Freq. Freq. Rel. Freq

18 8 How To Select A Subset of the Data for Analysis We wish to construct separate histograms, first for females, then for males. STATISTIX is a bit awkward in this. To Select Females, STATISTIX requires that you omit Males ) Click x on WINDOW ) Click x on unit3ex3 spreadsheet. 3) Click x on DATA ) Click x on OMIT/SELECT/RESTORE CASES 5) Type Omit sex= 6) Click x on GO 7) Click x on CLOSE Construct your histogram using instructions per above. Take care to title it clearly. Before you can select males, you must restore the entire data set ) Click x on WINDOW ) Click x on unit3ex3 spreadsheet 3) Click x on OMIT/SELECT/RESTORE CASES ) Type restore To Select Males, STATISTIX requires that you omit Females ) Type Omit sex=0 ) Click x on GO 3) Click x on CLOSE

19 9 This is what I get in STATISTIX Histogram of Heights - FEMALES 5 Frequency HEIGHT Histogram of Heights - MALES 5 Frequency HEIGHT

20 0 A. ( ) X + X + X 3 + X = Σ X i = = = = 96. i ( ) B. X + X + X + X = Σ X 3 = 3 i= i = = 6. C. Note: Σ( ) = ( 3 ) + ( ) + ( ) + ( 6 ) X i i= Σ = = = 38. ( Xi ) = Σ [ Xi Xi + ] i= i= = Σ X - Σ X + Σ i i i= i= i= = 6 - = 38. ( )( ) + ( )( )

21 D. Σ 3 Xi = 3 Σ X i i= i= ( ) = 3 = 5 A stem and leaf diagram might come in handy: MEAN MEDIAN 6 x = Σ X n i= i ( 56 ) =. 6 so = 5. = n x n First solve = = 35. th th Median is midpoint of 3 and observation. ~ x = ( + 3) so ~ x = MODE This sample is tri - modal 38, 5, RANGE Maximum - Minimum = 69 3 so range = 38

22 VARIANCE Let s save ourselves the trouble of a very long brute force formula by using the formula for grouped data. Let j index the unique values. There are unique values. j X j f j ( x j x) ( j ) f x x j TOTALS S j= = ( j ) Σ f x x j Σ j= f j = So S = Standard deviation S = S So S = 89.

23 3 5th Percentile First solve (.5) (n) = (.5) (6) = 6.5 So 5th percentile is the 7 th observation P 5 = 38 75th Percentile First solve (.75) (n) = (.75) (6) = 9.5 So 75th percentile is the 0 th observation P 75 = 5 5B MEAN x = Σ X ( ) i = 568 = 7. 0 So x = 7. 0 n i= MEDIAN n + + Solving = = Median is the th observation. So ~ x = 6 MODE mode = 5 RANGE Maximum - Minimum = 3 5. So Range = 9

24 Variance There are 6 unique values. j X j f j ( x j x) ( j ) f x x j TOTALS 9 98 S j= = ( j ) 6 Σ f x x j 6 Σ j= Standard deviation f j 67 = So S = S= S = 835. So S= 89. 5th Percentile Solving (.5) (n) = (.5) () = 5.5 So 5th percentile is 6th observation P 5 = 5 75th Percentile Solving (.75) (n) = (.75) () = 5.75 So 75th percentile is 6th observation P 75

25 5 5C. REMINDER - Use the same scale when comparing two groups. Group Patients Controls Mean Median 6 P P Interquartile Range (IQR) 3 3 P5-(.5)(IQR) P75+(.5)(IQR) * Min 3* 5* Max 69* 3 *=Whisker Notes on Whiskers ) IF P 5 - (.5) (IQR) < minimum, use minimum instead ) IF P 75 + (.5) (IQR) > maximum, use maximum instead Patients with panic disorder have Exercise ZAS scores #C that - Box are and higher Whisker than those Plotof controls. As well, ZAS scores of patients with panic disorder have more variability ZAS Score HEALTHY PANIC Healthy (n=), Panic Disorder (n=6)

26 6 6A. Class Class Relative Cumulative Cumulative Endpoints Midpoint Frequency Frequency Frequency Relative Freq TOTALS.000 6B. 00 Exercise #3b - Cumulative Rel. Frequency 80 Percent AGE Estimates are P 0 = 7 P 50 = 36 P 5 = 6 P 75 =.5

27 7 6C. Midpoint X j Frequency f j X j f j ( x x) ( ) j f j x j x MEAN x 6 Σ fx j j j= 680 = 6 = So x = Σ f 75 j= j 357. MEDIAN Note to reader I ve consulted a number of texts on this. There is no single correct answer. With interval data, whatever median you calculate is an approximation. Here is what is suggested in Think and Explain with Statistics (Lincoln E. Moses, page 6) First solve n = = 38 th observation Examination of the table reveals that the 38 th observation is in the interval 35 to.99 Set the following quantities: The letter l = lower limit of interval = 35 The letter u = upper limit of interval =.99 R = cumulative frequency up to the lower limit of interval = 35 M = # observations contained in interval = N = total # observations = 75 An approximate solution for the median is calculated as L NM ~ N/ - R x= l + ( u l) = 35 + M QP O L NM 75/ - 35 O QP ( ) = or 37

28 8 VARIANCE S j= = ( j ) 6 Σ f x x 6 j Σ j= f j = so S = Standard deviation S = S so S = 30. 7A. Remember to use the same scale. 3 median mean left box edge = P right box edge = P 75 IQR = P 75 -P P 5 -(.5)(IQR) P 75 +(.5)(IQR) left whisker 5 right whisker 0 3

29 9 7B. When data are skewed by extreme values, medians and quartiles give a better feel for the bulk of the data than do means and standard deviations. This example also illustrates that, as sample size increases, the range can only increase. Notice that the extreme value of 0 occurred in the sample with the largest sample size.

PubHlth 540 Fall Summarizing Data Page 1 of 18. Unit 1 - Summarizing Data Practice Problems. Solutions

PubHlth 540 Fall Summarizing Data Page 1 of 18. Unit 1 - Summarizing Data Practice Problems. Solutions PubHlth 50 Fall 0. Summarizing Data Page of 8 Unit - Summarizing Data Practice Problems Solutions #. a. Qualitative - ordinal b. Qualitative - nominal c. Quantitative continuous, ratio d. Qualitative -

More information

Practice problems from chapters 2 and 3

Practice problems from chapters 2 and 3 Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal,

More information

Unit 1 Summarizing Data

Unit 1 Summarizing Data BIOSTATS 540 Fall 2018 1. Summarizing Page 1 of 47 Unit 1 Summarizing It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

1. Let A and B be two events such that P(A)=0.6 and P(B)=0.6. Which of the following MUST be true?

1. Let A and B be two events such that P(A)=0.6 and P(B)=0.6. Which of the following MUST be true? 1 UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004 Exercises with Solutions Topic 2 Introduction to Probability Due: Monday

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BE540w - Introduction to Biostatistics Fall 2004

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BE540w - Introduction to Biostatistics Fall 2004 Page 1 of 9 UNIVERSITY O MASSACHUSETTS Department of Biostatistics and Epidemiology BE540w - Introduction to Biostatistics all 2004 Exercises with Solutions Topic 5 Normal Distribution Due: Monday November

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Using Microsoft Excel

Using Microsoft Excel Using Microsoft Excel Objective: Students will gain familiarity with using Excel to record data, display data properly, use built-in formulae to do calculations, and plot and fit data with linear functions.

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) 3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions

More information

Summarising numerical data

Summarising numerical data 2 Core: Data analysis Chapter 2 Summarising numerical data 42 Core Chapter 2 Summarising numerical data 2A Dot plots and stem plots Even when we have constructed a frequency table, or a histogram to display

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart

More information

Unit 1 Summarizing Data

Unit 1 Summarizing Data PubHtlth 540 Fall 2014 1. Summarizing Page 1 of 54 Unit 1 Summarizing It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive

More information

Unit 1 Summarizing Data

Unit 1 Summarizing Data BIOSTATS 540 Fall 2016 1. Summarizing Page 1 of 42 Unit 1 Summarizing It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67 Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

MEASURING THE SPREAD OF DATA: 6F

MEASURING THE SPREAD OF DATA: 6F CONTINUING WITH DESCRIPTIVE STATS 6E,6F,6G,6H,6I MEASURING THE SPREAD OF DATA: 6F othink about this example: Suppose you are at a high school football game and you sample 40 people from the student section

More information

COMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS

COMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS COMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS EX 1 Given the following series of data on Gender and Height for 8 patients, fill in two frequency tables one for each Variable, according to the model

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Unit 1 Summarizing Data

Unit 1 Summarizing Data PubHtlth 540 Fall 2013 1. Summarizing Page 1 of 51 Unit 1 Summarizing It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive

More information

Description of Samples and Populations

Description of Samples and Populations Description of Samples and Populations Random Variables Data are generated by some underlying random process or phenomenon. Any datum (data point) represents the outcome of a random variable. We represent

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

Unit 1 Summarizing Data

Unit 1 Summarizing Data PubHtlth 540 Fall 2011 1. Summarizing Data Page 1 of 51 Unit 1 Summarizing Data It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive

More information

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users BIOSTATS 640 Spring 2017 Review of Introductory Biostatistics STATA solutions Page 1 of 16 Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users #1. The following table lists length of

More information

Experiment 0 ~ Introduction to Statistics and Excel Tutorial. Introduction to Statistics, Error and Measurement

Experiment 0 ~ Introduction to Statistics and Excel Tutorial. Introduction to Statistics, Error and Measurement Experiment 0 ~ Introduction to Statistics and Excel Tutorial Many of you already went through the introduction to laboratory practice and excel tutorial in Physics 1011. For that reason, we aren t going

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review Our objective: to make confident statements about a parameter (aspect) in

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

P8130: Biostatistical Methods I

P8130: Biostatistical Methods I P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data

More information

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics LT1: Basics of Correlation LT2: Measuring Correlation and Line of best fit by eye Univariate (one variable) Displays Frequency tables Bar graphs

More information

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The

More information

Exercises from Chapter 3, Section 1

Exercises from Chapter 3, Section 1 Exercises from Chapter 3, Section 1 1. Consider the following sample consisting of 20 numbers. (a) Find the mode of the data 21 23 24 24 25 26 29 30 32 34 39 41 41 41 42 43 48 51 53 53 (b) Find the median

More information

Contents. 13. Graphs of Trigonometric Functions 2 Example Example

Contents. 13. Graphs of Trigonometric Functions 2 Example Example Contents 13. Graphs of Trigonometric Functions 2 Example 13.19............................... 2 Example 13.22............................... 5 1 Peterson, Technical Mathematics, 3rd edition 2 Example 13.19

More information

Lecture 1 : Basic Statistical Measures

Lecture 1 : Basic Statistical Measures Lecture 1 : Basic Statistical Measures Jonathan Marchini October 11, 2004 In this lecture we will learn about different types of data encountered in practice different ways of plotting data to explore

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

1 Introduction to Minitab

1 Introduction to Minitab 1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you

More information

Math 14 Lecture Notes Ch Percentile

Math 14 Lecture Notes Ch Percentile .3 Measures of the Location of the Data Percentile g A measure of position, the percentile, p, is an integer (1 p 99) such that the p th percentile is the position of a data value where p% of the data

More information

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

The empirical ( ) rule

The empirical ( ) rule The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

BIOS 2041: Introduction to Statistical Methods

BIOS 2041: Introduction to Statistical Methods BIOS 2041: Introduction to Statistical Methods Abdus S Wahed* *Some of the materials in this chapter has been adapted from Dr. John Wilson s lecture notes for the same course. Chapter 0 2 Chapter 1 Introduction

More information

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Motion II. Goals and Introduction

Motion II. Goals and Introduction Motion II Goals and Introduction As you have probably already seen in lecture or homework, and if you ve performed the experiment Motion I, it is important to develop a strong understanding of how to model

More information

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION In this lab you will first learn how to display the relationship between two quantitative variables with a scatterplot and also how to measure the strength of

More information

Chapter 2 Class Notes Sample & Population Descriptions Classifying variables

Chapter 2 Class Notes Sample & Population Descriptions Classifying variables Chapter 2 Class Notes Sample & Population Descriptions Classifying variables Random Variables (RVs) are discrete quantitative continuous nominal qualitative ordinal Notation and Definitions: a Sample is

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

Chapter 3. Measuring data

Chapter 3. Measuring data Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

Investigating Factors that Influence Climate

Investigating Factors that Influence Climate Investigating Factors that Influence Climate Description In this lesson* students investigate the climate of a particular latitude and longitude in North America by collecting real data from My NASA Data

More information

STATISTICS 1 REVISION NOTES

STATISTICS 1 REVISION NOTES STATISTICS 1 REVISION NOTES Statistical Model Representing and summarising Sample Data Key words: Quantitative Data This is data in NUMERICAL FORM such as shoe size, height etc. Qualitative Data This is

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Using Tables and Graphing Calculators in Math 11

Using Tables and Graphing Calculators in Math 11 Using Tables and Graphing Calculators in Math 11 Graphing calculators are not required for Math 11, but they are likely to be helpful, primarily because they allow you to avoid the use of tables in some

More information

1.3: Describing Quantitative Data with Numbers

1.3: Describing Quantitative Data with Numbers 1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

MATH 1015: Life Science Statistics. Lecture Pack for Chapter 1 Weeks 1-3. Lecturer: Jennifer Chan Room: Carslaw Room 817 Telephone:

MATH 1015: Life Science Statistics. Lecture Pack for Chapter 1 Weeks 1-3. Lecturer: Jennifer Chan Room: Carslaw Room 817 Telephone: MATH 1015: Life Science Statistics Lecture Pack for Chapter 1 Weeks 1-3. Lecturer: Jennifer Chan Room: Carslaw Room 817 Telephone: 9351 4873. Text: Phipps, M. and Quine, M. (2001) A Primer of Statistics

More information

IB Questionbank Mathematical Studies 3rd edition. Grouped discrete. 184 min 183 marks

IB Questionbank Mathematical Studies 3rd edition. Grouped discrete. 184 min 183 marks IB Questionbank Mathematical Studies 3rd edition Grouped discrete 184 min 183 marks 1. The weights in kg, of 80 adult males, were collected and are summarized in the box and whisker plot shown below. Write

More information

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same! Two sample tests (part II): What to do if your data are not distributed normally: Option 1: if your sample size is large enough, don't worry - go ahead and use a t-test (the CLT will take care of non-normal

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

Physics E-1ax, Fall 2014 Experiment 3. Experiment 3: Force. 2. Find your center of mass by balancing yourself on two force plates.

Physics E-1ax, Fall 2014 Experiment 3. Experiment 3: Force. 2. Find your center of mass by balancing yourself on two force plates. Learning Goals Experiment 3: Force After you finish this lab, you will be able to: 1. Use Logger Pro to analyze video and calculate position, velocity, and acceleration. 2. Find your center of mass by

More information

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that?

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that? Tastitsticsss? What s that? Statistics describes random mass phanomenons. Principles of Biostatistics and Informatics nd Lecture: Descriptive Statistics 3 th September Dániel VERES Data Collecting (Sampling)

More information

1.3.1 Measuring Center: The Mean

1.3.1 Measuring Center: The Mean 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations

More information

6 THE NORMAL DISTRIBUTION

6 THE NORMAL DISTRIBUTION CHAPTER 6 THE NORMAL DISTRIBUTION 341 6 THE NORMAL DISTRIBUTION Figure 6.1 If you ask enough people about their shoe size, you will find that your graphed data is shaped like a bell curve and can be described

More information

Chapter 6 Group Activity - SOLUTIONS

Chapter 6 Group Activity - SOLUTIONS Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing

More information

OCR Maths S1. Topic Questions from Papers. Representation of Data

OCR Maths S1. Topic Questions from Papers. Representation of Data OCR Maths S1 Topic Questions from Papers Representation of Data PhysicsAndMathsTutor.com 12 The back-to-back stem-and-leaf diagram below shows the number of hours of television watched per week by each

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 1- part 1: Describing variation, and graphical presentation Outline Sources of variation Types of variables Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease

More information

Math 7 /Unit 5 Practice Test: Statistics

Math 7 /Unit 5 Practice Test: Statistics Math 7 /Unit 5 Practice Test: Statistics Name: Date: Define the terms below and give an example. 1. population 2. random sample 3. interquartile range (IQR) 4. Determine whether each sample is a random

More information

Describing Distributions

Describing Distributions Describing Distributions With Numbers April 18, 2012 Summary Statistics. Measures of Center. Percentiles. Measures of Spread. A Summary Statement. Choosing Numerical Summaries. 1.0 What Are Summary Statistics?

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Continuous random variables

Continuous random variables Continuous random variables A continuous random variable X takes all values in an interval of numbers. The probability distribution of X is described by a density curve. The total area under a density

More information

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.

More information

Chapter 2 Solutions Page 15 of 28

Chapter 2 Solutions Page 15 of 28 Chapter Solutions Page 15 of 8.50 a. The median is 55. The mean is about 105. b. The median is a more representative average" than the median here. Notice in the stem-and-leaf plot on p.3 of the text that

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

Please bring the task to your first physics lesson and hand it to the teacher.

Please bring the task to your first physics lesson and hand it to the teacher. Pre-enrolment task for 2014 entry Physics Why do I need to complete a pre-enrolment task? This bridging pack serves a number of purposes. It gives you practice in some of the important skills you will

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for BUSINESS STATISTICS - BMGT 571 Chapters 1 through 6 Professor Ahmadi, Ph.D. Department of Management Revised May 005 Glossary of Terms: Statistics Chapter 1 Data Data Set Elements Variable

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

LABORATORY 1: KINEMATICS written by Melissa J. Wafer '95 June 1993

LABORATORY 1: KINEMATICS written by Melissa J. Wafer '95 June 1993 LABORATORY 1: KINEMATICS written by Melissa J. Wafer '95 June 1993 The purpose of this exercise is to re-enforce what you have learned about kinematics in class and to familiarize you with computer resources

More information