Clinical Research Module: Biostatistics
|
|
- Eustace Lewis French
- 5 years ago
- Views:
Transcription
1 Clinical Research Module: Biostatistics Lecture 1 Alberto Nettel-Aguirre, PhD, PStat These lecture notes based on others developed by Drs. Peter Faris, Sarah Rose Luz Palacios-Derflingher and myself
2 Who am I? Alberto Nettel-Aguirre, PhD, PStat Assocciate professor with the Departments of Paediatrics and Community Health Sciences, Adjunct Professor Faculty of Kinesiology Statistician with the Research Methods Team Biostats, Social network Analysis, Statistical Learning, Functional Data Analysis; injury prevention, Nephro, Neo collaborations.
3 Material Covered in the Biostatistics Lectures Lecture 1A- Background material: Why statistics? Variables and scales of measurement Distributions and measures of central tendency and dispersion Data presentation
4 Why Statistics? Statistics provides a means of evaluating evidence from medical research. Allows one to address questions such as: How likely are the effects observed in a study not due to chance? Can we trust the validity of conclusions? How accurate are the estimates provided by the research? Were the researchers conclusions trustworthy, given the number of patients they used to test their hypotheses?
5 A field of study concerned with the 1. Collection, organization, summarization, and analysis of data; 2. Drawing of inferences about a body of data (population) when only a part of the data is observed (sample) Statistics
6 Population vs. Sample Population the collection of all individuals or items under consideration in a study. Sample a part of the population from which information is obtained. Population Sample
7 Why are Statistics Important in health research? Statistical Analysis (correctly done) is a vital part of good research Statistics are needed to draw conclusions from any data you have collected Your statistical analysis plan should be worked into the design of your study from the beginning Often after you have collected your data it is too late to change your analysis plan
8 Sampling Designs Probabilistic designs Simple Random Sampling Systematic Sampling Cluster Sampling Stratified Sampling Non-probabilistic designs Convenience Ad hoc quota Volunteer Purposive Snowball sampling
9 Statistical Analysis Methods Describe the descriptive statistics used to describe the population Assuming you have a representative sample of the population Identify each statistical test or method used for each comparison, in the order in which they were addressed Cite a reference for complex or uncommon statistical tests used to analyze the data If necessary more detail can be provided online in a supplement
10 Statistical Analysis Methods Describe how you checked the assumptions of your statistical methods. Provide a sample size calculation (which should have been done at the proposal stage)
11 Statistical Analysis Methods Prior to Report Careful Graphical and Exploratory Analysis Aids in data cleaning and the decision about the appropriateness of different statistical methods In the Report Descriptive Statistics (informs the reader about the participants that you are examining) Comparison of individual variables between two groups Multivariable Analysis (another time)
12 Material Covered in the Biostatistics Lectures Lecture 1A- Background material: Why biostatistics? Variables and scales of measurement Distributions and measures of central tendency and dispersion Data presentation
13 Types of Measures To use statistical methods we make use of quantifying our observations Sometimes this is simple: e.g. blood pressure, height, weight, body mass index. Quantification can present challenges: e.g. how should we quantify tissue damage? Socioeconomic status? We must also decide on an appropriate way to measure to be able to assess an effect Proportion of successes? Number of relapses per patients? Difference in means? Comparison of changes?
14 Types of Measures We then have to choose appropriate methods to display data and perform statistical tests. The statistical methods we use will depend on the scale of measurement whether the variable is measured on a categorical (nominal or ordinal), numerical (interval or ratio) scale and the design of the study. In the case of numerical, we have two types of variables: continuous and discrete.
15 Types of Data Categorical data(categories) Nominal variables (names): Labels used to identify an attribute of each element two or more categories, no ordering Ordinal variables (order): Have the properties of nominal data and can be used to rank or order the observations for the variable two or more categories, ordered in magnitude Difference between successive not necessarily the same. Numerical data Discrete: can only take on certain numerical values Continuous: Interval : can take any numerical value within a range Ratio data:have all the properties of interval data, and the ratio of two observations is meaningful
16 Nominal Data Examples of Nominal Variables binary or dichotomous variables such as male/female, right or left side injuries Categories of disease, where there is no rationale for ordering the diseases. e.g. Pneumonia, asthma, emphysema
17 Ordinal Data Examples of ordinal variables: Level of severity: fatal, severe, moderate, minor. Degree of difficulty: difficult, doable, easy. Student grades A = excellent understanding, B = Adequate understanding, C = Passable.
18 Discrete Numerical Data Counts of events: Number of children Number of injuries Number of episodes of hypoglycaemia
19 Interval Data Interval variables have properties of order and magnitude. The magnitude between adjacent values are the same (with respect to the attribute being measured) regardless of where these are in the scale. e.g. the difference in kinetic energy between temperatures of 22 and 23 Celsius is the same as that between temperatures of 30 and 31 (1 degree celsius). Dates can be considered as an interval variable. Interval variables are continuous variables that can take on (theoreticaly) any value within a range.
20 Ratio Variables Ratio variables have the properties of order and magnitude, but also have the property that zero indicates the absence of the attribute. Examples include: Weight, height, serum values, age, time to an event, temperature in Kelvin. Interval and ratio variables can be turned into nominal and ordinal variables (careful, loss of info sometimes)
21 Special cases of variables: Dichotomous (Binary) Variables Dichotomous variables have two possible values. These values may/may not reflect the magnitude of some underlying characteristic. e.g. gender (Male/Female), mortality (Alive/Dead), age (Young/Old), diagnosis (Yes/No).
22 Examples: Types of Data Time from treatment to death Serum folate Blood pressure Number of relapses per patient Death Sex Quality of life IQ 7 point question re: satisfaction with treatment
23 Tables / Summaries Categorical data: Summarizing Data Frequencies, Percentages/proportions Numerical data: Measures of central tendency, measures of dispersion (mean or median, SD or IQR (in general))
24 Graphs Categorical data: Summarizing Data Bar charts: the height of each bar shows the frequency (or percentage) per category. The bars don't need to be contiguous (stay away from stacked). Numerical data: Histograms: Classes or bins with group values within a continuous range. The number (percentage or density) in the group is the height of the bar. The bars need to be contiguous. Boxplots: 3 number summary plus outlier candidates for numerical data
25 Material Covered in the Biostatistics Lectures Lecture 1A- Background material: Why biostatistics? Variables and scales of measurement Distributions and measures of central tendency and dispersion Data presentation
26 Probability Distributions Think first of material distribution (boxes of gloves per ward, swabs, etc) Each ward gets a certain amount of material In probability distributions, each possible event gets/has a certain amount of probability (chance) of happening.
27 Features of Distributions Central tendency or location parameters The value that observations are centred around. Dispersion how spread out the observations are in the distribution (sometimes around the centre of). Skewness are observations symmetrically distributed? Do they tend to extend farther to the left or to the right?
28 Measures of Central Tendency (location) Mean average value sum of values divided by the number in the sample Median Value at which 50% of the observations have been accumulated. To calculate: rank data, and find the value for which 50% of observations fall above and 50% below, or use rank formula Mode most frequently observed value
29 Calculations (for completeness) Data: 32, 42, 46, 46, 54 Mean: ( )/5 = 44 Median(via rank formula): Index i = (p th percentile*#observations)/ th percentile*5 observations/100=2.5 If i is not an integer, next value is pth percentile If i is an integer, average i th and i th +1 Median=46 Mode: most frequent value = 46
30 Using Measures of Central Tendency Mean a good representation for a sample when data aren t skewed and no extreme data present. Highly influenced by extreme values. If symmetric distribution, mean median.» Median Better representation when data are skewed or when there are outliers. Robust to extreme values. Mode Check for bimodal distribution. Not useful for small samples.
31 Cont. IF Data now : 32, 42, 46, 46, 254 Mean: ( )/5 = 84 Median: Median= STILL 46 Mode: most frequent value =
32 Effects of Skewness on Measures of Central Tendency
33 The data sets have the same Mean, Median, and Mode yet clearly differ! Measures of Variation or Measures of Spread
34 Measures of dispersion Range Difference between largest and smallest values Interquartile range Difference between the third quartile (75 th percentile) and first quartile (25 th percentile) central 50% of the data Sample variance sum of squared deviations from the mean divided by n-1 For sample, denoted as S 2 Standard deviation square root of the variance For sample, denoted as S
35 Calculating the variance and standard deviation (X=total cholesterol) Subject (i) X
36 Calculating the variance and standard deviation (X=total cholesterol) X (X- X)
37 Calculating the variance and standard deviation (X=total cholesterol) i X X (X- X) 2 (X- X) å X n 1 = å X n i= 1 i = 4.99 S 2 = n å i= 1 ( X i n -1 - X ) 2 = 0.87 S = 0.87 = 0.93
38 Material Covered in the Biostatistics Lectures Lecture 1- Background material: Why biostatistics? Variables and scales of measurement Measures of central tendency and dispersion Data presentation
39 Some plots for numerical data: Good graphical methods are essential for understanding and describing data Histograms: Bars representing frequency or percentages of data observed within values or groups of values. Boxplots: 3 number summary plus outlier identification 25%, 50%, 75% Best for visually comparing two or more groups
40 Frequency Distribution of Immunoglobulin IgM data for 298 healthy children (age 2-6 yrs) Class Frequency Class Frequency 0.1 to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < to < Data from Altman, 1995, Practical Statistics for Medical Research, Table 3.2, p. 23
41 Histogram of IgM (g/l) Frequency IgM (g/l)
42 CTFC In coronary angiography, the corrected TIMI frame count (CTFC) is the number of frames required for dye to reach a standardized distal landmark. The following boxplots show the CTFC before and after angioplasty in approximately 1000 patients. Note the skewness in the distributions.
43 The Box and Whisker Plot: Anatomy Birth Weight (grams) 1,000 2,000 3,000 4,000 5,000 Box Outliers Whiskers Outliers
44 CTFC pre and post-angioplasty difference in CTFC CTFC Post-CTFC - pre-ctfc Pre-pci Post-pci
45 Boxplots The top of the box is the 75 th percentile (i.e. 75% of the observations fall below this value) The bottom of the box is the 25 th percentile. Hence the box has the middle 50% of data. The line in the middle of the box is the median. Points above and below the whiskers are considered to be outliers. These outliers are not representative of the general distribution of values.
46 Displaying Categorical Data Tables / Summaries Frequencies Percentages/proportions Tabulations and cross-tabulations Graphs Bar charts: the height of each bar shows the frequency (or percentage) per category. The bars don't need to be contiguous.
47 Displaying Categorical Data - Example In a study, there are 747 men and 434 women. Of these, 276 men and 195 women had cardiac arrests. Gender Frequency % Male Female Total Cardiac arrest Frequency % Yes No Total
48 Displaying Categorical Data - Example N Row % Cardiac arrest No cardiac arrest Total Col % Male Female Total
49 Example 2
50 DO NOT!! Mean Hba1c Use Barplots with SD for numerical Data. A bar is useful when it represents accumulation as in a histogram (% or frequency from 0 up) not when it is supposed to represent actual numbers.
51 Stay away!!! From stacked bar graphs. The idea of a graph is to make comparison easy and not repeat all numbers, otherwise use a table. In stacked bars you can not easily tell different heights of different bars, whereas in side by side you can. 70% 60% 50% 40% 30% Banana Orange Apples 20% 10% 0% Sandra Susie Jamie Jorge Alberto
52 Stay away!! From adding dimensions that are not real dimensions and distort the graph. From Pie charts; you usually end up having to add numbers to each slice as the slices are not easily comparable. Hence it is just a colored disarrayed table!!!!
BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke
BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart
More informationWhat is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.
What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationCIVL 7012/8012. Collection and Analysis of Information
CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real
More informationP8130: Biostatistical Methods I
P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data
More informationTastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that?
Tastitsticsss? What s that? Statistics describes random mass phanomenons. Principles of Biostatistics and Informatics nd Lecture: Descriptive Statistics 3 th September Dániel VERES Data Collecting (Sampling)
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationChapter 1. Looking at Data
Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,
More informationStatistics in medicine
Statistics in medicine Lecture 1- part 1: Describing variation, and graphical presentation Outline Sources of variation Types of variables Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease
More informationECLT 5810 Data Preprocessing. Prof. Wai Lam
ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate
More informationUnits. Exploratory Data Analysis. Variables. Student Data
Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as
More informationDescriptive statistics
Patrick Breheny February 6 Patrick Breheny to Biostatistics (171:161) 1/25 Tables and figures Human beings are not good at sifting through large streams of data; we understand data much better when it
More informationDescriptive Univariate Statistics and Bivariate Correlation
ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to
More information1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.
1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions
More informationLast Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics
Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different
More informationWhat is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected
What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types
More informationChapter 2 Class Notes Sample & Population Descriptions Classifying variables
Chapter 2 Class Notes Sample & Population Descriptions Classifying variables Random Variables (RVs) are discrete quantitative continuous nominal qualitative ordinal Notation and Definitions: a Sample is
More informationDiscrete Multivariate Statistics
Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are
More informationFundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur
Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new
More informationBiostatistics for biomedical profession. BIMM34 Karin Källen & Linda Hartman November-December 2015
Biostatistics for biomedical profession BIMM34 Karin Källen & Linda Hartman November-December 2015 12015-11-02 Who needs a course in biostatistics? - Anyone who uses quntitative methods to interpret biological
More informationExploring, summarizing and presenting data. Berghold, IMI, MUG
Exploring, summarizing and presenting data Example Patient Nr Gender Age Weight Height PAVK-Grade W alking Distance Physical Functioning Scale Total Cholesterol Triglycerides 01 m 65 90 185 II b 200 70
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationBNG 495 Capstone Design. Descriptive Statistics
BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus
More informationStatistics I Chapter 2: Univariate data analysis
Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,
More informationAnnouncements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22
Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging
More informationChapter2 Description of samples and populations. 2.1 Introduction.
Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that
More informationElementary Statistics
Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:
More informationLecture Notes 2: Variables and graphics
Highlights: Lecture Notes 2: Variables and graphics Quantitative vs. qualitative variables Continuous vs. discrete and ordinal vs. nominal variables Frequency distributions Pie charts Bar charts Histograms
More informationStatistics I Chapter 2: Univariate data analysis
Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,
More informationChapter 4. Displaying and Summarizing. Quantitative Data
STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range
More informationST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart
ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that
More information1. Exploratory Data Analysis
1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be
More informationUniversity of Jordan Fall 2009/2010 Department of Mathematics
handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationChapter 3. Data Description
Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.
More informationDescriptive Statistics
Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter
More informationUnit 1 Summarizing Data
BIOSTATS 540 Fall 2016 1. Summarizing Page 1 of 42 Unit 1 Summarizing It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive
More informationSTT 315 This lecture is based on Chapter 2 of the textbook.
STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationIntroduction to Statistics
Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,
More informationQUANTITATIVE DATA. UNIVARIATE DATA data for one variable
QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE
More informationVariables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010
Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Review Recording observations - Must extract that which is to be analyzed: coding systems,
More informationNicole Dalzell. July 2, 2014
UNIT 1: INTRODUCTION TO DATA LECTURE 3: EDA (CONT.) AND INTRODUCTION TO STATISTICAL INFERENCE VIA SIMULATION STATISTICS 101 Nicole Dalzell July 2, 2014 Teams and Announcements Team1 = Houdan Sai Cui Huanqi
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:
More informationFurther Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data
Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)
More informationTopic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!
Topic 3: Introduction to Statistics Collecting Data We collect data through observation, surveys and experiments. We can collect two different types of data: Categorical Quantitative Algebra 1 Table of
More informationDescriptive Data Summarization
Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching/ Suhasini Subba Rao Review In the previous lecture we looked at the statistics of M&Ms. This example illustrates
More informationAnswer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)
Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:
More informationA is one of the categories into which qualitative data can be classified.
Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative
More informationUnit 2. Describing Data: Numerical
Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient
More informationCOMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS
COMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS EX 1 Given the following series of data on Gender and Height for 8 patients, fill in two frequency tables one for each Variable, according to the model
More informationLecture 1: Descriptive Statistics
Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics
More informationNemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014
Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of
More informationDescriptive Statistics-I. Dr Mahmoud Alhussami
Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.
More informationMeasures of Central Tendency and their dispersion and applications. Acknowledgement: Dr Muslima Ejaz
Measures of Central Tendency and their dispersion and applications Acknowledgement: Dr Muslima Ejaz LEARNING OBJECTIVES: Compute and distinguish between the uses of measures of central tendency: mean,
More informationOverview. INFOWO Statistics lecture S1: Descriptive statistics. Detailed Overview of the Statistics track. Definition
Overview INFOWO Statistics lecture S1: Descriptive statistics Peter de Waal Introduction to statistics Descriptive statistics Department of Information and Computing Sciences Faculty of Science, Universiteit
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures
More informationCHAPTER 2 Description of Samples and Populations
Chapter 2 27 CHAPTER 2 Description of Samples and Populations 2.1.1 (a) i) Molar width ii) Continuous variable iii) A molar iv) 36 (b) i) Birthweight, date of birth, and race ii) Birthweight is continuous,
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationCHAPTER 2: Describing Distributions with Numbers
CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring
More informationLecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:
Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots
More informationBIOS 2041: Introduction to Statistical Methods
BIOS 2041: Introduction to Statistical Methods Abdus S Wahed* *Some of the materials in this chapter has been adapted from Dr. John Wilson s lecture notes for the same course. Chapter 0 2 Chapter 1 Introduction
More informationExploratory data analysis
Exploratory data analysis November 29, 2017 Dr. Khajonpong Akkarajitsakul Department of Computer Engineering, Faculty of Engineering King Mongkut s University of Technology Thonburi Module III Overview
More informationVisualizing and summarizing data
Visualizing and summarizing data Ken Rice, Dept of Biostatistics HUBIO 530 January 2015 Q. What s your talk about? Today I will describe: How to visualize small datasets How to summarize small datasets
More informationChapter 2 Solutions Page 15 of 28
Chapter Solutions Page 15 of 8.50 a. The median is 55. The mean is about 105. b. The median is a more representative average" than the median here. Notice in the stem-and-leaf plot on p.3 of the text that
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1
More informationWhat is Statistics? Simple Summaries and Plots. Brian D. Ripley
What is Statistics? Simple Summaries and Plots Brian D. Ripley Statistics The name derives from state-istics, quantification of the state, an area now know as official statistics. Health statistics have
More informationANÁLISE DOS DADOS. Daniela Barreiro Claro
ANÁLISE DOS DADOS Daniela Barreiro Claro Outline Data types Graphical Analysis Proimity measures Prof. Daniela Barreiro Claro Types of Data Sets Record Ordered Relational records Video data: sequence of
More informationExample 2. Given the data below, complete the chart:
Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is
More informationDEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008
DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 3 Spring 2008 Measures of central tendency for ungrouped data 2 Graphs are very helpful to describe
More informationReview for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data
Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature
More informationMEASURES OF LOCATION AND SPREAD
MEASURES OF LOCATION AND SPREAD Frequency distributions and other methods of data summarization and presentation explained in the previous lectures provide a fairly detailed description of the data and
More information2 Descriptive Statistics
2 Descriptive Statistics Reading: SW Chapter 2, Sections 1-6 A natural first step towards answering a research question is for the experimenter to design a study or experiment to collect data from the
More informationMeasures of center. The mean The mean of a distribution is the arithmetic average of the observations:
Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number
More informationSummarizing and Displaying Measurement Data/Understanding and Comparing Distributions
Summarizing and Displaying Measurement Data/Understanding and Comparing Distributions Histograms, Mean, Median, Five-Number Summary and Boxplots, Standard Deviation Thought Questions 1. If you were to
More informationLearning Objectives for Stat 225
Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:
More informationStat Lecture Slides Exploring Numerical Data. Yibi Huang Department of Statistics University of Chicago
Stat 22000 Lecture Slides Exploring Numerical Data Yibi Huang Department of Statistics University of Chicago Outline In this slide, we cover mostly Section 1.2 & 1.6 in the text. Data and Types of Variables
More information(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)
3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions
More informationPractice problems from chapters 2 and 3
Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal,
More informationUnit 1 Summarizing Data
BIOSTATS 540 Fall 2018 1. Summarizing Page 1 of 47 Unit 1 Summarizing It is difficult to understand why statisticians commonly limit their enquiries to averages, and do not revel in more comprehensive
More informationVocabulary: Samples and Populations
Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we
More informationStatistics lecture 3. Bell-Shaped Curves and Other Shapes
Statistics lecture 3 Bell-Shaped Curves and Other Shapes Goals for lecture 3 Realize many measurements in nature follow a bell-shaped ( normal ) curve Understand and learn to compute a standardized score
More informationComparing Measures of Central Tendency *
OpenStax-CNX module: m11011 1 Comparing Measures of Central Tendency * David Lane This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 1.0 1 Comparing Measures
More informationIntroduction to Statistics
Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and
More informationUnit Two Descriptive Biostatistics. Dr Mahmoud Alhussami
Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are
More informationFREQUENCY DISTRIBUTIONS AND PERCENTILES
FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value
More informationChapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com
1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to
More informationChapter 1 - Lecture 3 Measures of Location
Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What
More informationTOPIC: Descriptive Statistics Single Variable
TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency
More informationChapter 1:Descriptive statistics
Slide 1.1 Chapter 1:Descriptive statistics Descriptive statistics summarises a mass of information. We may use graphical and/or numerical methods Examples of the former are the bar chart and XY chart,
More informationMeasures of disease spread
Measures of disease spread Marco De Nardi Milk Safety Project 1 Objectives 1. Describe the following measures of spread: range, interquartile range, variance, and standard deviation 2. Discuss examples
More informationUNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004
UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 50W - Introduction to Biostatistics Fall 00 Exercises with Solutions Topic Summarizing Data Due: Monday September 7, 00 READINGS.
More informationHistograms allow a visual interpretation
Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,
More information