Statistics I Chapter 1: Introduction

Size: px
Start display at page:

Download "Statistics I Chapter 1: Introduction"

Transcription

1 Statistics I Chapter 1: Introduction Chapter 1: Introduction Contents What is Statistics? - definition Key-words: population, parameter, sample, statistic, population size, sample size, individuals, objects Types of variables: categorical (ordinal, nominal) and numerical (discrete, continuous) Why sample? Definition of a simple random sample Frequencies and frequency distribution/table: absolute, absolute cumulative, relative, relative cumulative. Properties.

2 Chapter 1: Introduction Recommended reading Peña, D., Romo, J., Introducción a la Estadística para las Ciencias Sociales Chapters 1, 2, 3 Newbold, P. Estadística para los Negocios y la Economía (2009) Chapter 1 Sections 2.1, 2.4, 2.7. How to lie with Statistics Definition of Statistics Def. Statistics is a science that deals with: collecting, organizing, summarizing, presenting, interpreting, processing data to transform data into information predictions, forecasts, estimation Descriptive Statistics Inferential Statistics On what occasions did you hear/saw word statistics? football/tennis match summary unemployment rates, number of people injured in car accidents There is much more to statistics than percentages and counts!

3 Key-words A population is the complete collection of all items/individuals/objects/subjects of interest or under investigation N represents the population size A sample is an observed subset of the population, typically chosen to investigate the properties of a parent population n represents the sample size A parameter is a specific characteristic of a population (fixed) A statistic is a specific characteristic of a sample (varies from sample to sample) A variable is a characteristic of an individual Examples Pop: all students at UC3M Variable: height (0, ) Param: Average height of all students Statistic: Average height of sampled students Pop: all fish in a sea Variable: size {L, M, S} Param: Number of small fish in the entire sea Statistic: Number of small fish caught Pop: all patients of Getafe Hospital Variable: blood type {A,B,AB,O} Param: Percentage of all patients with AB Statistic: Percentage of sampled patients with AB Pop: all Philip s light-bulbs Variable: life-expectancy in days {0, 1, 2,...} Param: Variation in life-expectancy of all light-bulbs Statistic: Variation in life-expectancy of sampled light-bulbs

4 Types of data Data (Variable) Categorical (Qualitative) Numerical (Quantitative) Ordinal Nominal Discrete Continuous classes can be ranked no natural order integer nonintegers Example Example Example Example Clothes size: Blood type: # of children: Height: L>M>S A,B,AB,O 0,1,2, cm, 1.71cm Notation: Letters X, Y, Z are typically used. Example: X = height in cm (upper-case letters in definition) x = 1.55 (lower-case letters for specific values) x 1 = 1.55, x 2 = 1.71 (add subscripts if more than one) Why sample? In practice we don t study the population because: We may destroy the population (eg. life-expectancy of a light-bulb) Population may exist as a concept but not in reality (eg. population of defective items) Impractical (eg. population of all fish in a sea) Too expensive Too time consuming

5 Definition of a simple random sample (SRS) Def. Simple random sample is obtained in such a way that each member of the population is chosen strictly by chance each member of the population is likely to be chosen, and every possible sample of n objects is equally likely to be chosen Notation: Sample of size n from a variable X means that: We have n individuals selected at random from a population For each of the individuals we report the value of the variable X If X is categorical or discrete, it is convenient to write the different sample values that X takes as x 1, x 2,..., x k, k n (ranked from the smallest to the largest, unless X is nominal) Frequencies and frequency distribution Def. A frequency distribution is a list or a table... containing class groupings (categories or ranges within which the data fall)... and the corresponding frequencies with which data fall within each class or category Frequencies: absolute (number of times the value appeared in the sample) relative (proportion of times the value appeared in the sample)

6 Why use frequency distributions? A frequency distribution is a way to summarize data The distribution condenses the raw data into a more useful form... and allows for a quick visual interpretation of the data Grouping by classes: categorical and discrete data Note: Cumulative Cumulative Absolute Relative Absolute Relative Class, x i Freq, n i Freq, f i Freq, N i Frequency, F i x 1 n 1 f 1 = n 1 N 1 = n 1 F 1 = f 1 x 2 n 2 f 2 = n 2 n N 2 = N 1 + n 2 F 2 = F 1 + f x k n k f k = n k n N k = n F k = 1 Total n 1 empty empty n i = number of x i in the sample, f i = number of x i n N i = N i 1 + n i, F i = F i 1 + f i 0 f i, F i 1 F i and N i do not make sense for categorical-nominal variables

7 Grouping by classes Example 1: The data below shows blood types reported for a sample of 40 individuals. AB, A, B, O, A, A, A, B, O, AB, B, O, B, B, B, A, A, A, AB, B, O, A, A, A, AB, AB, O, B, B, AB, O, B, O, O, A, A, O, B, AB, AB What kind of variable is blood type? Find a frequency distribution of the data. What percentage of the sampled people have blood type A? What percentage of the individuals have blood type other than O? Grouping by classes Example 1 cont.: Categorical, nominal with 4 different classes. The frequency distribution is: 30% 100% 22.5% = 77.5% Absolute Relative Class Frequency Frequency A B AB O Total 40 1

8 Grouping by classes Example 2: The table below shows different levels of satisfaction (S=satisfied, V=very, U=unsatisfied) for 901 employees. Absolute Class Frequency VU 62 U 108 S 319 VS 412 Total 901 What type of variable is being studied? Find a frequency distribution of the data. What percentage of the sampled people are satisfied? How many individuals are unsatisfied or worse? In %? How many individuals are at least satisfied? In %? Grouping by classes Example 2 cont.: Categorical, ordinal with 4 different classes. The frequency distribution is: Cumulative Cumulative Absolute Relative Absolute Relative Class Frequency Frequency Frequency Frequency VU U S VS Total % 170, 19% = 731 or = 731, 35% + 46% = 81% or 100% 19% = 81%

9 Grouping by classes Example 3: To evaluate the performance of a new pesticide, a sample of 50 plants, from those treated by the new pesticide, was selected. The number of leaves attacked by a pest was counted for each of the sampled plants. The results are shown below. Absolute x i Frequency Total 50 Grouping by classes Example 3 cont.: What can you say about the variable in the study? Find its frequency distribution. What percentage of the sampled plants had only 3 leaves attacked? How many plants had no more than 3 leaves attacked? How many plants had at least 6 leaves attacked? What percentage of plants have between 3 and 5 leaves attacked? What percentage of plants had at least 8 leaves attacked? What percentage of plants had at most 2 leaves attacked?

10 Grouping by classes Example 3 cont.: Numerical, discrete with 9 different values. The frequency distribution is: Cumulative Cumulative Absolute Relative Absolute Relative x i Frequency Frequency Frequency Frequency Total 50 1 Grouping by classes Example 3 cont.: 16% or = 5 16% + 10% + 8% = 34% or ( )/50 = 34% 2% + 2% = 4% or 100% 96% = 4% 56%

11 Grouping by class intervals: continuous (and discrete) data Note: Class Interval Midpoint [l i 1, l i ) x i = l i +l i 1 2 n i f i N i F i [l 0, l 1 ) x 1 n 1 f 1 N 1 F 1 [l 1, l 2 ) x 2 n 2 f 2 N 2 F [l k 1, l k ] x k n k f k n 1 Total n 1 empty empty Left end-point is included, but right end-point is excluded (typical convention) Reverse end-point convention can be applied - check your software for definition Useful for tabulating discrete data if X takes many values Grouping by class intervals: continuous (and discrete) data Very often class intervals have the same width Determine the width w of each interval by w = largest number - smallest number number of desired intervals How many intervals? Roughly between 5 and 20. More specifically: k n if n is small k log(n) if n is large Intervals never overlap Round up the interval width to get desirable interval endpoints

12 Grouping by class intervals: continuous (and discrete) data Example 4: A manufacturer of insulation randomly selects 20 winter days and records the daily high temperature (in Fahrenheit) 24, 35, 17, 21, 24, 37, 26, 46, 58, 30, 32, 13, 12, 38, 41, 43, 44, 27, 53, 27 Find the frequency distribution of the data. Sort raw data in ascending order: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Find range: = 46 Select number of classes: say k = 5 Compute interval width: 10 (46/5 then round up) Determine the end-points: 10 but less than 20, 20 but less than 30, etc Count the observations and assign to classes Grouping by class intervals: continuous (and discrete) data Example 4 cont.: Class Interval Midpoint n i f i N i F i [10, 20) [20, 30) [30, 40) [40, 50) [50, 60] Total 20 1 On how many days the temperature was below 30F? In %? (3 + 6 = 9, which is 45%) On how many days (approximately) the temperature was at least 45F? In %? ( = 4, which is 20%)

Statistics I Chapter 1: Introduction

Statistics I Chapter 1: Introduction Statistics I Chapter 1: Introduction Chapter 1: Introduction Contents What is Statistics? - definition Key-words: population, parameter, sample, statistic, population size, sample size, individuals, objects

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

Statistics I Chapter 2: Analysis of univariate data

Statistics I Chapter 2: Analysis of univariate data Statistics I Chapter 2: Analysis of univariate data Chapter 2: Analysis of univariate data Contents 1. Representations and graphs Frequency tables. Bar and pie charts, pictograms, histograms, frequency

More information

Statistics I Chapter 3: Bivariate data analysis

Statistics I Chapter 3: Bivariate data analysis Statistics I Chapter 3: Bivariate data analysis Chapter 3: Bivariate data analysis Contents 3.1 Two-way tables Bivariate data Definition of a two-way table Joint absolute/relative frequency distribution

More information

Statistics 301: Probability and Statistics Introduction to Statistics Module

Statistics 301: Probability and Statistics Introduction to Statistics Module Statistics 301: Probability and Statistics Introduction to Statistics Module 1 2018 Introduction to Statistics Statistics is a science, not a branch of mathematics, but uses mathematical models as essential

More information

All the men living in Turkey can be a population. The average height of these men can be a population parameter

All the men living in Turkey can be a population. The average height of these men can be a population parameter CHAPTER 1: WHY STUDY STATISTICS? Why Study Statistics? Population is a large (or in nite) set of elements that are in the interest of a research question. A parameter is a speci c characteristic of a population

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Stages in scientific investigation: Frequency distributions and graphing data: Levels of measurement:

Stages in scientific investigation: Frequency distributions and graphing data: Levels of measurement: Frequency distributions and graphing data: Levels of Measurement Frequency distributions Graphing data Stages in scientific investigation: Obtain your data: Usually get data from a sample, taken from a

More information

Vehicle Freq Rel. Freq Frequency distribution. Statistics

Vehicle Freq Rel. Freq Frequency distribution. Statistics 1.1 STATISTICS Statistics is the science of data. This involves collecting, summarizing, organizing, and analyzing data in order to draw meaningful conclusions about the universe from which the data is

More information

Lecture Notes 2: Variables and graphics

Lecture Notes 2: Variables and graphics Highlights: Lecture Notes 2: Variables and graphics Quantitative vs. qualitative variables Continuous vs. discrete and ordinal vs. nominal variables Frequency distributions Pie charts Bar charts Histograms

More information

Lectures of STA 231: Biostatistics

Lectures of STA 231: Biostatistics Lectures of STA 231: Biostatistics Second Semester Academic Year 2016/2017 Text Book Biostatistics: Basic Concepts and Methodology for the Health Sciences (10 th Edition, 2014) By Wayne W. Daniel Prepared

More information

ECON1310 Quantitative Economic and Business Analysis A

ECON1310 Quantitative Economic and Business Analysis A ECON1310 Quantitative Economic and Business Analysis A Topic 1 Descriptive Statistics 1 Main points - Statistics descriptive collecting/presenting data; inferential drawing conclusions from - Data types

More information

Introduction to Basic Statistics Version 2

Introduction to Basic Statistics Version 2 Introduction to Basic Statistics Version 2 Pat Hammett, Ph.D. University of Michigan 2014 Instructor Comments: This document contains a brief overview of basic statistics and core terminology/concepts

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Draft Proof - Do not copy, post, or distribute

Draft Proof - Do not copy, post, or distribute 1 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1. Distinguish between descriptive and inferential statistics. Introduction to Statistics 2. Explain how samples and populations,

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

Applied Statistics in Business & Economics, 5 th edition

Applied Statistics in Business & Economics, 5 th edition A PowerPoint Presentation Package to Accompany Applied Statistics in Business & Economics, 5 th edition David P. Doane and Lori E. Seward Prepared by Lloyd R. Jaisingh McGraw-Hill/Irwin Copyright 2015

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Section 2.1 ~ Data Types and Levels of Measurement. Introduction to Probability and Statistics Spring 2017

Section 2.1 ~ Data Types and Levels of Measurement. Introduction to Probability and Statistics Spring 2017 Section 2.1 ~ Data Types and Levels of Measurement Introduction to Probability and Statistics Spring 2017 Objective To be able to classify data as qualitative or quantitative, to identify quantitative

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Introduction to Probability and Statistics Slides 1 Chapter 1

Introduction to Probability and Statistics Slides 1 Chapter 1 1 Introduction to Probability and Statistics Slides 1 Chapter 1 Prof. Ammar M. Sarhan, asarhan@mathstat.dal.ca Department of Mathematics and Statistics, Dalhousie University Fall Semester 2010 Course outline

More information

Statistics. Industry Business Education Physics Chemistry Economics Biology Agriculture Psychology Astronomy, etc. GFP - Sohar University

Statistics. Industry Business Education Physics Chemistry Economics Biology Agriculture Psychology Astronomy, etc. GFP - Sohar University Statistics اإلحصاء تعاريف 3-1 Definitions Statistics is a branch of Mathematics that deals collecting, analyzing, summarizing, and presenting data to help in the decision-making process. Statistics is

More information

FREQUENCY DISTRIBUTIONS AND PERCENTILES

FREQUENCY DISTRIBUTIONS AND PERCENTILES FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly

More information

Probabilities and Statistics Probabilities and Statistics Probabilities and Statistics

Probabilities and Statistics Probabilities and Statistics Probabilities and Statistics - Lecture 8 Olariu E. Florentin April, 2018 Table of contents 1 Introduction Vocabulary 2 Descriptive Variables Graphical representations Measures of the Central Tendency The Mean The Median The Mode Comparing

More information

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.)

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.) Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.) PRESENTATION OF DATA 1. Mathematical presentation (measures of central tendency and measures of dispersion). 2. Tabular

More information

Stochastic calculus for summable processes 1

Stochastic calculus for summable processes 1 Stochastic calculus for summable processes 1 Lecture I Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions. It is a

More information

Chapter 2. Mean and Standard Deviation

Chapter 2. Mean and Standard Deviation Chapter 2. Mean and Standard Deviation The median is known as a measure of location; that is, it tells us where the data are. As stated in, we do not need to know all the exact values to calculate the

More information

Lecture 25. STAT 225 Introduction to Probability Models April 16, Whitney Huang Purdue University. Agenda. Notes. Notes.

Lecture 25. STAT 225 Introduction to Probability Models April 16, Whitney Huang Purdue University. Agenda. Notes. Notes. Lecture 25 STAT 225 Introduction to Probability Models April 16, 2104 Whitney Huang Purdue University 25.1 Agenda 1 2 3 25.2 Probability vs. Statistics Figure : Taken from JHU Statistical Computing by

More information

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart

More information

Part 7: Glossary Overview

Part 7: Glossary Overview Part 7: Glossary Overview In this Part This Part covers the following topic Topic See Page 7-1-1 Introduction This section provides an alphabetical list of all the terms used in a STEPS surveillance with

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review Our objective: to make confident statements about a parameter (aspect) in

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability Random variable The outcome of each procedure is determined by chance. Probability Distributions Normal Probability Distribution N Chapter 6 Discrete Random variables takes on a countable number of values

More information

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E OBJECTIVE COURSE Understand the concept of population and sampling in the research. Identify the type

More information

Practice problems from chapters 2 and 3

Practice problems from chapters 2 and 3 Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal,

More information

Math 201 Statistics for Business & Economics. Definition of Statistics. Two Processes that define Statistics. Dr. C. L. Ebert

Math 201 Statistics for Business & Economics. Definition of Statistics. Two Processes that define Statistics. Dr. C. L. Ebert Math 201 Statistics for Business & Economics Dr. C. L. Ebert Chapter 1 Introduction Definition of Statistics Statistics - the study of the collection, organization, presentation, and characterization of

More information

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing

More information

Mitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D.

Mitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D. Mitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D. The number of cells in various stages of mitosis in your treatment and control onions are your raw

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation

More information

Essentials of Statistics and Probability

Essentials of Statistics and Probability May 22, 2007 Department of Statistics, NC State University dbsharma@ncsu.edu SAMSI Undergrad Workshop Overview Practical Statistical Thinking Introduction Data and Distributions Variables and Distributions

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

BASIC CONCEPTS C HAPTER 1

BASIC CONCEPTS C HAPTER 1 C HAPTER 1 BASIC CONCEPTS Statistics is the science which deals with the methods of collecting, classifying, presenting, comparing and interpreting numerical data collected on any sphere of inquiry. Knowledge

More information

Σ x i. Sigma Notation

Σ x i. Sigma Notation Sigma Notation The mathematical notation that is used most often in the formulation of statistics is the summation notation The uppercase Greek letter Σ (sigma) is used as shorthand, as a way to indicate

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

MEASURES OF LOCATION AND SPREAD

MEASURES OF LOCATION AND SPREAD MEASURES OF LOCATION AND SPREAD Frequency distributions and other methods of data summarization and presentation explained in the previous lectures provide a fairly detailed description of the data and

More information

Introduction to Statistics

Introduction to Statistics Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and

More information

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that?

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that? Tastitsticsss? What s that? Statistics describes random mass phanomenons. Principles of Biostatistics and Informatics nd Lecture: Descriptive Statistics 3 th September Dániel VERES Data Collecting (Sampling)

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

A SHORT INTRODUCTION TO PROBABILITY

A SHORT INTRODUCTION TO PROBABILITY A Lecture for B.Sc. 2 nd Semester, Statistics (General) A SHORT INTRODUCTION TO PROBABILITY By Dr. Ajit Goswami Dept. of Statistics MDKG College, Dibrugarh 19-Apr-18 1 Terminology The possible outcomes

More information

Scales of Measuement Dr. Sudip Chaudhuri

Scales of Measuement Dr. Sudip Chaudhuri Scales of Measuement Dr. Sudip Chaudhuri M. Sc., M. Tech., Ph.D., M. Ed. Assistant Professor, G.C.B.T. College, Habra, India, Honorary Researcher, Saha Institute of Nuclear Physics, Life Member, Indian

More information

DESCRIPTIVE STATISTICS

DESCRIPTIVE STATISTICS DESCRIPTIVE STATISTICS Statistics deals with the theories and methods used in the collection, organization, interpretation and presentation of data. Data raw material used in statistical investigation

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Probability Distributions

Probability Distributions Probability Distributions Probability This is not a math class, or an applied math class, or a statistics class; but it is a computer science course! Still, probability, which is a math-y concept underlies

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

Multiple Choice. Chapter 2 Test Bank

Multiple Choice. Chapter 2 Test Bank Straightforward Statistics 1st Edition Bowen Test Bank Full Download: https://testbanklive.com/download/straightforward-statistics-1st-edition-bowen-test-bank/ Chapter 2 Test Bank Multiple Choice 1. Data

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

Chapter 01 : What is Statistics?

Chapter 01 : What is Statistics? Chapter 01 : What is Statistics? Feras Awad Data: The information coming from observations, counts, measurements, and responses. Statistics: The science of collecting, organizing, analyzing, and interpreting

More information

Statistical Process Control

Statistical Process Control Statistical Process Control What is a process? Inputs PROCESS Outputs A process can be described as a transformation of set of inputs into desired outputs. Types of Measures Measures where the metric is

More information

Basic Statistics and Probability Chapter 3: Probability

Basic Statistics and Probability Chapter 3: Probability Basic Statistics and Probability Chapter 3: Probability Events, Sample Spaces and Probability Unions and Intersections Complementary Events Additive Rule. Mutually Exclusive Events Conditional Probability

More information

THE SAMPLING DISTRIBUTION OF THE MEAN

THE SAMPLING DISTRIBUTION OF THE MEAN THE SAMPLING DISTRIBUTION OF THE MEAN COGS 14B JANUARY 26, 2017 TODAY Sampling Distributions Sampling Distribution of the Mean Central Limit Theorem INFERENTIAL STATISTICS Inferential statistics: allows

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

Tabulation means putting data into tables. A table is a matrix of data in rows and columns, with the rows and the columns having titles.

Tabulation means putting data into tables. A table is a matrix of data in rows and columns, with the rows and the columns having titles. 1 Tabulation means putting data into tables. A table is a matrix of data in rows and columns, with the rows and the columns having titles. 2 converting the set of numbers into the form of a grouped frequency

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H.G. Wells

Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H.G. Wells Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H.G. Wells 1 Statistics is a science which deals with collection, tabulation, presentation,

More information

Goodness of Fit Tests

Goodness of Fit Tests Goodness of Fit Tests Marc H. Mehlman marcmehlman@yahoo.com University of New Haven (University of New Haven) Goodness of Fit Tests 1 / 38 Table of Contents 1 Goodness of Fit Chi Squared Test 2 Tests of

More information

S1600 #2. Data Presentation #1. January 14, 2016

S1600 #2. Data Presentation #1. January 14, 2016 S1600 #2 Data Presentation #1 January 14, 2016 Outline 1 Data Presentation #1 Statistics and Data Variable Types Summarizing Categorical Data (WMU) S1600 #2 S1600, Lecture 2 2 / 14 Statistics and Data

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

Introduction to Statistical Data Analysis Lecture 3: Probability Distributions

Introduction to Statistical Data Analysis Lecture 3: Probability Distributions Introduction to Statistical Data Analysis Lecture 3: Probability Distributions James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Frequency Distribution Cross-Tabulation

Frequency Distribution Cross-Tabulation Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape

More information

Chapitre 3. 5: Several Useful Discrete Distributions

Chapitre 3. 5: Several Useful Discrete Distributions Chapitre 3 5: Several Useful Discrete Distributions 5.3 The random variable x is not a binomial random variable since the balls are selected without replacement. For this reason, the probability p of choosing

More information

Revision Topic 13: Statistics 1

Revision Topic 13: Statistics 1 Revision Topic 13: Statistics 1 Averages There are three common types of average: the mean, median and mode. The mode (or modal value) is the data value (or values) that occurs the most often. The median

More information

Descriptive Statistics Methods of organizing and summarizing any data/information.

Descriptive Statistics Methods of organizing and summarizing any data/information. Introductory Statistics, 10 th ed. by Neil A. Weiss Chapter 1 The Nature of Statistics 1.1 Statistics Basics There are lies, damn lies, and statistics - Mark Twain Descriptive Statistics Methods of organizing

More information

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters. Chapter 9: Sampling Distributions 9.1: Sampling Distributions IDEA: How often would a given method of sampling give a correct answer if it was repeated many times? That is, if you took repeated samples

More information

Calculus for the Life Sciences

Calculus for the Life Sciences Calculus for the Life Sciences Integration Joseph M. Mahaffy, jmahaffy@mail.sdsu.edu Department of Mathematics and Statistics Dynamical Systems Group Computational Sciences Research Center San Diego State

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Atomic structure. Resources and methods for learning about these subjects (list a few here, in preparation for your research):

Atomic structure. Resources and methods for learning about these subjects (list a few here, in preparation for your research): Atomic structure This worksheet and all related files are licensed under the Creative Commons Attribution License, version 1.0. To view a copy of this license, visit http://creativecommons.org/licenses/by/1.0/,

More information

Author : Dr. Pushpinder Kaur. Educational Statistics: Mean Median and Mode

Author : Dr. Pushpinder Kaur. Educational Statistics: Mean Median and Mode B.ED. PART- II ACADEMIC SESSION : 2017-2018 PAPER XVIII Assessment for Learning Lesson No. 8 Author : Dr. Pushpinder Kaur Educational Statistics: Mean Median and Mode MEAN : The mean is the average value

More information

Week 1: Intro to R and EDA

Week 1: Intro to R and EDA Statistical Methods APPM 4570/5570, STAT 4000/5000 Populations and Samples 1 Week 1: Intro to R and EDA Introduction to EDA Objective: study of a characteristic (measurable quantity, random variable) for

More information

CISC 1100: Structures of Computer Science

CISC 1100: Structures of Computer Science CISC 1100: Structures of Computer Science Chapter 2 Sets and Sequences Fordham University Department of Computer and Information Sciences Fall, 2010 CISC 1100/Fall, 2010/Chapter 2 1 / 49 Outline Sets Basic

More information

Sampling Populations limited in the scope enumerate

Sampling Populations limited in the scope enumerate Sampling Populations Typically, when we collect data, we are somewhat limited in the scope of what information we can reasonably collect Ideally, we would enumerate each and every member of a population

More information

Chapter (3) Describing Data Numerical Measures Examples

Chapter (3) Describing Data Numerical Measures Examples Chapter (3) Describing Data Numerical Measures Examples Numeric Measurers Measures of Central Tendency Measures of Dispersion Arithmetic mean Mode Median Geometric Mean Range Variance &Standard deviation

More information

Chapter 2: Summarizing and Graphing Data

Chapter 2: Summarizing and Graphing Data Chapter 2: Summarizing and Graphing Data 9 Chapter 2: Summarizing and Graphing Data Section 2-2 1. No. For each class, the frequency tells us how many values fall within the given range of values, but

More information

Teaching Research Methods: Resources for HE Social Sciences Practitioners. Sampling

Teaching Research Methods: Resources for HE Social Sciences Practitioners. Sampling Sampling Session Objectives By the end of the session you will be able to: Explain what sampling means in research List the different sampling methods available Have had an introduction to confidence levels

More information