Σ x i. Sigma Notation

Size: px
Start display at page:

Download "Σ x i. Sigma Notation"

Transcription

1 Sigma Notation The mathematical notation that is used most often in the formulation of statistics is the summation notation The uppercase Greek letter Σ (sigma) is used as shorthand, as a way to indicate that a sum is to be taken: Σ x i The notation above is equivalent to writing x 1 + x 2 + x x n-1 + x n

2 Sigma Notation - Components Looking at sigma notation in more detail: refers to where the sum of terms ends indicates we are taking a sum Σ x i indicates what we are summing up refers to where the sum of terms begins

3 Sigma Notation - Indices For example, with the following indices: i=5 Σ x = x i 3 + x 4 + x 5 i=3 The above sum indicates that only observations 3 through 5 should be included in the sum, despite the fact that x 1 through x n might exist in the sample x 1 + x 2 + x x n-1 + x n

4 Sigma Notation - Rules Constants (denoted by an a in the example below) in summation notation can be treated as follows: n times Σ a = (a + a + a) = na Here we are simply using the summation notation to carry out a multiplication, e.g. i=3 Σ 4 = (4+4+4) = 4*3 = 12

5 Sigma Notation - Rules If we have a constant within a summation, that constant can be taken outside the summation sign: Σ ax i = a Σ x i Σ ax i = ax 1 + ax ax n a Σ x i = a(x 1 + x x n )

6 Sigma Notation - Rules The order in which addition operations are carried out is unimportant: Σ (x i + y i ) = Σ x i + Σ y i (x 1 + x 2 + x x n-1 + x n ) + (y 1 + y 2 + y y n-1 + y n )

7 Sigma Notation - Rules Exponents are handled differently depending on whether they are applied to the observation term or the whole sum: Σ x i m = x 1m + x 2m + + x m n ( Σ x ) m = (x i 1 + x x n ) m

8 Sigma Notation - Rules Products are handled much like exponents: Σ x i y i = (x 1 y 1 + x 2 y x n y n ) Σ x i * Σ x i y i = Σ x i * Σ y i Σ y i =(x 1 + x x n ) * (y 1 + y y n )

9 Sigma Notation - Simplification A summation will often be written leaving out the upper and/or lower limits of the summation, and in these cases we must assume that all of the terms available are to be summed: Σ x i = n Σ x i = i Σ x i = i Σ x i

10 Sigma Notation Compound Sums We frequently use tabular data (or data drawn from matrices), with which we can construct sums of both the rows and the columns (compounds sums), using subscript i to denote the row index and the subscript j to denote the column index: i rows j columns x 11 x 12 x 13 x 21 x 22 x 23 Σ j=m Σ x ij = (x 11 + x 12 + x 13 j=1 + x 21 + x 22 + x 23 )

11 Pi Notation Just as we have summation notation that we can use as a shorthand for sums, there is also product notation for multiplication The uppercase Greek letter Π (pi) is used as shorthand, as a way to indicate that a product is to be calculated: Π x i The notation above is equivalent to writing x 1 * x 2 * x 3 * * x n-1 * x n

12 Pi Notation While the product operator is less commonly applied in the statistical context, one place where it can be used is in the calculation of combinations, which make use of the factorial (!) operator:!n = n * n-1 * n-2 * * 3 * 2 * 1 = Π i!6 = 6 * 5 * 4 * 3 * 2 * 1 = 720 = i=6 Π i

13 Some Basic Terminology Population In statistical terms, a set of things (potentially any sort of things), but in particular referring to the complete set of all that are like them Variables The properties of a population that are to be measured (i.e. how do parts of the population differ?) Sample A subset (part of) the population Parent Population Refers to the more general population of origin for any given sub-population / sample / subset

14 Some Basic Terminology Representative A sample is representative if it is an accurate reflection of its parent population. Ensuring that samples are representative of the population from which they are drawn is a primary problem in statistics Constant Something that does not vary Parameter A constant measure which describes the characteristic of a population Statistic The corresponding measure for a sample

15 Some Common Corresponding Parameters and Descriptive Statistics (Population) Parameter (Sample) Statistic Mean µ x Standard Deviation σ S Variance σ 2 S 2

16 Simple Descriptive Statistics Descriptive statistics provide an organization and summary of a dataset A small number of summary measures replaces the entirety of a dataset You re likely already familiar with some simple descriptive summary measures: 1. Ratios 2. Proportions 3. Percentages 4. Rates of Change 5. (Location Quotients)

17 Simple Descriptive Summary Measures 1. Ratios - # of observations in A # of observations in B e.g. A - 6 overcast, B 24 mostly cloudy days 2. Proportions Relates one part or category of data to the entire set of observations, e.g. a box of marbles that contains 4 yellow, 6 red, 5 blue, and 2 green gives a yellow proportion of 4/17 or a color ={yellow, red, blue, green} a a count ={4, 6, 5, 2} proportion = i Σ a i

18 Simple Descriptive Summary Measures 2. Proportions cont. Sum of all proportions = 1 These are useful for comparing two sets of data w/ different sizes and category counts e.g. a different box of marbles gives a yellow proportion of 2/23, and in order for this to be a reasonable comparison we need to know the totals for both samples 3. Percentages Calculated by proportions x 100, e.g. 2/23 = 8.696%, use of these should be restricted to larger sample sizes, perhaps 20+ observations

19 Simple Descriptive Summary Measures 4. Rates of Change Expressing the change in a variable with respect to its original value, e.g. Z = x(t 2) x(t 1 ) x(t 1 ) = change in x original value of x e.g. if we had 20 marbles and then added 10, the rate of change = (30-20)/20 = 10/20 = Location Quotients An index of relative concentration in space, a comparison of a region s share of something to the total

20 Simple Descriptive Summary Measures 5. Location Quotients cont. For example, suppose we have a region of 1000 km 2 which we subdivide into three smaller areas of 200, 300, and 500 km 2 respectively (labeled A, B, and C) The region has an influenza outbreak with 150 cases in the first region, 100 in the second, and 350 in the third (a total of 600 flu cases): Proportion of Area Proportion of Cases Location Quotient A 200/1000= /600= /0.2=1.25 B 300/1000= /600= /0.3=0.57 C 500/1000= /600= /0.5=1.17 Location Quotient = Prop. of Cases / Prop. of Area

21 Simple Descriptive Statistics These are ways to summarize a number set quickly and accurately The most common way of describing a variable distribution is in terms of two of its properties: Central tendency describes the central value of the distribution, around which the observations cluster Dispersion describes how the observations are distributed First we ll look at measures of central tendency

22 Measures of Central Tendency Think of this from the following point of view: We have some distribution in which we want to locate the center, and we need to choose an appropriate measure of central tendency. We can choose from: 1. Mode 2. Median 3. Mean Each of these measures is appropriate to different distributions / under different circumstances

23 Measures of Central Tendency - Mode 1. Mode This is the most frequently occurring value in the distribution In the event that multiple values tie for the highest frequency, we have a problem A potential solution in this situation involves constructing frequency classes and identify the most frequently occurring class This is the only measure of central tendency that can be used with nominal data The mode allows the distribution s peak to be located quickly

24 Measures of Central Tendency - Median 2. Median This is the value of a variable such that half of the observations are above and half are below this value i.e. this value divides the distribution into two groups of equal size Note: When the distribution has an even number of observations, finding the median requires averaging two numbers The key advantage of the median is that its value is unaffected by extreme values at the end of a distribution (which potentially are outliers)

25 Measures of Central Tendency - Median 2. Median cont. When we find the median of a distribution, we do so by dividing it into two equal parts. We can divide distributions into a greater number of parts as well: Quartiles each contains 25% of all values Quintiles each contains 20% of all values Deciles each contains 10% of all values Percentiles each contains 1% of all values Except for quintiles (because they give an odd number of parts) each of these also gives us the median

26 Measures of Central Tendency - Mean 3. Mean a.k.a. average, the most commonly used measure of central tendency x = Σ x i n Sample mean µ = i=n Σ x i N Population mean When we compute a mean using these basic formulae, we are assuming that each observation is equally significant

27 Measures of Central Tendency - Mean 3. Mean cont. We can also calculate a weighted mean using some weighting factor: x = Σ w i x i Σ w i e.g. What is the average income of all people in cities A, B, and C: City Avg. Income Population A $23, ,000 B $20,000 50,000 C $25, ,000 Weighted mean Here, population is the weighting factor w i and the average income is the variable of interest x i

28 Measures of Central Tendency - Mean 3. Mean cont. We can also calculate a grouped mean using the mid-points and frequencies of groups: x = Σ m i f i Σ f i Grouped mean e.g. Suppose we had grouped some data in a frequency table and wanted to calculate the grouped mean: Group Freq. Midpoint

29 Measures of Central Tendency - Mean 3. Mean cont. A standard geographic application of the mean is to locate the center (a.k.a. centroid) of a spatial distribution by assigning to each member of the spatial distribution a gridded coordinate and calculating the mean value in each coordinate direction Bivariate mean or mean center For a set of (x,y) coordinates, the mean center (x,y) is computed using: x = Σ x i n y = Σ y i n

30 Measures of Central Tendency - Mean 3. Mean cont. We can also calculate a weighted mean center in much the same way, but by using weights: For a set of (x,y) coordinates, the weighted mean center (x,y) is computed using: x = Σ w i x i Σ w i y = Σ w i y i Σ w i e.g. suppose we had the centroids and areas of 3 polygons Here we weight by area, but other weightings possible

31 Measures of Central Tendency - Mean 3. Mean cont. NOTE: Mean centers and weighted mean centers are very sensitive to outliers! For example: Suppose we calculated the geographic center of the United States when considering all states. Consider the influence that Hawaii and Alaska would have on the result if we chose to include them in the calculation. Clearly we would get a very different result with and without these two states

32 Selecting a Measure of Central Tendency Most often, the mean is selected by default The mean s key advantage is that is sensitive to any change in the value of any observation However, we really must consider the nature of the distribution to choose properly: 1. Multi-modal distribution The mode must be used, as the median or mean would be rather meaningless 2. Unimodal symmetric The mean is fine if the distribution approaches being symmetric

33 Selecting a Measure of Central Tendency 3. Unimodal skewed A skewed distribution creates significant differences between the measures of central tendency: Negatively skewed Mean < Median < Mode Positively skewed Mode < Median < Mean In both cases of skew, the median is appropriate, especially in cases with extreme outliers (e.g. dist. of salaries of UNC Geography graduates) 4. Ordinal Data Median is well applied 5. Nominal Data Mode is the only choice

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Spring 2015: Lembo GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Descriptive statistics concise and easily understood summary of data set characteristics

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient

More information

Measures of Central Tendency

Measures of Central Tendency Measures of Central Tendency Summary Measures Summary Measures Central Tendency Mean Median Mode Quartile Range Variance Variation Coefficient of Variation Standard Deviation Measures of Central Tendency

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Meelis Kull Autumn Meelis Kull - Autumn MTAT Data Mining - Lecture 03

Meelis Kull Autumn Meelis Kull - Autumn MTAT Data Mining - Lecture 03 Meelis Kull meelis.kull@ut.ee Autumn 2017 1 Demo: Data science mini-project CRISP-DM: cross-industrial standard process for data mining Data understanding: Types of data Data understanding: First look

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

Statistics and parameters

Statistics and parameters Statistics and parameters Tables, histograms and other charts are used to summarize large amounts of data. Often, an even more extreme summary is desirable. Statistics and parameters are numbers that characterize

More information

1 Measures of the Center of a Distribution

1 Measures of the Center of a Distribution 1 Measures of the Center of a Distribution Qualitative descriptions of the shape of a distribution are important and useful. But we will often desire the precision of numerical summaries as well. Two aspects

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

8. Sequences, Series, and Probability 8.1. SEQUENCES AND SERIES

8. Sequences, Series, and Probability 8.1. SEQUENCES AND SERIES 8. Sequences, Series, and Probability 8.1. SEQUENCES AND SERIES What You Should Learn Use sequence notation to write the terms of sequences. Use factorial notation. Use summation notation to write sums.

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS LEARNING OBJECTIVES: After studying this chapter, a student should understand: notation used in statistics; how to represent variables in a mathematical form

More information

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS 21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS Table of Contents Functions... 2 Polynomials and Rational Functions... 3 Exponential Functions... 4 Logarithmic

More information

Descriptive Data Summarization

Descriptive Data Summarization Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that?

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that? Tastitsticsss? What s that? Statistics describes random mass phanomenons. Principles of Biostatistics and Informatics nd Lecture: Descriptive Statistics 3 th September Dániel VERES Data Collecting (Sampling)

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

2 Descriptive Statistics

2 Descriptive Statistics 2 Descriptive Statistics Reading: SW Chapter 2, Sections 1-6 A natural first step towards answering a research question is for the experimenter to design a study or experiment to collect data from the

More information

Chapter 4.notebook. August 30, 2017

Chapter 4.notebook. August 30, 2017 Sep 1 7:53 AM Sep 1 8:21 AM Sep 1 8:21 AM 1 Sep 1 8:23 AM Sep 1 8:23 AM Sep 1 8:23 AM SOCS When describing a distribution, make sure to always tell about three things: shape, outliers, center, and spread

More information

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used Measures of Central Tendency Mode: most frequent score. best average for nominal data sometimes none or multiple modes in a sample bimodal or multimodal distributions indicate several groups included in

More information

Sampling Populations limited in the scope enumerate

Sampling Populations limited in the scope enumerate Sampling Populations Typically, when we collect data, we are somewhat limited in the scope of what information we can reasonably collect Ideally, we would enumerate each and every member of a population

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Preliminary Statistics course. Lecture 1: Descriptive Statistics

Preliminary Statistics course. Lecture 1: Descriptive Statistics Preliminary Statistics course Lecture 1: Descriptive Statistics Rory Macqueen (rm43@soas.ac.uk), September 2015 Organisational Sessions: 16-21 Sep. 10.00-13.00, V111 22-23 Sep. 15.00-18.00, V111 24 Sep.

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population . Measures of Central Tendency: Mode, Median and Mean Average a single number that is used to describe the entire sample or population. Mode a. Easiest to compute, but not too stable i. Changing just one

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the

More information

P8130: Biostatistical Methods I

P8130: Biostatistical Methods I P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

SUMMARIZING MEASURED DATA. Gaia Maselli

SUMMARIZING MEASURED DATA. Gaia Maselli SUMMARIZING MEASURED DATA Gaia Maselli maselli@di.uniroma1.it Computer Network Performance 2 Overview Basic concepts Summarizing measured data Summarizing data by a single number Summarizing variability

More information

Statistical Concepts. Constructing a Trend Plot

Statistical Concepts. Constructing a Trend Plot Module 1: Review of Basic Statistical Concepts 1.2 Plotting Data, Measures of Central Tendency and Dispersion, and Correlation Constructing a Trend Plot A trend plot graphs the data against a variable

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Chapter 4. Characterizing Data Numerically: Descriptive Statistics

Chapter 4. Characterizing Data Numerically: Descriptive Statistics Chapter 4 Characterizing Data Numerically: Descriptive Statistics While visual representations of data are very useful, they are only a beginning point from which we may gain more information. Numerical

More information

For math conventions used on the GRE, refer to this link:

For math conventions used on the GRE, refer to this link: GRE Review ISU Student Success Center Quantitative Workshop One Quantitative Section: Overview Your test will include either two or three 35-minute quantitative sections. There will be 20 questions in

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Quartiles, Deciles, and Percentiles

Quartiles, Deciles, and Percentiles Quartiles, Deciles, and Percentiles From the definition of median that it s the middle point in the axis frequency distribution curve, and it is divided the area under the curve for two areas have the

More information

Topic-1 Describing Data with Numerical Measures

Topic-1 Describing Data with Numerical Measures Topic-1 Describing Data with Numerical Measures Central Tendency (Center) and Dispersion (Variability) Central tendency: measures of the degree to which scores are clustered around the mean of a distribution

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis Summarizing Variability and Determining Distributions CS 147: Computer Systems Performance Analysis Summarizing Variability and Determining Distributions 1

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

Chapter 3. Data Description. McGraw-Hill, Bluman, 7 th ed, Chapter 3

Chapter 3. Data Description. McGraw-Hill, Bluman, 7 th ed, Chapter 3 Chapter 3 Data Description McGraw-Hill, Bluman, 7 th ed, Chapter 3 1 Chapter 3 Overview Introduction 3-1 Measures of Central Tendency 3-2 Measures of Variation 3-3 Measures of Position 3-4 Exploratory

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

Summarizing Measured Data

Summarizing Measured Data Summarizing Measured Data 12-1 Overview Basic Probability and Statistics Concepts: CDF, PDF, PMF, Mean, Variance, CoV, Normal Distribution Summarizing Data by a Single Number: Mean, Median, and Mode, Arithmetic,

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS. No great deed, private or public, has ever been undertaken in a bliss of certainty.

1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS. No great deed, private or public, has ever been undertaken in a bliss of certainty. CIVL 3103 Approximation and Uncertainty J.W. Hurley, R.W. Meier 1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS No great deed, private or public, has ever been undertaken in a bliss of certainty. - Leon Wieseltier

More information

Week 1: Intro to R and EDA

Week 1: Intro to R and EDA Statistical Methods APPM 4570/5570, STAT 4000/5000 Populations and Samples 1 Week 1: Intro to R and EDA Introduction to EDA Objective: study of a characteristic (measurable quantity, random variable) for

More information

KCP e-learning. test user - ability basic maths revision. During your training, we will need to cover some ground using statistics.

KCP e-learning. test user - ability basic maths revision. During your training, we will need to cover some ground using statistics. During your training, we will need to cover some ground using statistics. The very mention of this word can sometimes alarm delegates who may not have done any maths or statistics since leaving school.

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

Chapter 1:Descriptive statistics

Chapter 1:Descriptive statistics Slide 1.1 Chapter 1:Descriptive statistics Descriptive statistics summarises a mass of information. We may use graphical and/or numerical methods Examples of the former are the bar chart and XY chart,

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Module 1. Identify parts of an expression using vocabulary such as term, equation, inequality

Module 1. Identify parts of an expression using vocabulary such as term, equation, inequality Common Core Standards Major Topic Key Skills Chapters Key Vocabulary Essential Questions Module 1 Pre- Requisites Skills: Students need to know how to add, subtract, multiply and divide. Students need

More information

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above King Abdul Aziz University Faculty of Sciences Statistics Department Final Exam STAT 0 First Term 49-430 A 40 Name No ID: Section: You have 40 questions in 9 pages. You have 90 minutes to solve the exam.

More information

Essentials of Statistics and Probability

Essentials of Statistics and Probability May 22, 2007 Department of Statistics, NC State University dbsharma@ncsu.edu SAMSI Undergrad Workshop Overview Practical Statistical Thinking Introduction Data and Distributions Variables and Distributions

More information

Overview of Dispersion. Standard. Deviation

Overview of Dispersion. Standard. Deviation 15.30 STATISTICS UNIT II: DISPERSION After reading this chapter, students will be able to understand: LEARNING OBJECTIVES To understand different measures of Dispersion i.e Range, Quartile Deviation, Mean

More information

CHAPTER 3. YAKUP ARI,Ph.D.(C)

CHAPTER 3. YAKUP ARI,Ph.D.(C) CHAPTER 3 YAKUP ARI,Ph.D.(C) math.stat.yeditepe@gmail.com REMEMBER!!! The purpose of descriptive statistics is to summarize and organize a set of scores. One of methods of descriptive statistics is to

More information

Summarizing and Displaying Measurement Data/Understanding and Comparing Distributions

Summarizing and Displaying Measurement Data/Understanding and Comparing Distributions Summarizing and Displaying Measurement Data/Understanding and Comparing Distributions Histograms, Mean, Median, Five-Number Summary and Boxplots, Standard Deviation Thought Questions 1. If you were to

More information

MEASURES OF LOCATION AND SPREAD

MEASURES OF LOCATION AND SPREAD MEASURES OF LOCATION AND SPREAD Frequency distributions and other methods of data summarization and presentation explained in the previous lectures provide a fairly detailed description of the data and

More information

Probabilities and Statistics Probabilities and Statistics Probabilities and Statistics

Probabilities and Statistics Probabilities and Statistics Probabilities and Statistics - Lecture 8 Olariu E. Florentin April, 2018 Table of contents 1 Introduction Vocabulary 2 Descriptive Variables Graphical representations Measures of the Central Tendency The Mean The Median The Mode Comparing

More information

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest: 1 Chapter 3 - Descriptive stats: Numerical measures 3.1 Measures of Location Mean Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size Example: The number

More information

1.3.1 Measuring Center: The Mean

1.3.1 Measuring Center: The Mean 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations

More information

ANÁLISE DOS DADOS. Daniela Barreiro Claro

ANÁLISE DOS DADOS. Daniela Barreiro Claro ANÁLISE DOS DADOS Daniela Barreiro Claro Outline Data types Graphical Analysis Proimity measures Prof. Daniela Barreiro Claro Types of Data Sets Record Ordered Relational records Video data: sequence of

More information

Marquette University MATH 1700 Class 5 Copyright 2017 by D.B. Rowe

Marquette University MATH 1700 Class 5 Copyright 2017 by D.B. Rowe Class 5 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 3.2-3.3 Lecture Chapter 4.1-4.2 Review Chapter 1 3.1 (Exam

More information

Comparing Measures of Central Tendency *

Comparing Measures of Central Tendency * OpenStax-CNX module: m11011 1 Comparing Measures of Central Tendency * David Lane This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 1.0 1 Comparing Measures

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

Measures of Central Tendency:

Measures of Central Tendency: Measures of Central Tendency: Mean, Median, Mode CasperWendy Measures of Central Tendency Measure of central tendency provides a very convenient way of describing a set of scores with a single number that

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

2.0 Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table

2.0 Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table 2.0 Lesson Plan Answer Questions 1 Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 2. Summary Statistics Given a collection of data, one needs to find representations

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

MAT Mathematics in Today's World

MAT Mathematics in Today's World MAT 1000 Mathematics in Today's World Last Time 1. Three keys to summarize a collection of data: shape, center, spread. 2. Can measure spread with the fivenumber summary. 3. The five-number summary can

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

Review Notes for IB Standard Level Math

Review Notes for IB Standard Level Math Review Notes for IB Standard Level Math 1 Contents 1 Algebra 8 1.1 Rules of Basic Operations............................... 8 1.2 Rules of Roots..................................... 8 1.3 Rules of Exponents...................................

More information

Hypothesis Testing hypothesis testing approach

Hypothesis Testing hypothesis testing approach Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we

More information

For instance, we want to know whether freshmen with parents of BA degree are predicted to get higher GPA than those with parents without BA degree.

For instance, we want to know whether freshmen with parents of BA degree are predicted to get higher GPA than those with parents without BA degree. DESCRIPTIVE ANALYSIS For instance, we want to know whether freshmen with parents of BA degree are predicted to get higher GPA than those with parents without BA degree. Assume that we have data; what information

More information

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III) Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.1.1 Simple Interest 0.2 Business Applications (III) 0.2.1 Expenses Involved in Buying a Car 0.2.2 Expenses Involved

More information

Math Conventions. for the Quantitative Reasoning measure of the GRE General Test.

Math Conventions. for the Quantitative Reasoning measure of the GRE General Test. Math Conventions for the Quantitative Reasoning measure of the GRE General Test www.ets.org The mathematical symbols and terminology used in the Quantitative Reasoning measure of the test are conventional

More information

Class 11 Maths Chapter 15. Statistics

Class 11 Maths Chapter 15. Statistics 1 P a g e Class 11 Maths Chapter 15. Statistics Statistics is the Science of collection, organization, presentation, analysis and interpretation of the numerical data. Useful Terms 1. Limit of the Class

More information

Numerical Measures of Central Tendency

Numerical Measures of Central Tendency ҧ Numerical Measures of Central Tendency The central tendency of the set of measurements that is, the tendency of the data to cluster, or center, about certain numerical values; usually the Mean, Median

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Lecture 1 : Basic Statistical Measures

Lecture 1 : Basic Statistical Measures Lecture 1 : Basic Statistical Measures Jonathan Marchini October 11, 2004 In this lecture we will learn about different types of data encountered in practice different ways of plotting data to explore

More information

Lecture 2. Descriptive Statistics: Measures of Center

Lecture 2. Descriptive Statistics: Measures of Center Lecture 2. Descriptive Statistics: Measures of Center Descriptive Statistics summarize or describe the important characteristics of a known set of data Inferential Statistics use sample data to make inferences

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

Higher Secondary - First year STATISTICS Practical Book

Higher Secondary - First year STATISTICS Practical Book Higher Secondary - First year STATISTICS Practical Book th_statistics_practicals.indd 07-09-08 8:00:9 Introduction Statistical tools are important for us in daily life. They are used in the analysis of

More information

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots

More information

Algebra I. Standard Processes or Content Strand CLE Course Level Expectation SPI State Performance Indicator Check for Understanding

Algebra I. Standard Processes or Content Strand CLE Course Level Expectation SPI State Performance Indicator Check for Understanding Algebra I 111 Standard Processes or Content Strand CLE Course Level Expectation SPI State Performance Indicator Check for Understanding Standard 1 Mathematical Processes CLE 3102.1.1 Use mathematical language,

More information