Sampling Populations limited in the scope enumerate

Size: px
Start display at page:

Download "Sampling Populations limited in the scope enumerate"

Transcription

1 Sampling Populations Typically, when we collect data, we are somewhat limited in the scope of what information we can reasonably collect Ideally, we would enumerate each and every member of a population so we could know its parameters perfectly In most cases this is not possible, because of the size of the population (infinite populations?) and associated costs (time, money, etc.) Usually it is not necessary, because by collecting data on an appropriate subset of the population we can create statistics that are adequate estimates of population parameters Instead, we sample a population, trying to get information about a representative subset of the population

2 Sampling Concepts We must define the sampling unit - the smallest subdivision of the population that becomes part of our sample We want to minimize sampling error when we design how we will collect data: Typically the sampling error as the sample size because larger samples make up a larger proportion of the population (and a complete census, for example, theoretically has no sampling error) We want to try and avoid sampling bias when we design how we will collect data: Bias here is referring to a systematic tendency in the selection of members of a population to be included in a sample, i.e. any given member of a population should have an equal chance of being included in the sample (for random sampling)

3 Probability Sampling Designs - Random Random sampling - In general, we need some degree of randomness in the selection of a sample to be able to draw any meaningful inferences about a population, but in some cases this may conflict with representativeness These are drawn in such a way that every unit of a population has an equal chance of being chosen and the selection of one unit has no impact on whether or not another individual will be selected (independence) This can be done with or without replacement (which determines whether the same unit can be drawn twice) We can generate random numbers using a table of random numerals, or using a computer, and we can scale to any required range of values

4 Transect Placement Software selects a random starting position for each transect, applying criteria Software assigns a random to each direction transect

5 Probability Sampling Designs - Systematic Representative approaches place restrictions on selection: Systematic sampling - This approach uses every k th element of the sampling frame, by beginning at a randomly chosen point in the frame, e.g. given a sampling frame of size = 200, to create a sample of size n=10 from such a sample, select a random point to begin within the frame and then include every 20 th value in the systematic sample This approach assumes that the assignment of the individuals in the sampling frame is random (i.e. they have not been placed in the frame in some order or grouping), and this should be checked before systematically sampling from a frame

6 Probability Sampling Designs - Systematic Some problems with systematic sampling: The possible values of sample size n are somewhat restricted by the size of the sampling frame, since the interval should divide evenly into the size of the sampling frame If the population itself exhibits some periodicity, then a stratified sample is likely to not be representative In geographic applications, with could be applied in 2 dimensions in (x,y) space with with xand y(which are not necessarily the same) specifying a systematic grid, but the sample size is still restricted by the extent of the study area (since the grid must fit evenly)

7 Probability Sampling Designs - Stratified We may need to place restrictions on how we select units for inclusion in a sample to ensure a representative sample. Stratified sampling - Divide the population into categories and select a random sample from each of these This approach can be used to decrease the likelihood of an unrepresentative sample if the classes/categories/strata are selected carefully (the individuals within a strata must be very much alike, which means that the population must be able to divided into relatively homogeneous groups) We need to know something about the population in order to make good decisions about stratification

8 Probability Sampling Designs - Stratified We can take a stratified sample that is Proportional - Where the random sample drawn from each class/category/stratum is the same size OR Disproportional - Where random samples of different sizes are drawn from each class/category/stratum, with the sample size usually being chosen on the basis of the size of that sub-population. This approach is best used when the sizes of the categories are significantly different, although it can also be applied to mitigate cost issues (i.e. it may be more costly to sample in a swamp than in a grassy field, so we might choose to take less samples in the swamp, although this clearly would be nothing to enhance representativeness in our sample)

9 Pond Branch Catchment Control Color Infrared Digital Orthophotography

10 Pond Branch Catchment Stratified TMI Sampling Pond Branch TMI Histogram TMI Values at Soil Moisture Sampling Locations using 11.25m PG DEM Percent of cells in catchment Topographic Moisture Index Topographic Moisture Index Pond Branch Glyndon

11 Probability Sampling Designs - Stratified WARNING: A class/category/stratum that is homogeneous with respect to one variable may have high variation with respect to another variable! Thus, stratification must be performed with some foreknowledge of how the sample will be analyzed, and if the sampling is being performed in a preliminary fashion (still seeking the relationships), there is a danger that the stratification will be found to be inappropriate after the fact E.g. my soils sampling may have been stratified with respect to TMI, but if I want to check if upstream landuse is a factor in Glyndon, I may find my samples are not representatively distributed with respect to land use

12 Random Spatial Sampling We can choose a random point in (x,y) space by choosing pairs of random numbers this produces a Poisson distribution if we divide the area into quadrats and count This is easy with rectangular study areas, otherwise we also need to reject any points outside the study area (e.g. my method for selecting the beginning of a transect) We can also produce stratified and systematic point samples by dividing the area into a group of mutually exclusive and collective exhaustive strata:

13 Data Portrayal Once we have sampled some geographic phenomenon, it is often useful to portray it in some fashion that allows you to get a sense of the values in the dataset Many portrayal approaches still involve reducing the volume of data (and information content), but if applied properly, they can help you see the interesting characteristics of data For the various scales of measurement, there are different approaches that are applicable

14 Scales of Measurement Thematic data can be divided into four types 1. The Nominal Scale 2. The Ordinal Scale 3. The Interval Scale 4. The Ratio Scale As we progress through these scales, the types of data they describe have increasing information content

15 Nominal Data From one of my dissertation transect samples, the set of types of segments are nominal data: Class Frequency % of Total Woody Herbaceous Water Normalizing Ground 6 the data, 1.88 Road 23 expressing it 7.21 relative to the Pavement 22 total (some 6.90 Structures 11 caveats here) 3.45

16 Nominal Data Class Frequency % of Total Woody Herbaceous Water Ground Road Pavement Structures This is a tabular presentation of data has the advantage of giving the exact quantities, but can be busy, especially in larger tables

17 Nominal Data Class Frequency Woody 105 Segment Type Frequency Herbaceous Water Ground 6 Road Pavement 22 Segment Types Structures 11 The frequency of nominal data classes can be well displayed by a bar graph Frequency Woody Herbaceous Water Ground Road Pavement Structures

18 Class Woody Herbaceous Water 0.31 Ground 1.88 Road 7.21 Pavement 6.90 Structures 3.45 Nominal Data % of Total Structures 3% Pavement 7% Segment Types Once normalized, the values are well displayed in a pie chart, which emphasizes each category s portion of the whole Road 7% Ground 2% Water 0% Herbaceous 48% Woody 33% Woody Herbaceous Water Ground Road Pavement Structures

19 Ordinal, Interval, & Ratio Data From my dissertation, the set of all topographic moisture index values drawn from a raster data layer is an example of an interval dataset:

20 Ordinal, Interval, & Ratio Data Pond Branch is a hectare watershed, which is equivalent to 375,500 m 2 (1 hectare = 10,000 m 2 ) Using 11.25m x 11.25m pixels ( m 2 ), there are ~ 2966 pixels from which we can draw TMI values

21 Ordinal, Interval, & Ratio Data It would clearly be impractical to try and get a sense of the distribution of TMI values in Pond Branch by looking at a table of 2966 values We need a data reduction approach by which we can reduce the number of values to a manageable amount, which in turn lends itself to some sort of graphical display For ordinal, interval, and ratio scale data, we can make use of histograms for this purpose, and building a histogram involves following a multistep procedure

22 Building a Histogram 1. Developing an ungrouped frequency table That is, we build a table that counts the number of occurrences of each variable value from lowest to highest: TMI Value Ungrouped Freq We could attempt to construct a bar chart from this table, but it would have too many bars to really be useful

23 Building a Histogram 2. Construct a grouped frequency table This table has classes of values (in a sense we are reducing our data back to the ordinal scale for display purposes) The decision on how to perform the grouping is a subjective one, but there are some common guidelines: Use class intervals with simple bounds and a common width (i.e. categories have same range) Adjacent intervals should not overlap (each datum should fit into one class)

24 Building a Histogram 3. Select an appropriate number of classes There are formulae available to make this decision objectively, but in reality it is a somewhat subjective decision If you have more observations, you usually need more classes, because when you put observations together in a class, you are considering them to have the same value for display purposes there is a trade-off here between simplicity and loss of information (e.g. Pond Branch TMI observations grouped into 10 classes)

25 Building a Histogram 3. Select an appropriate number of classes cont. Class Frequency

26 Building a Histogram 4. Plot the frequencies of each class All that remains is to create the plot: Pond Branch TMI Histogram Percent of cells in catchment Topographic Moisture Index

27 Frequencies & Distributions A histogram is one way to depict a frequency distribution. A loose definition of a frequency: The number of times a variable takes on a particular value (note that any variable has a frequency distribution) E.g. roll a pair of dice several times and record the resulting values (constrained to being between and 2 and 12), counting the number of times any given value occurs (the frequency of that value occurring), and take these all together to form a frequency distribution

28 Frequencies & Distributions Frequencies can be absolute (when the frequency provided is the actual count of the occurrences of that particular frequency) or they can be relative (when they are normalized by dividing the absolute frequency by the total number of observations to yield a relative frequency between 0 and 1) Relative frequencies are particularly useful if you want to compare distributions drawn from two different sources, i.e. while the numbers of observations of each source may be different, by normalizing them, they can be reasonably compared

29 Glyndon Segment Length Distributions Upper Baismans Run Percent of all segments in class Percent of all segments in class Segment length (meters) Woody Herbaceous Pavement Roads Structures Segment length (meters) Woody Herbaceous Pavement Roads Structures

30 Frequencies & Distributions In addition to the conventional frequencies described thusfar, there is another type of frequency known as a cumulative frequency. Cumulative frequencies are calculated by starting with the lowest class of an observed variable and its frequency and then adding each successive variable value to the preceding sum. Cumulative frequencies are desirable when we want to know what proportion of observations have a value less than some threshold

31 Frequencies & Distributions For example, here s some frequency data for the woody vegetation class segments distance from streams in Upper Baisman s Run: CLASS MIN. VALUE FREQ. CUM FREQ

32 Conventional Baismans Run Primary Class Distance from Stream Distributions Cumulative Percent of all cells in class Percent of all cells in class Distance to stream along D8 flow paths (meters) Woody Herbaceous Pavement and Road Structures Ground Distance to stream along D8 flow paths (m eters) Woody Herbaceous Pavement and Road Structures Ground

33 Frequencies & Distributions By examining the shape of freq. distribution curves we can gain some sense of the distribution through some general characteristics: 1. Modality Most distributions are unimodal, but we might also see bimodal or multi-modal dists. (if unimodal, we can also consider): 2. Symmetry a.k.a. skewness of the distribution Is it positively or negatively skewed? 3. Kurtosis Describes the degree of peakedness or flatness of the curve

34 Shapes of Histograms Bell Shaped Bimodal Mode: value with highest frequency Range: largest value-smallest value Skewed Random Developing a histogram from attribute data is one level of data reduction; we can describe bell shaped distributions using parameters that provide a more concise summary

David Tenenbaum GEOG 090 UNC-CH Spring 2005

David Tenenbaum GEOG 090 UNC-CH Spring 2005 Statistical Thinking, Data Types, and Geographical Primitives The scientific method in geography, two kinds of approaches, and the sorts of statistics used to support those approaches Some characteristics

More information

Wet May 29/30 Avg. June 26/28 Dry August 22 R 2 =0.79 R 2 =0.24

Wet May 29/30 Avg. June 26/28 Dry August 22 R 2 =0.79 R 2 =0.24 GEOG 090 Quantitative Methods in Geography Wet May 29/30 Avg. June 26/28 Dry August 22 0.6 0.6 0.6 Pond Branch - PG 11.25m DEM Theta 0.5 0.4 0.3 0.2 0.1 0.0 R 2 =0.71 4 5 6 7 8 9 10 11 12 13 TMI Theta

More information

Introduction to Statistics

Introduction to Statistics Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and

More information

David Tenenbaum GEOG 070 UNC-CH Spring 2005

David Tenenbaum GEOG 070 UNC-CH Spring 2005 GEOG 070 Introduction to Geographic Information GEOG 070 Introduction to Geographic Information Course Description: There is a spatial component to all that occurs on our planet. Everything happens somewhere

More information

Sampling The World. presented by: Tim Haithcoat University of Missouri Columbia

Sampling The World. presented by: Tim Haithcoat University of Missouri Columbia Sampling The World presented by: Tim Haithcoat University of Missouri Columbia Compiled with materials from: Charles Parson, Bemidji State University and Timothy Nyerges, University of Washington Introduction

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

FREQUENCY DISTRIBUTIONS AND PERCENTILES

FREQUENCY DISTRIBUTIONS AND PERCENTILES FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Up until this point, the quantitative methods we have been studying have been designed to help us understand a single random variable In many cases, we are interested in examining

More information

Stochastic calculus for summable processes 1

Stochastic calculus for summable processes 1 Stochastic calculus for summable processes 1 Lecture I Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions. It is a

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Review Recording observations - Must extract that which is to be analyzed: coding systems,

More information

Σ x i. Sigma Notation

Σ x i. Sigma Notation Sigma Notation The mathematical notation that is used most often in the formulation of statistics is the summation notation The uppercase Greek letter Σ (sigma) is used as shorthand, as a way to indicate

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Sampling. Where we re heading: Last time. What is the sample? Next week: Lecture Monday. **Lab Tuesday leaving at 11:00 instead of 1:00** Tomorrow:

Sampling. Where we re heading: Last time. What is the sample? Next week: Lecture Monday. **Lab Tuesday leaving at 11:00 instead of 1:00** Tomorrow: Sampling Questions Define: Sampling, statistical inference, statistical vs. biological population, accuracy, precision, bias, random sampling Why do people use sampling techniques in monitoring? How do

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Spring 2015: Lembo GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Descriptive statistics concise and easily understood summary of data set characteristics

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Governing Rules of Water Movement

Governing Rules of Water Movement Governing Rules of Water Movement Like all physical processes, the flow of water always occurs across some form of energy gradient from high to low e.g., a topographic (slope) gradient from high to low

More information

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic STATISTICS ANCILLARY SYLLABUS (W.E.F. the session 2014-15) Semester Paper Code Marks Credits Topic 1 ST21012T 70 4 Descriptive Statistics 1 & Probability Theory 1 ST21012P 30 1 Practical- Using Minitab

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics The Nature of Geographic Data Types of spatial data Continuous spatial data: geostatistics Samples may be taken at intervals, but the spatial process is continuous e.g. soil quality Discrete data Irregular:

More information

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E OBJECTIVE COURSE Understand the concept of population and sampling in the research. Identify the type

More information

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Mean 26.86667 Standard Error 2.816392 Median 25 Mode 20 Standard Deviation 10.90784 Sample Variance 118.981 Kurtosis -0.61717 Skewness

More information

Part 7: Glossary Overview

Part 7: Glossary Overview Part 7: Glossary Overview In this Part This Part covers the following topic Topic See Page 7-1-1 Introduction This section provides an alphabetical list of all the terms used in a STEPS surveillance with

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the

More information

Statistics 301: Probability and Statistics Introduction to Statistics Module

Statistics 301: Probability and Statistics Introduction to Statistics Module Statistics 301: Probability and Statistics Introduction to Statistics Module 1 2018 Introduction to Statistics Statistics is a science, not a branch of mathematics, but uses mathematical models as essential

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Introducing GIS analysis

Introducing GIS analysis 1 Introducing GIS analysis GIS analysis lets you see patterns and relationships in your geographic data. The results of your analysis will give you insight into a place, help you focus your actions, or

More information

Review of the Normal Distribution

Review of the Normal Distribution Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution

More information

Examine characteristics of a sample and make inferences about the population

Examine characteristics of a sample and make inferences about the population Chapter 11 Introduction to Inferential Analysis Learning Objectives Understand inferential statistics Explain the difference between a population and a sample Explain the difference between parameter and

More information

Figure Figure

Figure Figure Figure 4-12. Equal probability of selection with simple random sampling of equal-sized clusters at first stage and simple random sampling of equal number at second stage. The next sampling approach, shown

More information

Descriptive Data Summarization

Descriptive Data Summarization Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning

More information

POPULATION AND SAMPLE

POPULATION AND SAMPLE 1 POPULATION AND SAMPLE Population. A population refers to any collection of specified group of human beings or of non-human entities such as objects, educational institutions, time units, geographical

More information

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.)

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.) Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.) PRESENTATION OF DATA 1. Mathematical presentation (measures of central tendency and measures of dispersion). 2. Tabular

More information

Lecture 5: Sampling Methods

Lecture 5: Sampling Methods Lecture 5: Sampling Methods What is sampling? Is the process of selecting part of a larger group of participants with the intent of generalizing the results from the smaller group, called the sample, to

More information

Histograms, Central Tendency, and Variability

Histograms, Central Tendency, and Variability The Economist, September 6, 214 1 Histograms, Central Tendency, and Variability Lecture 2 Reading: Sections 5 5.6 Includes ALL margin notes and boxes: For Example, Guided Example, Notation Alert, Just

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015 Fall 2015 Population versus Sample Population: data for every possible relevant case Sample: a subset of cases that is drawn from an underlying population Inference Parameters and Statistics A parameter

More information

Sampling Theory in Statistics Explained - SSC CGL Tier II Notes in PDF

Sampling Theory in Statistics Explained - SSC CGL Tier II Notes in PDF Sampling Theory in Statistics Explained - SSC CGL Tier II Notes in PDF The latest SSC Exam Dates Calendar is out. According to the latest update, SSC CGL Tier II Exam will be conducted from 18th to 20th

More information

Descriptive Statistics Methods of organizing and summarizing any data/information.

Descriptive Statistics Methods of organizing and summarizing any data/information. Introductory Statistics, 10 th ed. by Neil A. Weiss Chapter 1 The Nature of Statistics 1.1 Statistics Basics There are lies, damn lies, and statistics - Mark Twain Descriptive Statistics Methods of organizing

More information

Now we will define some common sampling plans and discuss their strengths and limitations.

Now we will define some common sampling plans and discuss their strengths and limitations. Now we will define some common sampling plans and discuss their strengths and limitations. 1 For volunteer samples individuals are self selected. Participants decide to include themselves in the study.

More information

A SHORT INTRODUCTION TO PROBABILITY

A SHORT INTRODUCTION TO PROBABILITY A Lecture for B.Sc. 2 nd Semester, Statistics (General) A SHORT INTRODUCTION TO PROBABILITY By Dr. Ajit Goswami Dept. of Statistics MDKG College, Dibrugarh 19-Apr-18 1 Terminology The possible outcomes

More information

Lesson 6 Population & Sampling

Lesson 6 Population & Sampling Lesson 6 Population & Sampling Lecturer: Dr. Emmanuel Adjei Department of Information Studies Contact Information: eadjei@ug.edu.gh College of Education School of Continuing and Distance Education 2014/2015

More information

Topic 3 Populations and Samples

Topic 3 Populations and Samples BioEpi540W Populations and Samples Page 1 of 33 Topic 3 Populations and Samples Topics 1. A Feeling for Populations v Samples 2 2. Target Populations, Sampled Populations, Sampling Frames 5 3. On Making

More information

FAQ: Linear and Multiple Regression Analysis: Coefficients

FAQ: Linear and Multiple Regression Analysis: Coefficients Question 1: How do I calculate a least squares regression line? Answer 1: Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables so that one variable

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS. No great deed, private or public, has ever been undertaken in a bliss of certainty.

1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS. No great deed, private or public, has ever been undertaken in a bliss of certainty. CIVL 3103 Approximation and Uncertainty J.W. Hurley, R.W. Meier 1. AN INTRODUCTION TO DESCRIPTIVE STATISTICS No great deed, private or public, has ever been undertaken in a bliss of certainty. - Leon Wieseltier

More information

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information: Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information: aanum@ug.edu.gh College of Education School of Continuing and Distance Education 2014/2015 2016/2017 Session Overview In this Session

More information

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27 Chapter 7: Statistics Describing Data Chapter 7: Statistics Describing Data 1 / 27 Categorical Data Four ways to display categorical data: 1 Frequency and Relative Frequency Table 2 Bar graph (Pareto chart)

More information

Experimental Design, Data, and Data Summary

Experimental Design, Data, and Data Summary Chapter Six Experimental Design, Data, and Data Summary Tests of Hypotheses Because science advances by tests of hypotheses, scientists spend much of their time devising ways to test hypotheses. There

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Vehicle Freq Rel. Freq Frequency distribution. Statistics

Vehicle Freq Rel. Freq Frequency distribution. Statistics 1.1 STATISTICS Statistics is the science of data. This involves collecting, summarizing, organizing, and analyzing data in order to draw meaningful conclusions about the universe from which the data is

More information

Elements of probability theory

Elements of probability theory The role of probability theory in statistics We collect data so as to provide evidentiary support for answers we give to our many questions about the world (and in our particular case, about the business

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

Name: Lab Partner: Section: In this experiment error analysis and propagation will be explored.

Name: Lab Partner: Section: In this experiment error analysis and propagation will be explored. Chapter 2 Error Analysis Name: Lab Partner: Section: 2.1 Purpose In this experiment error analysis and propagation will be explored. 2.2 Introduction Experimental physics is the foundation upon which the

More information

ECON1310 Quantitative Economic and Business Analysis A

ECON1310 Quantitative Economic and Business Analysis A ECON1310 Quantitative Economic and Business Analysis A Topic 1 Descriptive Statistics 1 Main points - Statistics descriptive collecting/presenting data; inferential drawing conclusions from - Data types

More information

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability STA301- Statistics and Probability Solved MCQS From Midterm Papers March 19,2012 MC100401285 Moaaz.pk@gmail.com Mc100401285@gmail.com PSMD01 MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

More information

GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY

GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 2: GEOGRAPHIC DATA Primary data - acquired directly from the original source In situ or in the field Costly time and money! Campus bike racks

More information

Quality and Coverage of Data Sources

Quality and Coverage of Data Sources Quality and Coverage of Data Sources Objectives Selecting an appropriate source for each item of information to be stored in the GIS database is very important for GIS Data Capture. Selection of quality

More information

TECH 646 Analysis of Research in Industry and Technology

TECH 646 Analysis of Research in Industry and Technology TECH 646 Analysis of Research in Industry and Technology PART III The Sources and Collection of data: Measurement, Measurement Scales, Questionnaires & Instruments, Sampling Ch. 14 Sampling Lecture note

More information

Statistical Methods: Introduction, Applications, Histograms, Ch

Statistical Methods: Introduction, Applications, Histograms, Ch Outlines Statistical Methods: Introduction, Applications, Histograms, Characteristics November 4, 2004 Outlines Part I: Statistical Methods: Introduction and Applications Part II: Statistical Methods:

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

REVIEW: Midterm Exam. Spring 2012

REVIEW: Midterm Exam. Spring 2012 REVIEW: Midterm Exam Spring 2012 Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample Types of Data Parameter (Describing a characteristic of the Population) Statistic

More information

Destination Math California Intervention

Destination Math California Intervention Destination Math California Intervention correlated to the California Intervention 4 7 s McDougal Littell Riverdeep STANDARDS MAPS for a Mathematics Intervention Program (Grades 4-7) The standards maps

More information

Hypothesis Testing hypothesis testing approach

Hypothesis Testing hypothesis testing approach Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we

More information

Louisiana Transportation Engineering Conference. Monday, February 12, 2007

Louisiana Transportation Engineering Conference. Monday, February 12, 2007 Louisiana Transportation Engineering Conference Monday, February 12, 2007 Agenda Project Background Goal of EIS Why Use GIS? What is GIS? How used on this Project Other site selection tools I-69 Corridor

More information

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart

More information

Statistics Statistical Process Control & Control Charting

Statistics Statistical Process Control & Control Charting Statistics Statistical Process Control & Control Charting Cayman Systems International 1/22/98 1 Recommended Statistical Course Attendance Basic Business Office, Staff, & Management Advanced Business Selected

More information

Module 16. Sampling and Sampling Distributions: Random Sampling, Non Random Sampling

Module 16. Sampling and Sampling Distributions: Random Sampling, Non Random Sampling Module 16 Sampling and Sampling Distributions: Random Sampling, Non Random Sampling Principal Investigator Co-Principal Investigator Paper Coordinator Content Writer Prof. S P Bansal Vice Chancellor Maharaja

More information

Field data acquisition

Field data acquisition Lesson : Primary sources Unit 3: Field data B-DC Lesson / Unit 3 Claude Collet D Department of Geosciences - Geography Content of Lesson Unit 1: Unit : Unit 3: Unit 4: Digital sources Remote sensing Field

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

Introduction to Basic Statistics Version 2

Introduction to Basic Statistics Version 2 Introduction to Basic Statistics Version 2 Pat Hammett, Ph.D. University of Michigan 2014 Instructor Comments: This document contains a brief overview of basic statistics and core terminology/concepts

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Outline. Geographic Information Analysis & Spatial Data. Spatial Analysis is a Key Term. Lecture #1

Outline. Geographic Information Analysis & Spatial Data. Spatial Analysis is a Key Term. Lecture #1 Geographic Information Analysis & Spatial Data Lecture #1 Outline Introduction Spatial Data Types: Objects vs. Fields Scale of Attribute Measures GIS and Spatial Analysis Spatial Analysis is a Key Term

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Part III: Unstructured Data. Lecture timetable. Analysis of data. Data Retrieval: III.1 Unstructured data and data retrieval

Part III: Unstructured Data. Lecture timetable. Analysis of data. Data Retrieval: III.1 Unstructured data and data retrieval Inf1-DA 2010 20 III: 28 / 89 Part III Unstructured Data Data Retrieval: III.1 Unstructured data and data retrieval Statistical Analysis of Data: III.2 Data scales and summary statistics III.3 Hypothesis

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

Monitoring Design: Study Area,

Monitoring Design: Study Area, Monitoring Design: Study Area, Reporting Units, Stratification Owyhee Canyonlands - Photo: Scott Carter What do you mean by monitoring design? Loosely defined term that encompasses technical aspects of

More information

What is sampling? shortcut whole population small part Why sample? not enough; time, energy, money, labour/man power, equipment, access measure

What is sampling? shortcut whole population small part Why sample? not enough; time, energy, money, labour/man power, equipment, access measure What is sampling? A shortcut method for investigating a whole population Data is gathered on a small part of the whole parent population or sampling frame, and used to inform what the whole picture is

More information

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n GLOSSARY A absolute cell referencing A spreadsheet feature that blocks automatic adjustment of cell references when formulas are moved or copied. References preceded by a dollar sign $A$1, for example

More information

APS Eighth Grade Math District Benchmark Assessment NM Math Standards Alignment

APS Eighth Grade Math District Benchmark Assessment NM Math Standards Alignment EIGHTH GRADE NM STANDARDS Strand: NUMBER AND OPERATIONS Standard: Students will understand numerical concepts and mathematical operations. 5-8 Benchmark N.: Understand numbers, ways of representing numbers,

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Chapter. Organizing and Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Organizing and Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc. Chapter 2 Organizing and Summarizing Data Section 2.1 Organizing Qualitative Data Objectives 1. Organize Qualitative Data in Tables 2. Construct Bar Graphs 3. Construct Pie Charts When data is collected

More information

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS LEARNING OBJECTIVES: After studying this chapter, a student should understand: notation used in statistics; how to represent variables in a mathematical form

More information

Inferential Statistics. Chapter 5

Inferential Statistics. Chapter 5 Inferential Statistics Chapter 5 Keep in Mind! 1) Statistics are useful for figuring out random noise from real effects. 2) Numbers are not absolute, and they can be easily manipulated. 3) Always scrutinize

More information

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH Awanis Ku Ishak, PhD SBM Sampling The process of selecting a number of individuals for a study in such a way that the individuals represent the larger

More information

3. When a researcher wants to identify particular types of cases for in-depth investigation; purpose less to generalize to larger population than to g

3. When a researcher wants to identify particular types of cases for in-depth investigation; purpose less to generalize to larger population than to g Chapter 7: Qualitative and Quantitative Sampling Introduction Quantitative researchers more concerned with sampling; primary goal to get a representative sample (smaller set of cases a researcher selects

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Graphical Summaries Consider the following data x: 78, 24, 57, 39, 28, 30, 29, 18, 102, 34, 52, 54, 57, 82, 90, 94, 38, 59, 27, 68, 61, 39, 81, 43, 90, 40, 39, 33, 42, 15, 88, 94, 50, 66, 75, 79, 83, 34,31,36,

More information

Remote Sensing and Geospatial Application for Wetlands Mapping, Assessment, and Mitigation

Remote Sensing and Geospatial Application for Wetlands Mapping, Assessment, and Mitigation Remote Sensing and Geospatial Application for Wetlands Mapping, Assessment, and Mitigation Hydrology Soils MSU Seminar Series Remote Sensing and Geospatial Applications September 4, 2002 Vegetation NEPA

More information

a table or a graph or an equation.

a table or a graph or an equation. Topic (8) POPULATION DISTRIBUTIONS 8-1 So far: Topic (8) POPULATION DISTRIBUTIONS We ve seen some ways to summarize a set of data, including numerical summaries. We ve heard a little about how to sample

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information