Quantitative Methods Geography 441. Course Requirements

Similar documents
1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Harvard University. Rigorous Research in Engineering Education

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

POLI 443 Applied Political Research

Ordinary Least Squares Regression Explained: Vartanian

Mathematical Notation Math Introduction to Applied Statistics

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Topic 21 Goodness of Fit

Correlation Examining the relationship between interval-ratio variables

Ordinary Least Squares Regression Explained: Vartanian

The t-statistic. Student s t Test

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

AGEC 621 Lecture 16 David Bessler

Nature of Spatial Data. Outline. Spatial Is Special

Inferential statistics

Sampling Distributions

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Chapter 8 Handout: Interval Estimates and Hypothesis Testing

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

Lecture 5: Sampling Methods

Lab #12: Exam 3 Review Key

Quantitative Analysis and Empirical Methods

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Using Microsoft Excel

Is there a connection between gender, maths grade, hair colour and eye colour? Contents

Data Collection: What Is Sampling?

23. MORE HYPOTHESIS TESTING

THE PEARSON CORRELATION COEFFICIENT

Lecture 11: Two Way Analysis of Variance

Lecture 8. Spatial Estimation

Descriptive Statistics

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

College Physics II Lab 5: Equipotential Lines

Statistics for Managers Using Microsoft Excel

Psych 230. Psychological Measurement and Statistics

Lecture 28 Chi-Square Analysis

16.400/453J Human Factors Engineering. Design of Experiments II

Module 8: Linear Regression. The Applied Research Center

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Chapter 7 Comparison of two independent samples

Now we will define some common sampling plans and discuss their strengths and limitations.

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Regression With a Categorical Independent Variable: Mean Comparisons

Applied Statistics for the Behavioral Sciences

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Lecture 5 Geostatistics

Data analysis and Geostatistics - lecture VI

Mathematical Logic Prof. Arindama Singh Department of Mathematics Indian Institute of Technology, Madras. Lecture - 15 Propositional Calculus (PC)

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

STATISTICS 141 Final Review

WELCOME! Lecture 13 Thommy Perlinger

LABORATORY 4: ROTATIONAL MOTION PLAYGROUND DYNAMICS: THE MERRY-GO-ROUND Written May-June 1993 by Melissa Wafer '95

Your schedule of coming weeks. One-way ANOVA, II. Review from last time. Review from last time /22/2004. Create ANOVA table

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):

S12 - HS Regression Labs Workshop. Linear. Quadratic (not required) Logarithmic. Exponential. Power

Sampling. Module II Chapter 3

Weldon s dice. Lecture 15 - χ 2 Tests. Labby s dice. Labby s dice (cont.)

An inferential procedure to use sample data to understand a population Procedures

Confidence intervals

Chapter Eight: Assessment of Relationships 1/42

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

EXPERIMENTAL UNCERTAINTY

Multiple Regression Analysis

Laboratory Topics 4 & 5

Lectures of STA 231: Biostatistics

Do students sleep the recommended 8 hours a night on average?

Chi Square Analysis M&M Statistics. Name Period Date

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Lecture 41 Sections Mon, Apr 7, 2008

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Rama Nada. -Ensherah Mokheemer. 1 P a g e

Syllabus. Physics 0847, How Things Work Section II Fall 2014

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Cornell Science Inquiry Partnerships

10/31/2012. One-Way ANOVA F-test

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Hypothesis Testing hypothesis testing approach

Two-Sample Inferential Statistics

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur

Mathematical Statistics

GEO391 (Statistical Data Analysis for GIS) Winter Quarter 2015 Department of Geography DePaul University

PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14

Lecture 21: October 19

MATH Notebook 3 Spring 2018

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Two sample Hypothesis tests in R.

SIGNIFICANT FIGURES BEGIN

Mean Vector Inferences

experiment3 Introduction to Data Analysis

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

Mr. Stein s Words of Wisdom

Transcription:

Quantitative Methods Geography 441 Course Requirements

Equipment: 1. Calculator with statistical functions 2. Three-ring binder 3. A thumb dive 4. Textbook: Statistical Methods for Geography. By Peter A. Rogerson (latest edition)

Exercises Must be handed in on time. Must be TYPED. Must follow the format provided. Some slight variation if fine, as long as it is legible. IF you use Microsoft WORD, get used to working with the EQUATION EDITOR.

The Tests There are 2 test in this class divided into 2 sections: 1. An in-class test 2. A take home test The in-class test 1. Standard format. 2. Bring a calculator to the tests. The take home test will: 1. Focus on using the computer. 2. Allow you to work in groups. 3. Come BEFORE the in-class test. 4. Hand it in ON the in-class test day at the beginning of class.

Responsibilities 1. All work MUST be original. That means no cutting and pasting of other people s work, even if you are in the same group. 2. You must know how to perform all statistical tests by hand and by computer. 3. Hand in all material on time and in the correct format.

Answer Format The answer format is to be used on ALL material that you hand in. The reason for this is simple: 1. Makes it easier for me to read and grade. 2. Makes it easier for you to be sure you have included all relevant information.

Quantitative Methods (Geo 441) Dr. Paul Marr The formats below are to be used when completing the exercises and tests for this course. Format for Hand Calculation: Research Question: One hundred and fifty students in Geo 441 were surveyed to determine if the use of the correct answer format was associated with student survival. H o : Student survival is independent of using the correct answer format. H A : Student survival is contingent upon using the correct answer format. σ = 0.05 Clearly stated research question Null and alternate hypotheses statements Alpha level Observed Dead Alive Total Clearly typed problem Used Format 17 54 71 Did Not Use Format 67 12 79 Total 84 66 150 Expected Dead Alive Used Format 39.8 31.2 Did Not Use Format 44.2 34.8 2 2 2 2 17 39.8 54 31.2 67 44.2 12 34.8 39.8 χ 2 = 56.4 Df = 1 χ 2 c 0.05, 1 = 3.841 Therefore, reject H o. 31.2 44.2 Student survival is contingent upon using the correct answer format when completing Geography 441 exercises (χ 2 = 56.4, p < 0.001). 34.8 2 56.4 Clearly typed answer Listing of the test statistic, degrees of freedom, and critical value Statement of the results, test statistic, and probability

Additional topics to cover: 1. Web page navigation. a) Lecture notes. b) SPSS lab and mini lab sheets. c) Exercises. d) Statistical tables. e) Statistical examples. f) Other information. 2. Using Word Equation Editor.

The Scientific Method

For the record The term data is plural. Collecting these data required considerable personal expenditure.

The scientific method is used because it is defensible AND repeatable. The scientific method is a defensible technique that helps to minimize external influences. Statistical analyses are repeatable. The scientific method consists of hypotheses, models, laws, and theories.

Hypothesis a proposition or statement whose truth or falsity is capable of being tested. Models a means of simplifying reality so that relationships between variables can be more clearly studied. Laws universal statements of unrestricted range. Theories a collection of laws that gives greater insight than produced by individual laws.

Spatial Data Considerations

Spatial Sampling Sample frame: the list of all elements within a population. e.g. All townships within a county. Sampling method: the rules which govern how samples of a population are taken. The three most common are: 1. Random 2. Stratified random 3. Stratified systematic

Random sampling is often used when it is fairly certain that location does not influence independence. Stratified random sampling is often used when location does influence independence and the degree of influence is known and can be mapped. Stratified systematic sampling is often used when location does influence independence but the degree of influence is unknown and cannot be mapped.

Special Considerations with Spatial Data 1. Modifiable areal unit problem 2. Boundary problem 3. Spatial sampling procedures 4. Spatial autocorrelation

1. Modifiable areal unit problem: Changes in areal units influence test results. Cross boundary migration No cross boundary migration

2. Boundary problem Changes in the study area boundary influence test results. Small study area, even spread of points Large study area, clustering of points

3. Spatial sampling procedures Sampling methods influence test results. Random Stratified, Random Stratified, Regular

4. Spatial autocorrelation Location near each other often (but not always) influence each other. Non-independence. These locations have a greater chance of being similar since they are close together.

Spatial Data and Non-independence One and two sample tests assume that the data are independent (i.e. one observation does not influence another). Unfortunately, with spatial data one observation often does influence another. For example, two ponds near each other may have have frogs because they both are located in the correct microclimate.

Implications of Non-independence Non-independence reduces the effective sample size. Inflates the degrees of freedom. Reduces the critical value. Artificially increases the likelihood of getting significant results.

C In this example, A and B are study sites. C is a very distinct habitat type. A B Habitat C influences the observation values and makes testing between A and B difficult. Better to test between A, B, and C.

Spatial Unit Size and Probability Problem Probabilities tend to be larger with larger spatial units. - The probability that an event will occur is higher with larger spatial units. - Small areal units tend to have very low frequencies for random events unless the event is extremely common. - Under these conditions a normal distribution is almost impossible to attain.

The chance of a state having a dot in it is high. The chance of a county having a dot in it is low.

Small Frequency and Small Spatial Unit Problem Frequencies tend to be smaller with smaller spatial units. - The frequency with which an event occurs decreases as the spatial units decreases, all other things being equal.

21 out of 50 states have dots. (21/50)x100 = 42% 25 out of 3140 counties have dots. (25/3140)x100 = 0.8%

Additional Issues with Spatial Data 1. Ecological Fallacy Interpretation errors that occur when results derived from aggregate data are applied to disaggregate data. It is something akin to trying to determine someone s grade in a class by looking at the class average.

2. The Problem of Scale Scale only has meaning within the context of the measuring techniques applied and the questions being asked. Does it make sense to measure percent forest cover to 5 decimal places? Are we creating differences where none actually exist?

3. Locational Fallacy Not all processes have a spatial expression. Certain phenomenon are aspatial. For example, burglaries have a spatial component and can be mapped. Embezzlement does not.

4. Atomistic Fallacy An attempt to use individual or disaggregate data to explain aggregate data while ignoring the context of the individuals. For example, if we notice that no one is smoking in a bar, then we may conclude that people are smoking less or it could be a smoke-free bar.