Overview of Spatial analysis in ecology
|
|
- Austin Cameron
- 5 years ago
- Views:
Transcription
1 Spatial Point Patterns & Complete Spatial Randomness - II Geog 0C Introduction to Spatial Data Analysis Chris Funk Lecture 8 Overview of Spatial analysis in ecology st step in understanding ecological process is to identify patterns Spatial auto-correlation might indicate patterns or processes Processes can operate on multiple scales, patches, gradients Auto-correlation may be spurious, interpolative, true or induced True = caused by interaction among neighboring locations Induced=caused by a causal relationship with another correlated variable(s) which h itself is auto-correlated t Nearest neighbor distances (average d i ) Under CSR, counts follow a Poission distribution, average d i follows a Weibull distribution ib ti E(average d i )= i (A/n) 0.5 A=area, n = number of points i = a constant which varies as a function of the i th neighbor Ripley s K function Under CSR, the expected # of points is d, where d is the distance lag Ripley s L(d) function linearizes and stabilizes the variances L(d) = (K(d)/ ) 0.5 -d Under CSR E(L(d)) = 0, positive values imply cluster, negative values imply stratification
2 Complete Spatial Randomness (CSR) 3 Loose definition Spatial process, here a spatial point process, serving as a generating mechanism of spatial point patterns, with the following characteristics: intensity (mean # of events per unit area) is constant in any subregion s of the study domain D no environmental or first-order effects Position or occurrence of any event is independent of occurrence of any other event no event-to-event interaction or second-order effects Two versions of CSR point process models Binomial point process: there are n events in study domain D, which are located at random Poisson point process: the number of events n is a realization from a Poisson distribution; once a realization n l of n is generated, these n l points are located at random within D For a Poisson point process, number of events n in study region D varies from realization to realization, whereas this number is fixed for a Binomial point process. In other words, if we generate L sets of simulated point patterns from a Poisson point process, there will be L different numbers of events over the L realizations; for the Binomial process, these L numbers will all be the same. Homogeneous Poisson Point Process Formal definition Number of events y y(s), a count, within an arbitrary subregion s with area s is a realization of a random variable Y Y(s) with a Poisson PDF: 4 Any two RVs Y(s) and Y(s ) defined over two nonoverlapping subregions s and s are independent
3 Homogeneous Poisson Point Process: Simulation (I) Setting Consider a study region D of size D = 00x00 and an overall intensity = 0.0, leading to an expected count of E{Y(D)} = D = 00 events within D. Let D be partitioned into Q = 5 square quadrats of equal size s q = 0x0, for all q. One can now define a set of Q = 5 random variables {Y(s q ), q = Q}, one per quadrat. Under CSR, the RV Y(s) associated with any quadrat has an expected count of E{Y(s)} = s = 4 events (per st-order stationarity), and counts across different quadrats are independent 5 Objective Generate a realization (a point pattern) from a homogeneous Poisson process; in other words, simulate counts from the Q = 5 RVs {Y(s q ); q = Q}. Once a count y(s) is simulated for quadrat s, y(s) events (points) are placed at random within s. Since E{Y(s)} = 4, we need to generate, on average, 4 events within any quadrat s. Since counts across quadrats are independent, simulated events within s do not influence the generation of events outside s. All this amount to zooming in to a particular quadrat s, generating a count y(s) from a Poisson distribution with mean E{Y(s)} = 4, and then repeating for all Q quadrats. This is the same as generating, on average, 00 events randomly within D from a RV Y(D) with a Poisson distribution with mean E{Y(D)} = D = 00; then, y(d) would denote a simulated count over D Homogeneous Poisson Point Process: Flowchart (II) Let L be the number of realizations (alternative point patterns) to generate, and n l be the number of events of the (to be) simulated point pattern in the l- th realization (using the previous notation, n l y(d)). generate L numbers (counts) {n l ; l = L} from a Poisson distribution with mean ( D = expected # of events); these L counts serve as numbers of events for the point patterns to be simulated. for the l-th realization, simulate the locations of n l events in D, by generating n l values of x- and y-coordinates, independent and uniformly distributed ib t d along the two sides of a rectangle enclosing D 3. reject any events that do not lie in D, and repeat step until n l events are obtained within D; steps & constitute a realization from a Binomial process with n l events 4. repeat steps and 3 with another # of events n l, to generate another realization, i.e., the l -th simulated point pattern 6
4 Realizations from a Binomial Point Process Two realizations from a Binomial i spatial point process with n = 50 events: 7 Events can appear clustered, but this is due to chance if st-order effects were present, i.e., if varied through the study region, more events should appear at same places from one realization to another; hence, clusters would be formed around high intensity areas in each realization, even if no interaction was included in the model if strong nd-order effects were present, events would appear clustered in every realization; such clusters, however, would appear in different places from one realization to another if no st-order effects were present Sampling Distribution of a Statistic Under CSR (I) Sample statistic Mean event-to-nearest-event (ENE) distance; here the variable of interest is the distance (ENE) between any event an its nearest neighbor event, and the selected summary statistic is the mean of those distances: Constructing sampling distribution of mean ENE via simulation. Adopt a null hypothesis, here CSR, as a mechanism for generating point patterns; that null hypothesis also includes the parameters, here, of the population. Generate (simulate) one realization of a point pattern under CSR 3. Compute simulated average d min value from that realization 4. Rrepeat steps () and (), say, L = 000 times to obtain L simulated average d min values 5. Histogram of L simulated average d min values = sampling distribution of mean ENE distance under the null hypothesis 8
5 Sampling Distribution of a Statistic Under CSR (II) Two realizations of a Binomial point process with n = 50 events: Sampling distribution or histogram of average d min values from 500 simulated (under CSR) point patterns, each having n = 50 events 9 Sampling Distribution of a Statistic Under CSR (III) Two realizations of a Binomial point process with n = 00 events: Sampling distribution or histogram of average d min values from 500 simulated (under CSR) point patterns, each having n = 00 events 0
6 Looking at Observed Point Patterns (I) Sampling distribution of average d min values under CSR Two observed point patterns with n = 00 events: Question: Could these two point patterns be realizations under CSR? Answer: No, and this can be said with great confidence; pattern on left (right) has much larger (smaller) mean ENE distance than expected under CSR Looking at Observed Point Patterns (II) Observed point pattern with n = 00 events, and sampling distribution of average d min under CSR: Question: Is observed point pattern more clustered: than a CSR-generated one? Answer: Most probably no, since observed average d min = 5.8 (black vertical bar) lies at the center of the sampling distribution of average d mi n values under CSR
7 Looking at Observed Point Patterns (III) Observed point pattern with n = 00 events, and sampling distribution of d min under CSR: 3 Question: Is this pattern more clustered than a CSR-generated one? Equivalent question: Since small average d min values indicate clustering, what is the observed area to the left of average d min on the sampling distribution under CSR? Answer: The area under the curve of the sampling distribution to the left of observed average d min = 4.65 is an indication of how unlikely is the observed pattern to be generated by CSR: the smaller that area, the more unlike is the pattern to be a realization under CSR. NOTE: if we were asking whether the observed point pattern was more even (less clustered) than a CSR-generated one, we would be looking at the area under the curve to the right of 4.65, since we would be interested in larger (than CSR-related) such distance values P-Value of An Observed Sample Statistic 4 P-value: Area under curve of sampling distribution in the direction of the alternative hypothesis from the observed statistic = probability of observing the statistic by chance (e.g. under the null hypothesis). Here, the probability of average d min value 4.65 Direction dependence in defining the P-value comes into play for one-sided tests; when we are just interested in whether the null hypothesis holds or not, no matter the direction of the alternative hypothesis (two-sided test), the final P-value is defined as twice the above P-value (for a symmetric sampling distribution) Interpretation: The P-value is a measure of how unlikely the observed pattern is to be generated by the null hypothesis: the smaller the P-value, the more unlikely is the pattern to be a realization under the null hypothesis, here CSR Any P-value is associated with a null hypothesis, since a P-value is computed from a sampling distribution which in turn is generated under a null hypothesis; here, the null hypothesis involves a spatial point process model (CSR) and some fixed quantities, i.e., # of events and the particular domain (with its boundaries)
8 Sampling Distribution of G Function Under CSR Interpretation: Plots provide envelope of simulated minimum and maximum G(d) curves under the null hypothesis of CSR, for a given overall intensity computed as n/ D,hence tied to the # of events considered and the particular domain; The larger n is (more events in the domain), the tighter that envelope. 5 Link to hypothesis testing: To assess whether an observed point pattern can be regarded a realization from a CSR null process, evaluate the relative position (within that envelope) of the observed G(d) curve Testing Observed Ghat Plots Against CSR (I) Two observed point patterns with n = 00 events: Question: Could these two point patterns be realizations under CSR? 6 Most probably no, since the observed G(d) curve lies outside the simulation envelope
9 Testing Observed Ghat Plots Against CSR (II) Observed point pattern with n = 00 events: Question: Could this point pattern be a realization under CSR? 7 Answer: Most probably yes, since observed G(d) curve lies very close to mean simulated plot, and is well within the simulation envelope Analytically-Derived Sampling Distributions 8 Concept For simple domains, e.g., rectangles, there exist mathematical formulae that provide the expected values of sample statistics under CSR; in other words, people have already calculated l what is the mean of a very large number of simulated average d min or G(d) values under CSR, without ever touching a computer These formulae have been derived before the advent of powerful computers, and have been used for a long time in point pattern analysis since, no simulation runs are involved, such analytically-derived formulae can be easily used without t resorting to computerintensive simulation procedures Limitations Analytically-derived ll i d formulae need to account for the fact that t events near the boundary of the study region do not have the same number of neighbors as events in the middle of that region Such edge effects can be taken care of when the study region has simple geometry, e.g., for rectangles
10 CSR-Expected Mean Nearest Neighbor Distance Definition Average of all N ENE values Note that a single number does not suffice to a describe point pattern Checking for CSR. Compute expected value of mean nearest-neighbor distance, under CSR:. Form ratio R: Interpretation: R < observed nearest neighbor distances shorter than expected tendency towards clustering R> tendency towards evenly enl spaced eventsents 9 Result depends heavily upon study area definition (used to compute ) CSR-Expected G and F Functions G function definition: Proportion of event-to-nearest-event distances d min (u i ) no greater than given distance cutoff d cumulative distribution function (CDF) of all n event-to-nearest- event distances: F function definition: Proportion of point-to-nearest- event distances d min( (t p ) no greater than given distance cutoff d CDF of all m point-to-nearest-event distances: Expected G and F function under CSR for relatively small distances to avoid edge effects: 0 Checking for CSR: compare empirical functions G(d) and F(d) with their theoretical counterparts E{G(d)} and E{F(d)} under CSR
11 Examples of Observed and CSR-Expected G Functions Examples of Observed and CSR-Expected F Functions
12 Example with Evenly Spaced Events 3 The K Function. construct set of concentric circles (of increasing radius d) around each event. compute # of events in each distance band, excluding event at the center 3. cumulative number of events up to radius d around all events becomes the sample K function K(d) 4
13 CSR-Expected K Function K(d) & L(d) functions under CSR this can become a very large number (due to d ), and consequently small differences between K(d) and E{K(d)} cannot be easily resolved use L function instead: 5 With E{L(d)} = 0 Interpreting the L function L(d) > 0 implies clustering L(d) < 0 implies stratification Watch out for edge effects Reality tends to be patchy Can we use Monte Carlo simulations instead of edge effect corrections? Examples of L Functions 6 L(d) > 0 more events are separated by distance d than expected under CSR clustering
14 Other Spatial Point Process Models Heterogeneous with no second-order effects Heterogeneous Poisson process: intensity is made spatially varying (u), and could be linked to covariates. Simulation proceeds by generating events from a homogeneous Poisson process with intensity max = max{ (u)}, and dthen independently d keeping an event at u with probability bilit (u)/ max Cox process: spatially varying intensity (u) in a non-deterministic way (doubly stochastic process); a field of (u)-values is first simulated, and then simulation proceeds as in the heterogeneous Poisson model Homogeneous with second-order effects Poisson cluster process: i) Simulate centroids of parent events from a homogeneous Poisson process ii) Associate a simulated number of off-spring with each parent centroid iii) Simulate the locations of off-spring around each parent centroid according to some bivariate PDF, and iv) Keep only the locations of off-sprind as the final simulated point pattern There also exist processes with both first- and second-order effects e.g., the inhomogeneous Poisson cluster process : : : 7 Recap (I) 8 Confirmatory analysis of spatial point patterns Allows us to quantify the departure of results obtained via exploratory tools, e.g average d min or G(d), from expected results derived d under a specific null hypotheses (here CSR) Can be used to assess to what extent observed point patterns can be regarded as realizations from a particular spatial process (here CSR) CSR involves: i) a constant intensity and (ii) no event-to-event interaction Sampling distribution of a test statistic Lies at the heart of any statistical hypothesis testing procedure, and is tied to a particular null hypothesis (and a particular study domain) Simulation and analytical derivations are two alternative ways of computing such sampling distributions (the latter being increasingly replaced by the former) Watch t h out for edge effects when using analytically ll derived d sampling distributions
15 Recap (II) More interesting spatial point process models Heterogeneous Poisson process, Cox process, Poisson cluster process Note: It is almost impossible to assess whether an observed point pattern (a single realization from a hypothesized point process) stems from a process with only first- or only second-order effects or a combination thereof; different processes could yield indistinguishable realizations under certain parameter combinations (equi-finality) Parameter estimation? In practice, we are most often dealing with the problem of estimating the parameters of a spatial point process model from data, i.e., from an observed spatial point pattern. This is an inverse problem, as opposed to the forward problem of generating patterns from processes. The inverse problem, however, er is under-determined, determined mostly because we only have realization (the observed pattern) from a hypothesized process Data Generating process Forward problem Data/Map 9 Schrodinger s Box Inverse Problem
Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis
Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 5 Topic Overview 1) Introduction/Unvariate Statistics 2) Bootstrapping/Monte Carlo Simulation/Kernel
More informationGIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis
GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis Guofeng Cao www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Fall 2018 Spatial Point Patterns
More informationInteraction Analysis of Spatial Point Patterns
Interaction Analysis of Spatial Point Patterns Geog 2C Introduction to Spatial Data Analysis Phaedon C Kyriakidis wwwgeogucsbedu/ phaedon Department of Geography University of California Santa Barbara
More informationIntensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis
Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 4 Spatial Point Patterns Definition Set of point locations with recorded events" within study
More informationSpatial Point Pattern Analysis
Spatial Point Pattern Analysis Jiquan Chen Prof of Ecology, University of Toledo EEES698/MATH5798, UT Point variables in nature A point process is a discrete stochastic process of which the underlying
More informationChapter 6 Spatial Analysis
6.1 Introduction Chapter 6 Spatial Analysis Spatial analysis, in a narrow sense, is a set of mathematical (and usually statistical) tools used to find order and patterns in spatial phenomena. Spatial patterns
More informationSpatial Analysis I. Spatial data analysis Spatial analysis and inference
Spatial Analysis I Spatial data analysis Spatial analysis and inference Roadmap Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for analyses: working with
More informationPoint Pattern Analysis
Point Pattern Analysis Nearest Neighbor Statistics Luc Anselin http://spatial.uchicago.edu principle G function F function J function Principle Terminology events and points event: observed location of
More informationSimulation. Where real stuff starts
1 Simulation Where real stuff starts ToC 1. What is a simulation? 2. Accuracy of output 3. Random Number Generators 4. How to sample 5. Monte Carlo 6. Bootstrap 2 1. What is a simulation? 3 What is a simulation?
More informationLab #3 Background Material Quantifying Point and Gradient Patterns
Lab #3 Background Material Quantifying Point and Gradient Patterns Dispersion metrics Dispersion indices that measure the degree of non-randomness Plot-based metrics Distance-based metrics First-order
More informationPoints. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved
Points Luc Anselin http://spatial.uchicago.edu 1 classic point pattern analysis spatial randomness intensity distance-based statistics points on networks 2 Classic Point Pattern Analysis 3 Classic Examples
More informationPractical Statistics
Practical Statistics Lecture 1 (Nov. 9): - Correlation - Hypothesis Testing Lecture 2 (Nov. 16): - Error Estimation - Bayesian Analysis - Rejecting Outliers Lecture 3 (Nov. 18) - Monte Carlo Modeling -
More informationMath Review Sheet, Fall 2008
1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the
More informationOverview of Statistical Analysis of Spatial Data
Overview of Statistical Analysis of Spatial Data Geog 2C Introduction to Spatial Data Analysis Phaedon C. Kyriakidis www.geog.ucsb.edu/ phaedon Department of Geography University of California Santa Barbara
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationSimulation. Where real stuff starts
Simulation Where real stuff starts March 2019 1 ToC 1. What is a simulation? 2. Accuracy of output 3. Random Number Generators 4. How to sample 5. Monte Carlo 6. Bootstrap 2 1. What is a simulation? 3
More informationA Spatio-Temporal Point Process Model for Firemen Demand in Twente
University of Twente A Spatio-Temporal Point Process Model for Firemen Demand in Twente Bachelor Thesis Author: Mike Wendels Supervisor: prof. dr. M.N.M. van Lieshout Stochastic Operations Research Applied
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationData Analysis I. Dr Martin Hendry, Dept of Physics and Astronomy University of Glasgow, UK. 10 lectures, beginning October 2006
Astronomical p( y x, I) p( x, I) p ( x y, I) = p( y, I) Data Analysis I Dr Martin Hendry, Dept of Physics and Astronomy University of Glasgow, UK 10 lectures, beginning October 2006 4. Monte Carlo Methods
More information6. Spatial analysis of multivariate ecological data
Université Laval Analyse multivariable - mars-avril 2008 1 6. Spatial analysis of multivariate ecological data 6.1 Introduction 6.1.1 Conceptual importance Ecological models have long assumed, for simplicity,
More informationLecture 26 Section 8.4. Wed, Oct 14, 2009
PDFs n = Lecture 26 Section 8.4 Hampden-Sydney College Wed, Oct 14, 2009 Outline PDFs n = 1 2 PDFs n = 3 4 5 6 Outline PDFs n = 1 2 PDFs n = 3 4 5 6 PDFs n = Exercise 8.12, page 528. Suppose that 60% of
More informationRandom Number Generation. CS1538: Introduction to simulations
Random Number Generation CS1538: Introduction to simulations Random Numbers Stochastic simulations require random data True random data cannot come from an algorithm We must obtain it from some process
More informationCS 543 Page 1 John E. Boon, Jr.
CS 543 Machine Learning Spring 2010 Lecture 05 Evaluating Hypotheses I. Overview A. Given observed accuracy of a hypothesis over a limited sample of data, how well does this estimate its accuracy over
More informationAP Statistics Cumulative AP Exam Study Guide
AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics
More informationSo we will instead use the Jacobian method for inferring the PDF of functionally related random variables; see Bertsekas & Tsitsiklis Sec. 4.1.
2011 Page 1 Simulating Gaussian Random Variables Monday, February 14, 2011 2:00 PM Readings: Kloeden and Platen Sec. 1.3 Why does the Box Muller method work? How was it derived? The basic idea involves
More informationSemester , Example Exam 1
Semester 1 2017, Example Exam 1 1 of 10 Instructions The exam consists of 4 questions, 1-4. Each question has four items, a-d. Within each question: Item (a) carries a weight of 8 marks. Item (b) carries
More informationProbability and Stochastic Processes
Probability and Stochastic Processes A Friendly Introduction Electrical and Computer Engineers Third Edition Roy D. Yates Rutgers, The State University of New Jersey David J. Goodman New York University
More informationGIST 4302/5302: Spatial Analysis and Modeling
GIST 4302/5302: Spatial Analysis and Modeling Review Guofeng Cao www.gis.ttu.edu/starlab Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Spring 2016 Course Outlines Spatial Point Pattern
More informationSpatial Autocorrelation
Spatial Autocorrelation Luc Anselin http://spatial.uchicago.edu spatial randomness positive and negative spatial autocorrelation spatial autocorrelation statistics spatial weights Spatial Randomness The
More informationReliability Theory of Dynamically Loaded Structures (cont.)
Outline of Reliability Theory of Dynamically Loaded Structures (cont.) Probability Density Function of Local Maxima in a Stationary Gaussian Process. Distribution of Extreme Values. Monte Carlo Simulation
More informationThe Chi-Square Distributions
MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the
More informationMonte Carlo Studies. The response in a Monte Carlo study is a random variable.
Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2018 Examinations Subject CT3 Probability and Mathematical Statistics Core Technical Syllabus 1 June 2017 Aim The
More informationModeling and Performance Analysis with Discrete-Event Simulation
Simulation Modeling and Performance Analysis with Discrete-Event Simulation Chapter 9 Input Modeling Contents Data Collection Identifying the Distribution with Data Parameter Estimation Goodness-of-Fit
More informationthe amount of the data corresponding to the subinterval the width of the subinterval e x2 to the left by 5 units results in another PDF g(x) = 1 π
Math 10A with Professor Stankova Worksheet, Discussion #42; Friday, 12/8/2017 GSI name: Roy Zhao Problems 1. For each of the following distributions, derive/find all of the following: PMF/PDF, CDF, median,
More informationPractice Problems Section Problems
Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,
More informationMonte Carlo Integration II & Sampling from PDFs
Monte Carlo Integration II & Sampling from PDFs CS295, Spring 2017 Shuang Zhao Computer Science Department University of California, Irvine CS295, Spring 2017 Shuang Zhao 1 Last Lecture Direct illumination
More informationECO220Y Continuous Probability Distributions: Uniform and Triangle Readings: Chapter 9, sections
ECO220Y Continuous Probability Distributions: Uniform and Triangle Readings: Chapter 9, sections 9.8-9.9 Fall 2011 Lecture 8 Part 1 (Fall 2011) Probability Distributions Lecture 8 Part 1 1 / 19 Probability
More informationOikos. Appendix 1 and 2. o20751
Oikos o20751 Rosindell, J. and Cornell, S. J. 2013. Universal scaling of species-abundance distributions across multiple scales. Oikos 122: 1101 1111. Appendix 1 and 2 Universal scaling of species-abundance
More informationB.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba
B.N.Bandodkar College of Science, Thane Random-Number Generation Mrs M.J.Gholba Properties of Random Numbers A sequence of random numbers, R, R,., must have two important statistical properties, uniformity
More informationSIMULATION SEMINAR SERIES INPUT PROBABILITY DISTRIBUTIONS
SIMULATION SEMINAR SERIES INPUT PROBABILITY DISTRIBUTIONS Zeynep F. EREN DOGU PURPOSE & OVERVIEW Stochastic simulations involve random inputs, so produce random outputs too. The quality of the output is
More informationSpatial Clusters of Rates
Spatial Clusters of Rates Luc Anselin http://spatial.uchicago.edu concepts EBI local Moran scan statistics Concepts Rates as Risk from counts (spatially extensive) to rates (spatially intensive) rate =
More informationIf we want to analyze experimental or simulated data we might encounter the following tasks:
Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction
More informationRecall the Basics of Hypothesis Testing
Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE
More informationModelling the risk process
Modelling the risk process Krzysztof Burnecki Hugo Steinhaus Center Wroc law University of Technology www.im.pwr.wroc.pl/ hugo Modelling the risk process 1 Risk process If (Ω, F, P) is a probability space
More informationStochastic Processes
Stochastic Processes Stochastic Process Non Formal Definition: Non formal: A stochastic process (random process) is the opposite of a deterministic process such as one defined by a differential equation.
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2
MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 1 Bootstrapped Bias and CIs Given a multiple regression model with mean and
More informationTest of Complete Spatial Randomness on Networks
Test of Complete Spatial Randomness on Networks A PROJECT SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Xinyue Chang IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE
More informationSTAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).
STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationChapter 6 Expectation and Conditional Expectation. Lectures Definition 6.1. Two random variables defined on a probability space are said to be
Chapter 6 Expectation and Conditional Expectation Lectures 24-30 In this chapter, we introduce expected value or the mean of a random variable. First we define expectation for discrete random variables
More informationRecap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks
Recap Probability, stochastic processes, Markov chains ELEC-C7210 Modeling and analysis of communication networks 1 Recap: Probability theory important distributions Discrete distributions Geometric distribution
More informationChapter 22. Comparing Two Proportions 1 /29
Chapter 22 Comparing Two Proportions 1 /29 Homework p519 2, 4, 12, 13, 15, 17, 18, 19, 24 2 /29 Objective Students test null and alternate hypothesis about two population proportions. 3 /29 Comparing Two
More informationMonte Carlo Simulation. CWR 6536 Stochastic Subsurface Hydrology
Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology Steps in Monte Carlo Simulation Create input sample space with known distribution, e.g. ensemble of all possible combinations of v, D, q,
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationTesting of mark independence for marked point patterns
9th SSIAB Workshop, Avignon - May 9-11, 2012 Testing of mark independence for marked point patterns Mari Myllymäki Department of Biomedical Engineering and Computational Science Aalto University mari.myllymaki@aalto.fi
More informationLearning Objectives for Stat 225
Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:
More informationFundamentals of Applied Probability and Random Processes
Fundamentals of Applied Probability and Random Processes,nd 2 na Edition Oliver C. Ibe University of Massachusetts, LoweLL, Massachusetts ip^ W >!^ AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationStatistical Data Analysis
DS-GA 0 Lecture notes 8 Fall 016 1 Descriptive statistics Statistical Data Analysis In this section we consider the problem of analyzing a set of data. We describe several techniques for visualizing the
More informationENGRG Introduction to GIS
ENGRG 59910 Introduction to GIS Michael Piasecki October 13, 2017 Lecture 06: Spatial Analysis Outline Today Concepts What is spatial interpolation Why is necessary Sample of interpolation (size and pattern)
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationChapter 22. Comparing Two Proportions 1 /30
Chapter 22 Comparing Two Proportions 1 /30 Homework p519 2, 4, 12, 13, 15, 17, 18, 19, 24 2 /30 3 /30 Objective Students test null and alternate hypothesis about two population proportions. 4 /30 Comparing
More informationBTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014
BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 Homework 4 (version 3) - posted October 3 Assigned October 2; Due 11:59PM October 9 Problem 1 (Easy) a. For the genetic regression model: Y
More informationChapter 4: Monte Carlo Methods. Paisan Nakmahachalasint
Chapter 4: Monte Carlo Methods Paisan Nakmahachalasint Introduction Monte Carlo Methods are a class of computational algorithms that rely on repeated random sampling to compute their results. Monte Carlo
More informationUniversitat Autònoma de Barcelona Facultat de Filosofia i Lletres Departament de Prehistòria Doctorat en arqueologia prehistòrica
Universitat Autònoma de Barcelona Facultat de Filosofia i Lletres Departament de Prehistòria Doctorat en arqueologia prehistòrica FROM MICRO TO MACRO SPATIAL DYNAMICS IN THE VILLAGGIO DELLE MACINE BETWEEN
More information16 : Markov Chain Monte Carlo (MCMC)
10-708: Probabilistic Graphical Models 10-708, Spring 2014 16 : Markov Chain Monte Carlo MCMC Lecturer: Matthew Gormley Scribes: Yining Wang, Renato Negrinho 1 Sampling from low-dimensional distributions
More informationTypes of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics
The Nature of Geographic Data Types of spatial data Continuous spatial data: geostatistics Samples may be taken at intervals, but the spatial process is continuous e.g. soil quality Discrete data Irregular:
More information6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More informationIntroduction. Spatial Processes & Spatial Patterns
Introduction Spatial data: set of geo-referenced attribute measurements: each measurement is associated with a location (point) or an entity (area/region/object) in geographical (or other) space; the domain
More informationChapte The McGraw-Hill Companies, Inc. All rights reserved.
er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations
More informationNorthwestern University Department of Electrical Engineering and Computer Science
Northwestern University Department of Electrical Engineering and Computer Science EECS 454: Modeling and Analysis of Communication Networks Spring 2008 Probability Review As discussed in Lecture 1, probability
More informationInferential Statistics
Inferential Statistics Part 1 Sampling Distributions, Point Estimates & Confidence Intervals Inferential statistics are used to draw inferences (make conclusions/judgements) about a population from a sample.
More informationStochastic Simulation
Stochastic Simulation APPM 7400 Lesson 11: Spatial Poisson Processes October 3, 2018 Lesson 11: Spatial Poisson Processes Stochastic Simulation October 3, 2018 1 / 24 Consider a spatial configuration of
More informationThe multigroup Monte Carlo method part 1
The multigroup Monte Carlo method part 1 Alain Hébert alain.hebert@polymtl.ca Institut de génie nucléaire École Polytechnique de Montréal ENE6103: Week 11 The multigroup Monte Carlo method part 1 1/23
More informationHierarchical Modeling and Analysis for Spatial Data
Hierarchical Modeling and Analysis for Spatial Data Bradley P. Carlin, Sudipto Banerjee, and Alan E. Gelfand brad@biostat.umn.edu, sudiptob@biostat.umn.edu, and alan@stat.duke.edu University of Minnesota
More informationPrincipal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17
Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into
More informationBIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke
BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart
More informationLecture 4: Testing Stuff
Lecture 4: esting Stuff. esting Hypotheses usually has three steps a. First specify a Null Hypothesis, usually denoted, which describes a model of H 0 interest. Usually, we express H 0 as a restricted
More informationDr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)
Dr. Maddah ENMG 617 EM Statistics 10/15/12 Nonparametric Statistics (2) (Goodness of fit tests) Introduction Probability models used in decision making (Operations Research) and other fields require fitting
More informationUNIT 5:Random number generation And Variation Generation
UNIT 5:Random number generation And Variation Generation RANDOM-NUMBER GENERATION Random numbers are a necessary basic ingredient in the simulation of almost all discrete systems. Most computer languages
More informationCHAPTER 21: TIME SERIES ECONOMETRICS: SOME BASIC CONCEPTS
CHAPTER 21: TIME SERIES ECONOMETRICS: SOME BASIC CONCEPTS 21.1 A stochastic process is said to be weakly stationary if its mean and variance are constant over time and if the value of the covariance between
More informationThe Chi-Square Distributions
MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness
More informationWhy Is It There? Attribute Data Describe with statistics Analyze with hypothesis testing Spatial Data Describe with maps Analyze with spatial analysis
6 Why Is It There? Why Is It There? Getting Started with Geographic Information Systems Chapter 6 6.1 Describing Attributes 6.2 Statistical Analysis 6.3 Spatial Description 6.4 Spatial Analysis 6.5 Searching
More informationLecture 6. Probability events. Definition 1. The sample space, S, of a. probability experiment is the collection of all
Lecture 6 1 Lecture 6 Probability events Definition 1. The sample space, S, of a probability experiment is the collection of all possible outcomes of an experiment. One such outcome is called a simple
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationMath 494: Mathematical Statistics
Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/
More informationCONDUCTING INFERENCE ON RIPLEY S K-FUNCTION OF SPATIAL POINT PROCESSES WITH APPLICATIONS
CONDUCTING INFERENCE ON RIPLEY S K-FUNCTION OF SPATIAL POINT PROCESSES WITH APPLICATIONS By MICHAEL ALLEN HYMAN A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT
More informationIntroduction to Statistics and Error Analysis II
Introduction to Statistics and Error Analysis II Physics116C, 4/14/06 D. Pellett References: Data Reduction and Error Analysis for the Physical Sciences by Bevington and Robinson Particle Data Group notes
More informationModeling Uncertainty in the Earth Sciences Jef Caers Stanford University
Probability theory and statistical analysis: a review Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University Concepts assumed known Histograms, mean, median, spread, quantiles Probability,
More informationSpatial point processes
Mathematical sciences Chalmers University of Technology and University of Gothenburg Gothenburg, Sweden June 25, 2014 Definition A point process N is a stochastic mechanism or rule to produce point patterns
More informationLecture 3: Mixture Models for Microbiome data. Lecture 3: Mixture Models for Microbiome data
Lecture 3: Mixture Models for Microbiome data 1 Lecture 3: Mixture Models for Microbiome data Outline: - Mixture Models (Negative Binomial) - DESeq2 / Don t Rarefy. Ever. 2 Hypothesis Tests - reminder
More information1. Exploratory Data Analysis
1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be
More informationLecture 4: Statistical Hypothesis Testing
EAS31136/B9036: Statistics in Earth & Atmospheric Sciences Lecture 4: Statistical Hypothesis Testing Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition
More informationLecture 5: Sampling Methods
Lecture 5: Sampling Methods What is sampling? Is the process of selecting part of a larger group of participants with the intent of generalizing the results from the smaller group, called the sample, to
More information* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.
Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course
More informationY i = η + ɛ i, i = 1,...,n.
Nonparametric tests If data do not come from a normal population (and if the sample is not large), we cannot use a t-test. One useful approach to creating test statistics is through the use of rank statistics.
More information