Data analysis and Geostatistics - lecture XI
|
|
- Ferdinand Maxwell
- 6 years ago
- Views:
Transcription
1 Data analysis and Geostatistics - lecture XI Clustering and spatial analysis of data Cluster analysis Group samples into clusters based on similarity Cluster analysis requires substantial user input (selection of number of clusters, clustering routine, similarity criteria, etc) and results can therefore be ambiguous: always give detailed information on how your cluster analysis was performed
2 Cluster analysis Group samples into clusters based on similarity cluster mean A whichever deviation between sample and cluster mean is smallest: assigned to that cluster cluster mean B _ (i-b) _ (i-c) _ (i-a) cluster mean C Cluster analysis is again controlled by the sum of squares: SSwithin SSwithin SSwithin SSbetween A-B SSbetween B-C SSbetween A-C small variance within: tight clusters large variance between: good separation increasing the number of clusters will decrease the within variance, until all samples are their own cluster. That result is however meaningless... Cluster analysis - sample assignment criteria range of techniques that can be used to determine similarity cluster mean B cluster mean C cluster mean A Wide range of techniques - see book for details Euclidian distance - r or r 2 city block of Manhattan distance - this is useful when the two variables are separate characteristics (fossil length and width, the diagonal is not of interest) correlation similarity - sample with the same correlation are grouped together: deals with dilution effects association values - especially useful when you have only presence/absence data - specialized
3 Cluster analysis - two types Two varieties of clustering: hierarchical and partitioning methods hierarchical techniques: represent similarity in a tree or dendrogram the method: 1. all samples are a separate cluster 2. link the two most similar samples 3. link two other samples to form a new cluster or add a third sample to the first cluster depending on similarities 4. continue until only one cluster remains in this technique all intermediate steps and cluster associations are immediately available - depends on the user to select an appropriate pruning level in the tree there are many ways to link samples and these do result in different trees (see book for details) Hierarchical cluster analysis An eample of hierarchical clustering: the composition of a number of lava samples from Kawah Ijen volcano: degree of dissimilarity sample KV01 KV20 KV41 KV43 KV08 KV10 KV12 KV14 KV14 KV21 KV21 clusters dissimilarity based upon nearest neighbour criterium the resulting tree can be pruned at any level: up to the user to select should test if difference between groups is significant (which test?) basalt andesite dacite duplicate
4 Clustering - partitioning techniques Two varieties of clustering: hierarchical and partitioning methods partitioning techniques: assigns samples to a known number of clusters based upon similarity criteria the method: 1. samples are assigned to the cluster they are most similar to in multi-dimensional space 2. each assignment results in a shift in the characteristics of the cluster centre (means + variance or only variance) 3. samples are re-assigned where necessary and this routine is iterated until the system stabilizes There are two main approaches: clustering with specified cluster means (i.e. known groups) and clustering where the means are obtained during clustering both have their pros and cons: Partitioning techniques advantages disadvantages specified/ fied you always get the same answer during classification groups can relate to real dividing phenomena unknowns are (generally) easily classified boundaries commonly based on consensus (artificial) 2 samples close together can be in different clusters 2 very different samples can be in same cluster assigned/ sought data groups not split up over different clusters boundaries always in regions of low data density easy to apply to data sets with many variables instability issues: more data will result in shift in cluster means and sample assignment no fied boundaries so unsuitable for classification schemes
5 Clustering with hard boundaries 2 samples close together can be in different clusters 2 very different samples can be in same cluster Partitioning techniques advantages disadvantages specified/ fied you always get the same answer during classification groups can relate to real dividing phenomena unknowns are (generally) easily classified boundaries commonly based on consensus (artificial) 2 samples close together can be in different clusters 2 very different samples can be in same cluster assigned/ sought data groups not split up over different clusters boundaries always in regions of low data density easy to apply to data sets with many variables instability issues: more data will result in shift in cluster means and sample assignment no fied boundaries so unsuitable for classification schemes
6 Cluster means assigned during clustering: when cluster means are specified: use minimum distance to mean to assign if not: randomly assign each sample to a cluster and iterate to stable solution both cluster means and cluster assignment change during the iteration process stops when samples no longer change their assignment cluster A cluster B cluster C center Cluster analysis - method of assignment Samples are normally assigned to a cluster in a hard way: samples are unambiguously attributed to a specific cluster - 0 or 1 assignment However, mother nature is rarely so black and white... middle age cluster depends very much on percon/country/continent 1 0 young A middle age B old 1 0 young middle age old if age is between A and B: middle age fuzzy approach: samples have cluster memberships between 0 and 1
7 Fuzzy clustering fuzzy clustering has a number of distinct benefits: can deal with intermediate cases - not force-assigned samples have share multiple clusters - etra information: (0.7 young middle age versus 0.5 young middle age) ensures that single samples do not overly control individual clusters can have a separate outlier assignment most fleible and powerful: fuzzy clustering with seeking of cluster means hard clustering strained assignment due to outlier and intermediate value fuzzy clustering outlier not a problem and intermediate shown Clustering in NCSS - the eating habits of Europe can we distinguish the Europeans by their eating habits? the data (missing value = -999): Real coffee Nescafe Tea Sweetener Biscuits Pack. soup Tinned soup Frozen fish Frozen veg. Apples Tinned fruit Jam Garlic Butter Margerine Olive oil Yoghurt
8 Clustering in NCSS - the eating habits of Europe hierarchical clustering of this data set: clear clustering lots of options available: use parametric and nonparametric data and even mi these (length + color) variety of linkage types: nearest neighbour, furthest neighbour, Ward s method distance: Euclidian or Manhattan city block see the NCSS hierarchical clustering tutorial for more information Clustering in NCSS - the eating habits of Europe hard and fuzzy clustering of this data set: K-means - hard fuzzy-prob 1 fuzzy-prob 2 fuzzy-prob 3 fuzzy-prob 4 Germany Italy France Netherlands Belgium Luembourg Britain Portugal Austria Switzerland Sweden Denmark Norway Finland Spain Ireland
9 Clustering - number of clusters the main difficulty in cluster analysis is choosing the no. of clusters NCSS and other clustering packages will calculate assignments for a cluster number range the residual variance will decrease with every additional cluster so this is not a good indicator of optimal no. of clusters instead: choose no. of clusters where variance no longer strongly decreases use the averaged silhouette value: comparison between a value s dissimilarity with its cluster and the dissimilarity with its nearest neighbour: ranges from 1 to -1: > 0.75: good model < 0.25: poor model Use the fuzziness of the model (0; completely fuzzy to 1; hard) Fc(U) and Dc(U) parameters: ma Fc(U) + min Dc(U) = best model Plotting clusters on maps - Massif Central dataset Will look at an eample from the Massif Central in France. A dataset of the chemical composition of stream sediments collected in an area with a diverse geology and old, now abandoned, mining for Sb, As, Pb, Au, Ba & F Geology consists of: + Ally + Chilhac felsic gneisses Cronce river + + Lavoute-Chilhac St Cirgues + Reilhac Langeac recent volcanics mafic gneisses and schists Védrines + St Loup + Chastel + Lestival + Chanteuges + Pinols + Marsanges + Barlet (meta) - granite + Desges Desges river + Chazelles + Pebrac + + Charrai Prades sediment (incl coal) 5 km
10 Clustering - groups in Massif Central dataset The dataset is best described when split up into si clusters clear link to the bedrock geology, but not 1 to 1 Clustering - properties per cluster when the data have been clustered: can look at the characteristics of each cluster (mean + stdev) and correlations within this SiO2 MgO log (Li) K2O Li V
11 Clustering - groups in Massif Central dataset Can plot clusters individually to look at spatial distribution and contents cluster 1 cluster 2 cluster 3 cluster 4 cluster separation isn t perfect only cluster 4 is distinct in Li: multi-element separation Plotting data on maps: bubble plots Data are plotted at their spatial coordinates with a symbol whose size represents the value of the data point 140 Can apply eactly the same tools as used on the element map: adjust contrast, isolate features and perform data transformations can also overlay these bubbles on another layer, such as a topo map, geol map, stream map etc
12 Plotting data on maps: bubble plots Stream sediments as a reflection of the local geology: Beryllium Be concentrations without processing: 500 sometimes it just works! Plotting data on maps: bubble plots Silver concentrations: working with a non-normal distribution Ag optimized linear scale square root scale
13 Plotting data on maps: bubble plots Don t have to plot all the data in the dataset: applying a cut-off at low values will highlight interesting samples, whereas a high cut-off removes outliers Zn, only data with > 50 ppm Plotting data on maps: bubble plots Looking for element associations by combining bubble plots Cd Zn Sb W
14 Plotting data on maps Combining elements by using multi-coloured bubble plots is useful, but fast becomes confusing: can lead you to miss interesting samples Can also calculate such associations beforehand and plot them directly: Sb + Zn Sb / Zn Or you can apply logical rules to the data before plotting: plot Sb if S > 200 ppm if SiO2 > 60 wt% then plot K / Zr Note that such properties are calculated much easier and faster in programs designed for such calculations: e.g. Ecel or Quattro Pro Plotting data on maps Not limited to plotting data, but can also plot derived properties such as the mean, median, standard deviation, etc and not just values, but also other observations: geol code / vegetation / mode in multi-modal distribution
15 Plotting data on maps: bubble plots Plotting processed data - standard deviation: the variability at a sample site Plotting data on maps: artefacts in the gold map
16 Spatial data visualization To be able to calculate contours and surfaces: interpolation need to know the concentration at any point in the sampling space to be able to draw smooth contours: o interpolate between values interpolation on As content grid; 40 nearest neighbour 20 o o radius technique: 1/r o radius technique: 1/r 2 Spatial data visualization Results of different interpolation techniques:
17 Spatial data visualization To be able to calculate contours and surfaces: interpolation 140 interpolation on As content grid; 120 nearest neighbour 100 o radius technique: 1/r o o radius technique: 1/r 2 main issue: what samples should be included in the interpolation: 20 o what should the maimum radius be? Interpolation radius Spatial data have a very useful property: adjacent samples should be most similar, whereas samples that are far apart can be distinctly different, or: the variance for a small interpolation radius is small, as the variance between adjacent samples is small the variance increases as the interpolation radius increases (i.e. as samples further away from the point of interest are included) at some radius the variance will no longer increase as we have reached the overall variance, which is called the regional variance including values beyond the regional variance radius is pointless as such samples do not contain any information on the value at the point of interest
18 Interpolation radius Interpolation radius in a sedimentary core: radius concentration adjacent samples are most similar: as interpolation radius increases so does the variances when you enter another unit the variance increases significantly: such samples should not be included in your interpolation Interpolation radius
19 Interpolation radius variance radius semivariance radius Interpolation radius semivariance semivariance radius
20 Semivariance and semivariograms This concept is semivariance and is shown in a semivariogram semivariance: the variance between samples a specified interval or distance apart as the interval increases, the semivariance will approach the total variance of the data set, so it is a spatially controlled partial variance of the data γ h = Σ (z i - z i+h ) 2 2(n - h) with: γ = semivariance for interval h n = total number of samples zi = value at position i as h increases, the relatedness of the samples decreases and the variance will therefore increase: Semivariance and semivariograms plotting the semivariance against h: semivriogram concentration semivariance no relation with distance: random distance interval concentration semivariance gradual changes in concentration distance interval concentration semivariance continuous variation with distance: trend distance interval
21 Semivariance and semivariograms properties of a semivariogram : sill drift semivariance range range interval interval the range is the interval within which there is similarity between the samples Semivariance and semivariograms Semivariograms provide our maimum radius criterion: only samples that fall within the range are included in interpolation before we continue, a few notes: semivariograms have to be determined for each variable as each has its own range: interpolation has to be performed separately as well semivariograms are generally different for different spatial directions (N, SW, etc). Such anisotropy can point to an underlying geological phenomenon such as layering or a fault control on conc. This can be corrected for either manually by stretching the coordinate system perpendicular to the main ais, or automatically by kriging software most semivariagrams have an apparent cut-off at zero distance that has a semivariance 0. This is called the nugget effect and is caused by sample heterogeneity (= field duplicate variance)
22 Nugget effect in semi-variograms sill There is always some uncertainty at a given sample site, which you could quantify by taking field duplicates. semivariance nugget range interval This sample site variance is the nugget in a semivariogram (in essence the variance at zero distance) Every element will have such a nugget, but the effect is strongest for elements that are heterogeneously distributed, such as gold present as nuggets in a sediment because we use mean + var Using semivariogram information: kriging The interpolation technique that employs the range information as obtained from semivariograms is called kriging in kriging, only samples that are within the range are used to determine the value at a given intermediate position and the weighing for each sample is derived from its associated semivariance A (i,yi) = wt1 * A (1,y1) + wt2 * A (2,y2) + wt3 * A (3,y3) +... as an added bonus this also gives us the variance associated with each interpolated value (the uncertainty), so we can immediately see where our interpolations are reliable and where they are not because weights are based on the semivariance, obvious trends in the data should be removed as this leads to a continuous rise in the semivariance: can be done by first subtracting a trend surface
23 Estimate of uncertainty for each interpolated value source: wikipedia.org Uncertainty in block kriging of grades Kriging is commonly applied to estimate the grade of blocks in open pit mining using a sample grid or the grade of adjacent blocks (or both). In such cases it is invaluable to know the uncertainty on the grade estimate
24 Flavours of kriging There are many flavours of kriging and discussing them all would be a course in its own right. A few terms that you come across commonly: Simple/Ordinary kriging: no trend in the data, so there is a constant mean in the dataset and the variance is calculated as the difference from this mean. This mean is either known (Simple) or calculated from the data (Ordinary) Universal kriging: there is a spatial trend in the data, so the mean varies with the spatial coordinates. Instead of using universal kriging, you can also remove the trend in pre-processing of the data Indicator kriging: rather than estimating a numerical value at a given point, you estimate if it is higher or lower than a set value, and the prob. of this Co-kriging: a second variable is included in the kriging which is correlated with the first variable. This should improve estimates of the first and main variable Good kriging resource: Clark & Harper (2000) Practical Geostatistics ISBN , or you can download the 1979 original at Back to our eample Results of different interpolation techniques:
25 And now using kriging as the interpolation method Results of kriging on this data set: Kriging and trends The effect of a strong spatial trend in the data
26 Kriging and trends The effect of a strong spatial trend in the data Some data are not suited to interpolation/kriging There is a strong tendency to directly start with the most comple or fancy technique, such as kriging. However, kriging is not always appropriate! raw concentrations plotted optimized kriging map
27 Kriging and sample coverage Kriging works best when you have a high sample density and a more or less uniform distribution of data over the sample are. If not get artefacts Areas without samples need to be blanketed out, not just removed afterwards
Univariate Data Presentation: The Contouring Conundrum and Philosophical Arguments Regarding the Contouring of Geochemical Data Clifford R.
Univariate Data Presentation: The Contouring Conundrum and Philosophical Arguments Regarding the Contouring of Geochemical Data Clifford R. Stanley Dept. of Geology, Acadia University, Wolfville, Nova
More informationSpatial Data Mining. Regression and Classification Techniques
Spatial Data Mining Regression and Classification Techniques 1 Spatial Regression and Classisfication Discrete class labels (left) vs. continues quantities (right) measured at locations (2D for geographic
More informationLecture 5 Geostatistics
Lecture 5 Geostatistics Lecture Outline Spatial Estimation Spatial Interpolation Spatial Prediction Sampling Spatial Interpolation Methods Spatial Prediction Methods Interpolating Raster Surfaces with
More informationSpatial Analysis II. Spatial data analysis Spatial analysis and inference
Spatial Analysis II Spatial data analysis Spatial analysis and inference Roadmap Spatial Analysis I Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for
More informationIntroduction. Semivariogram Cloud
Introduction Data: set of n attribute measurements {z(s i ), i = 1,, n}, available at n sample locations {s i, i = 1,, n} Objectives: Slide 1 quantify spatial auto-correlation, or attribute dissimilarity
More informationApplying cluster analysis to 2011 Census local authority data
Applying cluster analysis to 2011 Census local authority data Kitty.Lymperopoulou@manchester.ac.uk SPSS User Group Conference November, 10 2017 Outline Basic ideas of cluster analysis How to choose variables
More informationA robust statistically based approach to estimating the probability of contamination occurring between sampling locations
A robust statistically based approach to estimating the probability of contamination occurring between sampling locations Peter Beck Principal Environmental Scientist Image placeholder Image placeholder
More informationSTATISTICA MULTIVARIATA 2
1 / 73 STATISTICA MULTIVARIATA 2 Fabio Rapallo Dipartimento di Scienze e Innovazione Tecnologica Università del Piemonte Orientale, Alessandria (Italy) fabio.rapallo@uniupo.it Alessandria, May 2016 2 /
More informationContents 1 Introduction 2 Statistical Tools and Concepts
1 Introduction... 1 1.1 Objectives and Approach... 1 1.2 Scope of Resource Modeling... 2 1.3 Critical Aspects... 2 1.3.1 Data Assembly and Data Quality... 2 1.3.2 Geologic Model and Definition of Estimation
More informationExploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands
Exploring the World of Ordinary Kriging Wageningen University & Research Center Wageningen, The Netherlands July 2004 (version 0.2) What is? What is it about? Potential Users a computer program for exploring
More informationSupplementary material: Methodological annex
1 Supplementary material: Methodological annex Correcting the spatial representation bias: the grid sample approach Our land-use time series used non-ideal data sources, which differed in spatial and thematic
More informationAn Introduction to Pattern Statistics
An Introduction to Pattern Statistics Nearest Neighbors The CSR hypothesis Clark/Evans and modification Cuzick and Edwards and controls All events k function Weighted k function Comparative k functions
More informationData Preprocessing. Cluster Similarity
1 Cluster Similarity Similarity is most often measured with the help of a distance function. The smaller the distance, the more similar the data objects (points). A function d: M M R is a distance on M
More informationMultivariate Analysis
Prof. Dr. J. Franke All of Statistics 3.1 Multivariate Analysis High dimensional data X 1,..., X N, i.i.d. random vectors in R p. As a data matrix X: objects values of p features 1 X 11 X 12... X 1p 2.
More informationTypes of Spatial Data
Spatial Data Types of Spatial Data Point pattern Point referenced geostatistical Block referenced Raster / lattice / grid Vector / polygon Point Pattern Data Interested in the location of points, not their
More informationTricks to Creating a Resource Block Model. St John s, Newfoundland and Labrador November 4, 2015
Tricks to Creating a Resource Block Model St John s, Newfoundland and Labrador November 4, 2015 Agenda 2 Domain Selection Top Cut (Grade Capping) Compositing Specific Gravity Variograms Block Size Search
More informationENGRG Introduction to GIS
ENGRG 59910 Introduction to GIS Michael Piasecki October 13, 2017 Lecture 06: Spatial Analysis Outline Today Concepts What is spatial interpolation Why is necessary Sample of interpolation (size and pattern)
More information11/8/2018. Spatial Interpolation & Geostatistics. Kriging Step 1
(Z i Z j ) 2 / 2 (Z i Zj) 2 / 2 Semivariance y 11/8/2018 Spatial Interpolation & Geostatistics Kriging Step 1 Describe spatial variation with Semivariogram Lag Distance between pairs of points Lag Mean
More informationGeostatistics: Kriging
Geostatistics: Kriging 8.10.2015 Konetekniikka 1, Otakaari 4, 150 10-12 Rangsima Sunila, D.Sc. Background What is Geostatitics Concepts Variogram: experimental, theoretical Anisotropy, Isotropy Lag, Sill,
More informationAutomatic Determination of Uncertainty versus Data Density
Automatic Determination of Uncertainty versus Data Density Brandon Wilde and Clayton V. Deutsch It is useful to know how various measures of uncertainty respond to changes in data density. Calculating
More informationSotkamo Silver Ag-Zn-Pb-Au Deposit: Mineral Resource Estimate Update
Sotkamo Silver Ag-Zn-Pb-Au Deposit: Mineral Resource Estimate Update Jan 15. 2014 By Jyrki Parkkinen Ph.D. Eurogeologist (850) Parkkinen Geoconsulting Jyrki.parkkinen@elisanet.fi LinkedIn Parkkinen: Sotkamo
More informationAssessing Pillar Geometries in the Witbank and Highveld Coalfields Using Geostatistical Techniques
Assessing Pillar Geometries in the Witbank and Highveld Coalfields Using Geostatistical Techniques Gavin H Lind Department of Mining Engineering University of the Witwatersrand Private Bag X3, WITS, 2050
More informationSINGULARITY ANALYSIS FOR IMAGE PROCESSING AND ANOMALY ENHANCEMENT. Qiuming CHENG
SINGULARITY ANALYSIS FOR IMAGE PROCESSING AND ANOMALY ENHANCEMENT Qiuming CHENG Department of Earth and Atmospheric Science, Department of Geography, York University, Toronto, 4700 Keele Street, Ont. M3J
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 y Kriging Step 1 Describe spatial variation with Semivariogram (Z i Z j ) 2 / 2 Point cloud Map 3
More informationVocabulary: Data About Us
Vocabulary: Data About Us Two Types of Data Concept Numerical data: is data about some attribute that must be organized by numerical order to show how the data varies. For example: Number of pets Measure
More informationPrinciples of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata
Principles of Pattern Recognition C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata e-mail: murthy@isical.ac.in Pattern Recognition Measurement Space > Feature Space >Decision
More informationI don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph
Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and
More informationModelling and projecting the postponement of childbearing in low-fertility countries
of childbearing in low-fertility countries Nan Li and Patrick Gerland, United Nations * Abstract In most developed countries, total fertility reached below-replacement level and stopped changing notably.
More informationManuscript of paper for APCOM 2003.
1 Manuscript of paper for APCOM 2003. AN ANALYSIS OF THE PRACTICAL AND ECONOMIC IMPLICATIONS OF SYSTEMATIC UNDERGROUND DRILLING IN DEEP SOUTH AFRICAN GOLD MINES W. ASSIBEY-BONSU Consultant: Geostatistics
More informationRefinement of the OECD regional typology: Economic Performance of Remote Rural Regions
[Preliminary draft April 2010] Refinement of the OECD regional typology: Economic Performance of Remote Rural Regions by Lewis Dijkstra* and Vicente Ruiz** Abstract To account for differences among rural
More informationLecture 9: Location Effects, Economic Geography and Regional Policy
Lecture 9: Location Effects, Economic Geography and Regional Policy G. Di Bartolomeo Index, EU-25 = 100 < 30 30-50 50-75 75-100 100-125 >= 125 Canarias (E) Guadeloupe Martinique RÈunion (F) (F) (F) Guyane
More informationBeta-Binomial Kriging: An Improved Model for Spatial Rates
Available online at www.sciencedirect.com ScienceDirect Procedia Environmental Sciences 27 (2015 ) 30 37 Spatial Statistics 2015: Emerging Patterns - Part 2 Beta-Binomial Kriging: An Improved Model for
More informationSpatial analysis. 0 move the objects and the results change
0 Outline: Roadmap 0 What is spatial analysis? 0 Transformations 0 Introduction to spatial interpolation 0 Classification of spatial interpolation methods 0 Interpolation methods 0 Areal interpolation
More informationCOMPARISON OF DIGITAL ELEVATION MODELLING METHODS FOR URBAN ENVIRONMENT
COMPARISON OF DIGITAL ELEVATION MODELLING METHODS FOR URBAN ENVIRONMENT Cahyono Susetyo Department of Urban and Regional Planning, Institut Teknologi Sepuluh Nopember, Indonesia Gedung PWK, Kampus ITS,
More informationEarth Science Chapter 6 Rocks
Earth Science Chapter 6 Rocks I. Rocks and the Rock Cycle * Material that makes up the solid part of the Earth. * Made of a variety of different combinations of minerals and organic matter. A. Three Major
More informationGeochemical Data Evaluation and Interpretation
Geochemical Data Evaluation and Interpretation Eric Grunsky Geological Survey of Canada Workshop 2: Exploration Geochemistry Basic Principles & Concepts Exploration 07 8-Sep-2007 Outline What is geochemical
More informationSTATISTICS Relationships between variables: Correlation
STATISTICS 16 Relationships between variables: Correlation The gentleman pictured above is Sir Francis Galton. Galton invented the statistical concept of correlation and the use of the regression line.
More informationPRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH
PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram
More informationOptimizing Sampling Schemes for Mapping and Dredging Polluted Sediment Layers
This file was created by scanning the printed publication. Errors identified by the software have been corrected; however, some errors may remain. Optimizing Sampling Schemes for Mapping and Dredging Polluted
More informationOnline Appendix for Cultural Biases in Economic Exchange? Luigi Guiso Paola Sapienza Luigi Zingales
Online Appendix for Cultural Biases in Economic Exchange? Luigi Guiso Paola Sapienza Luigi Zingales 1 Table A.1 The Eurobarometer Surveys The Eurobarometer surveys are the products of a unique program
More informationGeostatistical Interpolation: Kriging and the Fukushima Data. Erik Hoel Colligium Ramazzini October 30, 2011
Geostatistical Interpolation: Kriging and the Fukushima Data Erik Hoel Colligium Ramazzini October 30, 2011 Agenda Basics of geostatistical interpolation Fukushima radiation Database Web site Geoanalytic
More informationAngel International School - Manipay 1 st Term Examination November, 2015
Grade 08 Angel International School - Manipay 1 st Term Examination November, 2015 Chemistry Duration: 2 Hours Index No:- Underline the correct answer. 1. Which of the following is a fossil fuel? a) Natural
More information7 Geostatistics. Figure 7.1 Focus of geostatistics
7 Geostatistics 7.1 Introduction Geostatistics is the part of statistics that is concerned with geo-referenced data, i.e. data that are linked to spatial coordinates. To describe the spatial variation
More informationFinding the Nearest Positive Definite Matrix for Input to Semiautomatic Variogram Fitting (varfit_lmc)
Finding the Nearest Positive Definite Matrix for Input to Semiautomatic Variogram Fitting (varfit_lmc) Arja Jewbali (arja.jewbali@riotinto.com) Resource Estimation Geologist Rio Tinto Iron Ore In resource
More informationCREATION OF DEM BY KRIGING METHOD AND EVALUATION OF THE RESULTS
CREATION OF DEM BY KRIGING METHOD AND EVALUATION OF THE RESULTS JANA SVOBODOVÁ, PAVEL TUČEK* Jana Svobodová, Pavel Tuček: Creation of DEM by kriging method and evaluation of the results. Geomorphologia
More informationIsatis applications for soil pollution mapping and risk assessment
Isatis applications for soil pollution mapping and risk assessment Nicolas JEANNEE Contact: Email: jeannee@geovariances.fr - Tel: +33 ()4.67.6.61.96 ISATIS User s Meeting Fontainebleau, Sept. 26 Objective
More informationData analysis and Geostatistics - lecture VI
Data analysis and Geostatistics - lecture VI Statistical testing with population distributions Statistical testing - the steps 1. Define a hypothesis to test in statistics only a hypothesis rejection is
More informationToward an automatic real-time mapping system for radiation hazards
Toward an automatic real-time mapping system for radiation hazards Paul H. Hiemstra 1, Edzer J. Pebesma 2, Chris J.W. Twenhöfel 3, Gerard B.M. Heuvelink 4 1 Faculty of Geosciences / University of Utrecht
More informationCombining geological surface data and geostatistical model for Enhanced Subsurface geological model
Combining geological surface data and geostatistical model for Enhanced Subsurface geological model M. Kurniawan Alfadli, Nanda Natasia, Iyan Haryanto Faculty of Geological Engineering Jalan Raya Bandung
More informationData Exploration and Unsupervised Learning with Clustering
Data Exploration and Unsupervised Learning with Clustering Paul F Rodriguez,PhD San Diego Supercomputer Center Predictive Analytic Center of Excellence Clustering Idea Given a set of data can we find a
More information2. Sample representativeness. That means some type of probability/random sampling.
1 Neuendorf Cluster Analysis Model: X1 X2 X3 X4 X5 Clusters (Nominal variable) Y1 Y2 Y3 Clustering/Internal Variables External Variables Assumes: 1. Actually, any level of measurement (nominal, ordinal,
More informationCHEK Proficiency study 671. CMI, MI and free formaldehyde in body lotion. Date 30 November 2017 Version 1. Report number CHEK
CHEK Proficiency study 671 CMI, MI and free formaldehyde in body lotion Date 30 November 2017 Version 1 Status Final Report number CHEK-17-671 page 0 Colophon Number 671 Name CMI, MI and free formaldehyde
More informationGeostatistical Density Mapping
Geostatistical Density Mapping Sean A. McKenna and Barry Roberts (SNL) Brent Pulsipher & John Hathaway (PNNL) 2008 Partners in Environmental Technology Technical Symposium & Workshop Sandia is a multiprogram
More informationThe Snap lake diamond deposit - mineable resource.
Title Page: Author: Designation: Affiliation: Address: Tel: Email: The Snap lake diamond deposit - mineable resource. Fanie Nel Senior Mineral Resource Analyst De Beers Consolidated Mines Mineral Resource
More informationIntroduction. Table 27. Median concentrations of Eu in the FOREGS samples and in some reference data sets.
Eu Europium Introduction See section on Rare Earth Elements (REEs). Table 27 compares the median concentrations of Eu in the FOREGS samples and in some reference datasets. Table 27. Median concentrations
More informationAzerbaijan International Mining Company Limited
Updated Mineral Resources Gedabek Mineral Deposit, Republic of Azerbaijan Azerbaijan International Mining Company Limited Prepared by CAE Mining CAE Mining 8585 Cote-de-Liesse Saint-Laurent Quebec H4T
More informationSPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA
SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA D. Pokrajac Center for Information Science and Technology Temple University Philadelphia, Pennsylvania A. Lazarevic Computer
More informationLecture 9: Introduction to Kriging
Lecture 9: Introduction to Kriging Math 586 Beginning remarks Kriging is a commonly used method of interpolation (prediction) for spatial data. The data are a set of observations of some variable(s) of
More informationSpatiotemporal Analysis of Environmental Radiation in Korea
WM 0 Conference, February 25 - March, 200, Tucson, AZ Spatiotemporal Analysis of Environmental Radiation in Korea J.Y. Kim, B.C. Lee FNC Technology Co., Ltd. Main Bldg. 56, Seoul National University Research
More informationINTEGRATION OF BOREHOLE INFORMATION AND RESISTIVITY DATA FOR AQUIFER VULNERABILITY
INTEGRATION OF BOREHOLE INFORMATION AND RESISTIVITY DATA FOR AQUIFER VULNERABILITY ANDERS V. CHRISTIANSEN 1, ESBEN AUKEN 1 AND KURT SØRENSEN 1 1 HydroGeophysics Group, Aarhus University, Finlandsgade 8,
More informationRegression Revisited (again) Isobel Clark Geostokos Limited, Scotland. Abstract
Regression Revisited (again) Isobel Clark Geostokos Limited, Scotland Abstract One of the seminal pioneering papers in reserve evaluation was published by Danie Krige in 1951. In this paper he introduced
More informationSpatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS
Spatial Data Analysis in Archaeology Anthropology 589b Fraser D. Neiman University of Virginia 2.19.07 Spring 2007 Kriging Artifact Density Surfaces in ArcGIS 1. The ingredients. -A data file -- in.dbf
More informationDrift Inflates Variance among Populations. Geographic Population Structure. Variance among groups increases across generations (Buri 1956)
Geographic Population Structure Alan R Rogers December 4, 205 / 26 Drift Inflates Variance among Populations 2 / 26 Variance among groups increases across generations Buri 956) 3 / 26 Gene flow migration)
More informationEstimation of hydrogeological parameters for groundwater modelling with fuzzy geostatistics: closer to nature?
Calibration and Reliability in Groundwater Modelling (Proceedings of the ModelCARE 96 Conference held at Golden, Colorado, September 1996). IAHS Publ. no. 237, 1996. 511 Estimation of hydrogeological parameters
More informationFittings and Pipe Connections
KWD Market Report Fittings and Pipe Connections Europe 2016 Situation and Trends for Heating & Plumbing Pipe Systems Contents: 1. Fittings and pipe connections Europe 2016 page 6 2. Terms and definitions
More information4th HR-HU and 15th HU geomathematical congress Geomathematics as Geoscience Reliability enhancement of groundwater estimations
Reliability enhancement of groundwater estimations Zoltán Zsolt Fehér 1,2, János Rakonczai 1, 1 Institute of Geoscience, University of Szeged, H-6722 Szeged, Hungary, 2 e-mail: zzfeher@geo.u-szeged.hu
More informationEstimation of direction of increase of gold mineralisation using pair-copulas
22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Estimation of direction of increase of gold mineralisation using pair-copulas
More informationSelf Organizing Maps. We are drowning in information and starving for knowledge. A New Approach for Integrated Analysis of Geological Data.
Radiometrics Processed Landsat TM Self Organizing Maps A New Approach for Integrated Analysis of Geological Data. We are drowning in information and starving for knowledge. Rutherford D. Roger Stephen.Fraser@csiro.au
More informationAdvanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland
EnviroInfo 2004 (Geneva) Sh@ring EnviroInfo 2004 Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland Mikhail Kanevski 1, Michel Maignan 1
More informationIssues and Techniques in Pattern Classification
Issues and Techniques in Pattern Classification Carlotta Domeniconi www.ise.gmu.edu/~carlotta Machine Learning Given a collection of data, a machine learner eplains the underlying process that generated
More information3. Bedrock from which entire geologic time period is missing between rock units 5 and 6? The fault most likely occurred after
Homework # 4 Relative Dating Name: Section: Date: 1. The cross section below represents a portion of Earth s crust. Letters through D are locations within the rock units. 5. The sequence of rock types
More informationMultivariate Analysis Cluster Analysis
Multivariate Analysis Cluster Analysis Prof. Dr. Anselmo E de Oliveira anselmo.quimica.ufg.br anselmo.disciplinas@gmail.com Cluster Analysis System Samples Measurements Similarities Distances Clusters
More informationDesign of Experiments
Design of Experiments D R. S H A S H A N K S H E K H A R M S E, I I T K A N P U R F E B 19 TH 2 0 1 6 T E Q I P ( I I T K A N P U R ) Data Analysis 2 Draw Conclusions Ask a Question Analyze data What to
More informationTransiogram: A spatial relationship measure for categorical data
International Journal of Geographical Information Science Vol. 20, No. 6, July 2006, 693 699 Technical Note Transiogram: A spatial relationship measure for categorical data WEIDONG LI* Department of Geography,
More informationInternational Journal of Remote Sensing, in press, 2006.
International Journal of Remote Sensing, in press, 2006. Parameter Selection for Region-Growing Image Segmentation Algorithms using Spatial Autocorrelation G. M. ESPINDOLA, G. CAMARA*, I. A. REIS, L. S.
More informationAs compaction and cementation of these sediments eventually occur, which area will become siltstone? A) A B) B C) C D) D
1. A student obtains a cup of quartz sand from a beach. A saltwater solution is poured into the sand and allowed to evaporate. The mineral residue from the saltwater solution cements the sand grains together,
More informationPROSPECTIVE AREAS FOR CRITICAL RAW MATERIALS IN THE EU
This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No. 776487 Ref. Ares(2017)5879386-30/11/2017 PROSPECTIVE AREAS FOR CRITICAL
More informationTime: 1 hour 30 minutes
Paper Reference(s) 6683/01 Edexcel GCE Statistics S1 Silver Level S4 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates
More informationVISUAL ANALYTICS APPROACH FOR CONSIDERING UNCERTAINTY INFORMATION IN CHANGE ANALYSIS PROCESSES
VISUAL ANALYTICS APPROACH FOR CONSIDERING UNCERTAINTY INFORMATION IN CHANGE ANALYSIS PROCESSES J. Schiewe HafenCity University Hamburg, Lab for Geoinformatics and Geovisualization, Hebebrandstr. 1, 22297
More informationTypes of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics
The Nature of Geographic Data Types of spatial data Continuous spatial data: geostatistics Samples may be taken at intervals, but the spatial process is continuous e.g. soil quality Discrete data Irregular:
More informationStatistical Perspectives on Geographic Information Science. Michael F. Goodchild University of California Santa Barbara
Statistical Perspectives on Geographic Information Science Michael F. Goodchild University of California Santa Barbara Statistical geometry Geometric phenomena subject to chance spatial phenomena emphasis
More informationGEOSTATISTICS. Dr. Spyros Fountas
GEOSTATISTICS Dr. Spyros Fountas Northing (m) 140550 140450 140350 Trent field Disturbed area Andover 140250 Panholes 436950 437050 437150 437250 437350 Easting (m) Trent Field Westover Farm (Blackmore,
More informationMerging Rain-Gauge and Radar Data
Merging Rain-Gauge and Radar Data Dr Sharon Jewell, Obserations R&D, Met Office, FitzRoy Road, Exeter sharon.jewell@metoffice.gov.uk Outline 1. Introduction The Gauge and radar network Interpolation techniques
More informationChapter 4 Rocks & Igneous Rocks
Chapter 4 Rocks & Igneous Rocks Rock Definition A naturally occurring consolidated mixture of one or more minerals e.g, marble, granite, sandstone, limestone Rock Definition Must naturally occur in nature,
More informationSKINAKAS OBSERVATORY. Astronomy Projects for University Students PROJECT GALAXIES
PROJECT 7 GALAXIES Objective: The topics covered in the previous lessons target celestial objects located in our neighbourhood, i.e. objects which are within our own Galaxy. However, the Universe extends
More informationGridding of precipitation and air temperature observations in Belgium. Michel Journée Royal Meteorological Institute of Belgium (RMI)
Gridding of precipitation and air temperature observations in Belgium Michel Journée Royal Meteorological Institute of Belgium (RMI) Gridding of meteorological data A variety of hydrologic, ecological,
More informationA MultiGaussian Approach to Assess Block Grade Uncertainty
A MultiGaussian Approach to Assess Block Grade Uncertainty Julián M. Ortiz 1, Oy Leuangthong 2, and Clayton V. Deutsch 2 1 Department of Mining Engineering, University of Chile 2 Department of Civil &
More informationROeS Seminar, November
IASC Introduction: Spatial Interpolation Estimation at a certain location Geostatistische Modelle für Fließgewässer e.g. Air pollutant concentrations were measured at different locations. What is the concentration
More informationConcepts and Applications of Kriging
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko Outline Intro to interpolation Exploratory
More informationThe development of a Kriging based Gauge and Radar merged product for real-time rainfall accumulation estimation
The development of a Kriging based Gauge and Radar merged product for real-time rainfall accumulation estimation Sharon Jewell and Katie Norman Met Office, FitzRoy Road, Exeter, UK (Dated: 16th July 2014)
More informationThe Raman Spectroscopy of Graphene and the Determination of Layer Thickness
Application Note: 52252 The Raman Spectroscopy of Graphene and the Determination of Layer Thickness Mark Wall, Ph.D., Thermo Fisher Scientific, Madison, WI, USA Key Words DXR Raman Microscope 2D Band D
More informationEmpirical Bayesian Kriging
Empirical Bayesian Kriging Implemented in ArcGIS Geostatistical Analyst By Konstantin Krivoruchko, Senior Research Associate, Software Development Team, Esri Obtaining reliable environmental measurements
More informationRegression revisited (again)
by I. Clark* Synopsis One of the seminal pioneering papers in reserve evaluation was published by Danie Krige in 1951. In that paper he introduced the concept of regression techniques in providing better
More informationConcepts and Applications of Kriging
Esri International User Conference San Diego, California Technical Workshops July 24, 2012 Concepts and Applications of Kriging Konstantin Krivoruchko Eric Krause Outline Intro to interpolation Exploratory
More informationIntroduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.
Presentation of Presentation of Use of Introduction : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées. Authors of : Christine Thomas-Agnan,
More informationThe observed global warming of the lower atmosphere
WATER AND CLIMATE CHANGE: CHANGES IN THE WATER CYCLE 3.1 3.1.6 Variability of European precipitation within industrial time CHRISTIAN-D. SCHÖNWIESE, SILKE TRÖMEL & REINHARD JANOSCHITZ SUMMARY: Precipitation
More informationMultitask Learning of Environmental Spatial Data
9th International Congress on Environmental Modelling and Software Brigham Young University BYU ScholarsArchive 6th International Congress on Environmental Modelling and Software - Leipzig, Germany - July
More informationReview of Lecture 1. Across records. Within records. Classification, Clustering, Outlier detection. Associations
Review of Lecture 1 This course is about finding novel actionable patterns in data. We can divide data mining algorithms (and the patterns they find) into five groups Across records Classification, Clustering,
More informationRadial basis functions and kriging a gold case study
Page Radial basis functions and kriging a gold case study D Kentwell, Principal Consultant, SRK Consulting This article was first published in The AusIMM Bulletin, December. Introduction Recent advances
More informationCombining airborne electromagnetics with geotechnical data for automated depth to bedrock tracking
Combining airborne electromagnetics with geotechnical data for automated depth to bedrock tracking Craig W Christensen a,b,1, Andreas A Pfaffhuber a, Helgard Anschütz a and Tone F Smaavik a a Norwegian
More information