The use of Geographic Information System (GIS) and non-gis methods to assess the external validity of samples postcollection

Size: px
Start display at page:

Download "The use of Geographic Information System (GIS) and non-gis methods to assess the external validity of samples postcollection"

Transcription

1 J Vet Diagn Invest 21: (2009) The use of Geographic Information System (GIS) and non-gis methods to assess the external validity of samples postcollection Esther Richardson, 1 Margaret Good, Guy McGrath, Simon J. More Abstract. External validity is fundamental to veterinary diagnostic investigation, reflecting the accuracy with which sample results can be extrapolated to a broader population of interest. Probability sampling methods are routinely used during the collection of samples from populations, specifically to maximize external validity. Nonprobability sampling (e.g., of blood samples collected as part of routine surveillance programs or laboratory submissions) may provide useful data for further posthoc epidemiological analysis, adding value to the collection and submission of samples. As the sample has already been submitted, the analyst or investigator does not have any control over the sampling methodology, and hence external validity as routine probability sampling methods may not have been employed. The current study describes several Geographic Information System (GIS) and non-gis methods, applied posthoc, to assess the external validity of samples collected using both probability and nonprobability sampling methods. These methods could equally be employed for inspecting other datasets. Mapping was conducted using ArcView 9.1. Based on this posthoc assessment, results from the random field sample could provide an externally valid, albeit relatively imprecise, estimate of national disease prevalence, of disease prevalence in 3 of the 4 provinces (all but Ulster, in the north and northwest, where sample size was small), and in beef and dairy herds. This study provides practical methods for examining the external validity of samples postcollection. Key words: External validity; Geographic Information System; opportunistic; random; sampling bias. Introduction External validity is fundamental to veterinary diagnostic investigation, reflecting the accuracy with which sample results can be extrapolated to a broader population of interest. 6 External validity can be affected by sampling bias. 8 Probability sampling methods are routinely used during the collection of population samples, specifically to minimize the potential of sampling bias. Using probability sampling, each member of the population has a known nonzero probability of being selected. 5 Using these methods, the results are expressed in a probabilistic manner; that is, an estimate of prevalence (with a defined precision and confidence) in the broader population, or substantiation of disease freedom (at a defined level of confidence and given a minimum disease prevalence, if infections were present) in the broader population. 4,14 Simple random sampling is the purest form of probability sampling. For From the Moorepark Dairy Production Research Centre, Moorepark, Fermoy, Co. Cork (Richardson), the Department of Agriculture and Food, Dublin (Good), and the Centre for Veterinary Epidemiology and Risk Analysis, Veterinary Sciences Centre, University College Dublin, Belfield, Dublin (McGrath, More), Ireland. 1 Corresponding Author: Esther Richardson, Moorepark Dairy Production Research Centre, Moorepark, Fermoy, Co. Cork, Ireland. richardson.esther@gmail.com example, in a previous study 13 of farm-level risk factors for porcine reproductive and respiratory syndrome (PRRS) in Denmark, a random sample of sow herds was selected from the PRRS register, which provided a complete list of all PRRS investigations conducted in Denmark. 13 However, this method is frequently unworkable, as a complete list of the population (a sampling frame) may not be available. Several other random sampling strategies are more commonly used in veterinary medicine, including systematic sampling and multistage sampling. 2 Variants of the latter include stratified and cluster sampling. 10,11 When nonprobability methods (e.g., purpose, opportunistic, and snowball) are used during sample selection, it is not possible to formally determine the degree of sampling bias present. 12 In veterinary medicine, posthoc stratification has routinely been applied following sample collection to compare characteristics of the sample with those of the broader population. 6 Broad guidelines are now available from the medical literature to enhance the quality and transparency in health research (www. equator-network.org; e.g., the Strengthening the Reporting of Observational studies in Epidemiology [STROBE] statement for observational epidemiological studies, the Standards for Reporting of Diagnostic Accuracy statement, the Consolidated Standards of Reporting Trials statement). 1,3,17 Relevant to 633

2 634 Richardson et al. external validity, the STROBE statement, 17 which covers observational epidemiological studies in general, and cohort, case-control, and cross-sectional studies in more detail, provides the following reporting guidelines: N In the Materials and Methods, it is important that authors present detail of selection criteria for participants, source and method of selection, and a flow diagram. N In the Results, detail should be presented of the number of participants potentially eligible, confirmed eligible, included in study, and lost to follow-up as well as descriptive data on characteristics of each participant group. N In the Discussion, authors should interpret the external validity of the study. Guidelines specifically for reporting surveys are also available. 9 Nonprobability sampling (e.g., of blood specimens collected as part of routine surveillance programs or laboratory submissions) may provide useful data for further posthoc epidemiological analysis about the condition(s) for which the samples were originally submitted for testing. Following additional testing, similar analyses can also be conducted about other condition(s), thereby adding value to existing sample collection. As the data had already been collected, the analyst or investigator does not have any control over the sampling methodology, and routine probability sampling methods may not have been employed, which means the external validity of the sample should be assessed prior to using probability-based statistics for making inferences to other groups from the data. In 2005, a survey was conducted in Ireland to determine the serological prevalence and within-herd prevalence of Johne s disease in Irish cattle. Random sampling of 1,000 herds was implicit in the study design; however, as a result of errors during specimen collection, there was a shortfall in the number targeted for collection. Further specimens were collected using opportunistic (nonprobability) sampling methods. As a consequence, the external validity of the overall sample needed to be verified. The current paper describes various Geographic Information System (GIS) and non-gis methods, applied posthoc, to assess the external validity of samples collected using both probability and nonprobability sampling methods. Materials and methods The data In Ireland, all cattle herds have a unique herd number and are registered in the national Animal Health Computer System. As part of the national brucellosis eradication program, blood is collected annually from all cattle (females and intact males) over 1 year of age. In 2005, brucellosis sampling was conducted on approximately 132,000 herds. In the current study, the reference population included a subset of these herds, specifically the 96,163 (breeding) herds where at least 1 home-born calf had been registered in A simple random sample of 1,000 herd numbers was selected from the reference population, using computergenerated random numbers. This sample size is sufficient to estimate a national herd level prevalence of 10%, with a 95% confidence interval and a precision of 2%. Arrangements were then made, through the national brucellosis testing laboratory, to collect all sera from each of these herds. A computer program was used to notify staff when relevant samples were available. Between June and November 2005, the sera from 640 herds (the random field sample ) were collected in this way. However, a computer error occurred for a period mid-year, and a proportion of the target samples were not collected. Once the error was detected, sera from a further 367 herds were collected based on opportunistic sampling during early winter 2005 to make up for the earlier missed samples. Samples in this opportunistic field sample were selected by laboratory staff from the brucellosis submissions as they came in, without any formal methodology of sampling employed. These datasets were checked for entry errors, including multiple entries. Duplicate entries for 4 herds (1 in the random field sample, 3 in the opportunistic field sample) were removed. A further 3 herd samples were artificially generated to provide comparison groups to the field samples, using the random number selection in the Animal Movement extension of ArcView version 2.4. a These generated samples included random A (605 herds, for comparison with the random field sample), random B (295 herds, for comparison with the opportunistic field sample), and random A+B (random A and B combined, 900 herds, for comparison with the combined random and opportunistic field samples). Data analyses General GIS information. In Ireland, herd numbers are geo-referenced to digitized parcels of land, constructed for calculating European Union land-based farming aid. The geo-referenced farm database does not include farms that do not manually complete an application for farming aid, including those that were automatically eligible and those where a claim for farming aid was not made. In the current study, the position of each farm was based on the center of the largest land fragment. Mapping was conducted using ArcView version 9.1. a Posthoc assessment of external validity. Non-GIS methods. The percentage of herds within each of the 26 counties of Ireland was determined based on available herd-level data from the reference population (n 5 96,163 herds), the random field sample (n 5 639), the opportunistic field sample (n 5 364), and the combined field sample (n 5 1,003). The percentages were graphed, and a

3 Assessing external validity following sample collection 635 chi-square test b was used to separately compare county herd distribution in the random field sample versus the reference population and the opportunistic field sample versus the reference population. The field samples and reference population were also stratified by province (4 categories: Connaught, Leinster, Munster, Ulster), county (26) and enterprise type (3: dairy, beef, mixed). Enterprise type was determined based on the proportion of beef and dairy breeds among mature cows in each herd (beef:.66% beef breeds; dairy:.66% dairy breeds; mixed: all other herds). A chi-square test b wasusedtoseparately compare the distribution of herds in the random field sample versus the reference population, and the opportunistic field sample versus the reference population, by province and by enterprise type. GIS methods. The spatial distributions of the random and opportunistic field samples were visually compared. Then, the average nearest neighbor method (ArcView 9.1) was used on both the field (random, opportunistic, combined) and generated (random A, B, A+B) samples to determine the degree of clustering in each. This method is a simple technique that assumes all points are free to locate anywhere in a study window. Using this assumption, the software calculates the hypothetical average distance between points and compares this with the actual observed distance between points. A normal distribution for the hypothetical calculated average distance is created and the ratio of the observed to expected average values are placed on this curve using the Z-score (that is, the probability that the data is not random). Confidence intervals of 90%, 95% and 99% from the curve are used to determine significance levels that define qualitative definitions of the likelihood that the observed data are randomly distributed. For example, within the confidence limits of 90% (significance level.0.10) there is a high likelihood that the points are neither dispersed nor clustered; furthermore, between 90 and 95% (significance level ), there is a medium likelihood data are dispersed or clustered but they are still possibly random. However, outside of the 95% confidence interval (significance level 0.05), there is a low likelihood that data are randomly distributed. Outside of 99% confidence interval, there is only a very low likelihood that data are randomly distributed, so it is highly likely that they are dispersed or clustered. It is a simple and robust test that defines observed data on a range of qualitative values from clustered to evenly dispersed, with random data falling in the middle of the range. Nearest neighbor ratios and qualitative outputs from ArcView were reported. Because of software restrictions, the nearest neighbor analysis could only be carried out in a rectangular area or window (Fig. 1) that incorporated as much of Ireland (but as little sea or Northern Ireland) as possible. Further to this, a kernel density plot (ArcView 9.1) was developed for each sample, except the combined field sample and random A+B, using the same window as a visual representation of the differences in sample distribution to support the findings of the nearest neighbor analysis. The kernel density plot was conducted using a 10-km search radius, with a 100-m grid. Results are presented as points per square kilometer. Figure 1. The rectangular area or window that was used during the nearest neighbor analysis. The window incorporates as much of Ireland (but as little sea or Northern Ireland) as possible. Results The samples During 2003, there were 96,163 (breeding) herds in Ireland where at least 1 home-born calf had been registered. Of these, 90,807 (93.7%) were geo-referenced, including 58,285 (60.6% of all breeding herds) located within the target window. Table 1 presents the total number of herds, the number geo-referenced Table 1. The number of herds, including geo-referenced herds (in total and within the target window), in the field and in generated samples. Geo-referenced herds All herds In total Within target window Field sample Random Opportunistic Combined 1, Generated samples Random A Random B Random A+B

4 636 Richardson et al. Figure 2. The percentage of herds in each county, by reference population, random field sample, opportunistic field sample, and combined field sample. Counties are alphabetized from Carlow to Wicklow and labeled from A to Z in that order. in total and within the target window in each of the field, and generated sample datasets. Posthoc assessment of external validity Non-GIS methods. Herd distribution. The percentage of herds in each county is presented in Figure 2. Compared with the reference population, there were significant differences in the county herd distribution in the opportunistic (P, 0.001) but not the random (P ) field samples. The distribution of herds by province and by enterprise type is presented in Table 2. Compared with the reference population, there were significant differences in herd distribution for the opportunistic field samples, both by province and enterprise type (each P, 0.001), but not in the random field sample (by province: P ; by enterprise type: P ). GIS methods. Visual assessment. The distribution of all geo-referenced cattle herds across the country was visually homogeneous, except for areas obviously not suitable for grazing, such as lakes, bogs, forest, and upland areas. Geo-referenced herds in the opportunistic field sample (n 5 295) were distributed in clusters across the country, whereas those in the random field sample (n 5 605) were evenly dispersed (Fig. 3). Nearest neighbor analysis. Based on results of the nearest neighbor analysis, herds in the field samples were more clustered than those in the generated samples (Table 3). Furthermore, the herds in the opportunistic and combine field samples tested as more likely to be clustered than those in the random field sample. Kernel density plots. Areas of higher sampling density are present in the kernel density plots of the opportunistic field sample and to a lesser degree the random field sample, but not of the equivalent generated samples (Fig. 4).

5 Assessing external validity following sample collection 637 Table 2. Number and percentage of herds in the field and opportunistic field samples, and in the reference population, as well as comparison between random field sample and reference population and between opportunistic field sample and reference population, by province and by enterprise type.* Random field sample Opportunistic field sample Reference population Strata Number P value{ Number P value{ Number Province 0.91,0.001 Connaught 182 (28) 76 (22) 29,522 (31) Leinster 143 (22) 160 (48) 21,252 (22) Munster 242 (38) 66 (20) 32,888 (34) Ulster 78 (12) 33 (10) 12,501 (13) Total herd number{ 645 (100) 335 (100) 96,163 (100) Enterprise type 0.66,0.001 Dairy 167 (26) 37 (10.3) 24,041 (25) Beef 459 (71.5) 316 (88.3) 69,237 (72) Mixed 16 (2.5) 5 (1.4) 2,885 (3) Total herd number{ 642 (100) 358 (100) 96,163 (100) * Numbers in parentheses are percentages. { Result of the chi-square test comparing the distribution of herds in the random field sample versus the reference population, and the opportunistic field sample versus the reference population, by province and by enterprise type. { Data on breed and/or province were not available for all herds, as reflected in the total herd counts. The percentages reflect the distribution of herds within each strata using available data. Discussion External validity is a central concept in observational epidemiological studies, reflecting the degree to which study results can be extrapolated from the sample to the broader reference population. This issue was of particular importance in the current context, as the study was seeking to provide accurate estimates of herd and within-herd prevalence of Johne s disease in the Irish cattle population. Although the cost of sample collection was relatively modest (as it was conducted in association with a national program of brucellosis surveillance), there were substantial costs associated with sample testing and subsequent analysis. Figure 3. Distribution of geo-referenced herds in the random (A; n 5 605) and opportunistic (B; n 5 295) field samples.

6 638 Richardson et al. Table 3. Assessment of herd clustering in each of the field and generated samples, based on the nearest neighbor analysis. Nearest neighbor analysis Ratio Z score Qualitative interpretation* Field samples Random Low (possibly clustered) Opportunistic Very low (clustered) Combined Very low (clustered) Generated samples Random A High (random) Random B Medium (possibly random) Random A+B High (random) * Chance that observed spatial pattern could have occurred at random. In the present study, various non-gis and GIS methods were used to assess the external validity of the 2 field datasets. The non-gis methods included the distribution of herds by province, county, and enterprise type. Contrary to many similar situations, the data were available for the reference population as well as the field samples. The utility of 2 GIS methods was also examined. These methods have a long history in a number of disciplines, including geography and ecology. They are also increasingly common in veterinary medicine, to assess spatial issues relating to both populations and samples. 15,16 These methods can be conducted without prior knowledge of the prevalence of disease surveyed, which was of particular benefit here, where little a priori data were available (prior to the start of the study) regarding the Figure 4. Kernel density plots of herds in (A) the random field sample and (B) random A generated sample datasets within the target window (each representing 605 farms); (C) the opportunistic field sample and (D) random B generated sample datasets within the target window (each representing 295 farms).

7 Assessing external validity following sample collection 639 national prevalence of Johne s disease. Nearest neighbor analysis (Table 3) provided a transparent and useful method to compare the distribution of each dataset with the target population. Further, density plots (Fig. 4) proved useful to explain the outcome of the nearest neighbor analysis. In each case, the generated sample was used to facilitate robust comparison. The current study identified substantial differences between herd distribution in the opportunistic field sample and the reference population. This difference was seen most clearly when comparing the percentage of herds by county in the reference population and the opportunistic field sample (P, 0.001). The artificially even herd distribution in the opportunistic field sample is immediately evident (Fig. 2). Similarly, compared with the reference population, there was a significant difference in herd distribution in the opportunistic field sample by province and enterprise type (P, 0.001). The opportunistic field sample was heavily biased towards beef cattle and herds in Leinster (the eastern province) compared with Connaght (west), Munster (south), and Ulster (north, northwest; Table 2). These 2 factors are interlinked, noting that Munster has the highest proportion of Ireland s dairy herds, and therefore fewer Munster herds will also result in fewer dairy herds. These findings were consistent with results from the GIS methods. Based on results from the nearest neighbor analysis, there was a very low chance that the observed spatial pattern from the opportunistic field sample could have occurred at random (Table 3). The opportunistic field sample was clustered in the northwest, based on both visual appraisal (Fig. 3) and density analysis (Fig. 4). It is rarely possible to produce reliable inferences about a population when opportunistic (nonprobability) sampling is used. 7 The distribution of herds in the random field sample and the reference population were similar. In these 2 datasets, there was no significant difference in herd distribution by county P ; Fig. 2). Further, compared with the reference population, the herd distribution in the random field sample did not differ significantly, either by province (P ) or enterprise type (P ; Table 2). Nonetheless, using GIS methods, there is some evidence of clustering, both in the kernel plots (Fig. 4) and nearest neighbor analysis (Table 3). Although caution is needed, it is likely that disease prevalence in the random field database is a reliable estimate of disease prevalence in the reference population. Note that the field sample includes a relatively small number of herds, particularly within defined strata. Based on this posthoc assessment, one can conclude that the results from the random field sample would provide an externally valid, albeit relatively imprecise, estimate of national disease prevalence, of disease prevalence in 3 of the 4 provinces (all but Ulster, in the north and northwest, where sample size was small), and in beef and dairy herds. This paper examines a range of GIS and non-gis methods to assess the external validity of field samples. Both methods, but particularly the GIS methods, require some background information about the reference population. The non-gis methods are well-suited to situations where data about the reference population are limited. Nonetheless, summary information (for example, the percentage of herds by geographic area) is needed. Using these methods, herd distribution in the field sample(s) and reference population can be compared over a range of strata; in this example, by province, county, and enterprise type. Using the GIS methods, much greater detail about the reference population is required. These methods are appropriate within an Irish context, where geo-referenced data are available about all cattle farms within the country; in other countries, such extensive and accurate information may not be available. In the current study, the GIS methods provided a relatively simple and readily interpretable means to assess the spatial distribution of the field samples and the reference population. Both visual (kernel density plots) and statistical (nearest neighbor analysis) methods are available to assist in these assessments. Sources and manufacturers a. Environmental Systems Research Institute Inc., Redlands, CA. b by GraphPad Software. Inc.; accessed March 5, References 1. Altman DG, Schulz KF, Moher D, et al.: 2001, The revised CONSORT statement for reporting randomized trials: explanation and elaboration. Ann Intern Med 134: Bager F: 2000, DANMAP: monitoring antimicrobial resistance in Denmark. Int J Antimicrob Agents 14: Bossuyt PM, Reitsma JB, Bruns DE, et al.: 2003, Standards for reporting of diagnostic accuracy: towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. Br Med J 326: Cameron AR, Baldock FC: 1998, Two-stage sampling in surveys to substantiate freedom from disease. Prev Vet Med 34: Dohoo I, Martin W, Stryhn H: 2003, Veterinary epidemiologic research. AVC Inc., Charlottetown, Prince Edward Island, Canada. 6. Greiner M, Gardner IA: 2000, Epidemiologic issues in the validation of veterinary diagnostic tests. Prev Vet Med 45: Jannink I, Bennen JN, Blaauw J, et al.: 1995, At opportunistic and systematic random sampling: effects on the prognostic

8 640 Richardson et al. value of nuclear area assessments in breast cancer patients. Breast Cancer Res Treat 36: Johnson LC, Beaton R, Murphy SA: 2004, First record: a methodological approach to counter sampling bias. Psychol Rep 95: Kelley K, Clark B, Brown V, Sitzia J: 2003, Good practice in the conduct and reporting of survey research. Int J Qual Health Care 15: Lucchelli A, Lance SE, Bartlett PB, et al.: 1992, Prevalence of bovine group A rotavirus shedding among dairy calves in Ohio. Am J Vet Res 53: Marcotty T, Simukoko H, Berkvens D, et al.: 2008, Evaluating the use of packed cell volume as an indicator of trypanosomal infections in cattle in eastern Zambia. Prev Vet Med 87: Martin SW, Meek AH, Willeberg P: 1986, Veterinary epidemiology, principles and methods. Iowa State University Press, Ames, IA. 13. Mortensen S, Stryhn H, Søgaard R, et al.: 2002, Risk factors for infection of sow herds with porcine reproductive and respiratory syndrome (PRRS) virus. Prev Vet Med 53: Muskens J, Barkema HW, Russchen E, et al.: 2000, Prevalence and regional distribution of paratuberculosis in dairy herds in the Netherlands. Prev Vet Med 77: Nødtvedt A, Guitian J, Egenvall A, et al.: 2007, The spatial distribution of atopic dermatitis cases in a population of insured Swedish dogs. Prev Vet Med 78: Stevenson MA, Morris RS, Pfeiffer DU, et al.: 2000, Descriptive spatial analysis of the epidemic of bovine spongiform encephalopathy in Great Britain to June Vet Rec 147: Vandenbroucke JP, von Elm E, Altman DG, et al.: 2007, Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): explanation and elaboration. PLOS Med 16:e297.

Sampling. General introduction to sampling methods in epidemiology and some applications to food microbiology study October Hanoi

Sampling. General introduction to sampling methods in epidemiology and some applications to food microbiology study October Hanoi Sampling General introduction to sampling methods in epidemiology and some applications to food microbiology study October 2006 - Hanoi Stéphanie Desvaux, François Roger, Sophie Molia CIRAD Research Unit

More information

Sampling approaches in animal health studies. Al Ain November 2017

Sampling approaches in animal health studies. Al Ain November 2017 Sampling approaches in animal health studies Al Ain 13-14 November 2017 Definitions Populations and Samples From a statistical point of view: Population is a set of measurements, that may be finite or

More information

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM

CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH. Awanis Ku Ishak, PhD SBM CHOOSING THE RIGHT SAMPLING TECHNIQUE FOR YOUR RESEARCH Awanis Ku Ishak, PhD SBM Sampling The process of selecting a number of individuals for a study in such a way that the individuals represent the larger

More information

Role of GIS in Tracking and Controlling Spread of Disease

Role of GIS in Tracking and Controlling Spread of Disease Role of GIS in Tracking and Controlling Spread of Disease For Dr. Baqer Al-Ramadan By Syed Imran Quadri CRP 514: Introduction to GIS Introduction Problem Statement Objectives Methodology of Study Literature

More information

Lecture 5: Sampling Methods

Lecture 5: Sampling Methods Lecture 5: Sampling Methods What is sampling? Is the process of selecting part of a larger group of participants with the intent of generalizing the results from the smaller group, called the sample, to

More information

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information: Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information: aanum@ug.edu.gh College of Education School of Continuing and Distance Education 2014/2015 2016/2017 Session Overview In this Session

More information

Sampling. Module II Chapter 3

Sampling. Module II Chapter 3 Sampling Module II Chapter 3 Topics Introduction Terms in Sampling Techniques of Sampling Essentials of Good Sampling Introduction In research terms a sample is a group of people, objects, or items that

More information

Cluster Analysis using SaTScan

Cluster Analysis using SaTScan Cluster Analysis using SaTScan Summary 1. Statistical methods for spatial epidemiology 2. Cluster Detection What is a cluster? Few issues 3. Spatial and spatio-temporal Scan Statistic Methods Probability

More information

Part 7: Glossary Overview

Part 7: Glossary Overview Part 7: Glossary Overview In this Part This Part covers the following topic Topic See Page 7-1-1 Introduction This section provides an alphabetical list of all the terms used in a STEPS surveillance with

More information

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E OBJECTIVE COURSE Understand the concept of population and sampling in the research. Identify the type

More information

PRE DICT. PREDICT Project. Prevent and Respond to Epidemics and Demonstrate Information and Communication Technologies.

PRE DICT. PREDICT Project. Prevent and Respond to Epidemics and Demonstrate Information and Communication Technologies. PRE DICT PREDICT Project Prevent and Respond to Epidemics and Demonstrate Information and Communication Technologies IAC Conference October 5 th 2011 DSV Context About 75 % of the new diseases having affected

More information

Module 4 Approaches to Sampling. Georgia Kayser, PhD The Water Institute

Module 4 Approaches to Sampling. Georgia Kayser, PhD The Water Institute Module 4 Approaches to Sampling Georgia Kayser, PhD 2014 The Water Institute Objectives To understand the reasons for sampling populations To understand the basic questions and issues in selecting a sample.

More information

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics The Nature of Geographic Data Types of spatial data Continuous spatial data: geostatistics Samples may be taken at intervals, but the spatial process is continuous e.g. soil quality Discrete data Irregular:

More information

USING CLUSTERING SOFTWARE FOR EXPLORING SPATIAL AND TEMPORAL PATTERNS IN NON-COMMUNICABLE DISEASES

USING CLUSTERING SOFTWARE FOR EXPLORING SPATIAL AND TEMPORAL PATTERNS IN NON-COMMUNICABLE DISEASES USING CLUSTERING SOFTWARE FOR EXPLORING SPATIAL AND TEMPORAL PATTERNS IN NON-COMMUNICABLE DISEASES Mariana Nagy "Aurel Vlaicu" University of Arad Romania Department of Mathematics and Computer Science

More information

An all-island approach to mapping bovine tuberculosis in Ireland

An all-island approach to mapping bovine tuberculosis in Ireland PEER reviewed An all-island approach to mapping bovine tuberculosis in Ireland McGrath G 1, Abernethy D 2, Stringer L 2,3 and More SJ 1 1 Centre for Veterinary Epidemiology and Risk Analysis, Veterinary

More information

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Spatial Analysis I. Spatial data analysis Spatial analysis and inference Spatial Analysis I Spatial data analysis Spatial analysis and inference Roadmap Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for analyses: working with

More information

Project Appraisal Guidelines

Project Appraisal Guidelines Project Appraisal Guidelines Unit 16.2 Expansion Factors for Short Period Traffic Counts August 2012 Project Appraisal Guidelines Unit 16.2 Expansion Factors for Short Period Traffic Counts Version Date

More information

Cluster Analysis using SaTScan. Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007

Cluster Analysis using SaTScan. Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007 Cluster Analysis using SaTScan Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007 Outline Clusters & Cluster Detection Spatial Scan Statistic Case Study 28 September 2007 APHEO Conference

More information

Stochastic calculus for summable processes 1

Stochastic calculus for summable processes 1 Stochastic calculus for summable processes 1 Lecture I Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions. It is a

More information

Part A: Salmonella prevalence estimates. (Question N EFSA-Q ) Adopted by The Task Force on 28 March 2007

Part A: Salmonella prevalence estimates. (Question N EFSA-Q ) Adopted by The Task Force on 28 March 2007 The EFSA Journal (2007) 98, 1-85 Report of the Task Force on Zoonoses Data Collection on the Analysis of the baseline survey on the prevalence of Salmonella in broiler flocks of Gallus gallus, in the EU,

More information

Sampling Techniques. Esra Akdeniz. February 9th, 2016

Sampling Techniques. Esra Akdeniz. February 9th, 2016 Sampling Techniques Esra Akdeniz February 9th, 2016 HOW TO DO RESEARCH? Question. Literature research. Hypothesis. Collect data. Analyze data. Interpret and present results. HOW TO DO RESEARCH? Collect

More information

SAMPLING- Method of Psychology. By- Mrs Neelam Rathee, Dept of Psychology. PGGCG-11, Chandigarh.

SAMPLING- Method of Psychology. By- Mrs Neelam Rathee, Dept of Psychology. PGGCG-11, Chandigarh. By- Mrs Neelam Rathee, Dept of 2 Sampling is that part of statistical practice concerned with the selection of a subset of individual observations within a population of individuals intended to yield some

More information

Introduction to risk assessment

Introduction to risk assessment Introduction to risk assessment Inception workshop of the project Strengthening the regional preparedness, prevention and response against lumpy skin disease in Belarus, Moldova and Ukraine (TCP/RER/3605)

More information

2011/04 LEUKAEMIA IN WALES Welsh Cancer Intelligence and Surveillance Unit

2011/04 LEUKAEMIA IN WALES Welsh Cancer Intelligence and Surveillance Unit 2011/04 LEUKAEMIA IN WALES 1994-2008 Welsh Cancer Intelligence and Surveillance Unit Table of Contents 1 Definitions and Statistical Methods... 2 2 Results 7 2.1 Leukaemia....... 7 2.2 Acute Lymphoblastic

More information

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling.

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling. Slide 1 Module 4 Approaches to Sampling Georgia Kayser, PhD Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling Slide 2 Objectives To understand the reasons for sampling populations

More information

Temporal and spatial mapping of hand, foot and mouth disease in Sarawak, Malaysia

Temporal and spatial mapping of hand, foot and mouth disease in Sarawak, Malaysia Geospatial Health 8(2), 2014, pp. 503-507 Temporal and spatial mapping of hand, foot and mouth disease in Sarawak, Malaysia Noraishah M. Sham 1, Isthrinayagy Krishnarajah 1,2, Noor Akma Ibrahim 1,2, Munn-Sann

More information

ESRI 2008 Health GIS Conference

ESRI 2008 Health GIS Conference ESRI 2008 Health GIS Conference An Exploration of Geographically Weighted Regression on Spatial Non- Stationarity and Principal Component Extraction of Determinative Information from Robust Datasets A

More information

A Note on Bayesian Inference After Multiple Imputation

A Note on Bayesian Inference After Multiple Imputation A Note on Bayesian Inference After Multiple Imputation Xiang Zhou and Jerome P. Reiter Abstract This article is aimed at practitioners who plan to use Bayesian inference on multiplyimputed datasets in

More information

Spatial Statistical Analysis in Cow Disease Monitoring Based on GIS

Spatial Statistical Analysis in Cow Disease Monitoring Based on GIS Spatial Statistical Analysis in Cow Disease Monitoring Based on GIS Lin Li 1, Yong Yang 2,*, Hongbin Wang 3, Jing Dong 1, Yujun Zhao 1, and Jianbin He 1 1 School of Animal Husbandry and Veterinary Medicine,

More information

DATA DISAGGREGATION BY GEOGRAPHIC

DATA DISAGGREGATION BY GEOGRAPHIC PROGRAM CYCLE ADS 201 Additional Help DATA DISAGGREGATION BY GEOGRAPHIC LOCATION Introduction This document provides supplemental guidance to ADS 201.3.5.7.G Indicator Disaggregation, and discusses concepts

More information

Examine characteristics of a sample and make inferences about the population

Examine characteristics of a sample and make inferences about the population Chapter 11 Introduction to Inferential Analysis Learning Objectives Understand inferential statistics Explain the difference between a population and a sample Explain the difference between parameter and

More information

INTRODUCTION. In March 1998, the tender for project CT.98.EP.04 was awarded to the Department of Medicines Management, Keele University, UK.

INTRODUCTION. In March 1998, the tender for project CT.98.EP.04 was awarded to the Department of Medicines Management, Keele University, UK. INTRODUCTION In many areas of Europe patterns of drug use are changing. The mechanisms of diffusion are diverse: introduction of new practices by new users, tourism and migration, cross-border contact,

More information

Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States

Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States By Russ Frith Advisor: Dr. Raid Amin University of W. Florida Capstone Project in Statistics April,

More information

Facilitating Survey Sampling in Low- and Middle-Income Contexts Through Satellite Maps and Basic Mobile Computing Technology

Facilitating Survey Sampling in Low- and Middle-Income Contexts Through Satellite Maps and Basic Mobile Computing Technology Facilitating Survey Sampling in Low- and Middle-Income Contexts Through Satellite Maps and Basic Mobile Computing Technology 6 th European Survey Association Conference Survey Research in Developing Countries

More information

EpiMAN-TB, a decision support system using spatial information for the management of tuberculosis in cattle and deer in New Zealand

EpiMAN-TB, a decision support system using spatial information for the management of tuberculosis in cattle and deer in New Zealand EpiMAN-TB, a decision support system using spatial information for the management of tuberculosis in cattle and deer in New Zealand J.S. McKenzie 1, R.S. Morris 1, C.J. Tutty 2, D.U. Pfeiffer 1 Dept of

More information

MN 400: Research Methods. CHAPTER 7 Sample Design

MN 400: Research Methods. CHAPTER 7 Sample Design MN 400: Research Methods CHAPTER 7 Sample Design 1 Some fundamental terminology Population the entire group of objects about which information is wanted Unit, object any individual member of the population

More information

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015 Fall 2015 Population versus Sample Population: data for every possible relevant case Sample: a subset of cases that is drawn from an underlying population Inference Parameters and Statistics A parameter

More information

Vehicle Freq Rel. Freq Frequency distribution. Statistics

Vehicle Freq Rel. Freq Frequency distribution. Statistics 1.1 STATISTICS Statistics is the science of data. This involves collecting, summarizing, organizing, and analyzing data in order to draw meaningful conclusions about the universe from which the data is

More information

SYA 3300 Research Methods and Lab Summer A, 2000

SYA 3300 Research Methods and Lab Summer A, 2000 May 17, 2000 Sampling Why sample? Types of sampling methods Probability Non-probability Sampling distributions Purposes of Today s Class Define generalizability and its relation to different sampling strategies

More information

Use of the scan statistic on disaggregated province-based data: Foot-and-mouth disease in Iran

Use of the scan statistic on disaggregated province-based data: Foot-and-mouth disease in Iran Preventive Veterinary Medicine 71 (2005) 197 207 www.elsevier.com/locate/prevetmed Use of the scan statistic on disaggregated province-based data: Foot-and-mouth disease in Iran Andres M. Perez *, Mark

More information

Statistical Practice

Statistical Practice Statistical Practice A Note on Bayesian Inference After Multiple Imputation Xiang ZHOU and Jerome P. REITER This article is aimed at practitioners who plan to use Bayesian inference on multiply-imputed

More information

Long Island Breast Cancer Study and the GIS-H (Health)

Long Island Breast Cancer Study and the GIS-H (Health) Long Island Breast Cancer Study and the GIS-H (Health) Edward J. Trapido, Sc.D. Associate Director Epidemiology and Genetics Research Program, DCCPS/NCI COMPREHENSIVE APPROACHES TO CANCER CONTROL September,

More information

CROSS SECTIONAL STUDY & SAMPLING METHOD

CROSS SECTIONAL STUDY & SAMPLING METHOD CROSS SECTIONAL STUDY & SAMPLING METHOD Prof. Dr. Zaleha Md. Isa, BSc(Hons) Clin. Biochemistry; PhD (Public Health), Department of Community Health, Faculty of Medicine, Universiti Kebangsaan Malaysia

More information

Inferential Statistics. Chapter 5

Inferential Statistics. Chapter 5 Inferential Statistics Chapter 5 Keep in Mind! 1) Statistics are useful for figuring out random noise from real effects. 2) Numbers are not absolute, and they can be easily manipulated. 3) Always scrutinize

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION A: The weakening geographical relationship between climate and malaria endemicity 1900-2007 Temperature and rainfall are two climatic variables known to assert fundamental influence on local environmental

More information

Now we will define some common sampling plans and discuss their strengths and limitations.

Now we will define some common sampling plans and discuss their strengths and limitations. Now we will define some common sampling plans and discuss their strengths and limitations. 1 For volunteer samples individuals are self selected. Participants decide to include themselves in the study.

More information

What is GIS? ESRI Canada. August 2011

What is GIS? ESRI Canada. August 2011 What is GIS? ESRI Canada August 2011 Geography Matters! Environmental Park Management Agriculture Public Utilities Health Care Emergency 911 Real Estate Marketing Environmental What are the effects of

More information

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD Taking into account sampling design in DAD SAMPLING DESIGN AND DAD With version 4.2 and higher of DAD, the Sampling Design (SD) of the database can be specified in order to calculate the correct asymptotic

More information

Spatio-temporal evaluation of cattle trade in Sweden: description of a grid network visualization technique

Spatio-temporal evaluation of cattle trade in Sweden: description of a grid network visualization technique Geospatial Health 5(1), 2010, pp. 119-130 Spatio-temporal evaluation of cattle trade in Sweden: description of a grid network visualization technique Stefan Widgren 1, Jenny Frössling 1,2 1 Department

More information

Measures from the Adult Social Care Outcomes Framework, England

Measures from the Adult Social Care Outcomes Framework, England s from the Adult Social Care Outcomes Framework, England 2016-17 Guidance for using CSV file Published 25 October 2017 Copyright 2017 Health and Social Care Information Centre. The Health and Social Care

More information

The Use of Random Geographic Cluster Sampling to Survey Pastoralists. Kristen Himelein, World Bank Addis Ababa, Ethiopia January 23, 2013

The Use of Random Geographic Cluster Sampling to Survey Pastoralists. Kristen Himelein, World Bank Addis Ababa, Ethiopia January 23, 2013 The Use of Random Geographic Cluster Sampling to Survey Pastoralists Kristen Himelein, World Bank Addis Ababa, Ethiopia January 23, 2013 Paper This presentation is based on the paper by Kristen Himelein

More information

Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors to consider

Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors to consider Visanou Hansana Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors to consider when determining sample size Understand

More information

3. When a researcher wants to identify particular types of cases for in-depth investigation; purpose less to generalize to larger population than to g

3. When a researcher wants to identify particular types of cases for in-depth investigation; purpose less to generalize to larger population than to g Chapter 7: Qualitative and Quantitative Sampling Introduction Quantitative researchers more concerned with sampling; primary goal to get a representative sample (smaller set of cases a researcher selects

More information

Teaching Research Methods: Resources for HE Social Sciences Practitioners. Sampling

Teaching Research Methods: Resources for HE Social Sciences Practitioners. Sampling Sampling Session Objectives By the end of the session you will be able to: Explain what sampling means in research List the different sampling methods available Have had an introduction to confidence levels

More information

3/9/2015. Overview. Introduction - sampling. Introduction - sampling. Before Sampling

3/9/2015. Overview. Introduction - sampling. Introduction - sampling. Before Sampling PLP 6404 Epidemiology of Plant Diseases Spring 2015 Lecture 19: Spatial variability, sampling and interplotinterference Prof. Dr. Ariena van Bruggen Emerging Pathogens Institute and Plant Pathology Department,

More information

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials CHL 55 H Crossover Trials The Two-sequence, Two-Treatment, Two-period Crossover Trial Definition A trial in which patients are randomly allocated to one of two sequences of treatments (either 1 then, or

More information

University of Lusaka

University of Lusaka University of Lusaka BSPH 315 Health Mapping & GIS Topic: Background of GIS Content: 1. Aim of the course 2. What do you know about GIS? 3. Difference between geographic data and geographic information

More information

MISSING or INCOMPLETE DATA

MISSING or INCOMPLETE DATA MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing

More information

Economic, Productive & Financial Performance Of Alberta Cow/Calf Operations

Economic, Productive & Financial Performance Of Alberta Cow/Calf Operations Benchmarks for Cattlemen Economics & Competitiveness Economic, Productive & Financial Performance Of Cow/Calf Operations Moist Mixed land Production Year - 1999 Nov-03 Foreword This AgriProfit$ & Returns

More information

The National Spatial Strategy

The National Spatial Strategy Purpose of this Consultation Paper This paper seeks the views of a wide range of bodies, interests and members of the public on the issues which the National Spatial Strategy should address. These views

More information

On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm

On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm Richard Wyss 1, Bruce Fireman 2, Jeremy A. Rassen 3, Sebastian Schneeweiss 1 Author Affiliations:

More information

Big-Geo-Data EHR Infrastructure Development for On-Demand Analytics

Big-Geo-Data EHR Infrastructure Development for On-Demand Analytics Big-Geo-Data EHR Infrastructure Development for On-Demand Analytics Sohayla Pruitt, MA Senior Geospatial Scientist Duke Medicine DUHS DHTS EIM HIRS Page 1 Institute of Medicine, World Health Organization,

More information

GEO 463-Geographic Information Systems Applications. Lecture 1

GEO 463-Geographic Information Systems Applications. Lecture 1 GEO 463-Geographic Information Systems Applications Lecture 1 Rules of engagement No Mobile Submit course work- scratch my back.i..? Software- Quantum GIS vrs ArcGIS Open source vrs Commercial Free vrs

More information

Spatial Analyses of Bowhead Whale Calls by Type of Call. Heidi Batchelor and Gerald L. D Spain. Marine Physical Laboratory

Spatial Analyses of Bowhead Whale Calls by Type of Call. Heidi Batchelor and Gerald L. D Spain. Marine Physical Laboratory 1 Spatial Analyses of Bowhead Whale Calls by Type of Call 2 3 Heidi Batchelor and Gerald L. D Spain 4 5 Marine Physical Laboratory 6 Scripps Institution of Oceanography 7 291 Rosecrans St., San Diego,

More information

Public Disclosure Copy. Implementation Status & Results Report AGRICULTURAL COMPETITIVENESS IMPROVEMENT PROJECT (P122812)

Public Disclosure Copy. Implementation Status & Results Report AGRICULTURAL COMPETITIVENESS IMPROVEMENT PROJECT (P122812) Public Disclosure Authorized EUROPE AND CENTRAL ASIA Azerbaijan Agriculture and Rural Development Global Practice IBRD/IDA Investment Project Financing FY 2014 Seq No: 3 ARCHIVED on 10-Apr-2015 ISR18747

More information

METHODS FOR STATISTICS

METHODS FOR STATISTICS DYNAMIC CARTOGRAPHIC METHODS FOR VISUALIZATION OF HEALTH STATISTICS Radim Stampach M.Sc. Assoc. Prof. Milan Konecny Ph.D. Petr Kubicek Ph.D. Laboratory on Geoinformatics and Cartography, Department of

More information

Sampling in Space and Time. Natural experiment? Analytical Surveys

Sampling in Space and Time. Natural experiment? Analytical Surveys Sampling in Space and Time Overview of Sampling Approaches Sampling versus Experimental Design Experiments deliberately perturb a portion of population to determine effect objective is to compare the mean

More information

TYPES OF SAMPLING TECHNIQUES IN PHYSICAL EDUCATION AND SPORTS

TYPES OF SAMPLING TECHNIQUES IN PHYSICAL EDUCATION AND SPORTS TYPES OF SAMPLING TECHNIQUES IN PHYSICAL EDUCATION AND SPORTS Daksh Sharma Assistant Professor of Phy.Edu, SGGS Khalsa Mahilpur ABSTRACT In sports statistics, sampling techniques has a immense importance

More information

Determining Sufficient Number of Imputations Using Variance of Imputation Variances: Data from 2012 NAMCS Physician Workflow Mail Survey *

Determining Sufficient Number of Imputations Using Variance of Imputation Variances: Data from 2012 NAMCS Physician Workflow Mail Survey * Applied Mathematics, 2014,, 3421-3430 Published Online December 2014 in SciRes. http://www.scirp.org/journal/am http://dx.doi.org/10.4236/am.2014.21319 Determining Sufficient Number of Imputations Using

More information

In matrix algebra notation, a linear model is written as

In matrix algebra notation, a linear model is written as DM3 Calculation of health disparity Indices Using Data Mining and the SAS Bridge to ESRI Mussie Tesfamicael, University of Louisville, Louisville, KY Abstract Socioeconomic indices are strongly believed

More information

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware, Introduction to GIS Dr. Pranjit Kr. Sarma Assistant Professor Department of Geography Mangaldi College Mobile: +91 94357 04398 What is a GIS a system for input, storage, manipulation, and output of geographic

More information

Field data acquisition

Field data acquisition Lesson : Primary sources Unit 3: Field data B-DC Lesson / Unit 3 Claude Collet D Department of Geosciences - Geography Content of Lesson Unit 1: Unit : Unit 3: Unit 4: Digital sources Remote sensing Field

More information

Core Courses for Students Who Enrolled Prior to Fall 2018

Core Courses for Students Who Enrolled Prior to Fall 2018 Biostatistics and Applied Data Analysis Students must take one of the following two sequences: Sequence 1 Biostatistics and Data Analysis I (PHP 2507) This course, the first in a year long, two-course

More information

Information and surveillance system of vesicular diseases in the Americas. Use of grid maps for monitoring, data collection and reporting

Information and surveillance system of vesicular diseases in the Americas. Use of grid maps for monitoring, data collection and reporting Rev. sci. tech. Off. mt. Epiz., 1983, 2 (3), 739-749. Information and surveillance system of vesicular diseases in the Americas. Use of grid maps for monitoring, data collection and reporting Vicente M.

More information

Applied Spatial Analysis in Epidemiology

Applied Spatial Analysis in Epidemiology Applied Spatial Analysis in Epidemiology COURSE DURATION Course material will be available from: June 1- June 30, 2018 INSTRUCTOR Rena Jones, PhD MS rena.jones@yale.edu COURSE DESCRIPTION This course will

More information

An Introduction to SaTScan

An Introduction to SaTScan An Introduction to SaTScan Software to measure spatial, temporal or space-time clusters using a spatial scan approach Marilyn O Hara University of Illinois moruiz@illinois.edu Lecture for the Pre-conference

More information

BROOKINGS May

BROOKINGS May Appendix 1. Technical Methodology This study combines detailed data on transit systems, demographics, and employment to determine the accessibility of jobs via transit within and across the country s 100

More information

GIS and Health Geography. What is epidemiology?

GIS and Health Geography. What is epidemiology? GIS and Health Geography { What is epidemiology? TOC GIS and health geography Major applications for GIS Epidemiology What is health (and how location matters) What is a disease (and how to identify one)

More information

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints NEW YORK DEPARTMENT OF SANITATION Spatial Analysis of Complaints Spatial Information Design Lab Columbia University Graduate School of Architecture, Planning and Preservation November 2007 Title New York

More information

A case study for self-organized criticality and complexity in forest landscape ecology

A case study for self-organized criticality and complexity in forest landscape ecology Chapter 1 A case study for self-organized criticality and complexity in forest landscape ecology Janine Bolliger Swiss Federal Research Institute (WSL) Zürcherstrasse 111; CH-8903 Birmendsdorf, Switzerland

More information

Lecture 01: Introduction

Lecture 01: Introduction Lecture 01: Introduction Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 01: Introduction

More information

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016 Approach to Field Research Data Generation and Field Logistics Part 1 Lecture 3 AEC 460 Road Map How we do ecology Part 1 Recap Types of data Sampling abundance and density methods Part 2 Sampling design

More information

Supplementary material: Methodological annex

Supplementary material: Methodological annex 1 Supplementary material: Methodological annex Correcting the spatial representation bias: the grid sample approach Our land-use time series used non-ideal data sources, which differed in spatial and thematic

More information

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2 Acknowledgments xiii Preface xv GIS Tutorial 1 Introducing GIS and health applications 1 What is GIS? 2 Spatial data 2 Digital map infrastructure 4 Unique capabilities of GIS 5 Installing ArcView and the

More information

Urbanization, Land Cover, Weather, and Incidence Rates of Neuroinvasive West Nile Virus Infections In Illinois

Urbanization, Land Cover, Weather, and Incidence Rates of Neuroinvasive West Nile Virus Infections In Illinois Urbanization, Land Cover, Weather, and Incidence Rates of Neuroinvasive West Nile Virus Infections In Illinois JUNE 23, 2016 H ANNAH MATZ KE Background Uganda 1937 United States -1999 New York Quickly

More information

Agent-Based Modeling in ArcGIS. Kevin M. Johnston

Agent-Based Modeling in ArcGIS. Kevin M. Johnston Agent-Based Modeling in ArcGIS Kevin M. Johnston The problem Have a phenomenon that changes with time and space Want to model time and space explicitly not as a snap shot Want to model the interactions

More information

Quantifying uncertainty of geological 3D layer models, constructed with a-priori

Quantifying uncertainty of geological 3D layer models, constructed with a-priori Quantifying uncertainty of geological 3D layer models, constructed with a-priori geological expertise Jan Gunnink, Denise Maljers 2 and Jan Hummelman 2, TNO Built Environment and Geosciences Geological

More information

Chapter 6 ESTIMATION OF PARAMETERS

Chapter 6 ESTIMATION OF PARAMETERS Chapter 6 ESTIMATION OF PARAMETERS Recall that one of the objectives of statistics is to make inferences concerning a population. And these inferences are based only in partial information regarding the

More information

Chapter 6 Spatial Analysis

Chapter 6 Spatial Analysis 6.1 Introduction Chapter 6 Spatial Analysis Spatial analysis, in a narrow sense, is a set of mathematical (and usually statistical) tools used to find order and patterns in spatial phenomena. Spatial patterns

More information

USING DOWNSCALED POPULATION IN LOCAL DATA GENERATION

USING DOWNSCALED POPULATION IN LOCAL DATA GENERATION USING DOWNSCALED POPULATION IN LOCAL DATA GENERATION A COUNTRY-LEVEL EXAMINATION CONTENT Research Context and Approach. This part outlines the background to and methodology of the examination of downscaled

More information

Module 9: Sampling IPDET. Sampling. Intro Concepts Types Confidence/ Precision? How Large? Intervention or Policy. Evaluation Questions

Module 9: Sampling IPDET. Sampling. Intro Concepts Types Confidence/ Precision? How Large? Intervention or Policy. Evaluation Questions IPDET Module 9: Sampling Sampling Intervention or Policy Evaluation Questions Design Approaches Data Collection Intro Concepts Types Confidence/ Precision? How Large? Introduction Introduction to Sampling

More information

An Internet-based Agricultural Land Use Trends Visualization System (AgLuT)

An Internet-based Agricultural Land Use Trends Visualization System (AgLuT) An Internet-based Agricultural Land Use Trends Visualization System (AgLuT) Prepared for Missouri Department of Natural Resources Missouri Department of Conservation 07-01-2000-12-31-2001 Submitted by

More information

A4. Methodology Annex: Sampling Design (2008) Methodology Annex: Sampling design 1

A4. Methodology Annex: Sampling Design (2008) Methodology Annex: Sampling design 1 A4. Methodology Annex: Sampling Design (2008) Methodology Annex: Sampling design 1 Introduction The evaluation strategy for the One Million Initiative is based on a panel survey. In a programme such as

More information

GIS and Community Health. GIS and Community Health. Institutional Context and Interests in GIS Development. GIS and Community Health

GIS and Community Health. GIS and Community Health. Institutional Context and Interests in GIS Development. GIS and Community Health GIS and Community Health GIS and Community Health Some critiques of GIS emphasize the potentially harmful social consequences of the diffusion of GIS technology, including reinforcing the power of state

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Sampling. Where we re heading: Last time. What is the sample? Next week: Lecture Monday. **Lab Tuesday leaving at 11:00 instead of 1:00** Tomorrow:

Sampling. Where we re heading: Last time. What is the sample? Next week: Lecture Monday. **Lab Tuesday leaving at 11:00 instead of 1:00** Tomorrow: Sampling Questions Define: Sampling, statistical inference, statistical vs. biological population, accuracy, precision, bias, random sampling Why do people use sampling techniques in monitoring? How do

More information

SRJC Applied Technology 54A Introduction to GIS

SRJC Applied Technology 54A Introduction to GIS SRJC Applied Technology 54A Introduction to GIS Overview Lecture of Geographic Information Systems Fall 2004 Santa Rosa Junior College Presented By: Tim Pudoff, GIS Coordinator, County of Sonoma, Information

More information

STATISTICS Relationships between variables: Correlation

STATISTICS Relationships between variables: Correlation STATISTICS 16 Relationships between variables: Correlation The gentleman pictured above is Sir Francis Galton. Galton invented the statistical concept of correlation and the use of the regression line.

More information

Inference Methods for the Conditional Logistic Regression Model with Longitudinal Data Arising from Animal Habitat Selection Studies

Inference Methods for the Conditional Logistic Regression Model with Longitudinal Data Arising from Animal Habitat Selection Studies Inference Methods for the Conditional Logistic Regression Model with Longitudinal Data Arising from Animal Habitat Selection Studies Thierry Duchesne 1 (Thierry.Duchesne@mat.ulaval.ca) with Radu Craiu,

More information

OUTBREAK INVESTIGATION AND SPATIAL ANALYSIS OF SURVEILLANCE DATA: CLUSTER DATA ANALYSIS. Preliminary Programme / Draft_2. Serbia, Mai 2013

OUTBREAK INVESTIGATION AND SPATIAL ANALYSIS OF SURVEILLANCE DATA: CLUSTER DATA ANALYSIS. Preliminary Programme / Draft_2. Serbia, Mai 2013 OUTBREAK INVESTIGATION AND SPATIAL ANALYSIS OF SURVEILLANCE DATA: CLUSTER DATA ANALYSIS Preliminary Programme / Draft_2 Serbia, 20-24 Mai 2013 1. Course presentation Outbreak investigation is one of the

More information